在 Flink 上运行 Beam 管道期间与内存段相关的 EOFException

本文介绍了在 Flink 上运行 Beam 管道期间与内存段相关的 EOFException的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在尝试在我们的测试集群上的 Flink 上运行 Apache Beam 管道.它在在通过序列化对对象进行编码期间.我还没有能够在本地重现该错误.您可以在此处找到整个作业日志.某些值已被假数据替换.

I'm trying to run an Apache Beam pipeline on Flink on our test cluster. It has been failing with an EOFException at org.apache.flink.runtime.io.disk.SimpleCollectingOutputView:79 during the encoding of an object through serialisation. I haven't been able to reproduce the error locally, yet. You can find the entire job log here. Some values have been replaced with fake data.

用于运行管道的命令:

bin/flink run \
     -m yarn-cluster                                         \
     --yarncontainer                 1                       \
     --yarnslots                     4                       \
     --yarnjobManagerMemory          2000                    \
     --yarntaskManagerMemory         2000                    \
     --yarnname "EBI"        \
     pipeline.jar               \
     --runner=FlinkRunner \
     --zookeeperQuorum=hdp-master-001.fake.org:2181

虽然我认为它不相关，但要序列化的对象是可序列化的，并且具有隐式和显式编码器，但这并不影响情况.

While I think it's not related, the object-to-be-serialised is serialisable and has had both an implicit and an explicit coder, but this doesn't affect the situation.

可能导致这种情况的原因是什么，我可以做些什么来解决它?

What might be causing this situation and what can I do to address it?

就目前而言，将管理器的堆内存增加到 4 到 8GiB 之间似乎可以防止出现异常.仍然不确定这是否应该是正常的 Flink 行为(它不应该溢出到磁盘吗?).似乎不是一个可以扩展的解决方案.

EOFException

在 Flink 上运行 Beam 管道期间与内存段相关的 EOFException

问题描述

推荐答案