本文介绍了EMR Spark-TransportClient:无法发送RPC的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我遇到此错误,我试图增加群集实例以及执行程序和驱动程序参数上的内存,但没有成功.

I'm getting this error, I tried to increase memory on cluster instances and in the executor and driver parameters without success.

17/05/07 23:17:07 ERROR TransportClient: Failed to send RPC 6465703946954088562 to ip-172-30-12-164.eu-central-1.compute.internal/172.30.12.164:34706: java.nio.channels.ClosedChannelException

有人能解决此错误吗?

顺便说一句,我正在使用YARN作为集群管理器

BTW I'm using YARN as cluster manager

预先感谢

推荐答案

最后,我解决了这个问题.这是由于磁盘空间不足.一列hadoop日志显示:

Finally I resolved the problem. It was due to insufficient disk space. One column of hadoop logs showed:

Hadoop纱:1/1本地目录不正确:/var/lib/hadoop-yarn/cache/yarn/nm-local-dir; 1/1日志目录不正确:/var/log/hadoop-yarn/containers

Hadoop YARN: 1/1 local-dirs are bad: /var/lib/hadoop-yarn/cache/yarn/nm-local-dir; 1/1 log-dirs are bad: /var/log/hadoop-yarn/containers

使用Google搜索,发现 http://gethue.com/hadoop-yarn-11-local-dirs-are-bad-varlibhadoop-yarncacheyarnnmnm-local-dir-11-log-dirs-are -bad-varloghadoop-yarncontainers/

Googling it I found http://gethue.com/hadoop-yarn-11-local-dirs-are-bad-varlibhadoop-yarncacheyarnnm-local-dir-11-log-dirs-are-bad-varloghadoop-yarncontainers/

要查看此错误,我必须激活EMR中的纱线记录.参见

To see this error I have to activate the yarn logs in EMR. See

要访问群集ec2实例中的日志端口,我为此更改了安全组

To have access to the logs port in the cluster ec2 instances I changed security groups for it

即:

最后,我解决了在etl.py中更改实例类型,使其他实例具有更大磁盘的问题:

Finally I fixed the problem changing in etl.py the type of instances by other ones with bigger disks:

这篇关于EMR Spark-TransportClient:无法发送RPC的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!

08-20 14:45