问题描述
和?
如果我理解正确:
- HttpFS是一个独立的服务,公开REST API HDFS的顶层
- WebHDFS是内置于HDFS的REST API。它不需要任何进一步的安装。
我是否正确?
什么时候建议使用一个而不是另一个?
我已阅读与您的问题相关的文章。以下是链接。
WebHDFS vs HttpFs
WebHDFS和HttpFs之间的主要区别:WebHDFS需要访问集群的所有节点,被读取,它直接从该节点传输,而在HttpF中,单个节点将起到类似于网关的作用,并且将是到客户端节点的单点数据传输。因此,在大文件传输过程中HttpF可能会被阻塞,但好处是我们正在最小化访问HDFS所需的空间。
What is the difference between the WebHDFS REST API and HttpFS?
If I understand correctly:
- HttpFS is an independent service that exposes a REST API on top of HDFS
- WebHDFS is a REST API built-into HDFS. It doen't require any further installation
Am I correct?
When would be advisable to use one instead of the other?
I have read a article related with your question. following is the link.
WebHDFS vs HttpFsMajor difference between WebHDFS and HttpFs: WebHDFS needs access to all nodes of the cluster and when some data is read it is transmitted from that node directly, whereas in HttpFs, a singe node will act similar to a "gateway" and will be a single point of data transfer to the client node. So, HttpFs could be choked during a large file transfer but the good thing is that we are minimizing the footprint required to access HDFS.
这篇关于WebHDFS vs HttpFS的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!