Cloudera允许我配置fs.trash.interval。
但这不允许我配置fs.trash.checkpoint.interval。
那么hdfs何时创建检查点?
这里存在类似的问题,但没有回应:
When does Hadoop Framework creates a checkpoint (expunge) to its "current" directory in trash?
最佳答案
Apache Hadoop文档在左侧导航中包含指向各种* -default.xml文件的链接。这些文件包含所有配置属性的默认设置。
如果单击* -default.xml链接,则网站会在漂亮的表格中显示它们。这是core-site.xml中讨论垃圾属性的原始XML版本。
<property>
<name>fs.trash.interval</name>
<value>0</value>
<description>Number of minutes after which the checkpoint
gets deleted. If zero, the trash feature is disabled.
This option may be configured both on the server and the
client. If trash is disabled server side then the client
side configuration is checked. If trash is enabled on the
server side then the value configured on the server is
used and the client configuration value is ignored.
</description>
</property>
<property>
<name>fs.trash.checkpoint.interval</name>
<value>0</value>
<description>Number of minutes between trash checkpoints.
Should be smaller or equal to fs.trash.interval. If zero,
the value is set to the value of fs.trash.interval.
Every time the checkpointer runs it creates a new checkpoint
out of current and removes checkpoints created more than
fs.trash.interval minutes ago.
</description>
</property>
根据此描述,如果您尚未更改
fs.trash.checkpoint.interval
,则它使用与fs.trash.interval
相同的值,这就是它创建垃圾回收点的频率。Apache Hadoop 2.x发行版中引入了
fs.trash.checkpoint.interval
配置属性。较早的发行版不支持此配置属性,您可以认为该行为等效于fs.trash.checkpoint.interval
等于fs.trash.interval
。关于hadoop - cloudera垃圾检查点间隔配置,我们在Stack Overflow上找到一个类似的问题:https://stackoverflow.com/questions/41841744/