Cloudera允许我配置fs.trash.interval。

但这不允许我配置fs.trash.checkpoint.interval。

那么hdfs何时创建检查点?

这里存在类似的问题,但没有回应:
When does Hadoop Framework creates a checkpoint (expunge) to its "current" directory in trash?

最佳答案

Apache Hadoop文档在左侧导航中包含指向各种* -default.xml文件的链接。这些文件包含所有配置属性的默认设置。

如果单击* -default.xml链接,则网站会在漂亮的表格中显示它们。这是core-site.xml中讨论垃圾属性的原始XML版本。

<property>
  <name>fs.trash.interval</name>
  <value>0</value>
  <description>Number of minutes after which the checkpoint
  gets deleted.  If zero, the trash feature is disabled.
  This option may be configured both on the server and the
  client. If trash is disabled server side then the client
  side configuration is checked. If trash is enabled on the
  server side then the value configured on the server is
  used and the client configuration value is ignored.
  </description>
</property>

<property>
  <name>fs.trash.checkpoint.interval</name>
  <value>0</value>
  <description>Number of minutes between trash checkpoints.
  Should be smaller or equal to fs.trash.interval. If zero,
  the value is set to the value of fs.trash.interval.
  Every time the checkpointer runs it creates a new checkpoint
  out of current and removes checkpoints created more than
  fs.trash.interval minutes ago.
  </description>
</property>

根据此描述,如果您尚未更改fs.trash.checkpoint.interval,则它使用与fs.trash.interval相同的值,这就是它创建垃圾回收点的频率。

Apache Hadoop 2.x发行版中引入了fs.trash.checkpoint.interval配置属性。较早的发行版不支持此配置属性,您可以认为该行为等效于fs.trash.checkpoint.interval等于fs.trash.interval

关于hadoop - cloudera垃圾检查点间隔配置,我们在Stack Overflow上找到一个类似的问题:https://stackoverflow.com/questions/41841744/

10-11 11:00