<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: How to increase DFS space on existing cluster in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/How-to-increase-DFS-space-on-existing-cluster/m-p/187840#M149941</link>
    <description>&lt;P&gt;As you have heterogeneous worker nodes, I'd recommend setting up two separate host config groups first, then manage HDFS separately.&lt;/P&gt;&lt;P&gt;Here is the link to how to set up config groups in Ambari: &lt;/P&gt;&lt;P&gt;&lt;A href="https://docs.hortonworks.com/HDPDocuments/Ambari-2.5.1.0/bk_ambari-operations/content/using_host_config_groups.html" target="_blank"&gt;https://docs.hortonworks.com/HDPDocuments/Ambari-2.5.1.0/bk_ambari-operations/content/using_host_config_groups.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;For each host group, you can config the non DSF use by setting the proper value for 'dfs.datanode.du.reserved' (&lt;STRONG&gt;in bytes per volume&lt;/STRONG&gt;), normally it should be 20%- 25% of disk storage.  &lt;/P&gt;&lt;P&gt;Also, keep in mind non DFS can grow into reserved DFS storage, you should regularly delete logs and other non HDFS data that are taking large local storage, I normally use commands like 'du -hsx * | sort -rh | head -10' to identify top 10 largest folders.  &lt;/P&gt;</description>
    <pubDate>Tue, 18 Jul 2017 00:10:22 GMT</pubDate>
    <dc:creator>dsun</dc:creator>
    <dc:date>2017-07-18T00:10:22Z</dc:date>
  </channel>
</rss>

