<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: re balance the data size on data node disks in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/re-balance-the-data-size-on-data-node-disks/m-p/236400#M198213</link>
    <description>&lt;P&gt;Hello!&lt;/P&gt;&lt;P&gt;Seems the disk balancer utility was introduced after HDP 3.0.0-alpha1 , &lt;A rel="noopener noreferrer noopener noreferrer" href="https://issues.apache.org/jira/browse/HDFS-1312" target="_blank"&gt;see here&lt;/A&gt;.&lt;/P&gt;&lt;P&gt;Someone was talking that technically was possible to port back to previous HDP versions, but seems there is no progress on here. &lt;/P&gt;&lt;PRE&gt;As far as I know, we have not backported this change to HDP 2.1 or 2.4.2. There is nothing technically preventing us from doing so; Disk balancer does not depend on any of the newer 3.0 features. &lt;/PRE&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;In &lt;A rel="noopener noreferrer noopener noreferrer" href="https://community.hortonworks.com/questions/69852/hdfs-data-disk-size-is-exceeding-90-threshold-whil.html" target="_blank"&gt;another discussion&lt;/A&gt;, they suggest decommissioning the done, and commissioning again. Yes, is an arduous task, but, better than nothing &lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;A rel="noopener noreferrer noopener noreferrer" href="https://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-hdfs/HDFSDiskbalancer.html" target="_blank"&gt;Apache documentation for Disk rebalancing&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Thu, 25 Jul 2019 08:09:43 GMT</pubDate>
    <dc:creator>david_sanchez_p</dc:creator>
    <dc:date>2019-07-25T08:09:43Z</dc:date>
    <item>
      <title>re balance the data size on data node disks</title>
      <link>https://community.cloudera.com/t5/Support-Questions/re-balance-the-data-size-on-data-node-disks/m-p/236399#M198212</link>
      <description>&lt;P&gt;hi all&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;we have production cluster with HDP - 2.6.4 version&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;we have 186 data-node machines ( DELL MACHINES WITH 10 disks )&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;U&gt;we try to re balance the data on the disks so disks will be with the same used size but without success&lt;/U&gt;&lt;/P&gt;&lt;P&gt;we feel that &lt;STRONG&gt;2.6.4&lt;/STRONG&gt; version not have the tools that support re balance!!!&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;as I mentioned on each data-node machine we have 10 disks while each disk is 1.8T&lt;/P&gt;&lt;P&gt;and some of the disks are 55% used&lt;/P&gt;&lt;P&gt;and some of them are only 1% used&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;so we have non balanced disks ( its like some disk are not useful ) , but why HDFS not balanced the data on all disks??&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;my question - from which HDP version , we can re balance the data-node disks ?&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;dose 2.6.5 version support re balance ?&lt;/P&gt;&lt;P&gt;or from 3.X ?&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;please advice , what we can do ?&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;as I mentioned this is very huge cluster and&lt;/P&gt;&lt;P&gt;we get the bad feeling that the current HDP version ( 2.6.4 ) not support any re balance - is it true?&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;example &lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;PRE&gt;/dev/sdc &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 3842878616 357409860 3485452372 &amp;nbsp;10% /data_hdfs/sdc
/dev/sde &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 3842878616 460433776 3382428456 &amp;nbsp;42% /data_hdfs/sde
/dev/sdi &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 3842878616 &amp;nbsp; 8606628   34255604 &amp;nbsp; 1% /data_hdfs/sdi
/dev/sdg &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 3842878616 256937520   85924712 &amp;nbsp; 7% /data_hdfs/sdg
/dev/sdd &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 3842878616 465520852 3377341380 &amp;nbsp;53% /data_hdfs/sdd
/dev/sdh &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 3842878616 &amp;nbsp; &amp;nbsp; 90136   42772096 &amp;nbsp; 1% /data_hdfs/sdh
/dev/sdb &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 3842878616 466423860 3376438372 &amp;nbsp;53% /data_hdfs/sdb
&lt;/PRE&gt;</description>
      <pubDate>Thu, 25 Jul 2019 04:48:35 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/re-balance-the-data-size-on-data-node-disks/m-p/236399#M198212</guid>
      <dc:creator>mike_bronson7</dc:creator>
      <dc:date>2019-07-25T04:48:35Z</dc:date>
    </item>
    <item>
      <title>Re: re balance the data size on data node disks</title>
      <link>https://community.cloudera.com/t5/Support-Questions/re-balance-the-data-size-on-data-node-disks/m-p/236400#M198213</link>
      <description>&lt;P&gt;Hello!&lt;/P&gt;&lt;P&gt;Seems the disk balancer utility was introduced after HDP 3.0.0-alpha1 , &lt;A rel="noopener noreferrer noopener noreferrer" href="https://issues.apache.org/jira/browse/HDFS-1312" target="_blank"&gt;see here&lt;/A&gt;.&lt;/P&gt;&lt;P&gt;Someone was talking that technically was possible to port back to previous HDP versions, but seems there is no progress on here. &lt;/P&gt;&lt;PRE&gt;As far as I know, we have not backported this change to HDP 2.1 or 2.4.2. There is nothing technically preventing us from doing so; Disk balancer does not depend on any of the newer 3.0 features. &lt;/PRE&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;In &lt;A rel="noopener noreferrer noopener noreferrer" href="https://community.hortonworks.com/questions/69852/hdfs-data-disk-size-is-exceeding-90-threshold-whil.html" target="_blank"&gt;another discussion&lt;/A&gt;, they suggest decommissioning the done, and commissioning again. Yes, is an arduous task, but, better than nothing &lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;A rel="noopener noreferrer noopener noreferrer" href="https://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-hdfs/HDFSDiskbalancer.html" target="_blank"&gt;Apache documentation for Disk rebalancing&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 25 Jul 2019 08:09:43 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/re-balance-the-data-size-on-data-node-disks/m-p/236400#M198213</guid>
      <dc:creator>david_sanchez_p</dc:creator>
      <dc:date>2019-07-25T08:09:43Z</dc:date>
    </item>
    <item>
      <title>Re: re balance the data size on data node disks</title>
      <link>https://community.cloudera.com/t5/Support-Questions/re-balance-the-data-size-on-data-node-disks/m-p/236401#M198214</link>
      <description>&lt;P&gt;@&lt;A rel="user" href="https://community.hortonworks.com/users/115564/davidsanchezplaza.html"&gt;David Sanchez&lt;/A&gt;  other thing please - can you help me with the post  - &lt;A href="https://community.hortonworks.com/questions/249557/is-it-necessary-to-restart-the-ambari-server-after.html"&gt;https://community.hortonworks.com/questions/249557/is-it-necessary-to-restart-the-ambari-server-after.html&lt;/A&gt; &lt;/P&gt;</description>
      <pubDate>Thu, 25 Jul 2019 16:22:20 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/re-balance-the-data-size-on-data-node-disks/m-p/236401#M198214</guid>
      <dc:creator>mike_bronson7</dc:creator>
      <dc:date>2019-07-25T16:22:20Z</dc:date>
    </item>
  </channel>
</rss>

