<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question hdfs rebalancing in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/hdfs-rebalancing/m-p/162351#M57333</link>
    <description>&lt;P&gt;We are having a 10 dat node cluster. But data in hdfs is not spread across the nodes.  some nodes are under utilized. Will a hdfs rebalancing activity resolve this issue. And During rebalancing  will we be able to use the cluster.&lt;/P&gt;</description>
    <pubDate>Fri, 17 Mar 2017 16:54:47 GMT</pubDate>
    <dc:creator>arunpoy</dc:creator>
    <dc:date>2017-03-17T16:54:47Z</dc:date>
    <item>
      <title>hdfs rebalancing</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/hdfs-rebalancing/m-p/162351#M57333</link>
      <description>&lt;P&gt;We are having a 10 dat node cluster. But data in hdfs is not spread across the nodes.  some nodes are under utilized. Will a hdfs rebalancing activity resolve this issue. And During rebalancing  will we be able to use the cluster.&lt;/P&gt;</description>
      <pubDate>Fri, 17 Mar 2017 16:54:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/hdfs-rebalancing/m-p/162351#M57333</guid>
      <dc:creator>arunpoy</dc:creator>
      <dc:date>2017-03-17T16:54:47Z</dc:date>
    </item>
    <item>
      <title>Re: hdfs rebalancing</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/hdfs-rebalancing/m-p/162352#M57334</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/2302/arunpoy.html" nodeid="2302"&gt;@ARUN&lt;/A&gt;&lt;/P&gt;&lt;P&gt;The whole purpose of “balancer” utility is to help balance the blocks across DataNodes in the cluster.  So it should do the job, if there is no major issue at the cluster level. &lt;/P&gt;&lt;P&gt;It is usually recommend to run the balancer periodically during times when the cluster load is expected to be lower than usual.&lt;/P&gt;&lt;P&gt;Also please refer to the following article that explains the importance of balancer and the performance improvement facts: &lt;A href="https://community.hortonworks.com/articles/43615/hdfs-balancer-1-100x-performance-improvement.html" target="_blank"&gt;https://community.hortonworks.com/articles/43615/hdfs-balancer-1-100x-performance-improvement.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;You can run the HDFS balancer in Maintenance window as well as Without a Maintenance window. Few things you should keep in mind while running the balancer as mentioned in :  &lt;A href="https://community.hortonworks.com/articles/43849/hdfs-balancer-2-configurations-cli-options.html" target="_blank"&gt;https://community.hortonworks.com/articles/43849/hdfs-balancer-2-configurations-cli-options.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 17 Mar 2017 16:59:25 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/hdfs-rebalancing/m-p/162352#M57334</guid>
      <dc:creator>jsensharma</dc:creator>
      <dc:date>2017-03-17T16:59:25Z</dc:date>
    </item>
  </channel>
</rss>

