<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: hdfs balancer slow to move data around in cdh 5 in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/hdfs-balancer-slow-to-move-data-around-in-cdh-5/m-p/17240#M2637</link>
    <description>&lt;P&gt;The issue sounds close, but issue I am seeing that the thread sleeps for 35s and moves more data.&lt;/P&gt;&lt;P&gt;It doesnt exit out.&amp;nbsp;&amp;nbsp; It continues to run, but sleeps 35s then moves, then sleeps, then moves.&lt;/P&gt;&lt;P&gt;I gathered total rate to be about 15MB/s doing this it this way.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;like so:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;14/08/14 14:22:44 INFO balancer.Balancer: Successfully moved blk_1075451687_1713835 with size=132482663 from 10.2.2.3:50010 to 10.2.1.247:50010 through 10.2.1.248:50010&lt;BR /&gt;14/08/14 14:23:19 INFO net.NetworkTopology: Adding a new node: /default/10.2.1.253:50010&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;14/08/14 14:23:22 INFO balancer.Balancer: Successfully moved blk_1076792220_3054863 with size=134217728 from 10.2.1.253:50010 to 10.2.1.247:50010 through 10.2.2.7:50010&lt;BR /&gt;14/08/14 14:23:56 INFO net.NetworkTopology: Adding a new node: /default/10.2.2.2:50010&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;You can see the last move and next time it wakesup todo the move again its about 35s every time!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Tue, 19 Aug 2014 22:47:01 GMT</pubDate>
    <dc:creator>kal.pan</dc:creator>
    <dc:date>2014-08-19T22:47:01Z</dc:date>
    <item>
      <title>hdfs balancer slow to move data around in cdh 5</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/hdfs-balancer-slow-to-move-data-around-in-cdh-5/m-p/17226#M2635</link>
      <description>&lt;P&gt;New hdfs balancer the command (tried via CM as well same result)&lt;/P&gt;&lt;P&gt;"hdfs balancer" command is slow to move data around to balance the nodes.&lt;/P&gt;&lt;P&gt;I have tried it with different thresholds and poicies.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The old hdfs balancer in version cdh4.1.3 is pretty fast.&amp;nbsp; It actually moves around data at the speed specified by&lt;/P&gt;&lt;P&gt;dfs.balance.bandwidthPerSec.&amp;nbsp;&amp;nbsp; The old cluster moves data around rapidly, ie the balancer thread doesnt sleep.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The new one however, seems to either ignore it or is superceeded by another value that is not tunable.&amp;nbsp; The balancer sleeps for 35s then moves a few blocks around, then sleeps again.&lt;/P&gt;&lt;P&gt;I have dfs.balance.bandwidthPerSec set to 1GB/s (or 10Gb)&lt;/P&gt;&lt;P&gt;Yet the cluster only gets balanced at 15MB/s&amp;nbsp; which is really slow when you have TBs of data.&lt;/P&gt;&lt;P&gt;This cluster is completely idle of 18 nodes, nothing is happening other than prep work to move data around and testing to get the cluster to prod status.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;If anyone knows the quirks of the new hadoop please help!&amp;nbsp;&amp;nbsp; At this snail's pace adding / remove nodes would be disasterous to the hadoop infrastructure.&amp;nbsp;&amp;nbsp; This would indeed become small data platform not big data.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 09:05:31 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/hdfs-balancer-slow-to-move-data-around-in-cdh-5/m-p/17226#M2635</guid>
      <dc:creator>kal.pan</dc:creator>
      <dc:date>2022-09-16T09:05:31Z</dc:date>
    </item>
    <item>
      <title>Re: hdfs balancer slow to move data around in cdh 5</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/hdfs-balancer-slow-to-move-data-around-in-cdh-5/m-p/17236#M2636</link>
      <description>More than likely you are hitting HDFS-6621. This is fairly new so it hasn't been fixed in CDH5 as of yet.</description>
      <pubDate>Tue, 19 Aug 2014 22:07:08 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/hdfs-balancer-slow-to-move-data-around-in-cdh-5/m-p/17236#M2636</guid>
      <dc:creator>Rolando</dc:creator>
      <dc:date>2014-08-19T22:07:08Z</dc:date>
    </item>
    <item>
      <title>Re: hdfs balancer slow to move data around in cdh 5</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/hdfs-balancer-slow-to-move-data-around-in-cdh-5/m-p/17240#M2637</link>
      <description>&lt;P&gt;The issue sounds close, but issue I am seeing that the thread sleeps for 35s and moves more data.&lt;/P&gt;&lt;P&gt;It doesnt exit out.&amp;nbsp;&amp;nbsp; It continues to run, but sleeps 35s then moves, then sleeps, then moves.&lt;/P&gt;&lt;P&gt;I gathered total rate to be about 15MB/s doing this it this way.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;like so:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;14/08/14 14:22:44 INFO balancer.Balancer: Successfully moved blk_1075451687_1713835 with size=132482663 from 10.2.2.3:50010 to 10.2.1.247:50010 through 10.2.1.248:50010&lt;BR /&gt;14/08/14 14:23:19 INFO net.NetworkTopology: Adding a new node: /default/10.2.1.253:50010&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;14/08/14 14:23:22 INFO balancer.Balancer: Successfully moved blk_1076792220_3054863 with size=134217728 from 10.2.1.253:50010 to 10.2.1.247:50010 through 10.2.2.7:50010&lt;BR /&gt;14/08/14 14:23:56 INFO net.NetworkTopology: Adding a new node: /default/10.2.2.2:50010&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;You can see the last move and next time it wakesup todo the move again its about 35s every time!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 19 Aug 2014 22:47:01 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/hdfs-balancer-slow-to-move-data-around-in-cdh-5/m-p/17240#M2637</guid>
      <dc:creator>kal.pan</dc:creator>
      <dc:date>2014-08-19T22:47:01Z</dc:date>
    </item>
    <item>
      <title>Re: hdfs balancer slow to move data around in cdh 5</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/hdfs-balancer-slow-to-move-data-around-in-cdh-5/m-p/20722#M2639</link>
      <description>Sorry for the late response. What exact version of CDH 5.x are you running?</description>
      <pubDate>Thu, 23 Oct 2014 04:11:07 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/hdfs-balancer-slow-to-move-data-around-in-cdh-5/m-p/20722#M2639</guid>
      <dc:creator>GautamG</dc:creator>
      <dc:date>2014-10-23T04:11:07Z</dc:date>
    </item>
    <item>
      <title>Re: hdfs balancer slow to move data around in cdh 5</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/hdfs-balancer-slow-to-move-data-around-in-cdh-5/m-p/20754#M2640</link>
      <description>&lt;P&gt;There was a patch released that fixed this problem.&lt;/P&gt;&lt;P&gt;I updated the bundle give to me by support.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 23 Oct 2014 18:06:49 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/hdfs-balancer-slow-to-move-data-around-in-cdh-5/m-p/20754#M2640</guid>
      <dc:creator>kal.pan</dc:creator>
      <dc:date>2014-10-23T18:06:49Z</dc:date>
    </item>
    <item>
      <title>Re: hdfs balancer slow to move data around in cdh 5</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/hdfs-balancer-slow-to-move-data-around-in-cdh-5/m-p/20776#M2641</link>
      <description>That's great news, thanks for the feedback</description>
      <pubDate>Fri, 24 Oct 2014 02:34:46 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/hdfs-balancer-slow-to-move-data-around-in-cdh-5/m-p/20776#M2641</guid>
      <dc:creator>GautamG</dc:creator>
      <dc:date>2014-10-24T02:34:46Z</dc:date>
    </item>
    <item>
      <title>Re: hdfs balancer slow to move data around in cdh 5</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/hdfs-balancer-slow-to-move-data-around-in-cdh-5/m-p/23062#M2642</link>
      <description>&lt;P&gt;it seems i have the same issue exactly with CDH 5.1.0&lt;/P&gt;&lt;P&gt;where can i get the patch you are talking about?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Eliran&lt;/P&gt;</description>
      <pubDate>Wed, 24 Dec 2014 09:58:32 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/hdfs-balancer-slow-to-move-data-around-in-cdh-5/m-p/23062#M2642</guid>
      <dc:creator>eliran007</dc:creator>
      <dc:date>2014-12-24T09:58:32Z</dc:date>
    </item>
    <item>
      <title>Re: hdfs balancer slow to move data around in cdh 5</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/hdfs-balancer-slow-to-move-data-around-in-cdh-5/m-p/23064#M2643</link>
      <description>The patch is part of CDH 5.1.4 and CDH 5.2.0 (and later versions). You can simply upgrade to v5.1.4 from v5.1.0 for the fix.</description>
      <pubDate>Wed, 24 Dec 2014 11:17:58 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/hdfs-balancer-slow-to-move-data-around-in-cdh-5/m-p/23064#M2643</guid>
      <dc:creator>GautamG</dc:creator>
      <dc:date>2014-12-24T11:17:58Z</dc:date>
    </item>
    <item>
      <title>Re: hdfs balancer slow to move data around in cdh 5</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/hdfs-balancer-slow-to-move-data-around-in-cdh-5/m-p/23074#M2644</link>
      <description>&lt;P&gt;god bless you!!!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;i have been rebalancing fro almost two month and its been on 40%, and now in and hour it finished almost 10%&lt;/P&gt;&lt;P&gt;we were going crazy here.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks alot&lt;/P&gt;</description>
      <pubDate>Wed, 24 Dec 2014 15:41:28 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/hdfs-balancer-slow-to-move-data-around-in-cdh-5/m-p/23074#M2644</guid>
      <dc:creator>eliran007</dc:creator>
      <dc:date>2014-12-24T15:41:28Z</dc:date>
    </item>
    <item>
      <title>Re: hdfs balancer slow to move data around in cdh 5</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/hdfs-balancer-slow-to-move-data-around-in-cdh-5/m-p/23783#M2645</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We are also experiencing this. Could you point us to which patch this is?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Kind regards,&lt;/P&gt;&lt;P&gt;Jakob&lt;/P&gt;</description>
      <pubDate>Fri, 16 Jan 2015 14:56:07 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/hdfs-balancer-slow-to-move-data-around-in-cdh-5/m-p/23783#M2645</guid>
      <dc:creator>jakeri</dc:creator>
      <dc:date>2015-01-16T14:56:07Z</dc:date>
    </item>
    <item>
      <title>Re: hdfs balancer slow to move data around in cdh 5</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/hdfs-balancer-slow-to-move-data-around-in-cdh-5/m-p/23809#M2646</link>
      <description>&lt;P&gt;We have created &lt;SPAN class="il"&gt;PATCH&lt;/SPAN&gt;-434 for your s8cluster that is a backport of HDFS-6621 for CDH 5.1.0.&lt;BR /&gt;This &lt;SPAN class="il"&gt;patch&lt;/SPAN&gt; was tested against CentOS 5.7 and CentOS 6+ which are our supported OS versions for CDH 5.1.0.&lt;/P&gt;</description>
      <pubDate>Fri, 16 Jan 2015 20:16:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/hdfs-balancer-slow-to-move-data-around-in-cdh-5/m-p/23809#M2646</guid>
      <dc:creator>kal.pan</dc:creator>
      <dc:date>2015-01-16T20:16:47Z</dc:date>
    </item>
    <item>
      <title>Re: hdfs balancer slow to move data around in cdh 5</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/hdfs-balancer-slow-to-move-data-around-in-cdh-5/m-p/23825#M2647</link>
      <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/9337"&gt;@jakeri&lt;/a&gt; wrote:&lt;BR /&gt;&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We are also experiencing this. Could you point us to which patch this is?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Kind regards,&lt;/P&gt;&lt;P&gt;Jakob&lt;/P&gt;&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;The patch is part of CDH 5.1.4 and CDH 5.2.0 (and later versions). You can simply upgrade to v5.1.4 from v5.1.0 for the fix. If you have a support contract, you can log a case for a patch on a specific version of CDH.&lt;/P&gt;</description>
      <pubDate>Sat, 17 Jan 2015 23:49:56 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/hdfs-balancer-slow-to-move-data-around-in-cdh-5/m-p/23825#M2647</guid>
      <dc:creator>GautamG</dc:creator>
      <dc:date>2015-01-17T23:49:56Z</dc:date>
    </item>
  </channel>
</rss>

