<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Load not distributed in the cluster in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Load-not-distributed-in-the-cluster/m-p/163244#M36778</link>
    <description>&lt;P&gt;Thanks &lt;A rel="user" href="https://community.cloudera.com/users/504/kkulkarni.html" nodeid="504"&gt;@Kuldeep Kulkarni&lt;/A&gt; and &lt;A rel="user" href="https://community.cloudera.com/users/11611/shivanageshchowdary1994.html" nodeid="11611"&gt;@Shiva Nagesh&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Thu, 04 Aug 2016 12:10:25 GMT</pubDate>
    <dc:creator>arunpoy</dc:creator>
    <dc:date>2016-08-04T12:10:25Z</dc:date>
    <item>
      <title>Load not distributed in the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Load-not-distributed-in-the-cluster/m-p/163240#M36774</link>
      <description>&lt;P&gt;We are having a 5 node cluster. ( 2 master and 3 slave) and we are running MR jobs. but we always see that only 2 nodes are getting loaded and utilized, while the other node remains idle. what all could be the reasons for this. all the 3 nodes are in the same rack.&lt;/P&gt;</description>
      <pubDate>Wed, 03 Aug 2016 19:28:55 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Load-not-distributed-in-the-cluster/m-p/163240#M36774</guid>
      <dc:creator>arunpoy</dc:creator>
      <dc:date>2016-08-03T19:28:55Z</dc:date>
    </item>
    <item>
      <title>Re: Load not distributed in the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Load-not-distributed-in-the-cluster/m-p/163241#M36775</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/2302/arunpoy.html" nodeid="2302"&gt;@ARUN&lt;/A&gt;&lt;/P&gt;&lt;P&gt;The main reason might be the data blocks needed for the MapReduce job  to run are located in those two nodes itself.&lt;/P&gt;&lt;P&gt;Can you please check the data blocks of the file you are processing and verify that the data is distributed in 3 nodes.&lt;/P&gt;&lt;P&gt;Speculative execution( case when  your nodes are too busy running the tasks then the data can be moved temporarily to the third node and run the task.) also not be happening.&lt;/P&gt;</description>
      <pubDate>Wed, 03 Aug 2016 20:00:12 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Load-not-distributed-in-the-cluster/m-p/163241#M36775</guid>
      <dc:creator>shivanageshch</dc:creator>
      <dc:date>2016-08-03T20:00:12Z</dc:date>
    </item>
    <item>
      <title>Re: Load not distributed in the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Load-not-distributed-in-the-cluster/m-p/163242#M36776</link>
      <description>&lt;P&gt;Please do the following it helped me.&lt;/P&gt;&lt;P&gt;1) Login to Ambari WI&lt;/P&gt;&lt;P&gt;HDFS -&amp;gt; Quick Links -&amp;gt;NameNode UI -&amp;gt;Datanodes&lt;/P&gt;&lt;P&gt;Check how capacity of HDFS and how much it utilized and left storage if blocks are not replicated equally.&lt;/P&gt;&lt;P&gt;Jobs will be running on Data Node only, hence it running on two nodes only.&lt;/P&gt;&lt;P&gt;2) While running the MRapp always try to get for Data Locality during JOB run.&lt;/P&gt;&lt;P&gt;3) Do the Load Balancer on cluster , data will be distributed across the Datanodes.&lt;/P&gt;&lt;P&gt;4) After Balancer  completed , try to check how jobs are running.&lt;/P&gt;&lt;P&gt;still jobs are facing same , please update .&lt;/P&gt;&lt;P&gt;if its help full , your close is appreciated.&lt;/P&gt;</description>
      <pubDate>Thu, 04 Aug 2016 11:00:50 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Load-not-distributed-in-the-cluster/m-p/163242#M36776</guid>
      <dc:creator>shivkumar82015</dc:creator>
      <dc:date>2016-08-04T11:00:50Z</dc:date>
    </item>
    <item>
      <title>Re: Load not distributed in the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Load-not-distributed-in-the-cluster/m-p/163243#M36777</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/2302/arunpoy.html" nodeid="2302"&gt;@ARUN&lt;/A&gt;&lt;P&gt;In Addition to above answers:&lt;/P&gt;&lt;P&gt;1. Can you please check if nodemanager state is healthy for 3rd node? Sometimes, because of disk failure/reserved disk nodemanegers goes into unhealthy state although nodemanager daemon is still running, jobs will not get scheduled on problematic node.&lt;/P&gt;&lt;P&gt;2. Most important thing is try to run hdfs balancer if data distribution is un-even across all the datanodes.&lt;/P&gt;&lt;P&gt;3. Below is the command to run hdfs balancer&lt;/P&gt;&lt;PRE&gt;sudo -u hdfs hadoop balancer -threshold &amp;lt;threshold-value&amp;gt;&lt;/PRE&gt;&lt;P&gt;Note - default threshold is 10, you can reduce it upto 1 depending on how close you want to balance your cluster.&lt;/P&gt;&lt;P&gt;.&lt;/P&gt;&lt;P&gt;Hope this information helps!&lt;/P&gt;</description>
      <pubDate>Thu, 04 Aug 2016 11:22:13 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Load-not-distributed-in-the-cluster/m-p/163243#M36777</guid>
      <dc:creator>KuldeepK</dc:creator>
      <dc:date>2016-08-04T11:22:13Z</dc:date>
    </item>
    <item>
      <title>Re: Load not distributed in the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Load-not-distributed-in-the-cluster/m-p/163244#M36778</link>
      <description>&lt;P&gt;Thanks &lt;A rel="user" href="https://community.cloudera.com/users/504/kkulkarni.html" nodeid="504"&gt;@Kuldeep Kulkarni&lt;/A&gt; and &lt;A rel="user" href="https://community.cloudera.com/users/11611/shivanageshchowdary1994.html" nodeid="11611"&gt;@Shiva Nagesh&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 04 Aug 2016 12:10:25 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Load-not-distributed-in-the-cluster/m-p/163244#M36778</guid>
      <dc:creator>arunpoy</dc:creator>
      <dc:date>2016-08-04T12:10:25Z</dc:date>
    </item>
    <item>
      <title>Re: Load not distributed in the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Load-not-distributed-in-the-cluster/m-p/163245#M36779</link>
      <description>&lt;P&gt;Hi all,&lt;/P&gt;&lt;P&gt;Could you please share more information about the following scenario... trying to run exactly the statement you mentioned... running from the active master_node : &lt;/P&gt;&lt;P&gt;16/09/26 17:42:53 INFO balancer.Balancer: namenodes  = [hdfs://hadoop2, hdfs://linux.lab.domain.com:8020]
16/09/26 17:42:53 INFO balancer.Balancer: parameters = Balancer.Parameters [BalancingPolicy.Node, threshold = 5.0, max idle iteration = 5, #excluded nodes = 0, #included nodes = 0, #source nodes = 0, run during upgrade = false]
16/09/26 17:42:53 INFO balancer.Balancer: included nodes = []
16/09/26 17:42:53 INFO balancer.Balancer: excluded nodes = []
16/09/26 17:42:53 INFO balancer.Balancer: source nodes = []
Time Stamp               Iteration#  Bytes Already Moved  Bytes Left To Move  Bytes Being Moved
16/09/26 17:42:53 INFO balancer.KeyManager: Block token params received from NN: update interval=10hrs, 0sec, token lifetime=10hrs, 0sec
16/09/26 17:42:53 INFO block.BlockTokenSecretManager: Setting block keys
16/09/26 17:42:53 INFO balancer.KeyManager: Update block keys every 2hrs, 30mins, 0sec
16/09/26 17:42:54 INFO block.BlockTokenSecretManager: Setting block keys
org.apache.hadoop.ipc.RemoteException(org.apache.hadoop.ipc.StandbyException): Operation category READ is not supported in state standby
at org.apache.hadoop.hdfs.server.namenode.ha.StandbyState.checkOperation(StandbyState.java:87)
at org.apache.hadoop.hdfs.server.namenode.NameNode$NameNodeHAContext.checkOperation(NameNode.java:1872)
at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.checkOperation(FSNamesystem.java:1306)
at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.getServerDefaults(FSNamesystem.java:1618)
at org.apache.hadoop.hdfs.server.namenode.NameNodeRpcServer.getServerDefaults(NameNodeRpcServer.java:595)
at org.apache.hadoop.hdfs.protocolPB.ClientNamenodeProtocolServerSideTranslatorPB.getServerDefaults(ClientNamenodeProtocolServerSideTranslatorPB.java:383)
at org.apache.hadoop.hdfs.protocol.proto.ClientNamenodeProtocolProtos$ClientNamenodeProtocol$2.callBlockingMethod(ClientNamenodeProtocolProtos.java)
at org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:616)
at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:969)
at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2137)
at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2133)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Unknown Source)
at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1657)
at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2131)
.  Exiting ...&lt;/P&gt;&lt;P&gt;.  Exiting ...
Sep 26, 2016 5:42:54 PM  Balancing took 1.314 seconds &lt;/P&gt;&lt;P&gt;it runs in less than 2 seconds....what seems not running 100% , right ? &lt;/P&gt;&lt;P&gt;How many seconds it could take in average if it really runs fine ? or is there any log to check further information if some error is happenning ? &lt;/P&gt;&lt;P&gt;I also tried to run the same from the Ambari console (balance all nodes)... the same result is reached.&lt;/P&gt;&lt;P&gt;thanks and br&lt;/P&gt;</description>
      <pubDate>Wed, 28 Sep 2016 01:19:38 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Load-not-distributed-in-the-cluster/m-p/163245#M36779</guid>
      <dc:creator>leonardo_costa</dc:creator>
      <dc:date>2016-09-28T01:19:38Z</dc:date>
    </item>
  </channel>
</rss>

