<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Configure Storage capacity of Hadoop cluster in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112108#M21901</link>
    <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/2758/vinaykumarpotnuru.html" nodeid="2758"&gt;@vinay kumar&lt;/A&gt; I was going to add /hadoop and then remove other directories after sometime. &lt;/P&gt;</description>
    <pubDate>Mon, 07 Mar 2016 18:34:51 GMT</pubDate>
    <dc:creator>nsabharwal</dc:creator>
    <dc:date>2016-03-07T18:34:51Z</dc:date>
    <item>
      <title>Configure Storage capacity of Hadoop cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112085#M21878</link>
      <description>&lt;P&gt;we have 5 node cluster with following configurations for master and slaves.&lt;/P&gt;&lt;PRE&gt;HDPMaster   35 GB   500 GB
HDPSlave1   15 GB   500 GB
HDPSlave2   15 GB   500 GB
HDPSlave3   15 GB   500 GB
HDPSlave4   15 GB   500 GB
HDPSlave5   15 GB   500 GB
&lt;/PRE&gt;&lt;P&gt;But the cluster is not taking much space. I am aware of the fact that it will reserve some space for non-dfs use.But,it is taking uneven capacity for each slave node. Is there a way to reconfigure hdfs ?&lt;/P&gt;&lt;P&gt;PFA.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="2586-namenode.png" style="width: 1142px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/23237iE8CDEB968F4177C5/image-size/medium?v=v2&amp;amp;px=400" role="button" title="2586-namenode.png" alt="2586-namenode.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;Even though all the nodes have same hard disk, only slave 4 is taking 431GB, remaining all nodes are utilizing very small space. Is there a way to resolve this ? &lt;/P&gt;</description>
      <pubDate>Mon, 19 Aug 2019 11:17:42 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112085#M21878</guid>
      <dc:creator>vinaykumarpotnu</dc:creator>
      <dc:date>2019-08-19T11:17:42Z</dc:date>
    </item>
    <item>
      <title>Re: Configure Storage capacity of Hadoop cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112086#M21879</link>
      <description>&lt;P&gt;Is your replication factor set to 3? Are you using one reducer in your ingestion? You can use hdfs balancer to spread the data around your cluster &lt;A href="https://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-hdfs/HDFSCommands.html#Administration_Commands" target="_blank"&gt;https://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-hdfs/HDFSCommands.html#Administration_Commands&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 04 Mar 2016 18:28:30 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112086#M21879</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-03-04T18:28:30Z</dc:date>
    </item>
    <item>
      <title>Re: Configure Storage capacity of Hadoop cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112087#M21880</link>
      <description>&lt;P&gt;I think you're interpreting it wrong it's the opposite, only slave 4 is not taking up data, the other nodes are filled.&lt;/P&gt;</description>
      <pubDate>Fri, 04 Mar 2016 18:31:17 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112087#M21880</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-03-04T18:31:17Z</dc:date>
    </item>
    <item>
      <title>Re: Configure Storage capacity of Hadoop cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112088#M21881</link>
      <description>&lt;P&gt;yes, Replication factor is 3. But how spreading the data around the cluster help us in changing capacity of the nodes ? &lt;/P&gt;</description>
      <pubDate>Fri, 04 Mar 2016 18:32:04 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112088#M21881</guid>
      <dc:creator>vinaykumarpotnu</dc:creator>
      <dc:date>2016-03-04T18:32:04Z</dc:date>
    </item>
    <item>
      <title>Re: Configure Storage capacity of Hadoop cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112089#M21882</link>
      <description>&lt;P&gt;That's remaining capacity not total&lt;/P&gt;</description>
      <pubDate>Fri, 04 Mar 2016 18:36:05 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112089#M21882</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-03-04T18:36:05Z</dc:date>
    </item>
    <item>
      <title>Re: Configure Storage capacity of Hadoop cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112090#M21883</link>
      <description>&lt;P&gt;Yeah probably. I am new to this and I am not able to understand this whole configuration thing. Capacity is available space and non dfs is the space available for linux system use, if i am not wrong. so I still didn't understand the answer to my question. Why is the capacity(the available space  ) is more for slave 4 alone when all the nodes including master have the same harddisk capacity. &lt;/P&gt;</description>
      <pubDate>Fri, 04 Mar 2016 18:39:14 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112090#M21883</guid>
      <dc:creator>vinaykumarpotnu</dc:creator>
      <dc:date>2016-03-04T18:39:14Z</dc:date>
    </item>
    <item>
      <title>Re: Configure Storage capacity of Hadoop cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112091#M21884</link>
      <description>&lt;P&gt;Go to the node and investigate the data dir directory you specified. Run hdfs fsck / command see if you have issue  with hdfs, post screenshot of main page of Ambari with all widgets.&lt;/P&gt;</description>
      <pubDate>Fri, 04 Mar 2016 18:56:52 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112091#M21884</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-03-04T18:56:52Z</dc:date>
    </item>
    <item>
      <title>Re: Configure Storage capacity of Hadoop cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112092#M21885</link>
      <description>&lt;P&gt;Cluster is new. It hardly contain any data in it. &lt;/P&gt;</description>
      <pubDate>Fri, 04 Mar 2016 20:23:38 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112092#M21885</guid>
      <dc:creator>vinaykumarpotnu</dc:creator>
      <dc:date>2016-03-04T20:23:38Z</dc:date>
    </item>
    <item>
      <title>Re: Configure Storage capacity of Hadoop cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112093#M21886</link>
      <description>&lt;P&gt;OK you need to confirm which directories you specified for datanode in Ambari &amp;gt; hdfs &amp;gt; configs&lt;/P&gt;</description>
      <pubDate>Fri, 04 Mar 2016 21:10:54 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112093#M21886</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-03-04T21:10:54Z</dc:date>
    </item>
    <item>
      <title>Re: Configure Storage capacity of Hadoop cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112094#M21887</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/2758/vinaykumarpotnuru.html" nodeid="2758"&gt;@vinay kumar&lt;/A&gt;&lt;P&gt;&lt;EM&gt;I have never seen the same number for all the slave nodes because of the data distribution. &lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;A target="_blank" href="http://www.srinivastata.com/uncategorized/hdfs-block-placement-load-balance-options/"&gt;Link&lt;/A&gt;&lt;EM&gt;
&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;To overcome uneven block distribution scenario across the cluster, a utility program called &lt;STRONG&gt;balancer&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;
&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="http://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-hdfs/HDFSCommands.html#balancer" target="_blank"&gt;http://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-hdfs/HDFSCommands.html#balancer&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 05 Mar 2016 18:26:43 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112094#M21887</guid>
      <dc:creator>nsabharwal</dc:creator>
      <dc:date>2016-03-05T18:26:43Z</dc:date>
    </item>
    <item>
      <title>Re: Configure Storage capacity of Hadoop cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112095#M21888</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/2758/vinaykumarpotnuru.html" nodeid="2758"&gt;@vinay kumar&lt;/A&gt;&lt;P&gt;Maybe you have problem in disk partitioning. Can you check how much space you have allocated for partitions used by HDP?&lt;/P&gt;&lt;P&gt;Here's a link for partitioning recommendations &lt;A href="http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.3.0/bk_cluster-planning-guide/content/ch_partitioning_chapter.html" target="_blank"&gt;http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.3.0/bk_cluster-planning-guide/content/ch_partitioning_chapter.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 05 Mar 2016 23:00:38 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112095#M21888</guid>
      <dc:creator>ahadjidj</dc:creator>
      <dc:date>2016-03-05T23:00:38Z</dc:date>
    </item>
    <item>
      <title>Re: Configure Storage capacity of Hadoop cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112096#M21889</link>
      <description>&lt;P&gt;/opt/hadoop/hdfs/data,/tmp/hadoop/hdfs/data,/usr/hadoop/hdfs/data,/usr/local/hadoop/hdfs/data&lt;/P&gt;</description>
      <pubDate>Mon, 07 Mar 2016 13:55:08 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112096#M21889</guid>
      <dc:creator>vinaykumarpotnu</dc:creator>
      <dc:date>2016-03-07T13:55:08Z</dc:date>
    </item>
    <item>
      <title>Re: Configure Storage capacity of Hadoop cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112097#M21890</link>
      <description>&lt;P&gt;You need to pick just one there, especially don't choose /tmp as your parent dir, asking for trouble &lt;/P&gt;</description>
      <pubDate>Mon, 07 Mar 2016 14:07:24 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112097#M21890</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-03-07T14:07:24Z</dc:date>
    </item>
    <item>
      <title>Re: Configure Storage capacity of Hadoop cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112098#M21891</link>
      <description>&lt;P&gt;I will make it clear.The Cluster is new and it don't have much data in it. As per my understanding, Available capacity is the storage available for  data node(hdfs) , if i am not wrong. The actual hard disk size of each node being 500 GB and the available capacity for 5 of them is far to less than the slave 4.root folder disk capacity has more than 400gb space allocated and the same should be allocated to hdfs. My concern is where did the rest of the space go? How did the distribution thing come here when my only concern is about hdfs capacity. PFA. &lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="2649-df.png" style="width: 746px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/23235i153CC3D12AA2BEF6/image-size/medium?v=v2&amp;amp;px=400" role="button" title="2649-df.png" alt="2649-df.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="2637-df-h.png" style="width: 2px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/23236i1308FA276A16038D/image-size/medium?v=v2&amp;amp;px=400" role="button" title="2637-df-h.png" alt="2637-df-h.png" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 19 Aug 2019 11:17:34 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112098#M21891</guid>
      <dc:creator>vinaykumarpotnu</dc:creator>
      <dc:date>2019-08-19T11:17:34Z</dc:date>
    </item>
    <item>
      <title>Re: Configure Storage capacity of Hadoop cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112099#M21892</link>
      <description>&lt;P&gt;Like you said, i removed /tmp from that directories list and the capacity of all the nodes reduced to 40 gb or less including slave 4&lt;/P&gt;</description>
      <pubDate>Mon, 07 Mar 2016 14:27:26 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112099#M21892</guid>
      <dc:creator>vinaykumarpotnu</dc:creator>
      <dc:date>2016-03-07T14:27:26Z</dc:date>
    </item>
    <item>
      <title>Re: Configure Storage capacity of Hadoop cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112100#M21893</link>
      <description>&lt;P&gt;I have allocated around 400 GB for / partition. PFA. &lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="2650-df.png" style="width: 746px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/23234i583E14C3086C3361/image-size/medium?v=v2&amp;amp;px=400" role="button" title="2650-df.png" alt="2650-df.png" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 19 Aug 2019 11:17:21 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112100#M21893</guid>
      <dc:creator>vinaykumarpotnu</dc:creator>
      <dc:date>2019-08-19T11:17:21Z</dc:date>
    </item>
    <item>
      <title>Re: Configure Storage capacity of Hadoop cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112101#M21894</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/2758/vinaykumarpotnuru.html" nodeid="2758"&gt;@vinay kumar&lt;/A&gt;  What do you have for dfs.datanode.data.dir?&lt;/P&gt;&lt;P&gt;slave 4 --&amp;gt; /  ?&lt;/P&gt;&lt;P&gt;Rest of the nodes are using other mounts...&lt;/P&gt;</description>
      <pubDate>Mon, 07 Mar 2016 17:23:33 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112101#M21894</guid>
      <dc:creator>nsabharwal</dc:creator>
      <dc:date>2016-03-07T17:23:33Z</dc:date>
    </item>
    <item>
      <title>Re: Configure Storage capacity of Hadoop cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112102#M21895</link>
      <description>&lt;P&gt;dfs.datanode.data.dir have: /opt/hadoop/hdfs/data,/tmp/hadoop/hdfs/data,/usr/hadoop/hdfs/data,/usr/local/hadoop/hdfs/data&lt;/P&gt;&lt;P&gt;All the nodes have same mounts. &lt;/P&gt;</description>
      <pubDate>Mon, 07 Mar 2016 17:43:09 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112102#M21895</guid>
      <dc:creator>vinaykumarpotnu</dc:creator>
      <dc:date>2016-03-07T17:43:09Z</dc:date>
    </item>
    <item>
      <title>Re: Configure Storage capacity of Hadoop cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112103#M21896</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/140/nsabharwal.html" nodeid="140"&gt;@Neeraj Sabharwal&lt;/A&gt; Did say anything wrong ? So the capacity is the Space allocated to hdfs right? &lt;/P&gt;</description>
      <pubDate>Mon, 07 Mar 2016 17:46:49 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112103#M21896</guid>
      <dc:creator>vinaykumarpotnu</dc:creator>
      <dc:date>2016-03-07T17:46:49Z</dc:date>
    </item>
    <item>
      <title>Re: Configure Storage capacity of Hadoop cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112104#M21897</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/2758/vinaykumarpotnuru.html" nodeid="2758"&gt;@vinay kumar&lt;/A&gt; &lt;/P&gt;&lt;P&gt;As expected, the problem is with the  disks allocated to datanode settings.&lt;/P&gt;&lt;P&gt;Ambari picks up all the mounts except /boot and /mnt&lt;/P&gt;&lt;P&gt;You were suppose to modify the settings during the installs. As you can see, data is going on /opt and other mounts and you were suppose to give only /hadoop " / has 400GB"&lt;/P&gt;&lt;P&gt;Now , there is no way we want to store the data on /tmp&lt;/P&gt;&lt;P&gt;/opt/hadoop/hdfs/data,/tmp/hadoop/hdfs/data,/usr/hadoop/hdfs/data,/usr/local/hadoop/hdfs/data&lt;/P&gt;&lt;P&gt;You need to create a directory as /hadoop and modify the settings to read the data from /hadoop.&lt;/P&gt;</description>
      <pubDate>Mon, 07 Mar 2016 17:57:10 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Configure-Storage-capacity-of-Hadoop-cluster/m-p/112104#M21897</guid>
      <dc:creator>nsabharwal</dc:creator>
      <dc:date>2016-03-07T17:57:10Z</dc:date>
    </item>
  </channel>
</rss>

