<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: how to increase hdfs disk space in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/how-to-increase-hdfs-disk-space/m-p/120666#M26478</link>
    <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/6243/simrankauradept.html" nodeid="6243"&gt;@simran kaur&lt;/A&gt;
&lt;/P&gt;&lt;P&gt;Too many questions &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt; &lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Now, there are .staging folder created in the hdfs directory which I believe is because service check could not be completed?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;--&amp;gt; YARN requires a staging directory for temporary files created by running jobs.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;My disk space is only 1GB I guess in the cluster. Do I need to increase it? (I guess I do). If yes, How do I increase it? What would be the ideal amount?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;--&amp;gt; Please have a look at below properties&lt;/P&gt;&lt;PRE&gt;   &amp;lt;property&amp;gt;
      &amp;lt;name&amp;gt;dfs.datanode.data.dir&amp;lt;/name&amp;gt;
      &amp;lt;value&amp;gt;/hadoop/hdfs/data&amp;lt;/value&amp;gt;
      &amp;lt;final&amp;gt;true&amp;lt;/final&amp;gt;&lt;/PRE&gt;&lt;P&gt;This property has list of disks to be used for HDFS, you can add new disks to your linux machine and mention here by comma separated list.&lt;/P&gt;&lt;P&gt;Have a look at &lt;A href="https://community.hortonworks.com/questions/9772/how-to-add-more-disks-to-hdfs.html" target="_blank"&gt;https://community.hortonworks.com/questions/9772/how-to-add-more-disks-to-hdfs.html&lt;/A&gt; for more details&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Also, theoretically, there are always multiple data nodes in the cluster. Ambari shows only one. Do I need to create new ones myself?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Yes. You can spin up one more VM and add it using ambari. Here is the guide to add new node using ambari - &lt;A href="http://hortonworks.com/hadoop-tutorial/using-apache-ambari-add-new-nodes-existing-cluster/" target="_blank"&gt;http://hortonworks.com/hadoop-tutorial/using-apache-ambari-add-new-nodes-existing-cluster/&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;What are the advantages and how do I create them?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;with multiple nodes, you will get more storage capacity and processing power.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Also, what should be the ideal number of data nodes and why?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;You can run every component on single node, it really depends on your use case.&lt;/P&gt;&lt;P&gt;Hope this information helps!&lt;/P&gt;</description>
    <pubDate>Thu, 28 Apr 2016 21:59:30 GMT</pubDate>
    <dc:creator>KuldeepK</dc:creator>
    <dc:date>2016-04-28T21:59:30Z</dc:date>
    <item>
      <title>how to increase hdfs disk space</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/how-to-increase-hdfs-disk-space/m-p/120665#M26477</link>
      <description>&lt;P&gt;I have hdp installed on server with ambari. &lt;/P&gt;&lt;P&gt; HDFS disk space is 100% utilized after I have run service check. As per my understanding, some folders were created during the process. I have minimal understanding of it though. &lt;/P&gt;&lt;P&gt;Now, there are .staging folder created in the hdfs directory which I believe is because service check could not be completed?&lt;/P&gt;&lt;P&gt;My disk space is only 1GB I guess in the cluster. Do I need to increase it? (I guess I do). If yes, How do I increase it? What would be the ideal amount?&lt;/P&gt;&lt;P&gt;Also, theoretically, there are always multiple data nodes in the cluster. Ambari shows only one. Do I need to create new ones myself?&lt;/P&gt;&lt;P&gt;What are the advantages and how do I create them?&lt;/P&gt;&lt;P&gt;Also, what should be the ideal number of data nodes and why?&lt;/P&gt;</description>
      <pubDate>Thu, 28 Apr 2016 21:31:48 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/how-to-increase-hdfs-disk-space/m-p/120665#M26477</guid>
      <dc:creator>sim6</dc:creator>
      <dc:date>2016-04-28T21:31:48Z</dc:date>
    </item>
    <item>
      <title>Re: how to increase hdfs disk space</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/how-to-increase-hdfs-disk-space/m-p/120666#M26478</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/6243/simrankauradept.html" nodeid="6243"&gt;@simran kaur&lt;/A&gt;
&lt;/P&gt;&lt;P&gt;Too many questions &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt; &lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Now, there are .staging folder created in the hdfs directory which I believe is because service check could not be completed?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;--&amp;gt; YARN requires a staging directory for temporary files created by running jobs.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;My disk space is only 1GB I guess in the cluster. Do I need to increase it? (I guess I do). If yes, How do I increase it? What would be the ideal amount?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;--&amp;gt; Please have a look at below properties&lt;/P&gt;&lt;PRE&gt;   &amp;lt;property&amp;gt;
      &amp;lt;name&amp;gt;dfs.datanode.data.dir&amp;lt;/name&amp;gt;
      &amp;lt;value&amp;gt;/hadoop/hdfs/data&amp;lt;/value&amp;gt;
      &amp;lt;final&amp;gt;true&amp;lt;/final&amp;gt;&lt;/PRE&gt;&lt;P&gt;This property has list of disks to be used for HDFS, you can add new disks to your linux machine and mention here by comma separated list.&lt;/P&gt;&lt;P&gt;Have a look at &lt;A href="https://community.hortonworks.com/questions/9772/how-to-add-more-disks-to-hdfs.html" target="_blank"&gt;https://community.hortonworks.com/questions/9772/how-to-add-more-disks-to-hdfs.html&lt;/A&gt; for more details&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Also, theoretically, there are always multiple data nodes in the cluster. Ambari shows only one. Do I need to create new ones myself?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Yes. You can spin up one more VM and add it using ambari. Here is the guide to add new node using ambari - &lt;A href="http://hortonworks.com/hadoop-tutorial/using-apache-ambari-add-new-nodes-existing-cluster/" target="_blank"&gt;http://hortonworks.com/hadoop-tutorial/using-apache-ambari-add-new-nodes-existing-cluster/&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;What are the advantages and how do I create them?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;with multiple nodes, you will get more storage capacity and processing power.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Also, what should be the ideal number of data nodes and why?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;You can run every component on single node, it really depends on your use case.&lt;/P&gt;&lt;P&gt;Hope this information helps!&lt;/P&gt;</description>
      <pubDate>Thu, 28 Apr 2016 21:59:30 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/how-to-increase-hdfs-disk-space/m-p/120666#M26478</guid>
      <dc:creator>KuldeepK</dc:creator>
      <dc:date>2016-04-28T21:59:30Z</dc:date>
    </item>
  </channel>
</rss>

