<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Increase size of HDFS. in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Increase-size-of-HDFS/m-p/300038#M220005</link>
    <description>&lt;P&gt;I am running Hortonworks Sandbox HDP 2.6.5 on VirtualBox. I have increased the size of my virtual hard disk (.vdi) to 500 GB. However, when I login to Ambari and view the size of my disk, it shows 106 GB only. What should I do to increase the HDFS capacity from 106 GB to 500 GB?&lt;/P&gt;</description>
    <pubDate>Tue, 21 Jul 2020 03:27:58 GMT</pubDate>
    <dc:creator>focal_fossa</dc:creator>
    <dc:date>2020-07-21T03:27:58Z</dc:date>
    <item>
      <title>Increase size of HDFS.</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Increase-size-of-HDFS/m-p/300038#M220005</link>
      <description>&lt;P&gt;I am running Hortonworks Sandbox HDP 2.6.5 on VirtualBox. I have increased the size of my virtual hard disk (.vdi) to 500 GB. However, when I login to Ambari and view the size of my disk, it shows 106 GB only. What should I do to increase the HDFS capacity from 106 GB to 500 GB?&lt;/P&gt;</description>
      <pubDate>Tue, 21 Jul 2020 03:27:58 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Increase-size-of-HDFS/m-p/300038#M220005</guid>
      <dc:creator>focal_fossa</dc:creator>
      <dc:date>2020-07-21T03:27:58Z</dc:date>
    </item>
    <item>
      <title>Re: Increase size of HDFS.</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Increase-size-of-HDFS/m-p/300040#M220007</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/80138"&gt;@focal_fossa&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;AFAIK&amp;nbsp; these sandboxes dynamically allocated storage.&amp;nbsp; You can try that by&amp;nbsp; &lt;SPAN&gt;generate and load data for TPC-DS&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;General usage is&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;tpcds-setup.sh scale_factor [directory]&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;For example below will generate&amp;nbsp;&lt;SPAN&gt;&amp;nbsp;200 GB of TPC-DS data&lt;/SPAN&gt;&amp;nbsp;in /user/data [HDFS]&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;./tpcds-setup.sh 200  /user/data&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;This should prove that the disk allocation is dynamic&amp;nbsp; see below links&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A href="https://github.com/hortonworks/hive-testbench/blob/hive14/tpch-build.sh" target="_blank" rel="nofollow noopener noreferrer"&gt;https://github.com/hortonworks/hive-testbench/blob/hive14/tpch-build.sh&lt;/A&gt;&lt;SPAN&gt;&amp;nbsp;and&amp;nbsp;&lt;/SPAN&gt;&lt;A href="https://github.com/hortonworks/hive-testbench/blob/hive14/tpch-setup.sh" target="_blank" rel="nofollow noopener noreferrer"&gt;https://github.com/hortonworks/hive-testbench/blob/hive14/tpch-setup.sh&lt;/A&gt;&lt;SPAN&gt;&amp;nbsp;to build&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Hope that helps&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 20 Jul 2020 21:39:56 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Increase-size-of-HDFS/m-p/300040#M220007</guid>
      <dc:creator>Shelton</dc:creator>
      <dc:date>2020-07-20T21:39:56Z</dc:date>
    </item>
    <item>
      <title>Re: Increase size of HDFS.</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Increase-size-of-HDFS/m-p/300050#M220014</link>
      <description>&lt;P&gt;I'll look into it. I'll have to install gcc and then later Maven to run those shell scripts. Thanks for your input.&lt;/P&gt;</description>
      <pubDate>Mon, 20 Jul 2020 23:04:10 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Increase-size-of-HDFS/m-p/300050#M220014</guid>
      <dc:creator>focal_fossa</dc:creator>
      <dc:date>2020-07-20T23:04:10Z</dc:date>
    </item>
    <item>
      <title>Re: Increase size of HDFS.</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Increase-size-of-HDFS/m-p/300070#M220020</link>
      <description>&lt;P&gt;I don't think it is dynamically allocated, or at least it doesn't seem to be working. I've run out of space trying to load a ~70 GB file. How can I increase the capacity?&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Capture.PNG" style="width: 999px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/28321iA0F73EDD7FFEA38C/image-size/large?v=v2&amp;amp;px=999" role="button" title="Capture.PNG" alt="Capture.PNG" /&gt;&lt;/span&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Capture1.PNG" style="width: 999px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/28322iEC38356E83F3DD04/image-size/large?v=v2&amp;amp;px=999" role="button" title="Capture1.PNG" alt="Capture1.PNG" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 21 Jul 2020 03:15:19 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Increase-size-of-HDFS/m-p/300070#M220020</guid>
      <dc:creator>focal_fossa</dc:creator>
      <dc:date>2020-07-21T03:15:19Z</dc:date>
    </item>
    <item>
      <title>Re: Increase size of HDFS.</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Increase-size-of-HDFS/m-p/300092#M220029</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/80138"&gt;@focal_fossa&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Can you share how method you used to extend you VM disk? Whats the VM disk file extension&amp;nbsp;&lt;FONT color="#FF6600"&gt;vmdk&lt;/FONT&gt; or &lt;FONT color="#FF6600"&gt;vdi&lt;/FONT&gt;? Note&amp;nbsp;&lt;SPAN&gt;virtualbox&amp;nbsp;does not allow resizing on vmdk images.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Does you disk show Dynamically allocated storage as shown below?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="002.JPG" style="width: 842px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/28325i2F3EDE25CDD3458F/image-size/large?v=v2&amp;amp;px=999" role="button" title="002.JPG" alt="002.JPG" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Please revert&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 21 Jul 2020 08:56:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Increase-size-of-HDFS/m-p/300092#M220029</guid>
      <dc:creator>Shelton</dc:creator>
      <dc:date>2020-07-21T08:56:47Z</dc:date>
    </item>
    <item>
      <title>Re: Increase size of HDFS.</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Increase-size-of-HDFS/m-p/300139#M220049</link>
      <description>&lt;P&gt;It is a VDI. I have used Virtual Media Manager to increase the size of my disk. How can i get HDFS to expand and make use of the unallocated space?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I'm assuming this is how one would do it&lt;/P&gt;&lt;P&gt;1. Create a new partition in the Guest OS and assign a mount point to it.&lt;/P&gt;&lt;P&gt;2. Add that path to the DataNode directories&lt;/P&gt;&lt;P&gt;(or)&lt;/P&gt;&lt;P&gt;Extend the current partition to fill the unused disk space so that DataNode automatically increases the HDFS size?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="wapo.PNG" style="width: 695px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/28336i34693313EBD54C06/image-size/large?v=v2&amp;amp;px=999" role="button" title="wapo.PNG" alt="wapo.PNG" /&gt;&lt;/span&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="wapo 1.PNG" style="width: 898px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/28337i3C745493709BD158/image-size/large?v=v2&amp;amp;px=999" role="button" title="wapo 1.PNG" alt="wapo 1.PNG" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 21 Jul 2020 13:51:05 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Increase-size-of-HDFS/m-p/300139#M220049</guid>
      <dc:creator>focal_fossa</dc:creator>
      <dc:date>2020-07-21T13:51:05Z</dc:date>
    </item>
    <item>
      <title>Re: Increase size of HDFS.</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Increase-size-of-HDFS/m-p/300258#M220112</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/20288"&gt;@Shelton&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;So, I've been able to create&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;a new partition and&lt;/LI&gt;&lt;LI&gt;format it as an ext4 filesystem&lt;/LI&gt;&lt;LI&gt;mount it&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;How do I add add this new partition to my datanode? Is it as simple as putting the drive path in Amabari DataNode config?&lt;/P&gt;</description>
      <pubDate>Wed, 22 Jul 2020 16:06:50 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Increase-size-of-HDFS/m-p/300258#M220112</guid>
      <dc:creator>focal_fossa</dc:creator>
      <dc:date>2020-07-22T16:06:50Z</dc:date>
    </item>
    <item>
      <title>Re: Increase size of HDFS.</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Increase-size-of-HDFS/m-p/300259#M220113</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/80138"&gt;@focal_fossa&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;To increase the HDFS capacity add capacity by giving &lt;FONT color="#FF6600"&gt;dfs.datanode.data.dir&lt;/FONT&gt; more mount points or directories the new disk need to be mounted/formatted prior to adding the mount point in Ambari.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;In HDP using&amp;nbsp; Ambari, you should add the new mount point to the list of dirs in the &lt;FONT color="#FF6600"&gt;dfs.datanote.data.dir&lt;/FONT&gt; property. Depending the version of Ambari or in advanced section, the property is in &lt;FONT color="#FF6600"&gt;hdfs-site.xml&lt;/FONT&gt;. the more new disk you provide through comma separated list the more capacity you will have. Preferably every machine should have same disk and mount point structure&lt;BR /&gt;&lt;BR /&gt;You will need to&amp;nbsp;&lt;SPAN&gt;run the&amp;nbsp; HDFS balancer re-balances data across the DataNodes, moving blocks from overutilized to underutilized nodes&lt;BR /&gt;Running the balancer without parameters:&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;sudo -u hdfs hdfs balancer&lt;/LI-CODE&gt;&lt;P&gt;&lt;SPAN&gt;Running the balancer with a default threshold of 10%, meaning that the script will ensure that disk usage on each DataNode differs from the overall usage in the cluster by no more than 10%.&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;You can use&amp;nbsp; a different threshold&lt;/SPAN&gt;&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;sudo -u hdfs hdfs balancer -threshold 5&lt;/LI-CODE&gt;&lt;P&gt;&lt;SPAN&gt;This specifies that each Datanode's disk usage must be (or will be adjusted to be) within 5% of the cluster's overall usage&lt;BR /&gt;This process can take long&amp;nbsp; depending on data in your cluster&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Hope that helps&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 22 Jul 2020 16:37:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Increase-size-of-HDFS/m-p/300259#M220113</guid>
      <dc:creator>Shelton</dc:creator>
      <dc:date>2020-07-22T16:37:47Z</dc:date>
    </item>
    <item>
      <title>Re: Increase size of HDFS.</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Increase-size-of-HDFS/m-p/300285#M220126</link>
      <description>&lt;P&gt;Thank you for your inputs. I have been able to expand the size of my HDFS finally.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 23 Jul 2020 02:48:49 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Increase-size-of-HDFS/m-p/300285#M220126</guid>
      <dc:creator>focal_fossa</dc:creator>
      <dc:date>2020-07-23T02:48:49Z</dc:date>
    </item>
    <item>
      <title>Re: Increase size of HDFS.</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Increase-size-of-HDFS/m-p/300304#M220132</link>
      <description>&lt;P&gt;Hi &lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/80138"&gt;@focal_fossa&lt;/a&gt;,&amp;nbsp; I'm happy to see you resolved your issue. Can you please mark the appropriate reply as the solution? It will make it easier for others to find the answer in the future.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 23 Jul 2020 07:43:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Increase-size-of-HDFS/m-p/300304#M220132</guid>
      <dc:creator>VidyaSargur</dc:creator>
      <dc:date>2020-07-23T07:43:47Z</dc:date>
    </item>
    <item>
      <title>Re: Increase size of HDFS.</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Increase-size-of-HDFS/m-p/300311#M220139</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/80138"&gt;@focal_fossa&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Great to hear happy hadooping!&lt;BR /&gt;Maybe to help other mark the best answer that helped you resolve your problem so other searching for similar solution would use it to resolve similar issues.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 23 Jul 2020 09:16:39 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Increase-size-of-HDFS/m-p/300311#M220139</guid>
      <dc:creator>Shelton</dc:creator>
      <dc:date>2020-07-23T09:16:39Z</dc:date>
    </item>
    <item>
      <title>Re: Increase size of HDFS.</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Increase-size-of-HDFS/m-p/300324#M220152</link>
      <description>&lt;P&gt;Since the solution is scattered across many posts, I'm posting a short summary of what I did.&lt;/P&gt;&lt;P&gt;I am running HDP 2.6.5 image on VirtualBox.&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;Increased my virtual hard disk through Virtual Media Manager&lt;/LI&gt;&lt;LI&gt;In the guest OS,&amp;nbsp;&lt;OL&gt;&lt;LI&gt;Partitioned the unused space&lt;/LI&gt;&lt;LI&gt;Formatted the new partition as an ext4 file system&lt;/LI&gt;&lt;LI&gt;Mounted the file system&lt;/LI&gt;&lt;LI&gt;Update the /etc/fstab (I couldn't do it, as I did not find that file&lt;/LI&gt;&lt;/OL&gt;&lt;/LI&gt;&lt;LI&gt;In Ambari, under DataNode directory config, added the newly mounted file system as a comma separated value&lt;/LI&gt;&lt;LI&gt;Restarted HDFS (my cluster did not have any files, therefore I did not run the below)&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;Thanks to&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/20288"&gt;@Shelton&lt;/a&gt;&amp;nbsp;for his guidance.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;sudo -u hdfs hdfs balancer
​&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 23 Jul 2020 13:19:13 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Increase-size-of-HDFS/m-p/300324#M220152</guid>
      <dc:creator>focal_fossa</dc:creator>
      <dc:date>2020-07-23T13:19:13Z</dc:date>
    </item>
  </channel>
</rss>

