<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Adding New Hosts with Extra Disks in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Adding-New-Hosts-with-Extra-Disks/m-p/300446#M220232</link>
    <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/79929"&gt;@SagarKanani&lt;/a&gt;&amp;nbsp;Thank you for your reply.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Referring to the documentation, I found the following:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="lia-indent-padding-left-30px"&gt;&lt;STRONG&gt;&lt;SPAN class="term"&gt;dfs.datanode.data.dir&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;P class="lia-indent-padding-left-30px"&gt;Determines where on the local filesystem a DFS data node should store its blocks. If this is a comma-delimited list of directories, then data is stored in all named directories, typically on different devices. Directories that do not exist are ignored. Heterogeneous storage allows specifying that each directory resides on a different type of storage:&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;DISK,&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;SSD,&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;ARCHIVE,&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;or&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;RAM_DISK.&lt;/P&gt;&lt;P class="lia-indent-padding-left-30px"&gt;(&lt;A href="https://docs.cloudera.com/HDPDocuments/HDP2/HDP-2.6.3/bk_hdfs-administration/content/configuration_props.html" target="_blank" rel="noopener"&gt;https://docs.cloudera.com/HDPDocuments/HDP2/HDP-2.6.3/bk_hdfs-administration/content/configuration_props.html&lt;/A&gt;)&lt;/P&gt;&lt;P&gt;I think that means paths&amp;nbsp;&lt;SPAN&gt;/data/disk3 &amp;amp; /data/disk4 will be ignored on old nodes, right?&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Have anyone tried this scenario before?&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Mon, 27 Jul 2020 09:51:18 GMT</pubDate>
    <dc:creator>aen0180</dc:creator>
    <dc:date>2020-07-27T09:51:18Z</dc:date>
    <item>
      <title>Adding New Hosts with Extra Disks</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Adding-New-Hosts-with-Extra-Disks/m-p/300274#M220117</link>
      <description>&lt;P&gt;Hi, I need to add new hosts to an existing cluster using Ambari but the new hosts have more disks than the old nodes that I want to add to the HDFS (old nodes have /data/disk1&amp;amp; /data/disk2 while new nodes have/data/disk1, /data/disk2, /data/disk3 &amp;amp; /data/disk4). How can I add those disks after adding the nodes? can I just update&amp;nbsp; dfs.datanode.data.dir?&lt;/P&gt;</description>
      <pubDate>Wed, 22 Jul 2020 20:11:04 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Adding-New-Hosts-with-Extra-Disks/m-p/300274#M220117</guid>
      <dc:creator>aen0180</dc:creator>
      <dc:date>2020-07-22T20:11:04Z</dc:date>
    </item>
    <item>
      <title>Re: Adding New Hosts with Extra Disks</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Adding-New-Hosts-with-Extra-Disks/m-p/300310#M220138</link>
      <description>&lt;P&gt;I am not a 100% sure but i dont think you can add more disks for new machines. &lt;SPAN&gt;HDFS does a round robin writes on all disk, hence y&lt;/SPAN&gt;ou have to either have the same no. of disks or increase the disks on the existing data nodes. Then you update&amp;nbsp;&lt;SPAN&gt;dfs.datanode.data.dir accordingly.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 23 Jul 2020 08:52:34 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Adding-New-Hosts-with-Extra-Disks/m-p/300310#M220138</guid>
      <dc:creator>SagarKanani</dc:creator>
      <dc:date>2020-07-23T08:52:34Z</dc:date>
    </item>
    <item>
      <title>Re: Adding New Hosts with Extra Disks</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Adding-New-Hosts-with-Extra-Disks/m-p/300446#M220232</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/79929"&gt;@SagarKanani&lt;/a&gt;&amp;nbsp;Thank you for your reply.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Referring to the documentation, I found the following:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="lia-indent-padding-left-30px"&gt;&lt;STRONG&gt;&lt;SPAN class="term"&gt;dfs.datanode.data.dir&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;P class="lia-indent-padding-left-30px"&gt;Determines where on the local filesystem a DFS data node should store its blocks. If this is a comma-delimited list of directories, then data is stored in all named directories, typically on different devices. Directories that do not exist are ignored. Heterogeneous storage allows specifying that each directory resides on a different type of storage:&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;DISK,&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;SSD,&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;ARCHIVE,&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;or&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;RAM_DISK.&lt;/P&gt;&lt;P class="lia-indent-padding-left-30px"&gt;(&lt;A href="https://docs.cloudera.com/HDPDocuments/HDP2/HDP-2.6.3/bk_hdfs-administration/content/configuration_props.html" target="_blank" rel="noopener"&gt;https://docs.cloudera.com/HDPDocuments/HDP2/HDP-2.6.3/bk_hdfs-administration/content/configuration_props.html&lt;/A&gt;)&lt;/P&gt;&lt;P&gt;I think that means paths&amp;nbsp;&lt;SPAN&gt;/data/disk3 &amp;amp; /data/disk4 will be ignored on old nodes, right?&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Have anyone tried this scenario before?&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 27 Jul 2020 09:51:18 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Adding-New-Hosts-with-Extra-Disks/m-p/300446#M220232</guid>
      <dc:creator>aen0180</dc:creator>
      <dc:date>2020-07-27T09:51:18Z</dc:date>
    </item>
    <item>
      <title>Re: Adding New Hosts with Extra Disks</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Adding-New-Hosts-with-Extra-Disks/m-p/300512#M220271</link>
      <description>&lt;P&gt;Ahh ok...didnt check the documentation my bad. But, the question still lies if it will ignore the directory on all nodes or only old nodes. I am interested how this turns out. Maybe you can do a quick trial? I dont have a dev environment to try at the moment.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 29 Jul 2020 06:39:34 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Adding-New-Hosts-with-Extra-Disks/m-p/300512#M220271</guid>
      <dc:creator>SagarKanani</dc:creator>
      <dc:date>2020-07-29T06:39:34Z</dc:date>
    </item>
    <item>
      <title>Re: Adding New Hosts with Extra Disks</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Adding-New-Hosts-with-Extra-Disks/m-p/301004#M220477</link>
      <description>&lt;P&gt;I finally found the correct way to do that. I used Ambari to create a new &lt;STRONG&gt;configuration group&lt;/STRONG&gt; that includes the new hosts only, and then I added the extra disks paths to the&amp;nbsp;&lt;SPAN&gt;&lt;STRONG&gt;dfs.datanode.data.dir parameter&lt;/STRONG&gt; in the new configuration group only. That will integrate the extra disk on the new nodes only into the HDFS. Older nodes will not be impacted by the change in the parameter.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Reference:&amp;nbsp;&lt;A href="https://docs.cloudera.com/HDPDocuments/Ambari-2.7.5.0/managing-and-monitoring-ambari/content/amb_managing_host_configuration_groups.html" target="_blank"&gt;https://docs.cloudera.com/HDPDocuments/Ambari-2.7.5.0/managing-and-monitoring-ambari/content/amb_managing_host_configuration_groups.html&lt;/A&gt;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 06 Aug 2020 14:49:23 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Adding-New-Hosts-with-Extra-Disks/m-p/301004#M220477</guid>
      <dc:creator>aen0180</dc:creator>
      <dc:date>2020-08-06T14:49:23Z</dc:date>
    </item>
  </channel>
</rss>

