<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: HDFS sizing and the right model in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/HDFS-sizing-and-the-right-model/m-p/88375#M36481</link>
    <description>&lt;P&gt;Hi &lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/31306"&gt;@Adilm&lt;/a&gt;,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;1) If you want to migrate all data, you can compress them and allocated in other nodes/servers. And not need 20TB of disk.&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;Althow if you need availble the data information, yo have 2 scenarios:&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;- &lt;STRONG&gt;Ten replication factor&lt;/STRONG&gt;: then need 20TB per server.&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;- &lt;STRONG&gt;One replication facto&lt;/STRONG&gt;r: only need 20TB distributed in 10 servers.&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;- &lt;STRONG&gt;Best&lt;/STRONG&gt;: replication factor 5 and 4TB per server.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;2) Its depends, you need one namenode, one secondarynamenode, and for example 8 datanodes. You need to put attention of resources of your hosts.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;Manu.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Thu, 28 Mar 2019 10:40:49 GMT</pubDate>
    <dc:creator>manuroman</dc:creator>
    <dc:date>2019-03-28T10:40:49Z</dc:date>
    <item>
      <title>HDFS sizing and the right model</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HDFS-sizing-and-the-right-model/m-p/88360#M36480</link>
      <description>&lt;P&gt;Good day guys, im newby in Cloudera and wanted to ask 2 questions.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;1) I got 20TB of data and i should migrate it to 10 servers, do i need to have 20TB of disk on each server ?&lt;/P&gt;&lt;P&gt;2) How do i organize the right HDFS model (NameNode, DataNode, SecondaryNameNone) on those 10 servers ?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks, i hope to receive the answer very soon )&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 28 Mar 2019 07:15:49 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HDFS-sizing-and-the-right-model/m-p/88360#M36480</guid>
      <dc:creator>Adilm</dc:creator>
      <dc:date>2019-03-28T07:15:49Z</dc:date>
    </item>
    <item>
      <title>Re: HDFS sizing and the right model</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HDFS-sizing-and-the-right-model/m-p/88375#M36481</link>
      <description>&lt;P&gt;Hi &lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/31306"&gt;@Adilm&lt;/a&gt;,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;1) If you want to migrate all data, you can compress them and allocated in other nodes/servers. And not need 20TB of disk.&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;Althow if you need availble the data information, yo have 2 scenarios:&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;- &lt;STRONG&gt;Ten replication factor&lt;/STRONG&gt;: then need 20TB per server.&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;- &lt;STRONG&gt;One replication facto&lt;/STRONG&gt;r: only need 20TB distributed in 10 servers.&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;- &lt;STRONG&gt;Best&lt;/STRONG&gt;: replication factor 5 and 4TB per server.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;2) Its depends, you need one namenode, one secondarynamenode, and for example 8 datanodes. You need to put attention of resources of your hosts.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;Manu.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 28 Mar 2019 10:40:49 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HDFS-sizing-and-the-right-model/m-p/88375#M36481</guid>
      <dc:creator>manuroman</dc:creator>
      <dc:date>2019-03-28T10:40:49Z</dc:date>
    </item>
    <item>
      <title>Re: HDFS sizing and the right model</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HDFS-sizing-and-the-right-model/m-p/88378#M36482</link>
      <description>&lt;P&gt;Thanks for your reply, so if i get it the right way, size on each server depends on replication factor i put, is there any table of dependencies of replication factor and disk sizing ?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Also wanted to ask about the resources on each node, so summary i need some documentation about replica factor, sizing and ram usage.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 28 Mar 2019 11:07:28 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HDFS-sizing-and-the-right-model/m-p/88378#M36482</guid>
      <dc:creator>Adilm</dc:creator>
      <dc:date>2019-03-28T11:07:28Z</dc:date>
    </item>
    <item>
      <title>Re: HDFS sizing and the right model</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HDFS-sizing-and-the-right-model/m-p/88380#M36483</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/31306"&gt;@Adilm&lt;/a&gt;,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;You are right. There are not any table, you must to study your scenario(HA, security, access number ...).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Some questions:&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; - Volume users?&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; - Volume data?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;All documentation is available here, according your version:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&lt;A href="https://www.cloudera.com/documentation/enterprise/latest.html" target="_self"&gt;https://www.cloudera.com/documentation/enterprise/latest.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;Manu.&lt;/P&gt;</description>
      <pubDate>Thu, 28 Mar 2019 11:14:15 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HDFS-sizing-and-the-right-model/m-p/88380#M36483</guid>
      <dc:creator>manuroman</dc:creator>
      <dc:date>2019-03-28T11:14:15Z</dc:date>
    </item>
    <item>
      <title>Re: HDFS sizing and the right model</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HDFS-sizing-and-the-right-model/m-p/88382#M36484</link>
      <description>Thanks )</description>
      <pubDate>Thu, 28 Mar 2019 11:18:11 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HDFS-sizing-and-the-right-model/m-p/88382#M36484</guid>
      <dc:creator>Adilm</dc:creator>
      <dc:date>2019-03-28T11:18:11Z</dc:date>
    </item>
  </channel>
</rss>

