<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Minimum number of nodes, and specs for a real cluster in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Minimum-number-of-nodes-and-specs-for-a-real-cluster/m-p/33096#M52140</link>
    <description>&lt;P&gt;Hi&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I've been tasked with setting up a Hadoop cluster for testing&amp;nbsp;a new big data initiative. However I'm pretty much completely new to all of this. I know that one can set up a single node cluster for proof of concept, but I would like to know what is the minimum number of nodes, and what spec (amount of&amp;nbsp;RAM &amp;amp; disk space) for a proper cluster. Imagine a low throughput as it's only an initial test cluster (fewer than 10 users). And we only&amp;nbsp;&lt;EM&gt;need&lt;/EM&gt; Kafka, HDFS, Pig &amp;amp; Hive services to run.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;We generally have the ability to spin up Centos 6 VM's with 4GB RAM each, and I might be able to up that to 8GB each. But Reading many of the setup pages, it's quoting minimums of 10s of GB of RAM (e.g.&amp;nbsp;&lt;A href="http://blog.cloudera.com/blog/2013/08/how-to-select-the-right-hardware-for-your-new-hadoop-cluster/)," target="_blank"&gt;http://blog.cloudera.com/blog/2013/08/how-to-select-the-right-hardware-for-your-new-hadoop-cluster/),&lt;/A&gt; but the cloudera manager setup only asks for at least 4GB on that node (&lt;A href="http://www.cloudera.com/content/www/en-us/documentation/enterprise/5-3-x/topics/cm_ig_cm_requirements.html)" target="_blank"&gt;http://www.cloudera.com/content/www/en-us/documentation/enterprise/5-3-x/topics/cm_ig_cm_requirements.html)&lt;/A&gt; and mentions nothing around the other node's specs.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Let me know if you need any more information. I realise it's probably too vague as is.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Cheers,&lt;/P&gt;
&lt;P&gt;Ed&lt;/P&gt;</description>
    <pubDate>Fri, 16 Sep 2022 09:44:40 GMT</pubDate>
    <dc:creator>edwardmlyte</dc:creator>
    <dc:date>2022-09-16T09:44:40Z</dc:date>
  </channel>
</rss>

