<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Minimum number of nodes to add in a multi-node cluster in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Minimum-number-of-nodes-to-add-in-a-multi-node-cluster/m-p/123935#M43195</link>
    <description>&lt;A rel="user" href="https://community.cloudera.com/users/10486/simrank.html" nodeid="10486"&gt;@Simran Kaur&lt;/A&gt;&lt;P&gt;Please find answers inline -&lt;/P&gt;&lt;P&gt;1. a node means a server, right? No VM'S ?&lt;/P&gt;&lt;P&gt;- Node means server. A server can be physical hardware or virtual machine also.&lt;/P&gt;&lt;P&gt;2. How many servers I would need to add to have a healthy cluster&lt;/P&gt;&lt;P&gt;- It depends upon what type of configuration you use for production. Generally a broader question to discuss. For Master services I would recommend to deploy on individual node and slave nodes as per your requirement.&lt;/P&gt;&lt;P&gt;In case of HA you need to revisit placement of the above services.&lt;/P&gt;&lt;P&gt;master1 - Active NN,ZK,JN&lt;/P&gt;&lt;P&gt;master1 - Standby NN, ZK, JN,RM, AM,HS&lt;/P&gt;&lt;P&gt;master1 - Ambari, ZK, HIVE,SQOOP,OOZIE,HUE,Ranger,etc..&lt;/P&gt;&lt;P&gt;Slave Nodes - DN,N,etc..&lt;/P&gt;&lt;P&gt;3. Which of the above mentioned services should be co-located?&lt;/P&gt;&lt;P&gt;- For HDFS make sure JN should run most probably on both namenodes, also if possible you should have dedicated disk for JN and ZK.&lt;/P&gt;&lt;P&gt;4. What should be the distribution like?&lt;/P&gt;&lt;P&gt;- You can go for n-1 distribution [where is n=latest stable release from hdp]&lt;/P&gt;&lt;P&gt;You can migrate services after installation.&lt;/P&gt;</description>
    <pubDate>Tue, 11 Oct 2016 16:58:25 GMT</pubDate>
    <dc:creator>sshimpi</dc:creator>
    <dc:date>2016-10-11T16:58:25Z</dc:date>
    <item>
      <title>Minimum number of nodes to add in a multi-node cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Minimum-number-of-nodes-to-add-in-a-multi-node-cluster/m-p/123934#M43194</link>
      <description>&lt;P&gt;I have &lt;/P&gt;&lt;P&gt;1. Hive&lt;/P&gt;&lt;P&gt;2. Pig&lt;/P&gt;&lt;P&gt;3. Zookeeper&lt;/P&gt;&lt;P&gt;4.HDFS&lt;/P&gt;&lt;P&gt;5. Hue&lt;/P&gt;&lt;P&gt;6. Oozie&lt;/P&gt;&lt;P&gt;7. Sqoop&lt;/P&gt;&lt;P&gt;8. Yarn&lt;/P&gt;&lt;P&gt;9. Ranger&lt;/P&gt;&lt;P&gt;Currently, all of these are deployed on the same host.  Now, I would like to add more hosts to it. &lt;/P&gt;&lt;P&gt;But I have a few doubts:&lt;/P&gt;&lt;P&gt;In production, &lt;/P&gt;&lt;P&gt;1. a node means a server, right? No VM'S ?&lt;/P&gt;&lt;P&gt;2. How many servers I would need to add to have a healthy cluster&lt;/P&gt;&lt;P&gt;3. Which of the above mentioned services should be co-located?&lt;/P&gt;&lt;P&gt;4. What should be the distribution like?&lt;/P&gt;&lt;P&gt;Pig is relatively used less but sqoop, Hive , Oozie and Hue most of the times and ofcourse Ranger for authorization part. &lt;/P&gt;&lt;P&gt;What should be the distribution like? Which of these services should be moved to new hosts?Which of these should be co located?&lt;/P&gt;&lt;P&gt;Which of these should have entirely dedicated server to them? I am new to it and would appreciate if you could give the specifications to establishing a multi-node cluster . &lt;/P&gt;</description>
      <pubDate>Tue, 21 Apr 2026 13:48:50 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Minimum-number-of-nodes-to-add-in-a-multi-node-cluster/m-p/123934#M43194</guid>
      <dc:creator>simran_k</dc:creator>
      <dc:date>2026-04-21T13:48:50Z</dc:date>
    </item>
    <item>
      <title>Re: Minimum number of nodes to add in a multi-node cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Minimum-number-of-nodes-to-add-in-a-multi-node-cluster/m-p/123935#M43195</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/10486/simrank.html" nodeid="10486"&gt;@Simran Kaur&lt;/A&gt;&lt;P&gt;Please find answers inline -&lt;/P&gt;&lt;P&gt;1. a node means a server, right? No VM'S ?&lt;/P&gt;&lt;P&gt;- Node means server. A server can be physical hardware or virtual machine also.&lt;/P&gt;&lt;P&gt;2. How many servers I would need to add to have a healthy cluster&lt;/P&gt;&lt;P&gt;- It depends upon what type of configuration you use for production. Generally a broader question to discuss. For Master services I would recommend to deploy on individual node and slave nodes as per your requirement.&lt;/P&gt;&lt;P&gt;In case of HA you need to revisit placement of the above services.&lt;/P&gt;&lt;P&gt;master1 - Active NN,ZK,JN&lt;/P&gt;&lt;P&gt;master1 - Standby NN, ZK, JN,RM, AM,HS&lt;/P&gt;&lt;P&gt;master1 - Ambari, ZK, HIVE,SQOOP,OOZIE,HUE,Ranger,etc..&lt;/P&gt;&lt;P&gt;Slave Nodes - DN,N,etc..&lt;/P&gt;&lt;P&gt;3. Which of the above mentioned services should be co-located?&lt;/P&gt;&lt;P&gt;- For HDFS make sure JN should run most probably on both namenodes, also if possible you should have dedicated disk for JN and ZK.&lt;/P&gt;&lt;P&gt;4. What should be the distribution like?&lt;/P&gt;&lt;P&gt;- You can go for n-1 distribution [where is n=latest stable release from hdp]&lt;/P&gt;&lt;P&gt;You can migrate services after installation.&lt;/P&gt;</description>
      <pubDate>Tue, 11 Oct 2016 16:58:25 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Minimum-number-of-nodes-to-add-in-a-multi-node-cluster/m-p/123935#M43195</guid>
      <dc:creator>sshimpi</dc:creator>
      <dc:date>2016-10-11T16:58:25Z</dc:date>
    </item>
  </channel>
</rss>

