Created on 10-31-2016 04:04 AM - edited 09-16-2022 03:46 AM
I want to deploy HDP on two servers. Each server have 32G RAM.
So what best design when install HDP on two servers.
What need assign for Master?
What need assign for Slave?
And for practical, is better if use 1 server have 64G RAM or use 2 servers with 32G RAM each?
Thanks.
Created 10-31-2016 04:24 AM
I am assuming this is just some sandbox type deployment where you are just trying to experiment and learn Hadoop. Feel free to distribute process evenly on both nodes and set replication factor to 2, making both nodes data nodes. This means, one of your node wil serve as both data node as well as namenode. You cannot have any meaningful Hadoop deployment on two nodes. Not even for POC/evaluation purposes.
You might use these two nodes to install five VMs on it and in that case have two masters and three data nodes.
Created 10-31-2016 04:24 AM
I am assuming this is just some sandbox type deployment where you are just trying to experiment and learn Hadoop. Feel free to distribute process evenly on both nodes and set replication factor to 2, making both nodes data nodes. This means, one of your node wil serve as both data node as well as namenode. You cannot have any meaningful Hadoop deployment on two nodes. Not even for POC/evaluation purposes.
You might use these two nodes to install five VMs on it and in that case have two masters and three data nodes.
Created 10-31-2016 05:29 AM
thanks for your answer.
So about the performance two nodes is the same one node if they have the same total hardware( and also the same if I spare to 5 VMs node) ?
Can you suggest me small cluster design for 3 or 5 node. I want use this on production.
I find the typical cluster referent from document
Masters -- HDFS NameNode, YARN ResourceManager, and HBase Master.
Slaves -- HDFS DataNodes, YARN NodeManagers, and HBase RegionServers
Created 10-31-2016 05:38 AM
@Hoang Le If you have 5 nodes, you can use 2 nodes as Masternodes and 3 as Workernodes (slave nodes).