Created 05-21-2019 02:03 AM
I have a very small physical cluster of 4 hosts where I want to run a CDH with Cloudera Manager. In order to have at least 3 worker hosts I'd like to install the utility and edge node along with the master node on the same host. Is this possible?
Does it make sense to separete master and utility/edge nodes on two distinct hosts and go with only 2 worker hosts?
Is there an alternative setup being restricted to 4 hosts? Thanks for any advice!
Created 05-22-2019 10:30 AM
@rdbb ,
It is possible to even have a one-node cluster (With CM and CDH), but that is certainly not something that you would want to do if you want to leverage the inherent redundancy and load distrobution of CDH.
Functionally, you can put roles most anywhere you please; as far as what is best for your desired use cases, though, we can't really comment without more context.
If you only have 4 hosts, you still need to consider what sort of memory the roles that are on those hosts will demand (as well as CPU and disk). Also, do you want to use High Availability for HDFS and YARN (NameNode and Resource Manager)?
Given enough RAM, disk, etc. you could put master host and edge node on one node and then use the other 3 as worker nodes.
It is more about what you want to get out of this cluster, how important uptime is, and what resources you have.
If you don't care too much and just want to play around, the configuration I described with master/edge on one host and workers on the other 3 is fine. In fact, that's basically what we do for our nightly builds here at Cloudera.
I hope that halps a bit. Feel free to ask more questions.
Created 05-22-2019 02:22 AM
Hi,
do you want to use this configuration for production or only for evaluation and testing?
BR, cabguy0815
Created 05-22-2019 02:58 AM
Created 05-22-2019 02:44 AM
Hi @rdbb, Its depends of your objetive.
There are not any restriction, but really you need a edge node?
Regards, Manu.
Created 05-22-2019 03:45 AM
Hi @manuroman ,
thanks for the hint. Maybe we can omit the services typically running on an edge node.
But reading Cloudera's CDH Cluster Hosts and Role Assignments my understanding is, that at least three different types of hosts are recommended even for a very small installation.
- One master host
- One host combining utitliy and edge functionality
- Three worker hosts
Omitting the edge functionality now comes to the question if it is reasonable to install utilities like the Cloudera Manager on the master host? As mentioned in the original post, we are restricted to 4 hosts.
Best regards,
rdbb
Created 05-22-2019 10:30 AM
@rdbb ,
It is possible to even have a one-node cluster (With CM and CDH), but that is certainly not something that you would want to do if you want to leverage the inherent redundancy and load distrobution of CDH.
Functionally, you can put roles most anywhere you please; as far as what is best for your desired use cases, though, we can't really comment without more context.
If you only have 4 hosts, you still need to consider what sort of memory the roles that are on those hosts will demand (as well as CPU and disk). Also, do you want to use High Availability for HDFS and YARN (NameNode and Resource Manager)?
Given enough RAM, disk, etc. you could put master host and edge node on one node and then use the other 3 as worker nodes.
It is more about what you want to get out of this cluster, how important uptime is, and what resources you have.
If you don't care too much and just want to play around, the configuration I described with master/edge on one host and workers on the other 3 is fine. In fact, that's basically what we do for our nightly builds here at Cloudera.
I hope that halps a bit. Feel free to ask more questions.
Created 05-23-2019 01:56 AM