Support Questions

Find answers, ask questions, and share your expertise

Hello All,

 

We would like to install a cluster CDP 7.1 on our DEV environment composed of 8 nodes :
3 Masters
3 Slaves
1 Edge
1 Cloudera-manager

 

What are the best practices for distributing the services (Spark,Hbase,Zookeeper...) on the different nodes of the cluster?

 

Thanks for your feedback.

1 ACCEPTED SOLUTION

avatar
Master Guru

@Cluster-CDP You can follow below Cloudera Recommendation about Role distribution as per number of nodes in the cluster. Though in your case this seems a best fit for me. 

https://docs.cloudera.com/cloudera-manager/7.0.3/installation/topics/cdpdc-runtime-cluster-hosts-rol...

 

Master Hosts Utility Hosts Gateway Hosts Worker Hosts

Master Host 1:
  • NameNode
  • YARN ResourceManager
  • JobHistory Server
  • ZooKeeper
  • Kudu master
  • Spark History Server
One host for all Utility and Gateway roles:
  • Secondary NameNode
  • Cloudera Manager
  • Cloudera Manager Management Service
  • Hive Metastore
  • HiveServer2
  • Impala Catalog Server
  • Impala StateStore
  • Hue
  • Oozie
  • Flume
  • Gateway configuration
  • HBase backup master
  • Ranger Admin, Tagsync, Usersync servers
  • Atlas server
  • Solr server (CDP-INFRA-SOLR instance to support Atlas)
  • Streams Messaging Manager
  • Streams Replication Manager Service
  • ZooKeeper
3 - 10 Worker Hosts:
  • DataNode
  • NodeManager
  • Impalad
  • Kudu tablet server

Cheers!
Was your question answered? Make sure to mark the answer as the accepted solution.
If you find a reply useful, say thanks by clicking on the thumbs up button.

View solution in original post

1 REPLY 1

avatar
Master Guru

@Cluster-CDP You can follow below Cloudera Recommendation about Role distribution as per number of nodes in the cluster. Though in your case this seems a best fit for me. 

https://docs.cloudera.com/cloudera-manager/7.0.3/installation/topics/cdpdc-runtime-cluster-hosts-rol...

 

Master Hosts Utility Hosts Gateway Hosts Worker Hosts

Master Host 1:
  • NameNode
  • YARN ResourceManager
  • JobHistory Server
  • ZooKeeper
  • Kudu master
  • Spark History Server
One host for all Utility and Gateway roles:
  • Secondary NameNode
  • Cloudera Manager
  • Cloudera Manager Management Service
  • Hive Metastore
  • HiveServer2
  • Impala Catalog Server
  • Impala StateStore
  • Hue
  • Oozie
  • Flume
  • Gateway configuration
  • HBase backup master
  • Ranger Admin, Tagsync, Usersync servers
  • Atlas server
  • Solr server (CDP-INFRA-SOLR instance to support Atlas)
  • Streams Messaging Manager
  • Streams Replication Manager Service
  • ZooKeeper
3 - 10 Worker Hosts:
  • DataNode
  • NodeManager
  • Impalad
  • Kudu tablet server

Cheers!
Was your question answered? Make sure to mark the answer as the accepted solution.
If you find a reply useful, say thanks by clicking on the thumbs up button.