Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Best practices re: putting services with spiky CPU and memory usage on different servers from Hbase?

Best practices re: putting services with spiky CPU and memory usage on different servers from Hbase?

New Contributor

Are there any best practices concerning what services should be cotenants on the same cluster nodes? From what I understand about Spark, it's basically designed to fully utilize the available memory and CPU on the host machine, within certain limits. Our usage of Spark has been very spiky so far--periods of inactivity followed by the entire cluster being pegged for short periods. In contrast, Hbase is designed to perform as a database, with fairly consistent performance, lower patency, etc. Would you recommend putting Hbase and Spark services on different nodes so the CPU spikes from Spark don't impact the Hbase service? 

1 REPLY 1

Re: Best practices re: putting services with spiky CPU and memory usage on different servers from Hb

Master Guru
If the HBase latencies are SLA critical to you, we've often recommended separating out the HBase cluster to its own - this gives much more reliable performance and we indeed have multiple customers running it this way.

Otherwise, configuring your memory and CPU resource tunables on the cluster to divide it in a good way between the services is important.