Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

• Hortonworks Hadoop cluster health check and maintenance – Do’s and Don’ts

• Hortonworks Hadoop cluster health check and maintenance – Do’s and Don’ts

New Contributor

We have just started using HWX in our lab cluster . We are using Ambari to manage the cluster. It will be a great help if someone can give best practices for cluster health check and maintenance . Do’s and Don’ts for cluster .

3 REPLIES 3

Re: • Hortonworks Hadoop cluster health check and maintenance – Do’s and Don’ts

Super Mentor

@Ajay Sharma

Ambari provides a quick service check scripts to quickly verify different cluster services to verify if they are running fine or not. You can run it manually as well.

Ambari --> HDFS --> Service Actions --> Run Service Check.

- Same you can do with other services as well.

.

Re: • Hortonworks Hadoop cluster health check and maintenance – Do’s and Don’ts

Super Mentor

@Ajay Sharma

In addition to Service Checks for various services, Ambari also provides the various types of predefined "Alert & Notification" services that can be used to monitor the whole cluster and get notified when there is something wrong/unexpected.

https://docs.hortonworks.com/HDPDocuments/Ambari-2.4.2.0/bk_ambari-user-guide/content/list_of_predef...

.

- Additionally Ambari also allows users to create & register their own custom alerts (based on your own logic & need)

https://community.hortonworks.com/articles/38149/how-to-create-and-register-custom-ambari-alerts.htm...

.

- For HDP cluster Maintenance best practices you can refer to the following article written by @Kuldeep Kulkarni which talks about few very basic & useful maintenance scenarios.

https://community.hortonworks.com/articles/26518/hadoop-cluster-maintenance.html

.

Re: • Hortonworks Hadoop cluster health check and maintenance – Do’s and Don’ts

Hi @Ajay Sharma, I'd also strongly recommend our support subscription and implementing SmartSense. http://hortonworks.com/products/subscriptions/smartsense/. This is the quickest and best way to get a complete overview of your cluster health based on best practices and real-time workloads.

Don't have an account?
Coming from Hortonworks? Activate your account here