Support Questions
Find answers, ask questions, and share your expertise

• Hortonworks Hadoop cluster health check and maintenance – Do’s and Don’ts

New Contributor

We have just started using HWX in our lab cluster . We are using Ambari to manage the cluster. It will be a great help if someone can give best practices for cluster health check and maintenance . Do’s and Don’ts for cluster .

3 REPLIES 3

Super Mentor

@Ajay Sharma

Ambari provides a quick service check scripts to quickly verify different cluster services to verify if they are running fine or not. You can run it manually as well.

Ambari --> HDFS --> Service Actions --> Run Service Check.

- Same you can do with other services as well.

.

Super Mentor

@Ajay Sharma

In addition to Service Checks for various services, Ambari also provides the various types of predefined "Alert & Notification" services that can be used to monitor the whole cluster and get notified when there is something wrong/unexpected.

https://docs.hortonworks.com/HDPDocuments/Ambari-2.4.2.0/bk_ambari-user-guide/content/list_of_predef...

.

- Additionally Ambari also allows users to create & register their own custom alerts (based on your own logic & need)

https://community.hortonworks.com/articles/38149/how-to-create-and-register-custom-ambari-alerts.htm...

.

- For HDP cluster Maintenance best practices you can refer to the following article written by @Kuldeep Kulkarni which talks about few very basic & useful maintenance scenarios.

https://community.hortonworks.com/articles/26518/hadoop-cluster-maintenance.html

.

Hi @Ajay Sharma, I'd also strongly recommend our support subscription and implementing SmartSense. http://hortonworks.com/products/subscriptions/smartsense/. This is the quickest and best way to get a complete overview of your cluster health based on best practices and real-time workloads.