We have just started using HWX in our lab cluster . We are using Ambari to manage the cluster. It will be a great help if someone can give best practices for cluster health check and maintenance . Do’s and Don’ts for cluster .
Ambari provides a quick service check scripts to quickly verify different cluster services to verify if they are running fine or not. You can run it manually as well.
Ambari --> HDFS --> Service Actions --> Run Service Check.
- Same you can do with other services as well.
In addition to Service Checks for various services, Ambari also provides the various types of predefined "Alert & Notification" services that can be used to monitor the whole cluster and get notified when there is something wrong/unexpected.
- Additionally Ambari also allows users to create & register their own custom alerts (based on your own logic & need)
- For HDP cluster Maintenance best practices you can refer to the following article written by @Kuldeep Kulkarni which talks about few very basic & useful maintenance scenarios.
Hi @Ajay Sharma, I'd also strongly recommend our support subscription and implementing SmartSense. http://hortonworks.com/products/subscriptions/smartsense/. This is the quickest and best way to get a complete overview of your cluster health based on best practices and real-time workloads.