Can somebody give me a list of things to monitor at hadoop level (including yarn application level) and OS level as well, apart from the things Ambari monitors already by default..... for a Production environment?
Apart from ambari metrics, I will suggest to refer metrics from Grafana.
Grafana is by default provided as a service in Ambari [latest version] and has all the metrics needed for monitoring Production environment.
Pls refer few links below -
@PJ I see Grafana4.0 is available with alerting. Latest version of HDP is available with below grafana version and doesnot include alerting.
Grafana version: 2.6.0, commit: v2.6.0, build date: 2015-12-14 19:48:01
Currently you can refer grafana only for metrics.