I would like to know if there is a way to get metrics about cpu and memory usage.
For example, I would like to highlight the effect of a skew join on reducers, disk I/O, mapper memory usage during the querying, etc.
I saw really interesting slides about performance comparison (Hortonworks), with some graphes and bars, etc. and I was wondering the method to get those values...
One approach will be to use hive JMX based approach to collect some of those details like: https://community.hortonworks.com/articles/62211/enabling-jmx-monitoring-for-hiveserver2.html
Grafana also can provide many graphs related to Hive:
Thanks a lot for your reply.
But, through JMX, will I be able to monitor a particular query ? Or just the global activity of the JVM ?