I would like to know how this load_1, load_5 and load_15 generated?
I am running CDH Distribution ver 5.4.5
For Multi Core Node, how we can identify the maximum save number for the LOAD_1 average. Because our Node with 24 cores, when the load is high, can show number up to almost 300 for LOAD_1. And for the Less core usually only about 40 to 50.
Currently in our system, the hardware is monitored by other system. With the treshold of 90. And all the 24 core system always generate alert even the node is not actually busy.
So need this information to set the right alert for different Node with different number of core