We are keep getting this kind of alert. Because of too many queries submitted to cluster, we are getting this kind of alerts. Even after restarting name node, these alerts are not going. Is there a way to clean up these alerts.
There are two variance of "NameNode Heap Usage" alert (Daily & Weekely), which is a service-level alert and it is triggered if the NameNode heap usage deviation has grown beyond the specified threshold within a given period.
it just shows the deviation of heap usages if it gooes beyond the mentioned throshold. That helps us in planning in advance if we need to increase the NameNode heap (if we have set the threshold properly).
The default growth rate in heap for CRITICAL is set to 50% and WARNING is set to 20%. This varies based on the usage an type of your cluster and HDFS so this value users can adjust based on their requierment and based on the nature and heavy use.
You can check the value mentioned in the alert message to understand if there was 20% or 50% increase happened in the Heap Usage compared to a day before usage? (please share the screenshot of the alert / message)