- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
How to get rid of stale alerts in Ambari
- Labels:
-
Apache Ambari
Created on ‎01-13-2016 04:27 PM - edited ‎08-19-2019 05:15 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
I have restarted Ambari Server and all agents along with complete HDP stack multiple times in past 5 days for different activities but these alerts don't go away.
Created ‎01-20-2016 07:20 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi @Pardeep with Support's help we got rid of those alerts by adding 'misfire_grace_time':10 to APS_CONFIG in /usr/lib/python2.6/site-packages/ambari_agent/AlertSchedulerHandler.py on every node. After the update that section should read:
APS_CONFIG = { 'threadpool.core_threads': 3, 'coalesce': True, 'standalone': False, 'misfire_grace_time':10 }
In this we are allowing up to 10 seconds for all tests to complete. After that restart all ambari_agents. We tried on one cluster and it worked. This is most likely fixed in Ambari-2.2 but happens in 2.1.2.
Created ‎04-08-2016 01:31 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi:
today i had the same problem and i have efixed adding this on all nodes in thid file:
/usr/lib/python2.6/site-packages/ambari_agent/AlertSchedulerHandler.py
APS_CONFIG = { 'threadpool.core_threads': 3, 'coalesce': True, 'standalone': False, 'misfire_grace_time':10 }
but i want to know, what is that, its a bug or i Have a problem with my cluster.
thanks
Created ‎08-11-2020 01:40 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
I did this in root user, found the file and changed it there. But, how to change it for each node?
Created ‎04-11-2016 11:16 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi:
today the alert appear again, so, i think there are another problem, any suggestions?
Many Thanks.
Created ‎04-11-2016 11:16 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi:
today the alert appear again, so, i think there are another problem, any suggestions?
Many Thanks.
Created ‎04-11-2016 06:06 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
FWIW, You can disable this alert. Click "Enabled" next to State in upper-right and this alert will no longer be checked.
Created ‎05-02-2016 08:23 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
I have the same issue with an HDP 2.4 on SLES 11.4 fresh install.
Many of the alerts regard timeouts chacking the UIs:
Data Node UI, Node Manager UI, Atrals UI, Oozie UI, etc.
All these URLs are reachable from both the windows laptops with the browser an the HDP-nodes using wget in the console.
How does Ambari check these URLs? Is it possible that the check-scripts ignore os-wide proxy- and firewall-configurations?
I would also like to know what does the "24-Hour" columns mean, does someone have an idea what does it mean? There is no mention at all of this field in the docs. The content is alway "0" in my cluster (like in the posted screenshot).
Many thanks
Created ‎05-02-2016 08:27 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi:
have you did this:
/usr/lib/python2.6/site-packages/ambari_agent/AlertSchedulerHandler.py
- APS_CONFIG ={'threadpool.core_threads':3,'coalesce':True,'standalone':False,'misfire_grace_time':10}
also, check de ambari database alert
regards
Created ‎05-02-2016 08:54 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
In my Ambari the code look like:
self.APS_CONFIG = { 'apscheduler.threadpool.core_threads': 3, 'apscheduler.coalesce': True, 'apscheduler.standalone': False, 'apscheduler.misfire_grace_time': alert_grace_period }
Created ‎05-15-2017 08:21 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
In Ambari 2.2 misfire_grace_time is configurable by changing the value for the variable alert_grace_period.
This variable is configured in /etc/ambari-agent/conf/ambari-agent.ini and the default is:
alert_grace_period=5
You can increase to 10 seconds to match above answer.
This needs to be done on all hosts running ambari-agent, and afterwards ambari-agent needs to be restarted.
Created ‎06-14-2017 09:34 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
I have tried to place this parameter
- APS_CONFIG ={'threadpool.core_threads':3,'coalesce':True,'standalone':False,'misfire_grace_time':10} in the following file /usr/lib/python2.6/site-packages/ambari_agent/AlertSchedulerHandler.py
- Changed the parmeter and restarted all ambari-agents but failed with following error
====================================================================================
- Traceback (most recent call last): File "/usr/lib/python2.6/site-packages/ambari_agent/AmbariAgent.py", line 24, in <module> from Controller import AGENT_AUTO_RESTART_EXIT_CODE File "/usr/lib/python2.6/site-packages/ambari_agent/Controller.py", line 44, in <module> from ambari_agent.AlertSchedulerHandler import AlertSchedulerHandler File "/usr/lib/python2.6/site-packages/ambari_agent/AlertSchedulerHandler.py", line 50 'misfire_grace_time':10 ^SyntaxError: invalid syntax
- =========================================================================================
- Please help me out how to fix this issue
- Appreciate the help
