WARNING 2018-10-03 01:16:49,660 base_alert.py:138 - [Alert][datanode_health_summary] Unable to execute alert. [Alert][datanode_health_summary] Unable to extract JSON from JMX response WARNING 2018-10-03 01:16:49,671 base_alert.py:138 - [Alert][namenode_directory_status] Unable to execute alert. [Alert][namenode_directory_status] Unable to extract JSON from JMX response WARNING 2018-10-03 01:16:49,672 base_alert.py:138 - [Alert][smartsense_gateway_status] Unable to execute alert. [Alert][smartsense_gateway_status] Unable to extract JSON from JMX response WARNING 2018-10-03 01:16:49,704 base_alert.py:138 - [Alert][ambari_agent_disk_usage] Unable to execute alert. [Errno 2] No such file or directory: '/usr/hdp' INFO 2018-10-03 01:16:49,705 logger.py:75 - Pid file /var/run/ambari-metrics-monitor/ambari-metrics-monitor.pid is empty or does not exist INFO 2018-10-03 01:16:49,705 logger.py:75 - Pid file /var/run/ambari-metrics-monitor/ambari-metrics-monitor.pid is empty or does not exist ERROR 2018-10-03 01:16:49,705 script_alert.py:123 - [Alert][ams_metrics_monitor_process] Failed with result CRITICAL: ['Ambari Monitor is NOT running on testserver-gbs-auto-cog1.sl1027443.sl.edst.ibm.com'] ERROR 2018-10-03 01:16:49,705 script_alert.py:123 - [Alert][ams_metrics_monitor_process] Failed with result CRITICAL: ['Ambari Monitor is NOT running on testserver-gbs-auto-cog1.sl1027443.sl.edst.ibm.com'] INFO 2018-10-03 01:17:10,764 Controller.py:304 - Heartbeat (response id = 179641) with server is running... INFO 2018-10-03 01:17:10,765 Controller.py:311 - Building heartbeat message INFO 2018-10-03 01:17:10,767 Heartbeat.py:87 - Adding host info/state to heartbeat message. INFO 2018-10-03 01:17:11,107 logger.py:75 - Testing the JVM's JCE policy to see it if supports an unlimited key length. INFO 2018-10-03 01:17:11,107 logger.py:75 - Testing the JVM's JCE policy to see it if supports an unlimited key length. INFO 2018-10-03 01:17:11,477 Hardware.py:188 - Some mount points were ignored: /dev/shm INFO 2018-10-03 01:17:11,478 Controller.py:320 - Sending Heartbeat (id = 179641) INFO 2018-10-03 01:17:11,521 Controller.py:333 - Heartbeat response received (id = 179642) INFO 2018-10-03 01:17:11,521 Controller.py:342 - Heartbeat interval is 1 seconds INFO 2018-10-03 01:17:11,521 Controller.py:380 - Updating configurations from heartbeat INFO 2018-10-03 01:17:11,521 Controller.py:389 - Adding cancel/execution commands INFO 2018-10-03 01:17:11,521 Controller.py:475 - Waiting 0.9 for next heartbeat INFO 2018-10-03 01:17:12,422 Controller.py:482 - Wait for next heartbeat over ERROR 2018-10-03 01:17:49,543 script_alert.py:123 - [Alert][hive_webhcat_server_status] Failed with result CRITICAL: ['Connection failed to http://testserver-gbs-auto-cog1.sl1027443.sl.edst.ibm.com:50111/templeton/v1/status?user.name=ambari-qa + \nTraceback (most recent call last):\n File "/var/lib/ambari-agent/cache/common-services/HIVE/0.12.0.2.0/package/alerts/alert_webhcat_server.py", line 190, in execute\n url_response = urllib2.urlopen(query_url, timeout=connection_timeout)\n File "/usr/lib64/python2.6/urllib2.py", line 126, in urlopen\n return _opener.open(url, data, timeout)\n File "/usr/lib64/python2.6/urllib2.py", line 391, in open\n response = self._open(req, data)\n File "/usr/lib64/python2.6/urllib2.py", line 409, in _open\n \'_open\', req)\n File "/usr/lib64/python2.6/urllib2.py", line 369, in _call_chain\n result = func(*args)\n File "/usr/lib64/python2.6/urllib2.py", line 1190, in http_open\n return self.do_open(httplib.HTTPConnection, req)\n File "/usr/lib64/python2.6/urllib2.py", line 1165, in do_open\n raise URLError(err)\nURLError: \n'] ERROR 2018-10-03 01:17:49,543 script_alert.py:123 - [Alert][hive_webhcat_server_status] Failed with result CRITICAL: ['Connection failed to http://testserver-gbs-auto-cog1.sl1027443.sl.edst.ibm.com:50111/templeton/v1/status?user.name=ambari-qa + \nTraceback (most recent call last):\n File "/var/lib/ambari-agent/cache/common-services/HIVE/0.12.0.2.0/package/alerts/alert_webhcat_server.py", line 190, in execute\n url_response = urllib2.urlopen(query_url, timeout=connection_timeout)\n File "/usr/lib64/python2.6/urllib2.py", line 126, in urlopen\n return _opener.open(url, data, timeout)\n File "/usr/lib64/python2.6/urllib2.py", line 391, in open\n response = self._open(req, data)\n File "/usr/lib64/python2.6/urllib2.py", line 409, in _open\n \'_open\', req)\n File "/usr/lib64/python2.6/urllib2.py", line 369, in _call_chain\n result = func(*args)\n File "/usr/lib64/python2.6/urllib2.py", line 1190, in http_open\n return self.do_open(httplib.HTTPConnection, req)\n File "/usr/lib64/python2.6/urllib2.py", line 1165, in do_open\n raise URLError(err)\nURLError: \n'] ERROR 2018-10-03 01:17:49,553 script_alert.py:123 - [Alert][yarn_nodemanager_health] Failed with result CRITICAL: ['Connection failed to http://testserver-gbs-auto-cog1.sl1027443.sl.edst.ibm.com:8042/ws/v1/node/info (Traceback (most recent call last):\n File "/var/lib/ambari-agent/cache/common-services/YARN/2.1.0.2.0/package/alerts/alert_nodemanager_health.py", line 171, in execute\n url_response = urllib2.urlopen(query, timeout=connection_timeout)\n File "/usr/lib64/python2.6/urllib2.py", line 126, in urlopen\n return _opener.open(url, data, timeout)\n File "/usr/lib64/python2.6/urllib2.py", line 391, in open\n response = self._open(req, data)\n File "/usr/lib64/python2.6/urllib2.py", line 409, in _open\n \'_open\', req)\n File "/usr/lib64/python2.6/urllib2.py", line 369, in _call_chain\n result = func(*args)\n File "/usr/lib64/python2.6/urllib2.py", line 1190, in http_open\n return self.do_open(httplib.HTTPConnection, req)\n File "/usr/lib64/python2.6/urllib2.py", line 1165, in do_open\n raise URLError(err)\nURLError: \n)'] ERROR 2018-10-03 01:17:49,553 script_alert.py:123 - [Alert][yarn_nodemanager_health] Failed with result CRITICAL: ['Connection failed to http://testserver-gbs-auto-cog1.sl1027443.sl.edst.ibm.com:8042/ws/v1/node/info (Traceback (most recent call last):\n File "/var/lib/ambari-agent/cache/common-services/YARN/2.1.0.2.0/package/alerts/alert_nodemanager_health.py", line 171, in execute\n url_response = urllib2.urlopen(query, timeout=connection_timeout)\n File "/usr/lib64/python2.6/urllib2.py", line 126, in urlopen\n return _opener.open(url, data, timeout)\n File "/usr/lib64/python2.6/urllib2.py", line 391, in open\n response = self._open(req, data)\n File "/usr/lib64/python2.6/urllib2.py", line 409, in _open\n \'_open\', req)\n File "/usr/lib64/python2.6/urllib2.py", line 369, in _call_chain\n result = func(*args)\n File "/usr/lib64/python2.6/urllib2.py", line 1190, in http_open\n return self.do_open(httplib.HTTPConnection, req)\n File "/usr/lib64/python2.6/urllib2.py", line 1165, in do_open\n raise URLError(err)\nURLError: \n)'] WARNING 2018-10-03 01:17:49,579 base_alert.py:138 - [Alert][namenode_hdfs_pending_deletion_blocks] Unable to execute alert. [Alert][namenode_hdfs_pending_deletion_blocks] Unable to extract JSON from JMX response WARNING 2018-10-03 01:17:49,583 base_alert.py:138 - [Alert][datanode_health_summary] Unable to execute alert. [Alert][datanode_health_summary] Unable to extract JSON from JMX response WARNING 2018-10-03 01:17:49,585 base_alert.py:138 - [Alert][datanode_heap_usage] Unable to execute alert. [Alert][datanode_heap_usage] Unable to extract JSON from JMX response ERROR 2018-10-03 01:17:49,589 script_alert.py:123 - [Alert][datanode_unmounted_data_dir] Failed with result CRITICAL: ['The following data dir(s) were not found: /data/hadoop/hdfs/data\n/hadoop/hdfs/data\n'] WARNING 2018-10-03 01:17:49,590 base_alert.py:138 - [Alert][namenode_hdfs_blocks_health] Unable to execute alert. [Alert][namenode_hdfs_blocks_health] Unable to extract JSON from JMX response ERROR 2018-10-03 01:17:49,589 script_alert.py:123 - [Alert][datanode_unmounted_data_dir] Failed with result CRITICAL: ['The following data dir(s) were not found: /data/hadoop/hdfs/data\n/hadoop/hdfs/data\n'] WARNING 2018-10-03 01:17:49,596 base_alert.py:138 - [Alert][datanode_storage] Unable to execute alert. [Alert][datanode_storage] Unable to extract JSON from JMX response WARNING 2018-10-03 01:17:49,600 base_alert.py:138 - [Alert][namenode_directory_status] Unable to execute alert. [Alert][namenode_directory_status] Unable to extract JSON from JMX response WARNING 2018-10-03 01:17:49,602 base_alert.py:138 - [Alert][namenode_hdfs_capacity_utilization] Unable to execute alert. [Alert][namenode_hdfs_capacity_utilization] Unable to extract JSON from JMX response WARNING 2018-10-03 01:17:49,610 base_alert.py:138 - [Alert][namenode_rpc_latency] Unable to execute alert. [Alert][namenode_rpc_latency] Unable to extract JSON from JMX response WARNING 2018-10-03 01:17:49,617 base_alert.py:138 - [Alert][smartsense_bundle_failed_or_timedout] Unable to execute alert. [Alert][smartsense_bundle_failed_or_timedout] Unable to extract JSON from JMX response WARNING 2018-10-03 01:17:49,620 base_alert.py:138 - [Alert][smartsense_gateway_status] Unable to execute alert. [Alert][smartsense_gateway_status] Unable to extract JSON from JMX response WARNING 2018-10-03 01:17:49,651 base_alert.py:138 - [Alert][ambari_agent_disk_usage] Unable to execute alert. [Errno 2] No such file or directory: '/usr/hdp' INFO 2018-10-03 01:17:49,652 logger.py:75 - Pid file /var/run/ambari-metrics-monitor/ambari-metrics-monitor.pid is empty or does not exist INFO 2018-10-03 01:17:49,652 logger.py:75 - Pid file /var/run/ambari-metrics-monitor/ambari-metrics-monitor.pid is empty or does not exist ERROR 2018-10-03 01:17:49,653 script_alert.py:123 - [Alert][ams_metrics_monitor_process] Failed with result CRITICAL: ['Ambari Monitor is NOT running on testserver-gbs-auto-cog1.sl1027443.sl.edst.ibm.com'] ERROR 2018-10-03 01:17:49,653 script_alert.py:123 - [Alert][ams_metrics_monitor_process] Failed with result CRITICAL: ['Ambari Monitor is NOT running on testserver-gbs-auto-cog1.sl1027443.sl.edst.ibm.com'] INFO 2018-10-03 01:18:10,892 Controller.py:304 - Heartbeat (response id = 179704) with server is running... INFO 2018-10-03 01:18:10,893 Controller.py:311 - Building heartbeat message INFO 2018-10-03 01:18:10,895 Heartbeat.py:87 - Adding host info/state to heartbeat message. INFO 2018-10-03 01:18:11,141 logger.py:75 - Testing the JVM's JCE policy to see it if supports an unlimited key length. INFO 2018-10-03 01:18:11,141 logger.py:75 - Testing the JVM's JCE policy to see it if supports an unlimited key length. INFO 2018-10-03 01:18:11,478 Hardware.py:188 - Some mount points were ignored: /dev/shm INFO 2018-10-03 01:18:11,480 Controller.py:320 - Sending Heartbeat (id = 179704) INFO 2018-10-03 01:18:11,522 Controller.py:333 - Heartbeat response received (id = 179705) INFO 2018-10-03 01:18:11,522 Controller.py:342 - Heartbeat interval is 1 seconds INFO 2018-10-03 01:18:11,522 Controller.py:380 - Updating configurations from heartbeat INFO 2018-10-03 01:18:11,522 Controller.py:389 - Adding cancel/execution commands INFO 2018-10-03 01:18:11,522 Controller.py:475 - Waiting 0.9 for next heartbeat INFO 2018-10-03 01:18:12,423 Controller.py:482 - Wait for next heartbeat over ERROR 2018-10-03 01:18:49,574 script_alert.py:123 - [Alert][hive_webhcat_server_status] Failed with result CRITICAL: ['Connection failed to http://testserver-gbs-auto-cog1.sl1027443.sl.edst.ibm.com:50111/templeton/v1/status?user.name=ambari-qa + \nTraceback (most recent call last):\n File "/var/lib/ambari-agent/cache/common-services/HIVE/0.12.0.2.0/package/alerts/alert_webhcat_server.py", line 190, in execute\n url_response = urllib2.urlopen(query_url, timeout=connection_timeout)\n File "/usr/lib64/python2.6/urllib2.py", line 126, in urlopen\n return _opener.open(url, data, timeout)\n File "/usr/lib64/python2.6/urllib2.py", line 391, in open\n response = self._open(req, data)\n File "/usr/lib64/python2.6/urllib2.py", line 409, in _open\n \'_open\', req)\n File "/usr/lib64/python2.6/urllib2.py", line 369, in _call_chain\n result = func(*args)\n File "/usr/lib64/python2.6/urllib2.py", line 1190, in http_open\n return self.do_open(httplib.HTTPConnection, req)\n File "/usr/lib64/python2.6/urllib2.py", line 1165, in do_open\n raise URLError(err)\nURLError: \n'] ERROR 2018-10-03 01:18:49,574 script_alert.py:123 - [Alert][hive_webhcat_server_status] Failed with result CRITICAL: ['Connection failed to http://testserver-gbs-auto-cog1.sl1027443.sl.edst.ibm.com:50111/templeton/v1/status?user.name=ambari-qa + \nTraceback (most recent call last):\n File "/var/lib/ambari-agent/cache/common-services/HIVE/0.12.0.2.0/package/alerts/alert_webhcat_server.py", line 190, in execute\n url_response = urllib2.urlopen(query_url, timeout=connection_timeout)\n File "/usr/lib64/python2.6/urllib2.py", line 126, in urlopen\n return _opener.open(url, data, timeout)\n File "/usr/lib64/python2.6/urllib2.py", line 391, in open\n response = self._open(req, data)\n File "/usr/lib64/python2.6/urllib2.py", line 409, in _open\n \'_open\', req)\n File "/usr/lib64/python2.6/urllib2.py", line 369, in _call_chain\n result = func(*args)\n File "/usr/lib64/python2.6/urllib2.py", line 1190, in http_open\n return self.do_open(httplib.HTTPConnection, req)\n File "/usr/lib64/python2.6/urllib2.py", line 1165, in do_open\n raise URLError(err)\nURLError: \n'] ERROR 2018-10-03 01:18:49,591 script_alert.py:123 - [Alert][yarn_nodemanager_health] Failed with result CRITICAL: ['Connection failed to http://testserver-gbs-auto-cog1.sl1027443.sl.edst.ibm.com:8042/ws/v1/node/info (Traceback (most recent call last):\n File "/var/lib/ambari-agent/cache/common-services/YARN/2.1.0.2.0/package/alerts/alert_nodemanager_health.py", line 171, in execute\n url_response = urllib2.urlopen(query, timeout=connection_timeout)\n File "/usr/lib64/python2.6/urllib2.py", line 126, in urlopen\n return _opener.open(url, data, timeout)\n File "/usr/lib64/python2.6/urllib2.py", line 391, in open\n response = self._open(req, data)\n File "/usr/lib64/python2.6/urllib2.py", line 409, in _open\n \'_open\', req)\n File "/usr/lib64/python2.6/urllib2.py", line 369, in _call_chain\n result = func(*args)\n File "/usr/lib64/python2.6/urllib2.py", line 1190, in http_open\n return self.do_open(httplib.HTTPConnection, req)\n File "/usr/lib64/python2.6/urllib2.py", line 1165, in do_open\n raise URLError(err)\nURLError: \n)'] ERROR 2018-10-03 01:18:49,591 script_alert.py:123 - [Alert][yarn_nodemanager_health] Failed with result CRITICAL: ['Connection failed to http://testserver-gbs-auto-cog1.sl1027443.sl.edst.ibm.com:8042/ws/v1/node/info (Traceback (most recent call last):\n File "/var/lib/ambari-agent/cache/common-services/YARN/2.1.0.2.0/package/alerts/alert_nodemanager_health.py", line 171, in execute\n url_response = urllib2.urlopen(query, timeout=connection_timeout)\n File "/usr/lib64/python2.6/urllib2.py", line 126, in urlopen\n return _opener.open(url, data, timeout)\n File "/usr/lib64/python2.6/urllib2.py", line 391, in open\n response = self._open(req, data)\n File "/usr/lib64/python2.6/urllib2.py", line 409, in _open\n \'_open\', req)\n File "/usr/lib64/python2.6/urllib2.py", line 369, in _call_chain\n result = func(*args)\n File "/usr/lib64/python2.6/urllib2.py", line 1190, in http_open\n return self.do_open(httplib.HTTPConnection, req)\n File "/usr/lib64/python2.6/urllib2.py", line 1165, in do_open\n raise URLError(err)\nURLError: \n)'] WARNING 2018-10-03 01:18:49,607 base_alert.py:138 - [Alert][datanode_health_summary] Unable to execute alert. [Alert][datanode_health_summary] Unable to extract JSON from JMX response WARNING 2018-10-03 01:18:49,610 base_alert.py:138 - [Alert][namenode_directory_status] Unable to execute alert. [Alert][namenode_directory_status] Unable to extract JSON from JMX response WARNING 2018-10-03 01:18:49,619 base_alert.py:138 - [Alert][smartsense_gateway_status] Unable to execute alert. [Alert][smartsense_gateway_status] Unable to extract JSON from JMX response INFO 2018-10-03 01:18:49,623 logger.py:75 - Execute['export HIVE_CONF_DIR='/etc/hive/conf.server' ; hive --hiveconf hive.metastore.uris=thrift://testserver-gbs-auto-cog1.sl1027443.sl.edst.ibm.com:9083 --hiveconf hive.metastore.client.connect.retry.delay=1 --hiveconf hive.metastore.failure.retries=1 --hiveconf hive.metastore.connect.retries=1 --hiveconf hive.metastore.client.socket.timeout=14 --hiveconf hive.execution.engine=mr -e 'show databases;''] {'path': ['/bin/', '/usr/bin/', '/usr/sbin/', '/usr/lib/hive/bin'], 'timeout_kill_strategy': 2, 'timeout': 60, 'user': 'ambari-qa'} INFO 2018-10-03 01:18:49,623 logger.py:75 - Execute['export HIVE_CONF_DIR='/etc/hive/conf.server' ; hive --hiveconf hive.metastore.uris=thrift://testserver-gbs-auto-cog1.sl1027443.sl.edst.ibm.com:9083 --hiveconf hive.metastore.client.connect.retry.delay=1 --hiveconf hive.metastore.failure.retries=1 --hiveconf hive.metastore.connect.retries=1 --hiveconf hive.metastore.client.socket.timeout=14 --hiveconf hive.execution.engine=mr -e 'show databases;''] {'path': ['/bin/', '/usr/bin/', '/usr/sbin/', '/usr/lib/hive/bin'], 'timeout_kill_strategy': 2, 'timeout': 60, 'user': 'ambari-qa'} INFO 2018-10-03 01:18:49,688 logger.py:75 - Execute['! beeline -u 'jdbc:hive2://testserver-gbs-auto-cog1.sl1027443.sl.edst.ibm.com:10000/;transportMode=binary' -e '' 2>&1| awk '{print}'|grep -i -e 'Connection refused' -e 'Invalid URL''] {'path': ['/bin/', '/usr/bin/', '/usr/lib/hive/bin/', '/usr/sbin/'], 'timeout_kill_strategy': 2, 'timeout': 60, 'user': 'ambari-qa'} INFO 2018-10-03 01:18:49,688 logger.py:75 - Execute['! beeline -u 'jdbc:hive2://testserver-gbs-auto-cog1.sl1027443.sl.edst.ibm.com:10000/;transportMode=binary' -e '' 2>&1| awk '{print}'|grep -i -e 'Connection refused' -e 'Invalid URL''] {'path': ['/bin/', '/usr/bin/', '/usr/lib/hive/bin/', '/usr/sbin/'], 'timeout_kill_strategy': 2, 'timeout': 60, 'user': 'ambari-qa'} INFO 2018-10-03 01:18:49,708 logger.py:75 - Pid file /var/run/ambari-metrics-monitor/ambari-metrics-monitor.pid is empty or does not exist WARNING 2018-10-03 01:18:49,709 base_alert.py:138 - [Alert][ambari_agent_disk_usage] Unable to execute alert. [Errno 2] No such file or directory: '/usr/hdp' INFO 2018-10-03 01:18:49,708 logger.py:75 - Pid file /var/run/ambari-metrics-monitor/ambari-metrics-monitor.pid is empty or does not exist ERROR 2018-10-03 01:18:49,712 script_alert.py:123 - [Alert][ams_metrics_monitor_process] Failed with result CRITICAL: ['Ambari Monitor is NOT running on testserver-gbs-auto-cog1.sl1027443.sl.edst.ibm.com'] ERROR 2018-10-03 01:18:49,712 script_alert.py:123 - [Alert][ams_metrics_monitor_process] Failed with result CRITICAL: ['Ambari Monitor is NOT running on testserver-gbs-auto-cog1.sl1027443.sl.edst.ibm.com'] ERROR 2018-10-03 01:18:49,758 script_alert.py:123 - [Alert][hive_metastore_process] Failed with result CRITICAL: ['Metastore on testserver-gbs-auto-cog1.sl1027443.sl.edst.ibm.com failed (Traceback (most recent call last):\n File "/var/lib/ambari-agent/cache/common-services/HIVE/0.12.0.2.0/package/alerts/alert_hive_metastore.py", line 203, in execute\n timeout_kill_strategy=TerminateStrategy.KILL_PROCESS_TREE,\n File "/usr/lib/ambari-agent/lib/resource_management/core/base.py", line 166, in __init__\n self.env.run()\n File "/usr/lib/ambari-agent/lib/resource_management/core/environment.py", line 160, in run\n self.run_action(resource, action)\n File "/usr/lib/ambari-agent/lib/resource_management/core/environment.py", line 124, in run_action\n provider_action()\n File "/usr/lib/ambari-agent/lib/resource_management/core/providers/system.py", line 262, in action_run\n tries=self.resource.tries, try_sleep=self.resource.try_sleep)\n File "/usr/lib/ambari-agent/lib/resource_management/core/shell.py", line 72, in inner\n result = function(command, **kwargs)\n File "/usr/lib/ambari-agent/lib/resource_management/core/shell.py", line 102, in checked_call\n tries=tries, try_sleep=try_sleep, timeout_kill_strategy=timeout_kill_strategy)\n File "/usr/lib/ambari-agent/lib/resource_management/core/shell.py", line 150, in _call_wrapper\n result = _call(command, **kwargs_copy)\n File "/usr/lib/ambari-agent/lib/resource_management/core/shell.py", line 303, in _call\n raise ExecutionFailed(err_msg, code, out, err)\nExecutionFailed: Execution of \'export HIVE_CONF_DIR=\'/etc/hive/conf.server\' ; hive --hiveconf hive.metastore.uris=thrift://testserver-gbs-auto-cog1.sl1027443.sl.edst.ibm.com:9083 --hiveconf hive.metastore.client.connect.retry.delay=1 --hiveconf hive.metastore.failure.retries=1 --hiveconf hive.metastore.connect.retries=1 --hiveconf hive.metastore.client.socket.timeout=14 --hiveconf hive.execution.engine=mr -e \'show databases;\'\' returned 127. -bash: hive: command not found\n)'] ERROR 2018-10-03 01:18:49,758 script_alert.py:123 - [Alert][hive_metastore_process] Failed with result CRITICAL: ['Metastore on testserver-gbs-auto-cog1.sl1027443.sl.edst.ibm.com failed (Traceback (most recent call last):\n File "/var/lib/ambari-agent/cache/common-services/HIVE/0.12.0.2.0/package/alerts/alert_hive_metastore.py", line 203, in execute\n timeout_kill_strategy=TerminateStrategy.KILL_PROCESS_TREE,\n File "/usr/lib/ambari-agent/lib/resource_management/core/base.py", line 166, in __init__\n self.env.run()\n File "/usr/lib/ambari-agent/lib/resource_management/core/environment.py", line 160, in run\n self.run_action(resource, action)\n File "/usr/lib/ambari-agent/lib/resource_management/core/environment.py", line 124, in run_action\n provider_action()\n File "/usr/lib/ambari-agent/lib/resource_management/core/providers/system.py", line 262, in action_run\n tries=self.resource.tries, try_sleep=self.resource.try_sleep)\n File "/usr/lib/ambari-agent/lib/resource_management/core/shell.py", line 72, in inner\n result = function(command, **kwargs)\n File "/usr/lib/ambari-agent/lib/resource_management/core/shell.py", line 102, in checked_call\n tries=tries, try_sleep=try_sleep, timeout_kill_strategy=timeout_kill_strategy)\n File "/usr/lib/ambari-agent/lib/resource_management/core/shell.py", line 150, in _call_wrapper\n result = _call(command, **kwargs_copy)\n File "/usr/lib/ambari-agent/lib/resource_management/core/shell.py", line 303, in _call\n raise ExecutionFailed(err_msg, code, out, err)\nExecutionFailed: Execution of \'export HIVE_CONF_DIR=\'/etc/hive/conf.server\' ; hive --hiveconf hive.metastore.uris=thrift://testserver-gbs-auto-cog1.sl1027443.sl.edst.ibm.com:9083 --hiveconf hive.metastore.client.connect.retry.delay=1 --hiveconf hive.metastore.failure.retries=1 --hiveconf hive.metastore.connect.retries=1 --hiveconf hive.metastore.client.socket.timeout=14 --hiveconf hive.execution.engine=mr -e \'show databases;\'\' returned 127. -bash: hive: command not found\n)']