<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question ambari-agent connection refused after host reboot in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/ambari-agent-connection-refused-after-host-reboot/m-p/241206#M203010</link>
    <description>&lt;P&gt;I have a 1-node ambari managed cluster which was working correctly, auto starting server, host and components when restarting the system.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;I've changed to a public ip and a diferent hostname and after solving FQDN problems, I have problem with auto start when rebooting. Ambari server and agent are auto starting but heartbeat is lost and ambari-agent logs show connection refused, but if I manually restart ambari-agent, connection is correct and I can start services.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;There's ambari-server UI just after rebooting.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="108906-1558694191895.png" style="width: 1355px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/13763i7E0416EA53829BE3/image-size/medium?v=v2&amp;amp;px=400" role="button" title="108906-1558694191895.png" alt="108906-1558694191895.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;ambari-agent log shows the nex tail.&lt;/P&gt;&lt;PRE&gt;ERROR 2019-05-24 12:28:47,235 script_alert.py:119 - [Alert][hive_metastore_process] Failed with result CRITICAL: ['Metastore on bigdata.es failed (Traceback (most recent call last):\n &amp;nbsp;File "/var/lib/ambari-agent/cache/common-services/HIVE/0.12.0.2.0/package/alerts/alert_hive_metastore.py", line 200, in execute\n &amp;nbsp; &amp;nbsp;timeout_kill_strategy=TerminateStrategy.KILL_PROCESS_TREE,\n &amp;nbsp;File "/usr/lib/python2.6/site-packages/resource_management/core/base.py", line 155, in __init__\n &amp;nbsp; &amp;nbsp;self.env.run()\n &amp;nbsp;File "/usr/lib/python2.6/site-packages/resource_management/core/environment.py", line 160, in run\n &amp;nbsp; &amp;nbsp;self.run_action(resource, action)\n &amp;nbsp;File "/usr/lib/python2.6/site-packages/resource_management/core/environment.py", line 124, in run_action\n &amp;nbsp; &amp;nbsp;provider_action()\n &amp;nbsp;File "/usr/lib/python2.6/site-packages/resource_management/core/providers/system.py", line 262, in action_run\n &amp;nbsp; &amp;nbsp;tries=self.resource.tries, try_sleep=self.resource.try_sleep)\n &amp;nbsp;File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 72, in inner\n &amp;nbsp; &amp;nbsp;result = function(command, **kwargs)\n &amp;nbsp;File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 102, in checked_call\n &amp;nbsp; &amp;nbsp;tries=tries, try_sleep=try_sleep, timeout_kill_strategy=timeout_kill_strategy)\n &amp;nbsp;File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 150, in _call_wrapper\n &amp;nbsp; &amp;nbsp;result = _call(command, **kwargs_copy)\n &amp;nbsp;File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 303, in _call\n &amp;nbsp; &amp;nbsp;raise ExecutionFailed(err_msg, code, out, err)\nExecutionFailed: Execution of \'export HIVE_CONF_DIR=\'/usr/hdp/current/hive-metastore/conf/conf.server\' ; hive --hiveconf hive.metastore.uris=thrift://bigdata.es:9083 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; --hiveconf hive.metastore.client.connect.retry.delay=1 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; --hiveconf hive.metastore.failure.retries=1 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; --hiveconf hive.metastore.connect.retries=1 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; --hiveconf hive.metastore.client.socket.timeout=14 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; --hiveconf hive.execution.engine=mr -e \'show databases;\'\' returned 1. Logging initialized using configuration in file:/etc/hive/2.6.3.0-71/0/conf.server/hive-log4j.properties\nException in thread "main" java.lang.RuntimeException: java.lang.RuntimeException: Unable to instantiate org.apache.hadoop.hive.ql.metadata.SessionHiveMetaStoreClient\n\tat org.apache.hadoop.hive.ql.session.SessionState.start(SessionState.java:547)\n\tat org.apache.hadoop.hive.cli.CliDriver.run(CliDriver.java:681)\n\tat org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:625)\n\tat sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)\n\tat sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)\n\tat sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)\n\tat java.lang.reflect.Method.invoke(Method.java:498)\n\tat org.apache.hadoop.util.RunJar.run(RunJar.java:233)\n\tat org.apache.hadoop.util.RunJar.main(RunJar.java:148)\nCaused by: java.lang.RuntimeException: Unable to instantiate org.apache.hadoop.hive.ql.metadata.SessionHiveMetaStoreClient\n\tat org.apache.hadoop.hive.metastore.MetaStoreUtils.newInstance(MetaStoreUtils.java:1566)\n\tat org.apache.hadoop.hive.metastore.RetryingMetaStoreClient.&amp;lt;init&amp;gt;(RetryingMetaStoreClient.java:92)\n\tat org.apache.hadoop.hive.metastore.RetryingMetaStoreClient.getProxy(RetryingMetaStoreClient.java:138)\n\tat org.apache.hadoop.hive.metastore.RetryingMetaStoreClient.getProxy(RetryingMetaStoreClient.java:110)\n\tat org.apache.hadoop.hive.ql.metadata.Hive.createMetaStoreClient(Hive.java:3510)\n\tat org.apache.hadoop.hive.ql.metadata.Hive.getMSC(Hive.java:3542)\n\tat org.apache.hadoop.hive.ql.session.SessionState.start(SessionState.java:528)\n\t... 8 more\nCaused by: java.lang.reflect.InvocationTargetException\n\tat sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)\n\tat sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)\n\tat sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)\n\tat java.lang.reflect.Constructor.newInstance(Constructor.java:423)\n\tat org.apache.hadoop.hive.metastore.MetaStoreUtils.newInstance(MetaStoreUtils.java:1564)\n\t... 14 more\nCaused by: MetaException(message:Could not connect to meta store using any of the URIs provided. Most recent failure: org.apache.thrift.transport.TTransportException: java.net.ConnectException: Conexi\xc3\xb3n rehusada (Connection refused)\n\tat org.apache.thrift.transport.TSocket.open(TSocket.java:226)\n\tat org.apache.hadoop.hive.metastore.HiveMetaStoreClient.open(HiveMetaStoreClient.java:487)\n\tat org.apache.hadoop.hive.metastore.HiveMetaStoreClient.&amp;lt;init&amp;gt;(HiveMetaStoreClient.java:282)\n\tat org.apache.hadoop.hive.ql.metadata.SessionHiveMetaStoreClient.&amp;lt;init&amp;gt;(SessionHiveMetaStoreClient.java:76)\n\tat sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)\n\tat sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)\n\tat sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)\n\tat java.lang.reflect.Constructor.newInstance(Constructor.java:423)\n\tat org.apache.hadoop.hive.metastore.MetaStoreUtils.newInstance(MetaStoreUtils.java:1564)\n\tat org.apache.hadoop.hive.metastore.RetryingMetaStoreClient.&amp;lt;init&amp;gt;(RetryingMetaStoreClient.java:92)\n\tat org.apache.hadoop.hive.metastore.RetryingMetaStoreClient.getProxy(RetryingMetaStoreClient.java:138)\n\tat org.apache.hadoop.hive.metastore.RetryingMetaStoreClient.getProxy(RetryingMetaStoreClient.java:110)\n\tat org.apache.hadoop.hive.ql.metadata.Hive.createMetaStoreClient(Hive.java:3510)\n\tat org.apache.hadoop.hive.ql.metadata.Hive.getMSC(Hive.java:3542)\n\tat org.apache.hadoop.hive.ql.session.SessionState.start(SessionState.java:528)\n\tat org.apache.hadoop.hive.cli.CliDriver.run(CliDriver.java:681)\n\tat org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:625)\n\tat sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)\n\tat sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)\n\tat sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)\n\tat java.lang.reflect.Method.invoke(Method.java:498)\n\tat org.apache.hadoop.util.RunJar.run(RunJar.java:233)\n\tat org.apache.hadoop.util.RunJar.main(RunJar.java:148)\nCaused by: java.net.ConnectException: Conexi\xc3\xb3n rehusada (Connection refused)\n\tat java.net.PlainSocketImpl.socketConnect(Native Method)\n\tat java.net.AbstractPlainSocketImpl.doConnect(AbstractPlainSocketImpl.java:350)\n\tat java.net.AbstractPlainSocketImpl.connectToAddress(AbstractPlainSocketImpl.java:206)\n\tat java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:188)\n\tat java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392)\n\tat java.net.Socket.connect(Socket.java:589)\n\tat org.apache.thrift.transport.TSocket.open(TSocket.java:221)\n\t... 22 more\n)\n\tat org.apache.hadoop.hive.metastore.HiveMetaStoreClient.open(HiveMetaStoreClient.java:534)\n\tat org.apache.hadoop.hive.metastore.HiveMetaStoreClient.&amp;lt;init&amp;gt;(HiveMetaStoreClient.java:282)\n\tat org.apache.hadoop.hive.ql.metadata.SessionHiveMetaStoreClient.&amp;lt;init&amp;gt;(SessionHiveMetaStoreClient.java:76)\n\t... 19 more\n)']
INFO 2019-05-24 12:28:58,095 logger.py:71 - call[['test', '-w', '/dev']] {'sudo': True, 'quiet': False, 'timeout': 5}
INFO 2019-05-24 12:28:58,108 logger.py:71 - call returned (0, '')
INFO 2019-05-24 12:28:58,119 logger.py:71 - call[['test', '-w', '/']] {'sudo': True, 'quiet': False, 'timeout': 5}
INFO 2019-05-24 12:28:58,131 logger.py:71 - call returned (0, '')
INFO 2019-05-24 12:28:58,143 logger.py:71 - call[['test', '-w', '/Datos']] {'sudo': True, 'quiet': False, 'timeout': 5}
INFO 2019-05-24 12:28:58,154 logger.py:71 - call returned (0, '')
ERROR 2019-05-24 12:29:39,995 script_alert.py:119 - [Alert][hive_webhcat_server_status] Failed with result CRITICAL: ['Connection failed to &lt;A href="http://bigdata.es:50111/templeton/v1/status?user.name=ambari-qa" target="_blank" rel="nofollow noopener noreferrer"&gt;http://bigdata.es:50111/templeton/v1/status?user.name=ambari-qa&lt;/A&gt; + \nTraceback (most recent call last):\n &amp;nbsp;File "/var/lib/ambari-agent/cache/common-services/HIVE/0.12.0.2.0/package/alerts/alert_webhcat_server.py", line 190, in execute\n &amp;nbsp; &amp;nbsp;url_response = urllib2.urlopen(query_url, timeout=connection_timeout)\n &amp;nbsp;File "/usr/lib/python2.7/urllib2.py", line 154, in urlopen\n &amp;nbsp; &amp;nbsp;return opener.open(url, data, timeout)\n &amp;nbsp;File "/usr/lib/python2.7/urllib2.py", line 429, in open\n &amp;nbsp; &amp;nbsp;response = self._open(req, data)\n &amp;nbsp;File "/usr/lib/python2.7/urllib2.py", line 447, in _open\n &amp;nbsp; &amp;nbsp;\'_open\', req)\n &amp;nbsp;File "/usr/lib/python2.7/urllib2.py", line 407, in _call_chain\n &amp;nbsp; &amp;nbsp;result = func(*args)\n &amp;nbsp;File "/usr/lib/python2.7/urllib2.py", line 1228, in http_open\n &amp;nbsp; &amp;nbsp;return self.do_open(httplib.HTTPConnection, req)\n &amp;nbsp;File "/usr/lib/python2.7/urllib2.py", line 1198, in do_open\n &amp;nbsp; &amp;nbsp;raise URLError(err)\nURLError: &amp;lt;urlopen error [Errno 111] Conexi\xc3\xb3n rehusada&amp;gt;\n']
ERROR 2019-05-24 12:29:40,000 script_alert.py:119 - [Alert][yarn_nodemanager_health] Failed with result CRITICAL: ['Connection failed to &lt;A href="http://bigdata.es:8042/ws/v1/node/info" target="_blank" rel="nofollow noopener noreferrer"&gt;http://bigdata.es:8042/ws/v1/node/info&lt;/A&gt; (Traceback (most recent call last):\n &amp;nbsp;File "/var/lib/ambari-agent/cache/common-services/YARN/2.1.0.2.0/package/alerts/alert_nodemanager_health.py", line 171, in execute\n &amp;nbsp; &amp;nbsp;url_response = urllib2.urlopen(query, timeout=connection_timeout)\n &amp;nbsp;File "/usr/lib/python2.7/urllib2.py", line 154, in urlopen\n &amp;nbsp; &amp;nbsp;return opener.open(url, data, timeout)\n &amp;nbsp;File "/usr/lib/python2.7/urllib2.py", line 429, in open\n &amp;nbsp; &amp;nbsp;response = self._open(req, data)\n &amp;nbsp;File "/usr/lib/python2.7/urllib2.py", line 447, in _open\n &amp;nbsp; &amp;nbsp;\'_open\', req)\n &amp;nbsp;File "/usr/lib/python2.7/urllib2.py", line 407, in _call_chain\n &amp;nbsp; &amp;nbsp;result = func(*args)\n &amp;nbsp;File "/usr/lib/python2.7/urllib2.py", line 1228, in http_open\n &amp;nbsp; &amp;nbsp;return self.do_open(httplib.HTTPConnection, req)\n &amp;nbsp;File "/usr/lib/python2.7/urllib2.py", line 1198, in do_open\n &amp;nbsp; &amp;nbsp;raise URLError(err)\nURLError: &amp;lt;urlopen error [Errno 111] Conexi\xc3\xb3n rehusada&amp;gt;\n)']
INFO 2019-05-24 13:30:00,155 main.py:96 - loglevel=logging.INFO
INFO 2019-05-24 13:30:00,157 main.py:96 - loglevel=logging.INFO
INFO 2019-05-24 13:30:00,157 main.py:96 - loglevel=logging.INFO
INFO 2019-05-24 13:30:00,159 DataCleaner.py:39 - Data cleanup thread started
INFO 2019-05-24 13:30:00,164 DataCleaner.py:120 - Data cleanup started
INFO 2019-05-24 13:30:00,295 DataCleaner.py:122 - Data cleanup finished
INFO 2019-05-24 13:30:00,314 PingPortListener.py:50 - Ping port listener started on port: 8670
INFO 2019-05-24 13:30:00,314 main.py:132 - Newloglevel=logging.DEBUG
INFO 2019-05-24 13:30:00,314 main.py:405 - Connecting to Ambari server at &lt;A href="https://bigdata.es:8440" target="_blank" rel="nofollow noopener noreferrer"&gt;https://bigdata.es:8440&lt;/A&gt; (10.61.2.10)
DEBUG 2019-05-24 13:30:00,314 NetUtil.py:110 - Trying to connect to &lt;A href="https://bigdata.es:8440" target="_blank" rel="nofollow noopener noreferrer"&gt;https://bigdata.es:8440&lt;/A&gt;
INFO 2019-05-24 13:30:00,315 NetUtil.py:67 - Connecting to &lt;A href="https://bigdata.es:8440/ca" target="_blank" rel="nofollow noopener noreferrer"&gt;https://bigdata.es:8440/ca&lt;/A&gt;
WARNING 2019-05-24 13:30:00,317 NetUtil.py:98 - Failed to connect to &lt;A href="https://bigdata.es:8440/ca" target="_blank" rel="nofollow noopener noreferrer"&gt;https://bigdata.es:8440/ca&lt;/A&gt; due to [Errno 111] ConexiÃ³n rehusada
WARNING 2019-05-24 13:30:00,317 NetUtil.py:121 - Server at &lt;A href="https://bigdata.es:8440" target="_blank" rel="nofollow noopener noreferrer"&gt;https://bigdata.es:8440&lt;/A&gt; is not reachable, sleeping for 10 seconds...&lt;/PRE&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;And ambari-agent.ini config is the following&lt;/P&gt;&lt;PRE&gt;[server]
hostname = bigdata.es
url_port = 8440
secured_url_port = 8441
connect_retry_delay = 10
max_reconnect_retry_delay = 30

[agent]
logdir = /var/log/ambari-agent
piddir = /var/run/ambari-agent
prefix = /var/lib/ambari-agent/data
loglevel = DEBUG
data_cleanup_interval = 86400
data_cleanup_max_age = 2592000
data_cleanup_max_size_mb = 100
ping_port = 8670
cache_dir = /var/lib/ambari-agent/cache
tolerate_download_failures = true
run_as_user = root
parallel_execution = 0
alert_grace_period = 5
status_command_timeout = 5
alert_kinit_timeout = 14400000
system_resource_overrides = /etc/resource_overrides

[security]
keysdir = /var/lib/ambari-agent/keys
server_crt = ca.crt
passphrase_env_var_name = AMBARI_PASSPHRASE
ssl_verify_cert = 0
credential_lib_dir = /var/lib/ambari-agent/cred/lib
credential_conf_dir = /var/lib/ambari-agent/cred/conf
credential_shell_cmd = org.apache.hadoop.security.alias.CredentialShell
force_https_protocol = PROTOCOL_TLSv1_2

[services]
pidlookuppath = /var/run/

[heartbeat]
state_interval_seconds = 60
dirs = /etc/hadoop,/etc/hadoop/conf,/etc/hbase,/etc/hcatalog,/etc/hive,/etc/oozie,
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; /etc/sqoop,
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; /var/run/hadoop,/var/run/zookeeper,/var/run/hbase,/var/run/templeton,/var/run/oozie,
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; /var/log/hadoop,/var/log/zookeeper,/var/log/hbase,/var/run/templeton,/var/log/hive
log_lines_count = 300
idle_interval_min = 1
idle_interval_max = 10

[logging]
syslog_enabled = 0&lt;/PRE&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;If I run sudo ambari-agent restart command, then I can connect and start services in ambari-server ui.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="108943-1558694501380.png" style="width: 1064px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/13764i227D5D770E7BE31F/image-size/medium?v=v2&amp;amp;px=400" role="button" title="108943-1558694501380.png" alt="108943-1558694501380.png" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
    <pubDate>Sat, 17 Aug 2019 22:18:55 GMT</pubDate>
    <dc:creator>gilandresadrian</dc:creator>
    <dc:date>2019-08-17T22:18:55Z</dc:date>
  </channel>
</rss>

