Created 10-31-2016 01:09 PM
Hi Team,
We have set up a 3 node HDP 2.5 cluster using CentOS 6.5. When we are trying to add the Hive service from Ambari in that cluster we are receiving connection refused error (error happening in all the 3 nodes). However, we have successfully added other services without any issues. Please note that while adding service we are selecting New MySQL database, there is no existing MySQL database in that cluster. Need your help to address the issue. Error stack given below which came under "Hive Client Install" while adding the Hive service.
Traceback (most recent call last):
File "/var/lib/ambari-agent/cache/common-services/HIVE/0.12.0.2.0/package/scripts/hive_client.py", line 68, in <module>
HiveClient().execute()
File "/usr/lib/python2.6/site-packages/resource_management/libraries/script/script.py", line 280, in execute
method(env)
File "/var/lib/ambari-agent/cache/common-services/HIVE/0.12.0.2.0/package/scripts/hive_client.py", line 35, in install
self.configure(env)
File "/var/lib/ambari-agent/cache/common-services/HIVE/0.12.0.2.0/package/scripts/hive_client.py", line 43, in configure
hive(name='client')
File "/usr/lib/python2.6/site-packages/ambari_commons/os_family_impl.py", line 89, in thunk
return fn(*args, **kwargs)
File "/var/lib/ambari-agent/cache/common-services/HIVE/0.12.0.2.0/package/scripts/hive.py", line 282, in hive
mode = 0644,
File "/usr/lib/python2.6/site-packages/resource_management/core/base.py", line 155, in __init__
self.env.run()
File "/usr/lib/python2.6/site-packages/resource_management/core/environment.py", line 160, in run
self.run_action(resource, action)
File "/usr/lib/python2.6/site-packages/resource_management/core/environment.py", line 124, in run_action
provider_action()
File "/usr/lib/python2.6/site-packages/resource_management/core/providers/system.py", line 123, in action_create
content = self._get_content()
File "/usr/lib/python2.6/site-packages/resource_management/core/providers/system.py", line 160, in _get_content
return content()
File "/usr/lib/python2.6/site-packages/resource_management/core/source.py", line 51, in __call__
return self.get_content()
File "/usr/lib/python2.6/site-packages/resource_management/core/source.py", line 193, in get_content
web_file = opener.open(req)
File "/usr/lib64/python2.6/urllib2.py", line 391, in open
response = self._open(req, data)
File "/usr/lib64/python2.6/urllib2.py", line 409, in _open
'_open', req)
File "/usr/lib64/python2.6/urllib2.py", line 369, in _call_chain
result = func(*args)
File "/usr/lib64/python2.6/urllib2.py", line 1190, in http_open
return self.do_open(httplib.HTTPConnection, req)
File "/usr/lib64/python2.6/urllib2.py", line 1165, in do_open
raise URLError(err)
urllib2.URLError: <urlopen error [Errno 111] Connection refused>
Created 10-31-2016 02:53 PM
It should download DBConnectionVerification.jar from Ambari server. I just double checked. Not sure why it is picking up different host instead of Ambari server. Can you please check your /etc/hosts file just to see if there is any conflicting entry.
This problem is happening while installing hive-client on your system.
Can you please make sure to have DBConnectionVerification.jar on all the ambari-agents?
/usr/lib/ambari-agent/DBConnectionVerification.jar /var/lib/ambari-agent/tmp/DBConnectionVerification.jar
Typically, on Ambari server, below is the location for DBConnectionVerification.jar
/var/lib/ambari-server/resources/DBConnectionVerification.jar
Created 10-31-2016 02:11 PM
Please note that we are using local repo ......
Created 10-31-2016 02:20 PM
@Kuldeep Kulkarni - we are able to telnet and ping to ambari-server host from the reported host. Screenshots attached.
Created 10-31-2016 02:22 PM
@rajdip chaudhuri - Sorry I thought dn1.tcsgegdc.com:8080 is your Ambari server. What is this machine, is this your repo server? Can you please check telnet for this machine?
Created 10-31-2016 02:32 PM
@Kuldeep Kulkarni - not able to do telnet from ambari server host to dn1.tcsgegdc.com. Can you please guide me what needs to be done here? Please note that telnet is installed in all the nodes in cluster.
[root@nn ambari-agent]# telnet dn1.tcsgegdc.com 8080 Trying 3.209.124.206... telnet: connect to address 3.209.124.206: Connection refused
Created 10-31-2016 02:43 PM
@Kuldeep Kulkarni - in dn1.tcsgedc.com we ZK service running and we are able to telnet on port 2181 from ambari server host. To give you a bit of synopsis:
nn.tcsgegdc.com -->> NN and Ambari server host
dn1.tcsgedc.com -->> Data Node and Hive host
dn2.tcsgedc.com -->> Data Node and Spark host
Created 10-31-2016 02:53 PM
It should download DBConnectionVerification.jar from Ambari server. I just double checked. Not sure why it is picking up different host instead of Ambari server. Can you please check your /etc/hosts file just to see if there is any conflicting entry.
This problem is happening while installing hive-client on your system.
Can you please make sure to have DBConnectionVerification.jar on all the ambari-agents?
/usr/lib/ambari-agent/DBConnectionVerification.jar /var/lib/ambari-agent/tmp/DBConnectionVerification.jar
Typically, on Ambari server, below is the location for DBConnectionVerification.jar
/var/lib/ambari-server/resources/DBConnectionVerification.jar
Created 10-31-2016 03:08 PM
@Kuldeep Kulkarni - your solution of copying the jar file worked. Is this some kind of bug in HDP 2.5 because we did not faced this in 2.4 and 2.3.
Created 10-31-2016 03:11 PM
@rajdip chaudhuri - That's great. Can you please accept my answer above. I will look into this and file a BUG if needed.