Reply
Mv
New Contributor
Posts: 1
Registered: ‎09-01-2015

The Cloudera Manager Agent is not able to communicate with this role's web server.

Hi 

 

I am getting  "The Cloudera Manager Agent is not able to communicate with this role's web server." error in HDFS, HBASE and YARN services,

rest services are not giving any error.

 

I am able to execute the code in all services but still getting above error message.  

 

Following are the logs in "cloudera-scm-agent.log" :

 

ERROR Failed to collect java-based DNS names
Traceback (most recent call last):
File "/usr/lib/cmf/agent/src/cmf/monitor/host/dns_names.py", line 64, in collect
result, stdout, stderr = self._subprocess_with_timeout(args, self._poll_timeout)
File "/usr/lib/cmf/agent/src/cmf/monitor/host/dns_names.py", line 46, in _subprocess_with_timeout
return subprocess_with_timeout(args, timeout)
File "/usr/lib/cmf/agent/src/cmf/monitor/host/subprocess_timeout.py", line 40, in subprocess_with_timeout
close_fds=True)
File "/usr/lib/python2.7/subprocess.py", line 679, in __init__
errread, errwrite)
File "/usr/lib/python2.7/subprocess.py", line 1249, in _execute_child
raise child_exception

 

48487 Monitor-DataNodeMonitor throttling_logger ERROR Error fetching metrics at 'http://hdslave01.cibil.com:50075/jmx'
Traceback (most recent call last):
File "/usr/lib/cmf/agent/src/cmf/monitor/abstract_monitor.py", line 399, in collect_metrics_from_url
openedUrl = self.urlopen(url, username=username, password=password)
File "/usr/lib/cmf/agent/src/cmf/monitor/abstract_monitor.py", line 363, in urlopen
password=password)
File "/usr/lib/cmf/agent/src/cmf/url_util.py", line 58, in urlopen_with_timeout
return opener.open(url, data, timeout)
File "/usr/lib/python2.7/urllib2.py", line 406, in open
response = meth(req, response)

 

File "/usr/lib/python2.7/urllib2.py", line 519, in http_response
'http', request, response, code, msg, hdrs)
File "/usr/lib/python2.7/urllib2.py", line 444, in error
return self._call_chain(*args)
File "/usr/lib/python2.7/urllib2.py", line 378, in _call_chain
result = func(*args)
File "/usr/lib/python2.7/urllib2.py", line 527, in http_error_default
raise HTTPError(req.get_full_url(), code, msg, hdrs, fp)
HTTPError: HTTP Error 503: Service Unavailable

 

Correct me if i am wrong, is these error are related to HTTPs server?

I didn't get these error in CDH4.5, after installation fresh copy of CDH5.2 I am getting these error.

Does HBase,HDFS,YARN services requires mandate HTTPS server or TLS configuration, if so how to fix the configuration ?

 

Thanks in advance   

New Contributor
Posts: 2
Registered: ‎05-24-2017

Re: The Cloudera Manager Agent is not able to communicate with this role's web server.

Hii,

Where you able to find the cause to your problem? we expriance the same problem.

Posts: 1,106
Topics: 1
Kudos: 285
Solutions: 134
Registered: ‎04-22-2014

Re: The Cloudera Manager Agent is not able to communicate with this role's web server.

@galzoran,

 

That original post was from years ago, so let's get your information so we can make sure we are troubleshooting the same thing.  The stack traces from your issue will be more useful than the old ones.

 

The agent will periodically make an HTTP request of roles running on the same host as the agent to load JMX output and supply that to Service Monitor for metrics collection.  If that JMX loading fails, you can see events listed in Cloudera Manager indicating as much.

 

The best thing you can do to start off is to get the stack traces that occur when the agent fails to access the JMX information in the web resource.  This information will be in the agent logs on that host (/var/log/cloudera-scm-agent/cloudera-scm-agent.log by default).

 

If you can show us a few of those, it will give us a good idea of what we can look at next.