hi, I have the similar problem, plz help!
The different is my cluster use /etc/hosts to resolve hostname
/etc/hosts contain such content:
And cloudera-manager-server is installed on VM-N2, the nodes use the same username(root) and password to ssh.
But the installation wizard can only install VM-N2 successfully, the other nodes always report error when "Detecting Cloudera Manager Server..".
I make sure
(1) the server is running:
$service cloudera-scm-server status
cloudera-scm-server (pid 8011) is running...
echo "quit" | nc -v localhost 7182
nc: connect to localhost port 7182 (tcp) failed: Connection refused
Connection to localhost 7182 port [tcp/*] succeeded!
HTTP/1.1 400 Bad Request
(3) and I run this script in shell
python -c 'import socket; import sys; s = socket.socket(socket.AF_INET); s.settimeout(5.0); s.connect((sys.argv, int(sys.argv))); s.close();' localhost 7182
it does not echo error
Red Hat Enterprise Linux Server release 6.3 (Santiago)
BEGIN host -t PTR 192.168.0.56
184.108.40.206.in-addr.arpa domain name pointer localhost.
using localhost as scm server hostname
BEGIN which python
BEGIN python -c 'import socket; import sys; s = socket.socket(socket.AF_INET); s.settimeout(5.0); s.connect((sys.argv, int(sys.argv))); s.close();' localhost 7182
Traceback (most recent call last):
File "<string>", line 1, in <module>
File "<string>", line 1, in connect
socket.error: [Errno 111] Connection refused
could not contact scm server at localhost:7182, giving up
waiting for rollback request
i also faced the similar issue, when worked with IT support team get to know that issue is with server on which cloudera is being installed(Master server). DNS entry need to removed which it trying to get details of scm server.
hope this helps.