Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

FAILED - RETRYING: Make sure all of the Ambari Agents have registered (100 retries left).

Highlighted

FAILED - RETRYING: Make sure all of the Ambari Agents have registered (100 retries left).

New Contributor

https://github.com/hortonworks/ansible-hortonworks .

I am using ansible-hortonwork to install hadoop on ec2 instance. I also tried ansible-hadoop. in both cases i got error for Ambari Agents registration.

TASK [ambari-config : Modify the HDF base_url repository for HDF 3.0] ******************************************************************************************************************

skipping: [master01]

TASK [ambari-config : Make sure all of the Ambari Agents have registered] **************************************************************************************************************

ok: [master01] => (item=master01)

FAILED - RETRYING: Make sure all of the Ambari Agents have registered (100 retries left).

FAILED - RETRYING: Make sure all of the Ambari Agents have registered (99 retries left).

FAILED - RETRYING: Make sure all of the Ambari Agents have registered (98 retries left).

FAILED - RETRYING: Make sure all of the Ambari Agents have registered (97 retries left).

FAILED - RETRYING: Make sure all of the Ambari Agents have registered (96 retries left).

FAILED - RETRYING: Make sure all of the Ambari Agents have registered (95 retries left).

FAILED - RETRYING: Make sure all of the Ambari Agents have registered (94 retries left).

FAILED - RETRYING: Make sure all of the Ambari Agents have registered (93 retries left).

I am able to do ssh and password less connection all setup looks fine.

I don't unsderstand whats going on as ambari server is up and amabari agent is up. i think something is related to network and port. I see below things while do netstat -anp

It is something to do with ipv4 and ipv6. instance have ubuntu (gcc version 4.8.4 (Ubuntu 4.8.4-2ubuntu1~14.04.3)).

Salve: netstat -anp

tcp00 0.0.0.0:86700.0.0.0:* LISTEN521/python

Master: netstat -anp

tcp00 0.0.0.0:86700.0.0.0:* LISTEN4202/python

tcp6 00 :::8440 :::*LISTEN26419/java

tcp6 00 :::5432 :::*LISTEN16983/postgres

tcp6 00 :::8441 :::*LISTEN26419/java

tcp6 00 ::1:25:::*LISTEN2264/master

tcp6 00 :::8080 :::*LISTEN26419/java

1 REPLY 1

Re: FAILED - RETRYING: Make sure all of the Ambari Agents have registered (100 retries left).

New Contributor

at the end of the retry i got below error. As i mentioned in above post where ambari agrnt is listening.

FAILED - RETRYING: Make sure all of the Ambari Agents have registered (2 retries left).

FAILED - RETRYING: Make sure all of the Ambari Agents have registered (1 retries left).

failed: [master01] (item=slave01) => {"attempts": 100, "cache_control": "no-store", "changed": false, "connection": "close", "content_type": "text/plain", "expires": "Thu, 01 Jan 1970 00:00:00 GMT", "failed": true, "item": "slave01", "msg": "HTTP Error 404: Not Found", "pragma": "no-cache", "redirected": false, "set_cookie": "AMBARISESSIONID=1ccd4t86mvfvo10ijbzzoae1dv;Path=/;HttpOnly", "status": 404, "url": "http://ip-10-16-42-20.us-west-2.compute.internal:8080/api/v1/hosts/ip-10-16-46-142.us-west-2.compute.internal", "user": "VALUE_SPECIFIED_IN_NO_LOG_PARAMETER", "x_content_type_options": "nosniff", "x_frame_options": "DENY", "x_xss_protection": "1; mode=block"}

NO MORE HOSTS LEFT *********************************************************************************************************************************************************************

to retry, use: --limit @/root/ansible-hortonworks/playbooks/install_cluster.retry

PLAY RECAP *****************************************************************************************************************************************************************************

localhost: ok=9changed=6 unreachable=0failed=0

master01 : ok=68 changed=4 unreachable=0failed=1

slave01: ok=21 changed=1 unreachable=0failed=0

(ansible)root@ip-10-16-42-20:~/ansible-hortonworks# netstat -anp