Created 04-11-2016 12:41 PM
Hi:
When start the ambari-server i can see this error:
2016 14:02:15,986 INFO [qtp-ambari-client-1397] MetricsReportPropertyProvider:153 - METRICS_COLLECTOR is not live. Skip populating resources with metrics, next message will be logged after 1000 attempts. 11 Apr 2016 14:02:30,746 WARN [ambari-action-scheduler] ActionScheduler:654 - Host component information has not been found. Details:cluster=rsicluster01; host=lnxbig04.cajarural.gcr; service=YARN; component=APP_TIMELINE_SERVER; 11 Apr 2016 14:02:30,747 WARN [ambari-action-scheduler] ActionScheduler:682 - Host lnxbig04.cajarural.gcr has been detected as non-available. Host not found when trying to schedule an execution command. The most probable reason for that is that host or host component has been deleted recently. The command has been aborted and dequeued.Execution command details: cmdId: 894-11; taskId: 5031; roleCommand: CUSTOM_COMMAND 11 Apr 2016 14:02:31,777 WARN [ambari-action-scheduler] ActionScheduler:654 - Host component information has not been found. Details:cluster=rsicluster01; host=lnxbig04.cajarural.gcr; service=YARN; component=APP_TIMELINE_SERVER; 11 Apr 2016 14:02:31,777 WARN [ambari-action-scheduler] ActionScheduler:682 - Host lnxbig04.cajarural.gcr has been detected as non-available. Host not found when trying to schedule an execution command. The most probable reason for that is that host or host component has been deleted recently. The command has been aborted and dequeued.Execution command details: cmdId: 894-37; taskId: 5085; roleCommand: CUSTOM_COMMAND 11 Apr 2016 14:03:05,354 INFO [qtp-ambari-agent-1417] HeartBeatHandler:683 - State of service component METRICS_MONITOR of service AMBARI_METRICS of cluster rsicluster01 has changed from STARTED to INSTALLED at host lnxbig05.cajarural.gcr
but i dont have any APP_TIMELINE_SERVER in the host lnxbig04.cajarural.gcr, so how i cant delete this alert??? also the upgrade doesnt finish for this error.
Thanks
Created 04-11-2016 05:59 PM
Does this host lnxbigO4.cajarural.gcr exist at all? Could you please invoke the following URL and check for existence of this node and the services running on them:
http://:8080/api/v1/clusters//hosts>:8080/api/v1/clusters/<cluster_name>/hosts
Then drill down on this host to check the services installed.
If ATS is listed here, copy the URL, ensure that it really is not required and invoke an appropriate API to delete the service as listed in the Ambari Cwiki page
Is this a test cluster? if so, please obtain the following from ambari db:
select hostname from hosts;
select * from hostcomponentstate;
select * from hostcomponentdesiredstate;
Thanks & Rgds
Venkat
Created 04-11-2016 12:55 PM
I see the alert as WARNING which should not create a problem. Is it possible to attach full log ?
Also what is the error while you issue ambari upgrade ? Please send logs also.
Created 04-11-2016 01:10 PM
Here is the ambari-server log, just to say, the is not apptime server isntalled in this host.
11 Apr 2016 15:10:10,866 WARN [ambari-action-scheduler] ActionScheduler:654 - Host component information has not been found. Details:cluster=rsicluster01; host=lnxbig04.cajarural.gcr; service=YARN; component=APP_TIMELINE_SERVER; 11 Apr 2016 15:10:10,875 WARN [ambari-action-scheduler] ActionScheduler:682 - Host lnxbig04.cajarural.gcr has been detected as non-available. Host not found when trying to schedule an execution command. The most probable reason for that is that host or host component has been deleted recently. The command has been aborted and dequeued.Execution command details: cmdId: 894-11; taskId: 5031; roleCommand: CUSTOM_COMMAND 11 Apr 2016 15:10:11,908 WARN [ambari-action-scheduler] ActionScheduler:654 - Host component information has not been found. Details:cluster=rsicluster01; host=lnxbig04.cajarural.gcr; service=YARN; component=APP_TIMELINE_SERVER; 11 Apr 2016 15:10:11,908 WARN [ambari-action-scheduler] ActionScheduler:682 - Host lnxbig04.cajarural.gcr has been detected as non-available. Host not found when trying to schedule an execution command. The most probable reason for that is that host or host component has been deleted recently. The command has been aborted and dequeued.Execution command details: cmdId: 894-37; taskId: 5085; roleCommand: CUSTOM_COMMAND
Created 04-11-2016 05:59 PM
Does this host lnxbigO4.cajarural.gcr exist at all? Could you please invoke the following URL and check for existence of this node and the services running on them:
http://:8080/api/v1/clusters//hosts>:8080/api/v1/clusters/<cluster_name>/hosts
Then drill down on this host to check the services installed.
If ATS is listed here, copy the URL, ensure that it really is not required and invoke an appropriate API to delete the service as listed in the Ambari Cwiki page
Is this a test cluster? if so, please obtain the following from ambari db:
select hostname from hosts;
select * from hostcomponentstate;
select * from hostcomponentdesiredstate;
Thanks & Rgds
Venkat
Created 04-13-2016 04:09 AM
Also, it is important to take a backup of the Ambari database before doing any API add/delete service operations.