Created 07-24-2017 02:33 PM
I had a working CDH5 on RHEL in cluster mode.
My machine hanged somehow,so I had to reboot it.
I see everything in bad shape. I am trying to restart CDH management services, but I see timeout message and no service is starting.
if I start from command line, I see following message:
[root@pc1 Downloads]# service cloudera-scm-agent status
● cloudera-scm-agent.service - LSB: Cloudera SCM Agent
Loaded: loaded (/etc/rc.d/init.d/cloudera-scm-agent; bad; vendor preset: disabled)
Active: active (exited) since Mon 2017-07-24 15:14:15 CDT; 1h 15min ago
Docs: man:systemd-sysv-generator(8)
Process: 9482 ExecStop=/etc/rc.d/init.d/cloudera-scm-agent stop (code=exited, status=0/SUCCESS)
Process: 9531 ExecStart=/etc/rc.d/init.d/cloudera-scm-agent start (code=exited, status=0/SUCCESS)
Jul 24 15:14:14 pc1.localdomain.com systemd[1]: Starting LSB: Cloudera SCM A....
Jul 24 15:14:14 pc1.localdomain.com su[9545]: (to root) root on none
Jul 24 15:14:15 pc1.localdomain.com cloudera-scm-agent[9531]: Starting cloude...
Jul 24 15:14:15 pc1.localdomain.com systemd[1]: Started LSB: Cloudera SCM Agent.I am not sure what has gone wrong.
Created 07-24-2017 08:05 PM
It is very wierd that I rebooted the system, started the cloudera-management-services, and then started others services individually. It is working now.
I am not sure why it happened. It was really a nightmare, and I am really surprised what should I do in such scenarios (without rebooting system).
Created 07-24-2017 03:19 PM
There could be many reasons
1. CM -> hosts -> check the hosts are in green
2. Login as root and start ntpd service if it is not started automatically
service ntpd status
service ntpd start
chkconfig --list ntpd
chkconfig ntpd on;
wait for few mins and check the hosts status again.
post more logs if you are still facing issue
Created 07-24-2017 03:50 PM
following are the logs with ntpd:
[root@pc1 Downloads]# service ntpd status
Redirecting to /bin/systemctl status ntpd.service
● ntpd.service - Network Time Service
Loaded: loaded (/usr/lib/systemd/system/ntpd.service; enabled; vendor preset: disabled)
Active: active (running) since Mon 2017-07-24 17:40:31 CDT; 1min 8s ago
Process: 16053 ExecStart=/usr/sbin/ntpd -u ntp:ntp $OPTIONS (code=exited, status=0/SUCCESS)
Main PID: 16054 (ntpd)
CGroup: /system.slice/ntpd.service
└─16054 /usr/sbin/ntpd -u ntp:ntp -g
Jul 24 17:40:31 pc1.localdomain.com ntpd[16054]: Listen normally on 3 wlp3s0 192.168.1.115 UDP 123
[root@pc1 Downloads]# chkconfig --list ntpd
error reading information on service ntpd: No such file or directory
[root@pc1 Downloads]# chkconfig ntpd on
Note: Forwarding request to 'systemctl enable ntpd.service'.
I am in a fix now where to start now.
Created on 07-24-2017 06:51 PM - edited 07-24-2017 06:55 PM
Could you let me know if you are able to bring up your Cloudera-scm-server / Cloudera-scm-server-db ?
whats their status ?
where you able to bring up the Cloudera manager web ui ?
could you check if you multiple java version hanging around in the host ?
/usr/java/ /usr/lib/jvm/
or you use alternatives
Meantime using the Code tag tool bar could you share the logs cloudera-scm-server / cloudera-scm-agent ?
is the location for those logs
/var/log/
Created 07-24-2017 08:05 PM
It is very wierd that I rebooted the system, started the cloudera-management-services, and then started others services individually. It is working now.
I am not sure why it happened. It was really a nightmare, and I am really surprised what should I do in such scenarios (without rebooting system).