Support Questions

Find answers, ask questions, and share your expertise

Hortonworks nodes crash

Explorer

Hi,

How do we find why all services went down ?We have HDP2.6 all services went down, do we just restart the all services? .We are able to connect to Ambari.

5 REPLIES 5

Super Mentor

@vishal dutt

In order to find the cause of all service failure we will have to take a look at individual components logs like NameNode log / DataNode log ..etc

If you want to start all services back then try using the "Restart All" Option from Ambari UI.

Also can you please check the "uptime" of your host where services went down. May be your host is restarted.

# uptime

.

Mentor

@vishal dutt

Can you see the cause in the /var/log/ambari-server/ambari-server.log I suspect it could have crashed due to memory leak? Can you share the contents ambari-server.log


Explorer

Is this on the edgenode? don't see this file

Mentor

@vishal dutt

On your ambari-server host. I had a typo error in the path it should be /var/log/ambari-server/ Also check the other components logs for errors on the hosts running those services in /var/log/{service_name}/

Explorer

After restarting the services i am getting error on all data nodes "Ambari Monitor not running on ". Should i start ambari-agent on all data nodes?

Also All HBASE region servers are down getting error " Connection failed: [Errno 111] Connection refused to ctcl-hdpdata1.com:16030"