- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
if datanode goes down what admin will have do.
- Labels:
-
Apache Hadoop
Created ‎07-26-2017 06:53 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Created ‎07-26-2017 07:01 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
If there are many DataNodes then one of the DN will not cause much harm, as there is Replication Factor so the data will be having replicas on other DNs.
However it will be good to investigate why the DN went down. So looking at the DN log, Garbage Collection log can help.
Sometimes JVM crash also can cause DN to go down.
Immediate admin task will be to attempt to bring the DataNode. Then admin can start looking at the DN log, Garbage Collection to see why the DN went down. (in case of DataNode jvm crash the "hs_err_pid" file is generated usually, that can be reviewed).
.
From Ambari 2.5 onwards there is a new feature called as "Service Auto Start" then you might want to look at: https://docs.hortonworks.com/HDPDocuments/Ambari-2.5.1.0/bk_ambari-operations/content/ch07s04.html
Enabling auto-start for a service causes the ambari-agent to attempt re-starting service components in a stopped state without manual effort by a user.
.
Created ‎07-26-2017 07:01 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
If there are many DataNodes then one of the DN will not cause much harm, as there is Replication Factor so the data will be having replicas on other DNs.
However it will be good to investigate why the DN went down. So looking at the DN log, Garbage Collection log can help.
Sometimes JVM crash also can cause DN to go down.
Immediate admin task will be to attempt to bring the DataNode. Then admin can start looking at the DN log, Garbage Collection to see why the DN went down. (in case of DataNode jvm crash the "hs_err_pid" file is generated usually, that can be reviewed).
.
From Ambari 2.5 onwards there is a new feature called as "Service Auto Start" then you might want to look at: https://docs.hortonworks.com/HDPDocuments/Ambari-2.5.1.0/bk_ambari-operations/content/ch07s04.html
Enabling auto-start for a service causes the ambari-agent to attempt re-starting service components in a stopped state without manual effort by a user.
.
Created ‎07-26-2017 07:27 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
where we will find the DN logs and Garbage Collection.
Created ‎07-26-2017 07:33 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Normally you will find the DataNode logs (GC log as well) inside the following directory
/var/log/hadoop/hdfs/
But if you (or other admin) have customized the path then you can find the log directory path in the following command output on the DataNode:
# ps -ef | grep DataNode
.
like the following parameters in the "ps" command output:
like "-Dhadoop.log.dir=/var/log/hadoop/hdfs" and "-Xloggc:/var/log/hadoop/hdfs/gc.log-201707171201"
.
Created ‎07-26-2017 07:43 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
thanks for your revert but could you tell me on which node will find the logs..
Created ‎07-26-2017 07:53 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
On every datanode you will find the relevant logs in the mentioned path. You will need to look at the DataNode host which went down.
