<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Datanode goes dows after few secs of starting in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143976#M52399</link>
    <description>&lt;P&gt;@&lt;A href="https://community.hortonworks.com/users/3418/jsensharma.html"&gt;Jay SenSharma&lt;/A&gt;&lt;/P&gt;&lt;P&gt;1. i didnt got any error on datanode log.&lt;/P&gt;&lt;P&gt;2. ambari-server.log&lt;/P&gt;&lt;PRE&gt;22:53:19,873  WARN [Thread-1] HeartbeatMonitor:150 - Heartbeat lost from host datanode.ec2.internal
22:53:19,874  WARN [Thread-1] HeartbeatMonitor:150 - Heartbeat lost from host datanode.ec2.internal
22:53:19,874  WARN [Thread-1] HeartbeatMonitor:165 - Setting component state to UNKNOWN for component GANGLIA_MONITOR on datanode.ec2.internal
22:53:19,874  WARN [Thread-1] HeartbeatMonitor:165 - Setting component state to UNKNOWN for component DATANODE on datanode.ec2.internal
22:53:19,874  WARN [Thread-1] HeartbeatMonitor:165 - Setting component state to UNKNOWN for component NODEMANAGER on datanode.ec2.internal
22:53:19,890  WARN [Thread-1] HeartbeatMonitor:150 - Heartbeat lost from host &lt;/PRE&gt;&lt;P&gt;datanode.ec2.internal&lt;/P&gt;22:53:19,890  WARN [Thread-1] HeartbeatMonitor:165 - Setting component state to UNKNOWN for component GANGLIA_MONITOR on &lt;P&gt;datanode.ec2.internal&lt;/P&gt;</description>
    <pubDate>Tue, 24 Jan 2017 17:52:17 GMT</pubDate>
    <dc:creator>punit9876231</dc:creator>
    <dc:date>2017-01-24T17:52:17Z</dc:date>
    <item>
      <title>Datanode goes dows after few secs of starting</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143971#M52394</link>
      <description>&lt;P&gt;Datanode automatically goes down after a few sec on starting from ambari. i check that ambari agent is working.&lt;/P&gt;&lt;P&gt;datanode receives the heartbeat but no commands from namenode.&lt;/P&gt;&lt;P&gt;ambari agent log file.&lt;/P&gt;&lt;PRE&gt;INFO 2017-01-24 03:44:59,747 PythonExecutor.py:118 - Result: {'structuredOut': {}, 'stdout': '', 'stderr': '', 'exitcode': 1}
INFO 2017-01-24 03:45:07,970 Heartbeat.py:78 - Building Heartbeat: {responseId = 210, timestamp = 1485247507970, commandsInProgress = False, componentsMapped = True}
INFO 2017-01-24 03:45:08,129 Controller.py:214 - Heartbeat response received (id = 211)
INFO 2017-01-24 03:45:08,129 Controller.py:249 - No commands sent from ip-172-31-17-251.ec2.internal
INFO 2017-01-24 03:45:18,130 Heartbeat.py:78 - Building Heartbeat: {responseId = 211, timestamp = 1485247518130, commandsInProgress = False, componentsMapped = True}
INFO 2017-01-24 03:45:18,274 Controller.py:214 - Heartbeat response received (id = 212)
INFO 2017-01-24 03:45:18,274 Controller.py:249 - No commands sent from NAMENODE.ec2.internal



&lt;/PRE&gt;&lt;PRE&gt;


&lt;/PRE&gt;</description>
      <pubDate>Tue, 24 Jan 2017 16:59:36 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143971#M52394</guid>
      <dc:creator>punit9876231</dc:creator>
      <dc:date>2017-01-24T16:59:36Z</dc:date>
    </item>
    <item>
      <title>Re: Datanode goes dows after few secs of starting</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143972#M52395</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/15585/punit9876231.html" nodeid="15585"&gt;@Punit kumar&lt;/A&gt;&lt;/P&gt;&lt;P&gt;1. Do you see any error / exception in the DataNode log?&lt;/P&gt;&lt;P&gt;2. After triggering DataNode start operation from Ambari UI do you see any Error/Exception in ambari-server.log?&lt;/P&gt;&lt;P&gt;If yest hen can you please share those log snippets here? &lt;/P&gt;&lt;P&gt;3. Are you able to start/stop  the other components present on that agent host?  (or only DataNode is having this issue)&lt;/P&gt;&lt;P&gt;4. The output of "top" command so that we can see if memory is available sufficiently.&lt;/P&gt;&lt;P&gt;5. Once you triger the commands from Ambari UI to start the DataNode you might see following kind of files getting created in "/var/lib/ambari-agent/data". Do you see any error in the errors file?
command-3231.json  &lt;EM&gt;(Number might be different in your case but the time stamp should be latest for these files)&lt;/EM&gt;
errors-3231.txt
output-3231.txt&lt;/P&gt;&lt;P&gt;.&lt;/P&gt;</description>
      <pubDate>Tue, 24 Jan 2017 17:03:41 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143972#M52395</guid>
      <dc:creator>jsensharma</dc:creator>
      <dc:date>2017-01-24T17:03:41Z</dc:date>
    </item>
    <item>
      <title>Re: Datanode goes dows after few secs of starting</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143973#M52396</link>
      <description>&lt;P&gt;@&lt;A href="https://community.hortonworks.com/users/3418/jsensharma.html"&gt;Jay SenSharma&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Hi Jay,&lt;/P&gt;&lt;P&gt;thnx for reply.&lt;/P&gt;&lt;P&gt;i got error on output-30684.txt.&lt;/P&gt;&lt;PRE&gt;2017-01-24 03:39:17,877 - File['/etc/hadoop/conf/slaves'] {'content': Template('slaves.j2'), 'owner': 'hdfs'}
2017-01-24 03:39:17,877 - Directory['/var/lib/hadoop-hdfs'] {'owner': 'hdfs', 'group': 'hadoop', 'mode': 0751, 'recursive': True}
2017-01-24 03:39:17,893 - Host contains mounts: ['/', '/proc', '/sys', '/dev/pts', '/dev/shm', '/mnt/disk1', '/mnt/disk2', '/proc/sys/fs/binfmt_misc'].
2017-01-24 03:39:17,894 - Mount point for directory /mnt/disk1/hadoop/hdfs/data is /mnt/disk1
2017-01-24 03:39:17,894 - Mount point for directory /mnt/disk2/hadoop/hdfs/data is /mnt/disk2
2017-01-24 03:39:17,895 - Directory['/var/run/hadoop/hdfs'] {'owner': 'hdfs', 'recursive': True}
2017-01-24 03:39:17,895 - Directory['/var/log/hadoop/hdfs'] {'owner': 'hdfs', 'recursive': True}
2017-01-24 03:39:17,896 - File['/var/run/hadoop/hdfs/hadoop-hdfs-datanode.pid'] {'action': ['delete'], 'not_if': 'ls /var/run/hadoop/hdfs/hadoop-hdfs-datanode.pid &amp;gt;/dev/null 2&amp;gt;&amp;amp;1 &amp;amp;&amp;amp; ps `cat /var/run/hadoop/hdfs/hadoop-hdfs-datanode.pid` &amp;gt;/dev/null 2&amp;gt;&amp;amp;1'}
2017-01-24 03:39:17,919 - Deleting File['/var/run/hadoop/hdfs/hadoop-hdfs-datanode.pid']
2017-01-24 03:39:17,919 - Execute['ulimit -c unlimited;  su -s /bin/bash - hdfs -c 'export HADOOP_LIBEXEC_DIR=/usr/hdp/current/hadoop-client/libexec &amp;amp;&amp;amp; /usr/hdp/current/hadoop-client/sbin/hadoop-daemon.sh --config /etc/hadoop/conf start datanode''] {'not_if': 'ls /var/run/hadoop/hdfs/hadoop-hdfs-datanode.pid &amp;gt;/dev/null 2&amp;gt;&amp;amp;1 &amp;amp;&amp;amp; ps `cat /var/run/hadoop/hdfs/hadoop-hdfs-datanode.pid` &amp;gt;/dev/null 2&amp;gt;&amp;amp;1'}


&lt;/PRE&gt;</description>
      <pubDate>Tue, 24 Jan 2017 17:40:19 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143973#M52396</guid>
      <dc:creator>punit9876231</dc:creator>
      <dc:date>2017-01-24T17:40:19Z</dc:date>
    </item>
    <item>
      <title>Re: Datanode goes dows after few secs of starting</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143974#M52397</link>
      <description>&lt;P&gt;that was not error. that was the output of that file.&lt;/P&gt;&lt;P&gt;i got nothing on error-30684.txt&lt;/P&gt;&lt;P&gt;output of command-30684.txt&lt;/P&gt;&lt;PRE&gt;          "namenode.ec2.internal"
        ],
        "hs_host": [
            "namenode.ec2.internal"
        ],
        "hive_server_host": [
            "namenode.ec2.internal"
        ]
    }
}


&lt;/PRE&gt;</description>
      <pubDate>Tue, 24 Jan 2017 17:44:21 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143974#M52397</guid>
      <dc:creator>punit9876231</dc:creator>
      <dc:date>2017-01-24T17:44:21Z</dc:date>
    </item>
    <item>
      <title>Re: Datanode goes dows after few secs of starting</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143975#M52398</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/15585/punit9876231.html" nodeid="15585"&gt;@Punit kumar&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Here based on the output of "output-30684.txt" file we see that the  DataNode start instruction has been already given to the ambari-agent and following is the command snippet:&lt;/P&gt;&lt;PRE&gt; /usr/hdp/current/hadoop-client/sbin/hadoop-daemon.sh --config /etc/hadoop/conf start datanode&lt;/PRE&gt;&lt;P&gt;- So after that "hadoop-daemon.sh" script is actually responsible to start the DataNode with the given arguments.   &lt;/P&gt;&lt;P&gt;- Hence we should check the DataNode logs (.log and .out files) to finds out what is going wrong. &lt;/P&gt;&lt;P&gt;- There might be some OS resource constraints as well like (Less memory/disk space ..etc)  We might get information about using some OS tools like "top" and "df -h"  .  But looking at the DataNode log / out will give more  better idea here.&lt;/P&gt;&lt;P&gt;.&lt;/P&gt;</description>
      <pubDate>Tue, 24 Jan 2017 17:51:14 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143975#M52398</guid>
      <dc:creator>jsensharma</dc:creator>
      <dc:date>2017-01-24T17:51:14Z</dc:date>
    </item>
    <item>
      <title>Re: Datanode goes dows after few secs of starting</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143976#M52399</link>
      <description>&lt;P&gt;@&lt;A href="https://community.hortonworks.com/users/3418/jsensharma.html"&gt;Jay SenSharma&lt;/A&gt;&lt;/P&gt;&lt;P&gt;1. i didnt got any error on datanode log.&lt;/P&gt;&lt;P&gt;2. ambari-server.log&lt;/P&gt;&lt;PRE&gt;22:53:19,873  WARN [Thread-1] HeartbeatMonitor:150 - Heartbeat lost from host datanode.ec2.internal
22:53:19,874  WARN [Thread-1] HeartbeatMonitor:150 - Heartbeat lost from host datanode.ec2.internal
22:53:19,874  WARN [Thread-1] HeartbeatMonitor:165 - Setting component state to UNKNOWN for component GANGLIA_MONITOR on datanode.ec2.internal
22:53:19,874  WARN [Thread-1] HeartbeatMonitor:165 - Setting component state to UNKNOWN for component DATANODE on datanode.ec2.internal
22:53:19,874  WARN [Thread-1] HeartbeatMonitor:165 - Setting component state to UNKNOWN for component NODEMANAGER on datanode.ec2.internal
22:53:19,890  WARN [Thread-1] HeartbeatMonitor:150 - Heartbeat lost from host &lt;/PRE&gt;&lt;P&gt;datanode.ec2.internal&lt;/P&gt;22:53:19,890  WARN [Thread-1] HeartbeatMonitor:165 - Setting component state to UNKNOWN for component GANGLIA_MONITOR on &lt;P&gt;datanode.ec2.internal&lt;/P&gt;</description>
      <pubDate>Tue, 24 Jan 2017 17:52:17 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143976#M52399</guid>
      <dc:creator>punit9876231</dc:creator>
      <dc:date>2017-01-24T17:52:17Z</dc:date>
    </item>
    <item>
      <title>Re: Datanode goes dows after few secs of starting</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143977#M52400</link>
      <description>&lt;P&gt;@&lt;A href="https://community.hortonworks.com/users/3418/jsensharma.html"&gt;Jay SenSharma&lt;/A&gt;&lt;/P&gt;&lt;P&gt;3. other components on agent are running without an issues only issues is in datanode which goes down after few sec.&lt;/P&gt;&lt;P&gt;4. on running 'top' command i have enough space on agent&lt;/P&gt;</description>
      <pubDate>Tue, 24 Jan 2017 17:55:29 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143977#M52400</guid>
      <dc:creator>punit9876231</dc:creator>
      <dc:date>2017-01-24T17:55:29Z</dc:date>
    </item>
    <item>
      <title>Re: Datanode goes dows after few secs of starting</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143978#M52401</link>
      <description>&lt;P&gt;output of datanode.log&lt;/P&gt;&lt;PRE&gt;2017-01-24 04:59:13,837 INFO  datanode.DataNode (DataNode.java:shutdown(1720)) - Shutdown complete.
2017-01-24 04:59:13,839 FATAL datanode.DataNode (DataNode.java:secureMain(2385)) - Exception in secureMain
java.io.IOException: the path component: '/var/lib/hadoop-hdfs' is owned by a user who is not root and not you.  Your effective user id is 0; the path is owned by user id 508, and its permissions are 0751.  Please fix this or select a different socket path.
        at org.apache.hadoop.net.unix.DomainSocket.validateSocketPathSecurity0(Native Method)
        at org.apache.hadoop.net.unix.DomainSocket.bindAndListen(DomainSocket.java:189)
        at org.apache.hadoop.hdfs.net.DomainPeerServer.&amp;lt;init&amp;gt;(DomainPeerServer.java:40)
        at org.apache.hadoop.hdfs.server.datanode.DataNode.getDomainPeerServer(DataNode.java:892)
        at org.apache.hadoop.hdfs.server.datanode.DataNode.initDataXceiver(DataNode.java:858)
        at org.apache.hadoop.hdfs.server.datanode.DataNode.startDataNode(DataNode.java:1056)
        at org.apache.hadoop.hdfs.server.datanode.DataNode.&amp;lt;init&amp;gt;(DataNode.java:415)
        at org.apache.hadoop.hdfs.server.datanode.DataNode.makeInstance(DataNode.java:2268)
        at org.apache.hadoop.hdfs.server.datanode.DataNode.instantiateDataNode(DataNode.java:2155)
        at org.apache.hadoo
p.hdfs.server.datanode.DataNode.createDataNode(DataNode.java:2202)
        at org.apache.hadoop.hdfs.server.datanode.DataNode.secureMain(DataNode.java:2378)
        at org.apache.hadoop.hdfs.server.datanode.DataNode.main(DataNode.java:2402)
2017-01-24 04:59:13,841 INFO  util.ExitUtil (ExitUtil.java:terminate(124)) - Exiting with status 1
2017-01-24 04:59:13,843 INFO  datanode.DataNode (StringUtils.java:run(659)) - SHUTDOWN_MSG:
/************************************************************
SHUTDOWN_MSG: Shutting down DataNode at datanode.ec2.internal/datanode
************************************************************/

&lt;/PRE&gt;</description>
      <pubDate>Tue, 24 Jan 2017 18:06:58 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143978#M52401</guid>
      <dc:creator>punit9876231</dc:creator>
      <dc:date>2017-01-24T18:06:58Z</dc:date>
    </item>
    <item>
      <title>Re: Datanode goes dows after few secs of starting</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143979#M52402</link>
      <description>&lt;P&gt;df -h&lt;/P&gt;&lt;PRE&gt;Filesystem      Size  Used Avail Use% Mounted on
/dev/xvda1       30G  9.9G   19G  36% /
tmpfs            16G     0   16G   0% /dev/shm
/dev/xvdf       1.1T  905G   75G  93% /mnt/disk1
/dev/xvdg       1.1T  890G   90G  91% /mnt/disk2
&lt;/PRE&gt;</description>
      <pubDate>Tue, 24 Jan 2017 18:08:14 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143979#M52402</guid>
      <dc:creator>punit9876231</dc:creator>
      <dc:date>2017-01-24T18:08:14Z</dc:date>
    </item>
    <item>
      <title>Re: Datanode goes dows after few secs of starting</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143980#M52403</link>
      <description>&lt;P&gt;top&lt;/P&gt;&lt;PRE&gt;top - 05:03:36 up  4:41,  1 user,  load average: 0.00, 0.00, 0.00
Tasks: 186 total,   1 running, 185 sleeping,   0 stopped,   0 zombie
Cpu(s):  0.1%us,  0.4%sy,  0.0%ni, 99.5%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Mem:  32877652k total,  1678960k used, 31198692k free,   335884k buffers
Swap:        0k total,        0k used,        0k free,   517928k cached



&lt;/PRE&gt;</description>
      <pubDate>Tue, 24 Jan 2017 18:09:09 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143980#M52403</guid>
      <dc:creator>punit9876231</dc:creator>
      <dc:date>2017-01-24T18:09:09Z</dc:date>
    </item>
    <item>
      <title>Re: Datanode goes dows after few secs of starting</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143981#M52404</link>
      <description>&lt;P&gt;@&lt;A href="https://community.hortonworks.com/users/3418/jsensharma.html"&gt;Jay SenSharma&lt;/A&gt;&lt;/P&gt;&lt;P&gt;so the error is in log file of permissions&lt;/P&gt;</description>
      <pubDate>Tue, 24 Jan 2017 18:11:57 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143981#M52404</guid>
      <dc:creator>punit9876231</dc:creator>
      <dc:date>2017-01-24T18:11:57Z</dc:date>
    </item>
    <item>
      <title>Re: Datanode goes dows after few secs of starting</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143982#M52405</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/15585/punit9876231.html" nodeid="15585"&gt;@Punit kumar
&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Problem seems to be directory permission related:&lt;/P&gt;&lt;PRE&gt;java.io.IOException: the path component: '/var/lib/hadoop-hdfs' is owned by a user who is not root and not you.  Your effective user id is 0; the path is owned by user id 508, and its permissions are 0751.  Please fix this or select a different socket path.&lt;/PRE&gt;&lt;P&gt;.&lt;/P&gt;&lt;P&gt;- As the DN log is complaining about the permission on "/var/lib/hadoop-hdfs" so please check what kind of permission do you have there.    By default it should be owned by  "hdfs:hadoop"    as following:&lt;/P&gt;&lt;PRE&gt;# ls -lart /var/lib/hadoop-hdfs
drwxrwxrwt.  2 hdfs hadoop 4096 Aug 10 11:23 cache
srw-rw-rw-.  1 hdfs hadoop    0 Jan 24 09:09 dn_socket&lt;/PRE&gt;&lt;P&gt;- It would be best if you compare the permission on this Directory "/var/lib/hadoop-hdfs" from your Working DataNode hosts. &lt;/P&gt;&lt;P&gt;- In order to get more information about this exception, please see the use of "validateSocketPathSecurity0" method:&lt;/P&gt;&lt;P&gt;&lt;A href="https://github.com/apache/hadoop/blob/trunk/hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/net/unix/DomainSocket.java#L82-L105" target="_blank"&gt;https://github.com/apache/hadoop/blob/trunk/hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/net/unix/DomainSocket.java#L82-L105&lt;/A&gt;&lt;/P&gt;&lt;P&gt;.&lt;/P&gt;</description>
      <pubDate>Tue, 24 Jan 2017 18:27:37 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143982#M52405</guid>
      <dc:creator>jsensharma</dc:creator>
      <dc:date>2017-01-24T18:27:37Z</dc:date>
    </item>
    <item>
      <title>Re: Datanode goes dows after few secs of starting</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143983#M52406</link>
      <description>&lt;P&gt;yeah that was right&lt;/P&gt;&lt;PRE&gt;total 4
drwxrwxrwt. 2 hdfs hadoop 4096 Nov 19  2014 cache
srw-rw-rw-. 1 hdfs hadoop    0 Jan 24 03:39 dn_socket


actually non of my datanode host is working.
is that memory issue.
&lt;/PRE&gt;</description>
      <pubDate>Tue, 24 Jan 2017 18:40:20 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143983#M52406</guid>
      <dc:creator>punit9876231</dc:creator>
      <dc:date>2017-01-24T18:40:20Z</dc:date>
    </item>
    <item>
      <title>Re: Datanode goes dows after few secs of starting</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143984#M52407</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/15585/punit9876231.html" nodeid="15585"&gt;@Punit kumar&lt;/A&gt;&lt;/P&gt;&lt;P&gt;I will suggest you to kill these DataNodes ()if there are any DN daemon processes running) and then try manually starting them as "hdfs" user.    to see if those are getting started fine or not?    In Parallel put the DataNode log in "tail" so that we can see if it is showing the same error  or not  ?&lt;/P&gt;&lt;P&gt;Once they come up successfully then next time try from Ambari.&lt;/P&gt;</description>
      <pubDate>Tue, 24 Jan 2017 18:48:13 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143984#M52407</guid>
      <dc:creator>jsensharma</dc:creator>
      <dc:date>2017-01-24T18:48:13Z</dc:date>
    </item>
    <item>
      <title>Re: Datanode goes dows after few secs of starting</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143985#M52408</link>
      <description>&lt;P&gt;@&lt;A href="https://community.hortonworks.com/users/3418/jsensharma.html"&gt;Jay SenSharma&lt;/A&gt;&lt;/P&gt;&lt;P&gt;now am getting this error in datanode.log&lt;/P&gt;&lt;PRE&gt; 2017-01-24 03:39:19,891 INFO  common.Storage (Storage.java:tryLock(715)) - Lock on /mnt/disk1/hadoop/hdfs/data/in_use.lock acquired by nodename 1491@datanode.ec2.internal
2017-01-24 03:39:19,902 INFO  common.Storage (Storage.java:tryLock(715)) - Lock on /mnt/disk2/hadoop/hdfs/data/in_use.lock acquired by nodename 1491@datanode.ec2.internal
2017-01-24 03:39:19,903 FATAL datanode.DataNode (BPServiceActor.java:run(840)) - Initialization failed for Block pool &amp;lt;registering&amp;gt; (Datanode Uuid unassigned) service to namenode.ec2.internal/&lt;/PRE&gt;&lt;P&gt;namenode:8020. Exiting.&lt;/P&gt;java.io.IOException: Incompatible clusterIDs in /mnt/disk1/hadoop/hdfs/data: namenode clusterID = CID-297a140f-7cd6-4c73-afc8-bd0a7d01c0ee; datanode clusterID = CID-7591e6bd-ce9b-4b14-910c-c9603892a0f1
        at org.apache.hadoop.hdfs.server.datanode.DataStorage.doTransition(DataStorage.java:646)
        at org.apache.hadoop.hdfs.server.datanode.DataStorage.addStorageLocations(DataStorage.java:320)
        at org.apache.hadoop.hdfs.server.datanode.DataStorage.recoverTransitionRead(DataStorage.java:403)
        at org.apache.hadoop.hdfs.server.datanode.DataStorage.recoverTransitionRead(DataStorage.java:422)
        at org.apache.hadoop.hdfs.server.datanode.DataNode.initStorage(DataNode.java:1311)
        at org.apache.hadoop.hdfs.server.datanode.DataNode.initBlockPool(DataNode.java:1276)
        at org.apache.hadoop.hdfs.server.datanode.BPOfferService.verifyAndSetNamespaceInfo(BPOfferService.java:314)
        at org.apache.hadoop.hdfs.server.datanode.BPServiceActor.connectToNNAndHandshake(BPServiceActor.java:220)
        at org.apache.hadoop.hdfs.server.datanode.BPServiceActor.run(BPServiceActor.java:828)
        at java.lang.Thread.run(Thread.java:745)
2017-01-24 03:39:19,904 WARN  datanode.DataNode (BPServiceActor.java:run(861)) - Ending block pool service for: Block pool &amp;lt;registering&amp;gt; (Datanode Uuid unassigned) service to ip-172-31-17-251.ec2.internal/172.31.17.251:8020
2017-01-24 03:39:20,005 INFO  datanode.DataNode (BlockPoolManager.java:remove(103)) - Removed Block pool &amp;lt;registering&amp;gt; (Datanode Uuid unassigned)
2017-01-24 03:39:22,005 WARN  datanode.DataNode (DataNode.java:secureMain(2392)) - Exiting Datanode
2017-01-24 03:39:22,007 INFO  util.ExitUtil (ExitUtil.java:terminate(124)) - Exiting with status 0
2017-01-24 03:39:22,008 INFO  datanode.DataNode (StringUtils.java:run(659)) - SHUTDOWN_MSG:
/************************************************************
SHUTDOWN_MSG: Shutting down DataNode at datanode.ec2.internal/datanode
************************************************************/</description>
      <pubDate>Tue, 24 Jan 2017 18:52:19 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143985#M52408</guid>
      <dc:creator>punit9876231</dc:creator>
      <dc:date>2017-01-24T18:52:19Z</dc:date>
    </item>
    <item>
      <title>Re: Datanode goes dows after few secs of starting</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143986#M52409</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/15585/punit9876231.html" nodeid="15585"&gt;@Punit kumar&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Regarding your latest error:&lt;/P&gt;&lt;PRE&gt;java.io.IOException: Incompatible clusterIDs in 
/mnt/disk1/hadoop/hdfs/data: namenode clusterID = 
CID-297a140f-7cd6-4c73-afc8-bd0a7d01c0ee; datanode clusterID = 
CID-7591e6bd-ce9b-4b14-910c-c9603892a0f1 at &lt;/PRE&gt;&lt;P&gt;Looks like your VERSION file has different cluster IDs present in NameNode and DataNode that need to be correct. So please check.&lt;/P&gt;&lt;PRE&gt;cat &amp;lt;dfs.namenode.name.dir&amp;gt;/current/VERSION
cat &amp;lt;dfs.datanode.data.dir&amp;gt;/current/VERSION &lt;/PRE&gt;&lt;P&gt;&lt;STRONG&gt;Hence Copy the clusterID from nematode and put it in the VERSION file of datanode and then try again.&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Please refer to: &lt;A href="http://www.dedunu.info/2015/05/how-to-fix-incompatible-clusterids-in.html" target="_blank"&gt;http://www.dedunu.info/2015/05/how-to-fix-incompatible-clusterids-in.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;.&lt;/P&gt;</description>
      <pubDate>Tue, 24 Jan 2017 20:26:07 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143986#M52409</guid>
      <dc:creator>jsensharma</dc:creator>
      <dc:date>2017-01-24T20:26:07Z</dc:date>
    </item>
    <item>
      <title>Re: Datanode goes dows after few secs of starting</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143987#M52410</link>
      <description>&lt;P&gt;@&lt;A href="https://community.hortonworks.com/users/3418/jsensharma.html"&gt;Jay SenSharma&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Thnx, now its working.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 25 Jan 2017 00:12:42 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Datanode-goes-dows-after-few-secs-of-starting/m-p/143987#M52410</guid>
      <dc:creator>punit9876231</dc:creator>
      <dc:date>2017-01-25T00:12:42Z</dc:date>
    </item>
  </channel>
</rss>

