Support Questions
Find answers, ask questions, and share your expertise

One DataNode is started but no live

Contributor

I just added new DataNodes in my cluster but one of them isn't live.

The DataNode's log is:

2017-11-24 10:18:57,761 WARN  datanode.DataNode (BPServiceActor.java:retrieveNamespaceInfo(227)) - Problem connecting to server: namenode.example.com/192.168.0.2:8020
2017-11-24 10:19:18,785 INFO  ipc.Client (Client.java:handleConnectionFailure(906)) - Retrying connect to server: namenode.example.com/192.168.0.2:8020. Already tried 0 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=50, sleepTime=1000 MILLISECONDS)

Referrer to basic issues:

  • The /etc/hosts file in each device has the IP of all hosts
  • IPv6 is disabled on the interface dedicated to Hadoop
  • firewalld is stopped
  • SELinux is disabled
  • I can ping in both directions

So, I restarted the DataNode but the problem persists, see the startup logging:

2017-11-24 10:25:34,053 INFO  ipc.Server (Server.java:run(821)) - Starting Socket Reader #1 for port 8010
2017-11-24 10:25:34,115 INFO  datanode.DataNode (DataNode.java:initIpcServer(941)) - Opened IPC server at /0.0.0.0:8010
2017-11-24 10:25:34,155 INFO  datanode.DataNode (BlockPoolManager.java:refreshNamenodes(152)) - Refresh request received for nameservices: null
2017-11-24 10:25:34,171 INFO  datanode.DataNode (BlockPoolManager.java:doRefreshNamenodes(201)) - Starting BPOfferServices for nameservices: <default>
2017-11-24 10:25:34,179 INFO  datanode.DataNode (BPServiceActor.java:run(761)) - Block pool <registering> (Datanode Uuid unassigned) service to namenode.example.com/192.168.0.2:8020 starting to offer service
2017-11-24 10:25:34,183 INFO  ipc.Server (Server.java:run(1064)) - IPC Server Responder: starting
2017-11-24 10:25:34,183 INFO  ipc.Server (Server.java:run(900)) - IPC Server listener on 8010: starting
2017-11-24 10:25:50,309 INFO  ipc.Client (Client.java:handleConnectionFailure(906)) - Retrying connect to server: namenode.example.com/192.168.0.2:8020. Already tried 0 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=50, sleepTime=1000 MILLISECONDS)

What should I do?

Thanks in advance.

1 REPLY 1

Explorer

Same problem here…

; ;