Created on 01-20-2022 11:56 PM - edited 01-20-2022 11:58 PM
ip:9866:DataXceiver error processing WRITE_BLOCK operation src: /ip:30458 dst: /ip:9866 java.io.EOFException: Unexpected EOF while trying to read response from server at org.apache.hadoop.hdfs.protocolPB.PBHelperClient.vintPrefixed(PBHelperClient.java:539) at org.apache.hadoop.hdfs.server.datanode.DataXceiver.writeBlock(DataXceiver.java:846) at org.apache.hadoop.hdfs.protocol.datatransfer.Receiver.opWriteBlock(Receiver.java:173) at org.apache.hadoop.hdfs.protocol.datatransfer.Receiver.processOp(Receiver.java:107) at org.apache.hadoop.hdfs.server.datanode.DataXceiver.run(DataXceiver.java:291) at java.lang.Thread.run(Thread.java:748)
ip:9866:DataXceiver error processing WRITE_BLOCK operation src: /ip:46217 dst: /ip:9866 java.io.IOException: Not ready to serve the block pool, BP-711568456-10.13.6.7-1597913882399. at org.apache.hadoop.hdfs.server.datanode.DataXceiver.checkAndWaitForBP(DataXceiver.java:1402) at org.apache.hadoop.hdfs.server.datanode.DataXceiver.checkAccess(DataXceiver.java:1418) at org.apache.hadoop.hdfs.server.datanode.DataXceiver.writeBlock(DataXceiver.java:715) at org.apache.hadoop.hdfs.protocol.datatransfer.Receiver.opWriteBlock(Receiver.java:173) at org.apache.hadoop.hdfs.protocol.datatransfer.Receiver.processOp(Receiver.java:107) at org.apache.hadoop.hdfs.server.datanode.DataXceiver.run(DataXceiver.java:291) at java.lang.Thread.run(Thread.java:748)
This problem has been bothering me. What direction should I check
Created 01-21-2022 12:43 AM
I think the DataNode is restarting and scan blocks? It will return to normal when hen the block scan is complete.
You can test it again when the following content appears in the log
2022-01-17 22:41:56,564 INFO org.apache.hadoop.hdfs.server.datanode.fsdataset.impl.FsDatasetImpl: Time to add replicas to map for block pool BP-1115033092-xx.xx.xx.xx-1532155849771 on volume /data4/xxx/hdfs/dfs/data/current: 5270395ms
Created 01-26-2022 11:49 PM
You mean I can ignore this mistake, righ,However, this is an error, which will affect my cluster. I should check how it is caused