- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
HDFS blocks missing at every restart of CM cluster
- Labels:
-
Cloudera Manager
-
HDFS
Created on ‎12-21-2021 07:41 AM - edited ‎12-21-2021 07:43 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi,
I am facing issues with HDFS in my Cloudera Manager cluster. I have a cluster of 4 virtual machines (1 master and 3 slaves) on cloud and at every shutdown and restart of the cluster/vms HDFS shows some missing blocks.
I have found a "workaround" as deleting the missing files and restarting the service would solve the problem but sometimes the missing files contain important data related to other hadoop services that generate issues if they are removed. Is there another solution to avoid deleting data ?
Thank you.
Created ‎01-16-2022 01:27 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Missing blocks can happen because of 2 reasons,
1. Data corruption - Either the disk is corrupted or the VM is down. This is a permanently failure means the data is lost.
2. Delay is Datanode report - This is because of a delay in datanode report and this is a temporary failure and in some time if datanode reports back the report then the cluster will be back to normal.
How to find and solve the issue.
1. Make sure no data volume failures or VM failures.
2. Ensure that the namenode received heartbeats from all datanodes.
Namenode UI -> Datanodes -> last contact
If all datanodes found in the list and not in dead list or not found then no issues with receiving block report.
Apart from this to debug further we need Namenode and datanode logs.
Created on ‎01-18-2022 01:23 AM - edited ‎01-18-2022 01:30 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Thank you for your response , so do you think that maybe because i shut down the virtual machines everyday the data is sometimes lost ? it is very unusual since the issue now is not happening every time but sometimes hdfs is corrupt and shows blocks missing and sometimes it's healthy from the start so i don't know what to think of it.
Created ‎01-18-2022 01:57 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
I suspect that your datanodes report is slow.Because after restart of namenode you are trigger the datanode restart so it will take time to come up with reports during that interval you can except for missing blocks this will be an intermediate issue. So that you can wait for few more min's and check the namenode ui. Else during the time of issue copy the logs and share it.
Make sure to mark the answer as the accepted solution. If it resolves your issue !
Created ‎01-16-2022 05:56 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Are you make sure your Datanodes are all started and working properly?When I restarted the cluster earlier,there are also many missingBlocks before the DataNode restarts completely.
Created ‎01-16-2022 07:47 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
When I restarted the cluster earlier,there are also many missingBlocks before the DataNode restarts completely.
This can cause the missing blocks.ex: Namenode got restarted but still the datanode restart is inprogress. So the heartbeat from datanode might be missed. To confirm that you can check the namenode UI post restart and during the time of missing blocks.
