Created on 05-31-2016 03:00 PM - edited 09-16-2022 03:22 AM
Hi Folks,
We are running into an issue where one of the region has been stuck in transition state for a few weeks.
8befbfc89993b7e57e7e21cf17113dfc state=OFFLINE, ts=1464546604126, server=null}
From the hdfs logs it loosk like one of the 3 files in the region directory is missing blocks .
/var/shn/data/hbase/data/default/user_counters/8befbfc89993b7e57e7e21cf17113dfc/stats-monthly/c4b15e7f297c498d8d935378804af732: CORRUPT blockpool BP-1312200060-10.0.4.237-1411698869524 block blk_1287541730
MISSING 1 blocks of total size 663467 B
0. BP-1312200060-10.0.4.237-1411698869524:blk_1287541730_213952706 len=663467 MISSING!
If i remove just this one corrupted file and do assign 'region id' can i recover all my remaining files. please advise
Thank You,
Mastan
Created 06-01-2016 03:12 PM
Created 05-31-2016 11:04 PM
Created 06-01-2016 03:03 PM
Thank You harsh that worked, Unfortunately the logs for NN have rolled over and no rca could be found.
This is possibly due to multiple node issues we had a month back or so..
On the same note i have a quick question. Say if i'm taking down a node for patching , How long does NN wait before starting the replication of the under replicated blocks?
Created 06-01-2016 03:12 PM