Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

Dead region servers

avatar
Contributor

Hello everybody, basically there was an electric problem and the cluster was suddently shutdown.
After restarting everything Hbase results to have all the Region Servers online (but with 0 regions each) and the Region Server with the same names are shown in Dead Region Servers.
Everytime i restart hbase, new rows are add in the Dead Region Server .
This already happened to me long time ago and the problem was related to zookeeper, but i can't find the old post.
Do you know what i can do? Thanks

P.S. my cluster is kerberized, hbase version 2.0

1 ACCEPTED SOLUTION

avatar
Super Collaborator
hide-solution

This problem has been solved!

Want to get a detailed solution you have to login/registered on the community

Register/Login
6 REPLIES 6

avatar
Super Collaborator
hide-solution

This problem has been solved!

Want to get a detailed solution you have to login/registered on the community

Register/Login

avatar
Contributor

Hi @rki_and thanks for your answer, was exaclty was needed.
But, if i may ask, after that i see all regions server online, 0 offline and all regions on 1 region server execept for meta that is on another one (in total i have 3).
The problem is that i got this error in master:

org.apache.hadoop.hbase.NotServingRegionException: hbase:quota,,1620896369946.28dd7c81713c9347e8dfe4e6993b1ec7. is not online on my-server3.domain.com,16020,1658432084980


Do you have any idea on what could i do?

Thanks

avatar
Super Collaborator

Hello @loridigia 

You can try to assign the region from hbase shell.

> assign '28dd7c81713c9347e8dfe4e6993b1ec7'

If you can attach the below command output (with valid ticket ), we can check which all regions are offiline or in transition.

 

# hbase hbck -details

avatar
Contributor

Hi RKI, the command worked, that error now is gone... but doing "hbase hbck -details" i goit 560 inconsistencies all equals: 

ERROR: There is a hole in the region chain between  and .  You need to create a new .regioninfo and region dir in hdfs to plug the hole.


avatar
Super Collaborator

Hi,

A hole in region chain most probably indicates there are some regions which are not yet online and hence creates a hole.

 

# cat hbck.report | grep "not deployed on any region server"

 

If you see regions in the above command output, you will need to assign them using hbase shell.

avatar
Contributor

You are a SAVIOUR !!
I made a script to assign all regions with ""not deployed on any region server"" and now it works fine!!
Awesome thanks a lot mate!