Created on 04-18-2016 06:07 AM - edited 08-19-2019 05:07 AM
Hello,
after upgrading HDP from 2.2.4 to 2.3.4.7 using Ambari 2.2.1.1 the service HBase shows a strange alert.
It notifies me about dead region servers, and in detail there shall be "7 out of 5 dead region servers"... hmmmm, really ?!? A total of "5" is correct, this is the number of Regionservers, and they are all up and running (and marked "green" in Ambari itself)
How to get rid of that wrong alert notification ?!? ...just waiting a couple of days didn't solve the problem 😉
Thanks, Gerd
Created 04-18-2016 10:10 AM
Next try: Login as user hdfs, and do "hdfs dfs -ls /apps/hbase/data/WALs".
Created 04-18-2016 08:08 AM
HBase ports changed in HDP-2.3.4, details here. And region servers has logical names which include RS port. An example RS name: "sandbox.hortonworks.com,16020,1460965964168". Before, the RS port was 60020, and now it's 16020. So, if you have 5 machines running RS, before and after the upgrade there are 10 RSs to be taken care of, and in your case HBase master may still think that RS names with 60020 are still being used. Restarting HBase (or just restarting HBase master) is supposed to remove them, and solve your issues. Before and after the restart you can check Regions servers in your Hbase Web UI (HBase --> Quick Links).
Created on 04-18-2016 09:12 AM - edited 08-19-2019 05:07 AM
Hi @Predrag Minovic , thanks for answering.
The observation is exactly what is happening here, but even after restarting HBase multiple times this alert doesn't disappear. The output of the Dead Region servers still looks wired, because some of them are multiples and some are even not there =>
next try ?
Created 04-18-2016 10:10 AM
Next try: Login as user hdfs, and do "hdfs dfs -ls /apps/hbase/data/WALs".
Created 04-18-2016 02:45 PM
Brilliant @Predrag Minovic , that solved the issue. Thanks !