Created 05-07-2018 01:29 PM
Hi Everyone,
We are getting below exception.
org.apache.zookeeper.KeeperException$SessionExpiredException: KeeperErrorCode = Session expired
After that Region Server is shutting down. According to technical notes, the problem occurs because of a garbage collector pause time.
gc-log.pdf is the gc stats before shutting down. We have long gc pauses and it happens very often.
The reason is specifically in the young generation gc. So we decrease the young generation heap to 2500 mb. And also added below gc parameters but it didn't solve our problem.
-XX:+UseConcMarkSweepGC -XX:PermSize=128m -XX:MaxPermSize=128m -XX:SurvivorRatio=4 -XX:+PerfDisableSharedMem -XX:ParallelGCThreads=8 -XX:CMSInitiatingOccupancyFraction=50 -XX:+UseCMSInitiatingOccupancyOnly
What steps do I take to solve this?
Thanks.
Created 05-08-2018 06:34 AM
This question has been answered in this community before, please search for the same.
Also have a look at https://superuser.blog/hbase-dead-regionserver/
Created 05-08-2018 08:23 AM
Thanks for the response but like I said on my question, I read the similar problems and solution is the tuning gc.
I didn't find any context that resolves the region server gc pause problem.
Created 08-10-2018 04:45 AM
Any update on this ? Can anyone please check and provide ?