We are getting below exception.
org.apache.zookeeper.KeeperException$SessionExpiredException: KeeperErrorCode = Session expired
After that Region Server is shutting down. According to technical notes, the problem occurs because of a garbage collector pause time.
gc-log.pdf is the gc stats before shutting down. We have long gc pauses and it happens very often.
The reason is specifically in the young generation gc. So we decrease the young generation heap to 2500 mb. And also added below gc parameters but it didn't solve our problem.
-XX:+UseConcMarkSweepGC -XX:PermSize=128m -XX:MaxPermSize=128m -XX:SurvivorRatio=4 -XX:+PerfDisableSharedMem -XX:ParallelGCThreads=8 -XX:CMSInitiatingOccupancyFraction=50 -XX:+UseCMSInitiatingOccupancyOnly
What steps do I take to solve this?
Thanks for the response but like I said on my question, I read the similar problems and solution is the tuning gc.
I didn't find any context that resolves the region server gc pause problem.