Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Region Server down frequently

Region Server down frequently

New Contributor

A particular worker node region server is getting down very frequently. Restarting the service also doesn't resolve issue. can anyone give me an idea of what might be the issue or what things i need to look for in logs?

3 REPLIES 3

Re: Region Server down frequently

Hello

The explanation might be a little high level to help efficiently. I understand this is for a specific Region Server not all of them or a random one. A couple things can make a Region Server go down . Usual culprits are: Skew, by this I mean this Region Server gets a lot of traffic, for example writes, he will then be flushing the memstore very often and having a lot of GCs to clean out memory and if these last too long he may not be able to heartbeat to zookeeper in the predefined time window. Zookeeper will then take him out. You can log in the logs for the memstore flush and GC clean up. You should also see Zookeeper timeouts warning.

Re: Region Server down frequently

New Contributor

@nmaillard I am getting this error on AMBARI UI on checking the response link for a particular worker node. Can you let me know why is this happening and what can be the possible way to get this issue resolved? I tried restarting the zookeeper service as well but to no effect.

Highlighted

Re: Region Server down frequently

grep for WARN or ERROR log lines in the region server logs. And also check your system logs for resources availability errors.

Don't have an account?
Coming from Hortonworks? Activate your account here