<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Hbase region server is getting stopped frequently without any error log in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Hbase-region-server-is-getting-stopped-frequently-without/m-p/101623#M64588</link>
    <description>&lt;P&gt;I am not able to access the logs attached by you. Is it possible for you to share the logs through some shared ftp links?&lt;/P&gt;&lt;P&gt;would be intereseted in zookeeper logs, gc logs, datanodes logs, hbase regionserver logs, hbase master logs as well.&lt;/P&gt;</description>
    <pubDate>Thu, 07 Jan 2016 01:07:48 GMT</pubDate>
    <dc:creator>asinghal</dc:creator>
    <dc:date>2016-01-07T01:07:48Z</dc:date>
    <item>
      <title>Hbase region server is getting stopped frequently without any error log</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hbase-region-server-is-getting-stopped-frequently-without/m-p/101617#M64582</link>
      <description>&lt;P&gt;We are having five nodes hadoop cluster powered by HDP (Version 2.1), Ambari (Version 1.6). We have 1 hbase(Version 0.98) master. 3 are data nodes which are having 3 region servers. We are having hbase application running on this.&lt;/P&gt;&lt;P&gt;For last couple of weeks, region server of data node2 was getting stopped arbitrarily without any error logs. What we observed is that region server was going down on weekly basis. But from last couple of days, region server of data node3 is also going down without any error logs. &lt;/P&gt;&lt;P&gt;Region server logs are as follows-&lt;/P&gt;&lt;P&gt;Log list-&lt;/P&gt;&lt;P&gt;-rw-r--r-- 1 hbase hadoop  191 Dec 29 18:17 
hbase-hbase-regionserver-fsdata2c.corp.arc.com.out.2 &lt;/P&gt;&lt;P&gt;-rw-r--r-- 1 hbase hadoop 814M Dec 29 18:17 gc.log-201511240826

-rw-r--r-- 1 hbase hadoop  191 Jan  4 18:27 
hbase-hbase-regionserver-fsdata2c.corp.arc.com.out.1

-rw-r--r-- 1 hbase hadoop 186M Jan  4 18:27 gc.log-201512300433&lt;/P&gt;&lt;P&gt;[root@fsdata2c hbase]# more 
hbase-hbase-regionserver-fsdata2c.corp.arc.com.out.1

/usr/lib/hbase/bin/hbase-daemon.sh: line 197: 19217 
Killed  nice -n $HBASE_NICENESS "$HBASE_HOME"/bin/hbase 
--config "${HBASE_CONF_DIR

}" $command "$@" start &amp;gt;&amp;gt; "$logout" 2&amp;gt;&amp;amp;1

[root@fsdata2c hbase]#&lt;/P&gt;&lt;P&gt;As of now, we are starting region server manually which is solving the problem on temporary basis until region server stops again. &lt;/P&gt;&lt;P&gt;We require permanent solutions. Can anyone please help on this issue. &lt;/P&gt;</description>
      <pubDate>Wed, 06 Jan 2016 15:37:03 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hbase-region-server-is-getting-stopped-frequently-without/m-p/101617#M64582</guid>
      <dc:creator>raja_ray</dc:creator>
      <dc:date>2016-01-06T15:37:03Z</dc:date>
    </item>
    <item>
      <title>Re: Hbase region server is getting stopped frequently without any error log</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hbase-region-server-is-getting-stopped-frequently-without/m-p/101618#M64583</link>
      <description>&lt;P&gt;can you please look for JVM pauses in regionserver logs?&lt;/P&gt;</description>
      <pubDate>Wed, 06 Jan 2016 15:42:56 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hbase-region-server-is-getting-stopped-frequently-without/m-p/101618#M64583</guid>
      <dc:creator>asinghal</dc:creator>
      <dc:date>2016-01-06T15:42:56Z</dc:date>
    </item>
    <item>
      <title>Re: Hbase region server is getting stopped frequently without any error log</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hbase-region-server-is-getting-stopped-frequently-without/m-p/101619#M64584</link>
      <description>&lt;P&gt;There is no error is GC log and in regionserver log corresponding to JVM pauses(org.apache.hadoop.hbase.util.JvmPauseMonitor).&lt;/P&gt;&lt;P&gt;We are getting following error in logs-&lt;/P&gt;&lt;P&gt;[main-SendThread(fsdata2c.corp.arc.com:2181)] zookeeper.ClientCnxn: Unable to read additional data from server sessionid 0x0, likely server has closed socket, closing socket connection and attempting reconnect&lt;/P&gt;&lt;P&gt;I have attached logs.&lt;/P&gt;&lt;P&gt;Also, we have increased Hbase region server heap space to 4096MB and zookeeper session timeout to 40 seconds.&lt;/P&gt;&lt;P&gt;Please share your thought.
&lt;A href="https://community.cloudera.com/legacyfs/online/attachments/1210-hbase-region-server-log.txt"&gt;hbase-region-server-log.txt&lt;/A&gt; (4.5 kB)
&lt;A href="https://community.cloudera.com/legacyfs/online/attachments/1211-hbase-gc-log.txt"&gt;hbase-gc-log.txt&lt;/A&gt; (3.7 kB) &lt;/P&gt;</description>
      <pubDate>Wed, 06 Jan 2016 18:09:30 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hbase-region-server-is-getting-stopped-frequently-without/m-p/101619#M64584</guid>
      <dc:creator>raja_ray</dc:creator>
      <dc:date>2016-01-06T18:09:30Z</dc:date>
    </item>
    <item>
      <title>Re: Hbase region server is getting stopped frequently without any error log</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hbase-region-server-is-getting-stopped-frequently-without/m-p/101620#M64585</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/1947/rajaray.html" nodeid="1947"&gt;@Raja Ray&lt;/A&gt; are all standard requirements set, i.e. ulimit, swappiness? Also, can you check the disk health? Also, what OS are you running, in case of RPM based, do you have Transparent Huge Pages off?&lt;/P&gt;</description>
      <pubDate>Wed, 06 Jan 2016 21:41:01 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hbase-region-server-is-getting-stopped-frequently-without/m-p/101620#M64585</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-01-06T21:41:01Z</dc:date>
    </item>
    <item>
      <title>Re: Hbase region server is getting stopped frequently without any error log</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hbase-region-server-is-getting-stopped-frequently-without/m-p/101621#M64586</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/393/aervits.html" nodeid="393"&gt;@Artem Ervits&lt;/A&gt; : ulimit is unlimited, Swapiness is 30, OS is: RHEL 7, Transparent Huge Pages was enabled but now we have turned THP off. please suggest to tune these parameters!! Thanks in Advance&lt;/P&gt;</description>
      <pubDate>Wed, 06 Jan 2016 23:02:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hbase-region-server-is-getting-stopped-frequently-without/m-p/101621#M64586</guid>
      <dc:creator>raja_ray</dc:creator>
      <dc:date>2016-01-06T23:02:45Z</dc:date>
    </item>
    <item>
      <title>Re: Hbase region server is getting stopped frequently without any error log</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hbase-region-server-is-getting-stopped-frequently-without/m-p/101622#M64587</link>
      <description>&lt;P&gt;THP on Centos7 is not a big deal but no biggie, you should turn swappinness to 0. I have a script with some important parameters, take a look. &lt;/P&gt;&lt;P&gt;&lt;A href="https://github.com/dbist/scripts/blob/master/administration/hbase.sh"&gt;https://github.com/dbist/scripts/blob/master/administration/hbase.sh&lt;/A&gt;&lt;/P&gt;&lt;P&gt;also check for number of open files.&lt;/P&gt;</description>
      <pubDate>Wed, 06 Jan 2016 23:08:56 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hbase-region-server-is-getting-stopped-frequently-without/m-p/101622#M64587</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-01-06T23:08:56Z</dc:date>
    </item>
    <item>
      <title>Re: Hbase region server is getting stopped frequently without any error log</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hbase-region-server-is-getting-stopped-frequently-without/m-p/101623#M64588</link>
      <description>&lt;P&gt;I am not able to access the logs attached by you. Is it possible for you to share the logs through some shared ftp links?&lt;/P&gt;&lt;P&gt;would be intereseted in zookeeper logs, gc logs, datanodes logs, hbase regionserver logs, hbase master logs as well.&lt;/P&gt;</description>
      <pubDate>Thu, 07 Jan 2016 01:07:48 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hbase-region-server-is-getting-stopped-frequently-without/m-p/101623#M64588</guid>
      <dc:creator>asinghal</dc:creator>
      <dc:date>2016-01-07T01:07:48Z</dc:date>
    </item>
    <item>
      <title>Re: Hbase region server is getting stopped frequently without any error log</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hbase-region-server-is-getting-stopped-frequently-without/m-p/101624#M64589</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/393/aervits.html" nodeid="393"&gt;@Artem Ervits&lt;/A&gt; thanks for your prompt response. 
we have set swapiness to 0 and executed hbase.sh script given by you on all regionserver and Hbase master node.
here is the number of files that are open. &lt;/P&gt;&lt;P&gt;Hmaster:
cat /proc/sys/fs/file-nr &lt;/P&gt;&lt;P&gt;2208    0      1557484 &lt;/P&gt;&lt;P&gt;Regionserver #1: cat /proc/sys/fs/file-nr &lt;/P&gt;&lt;P&gt;2976     0     3126194 &lt;/P&gt;&lt;P&gt;Regionserver #2:
cat /proc/sys/fs/file-nr &lt;/P&gt;&lt;P&gt;3008     0     3126194 &lt;/P&gt;&lt;P&gt;Regionserver #3:
cat /proc/sys/fs/file-nr &lt;/P&gt;&lt;P&gt;2752     0     3126194&lt;/P&gt;&lt;P&gt;
we will observe the hbase components for next couple of weeks and will let you know. Thanks for your expert help!&lt;/P&gt;</description>
      <pubDate>Thu, 07 Jan 2016 14:54:27 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hbase-region-server-is-getting-stopped-frequently-without/m-p/101624#M64589</guid>
      <dc:creator>raja_ray</dc:creator>
      <dc:date>2016-01-07T14:54:27Z</dc:date>
    </item>
    <item>
      <title>Re: Hbase region server is getting stopped frequently without any error log</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hbase-region-server-is-getting-stopped-frequently-without/m-p/101625#M64590</link>
      <description>&lt;P&gt;can you still provide the regionserver logs? &lt;A rel="user" href="https://community.cloudera.com/users/1947/rajaray.html" nodeid="1947"&gt;@Raja Ray&lt;/A&gt; what I suggested are just common practices and not necesserily a solution for your problem.&lt;/P&gt;</description>
      <pubDate>Thu, 07 Jan 2016 21:52:58 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hbase-region-server-is-getting-stopped-frequently-without/m-p/101625#M64590</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-01-07T21:52:58Z</dc:date>
    </item>
    <item>
      <title>Re: Hbase region server is getting stopped frequently without any error log</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hbase-region-server-is-getting-stopped-frequently-without/m-p/101626#M64591</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/393/aervits.html" nodeid="393"&gt;@Artem Ervits&lt;/A&gt; After two weeks of observation, it seems that the issue is not occurring again. Thanks for your expert advice, help and solution on the issue. One more thing I am observing that "Blocks health CRIT for about a minute
CRITICAL: corrupt_blocks:&amp;lt;1&amp;gt;, missing_blocks:&amp;lt;0&amp;gt;, total_blocks:&amp;lt;1765&amp;gt;", although there is no missing blocks. I will create a separate thread for that. &lt;/P&gt;</description>
      <pubDate>Tue, 19 Jan 2016 14:17:16 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hbase-region-server-is-getting-stopped-frequently-without/m-p/101626#M64591</guid>
      <dc:creator>raja_ray</dc:creator>
      <dc:date>2016-01-19T14:17:16Z</dc:date>
    </item>
    <item>
      <title>Re: Hbase region server is getting stopped frequently without any error log</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hbase-region-server-is-getting-stopped-frequently-without/m-p/101627#M64592</link>
      <description>&lt;P&gt;Excellent can you please accept the answer to close it out &lt;A rel="user" href="https://community.cloudera.com/users/1947/rajaray.html" nodeid="1947"&gt;@Raja Ray&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 19 Jan 2016 20:12:31 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hbase-region-server-is-getting-stopped-frequently-without/m-p/101627#M64592</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-01-19T20:12:31Z</dc:date>
    </item>
  </channel>
</rss>

