<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Open File Descriptors warning in Cloudera Manager in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Open-File-Descriptors-warning-in-Cloudera-Manager/m-p/46850#M859</link>
    <description>&lt;P&gt;Hi... I am facing the same issue on my dev cluster.&amp;nbsp;&lt;/P&gt;&lt;P&gt;It has the CM5.7 and CDH5.6.&lt;/P&gt;&lt;P&gt;Also it has Java 1.8.&lt;/P&gt;&lt;P&gt;The number of open files descriptor just keeps on increasing. After restart of machine it brings down to a saner number.&lt;/P&gt;&lt;P&gt;Note: The only diff between this cluster and other clusters is that it also has Kafka installed on it. Can it be a reason?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Any idea why this is happening??&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Ran below on dev nodes.&lt;/P&gt;&lt;P&gt;[root@hdp-dev01 ~]# /usr/sbin/lsof | sort -k 2 -n | awk ' { num [$1]++ } END { for (a in num) { print a,num[a] } }' | sort -k 2 -rn | head -n 20&lt;BR /&gt;cmf-agent 22642&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;netstat -anp | grep CLOSE_WAIT - output as below&lt;/P&gt;&lt;P&gt;tcp 1430 0 10.12.15.114:34176 10.12.15.114:7180 CLOSE_WAIT 16602/python2.6&lt;/P&gt;</description>
    <pubDate>Tue, 01 Nov 2016 15:55:43 GMT</pubDate>
    <dc:creator>ABaaya</dc:creator>
    <dc:date>2016-11-01T15:55:43Z</dc:date>
    <item>
      <title>Open File Descriptors warning in Cloudera Manager</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Open-File-Descriptors-warning-in-Cloudera-Manager/m-p/5087#M854</link>
      <description>&lt;P&gt;We are facing issues with Open File descriptors in Task tracker from cloudera manager and most of the Task tracker are in critical health.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Cloudera Manager: ver 4.7&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Please find below the status of one such Task Tracker:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;"Open file descriptors: 24,241. File descriptor limit: 32,768. Percentage in use: 73.98%. Critical threshold: 70.00%."&lt;/P&gt;&lt;P&gt;we have checked the File descriptor with the OS using the command "ulimit -a" and it shows "unlimited",&lt;/P&gt;&lt;P&gt;core file size (blocks, -c) 0&lt;BR /&gt;data seg size (kbytes, -d) unlimited&lt;BR /&gt;scheduling priority (-e) 0&lt;BR /&gt;file size (blocks, -f) unlimited&lt;BR /&gt;pending signals (-i) 118683&lt;BR /&gt;max locked memory (kbytes, -l) 64&lt;BR /&gt;max memory size (kbytes, -m) unlimited&lt;BR /&gt;open files (-n) 1024&lt;BR /&gt;pipe size (512 bytes, -p) 8&lt;BR /&gt;POSIX message queues (bytes, -q) 819200&lt;BR /&gt;real-time priority (-r) 0&lt;BR /&gt;stack size (kbytes, -s) 10240&lt;BR /&gt;cpu time (seconds, -t) unlimited&lt;BR /&gt;max user processes (-u) 1024&lt;BR /&gt;virtual memory (kbytes, -v) unlimited&lt;BR /&gt;file locks (-x) unlimited&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="line-height: 14px;"&gt;In Cloudera documentation, &amp;nbsp;we see the description for TaskTracker File Descriptors as:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;"The health check thresholds of the number of file descriptors used. A failure of this health check may indicate a bug in either Hadoop or Cloudera Manager. Contact Cloudera support."&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;We found a similar issue in the issue list, but not sure whether they are talking about the same underlying problem:&lt;/P&gt;&lt;P&gt;&lt;A target="_blank" href="https://issues.cloudera.org/browse/DISTRO-530"&gt;https://issues.cloudera.org/browse/DISTRO-530&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Is this indeed a bug in the underlying framework or is there a way to increase the&amp;nbsp;&lt;SPAN style="line-height: 14px;"&gt;Open File Descriptors thorugh Cloudera Manager? Any other approach to recover from this issue is also appreciated.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="line-height: 14px;"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="line-height: 14px;"&gt;Thanks&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 08:52:42 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Open-File-Descriptors-warning-in-Cloudera-Manager/m-p/5087#M854</guid>
      <dc:creator>Ashok</dc:creator>
      <dc:date>2022-09-16T08:52:42Z</dc:date>
    </item>
    <item>
      <title>Re: Open File Descriptors warning in Cloudera Manager</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Open-File-Descriptors-warning-in-Cloudera-Manager/m-p/5099#M855</link>
      <description>&lt;P&gt;Hi Ashok, to increase the file descriptors for the task tracker with a CM controlled cluster you can do the following. Keep in mind that the Alert is only a warning.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Verify your current file descriptor ulimit value to your task tracker processes.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;pid=$(ps faux | grep java | grep tasktracker | awk -F " " '{print $2}');cat /proc/$pid/limits;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&lt;BR /&gt;Perform the following on each node in the cluster. A ulimit of 65536 probably would be enough for most use cases. I just used 262144 as an extreme example.&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;1. Edit /usr/sbin/cmf-agent and change the ulimit -n setting.&amp;nbsp;&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;2. Edit /usr/lib64/cmf/service/mapreduce/mapreduce.sh and add&amp;nbsp;&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;ulimit -n 262144&amp;nbsp;&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;at the top of the script.&amp;nbsp;&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;3. Push these script changes out to all the slave nodes&amp;nbsp;&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;4. A few nodes at a time in the same rack, shutdown datanode and tasktracker services on them.&amp;nbsp;&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;5. On these stopped nodes run a: service cloudera-scm-agent hard_restart&amp;nbsp;&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;6. Restart the datanode and tasktracker.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;Verify your file descriptor ulimit value to your task tracker processes after making the change.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;pid=$(ps faux | grep java | grep tasktracker | awk -F " " '{print $2}');cat /proc/$pid/limits;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;-roland&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 21 Jan 2014 16:30:09 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Open-File-Descriptors-warning-in-Cloudera-Manager/m-p/5099#M855</guid>
      <dc:creator>Rolando</dc:creator>
      <dc:date>2014-01-21T16:30:09Z</dc:date>
    </item>
    <item>
      <title>Re: Open File Descriptors warning in Cloudera Manager</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Open-File-Descriptors-warning-in-Cloudera-Manager/m-p/5109#M856</link>
      <description>Tanq rteague.</description>
      <pubDate>Wed, 22 Jan 2014 11:45:32 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Open-File-Descriptors-warning-in-Cloudera-Manager/m-p/5109#M856</guid>
      <dc:creator>Ashok</dc:creator>
      <dc:date>2014-01-22T11:45:32Z</dc:date>
    </item>
    <item>
      <title>Re: Open File Descriptors warning in Cloudera Manager</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Open-File-Descriptors-warning-in-Cloudera-Manager/m-p/5471#M857</link>
      <description>&lt;P&gt;I am also experiencing large numbers of open connections on my datanodes.&lt;/P&gt;&lt;P&gt;using lsof I see ~23,000 open tcp connections to the datanode with status CLOSE_WAIT&lt;/P&gt;&lt;P&gt;Restarting the cluster brings the open file descriptors down to a sane number.&lt;/P&gt;&lt;P&gt;I believe this is being caused by&amp;nbsp;&lt;A target="_blank" href="https://issues.apache.org/jira/browse/HDFS-5671"&gt;https://issues.apache.org/jira/browse/HDFS-5671&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Is cloudera aware of this? Any timeline on when it will be patched into CDH?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 30 Jan 2014 21:21:10 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Open-File-Descriptors-warning-in-Cloudera-Manager/m-p/5471#M857</guid>
      <dc:creator>wpeterson328</dc:creator>
      <dc:date>2014-01-30T21:21:10Z</dc:date>
    </item>
    <item>
      <title>Re: Open File Descriptors warning in Cloudera Manager</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Open-File-Descriptors-warning-in-Cloudera-Manager/m-p/5607#M858</link>
      <description>&lt;P&gt;What process is responsible for the open sockets? &amp;nbsp;Is it a local JVM, like an HBase regionserver, or some remote network IP? &amp;nbsp;Can you paste a few example lines from the following output?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;sudo netstat -anp | grep CLOSE_WAIT&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Also, what version of CDH are you on? &amp;nbsp;There can be multiple causes for an issue like you're seeing and there have been bugs fixed for this in the past with workarounds too.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Finally, can you give us the "$JAVA_HOME/bin/java -version" output? there was a JVM bug in ConcurrentMarkSweep GC in versions below 1.6.0_31, the workaround for that is to add the following JVM property to your runtime settings for the various hadoop daemons:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;-XX:-CMSConcurrent&lt;SPAN&gt;MTEnabled&lt;/SPAN&gt;&lt;/PRE&gt;
&lt;P&gt;Newer versions of CM add that for you automatically.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 04 Feb 2014 15:31:28 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Open-File-Descriptors-warning-in-Cloudera-Manager/m-p/5607#M858</guid>
      <dc:creator>Clint</dc:creator>
      <dc:date>2014-02-04T15:31:28Z</dc:date>
    </item>
    <item>
      <title>Re: Open File Descriptors warning in Cloudera Manager</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Open-File-Descriptors-warning-in-Cloudera-Manager/m-p/46850#M859</link>
      <description>&lt;P&gt;Hi... I am facing the same issue on my dev cluster.&amp;nbsp;&lt;/P&gt;&lt;P&gt;It has the CM5.7 and CDH5.6.&lt;/P&gt;&lt;P&gt;Also it has Java 1.8.&lt;/P&gt;&lt;P&gt;The number of open files descriptor just keeps on increasing. After restart of machine it brings down to a saner number.&lt;/P&gt;&lt;P&gt;Note: The only diff between this cluster and other clusters is that it also has Kafka installed on it. Can it be a reason?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Any idea why this is happening??&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Ran below on dev nodes.&lt;/P&gt;&lt;P&gt;[root@hdp-dev01 ~]# /usr/sbin/lsof | sort -k 2 -n | awk ' { num [$1]++ } END { for (a in num) { print a,num[a] } }' | sort -k 2 -rn | head -n 20&lt;BR /&gt;cmf-agent 22642&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;netstat -anp | grep CLOSE_WAIT - output as below&lt;/P&gt;&lt;P&gt;tcp 1430 0 10.12.15.114:34176 10.12.15.114:7180 CLOSE_WAIT 16602/python2.6&lt;/P&gt;</description>
      <pubDate>Tue, 01 Nov 2016 15:55:43 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Open-File-Descriptors-warning-in-Cloudera-Manager/m-p/46850#M859</guid>
      <dc:creator>ABaaya</dc:creator>
      <dc:date>2016-11-01T15:55:43Z</dc:date>
    </item>
    <item>
      <title>Re: Open File Descriptors warning in Cloudera Manager</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Open-File-Descriptors-warning-in-Cloudera-Manager/m-p/81394#M860</link>
      <description>&lt;P&gt;Resurrecting this topic with some clarity on the issue and it's remedy.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;If RegionServers would be keeping dead connections to the Datanodes, the same symnptoms would be seen, many connection in CLOSE_WAIT, and file descriptor number increasing.&lt;/P&gt;&lt;P&gt;In extreme cases the limit could be reached, whioch would case the host node to fail with no more open file descriptors to use issue.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;There wasa bug in HBase prior to CDH5.13 which is described in this upstream JIRA in more detail[1]:&lt;/P&gt;&lt;P class="p1"&gt;HBASE-9393 Hbase does not closing a closed socket resulting in many CLOSE_WAIT]&lt;/P&gt;&lt;P class="p1"&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;This issue was patched in the following CDH releases:&lt;/P&gt;&lt;P class="p1"&gt;CDH5.13.0, CDH5.13.1, CDH5.13.2, CDH5.13.3,&lt;/P&gt;&lt;P class="p1"&gt;CDH5.14.0,&lt;SPAN class="Apple-converted-space"&gt;&amp;nbsp; &lt;/SPAN&gt;CDH5.14.2, CDH5.14.4,&lt;/P&gt;&lt;P class="p1"&gt;CDH5.15.0,&lt;SPAN class="Apple-converted-space"&gt;&amp;nbsp; &lt;/SPAN&gt;CDH5.15.1,&lt;/P&gt;&lt;P class="p1"&gt;CDH6.0.0.&lt;/P&gt;&lt;P class="p1"&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;[1] - upstream HBase JIRA - &lt;A href="https://issues.apache.org/jira/browse/HBASE-9393?attachmentOrder=asc" target="_blank"&gt;&lt;SPAN class="s2"&gt;https://issues.apache.org/jira/browse/HBASE-9393?attachmentOrder=asc&lt;/SPAN&gt;&lt;/A&gt;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 23 Oct 2018 09:09:41 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Open-File-Descriptors-warning-in-Cloudera-Manager/m-p/81394#M860</guid>
      <dc:creator>Andoroid</dc:creator>
      <dc:date>2018-10-23T09:09:41Z</dc:date>
    </item>
  </channel>
</rss>

