<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: File Descriptor usage in Datanode climbing steadily in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/File-Descriptor-usage-in-Datanode-climbing-steadily/m-p/51063#M54570</link>
    <description>Are the number of datanodes the same? Is the block size the same? How many blocks are on each cluster?&lt;BR /&gt;&lt;BR /&gt;The *.meta files are metadata files for the blocks. This may have been a change compared to Hadoop 1; I am not sure.&lt;BR /&gt;&lt;BR /&gt;It is a bit weird for it to never go down though. I have a cluster with millions of blocks and hundreds of TBs and I'll get spikes but the open FDs are on average around 2k per nodes.&lt;BR /&gt;&lt;BR /&gt;It does depend on how much work the DNs are under as well.&lt;BR /&gt;&lt;BR /&gt;Can you increase the FD limits?</description>
    <pubDate>Fri, 17 Feb 2017 08:27:02 GMT</pubDate>
    <dc:creator>mbigelow</dc:creator>
    <dc:date>2017-02-17T08:27:02Z</dc:date>
    <item>
      <title>File Descriptor usage in Datanode climbing steadily</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/File-Descriptor-usage-in-Datanode-climbing-steadily/m-p/50973#M54569</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We just started using Cloudera Manager Express 5.9 (the same version for Namenodes and Datanodes) for our HDFS cluster. When our internal client is posting logs to HDFS , we see the usage of file descriptors in Datanodes continually climbing until it reaches to Warning level of 50% and ultimately cross the Critical threshold of 70% (default limits configured in the health tests) .. The only way to bring the usage down is to restart Data Nodes service on each of the Data Nodes. This is really disruptive to our usage of HDFS.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In the past we were using Cloudera Standard 4.6.2 , and with the same setup , we never saw the file descriptors usage so high.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I chcked the number of configured file descriptors in both 5.9 and 4.6.2 and it's the same 32k value.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Investigation report -:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I used ps -ef --cols 9999|grep hdfs to find hdfs pid. Then use /usr/sbin/lsof -p [pid]|wc –l to find how many files open. Here are changes:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Datanode1 Datanode2 Datanode3 numbers -:&lt;/P&gt;&lt;P&gt;Before send 16490 16490 16486&lt;/P&gt;&lt;P&gt;After send 16580 16580 16576&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;java&amp;nbsp;&amp;nbsp;&amp;nbsp; 23757 hdfs&amp;nbsp; 593r&amp;nbsp;&amp;nbsp; REG&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 202,81&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 59&amp;nbsp;&amp;nbsp; 29658600 /opt/dfs/dn/current/BP-832824084-10.189.101.91-1484719231606/current/finalized/subdir0/subdir148/blk_1073779777_70965.metaIn all three data nodes, there are many open files like this:&lt;/P&gt;&lt;P&gt;java&amp;nbsp;&amp;nbsp;&amp;nbsp; 23757 hdfs&amp;nbsp; 594r&amp;nbsp;&amp;nbsp; REG&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 202,81&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 519&amp;nbsp;&amp;nbsp; 29658617 /opt/dfs/dn/current/BP-832824084-10.189.101.91-1484719231606/current/finalized/subdir0/subdir148/blk_1073779790_71007.meta&lt;/P&gt;&lt;P&gt;java&amp;nbsp;&amp;nbsp;&amp;nbsp; 23757 hdfs&amp;nbsp; 595w&amp;nbsp;&amp;nbsp; REG&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 202,81&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 119&amp;nbsp;&amp;nbsp; 29658629 /opt/dfs/dn/current/BP-832824084-10.189.101.91-1484719231606/current/finalized/subdir0/subdir148/blk_1073779801_71047.meta&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We had the same situation even couple of hours later, and the open file descriptors did not decrease.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Has someone else seen the same problem and has a solution to this ? We will be really grateful for your support.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Please let me know if you have any questions.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Raj&lt;/P&gt;</description>
      <pubDate>Wed, 15 Feb 2017 19:50:51 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/File-Descriptor-usage-in-Datanode-climbing-steadily/m-p/50973#M54569</guid>
      <dc:creator>rajpalparyani</dc:creator>
      <dc:date>2017-02-15T19:50:51Z</dc:date>
    </item>
    <item>
      <title>Re: File Descriptor usage in Datanode climbing steadily</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/File-Descriptor-usage-in-Datanode-climbing-steadily/m-p/51063#M54570</link>
      <description>Are the number of datanodes the same? Is the block size the same? How many blocks are on each cluster?&lt;BR /&gt;&lt;BR /&gt;The *.meta files are metadata files for the blocks. This may have been a change compared to Hadoop 1; I am not sure.&lt;BR /&gt;&lt;BR /&gt;It is a bit weird for it to never go down though. I have a cluster with millions of blocks and hundreds of TBs and I'll get spikes but the open FDs are on average around 2k per nodes.&lt;BR /&gt;&lt;BR /&gt;It does depend on how much work the DNs are under as well.&lt;BR /&gt;&lt;BR /&gt;Can you increase the FD limits?</description>
      <pubDate>Fri, 17 Feb 2017 08:27:02 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/File-Descriptor-usage-in-Datanode-climbing-steadily/m-p/51063#M54570</guid>
      <dc:creator>mbigelow</dc:creator>
      <dc:date>2017-02-17T08:27:02Z</dc:date>
    </item>
    <item>
      <title>Re: File Descriptor usage in Datanode climbing steadily</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/File-Descriptor-usage-in-Datanode-climbing-steadily/m-p/51481#M54571</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I havent seen the file descriptors rising ever since i opened this ticket .. Feel free to close this ticket ..&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks for the suggestions though &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks, Raj&lt;/P&gt;</description>
      <pubDate>Sun, 26 Feb 2017 02:12:26 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/File-Descriptor-usage-in-Datanode-climbing-steadily/m-p/51481#M54571</guid>
      <dc:creator>rajpalparyani</dc:creator>
      <dc:date>2017-02-26T02:12:26Z</dc:date>
    </item>
    <item>
      <title>Re: File Descriptor usage in Datanode climbing steadily</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/File-Descriptor-usage-in-Datanode-climbing-steadily/m-p/52359#M54572</link>
      <description>&lt;P&gt;This is happening again ..&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We have now 4 large machines handling 1/6th of the load similarly 4 data nodes of 4.6.2 version and we did not see file descriptors climbing there , so it has something to do with 5.9 version itself ..&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Can someone please confirm next course of action in this case ?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Looking forward to your response.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Raj&lt;/P&gt;</description>
      <pubDate>Sat, 18 Mar 2017 14:41:13 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/File-Descriptor-usage-in-Datanode-climbing-steadily/m-p/52359#M54572</guid>
      <dc:creator>rajpalparyani</dc:creator>
      <dc:date>2017-03-18T14:41:13Z</dc:date>
    </item>
    <item>
      <title>Re: File Descriptor usage in Datanode climbing steadily</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/File-Descriptor-usage-in-Datanode-climbing-steadily/m-p/52401#M54573</link>
      <description>&lt;P&gt;We have the same issue.&lt;BR /&gt;We upgraded from 2.6.0 CDH 5.7.6 to 2.6.0 CDH 5.9.1.&lt;BR /&gt;Since then, our data nodes have been leaking open file descriptors to block .meta files.&lt;BR /&gt;We didn't have any issues before the upgrade.&lt;BR /&gt;The screen shot attached shows the change in behavior after the upgrade for one of our data nodes.&lt;BR /&gt;The drop downs occur when we restart the HDFS service.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="sc.png" style="width: 600px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/2771iE105412C981C515F/image-size/large?v=v2&amp;amp;px=999" role="button" title="sc.png" alt="sc.png" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 20 Mar 2017 22:20:52 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/File-Descriptor-usage-in-Datanode-climbing-steadily/m-p/52401#M54573</guid>
      <dc:creator>nymous</dc:creator>
      <dc:date>2017-03-20T22:20:52Z</dc:date>
    </item>
    <item>
      <title>Re: File Descriptor usage in Datanode climbing steadily</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/File-Descriptor-usage-in-Datanode-climbing-steadily/m-p/52440#M54574</link>
      <description>&lt;P&gt;Downgrading from &lt;SPAN&gt;2.6.0-cdh5.9.1&lt;/SPAN&gt; back to &lt;SPAN&gt;2.6.0-cdh5.8.4 looks to have fixed the problem.&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Our HDFS is back to being usable and stable.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 21 Mar 2017 12:53:32 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/File-Descriptor-usage-in-Datanode-climbing-steadily/m-p/52440#M54574</guid>
      <dc:creator>nymous</dc:creator>
      <dc:date>2017-03-21T12:53:32Z</dc:date>
    </item>
    <item>
      <title>Re: File Descriptor usage in Datanode climbing steadily</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/File-Descriptor-usage-in-Datanode-climbing-steadily/m-p/52460#M54575</link>
      <description>&lt;P&gt;Hi nmous,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;How did you downgrade from 5.9 to 5.8.4 ? Can you please tell me is there's a link for the documentation ?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Looking forward to your response.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks, Raj&lt;/P&gt;</description>
      <pubDate>Tue, 21 Mar 2017 18:31:56 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/File-Descriptor-usage-in-Datanode-climbing-steadily/m-p/52460#M54575</guid>
      <dc:creator>rajpalparyani</dc:creator>
      <dc:date>2017-03-21T18:31:56Z</dc:date>
    </item>
    <item>
      <title>Re: File Descriptor usage in Datanode climbing steadily</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/File-Descriptor-usage-in-Datanode-climbing-steadily/m-p/52511#M54576</link>
      <description>&lt;P&gt;We are only runnng hdfs, so we only need to upgrade that.&lt;/P&gt;&lt;P&gt;Since it was a dev environment, we shut all of hdfs down, download&lt;/P&gt;&lt;P&gt;&lt;A href="http://archive.cloudera.com/cdh5/cdh/5/hadoop-2.6.0-cdh5.8.4.tar.gz" target="_blank"&gt;hadoop-2.6.0-cdh5.8.4.tar.gz&lt;/A&gt; from &lt;A href="http://archive.cloudera.com/cdh5/cdh/5/" target="_blank"&gt;http://archive.cloudera.com/cdh5/cdh/5/&lt;/A&gt;&lt;/P&gt;&lt;P&gt;and run with that.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;(We are actually running with hdfs on mesos, so the artifacts get packaged up into an uberjar with the mesos executor, but there's no real magic there.&amp;nbsp; I think it just uses the stuff in hadoop/common and hadoop/hdfs and some of the run scripts.)&lt;/P&gt;</description>
      <pubDate>Wed, 22 Mar 2017 14:57:31 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/File-Descriptor-usage-in-Datanode-climbing-steadily/m-p/52511#M54576</guid>
      <dc:creator>nymous</dc:creator>
      <dc:date>2017-03-22T14:57:31Z</dc:date>
    </item>
  </channel>
</rss>

