<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Increase open file limit of the user to scale for large data processing.  ulimit and nofile in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Increase-open-file-limit-of-the-user-to-scale-for-large-data/m-p/339717#M233160</link>
    <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/45756"&gt;@JLo_Hernandez&lt;/a&gt;&amp;nbsp;I am having same question as of you. If you got the answer please let me know.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;</description>
    <pubDate>Fri, 25 Mar 2022 13:20:46 GMT</pubDate>
    <dc:creator>Boss</dc:creator>
    <dc:date>2022-03-25T13:20:46Z</dc:date>
    <item>
      <title>Increase open file limit of the user to scale for large data processing.  ulimit and nofile</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Increase-open-file-limit-of-the-user-to-scale-for-large-data/m-p/206502#M168464</link>
      <description>&lt;P&gt;Increase open file limit of the user to scale for large data processing  :  hive, hbase, hdfs, oozie, yarn, mapred, Zookeeper, Spark, HCat&lt;/P&gt;</description>
      <pubDate>Fri, 27 Jul 2018 15:32:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Increase-open-file-limit-of-the-user-to-scale-for-large-data/m-p/206502#M168464</guid>
      <dc:creator>rambabuch</dc:creator>
      <dc:date>2018-07-27T15:32:45Z</dc:date>
    </item>
    <item>
      <title>Re: Increase open file limit of the user to scale for large data processing.  ulimit and nofile</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Increase-open-file-limit-of-the-user-to-scale-for-large-data/m-p/206503#M168465</link>
      <description>&lt;P&gt;&lt;STRONG&gt;Here is the solution... &lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;1. Services - Hive, HBase, HDFS, Oozie, YARN, MapReduce, Ambari Metrics&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;These Services we can directly change the file limit from Ambari UI.&lt;/P&gt;&lt;PRE&gt;Ambari UI &amp;gt; ServiceConfigs&amp;gt; &amp;lt;username of the service&amp;gt;_user_nofile_limit
Example: 1. Ambari UI -&amp;gt; HIVE -&amp;gt; Configs -&amp;gt; Advanced -&amp;gt; Advanced hive-env -&amp;gt; hive_user_nofile_limit  64000
         2. Ambari UI &amp;gt; Ambari Metrics &amp;gt; configs &amp;gt; Advanced ams-hbase-env &amp;gt; max_open_files_limit  64000
         3. Ambari UI &amp;gt; Yarn &amp;gt; configs &amp;gt; Advanced yarn-env &amp;gt; yarn_user_nofile_limit  64000
         4. Ambari UI &amp;gt; MAPREDUCE2 &amp;gt; configs &amp;gt; Advanced mapred-env &amp;gt; mapred_user_nofile_limit  64000&lt;/PRE&gt;&lt;P&gt;&lt;STRONG&gt;&lt;BR /&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;2. Services -  Zookeeper, Spark, WebHCat, Ranger .  Users - zookeeper, Spark, hcat, ranger&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;For users &lt;STRONG style="background-color: initial;"&gt;spark, hcat, zookeeper, ranger&lt;/STRONG&gt;. Add the below lines for their respective nodes in /etc/security/limits.conf&lt;/P&gt;&lt;P&gt;/etc/security/limits.conf file should have below entries.&lt;/P&gt;&lt;PRE&gt;zookeeper  -    nofile    64000 
spark      -    nofile    64000
hcat       -    nofile    64000
ranger     -    nofile    64000&lt;/PRE&gt;&lt;P&gt;After save the changes. Login as spark/hcat/zookeeper user and execute &lt;STRONG&gt;ulimit -a&lt;/STRONG&gt; command.&lt;/P&gt;&lt;P&gt;check the output. The output should contain value as &lt;STRONG&gt;open files (-n) 64000&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Please find the below&lt;STRONG&gt; ulimit -a &lt;/STRONG&gt;output .&lt;/P&gt;&lt;PRE&gt;[spark@node01]$ ulimit -a 
core file size          (blocks, -c) 0
data seg size           (kbytes, -d) unlimited
scheduling priority             (-e) 0
file size               (blocks, -f) unlimited
pending signals                 (-i) 513179
max locked memory       (kbytes, -l) 64
max memory size         (kbytes, -m) unlimited
open files                      (-n) 64000
pipe size            (512 bytes, -p) 8
POSIX message queues     (bytes, -q) 819200
real-time priority              (-r) 0
stack size              (kbytes, -s) 8192
cpu time               (seconds, -t) unlimited
max user processes              (-u) 64000
virtual memory          (kbytes, -v) unlimited
file locks                      (-x) unlimited&lt;/PRE&gt;&lt;P&gt;If you still see ulimit -a values not updated. Then please add the below lines to file  /etc/pam.d/su . &lt;/P&gt;&lt;PRE&gt;vim /etc/pam.d/su
session         required        pam_limits.so&lt;/PRE&gt;&lt;P&gt;Repeat the above process... It will be successful.&lt;/P&gt;</description>
      <pubDate>Fri, 27 Jul 2018 15:45:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Increase-open-file-limit-of-the-user-to-scale-for-large-data/m-p/206503#M168465</guid>
      <dc:creator>rambabuch</dc:creator>
      <dc:date>2018-07-27T15:45:45Z</dc:date>
    </item>
    <item>
      <title>Re: Increase open file limit of the user to scale for large data processing.  ulimit and nofile</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Increase-open-file-limit-of-the-user-to-scale-for-large-data/m-p/206504#M168466</link>
      <description>&lt;P&gt;Is there any sort of formula or how did you came up with this value for users's processes? is it a random value? what can I check within my cluster in order to get a proper value for me?&lt;/P&gt;</description>
      <pubDate>Wed, 07 Aug 2019 22:54:43 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Increase-open-file-limit-of-the-user-to-scale-for-large-data/m-p/206504#M168466</guid>
      <dc:creator>JLo_Hernandez</dc:creator>
      <dc:date>2019-08-07T22:54:43Z</dc:date>
    </item>
    <item>
      <title>Re: Increase open file limit of the user to scale for large data processing.  ulimit and nofile</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Increase-open-file-limit-of-the-user-to-scale-for-large-data/m-p/339717#M233160</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/45756"&gt;@JLo_Hernandez&lt;/a&gt;&amp;nbsp;I am having same question as of you. If you got the answer please let me know.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;</description>
      <pubDate>Fri, 25 Mar 2022 13:20:46 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Increase-open-file-limit-of-the-user-to-scale-for-large-data/m-p/339717#M233160</guid>
      <dc:creator>Boss</dc:creator>
      <dc:date>2022-03-25T13:20:46Z</dc:date>
    </item>
  </channel>
</rss>

