<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question What triggers du in Impala? in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-triggers-du-in-Impala/m-p/40020#M25826</link>
    <description>&lt;P&gt;Hi there,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We are doing some testing on a very small cluster and we were experiencing some extra load by du command. It is affecting our testing results significantly and we are bypassing it by creating a symbolic link of du to a df command.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Our testing steps:&lt;/P&gt;&lt;P&gt;1. on all nodes:&amp;nbsp;echo 1 &amp;gt; /proc/sys/vm/drop_caches&lt;/P&gt;&lt;P&gt;2. run scripts&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Can anyone has a detailed&amp;nbsp;explaination of how the du command gets triggered by impala( we assume it's something related with vfs caching). Is there a config or a better way to make it not doing du?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks a lot! Let me know if you need more information. &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Fri, 16 Sep 2022 10:15:13 GMT</pubDate>
    <dc:creator>ZL</dc:creator>
    <dc:date>2022-09-16T10:15:13Z</dc:date>
    <item>
      <title>What triggers du in Impala?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-triggers-du-in-Impala/m-p/40020#M25826</link>
      <description>&lt;P&gt;Hi there,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We are doing some testing on a very small cluster and we were experiencing some extra load by du command. It is affecting our testing results significantly and we are bypassing it by creating a symbolic link of du to a df command.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Our testing steps:&lt;/P&gt;&lt;P&gt;1. on all nodes:&amp;nbsp;echo 1 &amp;gt; /proc/sys/vm/drop_caches&lt;/P&gt;&lt;P&gt;2. run scripts&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Can anyone has a detailed&amp;nbsp;explaination of how the du command gets triggered by impala( we assume it's something related with vfs caching). Is there a config or a better way to make it not doing du?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks a lot! Let me know if you need more information. &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 10:15:13 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-triggers-du-in-Impala/m-p/40020#M25826</guid>
      <dc:creator>ZL</dc:creator>
      <dc:date>2022-09-16T10:15:13Z</dc:date>
    </item>
    <item>
      <title>Re: What triggers du in Impala?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-triggers-du-in-Impala/m-p/40021#M25827</link>
      <description>Also we didn't have fs.du.interval setting in our config so by default it should be 600000 ms but we are seeing it much more often than that.</description>
      <pubDate>Thu, 21 Apr 2016 23:10:09 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-triggers-du-in-Impala/m-p/40021#M25827</guid>
      <dc:creator>ZL</dc:creator>
      <dc:date>2016-04-21T23:10:09Z</dc:date>
    </item>
    <item>
      <title>Re: What triggers du in Impala?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-triggers-du-in-Impala/m-p/40061#M25828</link>
      <description>&lt;P&gt;Are you sure it's Impala that's triggering it? I don't think Impala would use du for anything.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;HDFS apparently does and Cloudera Manager might use it.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Have you tried tracing back what is running 'du'? E.g. run "ps auxf" to get a tree-view of processes.&lt;/P&gt;</description>
      <pubDate>Fri, 22 Apr 2016 16:21:43 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-triggers-du-in-Impala/m-p/40061#M25828</guid>
      <dc:creator>Tim Armstrong</dc:creator>
      <dc:date>2016-04-22T16:21:43Z</dc:date>
    </item>
    <item>
      <title>Re: What triggers du in Impala?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-triggers-du-in-Impala/m-p/40062#M25829</link>
      <description>Actually it is datanode doing it. I guess I'll ask more about it as an HDFS topic. Thanks!</description>
      <pubDate>Fri, 22 Apr 2016 16:40:55 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-triggers-du-in-Impala/m-p/40062#M25829</guid>
      <dc:creator>ZL</dc:creator>
      <dc:date>2016-04-22T16:40:55Z</dc:date>
    </item>
  </channel>
</rss>

