<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Yarn Aggregate Log Retention Setting in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Yarn-Aggregate-Log-Retention-Setting/m-p/49222#M34574</link>
    <description>&lt;P&gt;Thanks for mentioning the information about the hadoop group and permissions. It would seem, that after applying these settings, all is working.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Cheers,&lt;/P&gt;&lt;P&gt;Ben&lt;/P&gt;</description>
    <pubDate>Mon, 09 Jan 2017 19:25:33 GMT</pubDate>
    <dc:creator>benassi</dc:creator>
    <dc:date>2017-01-09T19:25:33Z</dc:date>
    <item>
      <title>Yarn Aggregate Log Retention Setting</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Yarn-Aggregate-Log-Retention-Setting/m-p/49112#M34568</link>
      <description>&lt;P&gt;We have CDH 5.7.2 installed alongside with Cloudera Manager 5.8.1 at our company. We have configured YARN log aggregation to be enabled and YARN log aggregation retain seconds set to 1 day. For some reason, the YARN job logs in the default HDFS directory /tmp/logs/ are not being deleted. Can anyone explain why this is?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;BTW, we have both Hive and Spark jobs running on our cluster.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Ben&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 10:53:06 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Yarn-Aggregate-Log-Retention-Setting/m-p/49112#M34568</guid>
      <dc:creator>benassi</dc:creator>
      <dc:date>2022-09-16T10:53:06Z</dc:date>
    </item>
    <item>
      <title>Re: Yarn Aggregate Log Retention Setting</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Yarn-Aggregate-Log-Retention-Setting/m-p/49116#M34569</link>
      <description>Is /tmp/logs, and all subdirs, set to 770 and hadoop group?&lt;BR /&gt;&lt;BR /&gt;Have you check for actual log files? The log directories are not removed. It may appear that the logs are lingering.&lt;BR /&gt;&lt;BR /&gt;Use hdfs dfs -du -s -h /tmp/logs/ to see if there is any decrease over time or if it is just increasing?</description>
      <pubDate>Thu, 05 Jan 2017 21:14:42 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Yarn-Aggregate-Log-Retention-Setting/m-p/49116#M34569</guid>
      <dc:creator>mbigelow</dc:creator>
      <dc:date>2017-01-05T21:14:42Z</dc:date>
    </item>
    <item>
      <title>Re: Yarn Aggregate Log Retention Setting</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Yarn-Aggregate-Log-Retention-Setting/m-p/49124#M34570</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/14495"&gt;@benassi&lt;/a&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;As we know "Yarn Aggregate Log Retention" can control only YARN but /tmp/logs is not limited to YARN&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;So Can you check the YARN log date&amp;nbsp;using below steps.&amp;nbsp;&lt;BR /&gt;CM -&amp;gt; Yarn -&amp;gt; Web UI -&amp;gt; Resource Manager web UI -&amp;gt; (it will open 8088 link) Click on Finished link (left side) -&amp;gt; Come down and click on 'Last' button -&amp;gt; Check the log date -&amp;gt;&amp;nbsp;You should see only one day history data as you configured to 1 day&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Note: Make sure CM-&amp;gt; Yarn -&amp;gt; Configuration -&amp;gt; Enable Log Aggregation = Enabled&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;Kumar&lt;/P&gt;</description>
      <pubDate>Fri, 06 Jan 2017 01:22:55 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Yarn-Aggregate-Log-Retention-Setting/m-p/49124#M34570</guid>
      <dc:creator>saranvisa</dc:creator>
      <dc:date>2017-01-06T01:22:55Z</dc:date>
    </item>
    <item>
      <title>Re: Yarn Aggregate Log Retention Setting</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Yarn-Aggregate-Log-Retention-Setting/m-p/49128#M34571</link>
      <description>&lt;P&gt;To answer your questions:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The /tmp/logs and all subdirs are 770 and the group is hdfs. Should the group be hadoop instead? I see that the yarn user is not part of the hdfs group but is in the hadoop group.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The logs date back to Dec 18 and increase in size less than&amp;nbsp;1TB per day. We manually delete the logs to prevent it getting to big.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Ben&lt;/P&gt;</description>
      <pubDate>Fri, 06 Jan 2017 04:34:07 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Yarn-Aggregate-Log-Retention-Setting/m-p/49128#M34571</guid>
      <dc:creator>benassi</dc:creator>
      <dc:date>2017-01-06T04:34:07Z</dc:date>
    </item>
    <item>
      <title>Re: Yarn Aggregate Log Retention Setting</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Yarn-Aggregate-Log-Retention-Setting/m-p/49129#M34572</link>
      <description>&lt;P&gt;I did as you asked and see that the oldest finished is from Dec 18, and I see the logs in HDFS under /tmp/logs.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Log Aggregation is enabled.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Ben&lt;/P&gt;</description>
      <pubDate>Fri, 06 Jan 2017 04:35:57 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Yarn-Aggregate-Log-Retention-Setting/m-p/49129#M34572</guid>
      <dc:creator>benassi</dc:creator>
      <dc:date>2017-01-06T04:35:57Z</dc:date>
    </item>
    <item>
      <title>Re: Yarn Aggregate Log Retention Setting</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Yarn-Aggregate-Log-Retention-Setting/m-p/49197#M34573</link>
      <description>&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/14495"&gt;@benassi&lt;/a&gt; check who all belongs to the hadoop group. It should be hdfs, mapred, and yarn. The yarn account, as that is that the RM, NM, and JH run as, will need to have read/write access to be able to remove any old logs.</description>
      <pubDate>Mon, 09 Jan 2017 08:16:42 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Yarn-Aggregate-Log-Retention-Setting/m-p/49197#M34573</guid>
      <dc:creator>mbigelow</dc:creator>
      <dc:date>2017-01-09T08:16:42Z</dc:date>
    </item>
    <item>
      <title>Re: Yarn Aggregate Log Retention Setting</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Yarn-Aggregate-Log-Retention-Setting/m-p/49222#M34574</link>
      <description>&lt;P&gt;Thanks for mentioning the information about the hadoop group and permissions. It would seem, that after applying these settings, all is working.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Cheers,&lt;/P&gt;&lt;P&gt;Ben&lt;/P&gt;</description>
      <pubDate>Mon, 09 Jan 2017 19:25:33 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Yarn-Aggregate-Log-Retention-Setting/m-p/49222#M34574</guid>
      <dc:creator>benassi</dc:creator>
      <dc:date>2017-01-09T19:25:33Z</dc:date>
    </item>
    <item>
      <title>Re: Yarn Aggregate Log Retention Setting</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Yarn-Aggregate-Log-Retention-Setting/m-p/81341#M34575</link>
      <description>&lt;P&gt;Hi , my cluster is CDH 5.7.2,CM5.7.0,&amp;nbsp; and I meet the same touble.&lt;/P&gt;&lt;P&gt;we set&amp;nbsp;&lt;SPAN&gt;&amp;nbsp;dfs.permissions.superusergroup=supergroup&amp;nbsp; &amp;nbsp; ;&amp;nbsp; and we run the mapreduce application by 'hdfs' user, the hdfs file like this:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; drwxrwx---&amp;nbsp; &amp;nbsp;- hdfs&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;supergroup&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0 2018-06-05 15:01 /tmp/logs/hdfs&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;and the linux mapping of user to group is :&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;&amp;nbsp;hadoop:x:497:hdfs,mapred,yarn&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;&amp;nbsp;supergroup:x:505:hdfs,yarn&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;what should I do to resolve this problem?&amp;nbsp; thanks you very much.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 22 Oct 2018 08:06:15 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Yarn-Aggregate-Log-Retention-Setting/m-p/81341#M34575</guid>
      <dc:creator>zbz</dc:creator>
      <dc:date>2018-10-22T08:06:15Z</dc:date>
    </item>
    <item>
      <title>Re: Yarn Aggregate Log Retention Setting</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Yarn-Aggregate-Log-Retention-Setting/m-p/81381#M34576</link>
      <description>&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/29747"&gt;@zbz&lt;/a&gt;,&lt;BR /&gt;&lt;BR /&gt;The group ownership of all directories under /tmp/logs must be 'hadoop' or any group ID that's common between the 'yarn' and 'mapred' IDs. In your case you have it as supergroup, which does not have 'mapred' as its member, but is also the entirely wrong group to use - you do not want to grant HDFS superuser access to YARN service. I'd recommend removing 'yarn' from the 'supergroup' group.&lt;BR /&gt;&lt;BR /&gt;This is what a normal installation should appear as:&lt;BR /&gt;&lt;BR /&gt;# id -Gn mapred&lt;BR /&gt;mapred hadoop&lt;BR /&gt;&lt;BR /&gt;# id -Gn yarn&lt;BR /&gt;yarn hadoop&lt;BR /&gt;&lt;BR /&gt;# hadoop fs -ls -d /tmp/logs&lt;BR /&gt;drwxrwxrwt - mapred hadoop 0 2017-08-30 22:36 /tmp/logs&lt;BR /&gt;&lt;BR /&gt;So if the 'hadoop' group is shared by your two IDs (mapred and yarn) then you may execute the below (as a HDFS superuser) to resolve the issue permanently:&lt;BR /&gt;&lt;BR /&gt;hadoop fs -chgrp -R hadoop /tmp/logs&lt;BR /&gt;</description>
      <pubDate>Tue, 23 Oct 2018 01:23:12 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Yarn-Aggregate-Log-Retention-Setting/m-p/81381#M34576</guid>
      <dc:creator>Harsh J</dc:creator>
      <dc:date>2018-10-23T01:23:12Z</dc:date>
    </item>
    <item>
      <title>Re: Yarn Aggregate Log Retention Setting</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Yarn-Aggregate-Log-Retention-Setting/m-p/81382#M34577</link>
      <description>&lt;P&gt;Thank you so munch！&lt;/P&gt;&lt;P&gt;I change the group of '/tmp/logs' to hadoop , and&amp;nbsp; restart the JobHistoryServer role, everything being OK.&amp;nbsp; &amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; So happy !&lt;/P&gt;</description>
      <pubDate>Tue, 23 Oct 2018 01:43:02 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Yarn-Aggregate-Log-Retention-Setting/m-p/81382#M34577</guid>
      <dc:creator>zbz</dc:creator>
      <dc:date>2018-10-23T01:43:02Z</dc:date>
    </item>
  </channel>
</rss>

