<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Oozie not cleaning up old jobs from Oozie database in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oozie-not-cleaning-up-old-jobs-from-Oozie-database/m-p/30695#M6938</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;By default, Oozie will not purge child jobs if the parent is not eligible to be purged. In your case, because the Coordinator job is still running, none of the child Workflow jobs will be purged.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Which version of CDH are you using? Starting with CDH 5.2.0, you can change it so that Oozie will delete the child jobs even if the parent job is still running. To do that, you can set &lt;FONT face="courier new,courier"&gt;oozie.service.PurgeService.purge.old.coord.action=true&lt;/FONT&gt; in oozie-site.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Also, starting with CM 5.4, the Oozie Configuration page has controls for these configs, so you don't need the safety-valve anymore here.&lt;/P&gt;</description>
    <pubDate>Tue, 11 Aug 2015 17:44:39 GMT</pubDate>
    <dc:creator>Robert K</dc:creator>
    <dc:date>2015-08-11T17:44:39Z</dc:date>
    <item>
      <title>Oozie not cleaning up old jobs from Oozie database</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oozie-not-cleaning-up-old-jobs-from-Oozie-database/m-p/30692#M6937</link>
      <description>&lt;P&gt;&lt;SPAN&gt;Hi, &lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;I have the following properties set in my oozie-site.xml ( Using &lt;/SPAN&gt;&lt;SPAN&gt;&lt;STRONG&gt;safety-valve&lt;/STRONG&gt; in Cloudera Manager ) &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;oozie.services.ext - org.apache.oozie.service.PurgeService&lt;/P&gt;&lt;P&gt;oozie.service.PurgeService.older.than - 15&lt;/P&gt;&lt;P&gt;oozie.service.PurgeService.coord.older.than - 7&lt;/P&gt;&lt;P&gt;oozie.service.PurgeService.bundle.older.than - 7&lt;/P&gt;&lt;P&gt;oozie.service.PurgeService.purge.interval - 60&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;However, I still see some old jobs which are KILLED or completed as old &lt;/SPAN&gt;&lt;SPAN&gt;as September 2014 &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;To give an example, &lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;I have a Coordinator which is currently in RUNNING state. When I use the&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;Oozie Web Console &lt;/SPAN&gt;&lt;SPAN&gt;to list the instances of that Co-ordinator i.e. Click on Co-ordinator tab &lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;and click on my &lt;/SPAN&gt;&lt;SPAN&gt;co-ordinator and in that pop up I can see the oldest job of all &lt;/SPAN&gt;&lt;SPAN&gt;materialised workflow jobs &lt;/SPAN&gt;&lt;SPAN&gt;(co-ordinator actions) is of September 2014. I assume the property &lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;responsible for cleaning &lt;/SPAN&gt;&lt;SPAN&gt;this up is &lt;STRONG&gt;oozie.service.PurgeService.&lt;/STRONG&gt;&lt;/SPAN&gt;&lt;SPAN&gt;&lt;STRONG&gt;older.than&lt;/STRONG&gt; which I have set to &lt;STRONG&gt;15 &lt;/STRONG&gt;&lt;/SPAN&gt;&lt;SPAN&gt;days. So what am I &lt;/SPAN&gt;&lt;SPAN&gt;missing here? &lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 09:37:31 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oozie-not-cleaning-up-old-jobs-from-Oozie-database/m-p/30692#M6937</guid>
      <dc:creator>HadoopUserAffinity</dc:creator>
      <dc:date>2022-09-16T09:37:31Z</dc:date>
    </item>
    <item>
      <title>Re: Oozie not cleaning up old jobs from Oozie database</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oozie-not-cleaning-up-old-jobs-from-Oozie-database/m-p/30695#M6938</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;By default, Oozie will not purge child jobs if the parent is not eligible to be purged. In your case, because the Coordinator job is still running, none of the child Workflow jobs will be purged.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Which version of CDH are you using? Starting with CDH 5.2.0, you can change it so that Oozie will delete the child jobs even if the parent job is still running. To do that, you can set &lt;FONT face="courier new,courier"&gt;oozie.service.PurgeService.purge.old.coord.action=true&lt;/FONT&gt; in oozie-site.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Also, starting with CM 5.4, the Oozie Configuration page has controls for these configs, so you don't need the safety-valve anymore here.&lt;/P&gt;</description>
      <pubDate>Tue, 11 Aug 2015 17:44:39 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oozie-not-cleaning-up-old-jobs-from-Oozie-database/m-p/30695#M6938</guid>
      <dc:creator>Robert K</dc:creator>
      <dc:date>2015-08-11T17:44:39Z</dc:date>
    </item>
    <item>
      <title>Re: Oozie not cleaning up old jobs from Oozie database</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oozie-not-cleaning-up-old-jobs-from-Oozie-database/m-p/30713#M6939</link>
      <description>&lt;P&gt;&lt;SPAN&gt;Thanks &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt; rkanter.&lt;BR /&gt;We are using&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;CDH 4.5.0 alongwith&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;Cloudera Manager 5.2.0. Since, it is not possible to upgrade currently I guess I will have to resort to manually cleaning up the database tables. The database size is too large and we are observing latency in the queries (when observed via show processlist in MySQL). Are there any alternatives to CDH/CM upgrade or manual purging for maintaining the evergrowing database size ?&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 12 Aug 2015 07:49:44 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oozie-not-cleaning-up-old-jobs-from-Oozie-database/m-p/30713#M6939</guid>
      <dc:creator>HadoopUserAffinity</dc:creator>
      <dc:date>2015-08-12T07:49:44Z</dc:date>
    </item>
    <item>
      <title>Re: Oozie not cleaning up old jobs from Oozie database</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oozie-not-cleaning-up-old-jobs-from-Oozie-database/m-p/30728#M6940</link>
      <description>&lt;P&gt;&lt;SPAN&gt;What you can do as a workaround, is split up your long-running Coordinators.&amp;nbsp; For example, instead of making your Coordinator run for years? forever?, make it run for, say, 6 months.&amp;nbsp; And have an identical Coordinator scheduled to start exactly when that one ends.&amp;nbsp; This will allow Oozie to cleanup the old child Workflows from that Coordinator every 6 months. &amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;DIV&gt;Otherwise, you can schedule a cron job to manually delete old jobs from the Database.&amp;nbsp; However, please be careful about this.&amp;nbsp; When deleting a workflow job from the WF_JOBS table, you'll also need to delete the workflow actions from the WF_ACTIONS table that belong to it, as well as the coordinator action from the WF_ACTIONS table that it belongs to.&amp;nbsp; If you miss something, it will likely cause problems. &amp;nbsp;&lt;/DIV&gt;</description>
      <pubDate>Wed, 12 Aug 2015 18:44:46 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oozie-not-cleaning-up-old-jobs-from-Oozie-database/m-p/30728#M6940</guid>
      <dc:creator>Robert K</dc:creator>
      <dc:date>2015-08-12T18:44:46Z</dc:date>
    </item>
    <item>
      <title>Re: Oozie not cleaning up old jobs from Oozie database</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oozie-not-cleaning-up-old-jobs-from-Oozie-database/m-p/30744#M6941</link>
      <description>&lt;P&gt;Thanks Robert !&lt;/P&gt;</description>
      <pubDate>Thu, 13 Aug 2015 11:18:36 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oozie-not-cleaning-up-old-jobs-from-Oozie-database/m-p/30744#M6941</guid>
      <dc:creator>HadoopUserAffinity</dc:creator>
      <dc:date>2015-08-13T11:18:36Z</dc:date>
    </item>
    <item>
      <title>Re: Oozie not cleaning up old jobs from Oozie database</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oozie-not-cleaning-up-old-jobs-from-Oozie-database/m-p/61281#M6942</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/45"&gt;@Robert K&lt;/a&gt;&lt;/P&gt;&lt;P&gt;My postgresql db still stuck with some old bundle with status KILLED at the old time (year 2016).&lt;/P&gt;&lt;P&gt;I can see some log like 'STARTED Purge to purge Workflow Jobs older than [30] days, Coordinator Jobs older than [7] days, and Bundle jobs older than [7] days' but never see log like '&lt;SPAN&gt;ENDED Purge deleted ...&lt;/SPAN&gt;'&lt;/P&gt;&lt;P&gt;How can I check that (with oozie 4.2)?&lt;/P&gt;&lt;P&gt;Thank you very much.&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Screen Shot 2017-10-27 at 9.27.54 AM.png" style="width: 600px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/3525i756B98F5CABD0186/image-size/large?v=v2&amp;amp;px=999" role="button" title="Screen Shot 2017-10-27 at 9.27.54 AM.png" alt="Screen Shot 2017-10-27 at 9.27.54 AM.png" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 27 Oct 2017 02:28:33 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oozie-not-cleaning-up-old-jobs-from-Oozie-database/m-p/61281#M6942</guid>
      <dc:creator>DLTU</dc:creator>
      <dc:date>2017-10-27T02:28:33Z</dc:date>
    </item>
  </channel>
</rss>

