<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question How to trigger oozie workflow when input data with dynamic name is available? in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-trigger-oozie-workflow-when-input-data-with-dynamic/m-p/171083#M41921</link>
    <description>&lt;P&gt;I am doing a oozie-cooridnator when input data with dynamic name is available. Here is the coordinator.xml:&lt;/P&gt;&lt;PRE&gt; &amp;lt;coordinator-app name="${jobName} Coordinator" frequency="${coord:days(1)}" start="${startTime}" end="2099-01-01T00:00Z" timezone="UTC" xmlns="uri:oozie:coordinator:0.1"&amp;gt;
   &amp;lt;datasets&amp;gt;
    &amp;lt;dataset name="gaSchema" frequency="30" initial-instance="${startTime}" timezone="UTC"&amp;gt;
      &amp;lt;uri-template&amp;gt;${nameNode}/ga/bySchema/&amp;lt;/uri-template&amp;gt;
      &amp;lt;done-flag&amp;gt;ga_${YEAR}${MONTH}${DAY}.avro&amp;lt;/done-flag&amp;gt;
    &amp;lt;/dataset&amp;gt;
  &amp;lt;/datasets&amp;gt;
  &amp;lt;input-events&amp;gt;
      &amp;lt;data-in name="coordInput1" dataset="gaSchema"&amp;gt;
          &amp;lt;start-instance&amp;gt;${coord:current(-23)}&amp;lt;/start-instance&amp;gt;
          &amp;lt;end-instance&amp;gt;${coord:current(0)}&amp;lt;/end-instance&amp;gt;
      &amp;lt;/data-in&amp;gt;
   &amp;lt;/input-events&amp;gt;
   &amp;lt;action&amp;gt;
      &amp;lt;workflow&amp;gt;
         &amp;lt;app-path&amp;gt;${wfApplicationPath}&amp;lt;/app-path&amp;gt;
         &amp;lt;configuration&amp;gt;
            &amp;lt;property&amp;gt;&amp;lt;name&amp;gt;date&amp;lt;/name&amp;gt;&amp;lt;value&amp;gt;${coord:formatTime(coord:nominalTime(), "yyyyMMdd")}&amp;lt;/value&amp;gt;&amp;lt;/property&amp;gt;
            &amp;lt;property&amp;gt;&amp;lt;name&amp;gt;jobTracker&amp;lt;/name&amp;gt;&amp;lt;value&amp;gt;${jobTracker}&amp;lt;/value&amp;gt;&amp;lt;/property&amp;gt;
            &amp;lt;property&amp;gt;&amp;lt;name&amp;gt;nameNode&amp;lt;/name&amp;gt;&amp;lt;value&amp;gt;${nameNode}&amp;lt;/value&amp;gt;&amp;lt;/property&amp;gt;
            &amp;lt;property&amp;gt;&amp;lt;name&amp;gt;jobName&amp;lt;/name&amp;gt;&amp;lt;value&amp;gt;${jobName}&amp;lt;/value&amp;gt;&amp;lt;/property&amp;gt;  
          &amp;lt;/configuration&amp;gt;
      &amp;lt;/workflow&amp;gt;
   &amp;lt;/action&amp;gt;
&amp;lt;/coordinator-app&amp;gt;&lt;/PRE&gt;&lt;P&gt;When the file with current date arrives a hdfs folder then trigger workflow.&lt;/P&gt;&lt;PRE&gt;  &amp;lt;done-flag&amp;gt;ga_${YEAR}${MONTH}${DAY}.avro&amp;lt;/done-flag&amp;gt;&lt;/PRE&gt;&lt;P&gt;It didn't work with dynamic name.   I search it on internet, it seems it works on dynamic folder with fixed file name.  for example: &lt;/P&gt;&lt;PRE&gt; &amp;lt;uri-template&amp;gt;${nameNode}/ga/bySchema/${YEAR}${MONTH}${DAY}&amp;lt;/uri-template&amp;gt;
  &amp;lt;done-flag&amp;gt;ga.avro&amp;lt;/done-flag&amp;gt;&lt;/PRE&gt;&lt;P&gt;In this case, I have to create a lot of folders on hdfs because we import data every day. &lt;/P&gt;&lt;P&gt;Do you have any ideas how to trigger oozie workflow when input data with dynamic name is available?&lt;/P&gt;&lt;P&gt;Thanks &lt;/P&gt;</description>
    <pubDate>Tue, 27 Sep 2016 16:42:07 GMT</pubDate>
    <dc:creator>amydream08</dc:creator>
    <dc:date>2016-09-27T16:42:07Z</dc:date>
    <item>
      <title>How to trigger oozie workflow when input data with dynamic name is available?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-trigger-oozie-workflow-when-input-data-with-dynamic/m-p/171083#M41921</link>
      <description>&lt;P&gt;I am doing a oozie-cooridnator when input data with dynamic name is available. Here is the coordinator.xml:&lt;/P&gt;&lt;PRE&gt; &amp;lt;coordinator-app name="${jobName} Coordinator" frequency="${coord:days(1)}" start="${startTime}" end="2099-01-01T00:00Z" timezone="UTC" xmlns="uri:oozie:coordinator:0.1"&amp;gt;
   &amp;lt;datasets&amp;gt;
    &amp;lt;dataset name="gaSchema" frequency="30" initial-instance="${startTime}" timezone="UTC"&amp;gt;
      &amp;lt;uri-template&amp;gt;${nameNode}/ga/bySchema/&amp;lt;/uri-template&amp;gt;
      &amp;lt;done-flag&amp;gt;ga_${YEAR}${MONTH}${DAY}.avro&amp;lt;/done-flag&amp;gt;
    &amp;lt;/dataset&amp;gt;
  &amp;lt;/datasets&amp;gt;
  &amp;lt;input-events&amp;gt;
      &amp;lt;data-in name="coordInput1" dataset="gaSchema"&amp;gt;
          &amp;lt;start-instance&amp;gt;${coord:current(-23)}&amp;lt;/start-instance&amp;gt;
          &amp;lt;end-instance&amp;gt;${coord:current(0)}&amp;lt;/end-instance&amp;gt;
      &amp;lt;/data-in&amp;gt;
   &amp;lt;/input-events&amp;gt;
   &amp;lt;action&amp;gt;
      &amp;lt;workflow&amp;gt;
         &amp;lt;app-path&amp;gt;${wfApplicationPath}&amp;lt;/app-path&amp;gt;
         &amp;lt;configuration&amp;gt;
            &amp;lt;property&amp;gt;&amp;lt;name&amp;gt;date&amp;lt;/name&amp;gt;&amp;lt;value&amp;gt;${coord:formatTime(coord:nominalTime(), "yyyyMMdd")}&amp;lt;/value&amp;gt;&amp;lt;/property&amp;gt;
            &amp;lt;property&amp;gt;&amp;lt;name&amp;gt;jobTracker&amp;lt;/name&amp;gt;&amp;lt;value&amp;gt;${jobTracker}&amp;lt;/value&amp;gt;&amp;lt;/property&amp;gt;
            &amp;lt;property&amp;gt;&amp;lt;name&amp;gt;nameNode&amp;lt;/name&amp;gt;&amp;lt;value&amp;gt;${nameNode}&amp;lt;/value&amp;gt;&amp;lt;/property&amp;gt;
            &amp;lt;property&amp;gt;&amp;lt;name&amp;gt;jobName&amp;lt;/name&amp;gt;&amp;lt;value&amp;gt;${jobName}&amp;lt;/value&amp;gt;&amp;lt;/property&amp;gt;  
          &amp;lt;/configuration&amp;gt;
      &amp;lt;/workflow&amp;gt;
   &amp;lt;/action&amp;gt;
&amp;lt;/coordinator-app&amp;gt;&lt;/PRE&gt;&lt;P&gt;When the file with current date arrives a hdfs folder then trigger workflow.&lt;/P&gt;&lt;PRE&gt;  &amp;lt;done-flag&amp;gt;ga_${YEAR}${MONTH}${DAY}.avro&amp;lt;/done-flag&amp;gt;&lt;/PRE&gt;&lt;P&gt;It didn't work with dynamic name.   I search it on internet, it seems it works on dynamic folder with fixed file name.  for example: &lt;/P&gt;&lt;PRE&gt; &amp;lt;uri-template&amp;gt;${nameNode}/ga/bySchema/${YEAR}${MONTH}${DAY}&amp;lt;/uri-template&amp;gt;
  &amp;lt;done-flag&amp;gt;ga.avro&amp;lt;/done-flag&amp;gt;&lt;/PRE&gt;&lt;P&gt;In this case, I have to create a lot of folders on hdfs because we import data every day. &lt;/P&gt;&lt;P&gt;Do you have any ideas how to trigger oozie workflow when input data with dynamic name is available?&lt;/P&gt;&lt;P&gt;Thanks &lt;/P&gt;</description>
      <pubDate>Tue, 27 Sep 2016 16:42:07 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-trigger-oozie-workflow-when-input-data-with-dynamic/m-p/171083#M41921</guid>
      <dc:creator>amydream08</dc:creator>
      <dc:date>2016-09-27T16:42:07Z</dc:date>
    </item>
    <item>
      <title>Re: How to trigger oozie workflow when input data with dynamic name is available?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-trigger-oozie-workflow-when-input-data-with-dynamic/m-p/171084#M41922</link>
      <description>&lt;P&gt;I find a workaround solution. When a file is dropped in the folder /ga/bySchema/,  a _SUCCESS file is created in that folder, &amp;lt;done-flag&amp;gt;_SUCCESS&amp;lt;/done-flag&amp;gt; , then trigger workflow, in workflow.xml, I move the file (ga_${today}.avro) into an archive folder and delete _SUCCESS file.   ${today} is defined in coordinator.xml. now working fine. &lt;/P&gt;</description>
      <pubDate>Wed, 05 Oct 2016 18:14:23 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-trigger-oozie-workflow-when-input-data-with-dynamic/m-p/171084#M41922</guid>
      <dc:creator>amydream08</dc:creator>
      <dc:date>2016-10-05T18:14:23Z</dc:date>
    </item>
  </channel>
</rss>

