<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Feed Replication Only after Process Completion in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Feed-Replication-Only-after-Process-Completion/m-p/100454#M63473</link>
    <description>&lt;P&gt;You are very welcome, Happy to be of help.&lt;/P&gt;</description>
    <pubDate>Wed, 23 Dec 2015 05:46:43 GMT</pubDate>
    <dc:creator>bvellanki</dc:creator>
    <dc:date>2015-12-23T05:46:43Z</dc:date>
    <item>
      <title>Feed Replication Only after Process Completion</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Feed-Replication-Only-after-Process-Completion/m-p/100449#M63468</link>
      <description>&lt;P&gt;&lt;STRONG&gt;Scenario&lt;/STRONG&gt;: we have data that is ingested to the cluster via a Falcon process. It leverages a Falcon feed as an output to replicate the ingested data to a backup cluster. We'd like the feed to &lt;STRONG&gt;not replicate&lt;/STRONG&gt; until the process has completed. We currently use a delay to semi-accomplish this, but it's not perfect.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Question&lt;/STRONG&gt;: How can we (if at all) tell Falcon to wait until the process has completed to begin replication? Currently we have process.xml code as below, which tells the Feed to start "now" but with a delay of 2 hours (this delay being specified in the feed.xml).&lt;/P&gt;&lt;P&gt;To be clear, we have replication working and all that just fine -- we're just after a more elegant way for Falcon to only replicate after a process has confirmed completed. Is there a way?&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Process&lt;/STRONG&gt;&lt;/P&gt;&lt;PRE&gt;&amp;lt;outputs&amp;gt;
    &amp;lt;output name="hdp0001-my-feed" feed="hdp0001-my-feed" instance="now(0,0)"/&amp;gt;
&amp;lt;/outputs&amp;gt;&lt;/PRE&gt;&lt;P&gt;&lt;STRONG&gt;Feed&lt;/STRONG&gt;&lt;/P&gt;&lt;PRE&gt;&amp;lt;cluster name="primary-cluster" type="source" delay="hours(1)"&amp;gt;
    &amp;lt;validity start="2015-12-04T09:30Z" end="2099-12-31T23:59Z"/&amp;gt;
    &amp;lt;retention limit="months(9999)" action="archive"/&amp;gt;
    &amp;lt;locations&amp;gt;
        &amp;lt;location type="data" path="/hdfs/data/path/to/my_table/"/&amp;gt;
    &amp;lt;/locations&amp;gt;
&amp;lt;/cluster&amp;gt;&lt;/PRE&gt;</description>
      <pubDate>Tue, 22 Dec 2015 04:08:33 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Feed-Replication-Only-after-Process-Completion/m-p/100449#M63468</guid>
      <dc:creator>landon_t_robins</dc:creator>
      <dc:date>2015-12-22T04:08:33Z</dc:date>
    </item>
    <item>
      <title>Re: Feed Replication Only after Process Completion</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Feed-Replication-Only-after-Process-Completion/m-p/100450#M63469</link>
      <description>&lt;P&gt;In the feed entity specification,  &lt;A href="http://falcon.apache.org/EntitySpecification.html#Feed_Specification"&gt;http://falcon.apache.org/EntitySpecification.html#...&lt;/A&gt; please look for availability flag. A feed is considered available for downstream consumption, replication etc when the availabilityFlag file is created. You can make your process create the availabilityFlag as last task.  Please let me know if this works. &lt;/P&gt;&lt;PRE&gt;    &amp;lt;availabilityFlag&amp;gt;_SUCCESS&amp;lt;/availabilityFlag&amp;gt;
&lt;/PRE&gt;</description>
      <pubDate>Tue, 22 Dec 2015 05:05:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Feed-Replication-Only-after-Process-Completion/m-p/100450#M63469</guid>
      <dc:creator>bvellanki</dc:creator>
      <dc:date>2015-12-22T05:05:47Z</dc:date>
    </item>
    <item>
      <title>Re: Feed Replication Only after Process Completion</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Feed-Replication-Only-after-Process-Completion/m-p/100451#M63470</link>
      <description>&lt;P&gt;Balu, thanks for that answer! We knew of this tag, but weren't sure how to truly use it. I'm testing this now and will let you know asap! Thank you kindly.&lt;/P&gt;</description>
      <pubDate>Wed, 23 Dec 2015 03:58:52 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Feed-Replication-Only-after-Process-Completion/m-p/100451#M63470</guid>
      <dc:creator>landon_t_robins</dc:creator>
      <dc:date>2015-12-23T03:58:52Z</dc:date>
    </item>
    <item>
      <title>Re: Feed Replication Only after Process Completion</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Feed-Replication-Only-after-Process-Completion/m-p/100452#M63471</link>
      <description>&lt;P&gt;Hey Balu, that worked. Perfectly, actually!&lt;/P&gt;&lt;P&gt;The only thing left I was curious about... is there a Falcon-based way to remove the _SUCCESS file after replication has completed? I know how we can do it after some time or after creating it in the process (and waiting a few minutes perhaps), but if there's a Falcon method or tag I'd love to leverage that.&lt;/P&gt;&lt;P&gt;Please let me know! And thanks for all the great help.&lt;/P&gt;</description>
      <pubDate>Wed, 23 Dec 2015 04:32:27 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Feed-Replication-Only-after-Process-Completion/m-p/100452#M63471</guid>
      <dc:creator>landon_t_robins</dc:creator>
      <dc:date>2015-12-23T04:32:27Z</dc:date>
    </item>
    <item>
      <title>Re: Feed Replication Only after Process Completion</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Feed-Replication-Only-after-Process-Completion/m-p/100453#M63472</link>
      <description>&lt;P&gt;Unfortunately, there is no Falcon way to do this. Sorry.&lt;/P&gt;</description>
      <pubDate>Wed, 23 Dec 2015 05:46:15 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Feed-Replication-Only-after-Process-Completion/m-p/100453#M63472</guid>
      <dc:creator>bvellanki</dc:creator>
      <dc:date>2015-12-23T05:46:15Z</dc:date>
    </item>
    <item>
      <title>Re: Feed Replication Only after Process Completion</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Feed-Replication-Only-after-Process-Completion/m-p/100454#M63473</link>
      <description>&lt;P&gt;You are very welcome, Happy to be of help.&lt;/P&gt;</description>
      <pubDate>Wed, 23 Dec 2015 05:46:43 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Feed-Replication-Only-after-Process-Completion/m-p/100454#M63473</guid>
      <dc:creator>bvellanki</dc:creator>
      <dc:date>2015-12-23T05:46:43Z</dc:date>
    </item>
  </channel>
</rss>

