<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: oozie workflow in Production in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-workflow-in-Production/m-p/126197#M51422</link>
    <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/9789/vamsivalivetiedu.html" nodeid="9789"&gt;@vamsi valiveti&lt;/A&gt; I have to side with &lt;A rel="user" href="https://community.cloudera.com/users/10616/sgowda.html" nodeid="10616"&gt;@Santhosh B Gowda&lt;/A&gt; typically scheduled jobs are handled by coordinator workflow. Implement condition logic in your Oozie workflow and if there's new data, run the actions, otherwise proceed to end action.&lt;/P&gt;</description>
    <pubDate>Fri, 13 Jan 2017 00:25:03 GMT</pubDate>
    <dc:creator>aervits</dc:creator>
    <dc:date>2017-01-13T00:25:03Z</dc:date>
    <item>
      <title>oozie workflow in Production</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-workflow-in-Production/m-p/126193#M51418</link>
      <description>&lt;P&gt;Currently we are using oozie workflow(consists of hive,pig,sqoop actions) using below command in dev environment. In Production environment we should not run manually.can I create a shell script for below command and I can run that shell script using crontab scheduler.Is my approach is correct if yes what is the timings for the script. If not what is the approach to run below command in Production?&lt;/P&gt;&lt;PRE&gt;oozie job --oozie &lt;A href="http://host_nameofoozieserver:8080/oozie" target="_blank" rel="nofollow noopener noreferrer"&gt;http://host_nameofoozieserver:8080/oozie&lt;/A&gt; -D
oozie.wf.application.path=hdfs://namenodepath/pathof_workflow_xml/workflow.xml-run &lt;/PRE&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="11349-oozie-clarification.jpg" style="width: 488px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/22663iD7FC65C60A59E63A/image-size/medium?v=v2&amp;amp;px=400" role="button" title="11349-oozie-clarification.jpg" alt="11349-oozie-clarification.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 19 Aug 2019 10:07:23 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-workflow-in-Production/m-p/126193#M51418</guid>
      <dc:creator>vamsi123</dc:creator>
      <dc:date>2019-08-19T10:07:23Z</dc:date>
    </item>
    <item>
      <title>Re: oozie workflow in Production</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-workflow-in-Production/m-p/126194#M51419</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/9789/vamsivalivetiedu.html" nodeid="9789"&gt;@vamsi valiveti&lt;/A&gt;&lt;P&gt;Have we explored oozie co-ordinator ? does that not solve the problem&lt;/P&gt;&lt;P&gt;&lt;A href="https://oozie.apache.org/docs/3.1.3-incubating/CoordinatorFunctionalSpec.html" target="_blank"&gt;https://oozie.apache.org/docs/3.1.3-incubating/CoordinatorFunctionalSpec.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 12 Jan 2017 17:13:44 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-workflow-in-Production/m-p/126194#M51419</guid>
      <dc:creator>sgowda</dc:creator>
      <dc:date>2017-01-12T17:13:44Z</dc:date>
    </item>
    <item>
      <title>Re: oozie workflow in Production</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-workflow-in-Production/m-p/126195#M51420</link>
      <description>&lt;P&gt;HI &lt;A rel="user" href="https://community.cloudera.com/users/10616/sgowda.html" nodeid="10616"&gt;@Santhosh B Gowda&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Thanks for input.&lt;/P&gt;&lt;P&gt;a)My question is how to run below command in production since we should not run manually.&lt;/P&gt;&lt;PRE&gt;oozie job --oozie &lt;A href="http://host_nameofoozieserver:8080/oozie"&gt;http://host_nameofoozieserver:8080/oozie&lt;/A&gt; -Doozie.wf.application.path=hdfs://namenodepath/pathof_workflow_xml/workflow.xml-run 
&lt;/PRE&gt;&lt;P&gt;b)I know about coordinator but at this point of time i am not sure whether i have to use data or time triggers.&lt;/P&gt;&lt;P&gt;currently we are running flume continusoly&lt;/P&gt;</description>
      <pubDate>Thu, 12 Jan 2017 17:43:00 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-workflow-in-Production/m-p/126195#M51420</guid>
      <dc:creator>vamsi123</dc:creator>
      <dc:date>2017-01-12T17:43:00Z</dc:date>
    </item>
    <item>
      <title>Re: oozie workflow in Production</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-workflow-in-Production/m-p/126196#M51421</link>
      <description>&lt;P&gt;Hi experts,&lt;/P&gt;&lt;P&gt;Any input on my clarification&lt;/P&gt;</description>
      <pubDate>Thu, 12 Jan 2017 23:40:27 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-workflow-in-Production/m-p/126196#M51421</guid>
      <dc:creator>vamsi123</dc:creator>
      <dc:date>2017-01-12T23:40:27Z</dc:date>
    </item>
    <item>
      <title>Re: oozie workflow in Production</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-workflow-in-Production/m-p/126197#M51422</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/9789/vamsivalivetiedu.html" nodeid="9789"&gt;@vamsi valiveti&lt;/A&gt; I have to side with &lt;A rel="user" href="https://community.cloudera.com/users/10616/sgowda.html" nodeid="10616"&gt;@Santhosh B Gowda&lt;/A&gt; typically scheduled jobs are handled by coordinator workflow. Implement condition logic in your Oozie workflow and if there's new data, run the actions, otherwise proceed to end action.&lt;/P&gt;</description>
      <pubDate>Fri, 13 Jan 2017 00:25:03 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-workflow-in-Production/m-p/126197#M51422</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2017-01-13T00:25:03Z</dc:date>
    </item>
    <item>
      <title>Re: oozie workflow in Production</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-workflow-in-Production/m-p/126198#M51423</link>
      <description>&lt;P&gt;
	If you want to use crontab that you already decided to use a time-trigger/interval right? You should really use a coordinator.&lt;/P&gt;&lt;P&gt;
	If you really want to stick with crontab than the command is more or less correct. You have a typo (--oozie should be -oozie, and the port normally is 11000 but I guess you already confirmed the port?) and normally you refer to a job.properties file (stored locally not on hdfs) with -config.&lt;/P&gt;&lt;P&gt;
	So it should look like:&lt;/P&gt;&lt;PRE&gt;oozie job -oozie &lt;A href="http://sandbox.hortonworks.com:11000/oozie"&gt;http://sandbox.hortonworks.com:11000/oozie&lt;/A&gt; -config /path/to/job.properties -run
&lt;/PRE&gt;&lt;P&gt;In the job.properties file you would have parameters listed like namenode, jobtracker, hcmetastoreuri and of course the one you provide via the -D: oozie.wf.application.path&lt;/P&gt;&lt;P&gt;Normally the hdfs://namenode part can be omitted from the apppath url.&lt;/P&gt;</description>
      <pubDate>Fri, 13 Jan 2017 16:38:03 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-workflow-in-Production/m-p/126198#M51423</guid>
      <dc:creator>rene_sluiter</dc:creator>
      <dc:date>2017-01-13T16:38:03Z</dc:date>
    </item>
    <item>
      <title>Re: oozie workflow in Production</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-workflow-in-Production/m-p/126199#M51424</link>
      <description>&lt;P&gt;a)Thanks for input  &lt;A rel="user" href="https://community.cloudera.com/users/393/aervits.html" nodeid="393"&gt;@Artem Ervits &lt;/A&gt;.Your input is always appreciated.&lt;/P&gt;&lt;P&gt;I will go for coordinator Job with time and data availability-based scheduling but still have following clarifications&lt;/P&gt;&lt;PRE&gt;clarification 1:- suppose if i am using below command to trigger the coordinator job.Is it one time activity in production to run this command once in production since it will trigger based on frequency for day 2?please correct me if i am wrong or do i need to run this command on day2 also?

oozie job -oozie &lt;A href="http://sandbox.hortonworks.com:11000/oozie" target="_blank"&gt;http://sandbox.hortonworks.com:11000/oozie&lt;/A&gt; -config /path/to/job.properties -run


&amp;lt;coordinator-app name="my_first_job" start="2014-01-01T02:00Z"
end="2014-12-31T02:00Z" frequency="${coord:days(1)}"
xmlns="uri:oozie:coordinator:0.4"&amp;gt;
&lt;/PRE&gt;&lt;PRE&gt;clarification 2:-How to Implement condition logic in your Oozie workflow and if there's new data, run the actions, otherwise proceed to end action?&lt;/PRE&gt;</description>
      <pubDate>Fri, 13 Jan 2017 23:51:31 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-workflow-in-Production/m-p/126199#M51424</guid>
      <dc:creator>vamsi123</dc:creator>
      <dc:date>2017-01-13T23:51:31Z</dc:date>
    </item>
    <item>
      <title>Re: oozie workflow in Production</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-workflow-in-Production/m-p/126200#M51425</link>
      <description>&lt;P&gt;Hi folks&lt;/P&gt;&lt;P&gt;Any input on my clarification&lt;/P&gt;</description>
      <pubDate>Tue, 17 Jan 2017 22:53:06 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-workflow-in-Production/m-p/126200#M51425</guid>
      <dc:creator>vamsi123</dc:creator>
      <dc:date>2017-01-17T22:53:06Z</dc:date>
    </item>
    <item>
      <title>Re: oozie workflow in Production</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-workflow-in-Production/m-p/126201#M51426</link>
      <description>&lt;P&gt;1. Only once&lt;/P&gt;&lt;P&gt;2. Use decision property &lt;A href="https://www.infoq.com/articles/oozieexample/" target="_blank"&gt;https://www.infoq.com/articles/oozieexample/&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 17 Jan 2017 22:58:48 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-workflow-in-Production/m-p/126201#M51426</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2017-01-17T22:58:48Z</dc:date>
    </item>
  </channel>
</rss>

