<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Scheduling a Python script in OOZIE in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scheduling-a-Python-script-in-OOZIE/m-p/117488#M16966</link>
    <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/168/bleonhardi.html" nodeid="168"&gt;@Benjamin Leonhardi&lt;/A&gt; and &lt;A rel="user" href="https://community.cloudera.com/users/393/aervits.html" nodeid="393"&gt;@Artem Ervits&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Thank you so much for your detailed responses! I should have mentioned that I am working on the Sandbox and on Windows. I tried to create the job through Ambari-&amp;gt;Oozie-&amp;gt;Oozie Web UI-&amp;gt;Coordinator but there is no feature to create a job. How do I create the job using the UI?
&lt;A rel="user" href="https://community.cloudera.com/users/393/aervits.html" nodeid="393"&gt;&lt;/A&gt; &lt;/P&gt;</description>
    <pubDate>Mon, 01 Feb 2016 13:44:22 GMT</pubDate>
    <dc:creator>bidyut1</dc:creator>
    <dc:date>2016-02-01T13:44:22Z</dc:date>
    <item>
      <title>Scheduling a Python script in OOZIE</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scheduling-a-Python-script-in-OOZIE/m-p/117485#M16963</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I have created a Python script. The script pulls RSS feed and writes the output to a text file. I would like to execute the Python job once a day. Can this be done using OOZIE? Please feel free to suggest a better solution.&lt;/P&gt;&lt;P&gt;Thanks for your help in advance!&lt;/P&gt;</description>
      <pubDate>Fri, 29 Jan 2016 16:26:19 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scheduling-a-Python-script-in-OOZIE/m-p/117485#M16963</guid>
      <dc:creator>bidyut1</dc:creator>
      <dc:date>2016-01-29T16:26:19Z</dc:date>
    </item>
    <item>
      <title>Re: Scheduling a Python script in OOZIE</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scheduling-a-Python-script-in-OOZIE/m-p/117486#M16964</link>
      <description>&lt;P&gt;yes this can be done in oozie. I would suggest a shell action. You need to upload all files you need ( libraries etc. ) by adding them in file tags. I for example normally have a shell script that does a kinit for kerberos if needed ( you would need to upload the keytab as well) and then executes the python scripts with the parameters like outputFolder.&lt;/P&gt;&lt;P&gt;Now this can run on any datanode so all need access to your RSS feed. However you could also use an SSH action to connect to an edge node.&lt;/P&gt;&lt;PRE&gt;&amp;lt;action name="mypython"&amp;gt;
	&amp;lt;shell xmlns="uri:oozie:shell-action:0.1"&amp;gt;
            &amp;lt;job-tracker&amp;gt;${jobTracker}&amp;lt;/job-tracker&amp;gt;
            &amp;lt;name-node&amp;gt;${nameNode}&amp;lt;/name-node&amp;gt;
            &amp;lt;configuration&amp;gt;
                &amp;lt;property&amp;gt;
                  &amp;lt;name&amp;gt;mapred.job.queue.name&amp;lt;/name&amp;gt;
                  &amp;lt;value&amp;gt;${queueName}&amp;lt;/value&amp;gt;
                &amp;lt;/property&amp;gt;
            &amp;lt;/configuration&amp;gt;
            &amp;lt;exec&amp;gt;setupAndRun.sh&amp;lt;/exec&amp;gt;
            &amp;lt;env-var&amp;gt;outputFolder=${outputFolder}&amp;lt;/env-var&amp;gt;
            &amp;lt;env-var&amp;gt;targetFolder=${targetFolder}&amp;lt;/env-var&amp;gt;
            &amp;lt;file&amp;gt;${nameNode}/hdfsfolder/setupAndRun.sh#setupAndRun.sh&amp;lt;/file&amp;gt;
	    	&amp;lt;file&amp;gt;${nameNode}/hdfsfolder/mypython.py#mypython.py&amp;lt;/file&amp;gt;		
        &amp;lt;/shell&amp;gt;
        &amp;lt;ok to="end" /&amp;gt;
        &amp;lt;error to="kill" /&amp;gt;
    &amp;lt;/action&amp;gt;
&lt;/PRE&gt;</description>
      <pubDate>Fri, 29 Jan 2016 17:43:55 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scheduling-a-Python-script-in-OOZIE/m-p/117486#M16964</guid>
      <dc:creator>bleonhardi</dc:creator>
      <dc:date>2016-01-29T17:43:55Z</dc:date>
    </item>
    <item>
      <title>Re: Scheduling a Python script in OOZIE</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scheduling-a-Python-script-in-OOZIE/m-p/117487#M16965</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/1835/bidyut1.html" nodeid="1835"&gt;@Bidyut B&lt;/A&gt;
&lt;/P&gt;&lt;P&gt; this is a common request as I also was interested how to do it. As &lt;A rel="user" href="https://community.cloudera.com/users/168/bleonhardi.html" nodeid="168"&gt;@Benjamin Leonhardi&lt;/A&gt; stated you use the standard shell action. Here's a &lt;A href="https://github.com/dbist/oozie/tree/master/apps/python"&gt;sample&lt;/A&gt; Python workflow I created and tested on current release of Sandbox. If you want to use Python3 with Oozie, I added an &lt;A href="https://github.com/dbist/oozie/tree/master/apps/python3"&gt;example&lt;/A&gt; of that too, though it shouldn't be much different.&lt;/P&gt;</description>
      <pubDate>Mon, 01 Feb 2016 01:30:49 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scheduling-a-Python-script-in-OOZIE/m-p/117487#M16965</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-02-01T01:30:49Z</dc:date>
    </item>
    <item>
      <title>Re: Scheduling a Python script in OOZIE</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scheduling-a-Python-script-in-OOZIE/m-p/117488#M16966</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/168/bleonhardi.html" nodeid="168"&gt;@Benjamin Leonhardi&lt;/A&gt; and &lt;A rel="user" href="https://community.cloudera.com/users/393/aervits.html" nodeid="393"&gt;@Artem Ervits&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Thank you so much for your detailed responses! I should have mentioned that I am working on the Sandbox and on Windows. I tried to create the job through Ambari-&amp;gt;Oozie-&amp;gt;Oozie Web UI-&amp;gt;Coordinator but there is no feature to create a job. How do I create the job using the UI?
&lt;A rel="user" href="https://community.cloudera.com/users/393/aervits.html" nodeid="393"&gt;&lt;/A&gt; &lt;/P&gt;</description>
      <pubDate>Mon, 01 Feb 2016 13:44:22 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scheduling-a-Python-script-in-OOZIE/m-p/117488#M16966</guid>
      <dc:creator>bidyut1</dc:creator>
      <dc:date>2016-02-01T13:44:22Z</dc:date>
    </item>
    <item>
      <title>Re: Scheduling a Python script in OOZIE</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scheduling-a-Python-script-in-OOZIE/m-p/117489#M16967</link>
      <description>&lt;P&gt;After some searching I found that I need Hue to create Oozie jobs. Ok, now the fun begins, I would like to list the steps for my future reference and for other users.&lt;/P&gt;&lt;P&gt;Hue can be accessed at &lt;/P&gt;&lt;P&gt;&lt;A href="http://YourHostName:8000"&gt;http://YourHostName:8000&lt;/A&gt; , thanks to this post &lt;A href="https://martin.atlassian.net/wiki/pages/viewpage.action?pageId=22839304" target="_blank"&gt;https://martin.atlassian.net/wiki/pages/viewpage.action?pageId=22839304&lt;/A&gt; for the hint on the Hue web address. Once on Hue go to the site &lt;/P&gt;&lt;P&gt;&lt;A href="http://YourHostNmae:8000/oozie/list_coordinators/" target="_blank"&gt;http://YourHostNmae:8000/oozie/list_coordinators/&lt;/A&gt; and  Create a new coordinator, click on the 'Create' button on the right hand site&lt;/P&gt;&lt;P&gt;The Coordinator will require a workflow, the workflow can be created at &lt;/P&gt;&lt;P&gt;&lt;A href="http://YourHostName:8000/oozie/list_workflows/" target="_blank"&gt;http://YourHostName:8000/oozie/list_workflows/&lt;/A&gt; &lt;/P&gt;</description>
      <pubDate>Mon, 01 Feb 2016 14:22:35 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scheduling-a-Python-script-in-OOZIE/m-p/117489#M16967</guid>
      <dc:creator>bidyut1</dc:creator>
      <dc:date>2016-02-01T14:22:35Z</dc:date>
    </item>
    <item>
      <title>Re: Scheduling a Python script in OOZIE</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scheduling-a-Python-script-in-OOZIE/m-p/117490#M16968</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/1835/bidyut1.html" nodeid="1835"&gt;@Bidyut B&lt;/A&gt; please create an article preferably with images how to create a coordinator wf with Hue, if you can.&lt;/P&gt;</description>
      <pubDate>Mon, 01 Feb 2016 19:00:25 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scheduling-a-Python-script-in-OOZIE/m-p/117490#M16968</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-02-01T19:00:25Z</dc:date>
    </item>
    <item>
      <title>Re: Scheduling a Python script in OOZIE</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scheduling-a-Python-script-in-OOZIE/m-p/117491#M16969</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/393/aervits.html" nodeid="393"&gt;@Artem Ervits&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Thanks for the suggestion! I will create an article. Let me know if I need to submit for a review.
&lt;A rel="user" href="https://community.cloudera.com/users/393/aervits.html" nodeid="393"&gt;&lt;/A&gt; &lt;/P&gt;</description>
      <pubDate>Tue, 02 Feb 2016 15:03:28 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scheduling-a-Python-script-in-OOZIE/m-p/117491#M16969</guid>
      <dc:creator>bidyut1</dc:creator>
      <dc:date>2016-02-02T15:03:28Z</dc:date>
    </item>
    <item>
      <title>Re: Scheduling a Python script in OOZIE</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scheduling-a-Python-script-in-OOZIE/m-p/117492#M16970</link>
      <description>&lt;P&gt;Hi&lt;A href="https://community.hortonworks.com/questions/12354/scheduling-a-python-script-in-oozie.html#"&gt; @Artem Ervits&lt;/A&gt; and @Benjamin Leonhardi,&lt;/P&gt;&lt;P&gt;I
 am trying to execute a Python script using a  Oozie workflow for the 
last two days with Hue-&amp;gt;Workflows-&amp;gt;Shell action, but I am getting a
 message 'Couldn't save the workflow'.
&lt;/P&gt;&lt;P&gt;The script 'hello.py' is  printing &lt;/P&gt;&lt;P&gt;print("Hello, World!") and placed under /user/oozie folder with all permissions. &lt;/P&gt;&lt;P&gt;My Shell action contains the  attached workflow file.&lt;/P&gt;&lt;P&gt;The params are
&lt;/P&gt;&lt;P&gt;&lt;A href="http://localhost:8050"&gt;http://localhost:8050&lt;/A&gt;&lt;/P&gt;&lt;P&gt;hdfs://sandbox.hortonworks.com:8020&lt;/P&gt;&lt;P&gt;default&lt;/P&gt;&lt;P&gt;Job properties section, the property name is
&lt;/P&gt;&lt;P&gt;'job-tracker' and the value is in the attached file(jobproperties.png)&lt;/P&gt;&lt;BR /&gt;&lt;IMG src="https://community.cloudera.com/t5/image/serverpage/image-id/6178iA5C9F27500DADB09/image-size/large?v=1.0&amp;amp;px=999" border="0" alt="hue-shell-action.png" title="hue-shell-action.png" /&gt;&lt;IMG src="https://community.cloudera.com/t5/image/serverpage/image-id/6179i427058439692299E/image-size/large?v=1.0&amp;amp;px=999" border="0" alt="jobproperties.png" title="jobproperties.png" /&gt;</description>
      <pubDate>Wed, 03 Feb 2016 19:08:19 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scheduling-a-Python-script-in-OOZIE/m-p/117492#M16970</guid>
      <dc:creator>bidyut1</dc:creator>
      <dc:date>2016-02-03T19:08:19Z</dc:date>
    </item>
    <item>
      <title>Re: Scheduling a Python script in OOZIE</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scheduling-a-Python-script-in-OOZIE/m-p/117493#M16971</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/1835/bidyut1.html" nodeid="1835"&gt;@Bidyut B&lt;/A&gt; ith what I gave you, you don't need hue. Just use my directories and adjust. If you get specific errors open a new question.&lt;/P&gt;</description>
      <pubDate>Wed, 03 Feb 2016 19:45:24 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scheduling-a-Python-script-in-OOZIE/m-p/117493#M16971</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-02-03T19:45:24Z</dc:date>
    </item>
    <item>
      <title>Re: Scheduling a Python script in OOZIE</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scheduling-a-Python-script-in-OOZIE/m-p/117494#M16972</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/393/aervits.html" nodeid="393" target="_blank"&gt;@Artem Ervits&lt;/A&gt;&lt;P&gt;Sure, I am trying the same example now. I copied the files to the 'Oozie' folder on HDFS , screenshot 'files' , &lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="1801-files.png" style="width: 1219px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/23005iBD83BC9D3E700697/image-size/medium?v=v2&amp;amp;px=400" role="button" title="1801-files.png" alt="1801-files.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;but encountered the error (screenshot attached) related to JVM. To start oozie i navigated to the folder &lt;/P&gt;&lt;P&gt;/usr/lib/oozie/bin/ and executed the script 'oozie-start.sh'&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="1780-oozie-error.png" style="width: 949px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/23006iA68514BB086F9C2D/image-size/medium?v=v2&amp;amp;px=400" role="button" title="1780-oozie-error.png" alt="1780-oozie-error.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;I am not sure if this is the correct way to execute the job. Any help would be greatly appreciated.&lt;/P&gt;</description>
      <pubDate>Mon, 19 Aug 2019 10:48:58 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scheduling-a-Python-script-in-OOZIE/m-p/117494#M16972</guid>
      <dc:creator>bidyut1</dc:creator>
      <dc:date>2019-08-19T10:48:58Z</dc:date>
    </item>
    <item>
      <title>Re: Scheduling a Python script in OOZIE</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scheduling-a-Python-script-in-OOZIE/m-p/117495#M16973</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/1835/bidyut1.html" nodeid="1835"&gt;@Bidyut B&lt;/A&gt; are you not using Ambari? Start/stop Oozie with Ambari. Otherwise follow Oozie installation guide to install.&lt;/P&gt;</description>
      <pubDate>Thu, 04 Feb 2016 17:32:23 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scheduling-a-Python-script-in-OOZIE/m-p/117495#M16973</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-02-04T17:32:23Z</dc:date>
    </item>
    <item>
      <title>Re: Scheduling a Python script in OOZIE</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scheduling-a-Python-script-in-OOZIE/m-p/117496#M16974</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/393/aervits.html" nodeid="393"&gt;@Artem Ervits&lt;/A&gt; .. Oozie is on with a 'green tick mark on Ambari'... How do I execute the job using the UI ?&lt;/P&gt;</description>
      <pubDate>Fri, 05 Feb 2016 08:14:59 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scheduling-a-Python-script-in-OOZIE/m-p/117496#M16974</guid>
      <dc:creator>bidyut1</dc:creator>
      <dc:date>2016-02-05T08:14:59Z</dc:date>
    </item>
    <item>
      <title>Re: Scheduling a Python script in OOZIE</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scheduling-a-Python-script-in-OOZIE/m-p/117497#M16975</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/1835/bidyut1.html" nodeid="1835"&gt;@Bidyut B&lt;/A&gt; unless you use hue there is no other UI to execute Oozie wf. You need to use she'll to submit them.&lt;/P&gt;</description>
      <pubDate>Fri, 05 Feb 2016 08:15:27 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Scheduling-a-Python-script-in-OOZIE/m-p/117497#M16975</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-02-05T08:15:27Z</dc:date>
    </item>
  </channel>
</rss>

