<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: How to run Oozie Job with Python Script in Sandbox? in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-run-Oozie-Job-with-Python-Script-in-Sandbox/m-p/110152#M50705</link>
    <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/393/aervits.html" nodeid="393"&gt;@Artem Ervits&lt;/A&gt; ohh thank you so much .. How do you determine where the output of the job should be store/ how can you see the output to be sure it was what you're looking for &lt;/P&gt;</description>
    <pubDate>Fri, 06 Jan 2017 02:53:03 GMT</pubDate>
    <dc:creator>axel_robinson</dc:creator>
    <dc:date>2017-01-06T02:53:03Z</dc:date>
    <item>
      <title>How to run Oozie Job with Python Script in Sandbox?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-run-Oozie-Job-with-Python-Script-in-Sandbox/m-p/110148#M50701</link>
      <description>&lt;P&gt;&lt;A href="https://community.cloudera.com/legacyfs/online/attachments/11053-workflow.xml"&gt;workflow.xml

Attached is my current xml file in the sandbox.

I am new to Hadoop/Oozie so any detail explanation/instruction would be very helpful
&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 04 Jan 2017 01:56:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-run-Oozie-Job-with-Python-Script-in-Sandbox/m-p/110148#M50701</guid>
      <dc:creator>axel_robinson</dc:creator>
      <dc:date>2017-01-04T01:56:45Z</dc:date>
    </item>
    <item>
      <title>Re: How to run Oozie Job with Python Script in Sandbox?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-run-Oozie-Job-with-Python-Script-in-Sandbox/m-p/110149#M50702</link>
      <description>&lt;P&gt;I am mobile and can't comment on your workflows right now but I have example of python2 and python3 WF in my repo &lt;A href="https://github.com/dbist/oozie" target="_blank"&gt;https://github.com/dbist/oozie&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Browse to oozie/apps/ and you will see their respective directories. Use as you wish.&lt;/P&gt;</description>
      <pubDate>Wed, 04 Jan 2017 08:55:54 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-run-Oozie-Job-with-Python-Script-in-Sandbox/m-p/110149#M50702</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2017-01-04T08:55:54Z</dc:date>
    </item>
    <item>
      <title>Re: How to run Oozie Job with Python Script in Sandbox?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-run-Oozie-Job-with-Python-Script-in-Sandbox/m-p/110150#M50703</link>
      <description>&lt;P&gt;Thank you for your prompt reply.  But  I have a very basic question. Python should be install in the Sandbox? &lt;A href="https://community.cloudera.com/legacyfs/online/attachments/11131-apps-directory.png"&gt;apps-directory.png&lt;/A&gt; Currently my python scripts are placed in  /root/examples/apps/map-reduce/
But I am guessing there should be a python folder in  root/examples/apps/Python?
which contains the job.properties and workflow.xml  files in addition to the lib folder 

would be great if you could guide in which directory should I place the python script files&lt;/P&gt;</description>
      <pubDate>Thu, 05 Jan 2017 00:39:12 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-run-Oozie-Job-with-Python-Script-in-Sandbox/m-p/110150#M50703</guid>
      <dc:creator>axel_robinson</dc:creator>
      <dc:date>2017-01-05T00:39:12Z</dc:date>
    </item>
    <item>
      <title>Re: How to run Oozie Job with Python Script in Sandbox?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-run-Oozie-Job-with-Python-Script-in-Sandbox/m-p/110151#M50704</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/14806/axelrobinson.html" nodeid="14806"&gt;@justlearning&lt;/A&gt; same version of Python needs to be installed on every node that will run oozie containers (nodemanager). Same goes for any Python libraries you're importing into your script. I usually create the following tree&lt;/P&gt;&lt;PRE&gt;admin@u1201:~/oozie/apps/python$ tree
.
|-- job.properties
|-- scripts
|   `-- script.py
`-- workflow.xml


1 directory, 3 files

&lt;/PRE&gt;&lt;P&gt;so what you want is a workflow directory on hdfs with at least workflow.xml and optionally another directory within it with a Python script. job.properties file needs to be on your local filesystem. Then you would execute the oozie wf the following way:&lt;/P&gt;&lt;PRE&gt;oozie job -oozie &lt;A href="http://u1203.ambari.apache.org:11000/oozie" target="_blank"&gt;http://u1203.ambari.apache.org:11000/oozie&lt;/A&gt; -config oozie/apps/python/job.properties -run
&lt;/PRE&gt;</description>
      <pubDate>Thu, 05 Jan 2017 04:07:48 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-run-Oozie-Job-with-Python-Script-in-Sandbox/m-p/110151#M50704</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2017-01-05T04:07:48Z</dc:date>
    </item>
    <item>
      <title>Re: How to run Oozie Job with Python Script in Sandbox?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-run-Oozie-Job-with-Python-Script-in-Sandbox/m-p/110152#M50705</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/393/aervits.html" nodeid="393"&gt;@Artem Ervits&lt;/A&gt; ohh thank you so much .. How do you determine where the output of the job should be store/ how can you see the output to be sure it was what you're looking for &lt;/P&gt;</description>
      <pubDate>Fri, 06 Jan 2017 02:53:03 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-run-Oozie-Job-with-Python-Script-in-Sandbox/m-p/110152#M50705</guid>
      <dc:creator>axel_robinson</dc:creator>
      <dc:date>2017-01-06T02:53:03Z</dc:date>
    </item>
    <item>
      <title>Re: How to run Oozie Job with Python Script in Sandbox?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-run-Oozie-Job-with-Python-Script-in-Sandbox/m-p/110153#M50706</link>
      <description>&lt;P&gt;You can learn that by following this guide &lt;A href="https://community.hortonworks.com/content/kbentry/9148/troubleshooting-an-oozie-flow.html" target="_blank"&gt;https://community.hortonworks.com/content/kbentry/9148/troubleshooting-an-oozie-flow.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 06 Jan 2017 21:05:56 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-run-Oozie-Job-with-Python-Script-in-Sandbox/m-p/110153#M50706</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2017-01-06T21:05:56Z</dc:date>
    </item>
    <item>
      <title>Re: How to run Oozie Job with Python Script in Sandbox?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-run-Oozie-Job-with-Python-Script-in-Sandbox/m-p/110154#M50707</link>
      <description>&lt;P&gt;It's a good practice to accept answer if it satisfies your needs.&lt;/P&gt;</description>
      <pubDate>Sat, 07 Jan 2017 06:37:26 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-run-Oozie-Job-with-Python-Script-in-Sandbox/m-p/110154#M50707</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2017-01-07T06:37:26Z</dc:date>
    </item>
  </channel>
</rss>

