<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Python Streaming in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/125#M3539</link>
    <description>&lt;P&gt;I'm trying to use my local installation of Cloudera Quickstart VM to do a small mapreduce job in Python.&lt;/P&gt;&lt;P&gt;My test script works when I explicitly add python to the script:&lt;/P&gt;&lt;P&gt;&amp;nbsp; # cat inputfile.txt | python mymapper.py | sort | python myreducer.py&lt;/P&gt;&lt;P&gt;I need to add python to the path in the vm.&amp;nbsp; What's the best way to do this so it finds python from the command line and in Hadoop?&amp;nbsp; I haven't been successful trying to find and modify the right files in the Cloudera VM.&lt;/P&gt;&lt;P&gt;(I was able to run this on AWS.&amp;nbsp; I tried from the hadoop command line also:&lt;/P&gt;&lt;P&gt;hadoop jar /usr/lib/hadoop-0.20-mapreduce/contrib/streaming/hadoop-streaming-2.0.0-mr1-cdh4.3.0.jar \&lt;/P&gt;&lt;P&gt;-input inputfile.txt \&lt;/P&gt;&lt;P&gt;-output output010 \&lt;/P&gt;&lt;P&gt;-mapper mymapper.py \&lt;/P&gt;&lt;P&gt;-file mymapper.py \&lt;/P&gt;&lt;P&gt;-combiner myreducer.py \&lt;/P&gt;&lt;P&gt;-reducer myreducer.py \&lt;/P&gt;&lt;P&gt;-file myreducer.py&lt;/P&gt;&lt;P&gt;&amp;nbsp;... and it fails)&lt;/P&gt;&lt;P&gt;Any help to get the&amp;nbsp; right would be appreciated.&lt;/P&gt;&lt;P&gt;thanks,&lt;/P&gt;&lt;P&gt;jp&lt;/P&gt;</description>
    <pubDate>Fri, 16 Sep 2022 08:46:22 GMT</pubDate>
    <dc:creator>bigpalooka</dc:creator>
    <dc:date>2022-09-16T08:46:22Z</dc:date>
    <item>
      <title>Python Streaming</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/125#M3539</link>
      <description>&lt;P&gt;I'm trying to use my local installation of Cloudera Quickstart VM to do a small mapreduce job in Python.&lt;/P&gt;&lt;P&gt;My test script works when I explicitly add python to the script:&lt;/P&gt;&lt;P&gt;&amp;nbsp; # cat inputfile.txt | python mymapper.py | sort | python myreducer.py&lt;/P&gt;&lt;P&gt;I need to add python to the path in the vm.&amp;nbsp; What's the best way to do this so it finds python from the command line and in Hadoop?&amp;nbsp; I haven't been successful trying to find and modify the right files in the Cloudera VM.&lt;/P&gt;&lt;P&gt;(I was able to run this on AWS.&amp;nbsp; I tried from the hadoop command line also:&lt;/P&gt;&lt;P&gt;hadoop jar /usr/lib/hadoop-0.20-mapreduce/contrib/streaming/hadoop-streaming-2.0.0-mr1-cdh4.3.0.jar \&lt;/P&gt;&lt;P&gt;-input inputfile.txt \&lt;/P&gt;&lt;P&gt;-output output010 \&lt;/P&gt;&lt;P&gt;-mapper mymapper.py \&lt;/P&gt;&lt;P&gt;-file mymapper.py \&lt;/P&gt;&lt;P&gt;-combiner myreducer.py \&lt;/P&gt;&lt;P&gt;-reducer myreducer.py \&lt;/P&gt;&lt;P&gt;-file myreducer.py&lt;/P&gt;&lt;P&gt;&amp;nbsp;... and it fails)&lt;/P&gt;&lt;P&gt;Any help to get the&amp;nbsp; right would be appreciated.&lt;/P&gt;&lt;P&gt;thanks,&lt;/P&gt;&lt;P&gt;jp&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 08:46:22 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/125#M3539</guid>
      <dc:creator>bigpalooka</dc:creator>
      <dc:date>2022-09-16T08:46:22Z</dc:date>
    </item>
    <item>
      <title>Re: Python Streaming</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/143#M3540</link>
      <description>&lt;P&gt;jp,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Try inserting the header "#!/usr/bin/env python" as the first line in your scripts. This signals to the operating system that your scripts are executable through Python. If you do this in your local example (and do "chmod +x *.py"), it works without having to add python to the script:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;cat inputfile.txt | ./mymapper.py | sort | ./myreducer.py&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Copy the modified files back into HDFS and MapReduce will now be able to execute your mappers and reducers.&lt;/P&gt;</description>
      <pubDate>Thu, 01 Aug 2013 17:07:05 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/143#M3540</guid>
      <dc:creator>Sean</dc:creator>
      <dc:date>2013-08-01T17:07:05Z</dc:date>
    </item>
    <item>
      <title>Re: Python Streaming</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/149#M3541</link>
      <description>&lt;P&gt;Thanks.&amp;nbsp; I rebooted, reconstructed new files and again tried both #!/usr/bin/env python&amp;nbsp; and&amp;nbsp; #!/usr/bin/python&amp;nbsp; and changed permissions to include -x .&lt;/P&gt;&lt;P&gt;I'm making it through the file, mymapper, and sort, but I'm getting "no such file or directory" when I pipe it to&amp;nbsp; ./myreducer.py&lt;/P&gt;&lt;P&gt;But when I explicitly add "python" as the executable it works.&lt;/P&gt;&lt;P&gt;I'm guessing this is some obvious newbie issue (new to linux) but I should have this in the bag by now.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;jp&lt;/P&gt;</description>
      <pubDate>Thu, 01 Aug 2013 20:43:20 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/149#M3541</guid>
      <dc:creator>bigpalooka</dc:creator>
      <dc:date>2013-08-01T20:43:20Z</dc:date>
    </item>
    <item>
      <title>Re: Python Streaming</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/151#M3542</link>
      <description>It sounds like you may have a typo in one of the file paths.&lt;BR /&gt;&lt;BR /&gt;If you see something similar to "bash: ./myreducer.py: No such file or directory" your typo is in the path or filename of the reducer script.&lt;BR /&gt;&lt;BR /&gt;But if you see "bad interpreter" in the error, it means the path you're using to point to python is incorrect.&lt;BR /&gt;&lt;BR /&gt;If you have a hard time finding a typo, try copy / pasting the output of "ls -l", your exact command and the exact output of that command, and possibly your scripts as well. In the Linux terminal windows, Ctrl + Shift + C and Ctrl + Shift + V can be used to copy and paste.</description>
      <pubDate>Thu, 01 Aug 2013 21:04:00 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/151#M3542</guid>
      <dc:creator>Sean</dc:creator>
      <dc:date>2013-08-01T21:04:00Z</dc:date>
    </item>
    <item>
      <title>Re: Python Streaming</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/159#M3543</link>
      <description>&lt;P&gt;JP,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;One other thought, which may be off track, but since I can't see the command-line data that Sean has mentioned, I'm just guessing, is that you might want to check the permissions on the reducer.py script. &amp;nbsp;In order for it to accept the pipe and execute the sorted data as input, it must be executable. &amp;nbsp;You can assure it is executable by issuing a "chmod 755 reducer.py" on the file.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;HTH,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Clint&lt;/P&gt;</description>
      <pubDate>Fri, 02 Aug 2013 04:08:34 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/159#M3543</guid>
      <dc:creator>Clint</dc:creator>
      <dc:date>2013-08-02T04:08:34Z</dc:date>
    </item>
    <item>
      <title>Re: Python Streaming</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/161#M3544</link>
      <description>&lt;P&gt;I renamed my mapper and reducer to jpm.py and jpr.py to make sure my spelling is right.&amp;nbsp; The reducer part of the "cat" doesn't work unless it's preceeded by "python".&amp;nbsp; Then it completes successfully.&lt;/P&gt;&lt;P&gt;In hadoop map-reduce, from the command line, I've gotten the process to complete, but it yields no results.&amp;nbsp; I reduced the reducer functionality to just pass on what comes from the mapper.&amp;nbsp; It completes, but doesn't yield any results in the output (file size = 0).&amp;nbsp; I removed the reducer completely and I get what I expect from the mapper.&lt;/P&gt;&lt;P&gt;I'd like to progress to the gui's and get a taste of pig and hive in cloudera by the end of the month.&amp;nbsp; I think I'm going to try all over again with a fresh vm.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 02 Aug 2013 04:22:34 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/161#M3544</guid>
      <dc:creator>bigpalooka</dc:creator>
      <dc:date>2013-08-02T04:22:34Z</dc:date>
    </item>
    <item>
      <title>Re: Python Streaming</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/163#M3545</link>
      <description>&lt;P&gt;thanks - I did this through the properties screen of the file browser, but I tried it again with the command you supplied.&amp;nbsp; still no luck - the process completes, but outputs nothing, even with a plain vanilla reducer (echoing the mapper output).&lt;/P&gt;&lt;P&gt;jp&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 02 Aug 2013 04:30:36 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/163#M3545</guid>
      <dc:creator>bigpalooka</dc:creator>
      <dc:date>2013-08-02T04:30:36Z</dc:date>
    </item>
    <item>
      <title>Re: Python Streaming</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/165#M3546</link>
      <description>&lt;P&gt;Odd. &amp;nbsp;I take it you're doing something in your reducer that's smart about reading the "standard input" that's being piped to it? &amp;nbsp;Something like:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;for line in sys.stdin:&lt;/PRE&gt;
&lt;P&gt;&amp;nbsp;Also, as Sean indicated, if we could get pastes of your source code and also the actual command-line output/errors you are seeing, that would round out the picture for us.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks,&lt;/P&gt;
&lt;P&gt;&lt;BR /&gt;Clint&lt;/P&gt;</description>
      <pubDate>Fri, 02 Aug 2013 04:52:06 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/165#M3546</guid>
      <dc:creator>Clint</dc:creator>
      <dc:date>2013-08-02T04:52:06Z</dc:date>
    </item>
    <item>
      <title>Re: Python Streaming</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/228#M3547</link>
      <description>&lt;P&gt;It took me a while to figure out.&amp;nbsp; I just got it a minute ago.&amp;nbsp;&lt;/P&gt;&lt;P&gt;I was running scripts that I developed in Windows (where end-of-line = cr+lf).&amp;nbsp; I needed to strip out the "cr" so the python interpreter in Linux wouldn't be looking for /usr/bin/env python/r, but /usr/bin/env python.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Now I can move on.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;jp&lt;/P&gt;</description>
      <pubDate>Wed, 07 Aug 2013 00:36:23 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/228#M3547</guid>
      <dc:creator>bigpalooka</dc:creator>
      <dc:date>2013-08-07T00:36:23Z</dc:date>
    </item>
    <item>
      <title>Re: Python Streaming</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/230#M3548</link>
      <description>&lt;P&gt;Ouch, the old Windows-Linux end-of-line character conversion problem strikes again! &amp;nbsp;Thanks for closing the loop with us, jp, glad it's resolved!&lt;/P&gt;</description>
      <pubDate>Wed, 07 Aug 2013 02:09:40 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/230#M3548</guid>
      <dc:creator>Clint</dc:creator>
      <dc:date>2013-08-07T02:09:40Z</dc:date>
    </item>
    <item>
      <title>Re: Python Streaming</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/300#M3549</link>
      <description>&lt;P&gt;Nice!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;For information: editing the script directly in the File Browser in Hue does this cleaning too!&lt;/P&gt;</description>
      <pubDate>Thu, 08 Aug 2013 18:40:12 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/300#M3549</guid>
      <dc:creator>Romainr</dc:creator>
      <dc:date>2013-08-08T18:40:12Z</dc:date>
    </item>
    <item>
      <title>Re: Python Streaming</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/521#M3550</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have a similar problem. I wrote a simple mapper and reducer to read input file and calculate total number of lines.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;This works great locally&lt;/P&gt;&lt;P&gt;cat access.log | ./linecount_mapper.py | ./linecount_reduce.py&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Same input files and scripts, when used in streaming returns this error message. Any suggestions?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;java.lang.RuntimeException: Error in configuring object&lt;BR /&gt;at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:106)&lt;BR /&gt;at org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:72)&lt;BR /&gt;at org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:130)&lt;BR /&gt;at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:413)&lt;BR /&gt;at org.apache.hadoop.mapred.MapTask.run(MapTask.java:332)&lt;BR /&gt;at org.apache.hadoop.mapred.Child$4.run(Child.java:268)&lt;BR /&gt;at java.security.AccessController.doPrivileged(Native Method)&lt;BR /&gt;at javax.security.auth.Subject.doAs(Subject.java:396)&lt;BR /&gt;at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1408)&lt;BR /&gt;at org.apache.hadoop.mapred.Child.main(Child.java:262)&lt;BR /&gt;Caused by: java.lang.reflect.InvocationTargetException&lt;BR /&gt;at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)&lt;BR /&gt;at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)&lt;BR /&gt;at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.ja&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Charmee&lt;/P&gt;</description>
      <pubDate>Wed, 14 Aug 2013 20:12:18 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/521#M3550</guid>
      <dc:creator>charmeep</dc:creator>
      <dc:date>2013-08-14T20:12:18Z</dc:date>
    </item>
    <item>
      <title>Re: Python Streaming</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/525#M3551</link>
      <description>&lt;P&gt;This is how I invoke the mapreduce job&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;hadoop jar /usr/lib/hadoop-mapreduce/hadoop-streaming-2.0.0-cdh4.3.0.jar&lt;/P&gt;&lt;P&gt;-input /user/certification/sandbox/access.log -output /user/certification/sandboxout -mapper /user/certification/sandbox/linecount_mapper.py -reducer /user/certification/sandbox/linecount_reduce.py&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I also tried using&amp;nbsp;&lt;SPAN&gt;hadoop-streaming.jar, it gives me the same error as well.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Any suggestions are greatly appreciated.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 14 Aug 2013 20:36:52 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/525#M3551</guid>
      <dc:creator>charmeep</dc:creator>
      <dc:date>2013-08-14T20:36:52Z</dc:date>
    </item>
    <item>
      <title>Re: Python Streaming</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/17582#M3552</link>
      <description>&lt;P&gt;Hi Folks,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Can anyone help me here as well.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I also get same error as follow:&lt;/P&gt;&lt;PRE&gt; /usr/lib/hadoop/bin/hadoop jar /usr/lib/hadoop-mapreduce/hadoop-streaming-2.3.0-cdh5.1.0.jar -files /tmp/mapper.py,/tmp/reducer.py -mapper /tmp/mapper.py -reducer /tmp/reducer.py -input gutenberg/4300.txt -output output2
packageJobJar: [] [/usr/lib/hadoop-mapreduce/hadoop-streaming-2.3.0-cdh5.1.0.jar] /tmp/streamjob725052303650188667.jar tmpDir=null
14/08/26 02:44:06 INFO client.RMProxy: Connecting to ResourceManager at hdmachine1.example.com/128.243.29.224:8032
14/08/26 02:44:06 INFO client.RMProxy: Connecting to ResourceManager at hdmachine1.example.com/128.243.29.224:8032
14/08/26 02:44:06 WARN security.UserGroupInformation: PriviledgedActionException as:hduser (auth:SIMPLE) cause:org.apache.hadoop.mapred.FileAlreadyExistsException: Output directory hdfs://hdmachine1.example.com:8020/user/hduser/output2 already exists
14/08/26 02:44:06 WARN security.UserGroupInformation: PriviledgedActionException as:hduser (auth:SIMPLE) cause:org.apache.hadoop.mapred.FileAlreadyExistsException: Output directory hdfs://hdmachine1.example.com:8020/user/hduser/output2 already exists
14/08/26 02:44:06 ERROR streaming.StreamJob: Error Launching job : Output directory hdfs://hdmachine1.example.com:8020/user/hduser/output2 already exists
Streaming Command Failed!
[hduser@hdmachine1 ~]$ /usr/lib/hadoop/bin/hadoop jar /usr/lib/hadoop-mapreduce/hadoop-streaming-2.3.0-cdh5.1.0.jar -files /tmp/mapper.py,/tmp/reducer.py -mapper /tmp/mapper.py -reducer /tmp/reducer.py -input gutenberg/4300.txt -output op-1
packageJobJar: [] [/usr/lib/hadoop-mapreduce/hadoop-streaming-2.3.0-cdh5.1.0.jar] /tmp/streamjob6895399468399805454.jar tmpDir=null
14/08/26 02:44:21 INFO client.RMProxy: Connecting to ResourceManager at hdmachine1.example.com/128.243.29.224:8032
14/08/26 02:44:21 INFO client.RMProxy: Connecting to ResourceManager at hdmachine1.example.com/128.243.29.224:8032
14/08/26 02:44:22 INFO lzo.GPLNativeCodeLoader: Loaded native gpl library
14/08/26 02:44:22 INFO lzo.LzoCodec: Successfully loaded &amp;amp; initialized native-lzo library [hadoop-lzo rev 8e266e052e423af592871e2dfe09d54c03f6a0e8]
14/08/26 02:44:22 INFO mapred.FileInputFormat: Total input paths to process : 1
14/08/26 02:44:22 INFO mapreduce.JobSubmitter: number of splits:2
14/08/26 02:44:23 INFO mapreduce.JobSubmitter: Submitting tokens for job: job_1409004459008_0013
14/08/26 02:44:23 INFO impl.YarnClientImpl: Submitted application application_1409004459008_0013
14/08/26 02:44:23 INFO mapreduce.Job: The url to track the job: &lt;A href="http://hdmachine1.example.com:8088/proxy/application_1409004459008_0013/" target="_blank"&gt;http://hdmachine1.example.com:8088/proxy/application_1409004459008_0013/&lt;/A&gt;
14/08/26 02:44:23 INFO mapreduce.Job: Running job: job_1409004459008_0013
14/08/26 02:44:27 INFO mapreduce.Job: Job job_1409004459008_0013 running in uber mode : false
14/08/26 02:44:27 INFO mapreduce.Job:  map 0% reduce 0%
14/08/26 02:44:30 INFO mapreduce.Job: Task Id : attempt_1409004459008_0013_m_000000_0, Status : FAILED
Error: java.lang.RuntimeException: Error in configuring object
	at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:109)
	at org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:75)
	at org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:133)
	at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:426)
	at org.apache.hadoop.mapred.MapTask.run(MapTask.java:342)
	at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:167)
	at java.security.AccessController.doPrivileged(Native Method)
	at javax.security.auth.Subject.doAs(Subject.java:415)
	at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1554)
	at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:162)
Caused by: java.lang.reflect.InvocationTargetException
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
	at java.lang.reflect.Method.invoke(Method.java:606)
	at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:106)
	... 9 more
Caused by: java.lang.RuntimeException: Error in configuring object
	at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:109)
	at org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:75)
	at org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:133)
	at org.apache.hadoop.mapred.MapRunner.configure(MapRunner.java:38)
	... 14 more
Caused by: java.lang.reflect.InvocationTargetException
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
	at java.lang.reflect.Method.invoke(Method.java:606)
	at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:106)
	... 17 more
Caused by: java.lang.RuntimeException: configuration exception
	at org.apache.hadoop.streaming.PipeMapRed.configure(PipeMapRed.java:222)
	at org.apache.hadoop.streaming.PipeMapper.configure(PipeMapper.java:66)
	... 22 more
Caused by: java.io.IOException: Cannot run program "/tmp/mapper.py": error=2, No such file or directory
	at java.lang.ProcessBuilder.start(ProcessBuilder.java:1047)
	at org.apache.hadoop.streaming.PipeMapRed.configure(PipeMapRed.java:209)
	... 23 more
Caused by: java.io.IOException: error=2, No such file or directory
	at java.lang.UNIXProcess.forkAndExec(Native Method)
	at java.lang.UNIXProcess.&amp;lt;init&amp;gt;(UNIXProcess.java:186)
	at java.lang.ProcessImpl.start(ProcessImpl.java:130)
	at java.lang.ProcessBuilder.start(ProcessBuilder.java:1028)
	... 24 more

14/08/26 02:44:31 INFO mapreduce.Job: Task Id : attempt_1409004459008_0013_m_000001_0, Status : FAILED
Error: java.lang.RuntimeException: Error in configuring object
	at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:109)
	at org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:75)
	at org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:133)
	at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:426)
	at org.apache.hadoop.mapred.MapTask.run(MapTask.java:342)
	at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:167)
	at java.security.AccessController.doPrivileged(Native Method)
	at javax.security.auth.Subject.doAs(Subject.java:415)
	at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1554)
	at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:162)
Caused by: java.lang.reflect.InvocationTargetException
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
	at java.lang.reflect.Method.invoke(Method.java:606)
	at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:106)
	... 9 more
Caused by: java.lang.RuntimeException: Error in configuring object
	at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:109)
	at org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:75)
	at org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:133)
	at org.apache.hadoop.mapred.MapRunner.configure(MapRunner.java:38)
	... 14 more
Caused by: java.lang.reflect.InvocationTargetException
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
	at java.lang.reflect.Method.invoke(Method.java:606)
	at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:106)
	... 17 more
Caused by: java.lang.RuntimeException: configuration exception
	at org.apache.hadoop.streaming.PipeMapRed.configure(PipeMapRed.java:222)
	at org.apache.hadoop.streaming.PipeMapper.configure(PipeMapper.java:66)
	... 22 more
Caused by: java.io.IOException: Cannot run program "/tmp/mapper.py": error=2, No such file or directory
	at java.lang.ProcessBuilder.start(ProcessBuilder.java:1047)
	at org.apache.hadoop.streaming.PipeMapRed.configure(PipeMapRed.java:209)
	... 23 more
Caused by: java.io.IOException: error=2, No such file or directory
	at java.lang.UNIXProcess.forkAndExec(Native Method)
	at java.lang.UNIXProcess.&amp;lt;init&amp;gt;(UNIXProcess.java:186)
	at java.lang.ProcessImpl.start(ProcessImpl.java:130)
	at java.lang.ProcessBuilder.start(ProcessBuilder.java:1028)
	... 24 more

14/08/26 02:44:34 INFO mapreduce.Job: Task Id : attempt_1409004459008_0013_m_000001_1, Status : FAILED
Error: java.lang.RuntimeException: Error in configuring object
	at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:109)
	at org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:75)
	at org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:133)
	at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:426)
	at org.apache.hadoop.mapred.MapTask.run(MapTask.java:342)
	at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:167)
	at java.security.AccessController.doPrivileged(Native Method)
	at javax.security.auth.Subject.doAs(Subject.java:415)
	at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1554)
	at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:162)
Caused by: java.lang.reflect.InvocationTargetException
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
	at java.lang.reflect.Method.invoke(Method.java:606)
	at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:106)
	... 9 more
Caused by: java.lang.RuntimeException: Error in configuring object
	at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:109)
	at org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:75)
	at org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:133)
	at org.apache.hadoop.mapred.MapRunner.configure(MapRunner.java:38)
	... 14 more
Caused by: java.lang.reflect.InvocationTargetException
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
	at java.lang.reflect.Method.invoke(Method.java:606)
	at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:106)
	... 17 more
Caused by: java.lang.RuntimeException: configuration exception
	at org.apache.hadoop.streaming.PipeMapRed.configure(PipeMapRed.java:222)
	at org.apache.hadoop.streaming.PipeMapper.configure(PipeMapper.java:66)
	... 22 more
Caused by: java.io.IOException: Cannot run program "/tmp/mapper.py": error=2, No such file or directory
	at java.lang.ProcessBuilder.start(ProcessBuilder.java:1047)
	at org.apache.hadoop.streaming.PipeMapRed.configure(PipeMapRed.java:209)
	... 23 more
Caused by: java.io.IOException: error=2, No such file or directory
	at java.lang.UNIXProcess.forkAndExec(Native Method)
	at java.lang.UNIXProcess.&amp;lt;init&amp;gt;(UNIXProcess.java:186)
	at java.lang.ProcessImpl.start(ProcessImpl.java:130)
	at java.lang.ProcessBuilder.start(ProcessBuilder.java:1028)
	... 24 more

14/08/26 02:44:35 INFO mapreduce.Job: Task Id : attempt_1409004459008_0013_m_000000_1, Status : FAILED
Error: java.lang.RuntimeException: Error in configuring object
	at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:109)
	at org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:75)
	at org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:133)
	at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:426)
	at org.apache.hadoop.mapred.MapTask.run(MapTask.java:342)
	at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:167)
	at java.security.AccessController.doPrivileged(Native Method)
	at javax.security.auth.Subject.doAs(Subject.java:415)
	at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1554)
	at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:162)
Caused by: java.lang.reflect.InvocationTargetException
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
	at java.lang.reflect.Method.invoke(Method.java:606)
	at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:106)
	... 9 more
Caused by: java.lang.RuntimeException: Error in configuring object
	at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:109)
	at org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:75)
	at org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:133)
	at org.apache.hadoop.mapred.MapRunner.configure(MapRunner.java:38)
	... 14 more
Caused by: java.lang.reflect.InvocationTargetException
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
	at java.lang.reflect.Method.invoke(Method.java:606)
	at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:106)
	... 17 more
Caused by: java.lang.RuntimeException: configuration exception
	at org.apache.hadoop.streaming.PipeMapRed.configure(PipeMapRed.java:222)
	at org.apache.hadoop.streaming.PipeMapper.configure(PipeMapper.java:66)
	... 22 more
Caused by: java.io.IOException: Cannot run program "/tmp/mapper.py": error=2, No such file or directory
	at java.lang.ProcessBuilder.start(ProcessBuilder.java:1047)
	at org.apache.hadoop.streaming.PipeMapRed.configure(PipeMapRed.java:209)
	... 23 more
Caused by: java.io.IOException: error=2, No such file or directory
	at java.lang.UNIXProcess.forkAndExec(Native Method)
	at java.lang.UNIXProcess.&amp;lt;init&amp;gt;(UNIXProcess.java:186)
	at java.lang.ProcessImpl.start(ProcessImpl.java:130)
	at java.lang.ProcessBuilder.start(ProcessBuilder.java:1028)
	... 24 more

14/08/26 02:44:38 INFO mapreduce.Job: Task Id : attempt_1409004459008_0013_m_000001_2, Status : FAILED
Error: java.lang.RuntimeException: Error in configuring object
	at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:109)
	at org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:75)
	at org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:133)
	at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:426)
	at org.apache.hadoop.mapred.MapTask.run(MapTask.java:342)
	at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:167)
	at java.security.AccessController.doPrivileged(Native Method)
	at javax.security.auth.Subject.doAs(Subject.java:415)
	at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1554)
	at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:162)
Caused by: java.lang.reflect.InvocationTargetException
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
	at java.lang.reflect.Method.invoke(Method.java:606)
	at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:106)
	... 9 more
Caused by: java.lang.RuntimeException: Error in configuring object
	at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:109)
	at org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:75)
	at org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:133)
	at org.apache.hadoop.mapred.MapRunner.configure(MapRunner.java:38)
	... 14 more
Caused by: java.lang.reflect.InvocationTargetException
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
	at java.lang.reflect.Method.invoke(Method.java:606)
	at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:106)
	... 17 more
Caused by: java.lang.RuntimeException: configuration exception
	at org.apache.hadoop.streaming.PipeMapRed.configure(PipeMapRed.java:222)
	at org.apache.hadoop.streaming.PipeMapper.configure(PipeMapper.java:66)
	... 22 more
Caused by: java.io.IOException: Cannot run program "/tmp/mapper.py": error=2, No such file or directory
	at java.lang.ProcessBuilder.start(ProcessBuilder.java:1047)
	at org.apache.hadoop.streaming.PipeMapRed.configure(PipeMapRed.java:209)
	... 23 more
Caused by: java.io.IOException: error=2, No such file or directory
	at java.lang.UNIXProcess.forkAndExec(Native Method)
	at java.lang.UNIXProcess.&amp;lt;init&amp;gt;(UNIXProcess.java:186)
	at java.lang.ProcessImpl.start(ProcessImpl.java:130)
	at java.lang.ProcessBuilder.start(ProcessBuilder.java:1028)
	... 24 more

14/08/26 02:44:39 INFO mapreduce.Job: Task Id : attempt_1409004459008_0013_m_000000_2, Status : FAILED
Error: java.lang.RuntimeException: Error in configuring object
	at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:109)
	at org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:75)
	at org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:133)
	at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:426)
	at org.apache.hadoop.mapred.MapTask.run(MapTask.java:342)
	at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:167)
	at java.security.AccessController.doPrivileged(Native Method)
	at javax.security.auth.Subject.doAs(Subject.java:415)
	at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1554)
	at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:162)
Caused by: java.lang.reflect.InvocationTargetException
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
	at java.lang.reflect.Method.invoke(Method.java:606)
	at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:106)
	... 9 more
Caused by: java.lang.RuntimeException: Error in configuring object
	at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:109)
	at org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:75)
	at org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:133)
	at org.apache.hadoop.mapred.MapRunner.configure(MapRunner.java:38)
	... 14 more
Caused by: java.lang.reflect.InvocationTargetException
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
	at java.lang.reflect.Method.invoke(Method.java:606)
	at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:106)
	... 17 more
Caused by: java.lang.RuntimeException: configuration exception
	at org.apache.hadoop.streaming.PipeMapRed.configure(PipeMapRed.java:222)
	at org.apache.hadoop.streaming.PipeMapper.configure(PipeMapper.java:66)
	... 22 more
Caused by: java.io.IOException: Cannot run program "/tmp/mapper.py": error=2, No such file or directory
	at java.lang.ProcessBuilder.start(ProcessBuilder.java:1047)
	at org.apache.hadoop.streaming.PipeMapRed.configure(PipeMapRed.java:209)
	... 23 more
Caused by: java.io.IOException: error=2, No such file or directory
	at java.lang.UNIXProcess.forkAndExec(Native Method)
	at java.lang.UNIXProcess.&amp;lt;init&amp;gt;(UNIXProcess.java:186)
	at java.lang.ProcessImpl.start(ProcessImpl.java:130)
	at java.lang.ProcessBuilder.start(ProcessBuilder.java:1028)
	... 24 more

14/08/26 02:44:43 INFO mapreduce.Job:  map 100% reduce 100%
14/08/26 02:44:44 INFO mapreduce.Job: Job job_1409004459008_0013 failed with state FAILED due to: Task failed task_1409004459008_0013_m_000000
Job failed as tasks failed. failedMaps:1 failedReduces:0

14/08/26 02:44:44 INFO mapreduce.Job: Counters: 13
	Job Counters 
		Failed map tasks=7
		Killed map tasks=1
		Launched map tasks=8
		Other local map tasks=6
		Rack-local map tasks=2
		Total time spent by all maps in occupied slots (ms)=17357
		Total time spent by all reduces in occupied slots (ms)=0
		Total time spent by all map tasks (ms)=17357
		Total vcore-seconds taken by all map tasks=17357
		Total megabyte-seconds taken by all map tasks=17773568
	Map-Reduce Framework
		CPU time spent (ms)=0
		Physical memory (bytes) snapshot=0
		Virtual memory (bytes) snapshot=0
14/08/26 02:44:44 ERROR streaming.StreamJob: Job not Successful!
Streaming Command Failed!&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I tried to make the following from all previous posts I saw&amp;nbsp;&lt;/P&gt;&lt;P&gt;1- replace &amp;nbsp; #!/usr/bin/env python3.2 &amp;nbsp;to &amp;nbsp;#!/usr/bin/python3.2&lt;/P&gt;&lt;P&gt;2- I copied the mapper.py and reducer.py to /tmp and made both scripts with 777 ermissions&lt;/P&gt;&lt;P&gt;3- I restarted all hadoop servies&lt;/P&gt;&lt;P&gt;4- I used both streaming jar files &amp;nbsp;/usr/lib/hadoop-mapreduce/hadoop-streaming-2.3.0-cdh5.1.0.jar &amp;nbsp;and&amp;nbsp;/usr/lib/hadoop-mapreduce/hadoop-streaming.jar &amp;nbsp; but they both gave same Error message&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Here is the command I use again and I will appreciate if someone could explain what the hell is wrong with this&lt;/P&gt;&lt;PRE&gt;$ /usr/lib/hadoop/bin/hadoop jar /usr/lib/hadoop-mapreduce/hadoop-streaming-2.3.0-cdh5.1.0.jar -files /tmp/mapper.py,/tmp/reducer.py -mapper /tmp/mapper.py -reducer /tmp/reducer.py -input gutenberg/4300.txt -output output2&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 26 Aug 2014 01:48:58 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/17582#M3552</guid>
      <dc:creator>ahegazi</dc:creator>
      <dc:date>2014-08-26T01:48:58Z</dc:date>
    </item>
    <item>
      <title>Re: Python Streaming</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/17888#M3553</link>
      <description>The bottom of the stack trace says "Caused by: java.io.IOException: Cannot run program "/tmp/mapper.py": error=2, No such file or directory". How many nodes are there in this cluster? Have you copied the mapper.py program to all the nodes? It also needs to be executable (chmod 755 mapper.py)</description>
      <pubDate>Sat, 30 Aug 2014 03:56:12 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/17888#M3553</guid>
      <dc:creator>GautamG</dc:creator>
      <dc:date>2014-08-30T03:56:12Z</dc:date>
    </item>
    <item>
      <title>Re: Python Streaming</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/17930#M3554</link>
      <description>&lt;P&gt;Thanks GutamG for your reply,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;The cluster has 10 DataNodes and 1 NameNode&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;I didn't copy the scripts to all nodes, as I expect which is normal that -files option shall copy them to the HDFS where it is by default reachable via all nodes.&amp;nbsp;&lt;/SPAN&gt;I am sure it is executable, I even made it 777.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 01 Sep 2014 03:04:18 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/17930#M3554</guid>
      <dc:creator>ahegazi</dc:creator>
      <dc:date>2014-09-01T03:04:18Z</dc:date>
    </item>
    <item>
      <title>Re: Python Streaming</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/17932#M3555</link>
      <description>&lt;P&gt;Please refer to &lt;A target="_blank" href="http://hadoop.apache.org/docs/current/hadoop-mapreduce-client/hadoop-mapreduce-client-core/HadoopStreaming.html"&gt;http://hadoop.apache.org/docs/current/hadoop-mapreduce-client/hadoop-mapreduce-client-core/HadoopStreaming.html&lt;/A&gt; which shows the correct usage. Use -file for each file to be copied across.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;For example, try:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;$ /usr/lib/hadoop/bin/hadoop jar /usr/lib/hadoop-mapreduce/hadoop-streaming-2.3.0-cdh5.1.0.jar -file /tmp/mapper.py -file /tmp/reducer.py -mapper /tmp/mapper.py -reducer /tmp/reducer.py -input gutenberg/4300.txt -output output&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Edit: hyperlink and space char in command&lt;/P&gt;</description>
      <pubDate>Mon, 01 Sep 2014 03:35:37 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/17932#M3555</guid>
      <dc:creator>GautamG</dc:creator>
      <dc:date>2014-09-01T03:35:37Z</dc:date>
    </item>
    <item>
      <title>Re: Python Streaming</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/53990#M3556</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have similar problem, my python code is working fine when I am running it locally using cat command, but it is not working fine when I am running it on Hadoop. Please find below my code, error, command to run the program and permission on my files:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Permission on my files:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;-rwxrwxr-x 1 cloudera cloudera 690 Apr 20 14:15 flight_mapper.py&lt;BR /&gt;-rw-r--r-- 1 cloudera cloudera 2865221 Apr 19 08:21 flight_records.csv&lt;BR /&gt;-rwxrwxr-x 1 cloudera cloudera 501 Apr 19 13:39 flight_reducer.py&lt;BR /&gt;-rwxrwxrwx 1 cloudera cloudera 1349 Apr 21 06:20 framework.py&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Framework.py code:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;#! /usr/bin/env python&lt;BR /&gt;import os&lt;BR /&gt;import sys&lt;/P&gt;&lt;P&gt;from itertools import groupby&lt;BR /&gt;from operator import itemgetter&lt;/P&gt;&lt;P&gt;separator = "\t"&lt;/P&gt;&lt;P&gt;class Streaming(object):&lt;/P&gt;&lt;P&gt;@staticmethod&lt;BR /&gt;def GetJobConf(name):&lt;BR /&gt;name = name.replace(".","_").upper()&lt;BR /&gt;return os.environ.get(name)&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;def __init__(self,infile=sys.stdin,separator = separator):&lt;BR /&gt;self.infile = infile&lt;BR /&gt;self.sep = separator&lt;/P&gt;&lt;P&gt;def Status(self,message):&lt;BR /&gt;sys.stderr.write("reporter:status:{}\n".format(message))&lt;/P&gt;&lt;P&gt;def Counter(self,counter,amount=1,group="Python Streaming"):&lt;BR /&gt;msg = "reporter:counter:{0},{1},{2}\n".format(group,counter,amount)&lt;BR /&gt;sys.stderr.write(msg)&lt;/P&gt;&lt;P&gt;def Emit(self,key,value):&lt;BR /&gt;sys.stdout.write("{0}{1}{2}\n".format(key,self.sep,value))&lt;/P&gt;&lt;P&gt;def Read(self):&lt;BR /&gt;for line in self.infile:&lt;BR /&gt;yield line.rstrip()&lt;/P&gt;&lt;P&gt;def __iter__(self):&lt;BR /&gt;for line in self.Read():&lt;BR /&gt;yield line&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;class Mapper(Streaming):&lt;/P&gt;&lt;P&gt;def Map(self):&lt;BR /&gt;raise NotImplementedError("Mapper must implement a Map method")&lt;/P&gt;&lt;P&gt;class Reducer(Streaming):&lt;/P&gt;&lt;P&gt;def Reduce(self):&lt;BR /&gt;raise NotImplementedError("Reducer must implement a Reduce method")&lt;/P&gt;&lt;P&gt;def __iter__(self):&lt;BR /&gt;generator = (line.split(self.sep,1) for line in self.Read())&lt;BR /&gt;for item in groupby(generator,itemgetter(0)):&lt;BR /&gt;yield item&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;flight_mapper.py&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;#! /usr/bin/env python&lt;/P&gt;&lt;P&gt;import sys&lt;BR /&gt;import csv&lt;BR /&gt;from framework import Mapper&lt;/P&gt;&lt;P&gt;class FlightMapper(Mapper):&lt;BR /&gt;def __init__(self,infile=sys.stdin,separator='\t'):&lt;BR /&gt;super(FlightMapper,self).__init__(infile,separator)&lt;BR /&gt;&lt;BR /&gt;def Map(self):&lt;BR /&gt;reader = csv.reader(self)&lt;BR /&gt;for row in reader:&lt;BR /&gt;if len(row[3].strip()) == 0:&lt;BR /&gt;continue&lt;BR /&gt;if len(row[6].strip()) == 0:&lt;BR /&gt;row[6] = 0&lt;BR /&gt;self.Emit(row[3],row[6])&lt;BR /&gt;else:&lt;BR /&gt;sys.stdout.write("{0}\t{1}\n").format(row[3],row[6])&lt;BR /&gt;self.Emit(row[3],row[6])&lt;/P&gt;&lt;P&gt;if __name__ == '__main__':&lt;BR /&gt;mapper = FlightMapper(sys.stdin)&lt;BR /&gt;mapper.Map()&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;flight_reducer.py:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;#! /usr/bin/env python&lt;BR /&gt;import sys&lt;/P&gt;&lt;P&gt;from framework import Reducer&lt;BR /&gt;from itertools import groupby&lt;BR /&gt;from operator import itemgetter&lt;/P&gt;&lt;P&gt;class FlightReducer(Reducer):&lt;/P&gt;&lt;P&gt;def Reduce(self):&lt;BR /&gt;for key, val in self:&lt;BR /&gt;total = 0.0&lt;BR /&gt;count = 0&lt;BR /&gt;for item in val:&lt;BR /&gt;total += float(item[1])&lt;BR /&gt;count += 1&lt;BR /&gt;self.Emit(key,float(total)/float(count))&lt;/P&gt;&lt;P&gt;if __name__ == '__main__':&lt;BR /&gt;reducer = FlightReducer(sys.stdin)&lt;BR /&gt;reducer.Reduce()&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Error log:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;2017-04-21 06:34:14,341 INFO [main] org.apache.hadoop.metrics2.impl.MetricsConfig: loaded properties from hadoop-metrics2.properties&lt;BR /&gt;2017-04-21 06:34:14,411 INFO [main] org.apache.hadoop.metrics2.impl.MetricsSystemImpl: Scheduled snapshot period at 10 second(s).&lt;BR /&gt;2017-04-21 06:34:14,411 INFO [main] org.apache.hadoop.metrics2.impl.MetricsSystemImpl: MapTask metrics system started&lt;BR /&gt;2017-04-21 06:34:14,420 INFO [main] org.apache.hadoop.mapred.YarnChild: Executing with tokens:&lt;BR /&gt;2017-04-21 06:34:14,420 INFO [main] org.apache.hadoop.mapred.YarnChild: Kind: mapreduce.job, Service: job_1492704251350_0012, Ident: (org.apache.hadoop.mapreduce.security.token.JobTokenIdentifier@492e5810)&lt;BR /&gt;2017-04-21 06:34:14,496 INFO [main] org.apache.hadoop.mapred.YarnChild: Sleeping for 0ms before retrying again. Got null now.&lt;BR /&gt;2017-04-21 06:34:14,761 INFO [main] org.apache.hadoop.mapred.YarnChild: mapreduce.cluster.local.dir for child: /yarn/nm/usercache/cloudera/appcache/application_1492704251350_0012&lt;BR /&gt;2017-04-21 06:34:15,329 INFO [main] org.apache.hadoop.conf.Configuration.deprecation: session.id is deprecated. Instead, use dfs.metrics.session-id&lt;BR /&gt;2017-04-21 06:34:15,751 INFO [main] org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter: File Output Committer Algorithm version is 1&lt;BR /&gt;2017-04-21 06:34:15,765 INFO [main] org.apache.hadoop.mapred.Task: Using ResourceCalculatorProcessTree : [ ]&lt;BR /&gt;2017-04-21 06:34:15,955 INFO [main] org.apache.hadoop.mapred.MapTask: Processing split: hdfs://quickstart.cloudera:8020/user/cloudera/hadoop_practicals_input/flight_records.csv:1432610+1432611&lt;BR /&gt;2017-04-21 06:34:15,982 INFO [main] org.apache.hadoop.mapred.MapTask: numReduceTasks: 1&lt;BR /&gt;2017-04-21 06:34:15,996 INFO [main] org.apache.hadoop.mapred.MapTask: (EQUATOR) 0 kvi 4194300(16777200)&lt;BR /&gt;2017-04-21 06:34:15,996 INFO [main] org.apache.hadoop.mapred.MapTask: mapreduce.task.io.sort.mb: 16&lt;BR /&gt;2017-04-21 06:34:15,996 INFO [main] org.apache.hadoop.mapred.MapTask: soft limit at 13421773&lt;BR /&gt;2017-04-21 06:34:15,996 INFO [main] org.apache.hadoop.mapred.MapTask: bufstart = 0; bufvoid = 16777216&lt;BR /&gt;2017-04-21 06:34:15,997 INFO [main] org.apache.hadoop.mapred.MapTask: kvstart = 4194300; length = 1048576&lt;BR /&gt;2017-04-21 06:34:16,000 INFO [main] org.apache.hadoop.mapred.MapTask: Map output collector class = org.apache.hadoop.mapred.MapTask$MapOutputBuffer&lt;BR /&gt;2017-04-21 06:34:16,010 INFO [main] org.apache.hadoop.streaming.PipeMapRed: PipeMapRed exec [/usr/bin/python, ./flight_mapper.py]&lt;BR /&gt;2017-04-21 06:34:16,016 INFO [main] org.apache.hadoop.conf.Configuration.deprecation: mapred.task.id is deprecated. Instead, use mapreduce.task.attempt.id&lt;BR /&gt;2017-04-21 06:34:16,016 INFO [main] org.apache.hadoop.conf.Configuration.deprecation: mapred.task.is.map is deprecated. Instead, use mapreduce.task.ismap&lt;BR /&gt;2017-04-21 06:34:16,017 INFO [main] org.apache.hadoop.conf.Configuration.deprecation: mapred.skip.on is deprecated. Instead, use mapreduce.job.skiprecords&lt;BR /&gt;2017-04-21 06:34:16,018 INFO [main] org.apache.hadoop.conf.Configuration.deprecation: mapred.local.dir is deprecated. Instead, use mapreduce.cluster.local.dir&lt;BR /&gt;2017-04-21 06:34:16,020 INFO [main] org.apache.hadoop.conf.Configuration.deprecation: map.input.file is deprecated. Instead, use mapreduce.map.input.file&lt;BR /&gt;2017-04-21 06:34:16,020 INFO [main] org.apache.hadoop.conf.Configuration.deprecation: mapred.cache.localFiles is deprecated. Instead, use mapreduce.job.cache.local.files&lt;BR /&gt;2017-04-21 06:34:16,020 INFO [main] org.apache.hadoop.conf.Configuration.deprecation: mapred.job.id is deprecated. Instead, use mapreduce.job.id&lt;BR /&gt;2017-04-21 06:34:16,021 INFO [main] org.apache.hadoop.conf.Configuration.deprecation: map.input.start is deprecated. Instead, use mapreduce.map.input.start&lt;BR /&gt;2017-04-21 06:34:16,022 INFO [main] org.apache.hadoop.conf.Configuration.deprecation: mapred.tip.id is deprecated. Instead, use mapreduce.task.id&lt;BR /&gt;2017-04-21 06:34:16,023 INFO [main] org.apache.hadoop.conf.Configuration.deprecation: mapred.task.partition is deprecated. Instead, use mapreduce.task.partition&lt;BR /&gt;2017-04-21 06:34:16,024 INFO [main] org.apache.hadoop.conf.Configuration.deprecation: map.input.length is deprecated. Instead, use mapreduce.map.input.length&lt;BR /&gt;2017-04-21 06:34:16,025 INFO [main] org.apache.hadoop.conf.Configuration.deprecation: job.local.dir is deprecated. Instead, use mapreduce.job.local.dir&lt;BR /&gt;2017-04-21 06:34:16,025 INFO [main] org.apache.hadoop.conf.Configuration.deprecation: mapred.work.output.dir is deprecated. Instead, use mapreduce.task.output.dir&lt;BR /&gt;2017-04-21 06:34:16,049 INFO [main] org.apache.hadoop.streaming.PipeMapRed: R/W/S=1/0/0 in:NA [rec/s] out:NA [rec/s]&lt;BR /&gt;2017-04-21 06:34:16,049 INFO [main] org.apache.hadoop.streaming.PipeMapRed: R/W/S=10/0/0 in:NA [rec/s] out:NA [rec/s]&lt;BR /&gt;2017-04-21 06:34:16,050 INFO [main] org.apache.hadoop.streaming.PipeMapRed: R/W/S=100/0/0 in:NA [rec/s] out:NA [rec/s]&lt;BR /&gt;2017-04-21 06:34:16,063 INFO [Thread-14] org.apache.hadoop.streaming.PipeMapRed: MRErrorThread done&lt;BR /&gt;2017-04-21 06:34:16,068 INFO [main] org.apache.hadoop.streaming.PipeMapRed: R/W/S=1000/0/0 in:NA [rec/s] out:NA [rec/s]&lt;BR /&gt;2017-04-21 06:34:16,076 INFO [main] org.apache.hadoop.streaming.PipeMapRed: R/W/S=1751/0/0 in:NA [rec/s] out:NA [rec/s]&lt;BR /&gt;minRecWrittenToEnableSkip_=9223372036854775807 HOST=null&lt;BR /&gt;USER=cloudera&lt;BR /&gt;HADOOP_USER=null&lt;BR /&gt;last tool output: |null|&lt;/P&gt;&lt;P&gt;java.io.IOException: Stream closed&lt;BR /&gt;at java.lang.ProcessBuilder$NullOutputStream.write(ProcessBuilder.java:434)&lt;BR /&gt;at java.io.OutputStream.write(OutputStream.java:116)&lt;BR /&gt;at java.io.BufferedOutputStream.write(BufferedOutputStream.java:122)&lt;BR /&gt;at java.io.BufferedOutputStream.flushBuffer(BufferedOutputStream.java:82)&lt;BR /&gt;at java.io.BufferedOutputStream.write(BufferedOutputStream.java:126)&lt;BR /&gt;at java.io.DataOutputStream.write(DataOutputStream.java:107)&lt;BR /&gt;at org.apache.hadoop.streaming.io.TextInputWriter.writeUTF8(TextInputWriter.java:72)&lt;BR /&gt;at org.apache.hadoop.streaming.io.TextInputWriter.writeValue(TextInputWriter.java:51)&lt;BR /&gt;at org.apache.hadoop.streaming.PipeMapper.map(PipeMapper.java:106)&lt;BR /&gt;at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:54)&lt;BR /&gt;at org.apache.hadoop.streaming.PipeMapRunner.run(PipeMapRunner.java:34)&lt;BR /&gt;at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:453)&lt;BR /&gt;at org.apache.hadoop.mapred.MapTask.run(MapTask.java:343)&lt;BR /&gt;at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:164)&lt;BR /&gt;at java.security.AccessController.doPrivileged(Native Method)&lt;BR /&gt;at javax.security.auth.Subject.doAs(Subject.java:415)&lt;BR /&gt;at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1693)&lt;BR /&gt;at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:158)&lt;BR /&gt;2017-04-21 06:34:16,079 WARN [main] org.apache.hadoop.streaming.PipeMapRed: java.io.IOException: Stream closed&lt;BR /&gt;2017-04-21 06:34:16,079 INFO [main] org.apache.hadoop.streaming.PipeMapRed: PipeMapRed.waitOutputThreads(): subprocess exited with code 1 in org.apache.hadoop.streaming.PipeMapRed&lt;BR /&gt;2017-04-21 06:34:16,079 INFO [main] org.apache.hadoop.streaming.PipeMapRed: mapRedFinished&lt;BR /&gt;2017-04-21 06:34:16,079 WARN [main] org.apache.hadoop.streaming.PipeMapRed: java.io.IOException: Stream closed&lt;BR /&gt;2017-04-21 06:34:16,079 INFO [main] org.apache.hadoop.streaming.PipeMapRed: PipeMapRed.waitOutputThreads(): subprocess exited with code 1 in org.apache.hadoop.streaming.PipeMapRed&lt;BR /&gt;2017-04-21 06:34:16,079 INFO [main] org.apache.hadoop.streaming.PipeMapRed: mapRedFinished&lt;BR /&gt;2017-04-21 06:34:16,079 WARN [main] org.apache.hadoop.security.UserGroupInformation: PriviledgedActionException as:cloudera (auth:SIMPLE) cause:java.io.IOException: Stream closed&lt;BR /&gt;2017-04-21 06:34:16,079 WARN [main] org.apache.hadoop.mapred.YarnChild: Exception running child : java.io.IOException: Stream closed&lt;BR /&gt;at java.lang.ProcessBuilder$NullOutputStream.write(ProcessBuilder.java:434)&lt;BR /&gt;at java.io.OutputStream.write(OutputStream.java:116)&lt;BR /&gt;at java.io.BufferedOutputStream.write(BufferedOutputStream.java:122)&lt;BR /&gt;at java.io.BufferedOutputStream.flushBuffer(BufferedOutputStream.java:82)&lt;BR /&gt;at java.io.BufferedOutputStream.write(BufferedOutputStream.java:126)&lt;BR /&gt;at java.io.DataOutputStream.write(DataOutputStream.java:107)&lt;BR /&gt;at org.apache.hadoop.streaming.io.TextInputWriter.writeUTF8(TextInputWriter.java:72)&lt;BR /&gt;at org.apache.hadoop.streaming.io.TextInputWriter.writeValue(TextInputWriter.java:51)&lt;BR /&gt;at org.apache.hadoop.streaming.PipeMapper.map(PipeMapper.java:106)&lt;BR /&gt;at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:54)&lt;BR /&gt;at org.apache.hadoop.streaming.PipeMapRunner.run(PipeMapRunner.java:34)&lt;BR /&gt;at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:453)&lt;BR /&gt;at org.apache.hadoop.mapred.MapTask.run(MapTask.java:343)&lt;BR /&gt;at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:164)&lt;BR /&gt;at java.security.AccessController.doPrivileged(Native Method)&lt;BR /&gt;at javax.security.auth.Subject.doAs(Subject.java:415)&lt;BR /&gt;at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1693)&lt;BR /&gt;at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:158)&lt;/P&gt;&lt;P&gt;2017-04-21 06:34:16,085 INFO [main] org.apache.hadoop.mapred.Task: Runnning cleanup for the task&lt;BR /&gt;2017-04-21 06:34:16,090 WARN [main] org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter: Could not delete hdfs://quickstart.cloudera:8020/user/cloudera/average_delay/_temporary/1/_temporary/attempt_1492704251350_0012_m_000000_0&lt;BR /&gt;2017-04-21 06:34:16,094 INFO [main] org.apache.hadoop.metrics2.impl.MetricsSystemImpl: Stopping MapTask metrics system...&lt;BR /&gt;2017-04-21 06:34:16,094 INFO [main] org.apache.hadoop.metrics2.impl.MetricsSystemImpl: MapTask metrics system stopped.&lt;BR /&gt;2017-04-21 06:34:16,094 INFO [main] org.apache.hadoop.metrics2.impl.MetricsSystemImpl: MapTask metrics system shutdown complete.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Command:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;hadoop jar /usr/lib/hadoop-mapreduce/hadoop-streaming.jar -Dstream.non.zero.exit.is.failure=false -input 'hadoop_practicals_input/flight_records.csv' -output average_delay -mapper 'python ./flight_mapper.py' -reducer 'python ./flight_reducer.py' -file ./flight_mapper.py -file ./flight_reducer.py -file ./framework.py&lt;/P&gt;</description>
      <pubDate>Fri, 21 Apr 2017 13:50:19 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/53990#M3556</guid>
      <dc:creator>adityahadoop</dc:creator>
      <dc:date>2017-04-21T13:50:19Z</dc:date>
    </item>
    <item>
      <title>Re: Python Streaming</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/76965#M3557</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/55"&gt;@Sean&lt;/a&gt;,@Clint,&lt;BR /&gt;Can we use mrjob library to execute the mapreduce python code in cloudera quickstart vm ?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Vidya&lt;/P&gt;</description>
      <pubDate>Sun, 15 Jul 2018 19:43:18 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Python-Streaming/m-p/76965#M3557</guid>
      <dc:creator>Vidya821</dc:creator>
      <dc:date>2018-07-15T19:43:18Z</dc:date>
    </item>
  </channel>
</rss>

