Created on 07-17-2014 01:03 PM - edited 09-16-2022 02:02 AM
Wanted to try out some python on hadoop in my CDH5 VM but I need the streaming jar file.
According to the documentation here
the jar should be at this location:
/usr/lib/hadoop-0.20-mapreduce/contrib/streaming/hadoop-streaming-2.0.0-mr1-cdh<version>.jar
or maybe here
/usr/lib/hadoop-mapreduce/hadoop-streaming.jar
but I don't see it at either place. is there a way to install additional components if they are missing?
Created 08-03-2014 04:24 PM
Hey,
I guess the one you are looking for shall be located here
/usr/lib/hadoop-mapreduce/hadoop-streaming-2.3.0-cdh5.1.0.jar /usr/lib/hadoop-mapreduce/hadoop-streaming.jar
Cheers
Created 08-03-2014 04:24 PM
Hey,
I guess the one you are looking for shall be located here
/usr/lib/hadoop-mapreduce/hadoop-streaming-2.3.0-cdh5.1.0.jar /usr/lib/hadoop-mapreduce/hadoop-streaming.jar
Cheers
Created on 08-13-2014 04:48 AM - edited 08-13-2014 05:23 AM
Hi
It seems that it has been moved to
/opt/cloudera/parcels/CDH/lib/hadoop-mapreduce
Cheers
Created on 07-14-2015 11:25 AM - edited 07-14-2015 11:26 AM
I'm on the 5.3.0 VM now and was able to locate the jar in the folder you identified, thanks
/usr/lib/hadoop-mapreduce/
Created 04-19-2017 01:33 PM
Hi,
I have recently downloaded cloudera quick start VM for Virtual Box and I am looking for hadoop streaming jar file so that I can do some python streaming, but I am not able to find it. Please help me to get this jar file.
Thanks
Created 04-20-2017 06:26 AM
I found the location, it was the same where dr3x mentioned. My bad, I didn't noticed properly.