Support Questions

Find answers, ask questions, and share your expertise

Hadoop streaming jar files in CDH5?

Explorer

Wanted to try out some python on hadoop in my CDH5 VM but I need the streaming jar file.

 

According to the documentation here 

http://www.cloudera.com/content/cloudera-content/cloudera-docs/CDH5/latest/CDH5-Installation-Guide/c... 

 

the jar should be at this location:

/usr/lib/hadoop-0.20-mapreduce/contrib/streaming/hadoop-streaming-2.0.0-mr1-cdh<version>.jar

 

or maybe here

/usr/lib/hadoop-mapreduce/hadoop-streaming.jar

 

but I don't see it at either place.  is there a way to install additional components if they are missing?

1 ACCEPTED SOLUTION

Explorer

Hey,

 

I guess the one you are looking for shall be located here

 

/usr/lib/hadoop-mapreduce/hadoop-streaming-2.3.0-cdh5.1.0.jar
/usr/lib/hadoop-mapreduce/hadoop-streaming.jar

 

 

Cheers

View solution in original post

5 REPLIES 5

Explorer

Hey,

 

I guess the one you are looking for shall be located here

 

/usr/lib/hadoop-mapreduce/hadoop-streaming-2.3.0-cdh5.1.0.jar
/usr/lib/hadoop-mapreduce/hadoop-streaming.jar

 

 

Cheers

New Contributor

Hi

 

It seems that it has been moved to

 

/opt/cloudera/parcels/CDH/lib/hadoop-mapreduce

 

Cheers

Explorer

I'm on the 5.3.0 VM now and was able to locate the jar in the folder you identified, thanks

 

/usr/lib/hadoop-mapreduce/

New Contributor

Hi,

 

I have recently downloaded cloudera quick start VM  for Virtual Box and I am looking for hadoop streaming jar file so that I can do some python streaming, but I am not able to find it. Please help me to get this jar file.

 

Thanks

New Contributor

I found the location, it was the same where dr3x mentioned. My bad, I didn't noticed properly.