Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

Hadoop streaming jar files in CDH5?

avatar
Contributor

Wanted to try out some python on hadoop in my CDH5 VM but I need the streaming jar file.

 

According to the documentation here 

http://www.cloudera.com/content/cloudera-content/cloudera-docs/CDH5/latest/CDH5-Installation-Guide/c... 

 

the jar should be at this location:

/usr/lib/hadoop-0.20-mapreduce/contrib/streaming/hadoop-streaming-2.0.0-mr1-cdh<version>.jar

 

or maybe here

/usr/lib/hadoop-mapreduce/hadoop-streaming.jar

 

but I don't see it at either place.  is there a way to install additional components if they are missing?

1 ACCEPTED SOLUTION

avatar
Explorer

Hey,

 

I guess the one you are looking for shall be located here

 

/usr/lib/hadoop-mapreduce/hadoop-streaming-2.3.0-cdh5.1.0.jar
/usr/lib/hadoop-mapreduce/hadoop-streaming.jar

 

 

Cheers

View solution in original post

5 REPLIES 5

avatar
Explorer

Hey,

 

I guess the one you are looking for shall be located here

 

/usr/lib/hadoop-mapreduce/hadoop-streaming-2.3.0-cdh5.1.0.jar
/usr/lib/hadoop-mapreduce/hadoop-streaming.jar

 

 

Cheers

avatar
New Contributor

Hi

 

It seems that it has been moved to

 

/opt/cloudera/parcels/CDH/lib/hadoop-mapreduce

 

Cheers

avatar
Contributor

I'm on the 5.3.0 VM now and was able to locate the jar in the folder you identified, thanks

 

/usr/lib/hadoop-mapreduce/

avatar

Hi,

 

I have recently downloaded cloudera quick start VM  for Virtual Box and I am looking for hadoop streaming jar file so that I can do some python streaming, but I am not able to find it. Please help me to get this jar file.

 

Thanks

avatar

I found the location, it was the same where dr3x mentioned. My bad, I didn't noticed properly.