Created on 07-17-2014 01:03 PM - edited 09-16-2022 02:02 AM
Wanted to try out some python on hadoop in my CDH5 VM but I need the streaming jar file.
According to the documentation here
the jar should be at this location:
/usr/lib/hadoop-0.20-mapreduce/contrib/streaming/hadoop-streaming-2.0.0-mr1-cdh<version>.jar
or maybe here
/usr/lib/hadoop-mapreduce/hadoop-streaming.jar
but I don't see it at either place. is there a way to install additional components if they are missing?