Created 03-14-2016 04:41 AM
I have been practicing MapReduce on Ubuntu. Can somebody explain me, on Hortonworks Sandbox
1) How to write a MapReduce Program? Where is Eclipse? What Jars to be included in Build Path?
2) How to execute the MapReduce program?
Created 03-14-2016 04:43 AM
Created 03-14-2016 11:33 AM
You can follow this tutorial to setup an IDE with Sandbox https://github.com/hortonworks-gallery/ambari-vnc-service
Created 03-15-2016 05:04 AM
Thanks for your response. I have gone through the pdf.
1. In the pdf, there is one java program with driver, mapper and reduce programs. Did not mention how this program was written, using which IDE. Suppose, if I connect through VNC client and use Eclipse to write these driver, mapper and reducer programs, I will write them as 3 classes - 3 different files. Do I have to merge them to one file to run the job?
What is the general practice? people write using Eclipse or is there anyway to write using vi editor?
2. While writing these classes in Eclipse, what are the jar files I need to add to build path?
Created 03-15-2016 10:51 PM
You need to compile with hadoop-client and you can package all classes together
Created 08-18-2016 06:03 AM
Created 02-19-2018 05:41 PM
For people from 2018, you need to use `hadoop classpath` for specifying path to Hadoop classes (see https://community.hortonworks.com/questions/83006/java-compile-problem-for-hadoopdfsfilereadwritejav... instead of /usr/share/hadoop/hadoop -core -*.jar from tutorial document.