Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Multiple output format in MapReduce Job

Multiple output format in MapReduce Job

Rising Star

Let's say I have one input file named input.txt and the content of the file is given below

Hadoop is good

Hortonworks makes the life easy

Hadoop is a framework

This input.txt contains only 3 lines . Now I need the count of the word "Hortonworks" and line number of occurrence in the input file . For this input file the count of "Hortonworks" is 1 and it is in the line number 2. I can find it running separate MapReduce job for each query. Can we find both queries output in one MapReduce Job ? I do not want to run two separate job for this purpose. It will be IO heat for billions of data.

2 REPLIES 2

Re: Multiple output format in MapReduce Job

New Contributor

@Arkaprova Saha

Great question. I think it's not possible ( hope i am not wrong :-) ). Please share if you already got the answer.

.

Re: Multiple output format in MapReduce Job

Rising Star

@Arun Thanks for your reply. I am not sure. I am also looking for solution.