Support Questions

Find answers, ask questions, and share your expertise
Announcements
Check out our newest addition to the community, the Cloudera Data Analytics (CDA) group hub.

When Reducer Operation actually starts..I mean with respect to Copy Phase ?After completion of copy phase or while copy phase is going on???

Rising Star
1 ACCEPTED SOLUTION

@suresh bonam One of the best answers http://stackoverflow.com/questions/11672676/when-d...

Reducers start shuffling based on a threshold of percentage of mappers that have finished. You can change the parameter to get reducers to start sooner or later.

Why is starting the reducers early a good thing? Because it spreads out the data transfer from the mappers to the reducers over time, which is a good thing if your network is the bottleneck

View solution in original post

2 REPLIES 2

@suresh bonam One of the best answers http://stackoverflow.com/questions/11672676/when-d...

Reducers start shuffling based on a threshold of percentage of mappers that have finished. You can change the parameter to get reducers to start sooner or later.

Why is starting the reducers early a good thing? Because it spreads out the data transfer from the mappers to the reducers over time, which is a good thing if your network is the bottleneck

Mentor

@Suresh Bonam are you still having issues with this? Can you accept best answer or provide your own solution?

Take a Tour of the Community
Don't have an account?
Your experience may be limited. Sign in to explore more.