Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

Reduce program hosting location

avatar
Explorer

Documentation says, map tasks are run in DataNodes and have data locality constraints which the scheduler tries to honor and reduce tasks can run anywhere in the cluster. The statement "can run anywhere in the cluster" for reduce tasks, is referring to only DataNodes in the cluster OR is ResourceManager machine also considered part of the cluster, so that Reduce tasks are allowed to run in ResourceManager also ??

1 ACCEPTED SOLUTION

avatar
Contributor

reduce tasks "can run anywhere in the cluster" means on any of the node which has "Node Manager" installed on it

View solution in original post

3 REPLIES 3

avatar
Contributor

reduce tasks "can run anywhere in the cluster" means on any of the node which has "Node Manager" installed on it

avatar
Master Guru
@Fasil Ahamed

"can run anywhere in the cluster" - Meaning anywhere where nodemanagers have been deployed, it can be any slave node or master node if nodemanager is installed on master nodes.

Hope this information helps!

Please do let me know if you have any further question.

avatar
Master Guru

@Fasil Ahamed - Can you please accept the appropriate answer?