Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

Reduce program hosting location

avatar
New Member

Documentation says, map tasks are run in DataNodes and have data locality constraints which the scheduler tries to honor and reduce tasks can run anywhere in the cluster. The statement "can run anywhere in the cluster" for reduce tasks, is referring to only DataNodes in the cluster OR is ResourceManager machine also considered part of the cluster, so that Reduce tasks are allowed to run in ResourceManager also ??

1 ACCEPTED SOLUTION

avatar
New Member

reduce tasks "can run anywhere in the cluster" means on any of the node which has "Node Manager" installed on it

View solution in original post

3 REPLIES 3

avatar
New Member

reduce tasks "can run anywhere in the cluster" means on any of the node which has "Node Manager" installed on it

avatar
Master Guru
@Fasil Ahamed

"can run anywhere in the cluster" - Meaning anywhere where nodemanagers have been deployed, it can be any slave node or master node if nodemanager is installed on master nodes.

Hope this information helps!

Please do let me know if you have any further question.

avatar
Master Guru

@Fasil Ahamed - Can you please accept the appropriate answer?