Support Questions
Find answers, ask questions, and share your expertise

Reduce program hosting location

New Contributor

Documentation says, map tasks are run in DataNodes and have data locality constraints which the scheduler tries to honor and reduce tasks can run anywhere in the cluster. The statement "can run anywhere in the cluster" for reduce tasks, is referring to only DataNodes in the cluster OR is ResourceManager machine also considered part of the cluster, so that Reduce tasks are allowed to run in ResourceManager also ??

1 ACCEPTED SOLUTION

Explorer

reduce tasks "can run anywhere in the cluster" means on any of the node which has "Node Manager" installed on it

View solution in original post

3 REPLIES 3

Explorer

reduce tasks "can run anywhere in the cluster" means on any of the node which has "Node Manager" installed on it

Super Guru
@Fasil Ahamed

"can run anywhere in the cluster" - Meaning anywhere where nodemanagers have been deployed, it can be any slave node or master node if nodemanager is installed on master nodes.

Hope this information helps!

Please do let me know if you have any further question.

Super Guru

@Fasil Ahamed - Can you please accept the appropriate answer?

Take a Tour of the Community
Don't have an account?
Your experience may be limited. Sign in to explore more.