Created 06-06-2021 11:01 PM
Hi team,
Could you please tell me why do we segregate the compute node from storage node in the hadoop world, In this way are we not breaking the data locality's philosophy so in this way we are achieving the intra/inter rack data locality not the local data locality and what is the hurdle we faced in the previous design putting the both(compute and data node) on the same node).
Thanks
Created 07-01-2021 01:12 PM
Want to get a detailed solution you have to login/registered on the community
Register/LoginCreated on 06-28-2021 06:35 AM - edited 06-28-2021 06:36 AM
@Faizan123 We are not segregating compute node and data node. Compute node is a node manager and data node is used for storage. If you submit the job the yarn will try to create the task containers on the node where the data is located. The name we use node manager or compute node is used by yarn containers for processing the data. The name data node is used for storing the data. Both can be in a single node.
Please let me know if you have any queries. Also mark "Accept as Solution" if my answer helps you!
Thanks
Shobika S
Created 06-28-2021 06:40 AM
Created 07-01-2021 01:12 PM
Want to get a detailed solution you have to login/registered on the community
Register/LoginCreated 07-05-2021 09:28 AM
@Faizan123, has any of the replies helped resolve your issue? If so, can you kindly mark the appropriate reply as the solution, as it will make it easier for others to find the answer in the future?
Regards,
Vidya Sargur,Created 07-06-2021 11:50 PM
Hi @Faizan123, I hope the replies provided by @Shelton or @shobikas has helped you resolve your issue. If so, can you kindly accept them as a solution?
Regards,
Vidya Sargur,