Created 03-22-2017 07:58 AM
i want to know internal representation of YARN and MAPREDUCE. i am new to hadoop. i am getting how exactly the jobs get executed.
Created 03-22-2017 11:26 PM
Hi @heta desai
The Application Master will launch one MapTask for each map split. Typically, there is a map split for each input file. If the input file is too big (bigger than the HDFS block size) then we have two or more map splits associated to the same input file.
Also the memory used fro map and reduce task is RAM of Nodemanagers.
Please refer to it for more details -
http://ercoppa.github.io/HadoopInternals/AnatomyMapReduceJob.html
Created 03-22-2017 11:26 PM
Hi @heta desai
The Application Master will launch one MapTask for each map split. Typically, there is a map split for each input file. If the input file is too big (bigger than the HDFS block size) then we have two or more map splits associated to the same input file.
Also the memory used fro map and reduce task is RAM of Nodemanagers.
Please refer to it for more details -
http://ercoppa.github.io/HadoopInternals/AnatomyMapReduceJob.html
Created 03-23-2017 05:10 AM
Thank you.
Created 03-23-2017 08:00 PM
Can you please accept my answer if it answered your question ? 🙂