Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

which time actaul block size assign ? Is it provide at spliting time in Mapper or after MapReduce flow?

avatar
@Manish Gupta @Eric Hanson @Jay SenSharma
1 ACCEPTED SOLUTION

avatar
2 REPLIES 2

avatar

@Manish Kumar Yadav

You probably are looking at Hadoop block size vs split size.

Below is a nice read:

https://hadoopjournal.wordpress.com/2015/06/30/mapreduce-input-split-versus-hdfs-blocks/

avatar

@rbiswas yea Actually splitSize is equal to block size, so I want to know if we assign block size at SplitTime then job contains that memory for whole process or after reducer any changes;