Map intermediate data will be written and sorted on local disk before sending to the reducer machines.
You can reduce Map output
bq. Will i get any performance improvement if i increase io.sort.mb paramter when Map() task generates huge amount of data?
Yes (but impact may not be huge), you can use with io.sort.factor