The data format will ensure records are intact before being sent to the
mapper function. I believe this is done by sending partial records to the
machine they will be mapped on (so the overwhelming majority of the data is
processed in place, but half a line per block or so may still be
exchanged). Tom White's book The Definitive Guide To Hadoop does a good job
of covering details like this.