I'm looking for clarification on something I've read in Hadoop: The Definitive Guide. It states that map output is local and not in HDFS. But the map task runs on the node where the data resides (usually) and that is HDFS, correct? Or is it the case that the I/O done by the map is standard Java I/O and not something like hadoop fs -put?
Thanks in advance to all who answer.
Thanks in advance to all who reply.
Kevin