<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question How Reducers know where the mapper results are stored in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Reducers-know-where-the-mapper-results-are-stored/m-p/129641#M34735</link>
    <description>&lt;P&gt;1. How Reducers know where the mapper results are stored&lt;/P&gt;</description>
    <pubDate>Thu, 14 Jul 2016 21:36:54 GMT</pubDate>
    <dc:creator>gsrao_cse</dc:creator>
    <dc:date>2016-07-14T21:36:54Z</dc:date>
    <item>
      <title>How Reducers know where the mapper results are stored</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Reducers-know-where-the-mapper-results-are-stored/m-p/129641#M34735</link>
      <description>&lt;P&gt;1. How Reducers know where the mapper results are stored&lt;/P&gt;</description>
      <pubDate>Thu, 14 Jul 2016 21:36:54 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Reducers-know-where-the-mapper-results-are-stored/m-p/129641#M34735</guid>
      <dc:creator>gsrao_cse</dc:creator>
      <dc:date>2016-07-14T21:36:54Z</dc:date>
    </item>
    <item>
      <title>Re: How Reducers know where the mapper results are stored</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Reducers-know-where-the-mapper-results-are-stored/m-p/129642#M34736</link>
      <description>&lt;P&gt;on very high level,once map task get complete it notify Application master through heartbeat, AM keeps track mapping between map output and hosts. Reducer polla AM for map output locations untill it get all.&lt;/P&gt;</description>
      <pubDate>Thu, 14 Jul 2016 22:00:38 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Reducers-know-where-the-mapper-results-are-stored/m-p/129642#M34736</guid>
      <dc:creator>rajkumar_singh</dc:creator>
      <dc:date>2016-07-14T22:00:38Z</dc:date>
    </item>
    <item>
      <title>Re: How Reducers know where the mapper results are stored</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Reducers-know-where-the-mapper-results-are-stored/m-p/129643#M34737</link>
      <description>&lt;P&gt;
	I use &lt;A href="http://dailyhadoopsoup.blogspot.com/2014/01/shuffle-and-sort.html"&gt;this&lt;/A&gt; blog often when I forget the data movement between map--&amp;gt;reduce&lt;/P&gt;&lt;P&gt;
	The map outputs are copied to the reduce task JVM’s memory if they are small enough (the buffer’s size is controlled by mapred.job.shuffle.input.buffer.percent, which specifies the proportion of the heap to use for this purpose); otherwise, they are copied to disk. When the in-memory buffer reaches a threshold size 
	&lt;B&gt;(controlled by mapred.job.shuffle.merge.percent) &lt;/B&gt;or reaches a threshold number of map outputs&lt;B&gt;(mapred.inmem.merge.threshold)&lt;/B&gt;, it is merged and spilled to disk.&lt;/P&gt;</description>
      <pubDate>Thu, 14 Jul 2016 23:03:25 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Reducers-know-where-the-mapper-results-are-stored/m-p/129643#M34737</guid>
      <dc:creator>sunile_manjee</dc:creator>
      <dc:date>2016-07-14T23:03:25Z</dc:date>
    </item>
  </channel>
</rss>

