<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Hdfs and map reduce word count? in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hdfs-and-map-reduce-word-count/m-p/34532#M11134</link>
    <description>Ok very cool. Thank you for the reference.&lt;BR /&gt;&lt;BR /&gt;</description>
    <pubDate>Sun, 29 Nov 2015 20:31:56 GMT</pubDate>
    <dc:creator>KasperHansen</dc:creator>
    <dc:date>2015-11-29T20:31:56Z</dc:date>
    <item>
      <title>Hdfs and map reduce word count?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hdfs-and-map-reduce-word-count/m-p/34530#M11132</link>
      <description>can anyone explain pr direct my to material explaining how map reduce word count works? I do not understand how it can work!&lt;BR /&gt;&lt;BR /&gt;If a file is split into blocks and distributed over multiple nodes how can the word count program work? The file/text can be split in the middle of a word fx "be" in one block and "tween" in another block. How can the map reduce job count between as word if it is split over multiple blocks and nodes??&lt;BR /&gt;</description>
      <pubDate>Fri, 16 Sep 2022 09:50:42 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hdfs-and-map-reduce-word-count/m-p/34530#M11132</guid>
      <dc:creator>KasperHansen</dc:creator>
      <dc:date>2022-09-16T09:50:42Z</dc:date>
    </item>
    <item>
      <title>Re: Hdfs and map reduce word count?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hdfs-and-map-reduce-word-count/m-p/34531#M11133</link>
      <description>The data format will ensure records are intact before being sent to the&lt;BR /&gt;mapper function. I believe this is done by sending partial records to the&lt;BR /&gt;machine they will be mapped on (so the overwhelming majority of the data is&lt;BR /&gt;processed in place, but half a line per block or so may still be&lt;BR /&gt;exchanged). Tom White's book The Definitive Guide To Hadoop does a good job&lt;BR /&gt;of covering details like this.&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Sun, 29 Nov 2015 19:47:56 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hdfs-and-map-reduce-word-count/m-p/34531#M11133</guid>
      <dc:creator>Sean</dc:creator>
      <dc:date>2015-11-29T19:47:56Z</dc:date>
    </item>
    <item>
      <title>Re: Hdfs and map reduce word count?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hdfs-and-map-reduce-word-count/m-p/34532#M11134</link>
      <description>Ok very cool. Thank you for the reference.&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Sun, 29 Nov 2015 20:31:56 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hdfs-and-map-reduce-word-count/m-p/34532#M11134</guid>
      <dc:creator>KasperHansen</dc:creator>
      <dc:date>2015-11-29T20:31:56Z</dc:date>
    </item>
  </channel>
</rss>

