<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: &amp;quot;Number of reduce tasks is set to 0 since there's no reduce operator&amp;quot;: a problem? in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/quot-Number-of-reduce-tasks-is-set-to-0-since-there-s-no/m-p/156776#M28917</link>
    <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/4232/rachelwijsmuller.html" nodeid="4232"&gt;@R Wys&lt;/A&gt;&lt;/P&gt;&lt;P&gt;This is not an issue since you are using "select *" which doesn't require any kind of computation therefore Mapreduce framework is smart enough to figure out when reducer tasks is required as per provided operators. &lt;/P&gt;</description>
    <pubDate>Thu, 19 May 2016 18:21:30 GMT</pubDate>
    <dc:creator>jyadav</dc:creator>
    <dc:date>2016-05-19T18:21:30Z</dc:date>
    <item>
      <title>"Number of reduce tasks is set to 0 since there's no reduce operator": a problem?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/quot-Number-of-reduce-tasks-is-set-to-0-since-there-s-no/m-p/156775#M28916</link>
      <description>&lt;P&gt;Hi all, &lt;/P&gt;&lt;P&gt;Odd question - I'm just starting out in Hadoop and am in the process of moving all my test work into production, however I get a strange message on the prod system when working in Hive: "number of reduce tasks is set to 0 since there's no reduce operator". The queries are not failing (yet...?), and there are no strange records in any logs I have looked at. I don't know how to troubleshoot this if indeed it is a problem at all. Any advice? &lt;/P&gt;&lt;P&gt;Example: &lt;/P&gt;&lt;PRE&gt;hive&amp;gt; select * from myTable where daily_date='2015-12-29' limit 10;
Query ID = root_20160519113838_73d2b4dc-efb8-4ea6-b0a4-cdc4dc64c33a
Total jobs = 1
Launching Job 1 out of 1
Number of reduce tasks is set to 0 since there's no reduce operator
Starting Job = job_1418226366907_2316, Tracking URL = &lt;A href="http://hadoop-head01:8088/proxy/application_1418226366907_2316/" target="_blank"&gt;http://hadoop-head01:8088/proxy/application_1418226366907_2316/&lt;/A&gt;
Kill Command = /usr/lib/hadoop/bin/hadoop job  -kill job_1418226366907_2316
Hadoop job information for Stage-1: number of mappers: 1; number of reducers: 0
2016-05-19 11:39:07,038 Stage-1 map = 0%,  reduce = 0%
2016-05-19 11:39:12,653 Stage-1 map = 100%,  reduce = 0%, Cumulative CPU 2.65 sec
MapReduce Total cumulative CPU time: 2 seconds 650 msec
Ended Job = job_1418226366907_2316
MapReduce Jobs Launched:
Job 0: Map: 1   Cumulative CPU: 2.65 sec   HDFS Read: 64722 HDFS Write: 831 SUCCESS
Total MapReduce CPU Time Spent: 2 seconds 650 msec
OK
[..... records ....]
Time taken: 15.876 seconds, Fetched: 10 row(s)
&lt;/PRE&gt;&lt;P&gt;Hadoop version- 2.4.0.2.1.3.0-563&lt;/P&gt;&lt;P&gt;Hive version...(?) 0.13.0.2.1.3.0-563 &lt;/P&gt;</description>
      <pubDate>Thu, 19 May 2016 18:17:19 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/quot-Number-of-reduce-tasks-is-set-to-0-since-there-s-no/m-p/156775#M28916</guid>
      <dc:creator>rachel_wijsmull</dc:creator>
      <dc:date>2016-05-19T18:17:19Z</dc:date>
    </item>
    <item>
      <title>Re: "Number of reduce tasks is set to 0 since there's no reduce operator": a problem?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/quot-Number-of-reduce-tasks-is-set-to-0-since-there-s-no/m-p/156776#M28917</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/4232/rachelwijsmuller.html" nodeid="4232"&gt;@R Wys&lt;/A&gt;&lt;/P&gt;&lt;P&gt;This is not an issue since you are using "select *" which doesn't require any kind of computation therefore Mapreduce framework is smart enough to figure out when reducer tasks is required as per provided operators. &lt;/P&gt;</description>
      <pubDate>Thu, 19 May 2016 18:21:30 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/quot-Number-of-reduce-tasks-is-set-to-0-since-there-s-no/m-p/156776#M28917</guid>
      <dc:creator>jyadav</dc:creator>
      <dc:date>2016-05-19T18:21:30Z</dc:date>
    </item>
    <item>
      <title>Re: "Number of reduce tasks is set to 0 since there's no reduce operator": a problem?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/quot-Number-of-reduce-tasks-is-set-to-0-since-there-s-no/m-p/156777#M28918</link>
      <description>&lt;P&gt;This is not a problem at all.&lt;/P&gt;&lt;P&gt;Hive is just telling you that you are doing a "Map only" job.&lt;/P&gt;&lt;P&gt;Usually, in MapReduce (now in Hive we prefer using Tez instead of MapReduce but let's talk about MapReduce here because it is easier to understand) your job will have the following steps: Map -&amp;gt; Shuffle -&amp;gt; Reduce.&lt;/P&gt;&lt;P&gt;The Map and Reduce steps are where computations (in Hive: projections, aggregations, filtering...) happen.
Shuffle is just data going on the network, to go from the nodes that launched the mappers to the one that launch the reducers.&lt;/P&gt;&lt;P&gt;So if there is a possibility to do some "Map only" job and to avoid the "Shuffle" and "Reduce" steps, better: your job will be much faster in general and will involve less cluster resources (network, CPU, disk &amp;amp; memory).&lt;/P&gt;&lt;P&gt;The query you are showing on this example is very simple, that is why it can be transformed by Hive into a "Map only" job.&lt;/P&gt;&lt;P&gt;To understand better how the Hive queries are transformed into some MapReduce/Tez jobs, you can have a look at the "explain" command:&lt;/P&gt;&lt;P&gt;&lt;A href="https://cwiki.apache.org/confluence/display/Hive/LanguageManual+Explain"&gt;https://cwiki.apache.org/confluence/display/Hive/LanguageManual+Explain&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 19 May 2016 18:24:54 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/quot-Number-of-reduce-tasks-is-set-to-0-since-there-s-no/m-p/156777#M28918</guid>
      <dc:creator>sluangsay</dc:creator>
      <dc:date>2016-05-19T18:24:54Z</dc:date>
    </item>
    <item>
      <title>Re: "Number of reduce tasks is set to 0 since there's no reduce operator": a problem?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/quot-Number-of-reduce-tasks-is-set-to-0-since-there-s-no/m-p/156778#M28919</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/4232/rachelwijsmuller.html" nodeid="4232"&gt;@R Wys&lt;/A&gt;&lt;P&gt;There is no problem with hive here, hive has generated an execution plan with no reduce phase in your case. you can see the plan by running 'explain select*from myTable where daily_date='2015-12-29' limit 10'&lt;/P&gt;</description>
      <pubDate>Thu, 19 May 2016 18:27:48 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/quot-Number-of-reduce-tasks-is-set-to-0-since-there-s-no/m-p/156778#M28919</guid>
      <dc:creator>rajkumar_singh</dc:creator>
      <dc:date>2016-05-19T18:27:48Z</dc:date>
    </item>
  </channel>
</rss>

