<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Oozie streaming fails: stream.map.streamprocessor not set In JobConf [CDH 5.2] in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oozie-streaming-fails-stream-map-streamprocessor-not-set-In/m-p/23371#M3673</link>
    <description>It will be resolved in the next 5.2.x and 5.3.x bugfix releases (along with 5.4.0 in future).</description>
    <pubDate>Wed, 07 Jan 2015 06:15:27 GMT</pubDate>
    <dc:creator>Harsh J</dc:creator>
    <dc:date>2015-01-07T06:15:27Z</dc:date>
    <item>
      <title>Oozie streaming fails: stream.map.streamprocessor not set In JobConf [CDH 5.2]</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oozie-streaming-fails-stream-map-streamprocessor-not-set-In/m-p/21517#M3666</link>
      <description>&lt;P&gt;&amp;nbsp;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We have CDH 5.1 running on production for a few month already with no issues.&lt;/P&gt;&lt;P&gt;Recently we've created another cluster for qa environment and installed CDH 5.2 through Cloudera Manager.&lt;/P&gt;&lt;P&gt;And when we tried to&amp;nbsp;some Oozie workflows (the same jobs as on production) we got following error:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;Error: java.io.IOException: Type mismatch in key from map: expected org.apache.hadoop.io.LongWritable, received org.apache.hadoop.io.Text&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I've figured this error occured because hadoop tried to use IdentityMapper class instead of our streaming processors.&lt;/P&gt;&lt;P&gt;I've tried&amp;nbsp;a lot of different options but nothing helped so far.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The closest I could get is to compare actual jobConf files that we get on production (CDH 5.1) and on new cluster (CDH 5.2). And I figured that on new cluster jobConf doesn't contain following properties:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;stream.map.streamprocessor&lt;/P&gt;&lt;P&gt;stream.reduce.streamprocessor&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;which if I understand correctly are used by hadoop-streaming.jar&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Have no idea where to look next.&lt;/P&gt;&lt;P&gt;I would really appreciate any help with this issue.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Anatoly&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 09:12:46 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oozie-streaming-fails-stream-map-streamprocessor-not-set-In/m-p/21517#M3666</guid>
      <dc:creator>Anatoly</dc:creator>
      <dc:date>2022-09-16T09:12:46Z</dc:date>
    </item>
    <item>
      <title>Re: Oozie streaming fails: stream.map.streamprocessor not set In JobConf [CDH 5.2]</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oozie-streaming-fails-stream-map-streamprocessor-not-set-In/m-p/21985#M3667</link>
      <description>&lt;P&gt;Anyone?&lt;/P&gt;</description>
      <pubDate>Tue, 25 Nov 2014 05:33:57 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oozie-streaming-fails-stream-map-streamprocessor-not-set-In/m-p/21985#M3667</guid>
      <dc:creator>Anatoly</dc:creator>
      <dc:date>2014-11-25T05:33:57Z</dc:date>
    </item>
    <item>
      <title>Re: Oozie streaming fails: stream.map.streamprocessor not set In JobConf [CDH 5.2]</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oozie-streaming-fails-stream-map-streamprocessor-not-set-In/m-p/22104#M3668</link>
      <description>Which streaming jar are you using specifically? Are you using the Oozie streaming action?&lt;BR /&gt;&lt;BR /&gt;Can you share your actual job-launching command (or the relevant Oozie workflow and associated scripts/files)?</description>
      <pubDate>Sun, 30 Nov 2014 11:50:57 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oozie-streaming-fails-stream-map-streamprocessor-not-set-In/m-p/22104#M3668</guid>
      <dc:creator>Harsh J</dc:creator>
      <dc:date>2014-11-30T11:50:57Z</dc:date>
    </item>
    <item>
      <title>Re: Oozie streaming fails: stream.map.streamprocessor not set In JobConf [CDH 5.2]</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oozie-streaming-fails-stream-map-streamprocessor-not-set-In/m-p/22164#M3669</link>
      <description>&lt;P&gt;Yes, I'm using the Oozie streaming action. And streaming jar is the one which is bundled with CDH.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;What is&amp;nbsp;strange is that we use the same workflows and coordinators in our production cluster which is&amp;nbsp;on&amp;nbsp;CDH 5.1.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Here is example workflow action we use:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;&amp;lt;action name="raw-pass" retry-max="3" retry-interval="1"&amp;gt;
        &amp;lt;map-reduce&amp;gt;
            &amp;lt;job-tracker&amp;gt;${jobTracker}&amp;lt;/job-tracker&amp;gt;
            &amp;lt;name-node&amp;gt;${nameNode}&amp;lt;/name-node&amp;gt;
            &amp;lt;prepare&amp;gt;
                &amp;lt;delete path="${rawPassOutput}"/&amp;gt;
            &amp;lt;/prepare&amp;gt;
            &amp;lt;streaming&amp;gt;
                &amp;lt;mapper&amp;gt;mapreducers/bin/mapred run MyJobMapper&amp;lt;/mapper&amp;gt;
                &amp;lt;reducer&amp;gt;mapreducers/bin/mapred run MyJobReducer&amp;lt;/reducer&amp;gt;
            &amp;lt;/streaming&amp;gt;
            &amp;lt;configuration&amp;gt;
                &amp;lt;property&amp;gt;
                    &amp;lt;!--
                        This will add avro.jar and avro-mapred.jar dependencies to the job
                        (@see mapred.input.format.class property below)
                    --&amp;gt;
                    &amp;lt;name&amp;gt;oozie.action.sharelib.for.map-reduce&amp;lt;/name&amp;gt;
                    &amp;lt;value&amp;gt;mapreduce-streaming,hcatalog,sqoop&amp;lt;/value&amp;gt;
                &amp;lt;/property&amp;gt;
                &amp;lt;property&amp;gt;
                    &amp;lt;name&amp;gt;mapred.reduce.tasks&amp;lt;/name&amp;gt;
                    &amp;lt;value&amp;gt;1&amp;lt;/value&amp;gt;
                &amp;lt;/property&amp;gt;
                &amp;lt;property&amp;gt;
                    &amp;lt;name&amp;gt;mapred.job.queue.name&amp;lt;/name&amp;gt;
                    &amp;lt;value&amp;gt;${queueName}&amp;lt;/value&amp;gt;
                &amp;lt;/property&amp;gt;
                &amp;lt;property&amp;gt;
                    &amp;lt;name&amp;gt;mapred.input.dir&amp;lt;/name&amp;gt;
                    &amp;lt;value&amp;gt;${rawPassInput}&amp;lt;/value&amp;gt;
                &amp;lt;/property&amp;gt;
                &amp;lt;property&amp;gt;
                    &amp;lt;name&amp;gt;mapred.output.dir&amp;lt;/name&amp;gt;
                    &amp;lt;value&amp;gt;${rawPassOutput}&amp;lt;/value&amp;gt;
                &amp;lt;/property&amp;gt;
                &amp;lt;property&amp;gt;
                    &amp;lt;!--
                        This input format will automagically decode avro files so
                        that our mappers will get plain json as input.
                    --&amp;gt;
                    &amp;lt;name&amp;gt;mapred.input.format.class&amp;lt;/name&amp;gt;
                    &amp;lt;value&amp;gt;org.apache.avro.mapred.AvroAsTextInputFormat&amp;lt;/value&amp;gt;
                &amp;lt;/property&amp;gt;
            &amp;lt;/configuration&amp;gt;
            &amp;lt;archive&amp;gt;${mapreducersArchive}#mapreducers&amp;lt;/archive&amp;gt;
        &amp;lt;/map-reduce&amp;gt;
        &amp;lt;ok to="aggregate-pass"/&amp;gt;
        &amp;lt;error to="failure-email-notification"/&amp;gt;
    &amp;lt;/action&amp;gt;&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I can not share actual mapper and reducer&amp;nbsp;scripts though.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Also - is there a relatively easy way to downgrade CDH installation from CDH 5.2 to 5.1? Or at least to install 5.1 from scratch? I'm not sure there was an option in Cloudera Manager to install 5.1, only 5.2...&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks!&lt;/P&gt;</description>
      <pubDate>Sun, 30 Nov 2014 20:10:39 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oozie-streaming-fails-stream-map-streamprocessor-not-set-In/m-p/22164#M3669</guid>
      <dc:creator>Anatoly</dc:creator>
      <dc:date>2014-11-30T20:10:39Z</dc:date>
    </item>
    <item>
      <title>Re: Oozie streaming fails: stream.map.streamprocessor not set In JobConf [CDH 5.2]</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oozie-streaming-fails-stream-map-streamprocessor-not-set-In/m-p/22187#M3670</link>
      <description>&lt;P&gt;Ok, so I've managed to downgrade cluster to 5.1 and start my coordinator jobs. It seems to be working now as expected.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;So it seems to me there is some changes which breaks BC, but I really don't know what it is. At least I know that we shouldn't upgrade&amp;nbsp;to 5.2 until we figure out how to solve this.&lt;/P&gt;</description>
      <pubDate>Mon, 01 Dec 2014 20:18:56 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oozie-streaming-fails-stream-map-streamprocessor-not-set-In/m-p/22187#M3670</guid>
      <dc:creator>Anatoly</dc:creator>
      <dc:date>2014-12-01T20:18:56Z</dc:date>
    </item>
    <item>
      <title>Re: Oozie streaming fails: stream.map.streamprocessor not set In JobConf [CDH 5.2]</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oozie-streaming-fails-stream-map-streamprocessor-not-set-In/m-p/23122#M3671</link>
      <description>This is being caused due to&lt;BR /&gt;&lt;A target="_blank" href="https://issues.apache.org/jira/browse/OOZIE-2102."&gt;https://issues.apache.org/jira/browse/OOZIE-2102.&lt;/A&gt; It will be resolved&lt;BR /&gt;in a future release of CDH5.&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Sun, 28 Dec 2014 04:19:53 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oozie-streaming-fails-stream-map-streamprocessor-not-set-In/m-p/23122#M3671</guid>
      <dc:creator>Harsh J</dc:creator>
      <dc:date>2014-12-28T04:19:53Z</dc:date>
    </item>
    <item>
      <title>Re: Oozie streaming fails: stream.map.streamprocessor not set In JobConf [CDH 5.2]</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oozie-streaming-fails-stream-map-streamprocessor-not-set-In/m-p/23196#M3672</link>
      <description>&lt;P&gt;Thanks Harsh!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Really looking forward for this to be released! Just to clarify: is it going to be included in 5.3.x or only 5.4?&lt;/P&gt;</description>
      <pubDate>Wed, 31 Dec 2014 00:37:33 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oozie-streaming-fails-stream-map-streamprocessor-not-set-In/m-p/23196#M3672</guid>
      <dc:creator>Anatoly</dc:creator>
      <dc:date>2014-12-31T00:37:33Z</dc:date>
    </item>
    <item>
      <title>Re: Oozie streaming fails: stream.map.streamprocessor not set In JobConf [CDH 5.2]</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oozie-streaming-fails-stream-map-streamprocessor-not-set-In/m-p/23371#M3673</link>
      <description>It will be resolved in the next 5.2.x and 5.3.x bugfix releases (along with 5.4.0 in future).</description>
      <pubDate>Wed, 07 Jan 2015 06:15:27 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Oozie-streaming-fails-stream-map-streamprocessor-not-set-In/m-p/23371#M3673</guid>
      <dc:creator>Harsh J</dc:creator>
      <dc:date>2015-01-07T06:15:27Z</dc:date>
    </item>
  </channel>
</rss>

