<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Getting error while running mahout job through Oozie in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Getting-error-while-running-mahout-job-through-Oozie/m-p/42341#M33064</link>
    <description>&lt;P&gt;I have a guess: you need to make each of those things a separate arg tag? I don't know Oozie well myself, but something similar is needed in Maven config files. That is it may be reading this as one arg not two, called "-xm mapreduce"&lt;/P&gt;</description>
    <pubDate>Mon, 27 Jun 2016 16:28:44 GMT</pubDate>
    <dc:creator>srowen</dc:creator>
    <dc:date>2016-06-27T16:28:44Z</dc:date>
    <item>
      <title>Getting error while running mahout job through Oozie</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Getting-error-while-running-mahout-job-through-Oozie/m-p/42339#M33063</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am trying to run Mahout job on Reuters dataset. I have executed first 2 steps manually:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;1 step: mahout org.apache.lucene.benchmark.utils.ExtractReuters reuters_dataset reuters-out&lt;/P&gt;&lt;P&gt;2 step: hdfs dfs -put -f reuters-out /user/cloudera/mahout/kmeans/reuters-out&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I want to execute 3rd Step using oozie i.e.:&lt;/P&gt;&lt;P&gt;mahout seqdirectory -i /user/cloudera/mahout/kmeans/reuters-out -o /user/cloudera/mahout/kmeans/reuters-out-seqdir -c UTF-8 -chunk 5 -xm mapreduce&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have create below mentioned workflow.xml and job.properties for the same:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;lt;workflow-app name="My_Workflow" xmlns="uri:oozie:workflow:0.5"&amp;gt;&lt;BR /&gt;&amp;lt;start to="mahout-testing"/&amp;gt;&lt;BR /&gt;&amp;lt;kill name="Kill"&amp;gt;&lt;BR /&gt;&amp;lt;message&amp;gt;Action failed, error message[${wf:errorMessage(wf:lastErrorNode())}]&amp;lt;/message&amp;gt;&lt;BR /&gt;&amp;lt;/kill&amp;gt;&lt;BR /&gt;&amp;lt;action name="mahout-testing"&amp;gt;&lt;BR /&gt;&amp;lt;java&amp;gt;&lt;BR /&gt;&amp;lt;job-tracker&amp;gt;${jobTracker}&amp;lt;/job-tracker&amp;gt;&lt;BR /&gt;&amp;lt;name-node&amp;gt;${nameNode}&amp;lt;/name-node&amp;gt;&lt;BR /&gt;&amp;lt;prepare&amp;gt;&lt;BR /&gt;&amp;lt;delete path="${nameNode}/user/cloudera/mahout/kmeans/reuters-out-seqdir"/&amp;gt;&lt;BR /&gt;&amp;lt;/prepare&amp;gt;&lt;BR /&gt;&amp;lt;main-class&amp;gt;org.apache.mahout.driver.MahoutDriver&amp;lt;/main-class&amp;gt;&lt;BR /&gt;&amp;lt;arg&amp;gt;seqdirectory&amp;lt;/arg&amp;gt;&lt;BR /&gt;&amp;lt;arg&amp;gt;-i /user/cloudera/mahout/kmeans/reuters-out&amp;lt;/arg&amp;gt;&lt;BR /&gt;&amp;lt;arg&amp;gt;-o /user/cloudera/mahout/kmeans/reuters-out-seqdir&amp;lt;/arg&amp;gt;&lt;BR /&gt;&amp;lt;arg&amp;gt;-xm mapreduce&amp;lt;/arg&amp;gt;&lt;BR /&gt;&amp;lt;/java&amp;gt;&lt;BR /&gt;&amp;lt;ok to="End"/&amp;gt;&lt;BR /&gt;&amp;lt;error to="Kill"/&amp;gt;&lt;BR /&gt;&amp;lt;/action&amp;gt;&lt;BR /&gt;&amp;lt;end name="End"/&amp;gt;&lt;BR /&gt;&amp;lt;/workflow-app&amp;gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;job.properties:&lt;/P&gt;&lt;P&gt;oozie.use.system.libpath=True&lt;BR /&gt;security_enabled=False&lt;BR /&gt;dryrun=False&lt;BR /&gt;jobTracker=localhost:8032&lt;BR /&gt;nameNode=hdfs://quickstart.cloudera:8020&lt;BR /&gt;oozie.wf.application.path=${nameNode}/user/cloudera/app/mahout/kmeans/workflow.xml&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have also placed all mahout related jars under /user/cloudera/app/mahout/kmeans/lib.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;But I am getting below mentioned error:&lt;/P&gt;&lt;P&gt;&lt;FONT size="2"&gt;&amp;gt;&amp;gt;&amp;gt; Invoking Main class now &amp;gt;&amp;gt;&amp;gt;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT size="2"&gt;Fetching child yarn jobs&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="2"&gt;tag id : oozie-e3010996ec4154408748b70b6f44d85e&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="2"&gt;Child yarn jobs are found - &lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="2"&gt;Main class : org.apache.mahout.driver.MahoutDriver&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="2"&gt;Arguments :&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="2"&gt;seqdirectory&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="2"&gt;-i /user/cloudera/mahout/kmeans/reuters-out&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="2"&gt;-o /user/cloudera/mahout/kmeans/reuters-out-seqdir&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="2"&gt;-xm mapreduce&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT size="2"&gt;Unexpected -xm mapreduce while processing Job-Specific Options: &lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="2"&gt;Usage: &lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="2"&gt;[--input &amp;lt;input&amp;gt; --output &amp;lt;output&amp;gt; --overwrite --method &amp;lt;method&amp;gt; --chunkSize &lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="2"&gt;&amp;lt;chunkSize&amp;gt; --fileFilterClass &amp;lt;fileFilterClass&amp;gt; --keyPrefix &amp;lt;keyPrefix&amp;gt; &lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="2"&gt;--charset &amp;lt;charset&amp;gt; --method &amp;lt;method&amp;gt; --overwrite --help --tempDir &amp;lt;tempDir&amp;gt; &lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="2"&gt;--startPhase &amp;lt;startPhase&amp;gt; --endPhase &amp;lt;endPhase&amp;gt;] &lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="2"&gt;Job-Specific Options: &lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="2"&gt;--input (-i) input Path to job input directory. &lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="2"&gt;--output (-o) output The directory pathname for &lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="2"&gt;output. &lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="2"&gt;--overwrite (-ow) If present, overwrite the &lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="2"&gt;output directory before &lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="2"&gt;running job &lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="2"&gt;--method (-xm) method The execution method to use: &lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="2"&gt;sequential or mapreduce. &lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="2"&gt;Default is mapreduce&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;FONT size="2"&gt;Could you please explain what is the issue here?&lt;/FONT&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 10:27:49 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Getting-error-while-running-mahout-job-through-Oozie/m-p/42339#M33063</guid>
      <dc:creator>Anishsk10</dc:creator>
      <dc:date>2022-09-16T10:27:49Z</dc:date>
    </item>
    <item>
      <title>Re: Getting error while running mahout job through Oozie</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Getting-error-while-running-mahout-job-through-Oozie/m-p/42341#M33064</link>
      <description>&lt;P&gt;I have a guess: you need to make each of those things a separate arg tag? I don't know Oozie well myself, but something similar is needed in Maven config files. That is it may be reading this as one arg not two, called "-xm mapreduce"&lt;/P&gt;</description>
      <pubDate>Mon, 27 Jun 2016 16:28:44 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Getting-error-while-running-mahout-job-through-Oozie/m-p/42341#M33064</guid>
      <dc:creator>srowen</dc:creator>
      <dc:date>2016-06-27T16:28:44Z</dc:date>
    </item>
  </channel>
</rss>

