<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Apache Mahout K-Means Algorithm in HDP 2.4 on Hortonworks Sandbox with MapReduce! in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Apache-Mahout-K-Means-Algorithm-in-HDP-2-4-on-Hortonworks/m-p/149641#M28448</link>
    <description>&lt;P&gt;Get data into the cluster? Easiest way is to have a delimited file and do hadoop fs -put file &amp;lt;hdfs location&amp;gt; You can then read those files with sc.textFile. &lt;/P&gt;&lt;P&gt;You should go through a couple of basic tutorials I think to work with hadoop:&lt;/P&gt;&lt;P&gt;&lt;A href="http://hortonworks.com/hadoop-tutorial/using-commandline-manage-files-hdfs/" target="_blank"&gt;http://hortonworks.com/hadoop-tutorial/using-commandline-manage-files-hdfs/&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Mon, 16 May 2016 21:38:01 GMT</pubDate>
    <dc:creator>bleonhardi</dc:creator>
    <dc:date>2016-05-16T21:38:01Z</dc:date>
    <item>
      <title>Apache Mahout K-Means Algorithm in HDP 2.4 on Hortonworks Sandbox with MapReduce!</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Apache-Mahout-K-Means-Algorithm-in-HDP-2-4-on-Hortonworks/m-p/149638#M28445</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;i am studing Hortonworks Data Platform and i needs to run the K-Means algorithm. I know that the K-Means algorithm are in Mahout but the problem is that i don't know how i can execute this algorithm. I don't know how i can introduce any information to HDP Sandbox.&lt;/P&gt;&lt;P&gt;Can someone help me?&lt;/P&gt;&lt;P&gt;Thank you very much.&lt;/P&gt;&lt;P&gt;David.&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 10:19:50 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Apache-Mahout-K-Means-Algorithm-in-HDP-2-4-on-Hortonworks/m-p/149638#M28445</guid>
      <dc:creator>davidrebe</dc:creator>
      <dc:date>2022-09-16T10:19:50Z</dc:date>
    </item>
    <item>
      <title>Re: Apache Mahout K-Means Algorithm in HDP 2.4 on Hortonworks Sandbox with MapReduce!</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Apache-Mahout-K-Means-Algorithm-in-HDP-2-4-on-Hortonworks/m-p/149639#M28446</link>
      <description>&lt;P&gt;Hello David ,&lt;/P&gt;&lt;P&gt;Does it have to be Mahout? In general spark mlib is just quote a bit "cooler" now. Here is the Web page of it with an example code. ( If it has to be Mahout I am sure someone can help too)&lt;/P&gt;&lt;P&gt;&lt;A href="http://spark.apache.org/docs/latest/mllib-clustering.html" target="_blank"&gt;http://spark.apache.org/docs/latest/mllib-clustering.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Regarding Mahout I suppose you found that one already:&lt;/P&gt;&lt;P&gt;&lt;A href="https://mahout.apache.org/users/clustering/k-means-clustering.html" target="_blank"&gt;https://mahout.apache.org/users/clustering/k-means-clustering.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 16 May 2016 19:17:57 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Apache-Mahout-K-Means-Algorithm-in-HDP-2-4-on-Hortonworks/m-p/149639#M28446</guid>
      <dc:creator>bleonhardi</dc:creator>
      <dc:date>2016-05-16T19:17:57Z</dc:date>
    </item>
    <item>
      <title>Re: Apache Mahout K-Means Algorithm in HDP 2.4 on Hortonworks Sandbox with MapReduce!</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Apache-Mahout-K-Means-Algorithm-in-HDP-2-4-on-Hortonworks/m-p/149640#M28447</link>
      <description>&lt;P&gt;Hello Benjamin,&lt;/P&gt;&lt;P&gt;the problem is that i need thw two implementations (Spark and MapReduce) for to make a comparision. To realise the task in Spark i don't know how i can introduce the data to Sandbox to execute the algorithm. In the link: &lt;A href="http://spark.apache.org/docs/latest/mllib-clustering.html"&gt;http://spark.apache.org/docs/latest/mllib-clustering.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;only appears the code, but how i can create one task of Spark in Sandbox?&lt;/P&gt;&lt;P&gt;Can you help me?&lt;/P&gt;</description>
      <pubDate>Mon, 16 May 2016 21:01:55 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Apache-Mahout-K-Means-Algorithm-in-HDP-2-4-on-Hortonworks/m-p/149640#M28447</guid>
      <dc:creator>davidrebe</dc:creator>
      <dc:date>2016-05-16T21:01:55Z</dc:date>
    </item>
    <item>
      <title>Re: Apache Mahout K-Means Algorithm in HDP 2.4 on Hortonworks Sandbox with MapReduce!</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Apache-Mahout-K-Means-Algorithm-in-HDP-2-4-on-Hortonworks/m-p/149641#M28448</link>
      <description>&lt;P&gt;Get data into the cluster? Easiest way is to have a delimited file and do hadoop fs -put file &amp;lt;hdfs location&amp;gt; You can then read those files with sc.textFile. &lt;/P&gt;&lt;P&gt;You should go through a couple of basic tutorials I think to work with hadoop:&lt;/P&gt;&lt;P&gt;&lt;A href="http://hortonworks.com/hadoop-tutorial/using-commandline-manage-files-hdfs/" target="_blank"&gt;http://hortonworks.com/hadoop-tutorial/using-commandline-manage-files-hdfs/&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 16 May 2016 21:38:01 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Apache-Mahout-K-Means-Algorithm-in-HDP-2-4-on-Hortonworks/m-p/149641#M28448</guid>
      <dc:creator>bleonhardi</dc:creator>
      <dc:date>2016-05-16T21:38:01Z</dc:date>
    </item>
  </channel>
</rss>

