<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Hadoop + Spark Use Case in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Hadoop-Spark-Use-Case/m-p/167767#M130097</link>
    <description>&lt;P&gt;If you think Hadoop to be HDFS and YARN, spark can take advantage of HDFS (storage that can be horizontally expanded by adding more nodes) by reading data that is in HDFS, writing final processed data into HDFS and YARN (compute that can be horizontally expanded by adding more nodes) by running on YARN. &lt;/P&gt;&lt;P&gt;If you are looking at usecases, look at MLlib algorithms which cover a lot of use cases that can run on top of spark. &lt;/P&gt;</description>
    <pubDate>Thu, 26 May 2016 03:36:43 GMT</pubDate>
    <dc:creator>ravi1</dc:creator>
    <dc:date>2016-05-26T03:36:43Z</dc:date>
    <item>
      <title>Hadoop + Spark Use Case</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hadoop-Spark-Use-Case/m-p/167765#M130095</link>
      <description>&lt;P&gt;There exists some use case that shows how Hadoop and Spark work together? I already read the theory but I want to see something pratical to have a better understand.

Thanks!!!&lt;/P&gt;</description>
      <pubDate>Thu, 26 May 2016 02:12:35 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hadoop-Spark-Use-Case/m-p/167765#M130095</guid>
      <dc:creator>prodgers125</dc:creator>
      <dc:date>2016-05-26T02:12:35Z</dc:date>
    </item>
    <item>
      <title>Re: Hadoop + Spark Use Case</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hadoop-Spark-Use-Case/m-p/167766#M130096</link>
      <description>&lt;P&gt;There is a good blog post over at MapR regarding this. I personally think the Network Security use case is especially compelling.&lt;/P&gt;&lt;P&gt; &lt;A href="https://www.mapr.com/blog/game-changing-real-time-use-cases-apache-spark-on-hadoop" target="_blank"&gt;https://www.mapr.com/blog/game-changing-real-time-use-cases-apache-spark-on-hadoop&lt;/A&gt;.  &lt;/P&gt;</description>
      <pubDate>Thu, 26 May 2016 03:24:06 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hadoop-Spark-Use-Case/m-p/167766#M130096</guid>
      <dc:creator>bhopp</dc:creator>
      <dc:date>2016-05-26T03:24:06Z</dc:date>
    </item>
    <item>
      <title>Re: Hadoop + Spark Use Case</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hadoop-Spark-Use-Case/m-p/167767#M130097</link>
      <description>&lt;P&gt;If you think Hadoop to be HDFS and YARN, spark can take advantage of HDFS (storage that can be horizontally expanded by adding more nodes) by reading data that is in HDFS, writing final processed data into HDFS and YARN (compute that can be horizontally expanded by adding more nodes) by running on YARN. &lt;/P&gt;&lt;P&gt;If you are looking at usecases, look at MLlib algorithms which cover a lot of use cases that can run on top of spark. &lt;/P&gt;</description>
      <pubDate>Thu, 26 May 2016 03:36:43 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hadoop-Spark-Use-Case/m-p/167767#M130097</guid>
      <dc:creator>ravi1</dc:creator>
      <dc:date>2016-05-26T03:36:43Z</dc:date>
    </item>
    <item>
      <title>Re: Hadoop + Spark Use Case</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hadoop-Spark-Use-Case/m-p/167768#M130098</link>
      <description>&lt;P&gt;Spark and Hadoop go together like peanut butter and jelly.&lt;/P&gt;&lt;P&gt;Check out my slides&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.hortonworks.com/content/idea/28342/apache-zeppelin-with-scala-spark-introduction-to-r.html" target="_blank"&gt;https://community.hortonworks.com/content/idea/28342/apache-zeppelin-with-scala-spark-introduction-to-r.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.hortonworks.com/content/kbentry/34784/data-ingest-with-apache-zeppelin-apache-spark-16-h.html" target="_blank"&gt;https://community.hortonworks.com/content/kbentry/34784/data-ingest-with-apache-zeppelin-apache-spark-16-h.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;I worked at a few places that used Spark and Spark streaming to ingest data into HDFS and HBase.   Then Spark + Spark MLib and H20 to run machine learning on the data.   Then Hive and Spark SQL for queries.   And reporting through Hive Thrift server to Tableau.&lt;/P&gt;&lt;P&gt;Spark without Hadoop really is missing out a lot.&lt;/P&gt;&lt;P&gt;And Spark 1.6 on HDP you get all the benefits of running YARN applications, common security and locality of data access.&lt;/P&gt;&lt;P&gt;I wouldn't run Spark without Hadoop unless you are running Spark standalone for development.&lt;/P&gt;&lt;P&gt;Even there Zeppelin + Spark 1.6 on HDP is an awesome development environment.&lt;/P&gt;</description>
      <pubDate>Thu, 26 May 2016 03:55:16 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hadoop-Spark-Use-Case/m-p/167768#M130098</guid>
      <dc:creator>TimothySpann</dc:creator>
      <dc:date>2016-05-26T03:55:16Z</dc:date>
    </item>
    <item>
      <title>Re: Hadoop + Spark Use Case</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hadoop-Spark-Use-Case/m-p/167769#M130099</link>
      <description>&lt;P&gt;Yes when I think about Hadoop I'm saying to storage the data into HDFS. I don't know what type of advantage that can I take with Spark. Data cleansing? &lt;/P&gt;</description>
      <pubDate>Sun, 29 May 2016 06:28:58 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hadoop-Spark-Use-Case/m-p/167769#M130099</guid>
      <dc:creator>prodgers125</dc:creator>
      <dc:date>2016-05-29T06:28:58Z</dc:date>
    </item>
  </channel>
</rss>

