<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: stream processing runtimes in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/stream-processing-runtimes/m-p/158002#M120401</link>
    <description>&lt;P&gt;Thanks &lt;A rel="user" href="https://community.cloudera.com/users/2707/tkiss.html" nodeid="2707"&gt;@Tibor Kiss&lt;/A&gt; - I am looking for more information around distributed mode, is there a name to the cluster managers in storm or spark stremaing.&lt;/P&gt;</description>
    <pubDate>Thu, 02 Feb 2017 19:58:13 GMT</pubDate>
    <dc:creator>avijeetd</dc:creator>
    <dc:date>2017-02-02T19:58:13Z</dc:date>
    <item>
      <title>stream processing runtimes</title>
      <link>https://community.cloudera.com/t5/Support-Questions/stream-processing-runtimes/m-p/158000#M120399</link>
      <description>&lt;P&gt;Hi All,&lt;/P&gt;&lt;P&gt;most of the batch processing frameworks (MR, Spark) support a local mode and a distributed mode (standalone, yarn, mesos) of deployment and execution.&lt;/P&gt;&lt;P&gt;what about stream processing frameworks such as STORM, Spark-streaming? Do they manage the distributed mode on their own? is it even realistic to expect them to be work on YARN?&lt;/P&gt;&lt;P&gt;How to monitor a distributed spark streaming job? And do we need to specify master as yarn to make it distributed?&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Avijeet&lt;/P&gt;</description>
      <pubDate>Thu, 02 Feb 2017 18:42:04 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/stream-processing-runtimes/m-p/158000#M120399</guid>
      <dc:creator>avijeetd</dc:creator>
      <dc:date>2017-02-02T18:42:04Z</dc:date>
    </item>
    <item>
      <title>Re: stream processing runtimes</title>
      <link>https://community.cloudera.com/t5/Support-Questions/stream-processing-runtimes/m-p/158001#M120400</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;Both storm &amp;amp; spark supports local mode. &lt;/P&gt;&lt;P&gt;In Storm you need to create a LocalCluster instance then you can submit your job onto that. You can find description and example in the links: &lt;/P&gt;&lt;P&gt;&lt;A href="http://storm.apache.org/releases/1.0.2/Local-mode.html" target="_blank"&gt;http://storm.apache.org/releases/1.0.2/Local-mode.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="https://github.com/apache/storm/blob/1.0.x-branch/examples/storm-starter/src/jvm/org/apache/storm/starter/WordCountTopology.java#L98" target="_blank"&gt;https://github.com/apache/storm/blob/1.0.x-branch/examples/storm-starter/src/jvm/org/apache/storm/starter/WordCountTopology.java#L98&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Spark's approach on local mode is somewhat different. The allocation is controlled through the spark-master variable which can be set to local (or local[*], local[N] where N is a number). If local is specified executors will be started on your machine.&lt;/P&gt;&lt;P&gt;Both Storm and Spark has monitoring capabilities through a web interface. You can find details about them here:&lt;/P&gt;&lt;P&gt;&lt;A href="https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.5.0/bk_storm-component-guide/content/using-storm-ui.html" target="_blank"&gt;https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.5.0/bk_storm-component-guide/content/using-storm-ui.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="http://spark.apache.org/docs/latest/monitoring.html" target="_blank"&gt;http://spark.apache.org/docs/latest/monitoring.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Yarn is not a requirement but an option for distributed mode, both Spark &amp;amp; Storm is able to function on their own.&lt;/P&gt;</description>
      <pubDate>Thu, 02 Feb 2017 19:39:34 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/stream-processing-runtimes/m-p/158001#M120400</guid>
      <dc:creator>tkiss</dc:creator>
      <dc:date>2017-02-02T19:39:34Z</dc:date>
    </item>
    <item>
      <title>Re: stream processing runtimes</title>
      <link>https://community.cloudera.com/t5/Support-Questions/stream-processing-runtimes/m-p/158002#M120401</link>
      <description>&lt;P&gt;Thanks &lt;A rel="user" href="https://community.cloudera.com/users/2707/tkiss.html" nodeid="2707"&gt;@Tibor Kiss&lt;/A&gt; - I am looking for more information around distributed mode, is there a name to the cluster managers in storm or spark stremaing.&lt;/P&gt;</description>
      <pubDate>Thu, 02 Feb 2017 19:58:13 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/stream-processing-runtimes/m-p/158002#M120401</guid>
      <dc:creator>avijeetd</dc:creator>
      <dc:date>2017-02-02T19:58:13Z</dc:date>
    </item>
    <item>
      <title>Re: stream processing runtimes</title>
      <link>https://community.cloudera.com/t5/Support-Questions/stream-processing-runtimes/m-p/158003#M120402</link>
      <description>&lt;P&gt;In Storm's nomenclature 'nimbus' is the cluster manager:&lt;/P&gt;&lt;P&gt;&lt;A href="http://storm.apache.org/releases/1.0.1/Setting-up-a-Storm-cluster.html"&gt;http://storm.apache.org/releases/1.0.1/Setting-up-a-Storm-cluster.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Spark calls the cluster manager as 'master':&lt;/P&gt;&lt;P&gt;&lt;A href="http://spark.apache.org/docs/latest/spark-standalone.html"&gt;http://spark.apache.org/docs/latest/spark-standalone.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 02 Feb 2017 20:15:37 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/stream-processing-runtimes/m-p/158003#M120402</guid>
      <dc:creator>tkiss</dc:creator>
      <dc:date>2017-02-02T20:15:37Z</dc:date>
    </item>
    <item>
      <title>Re: stream processing runtimes</title>
      <link>https://community.cloudera.com/t5/Support-Questions/stream-processing-runtimes/m-p/158004#M120403</link>
      <description>&lt;P&gt;That's great &lt;A rel="user" href="https://community.cloudera.com/users/2707/tkiss.html" nodeid="2707"&gt;@Tibor Kiss&lt;/A&gt; - I am trying to run a spark streaming - how do I say to run on standalone cluster mode? &lt;/P&gt;</description>
      <pubDate>Fri, 03 Feb 2017 12:36:23 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/stream-processing-runtimes/m-p/158004#M120403</guid>
      <dc:creator>avijeetd</dc:creator>
      <dc:date>2017-02-03T12:36:23Z</dc:date>
    </item>
  </channel>
</rss>

