<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: [HDF-3.0] Difference between Nifi and Stream builder module of Streaming analytics manager in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/HDF-3-0-Difference-between-Nifi-and-Stream-builder-module-of/m-p/201483#M163494</link>
    <description>&lt;A rel="user" href="https://community.cloudera.com/users/19209/quentintoulou.html" nodeid="19209"&gt;@Quentin T&lt;/A&gt;&lt;P&gt;Here is a link to a tutorial that will help show how these tools work together and what role they play.&lt;/P&gt;&lt;P&gt;&lt;A href="https://hortonworks.com/tutorial/real-time-event-processing-in-nifi-sam-schema-registry-and-superset/"&gt;REAL-TIME EVENT PROCESSING IN NIFI, SAM, SCHEMA REGISTRY AND SUPERSET&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Sat, 29 Jul 2017 01:50:33 GMT</pubDate>
    <dc:creator>Wynner</dc:creator>
    <dc:date>2017-07-29T01:50:33Z</dc:date>
    <item>
      <title>[HDF-3.0] Difference between Nifi and Stream builder module of Streaming analytics manager</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HDF-3-0-Difference-between-Nifi-and-Stream-builder-module-of/m-p/201482#M163493</link>
      <description>&lt;P&gt;With the new version of HDF (3.0), we find Nifi and the new application : Streaming analystics manager. What is the main difference between Nifi and the stream builder module of SAM ? How can/should we use these two applications together ?&lt;/P&gt;&lt;P&gt;After a quick time to read the description of this module, it seems it's really close to Nifi.&lt;/P&gt;&lt;P&gt;Thank you !&lt;/P&gt;</description>
      <pubDate>Fri, 28 Jul 2017 20:15:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HDF-3-0-Difference-between-Nifi-and-Stream-builder-module-of/m-p/201482#M163493</guid>
      <dc:creator>quentin_toulou</dc:creator>
      <dc:date>2017-07-28T20:15:47Z</dc:date>
    </item>
    <item>
      <title>Re: [HDF-3.0] Difference between Nifi and Stream builder module of Streaming analytics manager</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HDF-3-0-Difference-between-Nifi-and-Stream-builder-module-of/m-p/201483#M163494</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/19209/quentintoulou.html" nodeid="19209"&gt;@Quentin T&lt;/A&gt;&lt;P&gt;Here is a link to a tutorial that will help show how these tools work together and what role they play.&lt;/P&gt;&lt;P&gt;&lt;A href="https://hortonworks.com/tutorial/real-time-event-processing-in-nifi-sam-schema-registry-and-superset/"&gt;REAL-TIME EVENT PROCESSING IN NIFI, SAM, SCHEMA REGISTRY AND SUPERSET&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 29 Jul 2017 01:50:33 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HDF-3-0-Difference-between-Nifi-and-Stream-builder-module-of/m-p/201483#M163494</guid>
      <dc:creator>Wynner</dc:creator>
      <dc:date>2017-07-29T01:50:33Z</dc:date>
    </item>
    <item>
      <title>Re: [HDF-3.0] Difference between Nifi and Stream builder module of Streaming analytics manager</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HDF-3-0-Difference-between-Nifi-and-Stream-builder-module-of/m-p/201484#M163495</link>
      <description>&lt;P&gt;Both are similar in their awesome drag-and-drop UI to process data in motion,  However, they differ fundamentally in purpose and underlying technology.&lt;/P&gt;&lt;H4&gt;Differences&lt;/H4&gt;&lt;P&gt;&lt;EM&gt;&lt;STRONG&gt;Purpose&lt;/STRONG&gt;&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;NiFi is meant for data flow management while Streaming Analytics Manager (SAM) is meant for advanced (complex) real-time analytics.  In general, for NiFi think acquiring, transforming and routing data to target destinations and for SAM think complex analytics on data as it is flowing across the wire.&lt;/P&gt;&lt;P&gt;Here is a more detailed comparison between flow management (NiFI) and stream analytics (SAM)&lt;/P&gt;&lt;TABLE&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD&gt;&lt;/TD&gt;&lt;TD&gt;Flow Management (NiFi)&lt;/TD&gt;&lt;TD&gt;Stream Analytics (SAM)&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;data velocity&lt;/TD&gt;&lt;TD&gt;batch, microbatch or streaming (from diverse sources)&lt;/TD&gt;&lt;TD&gt;streaming (from diverse sources)&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;data size (per content)&lt;/TD&gt;&lt;TD&gt;small (kb) to large (GB) &lt;/TD&gt;&lt;TD&gt;small (KB, MB) per message in stream&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;data manipulation&lt;/TD&gt;&lt;TD&gt;rich: parse, filter, join, transform, enrich, reformat&lt;/TD&gt;&lt;TD&gt;minimal changes to data&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;data flow management&lt;/TD&gt;&lt;TD&gt;powerful: queue prioritization, back pressure, route/merge, persist to target&lt;/TD&gt;&lt;TD&gt;minimal: mostly route/merge and persist to target&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;real-time analytics&lt;/TD&gt;&lt;TD&gt;basic&lt;/TD&gt;&lt;TD&gt;powerful&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;So NiFi is great to manage the movement of data from diverse sources (from small sensors, ftp locations, relational databases, rest apis in the cloud, and so on) to similar targets while modifying and making decisions on the data in between.  SAM is great at watching real-time streams of data and doing advanced analytics (dashboarding/visualizations, alerting, predictions, etc) as it flows by.  &lt;/P&gt;&lt;P&gt;&lt;EM&gt;&lt;STRONG&gt;Technology&lt;/STRONG&gt;&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;NiFi is built around processors and connections with repositories underneath.  SAM is built on top of Storm and Kafka (and Druid).  &lt;/P&gt;&lt;H4&gt;Shared&lt;/H4&gt;&lt;P&gt;What do they have in common? Both have easy UI development that hides complexity underneath.  Both are components of Hortonworks Data Flow (HDF) distribution.  Both share Kafka (see below).  Both are managed by the Ambari (admin and monitoring) and Ranger (authorization and security).  Both can use the same Schema Registry to work with data structure of content.&lt;/P&gt;&lt;H4&gt;Do they connect?&lt;/H4&gt;&lt;P&gt;A very common pattern is this: stream data using NiFi (and possibly filter, transform, enrich) and pass it to a Kafka queue to make it durable (persistent until consumed).  SAM pulls from the queue (subscribes to a topic) and does advanced analytics from there (dashboarding/visualizations, alerting, predictions, etc).  SAM pushes to hadoop (HBase or Hive) to persist for further historical analysis and exploration (data science, business intelligence, etc)  Tutorial mentioned by &lt;A rel="user" href="https://community.cloudera.com/users/286/dwynne.html" nodeid="286"&gt;@Wynner&lt;/A&gt; is an excellent example of this pattern and the separate strengths of NiFi and SAM.&lt;/P&gt;</description>
      <pubDate>Sat, 29 Jul 2017 03:15:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HDF-3-0-Difference-between-Nifi-and-Stream-builder-module-of/m-p/201484#M163495</guid>
      <dc:creator>gkeys</dc:creator>
      <dc:date>2017-07-29T03:15:45Z</dc:date>
    </item>
    <item>
      <title>Re: [HDF-3.0] Difference between Nifi and Stream builder module of Streaming analytics manager</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HDF-3-0-Difference-between-Nifi-and-Stream-builder-module-of/m-p/201485#M163496</link>
      <description>&lt;P&gt;Thanks a lot for this detailed response ! &lt;A rel="user" href="https://community.cloudera.com/users/11288/gkeys.html" nodeid="11288"&gt;@Greg Keys&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 31 Jul 2017 15:00:38 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HDF-3-0-Difference-between-Nifi-and-Stream-builder-module-of/m-p/201485#M163496</guid>
      <dc:creator>quentin_toulou</dc:creator>
      <dc:date>2017-07-31T15:00:38Z</dc:date>
    </item>
  </channel>
</rss>

