<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: When to use Nifi PutHDFS and when to use Nifi+Kafka+storm ? in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/When-to-use-Nifi-PutHDFS-and-when-to-use-Nifi-Kafka-storm/m-p/134570#M51874</link>
    <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/15487/toandyliang.html" nodeid="15487"&gt;@Andy Liang&lt;/A&gt;,&lt;/P&gt;&lt;P&gt;The use of Kafka and Storm will generally occur when you need to perform complex operations on your data before pushing the data in your HDP cluster (operations that cannot be performed by NiFi). Such operations can be, for example, window aggregations, complex joins, etc.&lt;/P&gt;&lt;P&gt;If you don't need to perform such operations before your data land in the HDP cluster, then you can use NiFi + PutHDFS.&lt;/P&gt;&lt;P&gt;Hope this helps.&lt;/P&gt;</description>
    <pubDate>Wed, 18 Jan 2017 22:52:37 GMT</pubDate>
    <dc:creator>pvillard</dc:creator>
    <dc:date>2017-01-18T22:52:37Z</dc:date>
    <item>
      <title>When to use Nifi PutHDFS and when to use Nifi+Kafka+storm ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/When-to-use-Nifi-PutHDFS-and-when-to-use-Nifi-Kafka-storm/m-p/134569#M51873</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I am new to Nifi.   I just wonder when should I use Nifi  writing direct to HDFS via PutHDFS and When should I use Nifi+kafka+storm?  What's the difference?     Could I do data manipulation on Nifi instead of storm?&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;Andy&lt;/P&gt;</description>
      <pubDate>Wed, 18 Jan 2017 22:49:21 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/When-to-use-Nifi-PutHDFS-and-when-to-use-Nifi-Kafka-storm/m-p/134569#M51873</guid>
      <dc:creator>toandyliang</dc:creator>
      <dc:date>2017-01-18T22:49:21Z</dc:date>
    </item>
    <item>
      <title>Re: When to use Nifi PutHDFS and when to use Nifi+Kafka+storm ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/When-to-use-Nifi-PutHDFS-and-when-to-use-Nifi-Kafka-storm/m-p/134570#M51874</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/15487/toandyliang.html" nodeid="15487"&gt;@Andy Liang&lt;/A&gt;,&lt;/P&gt;&lt;P&gt;The use of Kafka and Storm will generally occur when you need to perform complex operations on your data before pushing the data in your HDP cluster (operations that cannot be performed by NiFi). Such operations can be, for example, window aggregations, complex joins, etc.&lt;/P&gt;&lt;P&gt;If you don't need to perform such operations before your data land in the HDP cluster, then you can use NiFi + PutHDFS.&lt;/P&gt;&lt;P&gt;Hope this helps.&lt;/P&gt;</description>
      <pubDate>Wed, 18 Jan 2017 22:52:37 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/When-to-use-Nifi-PutHDFS-and-when-to-use-Nifi-Kafka-storm/m-p/134570#M51874</guid>
      <dc:creator>pvillard</dc:creator>
      <dc:date>2017-01-18T22:52:37Z</dc:date>
    </item>
    <item>
      <title>Re: When to use Nifi PutHDFS and when to use Nifi+Kafka+storm ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/When-to-use-Nifi-PutHDFS-and-when-to-use-Nifi-Kafka-storm/m-p/134571#M51875</link>
      <description>&lt;P&gt;Thank you very much for your quick response, Pierre.   &lt;/P&gt;&lt;P&gt;Thanks for the tutorial on your blog too.  I am reading your nifi &amp;amp; dropbox example now.&lt;/P&gt;&lt;P&gt;Andy&lt;/P&gt;</description>
      <pubDate>Wed, 18 Jan 2017 23:50:23 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/When-to-use-Nifi-PutHDFS-and-when-to-use-Nifi-Kafka-storm/m-p/134571#M51875</guid>
      <dc:creator>toandyliang</dc:creator>
      <dc:date>2017-01-18T23:50:23Z</dc:date>
    </item>
    <item>
      <title>Re: When to use Nifi PutHDFS and when to use Nifi+Kafka+storm ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/When-to-use-Nifi-PutHDFS-and-when-to-use-Nifi-Kafka-storm/m-p/134572#M51876</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/15487/toandyliang.html" nodeid="15487"&gt;@Andy Liang&lt;/A&gt;,&lt;/P&gt;&lt;P&gt;In addition to &lt;A rel="user" href="https://community.cloudera.com/users/5078/pvillard.html" nodeid="5078"&gt;@Pierre Villard&lt;/A&gt;'s answer.  There are three aspects of data processing joined up here:&lt;/P&gt;&lt;H4&gt;&lt;STRONG&gt;Streaming - Simple Event Processing&lt;/STRONG&gt;&lt;/H4&gt;&lt;P&gt;&lt;STRONG&gt;&lt;/STRONG&gt;This is what NiFi is very good at. All the information needed to do the processing is contained in the event. For example:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Log processing: If the log contains an error then separate from the flow and send an email alert&lt;/LI&gt;&lt;LI&gt;Transformation: Our legacy system uses XML but we want to use AVRO. Convert each XML event to AVRO&lt;/LI&gt;&lt;/UL&gt;&lt;H4&gt;&lt;STRONG&gt;Streaming - Complex Event Processing&lt;/STRONG&gt;&lt;/H4&gt;&lt;P&gt;This is what Storm is good at covered by Pierre.&lt;/P&gt;&lt;H4&gt;&lt;STRONG&gt;Batch&lt;/STRONG&gt;&lt;/H4&gt;&lt;P&gt;This is where MR/Hive/Spark (not spark streaming) come in. Land on HDFS and then the data can be processed and/or explored.&lt;/P&gt;</description>
      <pubDate>Wed, 18 Jan 2017 23:57:59 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/When-to-use-Nifi-PutHDFS-and-when-to-use-Nifi-Kafka-storm/m-p/134572#M51876</guid>
      <dc:creator>scarroll</dc:creator>
      <dc:date>2017-01-18T23:57:59Z</dc:date>
    </item>
    <item>
      <title>Re: When to use Nifi PutHDFS and when to use Nifi+Kafka+storm ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/When-to-use-Nifi-PutHDFS-and-when-to-use-Nifi-Kafka-storm/m-p/134573#M51877</link>
      <description>&lt;P&gt;Thank you @Sebastian Carroll for the detail explaination.&lt;/P&gt;</description>
      <pubDate>Fri, 20 Jan 2017 01:26:41 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/When-to-use-Nifi-PutHDFS-and-when-to-use-Nifi-Kafka-storm/m-p/134573#M51877</guid>
      <dc:creator>toandyliang</dc:creator>
      <dc:date>2017-01-20T01:26:41Z</dc:date>
    </item>
  </channel>
</rss>

