Created 09-15-2016 05:34 AM
Created 09-15-2016 09:22 AM
Your question is very broad, so if you need more info please be specific.
Based on the question tags I'm going to assume you're asking about NiFi integration with Spark, Kafka, Storm, and Solr.
The short answer is yes - we can integrate NiFi with other Apache Software 🙂
NiFi provides an easy way to stream data between different systems, and has in-built processors for dealing with most of the common Apache stack.
Kafka
NiFi has in-built processors to stream into and read data from Kafka:
I'd suggest checking out the following tutorial to get you started:
http://hortonworks.com/hadoop-tutorial/realtime-event-processing-nifi-kafka-storm/
Solr
NiFi has a PutSolrContentStream processor that allows you to stream data directly into a Solr index. Check out the following tutorial that uses NiFi to index twitter data directly into Solr and visualises it using banana:
Spark
Spark doesn't supply a mechanism to have data pushed to it, and instead pulls from other sources. You can integrate NiFi directly with Spark Streaming by exposing an Output Port in NiFi that Spark can consume from. The following article explains how to set up this integration:
https://community.hortonworks.com/articles/12708/nifi-feeding-data-to-spark-streaming.html
Note that typical streaming architecture involves NiFi pushing data to Kafka, and Spark Streaming (or Storm) reading from Kafka.
Storm
As above, typically NiFi is integrated with Storm with Kafka acting as the message buffer. The tutorial realtime event processing tutorial linked above covers the details of building a streaming application in NiFi Kafka & Storm.
Created 09-15-2016 09:20 AM
Hi,
Yes NiFi, by default, offers a very large number of processors to interact with other Apache softwares. Here is a non exhaustive list: Spark, Storm, Hive, HBase, Cassandra, Solr, Ignite, HDFS, Kafka, etc. You have the full list of available processors here:
Created 09-15-2016 09:22 AM
Your question is very broad, so if you need more info please be specific.
Based on the question tags I'm going to assume you're asking about NiFi integration with Spark, Kafka, Storm, and Solr.
The short answer is yes - we can integrate NiFi with other Apache Software 🙂
NiFi provides an easy way to stream data between different systems, and has in-built processors for dealing with most of the common Apache stack.
Kafka
NiFi has in-built processors to stream into and read data from Kafka:
I'd suggest checking out the following tutorial to get you started:
http://hortonworks.com/hadoop-tutorial/realtime-event-processing-nifi-kafka-storm/
Solr
NiFi has a PutSolrContentStream processor that allows you to stream data directly into a Solr index. Check out the following tutorial that uses NiFi to index twitter data directly into Solr and visualises it using banana:
Spark
Spark doesn't supply a mechanism to have data pushed to it, and instead pulls from other sources. You can integrate NiFi directly with Spark Streaming by exposing an Output Port in NiFi that Spark can consume from. The following article explains how to set up this integration:
https://community.hortonworks.com/articles/12708/nifi-feeding-data-to-spark-streaming.html
Note that typical streaming architecture involves NiFi pushing data to Kafka, and Spark Streaming (or Storm) reading from Kafka.
Storm
As above, typically NiFi is integrated with Storm with Kafka acting as the message buffer. The tutorial realtime event processing tutorial linked above covers the details of building a streaming application in NiFi Kafka & Storm.