- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
Integration between Apache Pig, Apache Nifi and Apache Spark
- Labels:
-
Apache NiFi
-
Apache Pig
-
Apache Spark
Created ‎06-17-2016 03:23 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
What are the various ways to integrate Apache Pig, Nifi and Spark?
I know I can connect some with Kafka or via files.
Created ‎06-17-2016 03:30 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
hello Timothy
There are mutilple ways to integrate these 3 services. As a starting point Nifi will probably be your ingestion flow. During this flow you could
- put your data to kafka and have spark read from it
- push your nifi data to spark: https://blogs.apache.org/nifi/entry/stream_processing_nifi_and_spark
- you could use and execute script processor and start a pig job
In summary you can have a push and forget connection, you can have a push to service and pick in next flow approach, or even execute in processor as corner case maybe
hope this shares some insight
Created ‎06-17-2016 03:30 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
hello Timothy
There are mutilple ways to integrate these 3 services. As a starting point Nifi will probably be your ingestion flow. During this flow you could
- put your data to kafka and have spark read from it
- push your nifi data to spark: https://blogs.apache.org/nifi/entry/stream_processing_nifi_and_spark
- you could use and execute script processor and start a pig job
In summary you can have a push and forget connection, you can have a push to service and pick in next flow approach, or even execute in processor as corner case maybe
hope this shares some insight
