<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Integration between Apache Pig, Apache Nifi and Apache Spark in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Integration-between-Apache-Pig-Apache-Nifi-and-Apache-Spark/m-p/145081#M107653</link>
    <description>&lt;P&gt;hello Timothy&lt;/P&gt;&lt;P&gt;There are mutilple ways to integrate these 3 services. As a starting point Nifi will probably be your ingestion flow. During this flow you could&lt;/P&gt;&lt;P&gt;- put your data to kafka and have spark read from it&lt;/P&gt;&lt;P&gt;- push your nifi data to spark: &lt;A href="https://blogs.apache.org/nifi/entry/stream_processing_nifi_and_spark"&gt;https://blogs.apache.org/nifi/entry/stream_processing_nifi_and_spark&lt;/A&gt;&lt;/P&gt;&lt;P&gt;- you could use and execute script processor and start a pig job&lt;/P&gt;&lt;P&gt;In summary you can have a push and forget connection, you can have a push to service and pick in next flow approach, or even execute in processor as corner case maybe&lt;/P&gt;&lt;P&gt;hope this shares some insight&lt;/P&gt;</description>
    <pubDate>Fri, 17 Jun 2016 22:30:35 GMT</pubDate>
    <dc:creator>nmaillard1</dc:creator>
    <dc:date>2016-06-17T22:30:35Z</dc:date>
    <item>
      <title>Integration between Apache Pig, Apache Nifi and Apache Spark</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Integration-between-Apache-Pig-Apache-Nifi-and-Apache-Spark/m-p/145080#M107652</link>
      <description>&lt;P&gt;What are the various ways to integrate Apache Pig, Nifi and Spark?&lt;/P&gt;&lt;P&gt;I know I can connect some with Kafka or via files.&lt;/P&gt;</description>
      <pubDate>Fri, 17 Jun 2016 22:23:48 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Integration-between-Apache-Pig-Apache-Nifi-and-Apache-Spark/m-p/145080#M107652</guid>
      <dc:creator>TimothySpann</dc:creator>
      <dc:date>2016-06-17T22:23:48Z</dc:date>
    </item>
    <item>
      <title>Re: Integration between Apache Pig, Apache Nifi and Apache Spark</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Integration-between-Apache-Pig-Apache-Nifi-and-Apache-Spark/m-p/145081#M107653</link>
      <description>&lt;P&gt;hello Timothy&lt;/P&gt;&lt;P&gt;There are mutilple ways to integrate these 3 services. As a starting point Nifi will probably be your ingestion flow. During this flow you could&lt;/P&gt;&lt;P&gt;- put your data to kafka and have spark read from it&lt;/P&gt;&lt;P&gt;- push your nifi data to spark: &lt;A href="https://blogs.apache.org/nifi/entry/stream_processing_nifi_and_spark"&gt;https://blogs.apache.org/nifi/entry/stream_processing_nifi_and_spark&lt;/A&gt;&lt;/P&gt;&lt;P&gt;- you could use and execute script processor and start a pig job&lt;/P&gt;&lt;P&gt;In summary you can have a push and forget connection, you can have a push to service and pick in next flow approach, or even execute in processor as corner case maybe&lt;/P&gt;&lt;P&gt;hope this shares some insight&lt;/P&gt;</description>
      <pubDate>Fri, 17 Jun 2016 22:30:35 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Integration-between-Apache-Pig-Apache-Nifi-and-Apache-Spark/m-p/145081#M107653</guid>
      <dc:creator>nmaillard1</dc:creator>
      <dc:date>2016-06-17T22:30:35Z</dc:date>
    </item>
  </channel>
</rss>

