- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
Can we ingest Rss feeds using apache nifi into hive?
- Labels:
-
Apache Hive
-
Apache NiFi
Created ‎01-22-2016 09:45 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Any thoughts on which processors to use in nifi for ingesting rss feeds into hive tables.
http://feeds.bbci.co.uk/news/world/rss.xml
thanks.
Created ‎01-22-2016 11:55 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Use this GetHTTP
http://www.nifi.rocks/getting-started-with-apache-nifi/
The URL we are going to grab a file from is XKCD’s rss feed, rss.xml. The GetHTTP procesor is simple to configure and just needs the URL property to be set to http://xkcd.com/rss.xml. Click on the value across from the URL property and enter http://xkcd.com/rss.xml.
Now drag down another processor, EvaluateXPath. Under the properties for this processor, set the following property-value pairs:
- Destination - flowfile-attribute
- Return Type - auto-detect
Created ‎01-22-2016 11:55 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Use this GetHTTP
http://www.nifi.rocks/getting-started-with-apache-nifi/
The URL we are going to grab a file from is XKCD’s rss feed, rss.xml. The GetHTTP procesor is simple to configure and just needs the URL property to be set to http://xkcd.com/rss.xml. Click on the value across from the URL property and enter http://xkcd.com/rss.xml.
Now drag down another processor, EvaluateXPath. Under the properties for this processor, set the following property-value pairs:
- Destination - flowfile-attribute
- Return Type - auto-detect
Created ‎01-22-2016 03:18 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Thank you @Neeraj Sabharwal. I ll give this a try. Why do we need EvaluateXPath here? Also i beleive at the end use puthdfs to route the data into hdfs. I beleive currently nifi doesnt have a processor to ingest directly into hive table i can see it has putsql. So i beleive best approach is to use puthdfs. Please let me know thanks..
Created ‎01-22-2016 03:20 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
@surender nath reddy kudumula I will put data in HDFS and create external Hive table on top of it.
Created ‎01-22-2016 06:21 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
thank you @Neeraj Sabharwal
Created ‎01-22-2016 07:38 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
@surender nath reddy kudumula As part of the best practice, please accept the best answer to close the thread.
Created ‎04-21-2016 08:14 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
@surender nath reddy kudumula - There is a JIRA actively being worked on to add Hive JDBC support to Nifi. https://issues.apache.org/jira/browse/NIFI-981
Created ‎12-17-2016 12:49 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Neeraj, thanks for your advice.
