Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

Does anyone know how can we stream Twitter data of specific Twitter accounts using Apache Flume ?

avatar
New Contributor
 
1 ACCEPTED SOLUTION

avatar
Super Collaborator

@azza messaoudi, check the following Twitter doc: https://dev.twitter.com/streaming/reference/post/statuses/filter

And here is the custom Flume source implementation with support of all twitter streaming parameters: http://www.dataprocessingtips.com/2016/04/24/custom-twitter-source-for-apache-flume/

(including "follow" parameter which you're interested in actually)

View solution in original post

2 REPLIES 2

avatar

Hi @azza messaoudi sorry this isn't a direct answer to your question, but unless you REALLY want to use Flume for this, have you looked at NiFi?

There is already a GetTwitter processor that you can configure for search terms etc, and there's a number of really good demos to show how you can put all this together.

A 3 part demo series here:

https://www.linkedin.com/pulse/apache-nifi-part-1-introduction-neeraj-sabharwal

... and a full tutorial including indexing in solr here: http://hortonworks.com/hadoop-tutorial/how-to-refine-and-visualize-sentiment-data/

I know that's not exactly what you asked for, but I promise that it's absolutely the easiest way to get twitter information into Hadoop that I've ever used.

Hope that helps.

avatar
Super Collaborator

@azza messaoudi, check the following Twitter doc: https://dev.twitter.com/streaming/reference/post/statuses/filter

And here is the custom Flume source implementation with support of all twitter streaming parameters: http://www.dataprocessingtips.com/2016/04/24/custom-twitter-source-for-apache-flume/

(including "follow" parameter which you're interested in actually)