Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Flume Installation and Streaming Twitter Data Using Flume

Flume Installation and Streaming Twitter Data Using Flume

Contributor

i want collect real time tweets from twitter using # hashtag

i want step by step example collect real time tweets using flume and java in hortonworks 2.3

2 REPLIES 2

Re: Flume Installation and Streaming Twitter Data Using Flume

Expert Contributor

1. Install Flume

2. Build custom twitter source.

You can use this project as a template: https://github.com/cloudera/cdh-twitter-example/tree/master/flume-sources

By default it allows to filter stream by keyword and save data in raw JSON format

3. Adjust config fromt the project above to use smth like

TwitterAgent.sources.Twitter.keywords = #hashtag1, #hashtag2

4. Enjoy!

Re: Flume Installation and Streaming Twitter Data Using Flume

Mentor

Hortonworks does not ship Twiiter source for flume, you will have to compile your own. Here's a step by step guide for everything else https://acadgild.com/blog/streaming-twitter-data-using-flume/

Don't have an account?
Coming from Hortonworks? Activate your account here