Code Repositories
Find and share code repositories
All community
This category
Community Articles
Users
cancel
Turn on suggestions
Auto-suggest helps you quickly narrow down your search results by suggesting possible matches as you type.
Showing results for
Show
only
|
Search instead for
Did you mean:
Advanced Search
Cloudera Community
:
Support
:
Code Repositories
:
Using NiFi, Storm and Kafka to analyze twitter dat...
Announcements
Check out our newest addition to the community, the
Cloudera Data Analytics (CDA) group hub.
Options
Subscribe to RSS Feed
Mark as New
Mark as Read
Bookmark
Subscribe
Printer Friendly Page
Report Inappropriate Content
Options
Subscribe to RSS Feed
Mark as New
Mark as Read
Bookmark
Subscribe
Printer Friendly Page
Report Inappropriate Content
Using NiFi, Storm and Kafka to analyze twitter data
Labels
(1)
Labels:
Apache Ambari
vjain
Guru
Created on
02-02-2017
04:26 PM
Repo Description
This Demo is built for Hortonworks HDP 2.3 Sandbox.
This is based on the
Hortonworks Twitter Demo
Purpose: Monitor Twitter stream for the procided Hastags & act on unexpected increases in tweet volume
Ingest: Listen for Twitter streams related to Hashtags input in NiFi Garden Hose (GetHTTP) processor
Processing:
Monitor tweets for unexpected volume
Volume thresholds managed in HBASE
Persistence:
HDFS (for future batch processing)
Hive (for interactive query)
HBase (for realtime alerts)
Solr/Banana (for search and reports/dashboards)
Refine:
Update threshold values based on historical analysis of tweet volumes
Demo setup:
Either download and start prebuilt VM
Start HDP 2.3 sandbox and run provided scripts to setup demo
Repo Info
Github Repo URL
https://github.com/vedantja/hdp_nifi_twitter_demo
Github account name
vedantja
Repo name
hdp_nifi_twitter_demo
1,431 Views
0
Kudos
Take a Tour of the Community
Community Browser
Cloudera Community
Groups
Cloudera Innovation Accelerator
Innovation Discussions
Innovation Blog
Cloudera Data Analytics (CDA)
Cloudera Data Analytics (CDA) Forum
Cloudera Data Analytics (CDA) Blogs
Cloudera Data Analytics (CDA) Articles
Announcements
Community Announcements
Product Announcements
Support Announcements
What's New @ Cloudera
Support
Support Questions
Code Repositories
Community Articles
Using the Community
Intros and Suggestions
Community Tips
Have a Cloudera Account?
Sign In
Don't have an account?
Register
Your experience may be limited.
Sign in
to explore more.
Announcements
Product Announcements
CDP Public Cloud: May 2023 Release Summary
What's New @ Cloudera
Cloudera Operational Database (COD) provides enhancements to the --scale-type CDP CLI option
What's New @ Cloudera
Cloudera Operational Database (COD) UI supports creating a smaller cluster using a predefined Data Lake template
What's New @ Cloudera
Cloudera Operational Database (COD) supports scaling up the clusters vertically
Product Announcements
CDP Public Cloud: April 2023 Release Summary
View More Announcements
Version history
Last update:
02-02-2017
04:26 PM
Updated by:
vjain
Contributors
vjain