Support Questions

oumayma_ajengui · ‎04-13-2016

Hello everyone, I'm streaming data from twitter in json format by using apache flume and storing it then in HDFS, then i load json files from hdfs into an external hive table, but the problem is that tweets texts aren't classified, i need to extract only profils who are against the issue i'm working on , and i was wondering how can i use apache spark to extract these data from my hive table and then apply sentiment analysis algorithms on extraxted tweets texts.

Thanks.

nsabharwal · ‎04-15-2016

@oumayma ajengui

See this demo https://community.hortonworks.com/articles/7819/apache-zeppelin-hive-spark-demo.html

You will have to write your process to make it work for your usecase but above demo can be a good starting point.

View solution in original post

nsabharwal · ‎04-15-2016

@oumayma ajengui

See this demo https://community.hortonworks.com/articles/7819/apache-zeppelin-hive-spark-demo.html

You will have to write your process to make it work for your usecase but above demo can be a good starting point.

Cloudera Community

Support Questions

how to apply sentiment analysis algorithms on data stored in hive by using spark ?