Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

how to apply sentiment analysis algorithms on data stored in hive by using spark ?

avatar

Hello everyone, I'm streaming data from twitter in json format by using apache flume and storing it then in HDFS, then i load json files from hdfs into an external hive table, but the problem is that tweets texts aren't classified, i need to extract only profils who are against the issue i'm working on , and i was wondering how can i use apache spark to extract these data from my hive table and then apply sentiment analysis algorithms on extraxted tweets texts.

Thanks.

1 ACCEPTED SOLUTION

avatar
Master Mentor
@oumayma ajengui

See this demo https://community.hortonworks.com/articles/7819/apache-zeppelin-hive-spark-demo.html

You will have to write your process to make it work for your usecase but above demo can be a good starting point.

View solution in original post

1 REPLY 1

avatar
Master Mentor
@oumayma ajengui

See this demo https://community.hortonworks.com/articles/7819/apache-zeppelin-hive-spark-demo.html

You will have to write your process to make it work for your usecase but above demo can be a good starting point.