Support Questions
Find answers, ask questions, and share your expertise
Announcements
Check out our newest addition to the community, the Cloudera Innovation Accelerator group hub.

Hive Streaming in Spark

Is it a valid scenario to use hive-streaming inside a spark program. I have seen examples of hive streaming as standalone program and spark streaming for writing to hive. Never seen any program where hive-streaming is used inside a spark application and submitted to cluster. Does hive streaming work inside a spark application or is this a totally wrong usage. Please share your thoughts.

4 REPLIES 4

Explorer

Better option would be to use Nifi for hive-streaming, Nifi has pre-built processor for streaming data into Hive, check out the post below for an example

https://community.hortonworks.com/articles/52856/stream-data-into-hive-like-a-king-using-nifi.html

Anybody has used hive-streaming inside spark and deployed in a cluster ? Is this something correct or wrong usage

Anybody has used hive-streaming inside spark and deployed in a cluster ? Is this something correct or wrong usage. Is there any url that shows using hive-streaming inside spark program in cluster mode.

Expert Contributor

Hive streaming API is a Java library so it should be possible to use it from any Java process.