- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
Kafka Handler for CDH 6.1.1 (Hive 2.1.1)
- Labels:
-
Apache Hive
-
Apache Kafka
Created on ‎03-12-2021 05:48 AM - edited ‎03-12-2021 05:49 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
We have CDH 6.1.1 cluster and Kafka cluster. I want to make possible to select data from Kafka topics using Hive. Is there any way to do it? Where I can find Kafka storage handler version for Hive 2.1.1 ?
Created ‎03-16-2021 01:28 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hello @d33play
I checked the documents again and found there is an internal jira CDH-72683 which was tracked for CDH versions to support Kafka Storage Handler.
Kafka Storage Handler is a part of Hive 3.x version but CDH 6.1.1 comes with Apache Hive 2.1.1
Hence it is not supported in CDH versions but is available with CDP.
https://docs.cloudera.com/runtime/7.2.0/release-notes/topics/rt-runtime-component-versions.html
So, the KafkaStorageHandler package will not work with Hive 2.1.1.
Hope this information helps.
Thanks and Regards,
Nandini
Created ‎03-12-2021 09:00 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
I believe this is what you are looking for
https://blog.cloudera.com/introducing-hive-kafka-sql/
And a more comprehensive doc - https://docs.cloudera.com/HDPDocuments/HDF3/HDF-3.5.1/kafka-hive-integration/hdf-kafka-hive-integrat...
You would not have to worry about the versions as the components are tested for cross compatibility.
Created ‎03-12-2021 11:38 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Thank you for reply. Is there any installation guide (which jars, where to place etc.)?
I've tried ADD JAR from hive CLI (not beeline), then I trying to create kafka storage table it's returns error (Error: Error running query: java.lang.NoClassDefFoundError: org/apache/hadoop/hive/metastore/DefaultHiveMetaHook).
Created ‎03-16-2021 01:28 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hello @d33play
I checked the documents again and found there is an internal jira CDH-72683 which was tracked for CDH versions to support Kafka Storage Handler.
Kafka Storage Handler is a part of Hive 3.x version but CDH 6.1.1 comes with Apache Hive 2.1.1
Hence it is not supported in CDH versions but is available with CDP.
https://docs.cloudera.com/runtime/7.2.0/release-notes/topics/rt-runtime-component-versions.html
So, the KafkaStorageHandler package will not work with Hive 2.1.1.
Hope this information helps.
Thanks and Regards,
Nandini
