Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

Does hive supports query from multiple sources?

avatar
Expert Contributor

I was reading about the presto, where a single Presto query can process data from multiple sources e.g. HDFS, MySQL, Cassandra or even Kafka. Presto, where you can define objects called 'catalogs' which can point to remote data sources.

Do we have such mechanism in Hive to process data from multiple sources?

Also can we access another hive table(from remote source) in same beeline connection?

1 ACCEPTED SOLUTION

avatar
Super Guru

@nyadav

no Hive don't have capability to query other data source until its storage handler is defined. hive has concept of native and non native tables, for native tables it know how to manage it but for non native table it dont have a capability until it has not storage handler. to know more of storage handler you can refer this doc https://cwiki.apache.org/confluence/display/Hive/StorageHandlers

View solution in original post

3 REPLIES 3

avatar
Super Guru

@nyadav

no Hive don't have capability to query other data source until its storage handler is defined. hive has concept of native and non native tables, for native tables it know how to manage it but for non native table it dont have a capability until it has not storage handler. to know more of storage handler you can refer this doc https://cwiki.apache.org/confluence/display/Hive/StorageHandlers

avatar
Expert Contributor

Good info @Rajkumar Singh, As HBase provides a storage handler in hive. What all storage handlers do we have for hive? as per the doc Cassandra, JDBC, MongoDB, and Google Spreadsheets.

avatar
Super Guru