Created 05-04-2016 09:14 AM
Created 05-04-2016 09:21 AM
As far as I know, Hive can read only the current (most recent) data version in HBase. Only when using HBase API or hbase shell you can also read all versions, or only the ones in a specific time interval.
From the Hive HBase integration document: there is currently no way to access the HBase timestamp attribute, and queries always access data with the latest timestamp.
Created 05-04-2016 09:21 AM
As far as I know, Hive can read only the current (most recent) data version in HBase. Only when using HBase API or hbase shell you can also read all versions, or only the ones in a specific time interval.
From the Hive HBase integration document: there is currently no way to access the HBase timestamp attribute, and queries always access data with the latest timestamp.
Created 05-05-2016 05:58 AM
Thanks for replying.
if not hive can impala read all versions in HBase?
Created 05-05-2016 07:58 AM
No, Impala cannot read all versions either. Impala defines tables in HBase using Hive DDL because Impala doesn't support custom SerDe's to define tables, and as we saw Hive doesn't expose HBase timestamps (more detail about Impala and HBase here). So, if you want to access timestamps you have to make them "first class citizens" and include them in your Hbase table key, or among values, if you can ensure unique keys by other means.
Created 05-09-2016 06:19 AM
thank you 🙂
Created 05-09-2016 07:17 AM
Okay, no problems, please consider to accept the answer, to help us manage answered questions. Tnx!
Created 05-04-2016 09:26 AM
currently there is no way available to read all the versions data.