Created 09-27-2016 05:05 PM
I don't see the levels of detail in hbase reference guide whether cell version can be accessed via hbase snapshot. Use case is I'd like to access an older version of cell via hive over hbase snapshot or via spark, is that information retained in snapshot and do I need HBase to access it or this info is retained in hfile and I can access it with anything other than hbase?
Created 09-29-2016 07:00 PM
@Artem Ervits, @Randy Gelhausen
Based on your discussion, I added a few additional lines to the github repo mentioned above. The code will now return older versions of a cell. The maximum number of snapshot versions to fetch is specified in the props file (here).
Created 09-27-2016 05:07 PM
See @Dan Zaratsian's examples reading cell versions and timestamps from a snapshot here.
Created 09-27-2016 05:19 PM
@Randy Gelhausen I don't see any reference to HBase cell versions here
Created 09-27-2016 05:22 PM
He filters all cells by timestamp here. If I understand correctly, without the filter, the DataFrame would expose all cell versions that exist in the snapshot.
Created 09-27-2016 05:24 PM
I'll give it a whirl and let you know, until then you get +1
Created 09-27-2016 05:45 PM
@Randy Gelhausen confirmed with @Devaraj Das snapshot captures all info including version information for completeness, here's an example http://stackoverflow.com/questions/28334940/how-to-get-all-versions-of-an-hbase-cell-in-a-spark-newa... just need to apply to snapshot rather than table
Created 09-29-2016 07:00 PM
@Artem Ervits, @Randy Gelhausen
Based on your discussion, I added a few additional lines to the github repo mentioned above. The code will now return older versions of a cell. The maximum number of snapshot versions to fetch is specified in the props file (here).