About EvanH

Harsh J · ‎11-02-2017

You are right that its all just byte sequences to HBase, and that it sorts everything lexicographically. You do not require a separator character when composing your key for HBase to understand them as boundaries (cause it would not serve as one), unless you prefer the extra bytes for better readability or for recovering back the individual data elements from (variable length) keys if that's a use-case. HBase 'sharding' (splitting) can be manually specified at table create time if you are aware of your key pattern and ranges - this is strongly recommended to scale from the beginning. Otherwise, HBase computes key midpoints by analysing them in byte form and splits them based on that, whenever a split size threshold is reached for a region range.

Online	Offline
Last Visited	‎03-16-2018 06:30 PM

Member Since	‎12-07-2015 10:56 AM
Last Visited	‎03-16-2018 06:30 PM
Posts	1

Cloudera Community

Re: HBase: Composite key for ImportTsv

Re: HBase: Composite key for ImportTsv