I am new to Hbase and trying to get an understanding of how data is stored in Hbase compared to data warehouse which I know. Can someone please provide the right design or data structure for the below design in data warehouse ..
Please suggest a best Hbase design which can address the following:
Use HBase exclusively for data that requires, dominantly, random access to individual records or small ranges of sequentially related records.
For everything else, including tables you primarily require building reports regularly on, you're better off using Kudu+Impala with appropriate partitioning. Checkout https://kudu.apache.org/docs/schema_design.html for a good reference on this.
Queries that perform large scans (hundreds of thousands of rows+ for ex.) including full table scans are not the type of workload HBase is designed for.