we have hbase in our datalake with hdp-2.5. We bulk load onto it from spark sql jobs and we need end users to query the processed data sitting on hbase via a REST-API.
1.is it best practice to provide end user access via api to the data in hbase?
2. Or we have cassandra or DB2 outside of hadoop cluster purely to provide rest api on top of it for user to access it?
3. Is cassandra faster in terms of random read than Hbase?