Created on 09-12-2017 05:12 AM - edited 09-16-2022 05:13 AM
Hi,
1. Can anybody suggest, why rowkey design is so important for create / retrieval of records?
2. Also, how cloudera search (SOLR) can be useful in such cases?
3. Is it mandatory to create secondary indices for retrieval?
Regards,
Deepak Sharma
Created on 09-12-2017 05:39 AM - edited 09-12-2017 05:43 AM
Hi,
1/ These articles describe some usefull information about rowkey and its design :
- http://archive.cloudera.com/cdh5/cdh/5/hbase-0.98.6-cdh5.3.8/book/rowkey.design.html
- https://www.linkedin.com/pulse/performance-tuning-hbase-part-1-rowkey-crux-kuldeep-deshpande/
2/ If you need to query your data in HBase by a cells value .. it will be totaly inefficient. Cloudera search can help you in these cases but you will need to index the data into Cloudera search.
3/ Well, I don't have the answer to that but I would recommend you to stick to querying by the "rowkey only" if you need some "performance".
Hope this helps.
Created 09-12-2017 11:52 PM
Thanks Mathieu
Created 09-12-2017 10:37 AM
Created 09-12-2017 11:56 PM
Hi Nishan,
Does it mean that, I will not have to bother anymore for the Row Key design, if I use solr?
Is there any specific java code, which I need to write inorder to support it ?
I was thinking to use Apache Pheonix, if it does take care of the Row Key design internally.
Any suggestions please ?
Thanks