There are lot of docs available related to the horizontal scalability but no where I see how to handle concurrent users say for eg handling 50K users request at a time. My requirement is I have millions of records stored in the hbase for multiple users. When multiple users try to access their data (say for around 50k), will hbase able to handle this much amount of concurrent users ? Is there any way to scale up connections (there are few param, but not up to 50 K, i believe), Please suggest a solution
Have a look at the below two parameters
The maximum number of concurrent connections the client will maintain to a single Region. That is, if there is already hbase.client.max.perregion.tasks writes in progress for this region, new puts won't be sent to this region until some writes finishes.
hbase.zookeeper.property.maxClientCnxns - Number of concurrent connections which can be made to a single member of ZK ensemble from a single client and this value should match the value in zoo.cfg. This need to be adjusted taking into consideration the expected number of HBase client connections.
It depends on what 50K users is doing. (if your cluster capacity and configuration is right, you can scale horizontally without any problem)