Created 12-13-2017 12:15 PM
Why Hbase database integrated in hadoop and what is the advantage ?
Created 12-15-2017 09:43 AM
the main question is what makes you think you 'need to use' Hbase? If you want MongoDb or cassandra, what stops you using it?
Created 12-22-2017 05:24 AM
Following this are considered Hbase in hadoop
1) Its only nosql Hbase database support/Integrated with HDFS/Hadoop file system for storing its data.
2) We can scale-up the cluster with Horizontal Scaling by adding more Machines, for storage more data and performance improvement without bringing down the cluster/services.
3) It support batch MAP/REDUCE task with help of HDFS file system.
4) Its real-time query capabilities with a key/value store(column family) and even it allow query for individual record as well.
Kindly add if any other feature need to included.
Created 12-22-2017 07:45 AM
OK, but what do you want to get answered with your post? You don't have to use HBase, you can use Cassandra or MongoDB if you want. With both you are able to build a scalable cluster, and integrate it into other jobs running on your hadoop platform. 'real time' I guess means low latency? With key-values stores I guess you really mean without a strict DB structure (like SQL DBs)?
Just keep in mind, that Cassandra is a BigTable DB, as HBase, but MongoDB is a document DB, storing JSONs. So MongoDB implements a different concept.
Created 12-22-2017 12:49 PM
thanks for your comments, it made me to rephrase my questions. 🙂
Created 12-15-2017 02:13 PM
HBase runs on your existing HDFS infrastructure is faster, more secure and more scalable then those other NoSQL choices. You also get Phoenix for SQL ontop of that.
HBase is the best choice especially if you have an existing Hadoop cluster.
Created 12-15-2017 07:46 PM
Not sure I ever thought I'd see "traditional" and "NoSQL" in the same sentence. 😉
Seriously, @Timothy Spann is correct that HBase & Phoenix are part of HDP (thus easily stood-up and managed if you run HDP) and available for your use AND @Harald Berghoff is also correct that you don't "need to use" HBase -- if your solution is solved best with another (notice I didn't say "traditional" -- wasn't HBase here before MongoDB anyways; hehe! I ~think~ they started in 2006 and 2007, respectively) NoSQL database then by all means leverage it.
Good luck and happy Hadooping (or whatever Big Data-y framework you are using)!!
Created 12-22-2017 03:37 AM
why only Hbase is we need to use in Hadoop that is my concern !
Created 12-22-2017 12:45 PM
thanks for pointing the question, corrected it 🙂
Created 12-23-2017 03:14 AM
Following this are considered Hbase in hadoop
1) Its only nosql Hbase database support/Integrated with HDFS/Hadoop file system for storing its data.
2) We can scale-up the cluster with Horizontal Scaling by adding more Machines, for storage more data and performance improvement without bringing down the cluster/services.
3) It support batch MAP/REDUCE task with help of HDFS file system.
4) Its real-time query capabilities with a key/value store(column family) and even it allow query for individual record as well.
5) Real time data Replication using Kafka and Hbase in Hadoop.
Kindly add if any other feature need to included.