I have a huge Hive Table, which works fine so far. Now I want to play around with HBase, so I'm looking for a way to my Hive table data into a (new) HBase table. I already found some solutions for that, but I'm not sure which way is the best one. By the way, I'm familiar with Spark, so working with RDD / Datasets is not a problem.
I'm using the Hortonworks Data Platform 2.6.5.
Are there other interesting ways to bulk load HBase by Hive data? Which way above is the most "common" one?
Thank you for your help!
AFAIK HFiles is the most efficient approach to bulk load data into Hbase. So, either 3rd or 4th approach seems to be good. Personally I would prefer 3rd approach Hive-HBase Integration as it is completely native & simple approach (as it also avoids writing any code)