Archives of Support Questions (Read Only)

Gayathridevi · ‎10-11-2017

Shu_ashu · ‎10-11-2017

You can use spark sql to get data from Hive table and create a dataframe.

There is another best way to get data from HBase table, in this method we are going to construct HbaseRDD from scratch and this is more scalable,better fit for spark catalyst engine

You can refer to the below links how to get data directly from HBase without using Hive table.

https://hortonworks.com/blog/spark-hbase-connector-a-year-in-review/

https://hortonworks.com/blog/spark-hbase-dataframe-based-hbase-connector/

https://github.com/hortonworks-spark/shc

View solution in original post

Shu_ashu · ‎10-11-2017

Hi @Gayathri Devi,