Created 07-03-2018 03:17 PM
Hi Community,
I'm running a basic spark job which reads from an HBase table.
I can see the job is getting complete without any error, but in output I get the empty rows.
Will appreciate any help.
Below is my code
object objectName { def catalog = s"""{ |"table":{"namespace":"namespaceName", "name":"tableName"}, |"rowkey":"rowKeyAttribute", |"columns":{ |"Key":{"cf":"rowkey", "col":"rowKeyAttribute", "type":"string"}, |"col1":{"cf":"cfName", "col":"col1", "type":"bigint"}, |"col2":{"cf":"cfName", "col":"col2", "type":"string"} |} |}""".stripMargin def main(args: Array[String]) { val spark = SparkSession.builder() .appName("dummyApplication") .getOrCreate() val sc = spark.sparkContext val sqlContext = spark.sqlContext import sqlContext.implicits._ def withCatalog(cat: String): DataFrame = { sqlContext .read .options(Map(HBaseTableCatalog.tableCatalog -> cat)) .format("org.apache.spark.sql.execution.datasources.hbase") .load() } }
Created 07-03-2018 05:35 PM
Did you check out the docs?
Did you look at this other HCC post on a similar topic:
https://community.hortonworks.com/questions/49743/read-hbase-table-by-using-sparkscala.html
Created 07-03-2018 05:35 PM
Did you check out the docs?
Did you look at this other HCC post on a similar topic:
https://community.hortonworks.com/questions/49743/read-hbase-table-by-using-sparkscala.html