Code Repositories
Find and share code repositories
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.
Repo Description

This repo contains Spark code that will bulkload data from Spark into HBase (via Phoenix). I've also included Spark code (SparkPhoenixSave.scala) to Save a DataFrame directly to HBase, via Phoenix. Similarly, there is code (SparkPhoenixLoad.scala) that'll load data from HBase, via Phoenix, into a Spark dataframe.

The goal is to use Apache Phoenix to speed up bulkloading time, compared to HBase Bulkloading (as I've demonstrated in this repo:

Repo Info
Github Repo URL
Github account name zaratsian
Repo name SparkPhoenix
Don't have an account?
Coming from Hortonworks? Activate your account here
Version history
Revision #:
1 of 1
Last update:
‎10-25-2016 05:28 PM
Updated by:
Top Kudoed Authors