Code Repositories

Find and share code repositories
Repo Description

This repo contains Spark code that will bulkload data from Spark into HBase (via Phoenix). I've also included Spark code (SparkPhoenixSave.scala) to Save a DataFrame directly to HBase, via Phoenix. Similarly, there is code (SparkPhoenixLoad.scala) that'll load data from HBase, via Phoenix, into a Spark dataframe.

The goal is to use Apache Phoenix to speed up bulkloading time, compared to HBase Bulkloading (as I've demonstrated in this repo: https://github.com/zaratsian/SparkHBaseExample

Repo Info
Github Repo URL https://github.com/zaratsian/SparkPhoenix
Github account name zaratsian
Repo name SparkPhoenix
3,109 Views
Take a Tour of the Community
Don't have an account?
Your experience may be limited. Sign in to explore more.
Version history
Last update:
‎10-25-2016 05:28 PM
Updated by:
Contributors
Top Kudoed Authors