Code Repositories
Find and share code repositories
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.
Guru
Repo Description

Demonstrate how SparkSQL can act as a distributed data federation platform.

Tables are created from three different sources:

  • Data that being processed by Spark
  • Data in Hive
  • Data in Postgres

Spark makes all of these tables available via JDBC as if from as single data store and speeds up processing by caching tables in memory.

Repo Info
Github Repo URL https://github.com/vakshorton/SparkSQLDataFederationDemo.git
Github account name vakshorton
Repo name SparkSQLDataFederationDemo.git
7,310 Views
Don't have an account?
Coming from Hortonworks? Activate your account here
Version history
Revision #:
1 of 1
Last update:
‎04-27-2016 06:57 PM
Updated by:
 
Contributors
Top Kudoed Authors