Code Repositories
Find and share code repositories
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.
Repo Description

Demonstrate how SparkSQL can act as a distributed data federation platform.

Tables are created from three different sources:

  • Data that being processed by Spark
  • Data in Hive
  • Data in Postgres

Spark makes all of these tables available via JDBC as if from as single data store and speeds up processing by caching tables in memory.

Repo Info
Github Repo URL
Github account name vakshorton
Repo name SparkSQLDataFederationDemo.git
Don't have an account?
Coming from Hortonworks? Activate your account here
Version history
Revision #:
1 of 1
Last update:
‎04-27-2016 06:57 PM
Updated by:
Top Kudoed Authors