Code Repositories
Find and share code repositories
Guru
Repo Description

Demonstrate how SparkSQL can act as a distributed data federation platform.

Tables are created from three different sources:

  • Data that being processed by Spark
  • Data in Hive
  • Data in Postgres

Spark makes all of these tables available via JDBC as if from as single data store and speeds up processing by caching tables in memory.

Repo Info
Github Repo URL https://github.com/vakshorton/SparkSQLDataFederationDemo.git
Github account name vakshorton
Repo name SparkSQLDataFederationDemo.git
8,161 Views
Don't have an account?
Version history
Revision #:
1 of 1
Last update:
‎04-27-2016 06:57 PM
Updated by:
 
Contributors
Top Kudoed Authors