Code Repositories

Find and share code repositories
Announcements
Welcome to the upgraded Community! Read this blog to see What’s New!
avatar
Guru
Repo Description

Demonstrate how SparkSQL can act as a distributed data federation platform.

Tables are created from three different sources:

  • Data that being processed by Spark
  • Data in Hive
  • Data in Postgres

Spark makes all of these tables available via JDBC as if from as single data store and speeds up processing by caching tables in memory.

Repo Info
Github Repo URL https://github.com/vakshorton/SparkSQLDataFederationDemo.git
Github account name vakshorton
Repo name SparkSQLDataFederationDemo.git
9,080 Views