Code Repositories
Find and share code repositories
Repo Description

As part of this flow, we will ingest data files, that are copied to the landing zone on a gateway server, and then process them at a regular interval automatically using Falcon. When the workflow begins, the files are ingested, stored, transformed and the transformed data is sqooped out of cluster into a MySQL database.

Once the data is processed, the hive processing lineage will be available in Apache Atlas.

Repo Info
Github Repo URL https://github.com/sainib/hadoop-data-pipeline
Github account name sainib
Repo name hadoop-data-pipeline
1,439 Views
Comments
Contributor

Is there any example with Kerberos enabled cluster?

Don't have an account?
Version history
Last update:
‎12-04-2015 06:34 PM
Updated by:
Contributors