Options
- Subscribe to RSS Feed
- Mark as New
- Mark as Read
- Bookmark
- Subscribe
- Printer Friendly Page
- Report Inappropriate Content
Cloudera Employee
Created on
05-02-2020
01:11 PM
- edited on
05-07-2020
11:43 PM
by
VidyaSargur
Hey Everyone,
In this video, I'm going to show you how to run Data Engineering workloads on Cloudera Data Hub. First, we'll deploy a Data Hub cluster with Zeppelin and Spark. Then, I'll show you an example of a PySpark job accessing data on S3. After that, we'll run another PySpark job to access data in Hive.
More Information
- More information on Data Hub
- https://www.cloudera.com/products/data-hub.html
- Documentation on CDP
- https://docs.cloudera.com/