Community Articles

Find and share helpful community-sourced technical articles.
Announcements
We’ve updated our product names and community labels - click here for full details
avatar
Cloudera Employee

Hey Everyone,

In this video, I'm going to show you how to run Data Engineering workloads on Cloudera Data Hub. First, we'll deploy a Data Hub cluster with Zeppelin and Spark. Then, I'll show you an example of a PySpark job accessing data on S3. After that, we'll run another PySpark job to access data in Hive.

 

 

More Information

More information on Data Hub
https://www.cloudera.com/products/data-hub.html
Documentation on CDP
https://docs.cloudera.com/
1,438 Views
0 Kudos