Community Articles

Find and share helpful community-sourced technical articles.
Announcements
Celebrating as our community reaches 100,000 members! Thank you!
avatar
Cloudera Employee

Hey Everyone,

In this video, I'm going to show you how to run Data Engineering workloads on Cloudera Data Hub. First, we'll deploy a Data Hub cluster with Zeppelin and Spark. Then, I'll show you an example of a PySpark job accessing data on S3. After that, we'll run another PySpark job to access data in Hive.

 

 

More Information

More information on Data Hub
https://www.cloudera.com/products/data-hub.html
Documentation on CDP
https://docs.cloudera.com/
821 Views
0 Kudos