Community Articles

Find and share helpful community-sourced technical articles.
Announcements
Now Live: Explore expert insights and technical deep dives on the new Cloudera Community BlogsRead the Announcement
avatar
Cloudera Employee

Hey Everyone,

In this video, I'm going to show you how to run Data Engineering workloads on Cloudera Data Hub. First, we'll deploy a Data Hub cluster with Zeppelin and Spark. Then, I'll show you an example of a PySpark job accessing data on S3. After that, we'll run another PySpark job to access data in Hive.

 

 

More Information

More information on Data Hub
https://www.cloudera.com/products/data-hub.html
Documentation on CDP
https://docs.cloudera.com/
1,310 Views
0 Kudos