Community Articles

Find and share helpful community-sourced technical articles.
avatar
Cloudera Employee

Hey Everyone,

In this video, I'm going to show you how to run Data Engineering workloads on Cloudera Data Hub. First, we'll deploy a Data Hub cluster with Zeppelin and Spark. Then, I'll show you an example of a PySpark job accessing data on S3. After that, we'll run another PySpark job to access data in Hive.

 

 

More Information

More information on Data Hub
https://www.cloudera.com/products/data-hub.html
Documentation on CDP
https://docs.cloudera.com/
1,010 Views
0 Kudos