Cloudera Community

Community Articles

Find and share helpful community-sourced technical articles.

Advanced Search

Cloudera Employee

Hey Everyone,

In this video, I'm going to show you how to run Data Engineering workloads on Cloudera Data Hub. First, we'll deploy a Data Hub cluster with Zeppelin and Spark. Then, I'll show you an example of a PySpark job accessing data on S3. After that, we'll run another PySpark job to access data in Hive.

More Information

More information on Data Hub: https://www.cloudera.com/products/data-hub.html
Documentation on CDP: https://docs.cloudera.com/

1,520 Views

Announcements

Community Announcements

June 2026 Community Highlights

What's New @ Cloudera

Cloudera Data Lineage Custom Lineage Connector Relaunch

What's New @ Cloudera

Product Update: Cloudera Flow Management Operator for Kubern...

What's New @ Cloudera

Product Update: Cloudera Data Flow v3.1 for Cloudera on Clou...

Community Announcements

May 2026 Community Highlights

Top Kudoed Authors

User

Count

6

4

2

1

1