Created on 05-02-202001:11 PM - edited on 04-21-202604:29 AM by GrazittiAPI
Hey Everyone,
In this video, I'm going to show you how to run Data Engineering workloads on Cloudera Data Hub. First, we'll deploy a Data Hub cluster with Zeppelin and Spark. Then, I'll show you an example of a PySpark job accessing data on S3. After that, we'll run another PySpark job to access data in Hive.