Created on 04-12-202107:44 AM - edited on 04-13-202111:29 PM by subratadas
Introduction
This post explains the process of submitting Spark jobs on CDP Public Cloud Data Hub to the Livy server and provides a sample wrapper script for job submission. To understand the Livy Spark submit process, please refer to this post and understand how to arrive at the job configuration.
The following steps submit the Spark Pi job from the spark_examples*.jar on S3.
Steps
Get the Livy endpoint for Data hub from the CDP control plane
Ensure that you have the workload password set for the environment
Download the Python wrapper script using the following command:
The job should now be submitted to the Data Hub cluster. The wrapper script can also be used on the CDP Private Cloud Base where Livy is configured with Livy.