Created on 03-04-202506:15 PM - edited on 03-10-202511:19 PM by VidyaSargur
CREATING A CDE REPOSITORY WITH A PRIVATE GIT REPOSITORY
OBJECTIVE
In this article, you will learn how to create a CDE Repository by cloning a private Git repository using CDE Credentials.
Cloudera Data Engineering (CDE) is a cloud-based service that helps businesses build, manage, and scale data pipelines. It is based on Apache Spark and integrates with other Cloudera services.
Git repositories allow teams to collaborate, manage project artifacts, and promote applications from lower to higher environments. CDE Repositories can be used to clone git repositories into a Virtual Cluster to make the imported files available to CDE Jobs as reusable dependencies. CDE currently supports Git providers such as GitHub, GitLab, and Bitbucket.
CDE Credentials allow you to store secrets such as usernames, passwords, and tokens, in secure, reusable objects associated with CDE Jobs, Custom Runtimes and Repositories.
REQUIREMENTS
This example was built with CDE 1.23. To reproduce the commands in your environment, you need a CDE Service, Virtual Cluster, Private Git repository, and a local installation of the CDE CLI.
CREATE PRIVATE GITHUB REPOSITORY
Log in to your GitHub account and create a private GitHub repository. Save the repository's URL, for example:
Cloudera Data Engineering (CDE) provides a command line interface (CLI) client. You can use the CLI to create and update jobs, view job details, manage job resources, run jobs, and so on. In this article, you learned how to use the CLI to create reusable CDE credentials for integrating with GitHub, and generated a CDE Repository to import application code from a private GitHub repository.