Community Articles

Find and share helpful community-sourced technical articles.
Announcements
Now Live: Explore expert insights and technical deep dives on the new Cloudera Community BlogsRead the Announcement
Labels (1)
avatar
Rising Star

CDP has recently been introduced to the market, and we all want to learn about it, so I decided to contribute a bit.

 

In this series of tutorials, I want to explain the basics of CDP Data Hub and provide some ease of automation. Indeed, the CDP control plane user interface provides a thorough workflow that automates the creation of main elements that ensures security, governance and scalability.

 

Screen Shot 2019-11-14 at 11.00.19 AM.png

 

As depicted above, these main elements are:

  • CDP Environment: Cloud hosted (in your cloud) resources for CDP deployment
  • CDP Data Lake: Host of CDP SDX, the shared service layer providing all security and governance
  • CDP Data Hub cluster(s): where users run their workloads

Note: CDP offers a lot more than these basic elements, and a lot more ways to configure them; this is a 101 tutorial and will not address all that CDP has to offer. For more information, visit the CDP documentation or product page.

 

Instead, this tutorial series will teach you:

  1. How to create a CDP environment in AWS with minimal requirements
  2. How to create a datalake from an existing environment
  3. How to launch a CDP Data Hub cluster via CLI

 

Happy scripting!

5,109 Views
Version history
Last update:
‎11-15-2019 01:50 PM
Updated by: