Community Articles
Find and share helpful community-sourced technical articles.
Labels (1)
Cloudera Employee


Continuing my previous article on creating a CDP AWS environment, this tutorial teaches you how to automate the creation of a datalake, including:

  • Setting up proper roles and IDBroker mappings associated with your user and environment
  • Creating a datalake
  • Syncing users to FreeIPA


Here is the TL;DR: go to my github and run the scripts as instructed.


Automation scripts

Step 1: Create IAM and launch Data Lake

Create roles and mapping in your existing environment: <base_dir> <prefix> <region> 

Create datalake: <base_dir> <prefix> 

Step 2: Verify periodically until datalake status is RUNNING <prefix> 

Step 3: Sync free IPA users <base_dir> <prefix> 


This was a short and sweet tutorial, more fun to come playing with data lake clusters and experiences!
0 Kudos
Tags (3)
Take a Tour of the Community
Don't have an account?
Your experience may be limited. Sign in to explore more.
Version history
Last update:
‎11-14-2019 07:43 AM
Updated by:
Top Kudoed Authors