Created on 02-09-202109:32 PM - edited on 02-09-202109:34 PM by subratadas
Cloudera Machine Learning provides a number of methods of connecting to other CDP services and experiences such as a Cloudera Data Warehouse. In this post, we will connect using Python and the Impyla library, as well as using the embedded Cloudera Data Visualization.
Using Impyla
Within Cloudera Machine Learning, create a new project and set the language to Python 3.6. The connection details are available from the Data Warehouse console by copying the JDBC connection details which will look like.
Note: The PASS variable is an Environment variable set in the Project settings under the Advanced tab. This does not protect your password but will mitigate the risk of it being copied into a version control service.
Log out as your default user and log back into Cloudera Data Visualization using the local admin user account. Note: You can raise a support request if you don't have access to this.
Add a new connection under Basic settings using the following parameters.
Connection Name: Name your Connection
Hostname or IP Address: Use the hostname from the JDBC string
Port #: Use the SSL port of 443
Username: CDP Workload Username
Password: CDP Workload Password
Under Advanced Settings, set the following parameters.