Recently, we have set CDH, CDSW Soulutions.
We want to use both Datalake on hadoop and Oracle DB we had on CDSW
So. I have no problem using hadoop.
But I don't know how to use Oracle DB on CDSW.
Can you guide me using Oracle DB on CDSW with Python?
I have questions bellow:
- Can I use without installing oracle client?
- To use oracle DB with CDSW, do I need to set up custom docker image installed oracle client first?
- Can you share a Custom Docker image configured with an Oracle client?
@johnwook You don't have to install any external database for CDSW to interact with your Hadoop cluster. As CDSW will interact with Hadoop using Gateway nodes and those will take care of this.
NOTE: The Cloudera Data Science Workbench uses a PostgreSQL database that runs within a container on the master host at /var/lib/cdsw/current/postgres-data. So you can not use any custom database with CDSW. You have to use this shipped with CDSW.
You want want to see the CDSW architecture to understand how CDSW works in Hadoop.