Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Using oracle db source in Data Science Workbench

Using oracle db source in Data Science Workbench

New Contributor

Hello

Recently, we have set CDH, CDSW Soulutions.

We want to use both Datalake on hadoop and Oracle DB we had on CDSW

So. I have no problem using hadoop.

But I don't know how to use Oracle DB on CDSW.

 

Can you guide me using Oracle DB on CDSW with Python?

I have questions bellow:

- Can I use without installing oracle client?

- To use oracle DB with CDSW, do I need to set up custom docker image installed oracle client  first?

- Can you share a Custom Docker image configured with an Oracle client?

 

Thanks

 

 

1 REPLY 1
Highlighted

Re: Using oracle db source in Data Science Workbench

Expert Contributor

@johnwook You don't have to install any external database for CDSW to interact with your Hadoop cluster. As CDSW will interact with Hadoop using Gateway nodes and those will take care of this. 

 

NOTE: The Cloudera Data Science Workbench uses a PostgreSQL database that runs within a container on the master host at /var/lib/cdsw/current/postgres-data. So you can not use any custom database with CDSW. You have to use this shipped with CDSW. 

 

You want want to see the CDSW architecture to understand how CDSW works in Hadoop. 

https://docs.cloudera.com/documentation/data-science-workbench/1-6-x/topics/cdsw_overview.html#cdsw_...

Don't have an account?
Coming from Hortonworks? Activate your account here