Support Questions

Find answers, ask questions, and share your expertise
Announcements
Check out our newest addition to the community, the Cloudera Data Analytics (CDA) group hub.

Using oracle db source in Data Science Workbench

New Contributor

Hello

Recently, we have set CDH, CDSW Soulutions.

We want to use both Datalake on hadoop and Oracle DB we had on CDSW

So. I have no problem using hadoop.

But I don't know how to use Oracle DB on CDSW.

 

Can you guide me using Oracle DB on CDSW with Python?

I have questions bellow:

- Can I use without installing oracle client?

- To use oracle DB with CDSW, do I need to set up custom docker image installed oracle client  first?

- Can you share a Custom Docker image configured with an Oracle client?

 

Thanks

 

 

1 REPLY 1

@johnwook You don't have to install any external database for CDSW to interact with your Hadoop cluster. As CDSW will interact with Hadoop using Gateway nodes and those will take care of this. 

 

NOTE: The Cloudera Data Science Workbench uses a PostgreSQL database that runs within a container on the master host at /var/lib/cdsw/current/postgres-data. So you can not use any custom database with CDSW. You have to use this shipped with CDSW. 

 

You want want to see the CDSW architecture to understand how CDSW works in Hadoop. 

https://docs.cloudera.com/documentation/data-science-workbench/1-6-x/topics/cdsw_overview.html#cdsw_...


Cheers!
Was your question answered? Make sure to mark the answer as the accepted solution.
If you find a reply useful, say thanks by clicking on the thumbs up button.
Take a Tour of the Community
Don't have an account?
Your experience may be limited. Sign in to explore more.