Support Questions
Find answers, ask questions, and share your expertise

Using oracle db source in Data Science Workbench

Using oracle db source in Data Science Workbench

New Contributor

Hello

Recently, we have set CDH, CDSW Soulutions.

We want to use both Datalake on hadoop and Oracle DB we had on CDSW

So. I have no problem using hadoop.

But I don't know how to use Oracle DB on CDSW.

 

Can you guide me using Oracle DB on CDSW with Python?

I have questions bellow:

- Can I use without installing oracle client?

- To use oracle DB with CDSW, do I need to set up custom docker image installed oracle client  first?

- Can you share a Custom Docker image configured with an Oracle client?

 

Thanks

 

 

1 REPLY 1

Re: Using oracle db source in Data Science Workbench

@johnwook You don't have to install any external database for CDSW to interact with your Hadoop cluster. As CDSW will interact with Hadoop using Gateway nodes and those will take care of this. 

 

NOTE: The Cloudera Data Science Workbench uses a PostgreSQL database that runs within a container on the master host at /var/lib/cdsw/current/postgres-data. So you can not use any custom database with CDSW. You have to use this shipped with CDSW. 

 

You want want to see the CDSW architecture to understand how CDSW works in Hadoop. 

https://docs.cloudera.com/documentation/data-science-workbench/1-6-x/topics/cdsw_overview.html#cdsw_...


Cheers!
Was your question answered? Make sure to mark the answer as the accepted solution.
If you find a reply useful, say thanks by clicking on the thumbs up button.