This is originally an old post, and replies are also old and inconclusive. I hope I may resurrect this thread. Subsequent to the date of this original posting, it appears CDSW will support GPU when the environment is contoured via this guidance: https://www.cloudera.com/documentation/data-science-workbench/latest/topics/cdsw_gpu.html We are selecting a configuration for at least one server that will hold GPU card(s), and will be a Worker Node within our CDSW deployment. We typically use Dell servers and are looking at an R940-series server since it can hold 4GPUs locally + lots of RAM & CPU. Regarding specific GPU compatibility constraints, looking at the nVidia page here: http://us.download.nvidia.com/XFree86/Linux-x86_64/390.25/README/supportedchips.html It looks like Tesla cards are compatible, and are also selectable on a Dell R940 server configuration, however there are many other nVidia cards that are also compatible with the CDSW-compatible driver, but are not selectable on a Dell R940 server configuration. Non-Tesla cards are a fraction of the cost of the Tesla versions. It seems the main difference between "cheap" and "expensive" nVidia cards is "GPU Passthru" necessary for supporting virtualized environments, grid computing, etc. I assume this is why they are supported by default within large commodity enterprise servers. So, CDSW executes processes within a Docker, and this is a form of virtualization whose abstraction details I am not fully versed in; the resulting question is, "does a docker image qualify as a virtualized environment requiring GPU passthru?", or from CDSW's perspective, since an nVidia "Titan V" is compatible with the driver specified in the Cloudera document (link above), then is "GPU Passthru" not required, and CDSW would support a cheaper "Titan V" card, as long as it would work on the selected server? If so, then it would drive potentially another discussion on if Dell can support a non-Tesla nVidia card on an R940 server (which is obviously a question for Dell). To the community: Does this seem like a train of thought that anyone out here has pursued before, and can you share any insight? All will be appreciated.
... View more
Hello, we see some samples related to using PYODBC in Python within CDSW at this at bottom (https://www.cloudera.com/documentation/data-science-workbench/latest/topics/cdsw_import_data.html) I tried this and it failed even after I corrected the URL pointing to the library on google. I am not knowledgeable enough to debug further. And we have received some guidance here related to unixODBC. (https://github.com/mkleehammer/pyodbc/wiki/Connecting-to-SQL-Server-from-RHEL-or-Centos) We are told we must first install this into the CDSW edge node environment. We could do that, but have not yet, because I am confused by existence of two options. Has anybody succeeded in using either of these ODBC drivers for reach back into an MS SQL database instance to pull data directly via SQL queries into CDSW? If so, which did you use, and would you be willing to share your verbose steps/experience? Thank you!
... View more