Member since
07-09-2015
68
Posts
24
Kudos Received
12
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
8996 | 11-23-2018 03:38 AM | |
2310 | 10-07-2018 11:44 PM | |
2949 | 09-24-2018 12:09 AM | |
4669 | 09-13-2018 02:27 AM | |
2992 | 09-12-2018 02:27 AM |
07-15-2021
03:34 AM
Cloudera ML Runtimes are the default and recommended solution for running user workloads. New Projects will be created with ML Runtimes configured by default and we recommend migrating existing Projects to use ML Runtimes. Legacy Engines are deprecated and will be removed in a future release but workloads running on them remain fully supported. To learn more, visit the documentation about Engine deprecation.
... View more
Labels:
05-20-2021
06:55 AM
RAPIDS Runtimes ship a suite of libraries from NVIDIA that bring the power of accelerated GPUs to standard Data Science operations — be it exploratory data analysis, feature engineering, or model training. The RAPIDS libraries are designed as drop-in replacements for common Python data science libraries like cuDF (pandas), cuPy (numpy), cuML (sklearn) and Dask-CUDA (dask) — enabling GPU acceleration for data science workloads of 5X+ without significant code changes. By leveraging the parallel compute capacity of GPUs the time for complicated data engineering and data science tasks can be dramatically reduced, accelerating the timeframes for Data Scientists to take ideas from concept to production. For more information about RAPIDS see rapids.ai. Data Scientists now can use the RAPIDS Runtimes that enable end-to-end data science and analytics pipelines entirely on GPUs. To learn more, visit the documentation about the RAPIDS ML Runtimes in CML.
... View more
Labels:
04-12-2021
08:58 AM
New, lightweight and customizable Cloudera ML Runtimes are now available in CDP Machine Learning and Cloudera Data Science Workbench 1.9. ML Runtimes have been rebuilt from the ground up to enable greater flexibility in frameworks, processing, and IDEs — powering customizable lightweight deployments without over encumbering the runtime profile. The new profiles are designed to meet the diverse needs of Data Scientists by enabling a variety of ML Runtimes natively and eliminating sizing and versatility issues with previous Cloudera Engine profiles. ML Runtimes with version 2020-02 offer support for Python 3.6, 3.7, and 3.8 with both the Workbench editor and with JupyterLab (GA). All of these runtimes are offered as NVIDIA GPU edition with out-of-the-box GPU acceleration. New R 3.6 and 4.0 Runtimes are also available with the Workbench editor. To learn more about our vision and roadmap read the Announcement blog post. As always, we welcome your feedback. Please send your comments and suggestions on our community forums.
... View more
12-16-2020
12:37 AM
1 Kudo
New, lightweight and customizable Cloudera ML Runtimes are now generally available in CDP Machine Learning and Cloudera Data Science Workbench 1.9. ML Runtimes have been rebuilt from the ground up to enable greater flexibility in frameworks, processing, and IDEs — powering customizable lightweight deployments without over encumbering the runtime profile. The new profiles are designed to meet the diverse needs of Data Scientists by enabling a variety of ML Runtimes natively and eliminating sizing and versatility issues with previous Cloudera Engine profiles. This initial release ships with Python 3.6, 3.7, and 3.8 Runtimes as well as both the Workbench editor and with JupyterLab (Tech Preview). New Runtime options will soon follow! To learn more about our vision and roadmap read the Announcement blog post. Also in these releases: CDP Machine Learning: Support scaling down to zero CPUs or GPUs on Azure Refreshed Data Science Project dashboard experience CDSW 1.9: Availability of Shared Data Experience (SDX for models — enabling model governance and model cataloging on CDP Private Base. Refreshed Data Science Project dashboard experience LDAP Group Sync enables creating teams that synchronize with an LDAP group for easier user management. Applications can now be configured for public, unauthenticated access. CDSW is now Certified with SLES12 SP5 and CentOS/RHEL 7.8 As always, we welcome your feedback. Please send your comments and suggestions on our community forums.
... View more
11-03-2020
01:15 PM
Model Monitoring & Deployment Security is now GA in CDSW 1.8. Available in this release is native MLOps functionalities for model monitoring, enabling tracking of individual model predictions down to the feature level for calculating model drift and ground truthing to production environments. Data scientists can analyze metrics using their preferred libraries and IDEs in any language, ensuring models are performing optimally and compliantly at scale. Additionally, this release includes general availability of Resource Quotas and quota management. These features enable administrators to limit users’ aggregate CPU, memory, and GPU consumption to protect against over-usage resulting in critical compute resource shortages. Default quotas can be configured for a Workspace, and overridden on a per-user basis with Custom quotas. Also in this release: Ability to use custom command-line arguments for jobs. Improved security for model deployments allowing user-level access controls to prevent unauthorized access of endpoints. Read the release notes for the full list of smaller improvements and bug fixes. Links: Download it here Upgrade with the Cloudera Manager Read the Production ML in CDSW blog CDSW Overview Getting started with CDSW As always, we welcome your feedback. Please send your comments and suggestions on our community forums.
... View more
11-26-2018
04:08 AM
We have a collaboration page in the documentation: https://www.cloudera.com/documentation/data-science-workbench/latest/topics/cdsw_collaborate.html We also have a page about Kerberos authentication: https://www.cloudera.com/documentation/data-science-workbench/latest/topics/cdsw_kerberos.html I hope this answers your question. Regards, Peter
... View more
11-23-2018
03:38 AM
1 Kudo
Yes.
... View more
11-23-2018
03:13 AM
The original issue you reported was an UnknownHostException on the clouderamaster. hdfs dfs -put data/sample_text_file.txt /tmp clouderamaster.<domain>.com -put: java.net.UnknownHostException: You need to make sure that this host can be resolved (both forward/reverse) from inside a CDSW session via DNS. As you can start a CDSW session and interact with it, you already configured the DNS entry for the CDSW master properly. Regards, Peter
... View more
11-20-2018
02:09 AM
Hi, We have an overlay network on top of your CDSW hosts where the pods are getting their IPs from (100.66.x.x). Based on your description it seems that DNS resolution is not working from inside the container while it works on the host. This can happen when multiple nameservers are configured in /etc/resolv.conf but some of them can't resolve your clouderamaster. You could figure out what nameserver can resolve your host and drop the rest of them or make sure that all nameservers can resolve the clouderamaster. I like to use `dig @nameserver clouderamaster.com` command to test these. Regards, Peter
... View more
11-19-2018
03:59 AM
You need to make sure that forward/reverse DNS resolution works from the CDSW terminal to host where you have the YARN ResourceManager and HDFS NameNode services. You referred to this as clouderamaster.<domain>.com before. This issue is not related to the CDSW master DNS resolution, you mentioned that you are using the session terminal, as it works, the CDSW master DNS is configured properly. Regards, Peter
... View more