About peter_ableda

peter_ableda · ‎11-03-2021

At Cloudera, we believe that data can make what is impossible today, possible tomorrow. There are many good uses of data. With data, we can monitor our business, the overall business, or specific business units. We can segment based on the customer verticals or whether they run in the public or private cloud. We can understand customers better, see usage patterns and main consumption drivers. We can find customer pain points, see where they get stuck, and understand how different bugs affect them. With data, we can discover new market opportunities, and review where we stand compared to the global market. We can track feature adoption, see how new features are picked up and what usage/consumption they generate. With data, we can set better goals, know where we are and where we want to go. And in the end, we can make better decisions. At Cloudera, we practice what we preach. As the Cloudera Data Platform (CDP) gains popularity and more and more customers make it a critical piece of their infrastructure, we set out to create the best data platform in the enterprise. Today, we will highlight a new feature that showcases one great example of using data in the service of our customers. I’m excited to share this feature because this is a success story! Late last year, we saw our customers struggling to get CML Workspaces up. The elevated escalation count put a strain on our engineering team, trials were slowed down, and even worse our customers had a very bad experience with our product. We needed to figure this out. We tried to understand “is this a systemic issue?” or “how widespread is this problem?”, and the results were alarming. Customers experienced issues more than half, 57% of the time. There are two phases of CML Workspace creation; first, we create a K8s Cluster via the liftie APIs - this is the ‘Provision’ step; second, we install the CML service. The above chart shows the workspace provisioning results broken down for CML releases between June ‘20 and Jan ‘21. Once we saw the results, we dug in and analyzed dozens of failure modes. We discovered that actual product bugs caused only a small portion of the failures. The most common failures we found were instance types requested in unsupported regions, failures due to conflicts between the admin-provided CIDR address ranges, and environments where CML Workspaces were failing due to an unhealthy DataLake. Okay, we identified the problem: we attempt to create CML Workspaces when we know they will surely fail. Preflight checks to the rescue. Liftie and CML engineers teamed up to solve this problem. They built a framework and released a series of checks over the course of the last few quarters to catch issues early. The results are astonishing. For the most recent - Aug ‘21 - release, customers experienced issues with the workspace creation just 7% of the time. For 39% of the attempts, we caught issues early and showed a meaningful error message, this saved hours and hours of work for support, engineering, and our customers. This was a data-driven project. We used data to qualify the problem, to understand the issues, and to measure our progress and the outcome. The result is a significantly more stable platform and a new framework that all other CDP Data Services will benefit from. Get started with Cloudera Machine Learning in CDP Now, you can start here.

peter_ableda · ‎07-15-2021

Administrators can customize the Cloudera provided ML Runtimes to support Data Scientists’ specific use-cases. They can install additional OS packages, Python and R libraries, third-party drivers to enable connecting to external data stores, or even a new editor to be used. CML now enables registering these custom Runtimes and making them available for Data Scientists to use in their projects. Data Scientists have specific requirements for their working environments, they require a set of R or Python libraries and ready-made connections to fetch from third-party data stores. With the new feature, administrators can create custom Runtimes that data scientists can use in CML. To learn more, visit the documentation about Customized ML Runtimes.

peter_ableda · ‎07-15-2021

Cloudera ML Runtimes are the default and recommended solution for running user workloads. New Projects will be created with ML Runtimes configured by default and we recommend migrating existing Projects to use ML Runtimes. Legacy Engines are deprecated and will be removed in a future release but workloads running on them remain fully supported. To learn more, visit the documentation about Engine deprecation.

peter_ableda · ‎05-20-2021

RAPIDS Runtimes ship a suite of libraries from NVIDIA that bring the power of accelerated GPUs to standard Data Science operations — be it exploratory data analysis, feature engineering, or model training. The RAPIDS libraries are designed as drop-in replacements for common Python data science libraries like cuDF (pandas), cuPy (numpy), cuML (sklearn) and Dask-CUDA (dask) — enabling GPU acceleration for data science workloads of 5X+ without significant code changes. By leveraging the parallel compute capacity of GPUs the time for complicated data engineering and data science tasks can be dramatically reduced, accelerating the timeframes for Data Scientists to take ideas from concept to production. For more information about RAPIDS see rapids.ai. Data Scientists now can use the RAPIDS Runtimes that enable end-to-end data science and analytics pipelines entirely on GPUs. To learn more, visit the documentation about the RAPIDS ML Runtimes in CML.

peter_ableda · ‎04-12-2021

New, lightweight and customizable Cloudera ML Runtimes are now available in CDP Machine Learning and Cloudera Data Science Workbench 1.9. ML Runtimes have been rebuilt from the ground up to enable greater flexibility in frameworks, processing, and IDEs — powering customizable lightweight deployments without over encumbering the runtime profile. The new profiles are designed to meet the diverse needs of Data Scientists by enabling a variety of ML Runtimes natively and eliminating sizing and versatility issues with previous Cloudera Engine profiles. ML Runtimes with version 2020-02 offer support for Python 3.6, 3.7, and 3.8 with both the Workbench editor and with JupyterLab (GA). All of these runtimes are offered as NVIDIA GPU edition with out-of-the-box GPU acceleration. New R 3.6 and 4.0 Runtimes are also available with the Workbench editor. To learn more about our vision and roadmap read the Announcement blog post. As always, we welcome your feedback. Please send your comments and suggestions on our community forums.

peter_ableda · ‎12-16-2020

New, lightweight and customizable Cloudera ML Runtimes are now generally available in CDP Machine Learning and Cloudera Data Science Workbench 1.9. ML Runtimes have been rebuilt from the ground up to enable greater flexibility in frameworks, processing, and IDEs — powering customizable lightweight deployments without over encumbering the runtime profile. The new profiles are designed to meet the diverse needs of Data Scientists by enabling a variety of ML Runtimes natively and eliminating sizing and versatility issues with previous Cloudera Engine profiles. This initial release ships with Python 3.6, 3.7, and 3.8 Runtimes as well as both the Workbench editor and with JupyterLab (Tech Preview). New Runtime options will soon follow! To learn more about our vision and roadmap read the Announcement blog post. Also in these releases: CDP Machine Learning: Support scaling down to zero CPUs or GPUs on Azure Refreshed Data Science Project dashboard experience CDSW 1.9: Availability of Shared Data Experience (SDX for models — enabling model governance and model cataloging on CDP Private Base. Refreshed Data Science Project dashboard experience LDAP Group Sync enables creating teams that synchronize with an LDAP group for easier user management. Applications can now be configured for public, unauthenticated access. CDSW is now Certified with SLES12 SP5 and CentOS/RHEL 7.8 As always, we welcome your feedback. Please send your comments and suggestions on our community forums.

peter_ableda · ‎11-03-2020

Model Monitoring & Deployment Security is now GA in CDSW 1.8. Available in this release is native MLOps functionalities for model monitoring, enabling tracking of individual model predictions down to the feature level for calculating model drift and ground truthing to production environments. Data scientists can analyze metrics using their preferred libraries and IDEs in any language, ensuring models are performing optimally and compliantly at scale. Additionally, this release includes general availability of Resource Quotas and quota management. These features enable administrators to limit users’ aggregate CPU, memory, and GPU consumption to protect against over-usage resulting in critical compute resource shortages. Default quotas can be configured for a Workspace, and overridden on a per-user basis with Custom quotas. Also in this release: Ability to use custom command-line arguments for jobs. Improved security for model deployments allowing user-level access controls to prevent unauthorized access of endpoints. Read the release notes for the full list of smaller improvements and bug fixes. Links: Download it here Upgrade with the Cloudera Manager Read the Production ML in CDSW blog CDSW Overview Getting started with CDSW As always, we welcome your feedback. Please send your comments and suggestions on our community forums.

peter_ableda · ‎11-26-2018

We have a collaboration page in the documentation: https://www.cloudera.com/documentation/data-science-workbench/latest/topics/cdsw_collaborate.html We also have a page about Kerberos authentication: https://www.cloudera.com/documentation/data-science-workbench/latest/topics/cdsw_kerberos.html I hope this answers your question. Regards, Peter

peter_ableda · ‎11-23-2018

Yes.

peter_ableda · ‎11-23-2018

The original issue you reported was an UnknownHostException on the clouderamaster. hdfs dfs -put data/sample_text_file.txt /tmp clouderamaster.<domain>.com -put: java.net.UnknownHostException: You need to make sure that this host can be resolved (both forward/reverse) from inside a CDSW session via DNS. As you can start a CDSW session and interact with it, you already configured the DNS entry for the CDSW master properly. Regards, Peter

Online	Offline
Last Visited	‎10-31-2024 01:02 PM

Member Since	‎07-09-2015 12:12 AM
Last Visited	‎10-31-2024 01:02 PM
Posts	70
Kudos received	28

Cloudera Community

Re: Connecting to hdfs/creating the sparksession f...

Re: Configuration step to use GPU fails under prox...

Re: Cannot launch workers in R script

Re: cdsw spark context issue

Re: split GPU for multiple users

Cloudera Machine Learning Workspace Provisioning P...

Cloudera Machine Learning now enables administrato...

Cloudera Machine Learning Runtimes are replacing L...

RAPIDS ML Runtimes are now available for Open GPU ...

[ANNOUNCE] New Cloudera Machine Learning Runtimes ...

[ANNOUNCE] Cloudera Machine Learning Runtimes are ...

[ANNOUNCE] Cloudera Data Science Workbench 1.8.1 R...

Re: Connecting to hdfs/creating the sparksession f...

Re: Connecting to hdfs/creating the sparksession f...

Re: Connecting to hdfs/creating the sparksession f...