Community Articles

aakulov · ‎03-06-2024

Earlier this year Cloudera Machine Learning (CML) added a new way to accelerate GenAI projects by tapping into Hugging Face Spaces and deploying these projects right inside of CML with just a few clicks. With over 6,500 spaces as of this writing, Hugging Face community is still growing rapidly and provides a convenient platform for practitioners and organizations to share their work in areas from classical machine learning to the latest GenAI research. In this article you will learn how to enable and use this feature to accelerate your own ML projects.

The default Hugging Face Spaces AMP catalog is enabled for all CML Public Cloud workspaces starting from version 2.0.43-b208. To enable users to launch external Hugging Face AMPs, additional steps are necessary (see end of this article).

Steps to Deploy Hugging Face Space AMP

Let's dive right in and see how simple it is to deploy a Hugging Face AMP:

Click on AMPs in the left sidebar of your ML Workspace. If you don’t see this, then AMPs are not enabled by your administrator.
Click on Hugging Face tab to narrow down the view to HF AMPs only
On the Can you run it? LLM version card click on Deploy
Read through the details of the AMP and the disclosure message. You can also navigate to the HF Space’s official github if you wish
Click Configure & Deploy
This particular HF Space is focused on answering a question of whether or not a given LLM can run on a particular hardware spec. In the next screen, note the environment variables that can be passed down the project. You can leave these at default values here.
Leave the rest of the settings unchanged and click Launch Project

At this point CML kicks off the steps required to launch this Hugging Face Space, namely installing dependencies and launching an application. After the steps are completed, the AMP will be fully deployed.

Clicking on Applications in the left side-bar, you can see a gradio app deployed. Clicking on the app's card (Application to serve UI :link:) will take you to the app's UI, opened in a new tab of your web browser. It will look like this:

What happened in the background?

Applied ML Prototypes (AMPs) are packaged projects that include execution steps that CML can understand and perform. The owner of a project defines .project-metadata.yaml in their project repository to instruct CML on what steps should be done run code, schedule a job, or deploy a model, etc.). In the case of Hugging Face Spaces this metadata is injected on the fly by CML as the project is being spun up. The two steps that are executed with Hugging Space AMPs are the following:

Install dependencies that a given HF Space requires
Deploy an Application (gradio or streamlit) if one is present in the HF Space

Once a Hugging Face AMP is launched in CML, users can treat it as any other local project, reviewing the code, making changes, breaking things and learning as they go. The goal is to accelerate innovation in the enterprise and adjust open projects to meet the requirements of specific customer use cases.

Enable Deployment of External HF Spaces

While Hugging Face Spaces AMPs is a Tech Preview feature, there is a setting that needs to be enabled in the ML Workspace to make it available to the users. For this you will need to have MLAdmin role in the workspace or work with your workspace administrator through the following steps:

Inside of the ML Workspace, navigate to Site Administration
Go to Settings tab
In the Feature Flags section, check the box next to Allow users to deploy external Hugging Face Space
This setting takes effect immediately

Once this setting is enabled, users will not only deploy Hugging Face Spaces AMPs from the existing catalog, but also let them point to any Hugging Face space and start working with it as a project within CML. In Tech Preview this supports gradio and streamlit applications only.

Iterate Faster with CML

At Cloudera we strive to give customers options, from deployment models on-prem or in the cloud to using external or internally-hosted Large Language Models. Introduction of Hugging Face Spaces integration in CML will significantly accelerate customers' Machine Learning projects, especially those focused on Generative AI.

steven-matison · ‎04-30-2024

Excellent work here Oleksandr!!

Cloudera Community

Community Articles

🤗 Hugging Face Spaces AMPs Accelerate ML Projects

Cloudera Data Platform (CDP)

Cloudera Machine Learning (CML)

Steps to Deploy Hugging Face Space AMP

What happened in the background?

Enable Deployment of External HF Spaces

Iterate Faster with CML

Re: 🤗 Hugging Face Spaces AMPs Accelerate ML Projects

Accelerating ML models with distributed Xgboost in...

Accelerating Replication and Decommissioning in HD...

5 New Applied ML Prototypes (AMPs)

ML workspace page does NOT open

Identifying & Removing Disabled User projects with...

Enable Intel's Intelligent Storage Acceleration Li...

Using RStudio as an Editor with ML Runtimes

Installing CDP CLI within a CML Project

Accelerating Streaming Analytics with Spark and HD...

AI to Edge: ML Operationalization using Cloudera