Member since
08-06-2019
17
Posts
0
Kudos Received
0
Solutions
05-02-2020
01:11 PM
Hey Everyone,
In this video, I'm going to show you how to run Data Engineering workloads on Cloudera Data Hub. First, we'll deploy a Data Hub cluster with Zeppelin and Spark. Then, I'll show you an example of a PySpark job accessing data on S3. After that, we'll run another PySpark job to access data in Hive.
More Information
More information on Data Hub
https://www.cloudera.com/products/data-hub.html
Documentation on CDP
https://docs.cloudera.com/
... View more
04-23-2020
05:33 PM
Hey Everyone,
In this video, I'll show you how to use experiments in Cloudera Machine Learning (CML). Experiments allow a user to run the same script with different inputs easily and compare metrics. This can be useful when performing hyperparameter optimization. In the video, I walk through an extremely simple example where we get the sum of numbers. Then, I run through an example that reflects a more real-world use case.
More Information
Cloudera Machine Learning - What You Should Know
https://community.cloudera.com/t5/Community-Articles/Cloudera-Machine-Learning-What-You-Should-Know/ta-p/292935
More information on Cloudera Machine Learning (CML)
https://www.cloudera.com/products/machine-learning.html
Documentation on CDP
https://docs.cloudera.com/
... View more
04-21-2020
07:04 AM
1 Kudo
Hey Everyone,
This video will show you how to create a Cloudera Machine Learning (CML) Workspace. A Workspace in CML is where all of your data science work will take place and allow you to collaborate with your team easily. Creating a basic Workspace is quick and simple if you already have an environment set up. Also, I cover the different advanced options available to help admins and users manage costs and stay secure when creating a new Workspace.
More information:
Cloudera Machine Learning – What You Should Know
https://community.cloudera.com/t5/Community-Articles/Cloudera-Machine-Learning-What-You-Should-Know/ta-p/292935
More information on Cloudera Machine Learning (CML)
https://www.cloudera.com/products/machine-learning.html
Documentation on CDP
https://docs.cloudera.com/
... View more
04-01-2020
07:52 PM
Hi Everyone,
Cloudera Machine Learning (CML) is just one of the many experiences you can use on the Cloudera Data Platform (CDP). Cloudera Machine Learning allows teams to immediately deploy machine learning workspaces that auto-scale to fit their needs and auto-suspend to save costs by using Kubernetes. All of this is packaged into a portable experience that can be accessed by multiple team members easily to provide a consistent experience across an organization. In today's video, I'll walk you through the different high-level features in CML on CDP public cloud.
More information:
More information on Cloudera Machine Learning (CML)
https://www.cloudera.com/products/machine-learning.html
Documentation on CDP
https://docs.cloudera.com/
... View more
03-31-2020
07:54 PM
Hi Everyone,
In this video I'll show you how to get started with Cloudera Data Warehouse in CDP public cloud. I'll walk you through activating an environment for use with the Data Warehouse experience, creating a Virtual Warehouse, and then loading in some data. After loading data in, I'll show you how to connect your Virtual Warehouse to Tableau. Although we're talking about Tableau specifically here, the same procedure can be replicated with other BI tools.
More information:
More information on Data Warehouse
https://www.cloudera.com/products/data-warehouse.html
Documentation on CDP
https://docs.cloudera.com/
... View more
Labels:
03-17-2020
02:58 PM
Hey Everybody,
Today I've got a continuation video about Attribute Based Access Control (ABAC) on CDP. ABAC allows you to Tag tables and columns and then set access policies for groups or users that will either grant or deny access to tagged data. At Cloudera Now Jonathan Hsieh showed us how to implement Column Masking with ABAC. So today I decided I would show you how to do the same thing. After that, I'll show you how security policies travel with the data; all because of SDX.
If you haven't checked out the first video go watch that first here:
https://community.cloudera.com/t5/Community-Articles/How-to-use-ABAC-in-CDP/ta-p/288056
More information:
Watch Cloudera Now
https://www.cloudera.com/about/events/cloudera-now-cdp.html
Documentation on CDP
https://docs.cloudera.com/
More information on SDX
https://www.cloudera.com/products/sdx.html
... View more
03-11-2020
03:09 PM
Cloudera Data Platform (CDP) on Public Cloud makes being an administrator for a big data platform even easier, thanks to SDX. Watch me spend a day at a temp position for Aperture Cybertronics as their Data Admin. I'll quickly deploy clusters, grant access to users, and change performance parameters for the Aperture Cybertronics' staff.
More information:
More information on Data Warehouse
https://www.cloudera.com/products/data-warehouse.html
Documentation on CDP
https://docs.cloudera.com/
... View more
Labels:
03-03-2020
08:47 PM
Hi Again,
Today's video is all about Cloudera Data Warehouse (CDW). CDW is one of the many experiences you can use on the Cloudera Data Platform (CDP). Cloudera Data warehouse packages up the projects you may already know and use such as Impala and Hive into a service. This Service runs on Kubernetes giving it the ability to pause, resume, scale up, or down quickly and automatically. The Data Warehouse service allows you to quickly deploy new use cases with the right data at the right time to tackle hard problems while meeting your SLA’s. It does all of this with significant cost savings and little effort on your part.
In this first video of the Data Warehouse collection we'll cover the different pieces of the service and layout of the UI. We'll show you the Data Analytics studio (DAS) as well to get you started querying your data warehouse.
More information:
More information on Data Warehouse
https://www.cloudera.com/products/data-warehouse.html
Documentation on CDP
https://docs.cloudera.com/
... View more
Labels:
01-23-2020
05:25 PM
Hey Everyone,
Column Masking and Row Filtering are powerful tools in CDP that admins can use to implement security rules without creating dozens of views that must be maintained over time. With Column Masking, admins can block or limit the extent of data specific users or groups are allowed to see. If instead an admin needs to limit the rows a user can see, (for example US users can only see data from the USA) Row Filtering can be used. In this video I walk through an example where we limit the amount of data call center employees are allowed to see. I'll show you how we set up these policies in Ranger and what effect it has on the user.
More information:
Documentation on CDP
https://docs.cloudera.com/
More information on SDX
https://www.cloudera.com/products/sdx.html
... View more
Labels:
01-21-2020
02:47 PM
1 Kudo
Hey everyone,
We have a new video for the SDX collection and this time we're talking about Attribute Based Access Control (ABAC). Cloudera Data Platform (CDP) Allows you to use ABAC on data access across the entire platform. ABAC allows you to Tag tables and columns and then set access policies based on those tags to allow specific groups or users to access that tagged data. In this video I'll show you how to setup ABAC in CDP using Atlas and Ranger. I'll walk through a couple examples to show you how ABAC can be safer and easier than implementing resource based policies.
More information:
Documentation on CDP
https://docs.cloudera.com/
More information on SDX
https://www.cloudera.com/products/sdx.html
... View more
Labels: