- |
- Sign Out

turn on suggestions

Auto-suggest helps you quickly narrow down your search results by suggesting possible matches as you type.

Showing results for

- Cloudera Community
- News
- News & Announcements
- Getting Started
- Hadoop 101
- Beta Releases
- Configuring and Managing
- Cloudera Manager
- Cloudera Director
- CDH Topics (w/o CM)
- Using the Platform
- Batch (MR, YARN, Oozie)
- Data Ingest (Sqoop, Flume...
- Storage (HDFS, HBase...
- Hue
- Hive
- Impala
- CDSW
- Data Science
- Search (SolrCloud)
- Spark
- Cloudera Labs
- Data Management
- Data Discovery, Optimization
- Security/Sentry
- Building on the Platform
- Kite SDK
- Suggestions
- Off Topic and Suggestions
- Cloudera AMA

- Cloudera Community
- :
- Using the Platform
- :
- Data Science
- :
- Oryx ALS Collaborative filtering essentials

Topic Options

- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page

Highlighted
# Oryx ALS Collaborative filtering essentials

Options

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Highlight
- Email to a Friend
- Report Inappropriate Content

06-05-2016 02:54 AM

Hi,

I need help with understanding basics of Oryx ALS Collaborative filtering essentials.

What exactly means "features", for example each user has 10 features and also each item has 10 features, some of them positive, some negative numbers. First I thought it represents matrix user-item, but I have more than 10 users and more than 10items.

When ingesting or learning new model, there is by default *weight* set to 1. Imagine I have various rating of movies, where 1 represents bad rating and 5 great rating. What weight should I set? I have tried to undestrand source code from which it seemed to me that weight should be in interval (0,1).

Thank you for help!

Solved! Go to Solution.

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Highlight
- Email to a Friend
- Report Inappropriate Content

06-05-2016 07:20 AM

Features means the number of latent features in the factored matrix model. If the user-item matrix A is factored as A ~= X Y', then the number of features f is the number of columsn of X and Y.

Weights are not ratings. No, weights can be any value. One approach is to view any interaction at all as a "1". You might instead treat bad ratings and negative weights, and good ratings as positive weights.

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Highlight
- Email to a Friend
- Report Inappropriate Content

06-07-2016 12:50 AM - edited 06-07-2016 12:50 AM

Hi,

thank you for quick reply.

I am quite new in recommendation domain. What exactly mean latent features?

New Solutions