0% found this document useful (0 votes)

59 views8 pages

Collaborative Filtering Techniques Explained

Collaborative filtering is a technique used in recommendation systems to predict a user's preferences based on the preferences of similar users. Collaborative filtering algorithms can predict a user-item rating or determine the top-k items or users. Neighborhood-based collaborative filtering forms neighborhoods of similar users or items to make recommendations. User-based collaborative filtering uses ratings from similar users while item-based collaborative filtering uses ratings for similar items.

Uploaded by

Angel Leo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

59 views8 pages

Collaborative Filtering Techniques Explained

Uploaded by

Angel Leo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

UNIT – 3

Collaborative filtering is a technique used in recommendation system to

make predictions about an individual’s preferences based on the preferences of
similar users. The idea about this method is that people who have similar
preferences in the past are likely to have similar preferences in the future.

Collaborative filtering algorithms, neighborhood-based collaborative filtering

algorithms can be formulated in one of two ways:
1. Predicting the rating value of a user-item combination: This is the simplest
and most primitive formulation of a recommender system. In this case, the
missing rating ruj of the user u for item j is predicted.
2. Determining the top-k items or top-k users: In most practical settings, the
merchant is not necessarily looking for specific ratings values of user-item
combinations. Rather, it is more interesting to learn the top-k most relevant
items for a particular user, or the top-k most relevant users for a particular item.
The problem of determining the top-k items is more common than that of
finding the top-k users. This is because the former formulation is used to
present lists of recommended items to users in Webcentric scenarios. In
traditional recommender algorithms, the “top-k problem” almost always refers
to the process of finding the top-k items, rather than the top-k users.

However, the latter formulation is also useful to the merchant because it can be
used to determine the best users to target with marketing efforts.
Collaborative filtering algorithm used data on the interactions of a large number of
users with a particular item such as their rating. This collaborative filtering used
two main approaches.

1. User-based collaborative filtering

2. Item-based collaborative filtering

Neighborhood-based Collaborative Filtering

Collaborative Filtering (CF) methods collect preferences in the form of ratings or

signals from many users (hence the name) and then recommend items to a user
based on item interactions that people have with similar tastes as this user had in
the past. In other words, these methods assume that if person X likes a subset of
items that person Y likes, then X is more likely to have the same opinion as Y for a
given item compared to a random person that may or may not have the same
preferences.
The main idea with neighborhood-based methods is to leverage either user-user
similarity or item-item similarity to make recommendations. These methods
assume that similar users tend to have similar behaviors when rating items. We can
also expand this assumption to items as well: similar items tend to receive similar
ratings from the same user.
In these methods, the interactions between users and items are generally
represented by a user-item matrix, where each row represents a user and each
column represents an item, while the cells represent the interaction between the
two, which, in most cases, are the item ratings made by users. In this context, we
can define two types of neighborhood-based methods:
 User-based Collaborative Filtering: Ratings given by users like a user U
are used to make recommendations. More specifically, to predict U's rating
for a given item I, we calculate the weighted average of the rating r of k
similar users (neighbors) to U, where the weights are determined by the
similarity between U and each of the similar users.

 Item-based Collaborative Filtering: Ratings of a group of similar items

are used to make recommendations for item I. Similarly, to predict I's rating
given by a user U, we calculate the weighted average of the rating r of k
similar items (neighbors) to I, where the weights are determined by the
similarity between I and each of the similar items.
Comparison between User-based and Item-based Methods
The difference is subtle, but user-based collaborative
filtering predicts a user’s rating by using the ratings of neighboring users, while
item-based collaborative filtering leverages the user's ratings on neighboring items,
which allows for more consistent predictions because it follows the rating
behaviors of that user. In the former case, the similarity is calculated between the
rows of the user-item matrix, while the latter looks at similarities between the
columns of the matrix.
These approaches also differ in the way they solve problems. It is common to use
the item neighborhood to recommend a list of top k items to a user. On the other
hand, it is interesting to retrieve the top k users from a segment to target them for
marketing campaigns.
To understand the reasoning behind a recommendation, item-based methods
provide better explanations than user-based methods. This is because item-based
recommendations can use the item neighborhood to explain the results in the form
of “you bought this, so these are the recommended items”. The item neighborhood
can also be useful for suggesting product bundles to maximize sales. On the other
hand, user-based methods’ recommendations usually cannot be explained directly
because neighbor users are anonymized for privacy reasons.
Additionally, item-based methods may only recommend items very similar to what
the user already liked, whereas user-based methods often recommend a more
diverse set of items. This can encourage users to try new items and potentially keep
their engagement and interest.
Another significant difference between these approaches is related to ratings.
Calculating the similarity between users to predict ratings may be misleading
because users may rate items in a different manner. When you present a range of
values to the user, he/she might interpret them differently. For instance, in a 5-star
rating system, a user may rate an item as 3 because it does what it is expected to do
and nothing more, while others might use 3 to rate an item that barely works. Some
users rate items highly and others rate items less favorably. To address this issue,
the ratings should be mean centered by the user, meaning the user’s mean rating is
subtracted from their raw rating, and the target user’s mean rating is added to the
calculation, as in the example below:
Neighborhood Models in Practice
Let’s say we have the following small sample of a user-item matrix, where items
are from a digital commerce store. Notice there are missing ratings, which means
users typically do not rate all products.

To show how the algorithm works in practice, let’s assume we have built an item-
based model. Note that the steps of the algorithm would be analogous to the user-
based model, except for the perspective changes and focus on similarities between
rows (users).
Remember that neighborhood CF algorithms rely on the ratings and similarity
between items/users, so the first step is to define which similarity metric to use.
One of the most common choices is the Pearson similarity, which measures how
correlated a pair of vectors are. The range of values scales from -1 to 1, where
those values indicate negative and positive correlations, respectively, and 0
indicates no correlation between vectors. This is the Pearson similarity equation for
item-based models:

During this first phase, it’s usual to precompute the similarity matrix beforehand to
obtain a good performance during inference time. In the case of item-based
models, an item-item similarity matrix is built by applying the similarity metric
between all pairs of items. Since the matrix is sparse, we only consider the set of
mutually rated pairs of items during the similarity computation. For instance, the
similarity between items from columns 1 and 4 of the image above will be
computed as the similarity between vectors [4,3,5] and [5,3,4]. It’s possible that a
pair of items may show no co-ratings by users due to the sparsity of the matrix,
resulting in an empty set. In that case, a value of 0 similarity is assigned for that
pair. To improve computational efficiency, it is common to consider only the k
nearest neighbors of an item during inference time.
Let’s say we want to predict how Madison rated the Animal Farm book and we
defined k=2 as the number of nearest neighbors to consider during calculation. To
simplify the example, we will only manually calculate the similarities between the
target item and items from columns 2 and 4 because they are the nearest neighbors
for this item. When calculating the mean rating during similarity computation, we
will consider only the set of ratings that are mutually exclusive between items.
The image below shows how the neighborhood is formed. The circle in red is the
value we’re trying to predict. The squares in green are ratings from Madison that
are going to be used to infer the rating for the target item. The other two ratings
marked with an X are not considered because k=2. The rectangles in orange show a
set of mutually exclusive ratings between the target item and the item from column
2, while the rectangles in blue show the same, but for the common ratings between
the target item and item from column 4.
These are the common set of ratings between the target item (item 3) and the first
neighbor (item 2): [4,3,3] and [4,4,3]. The first step is to calculate the mean in each
set:
4+ 3+3
Target item mean = 3
= 3.33

4+ 4+ 3
Item 2 mean = 3
= 3.67

The Pearson similarity formula centers the ratings by their mean, so we can
transform this vector and then plug the results into the equation:
Target item mean centered vector= [(4-3.33),(3-3.33),(3-3.33)]=[0.67-0.33,-0.33]
Item 2 mean centered vector = [(4-3.33),(3-3.33),(3-3.33)]=[0.33,0.33,-0.67]
To simplify the calculations, we separate the numerator and denominator:
Numerator = (0.67+0.33) + (-033*0.33) + (-0.33*-0.67) = 0.33
Denominator = √ 0.672 +¿ ¿ * √ ¿ ¿ = 0.67
0.33
Then finally compute the similarity between items 3 and 2 = 0.67 =0.5

4+ 3
Target item mean = 2 = 3.5

5+4
Item 4 mean = 2 = 4.5

Target item mean centered vector

= [(4-3.5), (3-3.5)] = [0.5, -0.5]
Item 4 mean centered vector = [(5-4.5), (4-4.5)] = [0.5, -0.5]
Numerator = (0.5 *0.5) + (-0.5*-0.5) = 0.5
Denominator = √ 0.52 +¿ ¿ * √ ¿ ¿ = 0.5
0.5
The same calculation is done for the similarity between items 3 and 4:= 0.5 = 1

Next, we calculate the mean for each item, considering all the item’s ratings:
4+ 4+ 1+ 2+3
Item 2 means = 5
= 2.8

5+3+3
Item 3 means = 3
=4

Then, we can plug in the values we found together with Madison’s ratings for
Items 2 and 4 (1 and 3, respectively) in the equation below:

So,
3.33+0.5∗( 1−2.8 ) +1∗(3.4 )
Rating (Madison, Item3) = 0.5+1 =2.07
Since ratings are discrete numbers, we round this value to 2. It’s important to note
that in a real-world setting, it’s often recommended to use neighborhood methods
only when k is above a certain threshold because, when the number of neighbors is
small, the predictions are usually not precise. An alternative would be to use
Content-based filtering when we do not have enough data about the user-item
relationship.

Understanding Collaborative Filtering Methods
No ratings yet
Understanding Collaborative Filtering Methods
11 pages
Collaborative Filtering in Recommender Systems
No ratings yet
Collaborative Filtering in Recommender Systems
13 pages
Collaborative Filtering Mathematical Models
No ratings yet
Collaborative Filtering Mathematical Models
10 pages
Integrated User-Item Collaborative Filtering
No ratings yet
Integrated User-Item Collaborative Filtering
4 pages
Collaborative Filtering Methods Explained
100% (1)
Collaborative Filtering Methods Explained
8 pages
User-Based Collaborative Filtering Overview
No ratings yet
User-Based Collaborative Filtering Overview
20 pages
Optimized Item-Based Filtering Algorithm
No ratings yet
Optimized Item-Based Filtering Algorithm
5 pages
Neighborhood-Based Collaborative Filtering
No ratings yet
Neighborhood-Based Collaborative Filtering
14 pages
Recommender Systems: Approaches & Examples
No ratings yet
Recommender Systems: Approaches & Examples
57 pages
Collaborative Filtering in Recommender Systems
No ratings yet
Collaborative Filtering in Recommender Systems
9 pages
Item Genre-Based Collaborative Filtering
No ratings yet
Item Genre-Based Collaborative Filtering
4 pages
Strengths and Weaknesses of Neighborhood Methods
No ratings yet
Strengths and Weaknesses of Neighborhood Methods
2 pages
Collaborative Filtering with Surprise Library
No ratings yet
Collaborative Filtering with Surprise Library
21 pages
Reference Paper
No ratings yet
Reference Paper
5 pages
Understanding Collaborative Filtering Techniques
No ratings yet
Understanding Collaborative Filtering Techniques
8 pages
A Novel Collaborative Filtering Model Based On Combination of Correlation Method With Matrix Completion Technique
No ratings yet
A Novel Collaborative Filtering Model Based On Combination of Correlation Method With Matrix Completion Technique
8 pages
Overview of Recommendation Systems
No ratings yet
Overview of Recommendation Systems
16 pages
Understanding Collaborative Filtering
No ratings yet
Understanding Collaborative Filtering
7 pages
User-Based Collaborative Filtering Methods
No ratings yet
User-Based Collaborative Filtering Methods
8 pages
Enhanced Item-Based Filtering Algorithm
No ratings yet
Enhanced Item-Based Filtering Algorithm
4 pages
Understanding Recommender Systems
No ratings yet
Understanding Recommender Systems
95 pages
Collaborative Filtering Techniques Explained
No ratings yet
Collaborative Filtering Techniques Explained
16 pages
Evaluating Neighborhood-Based CF Algorithms
No ratings yet
Evaluating Neighborhood-Based CF Algorithms
62 pages
Advanced Matrix Estimation Techniques
No ratings yet
Advanced Matrix Estimation Techniques
13 pages
Item-Based Filtering with Rough Set Theory
No ratings yet
Item-Based Filtering with Rough Set Theory
4 pages
Online Book Recommendation System
100% (1)
Online Book Recommendation System
21 pages
Understanding Recommender Systems
No ratings yet
Understanding Recommender Systems
30 pages
Item-based CF Algorithm with Slope One
No ratings yet
Item-based CF Algorithm with Slope One
3 pages
Understanding Recommendation Systems
No ratings yet
Understanding Recommendation Systems
32 pages
User-Based Recommendation Systems Explained
No ratings yet
User-Based Recommendation Systems Explained
4 pages
Recommender Systems Overview
No ratings yet
Recommender Systems Overview
36 pages
Data Mining for Recommender Systems
No ratings yet
Data Mining for Recommender Systems
56 pages
Introduction to Recommender Systems
No ratings yet
Introduction to Recommender Systems
48 pages
E-Commerce Recommendation Systems Overview
No ratings yet
E-Commerce Recommendation Systems Overview
51 pages
Collaborative Filtering Explained
No ratings yet
Collaborative Filtering Explained
17 pages
Decomposition-Based Item Recommendation Model
No ratings yet
Decomposition-Based Item Recommendation Model
2 pages
Understanding Collaborative Filtering
No ratings yet
Understanding Collaborative Filtering
8 pages
Recommender Systems Overview and Insights
No ratings yet
Recommender Systems Overview and Insights
4 pages
Item-Item Collaborative Filtering Overview
No ratings yet
Item-Item Collaborative Filtering Overview
19 pages
Understanding Collaborative Filtering Techniques
No ratings yet
Understanding Collaborative Filtering Techniques
65 pages
Enhanced Collaborative Filtering Algorithm
No ratings yet
Enhanced Collaborative Filtering Algorithm
8 pages
Understanding Recommender Systems
No ratings yet
Understanding Recommender Systems
12 pages
Collaborative Filtering in Recommender Systems
No ratings yet
Collaborative Filtering in Recommender Systems
12 pages
Understanding Collaborative Filtering
No ratings yet
Understanding Collaborative Filtering
34 pages
Clustering in Recommender Systems Review
No ratings yet
Clustering in Recommender Systems Review
22 pages
A New Temporal Recommendation System Based On Users' Similarity Prediction
No ratings yet
A New Temporal Recommendation System Based On Users' Similarity Prediction
6 pages
Introduction to Recommender Systems
No ratings yet
Introduction to Recommender Systems
86 pages
CSN-382 Movie Recommendation System
No ratings yet
CSN-382 Movie Recommendation System
25 pages
Understanding Recommendation Systems
No ratings yet
Understanding Recommendation Systems
8 pages
CSE545: Recommendation Systems Overview
No ratings yet
CSE545: Recommendation Systems Overview
72 pages
Fairness in Recommender Systems Study
No ratings yet
Fairness in Recommender Systems Study
17 pages
Memory vs Model Based Filtering
No ratings yet
Memory vs Model Based Filtering
22 pages
Recommender Systems in Machine Learning
No ratings yet
Recommender Systems in Machine Learning
139 pages
2404 16177v1
No ratings yet
2404 16177v1
6 pages
Collaborative Filtering in Recommender Systems
No ratings yet
Collaborative Filtering in Recommender Systems
12 pages
Collaborative Filtering Techniques Explained
No ratings yet
Collaborative Filtering Techniques Explained
51 pages
Unifying User-Based and Item-Based Collaborative Filtering Approaches by Similarity Fusion
No ratings yet
Unifying User-Based and Item-Based Collaborative Filtering Approaches by Similarity Fusion
8 pages
Automating Penetration Testing with Bash
No ratings yet
Automating Penetration Testing with Bash
11 pages
Women in 19th Century Literature
No ratings yet
Women in 19th Century Literature
12 pages
9th Grade Chemistry Lesson Plans
No ratings yet
9th Grade Chemistry Lesson Plans
5 pages
Grade 11lis3
No ratings yet
Grade 11lis3
3 pages
Writing Effective CVs and Application Letters
No ratings yet
Writing Effective CVs and Application Letters
6 pages
Virtual Reality and Friendship in "Virtually True"
No ratings yet
Virtual Reality and Friendship in "Virtually True"
9 pages
Grade 2 English Daily Lesson Log
No ratings yet
Grade 2 English Daily Lesson Log
3 pages
Deep Learning for Seizure Prediction
No ratings yet
Deep Learning for Seizure Prediction
10 pages
Juvenile Justice System Act 2018 Overview
No ratings yet
Juvenile Justice System Act 2018 Overview
28 pages
Revised Even Semester Timetable 2024-25
No ratings yet
Revised Even Semester Timetable 2024-25
1 page
AI for Pneumonia Detection in X-rays
No ratings yet
AI for Pneumonia Detection in X-rays
1 page
Grid Method for Shoe Drawing Project
No ratings yet
Grid Method for Shoe Drawing Project
53 pages
E-Commerce Sales Prediction Report
No ratings yet
E-Commerce Sales Prediction Report
29 pages
Understanding Scale and Map Plans
No ratings yet
Understanding Scale and Map Plans
7 pages
Evangel College 2023/24 Student List
No ratings yet
Evangel College 2023/24 Student List
5 pages
Arua District Primary School Inspection Report
No ratings yet
Arua District Primary School Inspection Report
66 pages
Bible Commandments and Parables Curriculum
100% (1)
Bible Commandments and Parables Curriculum
3 pages
KPPSC SST Physics & Math Study Guide
No ratings yet
KPPSC SST Physics & Math Study Guide
3 pages
Ekayanaa School Class XII-B Timetable
No ratings yet
Ekayanaa School Class XII-B Timetable
1 page
AIMO 2024-25 Trial Results - India
No ratings yet
AIMO 2024-25 Trial Results - India
12 pages
Biology A100 Tentative Syllabus
No ratings yet
Biology A100 Tentative Syllabus
10 pages
Descriptive Research Paper Guidelines
No ratings yet
Descriptive Research Paper Guidelines
14 pages
2025 Cosmetology Instructor Exam Guide
No ratings yet
2025 Cosmetology Instructor Exam Guide
5 pages
Yonsei University Graduate Admissions 2022
No ratings yet
Yonsei University Graduate Admissions 2022
24 pages
Parking System Management Project Report
No ratings yet
Parking System Management Project Report
7 pages
Data Analyst Internship at Yoshops
No ratings yet
Data Analyst Internship at Yoshops
2 pages
Rainforest Lesson Plan for 3rd Grade
No ratings yet
Rainforest Lesson Plan for 3rd Grade
4 pages
Language as Social Relations Network
100% (1)
Language as Social Relations Network
24 pages
Observing Fear Responses in Children
No ratings yet
Observing Fear Responses in Children
2 pages
Understanding Adjective Clauses
No ratings yet
Understanding Adjective Clauses
4 pages

Collaborative Filtering Techniques Explained

Uploaded by

Collaborative Filtering Techniques Explained

Uploaded by

UNIT – 3

Collaborative filtering is a technique used in recommendation system to

Collaborative filtering algorithms, neighborhood-based collaborative filtering

1. User-based collaborative filtering

2. Item-based collaborative filtering

Neighborhood-based Collaborative Filtering

Collaborative Filtering (CF) methods collect preferences in the form of ratings or

 Item-based Collaborative Filtering: Ratings of a group of similar items

Target item mean centered vector

You might also like