0% found this document useful (0 votes)

233 views32 pages

RecSys PyData2016

This document discusses recommender systems and how to build them using Python. It begins by explaining why recommender systems are useful and provides examples like movie, product, and friend recommendations. It then discusses three main approaches to building recommender systems: popularity-based, classification-based, and collaborative filtering. Collaborative filtering is further broken down into user-based, item-based, and model-based using matrix factorization. The document provides examples and quizzes to help explain each approach.

Uploaded by

Chong Hoo Chuah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

233 views32 pages

RecSys PyData2016

Uploaded by

Chong Hoo Chuah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Recommender

Systems
Using Python
Aug 12, 2016
Slides: https://goo.gl/ehBnhf
Notebook:
https://github.com/dvysardana/RecommenderSys
tems_PyData_2016 Slides by Divya
Outline
1. Why Recommender Systems?

1. Examples of Recommender Systems

1. How to build a Recommender System?

a. Popularity based
b. Classification based
c. Collaborative Filtering
i. Nearest Neighbor
ii. Matrix Factorization

4. Evaluation of Recommender Systems

1. Why Recommender
Systems?
Goal of a Recommender
System: Identify products most
relevant to the user (Eg. Top n
offers).

The long tail phenomenon

2. Some
Examples?
Movie/TV show
Recommendations
Product Recommendations
Friend Recommendations
Job Recommendations
A Naive
understanding of
Recommender
Systems

Users Matching Items

Quiz
What are users and matching items the
following cases:
a.) LinkedIn (Users: members, Items: jobs)
b.) Facebook (Users: members, Items: members)
c.) Amazon (Users: members, Items: products, e.g., books)
d.) Netflix (Users: members, Items: movies, TV shows)
Power of Recommendations: A Success story

“In 1988, a British mountain

climber named Joe Simpson
wrote a book called Touching
the Void, a harrowing account
of near death in the Peruvian
Andes. It got good reviews,
only a modest success, it was
soon forgotten. Then, a
decade later, a strange thing
happened. Jon Krakauer
wrote Into Thin Air, another
book about a mountain-
climbing tragedy, which
became a publishing
Published in 1988 Published in 1996 sensation. Suddently,
Touching the Void started to
sell again.”...The Long Tail by
Chris Anderson
3. Building a
Recommender
System
Solution 0: Popularity based Recommender System

Recommend items viewed/purchased by most people

Recommendations: Ranked list of items by their purchase count
Quiz
Which of the following is true of a popularity
based recommender system?
Can generate Personalized Recommendations?

Can use Context (Eg. time of day)?

Can use User Features?

Can use Item Features?

Can use Purchase History?

Is it Scalable?
Solution 1: Classification Model
Use features of both products as well as users in order to predict
whether a user will like a product or not.

User Features
(Eg. Age, Gender)

Product Features (Eg. Classifier

Limitations.
cost, quality) Like/Not
1. It is difficult to collect
like
high quality information
Purchase History
about products and
users.
Quiz
Which of the following is true of a
Classification model based recommender
system?
Can generate Personalized Recommendations?

Can use Context (Eg. time of day)?

Can use User Features?

Can use Item Features?

Can use Purchase History?

Is it Scalable?
Solution 2: Nearest neighbor Collaborative Filtering
User-based Collaborative Item-based Collaborative
Filtering Filtering
Find users who have Recommend items that are
a similar taste of products similar to the items the user
as the current user. bought.

Similarity is based upon Similarity is based upon

similarity in users’ co-occurence of purcha
purchasing behaviour.
“Items A and B were
“User x is similar to user y purchased by both users
because both purchased x and y, so they are similar
items A, B and C.” Fig. Source:
http://www.salemmarafi.com/code/collaborative-filtering-with-python/
Item-based Collaborative Filtering: An Example
(People who bought this also bought)
History Matrix

B A C
A B C

A C A

B
A
C

Example source: Bob’s Recommendations= [C, B]

https://www.mapr.com/blog/inside-look-at-components-of-recommendation-engine
Item-based Collaborative Filtering: Effect of popular
items

B A C
A B C

A C A
100,000

B
A
C
Item-based Collaborative Filtering: Normalize co-
occurence matrix

Normalize by Popularity
Jaccard similarity
-Number of users common for i and j
A B C
Number of users for either i or j

A
100,000
3 2
100,002
B 3 2

C
3 2
Item-based Collaborative Filtering: Effect of
multiple items
Rows from normalized co-occurence matrix

B A C A B C D

A 0 0.33 1 0.5

A C D 0.25 0.25 0 0.2

Weighted sum=
(Scores for movie A 0.125 0.29 0.5 0.35
A D + Scores for movie D)/2

C D B A
Ranked
Recommendations: 0.5 0.35 0.29 0.125
Quiz
Given a user x itemRatings matrix of size
480,189 x 17,770, which model will you apply
given the matrix is very sparse?
Popularity based recommender system May be

Classification model based recommender system

Item similarity based recommender system

User similarity based recommender system

All of the above

None of the above

17,770
This is the Million Dollar Matrix!!!!$$$$$$!!!!

~100 million ratings

Only 100 million out of

possible 8.5 billion
ratings are non zero.

Very sparse matrix!

Solution 3:
Model based Collaborative Filtering (Matrix Factorization)
Identify latent (hidden) features from the input user x itemRatings matrix to
represent users and items as vectors in N dimensional space.

(Serious/Escapist?) Geared towards Males or Females?

User Vector (u) = [1.3 2.8]
Item Vector (v) = [2.5 -1.9]

New user (Known ratings): [4 5 ….3]

Netflix Prize diagram (Koren et al., 2009)

Solution 3:
Model based Collaborative Filtering (Matrix Factorization)

Training: Use Matrix factorization approaches (Eg. Singular value Decomposition or SVD) to split the
Rating Matrix into constituent User Matrix and Item Matrix with minimum Sum of squared error (SSE).

The winning entry for the famed

Netflix Prize had a number of
SVD models including SVD++
blended
SVD: with Restricted Boltzmann
Anxp= Unxn Snxp VTpxp Machines. Using these
methods they achieved a 10
percent increase in accuracy
over Netflix’s existing algorithm.
--Gower 2014

Goal: Predict unknown ratings for the remaining set of movies using
the learned User Matrix and Item Matrix
● Refer to Gower 2014 to read more about Netflix prize and SVD (Gower, Stephen. "Netflix Prize and SVD." (2014): 1-10.)
Performance Metric for Recommendation Systems

All Recommendations (made on training dataset)

Relevant Items Irrelevant Items that

that are also are
recommended recommended

All
Relevant
Items Precision = # of products relevant & recommended / # of items
(All items Relevant items that
in the recommended
are not
test set) recommendations (Measure of exactness)

Recall = # of products relevant & recommended / # of relevant items

(Measure of completeness)
Performance Metric for Recommendation Systems

Precision Recall Curve: Evaluation of top n

recommendations
Performance Metric for Recommendation Systems
Some other
metrics
Mean Absolute Error

Accuracy

ROC curve

Gunawardana, Asela, and Guy Shani. "A survey of accuracy evaluation metrics of recommendation tasks."
Journal of Machine Learning Research10.Dec (2009): 2935-2962.
Quiz: Comparison of Recommendation Systems
Which recommender model can handle brand new items Cold Start Problem!
(Eg., a new released movie)?

Popularity Classification (Nearest Neighbor- (Matrix Factorization

Based Based based CF) based CF)

Personalized
Recommendations

Uses Context
(Eg. time of day)

User Features

Item Features

Purchase History

Scalable

Can handle brand new

Items?
Music Recommendation
(Python notebook)
Notebook:
https://github.com/dvysardana/Recom
menderSystems_PyData_2016

Short url:

https://goo.gl/kVnNKf
Resources
1. Book: Recommender Systems An Introduction by Dietmar Jannach

1. Book: Mining Massive Datasets by Jure Leskovec, Anand Rajaraman, Jeff

Ullman (www.mmds.org)

1. Coursera course on Recommender Systems, by University of Washington

1. Coursera course on Recommender Systems, by University of Minnesota

Do you have
any questions?

Book Recommendation Project
No ratings yet
Book Recommendation Project
15 pages
Understanding Recommendation Systems
No ratings yet
Understanding Recommendation Systems
37 pages
Recommender Systems Overview and Methods
No ratings yet
Recommender Systems Overview and Methods
36 pages
Understanding Recommendation Systems
No ratings yet
Understanding Recommendation Systems
8 pages
Recommender System Overview and Techniques
No ratings yet
Recommender System Overview and Techniques
87 pages
Rec - Unit 1
No ratings yet
Rec - Unit 1
66 pages
Understanding Recommender Systems
No ratings yet
Understanding Recommender Systems
81 pages
Project Report "E-Commerce Recommendation"
No ratings yet
Project Report "E-Commerce Recommendation"
20 pages
Unit 1 PDF
No ratings yet
Unit 1 PDF
58 pages
Data Mining: Recommendation Systems Overview
No ratings yet
Data Mining: Recommendation Systems Overview
26 pages
Understanding Recommendation Engines
No ratings yet
Understanding Recommendation Engines
17 pages
Understanding Recommender Systems Techniques
No ratings yet
Understanding Recommender Systems Techniques
20 pages
Collaborative Filtering Insights
No ratings yet
Collaborative Filtering Insights
11 pages
Understanding Recommendation Engines
No ratings yet
Understanding Recommendation Engines
18 pages
Understanding Recommender Systems
No ratings yet
Understanding Recommender Systems
30 pages
Movie Recommendation System Using ML
No ratings yet
Movie Recommendation System Using ML
6 pages
Movie Recommendation System with KNN
No ratings yet
Movie Recommendation System with KNN
5 pages
2404 16177v1
No ratings yet
2404 16177v1
6 pages
Overview of Recommendation Systems
No ratings yet
Overview of Recommendation Systems
33 pages
Collaborative Filtering & Recommendation System
No ratings yet
Collaborative Filtering & Recommendation System
17 pages
UNIT I - Introduction-Recommender Systems
No ratings yet
UNIT I - Introduction-Recommender Systems
24 pages
UNIMAS eRecruitment: Recommender Systems
No ratings yet
UNIMAS eRecruitment: Recommender Systems
46 pages
Recommender Systems in Machine Learning
No ratings yet
Recommender Systems in Machine Learning
139 pages
Rs Assignment
No ratings yet
Rs Assignment
6 pages
Understanding Recommender Systems in ML
No ratings yet
Understanding Recommender Systems in ML
33 pages
Recommender Systems Overview
No ratings yet
Recommender Systems Overview
26 pages
Movie Recommendation System Overview
No ratings yet
Movie Recommendation System Overview
30 pages
Benchmarking Parallel CF Techniques
No ratings yet
Benchmarking Parallel CF Techniques
5 pages
Social Network-Based Recommender System
No ratings yet
Social Network-Based Recommender System
32 pages
Understanding Recommender Systems
No ratings yet
Understanding Recommender Systems
75 pages
Harsh Sindhal 22BAI71213
No ratings yet
Harsh Sindhal 22BAI71213
6 pages
Understanding Recommender Systems
No ratings yet
Understanding Recommender Systems
46 pages
Recommender Systems & Matrix Factorization
No ratings yet
Recommender Systems & Matrix Factorization
49 pages
T10 Recommender System
No ratings yet
T10 Recommender System
45 pages
Unit III - 3.1 - Recommender Systems at CSJMU - 6 Slides Handouts
No ratings yet
Unit III - 3.1 - Recommender Systems at CSJMU - 6 Slides Handouts
3 pages
Movie Recommendation System Using IBCF
No ratings yet
Movie Recommendation System Using IBCF
18 pages
10 Recommender Systems
No ratings yet
10 Recommender Systems
35 pages
Recommendation System in Python
No ratings yet
Recommendation System in Python
13 pages
Movie Recommendation System Using Cosine Similarity and KNN: II. Related Work
No ratings yet
Movie Recommendation System Using Cosine Similarity and KNN: II. Related Work
4 pages
CompSci HL P3 Case Study
No ratings yet
CompSci HL P3 Case Study
7 pages
Introduction to Recommender Systems
No ratings yet
Introduction to Recommender Systems
54 pages
Movie Recommendations
No ratings yet
Movie Recommendations
12 pages
Understanding Recommendation Systems
No ratings yet
Understanding Recommendation Systems
45 pages
An Introduction To Recommender Systems
No ratings yet
An Introduction To Recommender Systems
6 pages
Rec Sys CF
No ratings yet
Rec Sys CF
48 pages
Music Recommendation Systems Explained
100% (1)
Music Recommendation Systems Explained
113 pages
Programming Questions Recommendation System
No ratings yet
Programming Questions Recommendation System
12 pages
Movie Recommendation Systems Overview
No ratings yet
Movie Recommendation Systems Overview
27 pages
Movie Recommendation System with KNN
No ratings yet
Movie Recommendation System with KNN
5 pages
Icitsi 2014 7048228
No ratings yet
Icitsi 2014 7048228
6 pages
Unit-5 ML
No ratings yet
Unit-5 ML
7 pages
Big Data Recommendation Systems Overview
No ratings yet
Big Data Recommendation Systems Overview
38 pages
AI Movie Recommendation System
No ratings yet
AI Movie Recommendation System
30 pages
Overview of Recommender Systems
No ratings yet
Overview of Recommender Systems
10 pages
Implementation and Comparison of Recommender Systems Using Various Models
100% (1)
Implementation and Comparison of Recommender Systems Using Various Models
13 pages
Recommender System
No ratings yet
Recommender System
8 pages
Recommendation System
No ratings yet
Recommendation System
21 pages
AIML Presentation
No ratings yet
AIML Presentation
21 pages
SQL Cheat Sheet Python
100% (1)
SQL Cheat Sheet Python
1 page
Secure File Sharing with Blockchain
No ratings yet
Secure File Sharing with Blockchain
9 pages
Skillpipe PDF
No ratings yet
Skillpipe PDF
39 pages
Skillpipe PDF
No ratings yet
Skillpipe PDF
39 pages
IPFS Search Engine Prototype Implementation
No ratings yet
IPFS Search Engine Prototype Implementation
44 pages
Python Time Series Analysis Guide
No ratings yet
Python Time Series Analysis Guide
8 pages
LAST Research File 3
No ratings yet
LAST Research File 3
25 pages
Employee Performance Appraisal in Ethiopia
No ratings yet
Employee Performance Appraisal in Ethiopia
56 pages
Test Bank For Intro Stats 5th Edition Richard D de Veaux
No ratings yet
Test Bank For Intro Stats 5th Edition Richard D de Veaux
313 pages
Qualitative Interviews A Methodological Discussion
No ratings yet
Qualitative Interviews A Methodological Discussion
16 pages
TLE Learning Facilitation and Performance
100% (1)
TLE Learning Facilitation and Performance
20 pages
AMIL Dissertation Structure Guide
No ratings yet
AMIL Dissertation Structure Guide
11 pages
Factor Affecting Consumer Buying Behavior in Electronic Appliance of LG
No ratings yet
Factor Affecting Consumer Buying Behavior in Electronic Appliance of LG
46 pages
Samuel Tonobaye
No ratings yet
Samuel Tonobaye
6 pages
Introduction to Business Statistics
No ratings yet
Introduction to Business Statistics
19 pages
Teaching Measures of Central Tendency
No ratings yet
Teaching Measures of Central Tendency
14 pages
MRCP Exam Result
100% (2)
MRCP Exam Result
2 pages
135 221 3 PB
No ratings yet
135 221 3 PB
10 pages
Strengthening Offerings in Perth Amboy
No ratings yet
Strengthening Offerings in Perth Amboy
165 pages
Liu - 2021 - Using Multiple Linear Regression and Random
No ratings yet
Liu - 2021 - Using Multiple Linear Regression and Random
19 pages
20874572
No ratings yet
20874572
173 pages
Analyzing Work and Designing Jobs
No ratings yet
Analyzing Work and Designing Jobs
11 pages
Proposal Defence Rashmi
No ratings yet
Proposal Defence Rashmi
24 pages
New Nurses' Journey to Professional Identity
No ratings yet
New Nurses' Journey to Professional Identity
8 pages
Impact of Culture on Employee Retention
No ratings yet
Impact of Culture on Employee Retention
9 pages
Organizational Innovativeness Insights
No ratings yet
Organizational Innovativeness Insights
20 pages
Practical Research 1: Show More Grade 11 Als
No ratings yet
Practical Research 1: Show More Grade 11 Als
22 pages
Research Methodology for Malnutrition Study
No ratings yet
Research Methodology for Malnutrition Study
6 pages
STPM Mathematics T: Data Analysis Techniques
No ratings yet
STPM Mathematics T: Data Analysis Techniques
29 pages
Document Print 3
No ratings yet
Document Print 3
27 pages
Types of Job Interviews Explained
No ratings yet
Types of Job Interviews Explained
7 pages
Research Methods in Psychology 4th Edition Rajiv S. Jhangiani Ebook Technical Edition
100% (2)
Research Methods in Psychology 4th Edition Rajiv S. Jhangiani Ebook Technical Edition
131 pages
IGCSE Sociology Unit 1
100% (2)
IGCSE Sociology Unit 1
23 pages
A Study of Hospitality Students' Perception Towards The Industry
No ratings yet
A Study of Hospitality Students' Perception Towards The Industry
6 pages
Role Transition Among Baccalaureate Nursing Students at Umm Al Qura University
No ratings yet
Role Transition Among Baccalaureate Nursing Students at Umm Al Qura University
1 page
CV Vipin - Fy24
No ratings yet
CV Vipin - Fy24
2 pages