SlideShare a Scribd company logo
1 of 53
Download to read offline
Jure Leskovec (@jure)
Including joint work with
J. McAuley, R. Pandey, L. Riedel
1Jure Leskove, Stanford University & Pinterest
Connecting People & Objects
2Jure Leskove, Stanford University & Pinterest
Internet
Offsite
Save
Do
On Pinterest
Pinterest: Discovery Engine
Visual Discovery
Engine
Pins: Rich Objects
4Jure Leskove, Stanford University & Pinterest
Boards: Collections
5Jure Leskove, Stanford University & Pinterest
Boards: Collections
Pinners
Boards
Pins
Web Pages
Object
Graph
Hyperlink
Graph
From Pins to the Object Graph
30+ Billion Pins
categorized by people into more than
750 Million Boards
50% of pins have been created
in the last 6 months
8
How do we uncover
relationships
between pins?
9
Object Graph
10
Can we
understand how
pins fit together
into a giant
network?
Jure Leskove, Stanford University & Pinterest
Object Graph: Products
Pins & product catalogs:
 10s of millions of products
 100s of millions product reviews
 How do we build the product graph
Three components:
 Link Prediction
 Topic models
 Product hierarchies
11Jure Leskove, Stanford University & Pinterest
Product Graph: Relations
12
Substitutes:
Purchase
instead
Complements:
Purchase
in addition
Jure Leskove, Stanford University & Pinterest
Product Graph: Description
13
: cleaner; quieter
: cheaper; high power
: well made, easy to install
: fits perfectly, great value
Jure Leskove, Stanford University & Pinterest
Product Graph: Overview
14
Substitute
Complement
Jure Leskove, Stanford University & Pinterest
Product Graph:What it does?
1. Understand the notions of
substitute and complement goods
is substitutable for
complements
15Jure Leskove, Stanford University & Pinterest
Product Graph:What it does?
2. Generate explanations
of why certain products are
preferred
“Good quality, soft, light
weight, the colors are
beautiful and exactly like
the picture!”
People prefer this
because:
16Jure Leskove, Stanford University & Pinterest
Product Graph:What it does?
3. Recommends
baskets of related items
Query: Suggested outfit:
Query: Suggested outfit:
17Jure Leskove, Stanford University & Pinterest
Product Graph: Overview
Building networks of products
Modeling: Can we use product data
to model product relationships?
Understanding: Can we explain
why people prefer certain products
over others?
18Jure Leskove, Stanford University & Pinterest
Problem Setting
Binary prediction task:
Given a pair of products, x and y, predict
whether they are related
(substitute/complementary)
Goal: Build a probabilistic model
that encodes
19Jure Leskove, Stanford University & Pinterest
Problem Setting
How to learn
from data
Train by maximum likelihood:
20
XComplementary
Not
Complementary
Jure Leskove, Stanford University & Pinterest
Attempt 1: Big bags of features
21
Features of product i:
[0,0,0,0,0,0,0,1,0,5,0,0,0, … ,0,1,0,0,0,0,0,1,2]
Features of product j:
[0,0,0,1,0,0,0,0,0,0,0,1,0, … ,0,0,0,0,0,0,0,1,0]
aardvark zoetrope
Jure Leskove, Stanford University & Pinterest
Attempt 1: Big bags of features
22
Features of product i:
[0,0,0,0,0,0,0,1,0,5,0,0,0, … ,0,1,0,0,0,0,0,1,2]
Features of product j:
[0,0,0,1,0,0,0,0,0,0,0,1,0, … ,0,0,0,0,0,0,0,1,0]
aardvark zoetrope
Parameterized probability measure
(essentially weighted-nearest-neighbor)
Jure Leskove, Stanford University & Pinterest
Attempt 1: Big bags of features
23
Features of product i:
[0,0,0,0,0,0,0,1,0,5,0,0,0, … ,0,1,0,0,0,0,0,1,2]
Features of product j:
[0,0,0,1,0,0,0,0,0,0,0,1,0, … ,0,0,0,0,0,0,0,1,0]
aardvark zoetrope
• High-dimensional
• Prone to overfitting
• Too fine-grained
Jure Leskove, Stanford University & Pinterest
Attempt 2: Features fromTopics
LDA
Shoes Female
Blei & McAuliffe (2007)
Product topics
Use any kind of
product related features:
brand, price, reviews,
product descriptions, …
Topic models:
24
FashionJure Leskove, Stanford University & Pinterest
Attempt 2: Features fromTopics
Features of product i:
[0.1, 0.4, 0.2, 0.1, 0.2]
Features of product j:
[0.3, 0.1, 0.3, 0.2, 0.1]
Shoes Female
25Jure Leskove, Stanford University & Pinterest
Attempt 2: Features fromTopics
On the right track, but are the
topics we are discovering
relevant to link prediction?
26
Features of product i:
[0.1, 0.4, 0.2, 0.1, 0.2]
Features of product j:
[0.3, 0.1, 0.3, 0.2, 0.1]
Shoes Female
Jure Leskove, Stanford University & Pinterest
Attempt 3: Learn “good” topics
Learn to discover topics that
explain the graph structure
27Jure Leskove, Stanford University & Pinterest
Attempt 3: Learn “good” topics
Link Prediction Product “topics”
Idea: Learn both simultaneously
Discover topics that “explain” product relations
28Jure Leskove, Stanford University & Pinterest
Attempt 3: Learn “good” topics
Conceptually, we want to learn to project
products into topic space such that
related products are nearby
29Jure Leskove, Stanford University & Pinterest
The SCEPTRE Model
Combining topic models with
link prediction
Topic model with topic distribution 𝜽𝜽
But, the topics should be “good” as
features for the link prediction 30Jure Leskove, Stanford University & Pinterest
The SCEPTRE Model: Details
31
Topic
membership
Jure Leskove, Stanford University & Pinterest
The SCEPTRE Model
why do people who view
X eventually buy Y?
There is a link between the two products because
people use similar words to describe them
But in what direction does the link flow?
Issue 1: Relationships we want to learn
are not symmetric
32Jure Leskove, Stanford University & Pinterest
The SCEPTRE Model
why do people why view
X eventually buy Y?
Solution: We solve this issue by learning
“relatedness” in addition to “directedness”
Relationships: Explained by product “properties”
“baby, pajamas, pants, colorful”
Directedness: Subjective/qualitative language
“true size, fits well, items are the same color as on the picture”
33Jure Leskove, Stanford University & Pinterest
Learning Multiple Graphs
35
browsed together
bought together
Issue 2: We want to learn multiple
relationships simultaneously
We could fit two independent models, but learning both at once:
1) Gives us more data on which to train the complete model
2) Helps with interpretability, since both relationships are explained in
terms of the same topicsJure Leskove, Stanford University & Pinterest
Learning Multiple Graphs
36
Solution: We fix this by learning multiple
regressors simultaneously (one for each graph),
that operate on a single set of topics
One regressor
per graph
Jure Leskove, Stanford University & Pinterest
Sceptre is Not tractable
37
Issue 3:The model has a too
many parameters
Thousands of topics multiplied by
millions of products
Jure Leskove, Stanford University & Pinterest
Including Hierarchy
Idea: use the
category
hierarchy to
sparsify the
model
Solution: Product hierarchy
38Jure Leskove, Stanford University & Pinterest
Including Hierarchy
39
Associate each node in the category
tree with a small number of topics:
Now we can fit models with
thousands of topics but only
10-20 are active per product
“Car audio” topics (for example)
have probability zero of being
selected for this product
Topics at the top of the hierarchy are
common to all electronics products, and
will contain generic (though electronics
specific) languageJure Leskove, Stanford University & Pinterest
Training the model: EM
40
E-step (topic assignments)
M-step (link prediction)
Other topic/regression
parameters (word distribution
𝜙𝜙 and topic assignments z)
Jure Leskove, Stanford University & Pinterest
Building the Product Graph
Now, we can generate the product graph
by identifying most probable links
For every product, rank all other products
according to p(x is related to y)
But this is slow!
Quadratic number of comparisons!
Solution: Use product hierarchy and a
matching engine
43Jure Leskove, Stanford University & Pinterest
Experiments
 Just for fun, let’s use the Amazon
product catalog:
44Jure Leskove, Stanford University & Pinterest
Edge Prediction Accuracy
45Jure Leskove, Stanford University & Pinterest
Ranking Performance
Manual examination shows great performance
(false positives are actually very relevant)
46Jure Leskove, Stanford University & Pinterest
Results: Micro-Categories
47Jure Leskove, Stanford University & Pinterest
Results: Micro-Categories
48Jure Leskove, Pinterest & Stanford University
Explaining user preferences
 Explain recommendations by identifying
words that “best explain” the link:
 Topic model we assign a topic to each word
 Logistic regressor uses the words to make predictions
 Identify phrases that maximize the likelihood of the
link in order to explain it
49
Use the “directedness” model to generate explanations as
it selects more subjective language (i.e., how do the products
differ, and why was one product “preferable” over another).
Jure Leskove, Stanford University & Pinterest
Example: Product Graph
50Jure Leskove, Stanford University & Pinterest
Example: Product Graph
51Jure Leskove, Stanford University & Pinterest
Pinterest as a
graph of objects
53
Connecting People & Objects
54Jure Leskove, Stanford University & Pinterest
Tourist
Attractions
Food
Sporting
Venues
San
Francisco
Art
Galleries
Pinterest Graph - Example
User:
●likes classic art
●just viewed a pin
about things to do
in SF Artists
Pinners
Boards
Images
Web Pages
Object
Graph
Hyperlink
Graph
From Pins to the Object Graph
We are hiring!
58
jure@pinterest.com
Inferring Networks of Substitutable and Complementary Products
by J. McAuley, R. Pandey, J. Leskovec. ACM SIGKDD2015.

More Related Content

Recently uploaded

Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
Areesha Ahmad
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
Areesha Ahmad
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
NazaninKarimi6
 

Recently uploaded (20)

Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
Dubai Call Girls Beauty Face Teen O525547819 Call Girls Dubai Young
Dubai Call Girls Beauty Face Teen O525547819 Call Girls Dubai YoungDubai Call Girls Beauty Face Teen O525547819 Call Girls Dubai Young
Dubai Call Girls Beauty Face Teen O525547819 Call Girls Dubai Young
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 

Featured

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

Featured (20)

Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 

SF BIG ANALYTICS: Pinterest Chief Scientist Prof. Jure Leskovec: Discovering Networks of Products

  • 1. Jure Leskovec (@jure) Including joint work with J. McAuley, R. Pandey, L. Riedel 1Jure Leskove, Stanford University & Pinterest
  • 2. Connecting People & Objects 2Jure Leskove, Stanford University & Pinterest
  • 4. Pins: Rich Objects 4Jure Leskove, Stanford University & Pinterest
  • 5. Boards: Collections 5Jure Leskove, Stanford University & Pinterest
  • 8. 30+ Billion Pins categorized by people into more than 750 Million Boards 50% of pins have been created in the last 6 months 8
  • 9. How do we uncover relationships between pins? 9
  • 10. Object Graph 10 Can we understand how pins fit together into a giant network? Jure Leskove, Stanford University & Pinterest
  • 11. Object Graph: Products Pins & product catalogs:  10s of millions of products  100s of millions product reviews  How do we build the product graph Three components:  Link Prediction  Topic models  Product hierarchies 11Jure Leskove, Stanford University & Pinterest
  • 12. Product Graph: Relations 12 Substitutes: Purchase instead Complements: Purchase in addition Jure Leskove, Stanford University & Pinterest
  • 13. Product Graph: Description 13 : cleaner; quieter : cheaper; high power : well made, easy to install : fits perfectly, great value Jure Leskove, Stanford University & Pinterest
  • 14. Product Graph: Overview 14 Substitute Complement Jure Leskove, Stanford University & Pinterest
  • 15. Product Graph:What it does? 1. Understand the notions of substitute and complement goods is substitutable for complements 15Jure Leskove, Stanford University & Pinterest
  • 16. Product Graph:What it does? 2. Generate explanations of why certain products are preferred “Good quality, soft, light weight, the colors are beautiful and exactly like the picture!” People prefer this because: 16Jure Leskove, Stanford University & Pinterest
  • 17. Product Graph:What it does? 3. Recommends baskets of related items Query: Suggested outfit: Query: Suggested outfit: 17Jure Leskove, Stanford University & Pinterest
  • 18. Product Graph: Overview Building networks of products Modeling: Can we use product data to model product relationships? Understanding: Can we explain why people prefer certain products over others? 18Jure Leskove, Stanford University & Pinterest
  • 19. Problem Setting Binary prediction task: Given a pair of products, x and y, predict whether they are related (substitute/complementary) Goal: Build a probabilistic model that encodes 19Jure Leskove, Stanford University & Pinterest
  • 20. Problem Setting How to learn from data Train by maximum likelihood: 20 XComplementary Not Complementary Jure Leskove, Stanford University & Pinterest
  • 21. Attempt 1: Big bags of features 21 Features of product i: [0,0,0,0,0,0,0,1,0,5,0,0,0, … ,0,1,0,0,0,0,0,1,2] Features of product j: [0,0,0,1,0,0,0,0,0,0,0,1,0, … ,0,0,0,0,0,0,0,1,0] aardvark zoetrope Jure Leskove, Stanford University & Pinterest
  • 22. Attempt 1: Big bags of features 22 Features of product i: [0,0,0,0,0,0,0,1,0,5,0,0,0, … ,0,1,0,0,0,0,0,1,2] Features of product j: [0,0,0,1,0,0,0,0,0,0,0,1,0, … ,0,0,0,0,0,0,0,1,0] aardvark zoetrope Parameterized probability measure (essentially weighted-nearest-neighbor) Jure Leskove, Stanford University & Pinterest
  • 23. Attempt 1: Big bags of features 23 Features of product i: [0,0,0,0,0,0,0,1,0,5,0,0,0, … ,0,1,0,0,0,0,0,1,2] Features of product j: [0,0,0,1,0,0,0,0,0,0,0,1,0, … ,0,0,0,0,0,0,0,1,0] aardvark zoetrope • High-dimensional • Prone to overfitting • Too fine-grained Jure Leskove, Stanford University & Pinterest
  • 24. Attempt 2: Features fromTopics LDA Shoes Female Blei & McAuliffe (2007) Product topics Use any kind of product related features: brand, price, reviews, product descriptions, … Topic models: 24 FashionJure Leskove, Stanford University & Pinterest
  • 25. Attempt 2: Features fromTopics Features of product i: [0.1, 0.4, 0.2, 0.1, 0.2] Features of product j: [0.3, 0.1, 0.3, 0.2, 0.1] Shoes Female 25Jure Leskove, Stanford University & Pinterest
  • 26. Attempt 2: Features fromTopics On the right track, but are the topics we are discovering relevant to link prediction? 26 Features of product i: [0.1, 0.4, 0.2, 0.1, 0.2] Features of product j: [0.3, 0.1, 0.3, 0.2, 0.1] Shoes Female Jure Leskove, Stanford University & Pinterest
  • 27. Attempt 3: Learn “good” topics Learn to discover topics that explain the graph structure 27Jure Leskove, Stanford University & Pinterest
  • 28. Attempt 3: Learn “good” topics Link Prediction Product “topics” Idea: Learn both simultaneously Discover topics that “explain” product relations 28Jure Leskove, Stanford University & Pinterest
  • 29. Attempt 3: Learn “good” topics Conceptually, we want to learn to project products into topic space such that related products are nearby 29Jure Leskove, Stanford University & Pinterest
  • 30. The SCEPTRE Model Combining topic models with link prediction Topic model with topic distribution 𝜽𝜽 But, the topics should be “good” as features for the link prediction 30Jure Leskove, Stanford University & Pinterest
  • 31. The SCEPTRE Model: Details 31 Topic membership Jure Leskove, Stanford University & Pinterest
  • 32. The SCEPTRE Model why do people who view X eventually buy Y? There is a link between the two products because people use similar words to describe them But in what direction does the link flow? Issue 1: Relationships we want to learn are not symmetric 32Jure Leskove, Stanford University & Pinterest
  • 33. The SCEPTRE Model why do people why view X eventually buy Y? Solution: We solve this issue by learning “relatedness” in addition to “directedness” Relationships: Explained by product “properties” “baby, pajamas, pants, colorful” Directedness: Subjective/qualitative language “true size, fits well, items are the same color as on the picture” 33Jure Leskove, Stanford University & Pinterest
  • 34. Learning Multiple Graphs 35 browsed together bought together Issue 2: We want to learn multiple relationships simultaneously We could fit two independent models, but learning both at once: 1) Gives us more data on which to train the complete model 2) Helps with interpretability, since both relationships are explained in terms of the same topicsJure Leskove, Stanford University & Pinterest
  • 35. Learning Multiple Graphs 36 Solution: We fix this by learning multiple regressors simultaneously (one for each graph), that operate on a single set of topics One regressor per graph Jure Leskove, Stanford University & Pinterest
  • 36. Sceptre is Not tractable 37 Issue 3:The model has a too many parameters Thousands of topics multiplied by millions of products Jure Leskove, Stanford University & Pinterest
  • 37. Including Hierarchy Idea: use the category hierarchy to sparsify the model Solution: Product hierarchy 38Jure Leskove, Stanford University & Pinterest
  • 38. Including Hierarchy 39 Associate each node in the category tree with a small number of topics: Now we can fit models with thousands of topics but only 10-20 are active per product “Car audio” topics (for example) have probability zero of being selected for this product Topics at the top of the hierarchy are common to all electronics products, and will contain generic (though electronics specific) languageJure Leskove, Stanford University & Pinterest
  • 39. Training the model: EM 40 E-step (topic assignments) M-step (link prediction) Other topic/regression parameters (word distribution 𝜙𝜙 and topic assignments z) Jure Leskove, Stanford University & Pinterest
  • 40. Building the Product Graph Now, we can generate the product graph by identifying most probable links For every product, rank all other products according to p(x is related to y) But this is slow! Quadratic number of comparisons! Solution: Use product hierarchy and a matching engine 43Jure Leskove, Stanford University & Pinterest
  • 41. Experiments  Just for fun, let’s use the Amazon product catalog: 44Jure Leskove, Stanford University & Pinterest
  • 42. Edge Prediction Accuracy 45Jure Leskove, Stanford University & Pinterest
  • 43. Ranking Performance Manual examination shows great performance (false positives are actually very relevant) 46Jure Leskove, Stanford University & Pinterest
  • 44. Results: Micro-Categories 47Jure Leskove, Stanford University & Pinterest
  • 45. Results: Micro-Categories 48Jure Leskove, Pinterest & Stanford University
  • 46. Explaining user preferences  Explain recommendations by identifying words that “best explain” the link:  Topic model we assign a topic to each word  Logistic regressor uses the words to make predictions  Identify phrases that maximize the likelihood of the link in order to explain it 49 Use the “directedness” model to generate explanations as it selects more subjective language (i.e., how do the products differ, and why was one product “preferable” over another). Jure Leskove, Stanford University & Pinterest
  • 47. Example: Product Graph 50Jure Leskove, Stanford University & Pinterest
  • 48. Example: Product Graph 51Jure Leskove, Stanford University & Pinterest
  • 49. Pinterest as a graph of objects 53
  • 50. Connecting People & Objects 54Jure Leskove, Stanford University & Pinterest
  • 51. Tourist Attractions Food Sporting Venues San Francisco Art Galleries Pinterest Graph - Example User: ●likes classic art ●just viewed a pin about things to do in SF Artists
  • 53. We are hiring! 58 jure@pinterest.com Inferring Networks of Substitutable and Complementary Products by J. McAuley, R. Pandey, J. Leskovec. ACM SIGKDD2015.