SlideShare a Scribd company logo
Tinderbook: Fall In Love with Culture
Book Recommendation
• 2,210,000 new books are published every
year
• Hard to find a good book to read, most
readers typically give up on a book in the
early chapters
• Recommender Systems help in finding a
good book → several existing websites
(GoodReads, LibraryThing, WhichBook…)
Key concepts
Collaborative filtering
Most of the existing book recommender
systems are based on collaborative filtering
“People who buy X, often buy Y”
Problem:
• Collaborative filtering suffers the cold start
problem, i.e. it does not work well when little
is known about the preferences of the user
• Long onboarding procedures: ask lot of
information about the user, mandatory login,
ask to rate many books, etc...
Linda
Mark
Jane
Competing
systems
Comparison of
existing book
recommender
systems
TinderBook
• Accurate recommendations
given just one book that the
user likes
• No login, no additional
information about the user
Goal
Onboarding
Usage Session
Temperature parameter governs the trade-off
between showing most popular vs random books
Popularity of the book is defined as the fraction of
positive feedback (ratings r ≥ 8) obtained by the book
in the LibraryThing dataset
• Low Temperature → very popular books
• High Temperature → less popular books
Recommendations
User receives five recommended books based
on the onboarding choice
• Positive feedback → Thumbs Up or Right Swipe
• Negative feedback → Thumbs Down or Left Swipe
• To read DBpedia book abstract → Info button
Usage Session
Graphical User Interface
Architecture
User Interface
API
Model
Seed
Discard
Feedback
ρ (ij
, ik
)
Book
cover
Book
metadata
Jane
MongoDB
Approach
• Entity2rec [1]
is a knowledge graph embedding algorithm that has reached
SOTA performance
• Knowledge Graphs are ideal for hybrid RSs
• Research has shown that KG embeddings can be effectively used for
recommendations [1] [2] [3] [4]
• However, in entity2rec as in [1]
, you need information about the user
• Need to extend entity2rec to address cold-start problem
How to generate accurate recommendation with only one book?
Item-to-Item Recommendation
seed
items
ρ (u, i)
0.8
0.6
0.5
0.2
Definition
ranking
function
ρ(ij
,ik
)
Entity2rec
Property-specific subgraphs
Mark Jane
GEORGE
ORWELL
DYSTOPIAN
NOVELS
feedback
feedback
feedback
feedback
dbo:author
dbo:author
dbo:subject
dbo:subject
Property-specific
subgraphs
Mark Jane
DYSTOPIAN
NOVELS
feedback
feedback
feedback
feedback
dbo:subject
dbo:subject
Mark Jane
feedback
feedback
feedback
feedback
dbo:author
dbo:author
GEORGE
ORWELL
Entity2rec
ρdbo:author
ρdct:subject
ρ (ij
,ik
) = avg( ρp
(ij
,ik
))
Item-item recommendation
Mark Jane
GEORGE
ORWELL
Mark Jane
DYSTOPIAN
NOVELS
Property-specific
embeddings using
node2vec [5]
Final
ranking
function
Mark Jane
DYSTOPIAN
NOVELS
Mark Jane
GEORGE
ORWELL
Offline evaluation
Datasets
• LibraryThing: 7,112 users, 37,231 books and 626,000 book ratings ranging from 1 to 10
Training set: 70% Validation set: 10% Test set: 20%
• LibraryThing + DBpedia: LibraryThing books have been mapped to DBpedia entities [6]
6,789 users (95.46%) 9,926 books (26.66%) 410,199 ratings (65.53%)
Metrics
• P@5: fraction of books in top-5 recommendations that are relevant to the user (rel-in-top-5 / 5)
• R@5: fraction of relevant books for the user that are in the top-5 recommendations (rel-in-top-5 / all-rel)
• SER@5: like P@5, but top 5 popular books are not counted. Measures non-obvious accurate recommendations
• NOV@5: how little known (in terms of popularity) are the recommended books. Does not matter accuracy.
Experimental setup
Offline evaluation
System P@5 R@5 SER@5 NOV@5
entity2rec 0.0549 0.0508 0.0514 11.099
ItemKNN 0.0484 0.0472 0.0463 12.2
RDF2Vec
(content-based)
0.0315 0.0288 0.0311 13.913
MostPop 0.0343 0.0256 0.007 8.4525
Results
Online evaluation
• Completeness
• Discard
• Dropout rate
• Seed popularity
• Recommendation time (or latency)
Metrics
Online evaluation
• Time span: two weeks
• Estimate a number of >100 users
• Two different configurations of temperature parameter: governs the degree of
randomness in the books that are presented to the user in the onboarding phase
• low temperature = more popular books
• high temperature = more randomness
All T = 0.3 (1st week) T = 1. (2nd week)
Tot. # seeds 470 358 112
Tot. # feedback 1936 1495 441
Tot. # discarded books 3668 2263 1405
Setup
Online evaluation
Temperature
• T = 0.3 more than 90% of the
seed books are concentrated
the top 10% in terms of
popularity
• T = 1. the popularity bias,
although still strong, decreases
Online evaluation
T = 0.3 T = 1. p value significant
P@5 0.497368 ± 0.026381 0.495833 ± 0.052701 9.79E-01 no
SER@5 0.417105 ± 0.024892 0.437500 ± 0.047382 7.07E-01 no
NOV@5 8.315443 ± 0.176832 10.095039 ± 0.347261 2.30E-05 yes
completeness 0.903947 ± 0.018229 0.937500 ± 0.025108 2.86E-01 no
discard 6.321229 ± 0.663185 12.544643 ± 2.070238 2.09E-03 yes
dropout 0.131285 ± 0.019150 0.178571 ± 0.039930 2.45E-01 no
seed_pop 0.002626 ± 0.000060 0.000835 ± 0.000086 2.74E-48 yes
Results
Summary
• TinderBook is a book recommender system that provides recommendations given
just one book that the user likes
• It is deeply rooted in semantic technologies:
• It presents an extension of a state-of-the-art KG embeddings algorithm to the cold start
scenario
• It is based on DBpedia data
• It has received good feedback from the public saying that it’s fun to use and
recommendations are accurate
• Application is online at http://www.tinderbook.it and data collection is ongoing
Lessons learned and discussion
• The use of semantic technologies have allowed us to:
• Rely on a knowledge graph embeddings algorithm to create a hybrid recommender
system in a cold start scenario
• Use DBpedia data to describe books (“Info” panel, “title”, “author”). Linked Open Data
allowed us to build an application quickly, without creating an ad-hoc curated database.
Semantic Web technologies integrate seamlessly with web applications.
• At the same time…:
• We have used an existing mapping from LibraryThing to DBpedia. If someone wanted to
replicate this approach on a different dataset, she would have to take care of the mapping,
which is time consuming
• Mapping books to DBpedia has produced a huge data loss (26% of the original catalog)
• DBpedia information is not always accurate. For thumbnails, we had to rely on Google to
have more accurate results.
• To foster the development of SW-based application, effort needs to be focused on improving
data quality and in making available to the public as many datasets mapping as possible
References
[1] Palumbo, E., Rizzo, G., Troncy, R.: Entity2rec: Learning user-item relatedness from knowledge graphs for top-n
item recommendation. In: 11th International Conference on Recommender Systems (RecSys). pp. 32–36 (2017)
[2] Palumbo, E., Rizzo, G., Troncy, R., Baralis, E., Osella, M., Ferro, E.: Knowledge graph embeddings with node2vec
for item recommendation. In: European Semantic Web Conference (ESWC), Demo Track. pp. 117–120 (2018)
[3] Palumbo, E., Rizzo, G., Troncy, R., Baralis, E., Osella, M., Ferro, E.: Translational models for item recommendation.
In: European Semantic Web Conference (ESWC) Satellite Events. pp. 478–490 (2018)
[4] Rosati, J., Ristoski, P., Di Noia, T., Leone, R.d., Paulheim, H.: RDF graph embeddings for content-based
recommender systems. In: CEUR workshop proceedings. vol. 1673, pp. 23–30 (2016)
[5] Grover, A., Leskovec, J.: node2vec: Scalable feature learning for networks. In: 22nd International Conference on
Knowledge Discovery and Data Mining (SIGKDD). pp. 855–864 (2016)
[6] Di Noia, T., Ostuni, V.C., Tomeo, P., Di Sciascio, E.: Sprank: Semantic path-based ranking for top-n
recommendations using linked open data. ACM Transactions on Intelligent Systems and Technology (TIST) 8(1) (2016)
http://www.tinderbook.it
https://github.com/D2KLab/entity2rec
https://www.slideshare.net/EnricoPalumbo2
Contacts
Enrico Palumbo - Rec Sys - enrico.palumbo@linksfoundation.com
Alberto Buzio - User Interface - alberto.buzio@linksfoundation.com

More Related Content

Similar to Tinderbook

Replication in Data Science - A Dance Between Data Science & Machine Learning...
Replication in Data Science - A Dance Between Data Science & Machine Learning...Replication in Data Science - A Dance Between Data Science & Machine Learning...
Replication in Data Science - A Dance Between Data Science & Machine Learning...June Andrews
 
The economics of information (1)
The economics of information (1)The economics of information (1)
The economics of information (1)WiLS
 
The How and Why of Feature Engineering
The How and Why of Feature EngineeringThe How and Why of Feature Engineering
The How and Why of Feature EngineeringAlice Zheng
 
ArXiv Literature Exploration using Social Network Analysis
ArXiv Literature Exploration using Social Network AnalysisArXiv Literature Exploration using Social Network Analysis
ArXiv Literature Exploration using Social Network AnalysisTanat Iempreedee
 
Predict oscars (4:17)
Predict oscars (4:17)Predict oscars (4:17)
Predict oscars (4:17)Thinkful
 
Content based recommendation systems
Content based recommendation systemsContent based recommendation systems
Content based recommendation systemsAravindharamanan S
 
BOOK RECALL TYPPPPPPBBBBBBBBBHHHHHHHHHHHHY
BOOK RECALL TYPPPPPPBBBBBBBBBHHHHHHHHHHHHYBOOK RECALL TYPPPPPPBBBBBBBBBHHHHHHHHHHHHY
BOOK RECALL TYPPPPPPBBBBBBBBBHHHHHHHHHHHHY13DikshaDatir
 
Thought Vectors and Knowledge Graphs in AI-powered Search
Thought Vectors and Knowledge Graphs in AI-powered SearchThought Vectors and Knowledge Graphs in AI-powered Search
Thought Vectors and Knowledge Graphs in AI-powered SearchTrey Grainger
 
Recommender Systems! @ASAI 2011
Recommender Systems! @ASAI 2011Recommender Systems! @ASAI 2011
Recommender Systems! @ASAI 2011Ernesto Mislej
 
The (very) basics of AI for the Radiology resident
The (very) basics of AI for the Radiology residentThe (very) basics of AI for the Radiology resident
The (very) basics of AI for the Radiology residentPedro Staziaki
 
Tutorial 13 (explicit ugc + sentiment analysis)
Tutorial 13 (explicit ugc + sentiment analysis)Tutorial 13 (explicit ugc + sentiment analysis)
Tutorial 13 (explicit ugc + sentiment analysis)Kira
 
Using powerpoint report by apalisok and cuyno
Using powerpoint report by apalisok and cuynoUsing powerpoint report by apalisok and cuyno
Using powerpoint report by apalisok and cuynoPrecious Alyana Apalisok
 
Core Methods In Educational Data Mining
Core Methods In Educational Data MiningCore Methods In Educational Data Mining
Core Methods In Educational Data Miningebelani
 
How to Interpret Implicit User Feedback
How to Interpret Implicit User FeedbackHow to Interpret Implicit User Feedback
How to Interpret Implicit User FeedbackLadislav Peska
 
Module 1.3 data exploratory
Module 1.3  data exploratoryModule 1.3  data exploratory
Module 1.3 data exploratorySara Hooker
 
Predict the Oscars with Data Science
Predict the Oscars with Data SciencePredict the Oscars with Data Science
Predict the Oscars with Data ScienceCarlos Edo
 
SFScon18 - Ludovik Coba - rrecsys: an R library for prototyping and assessing...
SFScon18 - Ludovik Coba - rrecsys: an R library for prototyping and assessing...SFScon18 - Ludovik Coba - rrecsys: an R library for prototyping and assessing...
SFScon18 - Ludovik Coba - rrecsys: an R library for prototyping and assessing...South Tyrol Free Software Conference
 

Similar to Tinderbook (20)

Replication in Data Science - A Dance Between Data Science & Machine Learning...
Replication in Data Science - A Dance Between Data Science & Machine Learning...Replication in Data Science - A Dance Between Data Science & Machine Learning...
Replication in Data Science - A Dance Between Data Science & Machine Learning...
 
The economics of information (1)
The economics of information (1)The economics of information (1)
The economics of information (1)
 
The How and Why of Feature Engineering
The How and Why of Feature EngineeringThe How and Why of Feature Engineering
The How and Why of Feature Engineering
 
ArXiv Literature Exploration using Social Network Analysis
ArXiv Literature Exploration using Social Network AnalysisArXiv Literature Exploration using Social Network Analysis
ArXiv Literature Exploration using Social Network Analysis
 
Predict oscars (4:17)
Predict oscars (4:17)Predict oscars (4:17)
Predict oscars (4:17)
 
Content based recommendation systems
Content based recommendation systemsContent based recommendation systems
Content based recommendation systems
 
BOOK RECALL TYPPPPPPBBBBBBBBBHHHHHHHHHHHHY
BOOK RECALL TYPPPPPPBBBBBBBBBHHHHHHHHHHHHYBOOK RECALL TYPPPPPPBBBBBBBBBHHHHHHHHHHHHY
BOOK RECALL TYPPPPPPBBBBBBBBBHHHHHHHHHHHHY
 
NHS May 2015
NHS May 2015 NHS May 2015
NHS May 2015
 
NHS May 2015
NHS May 2015NHS May 2015
NHS May 2015
 
Thought Vectors and Knowledge Graphs in AI-powered Search
Thought Vectors and Knowledge Graphs in AI-powered SearchThought Vectors and Knowledge Graphs in AI-powered Search
Thought Vectors and Knowledge Graphs in AI-powered Search
 
Recommender Systems! @ASAI 2011
Recommender Systems! @ASAI 2011Recommender Systems! @ASAI 2011
Recommender Systems! @ASAI 2011
 
The (very) basics of AI for the Radiology resident
The (very) basics of AI for the Radiology residentThe (very) basics of AI for the Radiology resident
The (very) basics of AI for the Radiology resident
 
Introduction to data visualization
Introduction to data visualizationIntroduction to data visualization
Introduction to data visualization
 
Tutorial 13 (explicit ugc + sentiment analysis)
Tutorial 13 (explicit ugc + sentiment analysis)Tutorial 13 (explicit ugc + sentiment analysis)
Tutorial 13 (explicit ugc + sentiment analysis)
 
Using powerpoint report by apalisok and cuyno
Using powerpoint report by apalisok and cuynoUsing powerpoint report by apalisok and cuyno
Using powerpoint report by apalisok and cuyno
 
Core Methods In Educational Data Mining
Core Methods In Educational Data MiningCore Methods In Educational Data Mining
Core Methods In Educational Data Mining
 
How to Interpret Implicit User Feedback
How to Interpret Implicit User FeedbackHow to Interpret Implicit User Feedback
How to Interpret Implicit User Feedback
 
Module 1.3 data exploratory
Module 1.3  data exploratoryModule 1.3  data exploratory
Module 1.3 data exploratory
 
Predict the Oscars with Data Science
Predict the Oscars with Data SciencePredict the Oscars with Data Science
Predict the Oscars with Data Science
 
SFScon18 - Ludovik Coba - rrecsys: an R library for prototyping and assessing...
SFScon18 - Ludovik Coba - rrecsys: an R library for prototyping and assessing...SFScon18 - Ludovik Coba - rrecsys: an R library for prototyping and assessing...
SFScon18 - Ludovik Coba - rrecsys: an R library for prototyping and assessing...
 

More from Enrico Palumbo

Knowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender SystemsKnowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender SystemsEnrico Palumbo
 
An Empirical Comparison of Knowledge Graph Embeddings for Item Recommendation
An Empirical Comparison of Knowledge Graph Embeddings for Item RecommendationAn Empirical Comparison of Knowledge Graph Embeddings for Item Recommendation
An Empirical Comparison of Knowledge Graph Embeddings for Item RecommendationEnrico Palumbo
 
Path recommender rec tour
Path recommender   rec tourPath recommender   rec tour
Path recommender rec tourEnrico Palumbo
 
Innovation Diffusion: a (Big) Data-driven approach to the study of the geogra...
Innovation Diffusion: a (Big) Data-driven approach to the study of the geogra...Innovation Diffusion: a (Big) Data-driven approach to the study of the geogra...
Innovation Diffusion: a (Big) Data-driven approach to the study of the geogra...Enrico Palumbo
 
An ensemble approach to financial entity matching
An ensemble approach to financial entity matchingAn ensemble approach to financial entity matching
An ensemble approach to financial entity matchingEnrico Palumbo
 

More from Enrico Palumbo (6)

Knowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender SystemsKnowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender Systems
 
An Empirical Comparison of Knowledge Graph Embeddings for Item Recommendation
An Empirical Comparison of Knowledge Graph Embeddings for Item RecommendationAn Empirical Comparison of Knowledge Graph Embeddings for Item Recommendation
An Empirical Comparison of Knowledge Graph Embeddings for Item Recommendation
 
Entity2rec recsys
Entity2rec recsysEntity2rec recsys
Entity2rec recsys
 
Path recommender rec tour
Path recommender   rec tourPath recommender   rec tour
Path recommender rec tour
 
Innovation Diffusion: a (Big) Data-driven approach to the study of the geogra...
Innovation Diffusion: a (Big) Data-driven approach to the study of the geogra...Innovation Diffusion: a (Big) Data-driven approach to the study of the geogra...
Innovation Diffusion: a (Big) Data-driven approach to the study of the geogra...
 
An ensemble approach to financial entity matching
An ensemble approach to financial entity matchingAn ensemble approach to financial entity matching
An ensemble approach to financial entity matching
 

Recently uploaded

一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单enxupq
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单enxupq
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIAlejandraGmez176757
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .NABLAS株式会社
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单yhkoc
 
Computer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sComputer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sMAQIB18
 
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...elinavihriala
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...correoyaya
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJames Polillo
 
Exploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxExploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxDilipVasan
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundOppotus
 
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPsWebinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPsCEPTES Software Inc
 
Pre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxPre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxStephen266013
 
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictSupply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictJack Cole
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsalex933524
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单ewymefz
 
Using PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDBUsing PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDBAlireza Kamrani
 
How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?DOT TECH
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay
 

Recently uploaded (20)

一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMI
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
Computer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sComputer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage s
 
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
 
Exploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxExploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptx
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPsWebinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
 
Pre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxPre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptx
 
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictSupply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
Using PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDBUsing PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDB
 
Slip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp ClaimsSlip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp Claims
 
How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 

Tinderbook

  • 1. Tinderbook: Fall In Love with Culture
  • 2. Book Recommendation • 2,210,000 new books are published every year • Hard to find a good book to read, most readers typically give up on a book in the early chapters • Recommender Systems help in finding a good book → several existing websites (GoodReads, LibraryThing, WhichBook…) Key concepts
  • 3. Collaborative filtering Most of the existing book recommender systems are based on collaborative filtering “People who buy X, often buy Y” Problem: • Collaborative filtering suffers the cold start problem, i.e. it does not work well when little is known about the preferences of the user • Long onboarding procedures: ask lot of information about the user, mandatory login, ask to rate many books, etc... Linda Mark Jane
  • 5. TinderBook • Accurate recommendations given just one book that the user likes • No login, no additional information about the user Goal
  • 6. Onboarding Usage Session Temperature parameter governs the trade-off between showing most popular vs random books Popularity of the book is defined as the fraction of positive feedback (ratings r ≥ 8) obtained by the book in the LibraryThing dataset • Low Temperature → very popular books • High Temperature → less popular books
  • 7. Recommendations User receives five recommended books based on the onboarding choice • Positive feedback → Thumbs Up or Right Swipe • Negative feedback → Thumbs Down or Left Swipe • To read DBpedia book abstract → Info button Usage Session
  • 9. Architecture User Interface API Model Seed Discard Feedback ρ (ij , ik ) Book cover Book metadata Jane MongoDB
  • 10. Approach • Entity2rec [1] is a knowledge graph embedding algorithm that has reached SOTA performance • Knowledge Graphs are ideal for hybrid RSs • Research has shown that KG embeddings can be effectively used for recommendations [1] [2] [3] [4] • However, in entity2rec as in [1] , you need information about the user • Need to extend entity2rec to address cold-start problem How to generate accurate recommendation with only one book?
  • 11. Item-to-Item Recommendation seed items ρ (u, i) 0.8 0.6 0.5 0.2 Definition ranking function ρ(ij ,ik )
  • 12. Entity2rec Property-specific subgraphs Mark Jane GEORGE ORWELL DYSTOPIAN NOVELS feedback feedback feedback feedback dbo:author dbo:author dbo:subject dbo:subject Property-specific subgraphs Mark Jane DYSTOPIAN NOVELS feedback feedback feedback feedback dbo:subject dbo:subject Mark Jane feedback feedback feedback feedback dbo:author dbo:author GEORGE ORWELL
  • 13. Entity2rec ρdbo:author ρdct:subject ρ (ij ,ik ) = avg( ρp (ij ,ik )) Item-item recommendation Mark Jane GEORGE ORWELL Mark Jane DYSTOPIAN NOVELS Property-specific embeddings using node2vec [5] Final ranking function Mark Jane DYSTOPIAN NOVELS Mark Jane GEORGE ORWELL
  • 14. Offline evaluation Datasets • LibraryThing: 7,112 users, 37,231 books and 626,000 book ratings ranging from 1 to 10 Training set: 70% Validation set: 10% Test set: 20% • LibraryThing + DBpedia: LibraryThing books have been mapped to DBpedia entities [6] 6,789 users (95.46%) 9,926 books (26.66%) 410,199 ratings (65.53%) Metrics • P@5: fraction of books in top-5 recommendations that are relevant to the user (rel-in-top-5 / 5) • R@5: fraction of relevant books for the user that are in the top-5 recommendations (rel-in-top-5 / all-rel) • SER@5: like P@5, but top 5 popular books are not counted. Measures non-obvious accurate recommendations • NOV@5: how little known (in terms of popularity) are the recommended books. Does not matter accuracy. Experimental setup
  • 15. Offline evaluation System P@5 R@5 SER@5 NOV@5 entity2rec 0.0549 0.0508 0.0514 11.099 ItemKNN 0.0484 0.0472 0.0463 12.2 RDF2Vec (content-based) 0.0315 0.0288 0.0311 13.913 MostPop 0.0343 0.0256 0.007 8.4525 Results
  • 16. Online evaluation • Completeness • Discard • Dropout rate • Seed popularity • Recommendation time (or latency) Metrics
  • 17. Online evaluation • Time span: two weeks • Estimate a number of >100 users • Two different configurations of temperature parameter: governs the degree of randomness in the books that are presented to the user in the onboarding phase • low temperature = more popular books • high temperature = more randomness All T = 0.3 (1st week) T = 1. (2nd week) Tot. # seeds 470 358 112 Tot. # feedback 1936 1495 441 Tot. # discarded books 3668 2263 1405 Setup
  • 18. Online evaluation Temperature • T = 0.3 more than 90% of the seed books are concentrated the top 10% in terms of popularity • T = 1. the popularity bias, although still strong, decreases
  • 19. Online evaluation T = 0.3 T = 1. p value significant P@5 0.497368 ± 0.026381 0.495833 ± 0.052701 9.79E-01 no SER@5 0.417105 ± 0.024892 0.437500 ± 0.047382 7.07E-01 no NOV@5 8.315443 ± 0.176832 10.095039 ± 0.347261 2.30E-05 yes completeness 0.903947 ± 0.018229 0.937500 ± 0.025108 2.86E-01 no discard 6.321229 ± 0.663185 12.544643 ± 2.070238 2.09E-03 yes dropout 0.131285 ± 0.019150 0.178571 ± 0.039930 2.45E-01 no seed_pop 0.002626 ± 0.000060 0.000835 ± 0.000086 2.74E-48 yes Results
  • 20. Summary • TinderBook is a book recommender system that provides recommendations given just one book that the user likes • It is deeply rooted in semantic technologies: • It presents an extension of a state-of-the-art KG embeddings algorithm to the cold start scenario • It is based on DBpedia data • It has received good feedback from the public saying that it’s fun to use and recommendations are accurate • Application is online at http://www.tinderbook.it and data collection is ongoing
  • 21. Lessons learned and discussion • The use of semantic technologies have allowed us to: • Rely on a knowledge graph embeddings algorithm to create a hybrid recommender system in a cold start scenario • Use DBpedia data to describe books (“Info” panel, “title”, “author”). Linked Open Data allowed us to build an application quickly, without creating an ad-hoc curated database. Semantic Web technologies integrate seamlessly with web applications. • At the same time…: • We have used an existing mapping from LibraryThing to DBpedia. If someone wanted to replicate this approach on a different dataset, she would have to take care of the mapping, which is time consuming • Mapping books to DBpedia has produced a huge data loss (26% of the original catalog) • DBpedia information is not always accurate. For thumbnails, we had to rely on Google to have more accurate results. • To foster the development of SW-based application, effort needs to be focused on improving data quality and in making available to the public as many datasets mapping as possible
  • 22. References [1] Palumbo, E., Rizzo, G., Troncy, R.: Entity2rec: Learning user-item relatedness from knowledge graphs for top-n item recommendation. In: 11th International Conference on Recommender Systems (RecSys). pp. 32–36 (2017) [2] Palumbo, E., Rizzo, G., Troncy, R., Baralis, E., Osella, M., Ferro, E.: Knowledge graph embeddings with node2vec for item recommendation. In: European Semantic Web Conference (ESWC), Demo Track. pp. 117–120 (2018) [3] Palumbo, E., Rizzo, G., Troncy, R., Baralis, E., Osella, M., Ferro, E.: Translational models for item recommendation. In: European Semantic Web Conference (ESWC) Satellite Events. pp. 478–490 (2018) [4] Rosati, J., Ristoski, P., Di Noia, T., Leone, R.d., Paulheim, H.: RDF graph embeddings for content-based recommender systems. In: CEUR workshop proceedings. vol. 1673, pp. 23–30 (2016) [5] Grover, A., Leskovec, J.: node2vec: Scalable feature learning for networks. In: 22nd International Conference on Knowledge Discovery and Data Mining (SIGKDD). pp. 855–864 (2016) [6] Di Noia, T., Ostuni, V.C., Tomeo, P., Di Sciascio, E.: Sprank: Semantic path-based ranking for top-n recommendations using linked open data. ACM Transactions on Intelligent Systems and Technology (TIST) 8(1) (2016)
  • 23. http://www.tinderbook.it https://github.com/D2KLab/entity2rec https://www.slideshare.net/EnricoPalumbo2 Contacts Enrico Palumbo - Rec Sys - enrico.palumbo@linksfoundation.com Alberto Buzio - User Interface - alberto.buzio@linksfoundation.com