Scientific Recommender Systems - PG PUSHPIN

Scientiﬁc Recommender Systems

Jan Petertonkoker

January 12th, 2012

Scientiﬁc Recommender Systems 1

Contents

Contents

1. Motivation (Examples)
2. Recommender Systems
3. Categories of Recommender Systems
3.1 Content-based Recommender: TF-IDF
3.2 Collaborative Recommender: Apache Mahout
3.3 Hybrid Recommender: SciPlore
4. Visualizations (Prototype)
5. Conclusion


Motivation

Motivation

Example: Amazon


Motivation

Motivation

Example: Twitter


Recommender Systems

Recommender Systems

u :C ×S →R

C - set of all users
S - set of all items
R - totally ordered set, which describes the usefulness of the
items to the respective user


Categories of Recommender Systems


content-based: items are recommended that are similar to
items the user liked in the past
collaborative: items are recommended that people liked that
are similar to the user (similar taste/preferences)
hybrid: a combination of content-based and collaborative
recommendation approaches



Content-based Recommender Systems

utility u(c, s) of an item s is estimated with the help of the
utilities u(c, si ) of all items si ∈ S that user c already rated
that are similar to item s
similarity between items is calculated according to their
attributes
user and item proﬁles
common problems
limited content analysis
overspecialization
new user problem



Content-based Recommender: TF-IDF
N - total number of documents in the system
keyword ki appears in ni of the documents
fi,j denotes the number of times a certain keyword ki appears
in a document dj



in a document dj
Term Frequency
fi,j
TFi,j = maxz fz,j
maximum in the denominator calculated over the frequencies
of all keywords kz that appear in document dj



in a document dj
Term Frequency
fi,j
TFi,j = maxz fz,j
Inverse Document Frequency
N
for a keyword ki : IDFi = log ni



in a document dj
Term Frequency
fi,j
TFi,j = maxz fz,j
Inverse Document Frequency
N
for a keyword ki : IDFi = log ni
TF-IDF
wi,j = TFi,j × IDFi


Collaborative Recommender Systems

utility u(c, s) of an item s is estimated with the help of the
utilities u(ci , s) assigned by users ci ∈ C that are similar to
user c.
common problems
new user/item problem
cold start
sparsity
scalability



Collaborative Recommender: Apache Mahout (1)

provides a ”toolbox” to create collaborative recommender
systems
input
user (long), item (long), preference (double)
1, 111, 2.5
data model
input from different file formats, database
increase performance with specific data structures



user-based recommender



user-based recommender

item-based recommender




similarity measures
pearson correlation (cosine similarity)
euclidean distance
spearman correlation
log-likelihood
...
slope-one recommender
other experimental recommender implementations
e.g. cluster-based



Hybrid Recommender Systems

combination of content-based and collaborative methods
seperate content-based and collaborative recommender
systems; results get combined somehow
collaborative recommender system with some added aspects of
content-based methods
content-based recommender system with some added aspects
of collaborative methods
a single recommender system which uniﬁes content-based and
collaborative methods from the beginning



Hybrid Recommender: SciPlore

SciPlore Overview


Visualizations (Prototype)

Visualizations (Prototype)

several recommenders based on given database
visualizations for explaining recommendations

Live Presentation


Conclusion

Summary

utility function
categories of recommender systems
content-based
collaborative
hybrid
implementation with Apache Mahout
possible visualizations


Conclusion

Questions?


References

References

Apache Mahout: Scalable machine learning and data mining.
http://mahout.apache.org/ - accessed on 6th January 2012
SciPlore: Exploring Science. http://www.sciplore.org -
accessed on 6th January 2012
G Adomavicius and A Tuzhilin. Toward the next generation of
recommender systems: a survey of the state-of-the-art and
possible extensions. IEEE Transactions on Knowledge and
Data Engineering, 17(6):734-749, 2005
B Gipp, J Beel and C Hentschel. Scienstein: A research paper
recommender system, volume 301, pages 309-315. IEEE, 2009
Sean Owen, Robin Anil, Ted Dunning and Ellen Friedman.
Mahout in Action, 2011


Scientific Recommender Systems - PG PUSHPIN

Recommended

Recommended

More Related Content

Similar to Scientific Recommender Systems - PG PUSHPIN

Similar to Scientific Recommender Systems - PG PUSHPIN (20)

Recently uploaded

Recently uploaded (20)

Scientific Recommender Systems - PG PUSHPIN