So far it is unclear how different factors of a scientific publication recommender system based on users' tweets have an influence on the recommendation performance. We examine three different factors, namely profiling method, temporal decay, and richness of content. Regarding profiling, we compare CF-IDF that replaces terms in TF-IDF by semantic concepts, HCF-IDF as novel hierarchical variant of CF-IDF, and topic modeling. As temporal decay functions, we apply sliding window and exponential decay. In terms of the richness of content, we compare recommendations using both full-texts and titles of publications and using only titles. Overall, the three factors make twelve recommendation strategies. We have conducted an online experiment with 123 participants and compared the strategies in a within-group design. The best recommendations are achieved by the strategy combining CF-IDF, sliding window, and with full-texts. However, the strategies using the novel HCF-IDF profiling method achieve similar results with just using the titles of the publications. Therefore, HCF-IDF can make recommendations when only short and sparse data is available. http://arxiv.org/abs/1603.07016
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Twitter Profile Recommendations
1. www.moving-project.eu
TraininG towards a society of data-saVvy inforMation prOfessionals
to enable open leadership INnovation
Chifumi Nishioka and Ansgar Scherp
Profiling vs. Time vs. Content: What does
Matter for Top-k Publication
Recommendation based on Twitter Profiles?
Kiel University and Leibniz Information Centre for Economics (ZBW Kiel), Germany
2. www.moving-project.eu
2 of 21
Motivation
• Information overload: too many papers in DLs
• Recommender systems: facilitate researchers by
suggesting papers that may interest them
• Collaborative filtering: cold start problem
• Content-based: based on their papers
• Social media in academia
• Researchers’ ideas on Twitter
[Letierce et al. 14]
• Ongoing research interests
Chifumi Nishioka (chni@informatik.uni-kiel.de)
Recommender System for scientific
papers based on Twitter
3. www.moving-project.eu
3 of 21
Three Factors
(I) Profiling method
• Extract features from social media items and
documents
• How to model user profiles and document profiles
(II) Temporal decay function
• Model the assumption that older
items are less important
• Examine which temporal decay function performs well
(III) Document content
• Investigate whether it is possible to make reasonable
recommendations using only titles of documents
Chifumi Nishioka (chni@informatik.uni-kiel.de)
4. www.moving-project.eu
4 of 21
Recommendation Procedure
Chifumi Nishioka (chni@informatik.uni-kiel.de)
User profiling
Compute similarity scores
between user profile and each of document profiles
Document profile
Recommend documents that have high similarity scores
5. www.moving-project.eu
5 of 21
Related Factors
Chifumi Nishioka (chni@informatik.uni-kiel.de)
User profiling
Compute similarity scores
between user profile and each of document profiles
Document profile
document corpus
Recommend documents that have high similarity scores
6. www.moving-project.eu
6 of 21
Three Factors and Choices
Chifumi Nishioka (chni@informatik.uni-kiel.de)
• Configurations
Factor Design Choices
Profiling method CF-IDF HCF-IDF LDA
Temporal decay function Sliding window Exponential decay
Document content All (title + full-text) Title
3 × 2 × 2 = 12 strategies are experimented
7. www.moving-project.eu
7 of 21
Profiles
• Profiles: represented as a vector, where each
element is a weight of a concept (i.e., feature)
• User profile 𝑝 𝑢 for user 𝑢
• Based on a user’s social media items (tweets) i ∈ 𝐼 𝑢
• Document profile 𝑝 𝑑 for document d
• User profile and document profile are made by
the same method
Chifumi Nishioka (chni@informatik.uni-kiel.de)
𝑝 𝑢 = {𝑤𝑒𝑖𝑔ℎ𝑡 𝑐, 𝐼 𝑢 |∀𝑐 ∈ 𝐶}
𝑤𝑒𝑖𝑔ℎ𝑡 𝑐, 𝐼 𝑢 =
𝑖∈𝐼 𝑢
𝑤𝑒𝑖𝑔ℎ𝑡(𝑐, 𝑖)
𝑝 𝑑 = {𝑤𝑒𝑖𝑔ℎ𝑡 𝑐, 𝑑 |∀𝑐 ∈ 𝐶}
a concept 𝑐 ∈ 𝐶 (i.e.,
feature) is a subject
term or topic
8. www.moving-project.eu
8 of 21
Factor I: Profiling Methods
• CF-IDF [Goossen et al. 11]
• Extension of TF-IDF replacing words with concepts
• Concept: a subject term coming from a taxonomy
• e.g., financial crisis, interest rate (economics)
• Use a domain specific taxonomy to extract only concepts
that are relevant to the target domain
Chifumi Nishioka (chni@informatik.uni-kiel.de)
How to extract features and model users and documents
𝑤𝑒𝑖𝑔ℎ𝑡′ 𝑐𝑓𝑖𝑑𝑓 𝑐, 𝑑 = 𝑐𝑓(𝑐, 𝑑) ∙ 𝑙𝑜𝑔
|𝐷|
|𝑑 ∈ 𝐷: 𝑐 ∈ 𝑑|
1. CF-IDF KB (taxonomy) based
methods2. HCF-IDF
3. LDA Topic modeling
Freely available in many domains!
9. www.moving-project.eu
9 of 21
Factor I: Profiling Methods
Chifumi Nishioka (chni@informatik.uni-kiel.de)
Social
Recommendation
Social
Tagging
Web Searching Web Mining
Site
Wrapping
Web Log
Analysis
World Wide Web
• HCF-IDF (Hierarchical CF-IDF) [Nishioka et al. 15]
• Extension of CF-IDF using spreading activation
• Combine the strength of the semantics with the
statistical strength
• 𝐵𝑒𝑙𝑙𝐿𝑜𝑔 𝑐, 𝑑 : best spreading
activation function
𝑤𝑒𝑖𝑔ℎ𝑡′ℎ𝑐𝑓𝑖𝑑𝑓 𝑐, 𝑑 = 𝐵𝑒𝑙𝑙𝐿𝑜𝑔(𝑐, 𝑑) ∙ 𝑙𝑜𝑔
|𝐷|
|𝑑 ∈ 𝐷: 𝑐 ∈ 𝑑|
Extract concepts which are
not mentioned directly
by spreading activation!
10. www.moving-project.eu
10 of 21
Factor I: Profiling Methods
• Latent Dirichlet Allocation (LDA) [Blei et al. 03]
• Unsupervised topic modeling method
• Topic model
• Document: A probability distribution over topics
• Topic: A probability distribution over words
• Treat a topic as a concept
• Procedure
• Construct a topic model over document corpus
• Infer a topic distribution over the trained topic model in
social media items
Chifumi Nishioka (chni@informatik.uni-kiel.de)
𝑤𝑒𝑖𝑔ℎ𝑡′𝑙𝑑𝑎 𝑐, 𝑑 = 𝑝(𝑐|𝑑)
11. www.moving-project.eu
11 of 21
Factor II: Temporal Decay Function
• Final weight is given after applying decay
• Sliding window
• Give weights only concepts that appear after 𝑡ℎ𝑟𝑒𝑠ℎ
• Parameter setting
• 𝑡ℎ𝑟𝑒𝑠ℎ = 250 𝑑𝑎𝑦𝑠 for social media items
• 𝑡ℎ𝑟𝑒𝑠ℎ = 9.04 𝑦𝑒𝑎𝑟𝑠 for scientific papers
• Exponential decay
• Parameter setting
• 𝜏 = 360 𝑑𝑎𝑦𝑠 for social media items
• 𝜏 = 13.05 𝑦𝑒𝑎𝑟𝑠 for scientific publications
Chifumi Nishioka (chni@informatik.uni-kiel.de)
𝑓𝑠𝑤 𝑡 =
1 for 𝑡 ≥ 𝑡ℎ𝑟𝑒𝑠ℎ
0 for 𝑡 < 𝑡ℎ𝑟𝑒𝑠ℎ
𝑓𝑒𝑥𝑝 𝑡 = 𝑒−(𝑡 𝑐𝑢𝑟𝑟𝑒𝑛𝑡−𝑡)/𝜏
𝑤𝑒𝑖𝑔ℎ𝑡 𝑐, 𝑖 = 𝑓 𝑡 ∙ 𝑤𝑒𝑖𝑔ℎ𝑡′
𝑐, 𝑖 𝑤𝑒𝑖𝑔ℎ𝑡 𝑐, 𝑖 = 𝑓 𝑡 ∙ 𝑤𝑒𝑖𝑔ℎ𝑡′
𝑐, 𝑖
12. www.moving-project.eu
12 of 21
Factor III: Document Content
• Title
• Always freely available
• All (title + full-text)
• Full-text: usually not available due to legal issues
• Extract full-text from PDF files
Chifumi Nishioka (chni@informatik.uni-kiel.de)
How the recommendation performance using only titles is
close to using both title and full-text
13. www.moving-project.eu
13 of 21
Computing Recommendations
• Temporal Cosine Similarity for CF-IDF & HCF-IDF
• 𝑓(𝑡 𝑑) to give higher weights to newer documents
• Dot Product for LDA
• Better performance than cosine similarity and
Kullback-Leibler divergence [Hazen 10]
Chifumi Nishioka (chni@informatik.uni-kiel.de)
𝑠𝑖𝑚 𝑡𝑐𝑜𝑠 𝑝 𝑢, 𝑝 𝑑 = 𝑓(𝑡 𝑑) ∙
𝑝 𝑢 ∙ 𝑝 𝑑
||𝑝 𝑢|| ∙ | 𝑝 𝑑 |
𝑠𝑖𝑚 𝑑𝑝 𝑝 𝑢, 𝑝 𝑑 = 𝑝 𝑢 ∙ 𝑝 𝑑
Recommend documents that have higher similarity scores
with user profile
14. www.moving-project.eu
14 of 21
Experiment Setup (1/3)
• Procedure
• Input his/her public Twitter handles
• The number of recommendations per strategy 𝑘 = 5
[Chen et al. 10]
• Ask participants to assess whether a recommended
paper is interesting or not
• On average 517.54 seconds to complete the
experiment
Chifumi Nishioka (chni@informatik.uni-kiel.de)
Scenario: Recommend scientific papers in the field of
economics based on users’ tweets
15. www.moving-project.eu
15 of 21
Experiment Setup (2/3)
• Web application
• Metadata (i.e., author, title, year) is shown
• Participants can open PDF files by clicking metadata
• Order of strategies is randomized
• Order of recommended papers is randomized
Chifumi Nishioka (chni@informatik.uni-kiel.de)
16. www.moving-project.eu
16 of 21
Experiment Setup (3/3)
• Dataset
• Scientific Papers
• 279,381 open access papers from EconBiz
• EconBiz: a portal for scientific papers in economics managed
by ZBW, the German National Library of Economics
• Hierarchical taxonomy
• STW, a thesaurus specialized for economics
• 3,335 semantic concepts and 37,733 labels
• 123 participants from the field of economics
• 21 bachelor /58 master /32 PhD /12 professor
• Metric: rankscore [Breese et al. 98]
• Assumption: higher ranked items are more likely to be
viewed
Chifumi Nishioka (chni@informatik.uni-kiel.de)
http://www.econbiz.de/
17. www.moving-project.eu
17 of 21
Result: Recommendation Strategy
• The strategy CF-IDF × Sliding window × All
performs best with the rankscore of 0.59
• But, the statistical test shows no significant
difference between the best strategy and the
strategies using HCF-IDF
Chifumi Nishioka (chni@informatik.uni-kiel.de)
18. www.moving-project.eu
18 of 21
Result: Influence of Three Factors
• Three-way repeated-measure ANOVA to analyze
the performance with respect to each factor
• Profiling method has the largest impact on the
recommendation performance
• Best profiling method: HCF-IDF
Chifumi Nishioka (chni@informatik.uni-kiel.de)
19. www.moving-project.eu
19 of 21
Insights from the Results
• Profiling method has the largest impact
• Best profiling method: HCF-IDF
• Spreading activation mitigates sparseness
• Works for users who have less tweets
• Usually, full-texts are unavailable for TDM
• Easy to employ HCF-IDF in many different fields
• MeSH for medicine, ACM CCS for computer science
Chifumi Nishioka (chni@informatik.uni-kiel.de)
Advantage: HCF-IDF can make good
recommendations based on only titles
20. www.moving-project.eu
20 of 21
Conclusion
• User experiment with three factors
• Profiling method: CF-IDF / HCF-IDF / LDA
• Decay function: sliding window / exponential decay
• Document content: Title / All (title + full-text)
• Result
• Profiling method has the largest impact
• Best profiling method: HCF-IDF
• Advantage of HCF-IDF: It performs well even if only
titles are available
Chifumi Nishioka (chni@informatik.uni-kiel.de)
Recommender system for scientific papers based on social
media items
21. www.moving-project.eu
21 of 21
Special thanks to
SIGIR Student Travel Grant
Project consortium and funding agency
Chifumi Nishioka (chni@informatik.uni-kiel.de)
MOVING is funded by the EU Horizon 2020 Programme under the project number INSO-4-2015: 693092
Our demo is online!
http://amygdala.informatik.uni-kiel.de/Demo/TwitterAccount
23. www.moving-project.eu
23 of 21
Reference
• [Blei and Lafferty 06] D. M. Blei and J. D. Lafferty. Dynamic topic models. ICML, 2006.
• [Blei et al. 03] D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. JMLR,
2003.
• [Breese et al. 98] J. S. Breese, D. Heckerman, and C. Kadie. Empirical analysis of
predictive algorithms for collaborative filtering. UAI, 1998.
• [Goossen et al. 11] F. Goossen, W. IJntema, F. Frasincar, F. Hogenboom, and U. Kaymak.
News personalization using the CF-IDF semantic recommender. WIMS, 2011.
• [Griffiths and Steyvers 04] T. L. Griffiths and M. Steyvers. Finding scientific topics. NAS,
2004.
• [Hazen 10] T. J. Hazen. Direct and latent modeling techniques for computing spoken
document similarity. Spoken Language Technology, 2010.
• [Kapanipathi et al. 14] P. Kapanipathi, P. Jain, C. Venkataramani, and A. Sheth. User
interests identification on Twitter using a hierarchical knowledge base. ESWC, 2014.
• [Letierce et al. 10] J. Letierce, A. Passant, J. Breslin, and S. Decker. Understanding how
Twitter is used to spread scientific messages. WebSci, 2010.
• [Nascimento et al. 11] C. Nascimento, A. H. F. Laender, A. S. da Silva, and M. A.
Gonçalves. A source independent framework for research paper recommendation. JCDL,
2011.
Chifumi Nishioka (chni@informatik.uni-kiel.de)
24. www.moving-project.eu
24 of 21
Reference
• [Nishioka et al. 15] C. Nishioka, G. Große-Bölting, A. Scherp. Influence of time on user
profiling and recommending researchers in social media. i-KNOW, 2015.
• [Shen et al. 13] W. Shen, J. Wang, P. Luo, and M. Wang. Linking named entities in tweets
with knowledge base via user interest modeling. KDD, 2013
• [Sugiyama and Kan 10] K. Sugiyama and M.-Y. Kan. Scholarly paper recommendation via
user’s recent research interests. JCDL, 2010.
Chifumi Nishioka (chni@informatik.uni-kiel.de)
25. www.moving-project.eu
25 of 21
Evaluation Metrics
• rankscore [Breese et al. 98]
• Posit that each successive item in a list is less likely to
be viewed with an exponential decay
• Set a parameter 𝜃 = 5, along with [Breese et al. 98]
• Other metrics
• Precision
• Mean Average Precision (MAP)
• Mean Reciprocal Rank (MRR)
• normalized Discounter Cumulative Gain (nDCG)
Chifumi Nishioka (chni@informatik.uni-kiel.de)
𝑟𝑎𝑛𝑘𝑠𝑐𝑜𝑟𝑒 =
𝑑∈ℎ𝑖𝑡𝑠
1
2
𝑟𝑎𝑛𝑘 𝑑−1
𝜃−1
𝑤𝑒𝑖𝑔ℎ𝑡 𝑐, 𝑖
26. www.moving-project.eu
26 of 21
Participants
• Collecting participants
• Mailing lists, tweets, and word-of-month
• 134 started the experiment and 123 completed
• Demographics of participants
• 96 male / 27 female
• 32.83 years old on average (SD: 7.34)
• 21 bachelor / 58 master / 32 a PhD / 12 professor
• 83 working in academia / 40 working in industry
• Incentive for the participation
• Get to know his / her most similar economist among
26 famous economists
• Chance to get one of two Amazon vouchers (50 EUR)
Chifumi Nishioka (chni@informatik.uni-kiel.de)
27. www.moving-project.eu
27 of 21
Tweets of Participants
• Extract user’s tweets via Twitter API
• Enable to extract at most 3,200 tweets per user
• Collect only tweets in English
• The number of English tweets per participant
• Average: 1096.82 English tweets (SD: 1048.46)
• Max: 3192
• Min: 2
• Criteria for the participation
• Participants who had no tweet in the last 250 days
could not participate in the experiment
• Five were rejected for this reason
Chifumi Nishioka (chni@informatik.uni-kiel.de)
28. www.moving-project.eu
28 of 21
Dataset: Scientific Publications
• Result of collaboration with EconBiz
• 279,381 papers in English
• EconBiz: a portal for scientific publications om
economics
• Procedure
• Seed list: 1 million URLs of open access papers
• Successfully download 413,098 papers in PDF
• Convert PDFs into texts using Apache PDFBox
• Detect languages of 413,098 papers
• Finally, get 279,381 papers in English
Chifumi Nishioka (chni@informatik.uni-kiel.de)
29. www.moving-project.eu
29 of 21
Dataset: Taxonomy
• STW (ver. 8.12) enriched by DBpedia redirects
• Maintained by ZBW
• Specialized for economics
• 6,335 semantic concepts and 11,679 labels in English
• Enrichment process
• Goal: get more synonymous labels
• Use the official mapping that connects STW concepts
with DBpedia concepts
• Get redirects from Dbpedia concepts
• e.g., “Telecommunications Operator” and “Telephone
companies”
• Finally, 6,335 semantic concepts and 37,733 labels
Chifumi Nishioka (chni@informatik.uni-kiel.de)
30. www.moving-project.eu
30 of 21
Latent Dirichlet Allocation (LDA)
Chifumi Nishioka (chni@informatik.uni-kiel.de)
source: D. M. Blei. Probabilistic topic models, CACM, 2012.
31. www.moving-project.eu
31 of 21
Latent Dirichlet Allocation (LDA)
• Implementation: JGibbLDA
• Preprocessing
• Lemmatization and stop words removal
• Remove words that appear in fewer than 25 scientific
publications along with [Blei and Lafferty 06]
• Parameters
• Hyper-parameters: 𝛼 = 0.5 and 𝛽 = 0.1
• suggested by [Griffiths and Steyvers 04]
• The number of topics: 𝐾 = 100
• Optimized by the log likelihood
• Experimented 𝐾 = 20, 50, 100, 200, 500, 1000, 5000
• The number of iterations: 500
Chifumi Nishioka (chni@informatik.uni-kiel.de)
32. www.moving-project.eu
32 of 21
Statistical Test for Strategies (1/2)
• Mauchly’s sphericity test
• Verify if the variances of the rankscores of the twelve
strategies are equal
• Reveal a violation of sphericity in the strategies
(𝜒2
65 = 435.90, 𝑝 = .00), which leads to positively
biased F-statistics and increases false positives
• One-way repeated-measure ANOVA with a
Greenhouse-Geisser correction of 𝜖 = .61
• Reveals a significant difference (𝐹 6.60, 805.33 =
21.98, 𝑝 = .00)
Chifumi Nishioka (chni@informatik.uni-kiel.de)
Investigate the difference of the recommendation
performance among the twelve strategies
33. www.moving-project.eu
33 of 21
Statistical Test for Strategies (2/2)
• Shaffer’s modified sequentially rejective
Bonferroni procedure (Shaffer’s MSRB
procedure)
Chifumi Nishioka (chni@informatik.uni-kiel.de)
34. www.moving-project.eu
34 of 21
Statistical Test for Three Factors
• Mendoza’s sphericity test
• Adopt to multi-way repeated-measure ANOVA
• Again, reveal a violation of sphericity
• Three-way repeated-measure ANOVA with a
Greenhouse-Geisser corrections
Chifumi Nishioka (chni@informatik.uni-kiel.de)
Investigate the difference of the recommendation
performance among the twelve strategies
Global 𝜒2
65 = 435.90, 𝑝 = .00
Profiling Method 𝜒2 2 = 12.21, 𝑝 = .00
Profiling Method × Decay Function 𝜒2
2 = 20.02, 𝑝 = .00
Profiling Method × Document Content (𝜒2 2 = 8.61, 𝑝 = .01
35. www.moving-project.eu
35 of 21
Result: Precision
• Values are similar to ones in rankscore
• The order of the strategies are identical
Chifumi Nishioka (chni@informatik.uni-kiel.de)
36. www.moving-project.eu
36 of 21
Result: nDCG
• Values are similar to ones in rankscore
• The order of the strategies are identical
Chifumi Nishioka (chni@informatik.uni-kiel.de)
37. www.moving-project.eu
37 of 21
Result: Mean Average Precision
• Values are higher than ones of rankscore
• The order of the strategies are almost same
Chifumi Nishioka (chni@informatik.uni-kiel.de)
38. www.moving-project.eu
38 of 21
Result: Mean Reciprocal Rank
• Values are higher than ones of rankscore
• The order of the strategies are almost same
Chifumi Nishioka (chni@informatik.uni-kiel.de)
40. www.moving-project.eu
40 of 21
Result: Influence of Three Factors
• Profiling Method
• This factor has the biggest impact
• HCF-IDF is the best profiling method, followed by CF-
IDF and LDA
• Document Content
• All (both title and full-text) significantly performs
better than Title except when using HCF-IDF
• Profiling Method × Document Content
• All (both title and full-text) is better choice for CF-IDF
• Document Content makes no difference for HCF-IDF
Chifumi Nishioka (chni@informatik.uni-kiel.de)
41. www.moving-project.eu
41 of 21
Result: Demographic Factors (1/3)
• Gender
• Female participants are more likely to evaluate
recommendations as interesting
• However, no difference about how each strategy
performs compared to the others, due to no
significant difference in the factor gender × strategy
• Age
• No influence on recommendation performance
Chifumi Nishioka (chni@informatik.uni-kiel.de)
Investigate whether each demographic factor has an
influence on recommendation performance
42. www.moving-project.eu
42 of 21
Result: Demographic Factors (2/3)
• Highest academic degree
• Participants with bachelor are more likely to evaluate
recommendations as interesting than those with
lecturer/professor
• But, no difference about how each strategy performs
compared to the others
• Major
• Manually classify participants into the two groups
• Participants majoring in economics (𝑛 = 92) and other
• No influence on recommendation performance
Chifumi Nishioka (chni@informatik.uni-kiel.de)
43. www.moving-project.eu
43 of 21
Result: Demographic Factors (3/3)
• Years of profession
• On average, working in 7.85 years (SD: 6.85)
• Divide participants into the three groups
• Participants working for ~5 years (𝑛 = 44), working for
5~10 years (𝑛 = 34), and working for 10~ years
• No influence on recommendation performance
• Employment type
• Participants working in academia (𝑛 = 83) and
industry
• No influence on recommendation performance
Chifumi Nishioka (chni@informatik.uni-kiel.de)
46. www.moving-project.eu
46 of 21
List of Taxonomies
• Maintained by W3
• https://www.w3.org/2001/sw/wiki/SKOS/Datasets
Chifumi Nishioka (chni@informatik.uni-kiel.de)
source: https://www.w3.org/2001/sw/wiki/SKOS/Datasets
Taxonomy Domain
Thesaurus for the Social Sciences Social science
NASA Taxonomy Technology areas
Linked Life data Biomedicine
Medical Subject Headings (MeSH) Biomedicine
Australian education vocablaries Education
UNESCO Thesaurus Education, culture, natural sciences, and
social and human sciences
ACM Computing Classification System Computer science
47. www.moving-project.eu
47 of 21
Insights from the Results
• Profiling method has the largest impact
• Best profiling method: HCF-IDF
• Only HCF-IDF enables to make reasonable
recommendations based on only titles
• CF-IDF requires full-texts
• LDA perform poorly even with full-texts
• Possible reason: impossible to infer topic distribution from
social media items, which are short and sparse
• Usually, full-texts are not available for legal issues
• Easy to employ HCF-IDF in many different fields
• e.g., MeSH for medicine, ACM CCS for compute
science
Chifumi Nishioka (chni@informatik.uni-kiel.de)