Evaluating the CC-IDF citation-weighting scheme: How effectively can ‘Inverse Document Frequency’ (IDF) be applied to references?

Evaluating the CC-IDF citation-weighting scheme: How
effectively can ‘Inverse Document Frequency’ (IDF) be
applied to references?
Joeran Beel, Corinna Breitinger, Stefan Langer
iConference 2017 -- 2017/03/24, presented by Maria Gäde

Asst. Prof. Dr. Joeran Beel | Intelligent Systems | Data & Knowledge Engineering Group | beelj@tcd.ie 2
Outline
1. CC-IDF
2. Evaluation
1. Methodology
2. Results
3. Discussion

1. CC-IDF: Common Citation – Inverse
Document Frequency

Introduction
• Introduced in 1998 in the digital library and citation-indexing
system CiteSeer (Bollacker, Lawrence, & Giles, 1998)
• CC-IDF = TF-IDF applied to citations/references
• Assumption: “if a very uncommon citation is shared by two
documents, this should be weighted more highly than a citation
made by a large number of documents” (Giles et al., 1998)
• Similar concept as Bibliographic Coupling (but different weighting
than Coupling Strength)
• In the original CC-IDF, the CC component is binary.
• We focus on IDF

Illustration
CC-IDF != Bibliographic Coupling Strength
Cited
Document dcited
2
Bibliogr. Coupled
Document dBC
3
Bibliogr. Coupled
Document dBC
4
Which bibliographic-coupled document
is more closely related to di?
Cited
Document dcited
1
Input
Document di
Bibliogr. Coupled
Document dBC
1
Bibliogr. Coupled
Document dBC
2
CC-IDF = 1/2CC-IDF = 1/3 CC-IDF = 1/3 CC-IDF = 1/3 + 1/2 = 5/6

Use of CC-IDF
• CC-IDF has been used in several recommender systems, and served
as a baseline in many evaluations.
• Ambiguous reports regarding the effectiveness of CC-IDF
• Sometimes, CC-IDF was found to perform better and other times
worse than simple bibliographic coupling and co-citation strength
(Küçüktunç, Saule, Kaya, & Çatalyürek, 2012; Küçüktunç et al.,
2013; Liang et al., 2011; Pan et al., 2015; Zhang et al., 2012)
• Compared to more advanced approaches such as HITS, PaperRank,
and Katz, CC-IDF performs usually poorly (Küçüktunç et al., 2012;
Pan et al., 2015)

Problem
• CC-IDF has never been compared to CC-Only, i.e. a simple citation
weighting scheme based only on the CC component and ignoring
IDF.
• The basic assumption underlying CC-IDF – namely that “if a very
uncommon citation is shared by two documents, this should be
weighted more highly than a citation made by a large number of
documents” – has never been evaluated for its effectiveness.
--> Goal: Compare CC-IDF vs. CC-Only

2. Evaluation

Methodology
• A/B Test in Docear’s research-paper
recommender system.
• Docear is a reference manager that
allows users to manage references and
PDF files, similar to Mendeley and
Zotero.
• One key difference is that Docear’s
users manage their data in mind-maps.
Users’ mind-maps contain links to
PDFs, as well as the user’s annotations
made within those PDFs.
• To calculate CC-IDF, each mind map of a
user was considered as one document.

CC-IDF in Docear
Recommendation Candidate
Corpus
Contains
Contains
Cites
Cites
Document d2
Document d1
Candidate c1
Cites
Candidate c4
Which candidate is more
related? Candidate c1 or
candidates c2/c3/c4 ?
Candidate c2
Cites
Candidate c3
Document collection of
User u

A/B Test Design
• Random Selection of
• CC-IDF
• CC-Only
• Evaluation with click-through rates (CTR).
• 238,681 recommendations to 3,561 users
• January – September 2014.
• All results are statistically significant (p<0.05), if not stated
otherwise.

Results
• CC-Only: 6.23%
• CC-IDF: 6.15%
• Difference is statistically not significant
1 2 3 4 [5-9] [10-14] [15-24] [25-74] [75-149] >=150
CC-Only (Dspld Recs) 180 451 667 713 1,964 896 2,098 7,329 5,848 4,675
CC-IDF (Dspld Recs) 252 496 600 770 1,660 1,075 2,484 10,114 5,555 4,979
CC-Only (CTR) 3.13% 1.93% 4.74% 5.00% 7.21% 8.08% 6.50% 6.00% 6.83% 5.87%
CC-IDF (CTR) 2.58% 3.86% 5.51% 5.62% 6.28% 8.03% 6.35% 6.64% 6.38% 4.92%
-
2,000
4,000
6,000
8,000
10,000
12,000
0%
1%
2%
3%
4%
5%
6%
7%
8%
9%
Numberofdisplayed
recommendations
CTR
Number of utilized references

Potential Reasons
1. Citations always have some significance
2. Document Frequency (DF) has too little variation based on
citations
3. New research articles have lower Document Frequency
4. No normalization for length of reference list

Outlook
• More research with:
• Traditional related-document scenario
• Larger document corpus
• Users with more documents in their collections
• Higher quality of citation data
• Variation of CC component

Thank You
Questions: Joeran Beel, beel@tcd.ie

Evaluating the CC-IDF citation-weighting scheme: How effectively can ‘Inverse Document Frequency’ (IDF) be applied to references?

Recommended

Recommended

More Related Content

What's hot

What's hot (16)

Similar to Evaluating the CC-IDF citation-weighting scheme: How effectively can ‘Inverse Document Frequency’ (IDF) be applied to references?

Similar to Evaluating the CC-IDF citation-weighting scheme: How effectively can ‘Inverse Document Frequency’ (IDF) be applied to references? (20)

Recently uploaded

Recently uploaded (20)

Evaluating the CC-IDF citation-weighting scheme: How effectively can ‘Inverse Document Frequency’ (IDF) be applied to references?

Editor's Notes