Wikisym 2012

WikiSym 2012

Mutual Evaluation of Editors and Texts
for Assessing Quality of Wikipedia Articles

Yu Suzuki Nagoya University, Japan
Masatoshi Yoshikawa Kyoto University, Japan

1

Have you ever use Wikipedia?
1.0

Wikipedia blog

0.8
Percentage Usage of

0.6

0.4

0.2
Oxford university - SPIRE Project
Results and analysis of Web2.0 services survey
http://spire.conted.ox.ac.uk/

0
-18 18-24 25-34 35-44 45-54 55-64 65-74
Age (years old)
2

Have you ever use Wikipedia?
1.0

Wikipedia blog

0.8
Percentage Usage of

0.6

0.4

Less than 18 years and more than 65 years old users
0.2 = novice users
use Wikipedia frequently. Oxford university - SPIRE Project
Results and analysis of Web2.0 services survey
http://spire.conted.ox.ac.uk/

0
-18 18-24 25-34 35-44 45-54 55-64 65-74
Age (years old)
2

What is the main purpose?

56% of users use
for work and study.

But really?

3

What is the main purpose?
Never heard
8%
For Work
Never used
8% 20% 56% of users use
for work and study.

Wikipedia is trusted by
For Fun many users.
28%
For Study
36%
But really?

3

Are Wikipedia articles high quality?
7000.00

80% of
all artic
5250.00

les are
low qua
# of Articles

l i t y.
値タイトル

3500.00

1750.00

0
1

Quality degree
カテゴリタイトル

low high
4
(calculated using our proposed method)

Objectives

• Calculate quality values for articles automatically, accurately.

• For readers: Readers may believe which articles are high quality or not.

→ Readers can assume which articles are high quality.

• For editors: Editors can decide which articles need to be edited.

• For administrators: Administrators can decide which articles are not
appropriate for Wikipedia, for keeping the quality of articles.

5

Output of Our proposed system

Quality Value: 40%

High quality part
Low quality part

6

What is quality?

From Dictionary
【Quality】the degree of excellence of something
【Credibility】the quality of being treated and believed in
From Psychology (Fogg 2003)
Trustworthiness: How many users believe something
Expertise: expert’s opinion

We use “trustworthiness” as the deﬁnition of quality
Quality is not True or False but How many users
believe.
7

Related Work

Link analysisquality articles using[Bellomi 2005, Chin 2011]
Identify high based method HITS, PageRank.
This method can easily identify major articles, but cannot identify minor but high
quality articles.
Using editor reputation [Adler2007, Wiklinson 2007]
We use this method.
Identify which articles are high quality using reputation of editors by editors
themselves
Good Point: These methods can calculate accurate quality.
Because, editors or viewers do not directly decide text quality.
Bad Point: Vandals (bad editor) can easily change text quality.

8

Plan for Calculating Quality
Who evaluate?
・reader (voting)
Who evaluate?
・reader themselves (personalization)
・editor (reputation-based)

What quality we measure? How to evaluate?
・whole article
What quality we ・reader’s voting
・a part measure?
of article How to evaluate?
・article analysis
・editor ・article edit history

9

Who evaluate?
・reader (voting)

・whole article
What quality we ・reader’s voting
・a part measure?
of article How to evaluate?
・article analysis

9

Who evaluate?
・reader (voting)

・whole article ・reader’s voting
・a part of article How to evaluate?
・article analysis

9

Who evaluate?
・reader (voting)

・whole article ・reader’s voting
・a part of article ・article analysis

9

Plan to Measure Quality
• Why we use reputation-based approach?

• Users voting are not always true.

• In YouTube, almost all votes are 5 stars (highest scores).

• Why we calculate editor’s quality?

• We assume that same editor writes same quality of articles.

• Why we use edit history?

• Our proposed system should language independent.
10

Overview
Quality degree 55%
5. 1. Identify editors of articles.
2. Get edit history of each editor.
3. Calculate text’s Quality Value; QV.
4. Calculate editor’s QV.
5. Calculate article’s QV.

QV of = 70%
QV of = 40%
QV= 60%
Editor:A Edit history 4.
Editor:B 3.
1. 2.
11

Key Idea

High quality texts survive beyond
Editor A multiple edits
add
・if a text remain - QV of the text ↑
Editor B ・if a text is deleted - QV of the text ↓
delete

Editor C

12

Calculate Text’s quality values
•A writes 100 letters
write A
• Texts of A do not gain QV
100
100

80
deleted by B •A cannot evaluate A herself
75 20↓
•B deletes 20 letters
deleted by C
# of letters

•B remain A’s 80 letters
50
•B evaluate A’s 80 letters is good
25
•C deletes 60 letters
60↓
20 20

•C remain A’s 20 letters
0 •C evaluate A’s 20 letters is good
1 2 3 4
version number
• A’s text QV = log80 + log20
13

Problem
• Editor’s quality is not considered.

•C deletes A’s text.
A
• A’s QV decreases.
add
B • If C has low quality, C may delete high quality texts.

delete • A’s QV should NOT be decreased.

C • If C has high quality, C should delete low quality texts.

• A’s QV should be decreased.

14

Use editor’s QV for text’s QV
write A
without editor’s QV
100
100
with editor’s QV • If B’s QV is 100%
100

80
80
deleted by B •B should deletes low quality texts.
75 20↓
• A’s text is deleted 25 letters by B.
deleted by C
# of letters

• If C’s QV is 50%
50 50
50

•C may delete 50% of high quality
25
60↓ texts.
20 20

0 • A’stext is deleted 30 letters
1 2 3 4 (60 letters × 50%) by C.
version number
15

Chicken-or-the-egg problem
QV of = 70%
QV of = 40%
= 60%
Editor:B 3. use ’s CV
1. 2.

• Text’s QV is calculated by both edit history and editor’s QV.

• Editor’s QV is calculated by text’s QV.

• Editor’s QV ⇆ text’s QV are a chicken-or-the-egg problem.

Mutually calculate editor’s and text’s QV until converge.
16

Our proposed method
1. Identify editors of articles.
2. Get edit history of each editor.
3. Calculate Text’s QV using editor’s QV.
• When ﬁrst time, all editor’s QV is considered as 1 (highest value).
4. Calculate editor’s QV.
5. If text’s QVs and editor’s QVs are not converged, return 3.
6. Calculate article’s QV.

QV of = 70%
QV of = 40%
QV = 60%
Editor:B 3.
1. 2. 5.
17

Experimental Setup
• Data set

• Japanese Wikipedia edit history data (at Nov. 2, 2010)

• 1,889,129 articles, 2,178,003 editors (w/ bots, anonymous IP user)

• High quality articles (Correct Dataset)

• “Featured articles” and “Good articles” selected by Wikipedians.

• Evaluation measure

• 11-pt interpolated Recall-Precision graph
18

Experimental result
0.10
with editor’s QV • Precision improves about 10%.
without editor’s QV
0.09

• At recall 0 to 0.5, precision improves about 20%, whereas
0.08
precision does not improve At recall 0.6 to 1.
0.07
• When an article is about current events and is high quality,
Precision

0.06 our system can decide as high quality, but not in featured
article.
0.04

0.03
• When one editor writes excellent texts, and the other editors
do not edit, the article is “featured” but do not decided as
0.02 high quality.

0.01 • Text’s and editor’s QV converges when we calculate QVs
20 times each.
0
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

Recall 19

Conclusion
• Calculate texts’ quality values using editor’s QV.
• Relation of texts’ quality values and editors’ quality values is chicken-or-the-egg.

• Mutually calculate text’s quality values and editor’s quality values until converge.

• Improved averaging precision ratio is about 10%.

• At low recall ratio, precision ratio improves about 20%.

• Future Work
• Confidence of quality values

• When A edits 100 articles many times, B edits only ONE article once, and A and B has same QV,
qualities of A and B are decided as the same by the system. But, this should be different because
confidence is different.

• Other effective assumption

• When high quality editor confirms a text, the text should be high quality even if the text is written by
low quality editor. 20

Open problem
• Using contents analysis

• Estimate terms which appear frequently in high quality articles, but do not
appear in low quality articles.

• Using multiple language articles

• If
an article in Japanese is similar to that in English, the article is high
quality?

• For Web documents, SNS, ...

• How to calculate quality degrees without edit history?
21

Thank you!

ありがとうございました!

非常感謝!

ขอบคุณ ครับ !
고마웠습니다 !
22

Wikisym 2012

Recommended

Recommended

More Related Content

Similar to Wikisym 2012

Similar to Wikisym 2012 (20)

Recently uploaded

Recently uploaded (20)

Wikisym 2012

Editor's Notes