WWW2014 Semantic Stability in Social Tagging Streams

Semantic Stability in Social Tagging
Streams
Claudia Wagner, Philipp Singer,
Markus Strohmaier and Bernardo Huberman

2
Folksonomies
Ontologies
Formal, shared
and stableNot formal but shared
and stable?

4
1970
1990
2010
http://schwarzenegger.com/

5
How can we measure semantic stability?
How can we compare the semantic stabilization process in
different systems?
What impacts semantic stability?

Measuring Semantic Stability
State of the Art
• Relative tag proportions per resource become stable with
increasing number of tag assignments [Golder and
Huberman, 2006]
• KL-divergence of rank-ordered tag frequency distribution per
resource at different time points converges towards zero
[Halpin et al., 2007]
• Power Law distributions [Cattuto et al., 2006] – Scale
invariance property ensures that regardless how large the
system grows the shape of the distribution stays the same
6

Some Limitations
• Don’t allow comparing the semantic
stabilization process of different systems
• Prune tag distributions to top-k tags
– Cannot handle non-conjoint lists of tags
• Random tagging process also produces
“stable” description
– Tag assignment at timepoint t+1 has less impact
on the tag distribution of a resource than a tag at
timepoint t
7

Example
KL-Divergence
8
• KL-divergence converges
towards zero.
• But random baseline also
converges towards zero if
we assume a constant
tagging rate.
• We do not always know
the top k tags!
0 200 400 600 800 1000
0.00.20.40.60.81.0
Number of consecutive tags assignments
KLDivergence
●
●
●
●
●
●
●
●
● ●
●
● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●

Example
Relative Tag Proportion
9
0e+00 2e+04 4e+04 6e+04 8e+04 1e+05
0.000.050.100.150.200.25
Consecutive Tags (User List Names)
RelativeTagProportion
bloggers
blogs
business
design
digital
entertainment
internet
it
marketing
mashable
media
my favstar.fm list
news
social
social media
social−media
socialmedia
tech
tech news
techies
technews
technology
tecnologia
twibes−socialmedia
web
0 2000 4000 6000 8000 10000
0.000.050.100.150.200.250.300.35
1
2
3
4
5

Intuition and Approach
• Some descriptors are
more important than
others.
• Ranking of (top)
descriptors remains
stable over time
• All descriptors are
equally important.
descriptors changes
over time
0
0.1
0.2
0.3
0.4
P(T)
0
0.1
0.2
0.3
0.4
0
0.1
0.2
0.3
P(T)
0
0.1
0.2
0.3
stable
less stable
tn tn+m
tn tn+m

Intuition and Approach
• Some descriptors are
more important than
others.
descriptors remains
stable over time
• All descriptors are
equally important.
descriptors changes
over time
0
0.1
0.2
0.3
0.4
P(T)
0
0.1
0.2
0.3
0.4
stable
less stable
tn tn+m
tn tn+m
0
0.2
0.4
0.6
0
0.2
0.4
P(T)

Requirements
• Rank agreement of the descriptors of a
resources over time
• Weighted rank agreement
• Non-conjoint lists of descriptors
• Random Baseline
13

Rank Biased Overlap (RBO)
[Webber et al., 2010]
• RBO falls in the range [0, 1], where 0 means
disjoint, and 1 means identical
• p lies between 0 and 1 and determines how steep
the decline in weights is
• The smaller p, the more top-weighted the metric
14

Example
15
0
0.1
0.2
0.3
0.4
0
0.1
0.2
0.3
0.4
Overlap at depth 1 = 1
P(T)
P(T)
tn
tn+m

Example
16
0
0.1
0.2
0.3
0.4
0
0.1
0.2
0.3
0.4
Overlap at depth 2 = 0.5
P(T)
P(T)
tn
tn+m

Example
17
0
0.1
0.2
0.3
0.4
0
0.1
0.2
0.3
0.4
Overlap at depth 3 = 1
P(T)
P(T)
tn
tn+m

Tie correction for
Rank Biased Overlap
• RBO does not penalize ties
• We want to penalize ties since they show that
users have not agreed on a ranking
• Sum only over those depths which occur in at
least one of the two rankings
19

Same concordant pairs: (A,D) and (B,D) and (C,D)
0
10
20
30
40
50
60
70
80
90
A B C D
0
10
20
30
40
50
60
70
80
90
C B A D
RBOorig = 0.2
RBOmod= 0.2
0
10
20
30
40
50
60
70
80
90
A B C D
0
10
20
30
40
50
60
70
80
90
A B C D
RBOorig = 0.34
RBOmod= 0.17
No Ties Ties
tn tn+m tn tn+m
R1 R2
A B C D C B A D A B C D C B A D
Frequency
Frequency

Semantic Stabilization on a
Resource Level
23
0 1000 2000 3000 4000
0.00.20.40.60.81.0
Number of consecutive tags assignments
RBO
●
●
●
●
●
●
●
●●
●
●●
●
●●
●●●
●●
●●
●●●●●
●●●●●●●●●●●●●●●●●●●●●●●●●●●
●●●
●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●
●●
●●●●●●●●●●
●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●
●●●●●●●●
●●
●
●●●●●●●●●●
●●●●
●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●
●●●●●●●
●
●●●●●
●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●
●
●●●●●●●●
●
●
●●●●●
●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●
●
●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●
●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●
• Tag distributions of Twitter
users become semantically
stable between 1k and 2k
tag assignments
• The RBO values of random
tagging distributions
increase slower and are
significantly lower

Semantic Stabilization
on a System Level
• How can we compare the semantic stabilization
process in different systems?
• We call a resource description semantically stable
after tn+m tag assignments, if the RBO value
between its tag distribution at point tn and tn+m is
equal or greater than k.
24

Semantic Stabilization
on a System Level
25
After 1250 tag assignments 90% of all
resources have a stability above 0.61

Empirical Study
Twitter
26
Medium level of semantic
stability is reached after
1k-2k tag assignments

Empirical Study
Twitter and Delicious
27
Tag streams in Delicious
stabelize faster and sign.
higher than in Twitter

Empirical Study
Twitter, Delicious and LibraryThing
28
Same is true for tag
streams of books in
LibraryBook

Empirical Study
Random Baseline
29

Difference between tag and word
streams?
30

What causes semantic stability?
• Simulations based on the epistemic tagging model
[Dellschaft and Staab, 2008].
• Use parameter I as imitation rate and produce tag
distributions for I=0, 0.1, ... 1
31

What causes stability?
33
Medium levels of
semantic stability are
reached after 1k-2k tag
assignments

34
Same is true if we
combine BK and imitation
when BK is dominant

35
If imitation and BK are
combined an imitation is
dominant higher levels of
semantic stability are
reached faster

36
• Combination of shared background knowledge and imitation
behaviour (where imitation is more important) leads to the fastest
and highest stabilization.
• Natural language systems show similar stabilization as social tagging
systems where no imitation is supported

Conclusions & Implications
• Attempt to formalize semantic stability in social streams
• Novel approach to measure and compare the semantic
stabilization process in different social streams
Why is that useful?
• Identify social streams (e.g. tag stream of URL or word stream
of hashtags) which are semantically stable
– Extract shared and agreed-upon semantic knowledge from
social streams
• Select systems that provide semantically stable streams
37

References
• D. Bollen and H. Halpin. The role of tag suggestions in folksonomies. In Proceedings of the 20th ACM
conference on Hypertext and hypermedia, HT ’09, pages 359–360, New York, NY, USA, 2009. ACM.
• C. Cattuto, Semiotic dynamics on social tagging communities. The European Physical Journal C -
Particles and Fields August 2006, Volume 46, Issue 2 Supplement, pp 33-37
• A. Clauset, C. R. Shalizi, and M. E. J. Newman. Power-law distributions in empirical data. SIAM Rev.,
51(4):661–703, Nov. 2009.
• K. Dellschaft and S. Staab. An epistemic dynamic model for tagging systems. In HT ’08: Proceedings of
the nineteenth ACM conference on Hypertext and hypermedia, pages 71–80, New York, NY, USA, 2008.
ACM.
• S. Golder and B. A. Huberman. Usage patterns of collaborative tagging systems. Journal of Information
Science, 32(2):198–208, April 2006.
• H. Halpin, V. Robu, and H. Shepherd. The complex dynamics of collaborative tagging. In Proceedings of
the 16th international conference on World Wide Web, WWW ’07, pages 211–220, New York, NY, USA,
2007. ACM.
• A. Hotho, R. Jäschke, C. Schmitz, and G. Stumme. Bibsonomy: A social bookmark and publication
sharing system. In Proceedings of the Conceptual Structures Tool Interoperability Workshop at the 14th
International Conference on Conceptual Structures, pages 87-102, 2006.
• C. T. Kello, G. D. A. Brown, R. Ferrer-i Cancho, J. G. Holden, K. Linkenkaer-Hansen, T. Rhodes, and G.
C. Van Orden. Scaling laws in cognitive sciences. Trends in Cognitive Sciences, 14(5):223{232, May
2010.
• W. Webber, A. Moat, and J. Zobel. A similarity measure for indefinite rankings. ACM Trans. Inf. Syst.,
28(4):20:1{20:38, Nov. 2010.
40

Thank you!
41
Special thanks to my collaborators (2/3 of them are here):

Limitations and Future Work
• RBO measures ranking but ignores the differences
in the frequencies
• Decay function to weight tag counts
– old tag assignments are less important than new ones
• Number and diversity of users who tag a resource
might impact the semantic stabilization process
42

Alternatives to RBO
• Unweighted and conjoint measures
– Kendall tau, Spearman rho
• Weighted and conjoint measures
– Weighted Kendall tau
• Unweighted and non-conjoint measures
– Intersection metric
• Weighted and conjoint
– Cumulative overlap at increasing depths
43

Categories of Semantically
Unstable Resources
• Entity to which a resource refers changes
• Resource (i.e. website) changes
• Entity/Topic to which a resource refers is controversial
– website refers to controversial entity/topic on which
different viewpoints exist
• External conditions which impact viewpoints on
entity/topic change
– Website remains stable but viewpoint of taggers on the
entity or topic related with the site change
45

[Golder and Huberman, 2006]
46
tn+mtn
stableless stable

[Golder and Huberman, 2006]
47
0e+00 2e+04 4e+04 6e+04 8e+04 1e+05
0.000.050.100.150.200.25
bloggers
blogs
business
design
digital
entertainment
internet
it
marketing
mashable
media
my favstar.fm list
news
social
social media
social−media
socialmedia
tech
tech news
techies
technews
technology
tecnologia
twibes−socialmedia
web
0 2000 4000 6000 8000 10000
0.000.050.100.150.200.250.300.35
1
2
3
4
5

KL-Divergence
[Halpin et al., 2007]
48
• KL divergence between the rank-ordered frequency
distribution of the top 25 tags at different time points
tn+mtn
stableless stable

Power Law
[Cattuto, 2006]
50
• Is the rank-ordered frequency distribution a power law
distribution?
• Is the frequency y of a tag inversely proportional to it's
rank r?
tn+mtn

Power Law
[Cattuto, 2006]
51
• Is it really power law?
– Very likely yes according to the maximum
likelihood estimator and Kolmogorov-
Smirnov statistic [Clauset et al., 2010]
– Estimate alpha and xmin over some
reasonable range
– Compare power law fit to the fit of the
exponential function, the lognormal
function and the stretched exponential
(Weibull) function. Use the log-likelihood
ratios to indicate which fit is better.
– We do not find significant differences
between the power law fit and the
lognormal fit

Stablilization going beyond
Baseline Stability
53

Stablilization not going beyond
Baseline Stability
54

WWW2014 Semantic Stability in Social Tagging Streams

Recommended

Recommended

More Related Content

Similar to WWW2014 Semantic Stability in Social Tagging Streams

Similar to WWW2014 Semantic Stability in Social Tagging Streams (20)

More from Claudia Wagner

More from Claudia Wagner (18)

Recently uploaded

Recently uploaded (20)

WWW2014 Semantic Stability in Social Tagging Streams

Editor's Notes