More Related Content Similar to Poster presentation 5th BENet (Belgium Network Research Meting), Namur (20) Poster presentation 5th BENet (Belgium Network Research Meting), Namur 1. Temporal Patterns of
cedaysan@gmail.com
http://fcxn.wordpress.com
http://xn.unamur.be
e, the interpretation of the dynamic
eity of the beads in a critical limit
lp to characterize viral memes
#hashtags) in twitter.
Refs:
1āÆ C. Sanlı et al. (arXiv - 2013).
2āÆ L. Berthier (2011).
ā¢āÆ A typical snapshot of an experiment:
The white spots indicate the positions
of the beads floating on surface
waves.
cedaysan@gmail.com
http://fcxn.wordpress.com!
Month 6 General Meeting
http://xn.unamur.be
result, the twitter users collectively advertise and
the beads form groups to move together. Both
systems self-organize and create dynamic
heterogeneity.
Therefore, the interpretation of the dynamic
heterogeneity of the beads in a critical limit
would help to characterize viral memes
(#hashtags) in twitter.
Refs:
1āÆ C. Sanlı et al. (arXiv - 2013).
2āÆ L. Berthier (2011).
Refs:
1āÆ H. Simon (1971).
2āÆ L. Weng et al. (2012).
3āÆ J. P. Gleeson et al. (2014).
0 12 24 36 48 60 72 84
0
time (hours)
nu
Communication
Spike Trains:
How often, who interacts with whom?
fcxn.wordpress.com
cedaysan@gmail.com
@CeydaSanli
Frontiers in Physics 3(79), 2015.
PLoS ONE 10(7): e0131704, 2015.
@RenaudLambiotte
compleXity !
networks
Sanli and Lambiotte Temporal pattern of online communication
accurately described by models on static networks and
consequently the process presents non-Markovian features
with strong inļ¬uence on the time required to explore the
system [25, 26]. Furthermore, the dynamics drives a strong
heterogeneity observed in user activity [27, 28] and user/content
popularity [29ā31]. Speciļ¬cally, in Twitter, the heterogeneity in
popularity has been observed and quantiļ¬ed in diļ¬erent ways
by the size of retweet cascades, i.e., users re-transfer messages
to their own followers with or without modifying them [32ā36]
or by the number of mentions of a user name, identiļ¬ed by the
symbol @, in other peopleās tweets [37].
In this paper, we focus on the dynamics of social interactions
taking place when diļ¬using rumors about the discovery of the
Higgs boson on July 2012 in Twitter [38]. Our main goal is to ļ¬nd
connections between the statistical properties of user time series
established on the same subject, e.g., the announcement of the
discovery of the Higgs boson, and their activity and popularity.
To this end, we analyze tweets including social interactions, such
as retweets of a message (RT), mentions of a user name (@), and
replies to a message (RE). For each type of the interactions, a
user can either play an active, e.g., retweeting, or a passive, e.g.,
being retweeted, role. Therefore, we characterize each user by
8 time series: one active and one passive time series for each
of the 3 types of interaction as well as for the aggregation of
all interactions, as illustrated in Figure 1. Active time series are
denoted as WHO and passive time series are deļ¬ned by WHOM.
We then investigate whether the statistical properties of each
signal is a good predictor for the activity and popularity of a user.
The following sections are organized as follows. In Section 2,
we describe the data set and provide basic statistical properties
of who and whom time series. In Section 3, we introduce a
technique dedicated to the analysis of non-stationary time series,
so-called local variation, originally established for neuron spike
trains [39ā42] and recently applied to hashtag spike trains in
Twitter [43, 44]. In Section 4, we search for statistical relations
between local variation and measures of popularity of a user.
Finally, Section 5 summarizes the key results and raises open
questions.
2. Activity and Popularity of Users
Our aim is to examine the dynamics of user communications in
Twitter. We investigate how frequently users talk to each other
on a certain topic, e.g., the discovery of the Higgs boson, and
identify how complex dynamic patterns of the communications
evolve in time. To this end, we focus on the three diļ¬erent
types of interaction between users, retweet (RT), mention (@),
and reply (RE). Twitter users can adopt a tweet of someone and
use it again in their own tweet by RT or contact to other users
directly by typing user names in a message called @ or simply RE
to any tweets, e.g., regular tweets, retweets, and tweets/retweets
including @s. Typically, @s and REs are associated to personal
interactions between users, whereas RTs are responsible for large-
scale information diļ¬usion in the social network and for the
emergence of cascades. Here, we count all types of interaction as
a part of complex information diļ¬usion in the network.
FIGURE 1 | Illustration of communications in Twitter. The sketch
summarizes the two positions of each user, e.g., active (who) and passive
(whom). Who interacts in time with any other whom by retweeting (RT) the
messages and mentioning (@) the user names of whom in a message and
replying (RE) to the messages from whom. Quantifying temporal patterns in
time series of who with various ranges of the activity of users aU and of whom
by increasing the popularity of users pU is the main scope of this paper.
Interactions in Twitter are performed between at least two
users (for instance, a user can mention several other users in
a single tweet). Each action is directed and characterized by its
timestamp. The users performing the action play active roles
(who users), the users receiving their attention play a passive
role (whom users), and each user can appear in both active and
passive roles described in Figure 1. Therefore, we construct active
and passive RT, @, and RE spike trains for each user.
2.1. Data Set
As a test bed, we consider the publicly available Higgs Twitter
data set [38, 45], ļ¬rst collected to track the spread of the rumor
on the discovery of the Higgs boson via RT, @, or RE. The data set
is composed of tweets containing one of the following keywords
or hashtags related to the discovery of the Higgs boson, ālhc,ā
ācern,ā āboson,ā and āhiggs.ā The start date is the 1st July 2012,
00:00 a.m. and the ļ¬nal date is the 7th July 2012, 11:59 p.m.,
which covers the announcement date of the discovery, the 4th
July 2012, 08:00 a.m. All dates and timestamps in the data are
converted to the Greenwich mean time. Detailed information on
the data collection procedure and basic statistics can be found in
Domenico et al. [38].
In total, the data is composed of 456,631 users (nodes) and
563,069 interactions (edges). Among those, we detect 354,930 RT,
171,237 @, and 36,902 RE, which shows that RT is more popular
than the other communication channels. For RT interactions, we
ļ¬nd 228,560 users join in who, in contrast, only 41,400 users
appear in whom. These numbers are smaller for @, e.g., 102,802
who and 31,477 whom, and even smaller for RE, with 27,227
who and 18,578 whom. In each case, whom is much lower
than who, as expected because a small number of users tend to
attract a large fraction of attention in both friendship [46, 47] and
online social [48ā52] networks. This observation is conļ¬rmed
in Figure 2, where we present Zipf plots associated to each
Frontiers in Physics | www.frontiersin.org 2 September 2015 | Volume 3 | Article 79
summary!
patterns!
tail analysis is required to resolve temporal correlations [31,
], bursts [19ā22], and cascading [53] driven by circadian
ythm [23, 24], complex decision-making of individuals [3,
54], and external factors [6] such as the announcement of
coveries, as considered in the current data [38].
To uncover the dynamics of the communication spike trains
borately, we apply the local variation LV originally deļ¬ned to
aracterize non-stationary neuron spike trains [39ā42] and very
ently has been used to analyze hashtag spike trains [43, 44].
nlike the memory coeļ¬cient and burstiness parameter [15], LV
ovides a local temporal measurement, e.g., at Ļi of a successive
me sequence of a spike train . . ., Ļiā1, Ļi, Ļi+1, . . ., and so
mpares temporal variations with their local rates [41]
LV =
3
N ā 2
Nā1
i = 2
(Ļi+1 ā Ļi) ā (Ļi ā Ļiā1)
(Ļi+1 ā Ļi) + (Ļi ā Ļiā1)
2
(1)
ere N is the total number of spikes. Equation (1) also takes the
m [41]
LV =
3
N ā 2
Nā1
i = 2
Ļi+1 ā Ļi
Ļi+1 + Ļi
2
(2)
re, Ļi+1 = Ļi+1 ā Ļi quantiļ¬es the forward delays and
i = Ļi ā Ļiā1 represents the backward waiting times for
event at Ļi. Importantly, the denominator normalizes the
antity such as to account for local variations of the rate at
ich events take place. By deļ¬nition, LV takes values in the
erval (0:3) [43]. It has been shown that helps at classifying
namical patterns successfully [39, 40, 42ā44]. Following the
alysis of Gamma processes [39, 40, 43] conventional in neuron
ke analysis [42], it is known that LV = 1 for temporarily
correlated (Poisson random) irregular spike trains, and that
gher values are associated to a burstiness of the spike trains.
contrast, smaller values indicate a higher regularity of the time
ies.
We now perform an analysis of LV on the user communication
ke trains. Equation (2) is performed through the spike trains
th removing multiple spikes taking place within 1 s. Such
ents are rare and their impact on the value of LV has been
own to be limited [43]. Figure 3 describes the distribution of
, P(LV) of full spike trains all together with RT, @, and RE
the who (a, b) and whom (c, d). Grouping LV based on the
quency fU, e.g., the activity of the who aU and the popularity
the whom pU, we examine the temporal patterns of the trains
diļ¬erent classes of aU and pU. For the real data in (a, c),
Figure 3A, LV is always larger than 1 in any values of aU,
ggesting that all users playing a role in who contact to the
om in bursty communications. However, in Figure 3C, we
serve distinct behavior of the whom users and bursts present
ly for low pU. By increasing pU, LV ā 1 indicating that there
no temporal correlation among the who referring the whom
d LV is slightly smaller than 1 for the most popular users,
dicating a tendency toward regularity in the time series, as also
served for the hashtag spike trains [43]. These observations
September 2015 | Volume 3 | Article 79
local!
variation!
5mm
ā¢āÆ Inset: The displacement
field demonstrates local
heterogeneities in the flow.
ā¢āÆ A typical snapshot of an experiment:
The white spots indicate the positions
of the beads floating on surface
waves.
cedaysan@gmail.com
http://fcxn.wordpress.com!
Month 6 General Meeting
http://xn.unamur.be
Fluctuations drive viral memes in online social media:
Integrating criticality into network science
Ceyda Sanlı, Vsevolod Salnikov, Lionel Tabourier, and Renaud Lambiotte
CompleXity Networks, naXys, University of Namur, Belgium.
To spread our posts throughout online social network
such as Twitter:
ā¢āÆ When do we need to post?
ā¢āÆ How often should a #hashtag be posted?
These questions emphasize the time features of our
twitting activity. They would be controlled much more easily
compared to the followings: What we post and how many
number of followers we have.
To be mobile in dense granular media such as highly
packed beads on surface waves:
ā¢āÆ Do single beads move independently or form a group?
ā¢āÆ Is the trajectory of each bead regular in time?
The quantification of the bead dynamics shows that the
beads perform heterogenous motion with a distinct time
scale to characterize this heterogeneity.
restricted amount of attention restricted amount of space
ā¢āÆ Restricted amount of sources forces social
and physical systems to present
emergence of order.
hypothesis
ā¢āÆ Twitter users want to spread their messages and
beads under driving want to be mobile. As a
result, the twitter users collectively advertise and
the beads form groups to move together. Both
systems self-organize and create dynamic
heterogeneity.
The origin of the fluctuations in
dynamics would be the same origins:
Therefore, the interpretation of the dynamic
heterogeneity of the beads in a critical limit
would help to characterize viral memes
(#hashtags) in twitter.
Refs:
1āÆ C. Sanlı et al. (arXiv - 2013).
2āÆ L. Berthier (2011).
Refs:
1āÆ H. Simon (1971).
2āÆ L. Weng et al. (2012).
3āÆ J. P. Gleeson et al. (2014).
0 12 24 36 48 60 72 84
0
10
20
30
40
time (hours)
numberoftweets/unittime
daily tweet cycles
propogations of #hashtags
0 10 20 30 40 50 60 70 80 90
0
5
10
15
20
25
Ļ4
(l=2R,Ļ)
Ļ (s)
Ļ=0.652
Ļ=0.725
Ļ=0.741
Ļ=0.749
Ļ=0.753
Ļ=0.755
Ļ=0.760
Ļ=0.761
Ļ=0.762
Ļ=0.766
Ļ=0.770
Ļ=0.771
(a)
ĻĻ
4
(l, Ļ)
hĻ
4
(l, Ļ)
mobility of beads
spatiotemporal granular flow
time
0.2
0.6
1
1.4
1.8
*
P(r=2R,t)MM
Twitter #hashtag analysis
ā¢āÆ single beads: ā¢āÆ groups:
perimeter
ā¢āÆ quantifying
dynamic
heterogeneity:
time (s)
0 0.5 1 1.5 2
[Ref:2]
[Ref:2]
ā¢āÆ #nice:ā¢āÆ #pepsi:
ā¢āÆ observation:ā¢āÆ artificial representation:
0 10 20 30 40 50 60
time (hours)
0 1 2 3 4 5 6 7 8
time (hours)
ā¢āÆ analysis:
time (hours)
cumulative
time (hours)
cumulative
5mm
ā¢āÆ Inset: The displacement
field demonstrates local
heterogeneities in the flow.
ā¢āÆ A typical snapshot of an experiment:
The white spots indicate the positions
of the beads floating on surface
waves.
cedaysan@gmail.com
http://fcxn.wordpress.com!
Month 6 General Meeting
http://xn.unamur.be
Fluctuations drive viral memes in online social media:
Integrating criticality into network science
Ceyda Sanlı, Vsevolod Salnikov, Lionel Tabourier, and Renaud Lambiotte
CompleXity Networks, naXys, University of Namur, Belgium.
To spread our posts throughout online social network
such as Twitter:
ā¢āÆ When do we need to post?
ā¢āÆ How often should a #hashtag be posted?
These questions emphasize the time features of our
twitting activity. They would be controlled much more easily
compared to the followings: What we post and how many
number of followers we have.
To be mobile in dense granular media such as highly
packed beads on surface waves:
ā¢āÆ Do single beads move independently or form a group?
ā¢āÆ Is the trajectory of each bead regular in time?
The quantification of the bead dynamics shows that the
beads perform heterogenous motion with a distinct time
scale to characterize this heterogeneity.
restricted amount of attention restricted amount of space
ā¢āÆ Restricted amount of sources forces social
and physical systems to present
emergence of order.
hypothesis
ā¢āÆ Twitter users want to spread their messages and
beads under driving want to be mobile. As a
result, the twitter users collectively advertise and
the beads form groups to move together. Both
systems self-organize and create dynamic
heterogeneity.
The origin of the fluctuations in
dynamics would be the same origins:
Therefore, the interpretation of the dynamic
heterogeneity of the beads in a critical limit
would help to characterize viral memes
(#hashtags) in twitter.
Refs:
1āÆ C. Sanlı et al. (arXiv - 2013).
2āÆ L. Berthier (2011).
Refs:
1āÆ H. Simon (1971).
2āÆ L. Weng et al. (2012).
3āÆ J. P. Gleeson et al. (2014).
0 12 24 36 48 60 72 84
0
10
20
30
40
time (hours)
numberoftweets/unittime
daily tweet cycles
propogations of #hashtags
0 10 20 30 40 50 60 70 80 90
0
5
10
15
20
25
Ļ
4
(l=2R,Ļ)
Ļ (s)
Ļ=0.652
Ļ=0.725
Ļ=0.741
Ļ=0.749
Ļ=0.753
Ļ=0.755
Ļ=0.760
Ļ=0.761
Ļ=0.762
Ļ=0.766
Ļ=0.770
Ļ=0.771
(a)
ĻĻ
4
(l, Ļ)
h
Ļ
4
(l, Ļ)
mobility of beads
spatiotemporal granular flow
time
0.2
0.6
1
1.4
1.8*
P(r=2R,t)MM
Twitter #hashtag analysis
ā¢āÆ single beads: ā¢āÆ groups:
perimeter
ā¢āÆ quantifying
dynamic
heterogeneity:
time (s)
0 0.5 1 1.5 2
[Ref:2]
[Ref:2]
ā¢āÆ #nice:ā¢āÆ #pepsi:
ā¢āÆ observation:ā¢āÆ artificial representation:
0 10 20 30 40 50 60
time (hours)
0 1 2 3 4 5 6 7 8
time (hours)
ā¢āÆ analysis:
time (hours)
cumulative
time (hours)
cumulative
real !
vs!
artiļ¬cial!
Temporal pattern of online communication
dard Pearson
en from the
nt covers 3
d RT spike
lly RT and @,
or who (A,B)
e temporal
sses, the
coefļ¬cients
he average
that of @.
e three main
s, e.g., bursts
her/his own
or RE or RT
h that their
n a message
A B
C D
FIGURE 7 | Linear correlations of LV of the same users. The procedure
and representation of the coefļ¬cients follow the same strategy as introduced in
Figure 6. However, we now impose the same users in the same frequency
classes. Even though (A,C) present the agreement in the temporal patterns of
full and RT spike trains of the same users, with high correlation coefļ¬cients in
almost all frequency ranges, (B) indicates lower consistency between RT and
@ spike trains during entire activity āØaUā© and (D) provides a signiļ¬cant result.
While less temporal coherence is observed between RT and @ spike trains in
low popularity āØpUā©, the correlation drastically increases with āØpUā©.
boson on July 4, 2012 within a restricted time window, e.g., 6
days [38]. The main aim is to extract salient temporal patterns
of communication in various types of interaction observed in
Twitter such as retweet (RT), mention (@), and reply (RE).
Adopting the technique so-called local variation LV originally
introduced for neuron spike trains [39ā42] and recently has
applied to hashtag spike trains in Twitter [43, 44], we perform
detailed analysis on user communication spike trains. Showing
strong inļ¬uences of the frequency of the hashtag spike trains on
the resultant temporal patterns in the earlier work [43, 44], in
parallel we here examine the diļ¬erences in the patterns induced
by the frequency of the user communication spike trains, fU.
We investigate user communication spike trains in two
categories, the ļ¬rst set of users are the active ones, who users,
and the other set is composed of the passive users, whom users,
in the communication, and each user can appear in both pools.
For who, f simply gives what extend users contact to whom
Sanli and Lambiotte Temporal pattern of online communication
A B
C D
FIGURE 3 | Probability density function of the local variation LV , P(LV )
of who (A,B) and whom (C,D) users in various ranges of the two
communication frequencies, e.g., aU and pU. (A,C) describe the results of
the real data. When we only observe bursty communication patters in who
independently of the average user activity frequency āØaUā© in (A), signiļ¬cant
variations in LV by increasing the average user popularity āØpUā© are clear in (C).
The results prove that popular users are addressed randomly in time and
slightly more regular patterns observed in the most popular users. On the
other hand, (B,D) present the statistics of artiļ¬cially generated random spikes
serving as a null model and all frequency ranges give the distributions around
1, as expected for temporarily uncorrelated signals.
are signiļ¬cantly diļ¬erent for artiļ¬cial spike trains constructed
by randomly permuting the real full spike train and so expected
to generate non-stationary Poisson processes. Therefore, all
distributions are centered around 1 in this case, independently
of aU and pU, as shown in Figures 3B,D. The randomization and
obtaining a null set follow the same procedure explained in detail
in Sanli and Lambiotte [43].
Even though Figure 3 represents P(LV) of full spike trains,
i.e., all interactions together, P(LV) of individual RT, @, and
RE communication spike trains describes very similar temporal
behavior for both the who and whom. Figure 4 summarizes the
detail of P(LV), the mean of LV, Āµ(LV) with the corresponding
standard deviations Ļ(LV) as error bars, comparatively. The
results highlight that to classify the communication temporal
patterns neither the position of the users, whether active or
passive, nor the types of the interaction, but the frequency
of the communication fU such as aU and pU plays a major
role. All Figures 4AāD, we observe three regions: Bursts in low
fU, log10āØfUā© < 2.5, irregular uncorrelated (Poisson random)
dynamics in moderate and high fU, log10āØfUā© ā 2.5ā3, and
regular patterns in very high fU, log10āØfUā© > 3. This conclusion
supports the importance of frequency so time parameter overall
human behavior [14, 16]. Applying standard linear ļ¬ttings to the
underlying data of Figure 4, composed of 5104 data points for
whom, the understanding can be further proven. We observe the
signiļ¬cant negative trend of LV with increasing pU, i.e., the slope
is ā0.32.
A B
C D
FIGURE 4 | Mean Āµ of the local variation LV of the user communication
spike trains vs. the logarithmic average frequency log10āØfUā©. The results
of who are represented by red squares and blue circles describe that of whom.
Types of the interaction are investigated in detail: (A) All communications of
retweet, RT, mention, @, and reply, RE. (B) Only RT. (C) Only @. (D) Only RE.
Independent of the types of the interaction, the frequency of communication,
e.g., the activity of users aU and the popularity of users pU, designs overall
communication patterns. While low fU gives bursty patterns with LV > 1,
moderate fU indicates irregular uncorrelated (Poisson random) signals, e.g.,
LV ā 1. For all high fU, LV < 1 presenting the regularity of the
communications. The error bars show the corresponding standard variations.
We now perform a more thorough comparison in Figure 5,
on the disparity of LV in diļ¬erent frequency ranges. To this end,
we calculate the standard z-values in two ways. First, to compare
LV of the full spike trains with LV of only RT and also with LV of
only @ spike trains, LRT
V and L@
V, respectively, we introduce
z(fU) =
Āµ(Lk
V) ā Āµ0(LV)
Ļ(Lk
V)/ f k
U
(3)
Here, k in superscripts labels the interaction, e.g., either RT or
@. Precisely, Lk
V is determined based on a ļ¬ltered spike train
composed of the user timestamps of either RT or @, as already
used in (Figures 4B,C). In addition, Āµk is the mean of Lk
V, also
presented in Figures 4B,C, and Āµ0 is the mean LV of the full spike
train, given in Figure 4A.
In Figure 5, black squares show z-values of RT and black
circles describe z-values of @. For who in Figure 5A where LV
only presents bursty patterns (orange shaded area) and low aU,
we have small z-values proving the agreement of the temporal
patterns suggested by LV in the same aU. However, for whom
in Figure 5B where we have rich values of pU compared to the
values of aU, while z-values are small in bursty patterns (low pU,
orange area) as also in who and in regular patterns (high pU,
yellow area), larger zā@ value (the black circle) is calculated in
uncorrelated Poisson dynamics (moderate pU, purple area). The
disagreement of LV with large zā@ indicates that even though
Frontiers in Physics | www.frontiersin.org 4 September 2015 | Volume 3 | Article 79
Sanli and Lambiotte Temporal pattern of online communication
A B
C D
FIGURE 3 | Probability density function of the local variation LV , P(LV )
of who (A,B) and whom (C,D) users in various ranges of the two
communication frequencies, e.g., aU and pU. (A,C) describe the results of
the real data. When we only observe bursty communication patters in who
independently of the average user activity frequency āØaUā© in (A), signiļ¬cant
variations in LV by increasing the average user popularity āØpUā© are clear in (C).
The results prove that popular users are addressed randomly in time and
slightly more regular patterns observed in the most popular users. On the
other hand, (B,D) present the statistics of artiļ¬cially generated random spikes
serving as a null model and all frequency ranges give the distributions around
1, as expected for temporarily uncorrelated signals.
are signiļ¬cantly diļ¬erent for artiļ¬cial spike trains constructed
by randomly permuting the real full spike train and so expected
to generate non-stationary Poisson processes. Therefore, all
distributions are centered around 1 in this case, independently
of aU and pU, as shown in Figures 3B,D. The randomization and
obtaining a null set follow the same procedure explained in detail
in Sanli and Lambiotte [43].
Even though Figure 3 represents P(LV) of full spike trains,
i.e., all interactions together, P(LV) of individual RT, @, and
RE communication spike trains describes very similar temporal
behavior for both the who and whom. Figure 4 summarizes the
detail of P(LV), the mean of LV, Āµ(LV) with the corresponding
standard deviations Ļ(LV) as error bars, comparatively. The
results highlight that to classify the communication temporal
patterns neither the position of the users, whether active or
passive, nor the types of the interaction, but the frequency
of the communication fU such as aU and pU plays a major
role. All Figures 4AāD, we observe three regions: Bursts in low
fU, log10āØfUā© < 2.5, irregular uncorrelated (Poisson random)
dynamics in moderate and high fU, log10āØfUā© ā 2.5ā3, and
regular patterns in very high fU, log10āØfUā© > 3. This conclusion
supports the importance of frequency so time parameter overall
human behavior [14, 16]. Applying standard linear ļ¬ttings to the
underlying data of Figure 4, composed of 5104 data points for
whom, the understanding can be further proven. We observe the
signiļ¬cant negative trend of LV with increasing pU, i.e., the slope
is ā0.32.
A B
C D
FIGURE 4 | Mean Āµ of the local variation LV of the user communication
spike trains vs. the logarithmic average frequency log10āØfUā©. The results
of who are represented by red squares and blue circles describe that of whom.
Types of the interaction are investigated in detail: (A) All communications of
retweet, RT, mention, @, and reply, RE. (B) Only RT. (C) Only @. (D) Only RE.
Independent of the types of the interaction, the frequency of communication,
e.g., the activity of users aU and the popularity of users pU, designs overall
communication patterns. While low fU gives bursty patterns with LV > 1,
moderate fU indicates irregular uncorrelated (Poisson random) signals, e.g.,
LV ā 1. For all high fU, LV < 1 presenting the regularity of the
communications. The error bars show the corresponding standard variations.
We now perform a more thorough comparison in Figure 5,
on the disparity of LV in diļ¬erent frequency ranges. To this end,
we calculate the standard z-values in two ways. First, to compare
LV of the full spike trains with LV of only RT and also with LV of
only @ spike trains, LRT
V and L@
V, respectively, we introduce
z(fU) =
Āµ(Lk
V) ā Āµ0(LV)
Ļ(Lk
V)/ f k
U
(3)
Here, k in superscripts labels the interaction, e.g., either RT or
@. Precisely, Lk
V is determined based on a ļ¬ltered spike train
composed of the user timestamps of either RT or @, as already
used in (Figures 4B,C). In addition, Āµk is the mean of Lk
V, also
presented in Figures 4B,C, and Āµ0 is the mean LV of the full spike
train, given in Figure 4A.
In Figure 5, black squares show z-values of RT and black
circles describe z-values of @. For who in Figure 5A where LV
only presents bursty patterns (orange shaded area) and low aU,
we have small z-values proving the agreement of the temporal
patterns suggested by LV in the same aU. However, for whom
in Figure 5B where we have rich values of pU compared to the
values of aU, while z-values are small in bursty patterns (low pU,
orange area) as also in who and in regular patterns (high pU,
yellow area), larger zā@ value (the black circle) is calculated in
uncorrelated Poisson dynamics (moderate pU, purple area). The
disagreement of LV with large zā@ indicates that even though
Frontiers in Physics | www.frontiersin.org 4 September 2015 | Volume 3 | Article 79
Spike Trains
C. Sanli, CompleXity Networks, UNamur
time
count
t t0 f
VARIABLE OF A TIME SERIES
es in social system include interaction among agents. Considering online social
h as Twitter, we address self-organized optimizing of popularity of information.
we create time series of #hashtag propogation, user activity, and user #hashtag
series, if a time delay between successive events, inter-event interval ā§, is a
ndependent events, the distrubution of inter-event interval is Poissonian. If not,
events are observed and therefore forward propogation of a signal is a function
ral history. Thus, quantifying ā§ is crucial.
iable Lv is an alternative way to characterize whether a time series is Poissonian
onian. For a stationarly process, Lv is a ratio of the diāµerence between the
nterval of forward event and the inter-event interval of backward event to the
inter-event intervals. Suppose that a signal propogates in distinct time such as
ā§i, ā§i+1, . . . ā§N . Then, at ā§i, the inter-event interval of forward event is ā§i+1 =
the inter-event interval of backward event is ā§i = ā§i ā§i 1. Consequently, Lv
Lv =
3
N 2
N 1X
i=2
ā
(ā§i+1 ā§i) (ā§i ā§i 1)
(ā§i+1 ā§i) + (ā§i ā§i 1)
ā2
=
ā
ā§i+1 ā§i
ā§i+1 + ā§i
ā2
. (1)
he total appearance of a time series in distinct times. Multiple activity in same
.
1 the distribution of the inter-event interval of a time series is Poissonian. If a
onsiders signiļ¬cant amount of bursty activity, the distribution is non-Poissonian.
3 the distribution is automatically assumed to be Poissonian.
S OF Lv
time series can be deļ¬ned as how many activity proceeded in distinct times ā§i.
ur in multiple diāµerent ā§i, N 3, the signal has high rank. If a time serie is
ā” 3, the signal has low rank.
2
0 50 100 150 200 250 300 350
0
0.5
1
1.5
2
2.5
3
3.5
Ļ
h
(hour)
L
v
< r
h
>=91127
< r
h
>=18553
< r
h
>= 1678
< r
h
>= 318
< r
h
>= 174
< r
h
>= 117
< r
h
>= 86
< r
h
>= 68
< r
h
>= 56
< r
h
>= 47
< r
h
>= 41
< r
h
>= 35
me series versus life time (ā§h) of the corresponsing
Randomly selected #hashtag activity from real
10
ā4
10
ā2
10
0
10
2
10
4
10
0
10
1
10
2
10
3
10
4
10
5
10
6
Ļ
h
(hour)
r
h
r =2h
FIG. 2. Rank of #hashtag rh versus life time of #hashtag ā§h.
2
. . .
10
ā4
10
ā2
10
0
10
2
10
4
10
0
10
1
10
2
10
3
10
4
10
5
10
6
Ļ
h
(hour)
r
h
r =2h
FIG. 2. Rank of #hashtag rh versus life time of #hashtag ā§h.
2
00:0012:0000:0012:0000:0012:0000:0012:0000:0012:0000:0012:0000:00
0
10
20
30
40
50
60
hour
count/min.
#hollande
#sarkozy
#votehollande
#fh2012
#france2012
FIG. 1. ...
hpi =
p
2
Social Dynamic Behavior Patterns
= popularity
data set: Higgs boson
discovery, July 4, 2012.
spike trains: !
user communication in twitter
retweet (RT), reply (RE),
and mention (@).
active (WHO) and
passive (WHOM):
frequency dominates
to design online social
dynamic behavior.
real spikes present signiļ¬cantly
different dynamics, comparing to
artiļ¬cial ones.
bursty who communicates to
popular whom randomly in time,
e.g. no temporal correlation while
addressing valorized whom.
RT and @ - dynamics is in an
alignment only for popular whom.
bursty!
regular!random!
a = activity !
bursty WHO!
popular WHOM
correlation of
LV provides a
comparison in
the patterns
between two
channels.
V
. It has been shown that helps at classifying
s successfully [39, 40, 42ā44]. Following the
a processes [39, 40, 43] conventional in neuron
], it is known that LV = 1 for temporarily
son random) irregular spike trains, and that
associated to a burstiness of the spike trains.
r values indicate a higher regularity of the time
m an analysis of LV on the user communication
ion (2) is performed through the spike trains
ultiple spikes taking place within 1 s. Such
d their impact on the value of LV has been
ed [43]. Figure 3 describes the distribution of
spike trains all together with RT, @, and RE
and whom (c, d). Grouping LV based on the
the activity of the who aU and the popularity
we examine the temporal patterns of the trains
s of aU and pU. For the real data in (a, c),
is always larger than 1 in any values of aU,
users playing a role in who contact to the
ommunications. However, in Figure 3C, we
ehavior of the whom users and bursts present
y increasing pU, LV ā 1 indicating that there
rrelation among the who referring the whom
smaller than 1 for the most popular users,
ncy toward regularity in the time series, as also
hashtag spike trains [43]. These observations
September 2015 | Volume 3 | Article 79
dressed by who such that their
r name is mentioned in a message
om who.
and whom such as the following-
t is to quantify online user
To reduce the complexity in
tudied here consider only a unique
that is the discovery of the Higgs
in the communication, and each user can appear in both pools.
For who, fU simply gives what extend users contact to whom
and so it is the activity of who, aU. On the other hand, for
whom, the generated spike trains present how often who refers
the messages or the user names of whom and therefore, fU is
the popularity of whom, pU. Providing comparative statistics on
LV of who and whom with increasing aU and pU, respectively,
we observe quite distinct temporal behavior of online users.
First, we observe an asymmetry between active and passive
interactions, as only the former give rise to hubs, with few users
attracting a large share of the attention. Moreover, who constantly
presents bursts, LV > 1 for all values of aU, whereas whom
demonstrates various dynamic behavior patterns, depending on
rg 6 September 2015 | Volume 3 | Article 79
Sanli and Lambiotte
the popularity: The
bursty time series, p
contacted by tempo
Poisson random sp
users with the ma
e.g., LV < 1.
These scenarios a
e.g., who or whom,
RT or @, suggestin
dominates to design
is also supported
the user pairs in th
linear correlation o
patterns. There, we
dynamic behavior
both metrics are com
users.
The analysis c
communication sp
relation in Twitter,
of connected users.
period of the data
the announcement
days after this date
of the communicat
during the announ
investigated in our
of the other researc
of the frequency in
i