Noisy information transmission through molecular interaction networks
Are Powerlaws Useful?
1. Are Power Laws Useful?
Michael P.H. Stumpf
Theoretical Systems Biology Group
07/03/2013
Are Power Laws Useful? Michael P.H. Stumpf 1 of 28
2. Outline
Power Laws: Ubiquituous, Universal, Useful?
Critical Phenomena and Scaling Behaviour
Empirical Power Laws
Simple Models of Network Evolution
More Normal Than Normal
Summary
Are Power Laws Useful? Michael P.H. Stumpf 2 of 28
3. What Do We Mean By A Powerlaw?
Power Law Relationships
Y ∝ Xλ
Are Power Laws Useful? Michael P.H. Stumpf Power Laws: Ubiquituous, Universal, Useful? 3 of 28
4. What Do We Mean By A Powerlaw?
Power Law Relationships
log(Y )
Y ∝ Xλ
log(X )
Are Power Laws Useful? Michael P.H. Stumpf Power Laws: Ubiquituous, Universal, Useful? 3 of 28
5. What Do We Mean By A Powerlaw?
Power Law Relationships
log(Y ) log(p(X ))
Y ∝ Xλ
log(X ) log(X )
Are Power Laws Useful? Michael P.H. Stumpf Power Laws: Ubiquituous, Universal, Useful? 3 of 28
6. Power Laws as Physical Scaling Relationships
Laminar Flow
r
R
x
Are Power Laws Useful? Michael P.H. Stumpf Critical Phenomena and Scaling Behaviour 4 of 28
7. Power Laws as Physical Scaling Relationships
Laminar Flow
r
R
x
Universality in Flow
For sufficiently low Reynolds number the velocity profile is universal
for all pipes and fluids:
v (r ) r2
= 1−
v (0) R2
Are Power Laws Useful? Michael P.H. Stumpf Critical Phenomena and Scaling Behaviour 4 of 28
8. Power Laws as Physical Scaling Relationships
1D Josephson-Junction Array in insulating and super conducting phase
Sondhi et al., Rev.Mod.Phys. 69:315 (1997).
As the critical temperature is approached from above, the correlation
length increases as some power of the reduced temperature,
−ν
T − Tc
ξ∝
Tc
Are Power Laws Useful? Michael P.H. Stumpf Critical Phenomena and Scaling Behaviour 4 of 28
9. Making Sense of Power Laws in Physics
The Ising Model
↑ ↓ ↑ ↓ ↓ ↓ ↑ ↓ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑
↑ ↓ ↑ ↓ ↓ ↑ ↓ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑
↓ ↓ ↓ ↑ ↓ ↑ ↑ ↑ ↑ ↑ ↑ ↓ ↑ ↑ ↑ ↑ ↑ ↑
↑ ↓ ↓ ↓ ↑ ↓ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑
↓ ↑ ↑ ↓ ↓ ↓ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑
↓ ↑ ↑ ↓ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑
T > TC T ≈ TC T < TC
Are Power Laws Useful? Michael P.H. Stumpf Critical Phenomena and Scaling Behaviour 5 of 28
10. Making Sense of Power Laws in Physics
The Ising Model
↓ ↓ ↑ ↑ ↓ ↑ ↑ ↓ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑
↓ ↑ ↓ ↓ ↑ ↑ ↓ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑
↑ ↑ ↑ ↑ ↓ ↑ ↑ ↑ ↑ ↑ ↑ ↓ ↑ ↑ ↑ ↑ ↑ ↑
↓ ↓ ↓ ↓ ↓ ↓ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑
↓ ↑ ↓ ↑ ↑ ↓ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑
↑ ↓ ↓ ↓ ↓ ↓ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑
T > TC T ≈ TC T < TC
Critical Exponents
For θC = T −TC we have for any macroscopic variable F (θC ) in the
TC
vicinity of the critical point θC ≈ 0,
F (θC ) = const. × θ−λ .
C
Are Power Laws Useful? Michael P.H. Stumpf Critical Phenomena and Scaling Behaviour 5 of 28
11. Making Sense of Power Laws in Physics
Renormalization Group Theory
↑ ↓ ↑ ↑ ↑ ↑
↓ ↑ ↑ ↑ ↑ ↑
↑ ↑ ↑ ↑ ↑ ↓
↑ ↑ ↑ ↑ ↑ ↑
↑ ↑ ↑ ↑ ↑ ↑
↑ ↑ ↑ ↑ ↑ ↑
Renormalization Group Transformation (RGT)
The partition function, Z of a physical system must be invariant under
the RGT:
Z= exp (−βH [Si ]) = exp −βH [Sα ] = Z
[Si ] [Sα ]
Are Power Laws Useful? Michael P.H. Stumpf Critical Phenomena and Scaling Behaviour 5 of 28
12. Making Sense of Power Laws in Physics
Renormalization Group Theory
↑ ↓ ↑ ↑ ↑ ↑
↓ ↑ ↑ ↑ ↑ ↑
↑ ↑ ↑ ↑ ↑ ↓
↑ ↑ ↑ ↑ ↑ ↑
↑ ↑ ↑ ↑ ↑ ↑
↑ ↑ ↑ ↑ ↑ ↑
Renormalization Group Transformation (RGT)
The partition function, Z of a physical system must be invariant under
the RGT:
Z= exp (−βH [Si ]) = exp −βH [Sα ] = Z
[Si ] [Sα ]
Are Power Laws Useful? Michael P.H. Stumpf Critical Phenomena and Scaling Behaviour 5 of 28
13. Making Sense of Power Laws in Physics
Renormalization Group Theory
↑ ↓ ↑ ↑ ↑ ↑
↓ ↑ ↑ ↑ ↑ ↑
↑ ↑ ↑ ↑ ↑ ↓
↑ ↑ ↑ ↑ ↑ ↑
↑ ↑ ↑ ↑ ↑ ↑
↑ ↑ ↑ ↑ ↑ ↑
Renormalization Group Transformation (RGT)
The partition function, Z of a physical system must be invariant under
the RGT:
Z= exp (−βH [Si ]) = exp −βH [Sα ] = Z
[Si ] [Sα ]
Are Power Laws Useful? Michael P.H. Stumpf Critical Phenomena and Scaling Behaviour 5 of 28
14. Making Sense of Power Laws in Physics
Renormalization Group Theory
↑ ↓ ↑ ↑ ↑ ↑
↑ ↑
↓ ↑ ↑ ↑ ↑ ↑
↑ ↑ ↑ ↑ ↑ ↓
↑ ↑
↑ ↑ ↑ ↑ ↑ ↑
↑ ↑ ↑ ↑ ↑ ↑
↑ ↑ ↑
↑ ↑ ↑ ↑ ↑ ↑
Renormalization Group Transformation (RGT)
The partition function, Z of a physical system must be invariant under
the RGT:
Z= exp (−βH [Si ]) = exp −βH [Sα ] = Z
[Si ] [Sα ]
Are Power Laws Useful? Michael P.H. Stumpf Critical Phenomena and Scaling Behaviour 5 of 28
15. Making Sense of Power Laws in Physics
Renormalization Group Theory
↑ ↓ ↑ ↑ ↑ ↑
↑ ↑
↓ ↑ ↑ ↑ ↑ ↑
↑ ↑ ↑ ↑ ↑ ↓
↑ ↑ ↑
↑ ↑ ↑ ↑ ↑ ↑
↑ ↑ ↑ ↑ ↑ ↑
↑ ↑ ↑
↑ ↑ ↑ ↑ ↑ ↑
Renormalization Group Transformation (RGT)
The partition function, Z of a physical system must be invariant under
the RGT:
Z= exp (−βH [Si ]) = exp −βH [Sα ] = Z
[Si ] [Sα ]
Are Power Laws Useful? Michael P.H. Stumpf Critical Phenomena and Scaling Behaviour 5 of 28
16. Making Sense of Power Laws in Physics
Renormalization Group Theory
↑ ↓ ↑ ↑ ↑ ↑
↓ ↑ ↑
↓ ↑ ↑ ↑ ↑ ↑
↑ ↑ ↑ ↑ ↑ ↓
↑ ↑ ↑
↑ ↑ ↑ ↑ ↑ ↑
↑ ↑ ↑ ↑ ↑ ↑
↑ ↑ ↑
↑ ↑ ↑ ↑ ↑ ↑
Renormalization Group Transformation (RGT)
The partition function, Z of a physical system must be invariant under
the RGT:
Z= exp (−βH [Si ]) = exp −βH [Sα ] = Z
[Si ] [Sα ]
Are Power Laws Useful? Michael P.H. Stumpf Critical Phenomena and Scaling Behaviour 5 of 28
17. Making Sense of Power Laws in Physics
Renormalization Group Theory
↑ ↓ ↑ ↑ ↑ ↑
↓ ↑ ↑
↓ ↑ ↑ ↑ ↑ ↑
↑ ↑ ↑ ↑ ↑ ↓
↑ ↑ ↑
↑ ↑ ↑ ↑ ↑ ↑
↑ ↑ ↑ ↑ ↑ ↑
↑ ↑ ↑
↑ ↑ ↑ ↑ ↑ ↑
Renormalization Group Transformation (RGT)
The partition function, Z of a physical system must be invariant under
the RGT:
Z= exp (−βH [Si ]) = exp −βH [Sα ] = Z
[Si ] [Sα ]
Are Power Laws Useful? Michael P.H. Stumpf Critical Phenomena and Scaling Behaviour 5 of 28
18. Making Sense of Power Laws in Physics
Renormalization Group Theory
↑ ↓ ↑ ↑ ↑ ↑
↓ ↑ ↑
↓ ↑ ↑ ↑ ↑ ↑
↑ ↑
↑ ↑ ↑ ↑ ↑ ↓
↑ ↑ ↑
↑ ↑ ↑ ↑ ↑ ↑
↑ ↑ ↑ ↑ ↑ ↑
↑ ↑ ↑
↑ ↑ ↑ ↑ ↑ ↑
Renormalization Group Transformation (RGT)
The partition function, Z of a physical system must be invariant under
the RGT:
Z= exp (−βH [Si ]) = exp −βH [Sα ] = Z
[Si ] [Sα ]
Are Power Laws Useful? Michael P.H. Stumpf Critical Phenomena and Scaling Behaviour 5 of 28
19. Behaviour Around Critical Points
The fixed points of the RGT define the possible macroscopic states of
the system, e.g. ferromagnet vs. paramagnet.
Of particular interest are the non-trivial fixed points, 0 < Tc , J < ∞,
which mark the occurrence of a phase transition.
Are Power Laws Useful? Michael P.H. Stumpf Critical Phenomena and Scaling Behaviour 6 of 28
20. Behaviour Around Critical Points
The fixed points of the RGT define the possible macroscopic states of
the system, e.g. ferromagnet vs. paramagnet.
Of particular interest are the non-trivial fixed points, 0 < Tc , J < ∞,
which mark the occurrence of a phase transition.
In this case changing the scale does not change the physics. For our
spin-system, for example, there is long-range correlation which
extends beyond the lattice spacing.
Are Power Laws Useful? Michael P.H. Stumpf Critical Phenomena and Scaling Behaviour 6 of 28
21. Behaviour Around Critical Points
The fixed points of the RGT define the possible macroscopic states of
the system, e.g. ferromagnet vs. paramagnet.
Of particular interest are the non-trivial fixed points, 0 < Tc , J < ∞,
which mark the occurrence of a phase transition.
In this case changing the scale does not change the physics. For our
spin-system, for example, there is long-range correlation which
extends beyond the lattice spacing.
Critical Points
At the critical point the correlation length diverges as a power law
ξ ∝ θ−ν
c
Note that knowledge of the critical point does not tell us necessarily
what the state of the system on either side is.
Are Power Laws Useful? Michael P.H. Stumpf Critical Phenomena and Scaling Behaviour 6 of 28
22. Heureka!
Mason Porter, http://www.quickmeme.com/meme/3sqh80/
Are Power Laws Useful? Michael P.H. Stumpf Empirical Power Laws 7 of 28
23. Simple Scaling Laws
West et al., PNAS, 99:2473 (2002).
May & Stumpf, Science, 290:2084 (2000).
Are Power Laws Useful? Michael P.H. Stumpf Empirical Power Laws 8 of 28
24. Simple Scaling Laws
West et al., PNAS, 99:2473 (2002).
May & Stumpf, Science, 290:2084 (2000).
Plausible Theories and Simple Models
Even when simple, plausible physical arguments can be put forward
In an area that is twice as large, we can accommodate four
times as many species, N ∝ A2
the empirical results are better described by other phenomenological
distributions.
Are Power Laws Useful? Michael P.H. Stumpf Empirical Power Laws 8 of 28
25. Illusions of Invariance
We consider off-spring weaning weight, w, and maternal weight, m,
and seek to understand w /m. If this ratio is invariant then log(w )
plotted against log(m) should have a regression slope of 1.0.
Are Power Laws Useful? Michael P.H. Stumpf Empirical Power Laws 9 of 28
26. Illusions of Invariance
We consider off-spring weaning weight, w, and maternal weight, m,
and seek to understand w /m. If this ratio is invariant then log(w )
plotted against log(m) should have a regression slope of 1.0.
The Inverse is not true! A slope of 1.0 does not imply invariance.
Are Power Laws Useful? Michael P.H. Stumpf Empirical Power Laws 9 of 28
27. Illusions of Invariance
We consider off-spring weaning weight, w, and maternal weight, m,
and seek to understand w /m. If this ratio is invariant then log(w )
plotted against log(m) should have a regression slope of 1.0.
The Inverse is not true! A slope of 1.0 does not imply invariance.
For example, consider the model where log(w ) = log(m) + log(c )
where log(c ) is a non-normally distributed error, , c ∼ U[0, 1].
Are Power Laws Useful? Michael P.H. Stumpf Empirical Power Laws 9 of 28
28. Illusions of Invariance
We consider off-spring weaning weight, w, and maternal weight, m,
and seek to understand w /m. If this ratio is invariant then log(w )
plotted against log(m) should have a regression slope of 1.0.
The Inverse is not true! A slope of 1.0 does not imply invariance.
For example, consider the model where log(w ) = log(m) + log(c )
where log(c ) is a non-normally distributed error, , c ∼ U[0, 1].
We then have
Var[log(w )] − Var[log(c )] Var[log(m)]
R2 = =
Var[log(w )] Var[log(m)] + Var[log(c )]
Are Power Laws Useful? Michael P.H. Stumpf Empirical Power Laws 9 of 28
29. Illusions of Invariance
We consider off-spring weaning weight, w, and maternal weight, m,
and seek to understand w /m. If this ratio is invariant then log(w )
plotted against log(m) should have a regression slope of 1.0.
The Inverse is not true! A slope of 1.0 does not imply invariance.
For example, consider the model where log(w ) = log(m) + log(c )
where log(c ) is a non-normally distributed error, , c ∼ U[0, 1].
We then have
Var[log(w )] − Var[log(c )] Var[log(m)]
R2 = =
Var[log(w )] Var[log(m)] + Var[log(c )]
Nee et al., Science, 309:1236 (2005).
Are Power Laws Useful? Michael P.H. Stumpf Empirical Power Laws 9 of 28
30. Scale-Free Networks
The Beginnings
A:
Actor collaboration; B: WWW; C: Power grid
Barabasi & Albert, Science, 286:510 (1999).
Ever since many real-world
networks have been
“discovered” to be
Partial alignment of Human and Fly protein-interaction network (PIN).
scale-free.
Are Power Laws Useful? Michael P.H. Stumpf Empirical Power Laws 10 of 28
31. Scale-Free Networks
The Beginnings
A:
Actor collaboration; B: WWW; C: Power grid
Barabasi & Albert, Science, 286:510 (1999).
Ever since many real-world
networks have been
“discovered” to be
Partial alignment of Human and Fly protein-interaction network (PIN).
scale-free.
What Are “Scale-Free” Networks?
Typically this means that the degree distributions is scale-free, i.e.
Pr(αk )
= const. ∀k .
Pr(k )
Are Power Laws Useful? Michael P.H. Stumpf Empirical Power Laws 10 of 28
33. Are Networks Scale-Free?
If D = {d1 , . . . , dn } is the empirical degree
distribution and Prm (k |θ) the probability to
observe degree k for model m ∈ M, with
{M1 , . . . , Mq }, and parameter θ then the
likelihood is given by
n
Lm (θm ) = Prm (k |θ).
i =1
Saccharomyces cerevisiae PIN
This allows us to compare different models in
light of the data (using e.g. the AIC or BIC to
enforce parsimony).
Are Power Laws Useful? Michael P.H. Stumpf Empirical Power Laws 11 of 28
34. Are Networks Scale-Free?
If D = {d1 , . . . , dn } is the empirical degree
distribution and Prm (k |θ) the probability to
observe degree k for model m ∈ M, with
{M1 , . . . , Mq }, and parameter θ then the
likelihood is given by
n
Lm (θm ) = Prm (k |θ).
i =1
Saccharomyces cerevisiae PIN
This allows us to compare different models in
light of the data (using e.g. the AIC or BIC to
enforce parsimony).
So far, no network has been found to be
scale-free when proper statistical analysis was
applied.
Stumpf & Ingram, Europhys. Lett. 71:152 (2005); Tanaka et al., FEBS Lett. 579:5140 (2005); Khanin & Wit, J.Comp.Biol.
13:810 (2006).
Are Power Laws Useful? Michael P.H. Stumpf Empirical Power Laws 11 of 28
35. Real Networks Are Scale Rich
Tanaka, Phys.Rev.Lett. 94:168101 (2005).
Are Power Laws Useful? Michael P.H. Stumpf Empirical Power Laws 12 of 28
36. Other Statistical Challenges
Incomplete and Noisy Data
Sub-nets of scale
free networks are
not scale-free.
Stumpf et al., PNAS 102:4221,
(2005);
Wiuf & Stumpf, Proc.Roy.Soc. A
462:1181 (2006).
Are Power Laws Useful? Michael P.H. Stumpf Empirical Power Laws 13 of 28
37. Other Statistical Challenges
Incomplete and Noisy Data
Sub-nets of scale
free networks are
not scale-free.
Stumpf et al., PNAS 102:4221,
(2005);
Wiuf & Stumpf, Proc.Roy.Soc. A
462:1181 (2006).
Truncated Power Laws
Pr(k ) ∝ k −λ for klow < k < khigh
It is hard to see what is gained by this: the statistical power is lower
than that of other mixture models, and the elegance of the
interpretation is no longer given.
Are Power Laws Useful? Michael P.H. Stumpf Empirical Power Laws 13 of 28
38. Evolving Networks
α
δ
δ
α
δ
γ
δ
Are Power Laws Useful? Michael P.H. Stumpf
γ
Simple Models of Network Evolution 14 of 28
39. Evolving Networks
Model-Based Evolutionary Analysis
• For sequence data we use models of nucleotide substitution in
order to infer phylogenies in a likelihood or Bayesian framework.
• None of these models — even the general time-reversible model
— are particularly realistic; but by allowing for complicating factors
e.g. rate variation we capture much of the variability observed
across a phylogenetic panel.
• Modes of network evolution will be even more complicated and
exhibit high levels of contingency; moreover the structure and
function of different parts of the network will be intricately linked.
• Nevertheless we believe that modelling the processes underlying
the evolution of networks can provide useful insights; in particular
we can study how functionality is distributed across groups of
genes.
Are Power Laws Useful? Michael P.H. Stumpf Simple Models of Network Evolution 14 of 28
40. Network Evolution Models
(a) Duplication attachment (b) Duplication attachment
with complimentarity
wj
(c) Linear preferential wi
(d) General scale-free
attachment
Are Power Laws Useful? Michael P.H. Stumpf Simple Models of Network Evolution 15 of 28
41. ABC on Networks
Summarizing Networks
• Data are noisy and incomplete.
• We can simulate models of network
evolution, but this does not allow us to
calculate likelihoods for all but very
trivial models.
• There is also no sufficient statistic that
would allow us to summarize networks,
so ABC approaches require some
thought.
• Many possible summary statistics of
networks are expensive to calculate.
Full likelihood: Wiuf et al., PNAS (2006).
ABC: Ratman et al., PLoS Comp.Biol. (2008).
Stumpf & Wiuf, J. Roy. Soc. Interface (2010).
Are Power Laws Useful? Michael P.H. Stumpf Simple Models of Network Evolution 16 of 28
42. Graph Spectrum
c a b c d e
0 1 1 1 0 a
a d e
1 0 1 1 0 b
A = 1 1 0 0 0 c
1 1 0 0 1 d
b 0 0 0 1 0 e
Graph Spectra
Given a graph G comprised of a set of nodes N and edges (i , j ) ∈ E
with i , j ∈ N, the adjacency matrix, A, of the graph is defined by
1 if (i , j ) ∈ E ,
ai ,j =
0 otherwise.
The eigenvalues, λ, of this matrix provide one way of defining the
graph spectrum.
Are Power Laws Useful? Michael P.H. Stumpf Simple Models of Network Evolution 17 of 28
43. Spectral Distances
A simple distance measure between graphs having adjacency
matrices A and B, known as the edit distance, is to count the number
of edges that are not shared by both graphs,
D (A, B ) = (ai ,j − bi ,j )2 .
i ,j
Are Power Laws Useful? Michael P.H. Stumpf Simple Models of Network Evolution 18 of 28
44. Spectral Distances
A simple distance measure between graphs having adjacency
matrices A and B, known as the edit distance, is to count the number
of edges that are not shared by both graphs,
D (A, B ) = (ai ,j − bi ,j )2 .
i ,j
However for unlabelled graphs we require some mapping h from
i ∈ NA to i ∈ NB that minimizes the distance
D (A, B ) Dh (A, B ) = (ai ,j − bh(i ),h(j ) )2 ,
i ,j
Are Power Laws Useful? Michael P.H. Stumpf Simple Models of Network Evolution 18 of 28
45. Spectral Distances
A simple distance measure between graphs having adjacency
matrices A and B, known as the edit distance, is to count the number
of edges that are not shared by both graphs,
D (A, B ) = (ai ,j − bi ,j )2 .
i ,j
However for unlabelled graphs we require some mapping h from
i ∈ NA to i ∈ NB that minimizes the distance
D (A, B ) Dh (A, B ) = (ai ,j − bh(i ),h(j ) )2 ,
i ,j
Given a spectrum (which is relatively cheap to compute) we have
(α) (β) 2
D (A, B ) = λl − λl
l
Are Power Laws Useful? Michael P.H. Stumpf Simple Models of Network Evolution 18 of 28
46. ABC using Graph Spectra
For an observed network, N, and a simulated network, Sθ , we use the
distance between the spectra
(N) (S) 2
D (N, Sθ ) = λl − λl ,
l
in our ABC SMC procedure. Note that this distance is a close lower
bound on the distance between the raw data; we therefore do not
have to bother with summary statistics.
Also, calculating graph spectra costs as much as calculating other
O (N 3 ) statistics (such as all shortest paths, the network diameter or
the within-reach distribution).
Thorne & Stumpf, J.Roy.Soc. Interface, 9:2653 (2012).
Are Power Laws Useful? Michael P.H. Stumpf Simple Models of Network Evolution 19 of 28
47. Protein Interaction Network Data
Species Proteins Interactions Genome size Sampling fraction
S.cerevisiae 5035 22118 6532 0.77
D. melanogaster 7506 22871 14076 0.53
H. pylori 715 1423 1589 0.45
E. coli 1888 7008 5416 0.35
Thorne & Stumpf, J.Roy.Soc. Interface, 9:2653 (2012).
Are Power Laws Useful? Michael P.H. Stumpf Simple Models of Network Evolution 20 of 28
48. Protein Interaction Network Data
Species Proteins Interactions Genome size Sampling fraction
S.cerevisiae 5035 22118 6532 0.77
D. melanogaster 7506 22871 14076 0.53
H. pylori 715 1423 1589 0.45
E. coli 1888 7008 5416 0.35
0.5
0.4
Model probability
Organism
0.3 S.cerevisae
D.melanogaster
H.pylori
E.coli
0.2
0.1
0.0
DA DAC LPA
Model
SF DACL DACR Thorne & Stumpf, J.Roy.Soc. Interface, 9:2653 (2012).
Are Power Laws Useful? Michael P.H. Stumpf Simple Models of Network Evolution 20 of 28
49. Protein Interaction Network Data
Species Proteins Interactions Genome size Sampling fraction
S.cerevisiae 5035 22118 6532 0.77
D. melanogaster 7506 22871 14076 0.53
H. pylori 715 1423 1589 0.45
E. coli 1888 7008 5416 0.35
Model Selection
0.5
• Inference here was based on all
0.4 the data, not summary
statistics.
Model probability
Organism
0.3 S.cerevisae
D.melanogaster
H.pylori
• Duplication models receive the
E.coli
0.2 strongest support from the data.
• Several models receive support
0.1
and no model is chosen
0.0
unambiguously.
DA DAC LPA
Model
SF DACL DACR Thorne & Stumpf, J.Roy.Soc. Interface, 9:2653 (2012).
Are Power Laws Useful? Michael P.H. Stumpf Simple Models of Network Evolution 20 of 28
50. PIN Model Evolution
Are Power Laws Useful? Michael P.H. Stumpf Simple Models of Network Evolution 21 of 28
51. Power Laws As Phenomenological Models
We have seen above that power laws emerge naturally in the context
of continuous phase transitions; but we have also seen that they offer
at best limited insights for finite systems.
Why do they nevertheless appear so often?
Are Power Laws Useful? Michael P.H. Stumpf More Normal Than Normal 22 of 28
52. Power Laws As Phenomenological Models
We have seen above that power laws emerge naturally in the context
of continuous phase transitions; but we have also seen that they offer
at best limited insights for finite systems.
Why do they nevertheless appear so often?
Revisiting the Renormalization Group
Let f (X ) be a probability distribution. Then the RGT acting on it is
defined as ∞
Ta f (X = x ) := |a| f (ax − s)f (s)ds.
−∞
Ta f (X ) is the pdf of the random variable
X1 + X2
Y = with X1 , X2 ∼ f (X ).
a
Here a controls the qualitative properties of the transformation.
Calvo et al., J.Stat.Phys. 141:409 (2010).
Are Power Laws Useful? Michael P.H. Stumpf More Normal Than Normal 22 of 28
53. Renormalization Group Transformation of PDFs
Here we are after the fixed points, f0 , of the RGT,
Ta f0 (x ) = f0 (x )
Are Power Laws Useful? Michael P.H. Stumpf More Normal Than Normal 23 of 28
54. Renormalization Group Transformation of PDFs
Here we are after the fixed points, f0 , of the RGT,
Ta f0 (x ) = f0 (x )
When all moments of f (X ) exist and are finite then we can determine
the moments of the transformed distribution,
n
1 n
Eg [Y n ] = ETa f [Y n ] = Ef X i Ef X n−i
an i
i =0
Are Power Laws Useful? Michael P.H. Stumpf More Normal Than Normal 23 of 28
55. Renormalization Group Transformation of PDFs
Here we are after the fixed points, f0 , of the RGT,
Ta f0 (x ) = f0 (x )
When all moments of f (X ) exist and are finite then we can determine
the moments of the transformed distribution,
n
1 n
Eg [Y n ] = ETa f [Y n ] = Ef X i Ef X n−i
an i
i =0
Some Fixed Points of the RGT
√ √ √
a< 2 a= 2 a> 2
(2n)!
lim ETa f0 [x 2 ] = ∞
m Ef0 [x 2n ] = Ef0 [x 2 ])n lim ETa f0 [x 2 ] = 0
m
m→∞ n!2n m→∞
Are Power Laws Useful? Michael P.H. Stumpf More Normal Than Normal 23 of 28
56. Renormalization Group Transformation of PDFs
Here we are after the fixed points, f0 , of the RGT,
Ta f0 (x ) = f0 (x )
When all moments of f (X ) exist and are finite then we can determine
the moments of the transformed distribution,
n
1 n
Eg [Y n ] = ETa f [Y n ] = Ef X i Ef X n−i
an i
i =0
Some Fixed Points of the RGT
√ √ √
a< 2 a= 2 a> 2
(2n)!
lim ETa f0 [x 2 ] = ∞
m Ef0 [x 2n ] = Ef0 [x 2 ])n lim ETa f0 [x 2 ] = 0
m
m→∞ n!2n m→∞
— N(0, 1) δ(x )
Are Power Laws Useful? Michael P.H. Stumpf More Normal Than Normal 23 of 28
57. Central Limit Theorems
The conventional central limit theorem emerges as the fixed point of
the RGT for distributions where all moments are finite.
Are Power Laws Useful? Michael P.H. Stumpf More Normal Than Normal 24 of 28
58. Central Limit Theorems
The conventional central limit theorem emerges as the fixed point of
the RGT for distributions where all moments are finite.
Now we look at the characteristics function, φ(t ) = Ef [eitX ] and obtain
for the fixed points, φ0 (t ),
ϕ0 (t /a)2 = ϕ0 (t )
Are Power Laws Useful? Michael P.H. Stumpf More Normal Than Normal 24 of 28
59. Central Limit Theorems
The conventional central limit theorem emerges as the fixed point of
the RGT for distributions where all moments are finite.
Now we look at the characteristics function, φ(t ) = Ef [eitX ] and obtain
for the fixed points, φ0 (t ),
ϕ0 (t /a)2 = ϕ0 (t )
General Fixed Points
´
The general fixed points of the RGT are given by the Levy-stable laws,
ϕ0 (t ) = Sα,A) (k ) := exp −A|t |α θ(t ) − A|t |α θ(−k )
¯
with |a| = 21/α , A the complex conjugate of A and θ(x ) the Heaviside
¯
step function.
For α = 2 we recover the Gaussian, and for all α < 2 we obtain the
heavy-tailed stable laws. For α < 1 the mean is infinite.
Are Power Laws Useful? Michael P.H. Stumpf More Normal Than Normal 24 of 28
60. Power Law vs. Gaussian Distribution
From the above we see that the conventional CLT is in fact a very
special case of a much more general form of the CLT.
Thus we would expect fat-tailed distributions (by whichever sensible
definition) to occur frequently.
Are Power Laws Useful? Michael P.H. Stumpf More Normal Than Normal 25 of 28
61. Power Law vs. Gaussian Distribution
From the above we see that the conventional CLT is in fact a very
special case of a much more general form of the CLT.
Thus we would expect fat-tailed distributions (by whichever sensible
definition) to occur frequently.
Gaussian Distributions are stable under aggregation and
marginalization.
Are Power Laws Useful? Michael P.H. Stumpf More Normal Than Normal 25 of 28
62. Power Law vs. Gaussian Distribution
From the above we see that the conventional CLT is in fact a very
special case of a much more general form of the CLT.
Thus we would expect fat-tailed distributions (by whichever sensible
definition) to occur frequently.
Gaussian Distributions are stable under aggregation and
marginalization.
Scaling Distributions are stable under aggregation, mixture,
maximisation and marginalization.
Are Power Laws Useful? Michael P.H. Stumpf More Normal Than Normal 25 of 28
63. Power Law vs. Gaussian Distribution
From the above we see that the conventional CLT is in fact a very
special case of a much more general form of the CLT.
Thus we would expect fat-tailed distributions (by whichever sensible
definition) to occur frequently.
Gaussian Distributions are stable under aggregation and
marginalization.
Scaling Distributions are stable under aggregation, mixture,
maximisation and marginalization.
More Normal Than Normal
In this sense scaling or fat-tailed distributions should be expected to
occur very frequently. For low-variability data we obtain the Gaussian
as a fixed point.
Willinger et al., Proceedings of the 2004 Winter Simulation Conference, 130 (2004).
It is thus also easy to come up with simple mechanisms that give rise
to dispersed data.
Are Power Laws Useful? Michael P.H. Stumpf More Normal Than Normal 25 of 28
64. Power Laws, Evidence, Usefulness
Statistical Support
Allometric
Scaling
Internet
Zipf’s Law
C. elegans
nervous Mechanistic
system Sophistication
S. cerevisiae PIN
Stumpf & Porter, Science, 335:665 (2012).
Are Power Laws Useful? Michael P.H. Stumpf Summary 26 of 28
65. How to Check if You Have a Power Law in Your Data?
Y ∝ X −λ YES or NO ?
Are Power Laws Useful? Michael P.H. Stumpf Summary 27 of 28
66. How to Check if You Have a Power Law in Your Data?
Y ∝ X −λ YES or NO ?
1. Would a power law relationship offer profound new insights?
Are Power Laws Useful? Michael P.H. Stumpf Summary 27 of 28
67. How to Check if You Have a Power Law in Your Data?
Y ∝ X −λ YES or NO ?
1. Would a power law relationship offer profound new insights?
◦ Simply reporting a power law or scaling relationship is not exciting.
Are Power Laws Useful? Michael P.H. Stumpf Summary 27 of 28
68. How to Check if You Have a Power Law in Your Data?
Y ∝ X −λ YES or NO ?
1. Would a power law relationship offer profound new insights?
◦ Simply reporting a power law or scaling relationship is not exciting.
◦ Could you just have a very dispersed random variable, Y ?
Are Power Laws Useful? Michael P.H. Stumpf Summary 27 of 28
69. How to Check if You Have a Power Law in Your Data?
Y ∝ X −λ YES or NO ?
1. Would a power law relationship offer profound new insights?
◦ Simply reporting a power law or scaling relationship is not exciting.
◦ Could you just have a very dispersed random variable, Y ?
2. Does it extend over at least three orders of magnitude in both
variables (sanity check)?
Are Power Laws Useful? Michael P.H. Stumpf Summary 27 of 28
70. How to Check if You Have a Power Law in Your Data?
Y ∝ X −λ YES or NO ?
1. Would a power law relationship offer profound new insights?
◦ Simply reporting a power law or scaling relationship is not exciting.
◦ Could you just have a very dispersed random variable, Y ?
2. Does it extend over at least three orders of magnitude in both
variables (sanity check)?
3. Does the power law relationship hold up in comparison to other
distributions (log-normal, stretched exponential, negative
binomial)?
Are Power Laws Useful? Michael P.H. Stumpf Summary 27 of 28
71. How to Check if You Have a Power Law in Your Data?
Y ∝ X −λ YES or NO ?
1. Would a power law relationship offer profound new insights?
◦ Simply reporting a power law or scaling relationship is not exciting.
◦ Could you just have a very dispersed random variable, Y ?
2. Does it extend over at least three orders of magnitude in both
variables (sanity check)?
3. Does the power law relationship hold up in comparison to other
distributions (log-normal, stretched exponential, negative
binomial)?
4. Have you got a non-trivial and meaningful theoretical model that
gives rise to the power law and which yields mechanistic insights?
Are Power Laws Useful? Michael P.H. Stumpf Summary 27 of 28
72. How to Check if You Have a Power Law in Your Data?
Y ∝ X −λ YES or NO ?
1. Would a power law relationship offer profound new insights?
◦ Simply reporting a power law or scaling relationship is not exciting.
◦ Could you just have a very dispersed random variable, Y ?
2. Does it extend over at least three orders of magnitude in both
variables (sanity check)?
3. Does the power law relationship hold up in comparison to other
distributions (log-normal, stretched exponential, negative
binomial)?
4. Have you got a non-trivial and meaningful theoretical model that
gives rise to the power law and which yields mechanistic insights?
A Useful Scientific Theory
Failing at any of these hurdles does not mean that the scientific
problem is boring or trivial. Power laws add a lot to the theory of
critical phenomena/fluid dynamics etc. but very little elsewhere.
Are Power Laws Useful? Michael P.H. Stumpf Summary 27 of 28
73. Acknowledgements
Theoretical Systems Biology Group
• Imperial College
London
◦ Thomas Thorne
◦ William Kelly
• Oxford
University
◦ Robert May
◦ Mason Porter
• Kopenhagen
University
◦ Carsten Wiuf
www.theosysbio.bio.ic.ac.uk
Are Power Laws Useful? Michael P.H. Stumpf Summary 28 of 28