SlideShare a Scribd company logo
Markov Chains as methodology used by PageRank to
rank the Web Pages on Internet.
Sergio S. Guirreri - www.guirreri.host22.com
Google Technology User Group (GTUG) of Palermo.
5th March 2010
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 1 / 14
Overview
1 Concepts on Markov-Chains.
2 The idea of the PageRank algorithm.
3 The PageRank algorithm.
4 Solving the PageRank algorithm.
5 Conclusions.
6 Bibliography.
7 Internet web sites.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 2 / 14
Concepts on Markov-Chains.
Stochastic Process and Markov-Chains.
Let assume the following stochastic process
{Xn; n = 0, 1, 2, . . . }
with values in a set E, called the state space, while its elements are called
state of the process.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 3 / 14
Concepts on Markov-Chains.
Stochastic Process and Markov-Chains.
Let assume the following stochastic process
{Xn; n = 0, 1, 2, . . . }
with values in a set E, called the state space, while its elements are called
state of the process.
Let assume the set E is finite or countable.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 3 / 14
Concepts on Markov-Chains.
Stochastic Process and Markov-Chains.
Let assume the following stochastic process
{Xn; n = 0, 1, 2, . . . }
with values in a set E, called the state space, while its elements are called
state of the process.
Let assume the set E is finite or countable.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 3 / 14
Concepts on Markov-Chains.
Stochastic Process and Markov-Chains.
Let assume the following stochastic process
{Xn; n = 0, 1, 2, . . . }
with values in a set E, called the state space, while its elements are called
state of the process.
Let assume the set E is finite or countable.
Definition
A Markov Chain is a stochastic process Xn that hold the following feature:
Prob{Xn+1 = j|Xn = i, Xn−1 = in−1, . . . , X0 = i0} =
= Prob{Xn+1 = j|Xn = i} = pij(n)
where E is the state space set and j, i, in−1, . . . , i0 ∈ E, n ∈ N.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 3 / 14
Concepts on Markov-Chains.
Stochastic Process and Markov-Chains.
Let assume the following stochastic process
{Xn; n = 0, 1, 2, . . . }
with values in a set E, called the state space, while its elements are called
state of the process.
Let assume the set E is finite or countable.
Definition
A Markov Chain is a stochastic process Xn that hold the following feature:
Prob{Xn+1 = j|Xn = i, Xn−1 = in−1, . . . , X0 = i0} =
= Prob{Xn+1 = j|Xn = i} = pij(n)
where E is the state space set and j, i, in−1, . . . , i0 ∈ E, n ∈ N.
The transition probability matrix P of the process Xn is composed of pij,
∀i, j ∈ E.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 3 / 14
The idea of the PageRank algorithm.
PageRank’s idea.
The idea behind the PageRank algorithm is similar to the idea of the impact
factor index used to rank the Journals [Page et al.(1999)]
[Brin and Page(1998)] [Langville et al.(2008)].
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 4 / 14
The idea of the PageRank algorithm.
PageRank’s idea.
The idea behind the PageRank algorithm is similar to the idea of the impact
factor index used to rank the Journals [Page et al.(1999)]
[Brin and Page(1998)] [Langville et al.(2008)].
PageRank the impact factor of Internet.
The impact factor of a journal is defined as the average number of citations
per recently published papers in that journal.
By regarding each web page as a journal, this idea was then extended to
measure the importance of the web page in the PageRank Algorithm.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 4 / 14
The idea of the PageRank algorithm.
Elements of the PageRank.
To illustrate the PageRank algorithm I define the following variables
[Ching and Ng(2006)]:
let be N the total number of web pages in the web.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 5 / 14
The idea of the PageRank algorithm.
Elements of the PageRank.
To illustrate the PageRank algorithm I define the following variables
[Ching and Ng(2006)]:
let be N the total number of web pages in the web.
let be k the outgoing links of web page j.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 5 / 14
The idea of the PageRank algorithm.
Elements of the PageRank.
To illustrate the PageRank algorithm I define the following variables
[Ching and Ng(2006)]:
let be N the total number of web pages in the web.
let be k the outgoing links of web page j.
let be Q the so called hyperlink matrix with elements:
Qij =



1
k if web page i is an outgoing link of web page j;
0 otherwise;
Qi,i > 0 ∀i.
(1)
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 5 / 14
The idea of the PageRank algorithm.
Elements of the PageRank.
To illustrate the PageRank algorithm I define the following variables
[Ching and Ng(2006)]:
let be N the total number of web pages in the web.
let be k the outgoing links of web page j.
let be Q the so called hyperlink matrix with elements:
Qij =



1
k if web page i is an outgoing link of web page j;
0 otherwise;
Qi,i > 0 ∀i.
(1)
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 5 / 14
The idea of the PageRank algorithm.
Elements of the PageRank.
To illustrate the PageRank algorithm I define the following variables
[Ching and Ng(2006)]:
let be N the total number of web pages in the web.
let be k the outgoing links of web page j.
let be Q the so called hyperlink matrix with elements:
Qij =



1
k if web page i is an outgoing link of web page j;
0 otherwise;
Qi,i > 0 ∀i.
(1)
The hyperlink matrix Q can be regarded as a transition probability matrix of
a Markov chain.
One may regard a surfer on the net as a random walker and the web pages as
the states of the Markov chain.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 5 / 14
The PageRank algorithm.
The PageRank with irreducible Markov Chain.
Assuming that the Markov chain is irreduciblea
and aperiodicb
then the
steady-state probability distribution (p1, p2, . . . , pN )T
of the states (web
pages) exists.
aA Markov chain is irreducible if all states communicate with each other.
bA chain is periodic if there exists k > 1 such that the interval between two visits to some
state s is always a multiple of k. Therefore a chain is aperiodic if k=1.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 6 / 14
The PageRank algorithm.
The PageRank with irreducible Markov Chain.
Assuming that the Markov chain is irreduciblea
and aperiodicb
then the
steady-state probability distribution (p1, p2, . . . , pN )T
of the states (web
pages) exists.
aA Markov chain is irreducible if all states communicate with each other.
bA chain is periodic if there exists k > 1 such that the interval between two visits to some
state s is always a multiple of k. Therefore a chain is aperiodic if k=1.
The PageRank
Each pi is the proportion of time that the surfer visiting the web page i.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 6 / 14
The PageRank algorithm.
The PageRank with irreducible Markov Chain.
Assuming that the Markov chain is irreduciblea
and aperiodicb
then the
steady-state probability distribution (p1, p2, . . . , pN )T
of the states (web
pages) exists.
aA Markov chain is irreducible if all states communicate with each other.
bA chain is periodic if there exists k > 1 such that the interval between two visits to some
state s is always a multiple of k. Therefore a chain is aperiodic if k=1.
The PageRank
Each pi is the proportion of time that the surfer visiting the web page i.
The higher the value of pi is, the more important web page i will be.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 6 / 14
The PageRank algorithm.
The PageRank with irreducible Markov Chain.
Assuming that the Markov chain is irreduciblea
and aperiodicb
then the
steady-state probability distribution (p1, p2, . . . , pN )T
of the states (web
pages) exists.
aA Markov chain is irreducible if all states communicate with each other.
bA chain is periodic if there exists k > 1 such that the interval between two visits to some
state s is always a multiple of k. Therefore a chain is aperiodic if k=1.
The PageRank
Each pi is the proportion of time that the surfer visiting the web page i.
The higher the value of pi is, the more important web page i will be.
The PageRank of web page i is then defined as pi.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 6 / 14
The PageRank algorithm.
The PageRank with reducible Markov Chain
Since the matrix Q can be reducible to ensure that the steady-state
probability exists and is unique the following matrix P must be considered:
P = α




Q11 Q12 . . . Q1N
Q21 Q22 . . . Q2N
. . . . . . . . . . . .
QN1 QN2 . . . QNN



 +
(1 − α)
N




1 1 . . . 1
1 1 . . . 1
. . . . . . . . . . . .
1 1 . . . 1



 (2)
Where 0 < α < 1 and the most popular values of α are 0.85 and (1 − 1/N).
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 7 / 14
The PageRank algorithm.
The PageRank with reducible Markov Chain
Since the matrix Q can be reducible to ensure that the steady-state
probability exists and is unique the following matrix P must be considered:
P = α




Q11 Q12 . . . Q1N
Q21 Q22 . . . Q2N
. . . . . . . . . . . .
QN1 QN2 . . . QNN



 +
(1 − α)
N




1 1 . . . 1
1 1 . . . 1
. . . . . . . . . . . .
1 1 . . . 1



 (2)
Where 0 < α < 1 and the most popular values of α are 0.85 and (1 − 1/N).
Interpretation of PageRank
The idea of the PageRank (2) is that, for a network of N web pages, each web
page has an inherent importance of (1 − α)/N.
If a page Pi has an importance of pi, then it will contribute an importance of
α pi which is shared among the web pages that it points to.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 7 / 14
The PageRank algorithm.
The PageRank with reducible Markov Chain
Solving the following linear system of equations subject to the normalization
constraint one can obtain the importance of web page Pi :





p1
p2
...
pN





= α




Q11 Q12 . . . Q1N
Q21 Q22 . . . Q2N
. . . . . . . . . . . .
QN1 QN2 . . . QNN









p1
p2
...
pN





+
(1 − α)
N





1
1
...
1





(3)
Since
N
i=1
pi = 1
the (3) can be rewritten as
(p1, p2, . . . , pN )T
= P(p1, p2, . . . , pN )T
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 8 / 14
Solving the PageRank algorithm.
The power method.
The power method is an iterative method for solving the dominant eigenvalue
and its corresponding eigenvectors of a matrix.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 9 / 14
Solving the PageRank algorithm.
The power method.
The power method is an iterative method for solving the dominant eigenvalue
and its corresponding eigenvectors of a matrix.
Given an n × n matrix A, the hypothesis of power method are:
there is a single dominant eigenvalue. The eigenvalues can be sorted:
|λ1| > |λ2| ≥ |λ3| ≥ . . . |λn|
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 9 / 14
Solving the PageRank algorithm.
The power method.
The power method is an iterative method for solving the dominant eigenvalue
and its corresponding eigenvectors of a matrix.
Given an n × n matrix A, the hypothesis of power method are:
there is a single dominant eigenvalue. The eigenvalues can be sorted:
|λ1| > |λ2| ≥ |λ3| ≥ . . . |λn|
there is a linearly independent set of n eigenvectors:
{u(1)
, u(2)
, . . . , u(n)
}
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 9 / 14
Solving the PageRank algorithm.
The power method.
The power method is an iterative method for solving the dominant eigenvalue
and its corresponding eigenvectors of a matrix.
Given an n × n matrix A, the hypothesis of power method are:
there is a single dominant eigenvalue. The eigenvalues can be sorted:
|λ1| > |λ2| ≥ |λ3| ≥ . . . |λn|
there is a linearly independent set of n eigenvectors:
{u(1)
, u(2)
, . . . , u(n)
}
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 9 / 14
Solving the PageRank algorithm.
The power method.
The power method is an iterative method for solving the dominant eigenvalue
and its corresponding eigenvectors of a matrix.
Given an n × n matrix A, the hypothesis of power method are:
there is a single dominant eigenvalue. The eigenvalues can be sorted:
|λ1| > |λ2| ≥ |λ3| ≥ . . . |λn|
there is a linearly independent set of n eigenvectors:
{u(1)
, u(2)
, . . . , u(n)
}
so that
Au(i)
= λiu(i)
, i = 1, . . . , n.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 9 / 14
Solving the PageRank algorithm.
The power method.
The initial vector x0 can be wrote:
x(0)
= a1u(1)
+ a2u(2)
+ · · · + anu(n)
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
Solving the PageRank algorithm.
The power method.
The initial vector x0 can be wrote:
x(0)
= a1u(1)
+ a2u(2)
+ · · · + anu(n)
iterating the initial vector with the A matrix:
Ak
x(0)
= a1Ak
u(1)
+ a2Ak
u(2)
+ · · · + anAk
u(n)
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
Solving the PageRank algorithm.
The power method.
The initial vector x0 can be wrote:
x(0)
= a1u(1)
+ a2u(2)
+ · · · + anu(n)
iterating the initial vector with the A matrix:
Ak
x(0)
= a1Ak
u(1)
+ a2Ak
u(2)
+ · · · + anAk
u(n)
= a1λk
1u(1)
+ a2λk
2u(2)
+ · · · + anλk
nu(n)
.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
Solving the PageRank algorithm.
The power method.
The initial vector x0 can be wrote:
x(0)
= a1u(1)
+ a2u(2)
+ · · · + anu(n)
iterating the initial vector with the A matrix:
Ak
x(0)
= a1Ak
u(1)
+ a2Ak
u(2)
+ · · · + anAk
u(n)
= a1λk
1u(1)
+ a2λk
2u(2)
+ · · · + anλk
nu(n)
.
dividing by λk
1
Ak
x(0)
λk
1
= a1u(1)
+ a2
λ2
λ1
k
u(2)
+ · · · + an
λn
λ1
k
u(n)
,
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
Solving the PageRank algorithm.
The power method.
The initial vector x0 can be wrote:
x(0)
= a1u(1)
+ a2u(2)
+ · · · + anu(n)
iterating the initial vector with the A matrix:
Ak
x(0)
= a1Ak
u(1)
+ a2Ak
u(2)
+ · · · + anAk
u(n)
= a1λk
1u(1)
+ a2λk
2u(2)
+ · · · + anλk
nu(n)
.
dividing by λk
1
Ak
x(0)
λk
1
= a1u(1)
+ a2
λ2
λ1
k
u(2)
+ · · · + an
λn
λ1
k
u(n)
,
Since
|λi|
|λ1|
< 1 →
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
Solving the PageRank algorithm.
The power method.
The initial vector x0 can be wrote:
x(0)
= a1u(1)
+ a2u(2)
+ · · · + anu(n)
iterating the initial vector with the A matrix:
Ak
x(0)
= a1Ak
u(1)
+ a2Ak
u(2)
+ · · · + anAk
u(n)
= a1λk
1u(1)
+ a2λk
2u(2)
+ · · · + anλk
nu(n)
.
dividing by λk
1
Ak
x(0)
λk
1
= a1u(1)
+ a2
λ2
λ1
k
u(2)
+ · · · + an
λn
λ1
k
u(n)
,
Since
|λi|
|λ1|
< 1 → lim
k→∞
|λi|k
|λ1|k
= 0 →
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
Solving the PageRank algorithm.
The power method.
The initial vector x0 can be wrote:
x(0)
= a1u(1)
+ a2u(2)
+ · · · + anu(n)
iterating the initial vector with the A matrix:
Ak
x(0)
= a1Ak
u(1)
+ a2Ak
u(2)
+ · · · + anAk
u(n)
= a1λk
1u(1)
+ a2λk
2u(2)
+ · · · + anλk
nu(n)
.
dividing by λk
1
Ak
x(0)
λk
1
= a1u(1)
+ a2
λ2
λ1
k
u(2)
+ · · · + an
λn
λ1
k
u(n)
,
Since
|λi|
|λ1|
< 1 → lim
k→∞
|λi|k
|λ1|k
= 0 → Ak
≈ a1λk
1u(1)
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
Conclusions.
The power method and PageRank.
Results.
The matrix P of the PageRank algorithm is a stochastic matrix therefore
the largest eigenvalue is 1.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 11 / 14
Conclusions.
The power method and PageRank.
Results.
The matrix P of the PageRank algorithm is a stochastic matrix therefore
the largest eigenvalue is 1.
The convergence rate of the power method depends on the ratio of λ2
λ1
.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 11 / 14
Conclusions.
The power method and PageRank.
Results.
The matrix P of the PageRank algorithm is a stochastic matrix therefore
the largest eigenvalue is 1.
The convergence rate of the power method depends on the ratio of λ2
λ1
.
It has been showed by [Haveliwala and Kamvar(2003)] that for the second
largest eigenvalue of P, we have
|λ2| ≤ α 0 ≤ α ≤ 1.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 11 / 14
Conclusions.
The power method and PageRank.
Results.
The matrix P of the PageRank algorithm is a stochastic matrix therefore
the largest eigenvalue is 1.
The convergence rate of the power method depends on the ratio of λ2
λ1
.
It has been showed by [Haveliwala and Kamvar(2003)] that for the second
largest eigenvalue of P, we have
|λ2| ≤ α 0 ≤ α ≤ 1.
Since λ1 = 1 the converge rate depends on α.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 11 / 14
Conclusions.
The power method and PageRank.
Results.
The matrix P of the PageRank algorithm is a stochastic matrix therefore
the largest eigenvalue is 1.
The convergence rate of the power method depends on the ratio of λ2
λ1
.
It has been showed by [Haveliwala and Kamvar(2003)] that for the second
largest eigenvalue of P, we have
|λ2| ≤ α 0 ≤ α ≤ 1.
Since λ1 = 1 the converge rate depends on α.
The most popular value for α is 0.85. With this value it has been proved
that the power method on web data set of over 80 million pages converges
in about 50 iterations.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 11 / 14
Conclusions.
Really thanks to GTUG Palermo
and
see you to the next meeting!
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 12 / 14
Bibliography.
Bibliography.
Brin, S. and Page, L. (1998).
The anatomy of a large-scale hypertextual Web search engine.
Computer networks and ISDN systems, 30(1-7), 107–117.
Ching, W. and Ng, M. (2006).
Markov Chains: Models, Algoritms and Applications.
Springer Science + Business Media, Inc.
Haveliwala, T. and Kamvar, M. (2003).
The second eigenvalue of the google matrix.
Technical report, Stanford University.
Langville, A., Meyer, C., and Fern´Andez, P. (2008).
Google’s PageRank and beyond: the science of search engine rankings.
The Mathematical Intelligencer, 30(1), 68–69.
Page, L., Brin, S., Motwani, R., and Winograd, T. (1999).
The PageRank Citation Ranking: Bringing Order to the Web.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 13 / 14
Internet web sites.
Internet web sites.
Jon Atle Gulla (2007) - From Google Search to Semantic Exploration. -
Norwegian University of Science Technology -
www.slideshare.net/sveino/semantics-and-search?type=presentation
Steven Levy (2010) - Exclusive: How Google’s Algorithm Rules the Web - Wired
Magazine - www.wired.com/magazine/2010/02/ff_google_algorithm/
Ann Smarty (2009) - Let’s Try to Find All 200 Parameters in Google Algorithm -
Search Engine Journal -
www.searchenginejournal.com/200-parameters-in-google-algorithm/15457/.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 14 / 14

More Related Content

What's hot

Search engine and web crawler
Search engine and web crawlerSearch engine and web crawler
Search engine and web crawlerishmecse13
 
How Google search works ppt
How Google search works pptHow Google search works ppt
How Google search works pptHardik Mahant
 
Introduction to the ethics of machine learning
Introduction to the ethics of machine learningIntroduction to the ethics of machine learning
Introduction to the ethics of machine learningDaniel Wilson
 
page ranking algorithm
page ranking algorithmpage ranking algorithm
page ranking algorithmJaved Khan
 
Everything You Didn't Know About Entity SEO
Everything You Didn't Know About Entity SEO Everything You Didn't Know About Entity SEO
Everything You Didn't Know About Entity SEO Sara Taher
 
The step by step guide to SEO Website Audit
The step by step guide to SEO Website Audit The step by step guide to SEO Website Audit
The step by step guide to SEO Website Audit amandacerry
 
Semantic search Bill Slawski DEEP SEA Con
Semantic search Bill Slawski DEEP SEA ConSemantic search Bill Slawski DEEP SEA Con
Semantic search Bill Slawski DEEP SEA ConBill Slawski
 
Link analysis : Comparative study of HITS and Page Rank Algorithm
Link analysis : Comparative study of HITS and Page Rank AlgorithmLink analysis : Comparative study of HITS and Page Rank Algorithm
Link analysis : Comparative study of HITS and Page Rank AlgorithmKavita Kushwah
 
Keyword Research and Topic Modeling in a Semantic Web
Keyword Research and Topic Modeling in a Semantic WebKeyword Research and Topic Modeling in a Semantic Web
Keyword Research and Topic Modeling in a Semantic WebBill Slawski
 
BrightonSEO - Master Crawl Budget Optimization for Enterprise Websites
BrightonSEO - Master Crawl Budget Optimization for Enterprise WebsitesBrightonSEO - Master Crawl Budget Optimization for Enterprise Websites
BrightonSEO - Master Crawl Budget Optimization for Enterprise WebsitesManick Bhan
 
Server Controls of ASP.Net
Server Controls of ASP.NetServer Controls of ASP.Net
Server Controls of ASP.NetHitesh Santani
 
Google page rank
Google page rankGoogle page rank
Google page rankYifan Li
 
On-Page SEO Techniques for 2022
On-Page SEO Techniques for 2022On-Page SEO Techniques for 2022
On-Page SEO Techniques for 2022Stephen Fraga
 
Opinion-based Article Ranking for Information Retrieval Systems: Factoids and...
Opinion-based Article Ranking for Information Retrieval Systems: Factoids and...Opinion-based Article Ranking for Information Retrieval Systems: Factoids and...
Opinion-based Article Ranking for Information Retrieval Systems: Factoids and...Koray Tugberk GUBUR
 
Introduction to Web Scraping using Python and Beautiful Soup
Introduction to Web Scraping using Python and Beautiful SoupIntroduction to Web Scraping using Python and Beautiful Soup
Introduction to Web Scraping using Python and Beautiful SoupTushar Mittal
 
Importance of Backlinks In SEO
Importance of Backlinks In SEOImportance of Backlinks In SEO
Importance of Backlinks In SEOAarav Infotech
 
What is Keyword Research & How to Do it ?
What is Keyword Research & How to Do it ? What is Keyword Research & How to Do it ?
What is Keyword Research & How to Do it ? Jam Hassan
 

What's hot (20)

Search engine and web crawler
Search engine and web crawlerSearch engine and web crawler
Search engine and web crawler
 
How Google search works ppt
How Google search works pptHow Google search works ppt
How Google search works ppt
 
Introduction to the ethics of machine learning
Introduction to the ethics of machine learningIntroduction to the ethics of machine learning
Introduction to the ethics of machine learning
 
page ranking algorithm
page ranking algorithmpage ranking algorithm
page ranking algorithm
 
Everything You Didn't Know About Entity SEO
Everything You Didn't Know About Entity SEO Everything You Didn't Know About Entity SEO
Everything You Didn't Know About Entity SEO
 
The step by step guide to SEO Website Audit
The step by step guide to SEO Website Audit The step by step guide to SEO Website Audit
The step by step guide to SEO Website Audit
 
Entity seo
Entity seoEntity seo
Entity seo
 
Semantic search Bill Slawski DEEP SEA Con
Semantic search Bill Slawski DEEP SEA ConSemantic search Bill Slawski DEEP SEA Con
Semantic search Bill Slawski DEEP SEA Con
 
Link analysis : Comparative study of HITS and Page Rank Algorithm
Link analysis : Comparative study of HITS and Page Rank AlgorithmLink analysis : Comparative study of HITS and Page Rank Algorithm
Link analysis : Comparative study of HITS and Page Rank Algorithm
 
Keyword Research and Topic Modeling in a Semantic Web
Keyword Research and Topic Modeling in a Semantic WebKeyword Research and Topic Modeling in a Semantic Web
Keyword Research and Topic Modeling in a Semantic Web
 
BrightonSEO - Master Crawl Budget Optimization for Enterprise Websites
BrightonSEO - Master Crawl Budget Optimization for Enterprise WebsitesBrightonSEO - Master Crawl Budget Optimization for Enterprise Websites
BrightonSEO - Master Crawl Budget Optimization for Enterprise Websites
 
Server Controls of ASP.Net
Server Controls of ASP.NetServer Controls of ASP.Net
Server Controls of ASP.Net
 
Foxtail Website Audit
Foxtail Website AuditFoxtail Website Audit
Foxtail Website Audit
 
Google page rank
Google page rankGoogle page rank
Google page rank
 
On-Page SEO Techniques for 2022
On-Page SEO Techniques for 2022On-Page SEO Techniques for 2022
On-Page SEO Techniques for 2022
 
Opinion-based Article Ranking for Information Retrieval Systems: Factoids and...
Opinion-based Article Ranking for Information Retrieval Systems: Factoids and...Opinion-based Article Ranking for Information Retrieval Systems: Factoids and...
Opinion-based Article Ranking for Information Retrieval Systems: Factoids and...
 
Introduction to Web Scraping using Python and Beautiful Soup
Introduction to Web Scraping using Python and Beautiful SoupIntroduction to Web Scraping using Python and Beautiful Soup
Introduction to Web Scraping using Python and Beautiful Soup
 
Image SEO
Image SEOImage SEO
Image SEO
 
Importance of Backlinks In SEO
Importance of Backlinks In SEOImportance of Backlinks In SEO
Importance of Backlinks In SEO
 
What is Keyword Research & How to Do it ?
What is Keyword Research & How to Do it ? What is Keyword Research & How to Do it ?
What is Keyword Research & How to Do it ?
 

Viewers also liked

The Google Pagerank algorithm - How does it work?
The Google Pagerank algorithm - How does it work?The Google Pagerank algorithm - How does it work?
The Google Pagerank algorithm - How does it work?Kundan Bhaduri
 
A Comparative Study of HITS vs PageRank Algorithms for Twitter Users Analysis
A Comparative Study of HITS vs PageRank Algorithms for Twitter Users AnalysisA Comparative Study of HITS vs PageRank Algorithms for Twitter Users Analysis
A Comparative Study of HITS vs PageRank Algorithms for Twitter Users AnalysisPoo Kuan Hoong
 
Population 3.5 - Spearman’S Rank
Population 3.5 - Spearman’S RankPopulation 3.5 - Spearman’S Rank
Population 3.5 - Spearman’S RankEcumene
 
LINEAR ALGEBRA BEHIND GOOGLE SEARCH
LINEAR ALGEBRA BEHIND GOOGLE SEARCHLINEAR ALGEBRA BEHIND GOOGLE SEARCH
LINEAR ALGEBRA BEHIND GOOGLE SEARCHDivyansh Verma
 
PageRank Algorithm In data mining
PageRank Algorithm In data miningPageRank Algorithm In data mining
PageRank Algorithm In data miningMai Mustafa
 
GCSE Geography: How And Why To Use Spearman’s Rank
GCSE Geography: How And Why To Use Spearman’s RankGCSE Geography: How And Why To Use Spearman’s Rank
GCSE Geography: How And Why To Use Spearman’s RankMark Cowan
 
Spearman Rank Correlation Presentation
Spearman Rank Correlation PresentationSpearman Rank Correlation Presentation
Spearman Rank Correlation Presentationcae_021
 
Halloween.Presentacion
Halloween.PresentacionHalloween.Presentacion
Halloween.PresentacionCarolinaMiau
 
Giornata Mondiale del Migrante e del Rifugiato
Giornata Mondiale del Migrante e del RifugiatoGiornata Mondiale del Migrante e del Rifugiato
Giornata Mondiale del Migrante e del RifugiatoMaike Loes
 
Secrets & Lies
Secrets & LiesSecrets & Lies
Secrets & LiesSam Baksh
 
Study skillsppt
Study skillspptStudy skillsppt
Study skillspptrnesbit
 
All-Pro Fasteners
All-Pro FastenersAll-Pro Fasteners
All-Pro FastenersTodd Grzych
 
Action adventure movie shots
Action adventure movie shotsAction adventure movie shots
Action adventure movie shotsstoliros
 
Presentation alevel work max
Presentation alevel work maxPresentation alevel work max
Presentation alevel work maxstoliros
 
A Story of Tiny Frog
A Story of Tiny FrogA Story of Tiny Frog
A Story of Tiny Frogbhatsumanta
 

Viewers also liked (20)

The Google Pagerank algorithm - How does it work?
The Google Pagerank algorithm - How does it work?The Google Pagerank algorithm - How does it work?
The Google Pagerank algorithm - How does it work?
 
A Comparative Study of HITS vs PageRank Algorithms for Twitter Users Analysis
A Comparative Study of HITS vs PageRank Algorithms for Twitter Users AnalysisA Comparative Study of HITS vs PageRank Algorithms for Twitter Users Analysis
A Comparative Study of HITS vs PageRank Algorithms for Twitter Users Analysis
 
Population 3.5 - Spearman’S Rank
Population 3.5 - Spearman’S RankPopulation 3.5 - Spearman’S Rank
Population 3.5 - Spearman’S Rank
 
LINEAR ALGEBRA BEHIND GOOGLE SEARCH
LINEAR ALGEBRA BEHIND GOOGLE SEARCHLINEAR ALGEBRA BEHIND GOOGLE SEARCH
LINEAR ALGEBRA BEHIND GOOGLE SEARCH
 
PageRank Algorithm In data mining
PageRank Algorithm In data miningPageRank Algorithm In data mining
PageRank Algorithm In data mining
 
GCSE Geography: How And Why To Use Spearman’s Rank
GCSE Geography: How And Why To Use Spearman’s RankGCSE Geography: How And Why To Use Spearman’s Rank
GCSE Geography: How And Why To Use Spearman’s Rank
 
Spearman Rank Correlation Presentation
Spearman Rank Correlation PresentationSpearman Rank Correlation Presentation
Spearman Rank Correlation Presentation
 
Halloween.Presentacion
Halloween.PresentacionHalloween.Presentacion
Halloween.Presentacion
 
Giornata Mondiale del Migrante e del Rifugiato
Giornata Mondiale del Migrante e del RifugiatoGiornata Mondiale del Migrante e del Rifugiato
Giornata Mondiale del Migrante e del Rifugiato
 
Secrets & Lies
Secrets & LiesSecrets & Lies
Secrets & Lies
 
Study skillsppt
Study skillspptStudy skillsppt
Study skillsppt
 
All-Pro Fasteners
All-Pro FastenersAll-Pro Fasteners
All-Pro Fasteners
 
Action adventure movie shots
Action adventure movie shotsAction adventure movie shots
Action adventure movie shots
 
birthday
birthdaybirthday
birthday
 
Presentation alevel work max
Presentation alevel work maxPresentation alevel work max
Presentation alevel work max
 
A Story of Tiny Frog
A Story of Tiny FrogA Story of Tiny Frog
A Story of Tiny Frog
 
Tema 2
Tema 2Tema 2
Tema 2
 
Farewell (Mae, batch 2011)
Farewell (Mae, batch 2011)Farewell (Mae, batch 2011)
Farewell (Mae, batch 2011)
 
A3 sergiodelpino
A3 sergiodelpinoA3 sergiodelpino
A3 sergiodelpino
 
Pati alegre (fil eminimizer)
Pati alegre (fil eminimizer)Pati alegre (fil eminimizer)
Pati alegre (fil eminimizer)
 

Similar to PageRank and Markov Chain

PageRank_algorithm_Nfaoui_El_Habib
PageRank_algorithm_Nfaoui_El_HabibPageRank_algorithm_Nfaoui_El_Habib
PageRank_algorithm_Nfaoui_El_HabibEl Habib NFAOUI
 
Pagerank
Pagerank Pagerank
Pagerank C C
 
Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...
Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...
Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...IOSR Journals
 
A Generalization of the PageRank Algorithm : NOTES
A Generalization of the PageRank Algorithm : NOTESA Generalization of the PageRank Algorithm : NOTES
A Generalization of the PageRank Algorithm : NOTESSubhajit Sahu
 
page rank explication et exemple formule
page rank explication et exemple  formulepage rank explication et exemple  formule
page rank explication et exemple formuleRamiHarrathi1
 
Done reread deeperinsidepagerank
Done reread deeperinsidepagerankDone reread deeperinsidepagerank
Done reread deeperinsidepagerankJames Arnold
 
Deeper Inside PageRank (NOTES)
Deeper Inside PageRank (NOTES)Deeper Inside PageRank (NOTES)
Deeper Inside PageRank (NOTES)Subhajit Sahu
 
Random web surfer pagerank algorithm
Random web surfer pagerank algorithmRandom web surfer pagerank algorithm
Random web surfer pagerank algorithmalexandrelevada
 
Data.Mining.C.8(Ii).Web Mining 570802461
Data.Mining.C.8(Ii).Web Mining 570802461Data.Mining.C.8(Ii).Web Mining 570802461
Data.Mining.C.8(Ii).Web Mining 570802461Margaret Wang
 
Hui xie 591r_presentation
Hui xie 591r_presentationHui xie 591r_presentation
Hui xie 591r_presentationramiro
 
Markov chains and page rankGraphs.pdf
Markov chains and page rankGraphs.pdfMarkov chains and page rankGraphs.pdf
Markov chains and page rankGraphs.pdfrayyverma
 
PageRank in Multithreading
PageRank in MultithreadingPageRank in Multithreading
PageRank in MultithreadingShujian Zhang
 
Done reread sketchinglandscapesofpagefarmsnpcomplete(3)
Done reread sketchinglandscapesofpagefarmsnpcomplete(3)Done reread sketchinglandscapesofpagefarmsnpcomplete(3)
Done reread sketchinglandscapesofpagefarmsnpcomplete(3)James Arnold
 
Done reread sketchinglandscapesofpagefarmsnpcomplete
Done reread sketchinglandscapesofpagefarmsnpcompleteDone reread sketchinglandscapesofpagefarmsnpcomplete
Done reread sketchinglandscapesofpagefarmsnpcompleteJames Arnold
 
Done reread sketchinglandscapesofpagefarmsnpcomplete(2)
Done reread sketchinglandscapesofpagefarmsnpcomplete(2)Done reread sketchinglandscapesofpagefarmsnpcomplete(2)
Done reread sketchinglandscapesofpagefarmsnpcomplete(2)James Arnold
 
Prediction Model Using Web Usage Mining Techniques
Prediction Model Using Web Usage Mining TechniquesPrediction Model Using Web Usage Mining Techniques
Prediction Model Using Web Usage Mining TechniquesEditor IJCATR
 
Done reread the effect of new links on google pagerank
Done reread the effect of new links on google pagerankDone reread the effect of new links on google pagerank
Done reread the effect of new links on google pagerankJames Arnold
 

Similar to PageRank and Markov Chain (20)

PageRank_algorithm_Nfaoui_El_Habib
PageRank_algorithm_Nfaoui_El_HabibPageRank_algorithm_Nfaoui_El_Habib
PageRank_algorithm_Nfaoui_El_Habib
 
Pagerank
Pagerank Pagerank
Pagerank
 
Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...
Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...
Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...
 
A Generalization of the PageRank Algorithm : NOTES
A Generalization of the PageRank Algorithm : NOTESA Generalization of the PageRank Algorithm : NOTES
A Generalization of the PageRank Algorithm : NOTES
 
page rank explication et exemple formule
page rank explication et exemple  formulepage rank explication et exemple  formule
page rank explication et exemple formule
 
Done reread deeperinsidepagerank
Done reread deeperinsidepagerankDone reread deeperinsidepagerank
Done reread deeperinsidepagerank
 
Deeper Inside PageRank (NOTES)
Deeper Inside PageRank (NOTES)Deeper Inside PageRank (NOTES)
Deeper Inside PageRank (NOTES)
 
Random web surfer pagerank algorithm
Random web surfer pagerank algorithmRandom web surfer pagerank algorithm
Random web surfer pagerank algorithm
 
Data.Mining.C.8(Ii).Web Mining 570802461
Data.Mining.C.8(Ii).Web Mining 570802461Data.Mining.C.8(Ii).Web Mining 570802461
Data.Mining.C.8(Ii).Web Mining 570802461
 
Page Rank Link Farm Detection
Page Rank Link Farm DetectionPage Rank Link Farm Detection
Page Rank Link Farm Detection
 
I04015559
I04015559I04015559
I04015559
 
Hui xie 591r_presentation
Hui xie 591r_presentationHui xie 591r_presentation
Hui xie 591r_presentation
 
Markov chains and page rankGraphs.pdf
Markov chains and page rankGraphs.pdfMarkov chains and page rankGraphs.pdf
Markov chains and page rankGraphs.pdf
 
PageRank in Multithreading
PageRank in MultithreadingPageRank in Multithreading
PageRank in Multithreading
 
Done reread sketchinglandscapesofpagefarmsnpcomplete(3)
Done reread sketchinglandscapesofpagefarmsnpcomplete(3)Done reread sketchinglandscapesofpagefarmsnpcomplete(3)
Done reread sketchinglandscapesofpagefarmsnpcomplete(3)
 
Done reread sketchinglandscapesofpagefarmsnpcomplete
Done reread sketchinglandscapesofpagefarmsnpcompleteDone reread sketchinglandscapesofpagefarmsnpcomplete
Done reread sketchinglandscapesofpagefarmsnpcomplete
 
Done reread sketchinglandscapesofpagefarmsnpcomplete(2)
Done reread sketchinglandscapesofpagefarmsnpcomplete(2)Done reread sketchinglandscapesofpagefarmsnpcomplete(2)
Done reread sketchinglandscapesofpagefarmsnpcomplete(2)
 
Prediction Model Using Web Usage Mining Techniques
Prediction Model Using Web Usage Mining TechniquesPrediction Model Using Web Usage Mining Techniques
Prediction Model Using Web Usage Mining Techniques
 
zanardi
zanardizanardi
zanardi
 
Done reread the effect of new links on google pagerank
Done reread the effect of new links on google pagerankDone reread the effect of new links on google pagerank
Done reread the effect of new links on google pagerank
 

Recently uploaded

Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsVlad Stirbu
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3DianaGray10
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Frank van Harmelen
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...Elena Simperl
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...Sri Ambati
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesThousandEyes
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonDianaGray10
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Ramesh Iyer
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Tobias Schneck
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Alison B. Lowndes
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlPeter Udo Diehl
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...Product School
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...Product School
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingThijs Feryn
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Product School
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersSafe Software
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Thierry Lestable
 

Recently uploaded (20)

Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 

PageRank and Markov Chain

  • 1. Markov Chains as methodology used by PageRank to rank the Web Pages on Internet. Sergio S. Guirreri - www.guirreri.host22.com Google Technology User Group (GTUG) of Palermo. 5th March 2010 Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 1 / 14
  • 2. Overview 1 Concepts on Markov-Chains. 2 The idea of the PageRank algorithm. 3 The PageRank algorithm. 4 Solving the PageRank algorithm. 5 Conclusions. 6 Bibliography. 7 Internet web sites. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 2 / 14
  • 3. Concepts on Markov-Chains. Stochastic Process and Markov-Chains. Let assume the following stochastic process {Xn; n = 0, 1, 2, . . . } with values in a set E, called the state space, while its elements are called state of the process. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 3 / 14
  • 4. Concepts on Markov-Chains. Stochastic Process and Markov-Chains. Let assume the following stochastic process {Xn; n = 0, 1, 2, . . . } with values in a set E, called the state space, while its elements are called state of the process. Let assume the set E is finite or countable. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 3 / 14
  • 5. Concepts on Markov-Chains. Stochastic Process and Markov-Chains. Let assume the following stochastic process {Xn; n = 0, 1, 2, . . . } with values in a set E, called the state space, while its elements are called state of the process. Let assume the set E is finite or countable. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 3 / 14
  • 6. Concepts on Markov-Chains. Stochastic Process and Markov-Chains. Let assume the following stochastic process {Xn; n = 0, 1, 2, . . . } with values in a set E, called the state space, while its elements are called state of the process. Let assume the set E is finite or countable. Definition A Markov Chain is a stochastic process Xn that hold the following feature: Prob{Xn+1 = j|Xn = i, Xn−1 = in−1, . . . , X0 = i0} = = Prob{Xn+1 = j|Xn = i} = pij(n) where E is the state space set and j, i, in−1, . . . , i0 ∈ E, n ∈ N. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 3 / 14
  • 7. Concepts on Markov-Chains. Stochastic Process and Markov-Chains. Let assume the following stochastic process {Xn; n = 0, 1, 2, . . . } with values in a set E, called the state space, while its elements are called state of the process. Let assume the set E is finite or countable. Definition A Markov Chain is a stochastic process Xn that hold the following feature: Prob{Xn+1 = j|Xn = i, Xn−1 = in−1, . . . , X0 = i0} = = Prob{Xn+1 = j|Xn = i} = pij(n) where E is the state space set and j, i, in−1, . . . , i0 ∈ E, n ∈ N. The transition probability matrix P of the process Xn is composed of pij, ∀i, j ∈ E. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 3 / 14
  • 8. The idea of the PageRank algorithm. PageRank’s idea. The idea behind the PageRank algorithm is similar to the idea of the impact factor index used to rank the Journals [Page et al.(1999)] [Brin and Page(1998)] [Langville et al.(2008)]. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 4 / 14
  • 9. The idea of the PageRank algorithm. PageRank’s idea. The idea behind the PageRank algorithm is similar to the idea of the impact factor index used to rank the Journals [Page et al.(1999)] [Brin and Page(1998)] [Langville et al.(2008)]. PageRank the impact factor of Internet. The impact factor of a journal is defined as the average number of citations per recently published papers in that journal. By regarding each web page as a journal, this idea was then extended to measure the importance of the web page in the PageRank Algorithm. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 4 / 14
  • 10. The idea of the PageRank algorithm. Elements of the PageRank. To illustrate the PageRank algorithm I define the following variables [Ching and Ng(2006)]: let be N the total number of web pages in the web. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 5 / 14
  • 11. The idea of the PageRank algorithm. Elements of the PageRank. To illustrate the PageRank algorithm I define the following variables [Ching and Ng(2006)]: let be N the total number of web pages in the web. let be k the outgoing links of web page j. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 5 / 14
  • 12. The idea of the PageRank algorithm. Elements of the PageRank. To illustrate the PageRank algorithm I define the following variables [Ching and Ng(2006)]: let be N the total number of web pages in the web. let be k the outgoing links of web page j. let be Q the so called hyperlink matrix with elements: Qij =    1 k if web page i is an outgoing link of web page j; 0 otherwise; Qi,i > 0 ∀i. (1) Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 5 / 14
  • 13. The idea of the PageRank algorithm. Elements of the PageRank. To illustrate the PageRank algorithm I define the following variables [Ching and Ng(2006)]: let be N the total number of web pages in the web. let be k the outgoing links of web page j. let be Q the so called hyperlink matrix with elements: Qij =    1 k if web page i is an outgoing link of web page j; 0 otherwise; Qi,i > 0 ∀i. (1) Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 5 / 14
  • 14. The idea of the PageRank algorithm. Elements of the PageRank. To illustrate the PageRank algorithm I define the following variables [Ching and Ng(2006)]: let be N the total number of web pages in the web. let be k the outgoing links of web page j. let be Q the so called hyperlink matrix with elements: Qij =    1 k if web page i is an outgoing link of web page j; 0 otherwise; Qi,i > 0 ∀i. (1) The hyperlink matrix Q can be regarded as a transition probability matrix of a Markov chain. One may regard a surfer on the net as a random walker and the web pages as the states of the Markov chain. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 5 / 14
  • 15. The PageRank algorithm. The PageRank with irreducible Markov Chain. Assuming that the Markov chain is irreduciblea and aperiodicb then the steady-state probability distribution (p1, p2, . . . , pN )T of the states (web pages) exists. aA Markov chain is irreducible if all states communicate with each other. bA chain is periodic if there exists k > 1 such that the interval between two visits to some state s is always a multiple of k. Therefore a chain is aperiodic if k=1. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 6 / 14
  • 16. The PageRank algorithm. The PageRank with irreducible Markov Chain. Assuming that the Markov chain is irreduciblea and aperiodicb then the steady-state probability distribution (p1, p2, . . . , pN )T of the states (web pages) exists. aA Markov chain is irreducible if all states communicate with each other. bA chain is periodic if there exists k > 1 such that the interval between two visits to some state s is always a multiple of k. Therefore a chain is aperiodic if k=1. The PageRank Each pi is the proportion of time that the surfer visiting the web page i. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 6 / 14
  • 17. The PageRank algorithm. The PageRank with irreducible Markov Chain. Assuming that the Markov chain is irreduciblea and aperiodicb then the steady-state probability distribution (p1, p2, . . . , pN )T of the states (web pages) exists. aA Markov chain is irreducible if all states communicate with each other. bA chain is periodic if there exists k > 1 such that the interval between two visits to some state s is always a multiple of k. Therefore a chain is aperiodic if k=1. The PageRank Each pi is the proportion of time that the surfer visiting the web page i. The higher the value of pi is, the more important web page i will be. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 6 / 14
  • 18. The PageRank algorithm. The PageRank with irreducible Markov Chain. Assuming that the Markov chain is irreduciblea and aperiodicb then the steady-state probability distribution (p1, p2, . . . , pN )T of the states (web pages) exists. aA Markov chain is irreducible if all states communicate with each other. bA chain is periodic if there exists k > 1 such that the interval between two visits to some state s is always a multiple of k. Therefore a chain is aperiodic if k=1. The PageRank Each pi is the proportion of time that the surfer visiting the web page i. The higher the value of pi is, the more important web page i will be. The PageRank of web page i is then defined as pi. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 6 / 14
  • 19. The PageRank algorithm. The PageRank with reducible Markov Chain Since the matrix Q can be reducible to ensure that the steady-state probability exists and is unique the following matrix P must be considered: P = α     Q11 Q12 . . . Q1N Q21 Q22 . . . Q2N . . . . . . . . . . . . QN1 QN2 . . . QNN     + (1 − α) N     1 1 . . . 1 1 1 . . . 1 . . . . . . . . . . . . 1 1 . . . 1     (2) Where 0 < α < 1 and the most popular values of α are 0.85 and (1 − 1/N). Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 7 / 14
  • 20. The PageRank algorithm. The PageRank with reducible Markov Chain Since the matrix Q can be reducible to ensure that the steady-state probability exists and is unique the following matrix P must be considered: P = α     Q11 Q12 . . . Q1N Q21 Q22 . . . Q2N . . . . . . . . . . . . QN1 QN2 . . . QNN     + (1 − α) N     1 1 . . . 1 1 1 . . . 1 . . . . . . . . . . . . 1 1 . . . 1     (2) Where 0 < α < 1 and the most popular values of α are 0.85 and (1 − 1/N). Interpretation of PageRank The idea of the PageRank (2) is that, for a network of N web pages, each web page has an inherent importance of (1 − α)/N. If a page Pi has an importance of pi, then it will contribute an importance of α pi which is shared among the web pages that it points to. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 7 / 14
  • 21. The PageRank algorithm. The PageRank with reducible Markov Chain Solving the following linear system of equations subject to the normalization constraint one can obtain the importance of web page Pi :      p1 p2 ... pN      = α     Q11 Q12 . . . Q1N Q21 Q22 . . . Q2N . . . . . . . . . . . . QN1 QN2 . . . QNN          p1 p2 ... pN      + (1 − α) N      1 1 ... 1      (3) Since N i=1 pi = 1 the (3) can be rewritten as (p1, p2, . . . , pN )T = P(p1, p2, . . . , pN )T Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 8 / 14
  • 22. Solving the PageRank algorithm. The power method. The power method is an iterative method for solving the dominant eigenvalue and its corresponding eigenvectors of a matrix. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 9 / 14
  • 23. Solving the PageRank algorithm. The power method. The power method is an iterative method for solving the dominant eigenvalue and its corresponding eigenvectors of a matrix. Given an n × n matrix A, the hypothesis of power method are: there is a single dominant eigenvalue. The eigenvalues can be sorted: |λ1| > |λ2| ≥ |λ3| ≥ . . . |λn| Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 9 / 14
  • 24. Solving the PageRank algorithm. The power method. The power method is an iterative method for solving the dominant eigenvalue and its corresponding eigenvectors of a matrix. Given an n × n matrix A, the hypothesis of power method are: there is a single dominant eigenvalue. The eigenvalues can be sorted: |λ1| > |λ2| ≥ |λ3| ≥ . . . |λn| there is a linearly independent set of n eigenvectors: {u(1) , u(2) , . . . , u(n) } Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 9 / 14
  • 25. Solving the PageRank algorithm. The power method. The power method is an iterative method for solving the dominant eigenvalue and its corresponding eigenvectors of a matrix. Given an n × n matrix A, the hypothesis of power method are: there is a single dominant eigenvalue. The eigenvalues can be sorted: |λ1| > |λ2| ≥ |λ3| ≥ . . . |λn| there is a linearly independent set of n eigenvectors: {u(1) , u(2) , . . . , u(n) } Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 9 / 14
  • 26. Solving the PageRank algorithm. The power method. The power method is an iterative method for solving the dominant eigenvalue and its corresponding eigenvectors of a matrix. Given an n × n matrix A, the hypothesis of power method are: there is a single dominant eigenvalue. The eigenvalues can be sorted: |λ1| > |λ2| ≥ |λ3| ≥ . . . |λn| there is a linearly independent set of n eigenvectors: {u(1) , u(2) , . . . , u(n) } so that Au(i) = λiu(i) , i = 1, . . . , n. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 9 / 14
  • 27. Solving the PageRank algorithm. The power method. The initial vector x0 can be wrote: x(0) = a1u(1) + a2u(2) + · · · + anu(n) Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
  • 28. Solving the PageRank algorithm. The power method. The initial vector x0 can be wrote: x(0) = a1u(1) + a2u(2) + · · · + anu(n) iterating the initial vector with the A matrix: Ak x(0) = a1Ak u(1) + a2Ak u(2) + · · · + anAk u(n) Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
  • 29. Solving the PageRank algorithm. The power method. The initial vector x0 can be wrote: x(0) = a1u(1) + a2u(2) + · · · + anu(n) iterating the initial vector with the A matrix: Ak x(0) = a1Ak u(1) + a2Ak u(2) + · · · + anAk u(n) = a1λk 1u(1) + a2λk 2u(2) + · · · + anλk nu(n) . Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
  • 30. Solving the PageRank algorithm. The power method. The initial vector x0 can be wrote: x(0) = a1u(1) + a2u(2) + · · · + anu(n) iterating the initial vector with the A matrix: Ak x(0) = a1Ak u(1) + a2Ak u(2) + · · · + anAk u(n) = a1λk 1u(1) + a2λk 2u(2) + · · · + anλk nu(n) . dividing by λk 1 Ak x(0) λk 1 = a1u(1) + a2 λ2 λ1 k u(2) + · · · + an λn λ1 k u(n) , Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
  • 31. Solving the PageRank algorithm. The power method. The initial vector x0 can be wrote: x(0) = a1u(1) + a2u(2) + · · · + anu(n) iterating the initial vector with the A matrix: Ak x(0) = a1Ak u(1) + a2Ak u(2) + · · · + anAk u(n) = a1λk 1u(1) + a2λk 2u(2) + · · · + anλk nu(n) . dividing by λk 1 Ak x(0) λk 1 = a1u(1) + a2 λ2 λ1 k u(2) + · · · + an λn λ1 k u(n) , Since |λi| |λ1| < 1 → Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
  • 32. Solving the PageRank algorithm. The power method. The initial vector x0 can be wrote: x(0) = a1u(1) + a2u(2) + · · · + anu(n) iterating the initial vector with the A matrix: Ak x(0) = a1Ak u(1) + a2Ak u(2) + · · · + anAk u(n) = a1λk 1u(1) + a2λk 2u(2) + · · · + anλk nu(n) . dividing by λk 1 Ak x(0) λk 1 = a1u(1) + a2 λ2 λ1 k u(2) + · · · + an λn λ1 k u(n) , Since |λi| |λ1| < 1 → lim k→∞ |λi|k |λ1|k = 0 → Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
  • 33. Solving the PageRank algorithm. The power method. The initial vector x0 can be wrote: x(0) = a1u(1) + a2u(2) + · · · + anu(n) iterating the initial vector with the A matrix: Ak x(0) = a1Ak u(1) + a2Ak u(2) + · · · + anAk u(n) = a1λk 1u(1) + a2λk 2u(2) + · · · + anλk nu(n) . dividing by λk 1 Ak x(0) λk 1 = a1u(1) + a2 λ2 λ1 k u(2) + · · · + an λn λ1 k u(n) , Since |λi| |λ1| < 1 → lim k→∞ |λi|k |λ1|k = 0 → Ak ≈ a1λk 1u(1) Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
  • 34. Conclusions. The power method and PageRank. Results. The matrix P of the PageRank algorithm is a stochastic matrix therefore the largest eigenvalue is 1. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 11 / 14
  • 35. Conclusions. The power method and PageRank. Results. The matrix P of the PageRank algorithm is a stochastic matrix therefore the largest eigenvalue is 1. The convergence rate of the power method depends on the ratio of λ2 λ1 . Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 11 / 14
  • 36. Conclusions. The power method and PageRank. Results. The matrix P of the PageRank algorithm is a stochastic matrix therefore the largest eigenvalue is 1. The convergence rate of the power method depends on the ratio of λ2 λ1 . It has been showed by [Haveliwala and Kamvar(2003)] that for the second largest eigenvalue of P, we have |λ2| ≤ α 0 ≤ α ≤ 1. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 11 / 14
  • 37. Conclusions. The power method and PageRank. Results. The matrix P of the PageRank algorithm is a stochastic matrix therefore the largest eigenvalue is 1. The convergence rate of the power method depends on the ratio of λ2 λ1 . It has been showed by [Haveliwala and Kamvar(2003)] that for the second largest eigenvalue of P, we have |λ2| ≤ α 0 ≤ α ≤ 1. Since λ1 = 1 the converge rate depends on α. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 11 / 14
  • 38. Conclusions. The power method and PageRank. Results. The matrix P of the PageRank algorithm is a stochastic matrix therefore the largest eigenvalue is 1. The convergence rate of the power method depends on the ratio of λ2 λ1 . It has been showed by [Haveliwala and Kamvar(2003)] that for the second largest eigenvalue of P, we have |λ2| ≤ α 0 ≤ α ≤ 1. Since λ1 = 1 the converge rate depends on α. The most popular value for α is 0.85. With this value it has been proved that the power method on web data set of over 80 million pages converges in about 50 iterations. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 11 / 14
  • 39. Conclusions. Really thanks to GTUG Palermo and see you to the next meeting! Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 12 / 14
  • 40. Bibliography. Bibliography. Brin, S. and Page, L. (1998). The anatomy of a large-scale hypertextual Web search engine. Computer networks and ISDN systems, 30(1-7), 107–117. Ching, W. and Ng, M. (2006). Markov Chains: Models, Algoritms and Applications. Springer Science + Business Media, Inc. Haveliwala, T. and Kamvar, M. (2003). The second eigenvalue of the google matrix. Technical report, Stanford University. Langville, A., Meyer, C., and Fern´Andez, P. (2008). Google’s PageRank and beyond: the science of search engine rankings. The Mathematical Intelligencer, 30(1), 68–69. Page, L., Brin, S., Motwani, R., and Winograd, T. (1999). The PageRank Citation Ranking: Bringing Order to the Web. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 13 / 14
  • 41. Internet web sites. Internet web sites. Jon Atle Gulla (2007) - From Google Search to Semantic Exploration. - Norwegian University of Science Technology - www.slideshare.net/sveino/semantics-and-search?type=presentation Steven Levy (2010) - Exclusive: How Google’s Algorithm Rules the Web - Wired Magazine - www.wired.com/magazine/2010/02/ff_google_algorithm/ Ann Smarty (2009) - Let’s Try to Find All 200 Parameters in Google Algorithm - Search Engine Journal - www.searchenginejournal.com/200-parameters-in-google-algorithm/15457/. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 14 / 14