Analysing the degree distribution of real graphs by means of several probabilistic models

•

1 like•867 views

The document analyzes the degree distribution of nodes in real-world networks using probabilistic models. It studies networks from datasets like Amazon, California roads, DBLP co-authorships, and others. Maximum likelihood and information criteria are used to determine the best fitting distributions, including Altmann, discrete Weibull, and MOEZipf. The analysis finds the MOEZipf distribution provides the best fit for most networks, followed by the discrete Weibull and Altmann distributions. Future work is proposed to integrate the best fitting distributions into a graph generation tool and analyze correlations between degree distribution and other network structures.

Technology

Analysing the degree distribution of real
graphs by means of several probabilistic
models
A. Duarte-L´opez, A. Prat-P´erez, M. P´erez-Casany
DAMA-UPC, Universitat Polit`ecnica de Catalunya
18th March, 2015

Introduction
Networks are complex structures in which the connections among
the nodes can exhibit complicated patterns.
Given a graph G with set of nodes V, the node degree is the number
of connections that a node has.
Objective: To determine the probability distribution of the de-
gree random variable of a real graph.
2/9

Networks
Diﬀerent real live networks from SNAP were studied.
Network Nodes Edges Type
Amazon 262111 1234876 Directed
CA roadnet 1965206 5533213 Undirected
DBLP Co-autorship 317080 1049865 Undirected
Live Journal 3997962 34681188 Undirected
NotreDame 325729 1497133 Directed
Patents 3774767 16518947 Directed
TX roadnet 1379917 3843319 Undirected
Wikipedia 2394385 5021409 Directed
Youtube 1134890 2987623 Undirected
3/9

Probabilistic Models
The non-negative integer probability distributions considered
are:
Uni-parametric Models Bi-parametric Models
Geometric Right-truncated Zipf
Poisson MOEZipf
Zipf Altmann
Negative Binomial
Discrete Weibull
All the models that contain the zero value in its domain were
truncated at 1.
4/9

Estimation and Goodness of the fit
Maximum likelihood method.
Given k = (k1, k2, ..., kN) a sample for the node degree we
maximize:
l(θ; k) =
N
i=1
log(pθ(ki))
Let M be the dimension of the parameter vector.
Akaike Information Criteria
AIC = −2l(ˆθ, K) + 2M
N
N − M − 1
Bayesian Information Criteria
BIC = −2l(ˆθ, k) + Mlog(N)
5/9

Results
1 10 100 1000 10000
110010000
Youtube (log−log scale)
Degree
Frequency
Observations
Altmann Distribution
Discrete Weibull Distribution
MOEZipf Distribution
Parameters
Distribution Parameters
Altmann ˆγ = 1.64; ˆδ = 0.0036
Discrete Weibull ˆv = 0.1424; ˆp = 0.0044
MOEZipf ˆγ = 2.089; ˆβ = 2.4101
Goodness of the ﬁt
Distribution AIC ∆AIC
Altmann 1688294.56 13402.66
Discrete Weibull 1676831.47 1939.57
MOEZipf 1674891.9 0
Distribution BIC ∆BIC
Altmann 1688316.23 13402.66
Discrete Weibull 1676853.14 1939.57
MOEZipf 1674913.57 0
6/9

Conclusions
The models given better ﬁts are:
MOEZipf model (54%)
Zero truncated discrete Weibull Model(38%)
Altmann Model (8%).
7/9

Future work
To integrate functions in the DataGen (LDBC project) that
allow generate random graphs in which their node degree
follows the MOEZipf, the zero truncated discrete Weibull
or the Altmann distributions.
To analyse the correlation between the degree distribution
and some other structural characteristics of the network
such as the clustering coeﬃcient, the degree assortativity,
etc.
8/9

Why should you come to our poster?
You will ﬁnd a relatively easy approach that allows you to
get more information about your network structure.
You can share with us your experience with respect the
node degree distribution.
Further potential collaborations.
9/9

Similar to Analysing the degree distribution of real graphs by means of several probabilistic models

W+charm posterGiacomo Snidero

Self-sampling Strategies for Multimemetic Algorithms in Unstable Computationa...Rafael Nogueras

F010123235IOSR Journals

Evaluating the Synchronization of a Chaotic Encryption Scheme Using Different...IOSR Journals

Sensitivity Analysis of Checkpointing Strategies for Multimemetic Algorithms ...Rafael Nogueras

FAULT DETECTION AND CLASSIFICATION ON SINGLE CIRCUIT TRANSMISSION LINE USING ...Politeknik Negeri Ujung Pandang

Input analysisBhavik A Shah

ieee nss mic 2016 poster N30-21Dae Woon Kim

FAST実験3：新型大気蛍光望遠鏡の試験観測報告Toshihiro FUJII

Recreation mathematics pptPawan Yadav

NONLINEAR MODELING AND ANALYSIS OF WSN NODE LOCALIZATION METHODijwmn

Cobb-douglas production functionSuniya Sheikh

Network switchingPREMAL GAJJAR

Du3211861191IJMER

08-Switching.pptSanaMateen7

Molecular Weight EstimationPhan Nghia

B1102030610IOSR Journals

Path loss predictionNguyen Minh Thu

Similar to Analysing the degree distribution of real graphs by means of several probabilistic models (20)

W+charm poster

Self-sampling Strategies for Multimemetic Algorithms in Unstable Computationa...

F010123235

Evaluating the Synchronization of a Chaotic Encryption Scheme Using Different...

Sensitivity Analysis of Checkpointing Strategies for Multimemetic Algorithms ...

FAULT DETECTION AND CLASSIFICATION ON SINGLE CIRCUIT TRANSMISSION LINE USING ...

Input analysis

ieee nss mic 2016 poster N30-21

FAST実験3：新型大気蛍光望遠鏡の試験観測報告

Recreation mathematics ppt

NONLINEAR MODELING AND ANALYSIS OF WSN NODE LOCALIZATION METHOD

Cobb-douglas production function

Network switching

Du3211861191

08-Switching.ppt

Molecular Weight Estimation

B1102030610

Path loss prediction

Recently uploaded

Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal

Time Series Foundation Models - current state and future directionsNathaniel Shimoni

[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra

Take control of your SAP testing with UiPath Test SuiteDianaGray10

Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3

How to write a Business Continuity PlanDatabarracks

UiPath Community: Communication Mining from Zero to HeroUiPathCommunity

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3

The State of Passkeys with FIDO Alliance.pptxLoriGlavin3

Rise of the Machines: Known As Drones...Rick Flair

How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe

Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes

A Journey Into the Emotions of Software DevelopersNicole Novielli

The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3

So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda

Decarbonising Buildings: Making a net-zero built environment a realityIES VE

From Family Reminiscence to Scholarly Archive .Alan Dix

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada

Recently uploaded (20)

Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...

Time Series Foundation Models - current state and future directions

[Webinar] SpiraTest - Setting New Standards in Quality Assurance

Take control of your SAP testing with UiPath Test Suite

Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx

How to write a Business Continuity Plan

UiPath Community: Communication Mining from Zero to Hero

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx

The State of Passkeys with FIDO Alliance.pptx

Rise of the Machines: Known As Drones...

How AI, OpenAI, and ChatGPT impact business and software.

Assure Ecommerce and Retail Operations Uptime with ThousandEyes

A Journey Into the Emotions of Software Developers

The Ultimate Guide to Choosing WordPress Pros and Cons

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx

So einfach geht modernes Roaming fuer Notes und Nomad.pdf

Decarbonising Buildings: Making a net-zero built environment a reality

From Family Reminiscence to Scholarly Archive .

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024

Analysing the degree distribution of real graphs by means of several probabilistic models

1. Analysing the degree distribution of real graphs by means of several probabilistic models A. Duarte-López, A. Prat-Pérez, M. Pérez-Casany DAMA-UPC, Universitat Politècnica de Catalunya 18th March, 2015

2. Introduction Networks are complex structures in which the connections among the nodes can exhibit complicated patterns. Given a graph G with set of nodes V, the node degree is the number of connections that a node has. Objective: To determine the probability distribution of the degree random variable of a real graph. 2/9

3. Networks Diﬀerent real live networks from SNAP were studied. Network Nodes Edges Type Amazon 262111 1234876 Directed CA roadnet 1965206 5533213 Undirected DBLP Co-autorship 317080 1049865 Undirected Live Journal 3997962 34681188 Undirected NotreDame 325729 1497133 Directed Patents 3774767 16518947 Directed TX roadnet 1379917 3843319 Undirected Wikipedia 2394385 5021409 Directed Youtube 1134890 2987623 Undirected 3/9

4. Probabilistic Models The non-negative integer probability distributions considered are: Uni-parametric Models Bi-parametric Models Geometric Right-truncated Zipf Poisson MOEZipf Zipf Altmann Negative Binomial Discrete Weibull All the models that contain the zero value in its domain were truncated at 1. 4/9

5. Estimation and Goodness of the fit Maximum likelihood method. Given k = (k1, k2, ..., kN) a sample for the node degree we maximize: l(θ; k) = N i=1 log(pθ(ki)) Let M be the dimension of the parameter vector. Akaike Information Criteria AIC = −2l(ˆθ, K) + 2M N N − M − 1 Bayesian Information Criteria BIC = −2l(ˆθ, k) + Mlog(N) 5/9

6. Results 1 10 100 1000 10000 110010000 Youtube (log−log scale) Degree Frequency Observations Altmann Distribution Discrete Weibull Distribution MOEZipf Distribution Parameters Distribution Parameters Altmann ˆγ = 1.64; ˆδ = 0.0036 Discrete Weibull ˆv = 0.1424; ˆp = 0.0044 MOEZipf ˆγ = 2.089; ˆβ = 2.4101 Goodness of the ﬁt Distribution AIC ∆AIC Altmann 1688294.56 13402.66 Discrete Weibull 1676831.47 1939.57 MOEZipf 1674891.9 0 Distribution BIC ∆BIC Altmann 1688316.23 13402.66 Discrete Weibull 1676853.14 1939.57 MOEZipf 1674913.57 0 6/9

7. Conclusions The models given better ﬁts are: MOEZipf model (54%) Zero truncated discrete Weibull Model(38%) Altmann Model (8%). 7/9

8. Future work To integrate functions in the DataGen (LDBC project) that allow generate random graphs in which their node degree follows the MOEZipf, the zero truncated discrete Weibull or the Altmann distributions. To analyse the correlation between the degree distribution and some other structural characteristics of the network such as the clustering coeﬃcient, the degree assortativity, etc. 8/9

9. Why should you come to our poster? You will ﬁnd a relatively easy approach that allows you to get more information about your network structure. You can share with us your experience with respect the node degree distribution. Further potential collaborations. 9/9

Analysing the degree distribution of real graphs by means of several probabilistic models

Recommended

Recommended

More Related Content

Similar to Analysing the degree distribution of real graphs by means of several probabilistic models

Similar to Analysing the degree distribution of real graphs by means of several probabilistic models (20)

More from Graph-TA

More from Graph-TA (20)

Recently uploaded

Recently uploaded (20)

Analysing the degree distribution of real graphs by means of several probabilistic models