SlideShare a Scribd company logo
1 of 25
Download to read offline
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Vertex Centric Asynchronous Belief Propagation
Algorithm for Large-Scale Graphs
Gabriel Gimenes
Hugo Gualdron
Jose F. Rodrigues-Jr
Instituto de Ciencias Matematicas e de Computacao
University of Sao Paulo - Sao Carlos
DamNet - 2016 ICDM Workshop, Barcelona, Spain
This work has finantial support from FAPESP 2014/25337-0
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Outline
1 Introduction
2 Belief Propagation Algorithm
3 Methodology and Experiments
4 Conclusions
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Outline
1 Introduction
2 Belief Propagation Algorithm
3 Methodology and Experiments
4 Conclusions
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Context
Ubiquitous data generation
Information availability: pros and cons
Web 2.0 – users are producing data and not only consuming
Relationships between elements
Facebook, Twitter, Amazon, GooglePlay, Email
Intuitive modelling: Graphs(Networks)
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Problem
Analyzing large-scale networks – efficient and powerful
Some graphs (e.g YahooWeb e Twitter) may not fit memory
Naive processing: prohibitive
Alternative: distributed processing
complexity, infrastructure, cost
How to process in a single computational node?
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Rationale
New approaches: Taking advatange of the multi-core
architecturess
Centralized → Decentralized
Vertex-centric processing techniques
Block-based processing
Asynchronous processing
Proposals: TurboGraph, GraphChi, X-Stream, MMap,
M-Flash, FlashGraph; Pregel, GraphLab, Giraph.
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Vertex-centric paradigm
Vertex-centric model
procedure Graph scan(Graph G)
for i = 1 to |V | do
sete ← set of edges adjacent to V [i]
V [i].value ← f (sete )
for each edge e in sete do
e.value ← g(V [i].value, e.value)
Outer loop
procedure Graph processing
while convergence criterion is not satisfied do
Graph scan(G)
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Outline
1 Introduction
2 Belief Propagation Algorithm
3 Methodology and Experiments
4 Conclusions
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Algorithm
Belief propagation - bayesian inference method
Estimating the marginal probability distribution for
non-annotated nodes
Message passing: information travels from annotated to
unannotated nodes
Guilty-by-association or ”birds of a feather flock together”
Heterophily vs Homophily
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Problem
Original algorithm proposed for trees - no loops
Loopy BP (Murphy et al.) generalized algorithm
Problems with convergence and performance
Early applications in stereo-imaging and facial reconstruction
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Evolution
Performance and scalability: distributed processing
Gonzalez et al. – distributed inefficiencies
Kang et al. – algorithm relevance for anti-malware and fraud
detection applications
Gatterbauer et al. – linear approximation, convergence
guarantees and better performance
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
BP vs LinBP
Belief Propagation
bs (i) = es (i)
u∈N(s)
mus (i)
mst (i) =
c−1
j=0
Hst (j, i)es (j)
u∈N(s)t
mus (j)
Linearized Belief Propagation
ˆbs (i) = ˆes (i) +
1
k
u∈N(s)
ˆmus (i)
ˆmst (i) = k
j
ˆHst (j, i)ˆbs (j) −
j
ˆHst (j, i) ˆmts (j)
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Outline
1 Introduction
2 Belief Propagation Algorithm
3 Methodology and Experiments
4 Conclusions
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Proposal and contributions
Algorithm: change of paradigm, asynchronous parallel
vertex-centric processing
Convergence: better convergence speed (number of iterations)
Scalability: commodity computer
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Our algorithm
VC-LinBP
1: procedure VC-LinBP(G(V , E), VExplicit, H, h, t)
2: set H = hH
3: set H2 = H2
4: repeat
5: for each vertex in V do
6: Update(vertex)
7: until t iterations or convergence achieved
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Our algorithm
Update
1: procedure Update(vertex)
2: Set degree = 0
3: for each class c in vertex do initializing vertex values for each class
4: vertex.value(c) = 0
5: for each incoming edge e to vertex do processing incoming messages
6: degree+ = e.weight2
7: for each each class cfrom do
8: for each each class cto do
9: vertex.value(cto) += e.weight * e.value(cfrom) * H(cfrom, cto)
10: if vertex is not explicit then echo cancellation of messages
11: for each each class cfrom do
12: for each each class cto do
13: vertex.value(cto)− = degree ∗ vertex.value(cfrom) ∗ H2(cfrom, cto)
14: else adding explicit value of the vertex
15: vertex.value(c)+ = VExplicit (vertex)(c)
16: for each outgoing edge e from vertex do sending messages to neighbors
17: for each each class c do
18: e.value(c) = vertex.value(c)
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Experiments
Efficiency and efficacy
i7 CPU 8 cores, 16GB RAM, 240GB SSD
Comparison with LinBP
2 versions: single e multi-threaded
Utilizing the GraphChi framework
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Datasets
Generated with the Kronecker product method – SNAP
4 different networks
Datasets
Graph # Nodes # Edges
1 59,049 1,048,576
2 177,147 4,194,304
3 531,441 16,777,216
4 1,594,323 67,108,864
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Experiments
Coupling Matrix
1 2 3
1 0.266667 -0.033333 0.366667
2 0.033333 -0.333333 0.366667
3 -0.233333 0.366667 -0.133333
3 classes, 5% randomly initialized (annotated)
Coupling matrix and initialization procedure based on LinBP’s
experimentation
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Experiments - Validation
Validation
Graph Top-beliefs’ Agreement (%)
1 100%
2 100%
3 99%
4 99%
Divergences are related to tiebreak scenarios
Efficacy – to be expected
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Experiments - Scalability
Runtime (sec)
Graph LinBP-SQL VC-LinBP-1thread VC-LinBP-8threads
1 39.04 0.31 0.23
2 179 1.27 0.75
3 826 5.90 3.15
4 5000 34.62 18.69
Fixed number of iterations (5 iterations)
Only runtime is considered – excluding pre-processing time
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Experiments
1
10
100
1000
1e+06 1e+07 1e+08
Number of Edges
Runtime(sec)
VC_LinBP_1thread
VC_LinBP_8threads
LinBP
(a) Scalability
0
2
4
6
8
1 2 3 4
Dataset
NumberofIterations
LinBP
VC_LinBP
(b) Convergence
Elidan et al. – asynchronous version is at worst the same as synchronous
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Outline
1 Introduction
2 Belief Propagation Algorithm
3 Methodology and Experiments
4 Conclusions
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Future Work
In-memory implementation – performance comparison
Experiments with bigger datasets
Detailed tiebreak scenarios
Real-world dataset experiments – DBLP, Malware detection,
Image segmentation
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Thank you!
Questions?

More Related Content

Viewers also liked

Fast Billion-scale Graph Computation Using a Bimodal Block Processing Model
Fast Billion-scale Graph Computation Using a Bimodal Block Processing ModelFast Billion-scale Graph Computation Using a Bimodal Block Processing Model
Fast Billion-scale Graph Computation Using a Bimodal Block Processing ModelUniversidade de São Paulo
 
Frequency plot and relevance plot to enhance visual data exploration
Frequency plot and relevance plot to enhance visual data explorationFrequency plot and relevance plot to enhance visual data exploration
Frequency plot and relevance plot to enhance visual data explorationUniversidade de São Paulo
 
Reviewing Data Visualization: an Analytical Taxonomical Study
Reviewing Data Visualization: an Analytical Taxonomical StudyReviewing Data Visualization: an Analytical Taxonomical Study
Reviewing Data Visualization: an Analytical Taxonomical StudyUniversidade de São Paulo
 
Visualization tree multiple linked analytical decisions
Visualization tree multiple linked analytical decisionsVisualization tree multiple linked analytical decisions
Visualization tree multiple linked analytical decisionsUniversidade de São Paulo
 
6 7-metodologia depesquisaemcienciadacomputacao-escritadeartigocientifico-plagio
6 7-metodologia depesquisaemcienciadacomputacao-escritadeartigocientifico-plagio6 7-metodologia depesquisaemcienciadacomputacao-escritadeartigocientifico-plagio
6 7-metodologia depesquisaemcienciadacomputacao-escritadeartigocientifico-plagioUniversidade de São Paulo
 
Unveiling smoke in social images with the SmokeBlock approach
Unveiling smoke in social images with the SmokeBlock approachUnveiling smoke in social images with the SmokeBlock approach
Unveiling smoke in social images with the SmokeBlock approachUniversidade de São Paulo
 
Effective and Unsupervised Fractal-based Feature Selection for Very Large Dat...
Effective and Unsupervised Fractal-based Feature Selection for Very Large Dat...Effective and Unsupervised Fractal-based Feature Selection for Very Large Dat...
Effective and Unsupervised Fractal-based Feature Selection for Very Large Dat...Universidade de São Paulo
 
On the Support of a Similarity-Enabled Relational Database Management System ...
On the Support of a Similarity-Enabled Relational Database Management System ...On the Support of a Similarity-Enabled Relational Database Management System ...
On the Support of a Similarity-Enabled Relational Database Management System ...Universidade de São Paulo
 
StructMatrix: large-scale visualization of graphs by means of structure detec...
StructMatrix: large-scale visualization of graphs by means of structure detec...StructMatrix: large-scale visualization of graphs by means of structure detec...
StructMatrix: large-scale visualization of graphs by means of structure detec...Universidade de São Paulo
 
Supervised-Learning Link Recommendation in the DBLP co-authoring network
Supervised-Learning Link Recommendation in the DBLP co-authoring networkSupervised-Learning Link Recommendation in the DBLP co-authoring network
Supervised-Learning Link Recommendation in the DBLP co-authoring networkUniversidade de São Paulo
 
Multimodal graph-based analysis over the DBLP repository: critical discoverie...
Multimodal graph-based analysis over the DBLP repository: critical discoverie...Multimodal graph-based analysis over the DBLP repository: critical discoverie...
Multimodal graph-based analysis over the DBLP repository: critical discoverie...Universidade de São Paulo
 
Techniques for effective and efficient fire detection from social media images
Techniques for effective and efficient fire detection from social media imagesTechniques for effective and efficient fire detection from social media images
Techniques for effective and efficient fire detection from social media imagesUniversidade de São Paulo
 
Physics of Algorithms Talk
Physics of Algorithms TalkPhysics of Algorithms Talk
Physics of Algorithms Talkjasonj383
 
Efficient Belief Propagation in Depth Finding
Efficient Belief Propagation in Depth FindingEfficient Belief Propagation in Depth Finding
Efficient Belief Propagation in Depth FindingSamantha Luber
 
Fire Detection on Unconstrained Videos Using Color-Aware Spatial Modeling and...
Fire Detection on Unconstrained Videos Using Color-Aware Spatial Modeling and...Fire Detection on Unconstrained Videos Using Color-Aware Spatial Modeling and...
Fire Detection on Unconstrained Videos Using Color-Aware Spatial Modeling and...Universidade de São Paulo
 

Viewers also liked (20)

Fast Billion-scale Graph Computation Using a Bimodal Block Processing Model
Fast Billion-scale Graph Computation Using a Bimodal Block Processing ModelFast Billion-scale Graph Computation Using a Bimodal Block Processing Model
Fast Billion-scale Graph Computation Using a Bimodal Block Processing Model
 
Frequency plot and relevance plot to enhance visual data exploration
Frequency plot and relevance plot to enhance visual data explorationFrequency plot and relevance plot to enhance visual data exploration
Frequency plot and relevance plot to enhance visual data exploration
 
Apresentacao vldb
Apresentacao vldbApresentacao vldb
Apresentacao vldb
 
Reviewing Data Visualization: an Analytical Taxonomical Study
Reviewing Data Visualization: an Analytical Taxonomical StudyReviewing Data Visualization: an Analytical Taxonomical Study
Reviewing Data Visualization: an Analytical Taxonomical Study
 
An introduction to MongoDB
An introduction to MongoDBAn introduction to MongoDB
An introduction to MongoDB
 
Visualization tree multiple linked analytical decisions
Visualization tree multiple linked analytical decisionsVisualization tree multiple linked analytical decisions
Visualization tree multiple linked analytical decisions
 
SuperGraph visualization
SuperGraph visualizationSuperGraph visualization
SuperGraph visualization
 
6 7-metodologia depesquisaemcienciadacomputacao-escritadeartigocientifico-plagio
6 7-metodologia depesquisaemcienciadacomputacao-escritadeartigocientifico-plagio6 7-metodologia depesquisaemcienciadacomputacao-escritadeartigocientifico-plagio
6 7-metodologia depesquisaemcienciadacomputacao-escritadeartigocientifico-plagio
 
Unveiling smoke in social images with the SmokeBlock approach
Unveiling smoke in social images with the SmokeBlock approachUnveiling smoke in social images with the SmokeBlock approach
Unveiling smoke in social images with the SmokeBlock approach
 
Effective and Unsupervised Fractal-based Feature Selection for Very Large Dat...
Effective and Unsupervised Fractal-based Feature Selection for Very Large Dat...Effective and Unsupervised Fractal-based Feature Selection for Very Large Dat...
Effective and Unsupervised Fractal-based Feature Selection for Very Large Dat...
 
On the Support of a Similarity-Enabled Relational Database Management System ...
On the Support of a Similarity-Enabled Relational Database Management System ...On the Support of a Similarity-Enabled Relational Database Management System ...
On the Support of a Similarity-Enabled Relational Database Management System ...
 
StructMatrix: large-scale visualization of graphs by means of structure detec...
StructMatrix: large-scale visualization of graphs by means of structure detec...StructMatrix: large-scale visualization of graphs by means of structure detec...
StructMatrix: large-scale visualization of graphs by means of structure detec...
 
Supervised-Learning Link Recommendation in the DBLP co-authoring network
Supervised-Learning Link Recommendation in the DBLP co-authoring networkSupervised-Learning Link Recommendation in the DBLP co-authoring network
Supervised-Learning Link Recommendation in the DBLP co-authoring network
 
Multimodal graph-based analysis over the DBLP repository: critical discoverie...
Multimodal graph-based analysis over the DBLP repository: critical discoverie...Multimodal graph-based analysis over the DBLP repository: critical discoverie...
Multimodal graph-based analysis over the DBLP repository: critical discoverie...
 
Techniques for effective and efficient fire detection from social media images
Techniques for effective and efficient fire detection from social media imagesTechniques for effective and efficient fire detection from social media images
Techniques for effective and efficient fire detection from social media images
 
Physics of Algorithms Talk
Physics of Algorithms TalkPhysics of Algorithms Talk
Physics of Algorithms Talk
 
C04922125
C04922125C04922125
C04922125
 
Efficient Belief Propagation in Depth Finding
Efficient Belief Propagation in Depth FindingEfficient Belief Propagation in Depth Finding
Efficient Belief Propagation in Depth Finding
 
Fire Detection on Unconstrained Videos Using Color-Aware Spatial Modeling and...
Fire Detection on Unconstrained Videos Using Color-Aware Spatial Modeling and...Fire Detection on Unconstrained Videos Using Color-Aware Spatial Modeling and...
Fire Detection on Unconstrained Videos Using Color-Aware Spatial Modeling and...
 
Graph-based Relational Data Visualization
Graph-based RelationalData VisualizationGraph-based RelationalData Visualization
Graph-based Relational Data Visualization
 

Similar to Vertex Centric Asynchronous Belief Propagation Algorithm for Large-Scale Graphs

Saliency Based Hookworm and Infection Detection for Wireless Capsule Endoscop...
Saliency Based Hookworm and Infection Detection for Wireless Capsule Endoscop...Saliency Based Hookworm and Infection Detection for Wireless Capsule Endoscop...
Saliency Based Hookworm and Infection Detection for Wireless Capsule Endoscop...IRJET Journal
 
Tutorial on Deep Generative Models
 Tutorial on Deep Generative Models Tutorial on Deep Generative Models
Tutorial on Deep Generative ModelsMLReview
 
New Search Strategies for the Petri Net CEGAR Approach
New Search Strategies for the Petri Net CEGAR ApproachNew Search Strategies for the Petri Net CEGAR Approach
New Search Strategies for the Petri Net CEGAR ApproachAkos Hajdu
 
NEURAL Network Design Training
NEURAL Network Design  TrainingNEURAL Network Design  Training
NEURAL Network Design TrainingESCOM
 
Presentation Slides - Genetic algorithm based key generation for fully homomo...
Presentation Slides - Genetic algorithm based key generation for fully homomo...Presentation Slides - Genetic algorithm based key generation for fully homomo...
Presentation Slides - Genetic algorithm based key generation for fully homomo...MajedahAlkharji
 
Sound Empirical Evidence in Software Testing
Sound Empirical Evidence in Software TestingSound Empirical Evidence in Software Testing
Sound Empirical Evidence in Software TestingJaguaraci Silva
 
Noise-robust classification with hypergraph neural network
Noise-robust classification with hypergraph neural networkNoise-robust classification with hypergraph neural network
Noise-robust classification with hypergraph neural networknooriasukmaningtyas
 
COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...
COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...
COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...IAEME Publication
 
Comparison between the genetic algorithms optimization and particle swarm opt...
Comparison between the genetic algorithms optimization and particle swarm opt...Comparison between the genetic algorithms optimization and particle swarm opt...
Comparison between the genetic algorithms optimization and particle swarm opt...IAEME Publication
 
Eswc2009
Eswc2009Eswc2009
Eswc2009fanizzi
 
DeepDRImageGuidedDiabeticRetinopathyDetectionUsingAttentionBasedDeepLearningS...
DeepDRImageGuidedDiabeticRetinopathyDetectionUsingAttentionBasedDeepLearningS...DeepDRImageGuidedDiabeticRetinopathyDetectionUsingAttentionBasedDeepLearningS...
DeepDRImageGuidedDiabeticRetinopathyDetectionUsingAttentionBasedDeepLearningS...RamithaDevi
 
Empirical project powerpoint
Empirical project powerpointEmpirical project powerpoint
Empirical project powerpointJoe Krall
 
Concept Drift for obtaining Accurate Insight on Process Execution
Concept Drift for obtaining Accurate Insight on Process ExecutionConcept Drift for obtaining Accurate Insight on Process Execution
Concept Drift for obtaining Accurate Insight on Process Executioniosrjce
 
Flavours of Physics Challenge: Transfer Learning approach
Flavours of Physics Challenge: Transfer Learning approachFlavours of Physics Challenge: Transfer Learning approach
Flavours of Physics Challenge: Transfer Learning approachAlexander Rakhlin
 
November, 2006 CCKM'06 1
November, 2006 CCKM'06 1 November, 2006 CCKM'06 1
November, 2006 CCKM'06 1 butest
 
Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)Dmitry Grapov
 
Learning Sparse Networks using Targeted Dropout
Learning Sparse Networks using Targeted DropoutLearning Sparse Networks using Targeted Dropout
Learning Sparse Networks using Targeted DropoutSeunghyun Hwang
 

Similar to Vertex Centric Asynchronous Belief Propagation Algorithm for Large-Scale Graphs (20)

Saliency Based Hookworm and Infection Detection for Wireless Capsule Endoscop...
Saliency Based Hookworm and Infection Detection for Wireless Capsule Endoscop...Saliency Based Hookworm and Infection Detection for Wireless Capsule Endoscop...
Saliency Based Hookworm and Infection Detection for Wireless Capsule Endoscop...
 
Tutorial on Deep Generative Models
 Tutorial on Deep Generative Models Tutorial on Deep Generative Models
Tutorial on Deep Generative Models
 
New Search Strategies for the Petri Net CEGAR Approach
New Search Strategies for the Petri Net CEGAR ApproachNew Search Strategies for the Petri Net CEGAR Approach
New Search Strategies for the Petri Net CEGAR Approach
 
NEURAL Network Design Training
NEURAL Network Design  TrainingNEURAL Network Design  Training
NEURAL Network Design Training
 
Presentation Slides - Genetic algorithm based key generation for fully homomo...
Presentation Slides - Genetic algorithm based key generation for fully homomo...Presentation Slides - Genetic algorithm based key generation for fully homomo...
Presentation Slides - Genetic algorithm based key generation for fully homomo...
 
Sound Empirical Evidence in Software Testing
Sound Empirical Evidence in Software TestingSound Empirical Evidence in Software Testing
Sound Empirical Evidence in Software Testing
 
Noise-robust classification with hypergraph neural network
Noise-robust classification with hypergraph neural networkNoise-robust classification with hypergraph neural network
Noise-robust classification with hypergraph neural network
 
COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...
COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...
COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...
 
Comparison between the genetic algorithms optimization and particle swarm opt...
Comparison between the genetic algorithms optimization and particle swarm opt...Comparison between the genetic algorithms optimization and particle swarm opt...
Comparison between the genetic algorithms optimization and particle swarm opt...
 
Eswc2009
Eswc2009Eswc2009
Eswc2009
 
DeepDRImageGuidedDiabeticRetinopathyDetectionUsingAttentionBasedDeepLearningS...
DeepDRImageGuidedDiabeticRetinopathyDetectionUsingAttentionBasedDeepLearningS...DeepDRImageGuidedDiabeticRetinopathyDetectionUsingAttentionBasedDeepLearningS...
DeepDRImageGuidedDiabeticRetinopathyDetectionUsingAttentionBasedDeepLearningS...
 
Empirical project powerpoint
Empirical project powerpointEmpirical project powerpoint
Empirical project powerpoint
 
Concept Drift for obtaining Accurate Insight on Process Execution
Concept Drift for obtaining Accurate Insight on Process ExecutionConcept Drift for obtaining Accurate Insight on Process Execution
Concept Drift for obtaining Accurate Insight on Process Execution
 
I017366469
I017366469I017366469
I017366469
 
Flavours of Physics Challenge: Transfer Learning approach
Flavours of Physics Challenge: Transfer Learning approachFlavours of Physics Challenge: Transfer Learning approach
Flavours of Physics Challenge: Transfer Learning approach
 
November, 2006 CCKM'06 1
November, 2006 CCKM'06 1 November, 2006 CCKM'06 1
November, 2006 CCKM'06 1
 
Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)
 
Learning Sparse Networks using Targeted Dropout
Learning Sparse Networks using Targeted DropoutLearning Sparse Networks using Targeted Dropout
Learning Sparse Networks using Targeted Dropout
 
Fahroo - Optimization and Discrete Mathematics - Spring Review 2013
Fahroo - Optimization and Discrete Mathematics - Spring Review 2013Fahroo - Optimization and Discrete Mathematics - Spring Review 2013
Fahroo - Optimization and Discrete Mathematics - Spring Review 2013
 
OOD_PPT.pptx
OOD_PPT.pptxOOD_PPT.pptx
OOD_PPT.pptx
 

More from Universidade de São Paulo

More from Universidade de São Paulo (13)

A gentle introduction to Deep Learning
A gentle introduction to Deep LearningA gentle introduction to Deep Learning
A gentle introduction to Deep Learning
 
Computação: carreira e mercado de trabalho
Computação: carreira e mercado de trabalhoComputação: carreira e mercado de trabalho
Computação: carreira e mercado de trabalho
 
Introdução às ferramentas de Business Intelligence do ecossistema Hadoop
Introdução às ferramentas de Business Intelligence do ecossistema HadoopIntrodução às ferramentas de Business Intelligence do ecossistema Hadoop
Introdução às ferramentas de Business Intelligence do ecossistema Hadoop
 
Complexidade de Algoritmos, Notação assintótica, Algoritmos polinomiais e in...
Complexidade de Algoritmos, Notação assintótica, Algoritmos polinomiais e in...Complexidade de Algoritmos, Notação assintótica, Algoritmos polinomiais e in...
Complexidade de Algoritmos, Notação assintótica, Algoritmos polinomiais e in...
 
Dawarehouse e OLAP
Dawarehouse e OLAPDawarehouse e OLAP
Dawarehouse e OLAP
 
Metric s plat - a platform for quick development testing and visualization of...
Metric s plat - a platform for quick development testing and visualization of...Metric s plat - a platform for quick development testing and visualization of...
Metric s plat - a platform for quick development testing and visualization of...
 
Hierarchical visual filtering pragmatic and epistemic actions for database vi...
Hierarchical visual filtering pragmatic and epistemic actions for database vi...Hierarchical visual filtering pragmatic and epistemic actions for database vi...
Hierarchical visual filtering pragmatic and epistemic actions for database vi...
 
Java generics-basics
Java generics-basicsJava generics-basics
Java generics-basics
 
Java collections-basic
Java collections-basicJava collections-basic
Java collections-basic
 
Java network-sockets-etc
Java network-sockets-etcJava network-sockets-etc
Java network-sockets-etc
 
Java streams
Java streamsJava streams
Java streams
 
Infovis tutorial
Infovis tutorialInfovis tutorial
Infovis tutorial
 
Java platform
Java platformJava platform
Java platform
 

Recently uploaded

RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一F La
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...ThinkInnovation
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAbdelrhman abooda
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 

Recently uploaded (20)

RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 

Vertex Centric Asynchronous Belief Propagation Algorithm for Large-Scale Graphs

  • 1. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Vertex Centric Asynchronous Belief Propagation Algorithm for Large-Scale Graphs Gabriel Gimenes Hugo Gualdron Jose F. Rodrigues-Jr Instituto de Ciencias Matematicas e de Computacao University of Sao Paulo - Sao Carlos DamNet - 2016 ICDM Workshop, Barcelona, Spain This work has finantial support from FAPESP 2014/25337-0
  • 2. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Outline 1 Introduction 2 Belief Propagation Algorithm 3 Methodology and Experiments 4 Conclusions
  • 3. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Outline 1 Introduction 2 Belief Propagation Algorithm 3 Methodology and Experiments 4 Conclusions
  • 4. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Context Ubiquitous data generation Information availability: pros and cons Web 2.0 – users are producing data and not only consuming Relationships between elements Facebook, Twitter, Amazon, GooglePlay, Email Intuitive modelling: Graphs(Networks)
  • 5. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Problem Analyzing large-scale networks – efficient and powerful Some graphs (e.g YahooWeb e Twitter) may not fit memory Naive processing: prohibitive Alternative: distributed processing complexity, infrastructure, cost How to process in a single computational node?
  • 6. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Rationale New approaches: Taking advatange of the multi-core architecturess Centralized → Decentralized Vertex-centric processing techniques Block-based processing Asynchronous processing Proposals: TurboGraph, GraphChi, X-Stream, MMap, M-Flash, FlashGraph; Pregel, GraphLab, Giraph.
  • 7. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Vertex-centric paradigm Vertex-centric model procedure Graph scan(Graph G) for i = 1 to |V | do sete ← set of edges adjacent to V [i] V [i].value ← f (sete ) for each edge e in sete do e.value ← g(V [i].value, e.value) Outer loop procedure Graph processing while convergence criterion is not satisfied do Graph scan(G)
  • 8. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Outline 1 Introduction 2 Belief Propagation Algorithm 3 Methodology and Experiments 4 Conclusions
  • 9. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Algorithm Belief propagation - bayesian inference method Estimating the marginal probability distribution for non-annotated nodes Message passing: information travels from annotated to unannotated nodes Guilty-by-association or ”birds of a feather flock together” Heterophily vs Homophily
  • 10. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Problem Original algorithm proposed for trees - no loops Loopy BP (Murphy et al.) generalized algorithm Problems with convergence and performance Early applications in stereo-imaging and facial reconstruction
  • 11. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Evolution Performance and scalability: distributed processing Gonzalez et al. – distributed inefficiencies Kang et al. – algorithm relevance for anti-malware and fraud detection applications Gatterbauer et al. – linear approximation, convergence guarantees and better performance
  • 12. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions BP vs LinBP Belief Propagation bs (i) = es (i) u∈N(s) mus (i) mst (i) = c−1 j=0 Hst (j, i)es (j) u∈N(s)t mus (j) Linearized Belief Propagation ˆbs (i) = ˆes (i) + 1 k u∈N(s) ˆmus (i) ˆmst (i) = k j ˆHst (j, i)ˆbs (j) − j ˆHst (j, i) ˆmts (j)
  • 13. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Outline 1 Introduction 2 Belief Propagation Algorithm 3 Methodology and Experiments 4 Conclusions
  • 14. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Proposal and contributions Algorithm: change of paradigm, asynchronous parallel vertex-centric processing Convergence: better convergence speed (number of iterations) Scalability: commodity computer
  • 15. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Our algorithm VC-LinBP 1: procedure VC-LinBP(G(V , E), VExplicit, H, h, t) 2: set H = hH 3: set H2 = H2 4: repeat 5: for each vertex in V do 6: Update(vertex) 7: until t iterations or convergence achieved
  • 16. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Our algorithm Update 1: procedure Update(vertex) 2: Set degree = 0 3: for each class c in vertex do initializing vertex values for each class 4: vertex.value(c) = 0 5: for each incoming edge e to vertex do processing incoming messages 6: degree+ = e.weight2 7: for each each class cfrom do 8: for each each class cto do 9: vertex.value(cto) += e.weight * e.value(cfrom) * H(cfrom, cto) 10: if vertex is not explicit then echo cancellation of messages 11: for each each class cfrom do 12: for each each class cto do 13: vertex.value(cto)− = degree ∗ vertex.value(cfrom) ∗ H2(cfrom, cto) 14: else adding explicit value of the vertex 15: vertex.value(c)+ = VExplicit (vertex)(c) 16: for each outgoing edge e from vertex do sending messages to neighbors 17: for each each class c do 18: e.value(c) = vertex.value(c)
  • 17. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Experiments Efficiency and efficacy i7 CPU 8 cores, 16GB RAM, 240GB SSD Comparison with LinBP 2 versions: single e multi-threaded Utilizing the GraphChi framework
  • 18. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Datasets Generated with the Kronecker product method – SNAP 4 different networks Datasets Graph # Nodes # Edges 1 59,049 1,048,576 2 177,147 4,194,304 3 531,441 16,777,216 4 1,594,323 67,108,864
  • 19. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Experiments Coupling Matrix 1 2 3 1 0.266667 -0.033333 0.366667 2 0.033333 -0.333333 0.366667 3 -0.233333 0.366667 -0.133333 3 classes, 5% randomly initialized (annotated) Coupling matrix and initialization procedure based on LinBP’s experimentation
  • 20. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Experiments - Validation Validation Graph Top-beliefs’ Agreement (%) 1 100% 2 100% 3 99% 4 99% Divergences are related to tiebreak scenarios Efficacy – to be expected
  • 21. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Experiments - Scalability Runtime (sec) Graph LinBP-SQL VC-LinBP-1thread VC-LinBP-8threads 1 39.04 0.31 0.23 2 179 1.27 0.75 3 826 5.90 3.15 4 5000 34.62 18.69 Fixed number of iterations (5 iterations) Only runtime is considered – excluding pre-processing time
  • 22. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Experiments 1 10 100 1000 1e+06 1e+07 1e+08 Number of Edges Runtime(sec) VC_LinBP_1thread VC_LinBP_8threads LinBP (a) Scalability 0 2 4 6 8 1 2 3 4 Dataset NumberofIterations LinBP VC_LinBP (b) Convergence Elidan et al. – asynchronous version is at worst the same as synchronous
  • 23. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Outline 1 Introduction 2 Belief Propagation Algorithm 3 Methodology and Experiments 4 Conclusions
  • 24. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Future Work In-memory implementation – performance comparison Experiments with bigger datasets Detailed tiebreak scenarios Real-world dataset experiments – DBLP, Malware detection, Image segmentation
  • 25. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Thank you! Questions?