SlideShare a Scribd company logo
gSkeletonClu [1]
Revealing density-based clustering structure from the core-connected
tree of a network
[1]Huang, J., Sun, H., Song, Q., Deng, H., & Han, J. (2013). Revealing density-based clustering structure from the core-connected tree of a
network. IEEE Transactions on Knowledge and Data Engineering, 25(8), 1876–1889. http://doi.org/10.1109/TKDE.2012.100
http://ieeexplore.ieee.org/xpl/login.jsp?tp=&arnumber=6200274&url=http%3A%2F%2Fieeexplore.ieee.org%2Fiel5%2F69%2F4358933%2F06200274.pdf%
3Farnumber%3D6200274
Abstract
Objective: Identify communities and vertices roles in a weighted network
Overview
Given a weighted network…
1- Calculate its CCMST with the Core-Connectivity Similarity
2- Find the components called (Structure Core-Connected)
● Components that contains the core
3- Attach the vertex classified as border
4 - Identify the Hubs and Outlier
Def1) Neighborhood
Neighborhood of n1:
r(n1) = {n1, n2, n3, n8}
Def2) Structural Similarity
num = 2*weight(n1, n8) = 20
denA = denB = 1
num += 10*10 = 120
denA += Sqr[(10*10) + (10*10)] = 14.14
denB += Sqr[(10*10) + (10*10) + (10*10) + (5*5)] = 18.02
σ(n1, n8) = num/(denA*denB)
*Note = Initial values of num and den are a mysterious
Def2) Structural Similarity
σ(n1, n8) = 0.47
Def3) Ɛ-Neighborhood
Ɛ-Neighborhood for n1:
● Ɛ = 0.47
● rƐ(n1) = {n1, n2, n8}
Def4) Core
if | (u) | >= μ, then u is a core. Denoted by Kε,μ
(u)
Considering,
● Ɛ = 0.47
● μ = 3,
so...
● Kε,μ (n1)
Def5) Directly Structure-Reachable
If u is a core AND v belongs to Ɛ(u).
So:
● u ⟼ ε,μv
○ n1 ⟼ ε,μn8
Def6) Hubs and Outliers
if h does not belong to any cluster
AND
if h bridges multiples cluster, such that:
h E r(u) ^ h E r(v)
then h is hub.
If not hub:
v is Outlier
Def6) Hubs and Outliers
if h does not belong to any cluster
AND
if h bridges multiples cluster, such that:
● h E r(u) ^ h E r(v)
then h is hub.
If not hub:
v is Outlier
hub
outlier
Def7) Structure Core-Similarity
CS(n1) candidates...
1. (n1, n0) - 0.08
2. (n1, n2) - 0.68
3. (n1, n3) - 0.43
4. (n1, n8) - 0.47
Ɛ
Def7) Structure Core-Similarity
CS(n1) candidates...
1. (n1, n0) - 0.08
2. (n1, n2) - 0.68
3. (n1, n3) - 0.43
4. (n1, n8) - 0.47
Ɛ
Def8) Reachability-Similarity
RS(n6, n7) = min {0.51, 0.1} = 0.1
RS(n6, n4) = 0.51
RS(n6, n5) = 0.55
---
RS(n7, n6) = min {0, 0.1} = 0
Asymmetric!!!!
Def9) Core Connectivity Similarity
CCS(n6, n4) = 0.51
CCS(n6, n5) = 0.51
CCS(n6, n7) = 0
Def9) Core Connectivity Similarity
CCS(n6, n4) = 0.51
CCS(n6, n5) = 0.51
CCS(n6, n7) = 0
Def10) Structure Core-Connected
Given Ɛ E IR, μ E IN;
u, v E V;
u, and v are directly core-connected with each other if and only if:
● Kε,μ (u) ^Kε,μ (v) ^ u ⟼ ε,μv
This is denoted by:
u ⟷ ε,μv
gSkeletonClu will first try to find structures that respect this definition above, after
that will append the "borders" ( vertex that are "directly structure reachable" but
don't respect this def. above). At the end, the gSkeletonClu will separate the
clusters, hubs and outliers.
CCMST - Core-Connected Maximal Spanning Tree
Instead to use the complete network the authors proved that it is possible to
identify the Structure Core-Connected components from the CCMST, considering
the weight as the "CCS(u,v)".
Ɛ-Candidates:
● 0.51
● 0.47
● 0.43
● 0.08
● 0
Core-Connected Components from CCMST
Ɛ= 0.51 Ɛ= 0.47
Ɛ= 0.43 Ɛ= 0.08
Attracting Indices for Attaching Borders
RS(2,3) = 0.55
RS(1,3) = 0.43
--
AS(3) = 0.55
Attracting Indices for Attaching Borders
AS(3) = 0.55
Ɛ= 0.47
if AS(3) > Ɛ:
n3 is attached to
the cluster that
contains n2.
So What…. ?
Let`s execute from scratch!
Step 1 - Prepare your weapons!!
Calculate the Weighted Core-Similarity NetworK
Ɛ = 0.47
μ = 3
Weighted NetworK: Weighted Core-Similarity NetworK:
Step 2- Point your weapons...
Calculate the CCMST
.
Step 3A - Fire!
Detect Core-Connected Components...
Ɛ = 0.47
μ = 3
Ɛ= 0.47
Step 3B - Fire again !
Attach the borders!
Ɛ= 0.47
Step 3C - Kill it, before it kills you!
Detect Cluster, hubs and outliers
n0 is a hub because:
● n0 does not belong to any cluster
● n0 bridges the clusters A and B.
n7 is a outlier because:
● it is not a hub =(
hub
outlier
Results - Guard the guns… You are the winner!
(or just a survivor...)
Clustering of Automatically Selected Ɛ
If you have the Ɛ candidates extracted from the CCMST…
AND...
If you adopt a way to measure what is the best Ɛ...
Then, you can automatically select the Ɛ parameter.
One possible choice is to use the modularity Q as a quality measure of network
clustering. The Q value belongs to [0,1]. The higher the value close to 1 indicates a
better clustering result.
In a nutshell… You should run the gSkeletonClu for all Ɛ candidates and based on a
quality index, choose the best partition!!!
Did you like?
There is more!
From the CCMST is possible to extract the clustering hierarchy… (next opportunity)
Limitation
● The gSkletonClu just can be applied on networks!
● In the author`s paper of gSkeletonClu, the tests show that it is slower than
SCAN…
● Maybe it can not work in BIG networks. (more than 1 million of vertex)
○ SCAN ++ (Shiokawa, 2015) [1][2] did tests in BIG networks and could not
perform the gSkeleton on them…
Have fun!
[1] http://www.vldb.org/pvldb/vol8/p1178-shiokawa.pdf
[2] htp://pt.slideshare.net/LazyShion/scan-efficient-algorithm-for-finding-clusters-hubs-and-outliers-on-largescale-graphs-vldb-2015
Presentation created by:
Danilo Amaral de Oliveira
oliveiradanilo@usp.br
Thank you!

More Related Content

What's hot

Cluster analysis using k-means method in R
Cluster analysis using k-means method in RCluster analysis using k-means method in R
Cluster analysis using k-means method in RVladimir Bakhrushin
 
K-means Clustering Algorithm with Matlab Source code
K-means Clustering Algorithm with Matlab Source codeK-means Clustering Algorithm with Matlab Source code
K-means Clustering Algorithm with Matlab Source codegokulprasath06
 
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...butest
 
Mean shift and Hierarchical clustering
Mean shift and Hierarchical clustering Mean shift and Hierarchical clustering
Mean shift and Hierarchical clustering Yan Xu
 
Fuzzy c-Means Clustering Algorithms
Fuzzy c-Means Clustering AlgorithmsFuzzy c-Means Clustering Algorithms
Fuzzy c-Means Clustering AlgorithmsJustin Cletus
 
Complex models in ecology: challenges and solutions
Complex models in ecology: challenges and solutionsComplex models in ecology: challenges and solutions
Complex models in ecology: challenges and solutionsPeter Solymos
 
K-means Clustering
K-means ClusteringK-means Clustering
K-means ClusteringSajib Sen
 
K-means clustering algorithm
K-means clustering algorithmK-means clustering algorithm
K-means clustering algorithmVinit Dantkale
 
Shortest route and mst
Shortest route and mstShortest route and mst
Shortest route and mstAlona Salva
 
Lecture 5: Neural Networks II
Lecture 5: Neural Networks IILecture 5: Neural Networks II
Lecture 5: Neural Networks IISang Jun Lee
 
Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16
Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16
Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16MLconf
 

What's hot (20)

Data miningpresentation
Data miningpresentationData miningpresentation
Data miningpresentation
 
Cluster analysis using k-means method in R
Cluster analysis using k-means method in RCluster analysis using k-means method in R
Cluster analysis using k-means method in R
 
K-means Clustering Algorithm with Matlab Source code
K-means Clustering Algorithm with Matlab Source codeK-means Clustering Algorithm with Matlab Source code
K-means Clustering Algorithm with Matlab Source code
 
Clustering tutorial
Clustering tutorialClustering tutorial
Clustering tutorial
 
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...
 
Project PPT
Project PPTProject PPT
Project PPT
 
Mean shift and Hierarchical clustering
Mean shift and Hierarchical clustering Mean shift and Hierarchical clustering
Mean shift and Hierarchical clustering
 
K-Means manual work
K-Means manual workK-Means manual work
K-Means manual work
 
Fuzzy c-Means Clustering Algorithms
Fuzzy c-Means Clustering AlgorithmsFuzzy c-Means Clustering Algorithms
Fuzzy c-Means Clustering Algorithms
 
Complex models in ecology: challenges and solutions
Complex models in ecology: challenges and solutionsComplex models in ecology: challenges and solutions
Complex models in ecology: challenges and solutions
 
Clustering part 1
Clustering part 1Clustering part 1
Clustering part 1
 
Neural nw k means
Neural nw k meansNeural nw k means
Neural nw k means
 
post119s1-file2
post119s1-file2post119s1-file2
post119s1-file2
 
K means Clustering Algorithm
K means Clustering AlgorithmK means Clustering Algorithm
K means Clustering Algorithm
 
K-means Clustering
K-means ClusteringK-means Clustering
K-means Clustering
 
K-means clustering algorithm
K-means clustering algorithmK-means clustering algorithm
K-means clustering algorithm
 
Shortest route and mst
Shortest route and mstShortest route and mst
Shortest route and mst
 
Lecture 5: Neural Networks II
Lecture 5: Neural Networks IILecture 5: Neural Networks II
Lecture 5: Neural Networks II
 
Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16
Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16
Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16
 
Hierachical clustering
Hierachical clusteringHierachical clustering
Hierachical clustering
 

Viewers also liked

Seo Web lên TOP Google
Seo Web lên TOP GoogleSeo Web lên TOP Google
Seo Web lên TOP Googlequyendo2508
 
PICKFAIR LOGO DESIGN AND GRAPHICS
PICKFAIR LOGO DESIGN AND GRAPHICSPICKFAIR LOGO DESIGN AND GRAPHICS
PICKFAIR LOGO DESIGN AND GRAPHICSMichelle Scott
 
Integrate Supply Chain Operations, Product Accelerate Product Delivery and Ma...
Integrate Supply Chain Operations, Product Accelerate Product Delivery and Ma...Integrate Supply Chain Operations, Product Accelerate Product Delivery and Ma...
Integrate Supply Chain Operations, Product Accelerate Product Delivery and Ma...YASH Technologies
 
Entrada a la bios
Entrada a la biosEntrada a la bios
Entrada a la biosxabi30
 
Fundraising and the Next Generation 10.2012
Fundraising and the Next Generation 10.2012Fundraising and the Next Generation 10.2012
Fundraising and the Next Generation 10.2012Emily Davis Consulting
 
ИАСУ железнодорожная логистика
ИАСУ железнодорожная логистикаИАСУ железнодорожная логистика
ИАСУ железнодорожная логистикаДенис Евдокимов
 
SAP ERP Success Story @ YASH Chemical Company Achieved Operational Excellence...
SAP ERP Success Story @ YASH Chemical Company Achieved Operational Excellence...SAP ERP Success Story @ YASH Chemical Company Achieved Operational Excellence...
SAP ERP Success Story @ YASH Chemical Company Achieved Operational Excellence...YASH Technologies
 
Ville de pekin
Ville de pekinVille de pekin
Ville de pekinF ztm
 
Vocational Training Providers (VTPs)
Vocational Training Providers (VTPs)Vocational Training Providers (VTPs)
Vocational Training Providers (VTPs)Sujai.G Pillai
 
Fix Steam Errors in Windows 10
Fix Steam Errors in Windows 10Fix Steam Errors in Windows 10
Fix Steam Errors in Windows 10Tech Geek
 
Microsoft .net application management services
Microsoft .net application management servicesMicrosoft .net application management services
Microsoft .net application management servicesYASH Technologies
 
Argus Production Monitoring at Salesforce
Argus Production Monitoring at Salesforce Argus Production Monitoring at Salesforce
Argus Production Monitoring at Salesforce HBaseCon
 
OntoSoft: A Distributed Semantic Registry for Scientific Software
OntoSoft: A Distributed Semantic Registry for Scientific SoftwareOntoSoft: A Distributed Semantic Registry for Scientific Software
OntoSoft: A Distributed Semantic Registry for Scientific Softwaredgarijo
 
Bài giảng khóa huấn luyện SmeBrand - Chiến lược thương hiệu dành cho doanh ng...
Bài giảng khóa huấn luyện SmeBrand - Chiến lược thương hiệu dành cho doanh ng...Bài giảng khóa huấn luyện SmeBrand - Chiến lược thương hiệu dành cho doanh ng...
Bài giảng khóa huấn luyện SmeBrand - Chiến lược thương hiệu dành cho doanh ng...THANHS BRANDING & MANAGEMENT COMPANY
 
Основы шести сигм
Основы шести сигмОсновы шести сигм
Основы шести сигмSixSigmaOnline
 
YASH Services for SAP HANA Migration
YASH Services for SAP HANA MigrationYASH Services for SAP HANA Migration
YASH Services for SAP HANA MigrationYASH Technologies
 

Viewers also liked (20)

Seo Web lên TOP Google
Seo Web lên TOP GoogleSeo Web lên TOP Google
Seo Web lên TOP Google
 
Los valores1
Los valores1Los valores1
Los valores1
 
PICKFAIR LOGO DESIGN AND GRAPHICS
PICKFAIR LOGO DESIGN AND GRAPHICSPICKFAIR LOGO DESIGN AND GRAPHICS
PICKFAIR LOGO DESIGN AND GRAPHICS
 
Creacion de youtube
Creacion de youtubeCreacion de youtube
Creacion de youtube
 
Integrate Supply Chain Operations, Product Accelerate Product Delivery and Ma...
Integrate Supply Chain Operations, Product Accelerate Product Delivery and Ma...Integrate Supply Chain Operations, Product Accelerate Product Delivery and Ma...
Integrate Supply Chain Operations, Product Accelerate Product Delivery and Ma...
 
Casanare
CasanareCasanare
Casanare
 
Christian arturo-garcia-eje4-ejercicio1
Christian arturo-garcia-eje4-ejercicio1Christian arturo-garcia-eje4-ejercicio1
Christian arturo-garcia-eje4-ejercicio1
 
Entrada a la bios
Entrada a la biosEntrada a la bios
Entrada a la bios
 
Fundraising and the Next Generation 10.2012
Fundraising and the Next Generation 10.2012Fundraising and the Next Generation 10.2012
Fundraising and the Next Generation 10.2012
 
ИАСУ железнодорожная логистика
ИАСУ железнодорожная логистикаИАСУ железнодорожная логистика
ИАСУ железнодорожная логистика
 
SAP ERP Success Story @ YASH Chemical Company Achieved Operational Excellence...
SAP ERP Success Story @ YASH Chemical Company Achieved Operational Excellence...SAP ERP Success Story @ YASH Chemical Company Achieved Operational Excellence...
SAP ERP Success Story @ YASH Chemical Company Achieved Operational Excellence...
 
Ville de pekin
Ville de pekinVille de pekin
Ville de pekin
 
Vocational Training Providers (VTPs)
Vocational Training Providers (VTPs)Vocational Training Providers (VTPs)
Vocational Training Providers (VTPs)
 
Fix Steam Errors in Windows 10
Fix Steam Errors in Windows 10Fix Steam Errors in Windows 10
Fix Steam Errors in Windows 10
 
Microsoft .net application management services
Microsoft .net application management servicesMicrosoft .net application management services
Microsoft .net application management services
 
Argus Production Monitoring at Salesforce
Argus Production Monitoring at Salesforce Argus Production Monitoring at Salesforce
Argus Production Monitoring at Salesforce
 
OntoSoft: A Distributed Semantic Registry for Scientific Software
OntoSoft: A Distributed Semantic Registry for Scientific SoftwareOntoSoft: A Distributed Semantic Registry for Scientific Software
OntoSoft: A Distributed Semantic Registry for Scientific Software
 
Bài giảng khóa huấn luyện SmeBrand - Chiến lược thương hiệu dành cho doanh ng...
Bài giảng khóa huấn luyện SmeBrand - Chiến lược thương hiệu dành cho doanh ng...Bài giảng khóa huấn luyện SmeBrand - Chiến lược thương hiệu dành cho doanh ng...
Bài giảng khóa huấn luyện SmeBrand - Chiến lược thương hiệu dành cho doanh ng...
 
Основы шести сигм
Основы шести сигмОсновы шести сигм
Основы шести сигм
 
YASH Services for SAP HANA Migration
YASH Services for SAP HANA MigrationYASH Services for SAP HANA Migration
YASH Services for SAP HANA Migration
 

Similar to gSkeletonClu - Revealing density-based clustering structure from the core-connected tree of a network

Neural Networks: Principal Component Analysis (PCA)
Neural Networks: Principal Component Analysis (PCA)Neural Networks: Principal Component Analysis (PCA)
Neural Networks: Principal Component Analysis (PCA)Mostafa G. M. Mostafa
 
Objects as points (CenterNet) review [CDM]
Objects as points (CenterNet) review [CDM]Objects as points (CenterNet) review [CDM]
Objects as points (CenterNet) review [CDM]Dongmin Choi
 
11ClusAdvanced.ppt
11ClusAdvanced.ppt11ClusAdvanced.ppt
11ClusAdvanced.pptSueMiu
 
Data Mining: Concepts and techniques: Chapter 11,Review: Basic Cluster Analys...
Data Mining: Concepts and techniques: Chapter 11,Review: Basic Cluster Analys...Data Mining: Concepts and techniques: Chapter 11,Review: Basic Cluster Analys...
Data Mining: Concepts and techniques: Chapter 11,Review: Basic Cluster Analys...Salah Amean
 
iiit delhi unsupervised pdf.pdf
iiit delhi unsupervised pdf.pdfiiit delhi unsupervised pdf.pdf
iiit delhi unsupervised pdf.pdfVIKASGUPTA127897
 
Chapter 11. Cluster Analysis Advanced Methods.ppt
Chapter 11. Cluster Analysis Advanced Methods.pptChapter 11. Cluster Analysis Advanced Methods.ppt
Chapter 11. Cluster Analysis Advanced Methods.pptSubrata Kumer Paul
 
Unsupervised learning clustering
Unsupervised learning clusteringUnsupervised learning clustering
Unsupervised learning clusteringArshad Farhad
 
ML basic & clustering
ML basic & clusteringML basic & clustering
ML basic & clusteringmonalisa Das
 
Tailored Bregman Ball Trees for Effective Nearest Neighbors
Tailored Bregman Ball Trees for Effective Nearest NeighborsTailored Bregman Ball Trees for Effective Nearest Neighbors
Tailored Bregman Ball Trees for Effective Nearest NeighborsFrank Nielsen
 
Chapter 11 cluster advanced, Han & Kamber
Chapter 11 cluster advanced, Han & KamberChapter 11 cluster advanced, Han & Kamber
Chapter 11 cluster advanced, Han & KamberHouw Liong The
 
Chapter 11 cluster advanced : web and text mining
Chapter 11 cluster advanced : web and text miningChapter 11 cluster advanced : web and text mining
Chapter 11 cluster advanced : web and text miningHouw Liong The
 
Presentation on unsupervised learning
Presentation on unsupervised learning Presentation on unsupervised learning
Presentation on unsupervised learning ANKUSH PAL
 
Density based methods
Density based methodsDensity based methods
Density based methodsSVijaylakshmi
 
Business analytics course in delhi
Business analytics course in delhiBusiness analytics course in delhi
Business analytics course in delhibhuvan8999
 
data science course in delhi
data science course in delhidata science course in delhi
data science course in delhidevipatnala1
 
business analytics course in delhi
business analytics course in delhibusiness analytics course in delhi
business analytics course in delhidevipatnala1
 
data science training in hyderabad
data science training in hyderabaddata science training in hyderabad
data science training in hyderabaddevipatnala1
 
data science training
data science trainingdata science training
data science trainingdevipatnala1
 

Similar to gSkeletonClu - Revealing density-based clustering structure from the core-connected tree of a network (20)

Neural Networks: Principal Component Analysis (PCA)
Neural Networks: Principal Component Analysis (PCA)Neural Networks: Principal Component Analysis (PCA)
Neural Networks: Principal Component Analysis (PCA)
 
Birch
BirchBirch
Birch
 
Objects as points (CenterNet) review [CDM]
Objects as points (CenterNet) review [CDM]Objects as points (CenterNet) review [CDM]
Objects as points (CenterNet) review [CDM]
 
11ClusAdvanced.ppt
11ClusAdvanced.ppt11ClusAdvanced.ppt
11ClusAdvanced.ppt
 
Data Mining: Concepts and techniques: Chapter 11,Review: Basic Cluster Analys...
Data Mining: Concepts and techniques: Chapter 11,Review: Basic Cluster Analys...Data Mining: Concepts and techniques: Chapter 11,Review: Basic Cluster Analys...
Data Mining: Concepts and techniques: Chapter 11,Review: Basic Cluster Analys...
 
11 clusadvanced
11 clusadvanced11 clusadvanced
11 clusadvanced
 
iiit delhi unsupervised pdf.pdf
iiit delhi unsupervised pdf.pdfiiit delhi unsupervised pdf.pdf
iiit delhi unsupervised pdf.pdf
 
Chapter 11. Cluster Analysis Advanced Methods.ppt
Chapter 11. Cluster Analysis Advanced Methods.pptChapter 11. Cluster Analysis Advanced Methods.ppt
Chapter 11. Cluster Analysis Advanced Methods.ppt
 
Unsupervised learning clustering
Unsupervised learning clusteringUnsupervised learning clustering
Unsupervised learning clustering
 
ML basic & clustering
ML basic & clusteringML basic & clustering
ML basic & clustering
 
Tailored Bregman Ball Trees for Effective Nearest Neighbors
Tailored Bregman Ball Trees for Effective Nearest NeighborsTailored Bregman Ball Trees for Effective Nearest Neighbors
Tailored Bregman Ball Trees for Effective Nearest Neighbors
 
Chapter 11 cluster advanced, Han & Kamber
Chapter 11 cluster advanced, Han & KamberChapter 11 cluster advanced, Han & Kamber
Chapter 11 cluster advanced, Han & Kamber
 
Chapter 11 cluster advanced : web and text mining
Chapter 11 cluster advanced : web and text miningChapter 11 cluster advanced : web and text mining
Chapter 11 cluster advanced : web and text mining
 
Presentation on unsupervised learning
Presentation on unsupervised learning Presentation on unsupervised learning
Presentation on unsupervised learning
 
Density based methods
Density based methodsDensity based methods
Density based methods
 
Business analytics course in delhi
Business analytics course in delhiBusiness analytics course in delhi
Business analytics course in delhi
 
data science course in delhi
data science course in delhidata science course in delhi
data science course in delhi
 
business analytics course in delhi
business analytics course in delhibusiness analytics course in delhi
business analytics course in delhi
 
data science training in hyderabad
data science training in hyderabaddata science training in hyderabad
data science training in hyderabad
 
data science training
data science trainingdata science training
data science training
 

Recently uploaded

Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Product School
 
AI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří KarpíšekAI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří KarpíšekCzechDreamin
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...Product School
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxDavid Michel
 
Introduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG EvaluationIntroduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG EvaluationZilliz
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Product School
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesThousandEyes
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaRTTS
 
UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2DianaGray10
 
Optimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityOptimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityScyllaDB
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Alison B. Lowndes
 
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeFree and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeCzechDreamin
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlPeter Udo Diehl
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...CzechDreamin
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersSafe Software
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor TurskyiFwdays
 
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀DianaGray10
 
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya HalderCustom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya HalderCzechDreamin
 
10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka Doktorová10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka DoktorováCzechDreamin
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualityInflectra
 

Recently uploaded (20)

Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
AI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří KarpíšekAI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří Karpíšek
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
 
Introduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG EvaluationIntroduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG Evaluation
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2
 
Optimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityOptimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through Observability
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeFree and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
 
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
 
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya HalderCustom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
 
10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka Doktorová10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka Doktorová
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 

gSkeletonClu - Revealing density-based clustering structure from the core-connected tree of a network

  • 1. gSkeletonClu [1] Revealing density-based clustering structure from the core-connected tree of a network [1]Huang, J., Sun, H., Song, Q., Deng, H., & Han, J. (2013). Revealing density-based clustering structure from the core-connected tree of a network. IEEE Transactions on Knowledge and Data Engineering, 25(8), 1876–1889. http://doi.org/10.1109/TKDE.2012.100 http://ieeexplore.ieee.org/xpl/login.jsp?tp=&arnumber=6200274&url=http%3A%2F%2Fieeexplore.ieee.org%2Fiel5%2F69%2F4358933%2F06200274.pdf% 3Farnumber%3D6200274
  • 2. Abstract Objective: Identify communities and vertices roles in a weighted network
  • 3. Overview Given a weighted network… 1- Calculate its CCMST with the Core-Connectivity Similarity 2- Find the components called (Structure Core-Connected) ● Components that contains the core 3- Attach the vertex classified as border 4 - Identify the Hubs and Outlier
  • 4. Def1) Neighborhood Neighborhood of n1: r(n1) = {n1, n2, n3, n8}
  • 5. Def2) Structural Similarity num = 2*weight(n1, n8) = 20 denA = denB = 1 num += 10*10 = 120 denA += Sqr[(10*10) + (10*10)] = 14.14 denB += Sqr[(10*10) + (10*10) + (10*10) + (5*5)] = 18.02 σ(n1, n8) = num/(denA*denB) *Note = Initial values of num and den are a mysterious
  • 7. Def3) Ɛ-Neighborhood Ɛ-Neighborhood for n1: ● Ɛ = 0.47 ● rƐ(n1) = {n1, n2, n8}
  • 8. Def4) Core if | (u) | >= μ, then u is a core. Denoted by Kε,μ (u) Considering, ● Ɛ = 0.47 ● μ = 3, so... ● Kε,μ (n1)
  • 9. Def5) Directly Structure-Reachable If u is a core AND v belongs to Ɛ(u). So: ● u ⟼ ε,μv ○ n1 ⟼ ε,μn8
  • 10. Def6) Hubs and Outliers if h does not belong to any cluster AND if h bridges multiples cluster, such that: h E r(u) ^ h E r(v) then h is hub. If not hub: v is Outlier
  • 11. Def6) Hubs and Outliers if h does not belong to any cluster AND if h bridges multiples cluster, such that: ● h E r(u) ^ h E r(v) then h is hub. If not hub: v is Outlier hub outlier
  • 12. Def7) Structure Core-Similarity CS(n1) candidates... 1. (n1, n0) - 0.08 2. (n1, n2) - 0.68 3. (n1, n3) - 0.43 4. (n1, n8) - 0.47 Ɛ
  • 13. Def7) Structure Core-Similarity CS(n1) candidates... 1. (n1, n0) - 0.08 2. (n1, n2) - 0.68 3. (n1, n3) - 0.43 4. (n1, n8) - 0.47 Ɛ
  • 14. Def8) Reachability-Similarity RS(n6, n7) = min {0.51, 0.1} = 0.1 RS(n6, n4) = 0.51 RS(n6, n5) = 0.55 --- RS(n7, n6) = min {0, 0.1} = 0 Asymmetric!!!!
  • 15. Def9) Core Connectivity Similarity CCS(n6, n4) = 0.51 CCS(n6, n5) = 0.51 CCS(n6, n7) = 0
  • 16. Def9) Core Connectivity Similarity CCS(n6, n4) = 0.51 CCS(n6, n5) = 0.51 CCS(n6, n7) = 0
  • 17. Def10) Structure Core-Connected Given Ɛ E IR, μ E IN; u, v E V; u, and v are directly core-connected with each other if and only if: ● Kε,μ (u) ^Kε,μ (v) ^ u ⟼ ε,μv This is denoted by: u ⟷ ε,μv gSkeletonClu will first try to find structures that respect this definition above, after that will append the "borders" ( vertex that are "directly structure reachable" but don't respect this def. above). At the end, the gSkeletonClu will separate the clusters, hubs and outliers.
  • 18. CCMST - Core-Connected Maximal Spanning Tree Instead to use the complete network the authors proved that it is possible to identify the Structure Core-Connected components from the CCMST, considering the weight as the "CCS(u,v)". Ɛ-Candidates: ● 0.51 ● 0.47 ● 0.43 ● 0.08 ● 0
  • 19. Core-Connected Components from CCMST Ɛ= 0.51 Ɛ= 0.47 Ɛ= 0.43 Ɛ= 0.08
  • 20. Attracting Indices for Attaching Borders RS(2,3) = 0.55 RS(1,3) = 0.43 -- AS(3) = 0.55
  • 21. Attracting Indices for Attaching Borders AS(3) = 0.55 Ɛ= 0.47 if AS(3) > Ɛ: n3 is attached to the cluster that contains n2.
  • 22. So What…. ? Let`s execute from scratch!
  • 23. Step 1 - Prepare your weapons!! Calculate the Weighted Core-Similarity NetworK Ɛ = 0.47 μ = 3 Weighted NetworK: Weighted Core-Similarity NetworK:
  • 24. Step 2- Point your weapons... Calculate the CCMST .
  • 25. Step 3A - Fire! Detect Core-Connected Components... Ɛ = 0.47 μ = 3 Ɛ= 0.47
  • 26. Step 3B - Fire again ! Attach the borders! Ɛ= 0.47
  • 27. Step 3C - Kill it, before it kills you! Detect Cluster, hubs and outliers n0 is a hub because: ● n0 does not belong to any cluster ● n0 bridges the clusters A and B. n7 is a outlier because: ● it is not a hub =( hub outlier
  • 28. Results - Guard the guns… You are the winner! (or just a survivor...)
  • 29. Clustering of Automatically Selected Ɛ If you have the Ɛ candidates extracted from the CCMST… AND... If you adopt a way to measure what is the best Ɛ... Then, you can automatically select the Ɛ parameter. One possible choice is to use the modularity Q as a quality measure of network clustering. The Q value belongs to [0,1]. The higher the value close to 1 indicates a better clustering result. In a nutshell… You should run the gSkeletonClu for all Ɛ candidates and based on a quality index, choose the best partition!!!
  • 30. Did you like? There is more! From the CCMST is possible to extract the clustering hierarchy… (next opportunity) Limitation ● The gSkletonClu just can be applied on networks! ● In the author`s paper of gSkeletonClu, the tests show that it is slower than SCAN… ● Maybe it can not work in BIG networks. (more than 1 million of vertex) ○ SCAN ++ (Shiokawa, 2015) [1][2] did tests in BIG networks and could not perform the gSkeleton on them… Have fun! [1] http://www.vldb.org/pvldb/vol8/p1178-shiokawa.pdf [2] htp://pt.slideshare.net/LazyShion/scan-efficient-algorithm-for-finding-clusters-hubs-and-outliers-on-largescale-graphs-vldb-2015
  • 31. Presentation created by: Danilo Amaral de Oliveira oliveiradanilo@usp.br Thank you!