SlideShare a Scribd company logo
1 of 22
Download to read offline
Action and Content Based
Community Detection in
Social Networks
Prabhsimran Singh Baweja
Prakhar Sharma
Ritesh Modi
Vaishali Pal
Mentored By: Prateek Mehta
Graph
Graph is a collection of objects where some objects are
connected by link.
Mathematically –
G = (V, E)
V – Vertices
E - Edges
MotivationSocial network analysis focuses on mining hidden semantics in a setting
involving interacting agents. Collaboration between them defines same kind
of behavior.
Our motivation is to use edge attribute weights along with the links to find
communities.
Entities are modeled as vertices, edges capture the relationship between
them.
Community
A community is a collection objects/people sharing the same
interests or having same characteristics.
E.g., People liking Jazz music might belong to one community
and the people like Folk might belong to the other.
Community Detection
Community detection is the task of extracting densely-knit groups within the network.
Unsupervised learning problem, addressed using analysis of linkage, node attributes,
etc.
Given: G = (V, E)
Output: C1, C2, ..., Ck ,
Ci ∩ Cj = φ , ∀i ≠ j , ∪ Ci = V
1≤i≤k
Communities represent a coarse grained view of the network, can be mapped to the
functional units of the network.
Community Structures in Real World
Internet U.S. Football Network
Power Grid Network Books Network
Modeling Communities as Graphs
Notion of communities is often defined as a graph structure, G
= (V,E), representing set of objects E, and their linkages V.
Given a graph, a community is defined as collection of nodes
that are more densely connected to each other than to the
other nodes in the network.
Multiple Communities
State-of-the-art
Modularity
Measure of denseness of connections between nodes of same
module and sparse connections between different modules.
Higher Modularity, well defined compact communities.
Modularity Maximization
A(i,j): Observed number of intra-community edges
KiKj / 2m : expected no. of edges between i and j if placed randomly
Modularity Maximization
Efficient Solution – Louvain Algorithm
Initially each node belongs to its own community
We go through each node and assign them to its neighbours
community as long as its leads to increase in modularity.
This is followed until modularity cannot be maximized further
Vincent D. Blondel, Jean-Loup Guillaume, Renaud Lambiotteand Etienne Lefebvre: Fast
unfolding of communities in large networks
• Edge Weights between community nodes are defined by the number of
inter-community edges.
• Folding ensures rapid decrease in the number of nodes that need to be
examined and thus enables large-scale application of the method.
Dataset
Set of Flickr Images metadata
Number of Images = 268649
Number of Authors (who posted at least 1 image) = 58522
Total numbers of Tags = 4932402
Number of Unique Groups = 203466
Number of Unique Galleries = 67859
Dataset Description
Graph |V| |E| Characteristics (each pair of author)
Edges formed
between authors
who used same
tag(s)
58522 1491950 • Cosine Similarity b/w tags used
• Group Contribution and Popularity
Distribution
• Gallery Contribution and Popularity
Distribution
Edges formed
between authors
who share
comments
589461 6012634 • Jaccard Similarity for comments shared
• Group Contribution and Popularity
Distribution
• Gallery Contribution and Popularity
Distribution
Dataset Analysis
• Tag Distribution
• Average Number of Tags used by an Author per photo
• Photo Distribution
• Number of Authors versus number of photos posted by them
• Logarithmic scale for Y-Axis values
• Group Distribution
• Number of Authors versus number of groups they posted photos in.
• Logarithmic scale for Y-Axis values
• Gallery Distribution
• Number of Authors versus number of galleries they posted photos in.
• Logarithmic scale for Y-Axis values
Formulae
Jaccard Similarity
J(A,B) =
𝑃(𝐴∩𝐵)
𝑃(𝐴∪𝐵)
Symmetrised KL Divergence
where
Group Contribution
P (A1, G1) = No. of Photos by A1 in G1 / Total photos in G1
P (A2, G1) = No. of Photos by A2 in G1 / Total photos in G1
Group Popularity
P (A1, G1) = No. of Photos by A1 in G1 / Total photos of A1
Gallery Contribution
P (A1, G1) = No. of Photos by A1 in G1 / Total photos in G1
Gallery Popularity
P (A1, G1) = No. of Photos by A1 in G1 / Total photos of A1
Results
Graph Modularity Number of Communities
Edges formed between
authors who used same tag(s)
(Weighted) (Cosine Similarity)
0.6432 80
Edges formed between
authors who used same tag(s)
(Weighted) (Jaccard Index)
0.5723 1904
Edges formed between
authors who used same tag(s)
(Unweighted)
0.6092 7
Edges formed between
authors who share comments
(Weighted)
0.4306 1092
Edges formed between
authors who share comments
(Unweighted)
0.3372 1087
Conclusion
As we can see, when we incorporate textual content
i.e. hash tags and their context using Cosine Similarity,
we see a good gain in modularity. Also, it results in
more compact communities.
In the second graph, edge weight is only on the basis
of count of comments shared. The more the number
of comments shared between them, the less the
Future Work
After successful identification of communities, we can also find
most influential author for each communities. With this
information, while someone is posting new images, he can use
the tags used the author or even can mention the author. Since,
the author is the most famous person, the image is likely to get
more hits.

More Related Content

What's hot

Community Detection
Community DetectionCommunity Detection
Community DetectionIlio Catallo
 
Overlapping community detection survey
Overlapping community detection surveyOverlapping community detection survey
Overlapping community detection survey煜林 车
 
Community detection in social networks[1]
Community detection in social networks[1]Community detection in social networks[1]
Community detection in social networks[1]sdnumaygmailcom
 
Clique-based Network Clustering
Clique-based Network ClusteringClique-based Network Clustering
Clique-based Network ClusteringGuang Ouyang
 
Recomendation system: Community Detection Based Recomendation System using Hy...
Recomendation system: Community Detection Based Recomendation System using Hy...Recomendation system: Community Detection Based Recomendation System using Hy...
Recomendation system: Community Detection Based Recomendation System using Hy...Rajul Kukreja
 
Community detection algorithms
Community detection algorithmsCommunity detection algorithms
Community detection algorithmsAlireza Andalib
 
Community Detection
Community Detection Community Detection
Community Detection Kanika Kanwal
 
Community detection
Community detectionCommunity detection
Community detectionScott Pauls
 
Community Detection in Social Media
Community Detection in Social MediaCommunity Detection in Social Media
Community Detection in Social Mediarezahk
 
Entropy based algorithm for community detection in augmented networks
Entropy based algorithm for community detection in augmented networksEntropy based algorithm for community detection in augmented networks
Entropy based algorithm for community detection in augmented networksJuan David Cruz-Gómez
 
Community Extracting Using Intersection Graph and Content Analysis in Complex...
Community Extracting Using Intersection Graph and Content Analysis in Complex...Community Extracting Using Intersection Graph and Content Analysis in Complex...
Community Extracting Using Intersection Graph and Content Analysis in Complex...kuramochi
 
Spatial patterns in evolutionary games on scale-free networks and multiplexes
Spatial patterns in evolutionary games on scale-free networks and multiplexesSpatial patterns in evolutionary games on scale-free networks and multiplexes
Spatial patterns in evolutionary games on scale-free networks and multiplexesKolja Kleineberg
 
FaceNet: A Unified Embedding for Face Recognition and Clustering
FaceNet: A Unified Embedding for Face Recognition and ClusteringFaceNet: A Unified Embedding for Face Recognition and Clustering
FaceNet: A Unified Embedding for Face Recognition and ClusteringWilly Marroquin (WillyDevNET)
 
The Hidden Geometry of Multiplex Networks @ Next Generation Network Analytics
The Hidden Geometry of Multiplex Networks @ Next Generation Network Analytics The Hidden Geometry of Multiplex Networks @ Next Generation Network Analytics
The Hidden Geometry of Multiplex Networks @ Next Generation Network Analytics Kolja Kleineberg
 
Techniques that Facebook use to Analyze and QuerySocial Graphs
Techniques that Facebook use to Analyze and QuerySocial GraphsTechniques that Facebook use to Analyze and QuerySocial Graphs
Techniques that Facebook use to Analyze and QuerySocial GraphsHaneen Droubi
 
Magnet community identification on social networks
Magnet community identification on social networksMagnet community identification on social networks
Magnet community identification on social networksmoresmile
 
MobiCom CHANTS
MobiCom CHANTSMobiCom CHANTS
MobiCom CHANTSgsthakur
 
Topological Data Analysis of Complex Spatial Systems
Topological Data Analysis of Complex Spatial SystemsTopological Data Analysis of Complex Spatial Systems
Topological Data Analysis of Complex Spatial SystemsMason Porter
 
Topological Data Analysis of Complex Spatial Systems
Topological Data Analysis of Complex Spatial SystemsTopological Data Analysis of Complex Spatial Systems
Topological Data Analysis of Complex Spatial SystemsMason Porter
 

What's hot (20)

Community Detection
Community DetectionCommunity Detection
Community Detection
 
Overlapping community detection survey
Overlapping community detection surveyOverlapping community detection survey
Overlapping community detection survey
 
Community detection in social networks[1]
Community detection in social networks[1]Community detection in social networks[1]
Community detection in social networks[1]
 
Clique-based Network Clustering
Clique-based Network ClusteringClique-based Network Clustering
Clique-based Network Clustering
 
Recomendation system: Community Detection Based Recomendation System using Hy...
Recomendation system: Community Detection Based Recomendation System using Hy...Recomendation system: Community Detection Based Recomendation System using Hy...
Recomendation system: Community Detection Based Recomendation System using Hy...
 
Community detection algorithms
Community detection algorithmsCommunity detection algorithms
Community detection algorithms
 
Community Detection
Community Detection Community Detection
Community Detection
 
Community detection
Community detectionCommunity detection
Community detection
 
Community Detection in Social Media
Community Detection in Social MediaCommunity Detection in Social Media
Community Detection in Social Media
 
Entropy based algorithm for community detection in augmented networks
Entropy based algorithm for community detection in augmented networksEntropy based algorithm for community detection in augmented networks
Entropy based algorithm for community detection in augmented networks
 
Community Extracting Using Intersection Graph and Content Analysis in Complex...
Community Extracting Using Intersection Graph and Content Analysis in Complex...Community Extracting Using Intersection Graph and Content Analysis in Complex...
Community Extracting Using Intersection Graph and Content Analysis in Complex...
 
Spatial patterns in evolutionary games on scale-free networks and multiplexes
Spatial patterns in evolutionary games on scale-free networks and multiplexesSpatial patterns in evolutionary games on scale-free networks and multiplexes
Spatial patterns in evolutionary games on scale-free networks and multiplexes
 
FaceNet: A Unified Embedding for Face Recognition and Clustering
FaceNet: A Unified Embedding for Face Recognition and ClusteringFaceNet: A Unified Embedding for Face Recognition and Clustering
FaceNet: A Unified Embedding for Face Recognition and Clustering
 
The Hidden Geometry of Multiplex Networks @ Next Generation Network Analytics
The Hidden Geometry of Multiplex Networks @ Next Generation Network Analytics The Hidden Geometry of Multiplex Networks @ Next Generation Network Analytics
The Hidden Geometry of Multiplex Networks @ Next Generation Network Analytics
 
Techniques that Facebook use to Analyze and QuerySocial Graphs
Techniques that Facebook use to Analyze and QuerySocial GraphsTechniques that Facebook use to Analyze and QuerySocial Graphs
Techniques that Facebook use to Analyze and QuerySocial Graphs
 
Magnet community identification on social networks
Magnet community identification on social networksMagnet community identification on social networks
Magnet community identification on social networks
 
MobiCom CHANTS
MobiCom CHANTSMobiCom CHANTS
MobiCom CHANTS
 
Topological Data Analysis of Complex Spatial Systems
Topological Data Analysis of Complex Spatial SystemsTopological Data Analysis of Complex Spatial Systems
Topological Data Analysis of Complex Spatial Systems
 
Face recognition using LDA
Face recognition using LDAFace recognition using LDA
Face recognition using LDA
 
Topological Data Analysis of Complex Spatial Systems
Topological Data Analysis of Complex Spatial SystemsTopological Data Analysis of Complex Spatial Systems
Topological Data Analysis of Complex Spatial Systems
 

Similar to Action and content based Community Detection in Social Networks

Community DetectionSlide
Community DetectionSlideCommunity DetectionSlide
Community DetectionSlideAshwini Tokekar
 
Quantum persistent k cores for community detection
Quantum persistent k cores for community detectionQuantum persistent k cores for community detection
Quantum persistent k cores for community detectionColleen Farrelly
 
Group and Community Detection in Social Networks
Group and Community Detection in Social NetworksGroup and Community Detection in Social Networks
Group and Community Detection in Social NetworksKent State University
 
Scalable Static and Dynamic Community Detection Using Grappolo : NOTES
Scalable Static and Dynamic Community Detection Using Grappolo : NOTESScalable Static and Dynamic Community Detection Using Grappolo : NOTES
Scalable Static and Dynamic Community Detection Using Grappolo : NOTESSubhajit Sahu
 
From Signal to Symbols
From Signal to SymbolsFrom Signal to Symbols
From Signal to Symbolsgpano
 
lec3_socialnetwork_part1.pptx
lec3_socialnetwork_part1.pptxlec3_socialnetwork_part1.pptx
lec3_socialnetwork_part1.pptxdatapro2
 
Predicting Communication Intention in Social Media
Predicting Communication Intention in Social MediaPredicting Communication Intention in Social Media
Predicting Communication Intention in Social MediaCharalampos Chelmis
 
Taxonomy and survey of community
Taxonomy and survey of communityTaxonomy and survey of community
Taxonomy and survey of communityIJCSES Journal
 
Experimental categorization and deep visualization
 Experimental categorization and deep visualization Experimental categorization and deep visualization
Experimental categorization and deep visualizationEverardo Reyes-García
 
Graph Analyses with Python and NetworkX
Graph Analyses with Python and NetworkXGraph Analyses with Python and NetworkX
Graph Analyses with Python and NetworkXBenjamin Bengfort
 
SocialCom09-tutorial.pdf
SocialCom09-tutorial.pdfSocialCom09-tutorial.pdf
SocialCom09-tutorial.pdfBalasundaramSr
 
User Identity Linkage: Data Collection, DataSet Biases, Method, Control and A...
User Identity Linkage: Data Collection, DataSet Biases, Method, Control and A...User Identity Linkage: Data Collection, DataSet Biases, Method, Control and A...
User Identity Linkage: Data Collection, DataSet Biases, Method, Control and A...IIIT Hyderabad
 
cs224w-79-final
cs224w-79-finalcs224w-79-final
cs224w-79-finalDarren Koh
 
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...Xiaohan Zeng
 
Social Network Analysis (SNA) 2018
Social Network Analysis  (SNA) 2018Social Network Analysis  (SNA) 2018
Social Network Analysis (SNA) 2018Arsalan Khan
 
2011 IEEE Social Computing Nodexl: Group-In-A-Box
2011 IEEE Social Computing Nodexl: Group-In-A-Box2011 IEEE Social Computing Nodexl: Group-In-A-Box
2011 IEEE Social Computing Nodexl: Group-In-A-BoxMarc Smith
 
COMMUNITY DETECTION IN THE COLLABORATIVE WEB
COMMUNITY DETECTION IN THE COLLABORATIVE WEBCOMMUNITY DETECTION IN THE COLLABORATIVE WEB
COMMUNITY DETECTION IN THE COLLABORATIVE WEBIJMIT JOURNAL
 
A Dynamic Algorithm for Local Community Detection in Graphs : NOTES
A Dynamic Algorithm for Local Community Detection in Graphs : NOTESA Dynamic Algorithm for Local Community Detection in Graphs : NOTES
A Dynamic Algorithm for Local Community Detection in Graphs : NOTESSubhajit Sahu
 

Similar to Action and content based Community Detection in Social Networks (20)

Community DetectionSlide
Community DetectionSlideCommunity DetectionSlide
Community DetectionSlide
 
Quantum persistent k cores for community detection
Quantum persistent k cores for community detectionQuantum persistent k cores for community detection
Quantum persistent k cores for community detection
 
Group and Community Detection in Social Networks
Group and Community Detection in Social NetworksGroup and Community Detection in Social Networks
Group and Community Detection in Social Networks
 
Scalable Static and Dynamic Community Detection Using Grappolo : NOTES
Scalable Static and Dynamic Community Detection Using Grappolo : NOTESScalable Static and Dynamic Community Detection Using Grappolo : NOTES
Scalable Static and Dynamic Community Detection Using Grappolo : NOTES
 
From Signal to Symbols
From Signal to SymbolsFrom Signal to Symbols
From Signal to Symbols
 
lec3_socialnetwork_part1.pptx
lec3_socialnetwork_part1.pptxlec3_socialnetwork_part1.pptx
lec3_socialnetwork_part1.pptx
 
Predicting Communication Intention in Social Media
Predicting Communication Intention in Social MediaPredicting Communication Intention in Social Media
Predicting Communication Intention in Social Media
 
Document 8 1.pdf
Document 8 1.pdfDocument 8 1.pdf
Document 8 1.pdf
 
Taxonomy and survey of community
Taxonomy and survey of communityTaxonomy and survey of community
Taxonomy and survey of community
 
Experimental categorization and deep visualization
 Experimental categorization and deep visualization Experimental categorization and deep visualization
Experimental categorization and deep visualization
 
Graph Analyses with Python and NetworkX
Graph Analyses with Python and NetworkXGraph Analyses with Python and NetworkX
Graph Analyses with Python and NetworkX
 
SocialCom09-tutorial.pdf
SocialCom09-tutorial.pdfSocialCom09-tutorial.pdf
SocialCom09-tutorial.pdf
 
User Identity Linkage: Data Collection, DataSet Biases, Method, Control and A...
User Identity Linkage: Data Collection, DataSet Biases, Method, Control and A...User Identity Linkage: Data Collection, DataSet Biases, Method, Control and A...
User Identity Linkage: Data Collection, DataSet Biases, Method, Control and A...
 
cs224w-79-final
cs224w-79-finalcs224w-79-final
cs224w-79-final
 
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...
 
Social Network Analysis (SNA) 2018
Social Network Analysis  (SNA) 2018Social Network Analysis  (SNA) 2018
Social Network Analysis (SNA) 2018
 
2011 IEEE Social Computing Nodexl: Group-In-A-Box
2011 IEEE Social Computing Nodexl: Group-In-A-Box2011 IEEE Social Computing Nodexl: Group-In-A-Box
2011 IEEE Social Computing Nodexl: Group-In-A-Box
 
SSRI_pt1.ppt
SSRI_pt1.pptSSRI_pt1.ppt
SSRI_pt1.ppt
 
COMMUNITY DETECTION IN THE COLLABORATIVE WEB
COMMUNITY DETECTION IN THE COLLABORATIVE WEBCOMMUNITY DETECTION IN THE COLLABORATIVE WEB
COMMUNITY DETECTION IN THE COLLABORATIVE WEB
 
A Dynamic Algorithm for Local Community Detection in Graphs : NOTES
A Dynamic Algorithm for Local Community Detection in Graphs : NOTESA Dynamic Algorithm for Local Community Detection in Graphs : NOTES
A Dynamic Algorithm for Local Community Detection in Graphs : NOTES
 

Recently uploaded

Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...GQ Research
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
Machine learning classification ppt.ppt
Machine learning classification  ppt.pptMachine learning classification  ppt.ppt
Machine learning classification ppt.pptamreenkhanum0307
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxBoston Institute of Analytics
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...ssuserf63bd7
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
While-For-loop in python used in college
While-For-loop in python used in collegeWhile-For-loop in python used in college
While-For-loop in python used in collegessuser7a7cd61
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfchwongval
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 

Recently uploaded (20)

Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
Machine learning classification ppt.ppt
Machine learning classification  ppt.pptMachine learning classification  ppt.ppt
Machine learning classification ppt.ppt
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
While-For-loop in python used in college
While-For-loop in python used in collegeWhile-For-loop in python used in college
While-For-loop in python used in college
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdf
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 

Action and content based Community Detection in Social Networks

  • 1. Action and Content Based Community Detection in Social Networks Prabhsimran Singh Baweja Prakhar Sharma Ritesh Modi Vaishali Pal Mentored By: Prateek Mehta
  • 2. Graph Graph is a collection of objects where some objects are connected by link. Mathematically – G = (V, E) V – Vertices E - Edges
  • 3. MotivationSocial network analysis focuses on mining hidden semantics in a setting involving interacting agents. Collaboration between them defines same kind of behavior. Our motivation is to use edge attribute weights along with the links to find communities. Entities are modeled as vertices, edges capture the relationship between them.
  • 4. Community A community is a collection objects/people sharing the same interests or having same characteristics. E.g., People liking Jazz music might belong to one community and the people like Folk might belong to the other.
  • 5. Community Detection Community detection is the task of extracting densely-knit groups within the network. Unsupervised learning problem, addressed using analysis of linkage, node attributes, etc. Given: G = (V, E) Output: C1, C2, ..., Ck , Ci ∩ Cj = φ , ∀i ≠ j , ∪ Ci = V 1≤i≤k Communities represent a coarse grained view of the network, can be mapped to the functional units of the network.
  • 6. Community Structures in Real World Internet U.S. Football Network Power Grid Network Books Network
  • 7. Modeling Communities as Graphs Notion of communities is often defined as a graph structure, G = (V,E), representing set of objects E, and their linkages V. Given a graph, a community is defined as collection of nodes that are more densely connected to each other than to the other nodes in the network.
  • 9. State-of-the-art Modularity Measure of denseness of connections between nodes of same module and sparse connections between different modules. Higher Modularity, well defined compact communities. Modularity Maximization A(i,j): Observed number of intra-community edges KiKj / 2m : expected no. of edges between i and j if placed randomly
  • 10. Modularity Maximization Efficient Solution – Louvain Algorithm Initially each node belongs to its own community We go through each node and assign them to its neighbours community as long as its leads to increase in modularity. This is followed until modularity cannot be maximized further Vincent D. Blondel, Jean-Loup Guillaume, Renaud Lambiotteand Etienne Lefebvre: Fast unfolding of communities in large networks
  • 11. • Edge Weights between community nodes are defined by the number of inter-community edges. • Folding ensures rapid decrease in the number of nodes that need to be examined and thus enables large-scale application of the method.
  • 12. Dataset Set of Flickr Images metadata Number of Images = 268649 Number of Authors (who posted at least 1 image) = 58522 Total numbers of Tags = 4932402 Number of Unique Groups = 203466 Number of Unique Galleries = 67859
  • 13. Dataset Description Graph |V| |E| Characteristics (each pair of author) Edges formed between authors who used same tag(s) 58522 1491950 • Cosine Similarity b/w tags used • Group Contribution and Popularity Distribution • Gallery Contribution and Popularity Distribution Edges formed between authors who share comments 589461 6012634 • Jaccard Similarity for comments shared • Group Contribution and Popularity Distribution • Gallery Contribution and Popularity Distribution
  • 14. Dataset Analysis • Tag Distribution • Average Number of Tags used by an Author per photo
  • 15. • Photo Distribution • Number of Authors versus number of photos posted by them • Logarithmic scale for Y-Axis values
  • 16. • Group Distribution • Number of Authors versus number of groups they posted photos in. • Logarithmic scale for Y-Axis values
  • 17. • Gallery Distribution • Number of Authors versus number of galleries they posted photos in. • Logarithmic scale for Y-Axis values
  • 19. Group Contribution P (A1, G1) = No. of Photos by A1 in G1 / Total photos in G1 P (A2, G1) = No. of Photos by A2 in G1 / Total photos in G1 Group Popularity P (A1, G1) = No. of Photos by A1 in G1 / Total photos of A1 Gallery Contribution P (A1, G1) = No. of Photos by A1 in G1 / Total photos in G1 Gallery Popularity P (A1, G1) = No. of Photos by A1 in G1 / Total photos of A1
  • 20. Results Graph Modularity Number of Communities Edges formed between authors who used same tag(s) (Weighted) (Cosine Similarity) 0.6432 80 Edges formed between authors who used same tag(s) (Weighted) (Jaccard Index) 0.5723 1904 Edges formed between authors who used same tag(s) (Unweighted) 0.6092 7 Edges formed between authors who share comments (Weighted) 0.4306 1092 Edges formed between authors who share comments (Unweighted) 0.3372 1087
  • 21. Conclusion As we can see, when we incorporate textual content i.e. hash tags and their context using Cosine Similarity, we see a good gain in modularity. Also, it results in more compact communities. In the second graph, edge weight is only on the basis of count of comments shared. The more the number of comments shared between them, the less the
  • 22. Future Work After successful identification of communities, we can also find most influential author for each communities. With this information, while someone is posting new images, he can use the tags used the author or even can mention the author. Since, the author is the most famous person, the image is likely to get more hits.