This document discusses applying computational analytics and social network analysis to SPOD data. It introduces concepts like centrality metrics, community detection algorithms, and dynamic graph analysis to study networks over time. Examples of social networks like friendships and email are provided. The document also discusses using the open-source Gephi tool to visualize networks and explore metrics. Overall, it explores how network analysis can provide insights from SPOD data to support tasks like identifying skilled users, encouraging participation, and understanding relationships between entities in the data.
2. ROUTE-TO-PA Prato Plenary Meeting
Overview
15/9/2015
Introduction
Some example
Some Metrics and Methods
Social and Data Network Analysis (SDNA) in SPOD
Discussion
Gephi: The Open Graph Viz Platform
Network over time
6. • Social network analysis and data mining:
– Detection of criminal activity, counter-terrorism, homeland security,
and intelligence
– Analysis of relationships within companies
– Sociological and anthropological studies
– Reciprocal trust schemes such as eBay ratings
– Recommended friends on Facebook
– Filter or recommend social media content
• – …
15/9/2015 ROUTE-TO-PA Prato Plenary Meeting
Applications of social networks
13. 15/9/2015
What can we learn by using these models?
ROUTE-TO-PA Prato Plenary Meeting
14. 15/9/2015 ROUTE-TO-PA Prato Plenary Meeting
• Can we mine SPOD data in order to:
• support other analysis (like Dialogue
Games)?
• encourage users participation?
• identify skilled and/or reliable users?
• …
• Idea: assuming that we are able to
model the data according to some
relationships between different
entities:
• we can exploit Network analysis
tools.
Network analysis in SPOD
16. • Assumption: important actors are
involved with others extensively.
• The problem is find out
automatically which are the central
actors.
• Metrics:
• Closeness Centrality
• Betweenness Centrality
• Eigenvalue Centrality
• Prestige
• PageRank Algorithm
• Hits Algorithm
• …
15/9/2015 ROUTE-TO-PA Prato Plenary Meeting
Centrality
17. • Methods:
o community detection is key to
understanding the structure of
networks.
• In SPOD it can be used to
• identify users with common interest (and
suggest friendship);
• identify related data;
• suggest the proper way of visualizing data.
15/9/2015 ROUTE-TO-PA Prato Plenary Meeting
Community detection
19. 15/9/2015 ROUTE-TO-PA Prato Plenary Meeting
Networks over time
• Does the leader of a discussion change over time?
• Is it true that participate actively in a discussion enhances users’ ability?
• Dynamic graph analysis (DGA) is a novel approach for Network analysis.
• DGA analyses how a network and its metrics evolve over time.
21. • It is easy to analyse a snapshot of a Network at time t (Gephi does it for you).
• The key point is to identify the Network structure and semantics
• Different goals require dedicated approaches and are based on different networks.
• For instance, if we are going to study interaction among users, we need a network
where:
• the users are the entities;
• the relations are "who communicate with whom”.
• Even in this simple case, we should also define some timing intervals in order to
evaluate whether the interactions change along the time.
15/9/2015 ROUTE-TO-PA Prato Plenary Meeting
Social and Data Network Analysis (SDNA) in SPOD
22. 15/9/2015 ROUTE-TO-PA Prato Plenary Meeting
Social and Data Network Analysis (SDNA) in SPOD
• Another approach could be
define a network that map
users and data as a Users-
Datalets Network.
24. • Gephi is an open-source
network analysis and
visualization software
package.
• Gephi offers the most
common metrics for
network analysis:
• Betweenness Centrality;
• Closeness;
• Diameter;
• Clustering Coefficient;
• PageRank;
• Community detection;
• …
15/9/2015 ROUTE-TO-PA Prato Plenary Meeting
Gephi: The Open Graph Viz Platform