SlideShare a Scribd company logo
1 of 43
Download to read offline
Social Network Analysis (SNA)
including a tutorial on concepts and methods
Social Media – Dr. Giorgos Cheliotis (gcheliotis@nus.edu.sg)
Communications and New Media, National University of Singapore
Background: Network Analysis
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)2
Newman et al, 2006
Newman et al, 2006
A very early example of network analysis
comes from the city of Königsberg (now
Kaliningrad). Famous mathematician Leonard
Euler used a graph to prove that there is no
path that crosses each of the city’s bridges
only once (Newman et al, 2006).
SNA has its origins in both social science and in the
broader fields of network analysis and graph theory
Network analysis concerns itself with the
formulation and solution of problems that have a
network structure; such structure is usually
captured in a graph (see the circled structure to the right)
Graph theory provides a set of abstract concepts
and methods for the analysis of graphs. These, in
combination with other analytical tools and with
methods developed specifically for the visualization
and analysis of social (and other) networks, form
the basis of what we call SNA methods.
But SNA is not just a methodology; it is a unique
perspective on how society functions. Instead of
focusing on individuals and their attributes, or on
macroscopic social structures, it centers on relations
between individuals, groups, or social institutions
This is an early depiction of what we call an
‘ego’ network, i.e. a personal network. The
graphic depicts varying tie strengths via
concentric circles (Wellman, 1998)
Background: Social Science
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)3
Wellman, 1998
Studying society from a network perspective is to
study individuals as embedded in a network of
relations and seek explanations for social behavior
in the structure of these networks rather than in
the individuals alone. This ‘network perspective’
becomes increasingly relevant in a society that
Manuel Castells has dubbed the network society.
SNA has a long history in social science, although
much of the work in advancing its methods has also
come from mathematicians, physicists, biologists
and computer scientists (because they too study
networks of different types)
The idea that networks of relations are important
in social science is not new, but widespread
availability of data and advances in computing and
methodology have made it much easier now to
apply SNA to a range of problems
More examples from social science
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)4
These visualizations depict the flow of communications in
an organization before and after the introduction of a
content management system (Garton et al, 1997)
A visualization of US bloggers shows clearly how they tend
to link predominantly to blogs supporting the same party,
forming two distinct clusters (Adamic and Glance, 2005)
In this example researchers collected a very large
amount of data on the links between web pages and
found out that the Web consists of a core of densely
inter-linked pages, while most other web pages either
link to or are linked to from that core. It was one of the
first such insights into very large scale human-generated
structures (Broder et al, 2000).
Background: Other Domains
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)5
Broder et al, 2000
(Social) Network Analysis has found
applications in many domains beyond social
science, although the greatest advances have
generally been in relation to the study of
structures generated by humans
Computer scientists for example have used
(and even developed new) network analysis
methods to study webpages, Internet traffic,
information dissemination, etc.
One example in life sciences is the use of
network analysis to study food chains in
different ecosystems
Mathematicians and (theoretical) physicists
usually focus on producing new and complex
methods for the analysis of networks, that can
be used by anyone, in any domain where
networks are relevant
Practical applications
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)6
Businesses use SNA to analyze and improve
communication flow in their organization, or with
their networks of partners and customers
Law enforcement agencies (and the army) use SNA
to identify criminal and terrorist networks from
traces of communication that they collect; and then
identify key players in these networks
Social Network Sites like Facebook use basic
elements of SNA to identify and recommend
potential friends based on friends-of-friends
Civil society organizations use SNA to uncover
conflicts of interest in hidden connections between
government bodies, lobbies and businesses
Network operators (telephony, cable, mobile) use
SNA-like methods to optimize the structure and
capacity of their networks
Why and when to use SNA
 Whenever you are studying a social network, either offline or online, or when
you wish to understand how to improve the effectiveness of the network
 When you want to visualize your data so as to uncover patterns in
relationships or interactions
 When you want to follow the paths that information (or basically anything)
follows in social networks
 When you do quantitative research, although for qualitative research a
network perspective is also valuable
(a) The range of actions and opportunities afforded to individuals are often a function of their
positions in social networks; uncovering these positions (instead of relying on common
assumptions based on their roles and functions, say as fathers, mothers, teachers,
workers) can yield more interesting and sometimes surprising results
(b) A quantitative analysis of a social network can help you identify different types of actors in
the network or key players, whom you can focus on for your qualitative research
 SNA is clearly also useful in analyzing SNS’s, OC’s and social media in general,
to test hypotheses on online behavior and CMC, to identify the causes for
dysfunctional communities or networks, and to promote social cohesion and
growth in an online community
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)7
Basic Concepts
 Networks
 Tie Strength
 Key Players
 Cohesion
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)8
How to represent various social networks
How to identify strong/weak ties in the network
How to identify key/central nodes in network
Measures of overall network structure
Representing relations as networks
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)9
1
2
3
4
1 2 3 4
Graph
AnneAnne
JimJim
MaryMary
JohnJohn
Vertex
(node) Edge (link)
Can we study their
interactions as a
network?
Communication
Anne: Jim, tell the Murrays they’re invited
Jim: Mary, you and your dad should come for dinner!
Jim: Mr. Murray, you should both come for dinner
Anne: Mary, did Jim tell you about the dinner? You must come.
John: Mary, are you hungry?
…
Entering data on a directed graph
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)10
1
2
3
4
Graph (directed)
Edge list
Adjacency matrix
Representing an undirected graph
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)11
1
2
3
4
Edge list remains the same
Adjacency matrix becomes symmetric
1
2
3
4
Directed
Undirected
(who knows whom)
(who contacts whom)
But interpretation
is different now
Ego networks and ‘whole’ networks
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)12
1
2
3
4
5
6
7
1
2
3
5
4
3’s ego network
1
2
54
3’s ego network
without ego**
‘whole’ network*
* no studied network is ‘whole’ in practice; it’s usually a partial picture of one’s real life networks (boundary specification problem)
** ego not needed for analysis as all alters are by definition connected to ego
ego
alter
isolate
Basic Concepts
Networks
 Tie Strength
Key Players
Cohesion
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)13
How to represent various social networks
How to identify strong/weak ties in the network
How to identify key/central nodes in network
Measures of overall network structure
Adding weights to edges (directed or undirected)
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)14
Edge list: add column of weights
Adjacency matrix: add weights instead of 1
Weights could be:
•Frequency of interaction in
period of observation
•Number of items
exchanged in period
•Individual perceptions of
strength of relationship
•Costs in communication or
exchange, e.g. distance
•Combinations of these
1
2
3
4
30
2
37
22
5
Edge weights as relationship strength
 Edges can represent interactions, flows of
information or goods,
similarities/affiliations, or social relations
 Specifically for social relations, a ‘proxy’
for the strength of a tie can be:
 the frequency of interaction (communication)
or the amount of flow (exchange)
 reciprocity in interaction or flow
 the type of interaction or flow between the
two parties (e.g., intimate or not)
 other attributes of the nodes or ties (e.g., kin
relationships)
 The structure of the nodes’ neighborhood (e.g.
many mutual ‘friends’)
 Surveys and interviews allows us to
establish the existence of mutual or one-
sided strength/affection with greater
certainty, but proxies above are also useful
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)15
Homophily, transitivity, and bridging
 Homophily is the tendency to relate to people with
similar characteristics (status, beliefs, etc.)
 It leads to the formation of homogeneous groups
(clusters) where forming relations is easier
 Extreme homogenization can act counter to
innovation and idea generation (heterophily is thus
desirable in some contexts)
 Homophilous ties can be strong or weak
 Transitivity in SNA is a property of ties: if there is a
tie between A and B and one between B and C,
then in a transitive network A and C will also be
connected
 Strong ties are more often transitive than weak ties;
transitivity is therefore evidence for the existence of
strong ties (but not a necessary or sufficient condition)
 Transitivity and homophily together lead to the
formation of cliques (fully connected clusters)
 Bridges are nodes and edges that connect across
groups
 Facilitate inter-group communication, increase social
cohesion, and help spur innovation
 They are usually weak ties, but not every weak tie is a
bridge
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)16
Homophily
Strong Weak
Transitivity Bridging
Interlinked
groups
Heterophily
Cliques
Social
network
TIES
CLUSTERING
Basic Concepts
Networks
Tie Strength
 Key Players
Cohesion
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)17
How to represent various social networks
How to identify strong/weak ties in the network
How to identify key/central nodes in network
Measures of overall network structure
Note on computational examples
 In the examples that follow values were calculated with the sna
and igraph packages for the R programming environment, which is
widely used by specialists in the field (but is not the most user-
friendly)
 Results may vary across different software packages (e.g. when you
use NodeXL or UCINET), mainly because SNA metrics can take
various roughly equivalent forms
 Consult the documentation of the software you are using when in
doubt
 In most cases, even if there are some differences in the output of
your preferred software when compared to these notes, results
will be qualitatively the same and thus interpretation will also be
the same – but you have been warned!
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)18
Degree centrality
 A node’s (in-) or (out-)degree is the
number of links that lead into or
out of the node
 In an undirected graph they are of
course identical
 Often used as measure of a node’s
degree of connectedness and hence
also influence and/or popularity
 Useful in assessing which nodes are
central with respect to spreading
information and influencing others
in their immediate ‘neighborhood’
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)19
1
2
3
4
5
6
7
2
3
4
1
4
1
1
Nodes 3 and 5 have the highest degree (4)
Values computed with the sna package in the R programming environment. Definitions of centrality measures may vary slightly in other software.
Hypothetical graph
Paths and shortest paths
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)20
1
2
3
4
 A path between two nodes is any
sequence of non-repeating nodes that
connects the two nodes
 The shortest path between two nodes is
the path that connects the two nodes
with the shortest number of edges (also
called the distance between the nodes)
 In the example to the right, between
nodes 1 and 4 there are two shortest
paths of length 2: {1,2,4} and {1,3,4}
 Other, longer paths between the two
nodes are {1,2,3,4}, {1,3,2,4}, {1,2,5,3,4}
and {1,3,5,2,4} (the longest paths)
 Shorter paths are desirable when speed
of communication or exchange is
desired (often the case in many studies, but
sometimes not, e.g. in networks that spread
disease)
Shortest path(s)
5
Hypothetical graph
Betweenness centrality
 For a given node v, calculate the
number of shortest paths between
nodes i and j that pass through v, and
divide by all shortest paths between
nodes i and j
 Sum the above values for all node
pairs i,j
 Sometimes normalized such that the
highest value is 1or that the sum of all
betweenness centralities in the
network is 1
 Shows which nodes are more likely to
be in communication paths between
other nodes
 Also useful in determining points
where the network would break apart
(think who would be cut off if nodes 3
or 5 would disappear)
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)21
1
2
3
4
5
6
7
0
1.5
6.5
0
9
0
0
Node 5 has higher betweenness centrality than 3
Values computed with the sna package in the R programming environment. Definitions of centrality measures may vary slightly in other software.
Closeness centrality
 Calculate the mean length of all
shortest paths from a node to all
other nodes in the network (i.e.
how many hops on average it takes
to reach every other node)
 Take the reciprocal of the above
value so that higher values are
‘better’ (indicate higher closeness)
like in other measures of centrality
 It is a measure of reach, i.e. the
speed with which information can
reach other nodes from a given
starting node
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)22
1
2
3
4
5
6
7
0.5
0.67
0.75
0.46
0.75
0.46
0.46
Nodes 3 and 5 have the highest (i.e. best)
closeness, while node 2 fares almost as well Note: Sometimes closeness is calculated without taking the reciprocal of the
mean shortest path length. Then lower values are ‘better’.
Values computed with the sna package in the R programming environment. Definitions of centrality measures may vary slightly in other software.
Eigenvector centrality
 A node’s eigenvector centrality is
proportional to the sum of the
eigenvector centralities of all nodes
directly connected to it
 In other words, a node with a high
eigenvector centrality is connected to
other nodes with high eigenvector
centrality
 This is similar to how Google ranks
web pages: links from highly linked-to
pages count more
 Useful in determining who is
connected to the most connected
nodes
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)23
1
2
3
4
5
6
7
0.36
0.49
0.54
0.19
0.49
0.17
0.17
Node 3 has the highest eigenvector centrality,
closely followed by 2 and 5
Note: The term ‘eigenvector’ comes from mathematics (matrix algebra),
but it is not necessary for understanding how to interpret this measure
Values computed with the sna package in the R programming environment. Definitions of centrality measures may vary slightly in other software.
Interpretation of measures (1)
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)24
 Degree
 Betweenness
 Closeness
 Eigenvector
How many people can this person reach directly?
How likely is this person to be the most direct route
between two people in the network?
How fast can this person reach everyone in the
network?
How well is this person connected to other well-
connected people?
Centrality measure Interpretation in social networks
Interpretation of measures (2)
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)25
 Degree
 Betweenness
 Closeness
 Eigenvector
In network of music collaborations: how many
people has this person collaborated with?
In network of spies: who is the spy though whom most
of the confidential information is likely to flow?
In network of sexual relations: how fast will an STD
spread from this person to the rest of the network?
In network of paper citations: who is the author that is
most cited by other well-cited authors?
Centrality measure Other possible interpretations…
Identifying sets of key players
 In the network to the right, node 10
is the most central according to
degree centrality
 But nodes 3 and 5 together will reach
more nodes
 Moreover the tie between them is
critical; if severed, the network will
break into two isolated sub-networks
 It follows that other things being
equal, players 3 and 5 together are
more ‘key’ to this network than 10
 Thinking about sets of key players is
helpful!
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)26
1
4
6
7
8
9
10
0
3 5
2
Basic Concepts
Networks
Tie Strength
Key Players
 Cohesion
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)27
How to represent various social networks
How to identify strong/weak ties in the network
How to identify key/central nodes in network
How to characterize a network’s structure
Reciprocity (degree of)
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)28
1 2
3 4
 The ratio of the number of relations
which are reciprocated (i.e. there is an
edge in both directions) over the total
number of relations in the network
 …where two vertices are said to be
related if there is at least one edge
between them
 In the example to the right this would be
2/5=0.4 (whether this is considered high
or low depends on the context)
 A useful indicator of the degree of
mutuality and reciprocal exchange in a
network, which relate to social cohesion
 Only makes sense in directed graphs
Reciprocity for network = 0.4
Density
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)29
1
2
3
4
 A network’s density is the ratio of the number of
edges in the network over the total number of
possible edges between all pairs of nodes (which is
n(n-1)/2, where n is the number of vertices, for an
undirected graph)
 In the example network to the right
density=5/6=0.83 (i.e. it is a fairly dense network;
opposite would be a sparse network)
 It is a common measure of how well connected a
network is (in other words, how closely knit it is) –
a perfectly connected network is called a clique and
has density=1
 A directed graph will have half the density of its
undirected equivalent, because there are twice as
many possible edges, i.e. n(n-1)
 Density is useful in comparing networks against each
other, or in doing the same for different regions
within a single network
1
2
3
4
density = 5/6 = 0.83
density = 5/12 = 0.42
Edge present in network
Possible but not present
Clustering
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)30
1
2
3
4
5
6
7
1
0.67
0.33
N/a
0.17
N/a
N/a
 A node’s clustering coefficient is the number of
closed triplets in the node’s neighborhood
over the total number of triplets in the
neighborhood. It is also known as transitivity.
 E.g., node 1 to the right has a value of 1
because it is only connected to 2 and 3, and
these nodes are also connected to one
another (i.e. the only triplet in the
neighborhood of 1 is closed). We say that
nodes 1,2, and 3 form a clique.
 Clustering algorithms identify clusters or
‘communities’ within networks based on
network structure and specific clustering
criteria (example shown to the right with two
clusters is based on edge betweenness, an
equivalent for edges of the betweenness
centrality presented earlier for nodes) Network clustering coefficient = 0.375
(3 nodes in each triangle x 2 triangles = 6 closed triplets divided by 16 total)
Cluster A
Cluster B
Values computed with the igraph package in the R programming environment. Definitions of centrality measures may vary slightly in other software.
Average and longest distance
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)31
1
2
3
4
5
6
7
 The longest shortest path (distance)
between any two nodes in a
network is called the network’s
diameter
 The diameter of the network on
the right is 3; it is a useful measure
of the reach of the network (as
opposed to looking only at the total
number of vertices or edges)
 It also indicates how long it will
take at most to reach any node in
the network (sparser networks will
generally have greater diameters)
 The average of all shortest paths in
a network is also interesting
because it indicates how far apart
any two nodes will be on average
(average distance)
diameter
Small Worlds
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)32
 A small world is a network that
looks almost random but exhibits a
significantly high clustering coefficient
(nodes tend to cluster locally) and a
relatively short average path length
(nodes can be reached in a few
steps)
 It is a very common structure in
social networks because of
transitivity in strong social ties and
the ability of weak ties to reach
across clusters (see also next page…)
 Such a network will have many
clusters but also many bridges
between clusters that help shorten
the average distance between nodes
local cluster
bridge
You may have heard of the
famous “6 degrees” of
separation
Sketch of small world
structure
Preferential Attachment
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)33
A property of some networks, where, during their evolution and growth in
time, a the great majority of new edges are to nodes with an already high
degree; the degree of these nodes thus increases disproportionately,
compared to most other nodes in the network
 The result is a network with
few very highly connected
nodes and many nodes with a
low degree
 Such networks are said to
exhibit a long-tailed degree
distribution
 And they tend to have a small-
world structure!
(so, as it turns out, transitivity and
strong/weak tie characteristics are not
necessary to explain small world structures,
but they are common and can also lead to
such structures)
nodes ordered in descending degree
degree
short
head
long tail
Example of network with
preferential attachment
Sketch of long-tailed
degree distribution
Reasons for preferential attachment
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)34
We want to be
associated with popular
people, ideas, items, thus
further increasing their
popularity, irrespective of
any objective, measurable
characteristics
We evaluate people and
everything else based on
objective quality criteria,
so higher quality nodes
will naturally attract more
attention, faster
Among nodes of similar
attributes, those that
reach critical mass first
will become ‘stars’ with
many friends and
followers (‘halo effect’)
Popularity Quality Mixed model
May be impossible to
predict who will become a
star, even if quality matters
Also known as
‘the good get better’
Also known as
‘the rich get richer’
Core-Periphery Structures
 A useful and relatively simple metric of the degree
to which a social network is centralized or
decentralized, is the centralization measure
(usually normalized such that it takes values between 0 and 1)
 It is based on calculating the differences in degrees
between nodes; a network that greatly depends on 1-2
highly connected nodes (as a result for example of
preferential attachment) will exhibit greater differences in
degree centrality between nodes
 Centralized structures can perform better at some tasks
(like team-based problem-solving requiring coordination),
but are more prone to failure if key players disconnect
 In addition to centralization, many large groups and
online communities have a core of densely
connected users that are critical for connecting a
much larger periphery
 Cores can be identified visually, or by examining the
location of high-degree nodes and their joint degree
distributions (do high-degree nodes tend to connect to
other high-degree nodes?)
 Bow-tie analysis, famously used to analyze the structure of
the Web, can also be used to distinguish between the core
and other, more peripheral elements in a network (see
earlier example here)
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)35
More peripheral clusters
and other structures
Nodes
in core
Thoughts on Design
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)36
How can an online social media platform (and its
administrators) leverage the methods and insights of social
network analysis?
How can it encourage a network perspective among its
users, such that they are aware of their ‘neighborhood’
and can learn how to work with it and/or expand it?
What measures can an online community take to optimize its
network structure?
Example: cliques can be undesirable because they shun newcomers
SNA inspired some of the
first SNS’s (e.g. SixDegrees),
but still not used so often in
conjunction with design
decisions – much untapped
potential here
How can online communities identify and utilize key players
for the benefit of the community?
What would be desirable structures for different types of
online platforms? (not easy to answer)
Analyzing your own ego-network
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)37
• Use the steps outlined in the following pages
to visualize and analyze your own network
• Think about the key players in your network,
the types of ties that you maintain with them,
identify any clusters or communities within
your network, etc.
• Objective: practice SNA with real data!
• Present your findings in class next week!
Visualizing Facebook ego-network online
 Launch the TouchGraph Facebook Browser
 You should see a visualization of your network like
the one to the right
 Make sure to set
to a value that will allow you to see your entire
network (friends ranked according to highest
betweeness centrality according to TouchGraph Help)
 Go to “Advanced” and remove all filters on the data
so that settings look like below:
 Note: not possible to export your data for further analysis
 You may also want to try TouchGraph Google Browser (it’s fun!)
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)38
Example TouchGraph Facebook Layout
Navigate the graph, examine friend
‘ranks’, friend positions in the network,
clusters and what they have in
common, try to identify weak and
strong ties of yours and assess overall
structure of your ego-network
Exporting data for offline analysis
 Data is more useful when you can extract it from an online platform and analyze
with a variety of more powerful tools
 Facebook, Twitter and other platforms have public Application Programming Interfaces
(API’s) which allow computer programs to extract data among other things
 Web crawlers can also be used to read and extract the data directly from the web pages
which contain them
 Doing either of the above on your own will usually require some programming skills
 Thankfully, if you have no such experience, you can use free tools built by others :)
 Bernie Hogan (Oxford Internet Institute) has developed a Facebook application
that extracts a list of all edges in your ego-network (see instructions on next page)
 Also, NodeXL (Windows only, see later slide) currently imports data from: Twitter,
YouTube, Flickr, and your email client!
 Let’s start with installing and learning to use NodeXL…
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)39
Using NodeXL for visualization & analysis
 Download and install NodeXL
Windows 7/Vista/XP, requires Excel 2007
(installation may take a while if additional software is
needed for NodeXL to work)
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)40
NodeXL sample screenshot
 Launch Excel and select
New -> My Templates ->NodeXLGraph.xltx
 Go to “Import” and select the appropriate
option for the data you wish to import (for
Facebook import see next slide first!)
NM4881A Tweep Network (weekly data)
 Click on to ask NodeXL to
compute centralities, network density,
clustering coefficients, etc.
 Select
to display network graph. You can customize
this using and as well as For more info read this
NodeXL tutorial
 Launch Excel and open the file that you just saved to your computer.
1. Excel will launch the Text Import Wizard. Select “Delimited”. Click “Next >”
2. Select “Space” as the delimiter, as shown here
3. Select “Next >” and then “Finish”.
4. A new file will be created in Excel. It contains a list of all the nodes in your ego-network, followed
by a list of all edges. Scroll down until you find the edges, select all of them and copy them (Ctrl-C)
5. You can now open a new NodeXL file in Excel as explained in the previous slide. Instead of using
NodeXL’s import function, paste the list of edges to the NodeXL worksheet, right here
6. In NodeXL select and “Get Vertices from Edge List”
7. Now you can compute graph metrics and visualize your data like explained in the previous slide!
Exporting Facebook ego-network data
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)41
This will explain how to export your Facebook data for analysis with a tool like NodeXL
 To use Bernie Hogan’s tool on Facebook, click here. From the two options presented,
select “UCInet”. This is a format specific to another tool, not NodeXL, but we will
import this data into NodeXL because it’s easier to use.
 After selecting “UCInet”, right-click on the link given to you and select to save the
generated file to a folder on your computer.
More options
 Many more tools are used for SNA, although they generally require more
expert knowledge. Some of these are:
 Pajek (Windows, free)
 UCInet (Windows, shareware)
 Netdraw (Windows, free)
 Mage (Windows, free)
 GUESS (all platforms, free and open source)
 R packages for SNA (all platforms, free and open source)
 Gephi (all platforms, free and open source)
 The field is continuously growing, so we can expect to see more user-
friendly applications coming out in the next years…
CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)42
Credits and licensing
 Front page network graph by ilamont (license: CC BY)
 Bridge routing illustrations in Newman et al, The Structure and Dynamics of Networks, Princeton University
Press
 Personal social network diagram in Mark Wellman (ed.), Networks in the Global Village, Westview Press
 Visualization of interactions in organization in Garton et al, Studying Online Social Networks, JCMC
 Visualization of US political bloggers in Lada Adamic and Natalie Glance, The Political Blogosphere and the
2004 US election: Divided They Blog, Proceedings of the International Conference on Knowledge Discovery
and Data Mining, ACM Press, 2005
 Web bow tie diagram by Broder et al, Graph Structure in the Web, Computer Networks, Elsevier
 Bond/tie photo by ChrisK4u (license: CC BY-ND)
 Visualization of small world network by AJC1 (license: CC BY-NC)
43 CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)
Original content in this presentation is licensed under the Creative Commons
Singapore Attribution 3.0 license unless stated otherwise (see above)
Original content in this presentation is licensed under the Creative Commons
Singapore Attribution 3.0 license unless stated otherwise (see above)

More Related Content

What's hot

Mining and analyzing social media part 2 - hicss47 tutorial - dave king
Mining and analyzing social media   part 2 - hicss47 tutorial - dave kingMining and analyzing social media   part 2 - hicss47 tutorial - dave king
Mining and analyzing social media part 2 - hicss47 tutorial - dave kingDave King
 
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...BAINIDA
 
Data mining based social network
Data mining based social networkData mining based social network
Data mining based social networkFiras Husseini
 
LAK13 Tutorial Social Network Analysis 4 Learning Analytics
LAK13 Tutorial Social Network Analysis 4 Learning AnalyticsLAK13 Tutorial Social Network Analysis 4 Learning Analytics
LAK13 Tutorial Social Network Analysis 4 Learning Analyticsgoehnert
 
A Guide to Social Network Analysis
A Guide to Social Network AnalysisA Guide to Social Network Analysis
A Guide to Social Network AnalysisOlivier Serrat
 
Social Network, Metrics and Computational Problem
Social Network, Metrics and Computational ProblemSocial Network, Metrics and Computational Problem
Social Network, Metrics and Computational ProblemAndry Alamsyah
 
Incremental Community Mining in Location-based Social Network
Incremental Community Mining in Location-based Social NetworkIncremental Community Mining in Location-based Social Network
Incremental Community Mining in Location-based Social NetworkIJAEMSJORNAL
 
Social Network Analysis for Competitive Intelligence
Social Network Analysis for Competitive IntelligenceSocial Network Analysis for Competitive Intelligence
Social Network Analysis for Competitive IntelligenceAugust Jackson
 
Network Theory
Network TheoryNetwork Theory
Network TheorySon Maroon
 
10 More than a Pretty Picture: Visual Thinking in Network Studies
10 More than a Pretty Picture: Visual Thinking in Network Studies10 More than a Pretty Picture: Visual Thinking in Network Studies
10 More than a Pretty Picture: Visual Thinking in Network Studiesdnac
 
02 Network Data Collection
02 Network Data Collection02 Network Data Collection
02 Network Data Collectiondnac
 
01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measures01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measuresdnac
 

What's hot (20)

03 Communities in Networks (2017)
03 Communities in Networks (2017)03 Communities in Networks (2017)
03 Communities in Networks (2017)
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network Analysis
 
01 Network Data Collection (2017)
01 Network Data Collection (2017)01 Network Data Collection (2017)
01 Network Data Collection (2017)
 
02 Descriptive Statistics (2017)
02 Descriptive Statistics (2017)02 Descriptive Statistics (2017)
02 Descriptive Statistics (2017)
 
Mining and analyzing social media part 2 - hicss47 tutorial - dave king
Mining and analyzing social media   part 2 - hicss47 tutorial - dave kingMining and analyzing social media   part 2 - hicss47 tutorial - dave king
Mining and analyzing social media part 2 - hicss47 tutorial - dave king
 
CSE509 Lecture 6
CSE509 Lecture 6CSE509 Lecture 6
CSE509 Lecture 6
 
05 Network Canvas (2017)
05 Network Canvas (2017)05 Network Canvas (2017)
05 Network Canvas (2017)
 
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
 
11 Keynote (2017)
11 Keynote (2017)11 Keynote (2017)
11 Keynote (2017)
 
Data mining based social network
Data mining based social networkData mining based social network
Data mining based social network
 
LAK13 Tutorial Social Network Analysis 4 Learning Analytics
LAK13 Tutorial Social Network Analysis 4 Learning AnalyticsLAK13 Tutorial Social Network Analysis 4 Learning Analytics
LAK13 Tutorial Social Network Analysis 4 Learning Analytics
 
A Guide to Social Network Analysis
A Guide to Social Network AnalysisA Guide to Social Network Analysis
A Guide to Social Network Analysis
 
Social Network, Metrics and Computational Problem
Social Network, Metrics and Computational ProblemSocial Network, Metrics and Computational Problem
Social Network, Metrics and Computational Problem
 
Incremental Community Mining in Location-based Social Network
Incremental Community Mining in Location-based Social NetworkIncremental Community Mining in Location-based Social Network
Incremental Community Mining in Location-based Social Network
 
Social Network Analysis for Competitive Intelligence
Social Network Analysis for Competitive IntelligenceSocial Network Analysis for Competitive Intelligence
Social Network Analysis for Competitive Intelligence
 
Social Network Analysis (SNA)
Social Network Analysis (SNA)Social Network Analysis (SNA)
Social Network Analysis (SNA)
 
Network Theory
Network TheoryNetwork Theory
Network Theory
 
10 More than a Pretty Picture: Visual Thinking in Network Studies
10 More than a Pretty Picture: Visual Thinking in Network Studies10 More than a Pretty Picture: Visual Thinking in Network Studies
10 More than a Pretty Picture: Visual Thinking in Network Studies
 
02 Network Data Collection
02 Network Data Collection02 Network Data Collection
02 Network Data Collection
 
01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measures01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measures
 

Similar to Socialnetworkanalysis 100225055227-phpapp02

01 Introduction to Networks Methods and Measures (2016)
01 Introduction to Networks Methods and Measures (2016)01 Introduction to Networks Methods and Measures (2016)
01 Introduction to Networks Methods and Measures (2016)Duke Network Analysis Center
 
The Mathematics of Social Network Analysis: Metrics for Academic Social Networks
The Mathematics of Social Network Analysis: Metrics for Academic Social NetworksThe Mathematics of Social Network Analysis: Metrics for Academic Social Networks
The Mathematics of Social Network Analysis: Metrics for Academic Social NetworksEditor IJCATR
 
Visually Exploring Social Participation in Encyclopedia of Life
Visually Exploring Social Participation in Encyclopedia of LifeVisually Exploring Social Participation in Encyclopedia of Life
Visually Exploring Social Participation in Encyclopedia of LifeHarish Vaidyanathan
 
Improving Knowledge Handling by building intellegent social systems
Improving Knowledge Handling by building intellegent social systemsImproving Knowledge Handling by building intellegent social systems
Improving Knowledge Handling by building intellegent social systemsnazeeh
 
2010 Catalyst Conference - Trends in Social Network Analysis
2010 Catalyst Conference - Trends in Social Network Analysis2010 Catalyst Conference - Trends in Social Network Analysis
2010 Catalyst Conference - Trends in Social Network AnalysisMarc Smith
 
Current trends of opinion mining and sentiment analysis in social networks
Current trends of opinion mining and sentiment analysis in social networksCurrent trends of opinion mining and sentiment analysis in social networks
Current trends of opinion mining and sentiment analysis in social networkseSAT Publishing House
 
Tepl webinar 20032013
Tepl webinar   20032013Tepl webinar   20032013
Tepl webinar 20032013Nina Pataraia
 
02 Introduction to Social Networks and Health: Key Concepts and Overview
02 Introduction to Social Networks and Health: Key Concepts and Overview02 Introduction to Social Networks and Health: Key Concepts and Overview
02 Introduction to Social Networks and Health: Key Concepts and OverviewDuke Network Analysis Center
 
a modified weight balanced algorithm for influential users community detectio...
a modified weight balanced algorithm for influential users community detectio...a modified weight balanced algorithm for influential users community detectio...
a modified weight balanced algorithm for influential users community detectio...INFOGAIN PUBLICATION
 
Sna based reasoning for multiagent
Sna based reasoning for multiagentSna based reasoning for multiagent
Sna based reasoning for multiagentijaia
 
Community detection in social networks an overview
Community detection in social networks an overviewCommunity detection in social networks an overview
Community detection in social networks an overvieweSAT Publishing House
 
Multimode network based efficient and scalable learning of collective behavior
Multimode network based efficient and scalable learning of collective behaviorMultimode network based efficient and scalable learning of collective behavior
Multimode network based efficient and scalable learning of collective behaviorIAEME Publication
 
00 Introduction to SN&H: Key Concepts and Overview
00 Introduction to SN&H: Key Concepts and Overview00 Introduction to SN&H: Key Concepts and Overview
00 Introduction to SN&H: Key Concepts and OverviewDuke Network Analysis Center
 
The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)theijes
 
An updated look at social network extraction system a personal data analysis ...
An updated look at social network extraction system a personal data analysis ...An updated look at social network extraction system a personal data analysis ...
An updated look at social network extraction system a personal data analysis ...eSAT Publishing House
 
[DSC Adria 23] Marija Mitrovic Dankulov Complex networks and data science eff...
[DSC Adria 23] Marija Mitrovic Dankulov Complex networks and data science eff...[DSC Adria 23] Marija Mitrovic Dankulov Complex networks and data science eff...
[DSC Adria 23] Marija Mitrovic Dankulov Complex networks and data science eff...DataScienceConferenc1
 
Analysis Of A Learning Community As A Social Network
Analysis Of A Learning Community As A Social NetworkAnalysis Of A Learning Community As A Social Network
Analysis Of A Learning Community As A Social NetworkMaria Perkins
 
Network Analysis in the Social Sciences
Network Analysis in the Social SciencesNetwork Analysis in the Social Sciences
Network Analysis in the Social SciencesConstantinos Bletsos
 

Similar to Socialnetworkanalysis 100225055227-phpapp02 (20)

01 Introduction to Networks Methods and Measures (2016)
01 Introduction to Networks Methods and Measures (2016)01 Introduction to Networks Methods and Measures (2016)
01 Introduction to Networks Methods and Measures (2016)
 
The Mathematics of Social Network Analysis: Metrics for Academic Social Networks
The Mathematics of Social Network Analysis: Metrics for Academic Social NetworksThe Mathematics of Social Network Analysis: Metrics for Academic Social Networks
The Mathematics of Social Network Analysis: Metrics for Academic Social Networks
 
Visually Exploring Social Participation in Encyclopedia of Life
Visually Exploring Social Participation in Encyclopedia of LifeVisually Exploring Social Participation in Encyclopedia of Life
Visually Exploring Social Participation in Encyclopedia of Life
 
Improving Knowledge Handling by building intellegent social systems
Improving Knowledge Handling by building intellegent social systemsImproving Knowledge Handling by building intellegent social systems
Improving Knowledge Handling by building intellegent social systems
 
2010 Catalyst Conference - Trends in Social Network Analysis
2010 Catalyst Conference - Trends in Social Network Analysis2010 Catalyst Conference - Trends in Social Network Analysis
2010 Catalyst Conference - Trends in Social Network Analysis
 
Current trends of opinion mining and sentiment analysis in social networks
Current trends of opinion mining and sentiment analysis in social networksCurrent trends of opinion mining and sentiment analysis in social networks
Current trends of opinion mining and sentiment analysis in social networks
 
Tepl webinar 20032013
Tepl webinar   20032013Tepl webinar   20032013
Tepl webinar 20032013
 
02 Introduction to Social Networks and Health: Key Concepts and Overview
02 Introduction to Social Networks and Health: Key Concepts and Overview02 Introduction to Social Networks and Health: Key Concepts and Overview
02 Introduction to Social Networks and Health: Key Concepts and Overview
 
a modified weight balanced algorithm for influential users community detectio...
a modified weight balanced algorithm for influential users community detectio...a modified weight balanced algorithm for influential users community detectio...
a modified weight balanced algorithm for influential users community detectio...
 
Sna based reasoning for multiagent
Sna based reasoning for multiagentSna based reasoning for multiagent
Sna based reasoning for multiagent
 
Community detection in social networks an overview
Community detection in social networks an overviewCommunity detection in social networks an overview
Community detection in social networks an overview
 
Multimode network based efficient and scalable learning of collective behavior
Multimode network based efficient and scalable learning of collective behaviorMultimode network based efficient and scalable learning of collective behavior
Multimode network based efficient and scalable learning of collective behavior
 
Internet
InternetInternet
Internet
 
Internet
InternetInternet
Internet
 
00 Introduction to SN&H: Key Concepts and Overview
00 Introduction to SN&H: Key Concepts and Overview00 Introduction to SN&H: Key Concepts and Overview
00 Introduction to SN&H: Key Concepts and Overview
 
The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)
 
An updated look at social network extraction system a personal data analysis ...
An updated look at social network extraction system a personal data analysis ...An updated look at social network extraction system a personal data analysis ...
An updated look at social network extraction system a personal data analysis ...
 
[DSC Adria 23] Marija Mitrovic Dankulov Complex networks and data science eff...
[DSC Adria 23] Marija Mitrovic Dankulov Complex networks and data science eff...[DSC Adria 23] Marija Mitrovic Dankulov Complex networks and data science eff...
[DSC Adria 23] Marija Mitrovic Dankulov Complex networks and data science eff...
 
Analysis Of A Learning Community As A Social Network
Analysis Of A Learning Community As A Social NetworkAnalysis Of A Learning Community As A Social Network
Analysis Of A Learning Community As A Social Network
 
Network Analysis in the Social Sciences
Network Analysis in the Social SciencesNetwork Analysis in the Social Sciences
Network Analysis in the Social Sciences
 

More from Adil Alpkoçak

Expert Systems vs Clinical Decision Support Systems
Expert Systems vs Clinical Decision Support SystemsExpert Systems vs Clinical Decision Support Systems
Expert Systems vs Clinical Decision Support SystemsAdil Alpkoçak
 
Fotografcilikta Minimalizm (iFOD)
Fotografcilikta Minimalizm (iFOD)Fotografcilikta Minimalizm (iFOD)
Fotografcilikta Minimalizm (iFOD)Adil Alpkoçak
 
Fotoğrafçılıkta Minimalizm
Fotoğrafçılıkta MinimalizmFotoğrafçılıkta Minimalizm
Fotoğrafçılıkta MinimalizmAdil Alpkoçak
 

More from Adil Alpkoçak (7)

Expert Systems vs Clinical Decision Support Systems
Expert Systems vs Clinical Decision Support SystemsExpert Systems vs Clinical Decision Support Systems
Expert Systems vs Clinical Decision Support Systems
 
Fotografcilikta Minimalizm (iFOD)
Fotografcilikta Minimalizm (iFOD)Fotografcilikta Minimalizm (iFOD)
Fotografcilikta Minimalizm (iFOD)
 
Lecture#5
Lecture#5Lecture#5
Lecture#5
 
Lecture #2 xml
Lecture #2 xmlLecture #2 xml
Lecture #2 xml
 
Lecture#01
Lecture#01Lecture#01
Lecture#01
 
Mir lecture1
Mir lecture1Mir lecture1
Mir lecture1
 
Fotoğrafçılıkta Minimalizm
Fotoğrafçılıkta MinimalizmFotoğrafçılıkta Minimalizm
Fotoğrafçılıkta Minimalizm
 

Recently uploaded

CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfAsst.prof M.Gokilavani
 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxbritheesh05
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2RajaP95
 
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)dollysharma2066
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
Concrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxConcrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxKartikeyaDwivedi3
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfAsst.prof M.Gokilavani
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx959SahilShah
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort servicejennyeacort
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...srsj9000
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024Mark Billinghurst
 
Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.eptoze12
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girlsssuser7cb4ff
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learningmisbanausheenparvam
 

Recently uploaded (20)

CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
 
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptx
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
 
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
Concrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxConcrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptx
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx
 
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024
 
Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girls
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 
young call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Serviceyoung call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Service
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learning
 

Socialnetworkanalysis 100225055227-phpapp02

  • 1. Social Network Analysis (SNA) including a tutorial on concepts and methods Social Media – Dr. Giorgos Cheliotis (gcheliotis@nus.edu.sg) Communications and New Media, National University of Singapore
  • 2. Background: Network Analysis CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)2 Newman et al, 2006 Newman et al, 2006 A very early example of network analysis comes from the city of Königsberg (now Kaliningrad). Famous mathematician Leonard Euler used a graph to prove that there is no path that crosses each of the city’s bridges only once (Newman et al, 2006). SNA has its origins in both social science and in the broader fields of network analysis and graph theory Network analysis concerns itself with the formulation and solution of problems that have a network structure; such structure is usually captured in a graph (see the circled structure to the right) Graph theory provides a set of abstract concepts and methods for the analysis of graphs. These, in combination with other analytical tools and with methods developed specifically for the visualization and analysis of social (and other) networks, form the basis of what we call SNA methods. But SNA is not just a methodology; it is a unique perspective on how society functions. Instead of focusing on individuals and their attributes, or on macroscopic social structures, it centers on relations between individuals, groups, or social institutions
  • 3. This is an early depiction of what we call an ‘ego’ network, i.e. a personal network. The graphic depicts varying tie strengths via concentric circles (Wellman, 1998) Background: Social Science CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)3 Wellman, 1998 Studying society from a network perspective is to study individuals as embedded in a network of relations and seek explanations for social behavior in the structure of these networks rather than in the individuals alone. This ‘network perspective’ becomes increasingly relevant in a society that Manuel Castells has dubbed the network society. SNA has a long history in social science, although much of the work in advancing its methods has also come from mathematicians, physicists, biologists and computer scientists (because they too study networks of different types) The idea that networks of relations are important in social science is not new, but widespread availability of data and advances in computing and methodology have made it much easier now to apply SNA to a range of problems
  • 4. More examples from social science CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)4 These visualizations depict the flow of communications in an organization before and after the introduction of a content management system (Garton et al, 1997) A visualization of US bloggers shows clearly how they tend to link predominantly to blogs supporting the same party, forming two distinct clusters (Adamic and Glance, 2005)
  • 5. In this example researchers collected a very large amount of data on the links between web pages and found out that the Web consists of a core of densely inter-linked pages, while most other web pages either link to or are linked to from that core. It was one of the first such insights into very large scale human-generated structures (Broder et al, 2000). Background: Other Domains CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)5 Broder et al, 2000 (Social) Network Analysis has found applications in many domains beyond social science, although the greatest advances have generally been in relation to the study of structures generated by humans Computer scientists for example have used (and even developed new) network analysis methods to study webpages, Internet traffic, information dissemination, etc. One example in life sciences is the use of network analysis to study food chains in different ecosystems Mathematicians and (theoretical) physicists usually focus on producing new and complex methods for the analysis of networks, that can be used by anyone, in any domain where networks are relevant
  • 6. Practical applications CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)6 Businesses use SNA to analyze and improve communication flow in their organization, or with their networks of partners and customers Law enforcement agencies (and the army) use SNA to identify criminal and terrorist networks from traces of communication that they collect; and then identify key players in these networks Social Network Sites like Facebook use basic elements of SNA to identify and recommend potential friends based on friends-of-friends Civil society organizations use SNA to uncover conflicts of interest in hidden connections between government bodies, lobbies and businesses Network operators (telephony, cable, mobile) use SNA-like methods to optimize the structure and capacity of their networks
  • 7. Why and when to use SNA  Whenever you are studying a social network, either offline or online, or when you wish to understand how to improve the effectiveness of the network  When you want to visualize your data so as to uncover patterns in relationships or interactions  When you want to follow the paths that information (or basically anything) follows in social networks  When you do quantitative research, although for qualitative research a network perspective is also valuable (a) The range of actions and opportunities afforded to individuals are often a function of their positions in social networks; uncovering these positions (instead of relying on common assumptions based on their roles and functions, say as fathers, mothers, teachers, workers) can yield more interesting and sometimes surprising results (b) A quantitative analysis of a social network can help you identify different types of actors in the network or key players, whom you can focus on for your qualitative research  SNA is clearly also useful in analyzing SNS’s, OC’s and social media in general, to test hypotheses on online behavior and CMC, to identify the causes for dysfunctional communities or networks, and to promote social cohesion and growth in an online community CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)7
  • 8. Basic Concepts  Networks  Tie Strength  Key Players  Cohesion CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)8 How to represent various social networks How to identify strong/weak ties in the network How to identify key/central nodes in network Measures of overall network structure
  • 9. Representing relations as networks CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)9 1 2 3 4 1 2 3 4 Graph AnneAnne JimJim MaryMary JohnJohn Vertex (node) Edge (link) Can we study their interactions as a network? Communication Anne: Jim, tell the Murrays they’re invited Jim: Mary, you and your dad should come for dinner! Jim: Mr. Murray, you should both come for dinner Anne: Mary, did Jim tell you about the dinner? You must come. John: Mary, are you hungry? …
  • 10. Entering data on a directed graph CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)10 1 2 3 4 Graph (directed) Edge list Adjacency matrix
  • 11. Representing an undirected graph CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)11 1 2 3 4 Edge list remains the same Adjacency matrix becomes symmetric 1 2 3 4 Directed Undirected (who knows whom) (who contacts whom) But interpretation is different now
  • 12. Ego networks and ‘whole’ networks CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)12 1 2 3 4 5 6 7 1 2 3 5 4 3’s ego network 1 2 54 3’s ego network without ego** ‘whole’ network* * no studied network is ‘whole’ in practice; it’s usually a partial picture of one’s real life networks (boundary specification problem) ** ego not needed for analysis as all alters are by definition connected to ego ego alter isolate
  • 13. Basic Concepts Networks  Tie Strength Key Players Cohesion CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)13 How to represent various social networks How to identify strong/weak ties in the network How to identify key/central nodes in network Measures of overall network structure
  • 14. Adding weights to edges (directed or undirected) CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)14 Edge list: add column of weights Adjacency matrix: add weights instead of 1 Weights could be: •Frequency of interaction in period of observation •Number of items exchanged in period •Individual perceptions of strength of relationship •Costs in communication or exchange, e.g. distance •Combinations of these 1 2 3 4 30 2 37 22 5
  • 15. Edge weights as relationship strength  Edges can represent interactions, flows of information or goods, similarities/affiliations, or social relations  Specifically for social relations, a ‘proxy’ for the strength of a tie can be:  the frequency of interaction (communication) or the amount of flow (exchange)  reciprocity in interaction or flow  the type of interaction or flow between the two parties (e.g., intimate or not)  other attributes of the nodes or ties (e.g., kin relationships)  The structure of the nodes’ neighborhood (e.g. many mutual ‘friends’)  Surveys and interviews allows us to establish the existence of mutual or one- sided strength/affection with greater certainty, but proxies above are also useful CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)15
  • 16. Homophily, transitivity, and bridging  Homophily is the tendency to relate to people with similar characteristics (status, beliefs, etc.)  It leads to the formation of homogeneous groups (clusters) where forming relations is easier  Extreme homogenization can act counter to innovation and idea generation (heterophily is thus desirable in some contexts)  Homophilous ties can be strong or weak  Transitivity in SNA is a property of ties: if there is a tie between A and B and one between B and C, then in a transitive network A and C will also be connected  Strong ties are more often transitive than weak ties; transitivity is therefore evidence for the existence of strong ties (but not a necessary or sufficient condition)  Transitivity and homophily together lead to the formation of cliques (fully connected clusters)  Bridges are nodes and edges that connect across groups  Facilitate inter-group communication, increase social cohesion, and help spur innovation  They are usually weak ties, but not every weak tie is a bridge CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)16 Homophily Strong Weak Transitivity Bridging Interlinked groups Heterophily Cliques Social network TIES CLUSTERING
  • 17. Basic Concepts Networks Tie Strength  Key Players Cohesion CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)17 How to represent various social networks How to identify strong/weak ties in the network How to identify key/central nodes in network Measures of overall network structure
  • 18. Note on computational examples  In the examples that follow values were calculated with the sna and igraph packages for the R programming environment, which is widely used by specialists in the field (but is not the most user- friendly)  Results may vary across different software packages (e.g. when you use NodeXL or UCINET), mainly because SNA metrics can take various roughly equivalent forms  Consult the documentation of the software you are using when in doubt  In most cases, even if there are some differences in the output of your preferred software when compared to these notes, results will be qualitatively the same and thus interpretation will also be the same – but you have been warned! CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)18
  • 19. Degree centrality  A node’s (in-) or (out-)degree is the number of links that lead into or out of the node  In an undirected graph they are of course identical  Often used as measure of a node’s degree of connectedness and hence also influence and/or popularity  Useful in assessing which nodes are central with respect to spreading information and influencing others in their immediate ‘neighborhood’ CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)19 1 2 3 4 5 6 7 2 3 4 1 4 1 1 Nodes 3 and 5 have the highest degree (4) Values computed with the sna package in the R programming environment. Definitions of centrality measures may vary slightly in other software. Hypothetical graph
  • 20. Paths and shortest paths CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)20 1 2 3 4  A path between two nodes is any sequence of non-repeating nodes that connects the two nodes  The shortest path between two nodes is the path that connects the two nodes with the shortest number of edges (also called the distance between the nodes)  In the example to the right, between nodes 1 and 4 there are two shortest paths of length 2: {1,2,4} and {1,3,4}  Other, longer paths between the two nodes are {1,2,3,4}, {1,3,2,4}, {1,2,5,3,4} and {1,3,5,2,4} (the longest paths)  Shorter paths are desirable when speed of communication or exchange is desired (often the case in many studies, but sometimes not, e.g. in networks that spread disease) Shortest path(s) 5 Hypothetical graph
  • 21. Betweenness centrality  For a given node v, calculate the number of shortest paths between nodes i and j that pass through v, and divide by all shortest paths between nodes i and j  Sum the above values for all node pairs i,j  Sometimes normalized such that the highest value is 1or that the sum of all betweenness centralities in the network is 1  Shows which nodes are more likely to be in communication paths between other nodes  Also useful in determining points where the network would break apart (think who would be cut off if nodes 3 or 5 would disappear) CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)21 1 2 3 4 5 6 7 0 1.5 6.5 0 9 0 0 Node 5 has higher betweenness centrality than 3 Values computed with the sna package in the R programming environment. Definitions of centrality measures may vary slightly in other software.
  • 22. Closeness centrality  Calculate the mean length of all shortest paths from a node to all other nodes in the network (i.e. how many hops on average it takes to reach every other node)  Take the reciprocal of the above value so that higher values are ‘better’ (indicate higher closeness) like in other measures of centrality  It is a measure of reach, i.e. the speed with which information can reach other nodes from a given starting node CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)22 1 2 3 4 5 6 7 0.5 0.67 0.75 0.46 0.75 0.46 0.46 Nodes 3 and 5 have the highest (i.e. best) closeness, while node 2 fares almost as well Note: Sometimes closeness is calculated without taking the reciprocal of the mean shortest path length. Then lower values are ‘better’. Values computed with the sna package in the R programming environment. Definitions of centrality measures may vary slightly in other software.
  • 23. Eigenvector centrality  A node’s eigenvector centrality is proportional to the sum of the eigenvector centralities of all nodes directly connected to it  In other words, a node with a high eigenvector centrality is connected to other nodes with high eigenvector centrality  This is similar to how Google ranks web pages: links from highly linked-to pages count more  Useful in determining who is connected to the most connected nodes CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)23 1 2 3 4 5 6 7 0.36 0.49 0.54 0.19 0.49 0.17 0.17 Node 3 has the highest eigenvector centrality, closely followed by 2 and 5 Note: The term ‘eigenvector’ comes from mathematics (matrix algebra), but it is not necessary for understanding how to interpret this measure Values computed with the sna package in the R programming environment. Definitions of centrality measures may vary slightly in other software.
  • 24. Interpretation of measures (1) CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)24  Degree  Betweenness  Closeness  Eigenvector How many people can this person reach directly? How likely is this person to be the most direct route between two people in the network? How fast can this person reach everyone in the network? How well is this person connected to other well- connected people? Centrality measure Interpretation in social networks
  • 25. Interpretation of measures (2) CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)25  Degree  Betweenness  Closeness  Eigenvector In network of music collaborations: how many people has this person collaborated with? In network of spies: who is the spy though whom most of the confidential information is likely to flow? In network of sexual relations: how fast will an STD spread from this person to the rest of the network? In network of paper citations: who is the author that is most cited by other well-cited authors? Centrality measure Other possible interpretations…
  • 26. Identifying sets of key players  In the network to the right, node 10 is the most central according to degree centrality  But nodes 3 and 5 together will reach more nodes  Moreover the tie between them is critical; if severed, the network will break into two isolated sub-networks  It follows that other things being equal, players 3 and 5 together are more ‘key’ to this network than 10  Thinking about sets of key players is helpful! CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)26 1 4 6 7 8 9 10 0 3 5 2
  • 27. Basic Concepts Networks Tie Strength Key Players  Cohesion CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)27 How to represent various social networks How to identify strong/weak ties in the network How to identify key/central nodes in network How to characterize a network’s structure
  • 28. Reciprocity (degree of) CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)28 1 2 3 4  The ratio of the number of relations which are reciprocated (i.e. there is an edge in both directions) over the total number of relations in the network  …where two vertices are said to be related if there is at least one edge between them  In the example to the right this would be 2/5=0.4 (whether this is considered high or low depends on the context)  A useful indicator of the degree of mutuality and reciprocal exchange in a network, which relate to social cohesion  Only makes sense in directed graphs Reciprocity for network = 0.4
  • 29. Density CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)29 1 2 3 4  A network’s density is the ratio of the number of edges in the network over the total number of possible edges between all pairs of nodes (which is n(n-1)/2, where n is the number of vertices, for an undirected graph)  In the example network to the right density=5/6=0.83 (i.e. it is a fairly dense network; opposite would be a sparse network)  It is a common measure of how well connected a network is (in other words, how closely knit it is) – a perfectly connected network is called a clique and has density=1  A directed graph will have half the density of its undirected equivalent, because there are twice as many possible edges, i.e. n(n-1)  Density is useful in comparing networks against each other, or in doing the same for different regions within a single network 1 2 3 4 density = 5/6 = 0.83 density = 5/12 = 0.42 Edge present in network Possible but not present
  • 30. Clustering CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)30 1 2 3 4 5 6 7 1 0.67 0.33 N/a 0.17 N/a N/a  A node’s clustering coefficient is the number of closed triplets in the node’s neighborhood over the total number of triplets in the neighborhood. It is also known as transitivity.  E.g., node 1 to the right has a value of 1 because it is only connected to 2 and 3, and these nodes are also connected to one another (i.e. the only triplet in the neighborhood of 1 is closed). We say that nodes 1,2, and 3 form a clique.  Clustering algorithms identify clusters or ‘communities’ within networks based on network structure and specific clustering criteria (example shown to the right with two clusters is based on edge betweenness, an equivalent for edges of the betweenness centrality presented earlier for nodes) Network clustering coefficient = 0.375 (3 nodes in each triangle x 2 triangles = 6 closed triplets divided by 16 total) Cluster A Cluster B Values computed with the igraph package in the R programming environment. Definitions of centrality measures may vary slightly in other software.
  • 31. Average and longest distance CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)31 1 2 3 4 5 6 7  The longest shortest path (distance) between any two nodes in a network is called the network’s diameter  The diameter of the network on the right is 3; it is a useful measure of the reach of the network (as opposed to looking only at the total number of vertices or edges)  It also indicates how long it will take at most to reach any node in the network (sparser networks will generally have greater diameters)  The average of all shortest paths in a network is also interesting because it indicates how far apart any two nodes will be on average (average distance) diameter
  • 32. Small Worlds CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)32  A small world is a network that looks almost random but exhibits a significantly high clustering coefficient (nodes tend to cluster locally) and a relatively short average path length (nodes can be reached in a few steps)  It is a very common structure in social networks because of transitivity in strong social ties and the ability of weak ties to reach across clusters (see also next page…)  Such a network will have many clusters but also many bridges between clusters that help shorten the average distance between nodes local cluster bridge You may have heard of the famous “6 degrees” of separation Sketch of small world structure
  • 33. Preferential Attachment CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)33 A property of some networks, where, during their evolution and growth in time, a the great majority of new edges are to nodes with an already high degree; the degree of these nodes thus increases disproportionately, compared to most other nodes in the network  The result is a network with few very highly connected nodes and many nodes with a low degree  Such networks are said to exhibit a long-tailed degree distribution  And they tend to have a small- world structure! (so, as it turns out, transitivity and strong/weak tie characteristics are not necessary to explain small world structures, but they are common and can also lead to such structures) nodes ordered in descending degree degree short head long tail Example of network with preferential attachment Sketch of long-tailed degree distribution
  • 34. Reasons for preferential attachment CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)34 We want to be associated with popular people, ideas, items, thus further increasing their popularity, irrespective of any objective, measurable characteristics We evaluate people and everything else based on objective quality criteria, so higher quality nodes will naturally attract more attention, faster Among nodes of similar attributes, those that reach critical mass first will become ‘stars’ with many friends and followers (‘halo effect’) Popularity Quality Mixed model May be impossible to predict who will become a star, even if quality matters Also known as ‘the good get better’ Also known as ‘the rich get richer’
  • 35. Core-Periphery Structures  A useful and relatively simple metric of the degree to which a social network is centralized or decentralized, is the centralization measure (usually normalized such that it takes values between 0 and 1)  It is based on calculating the differences in degrees between nodes; a network that greatly depends on 1-2 highly connected nodes (as a result for example of preferential attachment) will exhibit greater differences in degree centrality between nodes  Centralized structures can perform better at some tasks (like team-based problem-solving requiring coordination), but are more prone to failure if key players disconnect  In addition to centralization, many large groups and online communities have a core of densely connected users that are critical for connecting a much larger periphery  Cores can be identified visually, or by examining the location of high-degree nodes and their joint degree distributions (do high-degree nodes tend to connect to other high-degree nodes?)  Bow-tie analysis, famously used to analyze the structure of the Web, can also be used to distinguish between the core and other, more peripheral elements in a network (see earlier example here) CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)35 More peripheral clusters and other structures Nodes in core
  • 36. Thoughts on Design CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)36 How can an online social media platform (and its administrators) leverage the methods and insights of social network analysis? How can it encourage a network perspective among its users, such that they are aware of their ‘neighborhood’ and can learn how to work with it and/or expand it? What measures can an online community take to optimize its network structure? Example: cliques can be undesirable because they shun newcomers SNA inspired some of the first SNS’s (e.g. SixDegrees), but still not used so often in conjunction with design decisions – much untapped potential here How can online communities identify and utilize key players for the benefit of the community? What would be desirable structures for different types of online platforms? (not easy to answer)
  • 37. Analyzing your own ego-network CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)37 • Use the steps outlined in the following pages to visualize and analyze your own network • Think about the key players in your network, the types of ties that you maintain with them, identify any clusters or communities within your network, etc. • Objective: practice SNA with real data! • Present your findings in class next week!
  • 38. Visualizing Facebook ego-network online  Launch the TouchGraph Facebook Browser  You should see a visualization of your network like the one to the right  Make sure to set to a value that will allow you to see your entire network (friends ranked according to highest betweeness centrality according to TouchGraph Help)  Go to “Advanced” and remove all filters on the data so that settings look like below:  Note: not possible to export your data for further analysis  You may also want to try TouchGraph Google Browser (it’s fun!) CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)38 Example TouchGraph Facebook Layout Navigate the graph, examine friend ‘ranks’, friend positions in the network, clusters and what they have in common, try to identify weak and strong ties of yours and assess overall structure of your ego-network
  • 39. Exporting data for offline analysis  Data is more useful when you can extract it from an online platform and analyze with a variety of more powerful tools  Facebook, Twitter and other platforms have public Application Programming Interfaces (API’s) which allow computer programs to extract data among other things  Web crawlers can also be used to read and extract the data directly from the web pages which contain them  Doing either of the above on your own will usually require some programming skills  Thankfully, if you have no such experience, you can use free tools built by others :)  Bernie Hogan (Oxford Internet Institute) has developed a Facebook application that extracts a list of all edges in your ego-network (see instructions on next page)  Also, NodeXL (Windows only, see later slide) currently imports data from: Twitter, YouTube, Flickr, and your email client!  Let’s start with installing and learning to use NodeXL… CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)39
  • 40. Using NodeXL for visualization & analysis  Download and install NodeXL Windows 7/Vista/XP, requires Excel 2007 (installation may take a while if additional software is needed for NodeXL to work) CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)40 NodeXL sample screenshot  Launch Excel and select New -> My Templates ->NodeXLGraph.xltx  Go to “Import” and select the appropriate option for the data you wish to import (for Facebook import see next slide first!) NM4881A Tweep Network (weekly data)  Click on to ask NodeXL to compute centralities, network density, clustering coefficients, etc.  Select to display network graph. You can customize this using and as well as For more info read this NodeXL tutorial
  • 41.  Launch Excel and open the file that you just saved to your computer. 1. Excel will launch the Text Import Wizard. Select “Delimited”. Click “Next >” 2. Select “Space” as the delimiter, as shown here 3. Select “Next >” and then “Finish”. 4. A new file will be created in Excel. It contains a list of all the nodes in your ego-network, followed by a list of all edges. Scroll down until you find the edges, select all of them and copy them (Ctrl-C) 5. You can now open a new NodeXL file in Excel as explained in the previous slide. Instead of using NodeXL’s import function, paste the list of edges to the NodeXL worksheet, right here 6. In NodeXL select and “Get Vertices from Edge List” 7. Now you can compute graph metrics and visualize your data like explained in the previous slide! Exporting Facebook ego-network data CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)41 This will explain how to export your Facebook data for analysis with a tool like NodeXL  To use Bernie Hogan’s tool on Facebook, click here. From the two options presented, select “UCInet”. This is a format specific to another tool, not NodeXL, but we will import this data into NodeXL because it’s easier to use.  After selecting “UCInet”, right-click on the link given to you and select to save the generated file to a folder on your computer.
  • 42. More options  Many more tools are used for SNA, although they generally require more expert knowledge. Some of these are:  Pajek (Windows, free)  UCInet (Windows, shareware)  Netdraw (Windows, free)  Mage (Windows, free)  GUESS (all platforms, free and open source)  R packages for SNA (all platforms, free and open source)  Gephi (all platforms, free and open source)  The field is continuously growing, so we can expect to see more user- friendly applications coming out in the next years… CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg)42
  • 43. Credits and licensing  Front page network graph by ilamont (license: CC BY)  Bridge routing illustrations in Newman et al, The Structure and Dynamics of Networks, Princeton University Press  Personal social network diagram in Mark Wellman (ed.), Networks in the Global Village, Westview Press  Visualization of interactions in organization in Garton et al, Studying Online Social Networks, JCMC  Visualization of US political bloggers in Lada Adamic and Natalie Glance, The Political Blogosphere and the 2004 US election: Divided They Blog, Proceedings of the International Conference on Knowledge Discovery and Data Mining, ACM Press, 2005  Web bow tie diagram by Broder et al, Graph Structure in the Web, Computer Networks, Elsevier  Bond/tie photo by ChrisK4u (license: CC BY-ND)  Visualization of small world network by AJC1 (license: CC BY-NC) 43 CNM Social Media Module – Giorgos Cheliotis (gcheliotis@nus.edu.sg) Original content in this presentation is licensed under the Creative Commons Singapore Attribution 3.0 license unless stated otherwise (see above) Original content in this presentation is licensed under the Creative Commons Singapore Attribution 3.0 license unless stated otherwise (see above)