• Save
Network Analysis Basics (and applications to online networks)
Upcoming SlideShare
Loading in...5
×
 

Network Analysis Basics (and applications to online networks)

on

  • 5,465 views

Basics of network analysis, applications to the Web and social networks. Reference prepared for the Annenberg Program in Online Communities. ...

Basics of network analysis, applications to the Web and social networks. Reference prepared for the Annenberg Program in Online Communities.

Download the presentation PDF from http://kateto.net/resources/ .

Statistics

Views

Total Views
5,465
Views on SlideShare
4,033
Embed Views
1,432

Actions

Likes
22
Downloads
2
Comments
5

11 Embeds 1,432

http://kateto.net 1002
http://juanfelipeherrera.blogspot.com 230
http://ressivenetworks.com 87
http://www.scoop.it 65
http://markhawker.tumblr.com 37
https://elgg.leeds.ac.uk 3
http://wordsilo.blogspot.com 2
http://translate.googleusercontent.com 2
http://www.ressivenetworks.com 2
http://www.tumblr.com 1
http://feeds.feedburner.com 1
More...

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

CC Attribution-ShareAlike LicenseCC Attribution-ShareAlike License

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment
  • Density=L/N(N-1)
  • More than half of the people using social networking sites, furthermore, receive and follow links to news items on a daily basis. Blogs are another medium in which news content is propagated. As it is an accepted norm for bloggers to link to their sources (Chin & Chignell, 2006; Ferdig & Trammell, 2004), information diffusion in the blogosphere can be tracked based on hyperlink patterns and time stamps.Taking into account those new trends, scholars have started studying the spread of topics through both social networking platforms (Oh, Susarla, & Tan, 2008) and blogs (Leskovec, Backstrom, & Kleinberg, 2009; Leskovec, McGlohon, Faloutsos, Glance, & Hurst, 2007). The two analytical approaches used to explore the online diffusion of media content involve threshold models (Valente, 1996) and cascade models (Cointet & Roth, 2009). In threshold models, an actor's decision to disseminate a topic is based on the proportion of their connections who have already started discussing the subject. In a cascade model, each time an actor is "infected" with a new topic there is a certain probability that the infection will spread to neighboring nodes.Epidemic models of diffusion are often used for online content as their robustness has been well established through long use in other scientific fields. Much of the research looking into the network flow of information is based on methods initially developed to model the spread of disease through interpersonal connections. Clinical research in epidemiology often uses the SIR (susceptible - infected - recovered) cycle to describe the stages through which a node may go. The same model has been adapted to study audience members and their exposure to media content (Leskovec, et al., 2007). In the online-diffusion version of SIR, users may become susceptible to a topic when it is suggested to them by a friend (either through a blog post or via a service like Twitter or Facebook). The person may then be infected with the topic, in that they write a post about it or publish it on a social networking platform. With this, the individual is considered to have recovered from the topic, although a relapse is possible when something new appears on the subject. Modeling the spread of media content through social networks has allowed researchers to understand topic life-cycles, spikes and declines (Cointet, Faure, & Roth, 2007; Gruhl, Guha, Liben-Nowell, & Tomkins, 2004; Leskovec, et al., 2007). It has also provided a way to explore patterns of influence and identify opinion leaders (Java, 2006; Nakajima, Tatemura, Hara, Tanaka, & Uemura, 2006).

Network Analysis Basics (and applications to online networks) Network Analysis Basics (and applications to online networks) Presentation Transcript

  • Network Analysis Basics and applications to online data
    Katherine Ognyanova
    University of Southern California
    Prepared for the Annenberg Program
    for Online Communities, 2010.
  • Relational data
    Node (actor, vertex, etc.)
    Tie ( link, relation, edge, etc.)
  • Internet
    Internet Backbone: Each line is drawn between two nodes, representing two IP addresses. Source: Wikipedia
  • Global Media Networks
    Media corporations network. Source: Arsenault & Castells, 2008
  • US Senate
    US Senate cosponsorship network 1973-2004. Source: Fowler, 2006
  • High-School Romance
    Source: Easley, Kleinberg (2010) Networks, Crowds and Markets
  • Online Social Networks
    Twitter, 2010
  • SNiF
    Source: noah.cx, 2010
  • Basic Types of Networks
    HP
    MS
    Jill
    John
    Paul
    Jill
    Paul
    John
    Kate
    Kate
    Jim
    Jim
    Tom
    Jill
    Tom
    Jill
  • A closer look at links
    John
    Bob
    John
    Bob
    John
    Bob
    John
    Bob
    +
    5
    colleague
    friend
  • Distance in Networks
    B
    B
    B
    B
    A
    C
    E
    F
    A
    C
    E
    F
    A
    C
    E
    F
    A
    C
    E
    F
    D
    D
    D
    D
  • A Closer Look at Nodes
  • Node Types
    Isolate
    Liaison
    Gatekeeper
    Star
    Bridge
  • Node Centrality
  • Network Centralization & Density
    Degree, closeness and betweenness are centrality measures for individual nodes.
    Centralization is a network-level measure. It measures the degree to which an entire network is focused around a few central nodes. In a decentralized network, the links are more or less evenly distributed among nodes.
    Centralization is calculated based on the differences in degree centrality between nodes divided by the maximum possible sum of differences.
    Density : Ratio of the number of links to the number of possible links in the network
    Size: The number of nodes in a network
  • Example: Terrorist Networks
    Source: Business 2.0 December 2001. Six Degrees of Mohamed Atta
  • Social Network Measures & Mechanisms
  • US Political Blogosphere
    Linking patterns in the US political blogosphere. Source: Adamic & Glance (2005)
  • What does network data look like?
    Although there are other options (edge lists, node lists, etc.) network data is typically used in the form of a matrix .
    Row X column Y is 1 if there is a link from X to Y - and 0 if there is no link.
    The diagonal represents self-loops: links from X back to X.
    C
    D
    A
    B
  • Using UCINET for Network Analysis
    1
    2
    • UCINET: download from this website. You get two months of free trial.
    • Data Input: - Direct: Click on (1) – copy and paste from Excel or enter manually a matrix. - Import: Data menu -> Import via Spreadsheet - > Full matrix w/ multiple sheets. For every network Ucinet will create two files: .##h and .##d. If you move them, move both.
    • Density : Network menu -> Cohesion -> Density
    • Reciprocity / Transitivity / Clustering : Network menu -> Cohesion -> Reciprocity / Transitivity / Clustering
    • Node centrality : Network -> Centrality -> Multiple measures (old)
    • Visualization: Press button (2) for NetDraw.
  • Studying Social Structures Through Network Analysis
  • Collecting Network Data
  • Example: Snowball Sampling
    Source: Valente (2010) Social Networks and Health
  • The Web as a Network
  • E-Mail and Discussion Forum Networks
    Source: Welser, Gleave, Fisher & Smith, 2007
  • Diffusion Through Online Social Networks
  • Online Resources for Social Networks
    How can we collect data about online networks?
    • Facebook:Name Gen Web by Bernie Hogan is a Facebook app that will collect data about your network and let you download it in a UCINET file format.
    • NodeXLis an add-on for network analysis in Excel. It can collect and analyze Twitter, Flickr and e-mail nets.
    • SNAP (Stanford Network Analysis Platform) has a data library with big network datasets from Google, Amazon, Wikipedia, social network sites, etc.
    • IssueCrawler, LexiURLand SocSciBotare some of the available solutions for crawling web pages and collecting hyperlink networks.