SlideShare a Scribd company logo
1 of 47
CSE390 Advanced
Computer Networks
123
Lecture 15: Online Social
Networks
(The network is people)
Based on slides by A. Mislove, F. Schneider,
and W. Willinger. Updated by P Gill. Fall 2014
What are (online) social networks?
2
 Social networks are graphs of people
 Graph edges connect friends
 `Friend’ has different implications
 How hard is it to be Facebook `friends’?
 Online social networking
 Social network hosted by a Web site
 Friendship represents shared interest or trust
 Online friends may have never met
What are online social networks used
for?
3
 Popular for sharing content
 Photos (Flickr), videos (YouTube), blogs (LiveJournal),
profiles (Facebook, Orkut)
 Fixed broadband (Sandvine Q1 2014)
 YouTube 5.5% upload, 13.2% down
 Facebook 2.2% upload, 2.0% down
 Popular with users on the go
 Mobile (Sandvine Q1 2014)
 YouTube 3.8% up, 17.6% down
 Facebook 27.0% up, 14.0% down
Why are social networks
interesting?
4
 Popular way to connect
 Estimated 1.32B users online each day
 Average American spends 40 minutes/day on the site
 Changing the flow of information
 Formerly few ``writers’’ many ``readers’’ online
 Now anyone can write!
 What does this mean for Internet traffic?
 Important in regions with strict media controls
 E.g., Iran, Egypt using social media platforms to get
word out in times of unrest
 Useful in times of disaster
Notable incidents …
5
Not just a social phenomenon…
6
Facebook now contains photo and video
 Content delivery challenges!
YouTube is a large fraction of Google’s traffic!
Understanding properties of these networks is important
to understand how we build systems to support them!
• Graph Properties of OSNs
• (Mislove et al 2007)
• The Demise of MySpace
• (Torkjazi et al 2009)
• How do people use OSNs
• (Schneider et al 2009)
Outline
7
Required reading: Mislove et al. 2007
8
 One of the first measurement studies of online social
networks (OSNs)
 Large-scale measurement study and analysis of
multiple online social networks
 11 M users, 328 M links
 Four diverse OSNs
 Flickr: photosharing
 LiveJournal: blogging
 Orkut: social networking
 YouTube: video sharing
 Goals:
 Measure OSNs at scale
 Understand their static structural properties
How to measure OSNs at scale?
9
 Sites are reluctant to give out data
 Cannot enumerate user list in general
 Instead, performed crawls of the user graph
 Picked known seed user
 Crawled all of his friends
 Added new users to list
 Continued until all known users crawled
 Effectively performed a BFS of graph
Challenges faced
10
 Obtaining data using crawling presents unique
challenges
 Need to crawl quickly!
 Underlying network changes rapidly
 Consistent snapshot is hard to get
 Crawling completely
 Social networks aren’t necessarily connected
 Some users have no links! Or are in small clusters.
 Need to estimate the crawl coverage
How fast could they crawl?
11
 Crawled using a cluster of 58 machines
 Used APIs where available
 Otherwise, used screen scraping
 Crawls took varying times
 Flickr, YouTube 1 day
 LiveJournal 3 days
 Orkut (Partial) 39 days)
 Crawls subject to rate-limiting
 Discovered appropriate rates
Data collected
12
Able to crawl a large portion of the network
Node degrees vary by orders of magnitude
However, networks share many key properties
To ground analysis, will compare to Web [Broder et al.,
How are links distributed
13
What fraction of links are
symmetric?
14
Aside: User relationships on
Twitter
 Broadcasters
 News outlets, radio
stations
 No reason to follow
anyone
 Post playlists,
headlines
13
Aside: User relationships on
Twitter
 Acquaintances
 Similar number of
followers and
following
 Along the diagonal
 Green portion is top
1-percentile of
tweeters
14
Aside: User relationships on
Twitter
 Miscreants?
 Some people follow
many users
(programmatically)
 Hoping some will
follow them back
 Spam, widgets,
celebrities (at top)
15
Aside: User relationships on
Twitter
18
Twitter noticed the miscreants…
… enacted the 10% rule (you can follow 10% more people than follow
Complex network structure
20
Does a core exist?
21
How clustered is the fringe?
22
Implications
23
• Graph Properties of OSNs
• (Mislove et al 2007)
• The Demise of MySpace
• (Torkjazi et al 2009)
• How do people use OSNs
• (Schneider et al 2009)
Outline
24
Hot Today, Gone Tomorrow…
25
 Slides borrowed from W. Willinger
 Paper: Hot Today, Gone Tomorrow: On the
Migration of MySpace Users. M. Torkjazi, R.
Rejaie, and W. Willinger.
Motivation
 A majority of empirical studies of Online Social Networks
(OSNs) has focused on their associated friendship graphs
 What about the temporal dynamics of OSNs?
 What about the “active” portion of an OSN?
 A majority of empirical studies of OSNs has examined the
growth of these systems
 What about the patterns of decline in user population?
 What about changes over time in user activity?
 A majority of empirical studies of OSNs has been based on
connectivity information
 What about timing information?
 How to obtain relevant timing information?
8/17/2009 WOSN 2009 - Barcelona
26
This Study
 We examine the evolution of user population and user activity
in MySpace
 User arrival/activity/departure, life cycle of MySpace
 Why MySpace?
 It is one of the largest and most popular OSNs
 It provides several features making our study feasible
 Main challenges
 OSNs are often studied when they are popular and the
number of departure is negligible
 Popular OSNs tend to hide the information about user
departures
8/17/2009 WOSN 2009 - Barcelona
27
MySpace Features (I)
 Provides explicit profile status
 Public
 Private
 Invalid
 Availability of users’ last login
 Enables assessment of the level of activity among users
 Importantly, allows inference of population growth of MySpace
(see later for details)
 Global visibility
 http://www.myspace.com/user_id
8/17/2009 WOSN 2009 - Barcelona 28
MySpace Features (II)
 Monotonic assignment of numeric ID
 Searched periodically for currently
smallest unassigned ID and checked
that all larger IDs are unassigned; after
waiting for a short period, we observed
that the smallest unassigned ID (and
others after it) are now assigned.
 Found no apparent patterns in gaps
between consecutive invalid IDs
 No evidence for re-assignement of
deleted IDs
 Makes the selection of random
samples of MySpace users easy.
8/17/2009 WOSN 2009 - Barcelona 29
No visible pattern
Measurement
 Feb. 26th 2009: MySpace ID space [1 … 455,881,700]
 50 parallel samplers to collect 360K users in less than 12 hours
(0.1% of MySpace population)
 Using HTML parser to post-process the downloaded profiles
and extract
 User s’ profile status (invalid, public, private)
 Users’ last login date
 Users’ friend list (only for public profiles)
 Unable to parse last login info for 0.96% of public and 0.08% of
private profiles
 Last login info is not provided or is provided with obvious errors
(e.g. 1/1/0001)
8/17/2009 WOSN 2009 - Barcelona
30
On the Population size of MySpace
 Population of valid MySpace users (Feb. 26, 2009) was about
(41.5 + 17.3)% of 455,881,700 = 268M
 Compare with www.myspace.com/tom who has 266,029,430 friends
(Aug. 13, 2009)
 How has MySpace grown during the past years?
 How many “active” users are there in MySpace?
8/17/2009 WOSN 2009 - Barcelona 31
Total Invalid Public Private
362K 149K (41.2%) 150K (41.5%) 63K (17.3%)
On User Arrival
8/17/2009 WOSN 2009 - Barcelona 32
Public users
 What does user ID say about account creation time?
 Plot user ID vs. last login of that user for all our users
Private users
On User Arrival
 32% of public and 18% of private users are tourists
 Discovery of “tourists” enables accurate estimation of user
account creation time based on their associated user ID
8/17/2009 WOSN 2009 - Barcelona 33
Tourists
 What does user ID say about account creation time?
“Clean edge”
=
users whose last login
is shortly after their
account creation time
=
“MySpace tourists”
On MySpace’s Growth
 Use the observed uniform spread
of tourists across entire ID space
 Estimate account creation time by
last login time
 Estimate account creation time of
all sampled accounts based on
their ID.
8/17/2009 WOSN 2009 - Barcelona 34
April 2008
 Estimating the user population of MySpace in the past?
 Slope of the top line shows the growth rate of MySpace population
 Exponential growth until about April 2008
 Visible knee around April 2008 followed by a slow-down in growth
On User Departure
8/17/2009 WOSN 2009 - Barcelona 35
 More public and private profiles in
the first half of ID space
 More invalid profiles in the second
half of ID space
 Users joining the system earlier have been more likely to keep
their accounts than newer users
 Are newer users more likely to leave than older ones?
MySpace Life Cycle (I)
 Slow-down in the growth
rate of MySpace is related to
emergence of Facebook
 Informal evidence
(Alexa.com): Daily accesses
to Facebook surpassed that
of MySpace, at around April
2008
8/17/2009 WOSN 2009 - Barcelona 36
 Possible reasons behind MySpace’s decline?
• Graph Properties of OSNs
• (Mislove et al 2007)
• The Demise of MySpace
• (Torkjazi et al 2009)
• How do people use OSNs
• (Schneider et al 2009)
Outline
37
Understanding Online Social Network
Usage from a Network Perspective
38
 F. Schneider, A. Feldmann, B. Krishnamurthy, and
W. Willinger. ACM Internet Measurement
Conference 2009
 Slides borrowed from F. Schneider.
 This study differs from a lot of related work by
looking at OSN behavior at the network traffic level
 Vs. crawling the application-level social graph
General Approach
39
OSNs studied
40
HTTP Traces
41
Categories of pages
42
Pages manually classified based on small user
generated traces in the lab setting
Session Characteristics
43
Action popularity
44
Feature sequences
45
OSNs: Wrap up
46
 Many different types of OSNs
 Photos, video, profile-based
 Some extremely popular source of much Internet
traffic
 Facebook, YouTube
 New ones emerging
 Instagram, snapchat
 Old ones fading
 MySpace, Friendster
 Studying their properties can inform how we build
End
47
BitTorrent Overview
48
Tracker
Swarm
Leechers
Seeder

More Related Content

Similar to Online Social Networks Graph Properties and User Behavior

Finaly! Untangling Web 2.0
Finaly! Untangling Web 2.0Finaly! Untangling Web 2.0
Finaly! Untangling Web 2.0sydneyblackmore
 
Library 2.0: the Future of Libraries?
Library 2.0: the Future of Libraries?Library 2.0: the Future of Libraries?
Library 2.0: the Future of Libraries?Ben Ropp
 
Online social networking
Online social networkingOnline social networking
Online social networkingGurudutt Reddy
 
Social Networking Informative Speech
Social Networking Informative SpeechSocial Networking Informative Speech
Social Networking Informative SpeechCory Bohon
 
User Behaviour Pattern Recognition On Twitter Social Network
User Behaviour Pattern Recognition On Twitter Social NetworkUser Behaviour Pattern Recognition On Twitter Social Network
User Behaviour Pattern Recognition On Twitter Social NetworkGeorge Konstantakopoulos
 
Step Into Social Media Slides From The Workshop January 19, 2010
Step Into Social Media   Slides From The Workshop   January 19, 2010Step Into Social Media   Slides From The Workshop   January 19, 2010
Step Into Social Media Slides From The Workshop January 19, 2010Promote Public Relations Ltd
 
Research on language and identity
Research on language and identityResearch on language and identity
Research on language and identityAzmi Latiff
 
Do you ever use facebook
Do you ever use facebookDo you ever use facebook
Do you ever use facebookhgfhjjff
 
A review for the online social networks literature
A review for the online social networks literatureA review for the online social networks literature
A review for the online social networks literatureAlexander Decker
 
A review for the online social networks literature
A review for the online social networks literatureA review for the online social networks literature
A review for the online social networks literatureAlexander Decker
 
New microsoft office word document
New microsoft office word documentNew microsoft office word document
New microsoft office word documentShailendra Singh
 
New microsoft office word document
New microsoft office word documentNew microsoft office word document
New microsoft office word documentShailendra Singh
 
11.face location a novel approach to post the user global location
11.face location   a novel approach to post the user global location11.face location   a novel approach to post the user global location
11.face location a novel approach to post the user global locationAlexander Decker
 
Social Web for VU Dagje Studeren
Social Web for VU Dagje Studeren Social Web for VU Dagje Studeren
Social Web for VU Dagje Studeren Victor de Boer
 
Student`S Approach towards Social Network Sites
Student`S Approach towards Social Network SitesStudent`S Approach towards Social Network Sites
Student`S Approach towards Social Network Sitesiosrjce
 

Similar to Online Social Networks Graph Properties and User Behavior (20)

Finaly! Untangling Web 2.0
Finaly! Untangling Web 2.0Finaly! Untangling Web 2.0
Finaly! Untangling Web 2.0
 
Library 2.0: the Future of Libraries?
Library 2.0: the Future of Libraries?Library 2.0: the Future of Libraries?
Library 2.0: the Future of Libraries?
 
Online social networking
Online social networkingOnline social networking
Online social networking
 
Social Networking Informative Speech
Social Networking Informative SpeechSocial Networking Informative Speech
Social Networking Informative Speech
 
User Behaviour Pattern Recognition On Twitter Social Network
User Behaviour Pattern Recognition On Twitter Social NetworkUser Behaviour Pattern Recognition On Twitter Social Network
User Behaviour Pattern Recognition On Twitter Social Network
 
2_Doc5_2.pdf
2_Doc5_2.pdf2_Doc5_2.pdf
2_Doc5_2.pdf
 
Step Into Social Media Slides From The Workshop January 19, 2010
Step Into Social Media   Slides From The Workshop   January 19, 2010Step Into Social Media   Slides From The Workshop   January 19, 2010
Step Into Social Media Slides From The Workshop January 19, 2010
 
Managing government use of social media
Managing government use of social mediaManaging government use of social media
Managing government use of social media
 
Research on language and identity
Research on language and identityResearch on language and identity
Research on language and identity
 
Social networks
Social networksSocial networks
Social networks
 
Do you ever use facebook
Do you ever use facebookDo you ever use facebook
Do you ever use facebook
 
Faculty presentation 21 st december 2015
Faculty presentation 21 st december 2015Faculty presentation 21 st december 2015
Faculty presentation 21 st december 2015
 
A review for the online social networks literature
A review for the online social networks literatureA review for the online social networks literature
A review for the online social networks literature
 
A review for the online social networks literature
A review for the online social networks literatureA review for the online social networks literature
A review for the online social networks literature
 
New microsoft office word document
New microsoft office word documentNew microsoft office word document
New microsoft office word document
 
New microsoft office word document
New microsoft office word documentNew microsoft office word document
New microsoft office word document
 
11.face location a novel approach to post the user global location
11.face location   a novel approach to post the user global location11.face location   a novel approach to post the user global location
11.face location a novel approach to post the user global location
 
Social Web for VU Dagje Studeren
Social Web for VU Dagje Studeren Social Web for VU Dagje Studeren
Social Web for VU Dagje Studeren
 
Student`S Approach towards Social Network Sites
Student`S Approach towards Social Network SitesStudent`S Approach towards Social Network Sites
Student`S Approach towards Social Network Sites
 
L19 Social
L19 SocialL19 Social
L19 Social
 

Recently uploaded

办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
vip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Book
vip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Bookvip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Book
vip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Bookmanojkuma9823
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxFurkanTasci3
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 

Recently uploaded (20)

办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
vip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Book
vip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Bookvip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Book
vip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Book
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptx
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 

Online Social Networks Graph Properties and User Behavior

  • 1. CSE390 Advanced Computer Networks 123 Lecture 15: Online Social Networks (The network is people) Based on slides by A. Mislove, F. Schneider, and W. Willinger. Updated by P Gill. Fall 2014
  • 2. What are (online) social networks? 2  Social networks are graphs of people  Graph edges connect friends  `Friend’ has different implications  How hard is it to be Facebook `friends’?  Online social networking  Social network hosted by a Web site  Friendship represents shared interest or trust  Online friends may have never met
  • 3. What are online social networks used for? 3  Popular for sharing content  Photos (Flickr), videos (YouTube), blogs (LiveJournal), profiles (Facebook, Orkut)  Fixed broadband (Sandvine Q1 2014)  YouTube 5.5% upload, 13.2% down  Facebook 2.2% upload, 2.0% down  Popular with users on the go  Mobile (Sandvine Q1 2014)  YouTube 3.8% up, 17.6% down  Facebook 27.0% up, 14.0% down
  • 4. Why are social networks interesting? 4  Popular way to connect  Estimated 1.32B users online each day  Average American spends 40 minutes/day on the site  Changing the flow of information  Formerly few ``writers’’ many ``readers’’ online  Now anyone can write!  What does this mean for Internet traffic?  Important in regions with strict media controls  E.g., Iran, Egypt using social media platforms to get word out in times of unrest  Useful in times of disaster
  • 6. Not just a social phenomenon… 6 Facebook now contains photo and video  Content delivery challenges! YouTube is a large fraction of Google’s traffic! Understanding properties of these networks is important to understand how we build systems to support them!
  • 7. • Graph Properties of OSNs • (Mislove et al 2007) • The Demise of MySpace • (Torkjazi et al 2009) • How do people use OSNs • (Schneider et al 2009) Outline 7
  • 8. Required reading: Mislove et al. 2007 8  One of the first measurement studies of online social networks (OSNs)  Large-scale measurement study and analysis of multiple online social networks  11 M users, 328 M links  Four diverse OSNs  Flickr: photosharing  LiveJournal: blogging  Orkut: social networking  YouTube: video sharing  Goals:  Measure OSNs at scale  Understand their static structural properties
  • 9. How to measure OSNs at scale? 9  Sites are reluctant to give out data  Cannot enumerate user list in general  Instead, performed crawls of the user graph  Picked known seed user  Crawled all of his friends  Added new users to list  Continued until all known users crawled  Effectively performed a BFS of graph
  • 10. Challenges faced 10  Obtaining data using crawling presents unique challenges  Need to crawl quickly!  Underlying network changes rapidly  Consistent snapshot is hard to get  Crawling completely  Social networks aren’t necessarily connected  Some users have no links! Or are in small clusters.  Need to estimate the crawl coverage
  • 11. How fast could they crawl? 11  Crawled using a cluster of 58 machines  Used APIs where available  Otherwise, used screen scraping  Crawls took varying times  Flickr, YouTube 1 day  LiveJournal 3 days  Orkut (Partial) 39 days)  Crawls subject to rate-limiting  Discovered appropriate rates
  • 12. Data collected 12 Able to crawl a large portion of the network Node degrees vary by orders of magnitude However, networks share many key properties To ground analysis, will compare to Web [Broder et al.,
  • 13. How are links distributed 13
  • 14. What fraction of links are symmetric? 14
  • 15. Aside: User relationships on Twitter  Broadcasters  News outlets, radio stations  No reason to follow anyone  Post playlists, headlines 13
  • 16. Aside: User relationships on Twitter  Acquaintances  Similar number of followers and following  Along the diagonal  Green portion is top 1-percentile of tweeters 14
  • 17. Aside: User relationships on Twitter  Miscreants?  Some people follow many users (programmatically)  Hoping some will follow them back  Spam, widgets, celebrities (at top) 15
  • 18. Aside: User relationships on Twitter 18 Twitter noticed the miscreants… … enacted the 10% rule (you can follow 10% more people than follow
  • 20. Does a core exist? 21
  • 21. How clustered is the fringe? 22
  • 23. • Graph Properties of OSNs • (Mislove et al 2007) • The Demise of MySpace • (Torkjazi et al 2009) • How do people use OSNs • (Schneider et al 2009) Outline 24
  • 24. Hot Today, Gone Tomorrow… 25  Slides borrowed from W. Willinger  Paper: Hot Today, Gone Tomorrow: On the Migration of MySpace Users. M. Torkjazi, R. Rejaie, and W. Willinger.
  • 25. Motivation  A majority of empirical studies of Online Social Networks (OSNs) has focused on their associated friendship graphs  What about the temporal dynamics of OSNs?  What about the “active” portion of an OSN?  A majority of empirical studies of OSNs has examined the growth of these systems  What about the patterns of decline in user population?  What about changes over time in user activity?  A majority of empirical studies of OSNs has been based on connectivity information  What about timing information?  How to obtain relevant timing information? 8/17/2009 WOSN 2009 - Barcelona 26
  • 26. This Study  We examine the evolution of user population and user activity in MySpace  User arrival/activity/departure, life cycle of MySpace  Why MySpace?  It is one of the largest and most popular OSNs  It provides several features making our study feasible  Main challenges  OSNs are often studied when they are popular and the number of departure is negligible  Popular OSNs tend to hide the information about user departures 8/17/2009 WOSN 2009 - Barcelona 27
  • 27. MySpace Features (I)  Provides explicit profile status  Public  Private  Invalid  Availability of users’ last login  Enables assessment of the level of activity among users  Importantly, allows inference of population growth of MySpace (see later for details)  Global visibility  http://www.myspace.com/user_id 8/17/2009 WOSN 2009 - Barcelona 28
  • 28. MySpace Features (II)  Monotonic assignment of numeric ID  Searched periodically for currently smallest unassigned ID and checked that all larger IDs are unassigned; after waiting for a short period, we observed that the smallest unassigned ID (and others after it) are now assigned.  Found no apparent patterns in gaps between consecutive invalid IDs  No evidence for re-assignement of deleted IDs  Makes the selection of random samples of MySpace users easy. 8/17/2009 WOSN 2009 - Barcelona 29 No visible pattern
  • 29. Measurement  Feb. 26th 2009: MySpace ID space [1 … 455,881,700]  50 parallel samplers to collect 360K users in less than 12 hours (0.1% of MySpace population)  Using HTML parser to post-process the downloaded profiles and extract  User s’ profile status (invalid, public, private)  Users’ last login date  Users’ friend list (only for public profiles)  Unable to parse last login info for 0.96% of public and 0.08% of private profiles  Last login info is not provided or is provided with obvious errors (e.g. 1/1/0001) 8/17/2009 WOSN 2009 - Barcelona 30
  • 30. On the Population size of MySpace  Population of valid MySpace users (Feb. 26, 2009) was about (41.5 + 17.3)% of 455,881,700 = 268M  Compare with www.myspace.com/tom who has 266,029,430 friends (Aug. 13, 2009)  How has MySpace grown during the past years?  How many “active” users are there in MySpace? 8/17/2009 WOSN 2009 - Barcelona 31 Total Invalid Public Private 362K 149K (41.2%) 150K (41.5%) 63K (17.3%)
  • 31. On User Arrival 8/17/2009 WOSN 2009 - Barcelona 32 Public users  What does user ID say about account creation time?  Plot user ID vs. last login of that user for all our users Private users
  • 32. On User Arrival  32% of public and 18% of private users are tourists  Discovery of “tourists” enables accurate estimation of user account creation time based on their associated user ID 8/17/2009 WOSN 2009 - Barcelona 33 Tourists  What does user ID say about account creation time? “Clean edge” = users whose last login is shortly after their account creation time = “MySpace tourists”
  • 33. On MySpace’s Growth  Use the observed uniform spread of tourists across entire ID space  Estimate account creation time by last login time  Estimate account creation time of all sampled accounts based on their ID. 8/17/2009 WOSN 2009 - Barcelona 34 April 2008  Estimating the user population of MySpace in the past?  Slope of the top line shows the growth rate of MySpace population  Exponential growth until about April 2008  Visible knee around April 2008 followed by a slow-down in growth
  • 34. On User Departure 8/17/2009 WOSN 2009 - Barcelona 35  More public and private profiles in the first half of ID space  More invalid profiles in the second half of ID space  Users joining the system earlier have been more likely to keep their accounts than newer users  Are newer users more likely to leave than older ones?
  • 35. MySpace Life Cycle (I)  Slow-down in the growth rate of MySpace is related to emergence of Facebook  Informal evidence (Alexa.com): Daily accesses to Facebook surpassed that of MySpace, at around April 2008 8/17/2009 WOSN 2009 - Barcelona 36  Possible reasons behind MySpace’s decline?
  • 36. • Graph Properties of OSNs • (Mislove et al 2007) • The Demise of MySpace • (Torkjazi et al 2009) • How do people use OSNs • (Schneider et al 2009) Outline 37
  • 37. Understanding Online Social Network Usage from a Network Perspective 38  F. Schneider, A. Feldmann, B. Krishnamurthy, and W. Willinger. ACM Internet Measurement Conference 2009  Slides borrowed from F. Schneider.  This study differs from a lot of related work by looking at OSN behavior at the network traffic level  Vs. crawling the application-level social graph
  • 41. Categories of pages 42 Pages manually classified based on small user generated traces in the lab setting
  • 45. OSNs: Wrap up 46  Many different types of OSNs  Photos, video, profile-based  Some extremely popular source of much Internet traffic  Facebook, YouTube  New ones emerging  Instagram, snapchat  Old ones fading  MySpace, Friendster  Studying their properties can inform how we build

Editor's Notes

  1. Stats: http://www.dailymail.co.uk/sciencetech/article-2703440/Theres-no-escape-Facebook-set-record-stock-high-results-beats-expectations-1-32-BILLION-users-30-mobile.html
  2. http://mathworld.wolfram.com/GraphDiameter.html Longest shortest path.
  3. Small-world : everyone can reach each other within a small number of hops