SlideShare a Scribd company logo
CSE390 Advanced
Computer Networks
Lecture 15: Online Social
Networks
(The network is people)
Based on slides by A. Mislove, F. Schneider,
and W. Willinger. Updated by P Gill. Fall 2014
What are (online) social networks?
2
 Social networks are graphs of people
 Graph edges connect friends
 `Friend’ has different implications
 How hard is it to be Facebook `friends’?
 Online social networking
 Social network hosted by a Web site
 Friendship represents shared interest or trust
 Online friends may have never met
What are online social networks used
for?
3
 Popular for sharing content
 Photos (Flickr), videos (YouTube), blogs (LiveJournal),
profiles (Facebook, Orkut)
 Fixed broadband (Sandvine Q1 2014)
 YouTube 5.5% upload, 13.2% down
 Facebook 2.2% upload, 2.0% down
 Popular with users on the go
 Mobile (Sandvine Q1 2014)
 YouTube 3.8% up, 17.6% down
 Facebook 27.0% up, 14.0% down
Why are social networks
interesting?
4
 Popular way to connect
 Estimated 1.32B users online each day
 Average American spends 40 minutes/day on the site
 Changing the flow of information
 Formerly few ``writers’’ many ``readers’’ online
 Now anyone can write!
 What does this mean for Internet traffic?
 Important in regions with strict media controls
 E.g., Iran, Egypt using social media platforms to get
word out in times of unrest
 Useful in times of disaster
Notable incidents …
5
Not just a social phenomenon…
6
Facebook now contains photo and video
 Content delivery challenges!
YouTube is a large fraction of Google’s traffic!
Understanding properties of these networks is important
to understand how we build systems to support them!
• Graph Properties of OSNs
• (Mislove et al 2007)
• The Demise of MySpace
• (Torkjazi et al 2009)
• How do people use OSNs
• (Schneider et al 2009)
Outline
7
Required reading: Mislove et al. 2007
8
 One of the first measurement studies of online social
networks (OSNs)
 Large-scale measurement study and analysis of
multiple online social networks
 11 M users, 328 M links
 Four diverse OSNs
 Flickr: photosharing
 LiveJournal: blogging
 Orkut: social networking
 YouTube: video sharing
 Goals:
 Measure OSNs at scale
 Understand their static structural properties
How to measure OSNs at scale?
9
 Sites are reluctant to give out data
 Cannot enumerate user list in general
 Instead, performed crawls of the user graph
 Picked known seed user
 Crawled all of his friends
 Added new users to list
 Continued until all known users crawled
 Effectively performed a BFS of graph
Challenges faced
10
 Obtaining data using crawling presents unique
challenges
 Need to crawl quickly!
 Underlying network changes rapidly
 Consistent snapshot is hard to get
 Crawling completely
 Social networks aren’t necessarily connected
 Some users have no links! Or are in small clusters.
 Need to estimate the crawl coverage
How fast could they crawl?
11
 Crawled using a cluster of 58 machines
 Used APIs where available
 Otherwise, used screen scraping
 Crawls took varying times
 Flickr, YouTube 1 day
 LiveJournal 3 days
 Orkut (Partial) 39 days)
 Crawls subject to rate-limiting
 Discovered appropriate rates
Data collected
12
Able to crawl a large portion of the network
Node degrees vary by orders of magnitude
However, networks share many key properties
To ground analysis, will compare to Web [Broder et al.,
How are links distributed
13
What fraction of links are
symmetric?
14
Aside: User relationships on
Twitter
 Broadcasters
 News outlets, radio
stations
 No reason to follow
anyone
 Post playlists,
headlines
13
Aside: User relationships on
Twitter
 Acquaintances
 Similar number of
followers and
following
 Along the diagonal
 Green portion is top
1-percentile of
tweeters
14
Aside: User relationships on
Twitter
 Miscreants?
 Some people follow
many users
(programmatically)
 Hoping some will
follow them back
 Spam, widgets,
celebrities (at top)
15
Aside: User relationships on
Twitter
18
Twitter noticed the miscreants…
… enacted the 10% rule (you can follow 10% more people than follow
Complex network structure
20
Does a core exist?
21
How clustered is the fringe?
22
Implications
23
• Graph Properties of OSNs
• (Mislove et al 2007)
• The Demise of MySpace
• (Torkjazi et al 2009)
• How do people use OSNs
• (Schneider et al 2009)
Outline
24
Hot Today, Gone Tomorrow…
25
 Slides borrowed from W. Willinger
 Paper: Hot Today, Gone Tomorrow: On the
Migration of MySpace Users. M. Torkjazi, R.
Rejaie, and W. Willinger.
Motivation
 A majority of empirical studies of Online Social Networks
(OSNs) has focused on their associated friendship graphs
 What about the temporal dynamics of OSNs?
 What about the “active” portion of an OSN?
 A majority of empirical studies of OSNs has examined the
growth of these systems
 What about the patterns of decline in user population?
 What about changes over time in user activity?
 A majority of empirical studies of OSNs has been based on
connectivity information
 What about timing information?
 How to obtain relevant timing information?
8/17/2009 WOSN 2009 - Barcelona
26
This Study
 We examine the evolution of user population and user activity
in MySpace
 User arrival/activity/departure, life cycle of MySpace
 Why MySpace?
 It is one of the largest and most popular OSNs
 It provides several features making our study feasible
 Main challenges
 OSNs are often studied when they are popular and the
number of departure is negligible
 Popular OSNs tend to hide the information about user
departures
8/17/2009 WOSN 2009 - Barcelona
27
MySpace Features (I)
 Provides explicit profile status
 Public
 Private
 Invalid
 Availability of users’ last login
 Enables assessment of the level of activity among users
 Importantly, allows inference of population growth of MySpace
(see later for details)
 Global visibility
 http://www.myspace.com/user_id
8/17/2009 WOSN 2009 - Barcelona 28
MySpace Features (II)
 Monotonic assignment of numeric ID
 Searched periodically for currently
smallest unassigned ID and checked
that all larger IDs are unassigned; after
waiting for a short period, we observed
that the smallest unassigned ID (and
others after it) are now assigned.
 Found no apparent patterns in gaps
between consecutive invalid IDs
 No evidence for re-assignement of
deleted IDs
 Makes the selection of random
samples of MySpace users easy.
8/17/2009 WOSN 2009 - Barcelona 29
No visible pattern
Measurement
 Feb. 26th 2009: MySpace ID space [1 … 455,881,700]
 50 parallel samplers to collect 360K users in less than 12 hours
(0.1% of MySpace population)
 Using HTML parser to post-process the downloaded profiles
and extract
 User s’ profile status (invalid, public, private)
 Users’ last login date
 Users’ friend list (only for public profiles)
 Unable to parse last login info for 0.96% of public and 0.08% of
private profiles
 Last login info is not provided or is provided with obvious errors
(e.g. 1/1/0001)
8/17/2009 WOSN 2009 - Barcelona
30
On the Population size of MySpace
 Population of valid MySpace users (Feb. 26, 2009) was about
(41.5 + 17.3)% of 455,881,700 = 268M
 Compare with www.myspace.com/tom who has 266,029,430 friends
(Aug. 13, 2009)
 How has MySpace grown during the past years?
 How many “active” users are there in MySpace?
8/17/2009 WOSN 2009 - Barcelona 31
Total Invalid Public Private
362K 149K (41.2%) 150K (41.5%) 63K (17.3%)
On User Arrival
8/17/2009 WOSN 2009 - Barcelona 32
Public users
 What does user ID say about account creation time?
 Plot user ID vs. last login of that user for all our users
Private users
On User Arrival
 32% of public and 18% of private users are tourists
 Discovery of “tourists” enables accurate estimation of user
account creation time based on their associated user ID
8/17/2009 WOSN 2009 - Barcelona 33
Tourists
 What does user ID say about account creation time?
“Clean edge”
=
users whose last login
is shortly after their
account creation time
=
“MySpace tourists”
On MySpace’s Growth
 Use the observed uniform spread
of tourists across entire ID space
 Estimate account creation time by
last login time
 Estimate account creation time of
all sampled accounts based on
their ID.
8/17/2009 WOSN 2009 - Barcelona 34
April 2008
 Estimating the user population of MySpace in the past?
 Slope of the top line shows the growth rate of MySpace population
 Exponential growth until about April 2008
 Visible knee around April 2008 followed by a slow-down in growth
On User Departure
8/17/2009 WOSN 2009 - Barcelona 35
 More public and private profiles in
the first half of ID space
 More invalid profiles in the second
half of ID space
 Users joining the system earlier have been more likely to keep
their accounts than newer users
 Are newer users more likely to leave than older ones?
MySpace Life Cycle (I)
 Slow-down in the growth
rate of MySpace is related to
emergence of Facebook
 Informal evidence
(Alexa.com): Daily accesses
to Facebook surpassed that
of MySpace, at around April
2008
8/17/2009 WOSN 2009 - Barcelona 36
 Possible reasons behind MySpace’s decline?
• Graph Properties of OSNs
• (Mislove et al 2007)
• The Demise of MySpace
• (Torkjazi et al 2009)
• How do people use OSNs
• (Schneider et al 2009)
Outline
37
Understanding Online Social Network
Usage from a Network Perspective
38
 F. Schneider, A. Feldmann, B. Krishnamurthy, and
W. Willinger. ACM Internet Measurement
Conference 2009
 Slides borrowed from F. Schneider.
 This study differs from a lot of related work by
looking at OSN behavior at the network traffic level
 Vs. crawling the application-level social graph
General Approach
39
OSNs studied
40
HTTP Traces
41
Categories of pages
42
Pages manually classified based on small user
generated traces in the lab setting
Session Characteristics
43
Action popularity
44
Feature sequences
45
OSNs: Wrap up
46
 Many different types of OSNs
 Photos, video, profile-based
 Some extremely popular source of much Internet
traffic
 Facebook, YouTube
 New ones emerging
 Instagram, snapchat
 Old ones fading
 MySpace, Friendster
 Studying their properties can inform how we build
End
47
BitTorrent Overview
48
Tracker
Swarm
Leechers
Seeder

More Related Content

Similar to OSNs.pptx

Brian solis
Brian solisBrian solis
Brian solis
Bijgespijkerd.nl
 
Online social networking
Online social networkingOnline social networking
Online social networking
Gurudutt Reddy
 
Finaly! Untangling Web 2.0
Finaly! Untangling Web 2.0Finaly! Untangling Web 2.0
Finaly! Untangling Web 2.0
sydneyblackmore
 
Social Networking Informative Speech
Social Networking Informative SpeechSocial Networking Informative Speech
Social Networking Informative Speech
Cory Bohon
 
Step Into Social Media Slides From The Workshop January 19, 2010
Step Into Social Media   Slides From The Workshop   January 19, 2010Step Into Social Media   Slides From The Workshop   January 19, 2010
Step Into Social Media Slides From The Workshop January 19, 2010
Promote Public Relations Ltd
 
User Behaviour Pattern Recognition On Twitter Social Network
User Behaviour Pattern Recognition On Twitter Social NetworkUser Behaviour Pattern Recognition On Twitter Social Network
User Behaviour Pattern Recognition On Twitter Social Network
George Konstantakopoulos
 
2_Doc5_2.pdf
2_Doc5_2.pdf2_Doc5_2.pdf
2_Doc5_2.pdf
Ashishpal342837
 
Research on language and identity
Research on language and identityResearch on language and identity
Research on language and identity
Azmi Latiff
 
Social networks
Social networksSocial networks
Social networks
Pasha Binyatov
 
Do you ever use facebook
Do you ever use facebookDo you ever use facebook
Do you ever use facebook
hgfhjjff
 
Managing government use of social media
Managing government use of social mediaManaging government use of social media
Managing government use of social media
E-Government Center Moldova
 
11.face location a novel approach to post the user global location
11.face location   a novel approach to post the user global location11.face location   a novel approach to post the user global location
11.face location a novel approach to post the user global location
Alexander Decker
 
Faculty presentation 21 st december 2015
Faculty presentation 21 st december 2015Faculty presentation 21 st december 2015
Faculty presentation 21 st december 2015
Kamoru Abiodun Balogun(Bsc,MIT,CCNA,OCA,PhD inview UPM)
 
A review for the online social networks literature
A review for the online social networks literatureA review for the online social networks literature
A review for the online social networks literature
Alexander Decker
 
A review for the online social networks literature
A review for the online social networks literatureA review for the online social networks literature
A review for the online social networks literature
Alexander Decker
 
New microsoft office word document
New microsoft office word documentNew microsoft office word document
New microsoft office word document
Shailendra Singh
 
New microsoft office word document
New microsoft office word documentNew microsoft office word document
New microsoft office word document
Shailendra Singh
 
Student`S Approach towards Social Network Sites
Student`S Approach towards Social Network SitesStudent`S Approach towards Social Network Sites
Student`S Approach towards Social Network Sites
iosrjce
 
L19 Social
L19 SocialL19 Social
L19 Social
L19 SocialL19 Social

Similar to OSNs.pptx (20)

Brian solis
Brian solisBrian solis
Brian solis
 
Online social networking
Online social networkingOnline social networking
Online social networking
 
Finaly! Untangling Web 2.0
Finaly! Untangling Web 2.0Finaly! Untangling Web 2.0
Finaly! Untangling Web 2.0
 
Social Networking Informative Speech
Social Networking Informative SpeechSocial Networking Informative Speech
Social Networking Informative Speech
 
Step Into Social Media Slides From The Workshop January 19, 2010
Step Into Social Media   Slides From The Workshop   January 19, 2010Step Into Social Media   Slides From The Workshop   January 19, 2010
Step Into Social Media Slides From The Workshop January 19, 2010
 
User Behaviour Pattern Recognition On Twitter Social Network
User Behaviour Pattern Recognition On Twitter Social NetworkUser Behaviour Pattern Recognition On Twitter Social Network
User Behaviour Pattern Recognition On Twitter Social Network
 
2_Doc5_2.pdf
2_Doc5_2.pdf2_Doc5_2.pdf
2_Doc5_2.pdf
 
Research on language and identity
Research on language and identityResearch on language and identity
Research on language and identity
 
Social networks
Social networksSocial networks
Social networks
 
Do you ever use facebook
Do you ever use facebookDo you ever use facebook
Do you ever use facebook
 
Managing government use of social media
Managing government use of social mediaManaging government use of social media
Managing government use of social media
 
11.face location a novel approach to post the user global location
11.face location   a novel approach to post the user global location11.face location   a novel approach to post the user global location
11.face location a novel approach to post the user global location
 
Faculty presentation 21 st december 2015
Faculty presentation 21 st december 2015Faculty presentation 21 st december 2015
Faculty presentation 21 st december 2015
 
A review for the online social networks literature
A review for the online social networks literatureA review for the online social networks literature
A review for the online social networks literature
 
A review for the online social networks literature
A review for the online social networks literatureA review for the online social networks literature
A review for the online social networks literature
 
New microsoft office word document
New microsoft office word documentNew microsoft office word document
New microsoft office word document
 
New microsoft office word document
New microsoft office word documentNew microsoft office word document
New microsoft office word document
 
Student`S Approach towards Social Network Sites
Student`S Approach towards Social Network SitesStudent`S Approach towards Social Network Sites
Student`S Approach towards Social Network Sites
 
L19 Social
L19 SocialL19 Social
L19 Social
 
L19 Social
L19 SocialL19 Social
L19 Social
 

Recently uploaded

Senior Software Profiles Backend Sample - Sheet1.pdf
Senior Software Profiles  Backend Sample - Sheet1.pdfSenior Software Profiles  Backend Sample - Sheet1.pdf
Senior Software Profiles Backend Sample - Sheet1.pdf
Vineet
 
一比一原版卡尔加里大学毕业证(uc毕业证)如何办理
一比一原版卡尔加里大学毕业证(uc毕业证)如何办理一比一原版卡尔加里大学毕业证(uc毕业证)如何办理
一比一原版卡尔加里大学毕业证(uc毕业证)如何办理
oaxefes
 
Digital Marketing Performance Marketing Sample .pdf
Digital Marketing Performance Marketing  Sample .pdfDigital Marketing Performance Marketing  Sample .pdf
Digital Marketing Performance Marketing Sample .pdf
Vineet
 
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
hyfjgavov
 
一比一原版南昆士兰大学毕业证如何办理
一比一原版南昆士兰大学毕业证如何办理一比一原版南昆士兰大学毕业证如何办理
一比一原版南昆士兰大学毕业证如何办理
ugydym
 
Q4FY24 Investor-Presentation.pdf bank slide
Q4FY24 Investor-Presentation.pdf bank slideQ4FY24 Investor-Presentation.pdf bank slide
Q4FY24 Investor-Presentation.pdf bank slide
mukulupadhayay1
 
Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...
Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...
Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...
Marlon Dumas
 
PyData London 2024: Mistakes were made (Dr. Rebecca Bilbro)
PyData London 2024: Mistakes were made (Dr. Rebecca Bilbro)PyData London 2024: Mistakes were made (Dr. Rebecca Bilbro)
PyData London 2024: Mistakes were made (Dr. Rebecca Bilbro)
Rebecca Bilbro
 
06-18-2024-Princeton Meetup-Introduction to Milvus
06-18-2024-Princeton Meetup-Introduction to Milvus06-18-2024-Princeton Meetup-Introduction to Milvus
06-18-2024-Princeton Meetup-Introduction to Milvus
Timothy Spann
 
一比一原版(曼大毕业证书)曼尼托巴大学毕业证如何办理
一比一原版(曼大毕业证书)曼尼托巴大学毕业证如何办理一比一原版(曼大毕业证书)曼尼托巴大学毕业证如何办理
一比一原版(曼大毕业证书)曼尼托巴大学毕业证如何办理
ytypuem
 
一比一原版澳洲西澳大学毕业证(uwa毕业证书)如何办理
一比一原版澳洲西澳大学毕业证(uwa毕业证书)如何办理一比一原版澳洲西澳大学毕业证(uwa毕业证书)如何办理
一比一原版澳洲西澳大学毕业证(uwa毕业证书)如何办理
aguty
 
一比一原版爱尔兰都柏林大学毕业证(本硕)ucd学位证书如何办理
一比一原版爱尔兰都柏林大学毕业证(本硕)ucd学位证书如何办理一比一原版爱尔兰都柏林大学毕业证(本硕)ucd学位证书如何办理
一比一原版爱尔兰都柏林大学毕业证(本硕)ucd学位证书如何办理
hqfek
 
Salesforce AI + Data Community Tour Slides - Canarias
Salesforce AI + Data Community Tour Slides - CanariasSalesforce AI + Data Community Tour Slides - Canarias
Salesforce AI + Data Community Tour Slides - Canarias
davidpietrzykowski1
 
reading_sample_sap_press_operational_data_provisioning_with_sap_bw4hana (1).pdf
reading_sample_sap_press_operational_data_provisioning_with_sap_bw4hana (1).pdfreading_sample_sap_press_operational_data_provisioning_with_sap_bw4hana (1).pdf
reading_sample_sap_press_operational_data_provisioning_with_sap_bw4hana (1).pdf
perranet1
 
Telemetry Solution for Gaming (AWS Summit'24)
Telemetry Solution for Gaming (AWS Summit'24)Telemetry Solution for Gaming (AWS Summit'24)
Telemetry Solution for Gaming (AWS Summit'24)
GeorgiiSteshenko
 
How To Control IO Usage using Resource Manager
How To Control IO Usage using Resource ManagerHow To Control IO Usage using Resource Manager
How To Control IO Usage using Resource Manager
Alireza Kamrani
 
一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理
一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理
一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理
nyvan3
 
一比一原版马来西亚博特拉大学毕业证(upm毕业证)如何办理
一比一原版马来西亚博特拉大学毕业证(upm毕业证)如何办理一比一原版马来西亚博特拉大学毕业证(upm毕业证)如何办理
一比一原版马来西亚博特拉大学毕业证(upm毕业证)如何办理
eudsoh
 
一比一原版美国帕森斯设计学院毕业证(parsons毕业证书)如何办理
一比一原版美国帕森斯设计学院毕业证(parsons毕业证书)如何办理一比一原版美国帕森斯设计学院毕业证(parsons毕业证书)如何办理
一比一原版美国帕森斯设计学院毕业证(parsons毕业证书)如何办理
asyed10
 
一比一原版莱斯大学毕业证(rice毕业证)如何办理
一比一原版莱斯大学毕业证(rice毕业证)如何办理一比一原版莱斯大学毕业证(rice毕业证)如何办理
一比一原版莱斯大学毕业证(rice毕业证)如何办理
zsafxbf
 

Recently uploaded (20)

Senior Software Profiles Backend Sample - Sheet1.pdf
Senior Software Profiles  Backend Sample - Sheet1.pdfSenior Software Profiles  Backend Sample - Sheet1.pdf
Senior Software Profiles Backend Sample - Sheet1.pdf
 
一比一原版卡尔加里大学毕业证(uc毕业证)如何办理
一比一原版卡尔加里大学毕业证(uc毕业证)如何办理一比一原版卡尔加里大学毕业证(uc毕业证)如何办理
一比一原版卡尔加里大学毕业证(uc毕业证)如何办理
 
Digital Marketing Performance Marketing Sample .pdf
Digital Marketing Performance Marketing  Sample .pdfDigital Marketing Performance Marketing  Sample .pdf
Digital Marketing Performance Marketing Sample .pdf
 
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
 
一比一原版南昆士兰大学毕业证如何办理
一比一原版南昆士兰大学毕业证如何办理一比一原版南昆士兰大学毕业证如何办理
一比一原版南昆士兰大学毕业证如何办理
 
Q4FY24 Investor-Presentation.pdf bank slide
Q4FY24 Investor-Presentation.pdf bank slideQ4FY24 Investor-Presentation.pdf bank slide
Q4FY24 Investor-Presentation.pdf bank slide
 
Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...
Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...
Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...
 
PyData London 2024: Mistakes were made (Dr. Rebecca Bilbro)
PyData London 2024: Mistakes were made (Dr. Rebecca Bilbro)PyData London 2024: Mistakes were made (Dr. Rebecca Bilbro)
PyData London 2024: Mistakes were made (Dr. Rebecca Bilbro)
 
06-18-2024-Princeton Meetup-Introduction to Milvus
06-18-2024-Princeton Meetup-Introduction to Milvus06-18-2024-Princeton Meetup-Introduction to Milvus
06-18-2024-Princeton Meetup-Introduction to Milvus
 
一比一原版(曼大毕业证书)曼尼托巴大学毕业证如何办理
一比一原版(曼大毕业证书)曼尼托巴大学毕业证如何办理一比一原版(曼大毕业证书)曼尼托巴大学毕业证如何办理
一比一原版(曼大毕业证书)曼尼托巴大学毕业证如何办理
 
一比一原版澳洲西澳大学毕业证(uwa毕业证书)如何办理
一比一原版澳洲西澳大学毕业证(uwa毕业证书)如何办理一比一原版澳洲西澳大学毕业证(uwa毕业证书)如何办理
一比一原版澳洲西澳大学毕业证(uwa毕业证书)如何办理
 
一比一原版爱尔兰都柏林大学毕业证(本硕)ucd学位证书如何办理
一比一原版爱尔兰都柏林大学毕业证(本硕)ucd学位证书如何办理一比一原版爱尔兰都柏林大学毕业证(本硕)ucd学位证书如何办理
一比一原版爱尔兰都柏林大学毕业证(本硕)ucd学位证书如何办理
 
Salesforce AI + Data Community Tour Slides - Canarias
Salesforce AI + Data Community Tour Slides - CanariasSalesforce AI + Data Community Tour Slides - Canarias
Salesforce AI + Data Community Tour Slides - Canarias
 
reading_sample_sap_press_operational_data_provisioning_with_sap_bw4hana (1).pdf
reading_sample_sap_press_operational_data_provisioning_with_sap_bw4hana (1).pdfreading_sample_sap_press_operational_data_provisioning_with_sap_bw4hana (1).pdf
reading_sample_sap_press_operational_data_provisioning_with_sap_bw4hana (1).pdf
 
Telemetry Solution for Gaming (AWS Summit'24)
Telemetry Solution for Gaming (AWS Summit'24)Telemetry Solution for Gaming (AWS Summit'24)
Telemetry Solution for Gaming (AWS Summit'24)
 
How To Control IO Usage using Resource Manager
How To Control IO Usage using Resource ManagerHow To Control IO Usage using Resource Manager
How To Control IO Usage using Resource Manager
 
一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理
一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理
一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理
 
一比一原版马来西亚博特拉大学毕业证(upm毕业证)如何办理
一比一原版马来西亚博特拉大学毕业证(upm毕业证)如何办理一比一原版马来西亚博特拉大学毕业证(upm毕业证)如何办理
一比一原版马来西亚博特拉大学毕业证(upm毕业证)如何办理
 
一比一原版美国帕森斯设计学院毕业证(parsons毕业证书)如何办理
一比一原版美国帕森斯设计学院毕业证(parsons毕业证书)如何办理一比一原版美国帕森斯设计学院毕业证(parsons毕业证书)如何办理
一比一原版美国帕森斯设计学院毕业证(parsons毕业证书)如何办理
 
一比一原版莱斯大学毕业证(rice毕业证)如何办理
一比一原版莱斯大学毕业证(rice毕业证)如何办理一比一原版莱斯大学毕业证(rice毕业证)如何办理
一比一原版莱斯大学毕业证(rice毕业证)如何办理
 

OSNs.pptx

  • 1. CSE390 Advanced Computer Networks Lecture 15: Online Social Networks (The network is people) Based on slides by A. Mislove, F. Schneider, and W. Willinger. Updated by P Gill. Fall 2014
  • 2. What are (online) social networks? 2  Social networks are graphs of people  Graph edges connect friends  `Friend’ has different implications  How hard is it to be Facebook `friends’?  Online social networking  Social network hosted by a Web site  Friendship represents shared interest or trust  Online friends may have never met
  • 3. What are online social networks used for? 3  Popular for sharing content  Photos (Flickr), videos (YouTube), blogs (LiveJournal), profiles (Facebook, Orkut)  Fixed broadband (Sandvine Q1 2014)  YouTube 5.5% upload, 13.2% down  Facebook 2.2% upload, 2.0% down  Popular with users on the go  Mobile (Sandvine Q1 2014)  YouTube 3.8% up, 17.6% down  Facebook 27.0% up, 14.0% down
  • 4. Why are social networks interesting? 4  Popular way to connect  Estimated 1.32B users online each day  Average American spends 40 minutes/day on the site  Changing the flow of information  Formerly few ``writers’’ many ``readers’’ online  Now anyone can write!  What does this mean for Internet traffic?  Important in regions with strict media controls  E.g., Iran, Egypt using social media platforms to get word out in times of unrest  Useful in times of disaster
  • 6. Not just a social phenomenon… 6 Facebook now contains photo and video  Content delivery challenges! YouTube is a large fraction of Google’s traffic! Understanding properties of these networks is important to understand how we build systems to support them!
  • 7. • Graph Properties of OSNs • (Mislove et al 2007) • The Demise of MySpace • (Torkjazi et al 2009) • How do people use OSNs • (Schneider et al 2009) Outline 7
  • 8. Required reading: Mislove et al. 2007 8  One of the first measurement studies of online social networks (OSNs)  Large-scale measurement study and analysis of multiple online social networks  11 M users, 328 M links  Four diverse OSNs  Flickr: photosharing  LiveJournal: blogging  Orkut: social networking  YouTube: video sharing  Goals:  Measure OSNs at scale  Understand their static structural properties
  • 9. How to measure OSNs at scale? 9  Sites are reluctant to give out data  Cannot enumerate user list in general  Instead, performed crawls of the user graph  Picked known seed user  Crawled all of his friends  Added new users to list  Continued until all known users crawled  Effectively performed a BFS of graph
  • 10. Challenges faced 10  Obtaining data using crawling presents unique challenges  Need to crawl quickly!  Underlying network changes rapidly  Consistent snapshot is hard to get  Crawling completely  Social networks aren’t necessarily connected  Some users have no links! Or are in small clusters.  Need to estimate the crawl coverage
  • 11. How fast could they crawl? 11  Crawled using a cluster of 58 machines  Used APIs where available  Otherwise, used screen scraping  Crawls took varying times  Flickr, YouTube 1 day  LiveJournal 3 days  Orkut (Partial) 39 days)  Crawls subject to rate-limiting  Discovered appropriate rates
  • 12. Data collected 12 Able to crawl a large portion of the network Node degrees vary by orders of magnitude However, networks share many key properties To ground analysis, will compare to Web [Broder et al.,
  • 13. How are links distributed 13
  • 14. What fraction of links are symmetric? 14
  • 15. Aside: User relationships on Twitter  Broadcasters  News outlets, radio stations  No reason to follow anyone  Post playlists, headlines 13
  • 16. Aside: User relationships on Twitter  Acquaintances  Similar number of followers and following  Along the diagonal  Green portion is top 1-percentile of tweeters 14
  • 17. Aside: User relationships on Twitter  Miscreants?  Some people follow many users (programmatically)  Hoping some will follow them back  Spam, widgets, celebrities (at top) 15
  • 18. Aside: User relationships on Twitter 18 Twitter noticed the miscreants… … enacted the 10% rule (you can follow 10% more people than follow
  • 20. Does a core exist? 21
  • 21. How clustered is the fringe? 22
  • 23. • Graph Properties of OSNs • (Mislove et al 2007) • The Demise of MySpace • (Torkjazi et al 2009) • How do people use OSNs • (Schneider et al 2009) Outline 24
  • 24. Hot Today, Gone Tomorrow… 25  Slides borrowed from W. Willinger  Paper: Hot Today, Gone Tomorrow: On the Migration of MySpace Users. M. Torkjazi, R. Rejaie, and W. Willinger.
  • 25. Motivation  A majority of empirical studies of Online Social Networks (OSNs) has focused on their associated friendship graphs  What about the temporal dynamics of OSNs?  What about the “active” portion of an OSN?  A majority of empirical studies of OSNs has examined the growth of these systems  What about the patterns of decline in user population?  What about changes over time in user activity?  A majority of empirical studies of OSNs has been based on connectivity information  What about timing information?  How to obtain relevant timing information? 8/17/2009 WOSN 2009 - Barcelona 26
  • 26. This Study  We examine the evolution of user population and user activity in MySpace  User arrival/activity/departure, life cycle of MySpace  Why MySpace?  It is one of the largest and most popular OSNs  It provides several features making our study feasible  Main challenges  OSNs are often studied when they are popular and the number of departure is negligible  Popular OSNs tend to hide the information about user departures 8/17/2009 WOSN 2009 - Barcelona 27
  • 27. MySpace Features (I)  Provides explicit profile status  Public  Private  Invalid  Availability of users’ last login  Enables assessment of the level of activity among users  Importantly, allows inference of population growth of MySpace (see later for details)  Global visibility  http://www.myspace.com/user_id 8/17/2009 WOSN 2009 - Barcelona 28
  • 28. MySpace Features (II)  Monotonic assignment of numeric ID  Searched periodically for currently smallest unassigned ID and checked that all larger IDs are unassigned; after waiting for a short period, we observed that the smallest unassigned ID (and others after it) are now assigned.  Found no apparent patterns in gaps between consecutive invalid IDs  No evidence for re-assignement of deleted IDs  Makes the selection of random samples of MySpace users easy. 8/17/2009 WOSN 2009 - Barcelona 29 No visible pattern
  • 29. Measurement  Feb. 26th 2009: MySpace ID space [1 … 455,881,700]  50 parallel samplers to collect 360K users in less than 12 hours (0.1% of MySpace population)  Using HTML parser to post-process the downloaded profiles and extract  User s’ profile status (invalid, public, private)  Users’ last login date  Users’ friend list (only for public profiles)  Unable to parse last login info for 0.96% of public and 0.08% of private profiles  Last login info is not provided or is provided with obvious errors (e.g. 1/1/0001) 8/17/2009 WOSN 2009 - Barcelona 30
  • 30. On the Population size of MySpace  Population of valid MySpace users (Feb. 26, 2009) was about (41.5 + 17.3)% of 455,881,700 = 268M  Compare with www.myspace.com/tom who has 266,029,430 friends (Aug. 13, 2009)  How has MySpace grown during the past years?  How many “active” users are there in MySpace? 8/17/2009 WOSN 2009 - Barcelona 31 Total Invalid Public Private 362K 149K (41.2%) 150K (41.5%) 63K (17.3%)
  • 31. On User Arrival 8/17/2009 WOSN 2009 - Barcelona 32 Public users  What does user ID say about account creation time?  Plot user ID vs. last login of that user for all our users Private users
  • 32. On User Arrival  32% of public and 18% of private users are tourists  Discovery of “tourists” enables accurate estimation of user account creation time based on their associated user ID 8/17/2009 WOSN 2009 - Barcelona 33 Tourists  What does user ID say about account creation time? “Clean edge” = users whose last login is shortly after their account creation time = “MySpace tourists”
  • 33. On MySpace’s Growth  Use the observed uniform spread of tourists across entire ID space  Estimate account creation time by last login time  Estimate account creation time of all sampled accounts based on their ID. 8/17/2009 WOSN 2009 - Barcelona 34 April 2008  Estimating the user population of MySpace in the past?  Slope of the top line shows the growth rate of MySpace population  Exponential growth until about April 2008  Visible knee around April 2008 followed by a slow-down in growth
  • 34. On User Departure 8/17/2009 WOSN 2009 - Barcelona 35  More public and private profiles in the first half of ID space  More invalid profiles in the second half of ID space  Users joining the system earlier have been more likely to keep their accounts than newer users  Are newer users more likely to leave than older ones?
  • 35. MySpace Life Cycle (I)  Slow-down in the growth rate of MySpace is related to emergence of Facebook  Informal evidence (Alexa.com): Daily accesses to Facebook surpassed that of MySpace, at around April 2008 8/17/2009 WOSN 2009 - Barcelona 36  Possible reasons behind MySpace’s decline?
  • 36. • Graph Properties of OSNs • (Mislove et al 2007) • The Demise of MySpace • (Torkjazi et al 2009) • How do people use OSNs • (Schneider et al 2009) Outline 37
  • 37. Understanding Online Social Network Usage from a Network Perspective 38  F. Schneider, A. Feldmann, B. Krishnamurthy, and W. Willinger. ACM Internet Measurement Conference 2009  Slides borrowed from F. Schneider.  This study differs from a lot of related work by looking at OSN behavior at the network traffic level  Vs. crawling the application-level social graph
  • 41. Categories of pages 42 Pages manually classified based on small user generated traces in the lab setting
  • 45. OSNs: Wrap up 46  Many different types of OSNs  Photos, video, profile-based  Some extremely popular source of much Internet traffic  Facebook, YouTube  New ones emerging  Instagram, snapchat  Old ones fading  MySpace, Friendster  Studying their properties can inform how we build

Editor's Notes

  1. Stats: http://www.dailymail.co.uk/sciencetech/article-2703440/Theres-no-escape-Facebook-set-record-stock-high-results-beats-expectations-1-32-BILLION-users-30-mobile.html
  2. http://mathworld.wolfram.com/GraphDiameter.html Longest shortest path.
  3. Small-world : everyone can reach each other within a small number of hops