SlideShare a Scribd company logo
1 of 76
The evolution of research on
social media
Farida Vis, University of Sheffield
@flygirltwo
European Conference on Social Media, 10 July, Brighton, United Kingdom
ACADEMIA
INDUSTRY
GOVERNMENT
SOCIAL MEDIA = BIG
DATA
REAL-TIME ANALYTICS
SOCIAL MEDIA =
TWITTER
WHERE ARE THE RQs?
WHERE’S THE
THEORY?
ETHICS
METHODS
SAMPLING
DATA SHARING
WHERE’S THE
FUNDING?
WHAT’S THE FUTURE?
Aftermath of Hurricane Katrina
2005: Flickr
2008: YouTube
Fitna: The Video Battle
2011: Twitter
Reading the Riots on Twitter
data
data
unstructured
235 posts – 106 individuals
(Flickr)
Aftermath of Hurricane Katrina
2005
Manual collection possible
1413 videos – 700 individuals
(YouTube)
Fitna: The Video Battle
2008
+ Computer Science
2.6 million tweets – 700K individuals
(Twitter)
Reading the Riots on Twitter
2011
+ Lots of Computer Science
READING
THE RIOTS
ON TWITTER
Rob Procter (University of Manchester)
Farida Vis (University of Leicester)
Alexander Voss (University of St Andrews)
[Funded by JISC]
#readingtheriots
BORDER RUNNER
BIG DATA
“Big data” is high-volume, -velocity and –variety
information assets that demand cost-effective,
innovative forms of information processing for
enhanced insight and decision making’ (Gartner in
Sicular, 2013).
Huge industry now built around ‘social data’ and
‘listening platforms’ feeding on this data (Many
tools not suitable for academic use, black box).
• Technology: maximizing computation power and
algorithmic accuracy to gather, analyze, link, and
compare large data sets.
• Analysis: drawing on large data sets to identify
patterns in order to make economic, social,
technical, and legal claims.
• Mythology: the widespread belief that large data
sets offer a higher form of intelligence and
knowledge that can generate insights that were
previously impossible, with the aura of truth,
objectivity, and accuracy.
(boyd and Crawford p. 663).
Critiques of Big Data
• Important to make visible inherent claims about
objectivity
• Problematic focus on quantitative methods
• How can data answer questions it was not
designed to answer?
• How can the right questions be asked?
• Inherent biases in large linked error prone
datasets
• Focus on text and numbers that can be mined
algorithmically
• Data fundamentalism
Data fundamentalism
The notion that correlation always indicates causation,
and that massive data sets and predictive analytics
always reflect ‘objective truth’. Idea and belief in the
existence of an objective ‘truth’, that something can be
fully understood from a single perspective, again brings
to light tensions about how the social world can be made
known.
CRITICAL BIG DATA
STUDIES?
How do we ground online data?
In the offline: assessing findings against what we
know about an offline population (census data) in
order to better understand online data. Problems
with over/under representation in online data?
In the online: premised on the idea that data
derived from social media should be grounded in
other online data in order to understand it. So
comparing Facebook use to what we know about
Facebook use, rather than connecting it to offline
measurements about citizens. Richard Rogers
Important considerations
1. Asking the right question – research should
be question driven rather than data driven.
2. Accept poor data quality & users gaming
metrics – once online metrics have value users
will try to game them.
3. Limitations of tools (often built in
disconnected way)
4. Transparency – researchers should be upfront
about limitations of research and research
design. Can the data answer the questions?
A critical reflection on Big Data:
considering APIs, researchers and
tools as data makers
Rather than assuming data already exists ‘out there’, waiting to
simply be recovered and turned into findings, the article
examines how data is co-produced through dynamic research
intersections. A particular focus is the intersections between the
Application Programming Interface (API), the researcher
collecting the data as well as the tools used to process it. In light
of this, the article offers three new ways to define and think
about Big Data and proposes a series of practical suggestions for
making data.
(First Monday, October 2013, http://firstmonday.org/)
Twitter data ecosystem
Standard API sampling problems
Sampling from the FIREHOSE
1% random sample of the firehose
If not rate limited – all data collected?
New API sampling problems
New business models: enriched metadata
Social media vs social data
Datasift, GNIP and Topsy
Social media VS social data
• Social Media: User-generated content where one user
communicates and expresses themselves and that content
is delivered to other users. Examples of this are platforms
such as Twitter, Facebook, YouTube, Tumblr and Disqus.
Social media is delivered in a great user experience, and is
focused on sharing and content discovery. Social media
also offers both public and private experiences with the
ability to share messages privately.
• Social Data: Expresses social media in a computer-readable
format (e.g. JSON) and shares metadata about the content
to help provide not only content, but context. Metadata
often includes information about location, engagement and
links shared. Unlike social media, social data is focused
strictly on publicly shared experiences. (Cairns, 2013)
Social media
Social data
Enriched metadata
Location and influence
Where are users?
Are they influential?
Twitter expanding/enriching metadata
Hawskey (2013)
New Profile Geo Enrichment
Geo-locating tweets
Exact location
Lat/long coordinates
Gold standard geo data
Problem: only 1% of users
-> Only 2% of firehose tweets
Early adopters, highly skewed
Where in the world are you?
No Lat/long coordinates
Text field – enter anything
Advantage: more than half of all
tweets contain profile location
Much more evenly distributed
Profile Geo Enrichment
‘our customers can now hear from the whole world of Twitter users
and not just 1%’ (Cairs, 2013 on Gnip company Blog)
• Activity Location – 1% that provide lat/long
• Profile Location – Place provided in their profile. May or may not
be posting from there.
• Mentioned Location – Places a user talks about
‘Both the tweet text and Profile fields contain geographic information,
but not in substantial quantities and have poor accuracy’ (Leetaru et
al, First Monday, May 2013)
Problem with deleted tweets
‘A deleted tweet effectively disappears from the results of searching
Twitter, although a short delay sometimes occurs between deletion
and disappearance. A status deletion notice is distributed via the
Twitter streaming API to relevant users’ clients so that they, in turn,
remove deleted tweets from their records.’
‘Twitter does not provide a bulk-deletion of user’s tweets. It provides,
however, a one-click bulk-deletion of all location data that were
attached to user’s tweets, without deleting the tweets. By clicking on
the “Delete all location information” button on user’s account settings
page, all locations attached to all previous tweets are deleted.
(Almuhimedi et al, 2013)
Profile Geo Enrichment
Linking data
‘Profile location data can be used to unlock demographic data
and other information that is not otherwise possible with
activity location. For instance, US Census Bureau statistics are
aggregated at the locality level and can provide basic stats like
household income. Profile location is also a strong indicator of
activity location when one isn’t provided. (Cairns, 2013)
Social influence: Klout scores
Klout used to make profiles without consent
This is a DM. What is it doing here?
Klout rewards users for giving data
More data = more influential?
Online/offline?
Scores are easily gamed
Fake followers: Mitt Romney’s 100,000 extra followers in one day
As many as 20 million fake follower accounts (200 million active users)
This doesn’t take into account the issue of spoof accounts (clearly in
evidence in riot tweets) (Perlroth, 2013)
Klout scores industry standard measure…
Ability to describe the limitations of our data:
- APIs as data makers. Once data is linked very
hard to untangle how metadata is constructed
and where problems might be. Included in terms
of deleted content.
- Researchers and tools as data makers
- When creating a dataset important to
describe how it was made, what the
limitations are. What the sampling limitations
(both in terms of the API, but also related to
offline ‘population’. What other limitations re:
enriched metadata needs to be described?)
- When creating a dataset how complete is it?
- Limitations need to be known in order to
describe them. This is a real problem.
Tools as data makers
In answering complex questions about social media
data, we need:
1. Know the questions! And know how they might be
answered.
2. Problem with tools: not question driven. Often
developed around available (poor quality) data, often
by non social media experts, but those with data
processing expertise.
3. Tools therefore become data-makers in that they limit
the scope of possibility in the questions researchers
imagine. This is a huge problem!
Need better understanding of complex
ever changing dynamics between
APIs
Researchers
Tools
Organic data / data in the wild
SOCIAL MEDIA SIMPLY AS (BIG)
DATA
VS
SOCIAL MEDIA AS A RESEARCH
AREA
DOMAIN EXPERTISE
TO UNDERSTAND
TWITTER DATA YOU
NEED TO
UNDERSTAND
TWITTER
+ BE ON TWITTER
WHAT’S THE FUTURE?
WHAT GETS LEFT OUT?
750 MILLION
IMAGES
SHARED
DAILY
Images posses the ability to grab our attention
Social media companies know this
Images are key to engagement
Camera: used to be for special occasions
Smartphone: always with us
Everyday snaps Witnessing events
US 65% smartphone penetration
Smartphones overtaken desktop usage to access the internet
Mobile internet accounts for majority of internet use in US (57%)
Users typically access the internet via apps on mobile devices
All figures from comScore, US Digital Future in Focus, 2014
UK: The over-55s will experience the fastest year-on-year rises in
smartphone penetration.
Smartphone ownership should increase to about 50% by year-end, a
25% increase from 2013, but trailing 70% penetration among 18-54s.
The difference in smartphone penetration by age will disappear, but
differences in usage of smartphones remain substantial. Many over
55s use smartphones like feature phones.
All figures from Deloitte, predictions for 2014
Rise of platforms and apps focused on visual content
Pinterest
Tumblr
Instagram
Vine
Snapchat
‘Mobile first… and only’ | simple easy, user friendly design
Facebook daily image uploads: 350 million (November 2013)
Instagram daily image uploads: 60 million (March 2014)
Twitter: 500 million tweets daily (March 2014)
Snapchat daily snaps: 400 million (November 2013)
Images largely ignored in
social media research
Not easy to ‘mine’
Hard to figure out meaning
Huge interest in industry
WHAT DOES THE
FUTURE OF SOCIAL
MEDIA RESEARCH
LOOK LIKE?
QUESTION DRIVEN
(TOOL AWARE) + CRITICAL
BETTER METHODS
MORE THEORY
TRANSPARENT
SUSTAINABLE
ETHICAL
CROSS PLATFORM
INTERDISCIPLINARY
MORE CROSS SECTOR?
MORE FUNDING!
visualsocialmedialab.org
@VisSocMedLab
f.vis@sheffield.ac.uk
@flygirltwo
References
• Hazim Almuhimedia, Shomir Wilsona, Bin Liua, Norman Sadeha, Alessandro Acquistib, 2012. ‘Tweets Are
Forever: A Large-Scale Quantitative Analysis of Deleted Tweets’, CSCW’13, February 23–27, 2013, San
Antonio, Texas, USA, http://www.cs.cmu.edu/~shomir/cscw2013_tweets_are_forever.pdf, accessed 18
September, 2013.
• Ian Cairns, 2013. ‘Get More Geodata From Gnip With Our New Profile Geo Enrichment’, Gnip Company
Blog, 22 August, at http://blog.gnip.com/tag/geolocation/, accessed 13 September 2013.
• Grcommunication, 2012, ‘I will help raise your Klout score by sending you 10Ks and will tweet it out to my
50K+ followers from my 80+ Klout score for $5, http://fiverr.com/grcommunication/help-raise-your-klout-
score-by-sending-you-10ks-and-will-tweet-it-out-to-my-17k-followers-from-my-70-klout-score, accessed
19 September 2013.
• Anthony Ha, 2013. ‘Gnip Expands Its Partnership With Klout, Becoming The Exclusive Provider Of
Klout Topics’, TechCrunch, 8 August, http://techcrunch.com/2013/08/08/gnip-klout/, accessed 19
September 2013.
• Martin Hawksey, 2013. ‘Twitter throws a bone: Increased hits and metadata in Twitter Search API 1.1,’
March 28, at http://mashe.hawksey.info/2013/03/twitterthrows-a-bone-increased-hits-and-metadata-in-
twitter-search-api-1-1/ , accessed 10 September 2013.
• Kalev H. Leetaru, Shaowen Wang, Guofeng Cao, Anand Padmanabhan, and Eric Shook, 2013, ‘Mapping the
global Twitter heartbeat: The geography of Twitter, First Monday, Volume 18, Number 5-6 May,
http://firstmonday.org/article/view/4366/3654
• Nicole Perlroth, 2013. ‘Fake Twitter Followers Become Multimillion-Dollar Business’, New York Times, Bits
blog, 5 April, http://bits.blogs.nytimes.com/2013/04/05/fake-twitter-followers-becomes-multimillion-
dollar-business/, accessed 19 September 2013.
• Farida Vis, 2013. ‘A critical reflection on Big Data: considering APIs, researchers and tools as data makers’,
First Monday, 7 October, http://firstmonday.org

More Related Content

What's hot

CrowdTruth @VU Faculty Colloquium (June 2015)
CrowdTruth @VU Faculty Colloquium (June 2015)CrowdTruth @VU Faculty Colloquium (June 2015)
CrowdTruth @VU Faculty Colloquium (June 2015)Lora Aroyo
 
What Actor-Network Theory (ANT) and digital methods can do for data journalis...
What Actor-Network Theory (ANT) and digital methods can do for data journalis...What Actor-Network Theory (ANT) and digital methods can do for data journalis...
What Actor-Network Theory (ANT) and digital methods can do for data journalis...Liliana Bounegru
 
GitHub as Transparency Device in Data Journalism, Open Data and Data Activism
GitHub as Transparency Device in  Data Journalism, Open Data and Data ActivismGitHub as Transparency Device in  Data Journalism, Open Data and Data Activism
GitHub as Transparency Device in Data Journalism, Open Data and Data ActivismLiliana Bounegru
 
Lecture 7: How to STUDY the Social Web? (2014)
Lecture 7: How to STUDY the Social Web? (2014)Lecture 7: How to STUDY the Social Web? (2014)
Lecture 7: How to STUDY the Social Web? (2014)Lora Aroyo
 
Doing Social and Political Research in a Digital Age: An Introduction to Digi...
Doing Social and Political Research in a Digital Age: An Introduction to Digi...Doing Social and Political Research in a Digital Age: An Introduction to Digi...
Doing Social and Political Research in a Digital Age: An Introduction to Digi...Liliana Bounegru
 
Doing Digital Methods: Some Recent Highlights from Winter and Summer Schools
Doing Digital Methods: Some Recent Highlights from Winter and Summer SchoolsDoing Digital Methods: Some Recent Highlights from Winter and Summer Schools
Doing Digital Methods: Some Recent Highlights from Winter and Summer SchoolsLiliana Bounegru
 
User Behaviour Pattern Recognition On Twitter Social Network
User Behaviour Pattern Recognition On Twitter Social NetworkUser Behaviour Pattern Recognition On Twitter Social Network
User Behaviour Pattern Recognition On Twitter Social NetworkGeorge Konstantakopoulos
 
Mapping Issues with the Web: An Introduction to Digital Methods
Mapping Issues with the Web: An Introduction to Digital MethodsMapping Issues with the Web: An Introduction to Digital Methods
Mapping Issues with the Web: An Introduction to Digital MethodsJonathan Gray
 
CS147 Social Mobile
CS147 Social MobileCS147 Social Mobile
CS147 Social Mobilemor
 
Data Journalism and the Remaking of Data Infrastructures
Data Journalism and the Remaking of Data InfrastructuresData Journalism and the Remaking of Data Infrastructures
Data Journalism and the Remaking of Data InfrastructuresLiliana Bounegru
 
Coding Social Imagery: Learning from a #selfie #humor Image Set from Instagram
Coding Social Imagery:  Learning from a #selfie #humor Image Set from InstagramCoding Social Imagery:  Learning from a #selfie #humor Image Set from Instagram
Coding Social Imagery: Learning from a #selfie #humor Image Set from InstagramShalin Hai-Jew
 
Web 2.0 Collective Intelligence - How to use collective intelligence techniqu...
Web 2.0 Collective Intelligence - How to use collective intelligence techniqu...Web 2.0 Collective Intelligence - How to use collective intelligence techniqu...
Web 2.0 Collective Intelligence - How to use collective intelligence techniqu...Paul Gilbreath
 

What's hot (15)

CrowdTruth @VU Faculty Colloquium (June 2015)
CrowdTruth @VU Faculty Colloquium (June 2015)CrowdTruth @VU Faculty Colloquium (June 2015)
CrowdTruth @VU Faculty Colloquium (June 2015)
 
What Actor-Network Theory (ANT) and digital methods can do for data journalis...
What Actor-Network Theory (ANT) and digital methods can do for data journalis...What Actor-Network Theory (ANT) and digital methods can do for data journalis...
What Actor-Network Theory (ANT) and digital methods can do for data journalis...
 
GitHub as Transparency Device in Data Journalism, Open Data and Data Activism
GitHub as Transparency Device in  Data Journalism, Open Data and Data ActivismGitHub as Transparency Device in  Data Journalism, Open Data and Data Activism
GitHub as Transparency Device in Data Journalism, Open Data and Data Activism
 
Lecture 7: How to STUDY the Social Web? (2014)
Lecture 7: How to STUDY the Social Web? (2014)Lecture 7: How to STUDY the Social Web? (2014)
Lecture 7: How to STUDY the Social Web? (2014)
 
Doing Social and Political Research in a Digital Age: An Introduction to Digi...
Doing Social and Political Research in a Digital Age: An Introduction to Digi...Doing Social and Political Research in a Digital Age: An Introduction to Digi...
Doing Social and Political Research in a Digital Age: An Introduction to Digi...
 
Doing Digital Methods: Some Recent Highlights from Winter and Summer Schools
Doing Digital Methods: Some Recent Highlights from Winter and Summer SchoolsDoing Digital Methods: Some Recent Highlights from Winter and Summer Schools
Doing Digital Methods: Some Recent Highlights from Winter and Summer Schools
 
How to get started with Data Journalism
How to get started with Data JournalismHow to get started with Data Journalism
How to get started with Data Journalism
 
User Behaviour Pattern Recognition On Twitter Social Network
User Behaviour Pattern Recognition On Twitter Social NetworkUser Behaviour Pattern Recognition On Twitter Social Network
User Behaviour Pattern Recognition On Twitter Social Network
 
Mapping Issues with the Web: An Introduction to Digital Methods
Mapping Issues with the Web: An Introduction to Digital MethodsMapping Issues with the Web: An Introduction to Digital Methods
Mapping Issues with the Web: An Introduction to Digital Methods
 
CS147 Social Mobile
CS147 Social MobileCS147 Social Mobile
CS147 Social Mobile
 
Data Journalism and the Remaking of Data Infrastructures
Data Journalism and the Remaking of Data InfrastructuresData Journalism and the Remaking of Data Infrastructures
Data Journalism and the Remaking of Data Infrastructures
 
Coding Social Imagery: Learning from a #selfie #humor Image Set from Instagram
Coding Social Imagery:  Learning from a #selfie #humor Image Set from InstagramCoding Social Imagery:  Learning from a #selfie #humor Image Set from Instagram
Coding Social Imagery: Learning from a #selfie #humor Image Set from Instagram
 
The Birth of Social Media Methods
The Birth of Social Media MethodsThe Birth of Social Media Methods
The Birth of Social Media Methods
 
Rogers data days_2014_slides_opti
Rogers data days_2014_slides_optiRogers data days_2014_slides_opti
Rogers data days_2014_slides_opti
 
Web 2.0 Collective Intelligence - How to use collective intelligence techniqu...
Web 2.0 Collective Intelligence - How to use collective intelligence techniqu...Web 2.0 Collective Intelligence - How to use collective intelligence techniqu...
Web 2.0 Collective Intelligence - How to use collective intelligence techniqu...
 

Similar to The evolution of research on social media

What Your Tweets Tell Us About You, Speaker Notes
What Your Tweets Tell Us About You, Speaker NotesWhat Your Tweets Tell Us About You, Speaker Notes
What Your Tweets Tell Us About You, Speaker NotesKrisKasianovitz
 
Eavesdropping on the Twitter Microblogging Site
Eavesdropping on the Twitter Microblogging SiteEavesdropping on the Twitter Microblogging Site
Eavesdropping on the Twitter Microblogging SiteShalin Hai-Jew
 
RUNNING HEAD BIG DATA IN SOCIAL MEDIA .docx
RUNNING HEAD BIG DATA IN SOCIAL MEDIA                            .docxRUNNING HEAD BIG DATA IN SOCIAL MEDIA                            .docx
RUNNING HEAD BIG DATA IN SOCIAL MEDIA .docxcheryllwashburn
 
Accessing and Using Big Data to Advance Social Science Knowledge
Accessing and Using Big Data to Advance Social Science KnowledgeAccessing and Using Big Data to Advance Social Science Knowledge
Accessing and Using Big Data to Advance Social Science KnowledgeJosh Cowls
 
Big Data for International Development
Big Data for International DevelopmentBig Data for International Development
Big Data for International DevelopmentAlex Rascanu
 
Big data privacy issues in public social media
Big data privacy issues in public social mediaBig data privacy issues in public social media
Big data privacy issues in public social mediaSupriya Radhakrishna
 
Working with Social Media Data: Ethics & good practice around collecting, usi...
Working with Social Media Data: Ethics & good practice around collecting, usi...Working with Social Media Data: Ethics & good practice around collecting, usi...
Working with Social Media Data: Ethics & good practice around collecting, usi...Nicola Osborne
 
Data Science For Social Good: Tackling the Challenge of Homelessness
Data Science For Social Good: Tackling the Challenge of HomelessnessData Science For Social Good: Tackling the Challenge of Homelessness
Data Science For Social Good: Tackling the Challenge of HomelessnessAnita Luthra
 
2010 Catalyst Conference - Trends in Social Network Analysis
2010 Catalyst Conference - Trends in Social Network Analysis2010 Catalyst Conference - Trends in Social Network Analysis
2010 Catalyst Conference - Trends in Social Network AnalysisMarc Smith
 
Lecture series: Using trace data or subjective data, that is the question dur...
Lecture series: Using trace data or subjective data, that is the question dur...Lecture series: Using trace data or subjective data, that is the question dur...
Lecture series: Using trace data or subjective data, that is the question dur...Bart Rienties
 
Semantic Web Investigation within Big Data Context
Semantic Web Investigation within Big Data ContextSemantic Web Investigation within Big Data Context
Semantic Web Investigation within Big Data ContextMurad Daryousse
 
Challenges in-archiving-twitter
Challenges in-archiving-twitterChallenges in-archiving-twitter
Challenges in-archiving-twitterKatrin Weller
 
Beyond-Data-Literacy-2015
Beyond-Data-Literacy-2015Beyond-Data-Literacy-2015
Beyond-Data-Literacy-2015Amanda noonan
 
Insights From Social Media
Insights From Social MediaInsights From Social Media
Insights From Social MediaDr Wasim Ahmed
 
IRJET- Improved Real-Time Twitter Sentiment Analysis using ML & Word2Vec
IRJET-  	  Improved Real-Time Twitter Sentiment Analysis using ML & Word2VecIRJET-  	  Improved Real-Time Twitter Sentiment Analysis using ML & Word2Vec
IRJET- Improved Real-Time Twitter Sentiment Analysis using ML & Word2VecIRJET Journal
 
Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...The Higher Education Academy
 

Similar to The evolution of research on social media (20)

What Your Tweets Tell Us About You, Speaker Notes
What Your Tweets Tell Us About You, Speaker NotesWhat Your Tweets Tell Us About You, Speaker Notes
What Your Tweets Tell Us About You, Speaker Notes
 
s00146-014-0549-4.pdf
s00146-014-0549-4.pdfs00146-014-0549-4.pdf
s00146-014-0549-4.pdf
 
Eavesdropping on the Twitter Microblogging Site
Eavesdropping on the Twitter Microblogging SiteEavesdropping on the Twitter Microblogging Site
Eavesdropping on the Twitter Microblogging Site
 
RUNNING HEAD BIG DATA IN SOCIAL MEDIA .docx
RUNNING HEAD BIG DATA IN SOCIAL MEDIA                            .docxRUNNING HEAD BIG DATA IN SOCIAL MEDIA                            .docx
RUNNING HEAD BIG DATA IN SOCIAL MEDIA .docx
 
Accessing and Using Big Data to Advance Social Science Knowledge
Accessing and Using Big Data to Advance Social Science KnowledgeAccessing and Using Big Data to Advance Social Science Knowledge
Accessing and Using Big Data to Advance Social Science Knowledge
 
data, big data, open data
data, big data, open datadata, big data, open data
data, big data, open data
 
Big Data for International Development
Big Data for International DevelopmentBig Data for International Development
Big Data for International Development
 
Big data privacy issues in public social media
Big data privacy issues in public social mediaBig data privacy issues in public social media
Big data privacy issues in public social media
 
Working with Social Media Data: Ethics & good practice around collecting, usi...
Working with Social Media Data: Ethics & good practice around collecting, usi...Working with Social Media Data: Ethics & good practice around collecting, usi...
Working with Social Media Data: Ethics & good practice around collecting, usi...
 
Big data Paper
Big data PaperBig data Paper
Big data Paper
 
Data Science For Social Good: Tackling the Challenge of Homelessness
Data Science For Social Good: Tackling the Challenge of HomelessnessData Science For Social Good: Tackling the Challenge of Homelessness
Data Science For Social Good: Tackling the Challenge of Homelessness
 
2010 Catalyst Conference - Trends in Social Network Analysis
2010 Catalyst Conference - Trends in Social Network Analysis2010 Catalyst Conference - Trends in Social Network Analysis
2010 Catalyst Conference - Trends in Social Network Analysis
 
Lecture series: Using trace data or subjective data, that is the question dur...
Lecture series: Using trace data or subjective data, that is the question dur...Lecture series: Using trace data or subjective data, that is the question dur...
Lecture series: Using trace data or subjective data, that is the question dur...
 
Semantic Web Investigation within Big Data Context
Semantic Web Investigation within Big Data ContextSemantic Web Investigation within Big Data Context
Semantic Web Investigation within Big Data Context
 
Challenges in-archiving-twitter
Challenges in-archiving-twitterChallenges in-archiving-twitter
Challenges in-archiving-twitter
 
Beyond-Data-Literacy-2015
Beyond-Data-Literacy-2015Beyond-Data-Literacy-2015
Beyond-Data-Literacy-2015
 
Insights From Social Media
Insights From Social MediaInsights From Social Media
Insights From Social Media
 
IRJET- Improved Real-Time Twitter Sentiment Analysis using ML & Word2Vec
IRJET-  	  Improved Real-Time Twitter Sentiment Analysis using ML & Word2VecIRJET-  	  Improved Real-Time Twitter Sentiment Analysis using ML & Word2Vec
IRJET- Improved Real-Time Twitter Sentiment Analysis using ML & Word2Vec
 
Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...
 
F017433947
F017433947F017433947
F017433947
 

More from Farida Vis

Everyday Growing and Digging Cultures
Everyday Growing and Digging CulturesEveryday Growing and Digging Cultures
Everyday Growing and Digging CulturesFarida Vis
 
Where do images fit in the era of ‘Big Data’?
Where do images fit in the era of ‘Big Data’?Where do images fit in the era of ‘Big Data’?
Where do images fit in the era of ‘Big Data’?Farida Vis
 
How not to measure Twitter Influence
How not to measure Twitter InfluenceHow not to measure Twitter Influence
How not to measure Twitter InfluenceFarida Vis
 
What do Data Visualisations want?
What do Data Visualisations want?What do Data Visualisations want?
What do Data Visualisations want?Farida Vis
 
London Police Tweets
London Police Tweets London Police Tweets
London Police Tweets Farida Vis
 
Allotment (publics): an open data and data driven journalism perspective
Allotment (publics): an open data and data driven journalism perspective Allotment (publics): an open data and data driven journalism perspective
Allotment (publics): an open data and data driven journalism perspective Farida Vis
 
FutureEverything 2012: Social Media, Social Change
FutureEverything 2012: Social Media, Social Change FutureEverything 2012: Social Media, Social Change
FutureEverything 2012: Social Media, Social Change Farida Vis
 
Reading The Riots on Twitter at LIFT12
Reading The Riots on Twitter at LIFT12Reading The Riots on Twitter at LIFT12
Reading The Riots on Twitter at LIFT12Farida Vis
 
Growing Back to the Future
Growing Back to the Future Growing Back to the Future
Growing Back to the Future Farida Vis
 
Future of allotments in the UK: Manchester City Camp
Future of allotments in the UK: Manchester City CampFuture of allotments in the UK: Manchester City Camp
Future of allotments in the UK: Manchester City CampFarida Vis
 
Digging into citizenship: land, waiting lists, broken promises and creating c...
Digging into citizenship: land, waiting lists, broken promises and creating c...Digging into citizenship: land, waiting lists, broken promises and creating c...
Digging into citizenship: land, waiting lists, broken promises and creating c...Farida Vis
 
Amazon.com and the outbreak narrative
Amazon.com and the outbreak narrativeAmazon.com and the outbreak narrative
Amazon.com and the outbreak narrativeFarida Vis
 

More from Farida Vis (13)

Vacantacres
VacantacresVacantacres
Vacantacres
 
Everyday Growing and Digging Cultures
Everyday Growing and Digging CulturesEveryday Growing and Digging Cultures
Everyday Growing and Digging Cultures
 
Where do images fit in the era of ‘Big Data’?
Where do images fit in the era of ‘Big Data’?Where do images fit in the era of ‘Big Data’?
Where do images fit in the era of ‘Big Data’?
 
How not to measure Twitter Influence
How not to measure Twitter InfluenceHow not to measure Twitter Influence
How not to measure Twitter Influence
 
What do Data Visualisations want?
What do Data Visualisations want?What do Data Visualisations want?
What do Data Visualisations want?
 
London Police Tweets
London Police Tweets London Police Tweets
London Police Tweets
 
Allotment (publics): an open data and data driven journalism perspective
Allotment (publics): an open data and data driven journalism perspective Allotment (publics): an open data and data driven journalism perspective
Allotment (publics): an open data and data driven journalism perspective
 
FutureEverything 2012: Social Media, Social Change
FutureEverything 2012: Social Media, Social Change FutureEverything 2012: Social Media, Social Change
FutureEverything 2012: Social Media, Social Change
 
Reading The Riots on Twitter at LIFT12
Reading The Riots on Twitter at LIFT12Reading The Riots on Twitter at LIFT12
Reading The Riots on Twitter at LIFT12
 
Growing Back to the Future
Growing Back to the Future Growing Back to the Future
Growing Back to the Future
 
Future of allotments in the UK: Manchester City Camp
Future of allotments in the UK: Manchester City CampFuture of allotments in the UK: Manchester City Camp
Future of allotments in the UK: Manchester City Camp
 
Digging into citizenship: land, waiting lists, broken promises and creating c...
Digging into citizenship: land, waiting lists, broken promises and creating c...Digging into citizenship: land, waiting lists, broken promises and creating c...
Digging into citizenship: land, waiting lists, broken promises and creating c...
 
Amazon.com and the outbreak narrative
Amazon.com and the outbreak narrativeAmazon.com and the outbreak narrative
Amazon.com and the outbreak narrative
 

Recently uploaded

Lucknow 💋 Dating Call Girls Lucknow | Whatsapp No 8923113531 VIP Escorts Serv...
Lucknow 💋 Dating Call Girls Lucknow | Whatsapp No 8923113531 VIP Escorts Serv...Lucknow 💋 Dating Call Girls Lucknow | Whatsapp No 8923113531 VIP Escorts Serv...
Lucknow 💋 Dating Call Girls Lucknow | Whatsapp No 8923113531 VIP Escorts Serv...anilsa9823
 
Night 7k Call Girls Pari Chowk Escorts Call Me: 8448380779
Night 7k Call Girls Pari Chowk Escorts Call Me: 8448380779Night 7k Call Girls Pari Chowk Escorts Call Me: 8448380779
Night 7k Call Girls Pari Chowk Escorts Call Me: 8448380779Delhi Call girls
 
Enjoy Night⚡Call Girls Palam Vihar Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Palam Vihar Gurgaon >༒8448380779 Escort ServiceEnjoy Night⚡Call Girls Palam Vihar Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Palam Vihar Gurgaon >༒8448380779 Escort ServiceDelhi Call girls
 
9990611130 Find & Book Russian Call Girls In Crossings Republik
9990611130 Find & Book Russian Call Girls In Crossings Republik9990611130 Find & Book Russian Call Girls In Crossings Republik
9990611130 Find & Book Russian Call Girls In Crossings RepublikGenuineGirls
 
Website research Powerpoint for Bauer magazine
Website research Powerpoint for Bauer magazineWebsite research Powerpoint for Bauer magazine
Website research Powerpoint for Bauer magazinesamuelcoulson30
 
Your LinkedIn Makeover: Sociocosmos Presence Package
Your LinkedIn Makeover: Sociocosmos Presence PackageYour LinkedIn Makeover: Sociocosmos Presence Package
Your LinkedIn Makeover: Sociocosmos Presence PackageSocioCosmos
 
SELECTING A SOCIAL MEDIA MARKETING COMPANY
SELECTING A SOCIAL MEDIA MARKETING COMPANYSELECTING A SOCIAL MEDIA MARKETING COMPANY
SELECTING A SOCIAL MEDIA MARKETING COMPANYdizinfo
 
Film show pre-production powerpoint for site
Film show pre-production powerpoint for siteFilm show pre-production powerpoint for site
Film show pre-production powerpoint for siteAshtonCains
 
CALL ON ➥8923113531 🔝Call Girls Ashiyana Colony Lucknow best sexual service O...
CALL ON ➥8923113531 🔝Call Girls Ashiyana Colony Lucknow best sexual service O...CALL ON ➥8923113531 🔝Call Girls Ashiyana Colony Lucknow best sexual service O...
CALL ON ➥8923113531 🔝Call Girls Ashiyana Colony Lucknow best sexual service O...anilsa9823
 
Film show production powerpoint for site
Film show production powerpoint for siteFilm show production powerpoint for site
Film show production powerpoint for siteAshtonCains
 
DickinsonSlides teeeeeeeeeeessssssssssst.pptx
DickinsonSlides teeeeeeeeeeessssssssssst.pptxDickinsonSlides teeeeeeeeeeessssssssssst.pptx
DickinsonSlides teeeeeeeeeeessssssssssst.pptxednyonat
 
Hire↠Young Call Girls in Hari Nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esco...
Hire↠Young Call Girls in Hari Nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esco...Hire↠Young Call Girls in Hari Nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esco...
Hire↠Young Call Girls in Hari Nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esco...Delhi Call girls
 
Elite Class ➥8448380779▻ Call Girls In Nizammuddin Delhi NCR
Elite Class ➥8448380779▻ Call Girls In Nizammuddin Delhi NCRElite Class ➥8448380779▻ Call Girls In Nizammuddin Delhi NCR
Elite Class ➥8448380779▻ Call Girls In Nizammuddin Delhi NCRDelhi Call girls
 
CASH PAYMENT ON GIRL HAND TO HAND HOUSEWIFE
CASH PAYMENT ON GIRL HAND TO HAND HOUSEWIFECASH PAYMENT ON GIRL HAND TO HAND HOUSEWIFE
CASH PAYMENT ON GIRL HAND TO HAND HOUSEWIFECall girl Jaipur
 
Ready to get noticed? Partner with Sociocosmos
Ready to get noticed? Partner with SociocosmosReady to get noticed? Partner with Sociocosmos
Ready to get noticed? Partner with SociocosmosSocioCosmos
 
Production diary Film the city powerpoint
Production diary Film the city powerpointProduction diary Film the city powerpoint
Production diary Film the city powerpointAshtonCains
 
CALL ON ➥8923113531 🔝Call Girls Takrohi Lucknow best Female service 👖
CALL ON ➥8923113531 🔝Call Girls Takrohi Lucknow best Female service  👖CALL ON ➥8923113531 🔝Call Girls Takrohi Lucknow best Female service  👖
CALL ON ➥8923113531 🔝Call Girls Takrohi Lucknow best Female service 👖anilsa9823
 
Call Girls In South Ex. Delhi O9654467111 Women Seeking Men
Call Girls In South Ex. Delhi O9654467111 Women Seeking MenCall Girls In South Ex. Delhi O9654467111 Women Seeking Men
Call Girls In South Ex. Delhi O9654467111 Women Seeking MenSapana Sha
 

Recently uploaded (20)

9953056974 Young Call Girls In Kirti Nagar Indian Quality Escort service
9953056974 Young Call Girls In  Kirti Nagar Indian Quality Escort service9953056974 Young Call Girls In  Kirti Nagar Indian Quality Escort service
9953056974 Young Call Girls In Kirti Nagar Indian Quality Escort service
 
Lucknow 💋 Dating Call Girls Lucknow | Whatsapp No 8923113531 VIP Escorts Serv...
Lucknow 💋 Dating Call Girls Lucknow | Whatsapp No 8923113531 VIP Escorts Serv...Lucknow 💋 Dating Call Girls Lucknow | Whatsapp No 8923113531 VIP Escorts Serv...
Lucknow 💋 Dating Call Girls Lucknow | Whatsapp No 8923113531 VIP Escorts Serv...
 
Night 7k Call Girls Pari Chowk Escorts Call Me: 8448380779
Night 7k Call Girls Pari Chowk Escorts Call Me: 8448380779Night 7k Call Girls Pari Chowk Escorts Call Me: 8448380779
Night 7k Call Girls Pari Chowk Escorts Call Me: 8448380779
 
Enjoy Night⚡Call Girls Palam Vihar Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Palam Vihar Gurgaon >༒8448380779 Escort ServiceEnjoy Night⚡Call Girls Palam Vihar Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Palam Vihar Gurgaon >༒8448380779 Escort Service
 
9990611130 Find & Book Russian Call Girls In Crossings Republik
9990611130 Find & Book Russian Call Girls In Crossings Republik9990611130 Find & Book Russian Call Girls In Crossings Republik
9990611130 Find & Book Russian Call Girls In Crossings Republik
 
Website research Powerpoint for Bauer magazine
Website research Powerpoint for Bauer magazineWebsite research Powerpoint for Bauer magazine
Website research Powerpoint for Bauer magazine
 
Your LinkedIn Makeover: Sociocosmos Presence Package
Your LinkedIn Makeover: Sociocosmos Presence PackageYour LinkedIn Makeover: Sociocosmos Presence Package
Your LinkedIn Makeover: Sociocosmos Presence Package
 
SELECTING A SOCIAL MEDIA MARKETING COMPANY
SELECTING A SOCIAL MEDIA MARKETING COMPANYSELECTING A SOCIAL MEDIA MARKETING COMPANY
SELECTING A SOCIAL MEDIA MARKETING COMPANY
 
Film show pre-production powerpoint for site
Film show pre-production powerpoint for siteFilm show pre-production powerpoint for site
Film show pre-production powerpoint for site
 
CALL ON ➥8923113531 🔝Call Girls Ashiyana Colony Lucknow best sexual service O...
CALL ON ➥8923113531 🔝Call Girls Ashiyana Colony Lucknow best sexual service O...CALL ON ➥8923113531 🔝Call Girls Ashiyana Colony Lucknow best sexual service O...
CALL ON ➥8923113531 🔝Call Girls Ashiyana Colony Lucknow best sexual service O...
 
Film show production powerpoint for site
Film show production powerpoint for siteFilm show production powerpoint for site
Film show production powerpoint for site
 
DickinsonSlides teeeeeeeeeeessssssssssst.pptx
DickinsonSlides teeeeeeeeeeessssssssssst.pptxDickinsonSlides teeeeeeeeeeessssssssssst.pptx
DickinsonSlides teeeeeeeeeeessssssssssst.pptx
 
Hire↠Young Call Girls in Hari Nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esco...
Hire↠Young Call Girls in Hari Nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esco...Hire↠Young Call Girls in Hari Nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esco...
Hire↠Young Call Girls in Hari Nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esco...
 
Elite Class ➥8448380779▻ Call Girls In Nizammuddin Delhi NCR
Elite Class ➥8448380779▻ Call Girls In Nizammuddin Delhi NCRElite Class ➥8448380779▻ Call Girls In Nizammuddin Delhi NCR
Elite Class ➥8448380779▻ Call Girls In Nizammuddin Delhi NCR
 
CASH PAYMENT ON GIRL HAND TO HAND HOUSEWIFE
CASH PAYMENT ON GIRL HAND TO HAND HOUSEWIFECASH PAYMENT ON GIRL HAND TO HAND HOUSEWIFE
CASH PAYMENT ON GIRL HAND TO HAND HOUSEWIFE
 
Ready to get noticed? Partner with Sociocosmos
Ready to get noticed? Partner with SociocosmosReady to get noticed? Partner with Sociocosmos
Ready to get noticed? Partner with Sociocosmos
 
Russian Call Girls Rohini Sector 35 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...
Russian Call Girls Rohini Sector 35 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...Russian Call Girls Rohini Sector 35 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...
Russian Call Girls Rohini Sector 35 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...
 
Production diary Film the city powerpoint
Production diary Film the city powerpointProduction diary Film the city powerpoint
Production diary Film the city powerpoint
 
CALL ON ➥8923113531 🔝Call Girls Takrohi Lucknow best Female service 👖
CALL ON ➥8923113531 🔝Call Girls Takrohi Lucknow best Female service  👖CALL ON ➥8923113531 🔝Call Girls Takrohi Lucknow best Female service  👖
CALL ON ➥8923113531 🔝Call Girls Takrohi Lucknow best Female service 👖
 
Call Girls In South Ex. Delhi O9654467111 Women Seeking Men
Call Girls In South Ex. Delhi O9654467111 Women Seeking MenCall Girls In South Ex. Delhi O9654467111 Women Seeking Men
Call Girls In South Ex. Delhi O9654467111 Women Seeking Men
 

The evolution of research on social media

  • 1. The evolution of research on social media Farida Vis, University of Sheffield @flygirltwo European Conference on Social Media, 10 July, Brighton, United Kingdom
  • 3. SOCIAL MEDIA = BIG DATA
  • 11. Aftermath of Hurricane Katrina 2005: Flickr
  • 12. 2008: YouTube Fitna: The Video Battle
  • 13. 2011: Twitter Reading the Riots on Twitter
  • 15. 235 posts – 106 individuals (Flickr) Aftermath of Hurricane Katrina 2005 Manual collection possible
  • 16. 1413 videos – 700 individuals (YouTube) Fitna: The Video Battle 2008 + Computer Science
  • 17. 2.6 million tweets – 700K individuals (Twitter) Reading the Riots on Twitter 2011 + Lots of Computer Science
  • 18. READING THE RIOTS ON TWITTER Rob Procter (University of Manchester) Farida Vis (University of Leicester) Alexander Voss (University of St Andrews) [Funded by JISC] #readingtheriots
  • 21. “Big data” is high-volume, -velocity and –variety information assets that demand cost-effective, innovative forms of information processing for enhanced insight and decision making’ (Gartner in Sicular, 2013). Huge industry now built around ‘social data’ and ‘listening platforms’ feeding on this data (Many tools not suitable for academic use, black box).
  • 22. • Technology: maximizing computation power and algorithmic accuracy to gather, analyze, link, and compare large data sets. • Analysis: drawing on large data sets to identify patterns in order to make economic, social, technical, and legal claims. • Mythology: the widespread belief that large data sets offer a higher form of intelligence and knowledge that can generate insights that were previously impossible, with the aura of truth, objectivity, and accuracy. (boyd and Crawford p. 663).
  • 23. Critiques of Big Data • Important to make visible inherent claims about objectivity • Problematic focus on quantitative methods • How can data answer questions it was not designed to answer? • How can the right questions be asked? • Inherent biases in large linked error prone datasets • Focus on text and numbers that can be mined algorithmically • Data fundamentalism
  • 24. Data fundamentalism The notion that correlation always indicates causation, and that massive data sets and predictive analytics always reflect ‘objective truth’. Idea and belief in the existence of an objective ‘truth’, that something can be fully understood from a single perspective, again brings to light tensions about how the social world can be made known.
  • 26. How do we ground online data? In the offline: assessing findings against what we know about an offline population (census data) in order to better understand online data. Problems with over/under representation in online data? In the online: premised on the idea that data derived from social media should be grounded in other online data in order to understand it. So comparing Facebook use to what we know about Facebook use, rather than connecting it to offline measurements about citizens. Richard Rogers
  • 27. Important considerations 1. Asking the right question – research should be question driven rather than data driven. 2. Accept poor data quality & users gaming metrics – once online metrics have value users will try to game them. 3. Limitations of tools (often built in disconnected way) 4. Transparency – researchers should be upfront about limitations of research and research design. Can the data answer the questions?
  • 28. A critical reflection on Big Data: considering APIs, researchers and tools as data makers
  • 29. Rather than assuming data already exists ‘out there’, waiting to simply be recovered and turned into findings, the article examines how data is co-produced through dynamic research intersections. A particular focus is the intersections between the Application Programming Interface (API), the researcher collecting the data as well as the tools used to process it. In light of this, the article offers three new ways to define and think about Big Data and proposes a series of practical suggestions for making data. (First Monday, October 2013, http://firstmonday.org/)
  • 31. Standard API sampling problems Sampling from the FIREHOSE 1% random sample of the firehose If not rate limited – all data collected?
  • 32. New API sampling problems New business models: enriched metadata Social media vs social data Datasift, GNIP and Topsy
  • 33. Social media VS social data • Social Media: User-generated content where one user communicates and expresses themselves and that content is delivered to other users. Examples of this are platforms such as Twitter, Facebook, YouTube, Tumblr and Disqus. Social media is delivered in a great user experience, and is focused on sharing and content discovery. Social media also offers both public and private experiences with the ability to share messages privately. • Social Data: Expresses social media in a computer-readable format (e.g. JSON) and shares metadata about the content to help provide not only content, but context. Metadata often includes information about location, engagement and links shared. Unlike social media, social data is focused strictly on publicly shared experiences. (Cairns, 2013)
  • 35.
  • 36. Enriched metadata Location and influence Where are users? Are they influential?
  • 38. New Profile Geo Enrichment
  • 39. Geo-locating tweets Exact location Lat/long coordinates Gold standard geo data Problem: only 1% of users -> Only 2% of firehose tweets Early adopters, highly skewed Where in the world are you? No Lat/long coordinates Text field – enter anything Advantage: more than half of all tweets contain profile location Much more evenly distributed
  • 40. Profile Geo Enrichment ‘our customers can now hear from the whole world of Twitter users and not just 1%’ (Cairs, 2013 on Gnip company Blog) • Activity Location – 1% that provide lat/long • Profile Location – Place provided in their profile. May or may not be posting from there. • Mentioned Location – Places a user talks about ‘Both the tweet text and Profile fields contain geographic information, but not in substantial quantities and have poor accuracy’ (Leetaru et al, First Monday, May 2013)
  • 41. Problem with deleted tweets ‘A deleted tweet effectively disappears from the results of searching Twitter, although a short delay sometimes occurs between deletion and disappearance. A status deletion notice is distributed via the Twitter streaming API to relevant users’ clients so that they, in turn, remove deleted tweets from their records.’ ‘Twitter does not provide a bulk-deletion of user’s tweets. It provides, however, a one-click bulk-deletion of all location data that were attached to user’s tweets, without deleting the tweets. By clicking on the “Delete all location information” button on user’s account settings page, all locations attached to all previous tweets are deleted. (Almuhimedi et al, 2013)
  • 42. Profile Geo Enrichment Linking data ‘Profile location data can be used to unlock demographic data and other information that is not otherwise possible with activity location. For instance, US Census Bureau statistics are aggregated at the locality level and can provide basic stats like household income. Profile location is also a strong indicator of activity location when one isn’t provided. (Cairns, 2013)
  • 44. Klout used to make profiles without consent
  • 45. This is a DM. What is it doing here?
  • 46. Klout rewards users for giving data More data = more influential? Online/offline?
  • 48. Fake followers: Mitt Romney’s 100,000 extra followers in one day As many as 20 million fake follower accounts (200 million active users) This doesn’t take into account the issue of spoof accounts (clearly in evidence in riot tweets) (Perlroth, 2013)
  • 49. Klout scores industry standard measure…
  • 50. Ability to describe the limitations of our data: - APIs as data makers. Once data is linked very hard to untangle how metadata is constructed and where problems might be. Included in terms of deleted content. - Researchers and tools as data makers
  • 51. - When creating a dataset important to describe how it was made, what the limitations are. What the sampling limitations (both in terms of the API, but also related to offline ‘population’. What other limitations re: enriched metadata needs to be described?) - When creating a dataset how complete is it? - Limitations need to be known in order to describe them. This is a real problem.
  • 52. Tools as data makers In answering complex questions about social media data, we need: 1. Know the questions! And know how they might be answered. 2. Problem with tools: not question driven. Often developed around available (poor quality) data, often by non social media experts, but those with data processing expertise. 3. Tools therefore become data-makers in that they limit the scope of possibility in the questions researchers imagine. This is a huge problem!
  • 53. Need better understanding of complex ever changing dynamics between APIs Researchers Tools
  • 54. Organic data / data in the wild SOCIAL MEDIA SIMPLY AS (BIG) DATA VS SOCIAL MEDIA AS A RESEARCH AREA
  • 56. TO UNDERSTAND TWITTER DATA YOU NEED TO UNDERSTAND TWITTER
  • 57. + BE ON TWITTER
  • 60.
  • 62. Images posses the ability to grab our attention Social media companies know this Images are key to engagement
  • 63.
  • 64.
  • 65. Camera: used to be for special occasions Smartphone: always with us
  • 67. US 65% smartphone penetration Smartphones overtaken desktop usage to access the internet Mobile internet accounts for majority of internet use in US (57%) Users typically access the internet via apps on mobile devices All figures from comScore, US Digital Future in Focus, 2014
  • 68. UK: The over-55s will experience the fastest year-on-year rises in smartphone penetration. Smartphone ownership should increase to about 50% by year-end, a 25% increase from 2013, but trailing 70% penetration among 18-54s. The difference in smartphone penetration by age will disappear, but differences in usage of smartphones remain substantial. Many over 55s use smartphones like feature phones. All figures from Deloitte, predictions for 2014
  • 69. Rise of platforms and apps focused on visual content Pinterest Tumblr Instagram Vine Snapchat ‘Mobile first… and only’ | simple easy, user friendly design
  • 70. Facebook daily image uploads: 350 million (November 2013) Instagram daily image uploads: 60 million (March 2014) Twitter: 500 million tweets daily (March 2014) Snapchat daily snaps: 400 million (November 2013)
  • 71.
  • 72. Images largely ignored in social media research Not easy to ‘mine’ Hard to figure out meaning Huge interest in industry
  • 73. WHAT DOES THE FUTURE OF SOCIAL MEDIA RESEARCH LOOK LIKE?
  • 74. QUESTION DRIVEN (TOOL AWARE) + CRITICAL BETTER METHODS MORE THEORY TRANSPARENT SUSTAINABLE ETHICAL CROSS PLATFORM INTERDISCIPLINARY MORE CROSS SECTOR? MORE FUNDING!
  • 76. References • Hazim Almuhimedia, Shomir Wilsona, Bin Liua, Norman Sadeha, Alessandro Acquistib, 2012. ‘Tweets Are Forever: A Large-Scale Quantitative Analysis of Deleted Tweets’, CSCW’13, February 23–27, 2013, San Antonio, Texas, USA, http://www.cs.cmu.edu/~shomir/cscw2013_tweets_are_forever.pdf, accessed 18 September, 2013. • Ian Cairns, 2013. ‘Get More Geodata From Gnip With Our New Profile Geo Enrichment’, Gnip Company Blog, 22 August, at http://blog.gnip.com/tag/geolocation/, accessed 13 September 2013. • Grcommunication, 2012, ‘I will help raise your Klout score by sending you 10Ks and will tweet it out to my 50K+ followers from my 80+ Klout score for $5, http://fiverr.com/grcommunication/help-raise-your-klout- score-by-sending-you-10ks-and-will-tweet-it-out-to-my-17k-followers-from-my-70-klout-score, accessed 19 September 2013. • Anthony Ha, 2013. ‘Gnip Expands Its Partnership With Klout, Becoming The Exclusive Provider Of Klout Topics’, TechCrunch, 8 August, http://techcrunch.com/2013/08/08/gnip-klout/, accessed 19 September 2013. • Martin Hawksey, 2013. ‘Twitter throws a bone: Increased hits and metadata in Twitter Search API 1.1,’ March 28, at http://mashe.hawksey.info/2013/03/twitterthrows-a-bone-increased-hits-and-metadata-in- twitter-search-api-1-1/ , accessed 10 September 2013. • Kalev H. Leetaru, Shaowen Wang, Guofeng Cao, Anand Padmanabhan, and Eric Shook, 2013, ‘Mapping the global Twitter heartbeat: The geography of Twitter, First Monday, Volume 18, Number 5-6 May, http://firstmonday.org/article/view/4366/3654 • Nicole Perlroth, 2013. ‘Fake Twitter Followers Become Multimillion-Dollar Business’, New York Times, Bits blog, 5 April, http://bits.blogs.nytimes.com/2013/04/05/fake-twitter-followers-becomes-multimillion- dollar-business/, accessed 19 September 2013. • Farida Vis, 2013. ‘A critical reflection on Big Data: considering APIs, researchers and tools as data makers’, First Monday, 7 October, http://firstmonday.org