SlideShare a Scribd company logo
1 of 12
@qutdmrc
AoIR 2018, Montréal, 12 Oct. 2018
Axel Bruns | @snurb_dot_info
A Multi-Institutional Approach to ‘Big Social Data’
The TrISMA Project
@qutdmrc
TrISMA: Tracking Infrastructure for Social Media Analysis
● TrISMA:
● ARC LIEF project, 2014-16
● QUT, Curtin, Deakin, Swinburne, NLA (+ ECU, Sydney, WSU)
● Now maintained by the QUT Digital Observatory
● Key capabilities:
● Australian Twitter Collection: live tracker of 500,000 most active Twitter
accounts
● Australian Twittersphere network: follower relations between 3.7m Australian
Twitter accounts
● Australian Facebook Collection: activity on public Australian Facebook pages
Capabilities: Twitter
@qutdmrc
The Australian Twittersphere
● Twitter in Australia:
● Strong take-up since 2009
● Centred around 25-55 age range, urban, educated, affluent users (but gradually broadening)
● Significant role in crisis communication, political communication, audience engagement, …
● Mapping the Twittersphere:
● Long-term project to identify all Australian Twitter accounts
● First iteration: snowball crawl of follower/followee networks
● Starting with key hashtag populations (#auspol, #spill, …)
● Map of ~1m accounts in early 2012
● Second iteration: full crawl of global Twitter ID numberspace through to Sep. 2013 (~870m accounts)
● Third iteration: full crawl of global Twitter ID numberspace through to Feb. 2016 (~1.4b accounts)
● Filtering by description, location, timezone fields: identifiably Australian cities, states, timezones, etc.
● 4 million Australian accounts identified (by Feb. 2016)
● Retrieval of their follower/followee lists
● Continuous gathering of public tweets for the 500,000 most active accounts
● Capturing ~900,000 new tweets per day
@qutdmrc
Global Growth
@qutdmrc
3.7m known Australian accounts
Network of follower connections
Filtered for degree ≥1000
255k nodes (6.4%), 61m edges
Edges not shown in graph
The Australian Twittersphere
Teen Culture
Aspirational
Sports
Netizens
Arts & Culture
Politics
Television
Fashion
Popular Music
Food & Drinks
Agriculture Activism
Porn
Education
Cycling
News &
Generic
Hard Right
Progressive
South
Australia
Celebrities
Horse Racing
@qutdmrc
(http://bit.ly/QUTDOTwitter)
Benefits and Challenges
@qutdmrc
What’s Wrong with Hashtag Studies?
● Hashtag (and keyword) studies:
● Hashtags (often) fail to capture follow-on communication
● Hashtag studies lack context: e.g. what percentage of total tweet volume?
● Only ~18% of tweets by the top 500,000 Australian accounts contain hashtags
● Which hashtags, which keywords?
● Live Twitter API data collection assumes we know tracking terms a priori
● Need for comparative, whole-of-population studies across topics
● What do Australian (not global) Twitter users tweet about?
● It’s the network, stupid:
● Twitter follower relations crucial to message spread and visibility
● Most studies observe activity, not reach: talking, not listening
● Follower network data can show which thematic clusters are active,
● and which are likely to have seen relevant posts
@qutdmrc
‘Big Social Data’ after the APIcalypse
● Growing challenges:
● Refreshing the data – 2016 accounts; account IDs no longer consecutive
● Continuous snowballing – add accounts mentioned by tracked accounts?
● Continuing, unforeseeable, inherently anti-researcher API changes
● TrISMA maintenance and management:
● Costly data gathering, storage, processing, and maintenance
● QUT Digital Observatory as long-term investment (well beyond Twitter)
● Need for considerable digital methods research training
● Multi-institutional research project vs. Twitter Terms of Service?
● Twitter Terms of Service vs. public-interest research needs
@qutdmrc
Intransparency Spun as Data Protection
● The API clampdown:
● Facebook, Twitter, Instagram, … reducing API functionality and access
● Mounting casualities: Netvizz, Texifter, … (but not the commercial services?)
● Spin: ‘we are doing more to protect your data’ after Cambridge Analytica
● Reality: ‘we are doing more to frustrate independent, critical, public-interest scrutiny’
● Hiding problems (hate speech, bullying, ‘fake news’), rather than addressing them
● What can we do?
● Give up, walk away, research other things
● Protest, lobby companies for access, lobby legislators for pro-research regulation
● Use what remains of the APIs to gather what we can, and (carefully) share what we gather
● Build shared repositories for our social media datasets, and develop ethical access frameworks
● Explore scraping and other alternatives even if they break the ToS, in the public interest
(IT University Copenhagen, 27-28 Oct. 2018)
@qutdmrc
AoIR 2018, Montréal, 12 Oct. 2018
Axel Bruns | @snurb_dot_info
@snurb_dot_info – http://snurb.info/
@socialmediaQUT – http://socialmedia.qut.edu.au/
@qutdmrc – https://www.qut.edu.au/research/dmrc
This research is supported by the ARC Future Fellowship project
“Understanding Intermedia Information Flows in the Australian
Online Public Sphere”, and the ARC LIEF project “TrISMA:
Tracking Infrastructure for Social Media Analysis.”

More Related Content

Similar to A Multi-Institutional Approach to ‘Big Social Data’: The TrISMA Project

CCI Winter School Workshop on Digital Methods and Social Media Analytics
CCI Winter School Workshop on Digital Methods and Social Media AnalyticsCCI Winter School Workshop on Digital Methods and Social Media Analytics
CCI Winter School Workshop on Digital Methods and Social Media Analytics
Jean Burgess
 
Altmetrics Day Workshop - Internet Librarian International 2014
Altmetrics Day Workshop - Internet Librarian International 2014Altmetrics Day Workshop - Internet Librarian International 2014
Altmetrics Day Workshop - Internet Librarian International 2014
Andy Tattersall
 

Similar to A Multi-Institutional Approach to ‘Big Social Data’: The TrISMA Project (20)

Mapping a National Twittersphere: A 'Big Data' Analysis of Australian Twitter...
Mapping a National Twittersphere: A 'Big Data' Analysis of Australian Twitter...Mapping a National Twittersphere: A 'Big Data' Analysis of Australian Twitter...
Mapping a National Twittersphere: A 'Big Data' Analysis of Australian Twitter...
 
CCI Winter School Workshop on Digital Methods and Social Media Analytics
CCI Winter School Workshop on Digital Methods and Social Media AnalyticsCCI Winter School Workshop on Digital Methods and Social Media Analytics
CCI Winter School Workshop on Digital Methods and Social Media Analytics
 
CCI Winter School Social Media Presentation
CCI Winter School Social Media PresentationCCI Winter School Social Media Presentation
CCI Winter School Social Media Presentation
 
Using Twitter as a Postgraduate Researcher
Using Twitter as a Postgraduate ResearcherUsing Twitter as a Postgraduate Researcher
Using Twitter as a Postgraduate Researcher
 
Ad Hoc Innovation by Users of Social Networks: The Case of Twitter
Ad Hoc Innovation by Users of Social Networks: The Case of TwitterAd Hoc Innovation by Users of Social Networks: The Case of Twitter
Ad Hoc Innovation by Users of Social Networks: The Case of Twitter
 
What do you do with 280 million tweets from the 2016 U.S. election?
What do you do with 280 million tweets from the 2016 U.S. election?What do you do with 280 million tweets from the 2016 U.S. election?
What do you do with 280 million tweets from the 2016 U.S. election?
 
Pushed towards Dysfunction: How Social Media API Restrictions Distort Researc...
Pushed towards Dysfunction: How Social Media API Restrictions Distort Researc...Pushed towards Dysfunction: How Social Media API Restrictions Distort Researc...
Pushed towards Dysfunction: How Social Media API Restrictions Distort Researc...
 
Filter Bubbles in the Australian Twittersphere?
Filter Bubbles in the Australian Twittersphere?Filter Bubbles in the Australian Twittersphere?
Filter Bubbles in the Australian Twittersphere?
 
Using social media to promote your research
Using social media to promote your research Using social media to promote your research
Using social media to promote your research
 
Walking the talk of open research and open innovation in practice
Walking the talk of open research and open innovation in practiceWalking the talk of open research and open innovation in practice
Walking the talk of open research and open innovation in practice
 
Social Media in Australia: A ‘Big Data’ Perspective on Twitter
Social Media in Australia: A ‘Big Data’ Perspective on TwitterSocial Media in Australia: A ‘Big Data’ Perspective on Twitter
Social Media in Australia: A ‘Big Data’ Perspective on Twitter
 
Altmetrics Day Workshop - Internet Librarian International 2014
Altmetrics Day Workshop - Internet Librarian International 2014Altmetrics Day Workshop - Internet Librarian International 2014
Altmetrics Day Workshop - Internet Librarian International 2014
 
Applied AI/ML in the Workplace - Geek Food for Thought
Applied AI/ML in the Workplace - Geek Food for ThoughtApplied AI/ML in the Workplace - Geek Food for Thought
Applied AI/ML in the Workplace - Geek Food for Thought
 
‘Big Social Data’ in Context: Connecting Social Media Data and Other Sources
‘Big Social Data’ in Context: Connecting Social Media Data and Other Sources‘Big Social Data’ in Context: Connecting Social Media Data and Other Sources
‘Big Social Data’ in Context: Connecting Social Media Data and Other Sources
 
Academic Social Club
Academic Social ClubAcademic Social Club
Academic Social Club
 
Academic social club
Academic social clubAcademic social club
Academic social club
 
SMS19: The expert in the debate
SMS19: The expert in the debateSMS19: The expert in the debate
SMS19: The expert in the debate
 
Social media news audiences and the quantified journalist
Social media news audiences and the quantified journalistSocial media news audiences and the quantified journalist
Social media news audiences and the quantified journalist
 
Insights From Social Media
Insights From Social MediaInsights From Social Media
Insights From Social Media
 
Challenges in-archiving-twitter
Challenges in-archiving-twitterChallenges in-archiving-twitter
Challenges in-archiving-twitter
 

More from Axel Bruns

More from Axel Bruns (20)

AI as Research Assistant: Upscaling Content Analysis to Identify Patterns of ...
AI as Research Assistant: Upscaling Content Analysis to Identify Patterns of ...AI as Research Assistant: Upscaling Content Analysis to Identify Patterns of ...
AI as Research Assistant: Upscaling Content Analysis to Identify Patterns of ...
 
Dynamics of Destructive Polarisation in Mainstream and Social Media: The Case...
Dynamics of Destructive Polarisation in Mainstream and Social Media: The Case...Dynamics of Destructive Polarisation in Mainstream and Social Media: The Case...
Dynamics of Destructive Polarisation in Mainstream and Social Media: The Case...
 
Identifying the Symptoms of Destructive Polarisation
Identifying the Symptoms of Destructive PolarisationIdentifying the Symptoms of Destructive Polarisation
Identifying the Symptoms of Destructive Polarisation
 
Voices on the Voice Referendum: A Computational Analysis of News and Audience...
Voices on the Voice Referendum: A Computational Analysis of News and Audience...Voices on the Voice Referendum: A Computational Analysis of News and Audience...
Voices on the Voice Referendum: A Computational Analysis of News and Audience...
 
What Is Lost When Twitter Is Lost? Reflections on the Impending Death of a Pl...
What Is Lost When Twitter Is Lost? Reflections on the Impending Death of a Pl...What Is Lost When Twitter Is Lost? Reflections on the Impending Death of a Pl...
What Is Lost When Twitter Is Lost? Reflections on the Impending Death of a Pl...
 
What Is Lost When Twitter Is Lost? Reflections on the Impending Death of a Pl...
What Is Lost When Twitter Is Lost? Reflections on the Impending Death of a Pl...What Is Lost When Twitter Is Lost? Reflections on the Impending Death of a Pl...
What Is Lost When Twitter Is Lost? Reflections on the Impending Death of a Pl...
 
Types of Polarisation and Their Operationalisation in Digital and Social Medi...
Types of Polarisation and Their Operationalisation in Digital and Social Medi...Types of Polarisation and Their Operationalisation in Digital and Social Medi...
Types of Polarisation and Their Operationalisation in Digital and Social Medi...
 
News Sharing and Partisanship: Tracking News Outlet Repertoires on Twitter ov...
News Sharing and Partisanship: Tracking News Outlet Repertoires on Twitter ov...News Sharing and Partisanship: Tracking News Outlet Repertoires on Twitter ov...
News Sharing and Partisanship: Tracking News Outlet Repertoires on Twitter ov...
 
Determining the Drivers and Dynamics of Partisanship and Polarisation in Onli...
Determining the Drivers and Dynamics of Partisanship and Polarisation in Onli...Determining the Drivers and Dynamics of Partisanship and Polarisation in Onli...
Determining the Drivers and Dynamics of Partisanship and Polarisation in Onli...
 
Towards a New Empiricism: Polarisation across Four Dimensions
Towards a New Empiricism: Polarisation across Four DimensionsTowards a New Empiricism: Polarisation across Four Dimensions
Towards a New Empiricism: Polarisation across Four Dimensions
 
The Anatomy of Virality: How COVID-19 Conspiracy Theories Spread across Socia...
The Anatomy of Virality: How COVID-19 Conspiracy Theories Spread across Socia...The Anatomy of Virality: How COVID-19 Conspiracy Theories Spread across Socia...
The Anatomy of Virality: How COVID-19 Conspiracy Theories Spread across Socia...
 
A Platform Policy Implementation Audit of Actions against Russia’s State-Cont...
A Platform Policy Implementation Audit of Actions against Russia’s State-Cont...A Platform Policy Implementation Audit of Actions against Russia’s State-Cont...
A Platform Policy Implementation Audit of Actions against Russia’s State-Cont...
 
Networks of Agonism and Antagonism: Polarised Discourses about COP26 (and COP...
Networks of Agonism and Antagonism: Polarised Discourses about COP26 (and COP...Networks of Agonism and Antagonism: Polarised Discourses about COP26 (and COP...
Networks of Agonism and Antagonism: Polarised Discourses about COP26 (and COP...
 
The Filter in Our (?) Heads: Digital Media and Polarisation
The Filter in Our (?) Heads: Digital Media and PolarisationThe Filter in Our (?) Heads: Digital Media and Polarisation
The Filter in Our (?) Heads: Digital Media and Polarisation
 
Gatewatching 5: Weaponising Newssharing: ‘Fake News’ and Other Malinformation
Gatewatching 5: Weaponising Newssharing: ‘Fake News’ and Other MalinformationGatewatching 5: Weaponising Newssharing: ‘Fake News’ and Other Malinformation
Gatewatching 5: Weaponising Newssharing: ‘Fake News’ and Other Malinformation
 
Gatewatching 10: New(s) Publics in the Public Sphere
Gatewatching 10: New(s) Publics in the Public SphereGatewatching 10: New(s) Publics in the Public Sphere
Gatewatching 10: New(s) Publics in the Public Sphere
 
Gatewatching 4: Random Acts of Gatewatching: Everyday Newssharing Practices
Gatewatching 4: Random Acts of Gatewatching: Everyday Newssharing PracticesGatewatching 4: Random Acts of Gatewatching: Everyday Newssharing Practices
Gatewatching 4: Random Acts of Gatewatching: Everyday Newssharing Practices
 
Gatewatching 11: Echo Chambers? Filter Bubbles? Reviewing the Evidence
Gatewatching 11: Echo Chambers? Filter Bubbles? Reviewing the EvidenceGatewatching 11: Echo Chambers? Filter Bubbles? Reviewing the Evidence
Gatewatching 11: Echo Chambers? Filter Bubbles? Reviewing the Evidence
 
Gatewatching 1: Introduction: What’s So Different about Journalism Today?
Gatewatching 1: Introduction: What’s So Different about Journalism Today?Gatewatching 1: Introduction: What’s So Different about Journalism Today?
Gatewatching 1: Introduction: What’s So Different about Journalism Today?
 
Gatewatching 8: Hybrid News Coverage: Liveblogs
Gatewatching 8: Hybrid News Coverage: LiveblogsGatewatching 8: Hybrid News Coverage: Liveblogs
Gatewatching 8: Hybrid News Coverage: Liveblogs
 

Recently uploaded

Sociocosmos empowers you to go trendy on social media with a few clicks..pdf
Sociocosmos empowers you to go trendy on social media with a few clicks..pdfSociocosmos empowers you to go trendy on social media with a few clicks..pdf
Sociocosmos empowers you to go trendy on social media with a few clicks..pdf
SocioCosmos
 
JUAL PILL CYTOTEC PALOPO SULAWESI 087776558899 OBAT PENGGUGUR KANDUNGAN PALOP...
JUAL PILL CYTOTEC PALOPO SULAWESI 087776558899 OBAT PENGGUGUR KANDUNGAN PALOP...JUAL PILL CYTOTEC PALOPO SULAWESI 087776558899 OBAT PENGGUGUR KANDUNGAN PALOP...
JUAL PILL CYTOTEC PALOPO SULAWESI 087776558899 OBAT PENGGUGUR KANDUNGAN PALOP...
Cara Menggugurkan Kandungan 087776558899
 
Jual obat aborsi Asli Taiwan ( 085657271886 ) Cytote pil telat bulan penggugu...
Jual obat aborsi Asli Taiwan ( 085657271886 ) Cytote pil telat bulan penggugu...Jual obat aborsi Asli Taiwan ( 085657271886 ) Cytote pil telat bulan penggugu...
Jual obat aborsi Asli Taiwan ( 085657271886 ) Cytote pil telat bulan penggugu...
Klinik kandungan
 
💊💊 OBAT PENGGUGUR KANDUNGAN SEMARANG 087776-558899 ABORSI KLINIK SEMARANG
💊💊 OBAT PENGGUGUR KANDUNGAN SEMARANG 087776-558899 ABORSI KLINIK SEMARANG💊💊 OBAT PENGGUGUR KANDUNGAN SEMARANG 087776-558899 ABORSI KLINIK SEMARANG
💊💊 OBAT PENGGUGUR KANDUNGAN SEMARANG 087776-558899 ABORSI KLINIK SEMARANG
Cara Menggugurkan Kandungan 087776558899
 
Jual obat aborsi Bekasi ( 085657271886 ) Cytote pil telat bulan penggugur kan...
Jual obat aborsi Bekasi ( 085657271886 ) Cytote pil telat bulan penggugur kan...Jual obat aborsi Bekasi ( 085657271886 ) Cytote pil telat bulan penggugur kan...
Jual obat aborsi Bekasi ( 085657271886 ) Cytote pil telat bulan penggugur kan...
ZurliaSoop
 
TERSEDIA OBAT PENGGUGUR KANDUNGAN MAKASSAR KLINIK ABORSI MAKASSAR 087776558899
TERSEDIA OBAT PENGGUGUR KANDUNGAN MAKASSAR KLINIK ABORSI MAKASSAR 087776558899TERSEDIA OBAT PENGGUGUR KANDUNGAN MAKASSAR KLINIK ABORSI MAKASSAR 087776558899
TERSEDIA OBAT PENGGUGUR KANDUNGAN MAKASSAR KLINIK ABORSI MAKASSAR 087776558899
Obat Cytotec
 
Jual Obat Aborsi Kudus ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cy...
Jual Obat Aborsi Kudus ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cy...Jual Obat Aborsi Kudus ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cy...
Jual Obat Aborsi Kudus ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cy...
ZurliaSoop
 

Recently uploaded (13)

At-Sharjah ☎ +971554789724__**☎ Abortion Pills for sale in Sharjah, Uae
At-Sharjah ☎ +971554789724__**☎ Abortion Pills for sale in Sharjah, UaeAt-Sharjah ☎ +971554789724__**☎ Abortion Pills for sale in Sharjah, Uae
At-Sharjah ☎ +971554789724__**☎ Abortion Pills for sale in Sharjah, Uae
 
Sociocosmos empowers you to go trendy on social media with a few clicks..pdf
Sociocosmos empowers you to go trendy on social media with a few clicks..pdfSociocosmos empowers you to go trendy on social media with a few clicks..pdf
Sociocosmos empowers you to go trendy on social media with a few clicks..pdf
 
JUAL PILL CYTOTEC PALOPO SULAWESI 087776558899 OBAT PENGGUGUR KANDUNGAN PALOP...
JUAL PILL CYTOTEC PALOPO SULAWESI 087776558899 OBAT PENGGUGUR KANDUNGAN PALOP...JUAL PILL CYTOTEC PALOPO SULAWESI 087776558899 OBAT PENGGUGUR KANDUNGAN PALOP...
JUAL PILL CYTOTEC PALOPO SULAWESI 087776558899 OBAT PENGGUGUR KANDUNGAN PALOP...
 
Jual obat aborsi Asli Taiwan ( 085657271886 ) Cytote pil telat bulan penggugu...
Jual obat aborsi Asli Taiwan ( 085657271886 ) Cytote pil telat bulan penggugu...Jual obat aborsi Asli Taiwan ( 085657271886 ) Cytote pil telat bulan penggugu...
Jual obat aborsi Asli Taiwan ( 085657271886 ) Cytote pil telat bulan penggugu...
 
💊💊 OBAT PENGGUGUR KANDUNGAN SEMARANG 087776-558899 ABORSI KLINIK SEMARANG
💊💊 OBAT PENGGUGUR KANDUNGAN SEMARANG 087776-558899 ABORSI KLINIK SEMARANG💊💊 OBAT PENGGUGUR KANDUNGAN SEMARANG 087776-558899 ABORSI KLINIK SEMARANG
💊💊 OBAT PENGGUGUR KANDUNGAN SEMARANG 087776-558899 ABORSI KLINIK SEMARANG
 
Content strategy : Content empire and cash in
Content strategy : Content empire and cash inContent strategy : Content empire and cash in
Content strategy : Content empire and cash in
 
BVG BEACH CLEANING PROJECTS- ORISSA , ANDAMAN, PORT BLAIR
BVG BEACH CLEANING PROJECTS- ORISSA , ANDAMAN, PORT BLAIRBVG BEACH CLEANING PROJECTS- ORISSA , ANDAMAN, PORT BLAIR
BVG BEACH CLEANING PROJECTS- ORISSA , ANDAMAN, PORT BLAIR
 
Marketing Plan - Social Media. The Sparks Foundation
Marketing Plan -  Social Media. The Sparks FoundationMarketing Plan -  Social Media. The Sparks Foundation
Marketing Plan - Social Media. The Sparks Foundation
 
Jual obat aborsi Bekasi ( 085657271886 ) Cytote pil telat bulan penggugur kan...
Jual obat aborsi Bekasi ( 085657271886 ) Cytote pil telat bulan penggugur kan...Jual obat aborsi Bekasi ( 085657271886 ) Cytote pil telat bulan penggugur kan...
Jual obat aborsi Bekasi ( 085657271886 ) Cytote pil telat bulan penggugur kan...
 
TERSEDIA OBAT PENGGUGUR KANDUNGAN MAKASSAR KLINIK ABORSI MAKASSAR 087776558899
TERSEDIA OBAT PENGGUGUR KANDUNGAN MAKASSAR KLINIK ABORSI MAKASSAR 087776558899TERSEDIA OBAT PENGGUGUR KANDUNGAN MAKASSAR KLINIK ABORSI MAKASSAR 087776558899
TERSEDIA OBAT PENGGUGUR KANDUNGAN MAKASSAR KLINIK ABORSI MAKASSAR 087776558899
 
Jual Obat Aborsi Kudus ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cy...
Jual Obat Aborsi Kudus ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cy...Jual Obat Aborsi Kudus ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cy...
Jual Obat Aborsi Kudus ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cy...
 
Buku Prediksi Togel Sydney Malam ini 4d 100 perak MAGNUMTOGEL
Buku Prediksi Togel Sydney Malam ini 4d 100 perak MAGNUMTOGELBuku Prediksi Togel Sydney Malam ini 4d 100 perak MAGNUMTOGEL
Buku Prediksi Togel Sydney Malam ini 4d 100 perak MAGNUMTOGEL
 
Enhancing Consumer Trust Through Strategic Content Marketing
Enhancing Consumer Trust Through Strategic Content MarketingEnhancing Consumer Trust Through Strategic Content Marketing
Enhancing Consumer Trust Through Strategic Content Marketing
 

A Multi-Institutional Approach to ‘Big Social Data’: The TrISMA Project

  • 1. @qutdmrc AoIR 2018, Montréal, 12 Oct. 2018 Axel Bruns | @snurb_dot_info A Multi-Institutional Approach to ‘Big Social Data’ The TrISMA Project
  • 2. @qutdmrc TrISMA: Tracking Infrastructure for Social Media Analysis ● TrISMA: ● ARC LIEF project, 2014-16 ● QUT, Curtin, Deakin, Swinburne, NLA (+ ECU, Sydney, WSU) ● Now maintained by the QUT Digital Observatory ● Key capabilities: ● Australian Twitter Collection: live tracker of 500,000 most active Twitter accounts ● Australian Twittersphere network: follower relations between 3.7m Australian Twitter accounts ● Australian Facebook Collection: activity on public Australian Facebook pages
  • 4. @qutdmrc The Australian Twittersphere ● Twitter in Australia: ● Strong take-up since 2009 ● Centred around 25-55 age range, urban, educated, affluent users (but gradually broadening) ● Significant role in crisis communication, political communication, audience engagement, … ● Mapping the Twittersphere: ● Long-term project to identify all Australian Twitter accounts ● First iteration: snowball crawl of follower/followee networks ● Starting with key hashtag populations (#auspol, #spill, …) ● Map of ~1m accounts in early 2012 ● Second iteration: full crawl of global Twitter ID numberspace through to Sep. 2013 (~870m accounts) ● Third iteration: full crawl of global Twitter ID numberspace through to Feb. 2016 (~1.4b accounts) ● Filtering by description, location, timezone fields: identifiably Australian cities, states, timezones, etc. ● 4 million Australian accounts identified (by Feb. 2016) ● Retrieval of their follower/followee lists ● Continuous gathering of public tweets for the 500,000 most active accounts ● Capturing ~900,000 new tweets per day
  • 6. @qutdmrc 3.7m known Australian accounts Network of follower connections Filtered for degree ≥1000 255k nodes (6.4%), 61m edges Edges not shown in graph The Australian Twittersphere Teen Culture Aspirational Sports Netizens Arts & Culture Politics Television Fashion Popular Music Food & Drinks Agriculture Activism Porn Education Cycling News & Generic Hard Right Progressive South Australia Celebrities Horse Racing
  • 9. @qutdmrc What’s Wrong with Hashtag Studies? ● Hashtag (and keyword) studies: ● Hashtags (often) fail to capture follow-on communication ● Hashtag studies lack context: e.g. what percentage of total tweet volume? ● Only ~18% of tweets by the top 500,000 Australian accounts contain hashtags ● Which hashtags, which keywords? ● Live Twitter API data collection assumes we know tracking terms a priori ● Need for comparative, whole-of-population studies across topics ● What do Australian (not global) Twitter users tweet about? ● It’s the network, stupid: ● Twitter follower relations crucial to message spread and visibility ● Most studies observe activity, not reach: talking, not listening ● Follower network data can show which thematic clusters are active, ● and which are likely to have seen relevant posts
  • 10. @qutdmrc ‘Big Social Data’ after the APIcalypse ● Growing challenges: ● Refreshing the data – 2016 accounts; account IDs no longer consecutive ● Continuous snowballing – add accounts mentioned by tracked accounts? ● Continuing, unforeseeable, inherently anti-researcher API changes ● TrISMA maintenance and management: ● Costly data gathering, storage, processing, and maintenance ● QUT Digital Observatory as long-term investment (well beyond Twitter) ● Need for considerable digital methods research training ● Multi-institutional research project vs. Twitter Terms of Service? ● Twitter Terms of Service vs. public-interest research needs
  • 11. @qutdmrc Intransparency Spun as Data Protection ● The API clampdown: ● Facebook, Twitter, Instagram, … reducing API functionality and access ● Mounting casualities: Netvizz, Texifter, … (but not the commercial services?) ● Spin: ‘we are doing more to protect your data’ after Cambridge Analytica ● Reality: ‘we are doing more to frustrate independent, critical, public-interest scrutiny’ ● Hiding problems (hate speech, bullying, ‘fake news’), rather than addressing them ● What can we do? ● Give up, walk away, research other things ● Protest, lobby companies for access, lobby legislators for pro-research regulation ● Use what remains of the APIs to gather what we can, and (carefully) share what we gather ● Build shared repositories for our social media datasets, and develop ethical access frameworks ● Explore scraping and other alternatives even if they break the ToS, in the public interest (IT University Copenhagen, 27-28 Oct. 2018)
  • 12. @qutdmrc AoIR 2018, Montréal, 12 Oct. 2018 Axel Bruns | @snurb_dot_info @snurb_dot_info – http://snurb.info/ @socialmediaQUT – http://socialmedia.qut.edu.au/ @qutdmrc – https://www.qut.edu.au/research/dmrc This research is supported by the ARC Future Fellowship project “Understanding Intermedia Information Flows in the Australian Online Public Sphere”, and the ARC LIEF project “TrISMA: Tracking Infrastructure for Social Media Analysis.”