SlideShare a Scribd company logo
Conspiracy Stories:
Building Archives to
Facilitate Narrative
Analyses of Online Fake
News
Peter Broadwell (@PeterBroadwell)
Digital Library Program
University of California, Los Angeles
IFLA International News Media Conference
Reykjavík, Iceland, 27-28 April 2017
Peter Broadwell (@PeterBroadwell) - UCLA Digital Library
Building Archives to Facilitate Narrative Analyses of Fake News
IFLA International News Media Conference, 27 April 2017
Fake news is…
2
1 Jacob L. Nelson. 2017. “Is ‘fake news’ a fake problem?” Columbia Journalism Review, January 31, 2017
• Difficult to define and identify
• A badly overused term – especially these days
• Prominent on social media:
30% of traffic to fake news sites is from Facebook;
which is true for only 8% of legitimate news
• Financed by online advertising systems (Google),
and maybe some governments
• Now seen as a serious issue by journalists and
some Internet companies (finally)
• Not as dire a problem as some believe?1
• Worthy of further study
Peter Broadwell (@PeterBroadwell) - UCLA Digital Library
Building Archives to Facilitate Narrative Analyses of Fake News
IFLA International News Media Conference, 27 April 2017
3
Gregor Aisch, Jon Huang, Cecilia Kang,“Dissecting the #PizzaGate Conspiracy Theories,” New York Times, December 10, 2016
Analyzing the “shape” of conspiracy stories
Peter Broadwell (@PeterBroadwell) - UCLA Digital Library
Building Archives to Facilitate Narrative Analyses of Fake News
IFLA International News Media Conference, 27 April 2017
Title
4
TR Tangherlini, V Roychowdhury, B Glenn, CM Crespi, R Bandari, A Wadia, M Falahi, E
Ebrahimzadeh, R Bastani. 2016. “‘Mommy Blogs’ and the Vaccination Exemption
Narrative: Results From A Machine-Learning Approach for Story Aggregation on
Parenting Social Media Sites.” JMIR Public Health Surveillance 2:2 (2016).
Peter Broadwell (@PeterBroadwell) - UCLA Digital Library
Building Archives to Facilitate Narrative Analyses of Fake News
IFLA International News Media Conference, 27 April 2017
Don’t believe everything you read online…
5
http://newsroom.ucla.edu/releases/ucla-researchers-teach-computer-to-read-the-internet
Peter Broadwell (@PeterBroadwell) - UCLA Digital Library
Building Archives to Facilitate Narrative Analyses of Fake News
IFLA International News Media Conference, 27 April 2017
6
TR Tangherlini, V Roychowdhury, B Glenn, CM Crespi, R Bandari, A Wadia, M Falahi, E Ebrahimzadeh, R
Bastani. 2016. “‘Mommy Blogs’ and the Vaccination Exemption Narrative: Results From A Machine-Learning
Approach for Story Aggregation on Parenting Social Media Sites.” JMIR Public Health Surveillance 2:2 (2016).
Narrative framework analysis
Peter Broadwell (@PeterBroadwell) - UCLA Digital Library
Building Archives to Facilitate Narrative Analyses of Fake News
IFLA International News Media Conference, 27 April 2017
Building our own “fake news” web archive
7
Peter Broadwell (@PeterBroadwell) - UCLA Digital Library
Building Archives to Facilitate Narrative Analyses of Fake News
IFLA International News Media Conference, 27 April 2017
8
Or… using someone else’s archive?
Peter Broadwell (@PeterBroadwell) - UCLA Digital Library
Building Archives to Facilitate Narrative Analyses of Fake News
IFLA International News Media Conference, 27 April 2017
9
Small-scale case studies:
Pizzagate
and
Bridgegate
Peter Broadwell (@PeterBroadwell) - UCLA Digital Library
Building Archives to Facilitate Narrative Analyses of Fake News
IFLA International News Media Conference, 27 April 2017
Pre-selected “archives” of coverage
10
Peter Broadwell (@PeterBroadwell) - UCLA Digital Library
Building Archives to Facilitate Narrative Analyses of Fake News
IFLA International News Media Conference, 27 April 2017
Web archives: too much data,
not enough information
11
In WANE files Extracted by DBpedia Spotlight
3,376 unique names 954 unique Wikipedia entities
Christie_(P): 376
Tony_(P): 241
Chris_Christie_(P): 236
David_Wildstein_(P): 190
Bridget_Anne_Kelly_(P): 169
Chris_Christie_(P): 435
David_Wildstein_(P): 263
Bridget_Anne_Kelly_(P): 222
Bill_Baroni_(P): 194
Mark_Sokolich_(P): 130
• 2 seed URLs, 1 from Huffington Post, 1 from NJ Record
• Full Archive-It WARCs contain 124,355 unique linked URLs
• But 47,398 are adverts/spam, 73,544 likely news articles
• Only 415 news articles (163 from HuffPo, 252 from the Record)
are relevant to Bridgegate (.5%)
Peter Broadwell (@PeterBroadwell) - UCLA Digital Library
Building Archives to Facilitate Narrative Analyses of Fake News
IFLA International News Media Conference, 27 April 2017
Returning to Pizzagate…
12
Peter Broadwell (@PeterBroadwell) - UCLA Digital Library
Building Archives to Facilitate Narrative Analyses of Fake News
IFLA International News Media Conference, 27 April 2017
13
Gregor Aisch, Jon Huang, Cecilia Kang,“Dissecting the #PizzaGate Conspiracy Theories,” New York Times, December 10, 2016
Can we generate this computationally?
Peter Broadwell (@PeterBroadwell) - UCLA Digital Library
Building Archives to Facilitate Narrative Analyses of Fake News
IFLA International News Media Conference, 27 April 2017
14
A simple Pizzagate network model
Peter Broadwell (@PeterBroadwell) - UCLA Digital Library
Building Archives to Facilitate Narrative Analyses of Fake News
IFLA International News Media Conference, 27 April 2017
15
Towards a useful Pizzagate network model
Peter Broadwell (@PeterBroadwell) - UCLA Digital Library
Building Archives to Facilitate Narrative Analyses of Fake News
IFLA International News Media Conference, 27 April 2017
16
Bridgegate network, highlighting “degree”
Peter Broadwell (@PeterBroadwell) - UCLA Digital Library
Building Archives to Facilitate Narrative Analyses of Fake News
IFLA International News Media Conference, 27 April 2017
17
Bridgegate network, showing “betweenness”
Peter Broadwell (@PeterBroadwell) - UCLA Digital Library
Building Archives to Facilitate Narrative Analyses of Fake News
IFLA International News Media Conference, 27 April 2017
Archiving fake news for research…
18
• Is difficult to do well
• Requires constant monitoring and checking of
targeted sites, given capabilities of existing tools
• Would benefit from coordination between
institutions to distribute web-crawling tasks
• Entails a great deal of manual and semi-automated
content classification and filtering after collection
• Calls for the use of more sophisticated tools for
named entity resolution, disambiguation and
versioning of archival content
• Is potentially very worthwhile, despite these issues
Peter Broadwell (@PeterBroadwell) - UCLA Digital Library
Building Archives to Facilitate Narrative Analyses of Fake News
IFLA International News Media Conference, 27 April 2017
Thanks!
19
• Prof. Tim Tangherlini, UCLA Scandinavian Section
• Prof. Vwani Roychowdhury, UCLA Electrical Engineering
• Ph.D. students, UCLA Electrical Engineering:
• Ehsan Ebrahimzadeh
• Behnam Shahbazi
• Misagh Falahi
• Mark Graham, Internet Archive
• Karl Blumenthal, Archive-It

More Related Content

What's hot

Tackling fake news wut approach
Tackling fake news   wut approachTackling fake news   wut approach
Tackling fake news wut approach
Gabriela Grosseck
 
Linked Data : Cataloguing and a World Wide Web of Data
Linked Data : Cataloguing and a World Wide Web of DataLinked Data : Cataloguing and a World Wide Web of Data
Linked Data : Cataloguing and a World Wide Web of Data
Thomas Meehan
 
Facebook
FacebookFacebook
Good Riddance: Academic Publishers are Abandoning Publishing
Good Riddance: Academic Publishers are Abandoning PublishingGood Riddance: Academic Publishers are Abandoning Publishing
Good Riddance: Academic Publishers are Abandoning Publishing
Björn Brembs
 
Social Media for Researchers
Social Media for ResearchersSocial Media for Researchers
Social Media for ResearchersRichard Hall
 
Social Media Research Methods
Social Media Research MethodsSocial Media Research Methods
Social Media Research Methods
Katrin Weller
 
Computational Approaches to Studying Anti-Social Behaviour on Social Media
Computational Approaches to Studying Anti-Social Behaviour on Social MediaComputational Approaches to Studying Anti-Social Behaviour on Social Media
Computational Approaches to Studying Anti-Social Behaviour on Social Media
Toronto Metropolitan University
 
Facebook Apps and Libraries' Friendly Future
Facebook Apps and Libraries' Friendly FutureFacebook Apps and Libraries' Friendly Future
Facebook Apps and Libraries' Friendly Future
Cliff Landis
 
Constructing A Professional Presence - HEA Professional Presences For Academi...
Constructing A Professional Presence - HEA Professional Presences For Academi...Constructing A Professional Presence - HEA Professional Presences For Academi...
Constructing A Professional Presence - HEA Professional Presences For Academi...
Thomas Lancaster
 
Echo Chamber? What Echo Chamber? Reviewing the Evidence
Echo Chamber? What Echo Chamber? Reviewing the EvidenceEcho Chamber? What Echo Chamber? Reviewing the Evidence
Echo Chamber? What Echo Chamber? Reviewing the Evidence
Axel Bruns
 
Joining the ‘buzz’ : the role of social media in raising research visibility ...
Joining the ‘buzz’ : the role of social media in raising research visibility ...Joining the ‘buzz’ : the role of social media in raising research visibility ...
Joining the ‘buzz’ : the role of social media in raising research visibility ...
Eileen Shepherd
 
Ifsi library general information sheet september2010
Ifsi library general information sheet september2010Ifsi library general information sheet september2010
Ifsi library general information sheet september2010ifsilibrary
 
Twitter as a First Draft of the Present – and the Challenges of Preserving It...
Twitter as a First Draft of the Present – and the Challenges of Preserving It...Twitter as a First Draft of the Present – and the Challenges of Preserving It...
Twitter as a First Draft of the Present – and the Challenges of Preserving It...
Axel Bruns
 
Managing a (different) Data Deluge - SPARC OA conference
Managing a (different) Data Deluge - SPARC OA conferenceManaging a (different) Data Deluge - SPARC OA conference
Managing a (different) Data Deluge - SPARC OA conference
Cameron Neylon
 
Advancing Open @ Vanderbilt
Advancing Open @ VanderbiltAdvancing Open @ Vanderbilt
Advancing Open @ Vanderbilt
Nicole Allen
 
Why twitter? What can Twitter do for my library & my professional development?
Why twitter? What can Twitter do for my library & my professional development?Why twitter? What can Twitter do for my library & my professional development?
Why twitter? What can Twitter do for my library & my professional development?
Bill Drew
 
Digital Scholarship: building an online scholarly presence
Digital Scholarship: building an online scholarly presenceDigital Scholarship: building an online scholarly presence
Digital Scholarship: building an online scholarly presence
Alison McNab
 
The biggest threat to science today: the scholarly publishing system
The biggest threat to science today: the scholarly publishing systemThe biggest threat to science today: the scholarly publishing system
The biggest threat to science today: the scholarly publishing system
Björn Brembs
 
20140408 digital newspapers collections [idlc kuala lumpur]
20140408 digital newspapers collections [idlc kuala lumpur]20140408 digital newspapers collections [idlc kuala lumpur]
20140408 digital newspapers collections [idlc kuala lumpur]
Frederick Zarndt
 

What's hot (20)

Tackling fake news wut approach
Tackling fake news   wut approachTackling fake news   wut approach
Tackling fake news wut approach
 
Linked Data : Cataloguing and a World Wide Web of Data
Linked Data : Cataloguing and a World Wide Web of DataLinked Data : Cataloguing and a World Wide Web of Data
Linked Data : Cataloguing and a World Wide Web of Data
 
Facebook
FacebookFacebook
Facebook
 
Good Riddance: Academic Publishers are Abandoning Publishing
Good Riddance: Academic Publishers are Abandoning PublishingGood Riddance: Academic Publishers are Abandoning Publishing
Good Riddance: Academic Publishers are Abandoning Publishing
 
Social Media for Researchers
Social Media for ResearchersSocial Media for Researchers
Social Media for Researchers
 
Social Media Research Methods
Social Media Research MethodsSocial Media Research Methods
Social Media Research Methods
 
Computational Approaches to Studying Anti-Social Behaviour on Social Media
Computational Approaches to Studying Anti-Social Behaviour on Social MediaComputational Approaches to Studying Anti-Social Behaviour on Social Media
Computational Approaches to Studying Anti-Social Behaviour on Social Media
 
Facebook Apps and Libraries' Friendly Future
Facebook Apps and Libraries' Friendly FutureFacebook Apps and Libraries' Friendly Future
Facebook Apps and Libraries' Friendly Future
 
Constructing A Professional Presence - HEA Professional Presences For Academi...
Constructing A Professional Presence - HEA Professional Presences For Academi...Constructing A Professional Presence - HEA Professional Presences For Academi...
Constructing A Professional Presence - HEA Professional Presences For Academi...
 
Echo Chamber? What Echo Chamber? Reviewing the Evidence
Echo Chamber? What Echo Chamber? Reviewing the EvidenceEcho Chamber? What Echo Chamber? Reviewing the Evidence
Echo Chamber? What Echo Chamber? Reviewing the Evidence
 
Joining the ‘buzz’ : the role of social media in raising research visibility ...
Joining the ‘buzz’ : the role of social media in raising research visibility ...Joining the ‘buzz’ : the role of social media in raising research visibility ...
Joining the ‘buzz’ : the role of social media in raising research visibility ...
 
Listserv Monitoring Report
Listserv Monitoring ReportListserv Monitoring Report
Listserv Monitoring Report
 
Ifsi library general information sheet september2010
Ifsi library general information sheet september2010Ifsi library general information sheet september2010
Ifsi library general information sheet september2010
 
Twitter as a First Draft of the Present – and the Challenges of Preserving It...
Twitter as a First Draft of the Present – and the Challenges of Preserving It...Twitter as a First Draft of the Present – and the Challenges of Preserving It...
Twitter as a First Draft of the Present – and the Challenges of Preserving It...
 
Managing a (different) Data Deluge - SPARC OA conference
Managing a (different) Data Deluge - SPARC OA conferenceManaging a (different) Data Deluge - SPARC OA conference
Managing a (different) Data Deluge - SPARC OA conference
 
Advancing Open @ Vanderbilt
Advancing Open @ VanderbiltAdvancing Open @ Vanderbilt
Advancing Open @ Vanderbilt
 
Why twitter? What can Twitter do for my library & my professional development?
Why twitter? What can Twitter do for my library & my professional development?Why twitter? What can Twitter do for my library & my professional development?
Why twitter? What can Twitter do for my library & my professional development?
 
Digital Scholarship: building an online scholarly presence
Digital Scholarship: building an online scholarly presenceDigital Scholarship: building an online scholarly presence
Digital Scholarship: building an online scholarly presence
 
The biggest threat to science today: the scholarly publishing system
The biggest threat to science today: the scholarly publishing systemThe biggest threat to science today: the scholarly publishing system
The biggest threat to science today: the scholarly publishing system
 
20140408 digital newspapers collections [idlc kuala lumpur]
20140408 digital newspapers collections [idlc kuala lumpur]20140408 digital newspapers collections [idlc kuala lumpur]
20140408 digital newspapers collections [idlc kuala lumpur]
 

Similar to Conspiracy Stories: Building Archives to Facilitate Narrative Analyses of Online Fake News

Fake news
Fake newsFake news
Fake news
Cendrella Habre
 
Media literacy panel
Media literacy panel Media literacy panel
Media literacy panel
Cody Hennesy
 
2018 BHL Program Director’s Report: Secretariat & Technical Update
2018 BHL Program Director’s Report: Secretariat & Technical Update2018 BHL Program Director’s Report: Secretariat & Technical Update
2018 BHL Program Director’s Report: Secretariat & Technical Update
Martin Kalfatovic
 
ICDM 2017 tutorial misinformation
ICDM 2017 tutorial misinformationICDM 2017 tutorial misinformation
ICDM 2017 tutorial misinformation
Arizona State University
 
#3 DataBeersBCN - "How to get into the news with Social networks analysis" by...
#3 DataBeersBCN - "How to get into the news with Social networks analysis" by...#3 DataBeersBCN - "How to get into the news with Social networks analysis" by...
#3 DataBeersBCN - "How to get into the news with Social networks analysis" by...
DataBeersBCN
 
@WebSciDL PhD Student Project Reviews August 5&6, 2015
@WebSciDL PhD Student Project Reviews August 5&6, 2015@WebSciDL PhD Student Project Reviews August 5&6, 2015
@WebSciDL PhD Student Project Reviews August 5&6, 2015
Michael Nelson
 
Disrupting academic publishing: a future role for libraries
Disrupting academic publishing: a future role for librariesDisrupting academic publishing: a future role for libraries
Disrupting academic publishing: a future role for libraries
Brian Hole
 
Power&5th windsor-2016
Power&5th windsor-2016Power&5th windsor-2016
Fake News, Real Teens: Problems and Possibilities
Fake News, Real Teens: Problems and PossibilitiesFake News, Real Teens: Problems and Possibilities
Fake News, Real Teens: Problems and Possibilities
Tom Mackey
 
Sustainable, Successful Open Data Publication
Sustainable, Successful Open Data PublicationSustainable, Successful Open Data Publication
Sustainable, Successful Open Data Publication
Brian Hole
 
Platform Research Agenda
Platform Research AgendaPlatform Research Agenda
Platform Research Agenda
Peter C. Evans, PhD
 
Media and Information Literacy (MIL) - 5. Media and Information Sources
Media and Information Literacy (MIL) - 5. Media and Information SourcesMedia and Information Literacy (MIL) - 5. Media and Information Sources
Media and Information Literacy (MIL) - 5. Media and Information Sources
Arniel Ping
 
Media and Information Literacy
Media and Information LiteracyMedia and Information Literacy
Media and Information Literacy
John Paul Hallasgo
 
(lc 13,14) mil-massmediaandmediaeffects-160825023930.pdf
(lc 13,14) mil-massmediaandmediaeffects-160825023930.pdf(lc 13,14) mil-massmediaandmediaeffects-160825023930.pdf
(lc 13,14) mil-massmediaandmediaeffects-160825023930.pdf
ClaesTrinio
 
2013 ifla satellite zarndt et al [crowdsourcing the world's cultural heritage...
2013 ifla satellite zarndt et al [crowdsourcing the world's cultural heritage...2013 ifla satellite zarndt et al [crowdsourcing the world's cultural heritage...
2013 ifla satellite zarndt et al [crowdsourcing the world's cultural heritage...Frederick Zarndt
 
Special libraries association meeting march 2014
Special libraries association meeting march 2014Special libraries association meeting march 2014
Special libraries association meeting march 2014
Trish Rose-Sandler
 
Outreach Strategies to Engage Citizen Scientists: Insights from the Biodivers...
Outreach Strategies to Engage Citizen Scientists: Insights from the Biodivers...Outreach Strategies to Engage Citizen Scientists: Insights from the Biodivers...
Outreach Strategies to Engage Citizen Scientists: Insights from the Biodivers...
costantinog
 
Crisis COVID 19: Making Open Access Resources Reachable for Enhancing Online ...
Crisis COVID 19: Making Open Access Resources Reachable for Enhancing Online ...Crisis COVID 19: Making Open Access Resources Reachable for Enhancing Online ...
Crisis COVID 19: Making Open Access Resources Reachable for Enhancing Online ...
Zakir Hossain/ICS, Zurich
 
New Forms Of Communication: Harnessing Collective Knowledge through Web Logs
New Forms Of Communication: Harnessing Collective Knowledge through Web LogsNew Forms Of Communication: Harnessing Collective Knowledge through Web Logs
New Forms Of Communication: Harnessing Collective Knowledge through Web Logs
guestf2c507
 
New Forms Of Communication: Harnessing Collective Knowledge through Web Logs
New Forms Of Communication: Harnessing Collective Knowledge through Web LogsNew Forms Of Communication: Harnessing Collective Knowledge through Web Logs
New Forms Of Communication: Harnessing Collective Knowledge through Web Logs
Bryan Loar
 

Similar to Conspiracy Stories: Building Archives to Facilitate Narrative Analyses of Online Fake News (20)

Fake news
Fake newsFake news
Fake news
 
Media literacy panel
Media literacy panel Media literacy panel
Media literacy panel
 
2018 BHL Program Director’s Report: Secretariat & Technical Update
2018 BHL Program Director’s Report: Secretariat & Technical Update2018 BHL Program Director’s Report: Secretariat & Technical Update
2018 BHL Program Director’s Report: Secretariat & Technical Update
 
ICDM 2017 tutorial misinformation
ICDM 2017 tutorial misinformationICDM 2017 tutorial misinformation
ICDM 2017 tutorial misinformation
 
#3 DataBeersBCN - "How to get into the news with Social networks analysis" by...
#3 DataBeersBCN - "How to get into the news with Social networks analysis" by...#3 DataBeersBCN - "How to get into the news with Social networks analysis" by...
#3 DataBeersBCN - "How to get into the news with Social networks analysis" by...
 
@WebSciDL PhD Student Project Reviews August 5&6, 2015
@WebSciDL PhD Student Project Reviews August 5&6, 2015@WebSciDL PhD Student Project Reviews August 5&6, 2015
@WebSciDL PhD Student Project Reviews August 5&6, 2015
 
Disrupting academic publishing: a future role for libraries
Disrupting academic publishing: a future role for librariesDisrupting academic publishing: a future role for libraries
Disrupting academic publishing: a future role for libraries
 
Power&5th windsor-2016
Power&5th windsor-2016Power&5th windsor-2016
Power&5th windsor-2016
 
Fake News, Real Teens: Problems and Possibilities
Fake News, Real Teens: Problems and PossibilitiesFake News, Real Teens: Problems and Possibilities
Fake News, Real Teens: Problems and Possibilities
 
Sustainable, Successful Open Data Publication
Sustainable, Successful Open Data PublicationSustainable, Successful Open Data Publication
Sustainable, Successful Open Data Publication
 
Platform Research Agenda
Platform Research AgendaPlatform Research Agenda
Platform Research Agenda
 
Media and Information Literacy (MIL) - 5. Media and Information Sources
Media and Information Literacy (MIL) - 5. Media and Information SourcesMedia and Information Literacy (MIL) - 5. Media and Information Sources
Media and Information Literacy (MIL) - 5. Media and Information Sources
 
Media and Information Literacy
Media and Information LiteracyMedia and Information Literacy
Media and Information Literacy
 
(lc 13,14) mil-massmediaandmediaeffects-160825023930.pdf
(lc 13,14) mil-massmediaandmediaeffects-160825023930.pdf(lc 13,14) mil-massmediaandmediaeffects-160825023930.pdf
(lc 13,14) mil-massmediaandmediaeffects-160825023930.pdf
 
2013 ifla satellite zarndt et al [crowdsourcing the world's cultural heritage...
2013 ifla satellite zarndt et al [crowdsourcing the world's cultural heritage...2013 ifla satellite zarndt et al [crowdsourcing the world's cultural heritage...
2013 ifla satellite zarndt et al [crowdsourcing the world's cultural heritage...
 
Special libraries association meeting march 2014
Special libraries association meeting march 2014Special libraries association meeting march 2014
Special libraries association meeting march 2014
 
Outreach Strategies to Engage Citizen Scientists: Insights from the Biodivers...
Outreach Strategies to Engage Citizen Scientists: Insights from the Biodivers...Outreach Strategies to Engage Citizen Scientists: Insights from the Biodivers...
Outreach Strategies to Engage Citizen Scientists: Insights from the Biodivers...
 
Crisis COVID 19: Making Open Access Resources Reachable for Enhancing Online ...
Crisis COVID 19: Making Open Access Resources Reachable for Enhancing Online ...Crisis COVID 19: Making Open Access Resources Reachable for Enhancing Online ...
Crisis COVID 19: Making Open Access Resources Reachable for Enhancing Online ...
 
New Forms Of Communication: Harnessing Collective Knowledge through Web Logs
New Forms Of Communication: Harnessing Collective Knowledge through Web LogsNew Forms Of Communication: Harnessing Collective Knowledge through Web Logs
New Forms Of Communication: Harnessing Collective Knowledge through Web Logs
 
New Forms Of Communication: Harnessing Collective Knowledge through Web Logs
New Forms Of Communication: Harnessing Collective Knowledge through Web LogsNew Forms Of Communication: Harnessing Collective Knowledge through Web Logs
New Forms Of Communication: Harnessing Collective Knowledge through Web Logs
 

More from Peter Broadwell

The East Asian Studies Macroscope: Infrastructure for Collaborative Scholars...
The East Asian Studies Macroscope: Infrastructure for Collaborative Scholars...The East Asian Studies Macroscope: Infrastructure for Collaborative Scholars...
The East Asian Studies Macroscope: Infrastructure for Collaborative Scholars...
Peter Broadwell
 
Integration of a Unique Multimedia Collection into Public Linked Open Data R...
Integration of a Unique Multimedia Collection into Public Linked Open Data R...Integration of a Unique Multimedia Collection into Public Linked Open Data R...
Integration of a Unique Multimedia Collection into Public Linked Open Data R...
Peter Broadwell
 
aiSelections: Computational Techniques for Matching Faculty Research Profiles...
aiSelections: Computational Techniques for Matching Faculty Research Profiles...aiSelections: Computational Techniques for Matching Faculty Research Profiles...
aiSelections: Computational Techniques for Matching Faculty Research Profiles...
Peter Broadwell
 
TrollFinder: Geo-Semantic Exploration of a Very Large Corpus of Danish Folklore
TrollFinder: Geo-Semantic Exploration of a Very Large Corpus of Danish FolkloreTrollFinder: Geo-Semantic Exploration of a Very Large Corpus of Danish Folklore
TrollFinder: Geo-Semantic Exploration of a Very Large Corpus of Danish Folklore
Peter Broadwell
 
From Trot to Cultural Technology: The Historical Development of Production Ne...
From Trot to Cultural Technology:The Historical Development of Production Ne...From Trot to Cultural Technology:The Historical Development of Production Ne...
From Trot to Cultural Technology: The Historical Development of Production Ne...
Peter Broadwell
 
Social Network Analysis of Collaborative Composition in Film Scoring via the ...
Social Network Analysis of Collaborative Composition in Film Scoring via the ...Social Network Analysis of Collaborative Composition in Film Scoring via the ...
Social Network Analysis of Collaborative Composition in Film Scoring via the ...
Peter Broadwell
 
ElfYelp: Geolocated Topic Models for Pattern Discovery in a Large Folklore Co...
ElfYelp: Geolocated Topic Models for Pattern Discovery in a Large Folklore Co...ElfYelp: Geolocated Topic Models for Pattern Discovery in a Large Folklore Co...
ElfYelp: Geolocated Topic Models for Pattern Discovery in a Large Folklore Co...
Peter Broadwell
 

More from Peter Broadwell (7)

The East Asian Studies Macroscope: Infrastructure for Collaborative Scholars...
The East Asian Studies Macroscope: Infrastructure for Collaborative Scholars...The East Asian Studies Macroscope: Infrastructure for Collaborative Scholars...
The East Asian Studies Macroscope: Infrastructure for Collaborative Scholars...
 
Integration of a Unique Multimedia Collection into Public Linked Open Data R...
Integration of a Unique Multimedia Collection into Public Linked Open Data R...Integration of a Unique Multimedia Collection into Public Linked Open Data R...
Integration of a Unique Multimedia Collection into Public Linked Open Data R...
 
aiSelections: Computational Techniques for Matching Faculty Research Profiles...
aiSelections: Computational Techniques for Matching Faculty Research Profiles...aiSelections: Computational Techniques for Matching Faculty Research Profiles...
aiSelections: Computational Techniques for Matching Faculty Research Profiles...
 
TrollFinder: Geo-Semantic Exploration of a Very Large Corpus of Danish Folklore
TrollFinder: Geo-Semantic Exploration of a Very Large Corpus of Danish FolkloreTrollFinder: Geo-Semantic Exploration of a Very Large Corpus of Danish Folklore
TrollFinder: Geo-Semantic Exploration of a Very Large Corpus of Danish Folklore
 
From Trot to Cultural Technology: The Historical Development of Production Ne...
From Trot to Cultural Technology:The Historical Development of Production Ne...From Trot to Cultural Technology:The Historical Development of Production Ne...
From Trot to Cultural Technology: The Historical Development of Production Ne...
 
Social Network Analysis of Collaborative Composition in Film Scoring via the ...
Social Network Analysis of Collaborative Composition in Film Scoring via the ...Social Network Analysis of Collaborative Composition in Film Scoring via the ...
Social Network Analysis of Collaborative Composition in Film Scoring via the ...
 
ElfYelp: Geolocated Topic Models for Pattern Discovery in a Large Folklore Co...
ElfYelp: Geolocated Topic Models for Pattern Discovery in a Large Folklore Co...ElfYelp: Geolocated Topic Models for Pattern Discovery in a Large Folklore Co...
ElfYelp: Geolocated Topic Models for Pattern Discovery in a Large Folklore Co...
 

Recently uploaded

Best Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDABest Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDA
deeptiverma2406
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
Jean Carlos Nunes Paixão
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
Celine George
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
MysoreMuleSoftMeetup
 
A Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptxA Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptx
thanhdowork
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
Atul Kumar Singh
 
Chapter -12, Antibiotics (One Page Notes).pdf
Chapter -12, Antibiotics (One Page Notes).pdfChapter -12, Antibiotics (One Page Notes).pdf
Chapter -12, Antibiotics (One Page Notes).pdf
Kartik Tiwari
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
Sandy Millin
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
EduSkills OECD
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
Vikramjit Singh
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
DhatriParmar
 
Multithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race conditionMultithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race condition
Mohammed Sikander
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
Peter Windle
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
tarandeep35
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
EverAndrsGuerraGuerr
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Special education needs
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
Vivekanand Anglo Vedic Academy
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
siemaillard
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 

Recently uploaded (20)

Best Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDABest Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDA
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
 
A Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptxA Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptx
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
 
Chapter -12, Antibiotics (One Page Notes).pdf
Chapter -12, Antibiotics (One Page Notes).pdfChapter -12, Antibiotics (One Page Notes).pdf
Chapter -12, Antibiotics (One Page Notes).pdf
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
 
Multithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race conditionMultithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race condition
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
 

Conspiracy Stories: Building Archives to Facilitate Narrative Analyses of Online Fake News

  • 1. Conspiracy Stories: Building Archives to Facilitate Narrative Analyses of Online Fake News Peter Broadwell (@PeterBroadwell) Digital Library Program University of California, Los Angeles IFLA International News Media Conference Reykjavík, Iceland, 27-28 April 2017
  • 2. Peter Broadwell (@PeterBroadwell) - UCLA Digital Library Building Archives to Facilitate Narrative Analyses of Fake News IFLA International News Media Conference, 27 April 2017 Fake news is… 2 1 Jacob L. Nelson. 2017. “Is ‘fake news’ a fake problem?” Columbia Journalism Review, January 31, 2017 • Difficult to define and identify • A badly overused term – especially these days • Prominent on social media: 30% of traffic to fake news sites is from Facebook; which is true for only 8% of legitimate news • Financed by online advertising systems (Google), and maybe some governments • Now seen as a serious issue by journalists and some Internet companies (finally) • Not as dire a problem as some believe?1 • Worthy of further study
  • 3. Peter Broadwell (@PeterBroadwell) - UCLA Digital Library Building Archives to Facilitate Narrative Analyses of Fake News IFLA International News Media Conference, 27 April 2017 3 Gregor Aisch, Jon Huang, Cecilia Kang,“Dissecting the #PizzaGate Conspiracy Theories,” New York Times, December 10, 2016 Analyzing the “shape” of conspiracy stories
  • 4. Peter Broadwell (@PeterBroadwell) - UCLA Digital Library Building Archives to Facilitate Narrative Analyses of Fake News IFLA International News Media Conference, 27 April 2017 Title 4 TR Tangherlini, V Roychowdhury, B Glenn, CM Crespi, R Bandari, A Wadia, M Falahi, E Ebrahimzadeh, R Bastani. 2016. “‘Mommy Blogs’ and the Vaccination Exemption Narrative: Results From A Machine-Learning Approach for Story Aggregation on Parenting Social Media Sites.” JMIR Public Health Surveillance 2:2 (2016).
  • 5. Peter Broadwell (@PeterBroadwell) - UCLA Digital Library Building Archives to Facilitate Narrative Analyses of Fake News IFLA International News Media Conference, 27 April 2017 Don’t believe everything you read online… 5 http://newsroom.ucla.edu/releases/ucla-researchers-teach-computer-to-read-the-internet
  • 6. Peter Broadwell (@PeterBroadwell) - UCLA Digital Library Building Archives to Facilitate Narrative Analyses of Fake News IFLA International News Media Conference, 27 April 2017 6 TR Tangherlini, V Roychowdhury, B Glenn, CM Crespi, R Bandari, A Wadia, M Falahi, E Ebrahimzadeh, R Bastani. 2016. “‘Mommy Blogs’ and the Vaccination Exemption Narrative: Results From A Machine-Learning Approach for Story Aggregation on Parenting Social Media Sites.” JMIR Public Health Surveillance 2:2 (2016). Narrative framework analysis
  • 7. Peter Broadwell (@PeterBroadwell) - UCLA Digital Library Building Archives to Facilitate Narrative Analyses of Fake News IFLA International News Media Conference, 27 April 2017 Building our own “fake news” web archive 7
  • 8. Peter Broadwell (@PeterBroadwell) - UCLA Digital Library Building Archives to Facilitate Narrative Analyses of Fake News IFLA International News Media Conference, 27 April 2017 8 Or… using someone else’s archive?
  • 9. Peter Broadwell (@PeterBroadwell) - UCLA Digital Library Building Archives to Facilitate Narrative Analyses of Fake News IFLA International News Media Conference, 27 April 2017 9 Small-scale case studies: Pizzagate and Bridgegate
  • 10. Peter Broadwell (@PeterBroadwell) - UCLA Digital Library Building Archives to Facilitate Narrative Analyses of Fake News IFLA International News Media Conference, 27 April 2017 Pre-selected “archives” of coverage 10
  • 11. Peter Broadwell (@PeterBroadwell) - UCLA Digital Library Building Archives to Facilitate Narrative Analyses of Fake News IFLA International News Media Conference, 27 April 2017 Web archives: too much data, not enough information 11 In WANE files Extracted by DBpedia Spotlight 3,376 unique names 954 unique Wikipedia entities Christie_(P): 376 Tony_(P): 241 Chris_Christie_(P): 236 David_Wildstein_(P): 190 Bridget_Anne_Kelly_(P): 169 Chris_Christie_(P): 435 David_Wildstein_(P): 263 Bridget_Anne_Kelly_(P): 222 Bill_Baroni_(P): 194 Mark_Sokolich_(P): 130 • 2 seed URLs, 1 from Huffington Post, 1 from NJ Record • Full Archive-It WARCs contain 124,355 unique linked URLs • But 47,398 are adverts/spam, 73,544 likely news articles • Only 415 news articles (163 from HuffPo, 252 from the Record) are relevant to Bridgegate (.5%)
  • 12. Peter Broadwell (@PeterBroadwell) - UCLA Digital Library Building Archives to Facilitate Narrative Analyses of Fake News IFLA International News Media Conference, 27 April 2017 Returning to Pizzagate… 12
  • 13. Peter Broadwell (@PeterBroadwell) - UCLA Digital Library Building Archives to Facilitate Narrative Analyses of Fake News IFLA International News Media Conference, 27 April 2017 13 Gregor Aisch, Jon Huang, Cecilia Kang,“Dissecting the #PizzaGate Conspiracy Theories,” New York Times, December 10, 2016 Can we generate this computationally?
  • 14. Peter Broadwell (@PeterBroadwell) - UCLA Digital Library Building Archives to Facilitate Narrative Analyses of Fake News IFLA International News Media Conference, 27 April 2017 14 A simple Pizzagate network model
  • 15. Peter Broadwell (@PeterBroadwell) - UCLA Digital Library Building Archives to Facilitate Narrative Analyses of Fake News IFLA International News Media Conference, 27 April 2017 15 Towards a useful Pizzagate network model
  • 16. Peter Broadwell (@PeterBroadwell) - UCLA Digital Library Building Archives to Facilitate Narrative Analyses of Fake News IFLA International News Media Conference, 27 April 2017 16 Bridgegate network, highlighting “degree”
  • 17. Peter Broadwell (@PeterBroadwell) - UCLA Digital Library Building Archives to Facilitate Narrative Analyses of Fake News IFLA International News Media Conference, 27 April 2017 17 Bridgegate network, showing “betweenness”
  • 18. Peter Broadwell (@PeterBroadwell) - UCLA Digital Library Building Archives to Facilitate Narrative Analyses of Fake News IFLA International News Media Conference, 27 April 2017 Archiving fake news for research… 18 • Is difficult to do well • Requires constant monitoring and checking of targeted sites, given capabilities of existing tools • Would benefit from coordination between institutions to distribute web-crawling tasks • Entails a great deal of manual and semi-automated content classification and filtering after collection • Calls for the use of more sophisticated tools for named entity resolution, disambiguation and versioning of archival content • Is potentially very worthwhile, despite these issues
  • 19. Peter Broadwell (@PeterBroadwell) - UCLA Digital Library Building Archives to Facilitate Narrative Analyses of Fake News IFLA International News Media Conference, 27 April 2017 Thanks! 19 • Prof. Tim Tangherlini, UCLA Scandinavian Section • Prof. Vwani Roychowdhury, UCLA Electrical Engineering • Ph.D. students, UCLA Electrical Engineering: • Ehsan Ebrahimzadeh • Behnam Shahbazi • Misagh Falahi • Mark Graham, Internet Archive • Karl Blumenthal, Archive-It