Twitter Analysis: Fake News

•

2 likes•1,736 views

1) The document analyzes over 740,000 tweets from the second 2016 presidential debate between Donald Trump and Hillary Clinton to understand how quotes were recorded, interpreted, and shared on Twitter. 2) The tweets were filtered into collections based on memorable quotes from the debate and analyzed to find variations in how the quotes were reported, interpretive biases in the tweets, and sentiment toward the quotes. 3) The results showed that Twitter users often reported quotes with some variation or commentary, word trees illustrating changes to quotes over time, and that sarcastic tweets may have skewed sentiment analysis of the quotes.

Data & Analytics

Twitter Analysis: Fake News
Allison Hegel, Liuqing Li, Dallas Pillen,
Erika Siregar, Melanie Walsh

Our Project
Question
Is it “fake news” to misquote a presidential candidate by just one word? What about two? Three? When
exactly does “fake” news become fake?
Hypothesis
“Fake news” doesn’t only happen from the top down, but also happens at the very first moment of
interpretation, especially when shared on social media networks
Goals
● How Twitter users were recording, interpreting, and sharing the words spoken by Donald Trump and
Hillary Clinton in real time.
● Find out how the “facts” (the accurate transcription of the words) began to evolve into counterfacts
or alternate versions of their words.
● Find out if there is any interpretive bias and emotional valence in the tweets.
2

Dataset
Data Type: Tweets (~740,000 unique tweets)
Source: Social Feed Manager
Time Range: During and immediately after the Second Presidential Debate (10/01/2016)
Search terms: #debate, #debates, #debatenight, #debate2016, #debates2016, @HillaryClinton,
@realDonaldTrump, @debates
3

The Data Processing
Create Collections
Filter the json data based on several memorable debate quotes/topics
Collection 1 Quotes: “That was locker room talk.”
Keywords: locker room, locker-room, lockerroom
Collection 2 Quotes: “Nobody has more respect for women than I do.”
Keywords: respect for women
Collection 3 Quotes: “You would be in the jail.”
Keywords: jail
Collection 4 Quotes: “You need both a public and private position on certain issues.”
Keywords: public position, private position
4

Processing the Data
Pre-processing
change into lowercase
remove hashtags, mentions, URLs
remove stopwords
Tweet Variance
use TF-IDF (scikit-learn) to create the term vectors
calculate the cosine similarity among selected tweets
Sentiment
calculate the sentiment value (nltk.sentiment.vader)
Topic Analysis
create topics in each collection (# of topics: 3, # of words / topic: 8) (gensim)
5

Results
● Word trees showing quote and response
variance
6

Results
First topic in each collection Sentiments in each collection
locker room jail
respect for women
7

Conclusion
● Twitter users were recording, interpreting, and sharing the words spoken by
Donald Trump and Hillary Clinton in real time -- often with some variation or
comment
○ Sarcastic/insincere comments likely skewed sentiment analysis
● Further research would require improving the methods for cleaning the data,
analyzing the ways that quotes changed over a longer period of time, how
those interpretations were reflected in other outlets, and how influential
variances and interpretive biases were in shaping public understanding of
what the candidates said compared to deliberate “fake news”
9

Viewers also liked

Good News/ Bad NewsLulwahMA

Where Can We Post Stories Summarizing Web Archive CollectionsShawn Jones

Rediscovering Missing Web Pages Using Link Neighborhood Lexical SignaturesMartin Klein

How Much of the Web is Archived? JCDL 2011Ahmed AlSum

Robust Linking to Web ResourcesMartin Klein

Persistent Annotations Deserve New URIsalasaadi81

Web Archiving Activities of ODU’s Web Science and Digital Library Research G...Michael Nelson

Viewers also liked (7)

Good News/ Bad News

Where Can We Post Stories Summarizing Web Archive Collections

Rediscovering Missing Web Pages Using Link Neighborhood Lexical Signatures

How Much of the Web is Archived? JCDL 2011

Robust Linking to Web Resources

Persistent Annotations Deserve New URIs

Web Archiving Activities of ODU’s Web Science and Digital Library Research G...

Similar to Twitter Analysis: Fake News

Tweeting for Hillary - DS 501 case study 1Yousef Fadila

Document(2)Sutha Guru

SHORTer VERSION - Liminality and Communitas in Social Media - The case of Twi...Jana Herwig

DH 199 Social Media AnalyticsStephanie Wong

Twitter data analysis using Rsantoshi mangalgi

Twitter: Social Network Or News Medium?Serge Beckers

Data Science Poster FinalJesse Hinson

What Your Tweets Tell Us About You, Speaker NotesKrisKasianovitz

Characterizing microblogsEtico Capital

Accessing and analysing your own social media data.pptxLadduAnanu

Grounded theory meets big data: One way to marry ethnography and digital methodsCitizens in the Making

Outreach Through Social Media | Ocean Sciences 2014Christie Wilcox

Twitter 101Tom Dawkins

CDTW Capstone Presentation Todd Rutherford

Trumping the Polls: Event Analysis During the 2016 Presidential ElectionJinho Choi

Linguistic Cues to Deception: Identifying Political Trolls on Social MediaAseel Addawood

Project Media Essay Spring 2015 Professor BattyProject Med.docxwkyra78

SDSU Osher social media class 2Yadira Galindo

Language of Politics on Twitter - 03 AnalysisYelena Mejova

Similar to Twitter Analysis: Fake News (20)

Tweeting for Hillary - DS 501 case study 1

Document(2)

SHORTer VERSION - Liminality and Communitas in Social Media - The case of Twi...

DH 199 Social Media Analytics

Twitter data analysis using R

Twitter: Social Network Or News Medium?

Data Science Poster Final

What Your Tweets Tell Us About You, Speaker Notes

Characterizing microblogs

Accessing and analysing your own social media data.pptx

Grounded theory meets big data: One way to marry ethnography and digital methods

Outreach Through Social Media | Ocean Sciences 2014

Twitter 101

CDTW Capstone Presentation

Trumping the Polls: Event Analysis During the 2016 Presidential Election

Linguistic Cues to Deception: Identifying Political Trolls on Social Media

Project Media Essay Spring 2015 Professor BattyProject Med.docx

SDSU Osher social media class 2

Language of Politics on Twitter - 03 Analysis

Recently uploaded

Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...amitlee9823

➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...amitlee9823

Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE9953056974 Low Rate Call Girls In Saket, Delhi NCR

➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men 🔝Sambalpur🔝 Esc...amitlee9823

Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service9953056974 Low Rate Call Girls In Saket, Delhi NCR

VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY

5CL-ADBA,5cladba, Chinese supplier, safety is guaranteedamy56318795

Discover Why Less is More in B2B Researchmichael115558

➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...amitlee9823

Predicting Loan Approval: A Data Science ProjectBoston Institute of Analytics

Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop

Detecting Credit Card Fraud: A Machine Learning ApproachBoston Institute of Analytics

➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...amitlee9823

Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823

Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...amitlee9823

Aspirational Block Program Block Syaldey District - AlmoraGovindSinghDasila

Anomaly detection and data imputation within time seriesParis Women in Machine Learning and Data Science

Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823

Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangaloreamitlee9823

Recently uploaded (20)

Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...

➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...

Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE

➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men 🔝Sambalpur🔝 Esc...

Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service

VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...

5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed

Discover Why Less is More in B2B Research

➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...

Predicting Loan Approval: A Data Science Project

Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...

Detecting Credit Card Fraud: A Machine Learning Approach

➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...

Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand

Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...

Aspirational Block Program Block Syaldey District - Almora

Anomaly detection and data imputation within time series

Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...

Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore

Twitter Analysis: Fake News

1. Twitter Analysis: Fake News Allison Hegel, Liuqing Li, Dallas Pillen, Erika Siregar, Melanie Walsh

2. Our Project Question Is it “fake news” to misquote a presidential candidate by just one word? What about two? Three? When exactly does “fake” news become fake? Hypothesis “Fake news” doesn’t only happen from the top down, but also happens at the very first moment of interpretation, especially when shared on social media networks Goals ● How Twitter users were recording, interpreting, and sharing the words spoken by Donald Trump and Hillary Clinton in real time. ● Find out how the “facts” (the accurate transcription of the words) began to evolve into counterfacts or alternate versions of their words. ● Find out if there is any interpretive bias and emotional valence in the tweets. 2

3. Dataset Data Type: Tweets (~740,000 unique tweets) Source: Social Feed Manager Time Range: During and immediately after the Second Presidential Debate (10/01/2016) Search terms: #debate, #debates, #debatenight, #debate2016, #debates2016, @HillaryClinton, @realDonaldTrump, @debates 3

4. The Data Processing Create Collections Filter the json data based on several memorable debate quotes/topics Collection 1 Quotes: “That was locker room talk.” Keywords: locker room, locker-room, lockerroom Collection 2 Quotes: “Nobody has more respect for women than I do.” Keywords: respect for women Collection 3 Quotes: “You would be in the jail.” Keywords: jail Collection 4 Quotes: “You need both a public and private position on certain issues.” Keywords: public position, private position 4

5. Processing the Data Pre-processing change into lowercase remove hashtags, mentions, URLs remove stopwords Tweet Variance use TF-IDF (scikit-learn) to create the term vectors calculate the cosine similarity among selected tweets Sentiment calculate the sentiment value (nltk.sentiment.vader) Topic Analysis create topics in each collection (# of topics: 3, # of words / topic: 8) (gensim) 5

6. Results ● Word trees showing quote and response variance 6

7. Results First topic in each collection Sentiments in each collection locker room jail respect for women 7

8. Most Positive Tweets 8

9. Conclusion ● Twitter users were recording, interpreting, and sharing the words spoken by Donald Trump and Hillary Clinton in real time -- often with some variation or comment ○ Sarcastic/insincere comments likely skewed sentiment analysis ● Further research would require improving the methods for cleaning the data, analyzing the ways that quotes changed over a longer period of time, how those interpretations were reflected in other outlets, and how influential variances and interpretive biases were in shaping public understanding of what the candidates said compared to deliberate “fake news” 9

Twitter Analysis: Fake News

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (7)

Similar to Twitter Analysis: Fake News

Similar to Twitter Analysis: Fake News (20)

Recently uploaded

Recently uploaded (20)

Twitter Analysis: Fake News