SlideShare a Scribd company logo
1 of 1
Download to read offline
Using Tweets to Analyze the 2016 Presidential Election.
Nicklaus Glyder, Mitesh Gurav, Jesse Hinson, Ashwin Kumar, Kunwar Deep Singh Tor, Ravi Teja Kandula
	
Sentiment Analysis of 2016 Election Using Twitter Data
Ashwin Kumar, Jesse Hinson, Nicklaus Glyder, Mitesh Gurav, Ravi Teja Kandula, Kunwar Deep Singh Toor
Goal: Perform sentiment analysis on Tweets collected during the third 2016 Presidential Debate (focused on Clinton and Trump).
Identify and visualize keyword content of this Twitter sentiment data.
Process Flow Diagram:
Use Kafka, Zookeper, and
Twitter API on Palmetto Cluster
Clean Data using NLTK
and Python
Mass Extraction of Tweets.
Sentiment Analysis
using VADER
Creation of Visualizations
using D3.js and AngularJS
Accuracy of VADER sentiment analysis:
We hard coded the sentiment classification of 107 tweets, and analyzed how VADER performed on this training set. By disregarding Tweets with weak
sentiment scores, we were able to achieve 68% accuracy in polarity classification. This shows how Tweets (especially those that are political) are hard to classify
correctly due to intrinsic context such as humor and sarcasm.
Hillary Clinton Donald Trump
Visit http://debate-visualization.herokuapp.com to interact with these and
more visualizations that represent our data!
What is Sentiment Analysis?
The objective of sentiment analysis is to determine the positive/negative emotions
(polarity) associated with given text. We also want to determine extremity of the
sentiment. In other words, how positiver or negative was the sentiment of a tweet?
We used the VADER social media sentiment analyzer, a context-aware analyzer
developed at Georgia Tech.
C.J. Hutto and Eric Gilbert, "VADER: A Parsimonious Rule-based Model for Sentiment Analysis of Social Media Text",
Eighth International Conference on Weblogs and Social Media, Ann Arbor, MI, June 2014.
Motivations
* We wanted to use Tweets as a barometer for how people felt about different issues
during the 2016 Presidential Campaign.
* Social media and micro-blogging like Twitter have become a very popular place to
discuss politics; we were able to collect over 100,000 Tweets.
* Led to some interesting results: "terrorism" was found to be the most negative
word in both cases while "campaign", "women", and "lie" were the most frequently
used words.

More Related Content

What's hot

Grasso Fake News Final Presentation
Grasso Fake News Final PresentationGrasso Fake News Final Presentation
Grasso Fake News Final PresentationBrian Grasso
 
Fake news and partisan bias
Fake news and partisan biasFake news and partisan bias
Fake news and partisan biasJay Van Bavel
 
Loveworks - Google Hangout Presentation with Brian Sheehan
Loveworks - Google Hangout Presentation with Brian SheehanLoveworks - Google Hangout Presentation with Brian Sheehan
Loveworks - Google Hangout Presentation with Brian SheehanMichael Pranikoff
 
Twitter Sentiment and Network Analysis
Twitter Sentiment and Network AnalysisTwitter Sentiment and Network Analysis
Twitter Sentiment and Network AnalysisXudong Brandon Liang
 
Big Data Vizualisation
Big Data VizualisationBig Data Vizualisation
Big Data VizualisationDixon Jones
 
PeekAnalytics Social Audience Report @jeffjarvis
PeekAnalytics Social Audience Report @jeffjarvisPeekAnalytics Social Audience Report @jeffjarvis
PeekAnalytics Social Audience Report @jeffjarvisPeekAnaltyics
 
Blockchain News Analysis
Blockchain News AnalysisBlockchain News Analysis
Blockchain News AnalysisPrateek Singal
 
Social Media Monitoring
Social Media MonitoringSocial Media Monitoring
Social Media MonitoringShane Gibson
 
Fake news? Protect yourself!
Fake news? Protect yourself!Fake news? Protect yourself!
Fake news? Protect yourself!Amy LC
 
Symbolic Convergence Theory
Symbolic Convergence TheorySymbolic Convergence Theory
Symbolic Convergence Theoryemilykathryn818
 

What's hot (12)

Fake news
Fake newsFake news
Fake news
 
Grasso Fake News Final Presentation
Grasso Fake News Final PresentationGrasso Fake News Final Presentation
Grasso Fake News Final Presentation
 
Fake news and partisan bias
Fake news and partisan biasFake news and partisan bias
Fake news and partisan bias
 
Don't Torture Data, Ask Nicely!
Don't Torture Data, Ask Nicely!Don't Torture Data, Ask Nicely!
Don't Torture Data, Ask Nicely!
 
Loveworks - Google Hangout Presentation with Brian Sheehan
Loveworks - Google Hangout Presentation with Brian SheehanLoveworks - Google Hangout Presentation with Brian Sheehan
Loveworks - Google Hangout Presentation with Brian Sheehan
 
Twitter Sentiment and Network Analysis
Twitter Sentiment and Network AnalysisTwitter Sentiment and Network Analysis
Twitter Sentiment and Network Analysis
 
Big Data Vizualisation
Big Data VizualisationBig Data Vizualisation
Big Data Vizualisation
 
PeekAnalytics Social Audience Report @jeffjarvis
PeekAnalytics Social Audience Report @jeffjarvisPeekAnalytics Social Audience Report @jeffjarvis
PeekAnalytics Social Audience Report @jeffjarvis
 
Blockchain News Analysis
Blockchain News AnalysisBlockchain News Analysis
Blockchain News Analysis
 
Social Media Monitoring
Social Media MonitoringSocial Media Monitoring
Social Media Monitoring
 
Fake news? Protect yourself!
Fake news? Protect yourself!Fake news? Protect yourself!
Fake news? Protect yourself!
 
Symbolic Convergence Theory
Symbolic Convergence TheorySymbolic Convergence Theory
Symbolic Convergence Theory
 

Similar to Data Science Poster Final

Twitter Based Sentiment Analysis of Each Presidential Candidate Using Long Sh...
Twitter Based Sentiment Analysis of Each Presidential Candidate Using Long Sh...Twitter Based Sentiment Analysis of Each Presidential Candidate Using Long Sh...
Twitter Based Sentiment Analysis of Each Presidential Candidate Using Long Sh...CSCJournals
 
the complete draft about the CA election time tweets -- awaiting final weedin...
the complete draft about the CA election time tweets -- awaiting final weedin...the complete draft about the CA election time tweets -- awaiting final weedin...
the complete draft about the CA election time tweets -- awaiting final weedin...japokh
 
Using Tweets for Understanding Public Opinion During U.S. Primaries and Predi...
Using Tweets for Understanding Public Opinion During U.S. Primaries and Predi...Using Tweets for Understanding Public Opinion During U.S. Primaries and Predi...
Using Tweets for Understanding Public Opinion During U.S. Primaries and Predi...Monica Powell
 
Are Twitter Users Equal in Predicting Elections? Insights from Republican Pri...
Are Twitter Users Equal in Predicting Elections? Insights from Republican Pri...Are Twitter Users Equal in Predicting Elections? Insights from Republican Pri...
Are Twitter Users Equal in Predicting Elections? Insights from Republican Pri...Artificial Intelligence Institute at UofSC
 
Are Twitter Users Equal in Predicting Elections
Are Twitter Users Equal in Predicting ElectionsAre Twitter Users Equal in Predicting Elections
Are Twitter Users Equal in Predicting ElectionsLu Chen
 
Tweeting for Hillary - DS 501 case study 1
Tweeting for Hillary - DS 501 case study 1Tweeting for Hillary - DS 501 case study 1
Tweeting for Hillary - DS 501 case study 1Yousef Fadila
 
Sentiment analysis of pre elections tweets (general elections)
Sentiment analysis of pre elections tweets (general elections)Sentiment analysis of pre elections tweets (general elections)
Sentiment analysis of pre elections tweets (general elections)sahiba javid
 
Analyzing-Threat-Levels-of-Extremists-using-Tweets
Analyzing-Threat-Levels-of-Extremists-using-TweetsAnalyzing-Threat-Levels-of-Extremists-using-Tweets
Analyzing-Threat-Levels-of-Extremists-using-TweetsRESHAN FARAZ
 
SENTIMENT ANALYSIS OF TWITTER DATA
SENTIMENT ANALYSIS OF TWITTER DATASENTIMENT ANALYSIS OF TWITTER DATA
SENTIMENT ANALYSIS OF TWITTER DATAanargha gangadharan
 
REAL TIME SENTIMENT ANALYSIS OF TWITTER DATA
REAL TIME SENTIMENT ANALYSIS OF TWITTER DATAREAL TIME SENTIMENT ANALYSIS OF TWITTER DATA
REAL TIME SENTIMENT ANALYSIS OF TWITTER DATAMary Lis Joseph
 
SENTIMENT ANALYSIS OF TWITTER DATA
SENTIMENT ANALYSIS OF TWITTER DATASENTIMENT ANALYSIS OF TWITTER DATA
SENTIMENT ANALYSIS OF TWITTER DATAParvathy Devaraj
 
Cloudcroft-S Persuasion and the Presidency PREDICT402-56 W2015
Cloudcroft-S Persuasion and the Presidency PREDICT402-56 W2015Cloudcroft-S Persuasion and the Presidency PREDICT402-56 W2015
Cloudcroft-S Persuasion and the Presidency PREDICT402-56 W2015Sheila Cloudcroft
 
CUS 695 Project Presentation
CUS 695 Project PresentationCUS 695 Project Presentation
CUS 695 Project PresentationAdrian Duran
 
Final impoliteness GlideShah
Final impoliteness GlideShahFinal impoliteness GlideShah
Final impoliteness GlideShahMargaret Glide
 
2016Election Final Draft
2016Election Final Draft2016Election Final Draft
2016Election Final DraftSarah Abel
 

Similar to Data Science Poster Final (20)

Twitter Based Sentiment Analysis of Each Presidential Candidate Using Long Sh...
Twitter Based Sentiment Analysis of Each Presidential Candidate Using Long Sh...Twitter Based Sentiment Analysis of Each Presidential Candidate Using Long Sh...
Twitter Based Sentiment Analysis of Each Presidential Candidate Using Long Sh...
 
the complete draft about the CA election time tweets -- awaiting final weedin...
the complete draft about the CA election time tweets -- awaiting final weedin...the complete draft about the CA election time tweets -- awaiting final weedin...
the complete draft about the CA election time tweets -- awaiting final weedin...
 
Senior Thesis
Senior Thesis Senior Thesis
Senior Thesis
 
Using Tweets for Understanding Public Opinion During U.S. Primaries and Predi...
Using Tweets for Understanding Public Opinion During U.S. Primaries and Predi...Using Tweets for Understanding Public Opinion During U.S. Primaries and Predi...
Using Tweets for Understanding Public Opinion During U.S. Primaries and Predi...
 
Are Twitter Users Equal in Predicting Elections? Insights from Republican Pri...
Are Twitter Users Equal in Predicting Elections? Insights from Republican Pri...Are Twitter Users Equal in Predicting Elections? Insights from Republican Pri...
Are Twitter Users Equal in Predicting Elections? Insights from Republican Pri...
 
Are Twitter Users Equal in Predicting Elections
Are Twitter Users Equal in Predicting ElectionsAre Twitter Users Equal in Predicting Elections
Are Twitter Users Equal in Predicting Elections
 
Kottler Thesis 2011
Kottler Thesis 2011Kottler Thesis 2011
Kottler Thesis 2011
 
Tweeting for Hillary - DS 501 case study 1
Tweeting for Hillary - DS 501 case study 1Tweeting for Hillary - DS 501 case study 1
Tweeting for Hillary - DS 501 case study 1
 
Sentiment analysis of pre elections tweets (general elections)
Sentiment analysis of pre elections tweets (general elections)Sentiment analysis of pre elections tweets (general elections)
Sentiment analysis of pre elections tweets (general elections)
 
Analyzing-Threat-Levels-of-Extremists-using-Tweets
Analyzing-Threat-Levels-of-Extremists-using-TweetsAnalyzing-Threat-Levels-of-Extremists-using-Tweets
Analyzing-Threat-Levels-of-Extremists-using-Tweets
 
SENTIMENT ANALYSIS OF TWITTER DATA
SENTIMENT ANALYSIS OF TWITTER DATASENTIMENT ANALYSIS OF TWITTER DATA
SENTIMENT ANALYSIS OF TWITTER DATA
 
REAL TIME SENTIMENT ANALYSIS OF TWITTER DATA
REAL TIME SENTIMENT ANALYSIS OF TWITTER DATAREAL TIME SENTIMENT ANALYSIS OF TWITTER DATA
REAL TIME SENTIMENT ANALYSIS OF TWITTER DATA
 
SENTIMENT ANALYSIS OF TWITTER DATA
SENTIMENT ANALYSIS OF TWITTER DATASENTIMENT ANALYSIS OF TWITTER DATA
SENTIMENT ANALYSIS OF TWITTER DATA
 
Cloudcroft-S Persuasion and the Presidency PREDICT402-56 W2015
Cloudcroft-S Persuasion and the Presidency PREDICT402-56 W2015Cloudcroft-S Persuasion and the Presidency PREDICT402-56 W2015
Cloudcroft-S Persuasion and the Presidency PREDICT402-56 W2015
 
INST633_FinalProject
INST633_FinalProjectINST633_FinalProject
INST633_FinalProject
 
Project_Report
Project_ReportProject_Report
Project_Report
 
CUS 695 Project Presentation
CUS 695 Project PresentationCUS 695 Project Presentation
CUS 695 Project Presentation
 
Document(2)
Document(2)Document(2)
Document(2)
 
Final impoliteness GlideShah
Final impoliteness GlideShahFinal impoliteness GlideShah
Final impoliteness GlideShah
 
2016Election Final Draft
2016Election Final Draft2016Election Final Draft
2016Election Final Draft
 

Data Science Poster Final

  • 1. Using Tweets to Analyze the 2016 Presidential Election. Nicklaus Glyder, Mitesh Gurav, Jesse Hinson, Ashwin Kumar, Kunwar Deep Singh Tor, Ravi Teja Kandula Sentiment Analysis of 2016 Election Using Twitter Data Ashwin Kumar, Jesse Hinson, Nicklaus Glyder, Mitesh Gurav, Ravi Teja Kandula, Kunwar Deep Singh Toor Goal: Perform sentiment analysis on Tweets collected during the third 2016 Presidential Debate (focused on Clinton and Trump). Identify and visualize keyword content of this Twitter sentiment data. Process Flow Diagram: Use Kafka, Zookeper, and Twitter API on Palmetto Cluster Clean Data using NLTK and Python Mass Extraction of Tweets. Sentiment Analysis using VADER Creation of Visualizations using D3.js and AngularJS Accuracy of VADER sentiment analysis: We hard coded the sentiment classification of 107 tweets, and analyzed how VADER performed on this training set. By disregarding Tweets with weak sentiment scores, we were able to achieve 68% accuracy in polarity classification. This shows how Tweets (especially those that are political) are hard to classify correctly due to intrinsic context such as humor and sarcasm. Hillary Clinton Donald Trump Visit http://debate-visualization.herokuapp.com to interact with these and more visualizations that represent our data! What is Sentiment Analysis? The objective of sentiment analysis is to determine the positive/negative emotions (polarity) associated with given text. We also want to determine extremity of the sentiment. In other words, how positiver or negative was the sentiment of a tweet? We used the VADER social media sentiment analyzer, a context-aware analyzer developed at Georgia Tech. C.J. Hutto and Eric Gilbert, "VADER: A Parsimonious Rule-based Model for Sentiment Analysis of Social Media Text", Eighth International Conference on Weblogs and Social Media, Ann Arbor, MI, June 2014. Motivations * We wanted to use Tweets as a barometer for how people felt about different issues during the 2016 Presidential Campaign. * Social media and micro-blogging like Twitter have become a very popular place to discuss politics; we were able to collect over 100,000 Tweets. * Led to some interesting results: "terrorism" was found to be the most negative word in both cases while "campaign", "women", and "lie" were the most frequently used words.