SlideShare a Scribd company logo
1 of 30
Download to read offline
Mining public opinion about
economic issues
Twitter and the U.S Presidential Election
Ivan Abboud – Wajdy Al-Jaber
Source : https://arxiv.org/pdf/1804.03540.pdf
Introduction
• Opinion polls have been the bridge between public
opinion and politicians in elections
• Social media has provided a platform for collecting
a large amount of social media data
• This paper proposes a computational public
opinion mining approach to explore the discussion
of economic issues in social media during an
election
Opinion Mining
• An “opinion poll is a
type of survey or inquiry
designed to measure the
public’s views regarding
a particular topic or
series of topics”
• face-to-face interviews
• phone interviews
• surveys sent by mail or email or
available online
Opinion Mining
• Opinion polls encourages
political campaigns to track
polls and surveys for possible
changes in public relation
strategies
• Among different technologies,
social media plays the role of
a big focus group in providing
feedback during an election
cycle
Opinion Mining
• Before 2016, the 2012 presidential campaigns of Barack
Obama and Mitt Romney represented the most data-driven
election cycle in history
• The Obama and Romney campaigns spent $52 million and
$26 million on advertising in modern social media,
respectively
• In that election, 40% of U.S. adults engaged politically with
social media.
• 38% of social media users shared and followed political
news
• 20% of the users followed politicians on social media
Opinion Mining
• People share their feelings and
opinions on Twitter on such a
large scale that it can be used
for research
• collecting and analyzing
Twitter data is a cost-effective
way to survey a large number
of participants in a short
period of time.
• This study proposes an economics-based
opinion mining approach to analyze
election related tweets, to gather
positive and negative economic
feedback within them, and better
understand public opinion on economic
issues.
• The proposed approach applies a
combination of sentiment analysis and
topic modeling methods on millions of
tweets during the 2012 U.S. presidential
election.
(Barack Obama vs. Mitt Romney)
Methodology
This paper proposes an economic-based public
opinion mining approach with four components:
Sentiment
Analysis
Data
Collection
Topic
Discovery
Analysis
Data Collection
Twitter data can be collected with
APIs (Application Programming
Interfaces).
APIs collect different forms of
Twitter data for a user such as tweets, number of followers,
and favorite tweets…..
To access a large number of tweets, some related terms are
needed to retrieve the relevant ones
Data Collection
This step comes with a data cleaning step to remove
stopwords, such as “the,” that do not have any semantic
value.
Queries Raw Tweets Cleaned Tweets
This analysis shows that Obama has the advantage based
on the difference between positive and negative tweets
Data Collection
Queries Used for filtering tweets
Candidate Queries
Barack Obama
barack obama
@barackobama
#barackobama
#obama
Mitt Romney
mitt romney
@mittromney
#mittromney
#romney
The data for this research was collected from September 29, 2012,
to November 16, 2012
This dataset has 24 million tweets related to the candidates for
president, Barack Obama and Mitt Romney
Sentiment Analysis
Second step in the proposed approach is Sentiment analysis
Sentiment analysis is basically concerned with analysis of
emotions and opinions from text
Two main methods can be used for this step:
Learning-based
approaches
Lexicon-based
approaches
Sentiment Analysis
Machine Learning Approach
Sentiment Analysis
The problem with machine learning approaches is that it
needs a training data set which mean it need a data that is
labeled with humans raters
Also it need a prior knowledge about the data categories
Sentiment Analysis
Lexicon-based Approaches
finds the frequency of a predefined dictionary of positive
and negative terms to disclose sentiment in the data when
there is no prior knowledge about its categories
In this study they used the second approach
because there is no prior knowledge about the
data categories
Sentiment Analysis
Lexicon-based Approaches
Linguistic Inquiry and Word Count (LIWC) is the most common tool for text
analysis
Basically, it reads a given text and counts the percentage of words that reflect
different emotions, thinking styles, social concerns, and even parts of speech.
After the processing module has read and accounted for all words in a given
text, it calculates the percentage of total words that match each of the
dictionary categories.
For example, if LIWC analyzed a single speech that was 2,000 words and
compared them to the built-in dictionary, it might find that there were 150
pronouns and 84 positive emotion words used. It would convert these numbers
to percentages, 7.5% pronouns and 4.2% positive emotion words.
Sentiment Analysis
Lexicon-based Approaches
Dictionary is composed of
almost 6,400 words, word
stems, and selected
emoticons. For each
dictionary word, there is a
corresponding dictionary
entry that defines one or
more word categories.
Cried
Sadness
Negative
Emotion
Overall
Affect
Verb
Past
Focus
Sentiment Analysis
We filter the data to positive and
negative tweets with respect to each
candidate
4,549,496
2,773,933
3,075,592
2,396,873
0 500,000 1,000,000 1,500,000 2,000,000 2,500,000 3,000,000 3,500,000 4,000,000 4,500,000 5,000,000
Barack Obama
Mitt Romney
Tweets
Negative Positive
This analysis shows that Obama has
the advantage based on the difference
between positive and negative tweets
Topic Modeling
Topic modeling is a type of statistical modeling for
discovering the abstract “topics” that occur in a
collection of documents.
Topic Modeling
LDA or latent Dirichlet allocation is a “generative probabilistic
model” of a collection of documents made up of words.
The probabilistic topic model estimated by LDA consists of two
tables (matrices):
1 - The first table describes the probability or chance of selecting
a particular word when sampling a particular topic (category).
2- The second table describes the chance of selecting a
particular topic when sampling a particular document or
composite.
Topic Modeling
Example: bag-of-words
Budget Tax Employee
Tweet 0 10 0 0
Tweet 1 0 10 0
Tweet 2 0 0 10
Tweet 3 10 10 10
After running the LDA we end up with two tables:
Topic1 Topic2 Topic3
Budget 0 0 0.999
Tax 0.999 0 0
Employee 0 0.999 0
Topic1 Topic2 Topic3
Tweet0 0 0 0.93
Tweet1 0.93 0 0
Tweet2 0 0.93 0
Tweet3 0.333 0.333 0.333
Word vs Topic probabilistic model Tweet vs Topic probabilistic model
Topic Modeling
LDA is a generative model that have
two hyper-parameters to be tuned:
Α and β
Topic Modeling
The alpha controls
the mixture of topics
for any given
document. Turn it
down and the
documents will likely
have less of a
mixture of topics.
Turn it up and the
documents will likely
have more of a
mixture of topics
Topic Modeling
People posted tweets about different issues (topics)
during the 2012 election, but the focus of this research is
on the main economic issues including:
• Economy in General
• Job
• Budget Deficit
• Healthcare
• Tax
DPNT : the difference between the number of positive topics
and the number of negative topics
DPNT indicates the overall feedback status
Results
After applying topic modeling on positive and negative tweets , filtering out
topics based on the five economic issues, and calculating the DPNT value for
each topic for both candidates we got these results
13
34
4
11
8
18
24
3
4
13
0 5 10 15 20 25 30 35 40
Economy
Jobs
Budget Deficit
Healthcare
Tax
Obama’s Results
Negative Positive
19
22
3
18
21
25
31
9
14
31
0 5 10 15 20 25 30 35
Romney’s Results
Negative Positive
Results
-5
10
7
1
-5
-6
-9
-6
4
-10
-15 -10 -5 0 5 10 15
Economy
Jobs
Budget Deficit
Healthcare
Tax
Obama Vs. Romney (DPNT)
Romney Obama
Obama has three positive DPNTs with the highest DPNT for the job issue
Romney has just one positive DPNT for the healthcare issue.
Although Obama has two negative DPNTs for the economy in general and tax issues, he
has the advantage on all the economic issues based on DPNT value
Conclusion
The final election results show
that Obama had a big victory
with more than 3 million popular
votes and a more than 120
electoral vote advantage over
Romney
In line with the final results, our
analysis indicates that the winner
had the advantage on the most
important issues (economic
issues) in the election
Conclusion
The results show that jobs and
taxes were the most and the
least important issues,
respectively, for the followers of
the two candidates. Although the
overall ranking of the issues for
each candidate is very close,
DPNT values show Obama having
the advantage on all the
economic issues
Category
Category
Category
Category
Thank you!
Ivan Abboud – Wajdy Al-Jaber Questions??
The End
Thanks for listening
Source : https://arxiv.org/pdf/1804.03540.pdf

More Related Content

What's hot

Sentiment mining- The Design and Implementation of an Internet Public Opinion...
Sentiment mining- The Design and Implementation of an Internet PublicOpinion...Sentiment mining- The Design and Implementation of an Internet PublicOpinion...
Sentiment mining- The Design and Implementation of an Internet Public Opinion...Prateek Singh
 
Sentiment analysis
Sentiment analysisSentiment analysis
Sentiment analysisAmenda Joy
 
IRJET - Election Result Prediction using Sentiment Analysis
IRJET - Election Result Prediction using Sentiment AnalysisIRJET - Election Result Prediction using Sentiment Analysis
IRJET - Election Result Prediction using Sentiment AnalysisIRJET Journal
 
Twitter sentimentanalysis report
Twitter sentimentanalysis reportTwitter sentimentanalysis report
Twitter sentimentanalysis reportSavio Aberneithie
 
Managerialstatistics
ManagerialstatisticsManagerialstatistics
ManagerialstatisticsDivyesh Varshney
 
Opinion Mining
Opinion MiningOpinion Mining
Opinion MiningAli Habeeb
 
Twitter Analytics
Twitter AnalyticsTwitter Analytics
Twitter AnalyticsStephen Dann
 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment AnalysisRexNige
 
Project report
Project reportProject report
Project reportUtkarsh Soni
 
Text Analytics Past, Present & Future: An Industry View
Text Analytics Past, Present & Future: An Industry ViewText Analytics Past, Present & Future: An Industry View
Text Analytics Past, Present & Future: An Industry ViewSeth Grimes
 
A review of sentiment analysis approaches in big
A review of sentiment analysis approaches in bigA review of sentiment analysis approaches in big
A review of sentiment analysis approaches in bigNurfadhlina Mohd Sharef
 
Analyzing-Threat-Levels-of-Extremists-using-Tweets
Analyzing-Threat-Levels-of-Extremists-using-TweetsAnalyzing-Threat-Levels-of-Extremists-using-Tweets
Analyzing-Threat-Levels-of-Extremists-using-TweetsRESHAN FARAZ
 
Product Sentiment Analysis
Product Sentiment AnalysisProduct Sentiment Analysis
Product Sentiment Analysisnancy amala
 
New sentiment analysis of tweets using python by Ravi kumar
New sentiment analysis of tweets using python by Ravi kumarNew sentiment analysis of tweets using python by Ravi kumar
New sentiment analysis of tweets using python by Ravi kumarRavi Kumar
 
Ontology based sentiment analysis
Ontology based sentiment analysisOntology based sentiment analysis
Ontology based sentiment analysisprathako
 
sentiment analysis text extraction from social media
sentiment  analysis text extraction from social media sentiment  analysis text extraction from social media
sentiment analysis text extraction from social media Ravindra Chaudhary
 
Politycheck - Political Ideology Detection (Natural Language Processing Chall...
Politycheck - Political Ideology Detection (Natural Language Processing Chall...Politycheck - Political Ideology Detection (Natural Language Processing Chall...
Politycheck - Political Ideology Detection (Natural Language Processing Chall...Hamza Mahmood
 

What's hot (20)

Sentiment mining- The Design and Implementation of an Internet Public Opinion...
Sentiment mining- The Design and Implementation of an Internet PublicOpinion...Sentiment mining- The Design and Implementation of an Internet PublicOpinion...
Sentiment mining- The Design and Implementation of an Internet Public Opinion...
 
SentiCheNews - Sentiment Analysis on Newspapers and Tweets
SentiCheNews - Sentiment Analysis on Newspapers and TweetsSentiCheNews - Sentiment Analysis on Newspapers and Tweets
SentiCheNews - Sentiment Analysis on Newspapers and Tweets
 
Sentiment analysis
Sentiment analysisSentiment analysis
Sentiment analysis
 
IRJET - Election Result Prediction using Sentiment Analysis
IRJET - Election Result Prediction using Sentiment AnalysisIRJET - Election Result Prediction using Sentiment Analysis
IRJET - Election Result Prediction using Sentiment Analysis
 
Twitter sentimentanalysis report
Twitter sentimentanalysis reportTwitter sentimentanalysis report
Twitter sentimentanalysis report
 
Managerialstatistics
ManagerialstatisticsManagerialstatistics
Managerialstatistics
 
Opinion Mining
Opinion MiningOpinion Mining
Opinion Mining
 
Twitter Analytics
Twitter AnalyticsTwitter Analytics
Twitter Analytics
 
7 rm technical writing
7 rm technical writing7 rm technical writing
7 rm technical writing
 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment Analysis
 
Project report
Project reportProject report
Project report
 
Final deck
Final deckFinal deck
Final deck
 
Text Analytics Past, Present & Future: An Industry View
Text Analytics Past, Present & Future: An Industry ViewText Analytics Past, Present & Future: An Industry View
Text Analytics Past, Present & Future: An Industry View
 
A review of sentiment analysis approaches in big
A review of sentiment analysis approaches in bigA review of sentiment analysis approaches in big
A review of sentiment analysis approaches in big
 
Analyzing-Threat-Levels-of-Extremists-using-Tweets
Analyzing-Threat-Levels-of-Extremists-using-TweetsAnalyzing-Threat-Levels-of-Extremists-using-Tweets
Analyzing-Threat-Levels-of-Extremists-using-Tweets
 
Product Sentiment Analysis
Product Sentiment AnalysisProduct Sentiment Analysis
Product Sentiment Analysis
 
New sentiment analysis of tweets using python by Ravi kumar
New sentiment analysis of tweets using python by Ravi kumarNew sentiment analysis of tweets using python by Ravi kumar
New sentiment analysis of tweets using python by Ravi kumar
 
Ontology based sentiment analysis
Ontology based sentiment analysisOntology based sentiment analysis
Ontology based sentiment analysis
 
sentiment analysis text extraction from social media
sentiment  analysis text extraction from social media sentiment  analysis text extraction from social media
sentiment analysis text extraction from social media
 
Politycheck - Political Ideology Detection (Natural Language Processing Chall...
Politycheck - Political Ideology Detection (Natural Language Processing Chall...Politycheck - Political Ideology Detection (Natural Language Processing Chall...
Politycheck - Political Ideology Detection (Natural Language Processing Chall...
 

Similar to Mining public opinion about economic issues

591 Final Report - Team 7 - Political Issues
591 Final Report - Team 7 - Political Issues591 Final Report - Team 7 - Political Issues
591 Final Report - Team 7 - Political IssuesTim Sawicki
 
Entity-oriented sentiment analysis of tweets: results and problems
Entity-oriented sentiment analysis of tweets: results and problemsEntity-oriented sentiment analysis of tweets: results and problems
Entity-oriented sentiment analysis of tweets: results and problemsYuliya Rubtsova
 
Twitter data analysis using R
Twitter data analysis using RTwitter data analysis using R
Twitter data analysis using Rsantoshi mangalgi
 
Evaluation Of Research Methods And Data Collection A...
Evaluation Of Research Methods And Data Collection A...Evaluation Of Research Methods And Data Collection A...
Evaluation Of Research Methods And Data Collection A...Ashley Thomas
 
A Review of machine learning approaches to mine Social Choice of voters.
A Review of machine learning approaches to mine Social Choice of voters.A Review of machine learning approaches to mine Social Choice of voters.
A Review of machine learning approaches to mine Social Choice of voters.IRJET Journal
 
1 Crore Projects | ieee 2016 Projects | 2016 ieee Projects in chennai
1 Crore Projects | ieee 2016 Projects | 2016 ieee Projects in chennai1 Crore Projects | ieee 2016 Projects | 2016 ieee Projects in chennai
1 Crore Projects | ieee 2016 Projects | 2016 ieee Projects in chennai1crore projects
 
State of the Union
State of the UnionState of the Union
State of the Unionjtierney
 
SENTIMENT ANALYSIS OF TWITTER DATA
SENTIMENT ANALYSIS OF TWITTER DATASENTIMENT ANALYSIS OF TWITTER DATA
SENTIMENT ANALYSIS OF TWITTER DATAanargha gangadharan
 
REAL TIME SENTIMENT ANALYSIS OF TWITTER DATA
REAL TIME SENTIMENT ANALYSIS OF TWITTER DATAREAL TIME SENTIMENT ANALYSIS OF TWITTER DATA
REAL TIME SENTIMENT ANALYSIS OF TWITTER DATAMary Lis Joseph
 
Sentiment analysis by using fuzzy logic
Sentiment analysis by using fuzzy logicSentiment analysis by using fuzzy logic
Sentiment analysis by using fuzzy logicijcseit
 
International Journal of Computer Science, Engineering and Information Techno...
International Journal of Computer Science, Engineering and Information Techno...International Journal of Computer Science, Engineering and Information Techno...
International Journal of Computer Science, Engineering and Information Techno...ijcseit
 
Sentiment Analysis using Fuzzy logic
Sentiment Analysis using Fuzzy logicSentiment Analysis using Fuzzy logic
Sentiment Analysis using Fuzzy logicVinay Sawant
 
SENTIMENT ANALYSIS BY USING FUZZY LOGIC
SENTIMENT ANALYSIS BY USING FUZZY LOGICSENTIMENT ANALYSIS BY USING FUZZY LOGIC
SENTIMENT ANALYSIS BY USING FUZZY LOGICijcseit
 
Mediawave, social media monitoring & data analytics
Mediawave, social media monitoring & data analyticsMediawave, social media monitoring & data analytics
Mediawave, social media monitoring & data analyticsDwi Wahyono
 
American inter continental university busn 311 individual project
American inter continental university busn 311 individual projectAmerican inter continental university busn 311 individual project
American inter continental university busn 311 individual projectChristina Walkar
 
Part 1 Individual Factors Affecting Voter Turnout Based on .docx
Part 1 Individual Factors Affecting Voter Turnout Based on .docxPart 1 Individual Factors Affecting Voter Turnout Based on .docx
Part 1 Individual Factors Affecting Voter Turnout Based on .docxdanhaley45372
 
Social Media Dashboarding (reporting)
Social Media Dashboarding (reporting)Social Media Dashboarding (reporting)
Social Media Dashboarding (reporting)Scott K. Wilder
 
Social Media Dashboarding by Scott Wilder and semphonic
Social Media Dashboarding by Scott Wilder and semphonicSocial Media Dashboarding by Scott Wilder and semphonic
Social Media Dashboarding by Scott Wilder and semphonicEdelman Digital
 
Project for executive summary v2
Project for executive summary v2Project for executive summary v2
Project for executive summary v200000000A1
 

Similar to Mining public opinion about economic issues (20)

591 Final Report - Team 7 - Political Issues
591 Final Report - Team 7 - Political Issues591 Final Report - Team 7 - Political Issues
591 Final Report - Team 7 - Political Issues
 
Entity-oriented sentiment analysis of tweets: results and problems
Entity-oriented sentiment analysis of tweets: results and problemsEntity-oriented sentiment analysis of tweets: results and problems
Entity-oriented sentiment analysis of tweets: results and problems
 
Twitter data analysis using R
Twitter data analysis using RTwitter data analysis using R
Twitter data analysis using R
 
Evaluation Of Research Methods And Data Collection A...
Evaluation Of Research Methods And Data Collection A...Evaluation Of Research Methods And Data Collection A...
Evaluation Of Research Methods And Data Collection A...
 
A Review of machine learning approaches to mine Social Choice of voters.
A Review of machine learning approaches to mine Social Choice of voters.A Review of machine learning approaches to mine Social Choice of voters.
A Review of machine learning approaches to mine Social Choice of voters.
 
Abstract
AbstractAbstract
Abstract
 
1 Crore Projects | ieee 2016 Projects | 2016 ieee Projects in chennai
1 Crore Projects | ieee 2016 Projects | 2016 ieee Projects in chennai1 Crore Projects | ieee 2016 Projects | 2016 ieee Projects in chennai
1 Crore Projects | ieee 2016 Projects | 2016 ieee Projects in chennai
 
State of the Union
State of the UnionState of the Union
State of the Union
 
SENTIMENT ANALYSIS OF TWITTER DATA
SENTIMENT ANALYSIS OF TWITTER DATASENTIMENT ANALYSIS OF TWITTER DATA
SENTIMENT ANALYSIS OF TWITTER DATA
 
REAL TIME SENTIMENT ANALYSIS OF TWITTER DATA
REAL TIME SENTIMENT ANALYSIS OF TWITTER DATAREAL TIME SENTIMENT ANALYSIS OF TWITTER DATA
REAL TIME SENTIMENT ANALYSIS OF TWITTER DATA
 
Sentiment analysis by using fuzzy logic
Sentiment analysis by using fuzzy logicSentiment analysis by using fuzzy logic
Sentiment analysis by using fuzzy logic
 
International Journal of Computer Science, Engineering and Information Techno...
International Journal of Computer Science, Engineering and Information Techno...International Journal of Computer Science, Engineering and Information Techno...
International Journal of Computer Science, Engineering and Information Techno...
 
Sentiment Analysis using Fuzzy logic
Sentiment Analysis using Fuzzy logicSentiment Analysis using Fuzzy logic
Sentiment Analysis using Fuzzy logic
 
SENTIMENT ANALYSIS BY USING FUZZY LOGIC
SENTIMENT ANALYSIS BY USING FUZZY LOGICSENTIMENT ANALYSIS BY USING FUZZY LOGIC
SENTIMENT ANALYSIS BY USING FUZZY LOGIC
 
Mediawave, social media monitoring & data analytics
Mediawave, social media monitoring & data analyticsMediawave, social media monitoring & data analytics
Mediawave, social media monitoring & data analytics
 
American inter continental university busn 311 individual project
American inter continental university busn 311 individual projectAmerican inter continental university busn 311 individual project
American inter continental university busn 311 individual project
 
Part 1 Individual Factors Affecting Voter Turnout Based on .docx
Part 1 Individual Factors Affecting Voter Turnout Based on .docxPart 1 Individual Factors Affecting Voter Turnout Based on .docx
Part 1 Individual Factors Affecting Voter Turnout Based on .docx
 
Social Media Dashboarding (reporting)
Social Media Dashboarding (reporting)Social Media Dashboarding (reporting)
Social Media Dashboarding (reporting)
 
Social Media Dashboarding by Scott Wilder and semphonic
Social Media Dashboarding by Scott Wilder and semphonicSocial Media Dashboarding by Scott Wilder and semphonic
Social Media Dashboarding by Scott Wilder and semphonic
 
Project for executive summary v2
Project for executive summary v2Project for executive summary v2
Project for executive summary v2
 

Recently uploaded

Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 

Recently uploaded (20)

Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 

Mining public opinion about economic issues

  • 1. Mining public opinion about economic issues Twitter and the U.S Presidential Election Ivan Abboud – Wajdy Al-Jaber Source : https://arxiv.org/pdf/1804.03540.pdf
  • 2. Introduction • Opinion polls have been the bridge between public opinion and politicians in elections • Social media has provided a platform for collecting a large amount of social media data • This paper proposes a computational public opinion mining approach to explore the discussion of economic issues in social media during an election
  • 3. Opinion Mining • An “opinion poll is a type of survey or inquiry designed to measure the public’s views regarding a particular topic or series of topics” • face-to-face interviews • phone interviews • surveys sent by mail or email or available online
  • 4. Opinion Mining • Opinion polls encourages political campaigns to track polls and surveys for possible changes in public relation strategies • Among different technologies, social media plays the role of a big focus group in providing feedback during an election cycle
  • 5. Opinion Mining • Before 2016, the 2012 presidential campaigns of Barack Obama and Mitt Romney represented the most data-driven election cycle in history • The Obama and Romney campaigns spent $52 million and $26 million on advertising in modern social media, respectively • In that election, 40% of U.S. adults engaged politically with social media. • 38% of social media users shared and followed political news • 20% of the users followed politicians on social media
  • 6. Opinion Mining • People share their feelings and opinions on Twitter on such a large scale that it can be used for research • collecting and analyzing Twitter data is a cost-effective way to survey a large number of participants in a short period of time.
  • 7. • This study proposes an economics-based opinion mining approach to analyze election related tweets, to gather positive and negative economic feedback within them, and better understand public opinion on economic issues. • The proposed approach applies a combination of sentiment analysis and topic modeling methods on millions of tweets during the 2012 U.S. presidential election. (Barack Obama vs. Mitt Romney)
  • 8. Methodology This paper proposes an economic-based public opinion mining approach with four components: Sentiment Analysis Data Collection Topic Discovery Analysis
  • 9. Data Collection Twitter data can be collected with APIs (Application Programming Interfaces). APIs collect different forms of Twitter data for a user such as tweets, number of followers, and favorite tweets….. To access a large number of tweets, some related terms are needed to retrieve the relevant ones
  • 10. Data Collection This step comes with a data cleaning step to remove stopwords, such as “the,” that do not have any semantic value. Queries Raw Tweets Cleaned Tweets This analysis shows that Obama has the advantage based on the difference between positive and negative tweets
  • 11. Data Collection Queries Used for filtering tweets Candidate Queries Barack Obama barack obama @barackobama #barackobama #obama Mitt Romney mitt romney @mittromney #mittromney #romney The data for this research was collected from September 29, 2012, to November 16, 2012 This dataset has 24 million tweets related to the candidates for president, Barack Obama and Mitt Romney
  • 12. Sentiment Analysis Second step in the proposed approach is Sentiment analysis Sentiment analysis is basically concerned with analysis of emotions and opinions from text Two main methods can be used for this step: Learning-based approaches Lexicon-based approaches
  • 14. Sentiment Analysis The problem with machine learning approaches is that it needs a training data set which mean it need a data that is labeled with humans raters Also it need a prior knowledge about the data categories
  • 15. Sentiment Analysis Lexicon-based Approaches finds the frequency of a predefined dictionary of positive and negative terms to disclose sentiment in the data when there is no prior knowledge about its categories In this study they used the second approach because there is no prior knowledge about the data categories
  • 16. Sentiment Analysis Lexicon-based Approaches Linguistic Inquiry and Word Count (LIWC) is the most common tool for text analysis Basically, it reads a given text and counts the percentage of words that reflect different emotions, thinking styles, social concerns, and even parts of speech. After the processing module has read and accounted for all words in a given text, it calculates the percentage of total words that match each of the dictionary categories. For example, if LIWC analyzed a single speech that was 2,000 words and compared them to the built-in dictionary, it might find that there were 150 pronouns and 84 positive emotion words used. It would convert these numbers to percentages, 7.5% pronouns and 4.2% positive emotion words.
  • 17. Sentiment Analysis Lexicon-based Approaches Dictionary is composed of almost 6,400 words, word stems, and selected emoticons. For each dictionary word, there is a corresponding dictionary entry that defines one or more word categories. Cried Sadness Negative Emotion Overall Affect Verb Past Focus
  • 18. Sentiment Analysis We filter the data to positive and negative tweets with respect to each candidate 4,549,496 2,773,933 3,075,592 2,396,873 0 500,000 1,000,000 1,500,000 2,000,000 2,500,000 3,000,000 3,500,000 4,000,000 4,500,000 5,000,000 Barack Obama Mitt Romney Tweets Negative Positive This analysis shows that Obama has the advantage based on the difference between positive and negative tweets
  • 19. Topic Modeling Topic modeling is a type of statistical modeling for discovering the abstract “topics” that occur in a collection of documents.
  • 20. Topic Modeling LDA or latent Dirichlet allocation is a “generative probabilistic model” of a collection of documents made up of words. The probabilistic topic model estimated by LDA consists of two tables (matrices): 1 - The first table describes the probability or chance of selecting a particular word when sampling a particular topic (category). 2- The second table describes the chance of selecting a particular topic when sampling a particular document or composite.
  • 21. Topic Modeling Example: bag-of-words Budget Tax Employee Tweet 0 10 0 0 Tweet 1 0 10 0 Tweet 2 0 0 10 Tweet 3 10 10 10 After running the LDA we end up with two tables: Topic1 Topic2 Topic3 Budget 0 0 0.999 Tax 0.999 0 0 Employee 0 0.999 0 Topic1 Topic2 Topic3 Tweet0 0 0 0.93 Tweet1 0.93 0 0 Tweet2 0 0.93 0 Tweet3 0.333 0.333 0.333 Word vs Topic probabilistic model Tweet vs Topic probabilistic model
  • 22. Topic Modeling LDA is a generative model that have two hyper-parameters to be tuned: Α and β
  • 23. Topic Modeling The alpha controls the mixture of topics for any given document. Turn it down and the documents will likely have less of a mixture of topics. Turn it up and the documents will likely have more of a mixture of topics
  • 24. Topic Modeling People posted tweets about different issues (topics) during the 2012 election, but the focus of this research is on the main economic issues including: • Economy in General • Job • Budget Deficit • Healthcare • Tax DPNT : the difference between the number of positive topics and the number of negative topics DPNT indicates the overall feedback status
  • 25. Results After applying topic modeling on positive and negative tweets , filtering out topics based on the five economic issues, and calculating the DPNT value for each topic for both candidates we got these results 13 34 4 11 8 18 24 3 4 13 0 5 10 15 20 25 30 35 40 Economy Jobs Budget Deficit Healthcare Tax Obama’s Results Negative Positive 19 22 3 18 21 25 31 9 14 31 0 5 10 15 20 25 30 35 Romney’s Results Negative Positive
  • 26. Results -5 10 7 1 -5 -6 -9 -6 4 -10 -15 -10 -5 0 5 10 15 Economy Jobs Budget Deficit Healthcare Tax Obama Vs. Romney (DPNT) Romney Obama Obama has three positive DPNTs with the highest DPNT for the job issue Romney has just one positive DPNT for the healthcare issue. Although Obama has two negative DPNTs for the economy in general and tax issues, he has the advantage on all the economic issues based on DPNT value
  • 27. Conclusion The final election results show that Obama had a big victory with more than 3 million popular votes and a more than 120 electoral vote advantage over Romney In line with the final results, our analysis indicates that the winner had the advantage on the most important issues (economic issues) in the election
  • 28. Conclusion The results show that jobs and taxes were the most and the least important issues, respectively, for the followers of the two candidates. Although the overall ranking of the issues for each candidate is very close, DPNT values show Obama having the advantage on all the economic issues
  • 29. Category Category Category Category Thank you! Ivan Abboud – Wajdy Al-Jaber Questions??
  • 30. The End Thanks for listening Source : https://arxiv.org/pdf/1804.03540.pdf