SlideShare a Scribd company logo
Sentiment Analysis for Twitter
Priyanka Bajaj priyanka.bajaj@students.iiit.ac.in
Kamal Gurala kamal.gurala@students.iiit.ac.in
Faraz Alam faraz.alam@students.iiit.ac.in
Ritesh Kumar Gupta ritesh.kumar.gupta@in.ibm.com
Guided By : Satarupa Guha satarupaguha11@gmail.com
AGENDA
1.Introduction – Sentiment Analysis
2.About Twitter and Our Goal
3.Glossary
4.Challenges
5.Approach
6.Results and Conclusion
7.Tools and Technologies
What is Sentiment Analysis?
Mechanism to extract opinions, emotions and sentiments in
text
Enable us to track attitudes and feelings on the web based on
blog posts, comments, reviews and tweets on different topics
Enable to track products, brands and people and determine
whether they are viewed positively or negatively on the web.
acts: "The painting was more expensive than a Monet"
pinions: "I honestly don't like Monet, Pollock is the
better” artist"
Challenges
• Tweets are highly unstructured and also non-
grammatical
• Out of Vocabulary Words
• Lexical Variation
• Extensive usage of acronyms like asap, lol, afaik
Our System
• Tweet Downloader
– Download the tweets using Twitter API
• Tokenisation
– Twitter specific POS Tagger developed by ARK Social
Media Search
• Preprocessing
– Replacing Emoticons by their polarity, assign scores
– Remove URL, Target Mentions
– Replace #text -> text, since hashtags may contribute to the
sentiment
– Replace Sequence of Repeated Characters eg. ‘cooooool’
by ‘cool’ and assign higher score
– Twitter specific stop word removal
– Acronym expansion
System Details
• Feature Extractor
– Unigrams and Bigrams
– Polarity Score of the Tweet (f1)
– Count of Positive/Negative Words (f2,f3)
– Maximum Positive/Negative Score for Words (f4,f5)
– Count of Positive/Negative Emoticons and assign
scores(contibutes to all f1,f2,f3,f4,f5)
– Positive/Negative special POS Tags Polarity Score
• Classifier and Prediction
– Features extracted are fed into to SVM classifier
– Model built used to predict sentiment of new tweets
System Details Contd.
Results and Conclusion
A baseline model by taking the unigrams, and
compare it with the bigrams and lexicon features
model
Sub-Task Baseline Model Feature Based
Model
Sentence Based 49.81% 57.85%
Accuracy F1 Score (f-Measure)
Sub-Task Baseline Model Feature Based
Model
Sentence Based 55.56 61.17
• We investigated two kinds of models: Baseline and
Feature Based Models
• For our feature-based approach, feature analysis reveals
that the most important features are bigrams and those
that combine the prior polarity of words and their parts-
of-speech tags
Thank You

More Related Content

Viewers also liked

Sant Cugat
Sant CugatSant Cugat
Sant Cugat
RoviraBeatriz1rC
 
Needleman-wunch algorithm harshita
Needleman-wunch algorithm  harshitaNeedleman-wunch algorithm  harshita
Needleman-wunch algorithm harshita
Harshita Bhawsar
 
Cliff sugerman
Cliff sugermanCliff sugerman
Cliff sugerman
clifford sugerman
 
GPGPU algorithms in games
GPGPU algorithms in gamesGPGPU algorithms in games
GPGPU algorithms in gameszlatan4177
 
Algorithms for Computer Games - lecture slides 2009
Algorithms for Computer Games - lecture slides 2009Algorithms for Computer Games - lecture slides 2009
Algorithms for Computer Games - lecture slides 2009
Jouni Smed
 

Viewers also liked (6)

Sant Cugat
Sant CugatSant Cugat
Sant Cugat
 
Needleman-wunch algorithm harshita
Needleman-wunch algorithm  harshitaNeedleman-wunch algorithm  harshita
Needleman-wunch algorithm harshita
 
Sc13 gpu
Sc13 gpuSc13 gpu
Sc13 gpu
 
Cliff sugerman
Cliff sugermanCliff sugerman
Cliff sugerman
 
GPGPU algorithms in games
GPGPU algorithms in gamesGPGPU algorithms in games
GPGPU algorithms in games
 
Algorithms for Computer Games - lecture slides 2009
Algorithms for Computer Games - lecture slides 2009Algorithms for Computer Games - lecture slides 2009
Algorithms for Computer Games - lecture slides 2009
 

Similar to Report v1

Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment Analysis
prnk08
 
Sentiment Analysis
Sentiment Analysis Sentiment Analysis
Sentiment Analysis
prnk08
 
Sentiment Analysis in Twitter
Sentiment Analysis in TwitterSentiment Analysis in Twitter
Sentiment Analysis in Twitter
prnk08
 
Viva voce presentation-Modeling emotion expression through AOM
Viva voce presentation-Modeling emotion expression through AOMViva voce presentation-Modeling emotion expression through AOM
Viva voce presentation-Modeling emotion expression through AOM
syazwanie filzah
 
Twitter Sentiment Analysis
Twitter Sentiment AnalysisTwitter Sentiment Analysis
Twitter Sentiment Analysis
Ayush Khandelwal
 
Sentimental Analysis - Naive Bayes Algorithm
Sentimental Analysis - Naive Bayes AlgorithmSentimental Analysis - Naive Bayes Algorithm
Sentimental Analysis - Naive Bayes Algorithm
Khushboo Gupta
 
Social Media Audit Assignment Keyhole Social Media Analytics
Social Media Audit Assignment Keyhole Social Media AnalyticsSocial Media Audit Assignment Keyhole Social Media Analytics
Social Media Audit Assignment Keyhole Social Media Analytics
Matthew J. Kushin, Ph.D.
 
110917_0900_Karimi.pdf
110917_0900_Karimi.pdf110917_0900_Karimi.pdf
110917_0900_Karimi.pdf
Jayashankara3
 
Product Recommendations Enhanced with Reviews
Product Recommendations Enhanced with ReviewsProduct Recommendations Enhanced with Reviews
Product Recommendations Enhanced with Reviews
maranlar
 
10 Reasons Why Data-driven App Design Needs Social Science | Julian Runge
10 Reasons Why Data-driven App Design Needs Social Science | Julian Runge10 Reasons Why Data-driven App Design Needs Social Science | Julian Runge
10 Reasons Why Data-driven App Design Needs Social Science | Julian Runge
Jessica Tams
 
Ranking Twitter Conversations
Ranking Twitter ConversationsRanking Twitter Conversations
Ranking Twitter Conversations
Mohammed Faisal Anees
 
Context aware and personalized travel recommendation system travel sl
Context aware and personalized travel recommendation system   travel slContext aware and personalized travel recommendation system   travel sl
Context aware and personalized travel recommendation system travel sl
Senuri Wijenayake
 
Social annotations: utility and predictive modeling
Social annotations: utility and predictive modelingSocial annotations: utility and predictive modeling
Social annotations: utility and predictive modeling
oralonso
 
Industry project part2
Industry project part2Industry project part2
Industry project part2
Tushar Sharma
 
EMOTION RECOGNITION BY TEXTUAL TWEETS CLASSIFICATION USING VOTING CLASSIFIER ...
EMOTION RECOGNITION BY TEXTUAL TWEETS CLASSIFICATION USING VOTING CLASSIFIER ...EMOTION RECOGNITION BY TEXTUAL TWEETS CLASSIFICATION USING VOTING CLASSIFIER ...
EMOTION RECOGNITION BY TEXTUAL TWEETS CLASSIFICATION USING VOTING CLASSIFIER ...
Shakas Technologies
 
Designing Mobile UX
Designing Mobile UXDesigning Mobile UX
Designing Mobile UX
Farah Nuraini
 
Transform your training with simulations speaker deck - march 2011
Transform your training with simulations   speaker deck - march 2011Transform your training with simulations   speaker deck - march 2011
Transform your training with simulations speaker deck - march 2011
The Blockchain Academy
 
Remote User Testing - MSU 12 Nov2021
Remote User Testing - MSU 12 Nov2021Remote User Testing - MSU 12 Nov2021
Remote User Testing - MSU 12 Nov2021
Chris Farnum
 
Backchannels and broadcast
Backchannels and broadcastBackchannels and broadcast
Backchannels and broadcast
Stephen Dann
 

Similar to Report v1 (20)

Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment Analysis
 
Sentiment Analysis
Sentiment Analysis Sentiment Analysis
Sentiment Analysis
 
Sentiment Analysis in Twitter
Sentiment Analysis in TwitterSentiment Analysis in Twitter
Sentiment Analysis in Twitter
 
Viva voce presentation-Modeling emotion expression through AOM
Viva voce presentation-Modeling emotion expression through AOMViva voce presentation-Modeling emotion expression through AOM
Viva voce presentation-Modeling emotion expression through AOM
 
Twitter Sentiment Analysis
Twitter Sentiment AnalysisTwitter Sentiment Analysis
Twitter Sentiment Analysis
 
Sentimental Analysis - Naive Bayes Algorithm
Sentimental Analysis - Naive Bayes AlgorithmSentimental Analysis - Naive Bayes Algorithm
Sentimental Analysis - Naive Bayes Algorithm
 
Social Media Audit Assignment Keyhole Social Media Analytics
Social Media Audit Assignment Keyhole Social Media AnalyticsSocial Media Audit Assignment Keyhole Social Media Analytics
Social Media Audit Assignment Keyhole Social Media Analytics
 
110917_0900_Karimi.pdf
110917_0900_Karimi.pdf110917_0900_Karimi.pdf
110917_0900_Karimi.pdf
 
Product Recommendations Enhanced with Reviews
Product Recommendations Enhanced with ReviewsProduct Recommendations Enhanced with Reviews
Product Recommendations Enhanced with Reviews
 
10 Reasons Why Data-driven App Design Needs Social Science | Julian Runge
10 Reasons Why Data-driven App Design Needs Social Science | Julian Runge10 Reasons Why Data-driven App Design Needs Social Science | Julian Runge
10 Reasons Why Data-driven App Design Needs Social Science | Julian Runge
 
Ranking Twitter Conversations
Ranking Twitter ConversationsRanking Twitter Conversations
Ranking Twitter Conversations
 
Context aware and personalized travel recommendation system travel sl
Context aware and personalized travel recommendation system   travel slContext aware and personalized travel recommendation system   travel sl
Context aware and personalized travel recommendation system travel sl
 
Social annotations: utility and predictive modeling
Social annotations: utility and predictive modelingSocial annotations: utility and predictive modeling
Social annotations: utility and predictive modeling
 
Industry project part2
Industry project part2Industry project part2
Industry project part2
 
EMOTION RECOGNITION BY TEXTUAL TWEETS CLASSIFICATION USING VOTING CLASSIFIER ...
EMOTION RECOGNITION BY TEXTUAL TWEETS CLASSIFICATION USING VOTING CLASSIFIER ...EMOTION RECOGNITION BY TEXTUAL TWEETS CLASSIFICATION USING VOTING CLASSIFIER ...
EMOTION RECOGNITION BY TEXTUAL TWEETS CLASSIFICATION USING VOTING CLASSIFIER ...
 
Blooms
BloomsBlooms
Blooms
 
Designing Mobile UX
Designing Mobile UXDesigning Mobile UX
Designing Mobile UX
 
Transform your training with simulations speaker deck - march 2011
Transform your training with simulations   speaker deck - march 2011Transform your training with simulations   speaker deck - march 2011
Transform your training with simulations speaker deck - march 2011
 
Remote User Testing - MSU 12 Nov2021
Remote User Testing - MSU 12 Nov2021Remote User Testing - MSU 12 Nov2021
Remote User Testing - MSU 12 Nov2021
 
Backchannels and broadcast
Backchannels and broadcastBackchannels and broadcast
Backchannels and broadcast
 

Report v1

  • 1. Sentiment Analysis for Twitter Priyanka Bajaj priyanka.bajaj@students.iiit.ac.in Kamal Gurala kamal.gurala@students.iiit.ac.in Faraz Alam faraz.alam@students.iiit.ac.in Ritesh Kumar Gupta ritesh.kumar.gupta@in.ibm.com Guided By : Satarupa Guha satarupaguha11@gmail.com
  • 2. AGENDA 1.Introduction – Sentiment Analysis 2.About Twitter and Our Goal 3.Glossary 4.Challenges 5.Approach 6.Results and Conclusion 7.Tools and Technologies
  • 3. What is Sentiment Analysis? Mechanism to extract opinions, emotions and sentiments in text Enable us to track attitudes and feelings on the web based on blog posts, comments, reviews and tweets on different topics Enable to track products, brands and people and determine whether they are viewed positively or negatively on the web. acts: "The painting was more expensive than a Monet" pinions: "I honestly don't like Monet, Pollock is the better” artist"
  • 4. Challenges • Tweets are highly unstructured and also non- grammatical • Out of Vocabulary Words • Lexical Variation • Extensive usage of acronyms like asap, lol, afaik
  • 6. • Tweet Downloader – Download the tweets using Twitter API • Tokenisation – Twitter specific POS Tagger developed by ARK Social Media Search • Preprocessing – Replacing Emoticons by their polarity, assign scores – Remove URL, Target Mentions – Replace #text -> text, since hashtags may contribute to the sentiment – Replace Sequence of Repeated Characters eg. ‘cooooool’ by ‘cool’ and assign higher score – Twitter specific stop word removal – Acronym expansion System Details
  • 7. • Feature Extractor – Unigrams and Bigrams – Polarity Score of the Tweet (f1) – Count of Positive/Negative Words (f2,f3) – Maximum Positive/Negative Score for Words (f4,f5) – Count of Positive/Negative Emoticons and assign scores(contibutes to all f1,f2,f3,f4,f5) – Positive/Negative special POS Tags Polarity Score • Classifier and Prediction – Features extracted are fed into to SVM classifier – Model built used to predict sentiment of new tweets System Details Contd.
  • 8. Results and Conclusion A baseline model by taking the unigrams, and compare it with the bigrams and lexicon features model Sub-Task Baseline Model Feature Based Model Sentence Based 49.81% 57.85% Accuracy F1 Score (f-Measure) Sub-Task Baseline Model Feature Based Model Sentence Based 55.56 61.17 • We investigated two kinds of models: Baseline and Feature Based Models • For our feature-based approach, feature analysis reveals that the most important features are bigrams and those that combine the prior polarity of words and their parts- of-speech tags