Twitter Sentiment Analysis

•Download as PPTX, PDF•

1 like•1,112 views

Ayush Khandelwal

Information Retrieval and Extraction Major Project 2016 IIIT Hyderabad

Education

TWITTER SENTIMENT
ANALYSIS
By:
Ayush Khandelwal
Goutam Nair
Pravallika Rao
Course: Information Retrieval and Extraction
IIIT Hyderabad
Under the guidance of Prof. Vasudeva Varma

Problem Statement
 Input - Textual content of a tweet
 Output – Label signifying the sentiment of the tweet (Positive, Neutral or
Negative)

Challenges
 Noisy text
 Lack of context - 140 characters only
 Acronyms - lol, brb, gr8
 Emoticons - :) , :( , :|
 Negation

Approach
Tweet
Downloader
Parser
Pre-
processing
Feature
Extractor
Add
Additional
Features
SVM
Classifier &
Prediction

Approach
 Tweet Downloader
 Download the tweets using twitter API
(https://github.com/aritter/twitter_download).
 9684 training and 8987 testing tweets are downloaded.
 Parser
 The parser removes all unavailable tweets from the downloaded data
 After removing these we have 7612 tweets for training and 7868 tweets
for testing

Approach
 Pre-processing
 Replace Emoticons by their polarity.
 Remove URLs and Targets.
 Expand acronyms. eg 'brb' to 'be right back'
 Remove stop words.
 Tokenization
 Stemming
 Case-folding
 Remove punctuation marks
 Replace sequence of repeating characters eg. 'hellooooo' by 'helloo'

Approach
 Feature Extractor
 The pre-processed data file is fed to the feature extractor which creates the
feature vector.
 The basic(baseline) feature that was considered was of unigrams.
 A list of all unique unigrams across the training set was constructed and it formed
the basic vector for each tweet.
 Synsets are used for words that are not found in the list of unique unigrams.

Approach
 Add Additional Features
 Polarity scores of the tweets
 Negation
 Hashtags
 Special characters (?,!,*)
 Capitalized words
 SVM Classification and Prediction
 The features extracted are passed to the classifier
 The model built is used to predict the sentiment of the new tweets

Results
Features Accuracy Precision Recall F1 score
Unigram 54.855% 0.5264 0.5061 0.5126
Unigram+Additional
features
57.079% 0.5525 0.5308 0.5386
Bigrams 58.579% 0.5713 0.5173 0.5269
Bigrams+Additional
features
60.739% 0.5930 0.5525 0.5637

Links
 Github Repositary - https://github.com/ayush-khandelwal7/Twitter-
Sentiment-Analysis
 Github Page - http://goutamnair7.github.io/Twitter-Sentiment-Analysis

What's hot

Sentiment analysis of twitter dataBhagyashree Deokar

Twitter sentiment analysis pptAntaraBhattacharya12

Twitter sentiment analysis project reportBharat Khanna

Sentiment analysis of Twitter data using pythonHetu Bhavsar

Sentiment Analysis using Twitter DataHari Prasad

Sentiment Analysis in Twitterprnk08

Twitter sentiment-analysis Jiit2013-14Rachit Goel

Sentiment Analysis on TwitterSmritiAgarwal26

New sentiment analysis of tweets using python by Ravi kumarRavi Kumar

Sentiment Analysis Using Twitterpiya chauhan

Twitter sentiment analysisRahul Jha

Sentiment analysis in Twitter on Big DataIswarya M

Sentiment Analysis of Twitter DataSumit Raj

Sentiment analysis - Our approach and use casesKarol Chlasta

IRE2014-Sentiment AnalysisGangasagar Patil

Sentiment analysis using mlPravin Katiyar

Twitter sentimentanalysis reportSavio Aberneithie

sentiment analysis ShivangiYadav42

SENTIMENT ANALYSIS OF TWITTER DATAParvathy Devaraj

Sentiment Analysis in Twitterprnk08

What's hot (20)

Sentiment analysis of twitter data

Twitter sentiment analysis ppt

Twitter sentiment analysis project report

Sentiment analysis of Twitter data using python

Sentiment Analysis using Twitter Data

Sentiment Analysis in Twitter

Twitter sentiment-analysis Jiit2013-14

Sentiment Analysis on Twitter

New sentiment analysis of tweets using python by Ravi kumar

Sentiment Analysis Using Twitter

Twitter sentiment analysis

Sentiment analysis in Twitter on Big Data

Sentiment Analysis of Twitter Data

Sentiment analysis - Our approach and use cases

IRE2014-Sentiment Analysis

Sentiment analysis using ml

Twitter sentimentanalysis report

sentiment analysis

SENTIMENT ANALYSIS OF TWITTER DATA

Sentiment Analysis in Twitter

Viewers also liked

Algorithm Name Detection & ExtractionDeeksha thakur

Language-Independent Twitter Sentiment Analysissaschanarr

Classifying Twitter ContentStephen Dann

Sentiment of Sentence in Tweets: A Reviewiosrjce

Sentiment Analysis Symposium 2015: SyntaxMekkin Bjarnadottir

Twitter sentiment analysisHarshit Sanghvi

Language-Independent Twitter Sentiment Analysissaschanarr

Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new datase...Knowledge Media Institute - The Open University

Sentence level sentiment analysisVipul Munot

Text Classification, Sentiment Analysis, and Opinion MiningFabrizio Sebastiani

Text classification & sentiment analysisM. Atif Qureshi

Group-13 Project 15 Sub event detection on social mediaAhmedali Durga

[EN] Capture Indexing & Auto-Classification | DLM Forum Industry Whitepaper 0...PROJECT CONSULT Unternehmensberatung Dr. Ulrich Kampffmeyer GmbH

Web Information Extraction Learning based on Probabilistic Graphical ModelsGUANBO

Multimodal Information Extraction: Disease, Date and Location RetrievalSvitlana volkova

IRE- Algorithm Name Detection in Research PapersSriTeja Allaparthi

Web Information Retrieval and MiningCarlos Castillo (ChaTo)

Mining Product Synonyms - SlidesAnkush Jain

System for-health-diagnosisask2372

A survey of_eigenvector_methods_for_web_information_retrievalChen Xi

Viewers also liked (20)

Algorithm Name Detection & Extraction

Language-Independent Twitter Sentiment Analysis

Classifying Twitter Content

Sentiment of Sentence in Tweets: A Review

Sentiment Analysis Symposium 2015: Syntax

Twitter sentiment analysis

Language-Independent Twitter Sentiment Analysis

Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new datase...

Sentence level sentiment analysis

Text Classification, Sentiment Analysis, and Opinion Mining

Text classification & sentiment analysis

Group-13 Project 15 Sub event detection on social media

[EN] Capture Indexing & Auto-Classification | DLM Forum Industry Whitepaper 0...

Web Information Extraction Learning based on Probabilistic Graphical Models

Multimodal Information Extraction: Disease, Date and Location Retrieval

IRE- Algorithm Name Detection in Research Papers

Web Information Retrieval and Mining

Mining Product Synonyms - Slides

System for-health-diagnosis

A survey of_eigenvector_methods_for_web_information_retrieval

Similar to Twitter Sentiment Analysis

sentimentanaly 2.pdfvisheshs4

What Are Machine Learning Models by LinkedIn Sr Product ManagerProduct School

Sentiment Analysisprnk08

2020 09 24 - CONDG ML.NetBruno Capuano

Final PresentationBryan Then

AWS January 2016 Webinar Series - Building Smart Applications with Amazon Mac...Amazon Web Services

Data Science Task.pdf by the topper worldTanishaChouhan4

Movie Recommender System Using Artificial Intelligence Shrutika Oswal

Real-World Smart Applications with Amazon Machine Learning - AWS Machine Lear...AWS Germany

Pull_Request_PAW_Shared_Rohit.pptxrohitagarwal24

sentiment analysis text extraction from social media Ravindra Chaudhary

(BDT302) Real-World Smart Applications With Amazon Machine LearningAmazon Web Services

Tag recommendation in social bookmarking sites like deliVinay Singri

Report v1Vinay Singri

E-Commerce Product Rating Based on Customer ReviewIRJET Journal

Managing the Machine Learning Lifecycle with MLflowDatabricks

Integrate the most advanced text analytics into your predictive models - Mean...MeaningCloud

Machine Learning ClassifiersMostafa

Machine learning in php Using PHP-MLAgbagbara Omokhoa

Similar to Twitter Sentiment Analysis (20)

sentimentanaly 2.pdf

What Are Machine Learning Models by LinkedIn Sr Product Manager

Sentiment Analysis

2020 09 24 - CONDG ML.Net

Final Presentation

AWS January 2016 Webinar Series - Building Smart Applications with Amazon Mac...

Data Science Task.pdf by the topper world

Movie Recommender System Using Artificial Intelligence

Real-World Smart Applications with Amazon Machine Learning - AWS Machine Lear...

Pull_Request_PAW_Shared_Rohit.pptx

sentiment analysis text extraction from social media

(BDT302) Real-World Smart Applications With Amazon Machine Learning

Tag recommendation in social bookmarking sites like deli

Report v1

E-Commerce Product Rating Based on Customer Review

Managing the Machine Learning Lifecycle with MLflow

Integrate the most advanced text analytics into your predictive models - Mean...

Machine Learning Classifiers

Machine learning in php Using PHP-ML

Recently uploaded

Software Engineering Methodologies (overview)eniolaolutunde

SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood

Class 11th Physics NEET formula sheet pdfAyushMahapatra5

Introduction to Nonprofit Accounting: The BasicsTechSoup

Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82

Grant Readiness 101 TechSoup and Remy ConsultingTechSoup

Sports & Fitness Value Added Course FY..Disha Kariya

Paris 2024 Olympic Geographies - an activityGeoBlogs

The Most Excellent Way | 1 Corinthians 13Steve Thomason

IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...PsychoTech Services

Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K

BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur

Measures of Central Tendency: Mean, Median and ModeThiyagu K

Disha NEET Physics Guide for classes 11 and 12.pdfchloefrazer622

Nutritional Needs Presentation - HLTH 104misteraugie

microwave assisted reaction. General introductionMaksud Ahmed

Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande

Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching

Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417

BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy

Recently uploaded (20)

Software Engineering Methodologies (overview)

SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx

Class 11th Physics NEET formula sheet pdf

Introduction to Nonprofit Accounting: The Basics

Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi

Grant Readiness 101 TechSoup and Remy Consulting

Sports & Fitness Value Added Course FY..

Paris 2024 Olympic Geographies - an activity

The Most Excellent Way | 1 Corinthians 13

IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...

Measures of Dispersion and Variability: Range, QD, AD and SD

BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...

Measures of Central Tendency: Mean, Median and Mode

Disha NEET Physics Guide for classes 11 and 12.pdf

Nutritional Needs Presentation - HLTH 104

microwave assisted reaction. General introduction

Web & Social Media Analytics Previous Year Question Paper.pdf

Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...

Unit-IV- Pharma. Marketing Channels.pptx

BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf

Twitter Sentiment Analysis

1. TWITTER SENTIMENT ANALYSIS By: Ayush Khandelwal Goutam Nair Pravallika Rao Course: Information Retrieval and Extraction IIIT Hyderabad Under the guidance of Prof. Vasudeva Varma

2. Problem Statement  Input - Textual content of a tweet  Output – Label signifying the sentiment of the tweet (Positive, Neutral or Negative)

3. Motivation  Tweets sometimes express opinions about different topics. These opinions are important  Consumers can use sentiment analysis to research products or services before making a purchase. E.g. Kindle  Marketers can use this to research public opinion of their company and products, or to analyze customer satisfaction. E.g. Election Polls  Organizations can also use this to gather critical feedback about problems in newly released products. E.g. Brand Management (Nike, Adidas)

4. Challenges  Noisy text  Lack of context - 140 characters only  Acronyms - lol, brb, gr8  Emoticons - :) , :( , :|  Negation

5. Approach Tweet Downloader Parser Pre- processing Feature Extractor Add Additional Features SVM Classifier & Prediction

6. Approach  Tweet Downloader  Download the tweets using twitter API (https://github.com/aritter/twitter_download).  9684 training and 8987 testing tweets are downloaded.  Parser  The parser removes all unavailable tweets from the downloaded data  After removing these we have 7612 tweets for training and 7868 tweets for testing

7. Approach  Pre-processing  Replace Emoticons by their polarity.  Remove URLs and Targets.  Expand acronyms. eg 'brb' to 'be right back'  Remove stop words.  Tokenization  Stemming  Case-folding  Remove punctuation marks  Replace sequence of repeating characters eg. 'hellooooo' by 'helloo'

8. Approach  Feature Extractor  The pre-processed data file is fed to the feature extractor which creates the feature vector.  The basic(baseline) feature that was considered was of unigrams.  A list of all unique unigrams across the training set was constructed and it formed the basic vector for each tweet.  Synsets are used for words that are not found in the list of unique unigrams.

9. Approach  Add Additional Features  Polarity scores of the tweets  Negation  Hashtags  Special characters (?,!,*)  Capitalized words  SVM Classification and Prediction  The features extracted are passed to the classifier  The model built is used to predict the sentiment of the new tweets

10. Results Features Accuracy Precision Recall F1 score Unigram 54.855% 0.5264 0.5061 0.5126 Unigram+Additional features 57.079% 0.5525 0.5308 0.5386 Bigrams 58.579% 0.5713 0.5173 0.5269 Bigrams+Additional features 60.739% 0.5930 0.5525 0.5637

11. Links  Github Repositary - https://github.com/ayush-khandelwal7/Twitter- Sentiment-Analysis  Github Page - http://goutamnair7.github.io/Twitter-Sentiment-Analysis

Twitter Sentiment Analysis

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (20)

Similar to Twitter Sentiment Analysis

Similar to Twitter Sentiment Analysis (20)

Recently uploaded

Recently uploaded (20)

Twitter Sentiment Analysis