SlideShare a Scribd company logo
TITLE
Twitter Sentiment Analysis using Various
Classification Algorithms
Abstract
Twitter is a web application to determine online news and social networking service
where users post and interact with messages, anywhere in the world. Twitter posts are generally
short (140 characters) and generated continuously by public which is well suited for opinion
mining. Twitter messages can be classified either in positive or negative sentiment based on
certain aspects with respect to term based query. The past studies of sentiment classification
are not very conclusive about which features and supervised classification algorithms are good
for designing accurate and efficient sentiment classification system. We propose to combine
many feature extraction techniques like emoticons, exclamation and question mark symbol,
word gazetteer, unigrams to design more accurate sentiment classification system.
Keywords
Twitter; Sentiment Analysis; Opinion Mining; Natural Language Processing
Introduction
Human decision making is extensively influenced by assessment or judgement of others.
Before making any move, customers tend to gather as much information as possible about the
product they want to buy. The investors analyse and predict the stock market movement of a
company based on its popularity among its customers be investing their money in its shares.
With the advent development of social media, gathering data for evaluation become easier and
less time consuming. Different platform like Twitter, Facebook, Linked In serve as repositories
of useful data in terms of reviews, likes, comments etc.
Opinions are linked to almost all human activities because they have key impact on our decision
making. We mostly seek others opinions while taking any decisions. In the real world,
organizations and business entities are always willing to know public and general opinions
about their services and products. On the other hand, consumers also seek the opinions of
existing users of a product or service before making a decision to purchase products and
subscribing to services. Opinions of public about political candidates can be analysed to
forecast results of an election. In the past, organizations, governments and business entities
used to conduct surveys and opinion polls on focused groups for obtaining citizen opinions and
their sentiments [1].
Twitter is a social networking web application with microblogging feature that has a large and
constantly growing user data-base. Thus, the application provides a rich data set in the form of
messages that are usually short status updates from Twitter application users that must be
expressed in not more than 140 characters in length. On Twitter, data that consists of millions
of short messages and user status updates are generated each day on about hundreds of different
topics. The task of extracting data from these small texts has become immensely useful for
sorting and ranking popularity of topics mentioned within the updates. Nowadays twitter has
emerged as one of the most popular platforms for expressing sentiments and thoughts on
Internet. It is very useful and obvious to mine and analyse Twitter data for interesting
information regarding major trending topics in the media and other spaces.
Methodology
Twitter Sentiment Analysis is generally divided into 3 major categories that is
1. Machine Learning Approach
2. Lexicon Based Approach
3. Hybrid Approach
The Machine Learning Approach (ML) uses linguistic features and applies well known
Machine Learning algorithms.
The Lexicon based approach is driven by a opinion lexicon, which is nothing but a collection
of pre-compiled opinion terms. It is mainly divided into two main approaches that is
a) Dictionary based approach
b) Corpus Based approach
The Hybrid Approach combines the above two approaches.
To increase the performance and efficiency of sentiment classification system the combination
of well-known features extraction methods is considered. The proposed method compares 6
supervised classification algorithms that is
a) Naïve Bayes Algorithm
b) Bayes Net Algorithm
c) Discriminative Multinomial Naïve Bayes(DMNB) Algorithm
d) Sequential Minimal Optimization (SMO) Algorithm
e) Hyperpipes Algorithm
f) Random Forest Algorithm
1) Naïve Bayes(NB): This algorithm is a probabilistic classifier in a simple form that counts
the combinations of values and frequency in a data set under consideration and calculates
probabilities set. Bayes theorem is the base of this algorithm and assumes that all the attributes
are completely independent against a set value of the class variable.
2) Bayes Net (BN): Bayesian nets (BN) are a network-based system that are mainly used for
analysing and representing the models that involves uncertainty. Bayesian networks learns the
causal relationships and use it to implement incremental learning. To perform classification,
first the input nodes must be set with the evidence and then the output nodes can be queried
and analysed using standard Bayesian network inference.
3) Discriminative Multinominal Naive Bayes (DMNB): The multinomial Naive Bayes is a
well-known and widely used classifier for classification of documents and tested to yield
satisfactory performance. Discriminative multinomial Naïve Bayes (DMNB) takes a document
and consider it as a bag-of-words. For each class c, P(w|c), the training data is unitized to
estimate the probability of observing the word w against the given class. It works on the
collection of training documents of the particular class by calculating each word’s relative
occurrence frequency. The classifier also needs the prior probability, Pc) which is intuitive to
estimate. If the word w occurs nwd number of times in document d, then given a document
under test the probability of the class c is calculated in the following manner
4) SMO: Sequential Minimal Optimization (SMO) method is generally used in the training
process of Support Vector Machines (SVM) classification algorithm. SMO algorithm consists
of many optimizations designed primarily to increase the analysis performance of large
datasets. It is designed to ensure that the algorithm converges with results even in degenerate
conditions. It works by breaking up a problem into a set of atomic sub-problems, which are
solved using analytical approach
5) Hyperpipes: Hyperpipes is a technique that creates a “hyperpipe” for each class of a data
set. These Classes are the collections of data build around single object template. it can work
extremely fast and effectively.
6) Random Forest: Many trees are produced by this algorithm for classification process. It
classifies new object from an input vector by setting the vector against the forest on each of the
trees. A classification is generated by each tree. In other words, that class is voted by the tree.
The classification having the most votes is chosen by the random forest method across all the
trees. It also runs efficiently on large datasets.
Results Obtained
The six selected classification algorithms were executed on features extracted from Sanders
Twitter dataset on Weka tool. by configuring it with 10-fold cross validation flag building and
testing of the system is carried out. Simulation results in empirical form are presented in Tables
1-9.
False Positive Rate (FPR), True Positive Rate (TPR), Precision (P), recall (R), F-score (F),
and Receiver Operating Characteristic values (ROC) are shown in the following tables.
Table 1: Naïve Bayes Result
Table 2: Bayes Net Results
Table 3: Discriminative Multinominal Naive Bayes(DMNB) Results
Table 4: Sequential Minimal Optimization (SMO) Results
Table 5: Hyperpipes Results
Table 6: Random Forest Results
Performance and Results Comparison
Based on simulation results, the performance of Naive Bayes algorithm is least in comparison
of all six algorithms considered in this study. In general, precision and recall scores are
sufficiently low against the Positive and Negative classes. This is due to large number of
instances in the class ‘other’ in comparison of positive and negative classes. The considered
Sanders dataset is highly imbalanced. Overall, the two most balanced and well-performing
algorithms are DMNB and SMO, with overall F-scores of 0.769 and 0.75 respectively.
Fig 1: Precision Comparison
Fig 2: Recall Comparison
Fig 3: F-Measure Comparison
References
[1] Medhat, Walaa, Ahmed Hassan, and Hoda Korashy. "Sentiment analysis algorithms and
applications: A survey." Ain Shams Engineering Journal 5.4 (2014): 1093-1113.
[2] Liu, Bing. "Sentiment analysis and opinion mining." Synthesis lectures on human language
technologies 5.1 (2012): 1-167.
[3] Agarwal, Apoorv, et al. "Sentiment analysis of twitter data." Proceedings of the workshop
on languages in social media. Association for Computational Linguistics, 2011.
[4] Imran, Muhammad, et al. "Processing social media messages in mass emergency: A
survey." ACM Computing Surveys (CSUR) 47.4 (2015): 67.
[5] Feldman, Ronen. "Techniques and applications for sentiment analysis, “Communications
of the ACM 56.4 (2013): 82-89.
[6] Pang, Bo, and Lillian Lee. “Opinion mining and sentiment analysis. “Foundations and
trends in information retrieval 2.1-2 (2008): 1-135.
[7] Cambria, Erik, et al. “New avenues in opinion mining and sentiment analysis.” IEEE
Intelligent Systems 28.2 (2013): 15- 21.
[8] Witten, Ian H., and Eibe Frank. Data Mining: Practical machine learning tools and
techniques. Morgan Kaufmann, 2005.
[9] Bifet, Albert, and Eibe Frank. "Sentiment knowledge discovery in twitter streaming data."
International Conference on Discovery Science. Springer Berlin Heidelberg, 2010.
[10] Saif, Hassan, Yulan He, and Harith Alani. "Semantic sentiment analysis of twitter.
International Semantic Web Conference. Springer Berlin Heidelberg, 2012.
Abstract

More Related Content

What's hot

P1803018289
P1803018289P1803018289
P1803018289
IOSR Journals
 
IRJET- Twitter Sentimental Analysis for Predicting Election Result using ...
IRJET-  	  Twitter Sentimental Analysis for Predicting Election Result using ...IRJET-  	  Twitter Sentimental Analysis for Predicting Election Result using ...
IRJET- Twitter Sentimental Analysis for Predicting Election Result using ...
IRJET Journal
 
Sentiment analysis of twitter data
Sentiment analysis of twitter dataSentiment analysis of twitter data
Sentiment analysis of twitter data
Bhagyashree Deokar
 
Supervised Sentiment Classification using DTDP algorithm
Supervised Sentiment Classification using DTDP algorithmSupervised Sentiment Classification using DTDP algorithm
Supervised Sentiment Classification using DTDP algorithm
IJSRD
 
IRJET- Fake News Detection using Logistic Regression
IRJET- Fake News Detection using Logistic RegressionIRJET- Fake News Detection using Logistic Regression
IRJET- Fake News Detection using Logistic Regression
IRJET Journal
 
Sentiment Analysis on Twitter
Sentiment Analysis on TwitterSentiment Analysis on Twitter
Sentiment Analysis on Twitter
Subarno Pal
 
Sentiment analysis of Twitter Data
Sentiment analysis of Twitter DataSentiment analysis of Twitter Data
Sentiment analysis of Twitter Data
Nurendra Choudhary
 
Aj35198205
Aj35198205Aj35198205
Aj35198205
IJERA Editor
 
SENTIMENT ANALYSIS OF TWITTER DATA
SENTIMENT ANALYSIS OF TWITTER DATASENTIMENT ANALYSIS OF TWITTER DATA
SENTIMENT ANALYSIS OF TWITTER DATAParvathy Devaraj
 
IRJET - Twitter Sentiment Analysis using Machine Learning
IRJET -  	  Twitter Sentiment Analysis using Machine LearningIRJET -  	  Twitter Sentiment Analysis using Machine Learning
IRJET - Twitter Sentiment Analysis using Machine Learning
IRJET Journal
 
J1803015357
J1803015357J1803015357
J1803015357
IOSR Journals
 
Sentiment Analysis on Twitter Data
Sentiment Analysis on Twitter DataSentiment Analysis on Twitter Data
Sentiment Analysis on Twitter Data
IRJET Journal
 
Sensing Trending Topics in Twitter for Greater Jakarta Area
Sensing Trending Topics in Twitter for Greater Jakarta Area Sensing Trending Topics in Twitter for Greater Jakarta Area
Sensing Trending Topics in Twitter for Greater Jakarta Area
IJECEIAES
 
Sentiment analysis using ml
Sentiment analysis using mlSentiment analysis using ml
Sentiment analysis using ml
Pravin Katiyar
 
Explore the Effects of Emoticons on Twitter Sentiment Analysis
Explore the Effects of Emoticons on Twitter Sentiment Analysis Explore the Effects of Emoticons on Twitter Sentiment Analysis
Explore the Effects of Emoticons on Twitter Sentiment Analysis
csandit
 
IRJET - Implementation of Twitter Sentimental Analysis According to Hash Tag
 IRJET - Implementation of Twitter Sentimental Analysis According to Hash Tag IRJET - Implementation of Twitter Sentimental Analysis According to Hash Tag
IRJET - Implementation of Twitter Sentimental Analysis According to Hash Tag
IRJET Journal
 
IMPROVED SENTIMENT ANALYSIS USING A CUSTOMIZED DISTILBERT NLP CONFIGURATION
IMPROVED SENTIMENT ANALYSIS USING A CUSTOMIZED DISTILBERT NLP CONFIGURATIONIMPROVED SENTIMENT ANALYSIS USING A CUSTOMIZED DISTILBERT NLP CONFIGURATION
IMPROVED SENTIMENT ANALYSIS USING A CUSTOMIZED DISTILBERT NLP CONFIGURATION
adeij1
 
F0363942
F0363942F0363942
F0363942
iosrjournals
 
Modeling Text Independent Speaker Identification with Vector Quantization
Modeling Text Independent Speaker Identification with Vector QuantizationModeling Text Independent Speaker Identification with Vector Quantization
Modeling Text Independent Speaker Identification with Vector Quantization
TELKOMNIKA JOURNAL
 
IRJET- Suspicious Email Detection System
IRJET- Suspicious Email Detection SystemIRJET- Suspicious Email Detection System
IRJET- Suspicious Email Detection System
IRJET Journal
 

What's hot (20)

P1803018289
P1803018289P1803018289
P1803018289
 
IRJET- Twitter Sentimental Analysis for Predicting Election Result using ...
IRJET-  	  Twitter Sentimental Analysis for Predicting Election Result using ...IRJET-  	  Twitter Sentimental Analysis for Predicting Election Result using ...
IRJET- Twitter Sentimental Analysis for Predicting Election Result using ...
 
Sentiment analysis of twitter data
Sentiment analysis of twitter dataSentiment analysis of twitter data
Sentiment analysis of twitter data
 
Supervised Sentiment Classification using DTDP algorithm
Supervised Sentiment Classification using DTDP algorithmSupervised Sentiment Classification using DTDP algorithm
Supervised Sentiment Classification using DTDP algorithm
 
IRJET- Fake News Detection using Logistic Regression
IRJET- Fake News Detection using Logistic RegressionIRJET- Fake News Detection using Logistic Regression
IRJET- Fake News Detection using Logistic Regression
 
Sentiment Analysis on Twitter
Sentiment Analysis on TwitterSentiment Analysis on Twitter
Sentiment Analysis on Twitter
 
Sentiment analysis of Twitter Data
Sentiment analysis of Twitter DataSentiment analysis of Twitter Data
Sentiment analysis of Twitter Data
 
Aj35198205
Aj35198205Aj35198205
Aj35198205
 
SENTIMENT ANALYSIS OF TWITTER DATA
SENTIMENT ANALYSIS OF TWITTER DATASENTIMENT ANALYSIS OF TWITTER DATA
SENTIMENT ANALYSIS OF TWITTER DATA
 
IRJET - Twitter Sentiment Analysis using Machine Learning
IRJET -  	  Twitter Sentiment Analysis using Machine LearningIRJET -  	  Twitter Sentiment Analysis using Machine Learning
IRJET - Twitter Sentiment Analysis using Machine Learning
 
J1803015357
J1803015357J1803015357
J1803015357
 
Sentiment Analysis on Twitter Data
Sentiment Analysis on Twitter DataSentiment Analysis on Twitter Data
Sentiment Analysis on Twitter Data
 
Sensing Trending Topics in Twitter for Greater Jakarta Area
Sensing Trending Topics in Twitter for Greater Jakarta Area Sensing Trending Topics in Twitter for Greater Jakarta Area
Sensing Trending Topics in Twitter for Greater Jakarta Area
 
Sentiment analysis using ml
Sentiment analysis using mlSentiment analysis using ml
Sentiment analysis using ml
 
Explore the Effects of Emoticons on Twitter Sentiment Analysis
Explore the Effects of Emoticons on Twitter Sentiment Analysis Explore the Effects of Emoticons on Twitter Sentiment Analysis
Explore the Effects of Emoticons on Twitter Sentiment Analysis
 
IRJET - Implementation of Twitter Sentimental Analysis According to Hash Tag
 IRJET - Implementation of Twitter Sentimental Analysis According to Hash Tag IRJET - Implementation of Twitter Sentimental Analysis According to Hash Tag
IRJET - Implementation of Twitter Sentimental Analysis According to Hash Tag
 
IMPROVED SENTIMENT ANALYSIS USING A CUSTOMIZED DISTILBERT NLP CONFIGURATION
IMPROVED SENTIMENT ANALYSIS USING A CUSTOMIZED DISTILBERT NLP CONFIGURATIONIMPROVED SENTIMENT ANALYSIS USING A CUSTOMIZED DISTILBERT NLP CONFIGURATION
IMPROVED SENTIMENT ANALYSIS USING A CUSTOMIZED DISTILBERT NLP CONFIGURATION
 
F0363942
F0363942F0363942
F0363942
 
Modeling Text Independent Speaker Identification with Vector Quantization
Modeling Text Independent Speaker Identification with Vector QuantizationModeling Text Independent Speaker Identification with Vector Quantization
Modeling Text Independent Speaker Identification with Vector Quantization
 
IRJET- Suspicious Email Detection System
IRJET- Suspicious Email Detection SystemIRJET- Suspicious Email Detection System
IRJET- Suspicious Email Detection System
 

Viewers also liked

Miguel Sendagorta, Congreso DEC. Casos Lexus
Miguel Sendagorta, Congreso DEC. Casos LexusMiguel Sendagorta, Congreso DEC. Casos Lexus
Miguel Sendagorta, Congreso DEC. Casos Lexus
Asociación DEC
 
N c36 nuestros pastores nos acompañan
N c36 nuestros pastores nos acompañanN c36 nuestros pastores nos acompañan
N c36 nuestros pastores nos acompañan
mangostafeliz
 
(308)long emprendimiento y empleabilidad copia
(308)long  emprendimiento y empleabilidad copia(308)long  emprendimiento y empleabilidad copia
(308)long emprendimiento y empleabilidad copia
ManfredNolte
 
Sexualidad Humana
Sexualidad HumanaSexualidad Humana
Sexualidad Humana
Erick Paz
 
Comunicacion y liderazgo
Comunicacion y liderazgo Comunicacion y liderazgo
Comunicacion y liderazgo
Proyecto Aula
 
Presentación Viernes DEC Iberdrola
Presentación Viernes DEC IberdrolaPresentación Viernes DEC Iberdrola
Presentación Viernes DEC Iberdrola
Asociación DEC
 
Jorge Martínez-Arroyo. Apertura del III Congreso Internacional sobre Custome...
Jorge Martínez-Arroyo.  Apertura del III Congreso Internacional sobre Custome...Jorge Martínez-Arroyo.  Apertura del III Congreso Internacional sobre Custome...
Jorge Martínez-Arroyo. Apertura del III Congreso Internacional sobre Custome...
Asociación DEC
 
Papa Francisco - Liderando con humildad
Papa Francisco - Liderando con humildadPapa Francisco - Liderando con humildad
Papa Francisco - Liderando con humildad
Maynor Mijangos
 
Impacts of cash transfers on schooling
Impacts of cash transfers on schoolingImpacts of cash transfers on schooling
Impacts of cash transfers on schooling
Michelle Mills
 
Gaussian Dictionary for Compressive Sensing of the ECG Signal
Gaussian Dictionary for Compressive Sensing of the ECG SignalGaussian Dictionary for Compressive Sensing of the ECG Signal
Gaussian Dictionary for Compressive Sensing of the ECG Signal
Riccardo Bernardini
 
IT рекрутинг без факапов
IT рекрутинг без факаповIT рекрутинг без факапов
IT рекрутинг без факапов
Viktoriya Pridatko
 

Viewers also liked (11)

Miguel Sendagorta, Congreso DEC. Casos Lexus
Miguel Sendagorta, Congreso DEC. Casos LexusMiguel Sendagorta, Congreso DEC. Casos Lexus
Miguel Sendagorta, Congreso DEC. Casos Lexus
 
N c36 nuestros pastores nos acompañan
N c36 nuestros pastores nos acompañanN c36 nuestros pastores nos acompañan
N c36 nuestros pastores nos acompañan
 
(308)long emprendimiento y empleabilidad copia
(308)long  emprendimiento y empleabilidad copia(308)long  emprendimiento y empleabilidad copia
(308)long emprendimiento y empleabilidad copia
 
Sexualidad Humana
Sexualidad HumanaSexualidad Humana
Sexualidad Humana
 
Comunicacion y liderazgo
Comunicacion y liderazgo Comunicacion y liderazgo
Comunicacion y liderazgo
 
Presentación Viernes DEC Iberdrola
Presentación Viernes DEC IberdrolaPresentación Viernes DEC Iberdrola
Presentación Viernes DEC Iberdrola
 
Jorge Martínez-Arroyo. Apertura del III Congreso Internacional sobre Custome...
Jorge Martínez-Arroyo.  Apertura del III Congreso Internacional sobre Custome...Jorge Martínez-Arroyo.  Apertura del III Congreso Internacional sobre Custome...
Jorge Martínez-Arroyo. Apertura del III Congreso Internacional sobre Custome...
 
Papa Francisco - Liderando con humildad
Papa Francisco - Liderando con humildadPapa Francisco - Liderando con humildad
Papa Francisco - Liderando con humildad
 
Impacts of cash transfers on schooling
Impacts of cash transfers on schoolingImpacts of cash transfers on schooling
Impacts of cash transfers on schooling
 
Gaussian Dictionary for Compressive Sensing of the ECG Signal
Gaussian Dictionary for Compressive Sensing of the ECG SignalGaussian Dictionary for Compressive Sensing of the ECG Signal
Gaussian Dictionary for Compressive Sensing of the ECG Signal
 
IT рекрутинг без факапов
IT рекрутинг без факаповIT рекрутинг без факапов
IT рекрутинг без факапов
 

Similar to Abstract

IRJET- A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...
IRJET-  	  A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...IRJET-  	  A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...
IRJET- A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...
IRJET Journal
 
A Survey on Sentiment Analysis and Opinion Mining
A Survey on Sentiment Analysis and Opinion MiningA Survey on Sentiment Analysis and Opinion Mining
A Survey on Sentiment Analysis and Opinion Mining
IJSRD
 
A Survey on Sentiment Analysis and Opinion Mining
A Survey on Sentiment Analysis and Opinion MiningA Survey on Sentiment Analysis and Opinion Mining
A Survey on Sentiment Analysis and Opinion Mining
IJSRD
 
Analysis Levels And Techniques A Survey
Analysis Levels And Techniques   A SurveyAnalysis Levels And Techniques   A Survey
Analysis Levels And Techniques A Survey
Liz Adams
 
Sentiment Analysis Using Hybrid Approach: A Survey
Sentiment Analysis Using Hybrid Approach: A SurveySentiment Analysis Using Hybrid Approach: A Survey
Sentiment Analysis Using Hybrid Approach: A Survey
IJERA Editor
 
Methods for Sentiment Analysis: A Literature Study
Methods for Sentiment Analysis: A Literature StudyMethods for Sentiment Analysis: A Literature Study
Methods for Sentiment Analysis: A Literature Study
vivatechijri
 
sentiment analysis.pdf
sentiment analysis.pdfsentiment analysis.pdf
sentiment analysis.pdf
manju451965
 
Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...
Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...
Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...
IRJET Journal
 
A Review of machine learning approaches to mine Social Choice of voters.
A Review of machine learning approaches to mine Social Choice of voters.A Review of machine learning approaches to mine Social Choice of voters.
A Review of machine learning approaches to mine Social Choice of voters.
IRJET Journal
 
Sentiment Analysis and Classification of Tweets using Data Mining
Sentiment Analysis and Classification of Tweets using Data MiningSentiment Analysis and Classification of Tweets using Data Mining
Sentiment Analysis and Classification of Tweets using Data Mining
IRJET Journal
 
Multi-Tier Sentiment Analysis System in Big Data Environment
Multi-Tier Sentiment Analysis System in Big Data EnvironmentMulti-Tier Sentiment Analysis System in Big Data Environment
Multi-Tier Sentiment Analysis System in Big Data Environment
IJCSIS Research Publications
 
An Approach To Sentiment Analysis
An Approach To Sentiment AnalysisAn Approach To Sentiment Analysis
An Approach To Sentiment Analysis
Sarah Morrow
 
APPROXIMATE ANALYTICAL SOLUTION OF NON-LINEAR BOUSSINESQ EQUATION FOR THE UNS...
APPROXIMATE ANALYTICAL SOLUTION OF NON-LINEAR BOUSSINESQ EQUATION FOR THE UNS...APPROXIMATE ANALYTICAL SOLUTION OF NON-LINEAR BOUSSINESQ EQUATION FOR THE UNS...
APPROXIMATE ANALYTICAL SOLUTION OF NON-LINEAR BOUSSINESQ EQUATION FOR THE UNS...
mathsjournal
 
FEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSIS
FEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSISFEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSIS
FEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSIS
mlaij
 
Opinion mining on newspaper headlines using SVM and NLP
Opinion mining on newspaper headlines using SVM and NLPOpinion mining on newspaper headlines using SVM and NLP
Opinion mining on newspaper headlines using SVM and NLP
IJECEIAES
 
Combining Lexicon based and Machine Learning based Methods for Twitter Sentim...
Combining Lexicon based and Machine Learning based Methods for Twitter Sentim...Combining Lexicon based and Machine Learning based Methods for Twitter Sentim...
Combining Lexicon based and Machine Learning based Methods for Twitter Sentim...
IRJET Journal
 
Svm and maximum entropy model for sentiment analysis of tweets
Svm and maximum entropy model for sentiment analysis of tweetsSvm and maximum entropy model for sentiment analysis of tweets
Svm and maximum entropy model for sentiment analysis of tweets
S M Raju
 
76201960
7620196076201960
76201960
IJRAT
 
A SURVEY OF SENTIMENT CLASSSIFICTION TECHNIQUES
A SURVEY OF SENTIMENT CLASSSIFICTION TECHNIQUESA SURVEY OF SENTIMENT CLASSSIFICTION TECHNIQUES
A SURVEY OF SENTIMENT CLASSSIFICTION TECHNIQUES
Journal For Research
 

Similar to Abstract (20)

IRJET- A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...
IRJET-  	  A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...IRJET-  	  A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...
IRJET- A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...
 
A Survey on Sentiment Analysis and Opinion Mining
A Survey on Sentiment Analysis and Opinion MiningA Survey on Sentiment Analysis and Opinion Mining
A Survey on Sentiment Analysis and Opinion Mining
 
A Survey on Sentiment Analysis and Opinion Mining
A Survey on Sentiment Analysis and Opinion MiningA Survey on Sentiment Analysis and Opinion Mining
A Survey on Sentiment Analysis and Opinion Mining
 
Analysis Levels And Techniques A Survey
Analysis Levels And Techniques   A SurveyAnalysis Levels And Techniques   A Survey
Analysis Levels And Techniques A Survey
 
Sentiment Analysis Using Hybrid Approach: A Survey
Sentiment Analysis Using Hybrid Approach: A SurveySentiment Analysis Using Hybrid Approach: A Survey
Sentiment Analysis Using Hybrid Approach: A Survey
 
Methods for Sentiment Analysis: A Literature Study
Methods for Sentiment Analysis: A Literature StudyMethods for Sentiment Analysis: A Literature Study
Methods for Sentiment Analysis: A Literature Study
 
sentiment analysis.pdf
sentiment analysis.pdfsentiment analysis.pdf
sentiment analysis.pdf
 
Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...
Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...
Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...
 
A Review of machine learning approaches to mine Social Choice of voters.
A Review of machine learning approaches to mine Social Choice of voters.A Review of machine learning approaches to mine Social Choice of voters.
A Review of machine learning approaches to mine Social Choice of voters.
 
Sentiment Analysis and Classification of Tweets using Data Mining
Sentiment Analysis and Classification of Tweets using Data MiningSentiment Analysis and Classification of Tweets using Data Mining
Sentiment Analysis and Classification of Tweets using Data Mining
 
Multi-Tier Sentiment Analysis System in Big Data Environment
Multi-Tier Sentiment Analysis System in Big Data EnvironmentMulti-Tier Sentiment Analysis System in Big Data Environment
Multi-Tier Sentiment Analysis System in Big Data Environment
 
An Approach To Sentiment Analysis
An Approach To Sentiment AnalysisAn Approach To Sentiment Analysis
An Approach To Sentiment Analysis
 
APPROXIMATE ANALYTICAL SOLUTION OF NON-LINEAR BOUSSINESQ EQUATION FOR THE UNS...
APPROXIMATE ANALYTICAL SOLUTION OF NON-LINEAR BOUSSINESQ EQUATION FOR THE UNS...APPROXIMATE ANALYTICAL SOLUTION OF NON-LINEAR BOUSSINESQ EQUATION FOR THE UNS...
APPROXIMATE ANALYTICAL SOLUTION OF NON-LINEAR BOUSSINESQ EQUATION FOR THE UNS...
 
FEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSIS
FEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSISFEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSIS
FEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSIS
 
Opinion mining on newspaper headlines using SVM and NLP
Opinion mining on newspaper headlines using SVM and NLPOpinion mining on newspaper headlines using SVM and NLP
Opinion mining on newspaper headlines using SVM and NLP
 
Combining Lexicon based and Machine Learning based Methods for Twitter Sentim...
Combining Lexicon based and Machine Learning based Methods for Twitter Sentim...Combining Lexicon based and Machine Learning based Methods for Twitter Sentim...
Combining Lexicon based and Machine Learning based Methods for Twitter Sentim...
 
Svm and maximum entropy model for sentiment analysis of tweets
Svm and maximum entropy model for sentiment analysis of tweetsSvm and maximum entropy model for sentiment analysis of tweets
Svm and maximum entropy model for sentiment analysis of tweets
 
76201960
7620196076201960
76201960
 
NLP Ecosystem
NLP EcosystemNLP Ecosystem
NLP Ecosystem
 
A SURVEY OF SENTIMENT CLASSSIFICTION TECHNIQUES
A SURVEY OF SENTIMENT CLASSSIFICTION TECHNIQUESA SURVEY OF SENTIMENT CLASSSIFICTION TECHNIQUES
A SURVEY OF SENTIMENT CLASSSIFICTION TECHNIQUES
 

Recently uploaded

DESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docxDESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docx
FluxPrime1
 
Final project report on grocery store management system..pdf
Final project report on grocery store management system..pdfFinal project report on grocery store management system..pdf
Final project report on grocery store management system..pdf
Kamal Acharya
 
AP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specificAP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specific
BrazilAccount1
 
Fundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptxFundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptx
manasideore6
 
Recycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part IIIRecycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part III
Aditya Rajan Patra
 
CME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional ElectiveCME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional Elective
karthi keyan
 
ML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptxML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptx
Vijay Dialani, PhD
 
road safety engineering r s e unit 3.pdf
road safety engineering  r s e unit 3.pdfroad safety engineering  r s e unit 3.pdf
road safety engineering r s e unit 3.pdf
VENKATESHvenky89705
 
Investor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptxInvestor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptx
AmarGB2
 
HYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generationHYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generation
Robbie Edward Sayers
 
Forklift Classes Overview by Intella Parts
Forklift Classes Overview by Intella PartsForklift Classes Overview by Intella Parts
Forklift Classes Overview by Intella Parts
Intella Parts
 
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Dr.Costas Sachpazis
 
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&BDesign and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Sreedhar Chowdam
 
block diagram and signal flow graph representation
block diagram and signal flow graph representationblock diagram and signal flow graph representation
block diagram and signal flow graph representation
Divya Somashekar
 
Unbalanced Three Phase Systems and circuits.pptx
Unbalanced Three Phase Systems and circuits.pptxUnbalanced Three Phase Systems and circuits.pptx
Unbalanced Three Phase Systems and circuits.pptx
ChristineTorrepenida1
 
NUMERICAL SIMULATIONS OF HEAT AND MASS TRANSFER IN CONDENSING HEAT EXCHANGERS...
NUMERICAL SIMULATIONS OF HEAT AND MASS TRANSFER IN CONDENSING HEAT EXCHANGERS...NUMERICAL SIMULATIONS OF HEAT AND MASS TRANSFER IN CONDENSING HEAT EXCHANGERS...
NUMERICAL SIMULATIONS OF HEAT AND MASS TRANSFER IN CONDENSING HEAT EXCHANGERS...
ssuser7dcef0
 
DfMAy 2024 - key insights and contributions
DfMAy 2024 - key insights and contributionsDfMAy 2024 - key insights and contributions
DfMAy 2024 - key insights and contributions
gestioneergodomus
 
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
obonagu
 
6th International Conference on Machine Learning & Applications (CMLA 2024)
6th International Conference on Machine Learning & Applications (CMLA 2024)6th International Conference on Machine Learning & Applications (CMLA 2024)
6th International Conference on Machine Learning & Applications (CMLA 2024)
ClaraZara1
 
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTSHeap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
Soumen Santra
 

Recently uploaded (20)

DESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docxDESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docx
 
Final project report on grocery store management system..pdf
Final project report on grocery store management system..pdfFinal project report on grocery store management system..pdf
Final project report on grocery store management system..pdf
 
AP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specificAP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specific
 
Fundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptxFundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptx
 
Recycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part IIIRecycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part III
 
CME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional ElectiveCME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional Elective
 
ML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptxML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptx
 
road safety engineering r s e unit 3.pdf
road safety engineering  r s e unit 3.pdfroad safety engineering  r s e unit 3.pdf
road safety engineering r s e unit 3.pdf
 
Investor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptxInvestor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptx
 
HYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generationHYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generation
 
Forklift Classes Overview by Intella Parts
Forklift Classes Overview by Intella PartsForklift Classes Overview by Intella Parts
Forklift Classes Overview by Intella Parts
 
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
 
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&BDesign and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
 
block diagram and signal flow graph representation
block diagram and signal flow graph representationblock diagram and signal flow graph representation
block diagram and signal flow graph representation
 
Unbalanced Three Phase Systems and circuits.pptx
Unbalanced Three Phase Systems and circuits.pptxUnbalanced Three Phase Systems and circuits.pptx
Unbalanced Three Phase Systems and circuits.pptx
 
NUMERICAL SIMULATIONS OF HEAT AND MASS TRANSFER IN CONDENSING HEAT EXCHANGERS...
NUMERICAL SIMULATIONS OF HEAT AND MASS TRANSFER IN CONDENSING HEAT EXCHANGERS...NUMERICAL SIMULATIONS OF HEAT AND MASS TRANSFER IN CONDENSING HEAT EXCHANGERS...
NUMERICAL SIMULATIONS OF HEAT AND MASS TRANSFER IN CONDENSING HEAT EXCHANGERS...
 
DfMAy 2024 - key insights and contributions
DfMAy 2024 - key insights and contributionsDfMAy 2024 - key insights and contributions
DfMAy 2024 - key insights and contributions
 
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
 
6th International Conference on Machine Learning & Applications (CMLA 2024)
6th International Conference on Machine Learning & Applications (CMLA 2024)6th International Conference on Machine Learning & Applications (CMLA 2024)
6th International Conference on Machine Learning & Applications (CMLA 2024)
 
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTSHeap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
 

Abstract

  • 1. TITLE Twitter Sentiment Analysis using Various Classification Algorithms Abstract Twitter is a web application to determine online news and social networking service where users post and interact with messages, anywhere in the world. Twitter posts are generally short (140 characters) and generated continuously by public which is well suited for opinion mining. Twitter messages can be classified either in positive or negative sentiment based on certain aspects with respect to term based query. The past studies of sentiment classification are not very conclusive about which features and supervised classification algorithms are good for designing accurate and efficient sentiment classification system. We propose to combine many feature extraction techniques like emoticons, exclamation and question mark symbol, word gazetteer, unigrams to design more accurate sentiment classification system. Keywords Twitter; Sentiment Analysis; Opinion Mining; Natural Language Processing Introduction Human decision making is extensively influenced by assessment or judgement of others. Before making any move, customers tend to gather as much information as possible about the product they want to buy. The investors analyse and predict the stock market movement of a company based on its popularity among its customers be investing their money in its shares. With the advent development of social media, gathering data for evaluation become easier and less time consuming. Different platform like Twitter, Facebook, Linked In serve as repositories of useful data in terms of reviews, likes, comments etc. Opinions are linked to almost all human activities because they have key impact on our decision making. We mostly seek others opinions while taking any decisions. In the real world, organizations and business entities are always willing to know public and general opinions about their services and products. On the other hand, consumers also seek the opinions of existing users of a product or service before making a decision to purchase products and subscribing to services. Opinions of public about political candidates can be analysed to forecast results of an election. In the past, organizations, governments and business entities used to conduct surveys and opinion polls on focused groups for obtaining citizen opinions and their sentiments [1]. Twitter is a social networking web application with microblogging feature that has a large and constantly growing user data-base. Thus, the application provides a rich data set in the form of
  • 2. messages that are usually short status updates from Twitter application users that must be expressed in not more than 140 characters in length. On Twitter, data that consists of millions of short messages and user status updates are generated each day on about hundreds of different topics. The task of extracting data from these small texts has become immensely useful for sorting and ranking popularity of topics mentioned within the updates. Nowadays twitter has emerged as one of the most popular platforms for expressing sentiments and thoughts on Internet. It is very useful and obvious to mine and analyse Twitter data for interesting information regarding major trending topics in the media and other spaces. Methodology Twitter Sentiment Analysis is generally divided into 3 major categories that is 1. Machine Learning Approach 2. Lexicon Based Approach 3. Hybrid Approach The Machine Learning Approach (ML) uses linguistic features and applies well known Machine Learning algorithms. The Lexicon based approach is driven by a opinion lexicon, which is nothing but a collection of pre-compiled opinion terms. It is mainly divided into two main approaches that is a) Dictionary based approach b) Corpus Based approach The Hybrid Approach combines the above two approaches. To increase the performance and efficiency of sentiment classification system the combination of well-known features extraction methods is considered. The proposed method compares 6 supervised classification algorithms that is a) Naïve Bayes Algorithm b) Bayes Net Algorithm c) Discriminative Multinomial Naïve Bayes(DMNB) Algorithm d) Sequential Minimal Optimization (SMO) Algorithm e) Hyperpipes Algorithm f) Random Forest Algorithm 1) Naïve Bayes(NB): This algorithm is a probabilistic classifier in a simple form that counts the combinations of values and frequency in a data set under consideration and calculates probabilities set. Bayes theorem is the base of this algorithm and assumes that all the attributes are completely independent against a set value of the class variable.
  • 3. 2) Bayes Net (BN): Bayesian nets (BN) are a network-based system that are mainly used for analysing and representing the models that involves uncertainty. Bayesian networks learns the causal relationships and use it to implement incremental learning. To perform classification, first the input nodes must be set with the evidence and then the output nodes can be queried and analysed using standard Bayesian network inference. 3) Discriminative Multinominal Naive Bayes (DMNB): The multinomial Naive Bayes is a well-known and widely used classifier for classification of documents and tested to yield satisfactory performance. Discriminative multinomial Naïve Bayes (DMNB) takes a document and consider it as a bag-of-words. For each class c, P(w|c), the training data is unitized to estimate the probability of observing the word w against the given class. It works on the collection of training documents of the particular class by calculating each word’s relative occurrence frequency. The classifier also needs the prior probability, Pc) which is intuitive to estimate. If the word w occurs nwd number of times in document d, then given a document under test the probability of the class c is calculated in the following manner 4) SMO: Sequential Minimal Optimization (SMO) method is generally used in the training process of Support Vector Machines (SVM) classification algorithm. SMO algorithm consists of many optimizations designed primarily to increase the analysis performance of large datasets. It is designed to ensure that the algorithm converges with results even in degenerate conditions. It works by breaking up a problem into a set of atomic sub-problems, which are solved using analytical approach 5) Hyperpipes: Hyperpipes is a technique that creates a “hyperpipe” for each class of a data set. These Classes are the collections of data build around single object template. it can work extremely fast and effectively. 6) Random Forest: Many trees are produced by this algorithm for classification process. It classifies new object from an input vector by setting the vector against the forest on each of the trees. A classification is generated by each tree. In other words, that class is voted by the tree. The classification having the most votes is chosen by the random forest method across all the trees. It also runs efficiently on large datasets. Results Obtained The six selected classification algorithms were executed on features extracted from Sanders Twitter dataset on Weka tool. by configuring it with 10-fold cross validation flag building and testing of the system is carried out. Simulation results in empirical form are presented in Tables 1-9.
  • 4. False Positive Rate (FPR), True Positive Rate (TPR), Precision (P), recall (R), F-score (F), and Receiver Operating Characteristic values (ROC) are shown in the following tables. Table 1: Naïve Bayes Result Table 2: Bayes Net Results Table 3: Discriminative Multinominal Naive Bayes(DMNB) Results
  • 5. Table 4: Sequential Minimal Optimization (SMO) Results Table 5: Hyperpipes Results Table 6: Random Forest Results
  • 6. Performance and Results Comparison Based on simulation results, the performance of Naive Bayes algorithm is least in comparison of all six algorithms considered in this study. In general, precision and recall scores are sufficiently low against the Positive and Negative classes. This is due to large number of instances in the class ‘other’ in comparison of positive and negative classes. The considered Sanders dataset is highly imbalanced. Overall, the two most balanced and well-performing algorithms are DMNB and SMO, with overall F-scores of 0.769 and 0.75 respectively. Fig 1: Precision Comparison Fig 2: Recall Comparison
  • 7. Fig 3: F-Measure Comparison
  • 8. References [1] Medhat, Walaa, Ahmed Hassan, and Hoda Korashy. "Sentiment analysis algorithms and applications: A survey." Ain Shams Engineering Journal 5.4 (2014): 1093-1113. [2] Liu, Bing. "Sentiment analysis and opinion mining." Synthesis lectures on human language technologies 5.1 (2012): 1-167. [3] Agarwal, Apoorv, et al. "Sentiment analysis of twitter data." Proceedings of the workshop on languages in social media. Association for Computational Linguistics, 2011. [4] Imran, Muhammad, et al. "Processing social media messages in mass emergency: A survey." ACM Computing Surveys (CSUR) 47.4 (2015): 67. [5] Feldman, Ronen. "Techniques and applications for sentiment analysis, “Communications of the ACM 56.4 (2013): 82-89. [6] Pang, Bo, and Lillian Lee. “Opinion mining and sentiment analysis. “Foundations and trends in information retrieval 2.1-2 (2008): 1-135. [7] Cambria, Erik, et al. “New avenues in opinion mining and sentiment analysis.” IEEE Intelligent Systems 28.2 (2013): 15- 21. [8] Witten, Ian H., and Eibe Frank. Data Mining: Practical machine learning tools and techniques. Morgan Kaufmann, 2005. [9] Bifet, Albert, and Eibe Frank. "Sentiment knowledge discovery in twitter streaming data." International Conference on Discovery Science. Springer Berlin Heidelberg, 2010. [10] Saif, Hassan, Yulan He, and Harith Alani. "Semantic sentiment analysis of twitter. International Semantic Web Conference. Springer Berlin Heidelberg, 2012.