• Email
  • Like
  • Save
  • Private Content
  • Embed
 

Alleviating Data Sparsity for Twitter Sentiment Analysis

by on Apr 17, 2012

  • 2,974 views

Twitter has brought much attention recently as a hot research topic in the domain of sentiment analysis. Training sentiment classifiers from tweets data often faces the data sparsity problem partly ...

Twitter has brought much attention recently as a hot research topic in the domain of sentiment analysis. Training sentiment classifiers from tweets data often faces the data sparsity problem partly due to the large variety of short and irregular forms introduced to tweets because of the 140-character limit. In this work we propose using two different sets of features to alleviate the data sparseness problem. One is the semantic feature set where we extract semantically hidden concepts from tweets and then incorporate them into classifier training through interpolation. Another is the sentiment-topic feature set where we extract latent topics and the associated topic sentiment from tweets, then augment the original feature space with these sentiment-topics. Experimental results on the Stanford Twitter Sentiment Dataset show that both feature sets outperform the baseline model using unigrams only. Moreover, using semantic features rivals the previously reported best result. Using sentiment-topic features achieves 86.3% sentiment classification accuracy, which outperforms existing approaches.

Accessibility

Categories

Upload Details

Uploaded via SlideShare as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate. If needed, use the feedback form to let us know more details.

Cancel

4 Embeds 1,444

http://www.scoop.it 756
http://kmi.open.ac.uk 673
http://people.kmi.open.ac.uk 14
http://kmi.org 1

Statistics

Likes
2
Downloads
42
Comments
0
Embed Views
1,444
Views on SlideShare
1,530
Total Views
2,974
Post Comment
Edit your comment

Alleviating Data Sparsity for Twitter Sentiment Analysis Alleviating Data Sparsity for Twitter Sentiment Analysis Presentation Transcript