SlideShare a Scribd company logo
1 of 9
Download to read offline
A CLUSTERING ANALYSIS OF
TWEET LENGTH AND ITS
RELATION TO SENTIMENT
Research Project
Matthew Mayo
School of Computer Science
Columbus State University
Columbus, GA
CPSC 6185 Intelligent Agents
Dr. Rania Hodhod
Twitter
• Popular microblogging web service
• 140 character per message (tweet) limit
• Started in 2006, over 645 million users today*
• 58 million tweets per day*
• 9,000 tweets per minute*
* Source: www.statisticbrain.com/twitter-statistics
Sentiment Analysis
• Identifying, extracting & processing subjective
information from source material
• Subjective information includes attitudes,
emotions & opinions
• Appropriate for binary classification (positive vs.
negative, good vs. bad, etc.)
• Useful for movie reviews, political election
opinions, etc.
Project Aim
Interested in exploring the relationship between:
• Length of tweet (number of characters)
AND
• Sentiment score of tweet
Problem Description
The research project tasks:
1. Capture Twitter data
2. Build custom sentiment dictionary
3. Process tweets
4. Create dataset
5. Cluster tweet data
Methodology
● Custom Python scripts to capture and process live
tweets over 4 week schedule
● Use k-means clustering in Weka to look for
natural sentiment patterns
● Any correlation between length of tweet and its
sentiment (positive/negative/neutral)?
Results
Sentiment scores of shorter tweets appear more
tightly-centered around their cluster’s centroid
Longer tweets become less-centered on the
applicable centroid
As the number of characters in a tweet would lead
to a greater number of terms, which would
increase the chances of terms being assigned a
score, this seems intuitive
Results

More Related Content

Similar to project-presentation

Opinion mining for social media
Opinion mining for social mediaOpinion mining for social media
Opinion mining for social mediaDiana Maynard
 
Open Analytics: Building Effective Frameworks for Social Media Analysis
Open Analytics: Building Effective Frameworks for Social Media AnalysisOpen Analytics: Building Effective Frameworks for Social Media Analysis
Open Analytics: Building Effective Frameworks for Social Media Analysisikanow
 
Sentiment tool Project presentaion
Sentiment tool Project presentaionSentiment tool Project presentaion
Sentiment tool Project presentaionRavindra Chaudhary
 
Twitter Sentiment Prediction.pptx
Twitter Sentiment Prediction.pptxTwitter Sentiment Prediction.pptx
Twitter Sentiment Prediction.pptxKrishnesh Pujari
 
final_nlp
final_nlpfinal_nlp
final_nlpaphex34
 
Adressing Volume and Velocity Challenge on the Social Web using Crowd Sourced...
Adressing Volume and Velocity Challenge on the Social Web using Crowd Sourced...Adressing Volume and Velocity Challenge on the Social Web using Crowd Sourced...
Adressing Volume and Velocity Challenge on the Social Web using Crowd Sourced...Pavan Kapanipathi
 
Retweet Prediction with Attention-based Deep Neural Network
Retweet Prediction with Attention-based Deep Neural NetworkRetweet Prediction with Attention-based Deep Neural Network
Retweet Prediction with Attention-based Deep Neural NetworkGUANGYUAN PIAO
 
Twitter Sentiment Analysis
Twitter Sentiment AnalysisTwitter Sentiment Analysis
Twitter Sentiment Analysisijtsrd
 
Social media analytics
Social media analyticsSocial media analytics
Social media analyticsJithu Pettan
 
Twitter sentiment analysis using Azure NLP
Twitter sentiment analysis using Azure NLP Twitter sentiment analysis using Azure NLP
Twitter sentiment analysis using Azure NLP Olusola Amusan
 
New Methodologies for Capturing and Working with Publicly Available Twitter Data
New Methodologies for Capturing and Working with Publicly Available Twitter DataNew Methodologies for Capturing and Working with Publicly Available Twitter Data
New Methodologies for Capturing and Working with Publicly Available Twitter DataAxel Bruns
 
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptxSampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx20211a05p7
 
Harnessing Volume and Velocity Challenge on the Social Web using Crowd-Source...
Harnessing Volume and Velocity Challenge on the Social Web using Crowd-Source...Harnessing Volume and Velocity Challenge on the Social Web using Crowd-Source...
Harnessing Volume and Velocity Challenge on the Social Web using Crowd-Source...Artificial Intelligence Institute at UofSC
 
Twitter Sentiment Analysis.pdf
Twitter Sentiment Analysis.pdfTwitter Sentiment Analysis.pdf
Twitter Sentiment Analysis.pdfRachanasamal3
 
IRJET- Twitter Sentimental Analysis for Predicting Election Result using ...
IRJET-  	  Twitter Sentimental Analysis for Predicting Election Result using ...IRJET-  	  Twitter Sentimental Analysis for Predicting Election Result using ...
IRJET- Twitter Sentimental Analysis for Predicting Election Result using ...IRJET Journal
 
Designing Big Content - Search Exchange 2013
Designing Big Content - Search Exchange 2013Designing Big Content - Search Exchange 2013
Designing Big Content - Search Exchange 2013Brian_Chappell
 
Creating Killer Content for Social Media
Creating Killer Content for Social MediaCreating Killer Content for Social Media
Creating Killer Content for Social MediaGame Day Communications
 
Towards identifying Collaborative Learning groups using Social Media
Towards identifying Collaborative Learning groups using Social MediaTowards identifying Collaborative Learning groups using Social Media
Towards identifying Collaborative Learning groups using Social MediaSelver Softic
 

Similar to project-presentation (20)

Opinion mining for social media
Opinion mining for social mediaOpinion mining for social media
Opinion mining for social media
 
Open Analytics: Building Effective Frameworks for Social Media Analysis
Open Analytics: Building Effective Frameworks for Social Media AnalysisOpen Analytics: Building Effective Frameworks for Social Media Analysis
Open Analytics: Building Effective Frameworks for Social Media Analysis
 
Sentiment tool Project presentaion
Sentiment tool Project presentaionSentiment tool Project presentaion
Sentiment tool Project presentaion
 
Twitter Sentiment Prediction.pptx
Twitter Sentiment Prediction.pptxTwitter Sentiment Prediction.pptx
Twitter Sentiment Prediction.pptx
 
final_nlp
final_nlpfinal_nlp
final_nlp
 
Adressing Volume and Velocity Challenge on the Social Web using Crowd Sourced...
Adressing Volume and Velocity Challenge on the Social Web using Crowd Sourced...Adressing Volume and Velocity Challenge on the Social Web using Crowd Sourced...
Adressing Volume and Velocity Challenge on the Social Web using Crowd Sourced...
 
Retweet Prediction with Attention-based Deep Neural Network
Retweet Prediction with Attention-based Deep Neural NetworkRetweet Prediction with Attention-based Deep Neural Network
Retweet Prediction with Attention-based Deep Neural Network
 
Twitter Sentiment Analysis
Twitter Sentiment AnalysisTwitter Sentiment Analysis
Twitter Sentiment Analysis
 
Social media analytics
Social media analyticsSocial media analytics
Social media analytics
 
CS-IS 027
CS-IS 027CS-IS 027
CS-IS 027
 
Twitter sentiment analysis using Azure NLP
Twitter sentiment analysis using Azure NLP Twitter sentiment analysis using Azure NLP
Twitter sentiment analysis using Azure NLP
 
New Methodologies for Capturing and Working with Publicly Available Twitter Data
New Methodologies for Capturing and Working with Publicly Available Twitter DataNew Methodologies for Capturing and Working with Publicly Available Twitter Data
New Methodologies for Capturing and Working with Publicly Available Twitter Data
 
E017433538
E017433538E017433538
E017433538
 
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptxSampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
 
Harnessing Volume and Velocity Challenge on the Social Web using Crowd-Source...
Harnessing Volume and Velocity Challenge on the Social Web using Crowd-Source...Harnessing Volume and Velocity Challenge on the Social Web using Crowd-Source...
Harnessing Volume and Velocity Challenge on the Social Web using Crowd-Source...
 
Twitter Sentiment Analysis.pdf
Twitter Sentiment Analysis.pdfTwitter Sentiment Analysis.pdf
Twitter Sentiment Analysis.pdf
 
IRJET- Twitter Sentimental Analysis for Predicting Election Result using ...
IRJET-  	  Twitter Sentimental Analysis for Predicting Election Result using ...IRJET-  	  Twitter Sentimental Analysis for Predicting Election Result using ...
IRJET- Twitter Sentimental Analysis for Predicting Election Result using ...
 
Designing Big Content - Search Exchange 2013
Designing Big Content - Search Exchange 2013Designing Big Content - Search Exchange 2013
Designing Big Content - Search Exchange 2013
 
Creating Killer Content for Social Media
Creating Killer Content for Social MediaCreating Killer Content for Social Media
Creating Killer Content for Social Media
 
Towards identifying Collaborative Learning groups using Social Media
Towards identifying Collaborative Learning groups using Social MediaTowards identifying Collaborative Learning groups using Social Media
Towards identifying Collaborative Learning groups using Social Media
 

project-presentation

  • 1. A CLUSTERING ANALYSIS OF TWEET LENGTH AND ITS RELATION TO SENTIMENT
  • 2. Research Project Matthew Mayo School of Computer Science Columbus State University Columbus, GA CPSC 6185 Intelligent Agents Dr. Rania Hodhod
  • 3. Twitter • Popular microblogging web service • 140 character per message (tweet) limit • Started in 2006, over 645 million users today* • 58 million tweets per day* • 9,000 tweets per minute* * Source: www.statisticbrain.com/twitter-statistics
  • 4. Sentiment Analysis • Identifying, extracting & processing subjective information from source material • Subjective information includes attitudes, emotions & opinions • Appropriate for binary classification (positive vs. negative, good vs. bad, etc.) • Useful for movie reviews, political election opinions, etc.
  • 5. Project Aim Interested in exploring the relationship between: • Length of tweet (number of characters) AND • Sentiment score of tweet
  • 6. Problem Description The research project tasks: 1. Capture Twitter data 2. Build custom sentiment dictionary 3. Process tweets 4. Create dataset 5. Cluster tweet data
  • 7. Methodology ● Custom Python scripts to capture and process live tweets over 4 week schedule ● Use k-means clustering in Weka to look for natural sentiment patterns ● Any correlation between length of tweet and its sentiment (positive/negative/neutral)?
  • 8. Results Sentiment scores of shorter tweets appear more tightly-centered around their cluster’s centroid Longer tweets become less-centered on the applicable centroid As the number of characters in a tweet would lead to a greater number of terms, which would increase the chances of terms being assigned a score, this seems intuitive