SlideShare a Scribd company logo
Sentiment Analysis
our approach and use cases
Karol Chlasta
Antoni Sobkowicz
ś
dBConf 2015
Agenda
2015 Elections in
Poland on Twitter.com
& Onet.pl
Introduction
to Sentiment Analysis
and its applications
Questions
& Answers
How to approach
Sentiment
Analysis?
Introduction to Sentiment
Analysis and its
applications
➔ Author’s sentiment
◆ in all text
◆ in particular sentences
◆ towards an entity(s)
◆ towards specific aspects(s)
➔ Author’s emotions
◆ basic vs other emotions
➔ When to automate?
◆ many texts
◆ texts are untagged
◆ gaps in annotation
Sentiment vs Emotion Analysis
Sentiment Analysis or Opinion Mining
➔ Owner of the opinion
➔ Entity vs aspect
➔ Type of opinion(s) expressed
◆ like / dislike
◆ optimism / pessimism
◆ evaluation, confidence, familiarity
◆ polarization
➔ Value of opinion
➔ Event of sentiment expression
He recorded white Russian trucks crossing the border and
progressing fast towards Białystok.
The candidate looked confident, well prepared and was even very
well received by the audience, but lost the debate after answering
that question.
Sample Applications
Politics - forecasting sentiment towards a
candidate, political party or reform, social issue.
Review - to draw a conclusion if
this review is positive or
negative
Market monitoring
- Clients’ comments
about our company
or the competition.
Product - what
people think
about the
new...
Sample
Applications
Customer Service
How to approach
Sentiment
Analysis?
Approaches
➔ Dictionary methods
◆ Manual or (self)automatic
➔ Statistical methods
◆ Training sets
◆ Various descriptive features
● Words
● Detection of coexistence of words
● Punctuation
● Syntax
● Emoticons
● Language specific analysis
○ http://nlp.stanford.edu/sentiment/
Data Gathering
➔ Social Networks
◆ Many social networks provide APIs
➔ Thematic websites
◆ Need to create customized scrappers
Important thing - before data gathering
we need to correctly select media we are
covering!
Manually Tagging Sentiment
➔ Manually tagged sentiment data can be used to
create sentiment dictionaries and reference
(training) sentiment data for experiments
➔ Data should be tagged by several taggers,
because, especially in politics, Positive and
Negative are very subjective
➔ Data should also be tagged in a way that
combats effects associated with Negative Bias
➔ Negative Bias is an effect which results in
negative feelings and events are treated as more
important by human brain.
“ta d... wykrzykiwała codzień pod
krzyżem”
tagged as neutral
“nie dostal sie, na szczescie, do tego
palacu, wiec wystaje poden co chwila
:)) niedoczekanie twoje, kaczynski!”
tagged as negative
2015 Elections
in Poland
on Twitter.com & Onet.pl
Twitter.com and Onet.pl (+ Wp.pl)
Twitter.com
Over 11,000 tweets related to 10 profiles of
candidates on May 10th
Onet.pl and Wp.pl
Over 2000 articles with one of the tags related to
election (Andrzej Duda, Bronisław Komorowski, Paweł
Kukiz, Ewa Kopacz, Beata Szydło) between 20.05 and
20.08 (second round of election to presidential
swearing-in).
Over 1.5m comments written by the users.
Additional 12,000+ comments from years 2009-2011 for
evaluation purposes
Twitter.com
Implementation
Technology
➔ R 3.2.0 running on i686-pc-linux-gnu, RStudio
0.98.1103
➔ twitteR, dplyr, stringi, ggplot2, tm, e1071, RTextTools
Data
➔ 11,744 tweets related to 10 profiles of candidates
from May 10th, simple sentiment scoring algorithm
➔ Lexicon of 2000 positive & 3693 negative Polish
words. 18 positive and 22 negative emoticons
➔ 6040 tweets, excluding all neutral used to evaluate
Naive Bayes, Maximum Entropy, Support Vector
Machines, and Tree sentiment classifiers using
70/30 principle
Results
➔ Two approaches: Naive Bayes and Maximum Entropy
achieved the best accuracy (71.76% and 77.32%
respectively)
Onet.pl
& Wp.pl
Step I
Technology
➔ .NET
➔ Python 2.7 + Scrapy
➔ Java 8
Data
➔ 1,533,035 comments in 2057 articles total
◆ 923 manually sentiment annotated comments
➔ 5850 comments from 2011, site Gazeta.pl used for
lexicon generation (TRAIN-POL)
➔ 31095 tweets used for lexicon generation (TRAIN-
TWIT
➔ Evaluated methods: Naive Bayes (NB) and three
dictionary based methods
Results
➔ Two approaches: Naive Bayes and Simple Dictionary
Addition achieved the best accuracy (76% and 78%
respectively)
Step II
Technology
➔ .NET
➔ Python 2.7 + Scrapy
➔ Java 8
Data
➔ 1,533,035 comments in 2057 articles total
◆ 923 manually sentiment annotated comments
➔ 6448 comments from 2011, site Gazeta.pl
➔ 4592 comments from 2011, site Wyborcza.pl
➔ 7177 comments from 2010, site Gazeta.pl
➔ Lexicons generated from comments
➔ Datasets cross-tested using dictionaries created
with different datasets
Results
➔ Old datasets can be used to annotate new texts
despite 4 year difference
Step III
Technology
➔ Java 8
Data
➔ 1,533,035 comments in 2057 articles total
➔ Lexicon generated using manually sentiment tagged
data (from Gazeta.pl 2010, 2011 and Wyborcza.pl
2011)
➔ Algorithm threshold tweaked to have both good
binary and 3-category classification
Results
➔ Most comments are negative!
Summary
Questions?
Karol Chlasta (@KarolChlasta)
Antoni Sobkowicz
http://opi-lil.github.io/

More Related Content

What's hot

Sentiment analysis of Twitter data using python
Sentiment analysis of Twitter data using pythonSentiment analysis of Twitter data using python
Sentiment analysis of Twitter data using pythonHetu Bhavsar
 
Sentiment analysis
Sentiment analysisSentiment analysis
Sentiment analysisAmenda Joy
 
Practical sentiment analysis
Practical sentiment analysisPractical sentiment analysis
Practical sentiment analysisDiana Maynard
 
Sentiment Analysis Using Twitter
Sentiment Analysis Using TwitterSentiment Analysis Using Twitter
Sentiment Analysis Using Twitterpiya chauhan
 
Sentiment Analaysis on Twitter
Sentiment Analaysis on TwitterSentiment Analaysis on Twitter
Sentiment Analaysis on TwitterNitish J Prabhu
 
Twitter sentiment analysis ppt
Twitter sentiment analysis pptTwitter sentiment analysis ppt
Twitter sentiment analysis pptSonuCreation
 
Text classification & sentiment analysis
Text classification & sentiment analysisText classification & sentiment analysis
Text classification & sentiment analysisM. Atif Qureshi
 
Twitter sentiment-analysis Jiit2013-14
Twitter sentiment-analysis Jiit2013-14Twitter sentiment-analysis Jiit2013-14
Twitter sentiment-analysis Jiit2013-14Rachit Goel
 
Twitter Sentiment Analysis
Twitter Sentiment AnalysisTwitter Sentiment Analysis
Twitter Sentiment AnalysisAyush Khandelwal
 
Amazon sentimental analysis
Amazon sentimental analysisAmazon sentimental analysis
Amazon sentimental analysisAkhila
 
social network analysis project twitter sentimental analysis
social network analysis project twitter sentimental analysissocial network analysis project twitter sentimental analysis
social network analysis project twitter sentimental analysisAshish Mundra
 
Sentiment analysis using ml
Sentiment analysis using mlSentiment analysis using ml
Sentiment analysis using mlPravin Katiyar
 
Sentiment Analysis in Twitter
Sentiment Analysis in TwitterSentiment Analysis in Twitter
Sentiment Analysis in TwitterAyushi Dalmia
 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment AnalysisAnkur Tyagi
 
Sentiment analysis of Twitter Data
Sentiment analysis of Twitter DataSentiment analysis of Twitter Data
Sentiment analysis of Twitter DataNurendra Choudhary
 
Twitter sentiment analysis project report
Twitter sentiment analysis project reportTwitter sentiment analysis project report
Twitter sentiment analysis project reportBharat Khanna
 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment AnalysisDinesh V
 
Introduction to Sentiment Analysis
Introduction to Sentiment AnalysisIntroduction to Sentiment Analysis
Introduction to Sentiment AnalysisJaganadh Gopinadhan
 

What's hot (20)

Sentiment analysis of Twitter data using python
Sentiment analysis of Twitter data using pythonSentiment analysis of Twitter data using python
Sentiment analysis of Twitter data using python
 
Sentiment analysis
Sentiment analysisSentiment analysis
Sentiment analysis
 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment Analysis
 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment Analysis
 
Practical sentiment analysis
Practical sentiment analysisPractical sentiment analysis
Practical sentiment analysis
 
Sentiment Analysis Using Twitter
Sentiment Analysis Using TwitterSentiment Analysis Using Twitter
Sentiment Analysis Using Twitter
 
Sentiment Analaysis on Twitter
Sentiment Analaysis on TwitterSentiment Analaysis on Twitter
Sentiment Analaysis on Twitter
 
Twitter sentiment analysis ppt
Twitter sentiment analysis pptTwitter sentiment analysis ppt
Twitter sentiment analysis ppt
 
Text classification & sentiment analysis
Text classification & sentiment analysisText classification & sentiment analysis
Text classification & sentiment analysis
 
Twitter sentiment-analysis Jiit2013-14
Twitter sentiment-analysis Jiit2013-14Twitter sentiment-analysis Jiit2013-14
Twitter sentiment-analysis Jiit2013-14
 
Twitter Sentiment Analysis
Twitter Sentiment AnalysisTwitter Sentiment Analysis
Twitter Sentiment Analysis
 
Amazon sentimental analysis
Amazon sentimental analysisAmazon sentimental analysis
Amazon sentimental analysis
 
social network analysis project twitter sentimental analysis
social network analysis project twitter sentimental analysissocial network analysis project twitter sentimental analysis
social network analysis project twitter sentimental analysis
 
Sentiment analysis using ml
Sentiment analysis using mlSentiment analysis using ml
Sentiment analysis using ml
 
Sentiment Analysis in Twitter
Sentiment Analysis in TwitterSentiment Analysis in Twitter
Sentiment Analysis in Twitter
 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment Analysis
 
Sentiment analysis of Twitter Data
Sentiment analysis of Twitter DataSentiment analysis of Twitter Data
Sentiment analysis of Twitter Data
 
Twitter sentiment analysis project report
Twitter sentiment analysis project reportTwitter sentiment analysis project report
Twitter sentiment analysis project report
 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment Analysis
 
Introduction to Sentiment Analysis
Introduction to Sentiment AnalysisIntroduction to Sentiment Analysis
Introduction to Sentiment Analysis
 

Viewers also liked

Data Analytics on Twitter Feeds
Data Analytics on Twitter FeedsData Analytics on Twitter Feeds
Data Analytics on Twitter FeedsEu Jin Lok
 
Discovery & Consumption of Analytics Data @Twitter
Discovery & Consumption of Analytics Data @TwitterDiscovery & Consumption of Analytics Data @Twitter
Discovery & Consumption of Analytics Data @TwitterKamran Munshi
 
Bigdata analytics-twitter
Bigdata analytics-twitterBigdata analytics-twitter
Bigdata analytics-twitterdfilppi
 
Twitter Data Analytics
Twitter Data AnalyticsTwitter Data Analytics
Twitter Data Analyticsrupika08
 
Analyzing Big Data at Twitter (Web 2.0 Expo NYC Sep 2010)
Analyzing Big Data at Twitter (Web 2.0 Expo NYC Sep 2010)Analyzing Big Data at Twitter (Web 2.0 Expo NYC Sep 2010)
Analyzing Big Data at Twitter (Web 2.0 Expo NYC Sep 2010)Kevin Weil
 
Real Time Analytics for Big Data a Twitter Case Study
Real Time Analytics for Big Data a Twitter Case StudyReal Time Analytics for Big Data a Twitter Case Study
Real Time Analytics for Big Data a Twitter Case StudyNati Shalom
 
Real Time Analytics for Big Data - A twitter inspired case study
Real Time Analytics for Big Data - A twitter inspired case studyReal Time Analytics for Big Data - A twitter inspired case study
Real Time Analytics for Big Data - A twitter inspired case studyUri Cohen
 
Sentiment Analysis Using Machine Learning
Sentiment Analysis Using Machine LearningSentiment Analysis Using Machine Learning
Sentiment Analysis Using Machine LearningNihar Suryawanshi
 
Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier Dev Sahu
 
Pig Tutorial | Twitter Case Study | Apache Pig Script and Commands | Edureka
Pig Tutorial | Twitter Case Study | Apache Pig Script and Commands | EdurekaPig Tutorial | Twitter Case Study | Apache Pig Script and Commands | Edureka
Pig Tutorial | Twitter Case Study | Apache Pig Script and Commands | EdurekaEdureka!
 

Viewers also liked (11)

Data Analytics on Twitter Feeds
Data Analytics on Twitter FeedsData Analytics on Twitter Feeds
Data Analytics on Twitter Feeds
 
Discovery & Consumption of Analytics Data @Twitter
Discovery & Consumption of Analytics Data @TwitterDiscovery & Consumption of Analytics Data @Twitter
Discovery & Consumption of Analytics Data @Twitter
 
Bigdata analytics-twitter
Bigdata analytics-twitterBigdata analytics-twitter
Bigdata analytics-twitter
 
Twitter Data Analytics
Twitter Data AnalyticsTwitter Data Analytics
Twitter Data Analytics
 
Twitter Big Data
Twitter Big DataTwitter Big Data
Twitter Big Data
 
Analyzing Big Data at Twitter (Web 2.0 Expo NYC Sep 2010)
Analyzing Big Data at Twitter (Web 2.0 Expo NYC Sep 2010)Analyzing Big Data at Twitter (Web 2.0 Expo NYC Sep 2010)
Analyzing Big Data at Twitter (Web 2.0 Expo NYC Sep 2010)
 
Real Time Analytics for Big Data a Twitter Case Study
Real Time Analytics for Big Data a Twitter Case StudyReal Time Analytics for Big Data a Twitter Case Study
Real Time Analytics for Big Data a Twitter Case Study
 
Real Time Analytics for Big Data - A twitter inspired case study
Real Time Analytics for Big Data - A twitter inspired case studyReal Time Analytics for Big Data - A twitter inspired case study
Real Time Analytics for Big Data - A twitter inspired case study
 
Sentiment Analysis Using Machine Learning
Sentiment Analysis Using Machine LearningSentiment Analysis Using Machine Learning
Sentiment Analysis Using Machine Learning
 
Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier
 
Pig Tutorial | Twitter Case Study | Apache Pig Script and Commands | Edureka
Pig Tutorial | Twitter Case Study | Apache Pig Script and Commands | EdurekaPig Tutorial | Twitter Case Study | Apache Pig Script and Commands | Edureka
Pig Tutorial | Twitter Case Study | Apache Pig Script and Commands | Edureka
 

Similar to Sentiment analysis - Our approach and use cases

Sentimental Analysis - Naive Bayes Algorithm
Sentimental Analysis - Naive Bayes AlgorithmSentimental Analysis - Naive Bayes Algorithm
Sentimental Analysis - Naive Bayes AlgorithmKhushboo Gupta
 
Τweetfix: Data Analytics on Match Fixing
Τweetfix: Data Analytics on Match FixingΤweetfix: Data Analytics on Match Fixing
Τweetfix: Data Analytics on Match FixingAntigoni-Maria Founta
 
Explore what people think about PA. A case study. Cristina Menghini, Collabor...
Explore what people think about PA. A case study. Cristina Menghini, Collabor...Explore what people think about PA. A case study. Cristina Menghini, Collabor...
Explore what people think about PA. A case study. Cristina Menghini, Collabor...Data Driven Innovation
 
Twitter Sentiment Analysis.pdf
Twitter Sentiment Analysis.pdfTwitter Sentiment Analysis.pdf
Twitter Sentiment Analysis.pdfRachanasamal3
 
Interestingness of articles using twitter sentiments
Interestingness of articles using twitter sentimentsInterestingness of articles using twitter sentiments
Interestingness of articles using twitter sentimentsKritiKansalK
 
Planning to Evaluate Earned, Social/Digital Media Campaigns
Planning to Evaluate Earned, Social/Digital Media CampaignsPlanning to Evaluate Earned, Social/Digital Media Campaigns
Planning to Evaluate Earned, Social/Digital Media CampaignsEman Aly
 
Interestingness of articles using twitter sentiments
Interestingness of articles using twitter sentimentsInterestingness of articles using twitter sentiments
Interestingness of articles using twitter sentimentsArpit Bhayani
 
Mining public opinion about economic issues
Mining public opinion about economic issuesMining public opinion about economic issues
Mining public opinion about economic issuesIvan Abboud
 
Sentiment Analysis of Social Media Content: A multi-tool for listening to you...
Sentiment Analysis of Social Media Content: A multi-tool for listening to you...Sentiment Analysis of Social Media Content: A multi-tool for listening to you...
Sentiment Analysis of Social Media Content: A multi-tool for listening to you...Eirini Ntoutsi
 
Sentiment of Sentence in Tweets: A Review
Sentiment of Sentence in Tweets: A ReviewSentiment of Sentence in Tweets: A Review
Sentiment of Sentence in Tweets: A Reviewiosrjce
 
Oe peer learning group 1 - session 4 - april 18
Oe peer learning   group 1 - session 4 - april 18 Oe peer learning   group 1 - session 4 - april 18
Oe peer learning group 1 - session 4 - april 18 Beth Kanter
 
LT_presentation
LT_presentationLT_presentation
LT_presentationaphex34
 
MAjor_Project_2_ppt.pptx
MAjor_Project_2_ppt.pptxMAjor_Project_2_ppt.pptx
MAjor_Project_2_ppt.pptxBrighuRaina2
 
Sentiment Analysis and Social Media: How and Why
Sentiment Analysis and Social Media: How and WhySentiment Analysis and Social Media: How and Why
Sentiment Analysis and Social Media: How and WhyDavide Feltoni Gurini
 
Frontiers of Computational Journalism week 3 - Information Filter Design
Frontiers of Computational Journalism week 3 - Information Filter DesignFrontiers of Computational Journalism week 3 - Information Filter Design
Frontiers of Computational Journalism week 3 - Information Filter DesignJonathan Stray
 
ML&AI APPROACH TO USER UNDERSTANDING ECOSYSTEM AT VCCORP Applications to News...
ML&AI APPROACH TO USER UNDERSTANDING ECOSYSTEM AT VCCORP Applications to News...ML&AI APPROACH TO USER UNDERSTANDING ECOSYSTEM AT VCCORP Applications to News...
ML&AI APPROACH TO USER UNDERSTANDING ECOSYSTEM AT VCCORP Applications to News...Tuan Hoang
 
IRJET- Real Time Sentiment Analysis of Political Twitter Data using Machi...
IRJET-  	  Real Time Sentiment Analysis of Political Twitter Data using Machi...IRJET-  	  Real Time Sentiment Analysis of Political Twitter Data using Machi...
IRJET- Real Time Sentiment Analysis of Political Twitter Data using Machi...IRJET Journal
 
Develop Your Tech Skills to Improve Elections in Your Jurisdiction
Develop Your Tech Skills to Improve Elections in Your JurisdictionDevelop Your Tech Skills to Improve Elections in Your Jurisdiction
Develop Your Tech Skills to Improve Elections in Your JurisdictionWhitney May
 

Similar to Sentiment analysis - Our approach and use cases (20)

Sentimental Analysis - Naive Bayes Algorithm
Sentimental Analysis - Naive Bayes AlgorithmSentimental Analysis - Naive Bayes Algorithm
Sentimental Analysis - Naive Bayes Algorithm
 
Τweetfix: Data Analytics on Match Fixing
Τweetfix: Data Analytics on Match FixingΤweetfix: Data Analytics on Match Fixing
Τweetfix: Data Analytics on Match Fixing
 
Explore what people think about PA. A case study. Cristina Menghini, Collabor...
Explore what people think about PA. A case study. Cristina Menghini, Collabor...Explore what people think about PA. A case study. Cristina Menghini, Collabor...
Explore what people think about PA. A case study. Cristina Menghini, Collabor...
 
Twitter Sentiment Analysis.pdf
Twitter Sentiment Analysis.pdfTwitter Sentiment Analysis.pdf
Twitter Sentiment Analysis.pdf
 
Interestingness of articles using twitter sentiments
Interestingness of articles using twitter sentimentsInterestingness of articles using twitter sentiments
Interestingness of articles using twitter sentiments
 
Planning to Evaluate Earned, Social/Digital Media Campaigns
Planning to Evaluate Earned, Social/Digital Media CampaignsPlanning to Evaluate Earned, Social/Digital Media Campaigns
Planning to Evaluate Earned, Social/Digital Media Campaigns
 
Interestingness of articles using twitter sentiments
Interestingness of articles using twitter sentimentsInterestingness of articles using twitter sentiments
Interestingness of articles using twitter sentiments
 
Mining public opinion about economic issues
Mining public opinion about economic issuesMining public opinion about economic issues
Mining public opinion about economic issues
 
Sentiment Analysis of Social Media Content: A multi-tool for listening to you...
Sentiment Analysis of Social Media Content: A multi-tool for listening to you...Sentiment Analysis of Social Media Content: A multi-tool for listening to you...
Sentiment Analysis of Social Media Content: A multi-tool for listening to you...
 
W01761157162
W01761157162W01761157162
W01761157162
 
Sentiment of Sentence in Tweets: A Review
Sentiment of Sentence in Tweets: A ReviewSentiment of Sentence in Tweets: A Review
Sentiment of Sentence in Tweets: A Review
 
Oe peer learning group 1 - session 4 - april 18
Oe peer learning   group 1 - session 4 - april 18 Oe peer learning   group 1 - session 4 - april 18
Oe peer learning group 1 - session 4 - april 18
 
LT_presentation
LT_presentationLT_presentation
LT_presentation
 
MAjor_Project_2_ppt.pptx
MAjor_Project_2_ppt.pptxMAjor_Project_2_ppt.pptx
MAjor_Project_2_ppt.pptx
 
Sentiment Analysis and Social Media: How and Why
Sentiment Analysis and Social Media: How and WhySentiment Analysis and Social Media: How and Why
Sentiment Analysis and Social Media: How and Why
 
Frontiers of Computational Journalism week 3 - Information Filter Design
Frontiers of Computational Journalism week 3 - Information Filter DesignFrontiers of Computational Journalism week 3 - Information Filter Design
Frontiers of Computational Journalism week 3 - Information Filter Design
 
BDACA - Lecture4
BDACA - Lecture4BDACA - Lecture4
BDACA - Lecture4
 
ML&AI APPROACH TO USER UNDERSTANDING ECOSYSTEM AT VCCORP Applications to News...
ML&AI APPROACH TO USER UNDERSTANDING ECOSYSTEM AT VCCORP Applications to News...ML&AI APPROACH TO USER UNDERSTANDING ECOSYSTEM AT VCCORP Applications to News...
ML&AI APPROACH TO USER UNDERSTANDING ECOSYSTEM AT VCCORP Applications to News...
 
IRJET- Real Time Sentiment Analysis of Political Twitter Data using Machi...
IRJET-  	  Real Time Sentiment Analysis of Political Twitter Data using Machi...IRJET-  	  Real Time Sentiment Analysis of Political Twitter Data using Machi...
IRJET- Real Time Sentiment Analysis of Political Twitter Data using Machi...
 
Develop Your Tech Skills to Improve Elections in Your Jurisdiction
Develop Your Tech Skills to Improve Elections in Your JurisdictionDevelop Your Tech Skills to Improve Elections in Your Jurisdiction
Develop Your Tech Skills to Improve Elections in Your Jurisdiction
 

Recently uploaded

Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPsWebinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPsCEPTES Software Inc
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单enxupq
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单vcaxypu
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhArpitMalhotra16
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单ukgaet
 
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...elinavihriala
 
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...Domenico Conte
 
tapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatheahmadsaood
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单vcaxypu
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundOppotus
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .NABLAS株式会社
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...correoyaya
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesStarCompliance.io
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单enxupq
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单ewymefz
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsalex933524
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单ocavb
 

Recently uploaded (20)

Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPsWebinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
Slip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp ClaimsSlip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp Claims
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
 
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
 
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
 
tapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive data
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
 

Sentiment analysis - Our approach and use cases

  • 1. Sentiment Analysis our approach and use cases Karol Chlasta Antoni Sobkowicz ś dBConf 2015
  • 2. Agenda 2015 Elections in Poland on Twitter.com & Onet.pl Introduction to Sentiment Analysis and its applications Questions & Answers How to approach Sentiment Analysis?
  • 3. Introduction to Sentiment Analysis and its applications
  • 4. ➔ Author’s sentiment ◆ in all text ◆ in particular sentences ◆ towards an entity(s) ◆ towards specific aspects(s) ➔ Author’s emotions ◆ basic vs other emotions ➔ When to automate? ◆ many texts ◆ texts are untagged ◆ gaps in annotation Sentiment vs Emotion Analysis
  • 5. Sentiment Analysis or Opinion Mining ➔ Owner of the opinion ➔ Entity vs aspect ➔ Type of opinion(s) expressed ◆ like / dislike ◆ optimism / pessimism ◆ evaluation, confidence, familiarity ◆ polarization ➔ Value of opinion ➔ Event of sentiment expression He recorded white Russian trucks crossing the border and progressing fast towards Białystok. The candidate looked confident, well prepared and was even very well received by the audience, but lost the debate after answering that question.
  • 6. Sample Applications Politics - forecasting sentiment towards a candidate, political party or reform, social issue. Review - to draw a conclusion if this review is positive or negative Market monitoring - Clients’ comments about our company or the competition. Product - what people think about the new...
  • 9. Approaches ➔ Dictionary methods ◆ Manual or (self)automatic ➔ Statistical methods ◆ Training sets ◆ Various descriptive features ● Words ● Detection of coexistence of words ● Punctuation ● Syntax ● Emoticons ● Language specific analysis ○ http://nlp.stanford.edu/sentiment/
  • 10. Data Gathering ➔ Social Networks ◆ Many social networks provide APIs ➔ Thematic websites ◆ Need to create customized scrappers Important thing - before data gathering we need to correctly select media we are covering!
  • 11. Manually Tagging Sentiment ➔ Manually tagged sentiment data can be used to create sentiment dictionaries and reference (training) sentiment data for experiments ➔ Data should be tagged by several taggers, because, especially in politics, Positive and Negative are very subjective ➔ Data should also be tagged in a way that combats effects associated with Negative Bias ➔ Negative Bias is an effect which results in negative feelings and events are treated as more important by human brain. “ta d... wykrzykiwała codzień pod krzyżem” tagged as neutral “nie dostal sie, na szczescie, do tego palacu, wiec wystaje poden co chwila :)) niedoczekanie twoje, kaczynski!” tagged as negative
  • 12. 2015 Elections in Poland on Twitter.com & Onet.pl
  • 13. Twitter.com and Onet.pl (+ Wp.pl) Twitter.com Over 11,000 tweets related to 10 profiles of candidates on May 10th Onet.pl and Wp.pl Over 2000 articles with one of the tags related to election (Andrzej Duda, Bronisław Komorowski, Paweł Kukiz, Ewa Kopacz, Beata Szydło) between 20.05 and 20.08 (second round of election to presidential swearing-in). Over 1.5m comments written by the users. Additional 12,000+ comments from years 2009-2011 for evaluation purposes
  • 15. Implementation Technology ➔ R 3.2.0 running on i686-pc-linux-gnu, RStudio 0.98.1103 ➔ twitteR, dplyr, stringi, ggplot2, tm, e1071, RTextTools Data ➔ 11,744 tweets related to 10 profiles of candidates from May 10th, simple sentiment scoring algorithm ➔ Lexicon of 2000 positive & 3693 negative Polish words. 18 positive and 22 negative emoticons ➔ 6040 tweets, excluding all neutral used to evaluate Naive Bayes, Maximum Entropy, Support Vector Machines, and Tree sentiment classifiers using 70/30 principle Results ➔ Two approaches: Naive Bayes and Maximum Entropy achieved the best accuracy (71.76% and 77.32% respectively)
  • 17. Step I Technology ➔ .NET ➔ Python 2.7 + Scrapy ➔ Java 8 Data ➔ 1,533,035 comments in 2057 articles total ◆ 923 manually sentiment annotated comments ➔ 5850 comments from 2011, site Gazeta.pl used for lexicon generation (TRAIN-POL) ➔ 31095 tweets used for lexicon generation (TRAIN- TWIT ➔ Evaluated methods: Naive Bayes (NB) and three dictionary based methods Results ➔ Two approaches: Naive Bayes and Simple Dictionary Addition achieved the best accuracy (76% and 78% respectively)
  • 18.
  • 19. Step II Technology ➔ .NET ➔ Python 2.7 + Scrapy ➔ Java 8 Data ➔ 1,533,035 comments in 2057 articles total ◆ 923 manually sentiment annotated comments ➔ 6448 comments from 2011, site Gazeta.pl ➔ 4592 comments from 2011, site Wyborcza.pl ➔ 7177 comments from 2010, site Gazeta.pl ➔ Lexicons generated from comments ➔ Datasets cross-tested using dictionaries created with different datasets Results ➔ Old datasets can be used to annotate new texts despite 4 year difference
  • 20. Step III Technology ➔ Java 8 Data ➔ 1,533,035 comments in 2057 articles total ➔ Lexicon generated using manually sentiment tagged data (from Gazeta.pl 2010, 2011 and Wyborcza.pl 2011) ➔ Algorithm threshold tweaked to have both good binary and 3-category classification Results ➔ Most comments are negative!
  • 22. Questions? Karol Chlasta (@KarolChlasta) Antoni Sobkowicz http://opi-lil.github.io/