SlideShare a Scribd company logo
On sentiment analysis tools for
software engineering research
Robbert Jongeling Subhajit Datta Alexander Serebrenik
Eindhoven U of
Technology (NL)
Singapore U of Technology
and Design (SG)
Eindhoven U of
Technology (NL)
@jongeling_r @datta_subhajit @aserebrenik
E. Guzman, D. Azócar, and Y. Li,
“Sentiment analysis of commit
comments in GitHub: An empirical
study,” MSR 2014
A.-I. Rousinopoulos, G. Robles, and
J. M. González-Barahona, “Sentiment
analysis of Free/Open Source
developers: preliminary findings from
a case study,” Revista Eletrônica de
Sistemas de Informação, 2014
E. Guzman and B. Bruegge, “Towards
emotional awareness in software
development teams,” in Joint Meeting on
Foundations of Software Engineering, 2013
D. Pletea, B. Vasilescu, and A. Serebrenik,
“Security and emotion: Sentiment analysis
of security discussions on GitHub”, MSR
2014
M. Ortu, B. Adams, G. Destefanis, P. Tourani,
M. Marchesi, and R. Tonelli, “Are bullies
more productive? empirical study of
affectiveness vs. issue fixing time,” in MSR
2015
D. Garcia, M. S. Zanetti, and F. Schweitzer,
“The role of emotions in contributors
activity: A case study on the Gentoo
community,” in International Conference on
Cloud and Green Computing, 2013
E. Guzman, D. Azócar, and Y. Li,
“Sentiment analysis of commit
comments in GitHub: An empirical
study,” MSR 2014
A.-I. Rousinopoulos, G. Robles, and
J. M. González-Barahona, “Sentiment
analysis of Free/Open Source
developers: preliminary findings from
a case study,” Revista Eletrônica de
Sistemas de Informação, 2014
E. Guzman and B. Bruegge, “Towards
emotional awareness in software
development teams,” in Joint Meeting on
Foundations of Software Engineering, 2013
D. Pletea, B. Vasilescu, and A. Serebrenik,
“Security and emotion: Sentiment analysis
of security discussions on GitHub”, MSR
2014
M. Ortu, B. Adams, G. Destefanis, P. Tourani,
M. Marchesi, and R. Tonelli, “Are bullies
more productive? empirical study of
affectiveness vs. issue fixing time,” in MSR
2015
D. Garcia, M. S. Zanetti, and F. Schweitzer,
“The role of emotions in contributors
activity: A case study on the Gentoo
community,” in International Conference on
Cloud and Green Computing, 2013
NLTK SentiStrength
E. Guzman, D. Azócar, and Y. Li,
“Sentiment analysis of commit
comments in GitHub: An empirical
study,” MSR 2014
A.-I. Rousinopoulos, G. Robles, and
J. M. González-Barahona, “Sentiment
analysis of Free/Open Source
developers: preliminary findings from
a case study,” Revista Eletrônica de
Sistemas de Informação, 2014
E. Guzman and B. Bruegge, “Towards
emotional awareness in software
development teams,” in Joint Meeting on
Foundations of Software Engineering, 2013
D. Pletea, B. Vasilescu, and A. Serebrenik,
“Security and emotion: Sentiment analysis
of security discussions on GitHub”, MSR
2014
M. Ortu, B. Adams, G. Destefanis, P. Tourani,
M. Marchesi, and R. Tonelli, “Are bullies
more productive? empirical study of
affectiveness vs. issue fixing time,” in MSR
2015
D. Garcia, M. S. Zanetti, and F. Schweitzer,
“The role of emotions in contributors
activity: A case study on the Gentoo
community,” in International Conference on
Cloud and Green Computing, 2013
NLTK SentiStrength
Trained on movie/product reviews.
Threat: might misidentify (or fail to identify) a
sentiment in a software engineering artefact
• RQ1: To what extent do different sentiment analysis
tools agree with emotions of software developers?
• RQ2: To what extent do different sentiment analysis
tools agree with each other?
• RQ3: Do different sentiment analysis tools lead to
contradictory results in a software engineering
study?
Murgia et al.
MSR 2014
392 comments x 4 evaluators
joy love surprise anger fearsadness
positive negative
{
{
RQ1
RQ2
Murgia et al.
MSR 2014
392 comments x 4 evaluators
joy love surprise anger fearsadness
positive negative
{
{
Consistent:
positive: 3 positive, none negative
negative: 3 negative, none positive
neutral: ≥3 without emotion indication
Alchemy
Stanford NLP
NLTK
SentiStrength
RQ1
Manual
neg neu pos
Tool
neg
neu
pos
RQ2
Tool A
neg neu pos
Tool
B
neg
neu
pos
RQ1
RQ2
Murgia et al.
MSR 2014
392 comments x 4 evaluators
joy love surprise anger fearsadness
positive negative
{
{
Consistent:
positive: 3 positive, none negative
negative: 3 negative, none positive
neutral: ≥3 without emotion indication
Alchemy
Stanford NLP
NLTK
SentiStrength
RQ1
Manual
neg neu pos
Tool
neg
neu
pos
54
24
217
0 ≤ Adjusted Rand Index ≤ 1
[Santos, Embrechts, ICANN 2009]
RQ2
Tool A
neg neu pos
Tool
B
neg
neu
pos
RQ1
RQ2
Murgia et al.
MSR 2014
392 comments x 4 evaluators
joy love surprise anger fearsadness
positive negative
{
{
Consistent:
positive: 3 positive, none negative
negative: 3 negative, none positive
neutral: ≥3 without emotion indication
Alchemy
Stanford NLP
NLTK
SentiStrength
RQ1
Manual
neg neu pos
Tool
neg
neu
pos
54
24
217
0 ≤ Adjusted Rand Index ≤ 1
[Santos, Embrechts, ICANN 2009]
RQ2
Tool A
neg neu pos
Tool
B
neg
neu
pos
RQ1
RQ2
RQ1: To what extent do different sentiment analysis tools
agree with emotions of software developers?
RQ1
Manual
neg neu pos
NLTK
neg 19 51 11
neu 0 138 7
pos 5 28 36
Tool ARI
NLTK 0.239
SentiStrength 0.113
Stanford NLP 0.108
Alchemy 0.079
Tools do not agree with manual evaluation
RQ1
RQ2
RQ2: To what extent do different sentiment analysis tools
agree with each other?
RQ2
SentiStrength
neg neu pos
NLTK
neg 17 39 25
neu 15 96 34
pos 6 20 43
Tool A Tool B ARI
NLTK Alchemy 0.104
NLTK SentiStrength 0.090
Tools do not agree with each other
RQ1
RQ2
RQ3
issue tracker
over
text
response
time
Sentiment
Analysis Tool
compare times for
neg, neu, pos
issues/questions
q & a site
NLTK
issue tracker
over
text
response
time
Sentiment
Anal. Tool
compare times for
neg, neu, pos
issues/questions
q & a site
NLTK ∩
SentiStrength
issue tracker
over
text
response
time
Sentiment
Anal. Tool
compare times for
neg, neu, pos
issues/questions
q & a site
SentiStrength
RQ3
issue tracker
over
text
response
time
Sentiment
Analysis Tool
compare times for
neg, neu, pos
issues/questions
q & a site
NLTK
Are the results the same?
NLTK SentiStrength NLTK ∩ SentiStrength
ASF
descr
neg > neu*** neg > neu***
pos > neu*** pos > neu*** pos > neu***
pos > neg*** pos > neg***
ASF title
neg > neu**
pos > neu*** pos > neu**
pos > neg* pos > neg**
GNOME
descr
neg > neu*** neg > neu*** neg > neu***
pos > neu*** pos > neu*** pos > neu***
pos > neg***
neg > pos***
SO
descr
ø neg > pos* ø
RQ3 RQ3: Do different sentiment analysis tools lead to
contradictory results in a software engineering study?
Choice of the sentiment analysis tool affects results of the
software engineering study
Tools do not agree with manual evaluation
Tools do not agree with each other
Choice of the sentiment analysis tool affects results of the
software engineering study
Summary
Sentiment analysis tools are trained on movie/
product reviews.
Threat: might misidentify (or fail to identify) a
sentiment in a software engineering artefact
Next steps?
• Train sentiment analysis tools on software
engineering data
• Data of Murgia et al.: first step
• More and better-suited data is needed

More Related Content

Viewers also liked

actionreserchpbl
actionreserchpblactionreserchpbl
actionreserchpbl
puniga
 
Hh kehittamistyo esitys_atte_jarvela
Hh kehittamistyo esitys_atte_jarvelaHh kehittamistyo esitys_atte_jarvela
Hh kehittamistyo esitys_atte_jarvelaAtte Järvelä
 
ไตร่ตรองงานวิจัยของฉัน
ไตร่ตรองงานวิจัยของฉันไตร่ตรองงานวิจัยของฉัน
ไตร่ตรองงานวิจัยของฉัน
School in Phatthalung
 
Invited Talk MESOCA 2014: Evolving software systems: emerging trends and chal...
Invited Talk MESOCA 2014: Evolving software systems: emerging trends and chal...Invited Talk MESOCA 2014: Evolving software systems: emerging trends and chal...
Invited Talk MESOCA 2014: Evolving software systems: emerging trends and chal...
Alexander Serebrenik
 
Mock Advertisement for Amphibious Mousetrap Car
Mock Advertisement for Amphibious Mousetrap CarMock Advertisement for Amphibious Mousetrap Car
Mock Advertisement for Amphibious Mousetrap Car
Melissa Anne Lim
 
Icsm 2011 you can't control the unfamiliar
Icsm 2011 you can't control the unfamiliarIcsm 2011 you can't control the unfamiliar
Icsm 2011 you can't control the unfamiliarAlexander Serebrenik
 
EnTagRec: An Enhanced Tag Recommendation System for Software Information Sites
EnTagRec: An Enhanced Tag Recommendation System for Software Information SitesEnTagRec: An Enhanced Tag Recommendation System for Software Information Sites
EnTagRec: An Enhanced Tag Recommendation System for Software Information Sites
Alexander Serebrenik
 
Roman Vorobyev
Roman VorobyevRoman Vorobyev
Roman Vorobyev
Roman Vorobyev
 
Global Entrepreneurship Final Oral Presentation for "Les macarons de Pauline"
Global Entrepreneurship Final Oral Presentation for "Les macarons de Pauline"Global Entrepreneurship Final Oral Presentation for "Les macarons de Pauline"
Global Entrepreneurship Final Oral Presentation for "Les macarons de Pauline"
paulinehalazoune
 
Saxony Germany
Saxony GermanySaxony Germany
Saxony Germany
Nongprue038 Winkler
 
865 social capital
865 social capital865 social capital
865 social capitalIan Pollock
 
Sneak peak at door prizes and silent auction items for Grand Opening reception!
Sneak peak at door prizes and silent auction items for Grand Opening reception!Sneak peak at door prizes and silent auction items for Grand Opening reception!
Sneak peak at door prizes and silent auction items for Grand Opening reception!
Colt
 
Databases Part 3: Searching
Databases Part 3: SearchingDatabases Part 3: Searching
Databases Part 3: Searching
Missio Seminary Library
 
Riverbend Market Cooperative
Riverbend Market CooperativeRiverbend Market Cooperative
Riverbend Market Cooperative
Colt
 
Sneak peak at door prizes and silent auction items for Grand Opening reception!
Sneak peak at door prizes and silent auction items for Grand Opening reception!Sneak peak at door prizes and silent auction items for Grand Opening reception!
Sneak peak at door prizes and silent auction items for Grand Opening reception!
Colt
 
Flowgen: Flowchart-Based Documentation Framework for C++
Flowgen: Flowchart-Based Documentation Framework for C++Flowgen: Flowchart-Based Documentation Framework for C++
Flowgen: Flowchart-Based Documentation Framework for C++
Alexander Serebrenik
 
PresentacióN Pcpi M
PresentacióN Pcpi MPresentacióN Pcpi M
PresentacióN Pcpi Mguestae4440
 
Benevol 2013: Visualizing the complexity of software module upgrades
Benevol 2013: Visualizing the complexity of software module upgradesBenevol 2013: Visualizing the complexity of software module upgrades
Benevol 2013: Visualizing the complexity of software module upgradesAlexander Serebrenik
 
Pbl_action_reserch
Pbl_action_reserchPbl_action_reserch
Pbl_action_reserch
sojikarns
 
Reunió famílies setembre p3 2012 2013
Reunió famílies setembre p3 2012 2013Reunió famílies setembre p3 2012 2013
Reunió famílies setembre p3 2012 2013edumallol
 

Viewers also liked (20)

actionreserchpbl
actionreserchpblactionreserchpbl
actionreserchpbl
 
Hh kehittamistyo esitys_atte_jarvela
Hh kehittamistyo esitys_atte_jarvelaHh kehittamistyo esitys_atte_jarvela
Hh kehittamistyo esitys_atte_jarvela
 
ไตร่ตรองงานวิจัยของฉัน
ไตร่ตรองงานวิจัยของฉันไตร่ตรองงานวิจัยของฉัน
ไตร่ตรองงานวิจัยของฉัน
 
Invited Talk MESOCA 2014: Evolving software systems: emerging trends and chal...
Invited Talk MESOCA 2014: Evolving software systems: emerging trends and chal...Invited Talk MESOCA 2014: Evolving software systems: emerging trends and chal...
Invited Talk MESOCA 2014: Evolving software systems: emerging trends and chal...
 
Mock Advertisement for Amphibious Mousetrap Car
Mock Advertisement for Amphibious Mousetrap CarMock Advertisement for Amphibious Mousetrap Car
Mock Advertisement for Amphibious Mousetrap Car
 
Icsm 2011 you can't control the unfamiliar
Icsm 2011 you can't control the unfamiliarIcsm 2011 you can't control the unfamiliar
Icsm 2011 you can't control the unfamiliar
 
EnTagRec: An Enhanced Tag Recommendation System for Software Information Sites
EnTagRec: An Enhanced Tag Recommendation System for Software Information SitesEnTagRec: An Enhanced Tag Recommendation System for Software Information Sites
EnTagRec: An Enhanced Tag Recommendation System for Software Information Sites
 
Roman Vorobyev
Roman VorobyevRoman Vorobyev
Roman Vorobyev
 
Global Entrepreneurship Final Oral Presentation for "Les macarons de Pauline"
Global Entrepreneurship Final Oral Presentation for "Les macarons de Pauline"Global Entrepreneurship Final Oral Presentation for "Les macarons de Pauline"
Global Entrepreneurship Final Oral Presentation for "Les macarons de Pauline"
 
Saxony Germany
Saxony GermanySaxony Germany
Saxony Germany
 
865 social capital
865 social capital865 social capital
865 social capital
 
Sneak peak at door prizes and silent auction items for Grand Opening reception!
Sneak peak at door prizes and silent auction items for Grand Opening reception!Sneak peak at door prizes and silent auction items for Grand Opening reception!
Sneak peak at door prizes and silent auction items for Grand Opening reception!
 
Databases Part 3: Searching
Databases Part 3: SearchingDatabases Part 3: Searching
Databases Part 3: Searching
 
Riverbend Market Cooperative
Riverbend Market CooperativeRiverbend Market Cooperative
Riverbend Market Cooperative
 
Sneak peak at door prizes and silent auction items for Grand Opening reception!
Sneak peak at door prizes and silent auction items for Grand Opening reception!Sneak peak at door prizes and silent auction items for Grand Opening reception!
Sneak peak at door prizes and silent auction items for Grand Opening reception!
 
Flowgen: Flowchart-Based Documentation Framework for C++
Flowgen: Flowchart-Based Documentation Framework for C++Flowgen: Flowchart-Based Documentation Framework for C++
Flowgen: Flowchart-Based Documentation Framework for C++
 
PresentacióN Pcpi M
PresentacióN Pcpi MPresentacióN Pcpi M
PresentacióN Pcpi M
 
Benevol 2013: Visualizing the complexity of software module upgrades
Benevol 2013: Visualizing the complexity of software module upgradesBenevol 2013: Visualizing the complexity of software module upgrades
Benevol 2013: Visualizing the complexity of software module upgrades
 
Pbl_action_reserch
Pbl_action_reserchPbl_action_reserch
Pbl_action_reserch
 
Reunió famílies setembre p3 2012 2013
Reunió famílies setembre p3 2012 2013Reunió famílies setembre p3 2012 2013
Reunió famílies setembre p3 2012 2013
 

Similar to Sentiment analysis tools for software engineering research cannot be used out of the box

A Benchmark Study on Sentiment Analysis for Software Engineering Research
A Benchmark Study on Sentiment Analysis for Software Engineering ResearchA Benchmark Study on Sentiment Analysis for Software Engineering Research
A Benchmark Study on Sentiment Analysis for Software Engineering Research
Nicole Novielli
 
Some examples of Open Source computational tools for researchers
Some examples of Open Source computational tools for researchersSome examples of Open Source computational tools for researchers
Some examples of Open Source computational tools for researchers
Ashish Sharma
 
Emotion Analysis in Software Ecosystems
Emotion Analysis in Software EcosystemsEmotion Analysis in Software Ecosystems
Emotion Analysis in Software Ecosystems
Alexander Serebrenik
 
Intro to sentiment analysis
Intro to sentiment analysisIntro to sentiment analysis
Intro to sentiment analysis
Timea Turdean
 
IRJET- BDI using NLP for Efficient Depression Identification
IRJET- BDI using NLP for Efficient Depression IdentificationIRJET- BDI using NLP for Efficient Depression Identification
IRJET- BDI using NLP for Efficient Depression Identification
IRJET Journal
 
A Gold Standard for Emotion Annotation in Stack Overflow
A Gold Standard for Emotion Annotation in Stack Overflow A Gold Standard for Emotion Annotation in Stack Overflow
A Gold Standard for Emotion Annotation in Stack Overflow
Fabio Calefato
 
To Label or Not? Advances and Open Challenges in SE-specific Sentiment Analysis
To Label or Not? Advances and Open Challenges in SE-specific Sentiment AnalysisTo Label or Not? Advances and Open Challenges in SE-specific Sentiment Analysis
To Label or Not? Advances and Open Challenges in SE-specific Sentiment Analysis
Nicole Novielli
 
Aspect-Level Sentiment Analysis On Hotel Reviews
Aspect-Level Sentiment Analysis On Hotel ReviewsAspect-Level Sentiment Analysis On Hotel Reviews
Aspect-Level Sentiment Analysis On Hotel Reviews
Kimberly Pulley
 
R programming for psychometrics
R programming for psychometricsR programming for psychometrics
R programming for psychometrics
Diane Talley
 
The Challenges of Affect Detection in the Social Programmer Ecosystem
The Challenges of Affect Detection in the Social Programmer EcosystemThe Challenges of Affect Detection in the Social Programmer Ecosystem
The Challenges of Affect Detection in the Social Programmer Ecosystem
Nicole Novielli
 
Analyzing Big Data's Weakest Link (hint: it might be you)
Analyzing Big Data's Weakest Link  (hint: it might be you)Analyzing Big Data's Weakest Link  (hint: it might be you)
Analyzing Big Data's Weakest Link (hint: it might be you)
HPCC Systems
 
Big Data: the weakest link
Big Data: the weakest linkBig Data: the weakest link
Big Data: the weakest link
CS, NcState
 
Keynote@QUATIC - Recognizing Developer's Emotions: Advances and Open Challenges
Keynote@QUATIC - Recognizing Developer's Emotions: Advances and Open ChallengesKeynote@QUATIC - Recognizing Developer's Emotions: Advances and Open Challenges
Keynote@QUATIC - Recognizing Developer's Emotions: Advances and Open Challenges
Nicole Novielli
 
White paper loren k schwappach
White paper   loren k schwappachWhite paper   loren k schwappach
White paper loren k schwappachLoren Schwappach
 
Micro-Serendipity: Meaningful Coincidences in Everyday Life Shared on Twitter
Micro-Serendipity: Meaningful Coincidences in Everyday Life Shared on TwitterMicro-Serendipity: Meaningful Coincidences in Everyday Life Shared on Twitter
Micro-Serendipity: Meaningful Coincidences in Everyday Life Shared on Twitter
Toine Bogers
 
How does Social Software support Global Software Development?
How does Social Software support Global Software Development?How does Social Software support Global Software Development?
How does Social Software support Global Software Development?
Rosalba Giuffrida
 
Empathic Mixed Reality
Empathic Mixed RealityEmpathic Mixed Reality
Empathic Mixed Reality
Thammathip Piumsomboon
 
A General Architecture for an Emotion-aware Content-based Recommender System
A General Architecture for an Emotion-aware Content-based Recommender SystemA General Architecture for an Emotion-aware Content-based Recommender System
A General Architecture for an Emotion-aware Content-based Recommender System
Lucio Narducci
 
A Subjective Feature Extraction For Sentiment Analysis In Malayalam Language
A Subjective Feature Extraction For Sentiment Analysis In Malayalam LanguageA Subjective Feature Extraction For Sentiment Analysis In Malayalam Language
A Subjective Feature Extraction For Sentiment Analysis In Malayalam Language
Jeff Nelson
 
Neural Network Based Context Sensitive Sentiment Analysis
Neural Network Based Context Sensitive Sentiment AnalysisNeural Network Based Context Sensitive Sentiment Analysis
Neural Network Based Context Sensitive Sentiment Analysis
Editor IJCATR
 

Similar to Sentiment analysis tools for software engineering research cannot be used out of the box (20)

A Benchmark Study on Sentiment Analysis for Software Engineering Research
A Benchmark Study on Sentiment Analysis for Software Engineering ResearchA Benchmark Study on Sentiment Analysis for Software Engineering Research
A Benchmark Study on Sentiment Analysis for Software Engineering Research
 
Some examples of Open Source computational tools for researchers
Some examples of Open Source computational tools for researchersSome examples of Open Source computational tools for researchers
Some examples of Open Source computational tools for researchers
 
Emotion Analysis in Software Ecosystems
Emotion Analysis in Software EcosystemsEmotion Analysis in Software Ecosystems
Emotion Analysis in Software Ecosystems
 
Intro to sentiment analysis
Intro to sentiment analysisIntro to sentiment analysis
Intro to sentiment analysis
 
IRJET- BDI using NLP for Efficient Depression Identification
IRJET- BDI using NLP for Efficient Depression IdentificationIRJET- BDI using NLP for Efficient Depression Identification
IRJET- BDI using NLP for Efficient Depression Identification
 
A Gold Standard for Emotion Annotation in Stack Overflow
A Gold Standard for Emotion Annotation in Stack Overflow A Gold Standard for Emotion Annotation in Stack Overflow
A Gold Standard for Emotion Annotation in Stack Overflow
 
To Label or Not? Advances and Open Challenges in SE-specific Sentiment Analysis
To Label or Not? Advances and Open Challenges in SE-specific Sentiment AnalysisTo Label or Not? Advances and Open Challenges in SE-specific Sentiment Analysis
To Label or Not? Advances and Open Challenges in SE-specific Sentiment Analysis
 
Aspect-Level Sentiment Analysis On Hotel Reviews
Aspect-Level Sentiment Analysis On Hotel ReviewsAspect-Level Sentiment Analysis On Hotel Reviews
Aspect-Level Sentiment Analysis On Hotel Reviews
 
R programming for psychometrics
R programming for psychometricsR programming for psychometrics
R programming for psychometrics
 
The Challenges of Affect Detection in the Social Programmer Ecosystem
The Challenges of Affect Detection in the Social Programmer EcosystemThe Challenges of Affect Detection in the Social Programmer Ecosystem
The Challenges of Affect Detection in the Social Programmer Ecosystem
 
Analyzing Big Data's Weakest Link (hint: it might be you)
Analyzing Big Data's Weakest Link  (hint: it might be you)Analyzing Big Data's Weakest Link  (hint: it might be you)
Analyzing Big Data's Weakest Link (hint: it might be you)
 
Big Data: the weakest link
Big Data: the weakest linkBig Data: the weakest link
Big Data: the weakest link
 
Keynote@QUATIC - Recognizing Developer's Emotions: Advances and Open Challenges
Keynote@QUATIC - Recognizing Developer's Emotions: Advances and Open ChallengesKeynote@QUATIC - Recognizing Developer's Emotions: Advances and Open Challenges
Keynote@QUATIC - Recognizing Developer's Emotions: Advances and Open Challenges
 
White paper loren k schwappach
White paper   loren k schwappachWhite paper   loren k schwappach
White paper loren k schwappach
 
Micro-Serendipity: Meaningful Coincidences in Everyday Life Shared on Twitter
Micro-Serendipity: Meaningful Coincidences in Everyday Life Shared on TwitterMicro-Serendipity: Meaningful Coincidences in Everyday Life Shared on Twitter
Micro-Serendipity: Meaningful Coincidences in Everyday Life Shared on Twitter
 
How does Social Software support Global Software Development?
How does Social Software support Global Software Development?How does Social Software support Global Software Development?
How does Social Software support Global Software Development?
 
Empathic Mixed Reality
Empathic Mixed RealityEmpathic Mixed Reality
Empathic Mixed Reality
 
A General Architecture for an Emotion-aware Content-based Recommender System
A General Architecture for an Emotion-aware Content-based Recommender SystemA General Architecture for an Emotion-aware Content-based Recommender System
A General Architecture for an Emotion-aware Content-based Recommender System
 
A Subjective Feature Extraction For Sentiment Analysis In Malayalam Language
A Subjective Feature Extraction For Sentiment Analysis In Malayalam LanguageA Subjective Feature Extraction For Sentiment Analysis In Malayalam Language
A Subjective Feature Extraction For Sentiment Analysis In Malayalam Language
 
Neural Network Based Context Sensitive Sentiment Analysis
Neural Network Based Context Sensitive Sentiment AnalysisNeural Network Based Context Sensitive Sentiment Analysis
Neural Network Based Context Sensitive Sentiment Analysis
 

More from Alexander Serebrenik

Software development is a human activity: understanding software requires und...
Software development is a human activity: understanding software requires und...Software development is a human activity: understanding software requires und...
Software development is a human activity: understanding software requires und...
Alexander Serebrenik
 
Towards Continuous Performance Assessment of Java Applications With PerfBot
Towards Continuous Performance Assessment of Java Applications With PerfBotTowards Continuous Performance Assessment of Java Applications With PerfBot
Towards Continuous Performance Assessment of Java Applications With PerfBot
Alexander Serebrenik
 
“STILL AROUND”: Experiences and Survival Strategies of Veteran Women Software...
“STILL AROUND”: Experiences and Survival Strategies of Veteran Women Software...“STILL AROUND”: Experiences and Survival Strategies of Veteran Women Software...
“STILL AROUND”: Experiences and Survival Strategies of Veteran Women Software...
Alexander Serebrenik
 
A Qualitative Study of Developers’ Discussions of Their Problems and Joys Dur...
A Qualitative Study of Developers’ Discussions of Their Problems and Joys Dur...A Qualitative Study of Developers’ Discussions of Their Problems and Joys Dur...
A Qualitative Study of Developers’ Discussions of Their Problems and Joys Dur...
Alexander Serebrenik
 
Investigating the Resolution of Vulnerable Dependencies with Dependabot Secur...
Investigating the Resolution of Vulnerable Dependencies with Dependabot Secur...Investigating the Resolution of Vulnerable Dependencies with Dependabot Secur...
Investigating the Resolution of Vulnerable Dependencies with Dependabot Secur...
Alexander Serebrenik
 
Gender and Age in Software Engineering
Gender and Age in Software EngineeringGender and Age in Software Engineering
Gender and Age in Software Engineering
Alexander Serebrenik
 
Alexander - intro
Alexander - introAlexander - intro
Alexander - intro
Alexander Serebrenik
 
Diversity and inclusion in a CS classroom
Diversity and inclusion in a CS classroomDiversity and inclusion in a CS classroom
Diversity and inclusion in a CS classroom
Alexander Serebrenik
 
An Empirical Assessment on Merging and Repositioning of Static Analysis Alarms
An Empirical Assessment on Merging and Repositioning of Static Analysis AlarmsAn Empirical Assessment on Merging and Repositioning of Static Analysis Alarms
An Empirical Assessment on Merging and Repositioning of Static Analysis Alarms
Alexander Serebrenik
 
Classification and Ranking of Delta Static Analysis Alarms
Classification and Ranking of Delta Static Analysis AlarmsClassification and Ranking of Delta Static Analysis Alarms
Classification and Ranking of Delta Static Analysis Alarms
Alexander Serebrenik
 
What Is an AI Engineer? An Empirical Analysis of Job Ads in The Netherlands
What Is an AI Engineer? An Empirical Analysis of Job Ads in The NetherlandsWhat Is an AI Engineer? An Empirical Analysis of Job Ads in The Netherlands
What Is an AI Engineer? An Empirical Analysis of Job Ads in The Netherlands
Alexander Serebrenik
 
Gender and Community Smells
Gender and Community SmellsGender and Community Smells
Gender and Community Smells
Alexander Serebrenik
 
Bias in MSR Research
Bias in MSR ResearchBias in MSR Research
Bias in MSR Research
Alexander Serebrenik
 
From team organisation to software quality
From team organisation to software qualityFrom team organisation to software quality
From team organisation to software quality
Alexander Serebrenik
 
Women in Dutch Computer Science: Best Practices for Recruitment, Onboarding a...
Women in Dutch Computer Science: Best Practices for Recruitment, Onboarding a...Women in Dutch Computer Science: Best Practices for Recruitment, Onboarding a...
Women in Dutch Computer Science: Best Practices for Recruitment, Onboarding a...
Alexander Serebrenik
 
My research story (presentation at ICSE 2021 New Faculty Symposium)
My research story (presentation at ICSE 2021 New Faculty Symposium)My research story (presentation at ICSE 2021 New Faculty Symposium)
My research story (presentation at ICSE 2021 New Faculty Symposium)
Alexander Serebrenik
 
Opinion Mining for Software Engineering
Opinion Mining for Software EngineeringOpinion Mining for Software Engineering
Opinion Mining for Software Engineering
Alexander Serebrenik
 
Removing Self Admitted Technical Debt
Removing Self Admitted Technical DebtRemoving Self Admitted Technical Debt
Removing Self Admitted Technical Debt
Alexander Serebrenik
 
Gender Diversity and Inclusion and Software Engineering
Gender Diversity and Inclusion and Software EngineeringGender Diversity and Inclusion and Software Engineering
Gender Diversity and Inclusion and Software Engineering
Alexander Serebrenik
 
Identifying Developers’ Gender: State of the Art
Identifying Developers’ Gender: State of the ArtIdentifying Developers’ Gender: State of the Art
Identifying Developers’ Gender: State of the Art
Alexander Serebrenik
 

More from Alexander Serebrenik (20)

Software development is a human activity: understanding software requires und...
Software development is a human activity: understanding software requires und...Software development is a human activity: understanding software requires und...
Software development is a human activity: understanding software requires und...
 
Towards Continuous Performance Assessment of Java Applications With PerfBot
Towards Continuous Performance Assessment of Java Applications With PerfBotTowards Continuous Performance Assessment of Java Applications With PerfBot
Towards Continuous Performance Assessment of Java Applications With PerfBot
 
“STILL AROUND”: Experiences and Survival Strategies of Veteran Women Software...
“STILL AROUND”: Experiences and Survival Strategies of Veteran Women Software...“STILL AROUND”: Experiences and Survival Strategies of Veteran Women Software...
“STILL AROUND”: Experiences and Survival Strategies of Veteran Women Software...
 
A Qualitative Study of Developers’ Discussions of Their Problems and Joys Dur...
A Qualitative Study of Developers’ Discussions of Their Problems and Joys Dur...A Qualitative Study of Developers’ Discussions of Their Problems and Joys Dur...
A Qualitative Study of Developers’ Discussions of Their Problems and Joys Dur...
 
Investigating the Resolution of Vulnerable Dependencies with Dependabot Secur...
Investigating the Resolution of Vulnerable Dependencies with Dependabot Secur...Investigating the Resolution of Vulnerable Dependencies with Dependabot Secur...
Investigating the Resolution of Vulnerable Dependencies with Dependabot Secur...
 
Gender and Age in Software Engineering
Gender and Age in Software EngineeringGender and Age in Software Engineering
Gender and Age in Software Engineering
 
Alexander - intro
Alexander - introAlexander - intro
Alexander - intro
 
Diversity and inclusion in a CS classroom
Diversity and inclusion in a CS classroomDiversity and inclusion in a CS classroom
Diversity and inclusion in a CS classroom
 
An Empirical Assessment on Merging and Repositioning of Static Analysis Alarms
An Empirical Assessment on Merging and Repositioning of Static Analysis AlarmsAn Empirical Assessment on Merging and Repositioning of Static Analysis Alarms
An Empirical Assessment on Merging and Repositioning of Static Analysis Alarms
 
Classification and Ranking of Delta Static Analysis Alarms
Classification and Ranking of Delta Static Analysis AlarmsClassification and Ranking of Delta Static Analysis Alarms
Classification and Ranking of Delta Static Analysis Alarms
 
What Is an AI Engineer? An Empirical Analysis of Job Ads in The Netherlands
What Is an AI Engineer? An Empirical Analysis of Job Ads in The NetherlandsWhat Is an AI Engineer? An Empirical Analysis of Job Ads in The Netherlands
What Is an AI Engineer? An Empirical Analysis of Job Ads in The Netherlands
 
Gender and Community Smells
Gender and Community SmellsGender and Community Smells
Gender and Community Smells
 
Bias in MSR Research
Bias in MSR ResearchBias in MSR Research
Bias in MSR Research
 
From team organisation to software quality
From team organisation to software qualityFrom team organisation to software quality
From team organisation to software quality
 
Women in Dutch Computer Science: Best Practices for Recruitment, Onboarding a...
Women in Dutch Computer Science: Best Practices for Recruitment, Onboarding a...Women in Dutch Computer Science: Best Practices for Recruitment, Onboarding a...
Women in Dutch Computer Science: Best Practices for Recruitment, Onboarding a...
 
My research story (presentation at ICSE 2021 New Faculty Symposium)
My research story (presentation at ICSE 2021 New Faculty Symposium)My research story (presentation at ICSE 2021 New Faculty Symposium)
My research story (presentation at ICSE 2021 New Faculty Symposium)
 
Opinion Mining for Software Engineering
Opinion Mining for Software EngineeringOpinion Mining for Software Engineering
Opinion Mining for Software Engineering
 
Removing Self Admitted Technical Debt
Removing Self Admitted Technical DebtRemoving Self Admitted Technical Debt
Removing Self Admitted Technical Debt
 
Gender Diversity and Inclusion and Software Engineering
Gender Diversity and Inclusion and Software EngineeringGender Diversity and Inclusion and Software Engineering
Gender Diversity and Inclusion and Software Engineering
 
Identifying Developers’ Gender: State of the Art
Identifying Developers’ Gender: State of the ArtIdentifying Developers’ Gender: State of the Art
Identifying Developers’ Gender: State of the Art
 

Recently uploaded

Enhancing Project Management Efficiency_ Leveraging AI Tools like ChatGPT.pdf
Enhancing Project Management Efficiency_ Leveraging AI Tools like ChatGPT.pdfEnhancing Project Management Efficiency_ Leveraging AI Tools like ChatGPT.pdf
Enhancing Project Management Efficiency_ Leveraging AI Tools like ChatGPT.pdf
Jay Das
 
SOCRadar Research Team: Latest Activities of IntelBroker
SOCRadar Research Team: Latest Activities of IntelBrokerSOCRadar Research Team: Latest Activities of IntelBroker
SOCRadar Research Team: Latest Activities of IntelBroker
SOCRadar
 
GlobusWorld 2024 Opening Keynote session
GlobusWorld 2024 Opening Keynote sessionGlobusWorld 2024 Opening Keynote session
GlobusWorld 2024 Opening Keynote session
Globus
 
A Comprehensive Look at Generative AI in Retail App Testing.pdf
A Comprehensive Look at Generative AI in Retail App Testing.pdfA Comprehensive Look at Generative AI in Retail App Testing.pdf
A Comprehensive Look at Generative AI in Retail App Testing.pdf
kalichargn70th171
 
Beyond Event Sourcing - Embracing CRUD for Wix Platform - Java.IL
Beyond Event Sourcing - Embracing CRUD for Wix Platform - Java.ILBeyond Event Sourcing - Embracing CRUD for Wix Platform - Java.IL
Beyond Event Sourcing - Embracing CRUD for Wix Platform - Java.IL
Natan Silnitsky
 
Paketo Buildpacks : la meilleure façon de construire des images OCI? DevopsDa...
Paketo Buildpacks : la meilleure façon de construire des images OCI? DevopsDa...Paketo Buildpacks : la meilleure façon de construire des images OCI? DevopsDa...
Paketo Buildpacks : la meilleure façon de construire des images OCI? DevopsDa...
Anthony Dahanne
 
Understanding Globus Data Transfers with NetSage
Understanding Globus Data Transfers with NetSageUnderstanding Globus Data Transfers with NetSage
Understanding Globus Data Transfers with NetSage
Globus
 
Graphic Design Crash Course for beginners
Graphic Design Crash Course for beginnersGraphic Design Crash Course for beginners
Graphic Design Crash Course for beginners
e20449
 
AI Pilot Review: The World’s First Virtual Assistant Marketing Suite
AI Pilot Review: The World’s First Virtual Assistant Marketing SuiteAI Pilot Review: The World’s First Virtual Assistant Marketing Suite
AI Pilot Review: The World’s First Virtual Assistant Marketing Suite
Google
 
Globus Compute Introduction - GlobusWorld 2024
Globus Compute Introduction - GlobusWorld 2024Globus Compute Introduction - GlobusWorld 2024
Globus Compute Introduction - GlobusWorld 2024
Globus
 
Enterprise Resource Planning System in Telangana
Enterprise Resource Planning System in TelanganaEnterprise Resource Planning System in Telangana
Enterprise Resource Planning System in Telangana
NYGGS Automation Suite
 
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Globus
 
First Steps with Globus Compute Multi-User Endpoints
First Steps with Globus Compute Multi-User EndpointsFirst Steps with Globus Compute Multi-User Endpoints
First Steps with Globus Compute Multi-User Endpoints
Globus
 
Orion Context Broker introduction 20240604
Orion Context Broker introduction 20240604Orion Context Broker introduction 20240604
Orion Context Broker introduction 20240604
Fermin Galan
 
Prosigns: Transforming Business with Tailored Technology Solutions
Prosigns: Transforming Business with Tailored Technology SolutionsProsigns: Transforming Business with Tailored Technology Solutions
Prosigns: Transforming Business with Tailored Technology Solutions
Prosigns
 
May Marketo Masterclass, London MUG May 22 2024.pdf
May Marketo Masterclass, London MUG May 22 2024.pdfMay Marketo Masterclass, London MUG May 22 2024.pdf
May Marketo Masterclass, London MUG May 22 2024.pdf
Adele Miller
 
Into the Box 2024 - Keynote Day 2 Slides.pdf
Into the Box 2024 - Keynote Day 2 Slides.pdfInto the Box 2024 - Keynote Day 2 Slides.pdf
Into the Box 2024 - Keynote Day 2 Slides.pdf
Ortus Solutions, Corp
 
Large Language Models and the End of Programming
Large Language Models and the End of ProgrammingLarge Language Models and the End of Programming
Large Language Models and the End of Programming
Matt Welsh
 
Accelerate Enterprise Software Engineering with Platformless
Accelerate Enterprise Software Engineering with PlatformlessAccelerate Enterprise Software Engineering with Platformless
Accelerate Enterprise Software Engineering with Platformless
WSO2
 
BoxLang: Review our Visionary Licenses of 2024
BoxLang: Review our Visionary Licenses of 2024BoxLang: Review our Visionary Licenses of 2024
BoxLang: Review our Visionary Licenses of 2024
Ortus Solutions, Corp
 

Recently uploaded (20)

Enhancing Project Management Efficiency_ Leveraging AI Tools like ChatGPT.pdf
Enhancing Project Management Efficiency_ Leveraging AI Tools like ChatGPT.pdfEnhancing Project Management Efficiency_ Leveraging AI Tools like ChatGPT.pdf
Enhancing Project Management Efficiency_ Leveraging AI Tools like ChatGPT.pdf
 
SOCRadar Research Team: Latest Activities of IntelBroker
SOCRadar Research Team: Latest Activities of IntelBrokerSOCRadar Research Team: Latest Activities of IntelBroker
SOCRadar Research Team: Latest Activities of IntelBroker
 
GlobusWorld 2024 Opening Keynote session
GlobusWorld 2024 Opening Keynote sessionGlobusWorld 2024 Opening Keynote session
GlobusWorld 2024 Opening Keynote session
 
A Comprehensive Look at Generative AI in Retail App Testing.pdf
A Comprehensive Look at Generative AI in Retail App Testing.pdfA Comprehensive Look at Generative AI in Retail App Testing.pdf
A Comprehensive Look at Generative AI in Retail App Testing.pdf
 
Beyond Event Sourcing - Embracing CRUD for Wix Platform - Java.IL
Beyond Event Sourcing - Embracing CRUD for Wix Platform - Java.ILBeyond Event Sourcing - Embracing CRUD for Wix Platform - Java.IL
Beyond Event Sourcing - Embracing CRUD for Wix Platform - Java.IL
 
Paketo Buildpacks : la meilleure façon de construire des images OCI? DevopsDa...
Paketo Buildpacks : la meilleure façon de construire des images OCI? DevopsDa...Paketo Buildpacks : la meilleure façon de construire des images OCI? DevopsDa...
Paketo Buildpacks : la meilleure façon de construire des images OCI? DevopsDa...
 
Understanding Globus Data Transfers with NetSage
Understanding Globus Data Transfers with NetSageUnderstanding Globus Data Transfers with NetSage
Understanding Globus Data Transfers with NetSage
 
Graphic Design Crash Course for beginners
Graphic Design Crash Course for beginnersGraphic Design Crash Course for beginners
Graphic Design Crash Course for beginners
 
AI Pilot Review: The World’s First Virtual Assistant Marketing Suite
AI Pilot Review: The World’s First Virtual Assistant Marketing SuiteAI Pilot Review: The World’s First Virtual Assistant Marketing Suite
AI Pilot Review: The World’s First Virtual Assistant Marketing Suite
 
Globus Compute Introduction - GlobusWorld 2024
Globus Compute Introduction - GlobusWorld 2024Globus Compute Introduction - GlobusWorld 2024
Globus Compute Introduction - GlobusWorld 2024
 
Enterprise Resource Planning System in Telangana
Enterprise Resource Planning System in TelanganaEnterprise Resource Planning System in Telangana
Enterprise Resource Planning System in Telangana
 
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
 
First Steps with Globus Compute Multi-User Endpoints
First Steps with Globus Compute Multi-User EndpointsFirst Steps with Globus Compute Multi-User Endpoints
First Steps with Globus Compute Multi-User Endpoints
 
Orion Context Broker introduction 20240604
Orion Context Broker introduction 20240604Orion Context Broker introduction 20240604
Orion Context Broker introduction 20240604
 
Prosigns: Transforming Business with Tailored Technology Solutions
Prosigns: Transforming Business with Tailored Technology SolutionsProsigns: Transforming Business with Tailored Technology Solutions
Prosigns: Transforming Business with Tailored Technology Solutions
 
May Marketo Masterclass, London MUG May 22 2024.pdf
May Marketo Masterclass, London MUG May 22 2024.pdfMay Marketo Masterclass, London MUG May 22 2024.pdf
May Marketo Masterclass, London MUG May 22 2024.pdf
 
Into the Box 2024 - Keynote Day 2 Slides.pdf
Into the Box 2024 - Keynote Day 2 Slides.pdfInto the Box 2024 - Keynote Day 2 Slides.pdf
Into the Box 2024 - Keynote Day 2 Slides.pdf
 
Large Language Models and the End of Programming
Large Language Models and the End of ProgrammingLarge Language Models and the End of Programming
Large Language Models and the End of Programming
 
Accelerate Enterprise Software Engineering with Platformless
Accelerate Enterprise Software Engineering with PlatformlessAccelerate Enterprise Software Engineering with Platformless
Accelerate Enterprise Software Engineering with Platformless
 
BoxLang: Review our Visionary Licenses of 2024
BoxLang: Review our Visionary Licenses of 2024BoxLang: Review our Visionary Licenses of 2024
BoxLang: Review our Visionary Licenses of 2024
 

Sentiment analysis tools for software engineering research cannot be used out of the box

  • 1. On sentiment analysis tools for software engineering research Robbert Jongeling Subhajit Datta Alexander Serebrenik Eindhoven U of Technology (NL) Singapore U of Technology and Design (SG) Eindhoven U of Technology (NL) @jongeling_r @datta_subhajit @aserebrenik
  • 2. E. Guzman, D. Azócar, and Y. Li, “Sentiment analysis of commit comments in GitHub: An empirical study,” MSR 2014 A.-I. Rousinopoulos, G. Robles, and J. M. González-Barahona, “Sentiment analysis of Free/Open Source developers: preliminary findings from a case study,” Revista Eletrônica de Sistemas de Informação, 2014 E. Guzman and B. Bruegge, “Towards emotional awareness in software development teams,” in Joint Meeting on Foundations of Software Engineering, 2013 D. Pletea, B. Vasilescu, and A. Serebrenik, “Security and emotion: Sentiment analysis of security discussions on GitHub”, MSR 2014 M. Ortu, B. Adams, G. Destefanis, P. Tourani, M. Marchesi, and R. Tonelli, “Are bullies more productive? empirical study of affectiveness vs. issue fixing time,” in MSR 2015 D. Garcia, M. S. Zanetti, and F. Schweitzer, “The role of emotions in contributors activity: A case study on the Gentoo community,” in International Conference on Cloud and Green Computing, 2013
  • 3. E. Guzman, D. Azócar, and Y. Li, “Sentiment analysis of commit comments in GitHub: An empirical study,” MSR 2014 A.-I. Rousinopoulos, G. Robles, and J. M. González-Barahona, “Sentiment analysis of Free/Open Source developers: preliminary findings from a case study,” Revista Eletrônica de Sistemas de Informação, 2014 E. Guzman and B. Bruegge, “Towards emotional awareness in software development teams,” in Joint Meeting on Foundations of Software Engineering, 2013 D. Pletea, B. Vasilescu, and A. Serebrenik, “Security and emotion: Sentiment analysis of security discussions on GitHub”, MSR 2014 M. Ortu, B. Adams, G. Destefanis, P. Tourani, M. Marchesi, and R. Tonelli, “Are bullies more productive? empirical study of affectiveness vs. issue fixing time,” in MSR 2015 D. Garcia, M. S. Zanetti, and F. Schweitzer, “The role of emotions in contributors activity: A case study on the Gentoo community,” in International Conference on Cloud and Green Computing, 2013 NLTK SentiStrength
  • 4. E. Guzman, D. Azócar, and Y. Li, “Sentiment analysis of commit comments in GitHub: An empirical study,” MSR 2014 A.-I. Rousinopoulos, G. Robles, and J. M. González-Barahona, “Sentiment analysis of Free/Open Source developers: preliminary findings from a case study,” Revista Eletrônica de Sistemas de Informação, 2014 E. Guzman and B. Bruegge, “Towards emotional awareness in software development teams,” in Joint Meeting on Foundations of Software Engineering, 2013 D. Pletea, B. Vasilescu, and A. Serebrenik, “Security and emotion: Sentiment analysis of security discussions on GitHub”, MSR 2014 M. Ortu, B. Adams, G. Destefanis, P. Tourani, M. Marchesi, and R. Tonelli, “Are bullies more productive? empirical study of affectiveness vs. issue fixing time,” in MSR 2015 D. Garcia, M. S. Zanetti, and F. Schweitzer, “The role of emotions in contributors activity: A case study on the Gentoo community,” in International Conference on Cloud and Green Computing, 2013 NLTK SentiStrength Trained on movie/product reviews. Threat: might misidentify (or fail to identify) a sentiment in a software engineering artefact
  • 5. • RQ1: To what extent do different sentiment analysis tools agree with emotions of software developers? • RQ2: To what extent do different sentiment analysis tools agree with each other? • RQ3: Do different sentiment analysis tools lead to contradictory results in a software engineering study?
  • 6. Murgia et al. MSR 2014 392 comments x 4 evaluators joy love surprise anger fearsadness positive negative { { RQ1 RQ2
  • 7. Murgia et al. MSR 2014 392 comments x 4 evaluators joy love surprise anger fearsadness positive negative { { Consistent: positive: 3 positive, none negative negative: 3 negative, none positive neutral: ≥3 without emotion indication Alchemy Stanford NLP NLTK SentiStrength RQ1 Manual neg neu pos Tool neg neu pos RQ2 Tool A neg neu pos Tool B neg neu pos RQ1 RQ2
  • 8. Murgia et al. MSR 2014 392 comments x 4 evaluators joy love surprise anger fearsadness positive negative { { Consistent: positive: 3 positive, none negative negative: 3 negative, none positive neutral: ≥3 without emotion indication Alchemy Stanford NLP NLTK SentiStrength RQ1 Manual neg neu pos Tool neg neu pos 54 24 217 0 ≤ Adjusted Rand Index ≤ 1 [Santos, Embrechts, ICANN 2009] RQ2 Tool A neg neu pos Tool B neg neu pos RQ1 RQ2
  • 9. Murgia et al. MSR 2014 392 comments x 4 evaluators joy love surprise anger fearsadness positive negative { { Consistent: positive: 3 positive, none negative negative: 3 negative, none positive neutral: ≥3 without emotion indication Alchemy Stanford NLP NLTK SentiStrength RQ1 Manual neg neu pos Tool neg neu pos 54 24 217 0 ≤ Adjusted Rand Index ≤ 1 [Santos, Embrechts, ICANN 2009] RQ2 Tool A neg neu pos Tool B neg neu pos RQ1 RQ2
  • 10. RQ1: To what extent do different sentiment analysis tools agree with emotions of software developers? RQ1 Manual neg neu pos NLTK neg 19 51 11 neu 0 138 7 pos 5 28 36 Tool ARI NLTK 0.239 SentiStrength 0.113 Stanford NLP 0.108 Alchemy 0.079 Tools do not agree with manual evaluation RQ1 RQ2
  • 11. RQ2: To what extent do different sentiment analysis tools agree with each other? RQ2 SentiStrength neg neu pos NLTK neg 17 39 25 neu 15 96 34 pos 6 20 43 Tool A Tool B ARI NLTK Alchemy 0.104 NLTK SentiStrength 0.090 Tools do not agree with each other RQ1 RQ2
  • 12. RQ3 issue tracker over text response time Sentiment Analysis Tool compare times for neg, neu, pos issues/questions q & a site NLTK
  • 13. issue tracker over text response time Sentiment Anal. Tool compare times for neg, neu, pos issues/questions q & a site NLTK ∩ SentiStrength issue tracker over text response time Sentiment Anal. Tool compare times for neg, neu, pos issues/questions q & a site SentiStrength RQ3 issue tracker over text response time Sentiment Analysis Tool compare times for neg, neu, pos issues/questions q & a site NLTK Are the results the same?
  • 14. NLTK SentiStrength NLTK ∩ SentiStrength ASF descr neg > neu*** neg > neu*** pos > neu*** pos > neu*** pos > neu*** pos > neg*** pos > neg*** ASF title neg > neu** pos > neu*** pos > neu** pos > neg* pos > neg** GNOME descr neg > neu*** neg > neu*** neg > neu*** pos > neu*** pos > neu*** pos > neu*** pos > neg*** neg > pos*** SO descr ø neg > pos* ø RQ3 RQ3: Do different sentiment analysis tools lead to contradictory results in a software engineering study? Choice of the sentiment analysis tool affects results of the software engineering study
  • 15. Tools do not agree with manual evaluation Tools do not agree with each other Choice of the sentiment analysis tool affects results of the software engineering study Summary Sentiment analysis tools are trained on movie/ product reviews. Threat: might misidentify (or fail to identify) a sentiment in a software engineering artefact
  • 16. Next steps? • Train sentiment analysis tools on software engineering data • Data of Murgia et al.: first step • More and better-suited data is needed