SlideShare a Scribd company logo
1 of 33
Download to read offline
1
Sentimine: a cregit Plugin to
Analyze the Sentiment Behind the
Linux Kernel Code
Isabella Ferreira
Polytechnique Montréal
Kate Stewart

Linux Foundation

Daniel German

University of Victoria

Bram Adams

Polytechnique Montréal

Towards
2Linus Torvalds’ temporary leave in September 2018 ...
3
… sparked a
Rich and
Diverse
Discussion
about
Change in
the
Community ...
4
… even after Linus’ comeback, discussions continued
5
6
Almost half a year later, what did
the community learn or change?
7
Code of Conflict became Code of Conduct :-)
8…, but a Code of Conduct just treats the symptoms,
what is the root of the problem?
9
How civil is the Linux community?
Is it a recent problem?
Is it a problem that is increasing over time?
How does civility change across release cycles?
How is the civility in the Linux community
compared to other open source projects?
10
Civility is not (yet) part of CHAOSS
11
Initial idea: Measuring civility through sentiment
Politeness Sentiment Emotion

great work, guys!
WTF, a package
refactoring and class
renaming in one
patch?
Good
civility
Bad

civility
Challenges
Identify the target of the sentiment

● Is the sentiment about the person (civility) or about the
code (review)?
Identify the right granularity to analyze sentiments

● Individual e-mails vs. e-mail threads?

● Subsystem vs. the whole kernel?

● Week vs. month vs. release cycle?
12
13
Can we measure differences in sentiment
between maintainers and non-maintainers?
Can we measure differences in
sentiment across time?
Can we measure differences in
sentiment between releases?
14
Approach
Extract LKML data
Public-inbox e-mails from
Jan 2017 to Jan 2019

Step 1 Step 2
Group e-mails by threads
Mailboxminer tool

Step 3
Filter threads
>= 2 e-mails in a thread

ignore e-mails that have patch
15
Approach
Step 4
Calculate metrics
Release cycle

Month, week, weekday

Maintainer vs. non-maintainers

Results
16
A lot of communication across releases...
0
0
17
Phase 2 in the release cycle has more discussion
Phase 1
Phase 3
Phase 2
18
Approach
Extract LKML data
Public-inbox e-mails from
Jan 2017 to Jan 2019

Step 1 Step 2
Group e-mails by threads
Mailboxminer tool

Step 3
Filter threads
>= 2 e-mails in a thread

ignore e-mails that have patch

Step 5
Detect sentiment
Senti4SD

Step 4
Calculate metrics
Release cycle

Subsystem

Maintainer vs. non-maintainers

Answer questions about
civility
19
Senti4SD
Lexicon-based
features
Keyword-based
features
Semantic features
Training the model
20
Senti4SD
INPUT OUTPUT
Equal proportion of positive/negative e-mail threads
21
22
Equal number of positive/negative discussions across releases
0 0
23
Specific events might change the sentiment…
24
The sentiment fluctuates among weeks
Weeks 31-34: August, 2018Weeks 31-39: September, 2018Weeks 40-44: October, 2018
25
Tuesdays, Wednesdays, and Thursdays have more positive
discussions
26The sentiment does not differ between maintainers and non-
maintainers
27
Challenges and Ongoing Work
28
Noisy
tools
How to act
on results?
Target of
sentiment?Noisy data
29
PollEv.com/isabellaferr986
Let’s do a quick survey?
31
What kind of tool support could help the community?
32
Links to mailing list
reviews generated
by email2git
https://cregit.linuxsources.org
Cregit tracks patches at token level
Ferreira et al. A Longitudinal Study on the Maintainers’ Sentiment of a Large Scale Open Source Ecosystem
Isabella Ferreira

Polytechnique Montréal

MCIS Lab
isabella.ferreira@polymtl.ca
/isabellaferreira
/isabellavieira
@isaferreira_57
We need
your
feedback!

More Related Content

Similar to Sentiment Analysis on the Linux Kernel

Computer Ethics Analyzing Information Technology.docx
Computer Ethics Analyzing Information Technology.docxComputer Ethics Analyzing Information Technology.docx
Computer Ethics Analyzing Information Technology.docx
patricke8
 
Computer Ethics Analyzing Information Technology.docx
Computer Ethics Analyzing Information Technology.docxComputer Ethics Analyzing Information Technology.docx
Computer Ethics Analyzing Information Technology.docx
mccormicknadine86
 
Computer Ethics Analyzing Information Technology
Computer Ethics Analyzing Information TechnologyComputer Ethics Analyzing Information Technology
Computer Ethics Analyzing Information Technology
LinaCovington707
 
Open Source for an Open World
Open Source for an Open WorldOpen Source for an Open World
Open Source for an Open World
Elizabeth Thomsen
 
Class 4 lecture notes learning networks and connective knowledge
Class 4 lecture notes learning networks and connective knowledgeClass 4 lecture notes learning networks and connective knowledge
Class 4 lecture notes learning networks and connective knowledge
Shawn Lennie
 
eMadrid 2014-01-17 uned Salvador Ros (UNED) "Big Data in Education"
eMadrid 2014-01-17 uned Salvador Ros (UNED) "Big Data in Education"eMadrid 2014-01-17 uned Salvador Ros (UNED) "Big Data in Education"
eMadrid 2014-01-17 uned Salvador Ros (UNED) "Big Data in Education"
eMadrid network
 
Workshopvin2 A Socio Legal View On Virtual Individual Networks
Workshopvin2 A Socio Legal View On Virtual Individual NetworksWorkshopvin2 A Socio Legal View On Virtual Individual Networks
Workshopvin2 A Socio Legal View On Virtual Individual Networks
imec.archive
 
Msr2009 emad
Msr2009 emadMsr2009 emad
Msr2009 emad
SAIL_QU
 

Similar to Sentiment Analysis on the Linux Kernel (20)

Computer Ethics Analyzing Information Technology.docx
Computer Ethics Analyzing Information Technology.docxComputer Ethics Analyzing Information Technology.docx
Computer Ethics Analyzing Information Technology.docx
 
Computer Ethics Analyzing Information Technology.docx
Computer Ethics Analyzing Information Technology.docxComputer Ethics Analyzing Information Technology.docx
Computer Ethics Analyzing Information Technology.docx
 
Computer Ethics Analyzing Information Technology
Computer Ethics Analyzing Information TechnologyComputer Ethics Analyzing Information Technology
Computer Ethics Analyzing Information Technology
 
Open Source for an Open World
Open Source for an Open WorldOpen Source for an Open World
Open Source for an Open World
 
MOA Innovation Award: Gamification in MROCs by InSites Consulting
MOA Innovation Award: Gamification in MROCs by InSites ConsultingMOA Innovation Award: Gamification in MROCs by InSites Consulting
MOA Innovation Award: Gamification in MROCs by InSites Consulting
 
Learning about CHAOSS - Ana Jimenez Santamaria, Daniel Izquierdo
Learning about CHAOSS - Ana Jimenez Santamaria, Daniel IzquierdoLearning about CHAOSS - Ana Jimenez Santamaria, Daniel Izquierdo
Learning about CHAOSS - Ana Jimenez Santamaria, Daniel Izquierdo
 
Social influencer relationship management
Social influencer relationship managementSocial influencer relationship management
Social influencer relationship management
 
A startup with no office, hipster tools and open source products
A startup with no office, hipster tools and open source productsA startup with no office, hipster tools and open source products
A startup with no office, hipster tools and open source products
 
Class 4 lecture notes learning networks and connective knowledge
Class 4 lecture notes learning networks and connective knowledgeClass 4 lecture notes learning networks and connective knowledge
Class 4 lecture notes learning networks and connective knowledge
 
eMadrid 2014-01-17 uned Salvador Ros (UNED) "Big Data in Education"
eMadrid 2014-01-17 uned Salvador Ros (UNED) "Big Data in Education"eMadrid 2014-01-17 uned Salvador Ros (UNED) "Big Data in Education"
eMadrid 2014-01-17 uned Salvador Ros (UNED) "Big Data in Education"
 
KASW'08 - Invited Talk
KASW'08 - Invited TalkKASW'08 - Invited Talk
KASW'08 - Invited Talk
 
D1: The NMC Methodology
D1: The NMC MethodologyD1: The NMC Methodology
D1: The NMC Methodology
 
Access report PPT
Access report PPTAccess report PPT
Access report PPT
 
SAD08 - Working With Others
SAD08 - Working With OthersSAD08 - Working With Others
SAD08 - Working With Others
 
Jon Keefe - Online Communities
Jon Keefe - Online CommunitiesJon Keefe - Online Communities
Jon Keefe - Online Communities
 
"Did you say manifesto?" Background and observations on manifestos. RE4SUSY 2014
"Did you say manifesto?" Background and observations on manifestos. RE4SUSY 2014"Did you say manifesto?" Background and observations on manifestos. RE4SUSY 2014
"Did you say manifesto?" Background and observations on manifestos. RE4SUSY 2014
 
Workshopvin2 A Socio Legal View On Virtual Individual Networks
Workshopvin2 A Socio Legal View On Virtual Individual NetworksWorkshopvin2 A Socio Legal View On Virtual Individual Networks
Workshopvin2 A Socio Legal View On Virtual Individual Networks
 
The (R)evolution of Social Media in Software Engineering
The (R)evolution of Social Media in Software EngineeringThe (R)evolution of Social Media in Software Engineering
The (R)evolution of Social Media in Software Engineering
 
A Communications Infrastructure for the UK HE Digital Community
A Communications Infrastructure for the UK HE Digital CommunityA Communications Infrastructure for the UK HE Digital Community
A Communications Infrastructure for the UK HE Digital Community
 
Msr2009 emad
Msr2009 emadMsr2009 emad
Msr2009 emad
 

Recently uploaded

如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
acoha1
 
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
fztigerwe
 
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
Amil baba
 
Abortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotec
Abortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotecAbortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotec
Abortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
ppy8zfkfm
 
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
acoha1
 
原件一样伦敦国王学院毕业证成绩单留信学历认证
原件一样伦敦国王学院毕业证成绩单留信学历认证原件一样伦敦国王学院毕业证成绩单留信学历认证
原件一样伦敦国王学院毕业证成绩单留信学历认证
pwgnohujw
 
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Valters Lauzums
 
如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样
如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样
如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样
jk0tkvfv
 
Displacement, Velocity, Acceleration, and Second Derivatives
Displacement, Velocity, Acceleration, and Second DerivativesDisplacement, Velocity, Acceleration, and Second Derivatives
Displacement, Velocity, Acceleration, and Second Derivatives
23050636
 
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
pwgnohujw
 

Recently uploaded (20)

如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
 
Seven tools of quality control.slideshare
Seven tools of quality control.slideshareSeven tools of quality control.slideshare
Seven tools of quality control.slideshare
 
Predictive Precipitation: Advanced Rain Forecasting Techniques
Predictive Precipitation: Advanced Rain Forecasting TechniquesPredictive Precipitation: Advanced Rain Forecasting Techniques
Predictive Precipitation: Advanced Rain Forecasting Techniques
 
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
 
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
 
Abortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotec
Abortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotecAbortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotec
Abortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotec
 
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
 
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
 
NOAM AAUG Adobe Summit 2024: Summit Slam Dunks
NOAM AAUG Adobe Summit 2024: Summit Slam DunksNOAM AAUG Adobe Summit 2024: Summit Slam Dunks
NOAM AAUG Adobe Summit 2024: Summit Slam Dunks
 
原件一样伦敦国王学院毕业证成绩单留信学历认证
原件一样伦敦国王学院毕业证成绩单留信学历认证原件一样伦敦国王学院毕业证成绩单留信学历认证
原件一样伦敦国王学院毕业证成绩单留信学历认证
 
How to Transform Clinical Trial Management with Advanced Data Analytics
How to Transform Clinical Trial Management with Advanced Data AnalyticsHow to Transform Clinical Trial Management with Advanced Data Analytics
How to Transform Clinical Trial Management with Advanced Data Analytics
 
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
 
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
 
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarjSCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
 
社内勉強会資料_Object Recognition as Next Token Prediction
社内勉強会資料_Object Recognition as Next Token Prediction社内勉強会資料_Object Recognition as Next Token Prediction
社内勉強会資料_Object Recognition as Next Token Prediction
 
如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样
如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样
如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样
 
Bios of leading Astrologers & Researchers
Bios of leading Astrologers & ResearchersBios of leading Astrologers & Researchers
Bios of leading Astrologers & Researchers
 
Displacement, Velocity, Acceleration, and Second Derivatives
Displacement, Velocity, Acceleration, and Second DerivativesDisplacement, Velocity, Acceleration, and Second Derivatives
Displacement, Velocity, Acceleration, and Second Derivatives
 
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
 
Aggregations - The Elasticsearch "GROUP BY"
Aggregations - The Elasticsearch "GROUP BY"Aggregations - The Elasticsearch "GROUP BY"
Aggregations - The Elasticsearch "GROUP BY"
 

Sentiment Analysis on the Linux Kernel