MSR2017-Challenge

•

0 likes•86 views

Masud Rahman

Impact of Continuous Integration on Code Reviews

Technology

RESEARCH PROBLEM: IMPACT OF
AUTOMATED BUILDS ON CODE REVIEWS
 Automated Builds, an
important part of CI for
commit merging & consistency
 Exponential increase of
automated builds over the
years with Travis CI.
 Builds & Code reviews as
interleaving steps in the pull-
based development
RQ1: Does the status of automated builds influence the code
review participation in open source projects?
RQ2: Do frequent automated builds help improve the
overall quality of peer code reviews?
RQ3: Can we automatically predict whether an automated
build would trigger new code reviews or not? 2

DATASET & EXPERIMENTAL SETUP
3
MSR Challenge
Dataset (3702K)
Our dataset
(578K)
(Is build triggered by PR?
i.e., gh_is_pr==true?)
346K builds
(No reviews)
232K builds
(with code reviews)
RQ1 RQ2 RQ3

ANSWERING RQ1: BUILD STATUS &
CODE REVIEW PARTICIPATION
Build Status Build Only Builds + Reviews Total
Canceled 2,616 1,368 3,984
Errored 51,729 27,262 78,991
Failed 55,546 39,025 94,571
Passed 236,573 164,174 400,747
All 346,464 231,829 (40%) 578,293
4
 578K PR-based builds
 Four build statuses
 232K (40%) build entries
with code reviews.
 Chi-squared tests (p-
value=2.2e-16<0.05)

ANSWERING RQ1: BUILD STATUS &
CODE REVIEW PARTICIPATION
5
Previous
Build status
#PR with Review Comments
Only Added↑ Only Removed↓ Total Changed↑↓
Canceled 20 24 65
Errored 510 265 812
Failed 1,542 826 2,316
Passed 4,235 1,788 5,677
All 6,307 2,903 8,870 (28%)
 31,648 PRs for 232K entries from 1000+ projects
 For 28% PR, #review comments changed.
 Passed builds triggered 18% of new reviews.
 Errored + Failed triggered 10%

ANSWERING RQ2: BUILD FREQUENCY &
CODE REVIEW QUALITY
6
Quantile Issue Comments PR Comments All Review Comments
M p-value ∆ M p-value ∆ M p-value ∆
Q1
0.60
<0.001* 0.35
0.24
<0.001* 0.49
0.84
<0.001* 0.41
Q4
0.99 0.52 1.50
M= Mean #review comments, * = Statistically significant, ∆ = Cliff’s Delta

ANSWERING RQ2: BUILD FREQUENCY &
CODE REVIEW QUALITY
 5 projects from Q1, and 5 from Q4, 3-4 years old
 Cumulative #review comments/build over 48 months
 Code review quality (i.e., #comments) improved almost
linearly for frequently built projects
 Didn’t happen so for the counterpart, looks zigzag.
7

ANSWERING RQ3: PREDICTION OF NEW
CODE REVIEW TRIGGERING
Learning
Algorithm
Overall
Accuracy
New Review Triggered?
Precision Recall
Naïve Bayes 58.03% 68.70% 29.50%
Logistic Regression 60.56% 64.50% 47.00%
J48 64.04% 69.50% 50.10%
8
 Features: build status, code change statistics, test
change statistics, and code review comments.
Response: New review triggered or unchanged.
 Three ML algorithms with 10-fold cross-validation.
 26.5K build entries as balanced dataset.
 J48 performed the best, 64% accuracy, 69.50%
precision & 50% recall.

TAKE-HOME MESSAGES
 Automated builds might influence manual code
reviews since they interleave each other in the
modern pull-based development
 Passed builds more associated with review
participations, and with new code reviews.
 Frequently built projects received more review
comments than less frequently built ones.
 Code review activities are steady over time with
frequently built projects. Not true for the
counterparts.
 Our prediction model can predict whether a
build will trigger new code review or not.
9

THANK YOU!! QUESTIONS?
10
Email: chanchal.roy@usask.ca or
masud.rahman@usask.ca

What's hot

Review Participation in Modern Code Review: An Empirical Study of the Android...The University of Adelaide

Facts about open source projects & testingVu Hung Nguyen

Establishing A Defect Prediction Model Using A Combination of Product Metrics...MIMOS Berhad/Open University Malaysia/Universiti Teknologi Malaysia

Revisiting Code Ownership and Its Relationship with Software Quality in the S...The University of Adelaide

Improving Code Review Effectiveness Through Reviewer RecommendationsThe University of Adelaide

Using HPC Resources to Exploit Big Data for Code Review AnalyticsThe University of Adelaide

TddAlexander Zaidel

Presentation slides: "How to get 100% code coverage" Rapita Systems Ltd

Icsm2010 kameiSAIL_QU

TDD CrashCourse Part2: TDDDavid Rodenas

A Regression Analysis Approach for Building a Prediction Model for System Tes...MIMOS Berhad/Open University Malaysia/Universiti Teknologi Malaysia

Code Coverage and Test Suite Effectiveness: Empirical Study with Real Bugs in...Pavneet Singh Kochhar

Technical Practices for Agile Engineering - PNSQC 2019Moss Drake

Rayleigh modelRoy Antony Arnold G

TDD - Test Driven DevelopmentLim Chanmann

Test driven development vs Behavior driven developmentGallop Solutions

Code quality Sunil Prasad

Csqe sample exam 1 solutions 05.00.04binodrit98

Code coverageReturn on Intelligence

Reliability Vs. TestingNicolò Paternoster

What's hot (20)

Review Participation in Modern Code Review: An Empirical Study of the Android...

Facts about open source projects & testing

Establishing A Defect Prediction Model Using A Combination of Product Metrics...

Revisiting Code Ownership and Its Relationship with Software Quality in the S...

Improving Code Review Effectiveness Through Reviewer Recommendations

Using HPC Resources to Exploit Big Data for Code Review Analytics

Tdd

Presentation slides: "How to get 100% code coverage"

Icsm2010 kamei

TDD CrashCourse Part2: TDD

A Regression Analysis Approach for Building a Prediction Model for System Tes...

Code Coverage and Test Suite Effectiveness: Empirical Study with Real Bugs in...

Technical Practices for Agile Engineering - PNSQC 2019

Rayleigh model

TDD - Test Driven Development

Test driven development vs Behavior driven development

Code quality

Csqe sample exam 1 solutions 05.00.04

Code coverage

Reliability Vs. Testing

Similar to MSR2017-Challenge

Test-Driven Code Review: An Empirical StudyDelft University of Technology

Adopting code reviews for agile software developmentmariobernhart

CORRECT-ToolDemo-ASE2016Masud Rahman

Code-Review-COW56-MeetingMasud Rahman

Would Static Analysis Tools Help Developers with Code Reviews?Sebastiano Panichella

CORRECT-ICSE2016Masud Rahman

A Tale of Experiments on Bug PredictionMartin Pinzger

Cukic Promise08 V3gregoryg

STRICT-SANER2017Masud Rahman

Declarative Performance Testing Automation - Automating Performance Testing f...Vincenzo Ferme

Preventive Software Maintenance: The Past, the Present, the FutureNikolaos Tsantalis

Тестирование спецификацийSQALab

Standardized Risk Measurement for IT Executives 101Konstantin Berger

Cross-project Defect Prediction Using A Connectivity-based Unsupervised Class...Feng Zhang

Automating good coding practicesKevin Peterson

CodeInsight-SCAM2015Masud Rahman

MSR2017-RevHelperMasud Rahman

Process Aspects and Social Dynamics of Contemporary Code Review: Insights fro...JeffCarver32

When Testing Meets Code Review: Why and How Developers Review TestsDelft University of Technology

Manual Testing Guide1.pdfKhushal Chate

Similar to MSR2017-Challenge (20)

Test-Driven Code Review: An Empirical Study

Adopting code reviews for agile software development

CORRECT-ToolDemo-ASE2016

Code-Review-COW56-Meeting

Would Static Analysis Tools Help Developers with Code Reviews?

CORRECT-ICSE2016

A Tale of Experiments on Bug Prediction

Cukic Promise08 V3

STRICT-SANER2017

Declarative Performance Testing Automation - Automating Performance Testing f...

Preventive Software Maintenance: The Past, the Present, the Future

Тестирование спецификаций

Standardized Risk Measurement for IT Executives 101

Cross-project Defect Prediction Using A Connectivity-based Unsupervised Class...

Automating good coding practices

CodeInsight-SCAM2015

MSR2017-RevHelper

Process Aspects and Social Dynamics of Contemporary Code Review: Insights fro...

When Testing Meets Code Review: Why and How Developers Review Tests

Manual Testing Guide1.pdf

Recently uploaded

FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal

Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard

Presentation on how to chat with PDF using ChatGPT code interpreternaman860154

Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j

How to Remove Document Management Hurdles with X-Docs?XfilesPro

Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix

SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren

Key Features Of Token Development (1).pptxLBM Solutions

Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106

Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst

GenCyber Cyber Security Day PresentationMichael W. Hawkins

Scaling API-first – The story of a global engineering organizationRadu Cotescu

AI as an Interface for Commercial BuildingsMemoori

Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies

Understanding the Laravel MVC ArchitecturePixlogix Infotech

The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad

Recently uploaded (20)

FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service

Maximizing Board Effectiveness 2024 Webinar.pptx

Presentation on how to chat with PDF using ChatGPT code interpreter

Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024

SIEMENS: RAPUNZEL – A Tale About Knowledge Graph

How to Remove Document Management Hurdles with X-Docs?

Swan(sea) Song – personal research during my six years at Swansea ... and bey...

SQL Database Design For Developers at php[tek] 2024

Key Features Of Token Development (1).pptx

Enhancing Worker Digital Experience: A Hands-on Workshop for Partners

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics

Human Factors of XR: Using Human Factors to Design XR Systems

GenCyber Cyber Security Day Presentation

Scaling API-first – The story of a global engineering organization

AI as an Interface for Commercial Buildings

Benefits Of Flutter Compared To Other Frameworks

Understanding the Laravel MVC Architecture

The Codex of Business Writing Software for Real-World Solutions 2.pptx

MSR2017-Challenge

1. IMPACT OF CONTINUOUS INTEGRATION ON CODE REVIEWS Mohammad Masudur Rahman, Chanchal K. Roy Department of Computer Science University of Saskatchewan, Canada 14th International Conference on Mining Software Repositories (MSR 2017) (Challenge Track) Buenos Aires, Argentina

2. RESEARCH PROBLEM: IMPACT OF AUTOMATED BUILDS ON CODE REVIEWS  Automated Builds, an important part of CI for commit merging & consistency  Exponential increase of automated builds over the years with Travis CI.  Builds & Code reviews as interleaving steps in the pull- based development RQ1: Does the status of automated builds influence the code review participation in open source projects? RQ2: Do frequent automated builds help improve the overall quality of peer code reviews? RQ3: Can we automatically predict whether an automated build would trigger new code reviews or not? 2

3. DATASET & EXPERIMENTAL SETUP 3 MSR Challenge Dataset (3702K) Our dataset (578K) (Is build triggered by PR? i.e., gh_is_pr==true?) 346K builds (No reviews) 232K builds (with code reviews) RQ1 RQ2 RQ3

4. ANSWERING RQ1: BUILD STATUS & CODE REVIEW PARTICIPATION Build Status Build Only Builds + Reviews Total Canceled 2,616 1,368 3,984 Errored 51,729 27,262 78,991 Failed 55,546 39,025 94,571 Passed 236,573 164,174 400,747 All 346,464 231,829 (40%) 578,293 4  578K PR-based builds  Four build statuses  232K (40%) build entries with code reviews.  Chi-squared tests (p- value=2.2e-16<0.05)

5. ANSWERING RQ1: BUILD STATUS & CODE REVIEW PARTICIPATION 5 Previous Build status #PR with Review Comments Only Added↑ Only Removed↓ Total Changed↑↓ Canceled 20 24 65 Errored 510 265 812 Failed 1,542 826 2,316 Passed 4,235 1,788 5,677 All 6,307 2,903 8,870 (28%)  31,648 PRs for 232K entries from 1000+ projects  For 28% PR, #review comments changed.  Passed builds triggered 18% of new reviews.  Errored + Failed triggered 10%

6. ANSWERING RQ2: BUILD FREQUENCY & CODE REVIEW QUALITY 6 Quantile Issue Comments PR Comments All Review Comments M p-value ∆ M p-value ∆ M p-value ∆ Q1 0.60 <0.001* 0.35 0.24 <0.001* 0.49 0.84 <0.001* 0.41 Q4 0.99 0.52 1.50 M= Mean #review comments, * = Statistically significant, ∆ = Cliff’s Delta

7. ANSWERING RQ2: BUILD FREQUENCY & CODE REVIEW QUALITY  5 projects from Q1, and 5 from Q4, 3-4 years old  Cumulative #review comments/build over 48 months  Code review quality (i.e., #comments) improved almost linearly for frequently built projects  Didn’t happen so for the counterpart, looks zigzag. 7

8. ANSWERING RQ3: PREDICTION OF NEW CODE REVIEW TRIGGERING Learning Algorithm Overall Accuracy New Review Triggered? Precision Recall Naïve Bayes 58.03% 68.70% 29.50% Logistic Regression 60.56% 64.50% 47.00% J48 64.04% 69.50% 50.10% 8  Features: build status, code change statistics, test change statistics, and code review comments. Response: New review triggered or unchanged.  Three ML algorithms with 10-fold cross-validation.  26.5K build entries as balanced dataset.  J48 performed the best, 64% accuracy, 69.50% precision & 50% recall.

9. TAKE-HOME MESSAGES  Automated builds might influence manual code reviews since they interleave each other in the modern pull-based development  Passed builds more associated with review participations, and with new code reviews.  Frequently built projects received more review comments than less frequently built ones.  Code review activities are steady over time with frequently built projects. Not true for the counterparts.  Our prediction model can predict whether a build will trigger new code review or not. 9

10. THANK YOU!! QUESTIONS? 10 Email: chanchal.roy@usask.ca or masud.rahman@usask.ca

MSR2017-Challenge

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to MSR2017-Challenge

Similar to MSR2017-Challenge (20)

More from Masud Rahman

More from Masud Rahman (20)

Recently uploaded

Recently uploaded (20)

MSR2017-Challenge