Recovering Commit Dependencies for Selective Code Integration in Software Product Lines

•Download as PPTX, PDF•

0 likes•426 views

Foutse Khomh

Technology

Features Code Changes

FA CA1

FB CB1

4

• If change CA1 implements FA and Feature
Code Missing
Changes Dependencies
change CB1 implements FB
FA CA1 CA2 CB1
• If a change CA2 is added to modify
FA and CA2 is dependent on CB1 FB CB1

CA1 CA1 CA2 CB1
Integrate FA

CB1
5
Integrate FB

Automated
Grouping ( during
Define Calibrate the Commit Integration)
Dissimilarity Metrics on Assignment
Metrics Prior Versions Algorithm
Developer Guided
Grouping ( during
Development)

8

Given two commits characterized by files, developers and change requests (CRs)

Metric Description
File Dependency Distance (FD) Captures source code dependencies
between files involved in two commits
File Association Distance (FA) Captures logical dependencies between
files involved in two commits

Developer Dissimilarity Distance (DD) Captures the working relation between
two developers submitting commits

CR Dependency Distance (CRD) Captures the dissimilarity between the
CRs implemented by two commits

9

$For each of the four metrics - b3 • Min_Threshold = Avg(a) b2 • Max_Threshold = Avg(bmin) a • Silhouette= Avg{(bmin-a)/max(bmin,a)} b1 A higher silhouette value is better 11$

• Apply the similarity metrics
in order of their precedence

• If no suitable group is found
for a commit, assign the
commit to a new group

Color > Shape

13

Groups commits incrementally
and uses developers’ feedback
to improve the grouping during
development

Both approaches follow the k-means clustering method which consists
in assigning each item to the cluster with the nearest mean.
15

We analyzed three major versions of a family of mobile
applications

16

• Validate the dissimilarity metrics
Can the proposed metrics be used to identify
commit dependencies ?
• Validate the grouping approaches
How efficient are our proposed grouping
approaches?
• Value for Developers
Can the proposed approaches identify commit
dependencies missed by developers ?

17

The four similarity metrics display good abilities in
grouping commits ( i.e. high silhouette values)
1 0.94 0.96 0.96

0.79
0.8 0.76
0.67 0.67
Silhouette Value

0.63
0.6 0.57
CRD
0.47 0.49
0.46
FA
0.4 DD
FD
0.2

0
Verion 1 Version 2 Version 3

CRD > FA > DD > FD
18

• Efficiency of the Grouping Approaches
– 82% of commit dependencies were recovered by
the automated grouping with a precision of 95%
– The accuracy of the developer-guided grouping
approach is 98%
– We observed that precision/recall improves with
longer history data
• Value for Developers
– Automated grouping and Developer-guided
grouping approaches were able to reduce
integration failures by 76% and 94% respectively
19

Viewers also liked

Do Faster Releases Improve Software Quality? Foutse Khomh

OralLaia Ramírez

Late Propagation in Software ClonesFoutse Khomh

How does Context affect the Distribution of Software Maintainability Metrics?Foutse Khomh

Robi activation-hamzaSouth Asian University

An Entropy Evaluation Approach for Triaging Field Crashes: A Case Study of Mo...Foutse Khomh

Online Journalism in BangladeshSouth Asian University

Computer1 test 2 prep: processing, software, storageCathy Bennett

CountryJhonar Apolitano

蓝天#52Shirley Lee

Materi statitiska smpEndi Sudrajad

daknetaditya127

Viewers also liked (12)

Do Faster Releases Improve Software Quality?

Oral

Late Propagation in Software Clones

How does Context affect the Distribution of Software Maintainability Metrics?

Robi activation-hamza

An Entropy Evaluation Approach for Triaging Field Crashes: A Case Study of Mo...

Online Journalism in Bangladesh

Computer1 test 2 prep: processing, software, storage

Country

蓝天#52

Materi statitiska smp

daknet

Similar to Recovering Commit Dependencies for Selective Code Integration in Software Product Lines

Icsm2012 selective codeintegrationSAIL_QU

Predicting Defects using Network Analysis on Dependency GraphsThomas Zimmermann

VbKuldeep Sharma

Postdoc Symposium - Abram HindleICSM 2011

Collaborate12 Fcerkrivera

Collaborate12 fcerkrivera

WWW Conference 2012 - Web-Engineering - CloudgeniusDr.-Ing. Michael Menzel

Keynote HotSWUp 2012Martin Pinzger

Capacity Planning and ModellingAnthony Dehnashi

Database Change Management | Change Manager 5.1 BetaMichael Findling

eArtius HMGE Algorithm Applied to Optimization Tasks with 10,000 Design Varia...eArtius, Inc.

Cloud Migration: Moving to the CloudDr.-Ing. Michael Menzel

CSMR06a.pptPtidej Team

Auto mapper publicOleksii Duhno

Design1deepinderbedi

Anish Karmakar S C ASOA Symposium

Lead Allocation System's Attribute Driven Design (ADD)Amin Bandeali

Framework Engineering 2.1YoungSu Son

Dollars and Dates are Killing AgileRally Software

Dollars and dates are killing agile finaldrewz lin

Similar to Recovering Commit Dependencies for Selective Code Integration in Software Product Lines (20)

Icsm2012 selective codeintegration

Predicting Defects using Network Analysis on Dependency Graphs

Postdoc Symposium - Abram Hindle

Collaborate12 Fce

Collaborate12 fce

WWW Conference 2012 - Web-Engineering - Cloudgenius

Keynote HotSWUp 2012

Capacity Planning and Modelling

Database Change Management | Change Manager 5.1 Beta

eArtius HMGE Algorithm Applied to Optimization Tasks with 10,000 Design Varia...

Cloud Migration: Moving to the Cloud

CSMR06a.ppt

Auto mapper public

Design1

Anish Karmakar S C A

Lead Allocation System's Attribute Driven Design (ADD)

Framework Engineering 2.1

Dollars and Dates are Killing Agile

Dollars and dates are killing agile final

Recently uploaded

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

Real Time Object Detection Using Open CVKhem

The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad

Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

🐬 The future of MySQL is Postgres 🐘RTylerCroy

Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC

Artificial Intelligence: Facts and MythsJoaquim Jorge

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer

The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los

Automating Google Workspace (GWS) & more with Apps Scriptwesley chun

08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls

Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko

Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer

Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies

Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2

Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun

[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745

Recently uploaded (20)

2024: Domino Containers - The Next Step. News from the Domino Container commu...

Real Time Object Detection Using Open CV

The Codex of Business Writing Software for Real-World Solutions 2.pptx

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

🐬 The future of MySQL is Postgres 🐘

Breaking the Kubernetes Kill Chain: Host Path Mount

Artificial Intelligence: Facts and Myths

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

The 7 Things I Know About Cyber Security After 25 Years | April 2024

Automating Google Workspace (GWS) & more with Apps Script

08448380779 Call Girls In Greater Kailash - I Women Seeking Men

Handwritten Text Recognition for manuscripts and early printed texts

Axa Assurance Maroc - Insurer Innovation Award 2024

Factors to Consider When Choosing Accounts Payable Services Providers.pptx

Tata AIG General Insurance Company - Insurer Innovation Award 2024

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Exploring the Future Potential of AI-Enabled Smartphone Processors

Data Cloud, More than a CDP by Matt Robison

[2024]Digital Global Overview Report 2024 Meltwater.pdf

Recovering Commit Dependencies for Selective Code Integration in Software Product Lines

1. 1

2. Production 2

3. 3

4. Features Code Changes FA CA1 FB CB1 4

5. • If change CA1 implements FA and Feature Code Missing Changes Dependencies change CB1 implements FB FA CA1 CA2 CB1 • If a change CA2 is added to modify FA and CA2 is dependent on CB1 FB CB1 CA1 CA1 CA2 CB1 Integrate FA CB1 5 Integrate FB

6. CA1 CA2 CB1 6

7. Automated Grouping ( during Define Calibrate the Commit Integration) Dissimilarity Metrics on Assignment Metrics Prior Versions Algorithm Developer Guided Grouping ( during Development) 8

8. Given two commits characterized by files, developers and change requests (CRs) Metric Description File Dependency Distance (FD) Captures source code dependencies between files involved in two commits File Association Distance (FA) Captures logical dependencies between files involved in two commits Developer Dissimilarity Distance (DD) Captures the working relation between two developers submitting commits CR Dependency Distance (CRD) Captures the dissimilarity between the CRs implemented by two commits 9

9. Automated Grouping ( during Define Calibrate the Commit Integration) Dissimilarity Metrics on Assignment Metrics Prior Versions Algorithm Developer Guided Grouping ( during Development) 10

10. For each of the four metrics - b3 • Min_Threshold = Avg(a) b2 • Max_Threshold = Avg(bmin) a • Silhouette= Avg{(bmin-a)/max(bmin,a)} b1 A higher silhouette value is better 11

11. Automated Grouping ( during Define Calibrate the Commit Integration) Dissimilarity Metrics on Assignment Metrics Prior Versions Algorithm Developer Guided Grouping ( during Development) 12

12. • Apply the similarity metrics in order of their precedence • If no suitable group is found for a commit, assign the commit to a new group Color > Shape 13

13. Automated Grouping ( during Define Calibrate the Commit Integration) Dissimilarity Metrics on Assignment Metrics Prior Versions Algorithm Developer Guided Grouping ( during Development) 14

14. Groups commits incrementally and uses developers’ feedback to improve the grouping during development Both approaches follow the k-means clustering method which consists in assigning each item to the cluster with the nearest mean. 15

15. We analyzed three major versions of a family of mobile applications 16

16. • Validate the dissimilarity metrics Can the proposed metrics be used to identify commit dependencies ? • Validate the grouping approaches How efficient are our proposed grouping approaches? • Value for Developers Can the proposed approaches identify commit dependencies missed by developers ? 17

17. The four similarity metrics display good abilities in grouping commits ( i.e. high silhouette values) 1 0.94 0.96 0.96 0.79 0.8 0.76 0.67 0.67 Silhouette Value 0.63 0.6 0.57 CRD 0.47 0.49 0.46 FA 0.4 DD FD 0.2 0 Verion 1 Version 2 Version 3 CRD > FA > DD > FD 18

18. • Efficiency of the Grouping Approaches – 82% of commit dependencies were recovered by the automated grouping with a precision of 95% – The accuracy of the developer-guided grouping approach is 98% – We observed that precision/recall improves with longer history data • Value for Developers – Automated grouping and Developer-guided grouping approaches were able to reduce integration failures by 76% and 94% respectively 19

19. 20

Editor's Notes

Software products lines allow the development of similar products using common software components
Whenever modifications are performed by developers on the main branch integrators selectively propagate the modifications to the respective products by picking changes relevant for the specific products.
To ensure the success of these selective integration, development teams attempt to maintain clear mappings between code changes performed by developers and features from the products. However this mapping is not always maintains carefully, making this integration process very brittle.

Recovering Commit Dependencies for Selective Code Integration in Software Product Lines

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (12)

Similar to Recovering Commit Dependencies for Selective Code Integration in Software Product Lines

Similar to Recovering Commit Dependencies for Selective Code Integration in Software Product Lines (20)

Recently uploaded

Recently uploaded (20)

Recovering Commit Dependencies for Selective Code Integration in Software Product Lines

Editor's Notes