SlideShare a Scribd company logo
1 of 56
Download to read offline
FROM BUGS TO
DECISION SUPPORT
- Selected Research Highlights
Markus Borg
SSE Meeting Nov 10, 2015
Per Runeson
PHD STUDENT POSITION
”Aligning RE and V&V”
Bjarnason, Runeson, Borg, et al. Challenges and Practices in Aligning Requirements with Verification
and Validation: A Case Study of Six Companies, Empirical Software Engineering, 19(6), 2014.
EMSE
14
”LARGE-SCALE REQUIREMENTS TO
TEST LINKING”
• Traceability recovery
• Establish trace links in system after-the-fact
• ”if two artifacts share much text, they are more
likely to be associated by a link”
- Giuliano Antoniol (2002)
TRACEABILITY RECOVERY TOOLS
LUND CONTRIBUTIONS
• Johan Natt och Dag’s PhD Thesis
• ”Textual approach” to find similar requirements
• Tool: ReqSimile
• A systematic review of traceability recovery:
• Overview of techniques, evaluations, and results
• https://sites.google.com/site/tracerepo/
Issue
reports
Represent
Calculate angles in vector
space
TEXTUAL SIMILARITY FROM THE 60S
PREVIOUS FOCUS ON REQTS.
Borg, Runeson, and Ardö. Recovering from a Decade: A Systematic Mapping of Information
Retrieval Approaches to Software Traceability, Empirical Software Engineering, 19(6), 2014.
EMSE
14
Licentiate Thesis 2012
ISSUE REPORTS AS HUBS
Issue tracker
ISSUE MANAGEMENT MODEL
Change control board
Developers
Resolution
New issue
Q1. Has this issue been
reported before?
Change control board
Developers
Resolution
New issue
Q1. Has this issue been
reported before?
Q2. Who should investigate
the issue?
Change control board
Developers
Resolution
New issue
Q1. Has this issue been
reported before?
Q3. Will this issue result in
a code change?
Q2. Who should investigate
the issue?
Change control board
Developers
Resolution
New issue
Q4. How long time will it take
to correct this bug?
Q1. Has this issue been
reported before?
Q3. Will this issue result in
a code change?
Q2. Who should investigate
the issue?
Change control board
Developers
Resolution
New issue
Q4. How long time will it take
to correct this bug?
Q1. Has this issue been
reported before?
Q3. Will this issue result in
a code change?
Q2. Who should investigate
the issue?
Q5. If we make a corrective code
change, what is the impact?
Change control board
Developers
Resolution
New issue
Q1. HAS THIS ISSUE BEEN
REPORTED BEFORE?
ISSUE DUPLICATE DETECTION
Merge
FINDING TEXTUAL DUPLICATES
• Per’s second most cited paper (ICSE’07 – 278 cit.)
• Applied ReqSimile approach to issue reports
• Evaluated at Sony Ericsson Mobile Communications
?
1.
2.
3.
4.
5.
CONTINUED PER’S WORK
• Now standard feature of issue trackers
• Bugzilla, HP Quality Center, JIRA
• Replication
• Apache Lucene search engine library
• Issue reports from Android
• Fundamental for textual analysis in later thesis work
• Good at finding similar issue report
• Highly scalable solution – fast!
Borg, Runeson, Johansson, and Mäntylä. A Replicated Study on Duplicate Detection: Using
Apache Lucene to Search among Android Defects, In Proc. of the 8th Int’l Symp. On Empirical
Software Engineering and Measurement (ESEM), 2014.
ESEM
14
NETWORK ANALYSIS OF ISSUE
REPORTS
Borg, Pfahl, and Runeson. Analyzing Networks of Issue Reports, In Proc. of the 17th
European Conf. on Software Maintenance and Reengineering (CSMR), 2013.
CSMR
13
NETWORKS ARE POWERFUL!
Fundamental for
artifact ranking in
later thesis work
Q2. WHO SHOULD
INVESTIGATE THE ISSUE?
Q1. Has this issue been
reported before?
Q2. Who should investigate
the issue?
Change control board
Developers
Resolution
New issue
AUTOMATED ISSUE
ASSIGNMENT
• Supervised machine learning
• Train on historical bugs
Issue tracker
FEATURE SELECTION
• How to represent an issue report?
COLLABORATION WITH ERICSSON
• Ensemble learners for team
assignment
• Features
• 100 dominant terms
• Development site
• Submitter type
• System version
• Priority
Leif Jonsson
EXPERIMENTAL SETUP
Ericsson
Company A
4 x
Jonsson, Borg, Broman et al. Automated Bug Assignment: Ensemble-based Machine
Learning in Large-scale Industrial Contexts, Empirical Software Engineering, 2015.
EMSE
15
> 50,000
RESULTS
• Prediction accuracy in line with humans
• But instantaneous!
• At least 2.000 bug reports in the training set
A WORD OF WARNING…
• Some systems need fresh training data
Q3. WILL THIS ISSUE RESULT
IN A CODE CHANGE?
Q1. Has this issue been
reported before?
Q3. Will this issue result in
a code change?
Q2. Who should investigate
the issue?
Change control board
Developers
Resolution
New issue
MSC THESIS AT SONY
• Limited resources to deal with issues
• 1,000s of bugs reported in projects
• Resources to resolve roughly 10%
• Will the bug cause a code change?
HIGHEST PREDICTIVE POWER?
• Machine learning to find
patterns
• Empirical validation of
previous suspicions
1. mastership
2. fix for
3. ratl mastership
4. externalsupplier
5. ratl keysite
6. project
7. proj id
8. attachment share saved
9. found during
10. business priority
11. found in product
12. found by
13. abc rank
14. detection
15. impact
16. occurrence
17. priority
18. is platform
19. qa state
Standard
priority far
down the list…• Prediction accuracy 75%
Gulin and Olofsson. Development of a Decision Support
System for Defect Reports, MSc Thesis, Lund
University, 2014.
MSc.
Thesis
14
Q4.
How long time will it take to
correct this bug?
Q4. How long time will it take
to correct this bug?
Q1. Has this issue been
reported before?
Q3. Will this issue result in
a code change?
Q2. Who should investigate
the issue?
Change control board
Developers
Resolution
New issue
UNSUPERVISED MACHINE LEARNING
Automatic
clustering of
issue reports
REPLICATION OF RAJA (2013)
• ”Textual clusters of issue reports
have significantly different
resolution times”
• Conceptual replication
• Fully automatic clustering
• Issue reports from large projects
• Confirmed statistical differences
OPERATIONALIZATION
1. Put new issue in the right cluster
2. Resolution time based on cluster avg.
Assar, Borg, and Pfahl. Using Text Clustering to Predict Defect Resolution Time: A Conceptual
Replication and an Evaluation of Prediction Accuracy, Empirical Software Engineering, 2015.
EMSE
15
Q5.
If we make a corrective change,
what is the impact?
Q4. How long time will it take
to correct this bug?
Q1. Has this issue been
reported before?
Q3. Will this issue result in
a code change?
Q2. Who should investigate
the issue?
Q5. If we make a corrective code
change, what is the impact?
Change control board
Developers
Resolution
New issue
De la Vara, Borg, Wnuk, and Moonen. Safety Evidence Change Impact Analysis in Practice,
In revision, Transactions on Software Engineering, 2015.
TSE
(In rev.)
RECOMMENDATION SYSTEM
• “a software application that aims to support users
in their decision-making while interacting with
large information spaces”
Decision-support system
TWO MAIN APPROACHES
RECOMMENDATIONS BASED ON
HISTORICAL IMPACT (NON-CODE)
Reqs.
Tests
Borg, Gotel, and Wnuk. Enabling Traceability Reuse for Impact Analyses: A Feasibility Study in
a Safety Context, In Proc. of the Int’l WS on Traceability in Emerging Forms of Software
Engineering (TEFSE), 2012.
EMSE
15
1. Mine issue tracker
2. Create network of
previous impact
3. Index text with
4. Calculate centrality measures
IDENTIFY POTENTIAL IMPACT
Find similar issues
using Apache Lucene
Design Doc. X.Y
Req. X.Y
Test case UTC56
Req. Z.Y
Design Doc. X.Y
Follow links to create
candidate impact set
RANK THE POTENTIAL IMPACT
Use centrality measures
to rank candidate impact
1. Requirement X.Y
2. Design Document X.Y
3. Test case UTC56
4. Design Document X.Y
5. Requirement Z.Y
Borg and Runeson. Changes, Evolution and Bugs – Recommendation Systems for Issue
Management, In Recommendation Systems in Software Engineering, Robillard, Maalej,
Walker, and Zimmermann (Ed.), 2013.
RSSE
book
ImpRec
https://github.com/mrksbrg/ImpRec
DELIVER THE RECOMMENDATIONS
TWO UNIT CASE STUDY
• Deployed ImpRec in two teams
• Malmö and Bangalore
• Increases awareness and findability
• Helps project newcomers
Borg, Wnuk, Regnell, and Runeson. Supporting Change Impact Analysis Using a
Recommendation System: An Industrial Case Study in a Safety-Critical Context, In
submission, 2015.
In
subm.
PARAMETER TUNING
• Finding a feasible parameter
setting is difficult
• Presented framework to do it in R
• Factorial designs
• Response surface methodology
Borg. TuneR: A Framework for Tuning Software Engineering Tools with Hands-On
Instructions in R, In revision, Journal of Software: Evolution and Process, 2015.
JSEP
(in Rev)
SUMMARY
Bug tracker
Machine
Learning
“Humans obscured by bug overload, but
machine learning benefits from plentiful
training data. Practitioners confirm value of
developed tools.”
Tiny Transactions on Computer Science
(@TinyToCS), Volume 3, 2015
Bug tracker
Machine
Learning
Thank you!
markus.borg@sics.se
mrksbrg.com
@mrksbrg

More Related Content

What's hot

Taylor: Estimating uncertainty for continental scale measurements.
Taylor:  Estimating uncertainty for continental scale measurements.Taylor:  Estimating uncertainty for continental scale measurements.
Taylor: Estimating uncertainty for continental scale measurements.questRCN
 
Tim Menzies, directions in Data Science
Tim Menzies, directions in Data ScienceTim Menzies, directions in Data Science
Tim Menzies, directions in Data ScienceCS, NcState
 
Metadata Analyser: measuring metadata quality
Metadata Analyser: measuring metadata qualityMetadata Analyser: measuring metadata quality
Metadata Analyser: measuring metadata qualityFrancisco Couto
 
Research software susainability
Research software susainabilityResearch software susainability
Research software susainabilityDaniel S. Katz
 
EASE 2019 keynote
EASE 2019 keynoteEASE 2019 keynote
EASE 2019 keynotePer Runeson
 
Machine Learning Methods for Parameter Acquisition in a Human ...
Machine Learning Methods for Parameter Acquisition in a Human ...Machine Learning Methods for Parameter Acquisition in a Human ...
Machine Learning Methods for Parameter Acquisition in a Human ...butest
 
Instance Space Analysis for Search Based Software Engineering
Instance Space Analysis for Search Based Software EngineeringInstance Space Analysis for Search Based Software Engineering
Instance Space Analysis for Search Based Software EngineeringAldeida Aleti
 
Interactive fault localization leveraging simple user feedback - by Liang Gong
Interactive fault localization leveraging simple user feedback - by Liang GongInteractive fault localization leveraging simple user feedback - by Liang Gong
Interactive fault localization leveraging simple user feedback - by Liang GongLiang Gong
 
Customer Success Story: IEEE Xplore Inspires Innovation
Customer Success Story: IEEE Xplore Inspires InnovationCustomer Success Story: IEEE Xplore Inspires Innovation
Customer Success Story: IEEE Xplore Inspires InnovationIEEE Xplore Digital Library
 
ProspectusPresentationPrinterFriendly
ProspectusPresentationPrinterFriendlyProspectusPresentationPrinterFriendly
ProspectusPresentationPrinterFriendlymartijnetje
 
PhD Proposal talk
PhD Proposal talkPhD Proposal talk
PhD Proposal talkRay Buse
 
Search-based testing of procedural programs:iterative single-target or multi-...
Search-based testing of procedural programs:iterative single-target or multi-...Search-based testing of procedural programs:iterative single-target or multi-...
Search-based testing of procedural programs:iterative single-target or multi-...Vrije Universiteit Brussel
 
On Using Network Science in Mining Developers Collaboration in Software Engin...
On Using Network Science in Mining Developers Collaboration in Software Engin...On Using Network Science in Mining Developers Collaboration in Software Engin...
On Using Network Science in Mining Developers Collaboration in Software Engin...IJDKP
 
Histolab: an Open Source Python Library for Reproducible Digital Pathology
Histolab: an Open Source Python Library for Reproducible Digital PathologyHistolab: an Open Source Python Library for Reproducible Digital Pathology
Histolab: an Open Source Python Library for Reproducible Digital PathologyAlessia Marcolini
 
CASA: Context-Aware Scalable Authentication, at SOUPS 2013
CASA: Context-Aware Scalable Authentication, at SOUPS 2013CASA: Context-Aware Scalable Authentication, at SOUPS 2013
CASA: Context-Aware Scalable Authentication, at SOUPS 2013Jason Hong
 
CASA: Context Aware Scalable Authentication, at SOUPS 2013
CASA: Context Aware Scalable Authentication, at SOUPS 2013CASA: Context Aware Scalable Authentication, at SOUPS 2013
CASA: Context Aware Scalable Authentication, at SOUPS 2013Jason Hong
 
A PARTICLE SWARM OPTIMIZATION TECHNIQUE FOR GENERATING PAIRWISE TEST CASES
A PARTICLE SWARM OPTIMIZATION TECHNIQUE FOR GENERATING PAIRWISE TEST CASESA PARTICLE SWARM OPTIMIZATION TECHNIQUE FOR GENERATING PAIRWISE TEST CASES
A PARTICLE SWARM OPTIMIZATION TECHNIQUE FOR GENERATING PAIRWISE TEST CASESKula Sekhar Reddy Yerraguntla
 

What's hot (20)

Taylor: Estimating uncertainty for continental scale measurements.
Taylor:  Estimating uncertainty for continental scale measurements.Taylor:  Estimating uncertainty for continental scale measurements.
Taylor: Estimating uncertainty for continental scale measurements.
 
Tim Menzies, directions in Data Science
Tim Menzies, directions in Data ScienceTim Menzies, directions in Data Science
Tim Menzies, directions in Data Science
 
Metadata Analyser: measuring metadata quality
Metadata Analyser: measuring metadata qualityMetadata Analyser: measuring metadata quality
Metadata Analyser: measuring metadata quality
 
Research software susainability
Research software susainabilityResearch software susainability
Research software susainability
 
AS_industrialNew
AS_industrialNewAS_industrialNew
AS_industrialNew
 
EASE 2019 keynote
EASE 2019 keynoteEASE 2019 keynote
EASE 2019 keynote
 
Machine Learning Methods for Parameter Acquisition in a Human ...
Machine Learning Methods for Parameter Acquisition in a Human ...Machine Learning Methods for Parameter Acquisition in a Human ...
Machine Learning Methods for Parameter Acquisition in a Human ...
 
Instance Space Analysis for Search Based Software Engineering
Instance Space Analysis for Search Based Software EngineeringInstance Space Analysis for Search Based Software Engineering
Instance Space Analysis for Search Based Software Engineering
 
PTW_CV
PTW_CVPTW_CV
PTW_CV
 
Interactive fault localization leveraging simple user feedback - by Liang Gong
Interactive fault localization leveraging simple user feedback - by Liang GongInteractive fault localization leveraging simple user feedback - by Liang Gong
Interactive fault localization leveraging simple user feedback - by Liang Gong
 
Customer Success Story: IEEE Xplore Inspires Innovation
Customer Success Story: IEEE Xplore Inspires InnovationCustomer Success Story: IEEE Xplore Inspires Innovation
Customer Success Story: IEEE Xplore Inspires Innovation
 
ProspectusPresentationPrinterFriendly
ProspectusPresentationPrinterFriendlyProspectusPresentationPrinterFriendly
ProspectusPresentationPrinterFriendly
 
PhD Proposal talk
PhD Proposal talkPhD Proposal talk
PhD Proposal talk
 
Search-based testing of procedural programs:iterative single-target or multi-...
Search-based testing of procedural programs:iterative single-target or multi-...Search-based testing of procedural programs:iterative single-target or multi-...
Search-based testing of procedural programs:iterative single-target or multi-...
 
On Using Network Science in Mining Developers Collaboration in Software Engin...
On Using Network Science in Mining Developers Collaboration in Software Engin...On Using Network Science in Mining Developers Collaboration in Software Engin...
On Using Network Science in Mining Developers Collaboration in Software Engin...
 
Histolab: an Open Source Python Library for Reproducible Digital Pathology
Histolab: an Open Source Python Library for Reproducible Digital PathologyHistolab: an Open Source Python Library for Reproducible Digital Pathology
Histolab: an Open Source Python Library for Reproducible Digital Pathology
 
CASA: Context-Aware Scalable Authentication, at SOUPS 2013
CASA: Context-Aware Scalable Authentication, at SOUPS 2013CASA: Context-Aware Scalable Authentication, at SOUPS 2013
CASA: Context-Aware Scalable Authentication, at SOUPS 2013
 
Wcre13b.ppt
Wcre13b.pptWcre13b.ppt
Wcre13b.ppt
 
CASA: Context Aware Scalable Authentication, at SOUPS 2013
CASA: Context Aware Scalable Authentication, at SOUPS 2013CASA: Context Aware Scalable Authentication, at SOUPS 2013
CASA: Context Aware Scalable Authentication, at SOUPS 2013
 
A PARTICLE SWARM OPTIMIZATION TECHNIQUE FOR GENERATING PAIRWISE TEST CASES
A PARTICLE SWARM OPTIMIZATION TECHNIQUE FOR GENERATING PAIRWISE TEST CASESA PARTICLE SWARM OPTIMIZATION TECHNIQUE FOR GENERATING PAIRWISE TEST CASES
A PARTICLE SWARM OPTIMIZATION TECHNIQUE FOR GENERATING PAIRWISE TEST CASES
 

Similar to From Bugs to Decision Support - Selected Research Highlights

Automation in the Bug Flow - Machine Learning for Triaging and Tracing
Automation in the Bug Flow - Machine Learning for Triaging and TracingAutomation in the Bug Flow - Machine Learning for Triaging and Tracing
Automation in the Bug Flow - Machine Learning for Triaging and TracingMarkus Borg
 
Software Engineering Research: Leading a Double-Agent Life.
Software Engineering Research: Leading a Double-Agent Life.Software Engineering Research: Leading a Double-Agent Life.
Software Engineering Research: Leading a Double-Agent Life.Lionel Briand
 
Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt
Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.pptProto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt
Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.pptAnirbanBhar3
 
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software AnalyticsReplication and Benchmarking in Software Analytics
Replication and Benchmarking in Software AnalyticsUniversity of Zurich
 
Supporting Change Impact Analysis Using a Recommendation System - An Industri...
Supporting Change Impact Analysis Using a Recommendation System - An Industri...Supporting Change Impact Analysis Using a Recommendation System - An Industri...
Supporting Change Impact Analysis Using a Recommendation System - An Industri...Markus Borg
 
Software Mining and Software Datasets
Software Mining and Software DatasetsSoftware Mining and Software Datasets
Software Mining and Software DatasetsTao Xie
 
ExaLearn Overview - ECP Co-Design Center for Machine Learning
ExaLearn Overview - ECP Co-Design Center for Machine LearningExaLearn Overview - ECP Co-Design Center for Machine Learning
ExaLearn Overview - ECP Co-Design Center for Machine Learninginside-BigData.com
 
Collaborative Data Analysis with Taverna Workflows
Collaborative Data Analysis with Taverna WorkflowsCollaborative Data Analysis with Taverna Workflows
Collaborative Data Analysis with Taverna WorkflowsAndrea Wiggins
 
Big Data: the weakest link
Big Data: the weakest linkBig Data: the weakest link
Big Data: the weakest linkCS, NcState
 
Lionel Briand ICSM 2011 Keynote
Lionel Briand ICSM 2011 KeynoteLionel Briand ICSM 2011 Keynote
Lionel Briand ICSM 2011 KeynoteICSM 2011
 
Analyzing Big Data's Weakest Link (hint: it might be you)
Analyzing Big Data's Weakest Link  (hint: it might be you)Analyzing Big Data's Weakest Link  (hint: it might be you)
Analyzing Big Data's Weakest Link (hint: it might be you)HPCC Systems
 
Bug Triage: An Automated Process
Bug Triage: An Automated ProcessBug Triage: An Automated Process
Bug Triage: An Automated ProcessIRJET Journal
 
Software tools to facilitate materials science research
Software tools to facilitate materials science researchSoftware tools to facilitate materials science research
Software tools to facilitate materials science researchAnubhav Jain
 
An Empirical Study on the Adequacy of Testing in Open Source Projects
An Empirical Study on the Adequacy of Testing in Open Source ProjectsAn Empirical Study on the Adequacy of Testing in Open Source Projects
An Empirical Study on the Adequacy of Testing in Open Source ProjectsPavneet Singh Kochhar
 

Similar to From Bugs to Decision Support - Selected Research Highlights (20)

Automation in the Bug Flow - Machine Learning for Triaging and Tracing
Automation in the Bug Flow - Machine Learning for Triaging and TracingAutomation in the Bug Flow - Machine Learning for Triaging and Tracing
Automation in the Bug Flow - Machine Learning for Triaging and Tracing
 
Software Engineering Research: Leading a Double-Agent Life.
Software Engineering Research: Leading a Double-Agent Life.Software Engineering Research: Leading a Double-Agent Life.
Software Engineering Research: Leading a Double-Agent Life.
 
Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt
Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.pptProto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt
Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt Proto Spiral.ppt
 
Replication and Benchmarking in Software Analytics
Replication and Benchmarking in Software AnalyticsReplication and Benchmarking in Software Analytics
Replication and Benchmarking in Software Analytics
 
Duc le CV
Duc le CVDuc le CV
Duc le CV
 
Software bug prediction
Software bug prediction Software bug prediction
Software bug prediction
 
Supporting Change Impact Analysis Using a Recommendation System - An Industri...
Supporting Change Impact Analysis Using a Recommendation System - An Industri...Supporting Change Impact Analysis Using a Recommendation System - An Industri...
Supporting Change Impact Analysis Using a Recommendation System - An Industri...
 
Software Mining and Software Datasets
Software Mining and Software DatasetsSoftware Mining and Software Datasets
Software Mining and Software Datasets
 
ExaLearn Overview - ECP Co-Design Center for Machine Learning
ExaLearn Overview - ECP Co-Design Center for Machine LearningExaLearn Overview - ECP Co-Design Center for Machine Learning
ExaLearn Overview - ECP Co-Design Center for Machine Learning
 
Collaborative Data Analysis with Taverna Workflows
Collaborative Data Analysis with Taverna WorkflowsCollaborative Data Analysis with Taverna Workflows
Collaborative Data Analysis with Taverna Workflows
 
Big Data: the weakest link
Big Data: the weakest linkBig Data: the weakest link
Big Data: the weakest link
 
Lionel Briand ICSM 2011 Keynote
Lionel Briand ICSM 2011 KeynoteLionel Briand ICSM 2011 Keynote
Lionel Briand ICSM 2011 Keynote
 
bonino
boninobonino
bonino
 
Lopez
LopezLopez
Lopez
 
Analyzing Big Data's Weakest Link (hint: it might be you)
Analyzing Big Data's Weakest Link  (hint: it might be you)Analyzing Big Data's Weakest Link  (hint: it might be you)
Analyzing Big Data's Weakest Link (hint: it might be you)
 
CS846_report_akshat_kumar
CS846_report_akshat_kumarCS846_report_akshat_kumar
CS846_report_akshat_kumar
 
Bug Triage: An Automated Process
Bug Triage: An Automated ProcessBug Triage: An Automated Process
Bug Triage: An Automated Process
 
Paper review
Paper reviewPaper review
Paper review
 
Software tools to facilitate materials science research
Software tools to facilitate materials science researchSoftware tools to facilitate materials science research
Software tools to facilitate materials science research
 
An Empirical Study on the Adequacy of Testing in Open Source Projects
An Empirical Study on the Adequacy of Testing in Open Source ProjectsAn Empirical Study on the Adequacy of Testing in Open Source Projects
An Empirical Study on the Adequacy of Testing in Open Source Projects
 

More from Markus Borg

Agility in Software 2.0 - Notebook Interfaces and MLOps with Buttresses and R...
Agility in Software 2.0 - Notebook Interfaces and MLOps with Buttresses and R...Agility in Software 2.0 - Notebook Interfaces and MLOps with Buttresses and R...
Agility in Software 2.0 - Notebook Interfaces and MLOps with Buttresses and R...Markus Borg
 
Quality Assurance Of Generative Dialog Models in an evolving Conversationa...
Quality Assurance  Of  Generative Dialog Models in an evolving  Conversationa...Quality Assurance  Of  Generative Dialog Models in an evolving  Conversationa...
Quality Assurance Of Generative Dialog Models in an evolving Conversationa...Markus Borg
 
Test Automation with Grad-CAM Heatmaps - A Future Pipe Segment in MLOps for V...
Test Automation with Grad-CAM Heatmaps - A Future Pipe Segment in MLOps for V...Test Automation with Grad-CAM Heatmaps - A Future Pipe Segment in MLOps for V...
Test Automation with Grad-CAM Heatmaps - A Future Pipe Segment in MLOps for V...Markus Borg
 
Digital Twins Are Not Monozygotic - Cross-Replicating ADAS Testing in Two Ind...
Digital Twins Are Not Monozygotic - Cross-Replicating ADAS Testing in Two Ind...Digital Twins Are Not Monozygotic - Cross-Replicating ADAS Testing in Two Ind...
Digital Twins Are Not Monozygotic - Cross-Replicating ADAS Testing in Two Ind...Markus Borg
 
Illuminating a Blind Spot in Digitalization - Software Development in Sweden’...
Illuminating a Blind Spot in Digitalization - Software Development in Sweden’...Illuminating a Blind Spot in Digitalization - Software Development in Sweden’...
Illuminating a Blind Spot in Digitalization - Software Development in Sweden’...Markus Borg
 
Trained, Not Coded - Still Safe?
Trained, Not Coded - Still Safe?Trained, Not Coded - Still Safe?
Trained, Not Coded - Still Safe?Markus Borg
 
SZZ Unleashed: An Open Implementation of the SZZ Algorithm
SZZ Unleashed:  An Open Implementation of the SZZ AlgorithmSZZ Unleashed:  An Open Implementation of the SZZ Algorithm
SZZ Unleashed: An Open Implementation of the SZZ AlgorithmMarkus Borg
 
Explainability First! Cousteauing the Depths of Neural Networks
Explainability First! Cousteauing the Depths of Neural NetworksExplainability First! Cousteauing the Depths of Neural Networks
Explainability First! Cousteauing the Depths of Neural NetworksMarkus Borg
 
Test Automation Research... Is That Really Needed in 2018?
Test Automation Research... Is That Really Needed in 2018?Test Automation Research... Is That Really Needed in 2018?
Test Automation Research... Is That Really Needed in 2018?Markus Borg
 
Component Source Origin Decisions in Practice - A Survey of Decision Making i...
Component Source Origin Decisions in Practice - A Survey of Decision Making i...Component Source Origin Decisions in Practice - A Survey of Decision Making i...
Component Source Origin Decisions in Practice - A Survey of Decision Making i...Markus Borg
 
Enabling Visual Analytics with Unity - Exploring Regression Test Results in A...
Enabling Visual Analytics with Unity - Exploring Regression Test Results in A...Enabling Visual Analytics with Unity - Exploring Regression Test Results in A...
Enabling Visual Analytics with Unity - Exploring Regression Test Results in A...Markus Borg
 
Testing Quality Requirements of a System-of-Systems in the Public Sector - Ch...
Testing Quality Requirements of a System-of-Systems in the Public Sector - Ch...Testing Quality Requirements of a System-of-Systems in the Public Sector - Ch...
Testing Quality Requirements of a System-of-Systems in the Public Sector - Ch...Markus Borg
 
Comparing Cousins – A Harmonized Analysis of Racket Sport Set Scores using Ra...
Comparing Cousins – A Harmonized Analysis of Racket Sport Set Scores using Ra...Comparing Cousins – A Harmonized Analysis of Racket Sport Set Scores using Ra...
Comparing Cousins – A Harmonized Analysis of Racket Sport Set Scores using Ra...Markus Borg
 
Revisiting the Challenges in Aligning RE and V&V: Experiences from the Public...
Revisiting the Challenges in Aligning RE and V&V: Experiences from the Public...Revisiting the Challenges in Aligning RE and V&V: Experiences from the Public...
Revisiting the Challenges in Aligning RE and V&V: Experiences from the Public...Markus Borg
 
Enabling Traceability Reuse for Impact Analyses - Toward a Recommendation Sys...
Enabling Traceability Reuse for Impact Analyses - Toward a Recommendation Sys...Enabling Traceability Reuse for Impact Analyses - Toward a Recommendation Sys...
Enabling Traceability Reuse for Impact Analyses - Toward a Recommendation Sys...Markus Borg
 
Analyzing networks of issue reports
Analyzing networks of issue reportsAnalyzing networks of issue reports
Analyzing networks of issue reportsMarkus Borg
 
Findability through Traceability - A Realistic Application of Candidate Tr...
Findability through Traceability  - A Realistic Application of Candidate Tr...Findability through Traceability  - A Realistic Application of Candidate Tr...
Findability through Traceability - A Realistic Application of Candidate Tr...Markus Borg
 
Recommendation Systems for Issue Management
Recommendation Systems for Issue ManagementRecommendation Systems for Issue Management
Recommendation Systems for Issue ManagementMarkus Borg
 

More from Markus Borg (18)

Agility in Software 2.0 - Notebook Interfaces and MLOps with Buttresses and R...
Agility in Software 2.0 - Notebook Interfaces and MLOps with Buttresses and R...Agility in Software 2.0 - Notebook Interfaces and MLOps with Buttresses and R...
Agility in Software 2.0 - Notebook Interfaces and MLOps with Buttresses and R...
 
Quality Assurance Of Generative Dialog Models in an evolving Conversationa...
Quality Assurance  Of  Generative Dialog Models in an evolving  Conversationa...Quality Assurance  Of  Generative Dialog Models in an evolving  Conversationa...
Quality Assurance Of Generative Dialog Models in an evolving Conversationa...
 
Test Automation with Grad-CAM Heatmaps - A Future Pipe Segment in MLOps for V...
Test Automation with Grad-CAM Heatmaps - A Future Pipe Segment in MLOps for V...Test Automation with Grad-CAM Heatmaps - A Future Pipe Segment in MLOps for V...
Test Automation with Grad-CAM Heatmaps - A Future Pipe Segment in MLOps for V...
 
Digital Twins Are Not Monozygotic - Cross-Replicating ADAS Testing in Two Ind...
Digital Twins Are Not Monozygotic - Cross-Replicating ADAS Testing in Two Ind...Digital Twins Are Not Monozygotic - Cross-Replicating ADAS Testing in Two Ind...
Digital Twins Are Not Monozygotic - Cross-Replicating ADAS Testing in Two Ind...
 
Illuminating a Blind Spot in Digitalization - Software Development in Sweden’...
Illuminating a Blind Spot in Digitalization - Software Development in Sweden’...Illuminating a Blind Spot in Digitalization - Software Development in Sweden’...
Illuminating a Blind Spot in Digitalization - Software Development in Sweden’...
 
Trained, Not Coded - Still Safe?
Trained, Not Coded - Still Safe?Trained, Not Coded - Still Safe?
Trained, Not Coded - Still Safe?
 
SZZ Unleashed: An Open Implementation of the SZZ Algorithm
SZZ Unleashed:  An Open Implementation of the SZZ AlgorithmSZZ Unleashed:  An Open Implementation of the SZZ Algorithm
SZZ Unleashed: An Open Implementation of the SZZ Algorithm
 
Explainability First! Cousteauing the Depths of Neural Networks
Explainability First! Cousteauing the Depths of Neural NetworksExplainability First! Cousteauing the Depths of Neural Networks
Explainability First! Cousteauing the Depths of Neural Networks
 
Test Automation Research... Is That Really Needed in 2018?
Test Automation Research... Is That Really Needed in 2018?Test Automation Research... Is That Really Needed in 2018?
Test Automation Research... Is That Really Needed in 2018?
 
Component Source Origin Decisions in Practice - A Survey of Decision Making i...
Component Source Origin Decisions in Practice - A Survey of Decision Making i...Component Source Origin Decisions in Practice - A Survey of Decision Making i...
Component Source Origin Decisions in Practice - A Survey of Decision Making i...
 
Enabling Visual Analytics with Unity - Exploring Regression Test Results in A...
Enabling Visual Analytics with Unity - Exploring Regression Test Results in A...Enabling Visual Analytics with Unity - Exploring Regression Test Results in A...
Enabling Visual Analytics with Unity - Exploring Regression Test Results in A...
 
Testing Quality Requirements of a System-of-Systems in the Public Sector - Ch...
Testing Quality Requirements of a System-of-Systems in the Public Sector - Ch...Testing Quality Requirements of a System-of-Systems in the Public Sector - Ch...
Testing Quality Requirements of a System-of-Systems in the Public Sector - Ch...
 
Comparing Cousins – A Harmonized Analysis of Racket Sport Set Scores using Ra...
Comparing Cousins – A Harmonized Analysis of Racket Sport Set Scores using Ra...Comparing Cousins – A Harmonized Analysis of Racket Sport Set Scores using Ra...
Comparing Cousins – A Harmonized Analysis of Racket Sport Set Scores using Ra...
 
Revisiting the Challenges in Aligning RE and V&V: Experiences from the Public...
Revisiting the Challenges in Aligning RE and V&V: Experiences from the Public...Revisiting the Challenges in Aligning RE and V&V: Experiences from the Public...
Revisiting the Challenges in Aligning RE and V&V: Experiences from the Public...
 
Enabling Traceability Reuse for Impact Analyses - Toward a Recommendation Sys...
Enabling Traceability Reuse for Impact Analyses - Toward a Recommendation Sys...Enabling Traceability Reuse for Impact Analyses - Toward a Recommendation Sys...
Enabling Traceability Reuse for Impact Analyses - Toward a Recommendation Sys...
 
Analyzing networks of issue reports
Analyzing networks of issue reportsAnalyzing networks of issue reports
Analyzing networks of issue reports
 
Findability through Traceability - A Realistic Application of Candidate Tr...
Findability through Traceability  - A Realistic Application of Candidate Tr...Findability through Traceability  - A Realistic Application of Candidate Tr...
Findability through Traceability - A Realistic Application of Candidate Tr...
 
Recommendation Systems for Issue Management
Recommendation Systems for Issue ManagementRecommendation Systems for Issue Management
Recommendation Systems for Issue Management
 

Recently uploaded

Manassas R - Parkside Middle School 🌎🏫
Manassas R - Parkside Middle School 🌎🏫Manassas R - Parkside Middle School 🌎🏫
Manassas R - Parkside Middle School 🌎🏫qfactory1
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfSwapnil Therkar
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
 
Solution chemistry, Moral and Normal solutions
Solution chemistry, Moral and Normal solutionsSolution chemistry, Moral and Normal solutions
Solution chemistry, Moral and Normal solutionsHajira Mahmood
 
Module 4: Mendelian Genetics and Punnett Square
Module 4:  Mendelian Genetics and Punnett SquareModule 4:  Mendelian Genetics and Punnett Square
Module 4: Mendelian Genetics and Punnett SquareIsiahStephanRadaza
 
TOTAL CHOLESTEROL (lipid profile test).pptx
TOTAL CHOLESTEROL (lipid profile test).pptxTOTAL CHOLESTEROL (lipid profile test).pptx
TOTAL CHOLESTEROL (lipid profile test).pptxdharshini369nike
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxyaramohamed343013
 
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxTHE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxNandakishor Bhaurao Deshmukh
 
Transposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptTransposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptArshadWarsi13
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxpriyankatabhane
 
Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trssuser06f238
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024AyushiRastogi48
 
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...lizamodels9
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
‏‏VIRUS - 123455555555555555555555555555555555555555
‏‏VIRUS -  123455555555555555555555555555555555555555‏‏VIRUS -  123455555555555555555555555555555555555555
‏‏VIRUS - 123455555555555555555555555555555555555555kikilily0909
 
Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)DHURKADEVIBASKAR
 
insect anatomy and insect body wall and their physiology
insect anatomy and insect body wall and their  physiologyinsect anatomy and insect body wall and their  physiology
insect anatomy and insect body wall and their physiologyDrAnita Sharma
 

Recently uploaded (20)

Manassas R - Parkside Middle School 🌎🏫
Manassas R - Parkside Middle School 🌎🏫Manassas R - Parkside Middle School 🌎🏫
Manassas R - Parkside Middle School 🌎🏫
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
 
Solution chemistry, Moral and Normal solutions
Solution chemistry, Moral and Normal solutionsSolution chemistry, Moral and Normal solutions
Solution chemistry, Moral and Normal solutions
 
Module 4: Mendelian Genetics and Punnett Square
Module 4:  Mendelian Genetics and Punnett SquareModule 4:  Mendelian Genetics and Punnett Square
Module 4: Mendelian Genetics and Punnett Square
 
TOTAL CHOLESTEROL (lipid profile test).pptx
TOTAL CHOLESTEROL (lipid profile test).pptxTOTAL CHOLESTEROL (lipid profile test).pptx
TOTAL CHOLESTEROL (lipid profile test).pptx
 
Volatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -IVolatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -I
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docx
 
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxTHE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
 
Hot Sexy call girls in Moti Nagar,🔝 9953056974 🔝 escort Service
Hot Sexy call girls in  Moti Nagar,🔝 9953056974 🔝 escort ServiceHot Sexy call girls in  Moti Nagar,🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Moti Nagar,🔝 9953056974 🔝 escort Service
 
Transposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptTransposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.ppt
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
 
Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 tr
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024
 
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
‏‏VIRUS - 123455555555555555555555555555555555555555
‏‏VIRUS -  123455555555555555555555555555555555555555‏‏VIRUS -  123455555555555555555555555555555555555555
‏‏VIRUS - 123455555555555555555555555555555555555555
 
Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)
 
insect anatomy and insect body wall and their physiology
insect anatomy and insect body wall and their  physiologyinsect anatomy and insect body wall and their  physiology
insect anatomy and insect body wall and their physiology
 

From Bugs to Decision Support - Selected Research Highlights

  • 1. FROM BUGS TO DECISION SUPPORT - Selected Research Highlights Markus Borg SSE Meeting Nov 10, 2015
  • 3. PHD STUDENT POSITION ”Aligning RE and V&V” Bjarnason, Runeson, Borg, et al. Challenges and Practices in Aligning Requirements with Verification and Validation: A Case Study of Six Companies, Empirical Software Engineering, 19(6), 2014. EMSE 14
  • 4. ”LARGE-SCALE REQUIREMENTS TO TEST LINKING” • Traceability recovery • Establish trace links in system after-the-fact • ”if two artifacts share much text, they are more likely to be associated by a link” - Giuliano Antoniol (2002)
  • 6. LUND CONTRIBUTIONS • Johan Natt och Dag’s PhD Thesis • ”Textual approach” to find similar requirements • Tool: ReqSimile • A systematic review of traceability recovery: • Overview of techniques, evaluations, and results • https://sites.google.com/site/tracerepo/
  • 7. Issue reports Represent Calculate angles in vector space TEXTUAL SIMILARITY FROM THE 60S
  • 8. PREVIOUS FOCUS ON REQTS. Borg, Runeson, and Ardö. Recovering from a Decade: A Systematic Mapping of Information Retrieval Approaches to Software Traceability, Empirical Software Engineering, 19(6), 2014. EMSE 14
  • 10.
  • 11. ISSUE REPORTS AS HUBS Issue tracker
  • 12. ISSUE MANAGEMENT MODEL Change control board Developers Resolution New issue
  • 13. Q1. Has this issue been reported before? Change control board Developers Resolution New issue
  • 14. Q1. Has this issue been reported before? Q2. Who should investigate the issue? Change control board Developers Resolution New issue
  • 15. Q1. Has this issue been reported before? Q3. Will this issue result in a code change? Q2. Who should investigate the issue? Change control board Developers Resolution New issue
  • 16. Q4. How long time will it take to correct this bug? Q1. Has this issue been reported before? Q3. Will this issue result in a code change? Q2. Who should investigate the issue? Change control board Developers Resolution New issue
  • 17. Q4. How long time will it take to correct this bug? Q1. Has this issue been reported before? Q3. Will this issue result in a code change? Q2. Who should investigate the issue? Q5. If we make a corrective code change, what is the impact? Change control board Developers Resolution New issue
  • 18. Q1. HAS THIS ISSUE BEEN REPORTED BEFORE?
  • 20. FINDING TEXTUAL DUPLICATES • Per’s second most cited paper (ICSE’07 – 278 cit.) • Applied ReqSimile approach to issue reports • Evaluated at Sony Ericsson Mobile Communications ? 1. 2. 3. 4. 5.
  • 21. CONTINUED PER’S WORK • Now standard feature of issue trackers • Bugzilla, HP Quality Center, JIRA • Replication • Apache Lucene search engine library • Issue reports from Android • Fundamental for textual analysis in later thesis work • Good at finding similar issue report • Highly scalable solution – fast! Borg, Runeson, Johansson, and Mäntylä. A Replicated Study on Duplicate Detection: Using Apache Lucene to Search among Android Defects, In Proc. of the 8th Int’l Symp. On Empirical Software Engineering and Measurement (ESEM), 2014. ESEM 14
  • 22. NETWORK ANALYSIS OF ISSUE REPORTS Borg, Pfahl, and Runeson. Analyzing Networks of Issue Reports, In Proc. of the 17th European Conf. on Software Maintenance and Reengineering (CSMR), 2013. CSMR 13
  • 23. NETWORKS ARE POWERFUL! Fundamental for artifact ranking in later thesis work
  • 25. Q1. Has this issue been reported before? Q2. Who should investigate the issue? Change control board Developers Resolution New issue
  • 26. AUTOMATED ISSUE ASSIGNMENT • Supervised machine learning • Train on historical bugs Issue tracker
  • 27. FEATURE SELECTION • How to represent an issue report?
  • 28. COLLABORATION WITH ERICSSON • Ensemble learners for team assignment • Features • 100 dominant terms • Development site • Submitter type • System version • Priority Leif Jonsson
  • 29. EXPERIMENTAL SETUP Ericsson Company A 4 x Jonsson, Borg, Broman et al. Automated Bug Assignment: Ensemble-based Machine Learning in Large-scale Industrial Contexts, Empirical Software Engineering, 2015. EMSE 15 > 50,000
  • 30. RESULTS • Prediction accuracy in line with humans • But instantaneous! • At least 2.000 bug reports in the training set
  • 31. A WORD OF WARNING… • Some systems need fresh training data
  • 32. Q3. WILL THIS ISSUE RESULT IN A CODE CHANGE?
  • 33. Q1. Has this issue been reported before? Q3. Will this issue result in a code change? Q2. Who should investigate the issue? Change control board Developers Resolution New issue
  • 34. MSC THESIS AT SONY • Limited resources to deal with issues • 1,000s of bugs reported in projects • Resources to resolve roughly 10% • Will the bug cause a code change?
  • 35. HIGHEST PREDICTIVE POWER? • Machine learning to find patterns • Empirical validation of previous suspicions 1. mastership 2. fix for 3. ratl mastership 4. externalsupplier 5. ratl keysite 6. project 7. proj id 8. attachment share saved 9. found during 10. business priority 11. found in product 12. found by 13. abc rank 14. detection 15. impact 16. occurrence 17. priority 18. is platform 19. qa state Standard priority far down the list…• Prediction accuracy 75% Gulin and Olofsson. Development of a Decision Support System for Defect Reports, MSc Thesis, Lund University, 2014. MSc. Thesis 14
  • 36. Q4. How long time will it take to correct this bug?
  • 37. Q4. How long time will it take to correct this bug? Q1. Has this issue been reported before? Q3. Will this issue result in a code change? Q2. Who should investigate the issue? Change control board Developers Resolution New issue
  • 39. REPLICATION OF RAJA (2013) • ”Textual clusters of issue reports have significantly different resolution times” • Conceptual replication • Fully automatic clustering • Issue reports from large projects • Confirmed statistical differences
  • 40. OPERATIONALIZATION 1. Put new issue in the right cluster 2. Resolution time based on cluster avg. Assar, Borg, and Pfahl. Using Text Clustering to Predict Defect Resolution Time: A Conceptual Replication and an Evaluation of Prediction Accuracy, Empirical Software Engineering, 2015. EMSE 15
  • 41. Q5. If we make a corrective change, what is the impact?
  • 42. Q4. How long time will it take to correct this bug? Q1. Has this issue been reported before? Q3. Will this issue result in a code change? Q2. Who should investigate the issue? Q5. If we make a corrective code change, what is the impact? Change control board Developers Resolution New issue
  • 43. De la Vara, Borg, Wnuk, and Moonen. Safety Evidence Change Impact Analysis in Practice, In revision, Transactions on Software Engineering, 2015. TSE (In rev.)
  • 44. RECOMMENDATION SYSTEM • “a software application that aims to support users in their decision-making while interacting with large information spaces” Decision-support system
  • 46. RECOMMENDATIONS BASED ON HISTORICAL IMPACT (NON-CODE) Reqs. Tests Borg, Gotel, and Wnuk. Enabling Traceability Reuse for Impact Analyses: A Feasibility Study in a Safety Context, In Proc. of the Int’l WS on Traceability in Emerging Forms of Software Engineering (TEFSE), 2012. EMSE 15
  • 47. 1. Mine issue tracker 2. Create network of previous impact 3. Index text with
  • 49. IDENTIFY POTENTIAL IMPACT Find similar issues using Apache Lucene Design Doc. X.Y Req. X.Y Test case UTC56 Req. Z.Y Design Doc. X.Y Follow links to create candidate impact set
  • 50. RANK THE POTENTIAL IMPACT Use centrality measures to rank candidate impact 1. Requirement X.Y 2. Design Document X.Y 3. Test case UTC56 4. Design Document X.Y 5. Requirement Z.Y
  • 51. Borg and Runeson. Changes, Evolution and Bugs – Recommendation Systems for Issue Management, In Recommendation Systems in Software Engineering, Robillard, Maalej, Walker, and Zimmermann (Ed.), 2013. RSSE book ImpRec https://github.com/mrksbrg/ImpRec DELIVER THE RECOMMENDATIONS
  • 52. TWO UNIT CASE STUDY • Deployed ImpRec in two teams • Malmö and Bangalore • Increases awareness and findability • Helps project newcomers Borg, Wnuk, Regnell, and Runeson. Supporting Change Impact Analysis Using a Recommendation System: An Industrial Case Study in a Safety-Critical Context, In submission, 2015. In subm.
  • 53. PARAMETER TUNING • Finding a feasible parameter setting is difficult • Presented framework to do it in R • Factorial designs • Response surface methodology Borg. TuneR: A Framework for Tuning Software Engineering Tools with Hands-On Instructions in R, In revision, Journal of Software: Evolution and Process, 2015. JSEP (in Rev)
  • 55. Bug tracker Machine Learning “Humans obscured by bug overload, but machine learning benefits from plentiful training data. Practitioners confirm value of developed tools.” Tiny Transactions on Computer Science (@TinyToCS), Volume 3, 2015