Predicting Defects Using Change Genealogies (ISSE 2013)

•Download as PPTX, PDF•

1 like•1,274 views

This document discusses using change genealogies, which model dependencies between code changes, to predict defects. It finds that models using change genealogy metrics outperform those based on code complexity or dependency networks alone, achieving better precision while maintaining close recall. Key metrics include network efficiency and relationships between changes and dependency types. The study confirms that code entities combining functionalities from multiple older changes are more defect-prone.

Technology Business

Predicting Defects Using
Change Genealogies
Kim Herzig*, Sascha Just†, Andreas Rau†, Andreas Zeller†
* Microsoft Research, UK

† Saarland University,

Germany

Prediction Models
• Goal: determine the likelihood of bugs in

code entities

 Quality assurance limited by time and money.
 Can be helpful for project outsiders.

• Trained on “ground truth”
 Known instances and their properties.
 Idea: learning from past for future.

• Predicting / estimating defect likelihood of

new, unknown code entities

Fine-Tuning Prediction Models
Machine Learner

Training Methods
Metrics (independent variables)
Prediction Target

(Social) Network Metrics
 Some participants more active and

central than others.

 Are these participants also more

crucial?

Code Network Metrics

[2008] Zimmermann and Nagappan: “Predicting Defects using Network Analysis on Dependency Graphs”

10100
10010
1101011000
1001011001
0001010111
1001011001

10100
10010
1101011000
1001011001
0001010111
1001011001

10100
10010
1101011000
1001011001
0001010111
1001011001

10100
10010
1101011000
1001011001
0001010111
1001011001

 Code entities communicate with

each other.

Call graphs do not
change significantly  Use call graph network to
compute network metrics.
over time!
10100
10010
1101011000
1001011001
0001010111
1001011001

10100
10010
1101011000
1001011001
0001010111
1001011001

10100
10010
1101011000
1001011001
0001010111
1001011001

10100
10010
1101011000
1001011001
0001010111
1001011001

 Assumption: “Central binaries tend to

be defect-prone”.

Change Network Metrics
Idea: Use dependencies between code changes
 Code changes depend on each

other.

 Central code changes tend to be

crucial.

Change Genealogies

 Assumption: “Code being crucially

changed tend to be defect prone”.

Change Genealogies (in a nutshell)
[2013] Kim Herzig: “Mining and Untangling Change Genealogies” (PhD thesis)

Directed graph structure
Method level dependencies

Multi-dimensional (space & time)

Change Genealogy Metrics
 EGO network metrics
 Measures the immediate impact of changes on other changes.

 GLOBAL network metrics
 Express the long-term impact of changes on other changes.

 Considering the type of the change
 Adding method definition, modifying method call

 Considering parent age
 How old are the parent changes a change depends on.

Change genealogy metrics must be aggregated to source file level.

Experimental Setup
Comparing change genealogies
against:


Code complexity models
(e.g. McCabe)



Code dependency models
(Zimmermann & Nagappan)



Combined network models

(Change genealogy & code dependency network metrics)

Experimental Setup

Study subjects

Multiple machine learners

Prediction Precision

NM & CGM

Change genealogy metrics
Code dependency network metrics (Zimmermann & Nagappan)
Code complexity metrics

Confirmed: Network metrics
outperform complexity metrics.
Change genealogy models report
less false positives (higher precision).
Change genealogy model slightly
more false negatives (lower recall).
Combining network metrics: good
recall but worse precision.

Influential Metrics
Network efficiency among the top 10 most influential metrics.
Relationship between changes and type of dependency top 2 metrics (for all projects).
Higher number of old parents the higher the probability to add bugs.
 Code entities combining multiple older functionalities more defect prone.

Summary

Adapting social network metrics

to change dependency graphs.

Comparing prediction models.

 Change genealogies are well suited for defect prediction

(better precision, close recall).

 Code entities combining multiple older functionalities more defect prone.

The document discusses using machine learning approaches to automate the acquisition of parameters and network structures for computational models of human decision making. It aims to semi-automate the process of building and tuning cognitive models to reduce costs and speed up development. Parameter acquisition and network topology induction are challenging problems that require novel machine learning algorithms to infer the internal representations and decision processes of human operators under cognitive plausibility constraints. Direct elicitation of information from users may be the most promising approach.

SDN Dependability: Assessment, Techniques, and Tools - SDN Research Group - I...

Stenio Fernandes

This talk will discuss what is dependability, how it has effects upon network design and management in SDN scenarios, why it is important to measure and assess its attributes (e.g., availability, reliability), and what are the tools and techniques for scalable performance evaluation. In addition, challenges for introducing dependability assessment in SDN HW/SW components will be discussed. We will also give some directions on possible solutions, including plans for some I-Ds. Some topics of the presentation are: - How to assess risks associated to SDN deployment? - How to measure and improve dependability attributes in SDN - Dependability in virtualized environments: A glimpse on research papers - I-D proposals related to dependability for SDN

Project Data Incorporating Qualitative Factors for Improved Software Defect P...

Tim Menzies

The document discusses a project that aimed to improve software defect prediction by incorporating qualitative factors. It provides background on previous statistical and causal models for defect prediction. It then describes the development of a causal Bayesian network model that considers qualitative factors like testing staff experience, documented test cases, and how well-defined the testing process is, in addition to quantitative factors. The model was developed using both qualitative expert judgement and quantitative data from companies. It was validated on a set of projects selected for having reliable defect data and key people available. The results showed good predictions of actual versus predicted defects using only a few of the model inputs. Some caveats are mentioned regarding biases and limitations of the data. In conclusion, the causal model was able to provide

Producing Quality Software

Simon Smith

Big Data Analytics and Advanced Computer Networking Scenarios

Stenio Fernandes

The document discusses big data analytics and advanced computer networking scenarios, including research challenges and opportunities. It covers technical background on measurements and analysis in computer networks. It also discusses new networking architectures like Software-Defined Networking (SDN), Information-Centric Networking (ICN), and network visualization. Tools and techniques for high-performance network traffic analysis using visual analytics are also covered. The document provides an agenda for applied research opportunities in computer networking between CIn/UFPE and Dalhousie University.

Code Ownership and Software Quality: A Replication Study @ MSR 2015

Kim Herzig

In a traditional sense, ownership determines rights and duties in regard to an object, for example a property. The owner of source code usually refers to the person that invented the code. However, larger code artifacts, such as files, are usually composed by multiple engineers contributing to the entity over time through a series of changes. Frequently, the person with the highest contribution, e.g. the most number of code changes, is defined as the code owner and takes responsibility for it. Thus, code ownership relates to the knowledge engineers have about code. Lacking responsibility and knowledge about code can reduce code quality. In an earlier study, Bird et al. [1] showed that Windows binaries that lacked clear code ownership were more likely to be defect prone. However recommendations for large artifacts such as binaries are usually not actionable. E.g. changing the concept of binaries and refactoring them to ensure strong ownership would violate system architecture principles. A recent replication study by Foucault et al. [2] on open source software replicate the original results and lead to doubts about the general concept of ownership impacting code quality. In this paper, we replicated and extended the previous two ownership studies [1, 2] and reflect on their findings. Further, we define several new ownership metrics to investigate the dependency between ownership and code quality on file and directory level for 4 major Microsoft products. The results confirm the original findings by Bird et al. [1] that code ownership correlates with code quality. Using new and refined code ownership metrics we were able to classify source files that contained at least one bug with a median precision of 0.74 and a median recall of 0.38. On directory level, we achieve a precision of 0.76 and a recall of 0.60.

A Case Study of Bias in Bug-Fix Datasets

SAIL_QU

1) The document discusses potential biases in bug-fix datasets that could threaten the validity of software quality studies, including linkage bias and tagging bias. 2) Linkage bias occurs when unlinked bug reports have higher severity and involve less experienced developers, while tagging bias happens because about 2/3 of bug reports are not actual defects. 3) The document examines whether these biases exist in the Jazz dataset and how they could affect research results. It finds that linkage biases may exist but tagging biases do not strongly influence outcomes.

Nonadaptive mastermind algorithms for string and vector databases, with case ...

Ecway Technologies

This paper studies nonadaptive Mastermind algorithms for attacking the privacy of string and vector databases like DNA strings, movie ratings, and social network data. The algorithms can take advantage of minimal privacy leaks, like whether two people share any genetic mutations or common friends. The attacks are analyzed theoretically and experimentally on genomic, recommendation, and social network data. By exploiting the sparse nature of real-world databases and modulating query sparsity, the paper shows relatively few nonadaptive queries are needed to recover a large portion of each database.

The research domain of end-user’s information security behaviours has been gaining much attention over the recent years. While the nature of intention to perform information security behaviours are being revealed, there are still gaps in this area. In particular, few studies have addressed whether such intention remains across contexts, especially from home to public places. Secondly, the amount of the cyber-threats swells with the increase of personal devices with the rapid adoption of the BYOD trend. This research employed MSEM methods to develop a conceptual model based on Protection Motivation Theory by using data collected from 252 higher education students in a BYOD Australian university. Our findings confirmed and explored in details how intention to perform information security behaviours varied due to the change of context. Academics and practitioners could mitigate the security gap by focusing on the intention’s differences discussed in our findings.

Some insights from a Systematic Mapping Study and a Systematic Review Study: ...

Phu H. Nguyen

Doing literature reviews is a must for us (researchers) to avoid reinventing the wheel, and to expand the boundary of knowledge. Why not having fun with the snowballing technique and conducting the reviews systematically? This talk shares some insights from a Systematic Mapping Study (SMS) and a Systematic Literature Review (SLR). When to conduct a SMS? When to conduct a SLR? What are the differences?

Reverse Engineering android Malware analysis

Anik Ralhan

Slides of session I presensented to my folks at University of Calgary on research paper on Mudflow and Flowdroid. Links given below: https://www.st.cs.uni-saarland.de/appmining/mudflow/ https://www.google.ca/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&uact=8&ved=0ahUKEwik583ola7XAhUX6WMKHQYXCnoQFggoMAA&url=https%3A%2F%2Fblogs.uni-paderborn.de%2Fsse%2Ftools%2Fflowdroid%2F&usg=AOvVaw1t13BQnA07LA9FA3O5wNvN

Tutorial ESWC2011 Building Semantic Sensor Web - 01 - Introduction

Jean-Paul Calbimonte

This document outlines the objectives, schedule, and introduction for a tutorial on building semantic sensor webs and applications. The tutorial aims to teach developers and experts about building semantically-enabled applications that rely on data from sensor networks. It will cover semantic sensor web components, discovering sensor data sources for a region, querying streaming data through ontologies, and sensor data semantic mashups. The introduction discusses sensor networks as information sources, the sensor web, end users of sensor data, and challenges in sensor data management including interoperability, data quality, and distributed processing.

Effects of Ownership on Software Quality

Md. Shafiuzzaman Hira

1) The paper examines the effects of ownership on software quality, specifically looking at how the number of minor contributors and level of ownership of the top contributor relate to pre-release and post-release defects. 2) The results found that a higher number of minor contributors was strongly correlated with more defects, while higher ownership levels of the top contributor was correlated with fewer defects. 3) The number of minor contributors was found to be the strongest predictor of defects based on linear regression models.

Unit Testing with ASP.NET

Josh Candish

Unit testing provides several benefits such as exercising code to find errors, preventing issues from propagating to other parts of an application, and generally improving software quality. There are different views on what unit testing entails and several popular unit testing frameworks for .NET like NUnit and MSpec. Effective unit testing requires writing tests first before code and designing tests to run quickly and independently.

Using Cognitive Dimensions Questionnaire to Evaluate the Usability of Securit...

Chamila Wijayarathna

This was presented by me at the 28th annual gathering of Psychology of Programmers Interest Group (PPIG). Usability issues that exist in security APIs cause programmers to embed those security APIs incorrectly to the applications they develop. This results in introduction of security vulnerabilities to those applications. One of the main reasons for security APIs to be not usable is currently there is no proper method by which the usability issues of security APIs can be identified. We conducted a study to assess the effectiveness of the cognitive dimensions questionnaire based usability evaluation methodology in evaluating the usability of security APIs. We used a cognitive dimensions based generic questionnaire to collect feedback from programmers who participated in the study. Results revealed interesting facts about the prevailing usability issues in four commonly used security APIs and the capability of the methodology to identify those issues.

WSN Security Research Directions

Emil Lupu

This document discusses security challenges in wireless sensor networks and proposes guiding principles and research themes. It notes that security is difficult due to limited node resources and unattended operation. While much research has been done, security is often ignored in real deployments. Research assumptions often do not reflect reality. The document proposes three guiding principles: link studies to context of use, leverage the relationship to physical phenomena, and consider security as part of resilient design. It then outlines several proposed research themes, including designing solutions for concrete settings, establishing confidence in device operation, measuring network health from within, and ensuring data quality and trustworthiness.

an empirical performance evaluation of relational keyword search techniques

swathi78

3. Cnnecst-Project Planning and Organization

CNNECST - Convolutional Neural Networks

This document outlines the project plan and organization for a CNN2ECST project. It includes work packages, tasks, a Gantt chart, and SWOT analysis. The main work packages are system requirements, development, system testing and evaluation, and exploitation/dissemination. Key tasks include defining requirements, designing CNN models, developing hardware accelerators, and disseminating results. The SWOT analysis identifies strengths like cost reduction and opportunities like growing markets, as well as weaknesses like marketing skills and threats like potential competitors.

Ontology based top-k query answering over massive, heterogeneous, and dynamic...

Daniele Dell'Aglio

This document discusses ontology-based top-k continuous query answering over streaming data from multiple heterogeneous sources. It aims to investigate how ontologies and top-k queries can improve continuous query processing by exploiting ordering. The research will analyze state of the art solutions, define an evaluation framework, and assess the effects on correctness and performance of techniques that integrate stream reasoning and top-k queries. Preliminary results include an extension of an RDF stream processor testbench and a case study on real-time social media analytics.

Fortner_OSCARPresentation

Ashley Fortner

This study evaluated the effectiveness of new behavior monitoring applications in tracking red panda behavior compared to traditional hand scoring methods. The Ruby for Good application and hand scoring methods were used to monitor the behavior of four red pandas. Statistical analyses found significant agreement between the three judges, and correlations between the Ruby application and hand scoring were high, suggesting the application can successfully monitor red panda behavior. However, the application works best when movements are spread out and needs improvements to data export and analysis features to better meet researcher needs. Overall, behavior monitoring applications can speed up analysis and help researchers identify abnormal behaviors faster, improving conservation of endangered red pandas.

Who Watches the Watchers Metrics for Security Strategy - BsidesLV 2015 - Roytman

Michael Roytman

This document discusses the need for better security metrics and automation. It argues that attackers currently have better automation capabilities than defenders. The document outlines problems with current vulnerability management practices being too manual. It proposes that better security metrics could help drive more effective automation. The document discusses characteristics of good security metrics and evaluates some existing metrics against those criteria. It emphasizes the need for more objective, automated metrics based on real-world data about breaches and exploits.

Data Driven Testing Is More Than an Excel File

Mehmet Gök

This document discusses data-driven testing and test data management. It covers several frameworks for data-driven testing including keyword-driven testing and behavior-driven development. It also discusses concepts for managing test data like subsetting, synthetic data generation, data integrity, and approaches like data modeling, discovery, and profiling test data. Finally, it discusses tools for test data management and service virtualization and considerations for selecting tools.

Frequency Based Detection Of Task Switches

rnair

Goal Decomposition and Abductive Reasoning for Policy Analysis and Refinement

Emil Lupu

This document discusses an approach to policy refinement and analysis using goal decomposition, abductive reasoning, and formal representations of policies and managed objects. The approach involves decomposing high-level goals into refined policies using patterns, and applying abductive reasoning to derive policy elements and ensure consistency during refinement. Formal models of policies, goals, and managed objects are used to enable analysis, validation, and detect conflicts. The approach provides explanations for refinement and analysis results. The document also discusses limitations and comparisons to other policy refinement techniques.

Network Intrusion Detection (1)-converted-1.pptx

SubhrajyotiPayra

This document outlines a machine learning project on network intrusion detection. It presents the objectives, which are to detect network intrusions using supervised machine learning algorithms like Naive Bayes, Decision Tree, K-Neighbors and Logistic Regression. The best performing model was found to be Decision Tree classification, which achieved 99.60% accuracy on the test data. The document discusses the system model, problem statement, data collection and preprocessing steps, model training and evaluation, and concludes with possibilities for future work such as using different datasets and deep learning approaches.

MLOps and Data Quality: Deploying Reliable ML Models in Production

Provectus

Looking to build a robust machine learning infrastructure to streamline MLOps? Learn from Provectus experts how to ensure the success of your MLOps initiative by implementing Data QA components in your ML infrastructure. For most organizations, the development of multiple machine learning models, their deployment and maintenance in production are relatively new tasks. Join Provectus as we explain how to build an end-to-end infrastructure for machine learning, with a focus on data quality and metadata management, to standardize and streamline machine learning life cycle management (MLOps). Agenda - Data Quality and why it matters - Challenges and solutions of Data Testing - Challenges and solutions of Model Testing - MLOps pipelines and why they matter - How to expand validation pipelines for Data Quality

laptop price prediction presentation

NeerajNishad4

Dataset: Gather a large dataset of laptops and their features, including processor speed, RAM, storage, and display size, along with their corresponding prices. Feature engineering: Extracting meaningful features from the dataset, such as brand, model, and year, and transforming them into a format that machine learning algorithms can use. Model selection: Choosing the most appropriate machine learning algorithm, such as linear regression, decision tree, or random forest, based on the type of data and desired level of accuracy. Model training: Splitting the dataset into training and testing sets, and using the training data to train the machine learning model. Model evaluation: Testing the model's performance on the testing data and evaluating its accuracy using metrics such as mean squared error or R-squared. Hyperparameter tuning: Optimizing the model's hyperparameters, such as learning rate or regularization strength, to achieve the best performance.

DATI, AI E ROBOTICA @POLITO

MarcoMellia

#Interactive Session by Vivek Patle and Jahnavi Umarji, "Empowering Functiona...

Agile Testing Alliance

Data analytics in computer networking

Stenio Fernandes

This document discusses exploratory data analysis (EDA) and its application to analyzing computer networking data. EDA involves graphically summarizing data to uncover patterns, relationships, and structure without formal hypothesis testing. The document outlines the EDA process, including identifying key metrics and factors to explore. It provides examples of EDA graphs that could be used to analyze simulated WiFi data, examining how various factors like vendor, user type, and distance affect network performance metrics. The goal of EDA is to gain insights, detect anomalies, and inform modeling before running extensive simulations or experiments.

What's hot

Dang et al. (2013), "Contextual difference and intention to perform informati...

tduy0506

Some insights from a Systematic Mapping Study and a Systematic Review Study: ...

Phu H. Nguyen

Reverse Engineering android Malware analysis

Anik Ralhan

Tutorial ESWC2011 Building Semantic Sensor Web - 01 - Introduction

Jean-Paul Calbimonte

Effects of Ownership on Software Quality

Md. Shafiuzzaman Hira

Unit Testing with ASP.NET

Josh Candish

Using Cognitive Dimensions Questionnaire to Evaluate the Usability of Securit...

Chamila Wijayarathna

WSN Security Research Directions

Emil Lupu

an empirical performance evaluation of relational keyword search techniques

swathi78

3. Cnnecst-Project Planning and Organization

CNNECST - Convolutional Neural Networks

Ontology based top-k query answering over massive, heterogeneous, and dynamic...

Daniele Dell'Aglio

Fortner_OSCARPresentation

Ashley Fortner

Who Watches the Watchers Metrics for Security Strategy - BsidesLV 2015 - Roytman

Michael Roytman

Data Driven Testing Is More Than an Excel File

Mehmet Gök

Frequency Based Detection Of Task Switches

rnair

Goal Decomposition and Abductive Reasoning for Policy Analysis and Refinement

Emil Lupu

What's hot (16)

Dang et al. (2013), "Contextual difference and intention to perform informati...

Some insights from a Systematic Mapping Study and a Systematic Review Study: ...

Reverse Engineering android Malware analysis

Tutorial ESWC2011 Building Semantic Sensor Web - 01 - Introduction

Effects of Ownership on Software Quality

Unit Testing with ASP.NET

Using Cognitive Dimensions Questionnaire to Evaluate the Usability of Securit...

WSN Security Research Directions

an empirical performance evaluation of relational keyword search techniques

3. Cnnecst-Project Planning and Organization

Ontology based top-k query answering over massive, heterogeneous, and dynamic...

Fortner_OSCARPresentation

Who Watches the Watchers Metrics for Security Strategy - BsidesLV 2015 - Roytman

Data Driven Testing Is More Than an Excel File

Frequency Based Detection Of Task Switches

Goal Decomposition and Abductive Reasoning for Policy Analysis and Refinement

Similar to Predicting Defects Using Change Genealogies (ISSE 2013)

Network Intrusion Detection (1)-converted-1.pptx

SubhrajyotiPayra

MLOps and Data Quality: Deploying Reliable ML Models in Production

Provectus

laptop price prediction presentation

NeerajNishad4

DATI, AI E ROBOTICA @POLITO

MarcoMellia

#Interactive Session by Vivek Patle and Jahnavi Umarji, "Empowering Functiona...

Agile Testing Alliance

Data analytics in computer networking

Stenio Fernandes

Predictive Model and Record Description with Segmented Sensitivity Analysis (...

Greg Makowski

Describing a predictive data mining model can provide a competitive advantage for solving business problems with a model. The SSA approach can also provide reasons for the forecast for each record. This can help drive investigations into fields and interactions during a data mining project, as well as identifying "data drift" between the original training data, and the current scoring data. I am working on open source version of SSA, first in R.

Presentation1.pptx

SubhashreddyPalleti

This document describes a proposed system for detecting cyber attacks using Bayesian inference. It begins with an introduction to the problem of credit/debit card theft and existing physical unclonable functions. It then discusses the disadvantages of existing cyber attack detection systems, such as performance issues and high false positive rates. The proposed system builds a directed acyclic graph to represent the probability distribution of variables related to cyber attacks. It will use modules for data collection, preprocessing, model training/testing, and attack detection. The system will be implemented in Python using frameworks like Django and evaluated using algorithms like random forest, artificial neural networks, and support vector machines.

The Dangers of Machine Learning

tothepointIT

This document discusses several potential dangers of artificial intelligence including adversarial attacks, bias in data, ethics issues, and security concerns. It provides examples of different types of biases that can occur such as sampling bias, measurement bias, and stereotype bias. The document also discusses challenges in testing AI systems and quotes several experts on the impacts and limitations of AI.

ISEN 613_Team3_Final Project Report

Rahul Garg, CSSGB

This document summarizes a student project that aims to predict defects in steel plate manufacturing using historical data. The objectives are to compare different classification models and select a final model with the lowest misclassification rate. Various techniques like linear discriminant analysis, logistic regression, random forests, decision trees, support vector machines, neural networks, and C5.0 are compared on a steel plate defect dataset. The results show that the C5.0 technique has the lowest error rate of 19.4% and highest accuracy of 81.6%, making it the proposed final model for predicting steel plate defects.

ISEN 613_Team3_Final Project Report

Naman Kapoor

The document summarizes a study that compared the performance of various machine learning classification techniques for predicting steel plate defects. The study tested techniques including linear discriminant analysis, logistic regression, random forests, decision trees, bagging, support vector machines, and neural networks on a steel plate defect dataset. It found that the C5.0 decision tree technique achieved the lowest misclassification rate of 19.4% and highest area under the ROC curve, making it the best performing model for this classification problem. The objective of accurately predicting steel plate defects from historical data using machine learning was therefore achieved through comparison of different modeling techniques.

ICMCSI 2023 PPT 1074.pptx

ajagbesundayadeola

The paper proposes a new machine learning approach for cyber security in big data. It combines multiple classifiers into an ensemble "outfit" approach. The outfit approach achieves 99.8% accuracy in distinguishing benign from malicious web pages, outperforming individual classifiers. The methodology collects and prepares a big dataset to train and evaluate KNN, SVM, MLP classifiers. Results show the outfit approach has higher true positive rate, F-measure, and recall while lowering the false positive rate compared to individual classifiers. The research aims to better detect cyber threats and improve security of big data.

SEMI SUPERVISED BASED SPATIAL EM FRAMEWORK FOR MICROARRAY ANALYSIS

IRJET Journal

This document presents a semi-supervised spatial EM framework for microarray analysis to efficiently classify and predict diseases based on gene expression data. It uses a spatial EM algorithm to cluster gene expression data, followed by an SVM classifier to predict diseases and their severity levels. The proposed approach is evaluated based on classification accuracy, computation time, and ability to identify biologically significant genes. Experimental results on disease datasets show improved accuracy compared to other supervised and unsupervised methods. The authors conclude that using the same classifier for gene selection and classification enhances predictive performance, and future work will focus on partitioning genes into clusters correlated with sample categories to further improve accuracy.

Model-Simulation-and-Measurement-Based Systems Engineering of Power System Sy...

Luigi Vanfretti

The document discusses model-simulation-and-measurement-based systems engineering of power system synchrophasor systems. It outlines the speaker's background and research interests in modeling and simulation technologies for cyber-physical power systems. The talk motivates the need for these technologies to enable applications like wide-area control systems using synchronized phasor measurements. It also discusses challenges in developing smart grids as complex cyber-physical systems and the roles that modeling and simulation can play in addressing these challenges.

The Status of ML Algorithms for Structure-property Relationships Using Matb...

Anubhav Jain

The document discusses the development of Matbench, a standardized benchmark for evaluating machine learning algorithms for materials property prediction. Matbench includes 13 standardized datasets covering a variety of materials prediction tasks. It employs a nested cross-validation procedure to evaluate algorithms and ranks submissions on an online leaderboard. This allows for reproducible evaluation and comparison of different algorithms. Matbench has provided insights into which algorithm types work best for certain prediction problems and has helped measure overall progress in the field. Future work aims to expand Matbench with more diverse datasets and evaluation procedures to better represent real-world materials design challenges.

Cyb 5675 class project final

Craig Cannon

This document summarizes a student project that aims to evaluate various data mining classifiers on network intrusion detection. The student filters the KDD99 intrusion detection dataset and divides it into training and test sets. Five classifiers - Naive Bayes, J48, Decision Table, JRip and SMO - are tested on the training set using cross-validation. Performance results for each classifier on detecting different attack categories (DoS, Probe, U2R, R2L) will be analyzed to propose an ideal intrusion detection model.

2cee Master Cocomo20071

CS, NcState

Study on reliability optimization problem of computer By Dharmendra Singh[Srm...

Dharmendrasingh417

IMPLEMENTATION OF DYNAMIC COUPLING MEASUREMENT OF DISTRIBUTED OBJECT ORIENTED...

IJCSEA Journal

This document summarizes a research paper that proposes a method for dynamically measuring coupling in distributed object-oriented software systems. The method involves three steps: instrumentation of the Java Virtual Machine to trace method calls, post-processing of the trace files to merge information, and calculation of coupling metrics based on the dynamic traces. The implementation results show that the proposed approach can effectively measure coupling metrics dynamically by accounting for polymorphism and dynamic binding, overcoming limitations of traditional static coupling analysis.

IMPLEMENTATION OF DYNAMIC COUPLING MEASUREMENT OF DISTRIBUTED OBJECT ORIENTED...

IJCSEA Journal

Software metrics are increasingly playing a central role in the planning and control of software development projects. Coupling measures have important applications in software development and maintenance. Existing literature on software metrics is mainly focused on centralized systems, while work in the area of distributed systems, particularly in service-oriented systems, is scarce. Distributed systems with service oriented components are even more heterogeneous networking and execution environment. Traditional coupling measures take into account only “static” couplings. They do not account for “dynamic” couplings due to polymorphism and may significantly underestimate the complexity of software and misjudge the need for code inspection, testing and debugging. This is expected to result in poor predictive accuracy of the quality models in distributed Object Oriented systems that utilize static coupling measurements. In order to overcome these issues, we propose a hybrid model in Distributed Object Oriented Software for measure the coupling dynamically. In the proposed method, there are three steps such as Instrumentation process, Post processing and Coupling measurement. Initially the instrumentation process is done. In this process the instrumented JVM that has been modified to trace method calls. During this process, three trace files are created namely .prf, .clp, .svp. In the second step, the information in these file are merged. At the end of this step, the merged detailed trace of each JVM contains pointers to the merged trace files of the other JVM such that the path of every remote call from the client to the server can be uniquely identified. Finally, the coupling metrics are measured dynamically. The implementation results show that the proposed system will effectively measure the coupling metrics dynamically.

Similar to Predicting Defects Using Change Genealogies (ISSE 2013) (20)

Network Intrusion Detection (1)-converted-1.pptx

MLOps and Data Quality: Deploying Reliable ML Models in Production

laptop price prediction presentation

DATI, AI E ROBOTICA @POLITO

#Interactive Session by Vivek Patle and Jahnavi Umarji, "Empowering Functiona...

Data analytics in computer networking

Predictive Model and Record Description with Segmented Sensitivity Analysis (...

Presentation1.pptx

The Dangers of Machine Learning

ISEN 613_Team3_Final Project Report

ICMCSI 2023 PPT 1074.pptx

SEMI SUPERVISED BASED SPATIAL EM FRAMEWORK FOR MICROARRAY ANALYSIS

Model-Simulation-and-Measurement-Based Systems Engineering of Power System Sy...

The Status of ML Algorithms for Structure-property Relationships Using Matb...

Cyb 5675 class project final

2cee Master Cocomo20071

Study on reliability optimization problem of computer By Dharmendra Singh[Srm...

IMPLEMENTATION OF DYNAMIC COUPLING MEASUREMENT OF DISTRIBUTED OBJECT ORIENTED...

More from Kim Herzig

Keynote AST 2016

Kim Herzig

The document discusses various aspects of automating software testing. It suggests automating the detection of flaky tests, determining the severity of test failures, converting tests to more isolated unit tests, and using usage data to determine what to test next. It emphasizes that while automation can improve testing efficiency, human oversight is still needed, and code reviews serve as the link between automated and manual processes.

Empirically Detecting False Test Alarms Using Association Rules @ ICSE 2015

Kim Herzig

Applying code changes to software systems and testing these code changes can be a complex task that involves many different types of software testing strategies, e.g. system and integration tests. However, not all test failures reported during code integration are hinting towards code defects. Testing large systems such as the Microsoft Windows operating system requires complex test infrastructures, which may lead to test failures caused by faulty tests and test infrastructure issues. Such false test alarms are particular annoying as they raise engineer attention and require manual inspection without providing any benefit. The goal of this work is to use empirical data to minimize the number of false test alarms reported during system and integration testing. To achieve this goal, we use association rule learning to identify patterns among failing test steps that are typically for false test alarms and can be used to automatically classify them. A successful classification of false test alarms is particularly valuable for the product teams as manual test failure inspection is an expensive and time-consuming process that not only costs engineering time and money but also slows down product development. We evaluating our approach on system and integration tests executed during Windows 8.1 and Microsoft Dynamics AX development. Performing more than 10,000 classifications for each product, our model shows a mean precision between 0.85 and 0.90 predicting between 34% and 48% of all false test alarms.

The Art of Testing Less without Sacrificing Quality @ ICSE 2015

Kim Herzig

Testing is a key element of software development processes for the management and assessment of product quality. In most development environments, the software engineers are responsible for ensuring the functional correctness of code. However, for large complex software products, there is an additional need to check that changes do not negatively impact other parts of the software and they comply with system constraints such as backward compatibility, performance, security etc. Ensuring these system constraints may require complex verification infrastructure and test procedures. Although such tests are time consuming and expensive and rarely find defects they act as an insurance process to ensure the software is compliant. However, long lasting tests increasingly conflict with strategic aims to shorten release cycles. To decrease production costs and to improve development agility, we created a generic test selection strategy called THEO that accelerates test processes without sacrificing product quality. THEO is based on a cost model, which dynamically skips tests when the expected cost of running the test exceeds the expected cost of removing it. We replayed past development periods of three major Microsoft products resulting in a reduction of 50% of test executions, saving millions of dollars per year, while maintaining product quality.

Issre2014 test defectprediction

Kim Herzig

This document discusses using test behavior metrics to build pre-release and post-release defect prediction models. It finds that test failure metrics like unreliable test cases executed, test failure bursts, and number of failing execution contexts can predict defects with high precision and recall. Models built using these metrics performed better than coverage-based models and were more accurate for pre-release defects compared to post-release defects. The most influential metrics for prediction were the relative number of unreliable tests, test failure bursts, and number of failing execution contexts.

The Impact of Test Ownership and Team Structure on the Reliability and Effect...

Kim Herzig

The document discusses how test ownership and team structure can impact test reliability and effectiveness. It analyzes metrics related to test ownership, such as the number of test owners, owners who have left the company, and organizational structure of owners. The analysis found that tests with more concentrated ownership among fewer groups tended to be more effective, while distributed or scattered ownership across multiple groups made tests less effective. Tests were also less effective if owners who had left the company contributed to them. The organizational structure metrics proved to be good predictors of test effectiveness and excellent predictors of test reliability.

Mining and Untangling Change Genealogies (PhD Defense Talk)

Kim Herzig

The document discusses mining software repositories to analyze code history and detect patterns. It describes representing code changes as change operations like adding or removing method definitions. These are used to build change genealogies modeling dependencies between changes. Change genealogies can be model checked using CTL to extract rules describing likely cause-effect chains of changes. These rules are evaluated on projects to predict with over 60% precision which future changes may occur based on current changes. The approach ensures predictions are based on structural dependencies between changes.

The Impact of Tangled Code Changes

Kim Herzig

This document discusses the impact of tangled code changes, where a single code change fixes bugs and implements new features. The key points are: 1) A study of 7,000 bug-fixing changes across 5 open-source projects found that on average, about 10% of bug fixes were tangled with other changes. 2) Untangling tangled changes allows more accurate measurement of their impact, such as which files are most defect-prone. Without untangling, up to 50% of files identified as most defect-prone could be false positives. 3) Tangled changes can incorrectly associate up to 16% of files with bug fixes, and up to 7% of files associated with bugs

Mining Cause Effect Chains from Version Archives - ISSRE 2011

Kim Herzig

Software reliability is determined by software changes. How do these changes relate to each other? By analyzing the impacted method definitions and usages, we determine dependencies between changes, resulting in a change genealogy that captures how earlier changes enable and cause later ones. Model checking this genealogy reveals temporal process patterns that encode key features of the software process: “Whenever class A is changed, its test case is later updated as well.” Such patterns can be validated automatically: In an evaluation of four open source histories, our prototype would recommend pending activities with a precision of 60– 72%.

Network vs. Code Metrics to Predict Defects: A Replication Study

Kim Herzig

The document discusses a replication study of a previous work that found network metrics outperformed code metrics in defect prediction models. The replication study makes several contributions: it uses random sampling on the same release like the original, predicts defects across different releases of the same project, and predicts defects across different projects. It collects both code and network metrics using various tools and from various levels of granularity, with some differences from the original study such as language and projects used.

Capturing the Long Term Impact of Changes

Kim Herzig

Software Engineering Course 2009 - Mining Software Archives

Kim Herzig

The document provides information about an exam, including admittance details, exam regulations, and seminar and assignment information. It then discusses using data mining to predict the most defect-prone source code entities by analyzing past bug and version control data, as well as source code metrics. The process involves defining the problem, preparing the data, exploring the data to understand relationships, building a prediction model using machine learning techniques, and validating the model on test data. The goal is to prioritize testing of the most defect-prone entities identified by the model.

More from Kim Herzig (11)

Keynote AST 2016

Empirically Detecting False Test Alarms Using Association Rules @ ICSE 2015

The Art of Testing Less without Sacrificing Quality @ ICSE 2015

Issre2014 test defectprediction

The Impact of Test Ownership and Team Structure on the Reliability and Effect...

Mining and Untangling Change Genealogies (PhD Defense Talk)

The Impact of Tangled Code Changes

Mining Cause Effect Chains from Version Archives - ISSRE 2011

Network vs. Code Metrics to Predict Defects: A Replication Study

Capturing the Long Term Impact of Changes

Software Engineering Course 2009 - Mining Software Archives

Recently uploaded

"Choosing proper type of scaling", Olena Syrota

Fwdays

Day 2 - Intro to UiPath Studio Fundamentals

UiPathCommunity

In our second session, we shall learn all about the main features and fundamentals of UiPath Studio that enable us to use the building blocks for any automation project. 📕 Detailed agenda: Variables and Datatypes Workflow Layouts Arguments Control Flows and Loops Conditional Statements 💻 Extra training through UiPath Academy: Variables, Constants, and Arguments in Studio Control Flow in Studio

Nordic Marketo Engage User Group_June 13_ 2024.pptx

MichaelKnudsen27

Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians

Neo4j

From Natural Language to Structured Solr Queries using LLMs

Sease

This talk draws on experimentation to enable AI applications with Solr. One important use case is to use AI for better accessibility and discoverability of the data: while User eXperience techniques, lexical search improvements, and data harmonization can take organizations to a good level of accessibility, a structural (or “cognitive” gap) remains between the data user needs and the data producer constraints. That is where AI – and most importantly, Natural Language Processing and Large Language Model techniques – could make a difference. This natural language, conversational engine could facilitate access and usage of the data leveraging the semantics of any data source. The objective of the presentation is to propose a technical approach and a way forward to achieve this goal. The key concept is to enable users to express their search queries in natural language, which the LLM then enriches, interprets, and translates into structured queries based on the Solr index’s metadata. This approach leverages the LLM’s ability to understand the nuances of natural language and the structure of documents within Apache Solr. The LLM acts as an intermediary agent, offering a transparent experience to users automatically and potentially uncovering relevant documents that conventional search methods might overlook. The presentation will include the results of this experimental work, lessons learned, best practices, and the scope of future work that should improve the approach and make it production-ready.

JavaLand 2024: Application Development Green Masterplan

Miro Wengner

Christine's Product Research Presentation.pptx

christinelarrosa

PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx

christinelarrosa

“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...

Edge AI and Vision Alliance

For the full video of this presentation, please visit: https://www.edge-ai-vision.com/2024/06/temporal-event-neural-networks-a-more-efficient-alternative-to-the-transformer-a-presentation-from-brainchip/ Chris Jones, Director of Product Management at BrainChip , presents the “Temporal Event Neural Networks: A More Efficient Alternative to the Transformer” tutorial at the May 2024 Embedded Vision Summit. The expansion of AI services necessitates enhanced computational capabilities on edge devices. Temporal Event Neural Networks (TENNs), developed by BrainChip, represent a novel and highly efficient state-space network. TENNs demonstrate exceptional proficiency in handling multi-dimensional streaming data, facilitating advancements in object detection, action recognition, speech enhancement and language model/sequence generation. Through the utilization of polynomial-based continuous convolutions, TENNs streamline models, expedite training processes and significantly diminish memory requirements, achieving notable reductions of up to 50x in parameters and 5,000x in energy consumption compared to prevailing methodologies like transformers. Integration with BrainChip’s Akida neuromorphic hardware IP further enhances TENNs’ capabilities, enabling the realization of highly capable, portable and passively cooled edge devices. This presentation delves into the technical innovations underlying TENNs, presents real-world benchmarks, and elucidates how this cutting-edge approach is positioned to revolutionize edge AI across diverse applications.

Main news related to the CCS TSI 2023 (2023/1695)

Jakub Marek

An English 🇬🇧 translation of a presentation to the speech I gave about the main changes brought by CCS TSI 2023 at the biggest Czech conference on Communications and signalling systems on Railways, which was held in Clarion Hotel Olomouc from 7th to 9th November 2023 (konferenceszt.cz). Attended by around 500 participants and 200 on-line followers. The original Czech 🇨🇿 version of the presentation can be found here: https://www.slideshare.net/slideshow/hlavni-novinky-souvisejici-s-ccs-tsi-2023-2023-1695/269688092 . The videorecording (in Czech) from the presentation is available here: https://youtu.be/WzjJWm4IyPk?si=SImb06tuXGb30BEH .

What is an RPA CoE? Session 2 – CoE Roles

DianaGray10

QA or the Highway - Component Testing: Bridging the gap between frontend appl...

zjhamm304

Mutation Testing for Task-Oriented Chatbots

Pablo Gómez Abajo

Conversational agents, or chatbots, are increasingly used to access all sorts of services using natural language. While open-domain chatbots - like ChatGPT - can converse on any topic, task-oriented chatbots - the focus of this paper - are designed for specific tasks, like booking a flight, obtaining customer support, or setting an appointment. Like any other software, task-oriented chatbots need to be properly tested, usually by defining and executing test scenarios (i.e., sequences of user-chatbot interactions). However, there is currently a lack of methods to quantify the completeness and strength of such test scenarios, which can lead to low-quality tests, and hence to buggy chatbots. To fill this gap, we propose adapting mutation testing (MuT) for task-oriented chatbots. To this end, we introduce a set of mutation operators that emulate faults in chatbot designs, an architecture that enables MuT on chatbots built using heterogeneous technologies, and a practical realisation as an Eclipse plugin. Moreover, we evaluate the applicability, effectiveness and efficiency of our approach on open-source chatbots, with promising results.

"Scaling RAG Applications to serve millions of users", Kevin Goedecke

Fwdays

Principle of conventional tomography-Bibash Shahi ppt..pptx

BibashShahi

Y-Combinator seed pitch deck template PP

c5vrf27qcz

Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency

ScyllaDB

Leveraging the Graph for Clinical Trials and Standards

Neo4j

Christine's Supplier Sourcing Presentaion.pptx

christinelarrosa

Northern Engraving | Modern Metal Trim, Nameplates and Appliance Panels

Northern Engraving

Recently uploaded (20)

"Choosing proper type of scaling", Olena Syrota

Day 2 - Intro to UiPath Studio Fundamentals

Nordic Marketo Engage User Group_June 13_ 2024.pptx

Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians

From Natural Language to Structured Solr Queries using LLMs

JavaLand 2024: Application Development Green Masterplan

Christine's Product Research Presentation.pptx

PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx

“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...

Main news related to the CCS TSI 2023 (2023/1695)

What is an RPA CoE? Session 2 – CoE Roles

QA or the Highway - Component Testing: Bridging the gap between frontend appl...

Mutation Testing for Task-Oriented Chatbots

"Scaling RAG Applications to serve millions of users", Kevin Goedecke

Principle of conventional tomography-Bibash Shahi ppt..pptx

Y-Combinator seed pitch deck template PP

Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency

Leveraging the Graph for Clinical Trials and Standards

Christine's Supplier Sourcing Presentaion.pptx

Northern Engraving | Modern Metal Trim, Nameplates and Appliance Panels

Predicting Defects Using Change Genealogies (ISSE 2013)

1. Predicting Defects Using Change Genealogies Kim Herzig*, Sascha Just†, Andreas Rau†, Andreas Zeller† * Microsoft Research, UK † Saarland University, Germany

2. Prediction Models • Goal: determine the likelihood of bugs in code entities  Quality assurance limited by time and money.  Can be helpful for project outsiders. • Trained on “ground truth”  Known instances and their properties.  Idea: learning from past for future. • Predicting / estimating defect likelihood of new, unknown code entities

3. Fine-Tuning Prediction Models Machine Learner Training Methods Metrics (independent variables) Prediction Target

4. (Social) Network Metrics  Some participants more active and central than others.  Are these participants also more crucial?

5. Code Network Metrics [2008] Zimmermann and Nagappan: “Predicting Defects using Network Analysis on Dependency Graphs” 10100 10010 1101011000 1001011001 0001010111 1001011001 10100 10010 1101011000 1001011001 0001010111 1001011001 10100 10010 1101011000 1001011001 0001010111 1001011001 10100 10010 1101011000 1001011001 0001010111 1001011001  Code entities communicate with each other. Call graphs do not change significantly  Use call graph network to compute network metrics. over time! 10100 10010 1101011000 1001011001 0001010111 1001011001 10100 10010 1101011000 1001011001 0001010111 1001011001 10100 10010 1101011000 1001011001 0001010111 1001011001 10100 10010 1101011000 1001011001 0001010111 1001011001  Assumption: “Central binaries tend to be defect-prone”.

6. Change Network Metrics Idea: Use dependencies between code changes  Code changes depend on each other.  Central code changes tend to be crucial. Change Genealogies  Assumption: “Code being crucially changed tend to be defect prone”.

7. Change Genealogies (in a nutshell) [2013] Kim Herzig: “Mining and Untangling Change Genealogies” (PhD thesis) Directed graph structure Method level dependencies Multi-dimensional (space & time)

8. Change Genealogy Metrics  EGO network metrics  Measures the immediate impact of changes on other changes.  GLOBAL network metrics  Express the long-term impact of changes on other changes.  Considering the type of the change  Adding method definition, modifying method call  Considering parent age  How old are the parent changes a change depends on. Change genealogy metrics must be aggregated to source file level.

9. Experimental Setup Comparing change genealogies against:  Code complexity models (e.g. McCabe)  Code dependency models (Zimmermann & Nagappan)  Combined network models (Change genealogy & code dependency network metrics)

10. Experimental Setup Study subjects Multiple machine learners

11. Prediction Precision NM & CGM Change genealogy metrics Code dependency network metrics (Zimmermann & Nagappan) Code complexity metrics

12. Confirmed: Network metrics outperform complexity metrics. Change genealogy models report less false positives (higher precision). Change genealogy model slightly more false negatives (lower recall). Combining network metrics: good recall but worse precision.

13. Influential Metrics Network efficiency among the top 10 most influential metrics. Relationship between changes and type of dependency top 2 metrics (for all projects). Higher number of old parents the higher the probability to add bugs.  Code entities combining multiple older functionalities more defect prone.

14. Summary Adapting social network metrics to change dependency graphs. Comparing prediction models.  Change genealogies are well suited for defect prediction (better precision, close recall).  Code entities combining multiple older functionalities more defect prone.

Predicting Defects Using Change Genealogies (ISSE 2013)

Recommended

Recommended

More Related Content

What's hot

What's hot (16)

Similar to Predicting Defects Using Change Genealogies (ISSE 2013)

Similar to Predicting Defects Using Change Genealogies (ISSE 2013) (20)

More from Kim Herzig

More from Kim Herzig (11)

Recently uploaded

Recently uploaded (20)

Predicting Defects Using Change Genealogies (ISSE 2013)