SlideShare a Scribd company logo
1 of 44
Empirical Software Engineering (MSE 507 )
Dr. Assad Abbas
Department of Computer Science
COMSATS Institute of Information Technology, Islamabad
assadabbas@comsats.edu.pk
2
October 14, 2022
Introduction
 Engineering requires the development of the product in a scientific,
well formed, and systematic manner
 Software Engineering (IEEE definition)
The application of a systematic, disciplined, quantifiable approach to
the development, operation and maintenance of software, and the
study of these approaches, that is, the application of engineering to
software.
 Empirical
“Empirical” is typically used to define any statement about the world
that is related to observation or experience.
3
October 14, 2022
What is Empirical Software Engineering?
 Empirical Software Engineering
emphasizes the use of empirical methods in the field of
software engineering.
 involves methods for evaluating, assessing, predicting,
monitoring, and controlling the existing artifacts of
software development.
applies quantitative methods to the software engineering
phenomenon to understand software development better.
 Gaining importance because of the availability of vast
data sets from open source repositories that contain
information about software requirements, bugs, and
changes
4
October 14, 2022
Overview of Empirical Studies
 Empirical study (ES) is an attempt to compare the theories
and observations using real-life data for analysis
 ES utilize data analysis and statistical methods for exploring
relationships
 Benefits of ES
Allow to explore relationships
Allow to prove theoretical concepts
Allow to evaluate accuracy of the models
Allow to choose among tools and techniques
Allow to establish quality benchmarks across software
organizations
Allow to assess and improve techniques and methods
5
October 14, 2022
Empirical Studies in Software Engineering
 Empirical studies are important in the area of software
engineering as they allow:
software professionals to evaluate and assess the new
concepts, technologies, tools, and techniques in scientific and
proved manner.
improving, managing, and controlling the existing processes
and techniques by using evidence obtained from the empirical
analysis.
to gather evidence that can be used to support the claims of
efficiency of a given technique or technology
 The empirical information can help software management in
decision making and improving software processes
6
October 14, 2022
Steps in Empirical Studies
Formation of research questions
Formation of a research hypothesis
Gathering data
Analyzing the data
Developing and validating models
Deriving conclusions from the obtained results
7
October 14, 2022
Qualitative and Quantitative Research
Empirical studies can be classified as:
Quantitative &
Qualitative
Quantitative Research
The most widely used scientific method in SE that applies
mathematical or statistical-based methods to derive conclusions.
Is used to prove or disprove a hypothesis (a concept that has to be
tested for further investigation).
The aim of a quantitative research is to generate results that are
generalizable and unbiased and thus can be applied to a larger
population in research.
It uses statistical methods to validate a hypothesis and to explore
causal relationships.
Common quantitative methods include experiments, observations
recorded as numbers, and surveys with closed-ended questions
8
October 14, 2022
Qualitative and Quantitative Research
Qualitative Research
In qualitative research, the researchers study human behavior,
preferences, and nature.
Provides an in-depth analysis of the concept under
investigation and thus uses focused data for research.
Understanding a new process or technique in software
engineering is an example of qualitative research.
Qualitative research provides textual descriptions or pictures
related to human beliefs or behavior.
It can be extended to other studies with similar populations but
generalizations of a particular phenomenon may be difficult.
Common qualitative methods include interviews with open-
ended questions, observations described in words, and
literature reviews that explore concepts and theories.
9
October 14, 2022
Qualitative and Quantitative Research
10
October 14, 2022
Types of Empirical Studies
Experiments
Case studies
Surveys
Systematic Reviews
11
October 14, 2022
Types of Empirical Studies
Experiments
An experimental study tests the established hypothesis by finding the
effect of variables of interest (independent variables) on the outcome
variable (dependent variable) using statistical analysis.
If the experiment is carried out correctly, the hypothesis is either
accepted or rejected.
 For example, one group uses inspections and the other group uses
walkthroughs to identify design defects, which technique is more effective
in detecting a larger number of defects?
The experiment must also control the confounding variables, which
may affect the accuracy of the results produced by the experiment.
The experiments are carried out in a controlled environment and often
referred to as controlled experiments
The key factors involved in the experiments are: independent
variables, dependent variables, hypothesis, and statistical techniques
12
October 14, 2022
Types of Empirical Studies
Experiments
A controlled experiment involves varying the
variables (one or more) and keeping everything else
constant or the same and are usually conducted in
small or laboratory setting
Example: As discussed in previous slide (inspections
and walkthroughs)
13
October 14, 2022
Types of Empirical Studies
Case Study
Case study research represents real-world scenarios and
involves studying a particular phenomenon
For example: to evaluate a particular software tool,
method, or process
The effect of a change in an organization can be studied
using case study research
Case studies are appropriate where a phenomenon is to
be studied for a longer period of time so that the effects of
the phenomenon can be observed.
The disadvantages of case studies include difficulty in
generalization as they represent a typical situation.
Since they are based on a particular case, the validity of
the results is questionable.
14
October 14, 2022
Types of Empirical Studies
Case Study
Phases in Case Study
The case study design phase involves identifying existing
objectives, cases, research questions, and data-collection
strategies.
The case may be a tool, technology, technique, process,
product, individual, or software. Qualitative data is usually
collected in a case study.
The sources include interviews, group discussions, or
observations. The data may be directly or indirectly collected
from participants..
15
October 14, 2022
Types of Empirical Studies
Survey Research
Survey research identifies features or information from a
large scale of a population.
For example, surveys can be used when a researcher
wants to know whether the use of a particular process has
improved the view of clients toward software usability
features. This information can be obtained by asking the
selected software testers to fill questionnaires.
Surveys are usually conducted using questionnaires and
interviews.
The questionnaires are constructed to collect research-
related information.
Surveys can be descriptive, explorative, and
explanatory.
16
October 14, 2022
Types of Empirical Studies
Survey Research
Descriptive survey research—describes a concept or topic in
more detail
Explorative—focuses on the discovery of new ideas and
insights and are usually conducted at the beginning of a
research study to gather initial information
Explanatory—attempts to explain how things work in
connections like cause and effect, meaning a researcher wants
to explain how things interact or work with each other
For example, while exploring relationship between various
independent variables and an outcome variable, a researcher
may want to explain why an independent variable affects the
outcome variable.
17
October 14, 2022
Types of Empirical Studies
Systematic Reviews
Literature review is an important part that examines the
existing position of literature in an area in which the
research is being conducted.
The systematic reviews are methodically undertaken with
a specific search strategy and well-defined methodology
to answer a set of questions
Kitchenham (2007) defines systematic review as:
 A form of secondary study that uses a well-defined
methodology to identify, analyze and interpret all available
evidence related to a specific research question in a way
that is unbiased and (to a degree) repeatable.
The purpose of a systematic review is to summarize the
existing research and provide future guidelines for
research by identifying gaps in the existing literature
18
October 14, 2022
Types of Empirical Studies
Systematic Reviews
A systematic review involves:
 Defining research questions.
 Forming and documenting a search strategy.
 Determining inclusion and exclusion criteria.
 Establishing quality assessment criteria.
Steps in systematic revies
19
October 14, 2022
Empirical vs. Experimental Studies
Are the two same?
Used interchangeably, but there is some difference
In experimental software engineering, the dependent
variable is closely observed in a controlled environment.
Empirical studies are used to define anything related to
observation and experience and are valuable as these
studies consider real-world data.
In experimental studies, data is artificial or synthetic but is
more controlled.
20
October 14, 2022
Empirical Study Process
 Without a sound and proven research process, it is difficult to
carry out efficient and effective research.
 Thus, a research methodology must be complete and
repeatable, which, when followed, in a replicated or empirical
study, will enable comparisons to be made across various
studies
21
October 14, 2022
Empirical Study Process
22
October 14, 2022
Empirical Study Process
Study Definition
Scope: What are the dimensions of the study?
Motivation: Why is it being studied? (e.g. to analyze
and assess the capability of a technique or method)
Object: What entity is being studied? (e.g. process,
product, technique being studied)
Purpose: What is the aim of the study? (e.g. to prove
that technique A is superior to technique B)
Perspective: From whose view is the study being
conducted (e.g. project manager, customer)?
Domain: What is the area of the study?
23
October 14, 2022
Empirical Study Process
Experiment Design
Research Questions
 The first step is to formulate the research problem. This
step states the problem in the form of questions and
identifies the main concepts and relations to be
explored.
RQ1: What will be the effect of software metrics on
quality attributes (such as fault proneness/testing
effort/maintenance effort) of a class?
RQ2. Are machine-learning methods adaptable to
object-oriented systems for predicting quality
attributes?
24
October 14, 2022
Empirical Study Process
Experiment Design
Independent and Dependent Variables
 To analyze relationships, the next step is to define the
dependent and the independent variables.
 The outcome variable predicted by the independent
variables is called the dependent variable (response
variable).
For instance, the dependent variables of the models
chosen for analysis may be fault proneness, testing
effort, and maintenance effort.
 A variable used to predict or estimate a dependent
variable is called the independent (explanatory or
predictor) variable.
25
October 14, 2022
Empirical Study Process
Experiment Design
Hypothesis Formulation
 The researcher should carefully state the hypothesis to
be tested in the study.
 The hypothesis is tested on the sample data. On the
basis of the result from the sample, a decision concerning
the validity of the hypothesis (acceptance or rejection) is
made.
 Consider an example where a hypothesis is to be formed
for comparing a number of methods for predicting fault-
prone classes.
H–M: M outperforms the compared methods for predicting
fault-prone software classes
Null hypothesis: M does not outperform the compared
methods for predicting fault prone software classes.
26
October 14, 2022
Empirical Study Process
Experiment Design
Empirical Data Collection
 e.g. university/academic systems, commercial
systems, or open source software
Empirical Methods
 The data analysis techniques are selected based on
the type of the dependent variables used
 Statistical techniques: linear regression and logistic
regression
 Machine-learning techniques: decision trees, support
vector machines, and so on
27
October 14, 2022
Research Conduct and Analysis
Research Conduct and Analysis
Descriptive Statistics
 Descriptive statistics are brief informational coefficients
that summarize a given data set, which can be either a
representation of the entire population or a sample of a
population.
 Descriptive statistics are broken down into measures of
central tendency and measures of variability (spread).
Measures of central tendency include the mean,
median, and mode
Measures of variability include standard deviation,
variance, minimum and maximum variables
28
October 14, 2022
Empirical Study Process
Research Conduct and Analysis
Attribute Reduction
 Feature selection is an important step that identifies
and removes as much of the irrelevant and redundant
information as possible.
 The dimensionality of the data reduces the size of the
hypothesis space and allows the methods to operate
faster and more effectively
29
October 14, 2022
Research Conduct and Analysis
Research Conduct and Analysis
Statistical Analysis
Following steps are used in analysis
 Model Prediction
Multivariate analysis is used to find the combined effect of
each independent variable on the dependent variable.
Based on the results of performance measures, the
performance of models predicted is evaluated and the
results are interpreted.
 Model Validation
The validation data is used for validating the model
predicted in the previous step
 Hypothesis Testing
It determines whether the null hypothesis can be rejected
at a specified confidence level
30
October 14, 2022
Research Conduct and Analysis
Result Interpretation
In this step, the results computed in the empirical study’s
analysis phase are assessed and discussed.
The reason behind the acceptance or rejection of the
hypothesis is examined.
This process provides insight to the researchers about the
actual reasons of the decision made for hypothesis.
The conclusions are derived from the results obtained in
the study.
The significance and practical relevance of the results are
defined in this phase.
The limitations of the study are also reported in the form
of threats to validity.
31
October 14, 2022
Research Conduct and Analysis
Reporting
Report is generated after the study has been
conducted and results are finalized
To be prepared from readers’ perspective
Clearly document the: background, motivation,
analysis, design, results, and the discussion of the
results
The experiment settings, data-collection methods,
and design processes must be reported in significant
level of detail
The audience should be able to replicate or repeat
the results of a study in a similar context
32
October 14, 2022
Characteristics of a Good Empirical Study
 Clear
 research goals, hypothesis, and data-collection procedure
 Descriptive
 should help in repeating and replication
 Precise
 Should help improve confidence
 Valid
 Should be valid for a wider population
 Unbiased
 No influence of any sort of factors to get the desired results (selection of
participants)
 Control
 Experiment design should be able to control the independent variables to
minimize confounding effects of other variables
 Replicable
 Repeating the experiment with different data under same conditions
 Repeatable
 Reproducing the results of the study under similar settings
33
October 14, 2022
Ethics of Empirical Research
 Researchers, academicians, and sponsors should be aware of research
ethics while conducting and funding empirical research in software
engineering.
 The upholding of ethical standards helps to develop trust between the
researcher and the participant, and thus smoothens the research process.
 An unethical study can harm the reputation of the research conducted in
software engineering area
 In some countries like the United States, the sponsoring agency requires
that the research involving participants must be reviewed by a third-party
ethics committee to verify that the research complies with the ethical
principles and standards
34
October 14, 2022
Ethics of Empirical Research
Examples of Unethical Research
35
October 14, 2022
Ethics of Empirical Research
Informed consent
Five elements— disclosure, comprehension,
voluntariness, consent, and right to withdraw
Consent form— Research title, contact details,
consent and comprehension, withdrawal,
confidentiality, risks and benefits, clarification,
signatures
Scientific value
Confidentiality
Beneficence
36
October 14, 2022
Basic Elements of Empirical Research
37
October 14, 2022
Terminologies
Software quality and software evolution
Software quality determines how well the software is
designed (quality of design), and how well the
software conforms to that design (quality of
conformance).
Software evolution (maintenance) involves making
changes in the software. Changes are required
because of the following reasons:
 Defects reported by the customer
 New functionality requested by the customer
 Improvement in the quality of the software
 To adapt to new platforms
38
October 14, 2022
Terminologies
Software Quality Attributes
39
October 14, 2022
Terminologies
Descriptive, Correlational, and Cause–Effect
Research
Descriptive research provides description of
concepts.
Correlational research provides relation between two
variables.
Cause–effect research is similar to experiment
research in that the effect of one variable on another
is found.
40
October 14, 2022
Terminologies
Classification and Prediction
Classification predicts categorical outcome variables
The training data is used for model development, and
the model can be used for predicting unknown
categories of outcome variables.
41
October 14, 2022
Terminologies
 Qualitative and quantitative data
 Dependent, independent and confounding variables
 Parametric and non-parametric tests
These tests are applied to determine the validity of the
research hypothesis
Parametric tests are used for data samples having normal
distribution (bell-shaped curve)
Nonparametric tests are used when the distribution of
data samples is highly skewed (left, right skewed)
42
October 14, 2022
Terminologies
Goal Question Metric (GQM)
GQM method follows top-down approach for dividing
goals into questions and mapping questions to
metrics, and follows bottom-up approach by
interpreting the measurement to verify whether the
goals have been satisfied.
 Goal: Improve the defect-correction rate in the system.
 Question: How many defects have been corrected in
the maintenance phase?
 Metric: Number of defects corrected/Number of defects
reported.
43
October 14, 2022
Terminologies
 Software Archives or Repositories
The progress of the software is managed using software
repositories that include source code, documentation,
archived communications, and defect-tracking systems.
 For maintaining software systems, improving software quality, and
empirical validation of data and techniques.
Researchers can mine these repositories to understand
the software development, software evolution, and make
predictions. For example, defects can be predicted using
historical data, and this information can be used to
produce less defective future releases.
The data is kept in various types of software repositories
such as CVS, Git, SVN, ClearCase, Perforce, Mercurial,
Veracity, and Fossil.
44
October 14, 2022
Source
Malhotra, Ruchika. Empirical research in software
engineering: concepts, analysis, and applications.
CRC press, 2016.

More Related Content

What's hot

Introductions to Online Machine Learning Algorithms
Introductions to Online Machine Learning AlgorithmsIntroductions to Online Machine Learning Algorithms
Introductions to Online Machine Learning AlgorithmsDataWorks Summit
 
Federated learning
Federated learningFederated learning
Federated learningMindos Cheng
 
Introduction to Data Science.pptx
Introduction to Data Science.pptxIntroduction to Data Science.pptx
Introduction to Data Science.pptxVrishit Saraswat
 
Federated learning in brief
Federated learning in briefFederated learning in brief
Federated learning in briefShashi Perera
 
New Trends in software development
New Trends in software developmentNew Trends in software development
New Trends in software developmentKabir Khanna
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learningshivani saluja
 
Computational Intelligence: concepts and applications using Athena
Computational Intelligence: concepts and applications using AthenaComputational Intelligence: concepts and applications using Athena
Computational Intelligence: concepts and applications using AthenaPedro Almir
 
Introduction To Generative Adversarial Networks GANs
Introduction To Generative Adversarial Networks GANsIntroduction To Generative Adversarial Networks GANs
Introduction To Generative Adversarial Networks GANsHichem Felouat
 
Modeling Physical Systems in the Metaverse Easily with Graphs
Modeling Physical Systems in the Metaverse Easily with GraphsModeling Physical Systems in the Metaverse Easily with Graphs
Modeling Physical Systems in the Metaverse Easily with GraphsNeo4j
 
Privacy preserving machine learning
Privacy preserving machine learningPrivacy preserving machine learning
Privacy preserving machine learningMichał Kuźba
 
Software quality assurance
Software quality assuranceSoftware quality assurance
Software quality assuranceAman Adhikari
 
Data science | What is Data science
Data science | What is Data scienceData science | What is Data science
Data science | What is Data scienceShilpaKrishna6
 
Machine learning life cycle
Machine learning life cycleMachine learning life cycle
Machine learning life cycleRamjee Ganti
 
Software quality assurance lecture 1
Software quality assurance lecture 1Software quality assurance lecture 1
Software quality assurance lecture 1Abdul Basit
 
Managing software project, software engineering
Managing software project, software engineeringManaging software project, software engineering
Managing software project, software engineeringRupesh Vaishnav
 
Scientific methods in computer science
Scientific methods in computer scienceScientific methods in computer science
Scientific methods in computer scienceParinda Rajapaksha
 
Explainable AI in Healthcare
Explainable AI in HealthcareExplainable AI in Healthcare
Explainable AI in Healthcarevonaurum
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Cloudera, Inc.
 

What's hot (20)

Stm unit1
Stm unit1Stm unit1
Stm unit1
 
Introductions to Online Machine Learning Algorithms
Introductions to Online Machine Learning AlgorithmsIntroductions to Online Machine Learning Algorithms
Introductions to Online Machine Learning Algorithms
 
Federated learning
Federated learningFederated learning
Federated learning
 
Introduction to Data Science.pptx
Introduction to Data Science.pptxIntroduction to Data Science.pptx
Introduction to Data Science.pptx
 
Federated learning in brief
Federated learning in briefFederated learning in brief
Federated learning in brief
 
New Trends in software development
New Trends in software developmentNew Trends in software development
New Trends in software development
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Computational Intelligence: concepts and applications using Athena
Computational Intelligence: concepts and applications using AthenaComputational Intelligence: concepts and applications using Athena
Computational Intelligence: concepts and applications using Athena
 
Introduction To Generative Adversarial Networks GANs
Introduction To Generative Adversarial Networks GANsIntroduction To Generative Adversarial Networks GANs
Introduction To Generative Adversarial Networks GANs
 
Modeling Physical Systems in the Metaverse Easily with Graphs
Modeling Physical Systems in the Metaverse Easily with GraphsModeling Physical Systems in the Metaverse Easily with Graphs
Modeling Physical Systems in the Metaverse Easily with Graphs
 
Analysis modeling
Analysis modelingAnalysis modeling
Analysis modeling
 
Privacy preserving machine learning
Privacy preserving machine learningPrivacy preserving machine learning
Privacy preserving machine learning
 
Software quality assurance
Software quality assuranceSoftware quality assurance
Software quality assurance
 
Data science | What is Data science
Data science | What is Data scienceData science | What is Data science
Data science | What is Data science
 
Machine learning life cycle
Machine learning life cycleMachine learning life cycle
Machine learning life cycle
 
Software quality assurance lecture 1
Software quality assurance lecture 1Software quality assurance lecture 1
Software quality assurance lecture 1
 
Managing software project, software engineering
Managing software project, software engineeringManaging software project, software engineering
Managing software project, software engineering
 
Scientific methods in computer science
Scientific methods in computer scienceScientific methods in computer science
Scientific methods in computer science
 
Explainable AI in Healthcare
Explainable AI in HealthcareExplainable AI in Healthcare
Explainable AI in Healthcare
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18
 

Similar to Lecture 2.pptx

A Standardized Case Study Framework and Methodology to Identify quot Best Pr...
A Standardized Case Study Framework and Methodology to Identify  quot Best Pr...A Standardized Case Study Framework and Methodology to Identify  quot Best Pr...
A Standardized Case Study Framework and Methodology to Identify quot Best Pr...Carrie Tran
 
Overview of research methodology.pptx
Overview of research methodology.pptxOverview of research methodology.pptx
Overview of research methodology.pptxYogeshPatel28169
 
Chapter 7 (research design)
Chapter 7 (research design)Chapter 7 (research design)
Chapter 7 (research design)BoreyThai1
 
Survey Research article.pdf
Survey Research article.pdfSurvey Research article.pdf
Survey Research article.pdfAyeshaMalickk1
 
quantitative-research.pptx
quantitative-research.pptxquantitative-research.pptx
quantitative-research.pptxcjoypingaron
 
Research Design, Process of research with examples
Research Design, Process of research with examplesResearch Design, Process of research with examples
Research Design, Process of research with examplesDr. Anita Rathod
 
Research Methodology.pptx
Research Methodology.pptxResearch Methodology.pptx
Research Methodology.pptxDrKusumGaur
 
MIS 49100 Week 3 Research Methodology
MIS 49100 Week 3 Research MethodologyMIS 49100 Week 3 Research Methodology
MIS 49100 Week 3 Research Methodologyoudesign
 
Validity in Research
Validity in ResearchValidity in Research
Validity in ResearchEcem Ekinci
 
Business research methods
Business  research methodsBusiness  research methods
Business research methodsHanumant singh
 
Research method in operations management final
Research method in  operations management finalResearch method in  operations management final
Research method in operations management finalindonesiabelajar
 
MethodsofDataCollection.pdf
MethodsofDataCollection.pdfMethodsofDataCollection.pdf
MethodsofDataCollection.pdfMohdTaufiqIshak
 

Similar to Lecture 2.pptx (20)

Lecture 3.pptx
Lecture 3.pptxLecture 3.pptx
Lecture 3.pptx
 
A Standardized Case Study Framework and Methodology to Identify quot Best Pr...
A Standardized Case Study Framework and Methodology to Identify  quot Best Pr...A Standardized Case Study Framework and Methodology to Identify  quot Best Pr...
A Standardized Case Study Framework and Methodology to Identify quot Best Pr...
 
BRM_Research Design.ppt
BRM_Research Design.pptBRM_Research Design.ppt
BRM_Research Design.ppt
 
Overview of research methodology.pptx
Overview of research methodology.pptxOverview of research methodology.pptx
Overview of research methodology.pptx
 
Chapter 7 (research design)
Chapter 7 (research design)Chapter 7 (research design)
Chapter 7 (research design)
 
Survey Research article.pdf
Survey Research article.pdfSurvey Research article.pdf
Survey Research article.pdf
 
quantitative-research.pptx
quantitative-research.pptxquantitative-research.pptx
quantitative-research.pptx
 
Research Design
Research DesignResearch Design
Research Design
 
Research Design, Process of research with examples
Research Design, Process of research with examplesResearch Design, Process of research with examples
Research Design, Process of research with examples
 
Research Methodology.pptx
Research Methodology.pptxResearch Methodology.pptx
Research Methodology.pptx
 
Research methodology (2)
Research methodology (2)Research methodology (2)
Research methodology (2)
 
MIS 49100 Week 3 Research Methodology
MIS 49100 Week 3 Research MethodologyMIS 49100 Week 3 Research Methodology
MIS 49100 Week 3 Research Methodology
 
De carlo rizk 2010 icelw
De carlo rizk 2010 icelwDe carlo rizk 2010 icelw
De carlo rizk 2010 icelw
 
Ijetcas14 438
Ijetcas14 438Ijetcas14 438
Ijetcas14 438
 
Validity in Research
Validity in ResearchValidity in Research
Validity in Research
 
Research methodology presentation
Research methodology presentationResearch methodology presentation
Research methodology presentation
 
Business research methods
Business  research methodsBusiness  research methods
Business research methods
 
Research method in operations management final
Research method in  operations management finalResearch method in  operations management final
Research method in operations management final
 
MethodsofDataCollection.pdf
MethodsofDataCollection.pdfMethodsofDataCollection.pdf
MethodsofDataCollection.pdf
 
Research Design _komal-1.docx
Research Design _komal-1.docxResearch Design _komal-1.docx
Research Design _komal-1.docx
 

Recently uploaded

Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataBradBedford3
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...Christina Lin
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software DevelopersVinodh Ram
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsAhmed Mohamed
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Andreas Granig
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfPower Karaoke
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesPhilip Schwarz
 
Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - InfographicHr365.us smith
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWave PLM
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...gurkirankumar98700
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityNeo4j
 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEEVICTOR MAESTRE RAMIREZ
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.
 
software engineering Chapter 5 System modeling.pptx
software engineering Chapter 5 System modeling.pptxsoftware engineering Chapter 5 System modeling.pptx
software engineering Chapter 5 System modeling.pptxnada99848
 

Recently uploaded (20)

Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software Developers
 
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort ServiceHot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML Diagrams
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdf
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a series
 
Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - Infographic
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need It
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEE
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
 
software engineering Chapter 5 System modeling.pptx
software engineering Chapter 5 System modeling.pptxsoftware engineering Chapter 5 System modeling.pptx
software engineering Chapter 5 System modeling.pptx
 

Lecture 2.pptx

  • 1. Empirical Software Engineering (MSE 507 ) Dr. Assad Abbas Department of Computer Science COMSATS Institute of Information Technology, Islamabad assadabbas@comsats.edu.pk
  • 2. 2 October 14, 2022 Introduction  Engineering requires the development of the product in a scientific, well formed, and systematic manner  Software Engineering (IEEE definition) The application of a systematic, disciplined, quantifiable approach to the development, operation and maintenance of software, and the study of these approaches, that is, the application of engineering to software.  Empirical “Empirical” is typically used to define any statement about the world that is related to observation or experience.
  • 3. 3 October 14, 2022 What is Empirical Software Engineering?  Empirical Software Engineering emphasizes the use of empirical methods in the field of software engineering.  involves methods for evaluating, assessing, predicting, monitoring, and controlling the existing artifacts of software development. applies quantitative methods to the software engineering phenomenon to understand software development better.  Gaining importance because of the availability of vast data sets from open source repositories that contain information about software requirements, bugs, and changes
  • 4. 4 October 14, 2022 Overview of Empirical Studies  Empirical study (ES) is an attempt to compare the theories and observations using real-life data for analysis  ES utilize data analysis and statistical methods for exploring relationships  Benefits of ES Allow to explore relationships Allow to prove theoretical concepts Allow to evaluate accuracy of the models Allow to choose among tools and techniques Allow to establish quality benchmarks across software organizations Allow to assess and improve techniques and methods
  • 5. 5 October 14, 2022 Empirical Studies in Software Engineering  Empirical studies are important in the area of software engineering as they allow: software professionals to evaluate and assess the new concepts, technologies, tools, and techniques in scientific and proved manner. improving, managing, and controlling the existing processes and techniques by using evidence obtained from the empirical analysis. to gather evidence that can be used to support the claims of efficiency of a given technique or technology  The empirical information can help software management in decision making and improving software processes
  • 6. 6 October 14, 2022 Steps in Empirical Studies Formation of research questions Formation of a research hypothesis Gathering data Analyzing the data Developing and validating models Deriving conclusions from the obtained results
  • 7. 7 October 14, 2022 Qualitative and Quantitative Research Empirical studies can be classified as: Quantitative & Qualitative Quantitative Research The most widely used scientific method in SE that applies mathematical or statistical-based methods to derive conclusions. Is used to prove or disprove a hypothesis (a concept that has to be tested for further investigation). The aim of a quantitative research is to generate results that are generalizable and unbiased and thus can be applied to a larger population in research. It uses statistical methods to validate a hypothesis and to explore causal relationships. Common quantitative methods include experiments, observations recorded as numbers, and surveys with closed-ended questions
  • 8. 8 October 14, 2022 Qualitative and Quantitative Research Qualitative Research In qualitative research, the researchers study human behavior, preferences, and nature. Provides an in-depth analysis of the concept under investigation and thus uses focused data for research. Understanding a new process or technique in software engineering is an example of qualitative research. Qualitative research provides textual descriptions or pictures related to human beliefs or behavior. It can be extended to other studies with similar populations but generalizations of a particular phenomenon may be difficult. Common qualitative methods include interviews with open- ended questions, observations described in words, and literature reviews that explore concepts and theories.
  • 9. 9 October 14, 2022 Qualitative and Quantitative Research
  • 10. 10 October 14, 2022 Types of Empirical Studies Experiments Case studies Surveys Systematic Reviews
  • 11. 11 October 14, 2022 Types of Empirical Studies Experiments An experimental study tests the established hypothesis by finding the effect of variables of interest (independent variables) on the outcome variable (dependent variable) using statistical analysis. If the experiment is carried out correctly, the hypothesis is either accepted or rejected.  For example, one group uses inspections and the other group uses walkthroughs to identify design defects, which technique is more effective in detecting a larger number of defects? The experiment must also control the confounding variables, which may affect the accuracy of the results produced by the experiment. The experiments are carried out in a controlled environment and often referred to as controlled experiments The key factors involved in the experiments are: independent variables, dependent variables, hypothesis, and statistical techniques
  • 12. 12 October 14, 2022 Types of Empirical Studies Experiments A controlled experiment involves varying the variables (one or more) and keeping everything else constant or the same and are usually conducted in small or laboratory setting Example: As discussed in previous slide (inspections and walkthroughs)
  • 13. 13 October 14, 2022 Types of Empirical Studies Case Study Case study research represents real-world scenarios and involves studying a particular phenomenon For example: to evaluate a particular software tool, method, or process The effect of a change in an organization can be studied using case study research Case studies are appropriate where a phenomenon is to be studied for a longer period of time so that the effects of the phenomenon can be observed. The disadvantages of case studies include difficulty in generalization as they represent a typical situation. Since they are based on a particular case, the validity of the results is questionable.
  • 14. 14 October 14, 2022 Types of Empirical Studies Case Study Phases in Case Study The case study design phase involves identifying existing objectives, cases, research questions, and data-collection strategies. The case may be a tool, technology, technique, process, product, individual, or software. Qualitative data is usually collected in a case study. The sources include interviews, group discussions, or observations. The data may be directly or indirectly collected from participants..
  • 15. 15 October 14, 2022 Types of Empirical Studies Survey Research Survey research identifies features or information from a large scale of a population. For example, surveys can be used when a researcher wants to know whether the use of a particular process has improved the view of clients toward software usability features. This information can be obtained by asking the selected software testers to fill questionnaires. Surveys are usually conducted using questionnaires and interviews. The questionnaires are constructed to collect research- related information. Surveys can be descriptive, explorative, and explanatory.
  • 16. 16 October 14, 2022 Types of Empirical Studies Survey Research Descriptive survey research—describes a concept or topic in more detail Explorative—focuses on the discovery of new ideas and insights and are usually conducted at the beginning of a research study to gather initial information Explanatory—attempts to explain how things work in connections like cause and effect, meaning a researcher wants to explain how things interact or work with each other For example, while exploring relationship between various independent variables and an outcome variable, a researcher may want to explain why an independent variable affects the outcome variable.
  • 17. 17 October 14, 2022 Types of Empirical Studies Systematic Reviews Literature review is an important part that examines the existing position of literature in an area in which the research is being conducted. The systematic reviews are methodically undertaken with a specific search strategy and well-defined methodology to answer a set of questions Kitchenham (2007) defines systematic review as:  A form of secondary study that uses a well-defined methodology to identify, analyze and interpret all available evidence related to a specific research question in a way that is unbiased and (to a degree) repeatable. The purpose of a systematic review is to summarize the existing research and provide future guidelines for research by identifying gaps in the existing literature
  • 18. 18 October 14, 2022 Types of Empirical Studies Systematic Reviews A systematic review involves:  Defining research questions.  Forming and documenting a search strategy.  Determining inclusion and exclusion criteria.  Establishing quality assessment criteria. Steps in systematic revies
  • 19. 19 October 14, 2022 Empirical vs. Experimental Studies Are the two same? Used interchangeably, but there is some difference In experimental software engineering, the dependent variable is closely observed in a controlled environment. Empirical studies are used to define anything related to observation and experience and are valuable as these studies consider real-world data. In experimental studies, data is artificial or synthetic but is more controlled.
  • 20. 20 October 14, 2022 Empirical Study Process  Without a sound and proven research process, it is difficult to carry out efficient and effective research.  Thus, a research methodology must be complete and repeatable, which, when followed, in a replicated or empirical study, will enable comparisons to be made across various studies
  • 22. 22 October 14, 2022 Empirical Study Process Study Definition Scope: What are the dimensions of the study? Motivation: Why is it being studied? (e.g. to analyze and assess the capability of a technique or method) Object: What entity is being studied? (e.g. process, product, technique being studied) Purpose: What is the aim of the study? (e.g. to prove that technique A is superior to technique B) Perspective: From whose view is the study being conducted (e.g. project manager, customer)? Domain: What is the area of the study?
  • 23. 23 October 14, 2022 Empirical Study Process Experiment Design Research Questions  The first step is to formulate the research problem. This step states the problem in the form of questions and identifies the main concepts and relations to be explored. RQ1: What will be the effect of software metrics on quality attributes (such as fault proneness/testing effort/maintenance effort) of a class? RQ2. Are machine-learning methods adaptable to object-oriented systems for predicting quality attributes?
  • 24. 24 October 14, 2022 Empirical Study Process Experiment Design Independent and Dependent Variables  To analyze relationships, the next step is to define the dependent and the independent variables.  The outcome variable predicted by the independent variables is called the dependent variable (response variable). For instance, the dependent variables of the models chosen for analysis may be fault proneness, testing effort, and maintenance effort.  A variable used to predict or estimate a dependent variable is called the independent (explanatory or predictor) variable.
  • 25. 25 October 14, 2022 Empirical Study Process Experiment Design Hypothesis Formulation  The researcher should carefully state the hypothesis to be tested in the study.  The hypothesis is tested on the sample data. On the basis of the result from the sample, a decision concerning the validity of the hypothesis (acceptance or rejection) is made.  Consider an example where a hypothesis is to be formed for comparing a number of methods for predicting fault- prone classes. H–M: M outperforms the compared methods for predicting fault-prone software classes Null hypothesis: M does not outperform the compared methods for predicting fault prone software classes.
  • 26. 26 October 14, 2022 Empirical Study Process Experiment Design Empirical Data Collection  e.g. university/academic systems, commercial systems, or open source software Empirical Methods  The data analysis techniques are selected based on the type of the dependent variables used  Statistical techniques: linear regression and logistic regression  Machine-learning techniques: decision trees, support vector machines, and so on
  • 27. 27 October 14, 2022 Research Conduct and Analysis Research Conduct and Analysis Descriptive Statistics  Descriptive statistics are brief informational coefficients that summarize a given data set, which can be either a representation of the entire population or a sample of a population.  Descriptive statistics are broken down into measures of central tendency and measures of variability (spread). Measures of central tendency include the mean, median, and mode Measures of variability include standard deviation, variance, minimum and maximum variables
  • 28. 28 October 14, 2022 Empirical Study Process Research Conduct and Analysis Attribute Reduction  Feature selection is an important step that identifies and removes as much of the irrelevant and redundant information as possible.  The dimensionality of the data reduces the size of the hypothesis space and allows the methods to operate faster and more effectively
  • 29. 29 October 14, 2022 Research Conduct and Analysis Research Conduct and Analysis Statistical Analysis Following steps are used in analysis  Model Prediction Multivariate analysis is used to find the combined effect of each independent variable on the dependent variable. Based on the results of performance measures, the performance of models predicted is evaluated and the results are interpreted.  Model Validation The validation data is used for validating the model predicted in the previous step  Hypothesis Testing It determines whether the null hypothesis can be rejected at a specified confidence level
  • 30. 30 October 14, 2022 Research Conduct and Analysis Result Interpretation In this step, the results computed in the empirical study’s analysis phase are assessed and discussed. The reason behind the acceptance or rejection of the hypothesis is examined. This process provides insight to the researchers about the actual reasons of the decision made for hypothesis. The conclusions are derived from the results obtained in the study. The significance and practical relevance of the results are defined in this phase. The limitations of the study are also reported in the form of threats to validity.
  • 31. 31 October 14, 2022 Research Conduct and Analysis Reporting Report is generated after the study has been conducted and results are finalized To be prepared from readers’ perspective Clearly document the: background, motivation, analysis, design, results, and the discussion of the results The experiment settings, data-collection methods, and design processes must be reported in significant level of detail The audience should be able to replicate or repeat the results of a study in a similar context
  • 32. 32 October 14, 2022 Characteristics of a Good Empirical Study  Clear  research goals, hypothesis, and data-collection procedure  Descriptive  should help in repeating and replication  Precise  Should help improve confidence  Valid  Should be valid for a wider population  Unbiased  No influence of any sort of factors to get the desired results (selection of participants)  Control  Experiment design should be able to control the independent variables to minimize confounding effects of other variables  Replicable  Repeating the experiment with different data under same conditions  Repeatable  Reproducing the results of the study under similar settings
  • 33. 33 October 14, 2022 Ethics of Empirical Research  Researchers, academicians, and sponsors should be aware of research ethics while conducting and funding empirical research in software engineering.  The upholding of ethical standards helps to develop trust between the researcher and the participant, and thus smoothens the research process.  An unethical study can harm the reputation of the research conducted in software engineering area  In some countries like the United States, the sponsoring agency requires that the research involving participants must be reviewed by a third-party ethics committee to verify that the research complies with the ethical principles and standards
  • 34. 34 October 14, 2022 Ethics of Empirical Research Examples of Unethical Research
  • 35. 35 October 14, 2022 Ethics of Empirical Research Informed consent Five elements— disclosure, comprehension, voluntariness, consent, and right to withdraw Consent form— Research title, contact details, consent and comprehension, withdrawal, confidentiality, risks and benefits, clarification, signatures Scientific value Confidentiality Beneficence
  • 36. 36 October 14, 2022 Basic Elements of Empirical Research
  • 37. 37 October 14, 2022 Terminologies Software quality and software evolution Software quality determines how well the software is designed (quality of design), and how well the software conforms to that design (quality of conformance). Software evolution (maintenance) involves making changes in the software. Changes are required because of the following reasons:  Defects reported by the customer  New functionality requested by the customer  Improvement in the quality of the software  To adapt to new platforms
  • 39. 39 October 14, 2022 Terminologies Descriptive, Correlational, and Cause–Effect Research Descriptive research provides description of concepts. Correlational research provides relation between two variables. Cause–effect research is similar to experiment research in that the effect of one variable on another is found.
  • 40. 40 October 14, 2022 Terminologies Classification and Prediction Classification predicts categorical outcome variables The training data is used for model development, and the model can be used for predicting unknown categories of outcome variables.
  • 41. 41 October 14, 2022 Terminologies  Qualitative and quantitative data  Dependent, independent and confounding variables  Parametric and non-parametric tests These tests are applied to determine the validity of the research hypothesis Parametric tests are used for data samples having normal distribution (bell-shaped curve) Nonparametric tests are used when the distribution of data samples is highly skewed (left, right skewed)
  • 42. 42 October 14, 2022 Terminologies Goal Question Metric (GQM) GQM method follows top-down approach for dividing goals into questions and mapping questions to metrics, and follows bottom-up approach by interpreting the measurement to verify whether the goals have been satisfied.  Goal: Improve the defect-correction rate in the system.  Question: How many defects have been corrected in the maintenance phase?  Metric: Number of defects corrected/Number of defects reported.
  • 43. 43 October 14, 2022 Terminologies  Software Archives or Repositories The progress of the software is managed using software repositories that include source code, documentation, archived communications, and defect-tracking systems.  For maintaining software systems, improving software quality, and empirical validation of data and techniques. Researchers can mine these repositories to understand the software development, software evolution, and make predictions. For example, defects can be predicted using historical data, and this information can be used to produce less defective future releases. The data is kept in various types of software repositories such as CVS, Git, SVN, ClearCase, Perforce, Mercurial, Veracity, and Fossil.
  • 44. 44 October 14, 2022 Source Malhotra, Ruchika. Empirical research in software engineering: concepts, analysis, and applications. CRC press, 2016.

Editor's Notes

  1. 1