SlideShare a Scribd company logo
Empirical Software Engineering (MSE 507 )
Dr. Assad Abbas
Department of Computer Science
COMSATS Institute of Information Technology, Islamabad
assadabbas@comsats.edu.pk
2
October 14, 2022
Introduction
 Engineering requires the development of the product in a scientific,
well formed, and systematic manner
 Software Engineering (IEEE definition)
The application of a systematic, disciplined, quantifiable approach to
the development, operation and maintenance of software, and the
study of these approaches, that is, the application of engineering to
software.
 Empirical
“Empirical” is typically used to define any statement about the world
that is related to observation or experience.
3
October 14, 2022
What is Empirical Software Engineering?
 Empirical Software Engineering
emphasizes the use of empirical methods in the field of
software engineering.
 involves methods for evaluating, assessing, predicting,
monitoring, and controlling the existing artifacts of
software development.
applies quantitative methods to the software engineering
phenomenon to understand software development better.
 Gaining importance because of the availability of vast
data sets from open source repositories that contain
information about software requirements, bugs, and
changes
4
October 14, 2022
Overview of Empirical Studies
 Empirical study (ES) is an attempt to compare the theories
and observations using real-life data for analysis
 ES utilize data analysis and statistical methods for exploring
relationships
 Benefits of ES
Allow to explore relationships
Allow to prove theoretical concepts
Allow to evaluate accuracy of the models
Allow to choose among tools and techniques
Allow to establish quality benchmarks across software
organizations
Allow to assess and improve techniques and methods
5
October 14, 2022
Empirical Studies in Software Engineering
 Empirical studies are important in the area of software
engineering as they allow:
software professionals to evaluate and assess the new
concepts, technologies, tools, and techniques in scientific and
proved manner.
improving, managing, and controlling the existing processes
and techniques by using evidence obtained from the empirical
analysis.
to gather evidence that can be used to support the claims of
efficiency of a given technique or technology
 The empirical information can help software management in
decision making and improving software processes
6
October 14, 2022
Steps in Empirical Studies
Formation of research questions
Formation of a research hypothesis
Gathering data
Analyzing the data
Developing and validating models
Deriving conclusions from the obtained results
7
October 14, 2022
Qualitative and Quantitative Research
Empirical studies can be classified as:
Quantitative &
Qualitative
Quantitative Research
The most widely used scientific method in SE that applies
mathematical or statistical-based methods to derive conclusions.
Is used to prove or disprove a hypothesis (a concept that has to be
tested for further investigation).
The aim of a quantitative research is to generate results that are
generalizable and unbiased and thus can be applied to a larger
population in research.
It uses statistical methods to validate a hypothesis and to explore
causal relationships.
Common quantitative methods include experiments, observations
recorded as numbers, and surveys with closed-ended questions
8
October 14, 2022
Qualitative and Quantitative Research
Qualitative Research
In qualitative research, the researchers study human behavior,
preferences, and nature.
Provides an in-depth analysis of the concept under
investigation and thus uses focused data for research.
Understanding a new process or technique in software
engineering is an example of qualitative research.
Qualitative research provides textual descriptions or pictures
related to human beliefs or behavior.
It can be extended to other studies with similar populations but
generalizations of a particular phenomenon may be difficult.
Common qualitative methods include interviews with open-
ended questions, observations described in words, and
literature reviews that explore concepts and theories.
9
October 14, 2022
Qualitative and Quantitative Research
10
October 14, 2022
Types of Empirical Studies
Experiments
Case studies
Surveys
Systematic Reviews
11
October 14, 2022
Types of Empirical Studies
Experiments
An experimental study tests the established hypothesis by finding the
effect of variables of interest (independent variables) on the outcome
variable (dependent variable) using statistical analysis.
If the experiment is carried out correctly, the hypothesis is either
accepted or rejected.
 For example, one group uses inspections and the other group uses
walkthroughs to identify design defects, which technique is more effective
in detecting a larger number of defects?
The experiment must also control the confounding variables, which
may affect the accuracy of the results produced by the experiment.
The experiments are carried out in a controlled environment and often
referred to as controlled experiments
The key factors involved in the experiments are: independent
variables, dependent variables, hypothesis, and statistical techniques
12
October 14, 2022
Types of Empirical Studies
Experiments
A controlled experiment involves varying the
variables (one or more) and keeping everything else
constant or the same and are usually conducted in
small or laboratory setting
Example: As discussed in previous slide (inspections
and walkthroughs)
13
October 14, 2022
Types of Empirical Studies
Case Study
Case study research represents real-world scenarios and
involves studying a particular phenomenon
For example: to evaluate a particular software tool,
method, or process
The effect of a change in an organization can be studied
using case study research
Case studies are appropriate where a phenomenon is to
be studied for a longer period of time so that the effects of
the phenomenon can be observed.
The disadvantages of case studies include difficulty in
generalization as they represent a typical situation.
Since they are based on a particular case, the validity of
the results is questionable.
14
October 14, 2022
Types of Empirical Studies
Case Study
Phases in Case Study
The case study design phase involves identifying existing
objectives, cases, research questions, and data-collection
strategies.
The case may be a tool, technology, technique, process,
product, individual, or software. Qualitative data is usually
collected in a case study.
The sources include interviews, group discussions, or
observations. The data may be directly or indirectly collected
from participants..
15
October 14, 2022
Types of Empirical Studies
Survey Research
Survey research identifies features or information from a
large scale of a population.
For example, surveys can be used when a researcher
wants to know whether the use of a particular process has
improved the view of clients toward software usability
features. This information can be obtained by asking the
selected software testers to fill questionnaires.
Surveys are usually conducted using questionnaires and
interviews.
The questionnaires are constructed to collect research-
related information.
Surveys can be descriptive, explorative, and
explanatory.
16
October 14, 2022
Types of Empirical Studies
Survey Research
Descriptive survey research—describes a concept or topic in
more detail
Explorative—focuses on the discovery of new ideas and
insights and are usually conducted at the beginning of a
research study to gather initial information
Explanatory—attempts to explain how things work in
connections like cause and effect, meaning a researcher wants
to explain how things interact or work with each other
For example, while exploring relationship between various
independent variables and an outcome variable, a researcher
may want to explain why an independent variable affects the
outcome variable.
17
October 14, 2022
Types of Empirical Studies
Systematic Reviews
Literature review is an important part that examines the
existing position of literature in an area in which the
research is being conducted.
The systematic reviews are methodically undertaken with
a specific search strategy and well-defined methodology
to answer a set of questions
Kitchenham (2007) defines systematic review as:
 A form of secondary study that uses a well-defined
methodology to identify, analyze and interpret all available
evidence related to a specific research question in a way
that is unbiased and (to a degree) repeatable.
The purpose of a systematic review is to summarize the
existing research and provide future guidelines for
research by identifying gaps in the existing literature
18
October 14, 2022
Types of Empirical Studies
Systematic Reviews
A systematic review involves:
 Defining research questions.
 Forming and documenting a search strategy.
 Determining inclusion and exclusion criteria.
 Establishing quality assessment criteria.
Steps in systematic revies
19
October 14, 2022
Empirical vs. Experimental Studies
Are the two same?
Used interchangeably, but there is some difference
In experimental software engineering, the dependent
variable is closely observed in a controlled environment.
Empirical studies are used to define anything related to
observation and experience and are valuable as these
studies consider real-world data.
In experimental studies, data is artificial or synthetic but is
more controlled.
20
October 14, 2022
Empirical Study Process
 Without a sound and proven research process, it is difficult to
carry out efficient and effective research.
 Thus, a research methodology must be complete and
repeatable, which, when followed, in a replicated or empirical
study, will enable comparisons to be made across various
studies
21
October 14, 2022
Empirical Study Process
22
October 14, 2022
Empirical Study Process
Study Definition
Scope: What are the dimensions of the study?
Motivation: Why is it being studied? (e.g. to analyze
and assess the capability of a technique or method)
Object: What entity is being studied? (e.g. process,
product, technique being studied)
Purpose: What is the aim of the study? (e.g. to prove
that technique A is superior to technique B)
Perspective: From whose view is the study being
conducted (e.g. project manager, customer)?
Domain: What is the area of the study?
23
October 14, 2022
Empirical Study Process
Experiment Design
Research Questions
 The first step is to formulate the research problem. This
step states the problem in the form of questions and
identifies the main concepts and relations to be
explored.
RQ1: What will be the effect of software metrics on
quality attributes (such as fault proneness/testing
effort/maintenance effort) of a class?
RQ2. Are machine-learning methods adaptable to
object-oriented systems for predicting quality
attributes?
24
October 14, 2022
Empirical Study Process
Experiment Design
Independent and Dependent Variables
 To analyze relationships, the next step is to define the
dependent and the independent variables.
 The outcome variable predicted by the independent
variables is called the dependent variable (response
variable).
For instance, the dependent variables of the models
chosen for analysis may be fault proneness, testing
effort, and maintenance effort.
 A variable used to predict or estimate a dependent
variable is called the independent (explanatory or
predictor) variable.
25
October 14, 2022
Empirical Study Process
Experiment Design
Hypothesis Formulation
 The researcher should carefully state the hypothesis to
be tested in the study.
 The hypothesis is tested on the sample data. On the
basis of the result from the sample, a decision concerning
the validity of the hypothesis (acceptance or rejection) is
made.
 Consider an example where a hypothesis is to be formed
for comparing a number of methods for predicting fault-
prone classes.
H–M: M outperforms the compared methods for predicting
fault-prone software classes
Null hypothesis: M does not outperform the compared
methods for predicting fault prone software classes.
26
October 14, 2022
Empirical Study Process
Experiment Design
Empirical Data Collection
 e.g. university/academic systems, commercial
systems, or open source software
Empirical Methods
 The data analysis techniques are selected based on
the type of the dependent variables used
 Statistical techniques: linear regression and logistic
regression
 Machine-learning techniques: decision trees, support
vector machines, and so on
27
October 14, 2022
Research Conduct and Analysis
Research Conduct and Analysis
Descriptive Statistics
 Descriptive statistics are brief informational coefficients
that summarize a given data set, which can be either a
representation of the entire population or a sample of a
population.
 Descriptive statistics are broken down into measures of
central tendency and measures of variability (spread).
Measures of central tendency include the mean,
median, and mode
Measures of variability include standard deviation,
variance, minimum and maximum variables
28
October 14, 2022
Empirical Study Process
Research Conduct and Analysis
Attribute Reduction
 Feature selection is an important step that identifies
and removes as much of the irrelevant and redundant
information as possible.
 The dimensionality of the data reduces the size of the
hypothesis space and allows the methods to operate
faster and more effectively
29
October 14, 2022
Research Conduct and Analysis
Research Conduct and Analysis
Statistical Analysis
Following steps are used in analysis
 Model Prediction
Multivariate analysis is used to find the combined effect of
each independent variable on the dependent variable.
Based on the results of performance measures, the
performance of models predicted is evaluated and the
results are interpreted.
 Model Validation
The validation data is used for validating the model
predicted in the previous step
 Hypothesis Testing
It determines whether the null hypothesis can be rejected
at a specified confidence level
30
October 14, 2022
Research Conduct and Analysis
Result Interpretation
In this step, the results computed in the empirical study’s
analysis phase are assessed and discussed.
The reason behind the acceptance or rejection of the
hypothesis is examined.
This process provides insight to the researchers about the
actual reasons of the decision made for hypothesis.
The conclusions are derived from the results obtained in
the study.
The significance and practical relevance of the results are
defined in this phase.
The limitations of the study are also reported in the form
of threats to validity.
31
October 14, 2022
Research Conduct and Analysis
Reporting
Report is generated after the study has been
conducted and results are finalized
To be prepared from readers’ perspective
Clearly document the: background, motivation,
analysis, design, results, and the discussion of the
results
The experiment settings, data-collection methods,
and design processes must be reported in significant
level of detail
The audience should be able to replicate or repeat
the results of a study in a similar context
32
October 14, 2022
Characteristics of a Good Empirical Study
 Clear
 research goals, hypothesis, and data-collection procedure
 Descriptive
 should help in repeating and replication
 Precise
 Should help improve confidence
 Valid
 Should be valid for a wider population
 Unbiased
 No influence of any sort of factors to get the desired results (selection of
participants)
 Control
 Experiment design should be able to control the independent variables to
minimize confounding effects of other variables
 Replicable
 Repeating the experiment with different data under same conditions
 Repeatable
 Reproducing the results of the study under similar settings
33
October 14, 2022
Ethics of Empirical Research
 Researchers, academicians, and sponsors should be aware of research
ethics while conducting and funding empirical research in software
engineering.
 The upholding of ethical standards helps to develop trust between the
researcher and the participant, and thus smoothens the research process.
 An unethical study can harm the reputation of the research conducted in
software engineering area
 In some countries like the United States, the sponsoring agency requires
that the research involving participants must be reviewed by a third-party
ethics committee to verify that the research complies with the ethical
principles and standards
34
October 14, 2022
Ethics of Empirical Research
Examples of Unethical Research
35
October 14, 2022
Ethics of Empirical Research
Informed consent
Five elements— disclosure, comprehension,
voluntariness, consent, and right to withdraw
Consent form— Research title, contact details,
consent and comprehension, withdrawal,
confidentiality, risks and benefits, clarification,
signatures
Scientific value
Confidentiality
Beneficence
36
October 14, 2022
Basic Elements of Empirical Research
37
October 14, 2022
Terminologies
Software quality and software evolution
Software quality determines how well the software is
designed (quality of design), and how well the
software conforms to that design (quality of
conformance).
Software evolution (maintenance) involves making
changes in the software. Changes are required
because of the following reasons:
 Defects reported by the customer
 New functionality requested by the customer
 Improvement in the quality of the software
 To adapt to new platforms
38
October 14, 2022
Terminologies
Software Quality Attributes
39
October 14, 2022
Terminologies
Descriptive, Correlational, and Cause–Effect
Research
Descriptive research provides description of
concepts.
Correlational research provides relation between two
variables.
Cause–effect research is similar to experiment
research in that the effect of one variable on another
is found.
40
October 14, 2022
Terminologies
Classification and Prediction
Classification predicts categorical outcome variables
The training data is used for model development, and
the model can be used for predicting unknown
categories of outcome variables.
41
October 14, 2022
Terminologies
 Qualitative and quantitative data
 Dependent, independent and confounding variables
 Parametric and non-parametric tests
These tests are applied to determine the validity of the
research hypothesis
Parametric tests are used for data samples having normal
distribution (bell-shaped curve)
Nonparametric tests are used when the distribution of
data samples is highly skewed (left, right skewed)
42
October 14, 2022
Terminologies
Goal Question Metric (GQM)
GQM method follows top-down approach for dividing
goals into questions and mapping questions to
metrics, and follows bottom-up approach by
interpreting the measurement to verify whether the
goals have been satisfied.
 Goal: Improve the defect-correction rate in the system.
 Question: How many defects have been corrected in
the maintenance phase?
 Metric: Number of defects corrected/Number of defects
reported.
43
October 14, 2022
Terminologies
 Software Archives or Repositories
The progress of the software is managed using software
repositories that include source code, documentation,
archived communications, and defect-tracking systems.
 For maintaining software systems, improving software quality, and
empirical validation of data and techniques.
Researchers can mine these repositories to understand
the software development, software evolution, and make
predictions. For example, defects can be predicted using
historical data, and this information can be used to
produce less defective future releases.
The data is kept in various types of software repositories
such as CVS, Git, SVN, ClearCase, Perforce, Mercurial,
Veracity, and Fossil.
44
October 14, 2022
Source
Malhotra, Ruchika. Empirical research in software
engineering: concepts, analysis, and applications.
CRC press, 2016.

More Related Content

What's hot

What is Machine Learning | Introduction to Machine Learning | Machine Learnin...
What is Machine Learning | Introduction to Machine Learning | Machine Learnin...What is Machine Learning | Introduction to Machine Learning | Machine Learnin...
What is Machine Learning | Introduction to Machine Learning | Machine Learnin...
Simplilearn
 
Data Mining: Association Rules Basics
Data Mining: Association Rules BasicsData Mining: Association Rules Basics
Data Mining: Association Rules Basics
Benazir Income Support Program (BISP)
 
Low-cost convolutional neural network for tomato plant diseases classification
Low-cost convolutional neural network for tomato plant diseases classificationLow-cost convolutional neural network for tomato plant diseases classification
Low-cost convolutional neural network for tomato plant diseases classification
IAESIJAI
 
Machine Learning and Real-World Applications
Machine Learning and Real-World ApplicationsMachine Learning and Real-World Applications
Machine Learning and Real-World Applications
MachinePulse
 
linear classification
linear classificationlinear classification
linear classification
nep_test_account
 
Machine Learning Interview Questions and Answers | Machine Learning Interview...
Machine Learning Interview Questions and Answers | Machine Learning Interview...Machine Learning Interview Questions and Answers | Machine Learning Interview...
Machine Learning Interview Questions and Answers | Machine Learning Interview...
Edureka!
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
Rahul Kumar
 
K-Nearest Neighbor Classifier
K-Nearest Neighbor ClassifierK-Nearest Neighbor Classifier
K-Nearest Neighbor Classifier
Neha Kulkarni
 
Data Science Full Course | Edureka
Data Science Full Course | EdurekaData Science Full Course | Edureka
Data Science Full Course | Edureka
Edureka!
 
KNN Algorithm - How KNN Algorithm Works With Example | Data Science For Begin...
KNN Algorithm - How KNN Algorithm Works With Example | Data Science For Begin...KNN Algorithm - How KNN Algorithm Works With Example | Data Science For Begin...
KNN Algorithm - How KNN Algorithm Works With Example | Data Science For Begin...
Simplilearn
 
NAIVE BAYES CLASSIFIER
NAIVE BAYES CLASSIFIERNAIVE BAYES CLASSIFIER
NAIVE BAYES CLASSIFIER
Knoldus Inc.
 
APRIORI ALGORITHM -PPT.pptx
APRIORI ALGORITHM -PPT.pptxAPRIORI ALGORITHM -PPT.pptx
APRIORI ALGORITHM -PPT.pptx
SABITHARASSISTANTPRO
 
Anomaly detection with machine learning at scale
Anomaly detection with machine learning at scaleAnomaly detection with machine learning at scale
Anomaly detection with machine learning at scale
Impetus Technologies
 
Neural Networks: Principal Component Analysis (PCA)
Neural Networks: Principal Component Analysis (PCA)Neural Networks: Principal Component Analysis (PCA)
Neural Networks: Principal Component Analysis (PCA)
Mostafa G. M. Mostafa
 
Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...
Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...
Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...
Edureka!
 
Naive Bayes Classifier
Naive Bayes ClassifierNaive Bayes Classifier
Naive Bayes Classifier
Arunabha Saha
 
Intro to Model Selection
Intro to Model SelectionIntro to Model Selection
Intro to Model Selectionchenhm
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
Anastasia Jakubow
 

What's hot (20)

What is Machine Learning | Introduction to Machine Learning | Machine Learnin...
What is Machine Learning | Introduction to Machine Learning | Machine Learnin...What is Machine Learning | Introduction to Machine Learning | Machine Learnin...
What is Machine Learning | Introduction to Machine Learning | Machine Learnin...
 
Data Mining: Association Rules Basics
Data Mining: Association Rules BasicsData Mining: Association Rules Basics
Data Mining: Association Rules Basics
 
Low-cost convolutional neural network for tomato plant diseases classification
Low-cost convolutional neural network for tomato plant diseases classificationLow-cost convolutional neural network for tomato plant diseases classification
Low-cost convolutional neural network for tomato plant diseases classification
 
Machine Learning and Real-World Applications
Machine Learning and Real-World ApplicationsMachine Learning and Real-World Applications
Machine Learning and Real-World Applications
 
linear classification
linear classificationlinear classification
linear classification
 
Machine Learning Interview Questions and Answers | Machine Learning Interview...
Machine Learning Interview Questions and Answers | Machine Learning Interview...Machine Learning Interview Questions and Answers | Machine Learning Interview...
Machine Learning Interview Questions and Answers | Machine Learning Interview...
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
K-Nearest Neighbor Classifier
K-Nearest Neighbor ClassifierK-Nearest Neighbor Classifier
K-Nearest Neighbor Classifier
 
Data Science Full Course | Edureka
Data Science Full Course | EdurekaData Science Full Course | Edureka
Data Science Full Course | Edureka
 
KNN Algorithm - How KNN Algorithm Works With Example | Data Science For Begin...
KNN Algorithm - How KNN Algorithm Works With Example | Data Science For Begin...KNN Algorithm - How KNN Algorithm Works With Example | Data Science For Begin...
KNN Algorithm - How KNN Algorithm Works With Example | Data Science For Begin...
 
NAIVE BAYES CLASSIFIER
NAIVE BAYES CLASSIFIERNAIVE BAYES CLASSIFIER
NAIVE BAYES CLASSIFIER
 
APRIORI ALGORITHM -PPT.pptx
APRIORI ALGORITHM -PPT.pptxAPRIORI ALGORITHM -PPT.pptx
APRIORI ALGORITHM -PPT.pptx
 
Anomaly detection with machine learning at scale
Anomaly detection with machine learning at scaleAnomaly detection with machine learning at scale
Anomaly detection with machine learning at scale
 
Neural Networks: Principal Component Analysis (PCA)
Neural Networks: Principal Component Analysis (PCA)Neural Networks: Principal Component Analysis (PCA)
Neural Networks: Principal Component Analysis (PCA)
 
Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...
Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...
Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...
 
Naive Bayes Classifier
Naive Bayes ClassifierNaive Bayes Classifier
Naive Bayes Classifier
 
Naive bayes
Naive bayesNaive bayes
Naive bayes
 
Intro to Model Selection
Intro to Model SelectionIntro to Model Selection
Intro to Model Selection
 
Knn
KnnKnn
Knn
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 

Similar to Lecture 2.pptx

Lecture 3.pptx
Lecture 3.pptxLecture 3.pptx
Lecture 3.pptx
assadabbas22
 
A Standardized Case Study Framework and Methodology to Identify quot Best Pr...
A Standardized Case Study Framework and Methodology to Identify  quot Best Pr...A Standardized Case Study Framework and Methodology to Identify  quot Best Pr...
A Standardized Case Study Framework and Methodology to Identify quot Best Pr...
Carrie Tran
 
BRM_Research Design.ppt
BRM_Research Design.pptBRM_Research Design.ppt
BRM_Research Design.ppt
AbdifatahAhmedHurre
 
Overview of research methodology.pptx
Overview of research methodology.pptxOverview of research methodology.pptx
Overview of research methodology.pptx
YogeshPatel28169
 
Chapter 7 (research design)
Chapter 7 (research design)Chapter 7 (research design)
Chapter 7 (research design)
BoreyThai1
 
Survey Research article.pdf
Survey Research article.pdfSurvey Research article.pdf
Survey Research article.pdf
AyeshaMalickk1
 
quantitative-research.pptx
quantitative-research.pptxquantitative-research.pptx
quantitative-research.pptx
cjoypingaron
 
Research Design
Research DesignResearch Design
Research Design
Jignesh Kariya
 
Research Design, Process of research with examples
Research Design, Process of research with examplesResearch Design, Process of research with examples
Research Design, Process of research with examples
Dr. Anita Rathod
 
Research Methodology.pptx
Research Methodology.pptxResearch Methodology.pptx
Research Methodology.pptx
DrKusumGaur
 
Research methodology (2)
Research methodology (2)Research methodology (2)
Research methodology (2)
Sandeep Soni Kanpur
 
MIS 49100 Week 3 Research Methodology
MIS 49100 Week 3 Research MethodologyMIS 49100 Week 3 Research Methodology
MIS 49100 Week 3 Research Methodology
oudesign
 
Validity in Research
Validity in ResearchValidity in Research
Validity in Research
Ecem Ekinci
 
Research methodology presentation
Research methodology presentationResearch methodology presentation
Business research methods
Business  research methodsBusiness  research methods
Business research methods
Hanumant singh
 
Research method in operations management final
Research method in  operations management finalResearch method in  operations management final
Research method in operations management finalindonesiabelajar
 
MethodsofDataCollection.pdf
MethodsofDataCollection.pdfMethodsofDataCollection.pdf
MethodsofDataCollection.pdf
MohdTaufiqIshak
 
Research Design _komal-1.docx
Research Design _komal-1.docxResearch Design _komal-1.docx
Research Design _komal-1.docx
PREETICHOUDHARY100
 

Similar to Lecture 2.pptx (20)

Lecture 3.pptx
Lecture 3.pptxLecture 3.pptx
Lecture 3.pptx
 
A Standardized Case Study Framework and Methodology to Identify quot Best Pr...
A Standardized Case Study Framework and Methodology to Identify  quot Best Pr...A Standardized Case Study Framework and Methodology to Identify  quot Best Pr...
A Standardized Case Study Framework and Methodology to Identify quot Best Pr...
 
BRM_Research Design.ppt
BRM_Research Design.pptBRM_Research Design.ppt
BRM_Research Design.ppt
 
Overview of research methodology.pptx
Overview of research methodology.pptxOverview of research methodology.pptx
Overview of research methodology.pptx
 
Chapter 7 (research design)
Chapter 7 (research design)Chapter 7 (research design)
Chapter 7 (research design)
 
Survey Research article.pdf
Survey Research article.pdfSurvey Research article.pdf
Survey Research article.pdf
 
quantitative-research.pptx
quantitative-research.pptxquantitative-research.pptx
quantitative-research.pptx
 
Research Design
Research DesignResearch Design
Research Design
 
Research Design, Process of research with examples
Research Design, Process of research with examplesResearch Design, Process of research with examples
Research Design, Process of research with examples
 
Research Methodology.pptx
Research Methodology.pptxResearch Methodology.pptx
Research Methodology.pptx
 
Research methodology (2)
Research methodology (2)Research methodology (2)
Research methodology (2)
 
MIS 49100 Week 3 Research Methodology
MIS 49100 Week 3 Research MethodologyMIS 49100 Week 3 Research Methodology
MIS 49100 Week 3 Research Methodology
 
De carlo rizk 2010 icelw
De carlo rizk 2010 icelwDe carlo rizk 2010 icelw
De carlo rizk 2010 icelw
 
Ijetcas14 438
Ijetcas14 438Ijetcas14 438
Ijetcas14 438
 
Validity in Research
Validity in ResearchValidity in Research
Validity in Research
 
Research methodology presentation
Research methodology presentationResearch methodology presentation
Research methodology presentation
 
Business research methods
Business  research methodsBusiness  research methods
Business research methods
 
Research method in operations management final
Research method in  operations management finalResearch method in  operations management final
Research method in operations management final
 
MethodsofDataCollection.pdf
MethodsofDataCollection.pdfMethodsofDataCollection.pdf
MethodsofDataCollection.pdf
 
Research Design _komal-1.docx
Research Design _komal-1.docxResearch Design _komal-1.docx
Research Design _komal-1.docx
 

Recently uploaded

How to Position Your Globus Data Portal for Success Ten Good Practices
How to Position Your Globus Data Portal for Success Ten Good PracticesHow to Position Your Globus Data Portal for Success Ten Good Practices
How to Position Your Globus Data Portal for Success Ten Good Practices
Globus
 
Orion Context Broker introduction 20240604
Orion Context Broker introduction 20240604Orion Context Broker introduction 20240604
Orion Context Broker introduction 20240604
Fermin Galan
 
BoxLang: Review our Visionary Licenses of 2024
BoxLang: Review our Visionary Licenses of 2024BoxLang: Review our Visionary Licenses of 2024
BoxLang: Review our Visionary Licenses of 2024
Ortus Solutions, Corp
 
Top 7 Unique WhatsApp API Benefits | Saudi Arabia
Top 7 Unique WhatsApp API Benefits | Saudi ArabiaTop 7 Unique WhatsApp API Benefits | Saudi Arabia
Top 7 Unique WhatsApp API Benefits | Saudi Arabia
Yara Milbes
 
Top Features to Include in Your Winzo Clone App for Business Growth (4).pptx
Top Features to Include in Your Winzo Clone App for Business Growth (4).pptxTop Features to Include in Your Winzo Clone App for Business Growth (4).pptx
Top Features to Include in Your Winzo Clone App for Business Growth (4).pptx
rickgrimesss22
 
Navigating the Metaverse: A Journey into Virtual Evolution"
Navigating the Metaverse: A Journey into Virtual Evolution"Navigating the Metaverse: A Journey into Virtual Evolution"
Navigating the Metaverse: A Journey into Virtual Evolution"
Donna Lenk
 
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
Shahin Sheidaei
 
Large Language Models and the End of Programming
Large Language Models and the End of ProgrammingLarge Language Models and the End of Programming
Large Language Models and the End of Programming
Matt Welsh
 
A Sighting of filterA in Typelevel Rite of Passage
A Sighting of filterA in Typelevel Rite of PassageA Sighting of filterA in Typelevel Rite of Passage
A Sighting of filterA in Typelevel Rite of Passage
Philip Schwarz
 
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...
Juraj Vysvader
 
AI Pilot Review: The World’s First Virtual Assistant Marketing Suite
AI Pilot Review: The World’s First Virtual Assistant Marketing SuiteAI Pilot Review: The World’s First Virtual Assistant Marketing Suite
AI Pilot Review: The World’s First Virtual Assistant Marketing Suite
Google
 
GraphSummit Paris - The art of the possible with Graph Technology
GraphSummit Paris - The art of the possible with Graph TechnologyGraphSummit Paris - The art of the possible with Graph Technology
GraphSummit Paris - The art of the possible with Graph Technology
Neo4j
 
Enterprise Software Development with No Code Solutions.pptx
Enterprise Software Development with No Code Solutions.pptxEnterprise Software Development with No Code Solutions.pptx
Enterprise Software Development with No Code Solutions.pptx
QuickwayInfoSystems3
 
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...
Mind IT Systems
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
Pro Unity Game Development with C-sharp Book
Pro Unity Game Development with C-sharp BookPro Unity Game Development with C-sharp Book
Pro Unity Game Development with C-sharp Book
abdulrafaychaudhry
 
Utilocate provides Smarter, Better, Faster, Safer Locate Ticket Management
Utilocate provides Smarter, Better, Faster, Safer Locate Ticket ManagementUtilocate provides Smarter, Better, Faster, Safer Locate Ticket Management
Utilocate provides Smarter, Better, Faster, Safer Locate Ticket Management
Utilocate
 
How Recreation Management Software Can Streamline Your Operations.pptx
How Recreation Management Software Can Streamline Your Operations.pptxHow Recreation Management Software Can Streamline Your Operations.pptx
How Recreation Management Software Can Streamline Your Operations.pptx
wottaspaceseo
 
Enterprise Resource Planning System in Telangana
Enterprise Resource Planning System in TelanganaEnterprise Resource Planning System in Telangana
Enterprise Resource Planning System in Telangana
NYGGS Automation Suite
 
Understanding Globus Data Transfers with NetSage
Understanding Globus Data Transfers with NetSageUnderstanding Globus Data Transfers with NetSage
Understanding Globus Data Transfers with NetSage
Globus
 

Recently uploaded (20)

How to Position Your Globus Data Portal for Success Ten Good Practices
How to Position Your Globus Data Portal for Success Ten Good PracticesHow to Position Your Globus Data Portal for Success Ten Good Practices
How to Position Your Globus Data Portal for Success Ten Good Practices
 
Orion Context Broker introduction 20240604
Orion Context Broker introduction 20240604Orion Context Broker introduction 20240604
Orion Context Broker introduction 20240604
 
BoxLang: Review our Visionary Licenses of 2024
BoxLang: Review our Visionary Licenses of 2024BoxLang: Review our Visionary Licenses of 2024
BoxLang: Review our Visionary Licenses of 2024
 
Top 7 Unique WhatsApp API Benefits | Saudi Arabia
Top 7 Unique WhatsApp API Benefits | Saudi ArabiaTop 7 Unique WhatsApp API Benefits | Saudi Arabia
Top 7 Unique WhatsApp API Benefits | Saudi Arabia
 
Top Features to Include in Your Winzo Clone App for Business Growth (4).pptx
Top Features to Include in Your Winzo Clone App for Business Growth (4).pptxTop Features to Include in Your Winzo Clone App for Business Growth (4).pptx
Top Features to Include in Your Winzo Clone App for Business Growth (4).pptx
 
Navigating the Metaverse: A Journey into Virtual Evolution"
Navigating the Metaverse: A Journey into Virtual Evolution"Navigating the Metaverse: A Journey into Virtual Evolution"
Navigating the Metaverse: A Journey into Virtual Evolution"
 
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
 
Large Language Models and the End of Programming
Large Language Models and the End of ProgrammingLarge Language Models and the End of Programming
Large Language Models and the End of Programming
 
A Sighting of filterA in Typelevel Rite of Passage
A Sighting of filterA in Typelevel Rite of PassageA Sighting of filterA in Typelevel Rite of Passage
A Sighting of filterA in Typelevel Rite of Passage
 
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...
 
AI Pilot Review: The World’s First Virtual Assistant Marketing Suite
AI Pilot Review: The World’s First Virtual Assistant Marketing SuiteAI Pilot Review: The World’s First Virtual Assistant Marketing Suite
AI Pilot Review: The World’s First Virtual Assistant Marketing Suite
 
GraphSummit Paris - The art of the possible with Graph Technology
GraphSummit Paris - The art of the possible with Graph TechnologyGraphSummit Paris - The art of the possible with Graph Technology
GraphSummit Paris - The art of the possible with Graph Technology
 
Enterprise Software Development with No Code Solutions.pptx
Enterprise Software Development with No Code Solutions.pptxEnterprise Software Development with No Code Solutions.pptx
Enterprise Software Development with No Code Solutions.pptx
 
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
Pro Unity Game Development with C-sharp Book
Pro Unity Game Development with C-sharp BookPro Unity Game Development with C-sharp Book
Pro Unity Game Development with C-sharp Book
 
Utilocate provides Smarter, Better, Faster, Safer Locate Ticket Management
Utilocate provides Smarter, Better, Faster, Safer Locate Ticket ManagementUtilocate provides Smarter, Better, Faster, Safer Locate Ticket Management
Utilocate provides Smarter, Better, Faster, Safer Locate Ticket Management
 
How Recreation Management Software Can Streamline Your Operations.pptx
How Recreation Management Software Can Streamline Your Operations.pptxHow Recreation Management Software Can Streamline Your Operations.pptx
How Recreation Management Software Can Streamline Your Operations.pptx
 
Enterprise Resource Planning System in Telangana
Enterprise Resource Planning System in TelanganaEnterprise Resource Planning System in Telangana
Enterprise Resource Planning System in Telangana
 
Understanding Globus Data Transfers with NetSage
Understanding Globus Data Transfers with NetSageUnderstanding Globus Data Transfers with NetSage
Understanding Globus Data Transfers with NetSage
 

Lecture 2.pptx

  • 1. Empirical Software Engineering (MSE 507 ) Dr. Assad Abbas Department of Computer Science COMSATS Institute of Information Technology, Islamabad assadabbas@comsats.edu.pk
  • 2. 2 October 14, 2022 Introduction  Engineering requires the development of the product in a scientific, well formed, and systematic manner  Software Engineering (IEEE definition) The application of a systematic, disciplined, quantifiable approach to the development, operation and maintenance of software, and the study of these approaches, that is, the application of engineering to software.  Empirical “Empirical” is typically used to define any statement about the world that is related to observation or experience.
  • 3. 3 October 14, 2022 What is Empirical Software Engineering?  Empirical Software Engineering emphasizes the use of empirical methods in the field of software engineering.  involves methods for evaluating, assessing, predicting, monitoring, and controlling the existing artifacts of software development. applies quantitative methods to the software engineering phenomenon to understand software development better.  Gaining importance because of the availability of vast data sets from open source repositories that contain information about software requirements, bugs, and changes
  • 4. 4 October 14, 2022 Overview of Empirical Studies  Empirical study (ES) is an attempt to compare the theories and observations using real-life data for analysis  ES utilize data analysis and statistical methods for exploring relationships  Benefits of ES Allow to explore relationships Allow to prove theoretical concepts Allow to evaluate accuracy of the models Allow to choose among tools and techniques Allow to establish quality benchmarks across software organizations Allow to assess and improve techniques and methods
  • 5. 5 October 14, 2022 Empirical Studies in Software Engineering  Empirical studies are important in the area of software engineering as they allow: software professionals to evaluate and assess the new concepts, technologies, tools, and techniques in scientific and proved manner. improving, managing, and controlling the existing processes and techniques by using evidence obtained from the empirical analysis. to gather evidence that can be used to support the claims of efficiency of a given technique or technology  The empirical information can help software management in decision making and improving software processes
  • 6. 6 October 14, 2022 Steps in Empirical Studies Formation of research questions Formation of a research hypothesis Gathering data Analyzing the data Developing and validating models Deriving conclusions from the obtained results
  • 7. 7 October 14, 2022 Qualitative and Quantitative Research Empirical studies can be classified as: Quantitative & Qualitative Quantitative Research The most widely used scientific method in SE that applies mathematical or statistical-based methods to derive conclusions. Is used to prove or disprove a hypothesis (a concept that has to be tested for further investigation). The aim of a quantitative research is to generate results that are generalizable and unbiased and thus can be applied to a larger population in research. It uses statistical methods to validate a hypothesis and to explore causal relationships. Common quantitative methods include experiments, observations recorded as numbers, and surveys with closed-ended questions
  • 8. 8 October 14, 2022 Qualitative and Quantitative Research Qualitative Research In qualitative research, the researchers study human behavior, preferences, and nature. Provides an in-depth analysis of the concept under investigation and thus uses focused data for research. Understanding a new process or technique in software engineering is an example of qualitative research. Qualitative research provides textual descriptions or pictures related to human beliefs or behavior. It can be extended to other studies with similar populations but generalizations of a particular phenomenon may be difficult. Common qualitative methods include interviews with open- ended questions, observations described in words, and literature reviews that explore concepts and theories.
  • 9. 9 October 14, 2022 Qualitative and Quantitative Research
  • 10. 10 October 14, 2022 Types of Empirical Studies Experiments Case studies Surveys Systematic Reviews
  • 11. 11 October 14, 2022 Types of Empirical Studies Experiments An experimental study tests the established hypothesis by finding the effect of variables of interest (independent variables) on the outcome variable (dependent variable) using statistical analysis. If the experiment is carried out correctly, the hypothesis is either accepted or rejected.  For example, one group uses inspections and the other group uses walkthroughs to identify design defects, which technique is more effective in detecting a larger number of defects? The experiment must also control the confounding variables, which may affect the accuracy of the results produced by the experiment. The experiments are carried out in a controlled environment and often referred to as controlled experiments The key factors involved in the experiments are: independent variables, dependent variables, hypothesis, and statistical techniques
  • 12. 12 October 14, 2022 Types of Empirical Studies Experiments A controlled experiment involves varying the variables (one or more) and keeping everything else constant or the same and are usually conducted in small or laboratory setting Example: As discussed in previous slide (inspections and walkthroughs)
  • 13. 13 October 14, 2022 Types of Empirical Studies Case Study Case study research represents real-world scenarios and involves studying a particular phenomenon For example: to evaluate a particular software tool, method, or process The effect of a change in an organization can be studied using case study research Case studies are appropriate where a phenomenon is to be studied for a longer period of time so that the effects of the phenomenon can be observed. The disadvantages of case studies include difficulty in generalization as they represent a typical situation. Since they are based on a particular case, the validity of the results is questionable.
  • 14. 14 October 14, 2022 Types of Empirical Studies Case Study Phases in Case Study The case study design phase involves identifying existing objectives, cases, research questions, and data-collection strategies. The case may be a tool, technology, technique, process, product, individual, or software. Qualitative data is usually collected in a case study. The sources include interviews, group discussions, or observations. The data may be directly or indirectly collected from participants..
  • 15. 15 October 14, 2022 Types of Empirical Studies Survey Research Survey research identifies features or information from a large scale of a population. For example, surveys can be used when a researcher wants to know whether the use of a particular process has improved the view of clients toward software usability features. This information can be obtained by asking the selected software testers to fill questionnaires. Surveys are usually conducted using questionnaires and interviews. The questionnaires are constructed to collect research- related information. Surveys can be descriptive, explorative, and explanatory.
  • 16. 16 October 14, 2022 Types of Empirical Studies Survey Research Descriptive survey research—describes a concept or topic in more detail Explorative—focuses on the discovery of new ideas and insights and are usually conducted at the beginning of a research study to gather initial information Explanatory—attempts to explain how things work in connections like cause and effect, meaning a researcher wants to explain how things interact or work with each other For example, while exploring relationship between various independent variables and an outcome variable, a researcher may want to explain why an independent variable affects the outcome variable.
  • 17. 17 October 14, 2022 Types of Empirical Studies Systematic Reviews Literature review is an important part that examines the existing position of literature in an area in which the research is being conducted. The systematic reviews are methodically undertaken with a specific search strategy and well-defined methodology to answer a set of questions Kitchenham (2007) defines systematic review as:  A form of secondary study that uses a well-defined methodology to identify, analyze and interpret all available evidence related to a specific research question in a way that is unbiased and (to a degree) repeatable. The purpose of a systematic review is to summarize the existing research and provide future guidelines for research by identifying gaps in the existing literature
  • 18. 18 October 14, 2022 Types of Empirical Studies Systematic Reviews A systematic review involves:  Defining research questions.  Forming and documenting a search strategy.  Determining inclusion and exclusion criteria.  Establishing quality assessment criteria. Steps in systematic revies
  • 19. 19 October 14, 2022 Empirical vs. Experimental Studies Are the two same? Used interchangeably, but there is some difference In experimental software engineering, the dependent variable is closely observed in a controlled environment. Empirical studies are used to define anything related to observation and experience and are valuable as these studies consider real-world data. In experimental studies, data is artificial or synthetic but is more controlled.
  • 20. 20 October 14, 2022 Empirical Study Process  Without a sound and proven research process, it is difficult to carry out efficient and effective research.  Thus, a research methodology must be complete and repeatable, which, when followed, in a replicated or empirical study, will enable comparisons to be made across various studies
  • 22. 22 October 14, 2022 Empirical Study Process Study Definition Scope: What are the dimensions of the study? Motivation: Why is it being studied? (e.g. to analyze and assess the capability of a technique or method) Object: What entity is being studied? (e.g. process, product, technique being studied) Purpose: What is the aim of the study? (e.g. to prove that technique A is superior to technique B) Perspective: From whose view is the study being conducted (e.g. project manager, customer)? Domain: What is the area of the study?
  • 23. 23 October 14, 2022 Empirical Study Process Experiment Design Research Questions  The first step is to formulate the research problem. This step states the problem in the form of questions and identifies the main concepts and relations to be explored. RQ1: What will be the effect of software metrics on quality attributes (such as fault proneness/testing effort/maintenance effort) of a class? RQ2. Are machine-learning methods adaptable to object-oriented systems for predicting quality attributes?
  • 24. 24 October 14, 2022 Empirical Study Process Experiment Design Independent and Dependent Variables  To analyze relationships, the next step is to define the dependent and the independent variables.  The outcome variable predicted by the independent variables is called the dependent variable (response variable). For instance, the dependent variables of the models chosen for analysis may be fault proneness, testing effort, and maintenance effort.  A variable used to predict or estimate a dependent variable is called the independent (explanatory or predictor) variable.
  • 25. 25 October 14, 2022 Empirical Study Process Experiment Design Hypothesis Formulation  The researcher should carefully state the hypothesis to be tested in the study.  The hypothesis is tested on the sample data. On the basis of the result from the sample, a decision concerning the validity of the hypothesis (acceptance or rejection) is made.  Consider an example where a hypothesis is to be formed for comparing a number of methods for predicting fault- prone classes. H–M: M outperforms the compared methods for predicting fault-prone software classes Null hypothesis: M does not outperform the compared methods for predicting fault prone software classes.
  • 26. 26 October 14, 2022 Empirical Study Process Experiment Design Empirical Data Collection  e.g. university/academic systems, commercial systems, or open source software Empirical Methods  The data analysis techniques are selected based on the type of the dependent variables used  Statistical techniques: linear regression and logistic regression  Machine-learning techniques: decision trees, support vector machines, and so on
  • 27. 27 October 14, 2022 Research Conduct and Analysis Research Conduct and Analysis Descriptive Statistics  Descriptive statistics are brief informational coefficients that summarize a given data set, which can be either a representation of the entire population or a sample of a population.  Descriptive statistics are broken down into measures of central tendency and measures of variability (spread). Measures of central tendency include the mean, median, and mode Measures of variability include standard deviation, variance, minimum and maximum variables
  • 28. 28 October 14, 2022 Empirical Study Process Research Conduct and Analysis Attribute Reduction  Feature selection is an important step that identifies and removes as much of the irrelevant and redundant information as possible.  The dimensionality of the data reduces the size of the hypothesis space and allows the methods to operate faster and more effectively
  • 29. 29 October 14, 2022 Research Conduct and Analysis Research Conduct and Analysis Statistical Analysis Following steps are used in analysis  Model Prediction Multivariate analysis is used to find the combined effect of each independent variable on the dependent variable. Based on the results of performance measures, the performance of models predicted is evaluated and the results are interpreted.  Model Validation The validation data is used for validating the model predicted in the previous step  Hypothesis Testing It determines whether the null hypothesis can be rejected at a specified confidence level
  • 30. 30 October 14, 2022 Research Conduct and Analysis Result Interpretation In this step, the results computed in the empirical study’s analysis phase are assessed and discussed. The reason behind the acceptance or rejection of the hypothesis is examined. This process provides insight to the researchers about the actual reasons of the decision made for hypothesis. The conclusions are derived from the results obtained in the study. The significance and practical relevance of the results are defined in this phase. The limitations of the study are also reported in the form of threats to validity.
  • 31. 31 October 14, 2022 Research Conduct and Analysis Reporting Report is generated after the study has been conducted and results are finalized To be prepared from readers’ perspective Clearly document the: background, motivation, analysis, design, results, and the discussion of the results The experiment settings, data-collection methods, and design processes must be reported in significant level of detail The audience should be able to replicate or repeat the results of a study in a similar context
  • 32. 32 October 14, 2022 Characteristics of a Good Empirical Study  Clear  research goals, hypothesis, and data-collection procedure  Descriptive  should help in repeating and replication  Precise  Should help improve confidence  Valid  Should be valid for a wider population  Unbiased  No influence of any sort of factors to get the desired results (selection of participants)  Control  Experiment design should be able to control the independent variables to minimize confounding effects of other variables  Replicable  Repeating the experiment with different data under same conditions  Repeatable  Reproducing the results of the study under similar settings
  • 33. 33 October 14, 2022 Ethics of Empirical Research  Researchers, academicians, and sponsors should be aware of research ethics while conducting and funding empirical research in software engineering.  The upholding of ethical standards helps to develop trust between the researcher and the participant, and thus smoothens the research process.  An unethical study can harm the reputation of the research conducted in software engineering area  In some countries like the United States, the sponsoring agency requires that the research involving participants must be reviewed by a third-party ethics committee to verify that the research complies with the ethical principles and standards
  • 34. 34 October 14, 2022 Ethics of Empirical Research Examples of Unethical Research
  • 35. 35 October 14, 2022 Ethics of Empirical Research Informed consent Five elements— disclosure, comprehension, voluntariness, consent, and right to withdraw Consent form— Research title, contact details, consent and comprehension, withdrawal, confidentiality, risks and benefits, clarification, signatures Scientific value Confidentiality Beneficence
  • 36. 36 October 14, 2022 Basic Elements of Empirical Research
  • 37. 37 October 14, 2022 Terminologies Software quality and software evolution Software quality determines how well the software is designed (quality of design), and how well the software conforms to that design (quality of conformance). Software evolution (maintenance) involves making changes in the software. Changes are required because of the following reasons:  Defects reported by the customer  New functionality requested by the customer  Improvement in the quality of the software  To adapt to new platforms
  • 39. 39 October 14, 2022 Terminologies Descriptive, Correlational, and Cause–Effect Research Descriptive research provides description of concepts. Correlational research provides relation between two variables. Cause–effect research is similar to experiment research in that the effect of one variable on another is found.
  • 40. 40 October 14, 2022 Terminologies Classification and Prediction Classification predicts categorical outcome variables The training data is used for model development, and the model can be used for predicting unknown categories of outcome variables.
  • 41. 41 October 14, 2022 Terminologies  Qualitative and quantitative data  Dependent, independent and confounding variables  Parametric and non-parametric tests These tests are applied to determine the validity of the research hypothesis Parametric tests are used for data samples having normal distribution (bell-shaped curve) Nonparametric tests are used when the distribution of data samples is highly skewed (left, right skewed)
  • 42. 42 October 14, 2022 Terminologies Goal Question Metric (GQM) GQM method follows top-down approach for dividing goals into questions and mapping questions to metrics, and follows bottom-up approach by interpreting the measurement to verify whether the goals have been satisfied.  Goal: Improve the defect-correction rate in the system.  Question: How many defects have been corrected in the maintenance phase?  Metric: Number of defects corrected/Number of defects reported.
  • 43. 43 October 14, 2022 Terminologies  Software Archives or Repositories The progress of the software is managed using software repositories that include source code, documentation, archived communications, and defect-tracking systems.  For maintaining software systems, improving software quality, and empirical validation of data and techniques. Researchers can mine these repositories to understand the software development, software evolution, and make predictions. For example, defects can be predicted using historical data, and this information can be used to produce less defective future releases. The data is kept in various types of software repositories such as CVS, Git, SVN, ClearCase, Perforce, Mercurial, Veracity, and Fossil.
  • 44. 44 October 14, 2022 Source Malhotra, Ruchika. Empirical research in software engineering: concepts, analysis, and applications. CRC press, 2016.

Editor's Notes

  1. 1