SlideShare a Scribd company logo
1 of 25
Empowering Empirical Research in Software Design:
Construction and Studies on a Large-Scale Corpus of UML Models
Ph.D Thesis Defence
Ph.D Candidate:
Truong Ho-Quang
Chalmers | Gothenburg Univ.
Sweden
Opponent:
Dr. Klaas-Jan Stol
University College Cork
Ireland
Supervisors:
Dr. Michel R.V. Chaudron
Dr. Regina Hebig
Chalmers | Gothenburg Univ.
Sweden
Grading committee members:
Dr. Maria Teresa Baldassarre (University of Bari Aldo Mori, Italy)
Dr. Christoph Treude (University of Adelaide, Australia)
Dr. Sebastian Herold (Karlstad University, Sweden)
Content
• Context of the study (the ’Why’)
• Research Approach & Methods (the ’How’)
• Findings (the ’What’)
• Conclusion
2
Design & Modeling: House vs Software
3
HOUSE SOFTWARE
Design(v) is a process of making
decision about something that is
to be built or created.
Model(n) is an abstract
representation of a thing/system.
Modeling(v) is the process of
making models (i.e. choosing what
to represent and how to represent it)
Expressing
design into house
design plans
A house plan An UML model
Expressing
design into
models
Context of the study
• Software design and modeling are an essential part of the
software development process.
• There is lack of empirical research about software design
and modeling in practice.
4
Example of contradict findings
• Empirical research about use of UML (*) is contradictory
– [Petre, 2013]: ”… The majority of those interviewed (50)
simply do not use UML. Informants criticized UML for its
complexity, lack of formal semantics, inconsistency …”
– [Scanniello et al., 2010]: ”… the majority of the companies
(20/22) use UML in their projects…”
– [Anda et al., 2006]: ” … The interviewees obtained
immediate improvements as a consequence of
introducing a UML-based development method …”
5
(*) Abbreviation of the Unified Modeling Language
Problem statement
• Lack of practical guidelines on the use of modeling
– How is software design and modeling used?
– What are the impacts of software design and modeling?
• Lack of empirical data about software design & modeling
– Lack of generalisability and replicability
Goal of the PhD:
To empower empirical studies in software design and modeling by
collecting and studying a large corpus of software modeling artifacts
from real-life software systems.
6
RESEARCH GOALS,
SCOPE &
METHODOLOGY
• Goals of the Ph.D study
• Scope of the Ph.D study
• Approach & Research methodology
7
Goals of the Ph.D study
8
Dataset
Knowledge
G1. Building & sharing
a corpus of curated modeling artifacts
G2. Modeling practices
G3. Impacts
of modeling
Modeling languages
Scope: What modeling artifacts?
9
We focus on the Unified Modeling Language (UML) because:
• UML has become the de-facto standard for software
modeling in industry.
Scope: Which sources of UML models?
10
Sources of UML models
Industry OSS projects
Benefits • Industry-relevant context • OSS relevant context
• Data availability
• Data transparency
Challenges • Data availability
• Generalisation
• Replicability
• Identification of UML files
• Filtering out toy-projects
Challenges to identifying impacts:
Modeling is contextual (*)
11
(*) Figure from this paper:
Fernández-Sáez, Ana M., Michel RV Chaudron, and Marcela Genero. "An industrial case study on the use of UML in software
maintenance and its perceived benefits and hurdles." Empirical Software Engineering 23.6 (2018): 3281-3345.
Scope: Studying impacts in context
12
Dataset
Knowledge
G1. Building & sharing
a corpus of curated modeling artifacts
G2. Modeling practices
G3. Impacts
of modeling
G3.
Impacts
G2.
Practices
G1.
Collection
Context of
use
Enables
Enables
Relationship of papers and goals
13
C
A
B D
E
F
G HE
Legend
Paper
21
Paper 2
extends
paper 1
G1
Corpus
G2
Practices
G3
Impacts
Goals
Papers
A B C D E F G H
G3.
Impacts
G2.
Practices
G1.
Collection
Research methodology
14
Empirical Method
Experiment, Case Study,
User Study
Empirical Method
Survey Study
Constructive Method
FINDINGS &
CONTRIBUTIONS
15
Contributions to Goal G1.
Data Contribution
• Lindholmen dataset
– 93k+ UML models from 24k+ GitHub projects
– Meta data of the UML models and projects
– Data are curated
• Researchers are using it
– 12 published papers by other authors
16
http://models.cs.chalmers.se/oss/
Lindholmen Dataset
G1.
Collection
Contributions to Goal G1.
Data Collection Process
• Complete tool set for automatically crawl UML models from a
GitHub project
• This can be extended to crawl other software development artifacts
17
UML
File list
GitHub
1 Data collection
Potential UML file list
3 Extract Meta-data
2 Filter UML files
UML Image
Filter
Textual
Filter
Validation
5 Analyse result
4 Query database
CVSAnalY MySQL
GHTorrent
~ 12 800 000
non-forked repos
Output: 93 648 UML files
& 24 797 projects that use UML
Contributions to Goal G2.
• Scientific insights into how UML is used in OSS
projects.
18
G2.
Practices
Paper B: “How is UML used in OSS projects?”
19
• Most projects work very shortly on UML, usually at the beginning.
• Models are introduced during all possible phases in the lifespan of
OSS projects.
• A few projects are active with UML during their whole lifetime.
G2.
Practices
Paper C: Why is UML used in OSS projects?
20
• The majority of models are intended for creating
software designs and documenting software systems.
• Non-UML Contributors (NUCs) benefit from UML models
for understanding a system and for communication.
G2.
Practices
Contributions to goal G3.
• Scientific insights into impacts of software
design and UML modeling in software
development
21
G3.
Impacts
Paper C: What are the impacts of using UML?
22
• UML is helpful for new contributors to get up to speed.
• Changes of the working routine due to UML, mainly in
the planning phase and in communication.
G3.
Impacts
Paper F: Impacts of using UML to defect proneness?
23
93 GitHub projects
without UML models
50 GitHub projects
that have UML models
• Projects with UML have about 35% fewer bugs reported
compared to projects without using UML.
G3.
Impacts
Paper H: Impacts of using role stereotype to
developer’s understanding?
24
Mean TLX (*) Mean SUS (**) Mean Understanding (**)
RoleViz SoftagramRoleViz Tool (*)
• Participants achieved better scores on completing
software understanding tasks with RoleViz without any
cognitive-load penalty
(*) Demo video is available online at: https://youtu.be/1JYQMPMF9do
G3.
Impacts
Conclusion
25
I am proud that after five years of my PhD study:
• I went from little data to big empirical data of UML use.
• I made qualitative and quantitative observations on the
use and impacts of using software design and
modeling.
• I ran into an area that has lots of room for discussion &
future research.
• Being a part of great teams and colleagues.
Thank you for your attention!

More Related Content

What's hot

What do Practitioners Expect from the Meta-modeling Tools? A Survey
What do Practitioners Expect from the Meta-modeling Tools? A SurveyWhat do Practitioners Expect from the Meta-modeling Tools? A Survey
What do Practitioners Expect from the Meta-modeling Tools? A SurveyObeo
 
Sept 6 2021 BTech Artificial Intelligence and Data Science curriculum
Sept 6 2021 BTech Artificial Intelligence and Data Science curriculumSept 6 2021 BTech Artificial Intelligence and Data Science curriculum
Sept 6 2021 BTech Artificial Intelligence and Data Science curriculumFerdin Joe John Joseph PhD
 
Mit109 – software engineering
Mit109 – software engineeringMit109 – software engineering
Mit109 – software engineeringsmumbahelp
 
Needs challenges and_opportunites_in_architectural_languages (bolzano_dec2013)
Needs challenges and_opportunites_in_architectural_languages (bolzano_dec2013)Needs challenges and_opportunites_in_architectural_languages (bolzano_dec2013)
Needs challenges and_opportunites_in_architectural_languages (bolzano_dec2013)Henry Muccini
 
The Gap Between Academic Research and Industrial Practice in Software Testing
The Gap Between Academic Research and Industrial Practice in Software TestingThe Gap Between Academic Research and Industrial Practice in Software Testing
The Gap Between Academic Research and Industrial Practice in Software TestingZoltan Micskei
 
Software engineering Course module
Software engineering Course moduleSoftware engineering Course module
Software engineering Course moduleamandeep0224
 
A Survey on Software Release Planning Models - Slides for the Presentation @ ...
A Survey on Software Release Planning Models - Slides for the Presentation @ ...A Survey on Software Release Planning Models - Slides for the Presentation @ ...
A Survey on Software Release Planning Models - Slides for the Presentation @ ...Supersede
 
Contributions to the multidisciplinarity of computer science and IS
Contributions to the multidisciplinarity of computer science and ISContributions to the multidisciplinarity of computer science and IS
Contributions to the multidisciplinarity of computer science and ISSaïd Assar
 
Systematic software development using vdm by jones 2nd edition
Systematic software development using vdm by jones 2nd editionSystematic software development using vdm by jones 2nd edition
Systematic software development using vdm by jones 2nd editionYasir Raza Khan
 

What's hot (9)

What do Practitioners Expect from the Meta-modeling Tools? A Survey
What do Practitioners Expect from the Meta-modeling Tools? A SurveyWhat do Practitioners Expect from the Meta-modeling Tools? A Survey
What do Practitioners Expect from the Meta-modeling Tools? A Survey
 
Sept 6 2021 BTech Artificial Intelligence and Data Science curriculum
Sept 6 2021 BTech Artificial Intelligence and Data Science curriculumSept 6 2021 BTech Artificial Intelligence and Data Science curriculum
Sept 6 2021 BTech Artificial Intelligence and Data Science curriculum
 
Mit109 – software engineering
Mit109 – software engineeringMit109 – software engineering
Mit109 – software engineering
 
Needs challenges and_opportunites_in_architectural_languages (bolzano_dec2013)
Needs challenges and_opportunites_in_architectural_languages (bolzano_dec2013)Needs challenges and_opportunites_in_architectural_languages (bolzano_dec2013)
Needs challenges and_opportunites_in_architectural_languages (bolzano_dec2013)
 
The Gap Between Academic Research and Industrial Practice in Software Testing
The Gap Between Academic Research and Industrial Practice in Software TestingThe Gap Between Academic Research and Industrial Practice in Software Testing
The Gap Between Academic Research and Industrial Practice in Software Testing
 
Software engineering Course module
Software engineering Course moduleSoftware engineering Course module
Software engineering Course module
 
A Survey on Software Release Planning Models - Slides for the Presentation @ ...
A Survey on Software Release Planning Models - Slides for the Presentation @ ...A Survey on Software Release Planning Models - Slides for the Presentation @ ...
A Survey on Software Release Planning Models - Slides for the Presentation @ ...
 
Contributions to the multidisciplinarity of computer science and IS
Contributions to the multidisciplinarity of computer science and ISContributions to the multidisciplinarity of computer science and IS
Contributions to the multidisciplinarity of computer science and IS
 
Systematic software development using vdm by jones 2nd edition
Systematic software development using vdm by jones 2nd editionSystematic software development using vdm by jones 2nd edition
Systematic software development using vdm by jones 2nd edition
 

Similar to Truong Ho-Quang's Ph.D Defence Presentation

A Survey on Design Pattern Detection Approaches
A Survey on Design Pattern Detection ApproachesA Survey on Design Pattern Detection Approaches
A Survey on Design Pattern Detection ApproachesCSCJournals
 
CS251 Intro. to SE [Lec. 0 - Course Introduction & Plan] Spring 2022.pdf
CS251 Intro. to SE [Lec. 0 - Course Introduction & Plan] Spring 2022.pdfCS251 Intro. to SE [Lec. 0 - Course Introduction & Plan] Spring 2022.pdf
CS251 Intro. to SE [Lec. 0 - Course Introduction & Plan] Spring 2022.pdfTitoMido1
 
Taming Complexity: On Studying the Application of Model-Driven Engineering to...
Taming Complexity: On Studying the Application of Model-Driven Engineering to...Taming Complexity: On Studying the Application of Model-Driven Engineering to...
Taming Complexity: On Studying the Application of Model-Driven Engineering to...Florian Rademacher
 
Discreate eventsimulation idef
Discreate eventsimulation idefDiscreate eventsimulation idef
Discreate eventsimulation idefMandar Trivedi
 
Design Patterns - General Introduction
Design Patterns - General IntroductionDesign Patterns - General Introduction
Design Patterns - General IntroductionAsma CHERIF
 
Generic Model-based Approaches for Software Reverse Engineering and Comprehen...
Generic Model-based Approaches for Software Reverse Engineering and Comprehen...Generic Model-based Approaches for Software Reverse Engineering and Comprehen...
Generic Model-based Approaches for Software Reverse Engineering and Comprehen...Hugo Bruneliere
 
20CB304 - SE - UNIT V - Digital Notes.pptx
20CB304 - SE - UNIT V - Digital Notes.pptx20CB304 - SE - UNIT V - Digital Notes.pptx
20CB304 - SE - UNIT V - Digital Notes.pptxJayaramB11
 
Tommi kramer 2013-06-21-caise-re2-kramer
Tommi kramer   2013-06-21-caise-re2-kramerTommi kramer   2013-06-21-caise-re2-kramer
Tommi kramer 2013-06-21-caise-re2-kramercaise2013vlc
 
Software Engineering Research: Leading a Double-Agent Life.
Software Engineering Research: Leading a Double-Agent Life.Software Engineering Research: Leading a Double-Agent Life.
Software Engineering Research: Leading a Double-Agent Life.Lionel Briand
 
Software Development for Processionals.ppt
Software Development for Processionals.pptSoftware Development for Processionals.ppt
Software Development for Processionals.pptNaeemAhmed654645
 
MK_MSc_Degree_Project_Report ver 5_updated
MK_MSc_Degree_Project_Report ver 5_updatedMK_MSc_Degree_Project_Report ver 5_updated
MK_MSc_Degree_Project_Report ver 5_updatedMohammed Ali Khan
 
Best Practices in Object Oriented Analysis and Design
Best Practices in Object Oriented Analysis and DesignBest Practices in Object Oriented Analysis and Design
Best Practices in Object Oriented Analysis and Designnguyencraft01
 
Design & Evaluation of the Goal-Oriented Design Knowledge Library
Design & Evaluation of the Goal-Oriented Design Knowledge LibraryDesign & Evaluation of the Goal-Oriented Design Knowledge Library
Design & Evaluation of the Goal-Oriented Design Knowledge Libraryandrewhilts
 
Studying Software Engineering Patterns for Designing Machine Learning Systems
Studying Software Engineering Patterns for Designing Machine Learning SystemsStudying Software Engineering Patterns for Designing Machine Learning Systems
Studying Software Engineering Patterns for Designing Machine Learning SystemsHironori Washizaki
 
Declarations of Software Engineering Project Managers Managing Remotely: Prov...
Declarations of Software Engineering Project Managers Managing Remotely: Prov...Declarations of Software Engineering Project Managers Managing Remotely: Prov...
Declarations of Software Engineering Project Managers Managing Remotely: Prov...Dr. Mustafa Değerli
 
View Alignment Techniques
View Alignment TechniquesView Alignment Techniques
View Alignment TechniquesJIGAR MAKHIJA
 

Similar to Truong Ho-Quang's Ph.D Defence Presentation (20)

A Survey on Design Pattern Detection Approaches
A Survey on Design Pattern Detection ApproachesA Survey on Design Pattern Detection Approaches
A Survey on Design Pattern Detection Approaches
 
CS251 Intro. to SE [Lec. 0 - Course Introduction & Plan] Spring 2022.pdf
CS251 Intro. to SE [Lec. 0 - Course Introduction & Plan] Spring 2022.pdfCS251 Intro. to SE [Lec. 0 - Course Introduction & Plan] Spring 2022.pdf
CS251 Intro. to SE [Lec. 0 - Course Introduction & Plan] Spring 2022.pdf
 
Taming Complexity: On Studying the Application of Model-Driven Engineering to...
Taming Complexity: On Studying the Application of Model-Driven Engineering to...Taming Complexity: On Studying the Application of Model-Driven Engineering to...
Taming Complexity: On Studying the Application of Model-Driven Engineering to...
 
Discreate eventsimulation idef
Discreate eventsimulation idefDiscreate eventsimulation idef
Discreate eventsimulation idef
 
Design Patterns - General Introduction
Design Patterns - General IntroductionDesign Patterns - General Introduction
Design Patterns - General Introduction
 
Jar chapter 1
Jar chapter 1Jar chapter 1
Jar chapter 1
 
Generic Model-based Approaches for Software Reverse Engineering and Comprehen...
Generic Model-based Approaches for Software Reverse Engineering and Comprehen...Generic Model-based Approaches for Software Reverse Engineering and Comprehen...
Generic Model-based Approaches for Software Reverse Engineering and Comprehen...
 
20CB304 - SE - UNIT V - Digital Notes.pptx
20CB304 - SE - UNIT V - Digital Notes.pptx20CB304 - SE - UNIT V - Digital Notes.pptx
20CB304 - SE - UNIT V - Digital Notes.pptx
 
Tommi kramer 2013-06-21-caise-re2-kramer
Tommi kramer   2013-06-21-caise-re2-kramerTommi kramer   2013-06-21-caise-re2-kramer
Tommi kramer 2013-06-21-caise-re2-kramer
 
Oopsla07 tooldemo.ppt
Oopsla07 tooldemo.pptOopsla07 tooldemo.ppt
Oopsla07 tooldemo.ppt
 
Software Engineering Research: Leading a Double-Agent Life.
Software Engineering Research: Leading a Double-Agent Life.Software Engineering Research: Leading a Double-Agent Life.
Software Engineering Research: Leading a Double-Agent Life.
 
Software Development for Processionals.ppt
Software Development for Processionals.pptSoftware Development for Processionals.ppt
Software Development for Processionals.ppt
 
UNIT 01 SMD.pptx
UNIT 01 SMD.pptxUNIT 01 SMD.pptx
UNIT 01 SMD.pptx
 
Iwesep19.ppt
Iwesep19.pptIwesep19.ppt
Iwesep19.ppt
 
MK_MSc_Degree_Project_Report ver 5_updated
MK_MSc_Degree_Project_Report ver 5_updatedMK_MSc_Degree_Project_Report ver 5_updated
MK_MSc_Degree_Project_Report ver 5_updated
 
Best Practices in Object Oriented Analysis and Design
Best Practices in Object Oriented Analysis and DesignBest Practices in Object Oriented Analysis and Design
Best Practices in Object Oriented Analysis and Design
 
Design & Evaluation of the Goal-Oriented Design Knowledge Library
Design & Evaluation of the Goal-Oriented Design Knowledge LibraryDesign & Evaluation of the Goal-Oriented Design Knowledge Library
Design & Evaluation of the Goal-Oriented Design Knowledge Library
 
Studying Software Engineering Patterns for Designing Machine Learning Systems
Studying Software Engineering Patterns for Designing Machine Learning SystemsStudying Software Engineering Patterns for Designing Machine Learning Systems
Studying Software Engineering Patterns for Designing Machine Learning Systems
 
Declarations of Software Engineering Project Managers Managing Remotely: Prov...
Declarations of Software Engineering Project Managers Managing Remotely: Prov...Declarations of Software Engineering Project Managers Managing Remotely: Prov...
Declarations of Software Engineering Project Managers Managing Remotely: Prov...
 
View Alignment Techniques
View Alignment TechniquesView Alignment Techniques
View Alignment Techniques
 

More from Ho Quang Truong

Interactive Role Stereotype-Based Visualization To Comprehend Software Archit...
Interactive Role Stereotype-Based Visualization To Comprehend Software Archit...Interactive Role Stereotype-Based Visualization To Comprehend Software Archit...
Interactive Role Stereotype-Based Visualization To Comprehend Software Archit...Ho Quang Truong
 
SPLC 2018 industry forum
SPLC 2018 industry forumSPLC 2018 industry forum
SPLC 2018 industry forumHo Quang Truong
 
Hand-Over Presentation at SPLC’17 in Sevilla
Hand-Over Presentation at SPLC’17 in SevillaHand-Over Presentation at SPLC’17 in Sevilla
Hand-Over Presentation at SPLC’17 in SevillaHo Quang Truong
 

More from Ho Quang Truong (7)

Interactive Role Stereotype-Based Visualization To Comprehend Software Archit...
Interactive Role Stereotype-Based Visualization To Comprehend Software Archit...Interactive Role Stereotype-Based Visualization To Comprehend Software Archit...
Interactive Role Stereotype-Based Visualization To Comprehend Software Archit...
 
SPLC 2018 Booklet
SPLC 2018 BookletSPLC 2018 Booklet
SPLC 2018 Booklet
 
SPLC 2018 industry forum
SPLC 2018 industry forumSPLC 2018 industry forum
SPLC 2018 industry forum
 
Splc18 A4 poster
Splc18 A4 posterSplc18 A4 poster
Splc18 A4 poster
 
Splc18 flyer
Splc18 flyerSplc18 flyer
Splc18 flyer
 
Splc18 cover slide
Splc18 cover slideSplc18 cover slide
Splc18 cover slide
 
Hand-Over Presentation at SPLC’17 in Sevilla
Hand-Over Presentation at SPLC’17 in SevillaHand-Over Presentation at SPLC’17 in Sevilla
Hand-Over Presentation at SPLC’17 in Sevilla
 

Recently uploaded

Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 sciencefloriejanemacaya1
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Patrick Diehl
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physicsvishikhakeshava1
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptMAESTRELLAMesa2
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |aasikanpl
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
Caco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorptionCaco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorptionPriyansha Singh
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxSwapnil Therkar
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡anilsa9823
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 

Recently uploaded (20)

CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 science
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physics
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.ppt
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Caco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorptionCaco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorption
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 

Truong Ho-Quang's Ph.D Defence Presentation

  • 1. Empowering Empirical Research in Software Design: Construction and Studies on a Large-Scale Corpus of UML Models Ph.D Thesis Defence Ph.D Candidate: Truong Ho-Quang Chalmers | Gothenburg Univ. Sweden Opponent: Dr. Klaas-Jan Stol University College Cork Ireland Supervisors: Dr. Michel R.V. Chaudron Dr. Regina Hebig Chalmers | Gothenburg Univ. Sweden Grading committee members: Dr. Maria Teresa Baldassarre (University of Bari Aldo Mori, Italy) Dr. Christoph Treude (University of Adelaide, Australia) Dr. Sebastian Herold (Karlstad University, Sweden)
  • 2. Content • Context of the study (the ’Why’) • Research Approach & Methods (the ’How’) • Findings (the ’What’) • Conclusion 2
  • 3. Design & Modeling: House vs Software 3 HOUSE SOFTWARE Design(v) is a process of making decision about something that is to be built or created. Model(n) is an abstract representation of a thing/system. Modeling(v) is the process of making models (i.e. choosing what to represent and how to represent it) Expressing design into house design plans A house plan An UML model Expressing design into models
  • 4. Context of the study • Software design and modeling are an essential part of the software development process. • There is lack of empirical research about software design and modeling in practice. 4
  • 5. Example of contradict findings • Empirical research about use of UML (*) is contradictory – [Petre, 2013]: ”… The majority of those interviewed (50) simply do not use UML. Informants criticized UML for its complexity, lack of formal semantics, inconsistency …” – [Scanniello et al., 2010]: ”… the majority of the companies (20/22) use UML in their projects…” – [Anda et al., 2006]: ” … The interviewees obtained immediate improvements as a consequence of introducing a UML-based development method …” 5 (*) Abbreviation of the Unified Modeling Language
  • 6. Problem statement • Lack of practical guidelines on the use of modeling – How is software design and modeling used? – What are the impacts of software design and modeling? • Lack of empirical data about software design & modeling – Lack of generalisability and replicability Goal of the PhD: To empower empirical studies in software design and modeling by collecting and studying a large corpus of software modeling artifacts from real-life software systems. 6
  • 7. RESEARCH GOALS, SCOPE & METHODOLOGY • Goals of the Ph.D study • Scope of the Ph.D study • Approach & Research methodology 7
  • 8. Goals of the Ph.D study 8 Dataset Knowledge G1. Building & sharing a corpus of curated modeling artifacts G2. Modeling practices G3. Impacts of modeling
  • 9. Modeling languages Scope: What modeling artifacts? 9 We focus on the Unified Modeling Language (UML) because: • UML has become the de-facto standard for software modeling in industry.
  • 10. Scope: Which sources of UML models? 10 Sources of UML models Industry OSS projects Benefits • Industry-relevant context • OSS relevant context • Data availability • Data transparency Challenges • Data availability • Generalisation • Replicability • Identification of UML files • Filtering out toy-projects
  • 11. Challenges to identifying impacts: Modeling is contextual (*) 11 (*) Figure from this paper: Fernández-Sáez, Ana M., Michel RV Chaudron, and Marcela Genero. "An industrial case study on the use of UML in software maintenance and its perceived benefits and hurdles." Empirical Software Engineering 23.6 (2018): 3281-3345.
  • 12. Scope: Studying impacts in context 12 Dataset Knowledge G1. Building & sharing a corpus of curated modeling artifacts G2. Modeling practices G3. Impacts of modeling G3. Impacts G2. Practices G1. Collection Context of use Enables Enables
  • 13. Relationship of papers and goals 13 C A B D E F G HE Legend Paper 21 Paper 2 extends paper 1 G1 Corpus G2 Practices G3 Impacts
  • 14. Goals Papers A B C D E F G H G3. Impacts G2. Practices G1. Collection Research methodology 14 Empirical Method Experiment, Case Study, User Study Empirical Method Survey Study Constructive Method
  • 16. Contributions to Goal G1. Data Contribution • Lindholmen dataset – 93k+ UML models from 24k+ GitHub projects – Meta data of the UML models and projects – Data are curated • Researchers are using it – 12 published papers by other authors 16 http://models.cs.chalmers.se/oss/ Lindholmen Dataset G1. Collection
  • 17. Contributions to Goal G1. Data Collection Process • Complete tool set for automatically crawl UML models from a GitHub project • This can be extended to crawl other software development artifacts 17 UML File list GitHub 1 Data collection Potential UML file list 3 Extract Meta-data 2 Filter UML files UML Image Filter Textual Filter Validation 5 Analyse result 4 Query database CVSAnalY MySQL GHTorrent ~ 12 800 000 non-forked repos Output: 93 648 UML files & 24 797 projects that use UML
  • 18. Contributions to Goal G2. • Scientific insights into how UML is used in OSS projects. 18 G2. Practices
  • 19. Paper B: “How is UML used in OSS projects?” 19 • Most projects work very shortly on UML, usually at the beginning. • Models are introduced during all possible phases in the lifespan of OSS projects. • A few projects are active with UML during their whole lifetime. G2. Practices
  • 20. Paper C: Why is UML used in OSS projects? 20 • The majority of models are intended for creating software designs and documenting software systems. • Non-UML Contributors (NUCs) benefit from UML models for understanding a system and for communication. G2. Practices
  • 21. Contributions to goal G3. • Scientific insights into impacts of software design and UML modeling in software development 21 G3. Impacts
  • 22. Paper C: What are the impacts of using UML? 22 • UML is helpful for new contributors to get up to speed. • Changes of the working routine due to UML, mainly in the planning phase and in communication. G3. Impacts
  • 23. Paper F: Impacts of using UML to defect proneness? 23 93 GitHub projects without UML models 50 GitHub projects that have UML models • Projects with UML have about 35% fewer bugs reported compared to projects without using UML. G3. Impacts
  • 24. Paper H: Impacts of using role stereotype to developer’s understanding? 24 Mean TLX (*) Mean SUS (**) Mean Understanding (**) RoleViz SoftagramRoleViz Tool (*) • Participants achieved better scores on completing software understanding tasks with RoleViz without any cognitive-load penalty (*) Demo video is available online at: https://youtu.be/1JYQMPMF9do G3. Impacts
  • 25. Conclusion 25 I am proud that after five years of my PhD study: • I went from little data to big empirical data of UML use. • I made qualitative and quantitative observations on the use and impacts of using software design and modeling. • I ran into an area that has lots of room for discussion & future research. • Being a part of great teams and colleagues. Thank you for your attention!