SlideShare a Scribd company logo
1 of 11
Download to read offline
ONTHE USE OF DATABASE
FRAMEWORKS IN JAVA PROJECTS
A LARGE-SCALE HISTORICAL EMPIRICAL ANALYSIS
Mathieu Goeminne
mathieu.goeminne@umons.ac.be
Tom Mens
tom.mens@umons.ac.be
Software Engineering Lab
Alexandre Decan
alexandre.decan@umons.ac.be
GENERAL CONTEXT
• FNRS interuniversity research collaboration with the University of Namur.
• Study of software systems that use a database.
• Understanding the dynamics, issues and evolution of such systems.
• Need to take into account their specificities:
• How is their connection managed?
• How do the source code and the database co-evolve?
• How can we assess and increase the quality of such systems?
DATABASE FRAMEWORKS?
.java
.jsp
.php
DBMS
ORM
Connector
Session
manager
Source code DatabaseFrameworks
ONTHIS PRESENTATION
• Focus on database frameworks.
• Empirical study for answering 3 research questions:
• Q1: Which combinations of DB frameworks are being used
simultaneously?
• Q2: How long do DB frameworks ‘survive’ in the projects that use
them?
• Q3: Does the introduction of a DB framework influence the
survivability of another one?
CASE STUDY
• 20 Java DB frameworks.
• Top 5 retained: JDBC, Spring,
JPA, Vaadin, Hibernate.
• 13,307 Java projects from the
Github Java Corpus (MSR10):
3,707 of them use at least one
retained framework)
SURVIVAL ANALYSIS:
KAPLAN-MEIER ESTIMATOR
• Estimates the survival
function of a population
• Takes into account right
censoring (the event may
occur after the last
observation)
• Mantel-Haenszel test for
determining if 2 survival
functions differ.
Q1 - Which combinations of DB frameworks are
being used simultaneously?
• In most cases (98%), different
frameworks used in a project are
used in combination.
• JDBC is used as the single DB
framework in 56% of all projects.
• Most projects (83%) using
Hibernate also use JDBC.
Hibernate is almost never (3%)
used alone.
• JDBC used as support for the other
FW, providing unique services?
# projects using X fw vsY fw simultaneously
# projects using X andY simultaneously
Q2 - How long do database frameworks ‘survive’ in the projects?
• If a framework is introduced, it remains in
>45% of all the projects.
• Different trends. After 30 days:
• Hibernate disappears from 11.7% of
projects
• Spring disappears from 3.7% of
projects
• Partial order: JPA and Spring have higher
survival rates than JDBC and Hibernate.
• No clear relation between the order
and the popularity of the considered
frameworks.
Survival functions of top 5 FW according
to Kaplan-Meier estimator
p-values for difference tests of survival
functions
Q3 - Does the introduction of a database framework
influence the survivability of another one?
• Visually, the introduction of
some frameworks seems to
have a favorable influence.
• After Bonferroni correction,
no significant difference
in the survival
functions… at project level.
• In the future: file level
analyse.
Survival functions of JDBC with and without Spring
CONCLUSIONS
• Important coexistence
• Especially between JDBC and the other considered DB frameworks.
• JDBC is still often present in projects despite the use frameworks
providing more advanced, higher level services.
• In our study, there is no evidence for competition between frameworks.
• An introduced framework has an ≳45% of chance that it will remain.
• The disappearance of a framework doesn’t seem to be related to the
introduction of an other one.
FUTURE WORK
• Finer granularity: from projects to packages, files, methods, …
• Analysis of (un)frequently used framework services.
• Looking for evolutionary patterns:
• New frameworks introduced for their unique services?
• Simple SQL queries are replaced by framework services, while
complex queries remain as it?
• Extension to frameworks for less structured databases, NoSQL, etc.

More Related Content

Viewers also liked

On the topology of package dependency networks: A comparison of programming l...
On the topology of package dependency networks: A comparison of programming l...On the topology of package dependency networks: A comparison of programming l...
On the topology of package dependency networks: A comparison of programming l...
Tom Mens
 

Viewers also liked (9)

Evolution of database access technologies in Java-based software projects
Evolution of database access technologies in Java-based software projectsEvolution of database access technologies in Java-based software projects
Evolution of database access technologies in Java-based software projects
 
GISMO
GISMOGISMO
GISMO
 
Automated Planning for Resolving Model Inconsistencies - A Scalability Study
Automated Planning for Resolving Model Inconsistencies - A Scalability StudyAutomated Planning for Resolving Model Inconsistencies - A Scalability Study
Automated Planning for Resolving Model Inconsistencies - A Scalability Study
 
On the topology of package dependency networks: A comparison of programming l...
On the topology of package dependency networks: A comparison of programming l...On the topology of package dependency networks: A comparison of programming l...
On the topology of package dependency networks: A comparison of programming l...
 
Executable modeling and simulation of system software and processes
Executable modeling and simulation of system software and processesExecutable modeling and simulation of system software and processes
Executable modeling and simulation of system software and processes
 
Model-based testing of executable statecharts
Model-based testing of executable statechartsModel-based testing of executable statecharts
Model-based testing of executable statecharts
 
Challenges in Software Ecosystem Research
Challenges in Software Ecosystem ResearchChallenges in Software Ecosystem Research
Challenges in Software Ecosystem Research
 
Social and Technical Evolution of the Ruby on Rails Software Ecosystem
Social and Technical Evolution of the Ruby on Rails Software EcosystemSocial and Technical Evolution of the Ruby on Rails Software Ecosystem
Social and Technical Evolution of the Ruby on Rails Software Ecosystem
 
Software Ecosystem Evolution. It's complex!
Software Ecosystem Evolution. It's complex!Software Ecosystem Evolution. It's complex!
Software Ecosystem Evolution. It's complex!
 

More from Tom Mens

Comparing semantic versioning practices in Cargo, npm, Packagist and Rubygems
Comparing semantic versioning practices in Cargo, npm, Packagist and RubygemsComparing semantic versioning practices in Cargo, npm, Packagist and Rubygems
Comparing semantic versioning practices in Cargo, npm, Packagist and Rubygems
Tom Mens
 
Comparing dependency issues across software package distributions (FOSDEM 2020)
Comparing dependency issues across software package distributions (FOSDEM 2020)Comparing dependency issues across software package distributions (FOSDEM 2020)
Comparing dependency issues across software package distributions (FOSDEM 2020)
Tom Mens
 
Empirically Analysing the Socio-Technical Health of Software Package Managers
Empirically Analysing the Socio-Technical Health of Software Package ManagersEmpirically Analysing the Socio-Technical Health of Software Package Managers
Empirically Analysing the Socio-Technical Health of Software Package Managers
Tom Mens
 

More from Tom Mens (20)

How to be(come) a successful PhD student
How to be(come) a successful PhD studentHow to be(come) a successful PhD student
How to be(come) a successful PhD student
 
Recognising bot activity in collaborative software development
Recognising bot activity in collaborative software developmentRecognising bot activity in collaborative software development
Recognising bot activity in collaborative software development
 
A Dataset of Bot and Human Activities in GitHub
A Dataset of Bot and Human Activities in GitHubA Dataset of Bot and Human Activities in GitHub
A Dataset of Bot and Human Activities in GitHub
 
The (r)evolution of CI/CD on GitHub
 The (r)evolution of CI/CD on GitHub The (r)evolution of CI/CD on GitHub
The (r)evolution of CI/CD on GitHub
 
Nurturing the Software Ecosystems of the Future
Nurturing the Software Ecosystems of the FutureNurturing the Software Ecosystems of the Future
Nurturing the Software Ecosystems of the Future
 
Comment programmer un robot en 30 minutes?
Comment programmer un robot en 30 minutes?Comment programmer un robot en 30 minutes?
Comment programmer un robot en 30 minutes?
 
On the rise and fall of CI services in GitHub
On the rise and fall of CI services in GitHubOn the rise and fall of CI services in GitHub
On the rise and fall of CI services in GitHub
 
On backporting practices in package dependency networks
On backporting practices in package dependency networksOn backporting practices in package dependency networks
On backporting practices in package dependency networks
 
Comparing semantic versioning practices in Cargo, npm, Packagist and Rubygems
Comparing semantic versioning practices in Cargo, npm, Packagist and RubygemsComparing semantic versioning practices in Cargo, npm, Packagist and Rubygems
Comparing semantic versioning practices in Cargo, npm, Packagist and Rubygems
 
Lost in Zero Space
Lost in Zero SpaceLost in Zero Space
Lost in Zero Space
 
Evaluating a bot detection model on git commit messages
Evaluating a bot detection model on git commit messagesEvaluating a bot detection model on git commit messages
Evaluating a bot detection model on git commit messages
 
Is my software ecosystem healthy? It depends!
Is my software ecosystem healthy? It depends!Is my software ecosystem healthy? It depends!
Is my software ecosystem healthy? It depends!
 
Bot or not? Detecting bots in GitHub pull request activity based on comment s...
Bot or not? Detecting bots in GitHub pull request activity based on comment s...Bot or not? Detecting bots in GitHub pull request activity based on comment s...
Bot or not? Detecting bots in GitHub pull request activity based on comment s...
 
On the fragility of open source software packaging ecosystems
On the fragility of open source software packaging ecosystemsOn the fragility of open source software packaging ecosystems
On the fragility of open source software packaging ecosystems
 
How magic is zero? An Empirical Analysis of Initial Development Releases in S...
How magic is zero? An Empirical Analysis of Initial Development Releases in S...How magic is zero? An Empirical Analysis of Initial Development Releases in S...
How magic is zero? An Empirical Analysis of Initial Development Releases in S...
 
Comparing dependency issues across software package distributions (FOSDEM 2020)
Comparing dependency issues across software package distributions (FOSDEM 2020)Comparing dependency issues across software package distributions (FOSDEM 2020)
Comparing dependency issues across software package distributions (FOSDEM 2020)
 
Measuring Technical Lag in Software Deployments (CHAOSScon 2020)
Measuring Technical Lag in Software Deployments (CHAOSScon 2020)Measuring Technical Lag in Software Deployments (CHAOSScon 2020)
Measuring Technical Lag in Software Deployments (CHAOSScon 2020)
 
SecoHealth 2019 Research Achievements
SecoHealth 2019 Research AchievementsSecoHealth 2019 Research Achievements
SecoHealth 2019 Research Achievements
 
SECO-Assist 2019 research seminar
SECO-Assist 2019 research seminarSECO-Assist 2019 research seminar
SECO-Assist 2019 research seminar
 
Empirically Analysing the Socio-Technical Health of Software Package Managers
Empirically Analysing the Socio-Technical Health of Software Package ManagersEmpirically Analysing the Socio-Technical Health of Software Package Managers
Empirically Analysing the Socio-Technical Health of Software Package Managers
 

Recently uploaded

Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
levieagacer
 
Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.
Silpa
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
Silpa
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
Scintica Instrumentation
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
ANSARKHAN96
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.
Silpa
 

Recently uploaded (20)

Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its Functions
 
Genetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditionsGenetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditions
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
Genome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxGenome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptx
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptx
 
Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspects
 
Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRingsTransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 

On the Survival of Database Frameworks in Java Projects

  • 1. ONTHE USE OF DATABASE FRAMEWORKS IN JAVA PROJECTS A LARGE-SCALE HISTORICAL EMPIRICAL ANALYSIS Mathieu Goeminne mathieu.goeminne@umons.ac.be Tom Mens tom.mens@umons.ac.be Software Engineering Lab Alexandre Decan alexandre.decan@umons.ac.be
  • 2. GENERAL CONTEXT • FNRS interuniversity research collaboration with the University of Namur. • Study of software systems that use a database. • Understanding the dynamics, issues and evolution of such systems. • Need to take into account their specificities: • How is their connection managed? • How do the source code and the database co-evolve? • How can we assess and increase the quality of such systems?
  • 4. ONTHIS PRESENTATION • Focus on database frameworks. • Empirical study for answering 3 research questions: • Q1: Which combinations of DB frameworks are being used simultaneously? • Q2: How long do DB frameworks ‘survive’ in the projects that use them? • Q3: Does the introduction of a DB framework influence the survivability of another one?
  • 5. CASE STUDY • 20 Java DB frameworks. • Top 5 retained: JDBC, Spring, JPA, Vaadin, Hibernate. • 13,307 Java projects from the Github Java Corpus (MSR10): 3,707 of them use at least one retained framework)
  • 6. SURVIVAL ANALYSIS: KAPLAN-MEIER ESTIMATOR • Estimates the survival function of a population • Takes into account right censoring (the event may occur after the last observation) • Mantel-Haenszel test for determining if 2 survival functions differ.
  • 7. Q1 - Which combinations of DB frameworks are being used simultaneously? • In most cases (98%), different frameworks used in a project are used in combination. • JDBC is used as the single DB framework in 56% of all projects. • Most projects (83%) using Hibernate also use JDBC. Hibernate is almost never (3%) used alone. • JDBC used as support for the other FW, providing unique services? # projects using X fw vsY fw simultaneously # projects using X andY simultaneously
  • 8. Q2 - How long do database frameworks ‘survive’ in the projects? • If a framework is introduced, it remains in >45% of all the projects. • Different trends. After 30 days: • Hibernate disappears from 11.7% of projects • Spring disappears from 3.7% of projects • Partial order: JPA and Spring have higher survival rates than JDBC and Hibernate. • No clear relation between the order and the popularity of the considered frameworks. Survival functions of top 5 FW according to Kaplan-Meier estimator p-values for difference tests of survival functions
  • 9. Q3 - Does the introduction of a database framework influence the survivability of another one? • Visually, the introduction of some frameworks seems to have a favorable influence. • After Bonferroni correction, no significant difference in the survival functions… at project level. • In the future: file level analyse. Survival functions of JDBC with and without Spring
  • 10. CONCLUSIONS • Important coexistence • Especially between JDBC and the other considered DB frameworks. • JDBC is still often present in projects despite the use frameworks providing more advanced, higher level services. • In our study, there is no evidence for competition between frameworks. • An introduced framework has an ≳45% of chance that it will remain. • The disappearance of a framework doesn’t seem to be related to the introduction of an other one.
  • 11. FUTURE WORK • Finer granularity: from projects to packages, files, methods, … • Analysis of (un)frequently used framework services. • Looking for evolutionary patterns: • New frameworks introduced for their unique services? • Simple SQL queries are replaced by framework services, while complex queries remain as it? • Extension to frameworks for less structured databases, NoSQL, etc.