The document describes an empirical study on specialization in open source software communities. The study analyzes activity and workload data from over 1,300 projects and 5,000 developers in the GNOME desktop environment. Key findings include that most developers and projects concentrate their work in a few activity types, particularly coding, documentation, and translation. The analysis uses metrics like the Gini index to measure specialization at the developer and project level across different activities like coding, documentation, translation, and more. Overall, the study finds evidence that open source work exhibits specialization, with developers and projects focusing more intensively on certain types of activities.
I used these slides during my presentation at BeNeVol 2010 in Lille, France.
Paper:
Vasilescu B, Serebrenik A and van den Brand MGJ (2010), "Comparative study of software metrics' aggregation techniques", In Proceedings of the 9th Belgian-Netherlands Software Evolution Seminar, pp. 80-84.
A historical dataset for the Gnome software ecosystemTom Mens
Presentation of co-authored MSR 2013 paper "A historical dataset for Gnome contributors", presented by Maelick Claes at MSR 2013, San Francisco.
Abstract: We present a dataset of the open source software ecosystem GNOME from a social point of view. We have collected historical data about the contributors to all GNOME projects stored on git.gnome.org, taking into account the problem of identity matching, and as- sociating different activity types to the contributors. This type of information is very useful to complement the traditional, source-code related information one can ob- tain by mining and analyzing the actual source code. The dataset can be obtained at https://bitbucket.org/ mgoeminne/sgl-flossmetric-dbmerge.
Multi-Paradigm Modeling of HMI ApplicationsTom Mens
This presentation about the use of multi-paradigm modeling for human-machine interaction was given at the Universiteit Antwerpen as part of a research seminar of the Belgian PAI research project MoVES (Modeling, Verification and Evolution of Software).
It presents an overview of the planned research in the context of the PhD of Romuald Deshayes, under supervision by Tom Mens at the Software Engineering Lab of the University of Mons, Belgium.
Evaluationg software quality practices in European industryTom Mens
Presentation by Javier Perez (Software Engineering Lab at Département d'Informatique of UMONS). Presented during BENEVOL 2012 seminar on software evolution in Delft, The Netherlands, on December 3, 2012.
Migration patterns of open source ecosystem contributors - An empirical case ...Tom Mens
We present our emerging research on evolving open source software ecosystems, collections of software projects maintained by the same community. Within such an ecosystem we aim to understand how contributors join, leave and move across different projects over time. In particular, we study where new contributors to a given subsystem come from, and where people go to if they stop contributing in this subsystem. We define novel metrics to measure these aspects of projects in the ecosystem. We are carrying out an empirical study on the GNOME ecosystem to study these aspects.
Survivability of software projects in Gnome: A replication studyTom Mens
Presentation by Mathieu Goeminne of joint work with Tom Mens, Uzma Raja and Alexander Serebrenik on a replication study of a survivability model for software projects, applied to the GNOME software ecosystem. Presented during the SATToSE 2014 software evolution research seminar in Italy, July 2014
Statechart modeling of interactive gesture-based applicationsTom Mens
Developing intuitive interactive applications that are easy to maintain by developers is quite challenging, due to the complexity and the many technical aspects involved in such applications. In this article, we tackle the problem in two complementary ways. First, we propose a gestural interface to improve the user experience when interacting with applications that require the manipulation of 3D graphical scenes. Second, we reduce the complexity of developing such applications by modeling their executable behaviour using statecharts. We validate our approach by creating a modular and extensible Java framework for the development of in- teractive gesture-based applications. We developed a proof- of-concept application using this framework, that allows the user to construct and manipulate 3D scenes in OpenGL by using hand gestures only. These hand gestures are captured by the Kinect sensor, and translated into events and actions that are interpreted and executed by communicating statecharts that model the main behaviour of the interactive application.
Evidence for the Pareto principle in open source software activityTom Mens
This document studies the distribution of activity among contributors to open source software projects. It analyzes three projects to determine if the Pareto principle applies, where 20% of contributors account for 80% of activity. Results show activity becomes more unequally distributed over time. For two projects, a limited core group performs most commits, emails, and bug changes. Future work aims to better understand how core groups evolve and identify new active contributors.
I used these slides during my presentation at BeNeVol 2010 in Lille, France.
Paper:
Vasilescu B, Serebrenik A and van den Brand MGJ (2010), "Comparative study of software metrics' aggregation techniques", In Proceedings of the 9th Belgian-Netherlands Software Evolution Seminar, pp. 80-84.
A historical dataset for the Gnome software ecosystemTom Mens
Presentation of co-authored MSR 2013 paper "A historical dataset for Gnome contributors", presented by Maelick Claes at MSR 2013, San Francisco.
Abstract: We present a dataset of the open source software ecosystem GNOME from a social point of view. We have collected historical data about the contributors to all GNOME projects stored on git.gnome.org, taking into account the problem of identity matching, and as- sociating different activity types to the contributors. This type of information is very useful to complement the traditional, source-code related information one can ob- tain by mining and analyzing the actual source code. The dataset can be obtained at https://bitbucket.org/ mgoeminne/sgl-flossmetric-dbmerge.
Multi-Paradigm Modeling of HMI ApplicationsTom Mens
This presentation about the use of multi-paradigm modeling for human-machine interaction was given at the Universiteit Antwerpen as part of a research seminar of the Belgian PAI research project MoVES (Modeling, Verification and Evolution of Software).
It presents an overview of the planned research in the context of the PhD of Romuald Deshayes, under supervision by Tom Mens at the Software Engineering Lab of the University of Mons, Belgium.
Evaluationg software quality practices in European industryTom Mens
Presentation by Javier Perez (Software Engineering Lab at Département d'Informatique of UMONS). Presented during BENEVOL 2012 seminar on software evolution in Delft, The Netherlands, on December 3, 2012.
Migration patterns of open source ecosystem contributors - An empirical case ...Tom Mens
We present our emerging research on evolving open source software ecosystems, collections of software projects maintained by the same community. Within such an ecosystem we aim to understand how contributors join, leave and move across different projects over time. In particular, we study where new contributors to a given subsystem come from, and where people go to if they stop contributing in this subsystem. We define novel metrics to measure these aspects of projects in the ecosystem. We are carrying out an empirical study on the GNOME ecosystem to study these aspects.
Survivability of software projects in Gnome: A replication studyTom Mens
Presentation by Mathieu Goeminne of joint work with Tom Mens, Uzma Raja and Alexander Serebrenik on a replication study of a survivability model for software projects, applied to the GNOME software ecosystem. Presented during the SATToSE 2014 software evolution research seminar in Italy, July 2014
Statechart modeling of interactive gesture-based applicationsTom Mens
Developing intuitive interactive applications that are easy to maintain by developers is quite challenging, due to the complexity and the many technical aspects involved in such applications. In this article, we tackle the problem in two complementary ways. First, we propose a gestural interface to improve the user experience when interacting with applications that require the manipulation of 3D graphical scenes. Second, we reduce the complexity of developing such applications by modeling their executable behaviour using statecharts. We validate our approach by creating a modular and extensible Java framework for the development of in- teractive gesture-based applications. We developed a proof- of-concept application using this framework, that allows the user to construct and manipulate 3D scenes in OpenGL by using hand gestures only. These hand gestures are captured by the Kinect sensor, and translated into events and actions that are interpreted and executed by communicating statecharts that model the main behaviour of the interactive application.
Evidence for the Pareto principle in open source software activityTom Mens
This document studies the distribution of activity among contributors to open source software projects. It analyzes three projects to determine if the Pareto principle applies, where 20% of contributors account for 80% of activity. Results show activity becomes more unequally distributed over time. For two projects, a limited core group performs most commits, emails, and bug changes. Future work aims to better understand how core groups evolve and identify new active contributors.
The Center for Learning and Knowledge Technologies (CeLeKT) at Linnæus University in Sweden conducts research on mobile systems and social media to support collaborative learning. Their current projects include LETS GO on mobile science collaboratories, GeM on mobile math learning, mLearn2go on a learning toolbox, and Gifted Math utilizing classroom technologies. They discussed collaborating with Taiwan researchers on future projects and exchanging students and researchers.
NESSHI and GEPHI: sociology of science as a breeding ground for tool building...Clement Levallois
NESSHI was a project that studied the neuro-turn in social sciences and humanities. As part of the project, the presenter developed several data analysis tools to help with tasks like annotating images, sorting lists alphabetically, overlaying bibliographic maps, and creating networks based on text co-occurrences. Rather than leaving these tools as one-off scripts, the presenter advocates releasing them on platforms like Gephi that make the tools cross-platform, easy to install, and discoverable by a wide audience. This increases the likelihood the tools will be reused by others to support analysis in similar projects.
A community of developers stimulating innovation in uk higher educationDevCSI
This document provides an overview of the DevCSI project, which aims to stimulate innovation in UK higher education by supporting a community of developers. It discusses the types of developers involved, including opportunistic, engineers, and connected developers. It outlines events held by DevCSI to bring developers together, such as hack days and challenges, and how these help developers build skills and solutions. It also discusses how supporting local developers can benefit institutions by empowering users and enabling local innovation. Finally, it shares some statistics on DevCSI's events and community outreach.
GOSPL: A Method and Tool for Fact-Oriented Hybrid Ontology EngineeringChristophe Debruyne
In this paper we present GOSPL, which stands for Grounding Ontologies with Social Processes and Natural Language. GOSPL is a method and tool that supports stakeholders in iteratively interpreting and modeling their common hybrid ontologies using their own terminology for semantic interoperability between autonomously developed and maintained information systems. Hybrid ontologies are ontologies in which concepts are both formally and informally described with the help of a special linguistic resource called glossary. Social interactions between the community members drive the ontology evolution process and result in more stable and agreed upon ontologies.
Christophe Debruyne, Robert Meersman: GOSPL: A Method and Tool for Fact-Oriented Hybrid Ontology Engineering. ADBIS 2012: 153-166
Infinit.e is an open analytics platform built using MongoDB and Hadoop. It allows users to collect, store, enrich, retrieve, analyze and visualize both structured and unstructured documents at scale. The platform utilizes open source technologies like Elasticsearch, MongoDB and Hadoop. MongoDB is used for document storage and development due to its flexible document model and ease of data model changes. Hadoop is used for large-scale analytics due to its proven scalability and availability of machine learning libraries. The presentation concludes with a demonstration of Infinit.e's capabilities.
The document summarizes several projects conducted by Microsoft Research related to scholarly communication. It discusses tools developed to aid scientific research through better data analysis, collaboration, dissemination of research outputs, and archiving of published literature and data. Specific projects highlighted include developing semantic markup and chemical drawing tools in Word 2007, integrating gene expression data with research papers using Word 2007's Open Packaging Conventions format, and establishing workflows for archiving datasets submitted with published articles.
EclipseConEurope2012 SOA - Models As Operational DocumentationMarc Dutoo
At Eclipse Con Europe 2012 in the SOA Symposium track, JWT's EMF model export to structure and information in Document Management Systems is explained and demonstrated for in the case of the EasySOA service documentation registry, with JWT workflows producing a basis for SOA operational documentation.
ArchivesSpace: Building a Next-Generation Archives Management ToolMark Matienzo
Presentation at Digital Library Federation Forum, October 31, 2011, by Katherine Kott, ArchivesSpace Development Manager Mark A. Matienzo, ArchivesSpace Technical Architect.
The document discusses Novell's GroupWise and Vibe collaboration products. It highlights how the products address top priorities for CIOs like collaboration/workflow technologies and mobile access. Demos of features are shown, including social tools, presence and telephony integration, the Vibe user interface, GroupWise integration with Vibe, and mobile apps. Upcoming releases of GroupWise, Vibe, and Messenger are previewed with new features.
The document provides an overview of the key phases involved in a digital project - discover, define, design, develop, deploy & deliver, and maintain. It describes the main actions and lingo associated with each phase to help familiarize beginners with digital production terminology. The discover phase involves understanding requirements, stakeholders, and timelines. Define locks in goals, concepts, audiences and technical specifications. Design creates unique approaches and visuals. Develop includes building, testing, and review. Deploy & deliver implements the build and content. Maintain measures results and identifies optimizations.
Governing services, data, rules, processes and moreRandall Hauch
Randall and Kurt will present how Guvnor is being reborn so that it can manage artifacts from a variety of domains, including web services, data services, business rules and processes, and metadata management. Guvnor not only will storing these artifacts, but it will fully manage their lifecycle, enable search and discovery, and provide insight into how, when and where they can be used. They'll also describe Guvnor's architecture and use of JCR, REST, GWT, Atom, and S-RAMP.
Visual Ontology Modeling for Domain Experts and Business Users with metaphactoryPeter Haase
Visual Ontology Modeling for Domain Experts and Business Users with metaphactory
Presentation at the OntoCommons Workshop on Ontology Engineering Tools @ Fri Mar 19, 2021
This document provides information about the capstone project for the Digital Discipleship Boot Camp (DDBC). It outlines the three phases of the capstone project: proposal, implementation, and final report/application. Phase I involves submitting a project proposal by September 1st that describes the project, goals, audience, timeline, technology to be used, and budget. Phase II is the implementation period from September to November where participants carry out their projects. Phase III involves submitting an essay by December 1st summarizing the DDBC experience and how it will apply to ministry, an article sharing the project for others, and a CEU application. The document also provides rubrics for evaluating the proposals and projects.
The effective daum coursework focused on open source software in Jeju Nationa...Channy Yun
The document summarizes an open source software course at Jeju National University in Korea. The course focused on preparing students for careers at companies like Daum that rely heavily on open source technologies. It discussed how the course helped students gain skills in areas like project management, programming languages, and contributing to open source communities. Evaluations found high participation rates and that most students continued learning about open source after the course.
The document summarizes identified synergies between the AEGIS project and other projects. It lists each project and potential synergies in areas such as personas, ontologies, user models, portals, pilots, development, integration, roadmaps, and standards. For each synergy, it identifies the relevant aspects and issues to be discussed further. The overall goal is to define specific actions and a roadmap to better fulfill synergies across projects.
The document summarizes identified synergies between the AEGIS project and other projects. It lists each project and potential synergies in areas such as personas, ontologies, user models, portals, pilots, development, integration, roadmaps, and standards. For each synergy, it identifies the relevant aspects and issues to be discussed further. The overall goal is to define specific actions and a roadmap to better fulfill synergies across projects.
Knowledge Organization System (KOS) for biodiversity information resources, G...Dag Endresen
Presentation of the Global Biodiversity Information Facility (GBIF) knowledge organization system (KOS) work program for the National Center for Biomedical Ontology (NCBO) Web seminar series in October 2012. Available at http://www.bioontology.org/GBIF-vocabulary-management-for-biodiversity-informatics
When a local project becomes beneficial for the whole community (and vice ver...4Science
Hasselt University developed customizations to the open source repository software DSpace that provided useful new features but did not get integrated into the main DSpace codebase. They partnered with 4Science to develop a submission module that would be aligned with DSpace roadmaps and published as open source. 4Science developed the module over a year, resulting in features like configurable workflows, authentication including Shibboleth, and administrative editing that will be included in the upcoming DSpace 7 release.
A simple natural language interface application for launching applications and showing user information based on voice input processed by using natural language programming concepts
Keynote talk targeted to PhD students, during the BENEVOL 2023 research seminar (focused on software evolution) in Nijmegen, 27 November 2023, by Tom Mens (full professor in software engineering at University of Mons, Belgium). The keynote aims to provide tips, tricks and practical advice on how to become successful as a PhD student.
Recognising bot activity in collaborative software developmentTom Mens
Presentation by Natarajan Chidambaram during the International ICSE Workshop on Bots in Software Engineering (BotSE 2023) in Australia. Joint work with Mehdi Golzadeh, Tom Mens, Alexandre Decan of the Software Engineering Lab of the University of Mons and with Eleni Constantinou.
More Related Content
Similar to An empirical study on the Specialisation Effect in Open Source Communities
The Center for Learning and Knowledge Technologies (CeLeKT) at Linnæus University in Sweden conducts research on mobile systems and social media to support collaborative learning. Their current projects include LETS GO on mobile science collaboratories, GeM on mobile math learning, mLearn2go on a learning toolbox, and Gifted Math utilizing classroom technologies. They discussed collaborating with Taiwan researchers on future projects and exchanging students and researchers.
NESSHI and GEPHI: sociology of science as a breeding ground for tool building...Clement Levallois
NESSHI was a project that studied the neuro-turn in social sciences and humanities. As part of the project, the presenter developed several data analysis tools to help with tasks like annotating images, sorting lists alphabetically, overlaying bibliographic maps, and creating networks based on text co-occurrences. Rather than leaving these tools as one-off scripts, the presenter advocates releasing them on platforms like Gephi that make the tools cross-platform, easy to install, and discoverable by a wide audience. This increases the likelihood the tools will be reused by others to support analysis in similar projects.
A community of developers stimulating innovation in uk higher educationDevCSI
This document provides an overview of the DevCSI project, which aims to stimulate innovation in UK higher education by supporting a community of developers. It discusses the types of developers involved, including opportunistic, engineers, and connected developers. It outlines events held by DevCSI to bring developers together, such as hack days and challenges, and how these help developers build skills and solutions. It also discusses how supporting local developers can benefit institutions by empowering users and enabling local innovation. Finally, it shares some statistics on DevCSI's events and community outreach.
GOSPL: A Method and Tool for Fact-Oriented Hybrid Ontology EngineeringChristophe Debruyne
In this paper we present GOSPL, which stands for Grounding Ontologies with Social Processes and Natural Language. GOSPL is a method and tool that supports stakeholders in iteratively interpreting and modeling their common hybrid ontologies using their own terminology for semantic interoperability between autonomously developed and maintained information systems. Hybrid ontologies are ontologies in which concepts are both formally and informally described with the help of a special linguistic resource called glossary. Social interactions between the community members drive the ontology evolution process and result in more stable and agreed upon ontologies.
Christophe Debruyne, Robert Meersman: GOSPL: A Method and Tool for Fact-Oriented Hybrid Ontology Engineering. ADBIS 2012: 153-166
Infinit.e is an open analytics platform built using MongoDB and Hadoop. It allows users to collect, store, enrich, retrieve, analyze and visualize both structured and unstructured documents at scale. The platform utilizes open source technologies like Elasticsearch, MongoDB and Hadoop. MongoDB is used for document storage and development due to its flexible document model and ease of data model changes. Hadoop is used for large-scale analytics due to its proven scalability and availability of machine learning libraries. The presentation concludes with a demonstration of Infinit.e's capabilities.
The document summarizes several projects conducted by Microsoft Research related to scholarly communication. It discusses tools developed to aid scientific research through better data analysis, collaboration, dissemination of research outputs, and archiving of published literature and data. Specific projects highlighted include developing semantic markup and chemical drawing tools in Word 2007, integrating gene expression data with research papers using Word 2007's Open Packaging Conventions format, and establishing workflows for archiving datasets submitted with published articles.
EclipseConEurope2012 SOA - Models As Operational DocumentationMarc Dutoo
At Eclipse Con Europe 2012 in the SOA Symposium track, JWT's EMF model export to structure and information in Document Management Systems is explained and demonstrated for in the case of the EasySOA service documentation registry, with JWT workflows producing a basis for SOA operational documentation.
ArchivesSpace: Building a Next-Generation Archives Management ToolMark Matienzo
Presentation at Digital Library Federation Forum, October 31, 2011, by Katherine Kott, ArchivesSpace Development Manager Mark A. Matienzo, ArchivesSpace Technical Architect.
The document discusses Novell's GroupWise and Vibe collaboration products. It highlights how the products address top priorities for CIOs like collaboration/workflow technologies and mobile access. Demos of features are shown, including social tools, presence and telephony integration, the Vibe user interface, GroupWise integration with Vibe, and mobile apps. Upcoming releases of GroupWise, Vibe, and Messenger are previewed with new features.
The document provides an overview of the key phases involved in a digital project - discover, define, design, develop, deploy & deliver, and maintain. It describes the main actions and lingo associated with each phase to help familiarize beginners with digital production terminology. The discover phase involves understanding requirements, stakeholders, and timelines. Define locks in goals, concepts, audiences and technical specifications. Design creates unique approaches and visuals. Develop includes building, testing, and review. Deploy & deliver implements the build and content. Maintain measures results and identifies optimizations.
Governing services, data, rules, processes and moreRandall Hauch
Randall and Kurt will present how Guvnor is being reborn so that it can manage artifacts from a variety of domains, including web services, data services, business rules and processes, and metadata management. Guvnor not only will storing these artifacts, but it will fully manage their lifecycle, enable search and discovery, and provide insight into how, when and where they can be used. They'll also describe Guvnor's architecture and use of JCR, REST, GWT, Atom, and S-RAMP.
Visual Ontology Modeling for Domain Experts and Business Users with metaphactoryPeter Haase
Visual Ontology Modeling for Domain Experts and Business Users with metaphactory
Presentation at the OntoCommons Workshop on Ontology Engineering Tools @ Fri Mar 19, 2021
This document provides information about the capstone project for the Digital Discipleship Boot Camp (DDBC). It outlines the three phases of the capstone project: proposal, implementation, and final report/application. Phase I involves submitting a project proposal by September 1st that describes the project, goals, audience, timeline, technology to be used, and budget. Phase II is the implementation period from September to November where participants carry out their projects. Phase III involves submitting an essay by December 1st summarizing the DDBC experience and how it will apply to ministry, an article sharing the project for others, and a CEU application. The document also provides rubrics for evaluating the proposals and projects.
The effective daum coursework focused on open source software in Jeju Nationa...Channy Yun
The document summarizes an open source software course at Jeju National University in Korea. The course focused on preparing students for careers at companies like Daum that rely heavily on open source technologies. It discussed how the course helped students gain skills in areas like project management, programming languages, and contributing to open source communities. Evaluations found high participation rates and that most students continued learning about open source after the course.
The document summarizes identified synergies between the AEGIS project and other projects. It lists each project and potential synergies in areas such as personas, ontologies, user models, portals, pilots, development, integration, roadmaps, and standards. For each synergy, it identifies the relevant aspects and issues to be discussed further. The overall goal is to define specific actions and a roadmap to better fulfill synergies across projects.
The document summarizes identified synergies between the AEGIS project and other projects. It lists each project and potential synergies in areas such as personas, ontologies, user models, portals, pilots, development, integration, roadmaps, and standards. For each synergy, it identifies the relevant aspects and issues to be discussed further. The overall goal is to define specific actions and a roadmap to better fulfill synergies across projects.
Knowledge Organization System (KOS) for biodiversity information resources, G...Dag Endresen
Presentation of the Global Biodiversity Information Facility (GBIF) knowledge organization system (KOS) work program for the National Center for Biomedical Ontology (NCBO) Web seminar series in October 2012. Available at http://www.bioontology.org/GBIF-vocabulary-management-for-biodiversity-informatics
When a local project becomes beneficial for the whole community (and vice ver...4Science
Hasselt University developed customizations to the open source repository software DSpace that provided useful new features but did not get integrated into the main DSpace codebase. They partnered with 4Science to develop a submission module that would be aligned with DSpace roadmaps and published as open source. 4Science developed the module over a year, resulting in features like configurable workflows, authentication including Shibboleth, and administrative editing that will be included in the upcoming DSpace 7 release.
A simple natural language interface application for launching applications and showing user information based on voice input processed by using natural language programming concepts
Similar to An empirical study on the Specialisation Effect in Open Source Communities (20)
Keynote talk targeted to PhD students, during the BENEVOL 2023 research seminar (focused on software evolution) in Nijmegen, 27 November 2023, by Tom Mens (full professor in software engineering at University of Mons, Belgium). The keynote aims to provide tips, tricks and practical advice on how to become successful as a PhD student.
Recognising bot activity in collaborative software developmentTom Mens
Presentation by Natarajan Chidambaram during the International ICSE Workshop on Bots in Software Engineering (BotSE 2023) in Australia. Joint work with Mehdi Golzadeh, Tom Mens, Alexandre Decan of the Software Engineering Lab of the University of Mons and with Eleni Constantinou.
A Dataset of Bot and Human Activities in GitHubTom Mens
Presentation at the IEEE International Conference on Mining Software Repositories (MSR 2023) by Natarajan Chidambaram (Software Engineering Lab, University of Mons, Belgium) of a dataset of bot and human activities extracted from GitHub
This document discusses the rise of GitHub Actions (GHA) as a dominant continuous integration (CI) service based on a longitudinal study of 91,810 GitHub repositories. The study analyzed the evolution and usage of seven popular CI services over nine years, focusing on their co-usage and migration patterns. The study provides statistical evidence that GHA became the most used CI service within 18 months of its introduction, coinciding with a decrease in Travis usage likely due to policy changes and migrations to GHA. Interviews with software practitioners revealed competition between services and reasons for co-using or migrating between alternatives.
Nurturing the Software Ecosystems of the FutureTom Mens
In January 2018, four Software Engineering research groups located in different Belgian Universities launched a five year research project to nurture the software ecosystems of the future. We assembled a diverse team of about a dozen researchers and embarked on an exciting journey leading to a rich and diverse suite of papers, tools and datasets. Halfway into the project the corona pandemic intervened, but despite several months of lockdown, we succeeded in increasing inter-university collaboration. In this paper we share our achievements so that the BENEVOL community may benefit from our experience.
Comment programmer un robot en 30 minutes?Tom Mens
Comment apprendre à programmer un robot en 30 minutes? Atelier organisé par Tom Mens (en collaboration avec Pierre Zielinski, Gauvain Devillez et Sebastien Bonte) lors des Journées Math-Sciences du Printemps des Sciences 2022 à l'Université de Mons
On the rise and fall of CI services in GitHubTom Mens
Presentation of SANER 2022 conference article "On the rise and fall of CI services in GitHub" by Mehdi Golzadeh (co-authored with Alexandre Decan and Tom Mens).
On backporting practices in package dependency networksTom Mens
Presentation at FOSDEM 2022 Composition and Dependency Management DevRoom of empirical research on backporting practices in package dependency networks, published in the IEEE Transactions in Software Engineering in 2021 (https://doi.org/10.1109/TSE.2021.3112204)
Joint work by Alexandre Decan, Tom Mens; Ahmed Zeourali, Coen De Roover as part of the Belgian Excellence of Science research project SECOASSIST (https://secoassist.github.io)
Comparing semantic versioning practices in Cargo, npm, Packagist and RubygemsTom Mens
Presentation by Tom Mens at PackagingCon 2021 on Wednesday 10 November 2021.
Abstract: Semantic versioning (semver) is a commonly accepted open source practice, used by many package management systems to inform whether new package releases introduce possibly backward incompatible changes. Maintainers depending on such packages can use this practice to reduce the risk of breaking changes in their own packages by specifying version constraints on their dependencies. Depending on the amount of control a package maintainer desires to assert over her package dependencies, these constraints can range from very permissive to very restrictive. We empirically compared the evolution of semver compliance in four package management systems: Cargo, npm, Packagist and Rubygems. We discuss to what extent ecosystem-specific characteristics influence the degree of semver compliance, and we suggest to develop tools adopting the wisdom of the crowds to help package maintainers decide which type of version constraints they should impose on their dependencies.
We also studied to which extent the packages distributed by these package managers are still using a 0.y.z release, suggesting less stable and immature packages. We explore the effect of such "major zero" packages on semantic versioning adoption.
Our findings shed insight in some important differences between package managers with respect to package versioning policies.
Our empirical results have been published in two peer-reviewed academic journals: the IEEE Transactions in Software Engineering (https://doi.org/10.1109/TSE.2019.2918315) and Elsevier Science of Computer Programming (https://doi.org/10.1016/j.scico.2021.102656).
Achknowledgments: Research conducted in the context of the SECOASSIST "Excellence of Science" Research Project.
Presentation by Tom Mens at FOSDEM21 (Free Open Source Developers Meeting, February 2021). Published in Science of Computer Programming, August 2021.
https://doi.org/10.1016/j.scico.2021.102656
Abstract: When developing open source software end-user applications or reusable software packages, developers depend on software packages distributed through package managers such as npm, Packagist, Cargo, RubyGems. In addition to this, empirical evidence has shown that these package managers adhere to a large extent to semantic versioning principles. Packages that are still in major version zero are considered unstable according to semantic versioning, as some developers consider such packages as immature, still being under initial development.
This presentation reports on large-scale empirical evidence on the use of dependencies towards 0.y.z versions in four different software package distributions: Cargo, npm, Packagist and RubyGems. We study to which extent packages get stuck in the zero version space, never crossing the psychological barrier of major version zero. We compare the effect of the policies and practices of package managers on this phenomenon. We do not reveal the results of our findings in this abstract yet, as it would spoil the fun of the presentation.
Evaluating a bot detection model on git commit messagesTom Mens
Detecting the presence of bots in distributed software development activity is very important in order to prevent bias in socio-technical empirical studies. In previous work, we proposed a classification model to detect bots in GitHub repositories based on the pull request and issue comments of GitHub accounts. The current study generalises the approach to git contributors based on their commit messages. We train and evaluate the classification model on a large dataset of 6,922 git contributors. The original model based on pull request and issue comments obtained a precision of 0.77 on this dataset, whereas retraining the classification model on git commit messages increased the precision to 0.80. As a proof-of-concept, we implemented this model in BoDeGiC, an open source command-line tool to detect bots in git repositories.
Is my software ecosystem healthy? It depends!Tom Mens
QUATIC 2020 keynote presentation by Tom Mens (University of Mons) on dependency-related health issues in software ecosystems and research advances to address such health issues. Part of the presented research has been conducted as part of the Belgian SECO-ASSIST Excellence of Science Research Project.
Bot or not? Detecting bots in GitHub pull request activity based on comment s...Tom Mens
Presentation by Mehdi Golzadeh (Software Engineering Lab, University of Mons) of an article published at the 2nd International ICSE Workshop on Bots In Software Engineering (BotSE). See https://doi.org/10.1145/3387940.3391503
Abstract: Many empirical studies focus on socio-technical activity in social coding platforms such as GitHub, for example to study the onboarding, abandonment, productivity and collaboration among team members. Such studies face the difficulty that GitHub activity can also be generated automatically by bots of a different nature. It therefore becomes imperative to distinguish such bots from human users. We propose an automated approach to detect bots in GitHub pull request activity. Relying on the assumption that bots contain repetitive message patterns in their pull request comments, we analyse the similarity between multiple messages from the same GitHub identity, using a clustering method that combines the Jaccard and Levenshtein distance. We empirically evaluate our approach by analysing 20,090 comments of 250 users and 42 bots in 1,262 GitHub repositories. Our results show that the method is able to clearly separate bots from human users.
How magic is zero? An Empirical Analysis of Initial Development Releases in S...Tom Mens
1. 0.y.z packages are highly prevalent, contributing to 90% of packages in some distributions even though documentation states they are for initial development.
2. It generally takes a few months for packages to reach ≥1.0.0 but 20% take over a year, suggesting packages get stuck in 0.y.z.
3. 0.y.z packages are updated slightly more frequently but the difference is negligible, and there is little practical difference in how 0.y.z and ≥1.0.0 packages are used.
Comparing dependency issues across software package distributions (FOSDEM 2020)Tom Mens
This talk reports on our findings based on multiple empirical studies that we have conducted to understand different aspects of dependency management and their practical implications. This includes:
* the outdatedness of package dependencies, the transitive impact of such "technical lag", and its relation to the presence of bugs and security vulnerabilities.
* the impact of using either more permissive or more restrictive version contraints on dependencies.
* the virtues and limitations of being compliant to semantic versioning, a common policy to inform dependents whether new releases of software packages introduce possibly backward incompatible changes.
* the impact of specific characteristics, policies and tools used by the packaging ecosystem and its supporting community on all of the above.
The contents of the talk is primarily based on the following peer-reviewed scientific articles:
* What do package dependencies tell us about semantic versioning? Alexandre Decan, Tom Mens. IEEE Transactions on Software Engineering, 2019. https://doi.org/10.1109/TSE.2019.2918315
* An empirical comparison of dependency network evolution in seven software packaging ecosystems. Alexandre Decan, Tom Mens, Philippe Grosjean. Empirical Software Engineering 24(1):381-416, 2019. https://doi.org/10.1007/s10664-017-9589-y
* A formal framework for measuring technical lag in component repositories and its application to npm. Ahmed Zerouali, Tom Mens, Jesus Gonzalez‐Barahona, Alexandre Decan, Eleni Constantinou, Gregorio Robles. Journal of Software: Evolution and Process 31(8), 2019. https://doi.org/10.1002/smr.2157
* On the Impact of Security Vulnerabilities in the npm Package Dependency Network. Alexandre Decan, Tom Mens, Eleni Constantinou. International Conference on Mining Software Repositories, 2018. https://doi.org/10.1145/3196398.3196401
* On the Evolution of Technical Lag in the npm Package Dependency Network. Alexandre Decan, Tom Mens, Eleni Constantinou. International Conference on Software Maintenance and Evolution, 2018. https://doi.org/10.1109/ICSME.2018.00050
Measuring Technical Lag in Software Deployments (CHAOSScon 2020)Tom Mens
Presentation at CHAOSSCon Europe 2020 about the generic technical lag software measurement framework. Technical lag measures the increasing difference between deployed software components and the ideal upstream software components.
For more information, see https://doi.org/10.1002/smr.2157
This presentation reports on the research results achieved in the context of the interuniversity interdisciplinary research project SECOHealth "Vers une méthodologie et analyse socio-technique interdisciplinaire de la santé des écosystèmes logiciels" co-financed by FRS-FNRS Belgium and FRQ (FRSC - FRNT, Québec) with principal investigators Tom Mens (UMONS), Bram Adams (Polytechnique Montréal) and Josianne Marsan (Université Laval).
Introduction to the research seminar on empirical analysis of open source software ecosystems, organised by the SECO-ASSIST "excellence of science" research project, on September 4th, 2019 at the University of Mons, Belgium. With invited presentations by Alexander Serebrenik, Jesus Gonzalez-Barahona, Dario Di Nucci and Henrique Nucci. The seminar concludes with the public PhD defense of Ahmed Zerouali (supervised by Tom Mens) on the topic of "A Measurement Framework for Analyzing Technical Lag in Open-Source Software Ecosystems"
Empirically Analysing the Socio-Technical Health of Software Package ManagersTom Mens
Invited presentation at Concordia University (Montreal, Canada) by Eleni Constantinou and Tom Mens on recent research about the socio-technical health issues in software package management ecosystems.
Abstract: The large majority of today’s software is relying on open software software components. Such components are typically distributed through package managers for a wide variety of programming languages, and developed and maintained through online distributed software development services like GitHub. Software component repositories are perceived as software ecosystems that constitute complex and evolving socio-technical software dependency networks. Because of their complexity and evolution, these ecosystems tend to suffer from a wide variety of software health issues that can be either technical or social in nature. Examples of such issues include the ecosystem fragility due to exponential growth and transitive dependencies; the abundance of outdated, unmaintained or obsolete software components; the prolonged presence of unfixed bugs and security vulnerabilities; the abandonment or high turnover of key contributors, suboptimal collaboration between contributors, and many more. This presentation will report on our past and ongoing empirical research that studies such health factors within and across different software packaging ecosystems (such as npm, RubyGems, Cargo, CRAN, CPAN). We provide empirical evidence of some of the health problems, compare their presence across different ecosystems, and suggest ways to reduce their potential impact by providing concrete guidelines and tools. The presented research Is being conducted by researchers of the Software Engineering Lab at the University of Mons in the context of two ongoing projects SECOHealth and SECO-ASSIST, aiming to analyse and improve the health of software ecosystems.
A review of the growth of the Israel Genealogy Research Association Database Collection for the last 12 months. Our collection is now passed the 3 million mark and still growing. See which archives have contributed the most. See the different types of records we have, and which years have had records added. You can also see what we have for the future.
The simplified electron and muon model, Oscillating Spacetime: The Foundation...RitikBhardwaj56
Discover the Simplified Electron and Muon Model: A New Wave-Based Approach to Understanding Particles delves into a groundbreaking theory that presents electrons and muons as rotating soliton waves within oscillating spacetime. Geared towards students, researchers, and science buffs, this book breaks down complex ideas into simple explanations. It covers topics such as electron waves, temporal dynamics, and the implications of this model on particle physics. With clear illustrations and easy-to-follow explanations, readers will gain a new outlook on the universe's fundamental nature.
How to Build a Module in Odoo 17 Using the Scaffold MethodCeline George
Odoo provides an option for creating a module by using a single line command. By using this command the user can make a whole structure of a module. It is very easy for a beginner to make a module. There is no need to make each file manually. This slide will show how to create a module using the scaffold method.
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Dr. Vinod Kumar Kanvaria
Exploiting Artificial Intelligence for Empowering Researchers and Faculty,
International FDP on Fundamentals of Research in Social Sciences
at Integral University, Lucknow, 06.06.2024
By Dr. Vinod Kumar Kanvaria
This slide is special for master students (MIBS & MIFB) in UUM. Also useful for readers who are interested in the topic of contemporary Islamic banking.
A Strategic Approach: GenAI in EducationPeter Windle
Artificial Intelligence (AI) technologies such as Generative AI, Image Generators and Large Language Models have had a dramatic impact on teaching, learning and assessment over the past 18 months. The most immediate threat AI posed was to Academic Integrity with Higher Education Institutes (HEIs) focusing their efforts on combating the use of GenAI in assessment. Guidelines were developed for staff and students, policies put in place too. Innovative educators have forged paths in the use of Generative AI for teaching, learning and assessments leading to pockets of transformation springing up across HEIs, often with little or no top-down guidance, support or direction.
This Gasta posits a strategic approach to integrating AI into HEIs to prepare staff, students and the curriculum for an evolving world and workplace. We will highlight the advantages of working with these technologies beyond the realm of teaching, learning and assessment by considering prompt engineering skills, industry impact, curriculum changes, and the need for staff upskilling. In contrast, not engaging strategically with Generative AI poses risks, including falling behind peers, missed opportunities and failing to ensure our graduates remain employable. The rapid evolution of AI technologies necessitates a proactive and strategic approach if we are to remain relevant.
Physiology and chemistry of skin and pigmentation, hairs, scalp, lips and nail, Cleansing cream, Lotions, Face powders, Face packs, Lipsticks, Bath products, soaps and baby product,
Preparation and standardization of the following : Tonic, Bleaches, Dentifrices and Mouth washes & Tooth Pastes, Cosmetics for Nails.
Thinking of getting a dog? Be aware that breeds like Pit Bulls, Rottweilers, and German Shepherds can be loyal and dangerous. Proper training and socialization are crucial to preventing aggressive behaviors. Ensure safety by understanding their needs and always supervising interactions. Stay safe, and enjoy your furry friends!
A Survey of Techniques for Maximizing LLM Performance.pptx
An empirical study on the Specialisation Effect in Open Source Communities
1. An empirical study on the
Specialisation Effect in
Open Source Communities
Mathieu Goeminne & Tom Mens
University of Mons
Bogdan Vasilescu & Alexander Serebrenik
Eindhoven University of Technology
1
2. Our case study: Gnome
• A large ecosystem
• ~1,300 projects having a common
characteristic : integration in the same GNU/
Linux Desktop environment
• A large developer community
• > 5,000 involved authors in the Git repositories
• A set of activity types
• coding, documentation, translation, etc.
BENEVOL 2011 Empirical Study on Specialisation in FLOSS communities 08/12/11
2
3. Methodology
• Goal-Question-Metric approach:
• Define research goals
• Define questions to reach these goals
• Define metrics the answer these
questions
• Use metrics to statistically verify
hypotheses about the questions
BENEVOL 2011 Empirical Study on Specialisation in FLOSS communities 08/12/11
3
4. Research Goals:
understand
1. how development effort varies across
projects;
2. how development effort varies across
developers;
3. how the type of development activity
relates to the development effort.
BENEVOL 2011 Empirical Study on Specialisation in FLOSS communities 08/12/11
4
5. Research Questions
• Are the projects containing more activity
types more active? [Goal 1]
• How specialised are the projects towards
different activity types? [Goal 1]
• What influences the project specialisation?
To which extent are developers specialised
in different activity types? [Goal 2]
• etc.
BENEVOL 2011 Empirical Study on Specialisation in FLOSS communities 08/12/11
5
6. Workload and
Involvement Metrics
• workload APTW(p,d,t)
• # files in project p touched by developer d having touched a file of
activity type t
• involvement APTI(p,d,t)
• 1 if APTW(p,d,t) > 0; 0 otherwise
• Higher level metrics : aggregation over all projects, developers and/or
activity types. Ex:
• NPD(d) = Number of projects in which developer d is involved
• PW(p) = Global workload of project p
• RPTW(p,t) = Workload in p w.r.t activity type t, relative to the global
project workload
• PWS(p) = Specialisation (imbalance or unequal distribution) of
workload across project activity types (using Gini index).
BENEVOL 2011 Empirical Study on Specialisation in FLOSS communities 08/12/11
6
7. The Gini Index
Economic measure of inequality
• Aggregates a collection of values (e.g.
incomes in a country)
• A value between 0 and 1 :
• 0 if everybody has the same income
• 1 if a person has all the income and the
others don’t have anything.
BENEVOL 2011 Empirical Study on Specialisation in FLOSS communities 08/12/11
7
8. Experimental setup
• Extract data from Gnome’s Git project repositories
• Populate databases using this data
• Detect multiple logins belonging the same ‘real’
developer thanks to an identity matching algorithm.
• Define activity types carried out by project
developers
• Compute metrics
• Verify statistical hypotheses
BENEVOL 2011 Empirical Study on Specialisation in FLOSS communities 08/12/11
8
9. Activity types
• 13 activity types, including:
• coding, translation, documentation, building,
development documentation.
• File classification: based on the file path, name
and extension and a set of rules. A rule is a type
and a regular expression.
• Ex: (doc, ‘.*/DOC(-?)BOOK(S?)/.*’)
BENEVOL 2011 Empirical Study on Specialisation in FLOSS communities 08/12/11
9
10. How is the activity distributed
across activity types?
0.4
0.3
0.2
0.1
0.0
develdoc
build
doc
unknown
test
code
translate
mmedia
library
image
config
ui
meta
database
BENEVOL 2011 Empirical Study on Specialisation in FLOSS communities 08/12/11
10
11. Developer workload distribution
Imbalance by activity type
RAWS(p)
● ● ● ●●●●●
● ●● ●●●●
●●●●●
●●●●
●●●●
● ●
0.0 0.2 0.4 0.6 0.8 1.0
Most developers concentrate their
workload in few activities
BENEVOL 2011 Empirical Study on Specialisation in FLOSS communities 08/12/11
11
12. Project workload distribution
Imbalance by activity type
RPWS(p)
● ●●●
●●
0.0 0.2 0.4 0.6 0.8 1.0
Most projects concentrate their
workload in few activities
BENEVOL 2011 Empirical Study on Specialisation in FLOSS communities 08/12/11
12
17. Future work
• Take into account other factors
• time: How do our metrics and their relations evolve
over time?
• project categories (e.g. archived, deprecated, ...)
• project age and maturity
• main programming language
• ...
• Develop a dashboard tool to detect, prevent and predict
health problems in evolving software ecosystems
BENEVOL 2011 Empirical Study on Specialisation in FLOSS communities 08/12/11
17
18. Thank you
BENEVOL 2011 Empirical Study on Specialisation in FLOSS communities 08/12/11
18