A CDE seminar held on 19/4/11: Clare Sansom, structural biologist, Web 2.0 enthusiast and CDE Fellow, will then illustrate how the immersive virtual world, Second Life, can be used to illustrate molecular structures and teach molecular sciences, and discuss its application in teaching other highly "visual" disciplines.
BioCuration 2019 - Evidence and Conclusion Ontology 2019 Updatedolleyj
The Evidence and Conclusion Ontology (ECO) describes types of evidence relevant to biological investigations. First developed in the early 2000s, ECO now consists of over 1700 defined classes and is used by a large, and growing, list of resources. ECO imports close to 1000 classes from the Ontology for Biomedical Investigations and the Gene Ontology for use in logical definitions. Historically, ECO terms have generally been categorized by either the biological context of the evidence (e.g. gene expression) or the technique used to generate the evidence (e.g. PCR-based evidence). The result is that sometimes terms that have related biological context are found under different unrelated nodes. To address this, we have been performing a rigorous review of the structure and logic of the branches of ECO. Working with additional input from collaborators through the issue tracker on GitHub, term labels, definitions, and relationships are being evaluated and updated. The goal of these changes is to increase the logical consistency of ECO, make it easier for users to find and understand terms, and allow for ECO to continue to grow and support its users. In addition to the structural review, we have been working with CollecTF to utilize ECO for automated text mining. To generate a curated corpus for this effort, we have been annotating ECO terms to sentences which contain evidence-based assertions about gene products, taxonomic entities, and sequence features. From this effort we have developed clearly-defined annotation guidelines that have been passed on to a team of undergraduates who are continuing the curation effort.
Annotations are limited to single sentences, or to two consecutive sentences, containing the evidence instance and assertion clause. The quality of the mapping to ECO
and the strength of the author’s assertion are also captured. ECO is freely available at http://evidenceontology.org/ and https://github.com/evidenceontology.
ICBO 2018 Poster - Current Development in the Evidence and Conclusion Ontolog...dolleyj
The Evidence & Conclusion Ontology (ECO) has been developed to provide standardized descriptions for types of evidence within the biological domain. Best
practices in biocuration require that when a biological assertion is made (e.g. linking a Gene Ontology (GO) term for a molecular function to a protein), the type of evidence
supporting it is captured. In recent development efforts, we have been working with other ontology groups to ensure that ECO classes exist for the types of curation they
support. These include the Ontology for Microbial Phenotypes and GO. In addition, we continue to support user-level class requests through our GitHub issue tracker. To
facilitate the addition and maintenance of new classes, we utilize ROBOT (a command line tool for working with Open Biomedical Ontologies) as part of our standard workflow.
ROBOT templates allow us to define classes in a spreadsheet and convert them to Web Ontology Language (OWL) axioms, which can then be merged into ECO. ROBOT is
also part of our automated release process. Additionally, we are engaged in ongoing work to map ECO classes to Ontology for Biomedical Investigation classes using logical
definitions. ECO is currently in use by dozens of groups engaged in biological curation and the number of ECO users continues to grow. The ontology, in OWL and Open
Biomedical Ontology (OBO) formats, and associated resources can be accessed through our GitHub site (https://github.com/evidenceontology/evidenceontology) as well as
the ECO web page (http://evidenceontology.org/).
FAIR Data and Model Management for Systems Biology(and SOPs too!)Carole Goble
MultiScale Biology Network Springboard meeting, Nottingham, UK, 1 June 2015
FAIR Data and model management for Systems Biology
Over the past 5 years we have seen a change in expectations for the management of all the outcomes of research – that is the “assets” of data, models, codes, SOPs and so forth. Don’t stop reading. Yes, data management isn’t likely to win anyone a Nobel prize. But publications should be supported and accompanied by data, methods, procedures, etc. to assure reproducibility of results. Funding agencies expect data (and increasingly software) management retention and access plans as part of the proposal process for projects to be funded. Journals are raising their expectations of the availability of data and codes for pre- and post- publication. And the multi-component, multi-disciplinary nature of Systems Biology demands the interlinking and exchange of assets and the systematic recording of metadata for their interpretation.
Data and model management for the Systems Biology community is a multi-faceted one including: the development and adoption appropriate community standards (and the navigation of the standards maze); the sustaining of international public archives capable of servicing quantitative biology; and the development of the necessary tools and know-how for researchers within their own institutes so that they can steward their assets in a sustainable, coherent and credited manner while minimizing burden and maximising personal benefit.
The FAIRDOM (Findable, Accessible, Interoperable, Reusable Data, Operations and Models) Initiative has grown out of several efforts in European programmes (SysMO and EraSysAPP ERANets and the ISBE ESRFI) and national initiatives (de.NBI, German Virtual Liver Network, SystemsX, UK SynBio centres). It aims to support Systems Biology researchers with data and model management, with an emphasis on standards smuggled in by stealth.
This talk will use the FAIRDOM Initiative to discuss the FAIR management of data, SOPs, and models for Sys Bio, highlighting the challenges multi-scale biology presents.
http://www.fair-dom.org
http://www.fairdomhub.org
http://www.seek4science.org
Being Reproducible: SSBSS Summer School 2017Carole Goble
Lecture 2:
Being Reproducible: Models, Research Objects and R* Brouhaha
Reproducibility is a R* minefield, depending on whether you are testing for robustness (rerun), defence (repeat), certification (replicate), comparison (reproduce) or transferring between researchers (reuse). Different forms of "R" make different demands on the completeness, depth and portability of research. Sharing is another minefield raising concerns of credit and protection from sharp practices.
In practice the exchange, reuse and reproduction of scientific experiments is dependent on bundling and exchanging the experimental methods, computational codes, data, algorithms, workflows and so on along with the narrative. These "Research Objects" are not fixed, just as research is not “finished”: the codes fork, data is updated, algorithms are revised, workflows break, service updates are released. ResearchObject.org is an effort to systematically support more portable and reproducible research exchange.
In this talk I will explore these issues in more depth using the FAIRDOM Platform and its support for reproducible modelling. The talk will cover initiatives and technical issues, and raise social and cultural challenges.
A CDE seminar held on 19/4/11: Clare Sansom, structural biologist, Web 2.0 enthusiast and CDE Fellow, will then illustrate how the immersive virtual world, Second Life, can be used to illustrate molecular structures and teach molecular sciences, and discuss its application in teaching other highly "visual" disciplines.
BioCuration 2019 - Evidence and Conclusion Ontology 2019 Updatedolleyj
The Evidence and Conclusion Ontology (ECO) describes types of evidence relevant to biological investigations. First developed in the early 2000s, ECO now consists of over 1700 defined classes and is used by a large, and growing, list of resources. ECO imports close to 1000 classes from the Ontology for Biomedical Investigations and the Gene Ontology for use in logical definitions. Historically, ECO terms have generally been categorized by either the biological context of the evidence (e.g. gene expression) or the technique used to generate the evidence (e.g. PCR-based evidence). The result is that sometimes terms that have related biological context are found under different unrelated nodes. To address this, we have been performing a rigorous review of the structure and logic of the branches of ECO. Working with additional input from collaborators through the issue tracker on GitHub, term labels, definitions, and relationships are being evaluated and updated. The goal of these changes is to increase the logical consistency of ECO, make it easier for users to find and understand terms, and allow for ECO to continue to grow and support its users. In addition to the structural review, we have been working with CollecTF to utilize ECO for automated text mining. To generate a curated corpus for this effort, we have been annotating ECO terms to sentences which contain evidence-based assertions about gene products, taxonomic entities, and sequence features. From this effort we have developed clearly-defined annotation guidelines that have been passed on to a team of undergraduates who are continuing the curation effort.
Annotations are limited to single sentences, or to two consecutive sentences, containing the evidence instance and assertion clause. The quality of the mapping to ECO
and the strength of the author’s assertion are also captured. ECO is freely available at http://evidenceontology.org/ and https://github.com/evidenceontology.
ICBO 2018 Poster - Current Development in the Evidence and Conclusion Ontolog...dolleyj
The Evidence & Conclusion Ontology (ECO) has been developed to provide standardized descriptions for types of evidence within the biological domain. Best
practices in biocuration require that when a biological assertion is made (e.g. linking a Gene Ontology (GO) term for a molecular function to a protein), the type of evidence
supporting it is captured. In recent development efforts, we have been working with other ontology groups to ensure that ECO classes exist for the types of curation they
support. These include the Ontology for Microbial Phenotypes and GO. In addition, we continue to support user-level class requests through our GitHub issue tracker. To
facilitate the addition and maintenance of new classes, we utilize ROBOT (a command line tool for working with Open Biomedical Ontologies) as part of our standard workflow.
ROBOT templates allow us to define classes in a spreadsheet and convert them to Web Ontology Language (OWL) axioms, which can then be merged into ECO. ROBOT is
also part of our automated release process. Additionally, we are engaged in ongoing work to map ECO classes to Ontology for Biomedical Investigation classes using logical
definitions. ECO is currently in use by dozens of groups engaged in biological curation and the number of ECO users continues to grow. The ontology, in OWL and Open
Biomedical Ontology (OBO) formats, and associated resources can be accessed through our GitHub site (https://github.com/evidenceontology/evidenceontology) as well as
the ECO web page (http://evidenceontology.org/).
FAIR Data and Model Management for Systems Biology(and SOPs too!)Carole Goble
MultiScale Biology Network Springboard meeting, Nottingham, UK, 1 June 2015
FAIR Data and model management for Systems Biology
Over the past 5 years we have seen a change in expectations for the management of all the outcomes of research – that is the “assets” of data, models, codes, SOPs and so forth. Don’t stop reading. Yes, data management isn’t likely to win anyone a Nobel prize. But publications should be supported and accompanied by data, methods, procedures, etc. to assure reproducibility of results. Funding agencies expect data (and increasingly software) management retention and access plans as part of the proposal process for projects to be funded. Journals are raising their expectations of the availability of data and codes for pre- and post- publication. And the multi-component, multi-disciplinary nature of Systems Biology demands the interlinking and exchange of assets and the systematic recording of metadata for their interpretation.
Data and model management for the Systems Biology community is a multi-faceted one including: the development and adoption appropriate community standards (and the navigation of the standards maze); the sustaining of international public archives capable of servicing quantitative biology; and the development of the necessary tools and know-how for researchers within their own institutes so that they can steward their assets in a sustainable, coherent and credited manner while minimizing burden and maximising personal benefit.
The FAIRDOM (Findable, Accessible, Interoperable, Reusable Data, Operations and Models) Initiative has grown out of several efforts in European programmes (SysMO and EraSysAPP ERANets and the ISBE ESRFI) and national initiatives (de.NBI, German Virtual Liver Network, SystemsX, UK SynBio centres). It aims to support Systems Biology researchers with data and model management, with an emphasis on standards smuggled in by stealth.
This talk will use the FAIRDOM Initiative to discuss the FAIR management of data, SOPs, and models for Sys Bio, highlighting the challenges multi-scale biology presents.
http://www.fair-dom.org
http://www.fairdomhub.org
http://www.seek4science.org
Being Reproducible: SSBSS Summer School 2017Carole Goble
Lecture 2:
Being Reproducible: Models, Research Objects and R* Brouhaha
Reproducibility is a R* minefield, depending on whether you are testing for robustness (rerun), defence (repeat), certification (replicate), comparison (reproduce) or transferring between researchers (reuse). Different forms of "R" make different demands on the completeness, depth and portability of research. Sharing is another minefield raising concerns of credit and protection from sharp practices.
In practice the exchange, reuse and reproduction of scientific experiments is dependent on bundling and exchanging the experimental methods, computational codes, data, algorithms, workflows and so on along with the narrative. These "Research Objects" are not fixed, just as research is not “finished”: the codes fork, data is updated, algorithms are revised, workflows break, service updates are released. ResearchObject.org is an effort to systematically support more portable and reproducible research exchange.
In this talk I will explore these issues in more depth using the FAIRDOM Platform and its support for reproducible modelling. The talk will cover initiatives and technical issues, and raise social and cultural challenges.
Annotation of SBML Models Through Rule-Based Semantic IntegrationAllyson Lister
This talk was given on June 28, 2009 at the Bio-Ontologies SIG as part of ISMB/ECCB 2009. You can download the paper this presentation is about from http://hdl.handle.net/10101/npre.2009.3286.1. More information on the ISMB conference is available at http://www.iscb.org/ismbeccb2009/ and http://friendfeed.com/ismbeccb2009
The Seven Deadly Sins of BioinformaticsDuncan Hull
Keynote talk at Bioinformatics Open Source Conference (BOSC) Special Interest Group at the 15th Annual International Conference on Intelligent Systems for Molecular Biology (ISMB 2007) in Vienna, July 2007 by Carole Goble, University of Manchester.
Presentation given at the NBT / ECCB 2020, presenting COMBINE standards. Also providing links to related projects, introducing open model repositories and giving some hints for creating reusable models.
Reproducibility, Research Objects and Reality, Leiden 2016Carole Goble
Presented at the Leiden Bioscience Lecture, 24 November 2016, Reproducibility, Research Objects and Reality
Over the past 5 years we have seen a change in expectations for the management of all the outcomes of research – that is the “assets” of data, models, codes, SOPs, workflows. The “FAIR” (Findable, Accessible, Interoperable, Reusable) Guiding Principles for scientific data management and stewardship have proved to be an effective rallying-cry. Funding agencies expect data (and increasingly software) management retention and access plans. Journals are raising their expectations of the availability of data and codes for pre- and post- publication. It all sounds very laudable and straightforward. BUT…..
Reproducibility is a R* minefield, depending on whether you are testing for robustness (rerun), defence (repeat), certification (replicate), comparison (reproduce) or transferring between researchers (reuse). Different forms of "R" make different demands on the completeness, depth and portability of research. Sharing is another minefield raising concerns of credit and protection from sharp practices.
In practice the exchange, reuse and reproduction of scientific experiments is dependent on bundling and exchanging the experimental methods, computational codes, data, algorithms, workflows and so on along with the narrative. These "Research Objects" are not fixed, just as research is not “finished”: the codes fork, data is updated, algorithms are revised, workflows break, service updates are released. ResearchObject.org is an effort to systematically support more portable and reproducible research exchange
In this talk I will explore these issues in data-driven computational life sciences through the examples and stories from initiatives I am involved, and Leiden is involved in too including:
· FAIRDOM which has built a Commons for Systems and Synthetic Biology projects, with an emphasis on standards smuggled in by stealth and efforts to affecting sharing practices using behavioural interventions
· ELIXIR, the EU Research Data Infrastructure, and its efforts to exchange workflows
· Bioschemas.org, an ELIXIR-NIH-Google effort to support the finding of assets.
The repository ecology: an approach to understanding repository and service i...R. John Robertson
An increasing number of university institutions and other organisations are deciding to deploy repositories and a growing number of formal and informal distributed services are supporting or capitalising on the information these repositories provide. Despite reasonably well understood technical architectures, early majority adopters may struggle to articulate their place within the actualities of a wider information environment. The idea of a repository ecology provides developers and administrators with a useful way of articulating and analysing their place in the information environment, and the technical and organisational interactions they have, or are developing, with other parts of such an environment. This presentation will provide an overview of the concept of a repository ecology and examine some examples from the domains of scholarly communications and elearning.
FAIR Data, Operations and Model management for Systems Biology and Systems Me...Carole Goble
FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects given at 1st Conference of the European Association of Systems Medicine, 26-28 October 2016, Berlin. the FAIRDOM project is described.
Presented by Richard Kidd at "The Future Information Needs of Pharmaceutical & Medicinal Chemistry", Monday 28 November 2011 at The Linnean Society, Burlington Square, London run by the RSC CICAG group.
Standards and software: practical aids for reproducibility of computational r...Mike Hucka
My presentation during the session titled "Reproducibility of computational research: methods to avoid madness" on Wednesday, 17 September 2014, during ICSB 2014, held in Melbourne, Australia.
COMBINE 2019, EU-STANDS4PM, Heidelberg, Germany 18 July 2019
FAIR: Findable Accessable Interoperable Reusable. The “FAIR Principles” for research data, software, computational workflows, scripts, or any other kind of Research Object one can think of, is now a mantra; a method; a meme; a myth; a mystery. FAIR is about supporting and tracking the flow and availability of data across research organisations and the portability and sustainability of processing methods to enable transparent and reproducible results. All this is within the context of a bottom up society of collaborating (or burdened?) scientists, a top down collective of compliance-focused funders and policy makers and an in-the-middle posse of e-infrastructure providers.
Making the FAIR principles a reality is tricky. They are aspirations not standards. They are multi-dimensional and dependent on context such as the sensitivity and availability of the data and methods. We already see a jungle of projects, initiatives and programmes wrestling with the challenges. FAIR efforts have particularly focused on the “last mile” – “FAIRifying” destination community archive repositories and measuring their “compliance” to FAIR metrics (or less controversially “indicators”). But what about FAIR at the first mile, at source and how do we help Alice and Bob with their (secure) data management? If we tackle the FAIR first and last mile, what about the FAIR middle? What about FAIR beyond just data – like exchanging and reusing pipelines for precision medicine?
Since 2008 the FAIRDOM collaboration [1] has worked on FAIR asset management and the development of a FAIR asset Commons for multi-partner researcher projects [2], initially in the Systems Biology field. Since 2016 we have been working with the BioCompute Object Partnership [3] on standardising computational records of HTS precision medicine pipelines.
So, using our FAIRDOM and BioCompute Object binoculars let’s go on a FAIR safari! Let’s peruse the ecosystem, observe the different herds and reflect what where we are for FAIR personalised medicine.
References
[1] http://www.fair-dom.org
[2] http://www.fairdomhub.org
[3] http://www.biocomputeobject.org
CDAO presentation.
The idea of the comparative analysis ontoloty has been presented worldwide, including: NESCent (USA), IGBMC (France), UFRJ (Brazil). Providing a semantic framework for evolutionary analysis in a high-throughtput way after the next and third generation sequencing is the way to approach evolutionary-based studies into genome-wide analysis. The darwinian core of reasoning also allows CDAO to be used with other entities.
Presentation to ImmPort Science Meeting, February 27, 2014 on the proper treatment of value sets in the Immport Immunology Database and Analysis Portal
Being FAIR: FAIR data and model management SSBSS 2017 Summer SchoolCarole Goble
Lecture 1:
Being FAIR: FAIR data and model management
In recent years we have seen a change in expectations for the management of all the outcomes of research – that is the “assets” of data, models, codes, SOPs, workflows. The “FAIR” (Findable, Accessible, Interoperable, Reusable) Guiding Principles for scientific data management and stewardship [1] have proved to be an effective rallying-cry. Funding agencies expect data (and increasingly software) management retention and access plans. Journals are raising their expectations of the availability of data and codes for pre- and post- publication. The multi-component, multi-disciplinary nature of Systems and Synthetic Biology demands the interlinking and exchange of assets and the systematic recording of metadata for their interpretation.
Our FAIRDOM project (http://www.fair-dom.org) supports Systems Biology research projects with their research data, methods and model management, with an emphasis on standards smuggled in by stealth and sensitivity to asset sharing and credit anxiety. The FAIRDOM Platform has been installed by over 30 labs or projects. Our public, centrally hosted Asset Commons, the FAIRDOMHub.org, supports the outcomes of 50+ projects.
Now established as a grassroots association, FAIRDOM has over 8 years of experience of practical asset sharing and data infrastructure at the researcher coal-face ranging across European programmes (SysMO and ERASysAPP ERANets), national initiatives (Germany's de.NBI and Systems Medicine of the Liver; Norway's Digital Life) and European Research Infrastructures (ISBE) as well as in PI's labs and Centres such as the SynBioChem Centre at Manchester.
In this talk I will show explore how FAIRDOM has been designed to support Systems Biology projects and show examples of its configuration and use. I will also explore the technical and social challenges we face.
I will also refer to European efforts to support public archives for the life sciences. ELIXIR (http:// http://www.elixir-europe.org/) the European Research Infrastructure of 21 national nodes and a hub funded by national agreements to coordinate and sustain key data repositories and archives for the Life Science community, improve access to them and related tools, support training and create a platform for dataset interoperability. As the Head of the ELIXIR-UK Node and co-lead of the ELIXIR Interoperability Platform I will show how this work relates to your projects.
[1] Wilkinson et al, The FAIR Guiding Principles for scientific data management and stewardship Scientific Data 3, doi:10.1038/sdata.2016.18
Results may vary: Collaborations Workshop, Oxford 2014Carole Goble
Thoughts on computational science reproducibility with a focus on software. Given at the Software Sustainability Institute's 2014 Collaborations Workshop
Annotation of SBML Models Through Rule-Based Semantic IntegrationAllyson Lister
This talk was given on June 28, 2009 at the Bio-Ontologies SIG as part of ISMB/ECCB 2009. You can download the paper this presentation is about from http://hdl.handle.net/10101/npre.2009.3286.1. More information on the ISMB conference is available at http://www.iscb.org/ismbeccb2009/ and http://friendfeed.com/ismbeccb2009
The Seven Deadly Sins of BioinformaticsDuncan Hull
Keynote talk at Bioinformatics Open Source Conference (BOSC) Special Interest Group at the 15th Annual International Conference on Intelligent Systems for Molecular Biology (ISMB 2007) in Vienna, July 2007 by Carole Goble, University of Manchester.
Presentation given at the NBT / ECCB 2020, presenting COMBINE standards. Also providing links to related projects, introducing open model repositories and giving some hints for creating reusable models.
Reproducibility, Research Objects and Reality, Leiden 2016Carole Goble
Presented at the Leiden Bioscience Lecture, 24 November 2016, Reproducibility, Research Objects and Reality
Over the past 5 years we have seen a change in expectations for the management of all the outcomes of research – that is the “assets” of data, models, codes, SOPs, workflows. The “FAIR” (Findable, Accessible, Interoperable, Reusable) Guiding Principles for scientific data management and stewardship have proved to be an effective rallying-cry. Funding agencies expect data (and increasingly software) management retention and access plans. Journals are raising their expectations of the availability of data and codes for pre- and post- publication. It all sounds very laudable and straightforward. BUT…..
Reproducibility is a R* minefield, depending on whether you are testing for robustness (rerun), defence (repeat), certification (replicate), comparison (reproduce) or transferring between researchers (reuse). Different forms of "R" make different demands on the completeness, depth and portability of research. Sharing is another minefield raising concerns of credit and protection from sharp practices.
In practice the exchange, reuse and reproduction of scientific experiments is dependent on bundling and exchanging the experimental methods, computational codes, data, algorithms, workflows and so on along with the narrative. These "Research Objects" are not fixed, just as research is not “finished”: the codes fork, data is updated, algorithms are revised, workflows break, service updates are released. ResearchObject.org is an effort to systematically support more portable and reproducible research exchange
In this talk I will explore these issues in data-driven computational life sciences through the examples and stories from initiatives I am involved, and Leiden is involved in too including:
· FAIRDOM which has built a Commons for Systems and Synthetic Biology projects, with an emphasis on standards smuggled in by stealth and efforts to affecting sharing practices using behavioural interventions
· ELIXIR, the EU Research Data Infrastructure, and its efforts to exchange workflows
· Bioschemas.org, an ELIXIR-NIH-Google effort to support the finding of assets.
The repository ecology: an approach to understanding repository and service i...R. John Robertson
An increasing number of university institutions and other organisations are deciding to deploy repositories and a growing number of formal and informal distributed services are supporting or capitalising on the information these repositories provide. Despite reasonably well understood technical architectures, early majority adopters may struggle to articulate their place within the actualities of a wider information environment. The idea of a repository ecology provides developers and administrators with a useful way of articulating and analysing their place in the information environment, and the technical and organisational interactions they have, or are developing, with other parts of such an environment. This presentation will provide an overview of the concept of a repository ecology and examine some examples from the domains of scholarly communications and elearning.
FAIR Data, Operations and Model management for Systems Biology and Systems Me...Carole Goble
FAIR Data, Operations and Model management for Systems Biology and Systems Medicine Projects given at 1st Conference of the European Association of Systems Medicine, 26-28 October 2016, Berlin. the FAIRDOM project is described.
Presented by Richard Kidd at "The Future Information Needs of Pharmaceutical & Medicinal Chemistry", Monday 28 November 2011 at The Linnean Society, Burlington Square, London run by the RSC CICAG group.
Standards and software: practical aids for reproducibility of computational r...Mike Hucka
My presentation during the session titled "Reproducibility of computational research: methods to avoid madness" on Wednesday, 17 September 2014, during ICSB 2014, held in Melbourne, Australia.
COMBINE 2019, EU-STANDS4PM, Heidelberg, Germany 18 July 2019
FAIR: Findable Accessable Interoperable Reusable. The “FAIR Principles” for research data, software, computational workflows, scripts, or any other kind of Research Object one can think of, is now a mantra; a method; a meme; a myth; a mystery. FAIR is about supporting and tracking the flow and availability of data across research organisations and the portability and sustainability of processing methods to enable transparent and reproducible results. All this is within the context of a bottom up society of collaborating (or burdened?) scientists, a top down collective of compliance-focused funders and policy makers and an in-the-middle posse of e-infrastructure providers.
Making the FAIR principles a reality is tricky. They are aspirations not standards. They are multi-dimensional and dependent on context such as the sensitivity and availability of the data and methods. We already see a jungle of projects, initiatives and programmes wrestling with the challenges. FAIR efforts have particularly focused on the “last mile” – “FAIRifying” destination community archive repositories and measuring their “compliance” to FAIR metrics (or less controversially “indicators”). But what about FAIR at the first mile, at source and how do we help Alice and Bob with their (secure) data management? If we tackle the FAIR first and last mile, what about the FAIR middle? What about FAIR beyond just data – like exchanging and reusing pipelines for precision medicine?
Since 2008 the FAIRDOM collaboration [1] has worked on FAIR asset management and the development of a FAIR asset Commons for multi-partner researcher projects [2], initially in the Systems Biology field. Since 2016 we have been working with the BioCompute Object Partnership [3] on standardising computational records of HTS precision medicine pipelines.
So, using our FAIRDOM and BioCompute Object binoculars let’s go on a FAIR safari! Let’s peruse the ecosystem, observe the different herds and reflect what where we are for FAIR personalised medicine.
References
[1] http://www.fair-dom.org
[2] http://www.fairdomhub.org
[3] http://www.biocomputeobject.org
CDAO presentation.
The idea of the comparative analysis ontoloty has been presented worldwide, including: NESCent (USA), IGBMC (France), UFRJ (Brazil). Providing a semantic framework for evolutionary analysis in a high-throughtput way after the next and third generation sequencing is the way to approach evolutionary-based studies into genome-wide analysis. The darwinian core of reasoning also allows CDAO to be used with other entities.
Presentation to ImmPort Science Meeting, February 27, 2014 on the proper treatment of value sets in the Immport Immunology Database and Analysis Portal
Being FAIR: FAIR data and model management SSBSS 2017 Summer SchoolCarole Goble
Lecture 1:
Being FAIR: FAIR data and model management
In recent years we have seen a change in expectations for the management of all the outcomes of research – that is the “assets” of data, models, codes, SOPs, workflows. The “FAIR” (Findable, Accessible, Interoperable, Reusable) Guiding Principles for scientific data management and stewardship [1] have proved to be an effective rallying-cry. Funding agencies expect data (and increasingly software) management retention and access plans. Journals are raising their expectations of the availability of data and codes for pre- and post- publication. The multi-component, multi-disciplinary nature of Systems and Synthetic Biology demands the interlinking and exchange of assets and the systematic recording of metadata for their interpretation.
Our FAIRDOM project (http://www.fair-dom.org) supports Systems Biology research projects with their research data, methods and model management, with an emphasis on standards smuggled in by stealth and sensitivity to asset sharing and credit anxiety. The FAIRDOM Platform has been installed by over 30 labs or projects. Our public, centrally hosted Asset Commons, the FAIRDOMHub.org, supports the outcomes of 50+ projects.
Now established as a grassroots association, FAIRDOM has over 8 years of experience of practical asset sharing and data infrastructure at the researcher coal-face ranging across European programmes (SysMO and ERASysAPP ERANets), national initiatives (Germany's de.NBI and Systems Medicine of the Liver; Norway's Digital Life) and European Research Infrastructures (ISBE) as well as in PI's labs and Centres such as the SynBioChem Centre at Manchester.
In this talk I will show explore how FAIRDOM has been designed to support Systems Biology projects and show examples of its configuration and use. I will also explore the technical and social challenges we face.
I will also refer to European efforts to support public archives for the life sciences. ELIXIR (http:// http://www.elixir-europe.org/) the European Research Infrastructure of 21 national nodes and a hub funded by national agreements to coordinate and sustain key data repositories and archives for the Life Science community, improve access to them and related tools, support training and create a platform for dataset interoperability. As the Head of the ELIXIR-UK Node and co-lead of the ELIXIR Interoperability Platform I will show how this work relates to your projects.
[1] Wilkinson et al, The FAIR Guiding Principles for scientific data management and stewardship Scientific Data 3, doi:10.1038/sdata.2016.18
Results may vary: Collaborations Workshop, Oxford 2014Carole Goble
Thoughts on computational science reproducibility with a focus on software. Given at the Software Sustainability Institute's 2014 Collaborations Workshop
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...Wasswaderrick3
In this book, we use conservation of energy techniques on a fluid element to derive the Modified Bernoulli equation of flow with viscous or friction effects. We derive the general equation of flow/ velocity and then from this we derive the Pouiselle flow equation, the transition flow equation and the turbulent flow equation. In the situations where there are no viscous effects , the equation reduces to the Bernoulli equation. From experimental results, we are able to include other terms in the Bernoulli equation. We also look at cases where pressure gradients exist. We use the Modified Bernoulli equation to derive equations of flow rate for pipes of different cross sectional areas connected together. We also extend our techniques of energy conservation to a sphere falling in a viscous medium under the effect of gravity. We demonstrate Stokes equation of terminal velocity and turbulent flow equation. We look at a way of calculating the time taken for a body to fall in a viscous medium. We also look at the general equation of terminal velocity.
Richard's aventures in two entangled wonderlandsRichard Gill
Since the loophole-free Bell experiments of 2020 and the Nobel prizes in physics of 2022, critics of Bell's work have retreated to the fortress of super-determinism. Now, super-determinism is a derogatory word - it just means "determinism". Palmer, Hance and Hossenfelder argue that quantum mechanics and determinism are not incompatible, using a sophisticated mathematical construction based on a subtle thinning of allowed states and measurements in quantum mechanics, such that what is left appears to make Bell's argument fail, without altering the empirical predictions of quantum mechanics. I think however that it is a smoke screen, and the slogan "lost in math" comes to my mind. I will discuss some other recent disproofs of Bell's theorem using the language of causality based on causal graphs. Causal thinking is also central to law and justice. I will mention surprising connections to my work on serial killer nurse cases, in particular the Dutch case of Lucia de Berk and the current UK case of Lucy Letby.
Phenomics assisted breeding in crop improvementIshaGoswami9
As the population is increasing and will reach about 9 billion upto 2050. Also due to climate change, it is difficult to meet the food requirement of such a large population. Facing the challenges presented by resource shortages, climate
change, and increasing global population, crop yield and quality need to be improved in a sustainable way over the coming decades. Genetic improvement by breeding is the best way to increase crop productivity. With the rapid progression of functional
genomics, an increasing number of crop genomes have been sequenced and dozens of genes influencing key agronomic traits have been identified. However, current genome sequence information has not been adequately exploited for understanding
the complex characteristics of multiple gene, owing to a lack of crop phenotypic data. Efficient, automatic, and accurate technologies and platforms that can capture phenotypic data that can
be linked to genomics information for crop improvement at all growth stages have become as important as genotyping. Thus,
high-throughput phenotyping has become the major bottleneck restricting crop breeding. Plant phenomics has been defined as the high-throughput, accurate acquisition and analysis of multi-dimensional phenotypes
during crop growing stages at the organism level, including the cell, tissue, organ, individual plant, plot, and field levels. With the rapid development of novel sensors, imaging technology,
and analysis methods, numerous infrastructure platforms have been developed for phenotyping.
Nutraceutical market, scope and growth: Herbal drug technologyLokesh Patil
As consumer awareness of health and wellness rises, the nutraceutical market—which includes goods like functional meals, drinks, and dietary supplements that provide health advantages beyond basic nutrition—is growing significantly. As healthcare expenses rise, the population ages, and people want natural and preventative health solutions more and more, this industry is increasing quickly. Further driving market expansion are product formulation innovations and the use of cutting-edge technology for customized nutrition. With its worldwide reach, the nutraceutical industry is expected to keep growing and provide significant chances for research and investment in a number of categories, including vitamins, minerals, probiotics, and herbal supplements.
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...Studia Poinsotiana
I Introduction
II Subalternation and Theology
III Theology and Dogmatic Declarations
IV The Mixed Principles of Theology
V Virtual Revelation: The Unity of Theology
VI Theology as a Natural Science
VII Theology’s Certitude
VIII Conclusion
Notes
Bibliography
All the contents are fully attributable to the author, Doctor Victor Salas. Should you wish to get this text republished, get in touch with the author or the editorial committee of the Studia Poinsotiana. Insofar as possible, we will be happy to broker your contact.
Toxic effects of heavy metals : Lead and Arsenicsanjana502982
Heavy metals are naturally occuring metallic chemical elements that have relatively high density, and are toxic at even low concentrations. All toxic metals are termed as heavy metals irrespective of their atomic mass and density, eg. arsenic, lead, mercury, cadmium, thallium, chromium, etc.
The ability to recreate computational results with minimal effort and actionable metrics provides a solid foundation for scientific research and software development. When people can replicate an analysis at the touch of a button using open-source software, open data, and methods to assess and compare proposals, it significantly eases verification of results, engagement with a diverse range of contributors, and progress. However, we have yet to fully achieve this; there are still many sociotechnical frictions.
Inspired by David Donoho's vision, this talk aims to revisit the three crucial pillars of frictionless reproducibility (data sharing, code sharing, and competitive challenges) with the perspective of deep software variability.
Our observation is that multiple layers — hardware, operating systems, third-party libraries, software versions, input data, compile-time options, and parameters — are subject to variability that exacerbates frictions but is also essential for achieving robust, generalizable results and fostering innovation. I will first review the literature, providing evidence of how the complex variability interactions across these layers affect qualitative and quantitative software properties, thereby complicating the reproduction and replication of scientific studies in various fields.
I will then present some software engineering and AI techniques that can support the strategic exploration of variability spaces. These include the use of abstractions and models (e.g., feature models), sampling strategies (e.g., uniform, random), cost-effective measurements (e.g., incremental build of software configurations), and dimensionality reduction methods (e.g., transfer learning, feature selection, software debloating).
I will finally argue that deep variability is both the problem and solution of frictionless reproducibility, calling the software science community to develop new methods and tools to manage variability and foster reproducibility in software systems.
Exposé invité Journées Nationales du GDR GPL 2024
This presentation explores a brief idea about the structural and functional attributes of nucleotides, the structure and function of genetic materials along with the impact of UV rays and pH upon them.
In silico drugs analogue design: novobiocin analogues.pptx
Keynote ICSB 2014
1. International Conference on Systems Biology 2014, Melbourne
From art to engineering:
From art to engineering:
15 years of standards and tools
15 years of standards and tools
towards digital organisms
towards digital organisms
Nicolas Le Novère
Babraham Institute,
Cambridge, UK
n.lenovere@gmail.com
2. International Conference on Systems Biology 2014, Melbourne
What is the goal of using mathematical models?
Describe
1917
3. International Conference on Systems Biology 2014, Melbourne
What is the goal of using mathematical models?
Describe Explain
1917 1952
4. International Conference on Systems Biology 2014, Melbourne
What is the goal of using mathematical models?
Describe Explain Predict
1917 1952 2000
5. International Conference on Systems Biology 2014, Melbourne
1952 1960 1969 1981
1973
1991
1993
1998
2000
2005 2012
Turing
morphogen
Hodgkin-Huxley
action potential
Chance
catalase
Chance
glycolysis
Noble
pacemaker
Kauffman
Boolean networks
Savageau
BST
Kacser
MCA
Thomas
logic models
Goldbeter
Koshland
ultrasensitive
covalent
cascades
Tyson
cell cycle
Bray
bacterial
chemotaxis
Karr et al
Whole
mycoplasma
McAdams, Arkin
Stochastic simulations
Kitano, Hood
Systems biology revival
Repressilator,
toggle switch
Synthetic biology
2013
Thiele et al
Recon 2
BioModels
1989
Palsson
Erythrocyte
metabolism
6. International Conference on Systems Biology 2014, Melbourne
1952 1960 1969 1981
1973
1991
1993
1998
2000
2005 2012
Turing
morphogen
Hodgkin-Huxley
action potential
Chance
catalase
Chance
glycolysis
Noble
pacemaker
Kauffman
Boolean networks
Savageau
BST
Kacser
MCA
Thomas
logic models
Goldbeter
Koshland
ultrasensitive
covalent
cascades
Tyson
cell cycle
Bray
bacterial
chemotaxis
Karr et al
Whole
mycoplasma
McAdams, Arkin
Stochastic simulations
Kitano, Hood
Systems biology revival
Repressilator,
toggle switch
Synthetic biology
2013
Thiele et al
Recon 2
BioModels
1989
Palsson
Erythrocyte
metabolism
7. International Conference on Systems Biology 2014, Melbourne
1952 1960 1969 1981
1973
1991
1993
1998
2000
2005 2012
Turing
morphogen
Hodgkin-Huxley
action potential
Chance
catalase
Chance
glycolysis
Noble
pacemaker
Kauffman
Boolean networks
Savageau
BST
Kacser
MCA
Thomas
logic models
Goldbeter
Koshland
ultrasensitive
covalent
cascades
Tyson
cell cycle
Bray
bacterial
chemotaxis
Karr et al
Whole
mycoplasma
McAdams, Arkin
Stochastic simulations
Kitano, Hood
Systems biology revival
Repressilator,
toggle switch
Synthetic biology
2013
Thiele et al
Recon 2
BioModels
1989
Palsson
Erythrocyte
metabolism
8. International Conference on Systems Biology 2014, Melbourne
1952 1960 1969 1981
1973
1991
1993
1998
2000
2005 2012
Turing
morphogen
Hodgkin-Huxley
action potential
Chance
catalase
Chance
glycolysis
Noble
pacemaker
Kauffman
Boolean networks
Savageau
BST
Kacser
MCA
Thomas
logic models
Goldbeter
Koshland
ultrasensitive
covalent
cascades
Tyson
cell cycle
Bray
bacterial
chemotaxis
Karr et al
Whole
mycoplasma
McAdams, Arkin
Stochastic simulations
Kitano, Hood
Systems biology revival
Repressilator,
toggle switch
Synthetic biology
2013
Thiele et al
Recon 2
BioModels
1989
Palsson
Erythrocyte
metabolism
9. BioModels Database growth (published models branch) since its creation
Computational models on the rise
http://www.ebi.ac.uk/biomodels/
20 relationships
per model
500 relationships
per model
10. International Conference on Systems Biology 2014, Melbourne
Designed
Many
Standard
Automated production
Collaboration
Workflow
Robust
Improvised
One off
Unique
Manually produced
One or few artists
Produced in one go
Fragile
11. Parametrisations
Parametrisations
Simulation descriptions
Simulation descriptions
Model descriptions
Model descriptions
We need to
Therefore we need to share
Re-use
Re-use
Build upon
Build upon
Verify
Verify Modify
Modify
Integrate with
Integrate with
14th
Intl Workshop on Bioinformatics and Systems Biology, 20 July 2014, Berlin
Biological meaning
Biological meaning
12. Three types of standards
Minimal
requirements
Data-models
Terminologies
What to encode in order to
share experiments and
understand results
How to encode the information defined
above in a computer-readable manner
Structured representations
of knowledge, with
concept definitions
and their relationships
OBO
foundry
WHAT
HOW
14th
Intl Workshop on Bioinformatics and Systems Biology, 20 July 2014, Berlin
13. International Conference on Systems Biology 2014, Melbourne
A language to describe computational models in biology
Data-models
SBRML
Born in Caltech 2000
Model
descriptions
Mike
Hucka
Hamid
Bolouri
Andrew
Finney
Herbert
Sauro
Hiroaki
Kitano
John
Doyle
Hucka et al. (2003)
18. International Conference on Systems Biology 2014, Melbourne
Adding the semantics to the syntax
Minimal
requirements
Data-models
SBRML
Terminologies
Model
descriptions Born in Heidelberg 2004
Le Novère et al. (2005), Courtot et al. (2011)
19. International Conference on Systems Biology 2014, Melbourne
Encoded in public standard format
Single reference description
Reflect biological processes
Simulatable and reproduce results
Model creators details
Time of creation and last modification
Precise terms of distribution
Model components identified
Cross reference as triplets {collection, record, qualifier}
Collection and record in one agreed-upon URI
http://co.mbine.org/standards/miriam
20. International Conference on Systems Biology 2014, Melbourne
Encoded in public standard format
Single reference description
Reflect biological processes
Simulatable and reproduce results
Model creators details
Time of creation and last modification
Precise terms of distribution
Model components identified
Cross reference as triplets {collection, record, qualifier}
Collection and record in one agreed-upon URI
21. International Conference on Systems Biology 2014, Melbourne
(aka new MIRIAM URIs)
http://identifiers.org/namespace/identifier
protocol type of URI collection record
Camille Laibe Nick Juty Sarala Wimalaratne
Juty et al. (2012)
22. International Conference on Systems Biology 2014, Melbourne
http://identifiers.org/namespace/identifier
http://identifiers.org/pubmed/22140103
http://identifiers.org/eccode/1.1.1.1
http://identifiers.org/go/GO:0000186
(aka new MIRIAM URIs)
protocol type of URI collection record
23. International Conference on Systems Biology 2014, Melbourne
http://www.ebi.ac.uk/miriam/
http://identifiers.org/registry
Laibe et al. (2007)
24. International Conference on Systems Biology 2014, Melbourne
Unique Persistent
Resolvable
Homogeneous
Human
readable
Maintained
27. International Conference on Systems Biology 2014, Melbourne
Simulation experiment = model + what to do with it
Edelstein et al 1996 (BIOMD0000000002)
Huang & Ferrell (BIOMD0000000009)
Ueda, Hagiwara, Kitano 2001 (BIOMD0000000022)
Bornheimer et al 2004 (BIOMD0000000086)
28. International Conference on Systems Biology 2014, Melbourne
Choice of algorithm affects behaviour
G1 P1
P2 G2
lsode
Gibson-Bruck
29. International Conference on Systems Biology 2014, Melbourne
Minimal
requirements
Data-models SBRML
Description of simulations and analyses
Born in Hinxton 2007
Simulations
and analysis
Terminologies
Model
descriptions
Dagmar Waltemath
Anna Zhukova
Waltemath et al. (2011, 2011), Courtot et al (2011)
30. International Conference on Systems Biology 2014, Melbourne
Provide models or mean of access
Equations, parameter values and necessary conditions
Standard formats, code available or full description
Modifications required before simulation
Simulation steps, algorithms, order, processing
Information for correct implementation of all steps
If not open source, all information to rewrite
If dependent on platform, how to use this platform
Post-processing steps to generate final results
How to compare results to get insights
http://co.mbine.org/standards/miase
31. International Conference on Systems Biology 2014, Melbourne
Simulation Experiment Description Markup Language
http://sed-ml.org
Any XML format
32. International Conference on Systems Biology 2014, Melbourne
Minimal
requirements
Data-models NuML
Etc.
Simulations
and analysis
Terminologies
Model
descriptions
Numerical
results
34. International Conference on Systems Biology 2014, Melbourne
Camille
Laibe
Marco
Donizelli
Viji Chelliah
Le Novère et al. (2006), Li et al (2010)
http://www.ebi.ac.uk/biomodels
35. International Conference on Systems Biology 2014, Melbourne
>140 000 models
>10 millions math relations
~ 200 millions cross-refs
> 300 journals advise deposition
~ 1000 citations
1 million page requests per year
37. Biochemical models
Fernandez et al. DARPP-32 is a
robust integrator of dopamine
and glutamate signals
PLoS Comput Biol (2006) 2: e176.
BIOMD0000000153
BIOMD0000000153
reaction:
38. International Conference on Systems Biology 2014, Melbourne
Pharmacometrics models
Tham et al (2008) A pharmacodynamic
model for the time course of tumor
shrinkage by gemcitabine + carboplatin in
non-small cell lung cancer patients.
Clin Cancer Res. 2008 14(13): 4213-8.
assignment rule:
rate rule:
BIOMD0000000234
BIOMD0000000234
39. Neuroscience models
Izhikevich EM. Simple
model of spiking neurons.
IEEE Trans Neural Netw
(2003) 14(6):1569-1572.
rate rule:
dv
dt
= 0:042
+ 5 £ V + 140 ¡ U + i
event:
v = c
U = U + d
BIOMD0000000127
BIOMD0000000127
41. International Conference on Systems Biology 2014, Melbourne
“You should not develop standards and easy to use
modelling software. This allows biologists to write models,
and they don't know how to do it properly.”
Biomathematician, 2007
“By developing BioModels you harmed the cause of
modelling in biology. My students do not learn how to make
a model, they download it ready to use instead”
Theoretical biologist, 2006
50. International Conference on Systems Biology 2014, Melbourne
Chemical kinetics models
metabolic pathways
Flux Balance Analysis
whole genome reconstructions
Logical models
signalling pathways
Büchel et al. (2013)
Path2Models
51. International Conference on Systems Biology 2014, Melbourne
Are-we ready to develop comprehensive models of
whole cells, organs and organisms?
52. International Conference on Systems Biology 2014, Melbourne
9 orders of
magnitude
24 orders of
magnitude
Theoretical and
Computational
roadblockers
Coupled
multi-scale
models
Challenge 1: scales
Castiglione et al. (2014)
54. International Conference on Systems Biology 2014, Melbourne
Challenge 2: automatic generation
Need to generate model automatically based on omics datasets. Known on
small scale. The larger, the more human intervention needed.
56. International Conference on Systems Biology 2014, Melbourne
Why is modularity important?
Different parts can be modelled with different approaches
Alternative modules representing the same biological process
with different granularities or different modelling approaches
Model families with alternative parts: avoid combinatorial
explosion
Modules can be developed at their own pace; easier versioning
Distributed maintenance. No single individual or group can
master the entirety of the project.
57. International Conference on Systems Biology 2014, Melbourne
Challenge 4: Single software not sufficient
hybrid simulation system
discrete event detection
adaptive synchronization
modular model
58. International Conference on Systems Biology 2014, Melbourne
Whole cell:
Whole cell: e
electro-biochemical models
lectro-biochemical models
Mattioni et al (2012, 2013)
1504 spines
4761 compartments
16362 channels
59. International Conference on Systems Biology 2014, Melbourne
1504 spines
4761 compartments
16362 channels
Mattioni et al (2012, 2013)
60. International Conference on Systems Biology 2014, Melbourne
Challenge 5: reproducibility and virtualisation
Models must be self-contained and portable for everyone to reuse and
customise → Virtual machines with models+tools
62. International Conference on Systems Biology 2014, Melbourne
The numerous
scientists who
participated to
discussions
The community of
Computational
Systems Biology
You!
BioModels team
Ishan Ajmera
Raza Ali
Duncan Berenguier
Alexander Broicher
Vijayalakshmi Chelliah
Melanie Courtot
Marco Donizelli
Marine Dumousseau
Lukas Endler
Yvon Florent
Mihai Glont
Enuo He
Arnaud Henry
Henning Hermjakob
Gael Jalowicki
Nick Juty
Sarah Keating
Christian Knüpfer
Nicolas Le Novère
Chen Li
Camille Laibe
Stuart Moodie
Kedar Nath Natarajan
Jean-Baptiste Pettit
Michael Schubert
Maciej Swat
Karim Tazibt
Dagmar Waltemath
Sarala Wimalaratne
Anna Zhukova
Standards' editors
Richard Adams
Frank Bergman
Hamid Bolouri
Jonathan Cooper
Tobias Czauderna
Emek Demir
Andrew Finney
Stefan Hoops
Mike Hucka
Sarah Keating
Hiroaki Kitano
Nicolas Le Novère
Yukiko Matsuoka
Huaiyu Mi
Andrew Miller
Stuart Moodie
Ion Moraru
Chris Myers
David Nickerson
Brett Olivier
Sven Sahle
Herbert Sauro
Jim Schaff
Falk Schreiber
Lucian Smith
Anatoly Sorokin
Alice Villeger
Katja Wegner
Darren Wilkinson
Dagmar Waltemath
Path2Models
Finja Büchel
Claudine Chaouiya
Tobias Czauderna
Andreas Dräger
Mihai Glont
Martin Golebievski
Henning Hermjakob
Mike Hucka
Douglas Kell
Roland Keller
Camille Laibe
Nicolas Le Novère
Pedro Mendes
Florent Mittag
Wolfang Müller
Matthias Rall
Nicolas Rodriguez
Julio Saez-Rodriguez
Michael Schubert
Falk Schreiber
Neil Swainston
Martijn van Iersel
Clemens Wrzodek
Andreas Zell