Presentation held at the M3 SIG meeting at the ISMB in Stockholm 2009. Purpose to show the audience the software development activities of the Genomics standards Consortium. See also http://gensc.org
The Ocean Sampling Day's Metagenome Analysis: Standards, Pipelines and First ...Renzo Kottmann
Overview on Ocean Sampling Day and status of metagenomic analysis. Presentation was delivered at final BioMedBridges Symposium http://www.biomedbridges.eu/news/symposium-open-bridges-life-science-data
Metagenomics Session: http://www.biomedbridges.eu/news/workshop-metagenomics-bridging-between-environment-and-life
List of Software Development Model and MethodsRiant Soft
RiantSoft a Software Development Company derived the most useful and different types of Software Development Model for the users who want to know the development process. RiantSoft is specialized in custom software development with latest cutting edge technologies.
The Ocean Sampling Day's Metagenome Analysis: Standards, Pipelines and First ...Renzo Kottmann
Overview on Ocean Sampling Day and status of metagenomic analysis. Presentation was delivered at final BioMedBridges Symposium http://www.biomedbridges.eu/news/symposium-open-bridges-life-science-data
Metagenomics Session: http://www.biomedbridges.eu/news/workshop-metagenomics-bridging-between-environment-and-life
List of Software Development Model and MethodsRiant Soft
RiantSoft a Software Development Company derived the most useful and different types of Software Development Model for the users who want to know the development process. RiantSoft is specialized in custom software development with latest cutting edge technologies.
Functional and Architectural Requirements for Metadata: Supporting Discovery...Jian Qin
The tremendous growth in digital data has led to an increase in metadata initiatives for different types of scientific data, as evident in Ball’s survey (2009). Although individual communities have specific needs, there are shared goals that need to be recognized if systems are to effectively support data sharing within and across all domains. This paper considers this need, and explores systems requirements that are essential for metadata supporting the discovery and management of scientific data. The paper begins with an introduction and a review of selected research specific to metadata modeling in the sciences. Next, the paper’s goals are stated, followed by the presentation of valuable systems requirements. The results include a base-model with three chief principles: principle of least effort, infrastructure service, and portability. The principles are intended to support “data user” tasks. Results also include a set of defined user tasks and functions, and applications scenarios.
AI and Machine Learning for Secondary Metabolite PredictionYannick Djoumbou
In silico metabolism prediction tools provide a unique perspective to studying the chemical exposome, and how its changes affect the environment. Classical applications of such tools include, but are not limited to metabolite discovery, environmental fate prediction, ADMET profiling, and molecular design. Several approaches and methods to address the prediction of secondary metabolites have been described, and implemented in a comprehensive list of tools that include expert-, machine learning-, and QM-based systems, or hybrids thereof. In spite of the numerous reported successes, many limitations still hamper the wide adoption of those tools. In this presentation, we will describe the impact of artificial intelligence in the development of secondary metabolite prediction systems, along with the most commonly implemented approaches. Moreover, we will provide examples of the application of in silico metabolism prediction tools, such as BioTransformer, in the identification of secondary metabolites. Furthermore, we will discuss some of the prevalent limitations that hamper the widespread adoption of such tools, and propose solutions.
Presented by Tony Mathys at a Current Issues and Applications of the Geospatial Technologies Lecture, Department of Geography and Environment, Aberdeen University, 24 February 2012
BioDec, based near Bologna, Italy, provides top-notch services, solutions, and consulting in the field of lab data management and in postgenomics "in silico" research. The presentation summarizes our main achievements and describes our commercial offer.
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024Neo4j
Neha Bajwa, Vice President of Product Marketing, Neo4j
Join us as we explore breakthrough innovations enabled by interconnected data and AI. Discover firsthand how organizations use relationships in data to uncover contextual insights and solve our most pressing challenges – from optimizing supply chains, detecting fraud, and improving customer experiences to accelerating drug discoveries.
More Related Content
Similar to Software Development by the Genomics Standards Consortium
Functional and Architectural Requirements for Metadata: Supporting Discovery...Jian Qin
The tremendous growth in digital data has led to an increase in metadata initiatives for different types of scientific data, as evident in Ball’s survey (2009). Although individual communities have specific needs, there are shared goals that need to be recognized if systems are to effectively support data sharing within and across all domains. This paper considers this need, and explores systems requirements that are essential for metadata supporting the discovery and management of scientific data. The paper begins with an introduction and a review of selected research specific to metadata modeling in the sciences. Next, the paper’s goals are stated, followed by the presentation of valuable systems requirements. The results include a base-model with three chief principles: principle of least effort, infrastructure service, and portability. The principles are intended to support “data user” tasks. Results also include a set of defined user tasks and functions, and applications scenarios.
AI and Machine Learning for Secondary Metabolite PredictionYannick Djoumbou
In silico metabolism prediction tools provide a unique perspective to studying the chemical exposome, and how its changes affect the environment. Classical applications of such tools include, but are not limited to metabolite discovery, environmental fate prediction, ADMET profiling, and molecular design. Several approaches and methods to address the prediction of secondary metabolites have been described, and implemented in a comprehensive list of tools that include expert-, machine learning-, and QM-based systems, or hybrids thereof. In spite of the numerous reported successes, many limitations still hamper the wide adoption of those tools. In this presentation, we will describe the impact of artificial intelligence in the development of secondary metabolite prediction systems, along with the most commonly implemented approaches. Moreover, we will provide examples of the application of in silico metabolism prediction tools, such as BioTransformer, in the identification of secondary metabolites. Furthermore, we will discuss some of the prevalent limitations that hamper the widespread adoption of such tools, and propose solutions.
Presented by Tony Mathys at a Current Issues and Applications of the Geospatial Technologies Lecture, Department of Geography and Environment, Aberdeen University, 24 February 2012
BioDec, based near Bologna, Italy, provides top-notch services, solutions, and consulting in the field of lab data management and in postgenomics "in silico" research. The presentation summarizes our main achievements and describes our commercial offer.
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024Neo4j
Neha Bajwa, Vice President of Product Marketing, Neo4j
Join us as we explore breakthrough innovations enabled by interconnected data and AI. Discover firsthand how organizations use relationships in data to uncover contextual insights and solve our most pressing challenges – from optimizing supply chains, detecting fraud, and improving customer experiences to accelerating drug discoveries.
In his public lecture, Christian Timmerer provides insights into the fascinating history of video streaming, starting from its humble beginnings before YouTube to the groundbreaking technologies that now dominate platforms like Netflix and ORF ON. Timmerer also presents provocative contributions of his own that have significantly influenced the industry. He concludes by looking at future challenges and invites the audience to join in a discussion.
Climate Impact of Software Testing at Nordic Testing DaysKari Kakkonen
My slides at Nordic Testing Days 6.6.2024
Climate impact / sustainability of software testing discussed on the talk. ICT and testing must carry their part of global responsibility to help with the climat warming. We can minimize the carbon footprint but we can also have a carbon handprint, a positive impact on the climate. Quality characteristics can be added with sustainability, and then measured continuously. Test environments can be used less, and in smaller scale and on demand. Test techniques can be used in optimizing or minimizing number of tests. Test automation can be used to speed up testing.
In the rapidly evolving landscape of technologies, XML continues to play a vital role in structuring, storing, and transporting data across diverse systems. The recent advancements in artificial intelligence (AI) present new methodologies for enhancing XML development workflows, introducing efficiency, automation, and intelligent capabilities. This presentation will outline the scope and perspective of utilizing AI in XML development. The potential benefits and the possible pitfalls will be highlighted, providing a balanced view of the subject.
We will explore the capabilities of AI in understanding XML markup languages and autonomously creating structured XML content. Additionally, we will examine the capacity of AI to enrich plain text with appropriate XML markup. Practical examples and methodological guidelines will be provided to elucidate how AI can be effectively prompted to interpret and generate accurate XML markup.
Further emphasis will be placed on the role of AI in developing XSLT, or schemas such as XSD and Schematron. We will address the techniques and strategies adopted to create prompts for generating code, explaining code, or refactoring the code, and the results achieved.
The discussion will extend to how AI can be used to transform XML content. In particular, the focus will be on the use of AI XPath extension functions in XSLT, Schematron, Schematron Quick Fixes, or for XML content refactoring.
The presentation aims to deliver a comprehensive overview of AI usage in XML development, providing attendees with the necessary knowledge to make informed decisions. Whether you’re at the early stages of adopting AI or considering integrating it in advanced XML development, this presentation will cover all levels of expertise.
By highlighting the potential advantages and challenges of integrating AI with XML development tools and languages, the presentation seeks to inspire thoughtful conversation around the future of XML development. We’ll not only delve into the technical aspects of AI-powered XML development but also discuss practical implications and possible future directions.
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfPaige Cruz
Monitoring and observability aren’t traditionally found in software curriculums and many of us cobble this knowledge together from whatever vendor or ecosystem we were first introduced to and whatever is a part of your current company’s observability stack.
While the dev and ops silo continues to crumble….many organizations still relegate monitoring & observability as the purview of ops, infra and SRE teams. This is a mistake - achieving a highly observable system requires collaboration up and down the stack.
I, a former op, would like to extend an invitation to all application developers to join the observability party will share these foundational concepts to build on:
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Dr. Sean Tan, Head of Data Science, Changi Airport Group
Discover how Changi Airport Group (CAG) leverages graph technologies and generative AI to revolutionize their search capabilities. This session delves into the unique search needs of CAG’s diverse passengers and customers, showcasing how graph data structures enhance the accuracy and relevance of AI-generated search results, mitigating the risk of “hallucinations” and improving the overall customer journey.
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...Neo4j
Leonard Jayamohan, Partner & Generative AI Lead, Deloitte
This keynote will reveal how Deloitte leverages Neo4j’s graph power for groundbreaking digital twin solutions, achieving a staggering 100x performance boost. Discover the essential role knowledge graphs play in successful generative AI implementations. Plus, get an exclusive look at an innovative Neo4j + Generative AI solution Deloitte is developing in-house.
Communications Mining Series - Zero to Hero - Session 1DianaGray10
This session provides introduction to UiPath Communication Mining, importance and platform overview. You will acquire a good understand of the phases in Communication Mining as we go over the platform with you. Topics covered:
• Communication Mining Overview
• Why is it important?
• How can it help today’s business and the benefits
• Phases in Communication Mining
• Demo on Platform overview
• Q/A
Essentials of Automations: The Art of Triggers and Actions in FMESafe Software
In this second installment of our Essentials of Automations webinar series, we’ll explore the landscape of triggers and actions, guiding you through the nuances of authoring and adapting workspaces for seamless automations. Gain an understanding of the full spectrum of triggers and actions available in FME, empowering you to enhance your workspaces for efficient automation.
We’ll kick things off by showcasing the most commonly used event-based triggers, introducing you to various automation workflows like manual triggers, schedules, directory watchers, and more. Plus, see how these elements play out in real scenarios.
Whether you’re tweaking your current setup or building from the ground up, this session will arm you with the tools and insights needed to transform your FME usage into a powerhouse of productivity. Join us to discover effective strategies that simplify complex processes, enhancing your productivity and transforming your data management practices with FME. Let’s turn complexity into clarity and make your workspaces work wonders!
Threats to mobile devices are more prevalent and increasing in scope and complexity. Users of mobile devices desire to take full advantage of the features
available on those devices, but many of the features provide convenience and capability but sacrifice security. This best practices guide outlines steps the users can take to better protect personal devices and information.
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
Software Development by the Genomics Standards Consortium
1. Bringing Standards to Life:
Software Development by the
Genomics
Standards Consortium
Renzo Kottmann
Microbial Genomics Group
Max Planck Institute for Marine Microbiology
M3 SIG Stockholm July 2009 1
2. Genomic Standards Consortium (GSC)
Goal
• Promote mechanisms that
standardize the description of genomes
exchange and integrate genomic data
Open-membership, international working body
• Established in Sept 2005
• Participants include DDBJ, EMBL, GenBank, Sanger,
JCVI, JGI, EBI and a range of US, UK and EU research
institutions
• Organized a series of workshops
2 2
http://gensc.org and http://gensc.org/gc_wiki/index.php/GSC_Membership
3. Minimum Information about a Genome Sequence
(MIGS) Specification
MIGS extends what DDBJ/EMBL/GenBank request
upon submission of a genome sequence
• Examples:
Description of geographic location of a sample and
habitat
“Minimum Information about a Metagenomic Sequence”
(MIMS)
– Temperature
– pH
Description of sequence generation
– Sequencing method
– Assembly method
3 3
Field et al. Nat Biotechnol. 2008
6. Software Development for MIGS/MIMS
Mechanisms for
achieving compliance
are needed:
• Such mechanisms
involve
an appropriate reporting
structure for capturing
and exchanging data,
software,
databases
and controlled
vocabularies and/or
ontologies for defining
the terms used in the
annotations.
6
Field et al. Nat Biotechnol. 2008
7. Software Development for MIGS/MIMS
Mechanisms for Supporting Projects:
achieving compliance • Habitat-Lite (Ontology
are needed: specification)
• Such mechanisms
involve
an appropriate reporting
structure for capturing
and exchanging data,
software,
databases
and controlled
vocabularies and/or
ontologies for defining
the terms used in the
annotations.
7
Field et al. Nat Biotechnol. 2008
8. Software Development for MIGS/MIMS
Mechanisms for Supporting Projects:
achieving compliance • Habitat-Lite (Ontology
are needed: specification)
• Such mechanisms • Genomic Rosetta Stone
involve (Identifier Mapping)
an appropriate reporting
structure for capturing
and exchanging data,
software,
databases
and controlled
vocabularies and/or
ontologies for defining
the terms used in the
annotations.
8
Field et al. Nat Biotechnol. 2008
9. Software Development for MIGS/MIMS
Mechanisms for Supporting Projects:
achieving compliance • Habitat-Lite (Ontology
are needed: specification)
• Such mechanisms • Genomic Rosetta Stone
involve (Identifier Mapping)
an appropriate reporting • GCDML (MIGS/MIMS
structure for capturing
and exchanging data, specification in XML)
software,
databases
and controlled
vocabularies and/or
ontologies for defining
the terms used in the
annotations.
9
Field et al. Nat Biotechnol. 2008
10. Software Development for MIGS/MIMS
Mechanisms for Supporting Projects:
achieving compliance • Habitat-Lite (Ontology
are needed: specification)
• Such mechanisms • Genomic Rosetta Stone
involve (Identifier Mapping)
an appropriate reporting • GCDML (MIGS/MIMS
structure for capturing
and exchanging data, specification in XML)
software, • Genomes Catalogue
databases (Database and Web
and controlled Server)
vocabularies and/or
ontologies for defining
the terms used in the
annotations.
10
Field et al. Nat Biotechnol. 2008
11. Aquatic Aquatic: Freshwater Acquatic: Marine Terrestrial Air Fossil Food Organism-Associated Extreme Habitat Other
Habitat-Lite (= EnvO-Lite)
Easy-to-use (small) set of terms
• Captures high-level information about habitat
• Derived from the Environment Ontology (EnvO).
Meet the needs of multiple users
• Annotators, database providers, biologists, and
bioinformaticians alike who need to search and
employ such data in comparative analyses.
Hirschman et al. OMICS. 2008 11 11
12. Habitat-Lite
1. Level 2. Level
Aquatic soil
Aquatic: Freshwater sediment
Aquatic: Marine sludge
Terrestrial waste water
Air hot spring
Fossil hydrothermal vent
Food biofilm
Organism-Associated microbial mat
Extreme Habitat
Other
< 20 terms
Hirschman et al. OMICS. 2008 12 12
14. Genomic Rosetta Stone (GRS)
Create a unified mapping between different genomic
resources
Improve navigation across these resources
Enable the integration of this information in the near
future.
Van Brabant et al. OMICS. 2008 14 14
16. Genomic Rosetta Stone (GRS)
Enable the integration of this information in the near
future
Van Brabant et al. OMICS. 2008 16 16
17. Genomic Contextual Data
Markup Language (GCDML)
An Extensible Markup Language (XML)
Aim
• Implement MIGS/MIMS
• Provide even more descriptors
• Facilitate exchange and integration of genomic data
Kottmann et al. OMICS. 2008 17 17
22. Genome Catalogue
Requirements
• A Rich toolkit/user-friendly
• Designed to give credit to all contributors
• XML-based (GCDML)
Able to maintain all versions of GCDML schemas
• Web services-based
Supporting the automated exchange of content
• Serve as the international GCAT identifier authority
• Comprehensive
Containing reports for all taxa and metagenomes
• Ontology-supportive
• Shared by the GSC
22 22
23. Current Status
We have specifications:
• MIGS/MIMS
• Habitat-Lite
• Genomic Rosetta Stone
Work on supporting software is ongoing:
• Genomes Catalogue is in prototype status
• Funding
This is a long-term endeavour that can not be done on a
voluntary basis
23 23
24. Disscusion
Need of software for:
• Creation of MIGS/MIMS data
• Storage
• Analysis
Expand standardization efforts to
• Software specification/development
• Work on a standardized genomic data management
architecture / cyberinfrastructure
Data intensive science is successful if it works
towards one community with one vision
• World Wide Genomics project
24 24
25. Acknowledgements
All Members of GSC incl.
Dawn Field
Peter Sterk
Saul Kravitz
Tanya Gray
Megx.net team
Frank Oliver Glöckner
Ivaylo Kostadinov
Melissa Beth Duhaime
Pier Luigi Buttigieg
Wolfgang Hankeln
Pelin Yilmaz
25