The MIAPA ontology: An annotation ontology for validating minimum metadata reporting for phylogenetic analyses

Hilmar Lapp
Hilmar LappBioinformatician, Technologist
The MIAPA ontology: An
annotation ontology for validating
minimum metadata reporting for
phylogenetic analyses
Hilmar Lapp, Joachim Baran, Enrico Pontelli,
Arlin Stoltzfus, Ramona Walls
2013 iEvoBio Conference
Snowbird, UT
Phylogenetic trees are published
at a rapidly increasing rate
Abby et al (2012) PNAS
Bininda-Emonds et al (2007)
Betancur et al 2013
Smith et al 2011
Phylogenies are key to
many biological questions
Edwards & Smith 2010
Jetz et al 2012
Emerson & Gillespie, 2008
Reuse of phylogenies is
nonetheless rare
“Most attempts at
phylogenetic data reuse seem
to end in disappointment.”
Part of the challenge is
digital access
{<4% of published phylogenies
The other challenge:
Determining fitness for purpose
•What kind of tree
is it?
•Does it have
branch lengths?
Support values?
•From what kind of
matrix or
alignment?
•How was the
alignment created,
how the tree?
OMICS, 2006
“If inadequately documented, however,
even the most carefully constructed
phylogenetic analysis will languish in
the pages of a journal.”
Standards need
normative documents
Normative
Other
implentations
MIAME
MAGE-ML
MAGE-TAB
SOFT (GEO)
MIGS/MIMS/
MIMARKS
GCDML
Structured
Comments
(NCBI)
MIAPA ❌
Steps towards a Minimum
Information About a Phylogenetic
Analysis (MIAPA) Standard
2011 TDWG Conference
Draft MIAPA checklist
• Topology: including identifier; type of tree, topology, and
consensus
• OTUs: Terminal nodes with appropriate labels and
referenced to meaningful external identifier(s)
• Branch lengths and branch support, unless not
applicable to analysis method.
• Character matrix: aligned data matrix that is the basis
for the tree, with data type; accession numbers of the
sequences; mapping from rows to a tip of the topology
• Alignment method
• Tree inference method Now on Github:
http://github.com/miapa/miapa
MIAPA Ontology
• Initiated at the Phylotastic II hackathon in
Tucson (Feb 2013)
• Developed in OWL
• Designed to follow OBO Foundry best practices
• Open: CC0, http://github.com/miapa/miapa
• Mailing list: miapa-discuss@googlegroups.com
• OBO ID policy:
http://purl.obolibrary.org/obo/
MIAPA_NNNNNNN for term IDs
MIAPA Ontology
• Application Ontology: Reuses most terms and
properties from existing ontologies
• Comparative Data Analysis Ontology (CDAO)
• Information Artifact Ontology (IAO)
• Software Ontology (SWO + EDAM)
• W3C Provenance Ontology (PROV)
• In addition, some properties from Darwin Core
(DwC), Semantic Darwin Core (DSW), and
Dublin Core (DC)
MIAPA Checklist: Example Topology
Already
(mostly)
provided by
CDAO
Reused terms with usage
recommendations
Data provenance
PROV Model
Data provenance
PROV Model
Data annotation using
the MIAPA Ontology
Data annotation using
the MIAPA Ontology
Annotated a total
of 10 published
phylogenies
See http://
github.com/miapa/
miapa
Phylogenetic Tree and Data
RDF Triplestore
RDF, using CDAO
RDFization
MIAPA
Expert KnowledgeManual
Annotation
RDF, using CDAO and MIAPA
Automatic
Annotation
Extract &
Transform
MIAPA
information
Actor
SPARQL
Queries
MIAPA annotations can
facilitate data integration
Incentivizing better metadata
T O B C A T
MIAPA
Acknowledgements
• Jim Leebens-Mack and other authors of the 2006
publication
• Maryam Panahiazar for earlier work on ontology
support for MIAPA
• Nico Cellinese and participants of the 2011 TDWG
Workshop on MIAPA
• HIP Leadership Team (organizers of the 2013
Phylotastic Hackathon in Tucson)
• Andrea Thomer, co-conspirator at the Tucson hackathon
• Phylotastic Hackathon sponsors: NESCent, Biodiversity
Synthesis Center, iPlant Collaborative
1 of 21

Recommended

Linking Data, Linking People by
Linking Data, Linking PeopleLinking Data, Linking People
Linking Data, Linking PeoplefereiraJ
787 views42 slides
Ontologies neo4j-graph-workshop-berlin by
Ontologies neo4j-graph-workshop-berlinOntologies neo4j-graph-workshop-berlin
Ontologies neo4j-graph-workshop-berlinSimon Jupp
679 views26 slides
schema.org and biomedical ontologies by
schema.org and biomedical ontologies schema.org and biomedical ontologies
schema.org and biomedical ontologies Simon Jupp
345 views25 slides
Facilitating semantic alignment.-biohackathon-jupp by
Facilitating semantic alignment.-biohackathon-juppFacilitating semantic alignment.-biohackathon-jupp
Facilitating semantic alignment.-biohackathon-juppSimon Jupp
344 views19 slides
The FAIRDOM Commons for Systems Biology by
The FAIRDOM Commons for Systems BiologyThe FAIRDOM Commons for Systems Biology
The FAIRDOM Commons for Systems BiologyFAIRDOM
2.3K views29 slides
Report of the second FAIRDOM foundry by
Report of the second FAIRDOM foundryReport of the second FAIRDOM foundry
Report of the second FAIRDOM foundryFAIRDOM
1.1K views10 slides

More Related Content

What's hot

Importing life science at a into Neo4j by
Importing life science at a into Neo4jImporting life science at a into Neo4j
Importing life science at a into Neo4jSimon Jupp
798 views19 slides
Reflections on a (slightly unusual) multi-disciplinary academic career by
Reflections on a (slightly unusual) multi-disciplinary academic careerReflections on a (slightly unusual) multi-disciplinary academic career
Reflections on a (slightly unusual) multi-disciplinary academic careerCarole Goble
482 views38 slides
Reproducible and citable data and models: an introduction. by
Reproducible and citable data and models: an introduction.Reproducible and citable data and models: an introduction.
Reproducible and citable data and models: an introduction.FAIRDOM
4.2K views15 slides
How to make your published data findable, accessible, interoperable and reusable by
How to make your published data findable, accessible, interoperable and reusableHow to make your published data findable, accessible, interoperable and reusable
How to make your published data findable, accessible, interoperable and reusablePhoenix Bioinformatics
128 views48 slides
Reproducibility (and the R*) of Science: motivations, challenges and trends by
Reproducibility (and the R*) of Science: motivations, challenges and trendsReproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsCarole Goble
1.8K views37 slides
Bh14 ogo by
Bh14 ogoBh14 ogo
Bh14 ogojesualdofernandez
365 views17 slides

What's hot(20)

Importing life science at a into Neo4j by Simon Jupp
Importing life science at a into Neo4jImporting life science at a into Neo4j
Importing life science at a into Neo4j
Simon Jupp798 views
Reflections on a (slightly unusual) multi-disciplinary academic career by Carole Goble
Reflections on a (slightly unusual) multi-disciplinary academic careerReflections on a (slightly unusual) multi-disciplinary academic career
Reflections on a (slightly unusual) multi-disciplinary academic career
Carole Goble482 views
Reproducible and citable data and models: an introduction. by FAIRDOM
Reproducible and citable data and models: an introduction.Reproducible and citable data and models: an introduction.
Reproducible and citable data and models: an introduction.
FAIRDOM4.2K views
How to make your published data findable, accessible, interoperable and reusable by Phoenix Bioinformatics
How to make your published data findable, accessible, interoperable and reusableHow to make your published data findable, accessible, interoperable and reusable
How to make your published data findable, accessible, interoperable and reusable
Reproducibility (and the R*) of Science: motivations, challenges and trends by Carole Goble
Reproducibility (and the R*) of Science: motivations, challenges and trendsReproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trends
Carole Goble1.8K views
What is Reproducibility? The R* brouhaha (and how Research Objects can help) by Carole Goble
What is Reproducibility? The R* brouhaha (and how Research Objects can help)What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
Carole Goble1.5K views
Reproducible Research: how could Research Objects help by Carole Goble
Reproducible Research: how could Research Objects helpReproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects help
Carole Goble605 views
GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit... by Neo4j
GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...
GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...
Neo4j784 views
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe... by Carole Goble
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
Carole Goble459 views
Research Objects, SEEK and FAIRDOM by Carole Goble
Research Objects, SEEK and FAIRDOMResearch Objects, SEEK and FAIRDOM
Research Objects, SEEK and FAIRDOM
Carole Goble1.7K views
Annotopia open annotation services platform by Tim Clark
Annotopia open annotation services platformAnnotopia open annotation services platform
Annotopia open annotation services platform
Tim Clark762 views
FAIR Data, Operations and Model management for Systems Biology and Systems Me... by Carole Goble
FAIR Data, Operations and Model management for Systems Biology and Systems Me...FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
Carole Goble1.5K views
Improving the Management of Computational Models -- Invited talk at the EBI by Martin Scharm
Improving the Management of Computational Models -- Invited talk at the EBIImproving the Management of Computational Models -- Invited talk at the EBI
Improving the Management of Computational Models -- Invited talk at the EBI
Martin Scharm6K views
Small Science: First Impressions of Curation Needs. Presentation at Digital L... by Sarah Shreeves
Small Science: First Impressions of Curation Needs. Presentation at Digital L...Small Science: First Impressions of Curation Needs. Presentation at Digital L...
Small Science: First Impressions of Curation Needs. Presentation at Digital L...
Sarah Shreeves908 views
Reproducibility of model-based results: standards, infrastructure, and recogn... by FAIRDOM
Reproducibility of model-based results: standards, infrastructure, and recogn...Reproducibility of model-based results: standards, infrastructure, and recogn...
Reproducibility of model-based results: standards, infrastructure, and recogn...
FAIRDOM4K views
WIDOCO: A Wizard for Documenting Ontologies by dgarijo
WIDOCO: A Wizard for Documenting OntologiesWIDOCO: A Wizard for Documenting Ontologies
WIDOCO: A Wizard for Documenting Ontologies
dgarijo1.2K views
eXframe: A Semantic Web Platform for Genomic Experiments by Tim Clark
eXframe: A Semantic Web Platform for Genomic ExperimentseXframe: A Semantic Web Platform for Genomic Experiments
eXframe: A Semantic Web Platform for Genomic Experiments
Tim Clark432 views

Viewers also liked

KIERASAYS: by
KIERASAYS:KIERASAYS:
KIERASAYS:kieralove
268 views58 slides
Safe school environments by
Safe school environmentsSafe school environments
Safe school environmentstrtkaren
667 views6 slides
Reasoning over phenotype diversity, character change, and evolutionary descent by
Reasoning over phenotype diversity, character change, and evolutionary descentReasoning over phenotype diversity, character change, and evolutionary descent
Reasoning over phenotype diversity, character change, and evolutionary descentHilmar Lapp
698 views73 slides
Semantics of and for the diversity of life:
 Opportunities and perils of tryi... by
Semantics of and for the diversity of life:
 Opportunities and perils of tryi...Semantics of and for the diversity of life:
 Opportunities and perils of tryi...
Semantics of and for the diversity of life:
 Opportunities and perils of tryi...Hilmar Lapp
2.1K views79 slides
Bringing reason to phenotype diversity, character change, and common descent by
Bringing reason to phenotype diversity, character change, and common descentBringing reason to phenotype diversity, character change, and common descent
Bringing reason to phenotype diversity, character change, and common descentHilmar Lapp
1.2K views50 slides
Open science, open-source, and open data: Collaboration as an emergent property? by
Open science, open-source, and open data: Collaboration as an emergent property?Open science, open-source, and open data: Collaboration as an emergent property?
Open science, open-source, and open data: Collaboration as an emergent property?Hilmar Lapp
2.7K views29 slides

Viewers also liked(9)

KIERASAYS: by kieralove
KIERASAYS:KIERASAYS:
KIERASAYS:
kieralove268 views
Safe school environments by trtkaren
Safe school environmentsSafe school environments
Safe school environments
trtkaren667 views
Reasoning over phenotype diversity, character change, and evolutionary descent by Hilmar Lapp
Reasoning over phenotype diversity, character change, and evolutionary descentReasoning over phenotype diversity, character change, and evolutionary descent
Reasoning over phenotype diversity, character change, and evolutionary descent
Hilmar Lapp698 views
Semantics of and for the diversity of life:
 Opportunities and perils of tryi... by Hilmar Lapp
Semantics of and for the diversity of life:
 Opportunities and perils of tryi...Semantics of and for the diversity of life:
 Opportunities and perils of tryi...
Semantics of and for the diversity of life:
 Opportunities and perils of tryi...
Hilmar Lapp2.1K views
Bringing reason to phenotype diversity, character change, and common descent by Hilmar Lapp
Bringing reason to phenotype diversity, character change, and common descentBringing reason to phenotype diversity, character change, and common descent
Bringing reason to phenotype diversity, character change, and common descent
Hilmar Lapp1.2K views
Open science, open-source, and open data: Collaboration as an emergent property? by Hilmar Lapp
Open science, open-source, and open data: Collaboration as an emergent property?Open science, open-source, and open data: Collaboration as an emergent property?
Open science, open-source, and open data: Collaboration as an emergent property?
Hilmar Lapp2.7K views
Quality professional development by trtkaren
Quality professional developmentQuality professional development
Quality professional development
trtkaren427 views
Phyloinformatics VoCamp by Hilmar Lapp
Phyloinformatics VoCampPhyloinformatics VoCamp
Phyloinformatics VoCamp
Hilmar Lapp775 views

Similar to The MIAPA ontology: An annotation ontology for validating minimum metadata reporting for phylogenetic analyses

Measuring electronic resource availability final version by
Measuring electronic resource availability final versionMeasuring electronic resource availability final version
Measuring electronic resource availability final versionSanjeet Mann
320 views31 slides
Being FAIR: FAIR data and model management SSBSS 2017 Summer School by
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer SchoolCarole Goble
978 views65 slides
Investigating plant systems using data integration and network analysis by
Investigating plant systems using data integration and network analysisInvestigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysisCatherine Canevet
3K views96 slides
Research Objects: more than the sum of the parts by
Research Objects: more than the sum of the partsResearch Objects: more than the sum of the parts
Research Objects: more than the sum of the partsCarole Goble
1.5K views45 slides
Training daypresentation by
Training daypresentationTraining daypresentation
Training daypresentationAmy Fry
279 views27 slides
OhioLINK ERM Forum: The Front End by
OhioLINK ERM Forum: The Front EndOhioLINK ERM Forum: The Front End
OhioLINK ERM Forum: The Front EndAmy Fry
614 views27 slides

Similar to The MIAPA ontology: An annotation ontology for validating minimum metadata reporting for phylogenetic analyses(20)

Measuring electronic resource availability final version by Sanjeet Mann
Measuring electronic resource availability final versionMeasuring electronic resource availability final version
Measuring electronic resource availability final version
Sanjeet Mann320 views
Being FAIR: FAIR data and model management SSBSS 2017 Summer School by Carole Goble
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Carole Goble978 views
Investigating plant systems using data integration and network analysis by Catherine Canevet
Investigating plant systems using data integration and network analysisInvestigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysis
Research Objects: more than the sum of the parts by Carole Goble
Research Objects: more than the sum of the partsResearch Objects: more than the sum of the parts
Research Objects: more than the sum of the parts
Carole Goble1.5K views
Training daypresentation by Amy Fry
Training daypresentationTraining daypresentation
Training daypresentation
Amy Fry279 views
OhioLINK ERM Forum: The Front End by Amy Fry
OhioLINK ERM Forum: The Front EndOhioLINK ERM Forum: The Front End
OhioLINK ERM Forum: The Front End
Amy Fry614 views
Towards a Simple, Standards-Compliant, and Generic Phylogenetic Database by Hilmar Lapp
Towards a Simple, Standards-Compliant, and Generic Phylogenetic DatabaseTowards a Simple, Standards-Compliant, and Generic Phylogenetic Database
Towards a Simple, Standards-Compliant, and Generic Phylogenetic Database
Hilmar Lapp722 views
The PLUTo project @iEvoBio 2014 by Ross Mounce
The PLUTo project @iEvoBio 2014The PLUTo project @iEvoBio 2014
The PLUTo project @iEvoBio 2014
Ross Mounce2.1K views
Next generation sequencing requires next generation publishing: the Biodivers... by Vince Smith
Next generation sequencing requires next generation publishing: the Biodivers...Next generation sequencing requires next generation publishing: the Biodivers...
Next generation sequencing requires next generation publishing: the Biodivers...
Vince Smith1.1K views
Cross-Community User Requirements and the Biodiversity Heritage Library by Chris Freeland
Cross-Community User Requirements and the Biodiversity Heritage LibraryCross-Community User Requirements and the Biodiversity Heritage Library
Cross-Community User Requirements and the Biodiversity Heritage Library
Chris Freeland455 views
Fairification experience clarifying the semantics of data matrices by Pistoia Alliance
Fairification experience clarifying the semantics of data matricesFairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matrices
Pistoia Alliance465 views
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z... by ICZN
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
ICZN924 views
Ontologies for biodiversity informatics, UiO DSC June 2023 by Dag Endresen
 Ontologies for biodiversity informatics, UiO DSC June 2023 Ontologies for biodiversity informatics, UiO DSC June 2023
Ontologies for biodiversity informatics, UiO DSC June 2023
Dag Endresen84 views
Alexandra Basford, InCoB 2011: A Journal’s Perspective on Data Standards and ... by GigaScience, BGI Hong Kong
Alexandra Basford, InCoB 2011: A Journal’s Perspective on Data Standards and ...Alexandra Basford, InCoB 2011: A Journal’s Perspective on Data Standards and ...
Alexandra Basford, InCoB 2011: A Journal’s Perspective on Data Standards and ...
Preserving the Inputs and Outputs of Scholarship by tsbbbu
Preserving the Inputs and Outputs of ScholarshipPreserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of Scholarship
tsbbbu318 views
Case Study Life Sciences Data: Central for Integrative Systems Biology and Bi... by sesrdm
Case Study Life Sciences Data: Central for Integrative Systems Biology and Bi...Case Study Life Sciences Data: Central for Integrative Systems Biology and Bi...
Case Study Life Sciences Data: Central for Integrative Systems Biology and Bi...
sesrdm461 views
Bio 1010 2012 by nmjb
Bio 1010 2012Bio 1010 2012
Bio 1010 2012
nmjb672 views
The iPlant Tree of Life Project and Toolkit by Naim Matasci
The iPlant Tree of Life Project and ToolkitThe iPlant Tree of Life Project and Toolkit
The iPlant Tree of Life Project and Toolkit
Naim Matasci382 views

More from Hilmar Lapp

Of Trees and Owl: 
The challenges of reasoning over the semantics of shared d... by
Of Trees and Owl: 
The challenges of reasoning over the semantics of shared d...Of Trees and Owl: 
The challenges of reasoning over the semantics of shared d...
Of Trees and Owl: 
The challenges of reasoning over the semantics of shared d...Hilmar Lapp
275 views35 slides
Integrating data with phylogenies, at scale by
Integrating data with phylogenies, at scaleIntegrating data with phylogenies, at scale
Integrating data with phylogenies, at scaleHilmar Lapp
441 views43 slides
Rphenoscape: 
Connecting the semantics of evolutionary morphology to comparat... by
Rphenoscape: 
Connecting the semantics of evolutionary morphology to comparat...Rphenoscape: 
Connecting the semantics of evolutionary morphology to comparat...
Rphenoscape: 
Connecting the semantics of evolutionary morphology to comparat...Hilmar Lapp
499 views24 slides
Towards ubiquitous OWL computing: Simplifying programmatic authoring of and q... by
Towards ubiquitous OWL computing: Simplifying programmatic authoring of and q...Towards ubiquitous OWL computing: Simplifying programmatic authoring of and q...
Towards ubiquitous OWL computing: Simplifying programmatic authoring of and q...Hilmar Lapp
782 views24 slides
Open Bioinformatics Foundation: 2014 Update & Some Introspection by
Open Bioinformatics Foundation: 2014 Update & Some IntrospectionOpen Bioinformatics Foundation: 2014 Update & Some Introspection
Open Bioinformatics Foundation: 2014 Update & Some IntrospectionHilmar Lapp
1.2K views20 slides
Reproducible Science - Panel at iEvoBio 2014 by
Reproducible Science - Panel at iEvoBio 2014 Reproducible Science - Panel at iEvoBio 2014
Reproducible Science - Panel at iEvoBio 2014 Hilmar Lapp
1.1K views12 slides

More from Hilmar Lapp(14)

Of Trees and Owl: 
The challenges of reasoning over the semantics of shared d... by Hilmar Lapp
Of Trees and Owl: 
The challenges of reasoning over the semantics of shared d...Of Trees and Owl: 
The challenges of reasoning over the semantics of shared d...
Of Trees and Owl: 
The challenges of reasoning over the semantics of shared d...
Hilmar Lapp275 views
Integrating data with phylogenies, at scale by Hilmar Lapp
Integrating data with phylogenies, at scaleIntegrating data with phylogenies, at scale
Integrating data with phylogenies, at scale
Hilmar Lapp441 views
Rphenoscape: 
Connecting the semantics of evolutionary morphology to comparat... by Hilmar Lapp
Rphenoscape: 
Connecting the semantics of evolutionary morphology to comparat...Rphenoscape: 
Connecting the semantics of evolutionary morphology to comparat...
Rphenoscape: 
Connecting the semantics of evolutionary morphology to comparat...
Hilmar Lapp499 views
Towards ubiquitous OWL computing: Simplifying programmatic authoring of and q... by Hilmar Lapp
Towards ubiquitous OWL computing: Simplifying programmatic authoring of and q...Towards ubiquitous OWL computing: Simplifying programmatic authoring of and q...
Towards ubiquitous OWL computing: Simplifying programmatic authoring of and q...
Hilmar Lapp782 views
Open Bioinformatics Foundation: 2014 Update & Some Introspection by Hilmar Lapp
Open Bioinformatics Foundation: 2014 Update & Some IntrospectionOpen Bioinformatics Foundation: 2014 Update & Some Introspection
Open Bioinformatics Foundation: 2014 Update & Some Introspection
Hilmar Lapp1.2K views
Reproducible Science - Panel at iEvoBio 2014 by Hilmar Lapp
Reproducible Science - Panel at iEvoBio 2014 Reproducible Science - Panel at iEvoBio 2014
Reproducible Science - Panel at iEvoBio 2014
Hilmar Lapp1.1K views
The Dryad Digital Repository: Published data as part of the greater data ecos... by Hilmar Lapp
The Dryad Digital Repository: Published data as part of the greater data ecos...The Dryad Digital Repository: Published data as part of the greater data ecos...
The Dryad Digital Repository: Published data as part of the greater data ecos...
Hilmar Lapp1.5K views
PhyloCommons: Sharing, annotating, and reusing Phylogenies by Hilmar Lapp
PhyloCommons: Sharing, annotating, and reusing PhylogeniesPhyloCommons: Sharing, annotating, and reusing Phylogenies
PhyloCommons: Sharing, annotating, and reusing Phylogenies
Hilmar Lapp763 views
OBF Address at BOSC 2013 by Hilmar Lapp
OBF Address at BOSC 2013OBF Address at BOSC 2013
OBF Address at BOSC 2013
Hilmar Lapp688 views
The blessing and the curse: handshaking between general and specialist data r... by Hilmar Lapp
The blessing and the curse: handshaking between general and specialist data r...The blessing and the curse: handshaking between general and specialist data r...
The blessing and the curse: handshaking between general and specialist data r...
Hilmar Lapp970 views
Liberating Our Beautiful Trees: A Call to Arms. by Hilmar Lapp
Liberating Our Beautiful Trees: A Call to Arms.Liberating Our Beautiful Trees: A Call to Arms.
Liberating Our Beautiful Trees: A Call to Arms.
Hilmar Lapp2.3K views
OBF Address at BOSC 2012 by Hilmar Lapp
OBF Address at BOSC 2012OBF Address at BOSC 2012
OBF Address at BOSC 2012
Hilmar Lapp608 views
Lapp, ISCB Software Sharing Symposium by Hilmar Lapp
Lapp, ISCB Software Sharing SymposiumLapp, ISCB Software Sharing Symposium
Lapp, ISCB Software Sharing Symposium
Hilmar Lapp433 views
BioSQL Reloaded: v1.0 Release, PhyloDB Module, and Future Features by Hilmar Lapp
BioSQL Reloaded: v1.0 Release, PhyloDB Module, and Future FeaturesBioSQL Reloaded: v1.0 Release, PhyloDB Module, and Future Features
BioSQL Reloaded: v1.0 Release, PhyloDB Module, and Future Features
Hilmar Lapp1.2K views

Recently uploaded

EILO EXCURSION PROGRAMME 2023 by
EILO EXCURSION PROGRAMME 2023EILO EXCURSION PROGRAMME 2023
EILO EXCURSION PROGRAMME 2023info33492
202 views40 slides
StudioX.pptx by
StudioX.pptxStudioX.pptx
StudioX.pptxNikhileshSathyavarap
101 views18 slides
Meet the Bible by
Meet the BibleMeet the Bible
Meet the BibleSteve Thomason
78 views80 slides
Parts of Speech (1).pptx by
Parts of Speech (1).pptxParts of Speech (1).pptx
Parts of Speech (1).pptxmhkpreet001
46 views20 slides
JRN 362 - Lecture Twenty-Three (Epilogue) by
JRN 362 - Lecture Twenty-Three (Epilogue)JRN 362 - Lecture Twenty-Three (Epilogue)
JRN 362 - Lecture Twenty-Three (Epilogue)Rich Hanley
41 views57 slides

Recently uploaded(20)

EILO EXCURSION PROGRAMME 2023 by info33492
EILO EXCURSION PROGRAMME 2023EILO EXCURSION PROGRAMME 2023
EILO EXCURSION PROGRAMME 2023
info33492202 views
Parts of Speech (1).pptx by mhkpreet001
Parts of Speech (1).pptxParts of Speech (1).pptx
Parts of Speech (1).pptx
mhkpreet00146 views
JRN 362 - Lecture Twenty-Three (Epilogue) by Rich Hanley
JRN 362 - Lecture Twenty-Three (Epilogue)JRN 362 - Lecture Twenty-Three (Epilogue)
JRN 362 - Lecture Twenty-Three (Epilogue)
Rich Hanley41 views
Monthly Information Session for MV Asterix (November) by Esquimalt MFRC
Monthly Information Session for MV Asterix (November)Monthly Information Session for MV Asterix (November)
Monthly Information Session for MV Asterix (November)
Esquimalt MFRC107 views
Create a Structure in VBNet.pptx by Breach_P
Create a Structure in VBNet.pptxCreate a Structure in VBNet.pptx
Create a Structure in VBNet.pptx
Breach_P86 views
Education of marginalized and socially disadvantages segments.pptx by GarimaBhati5
Education of marginalized and socially disadvantages segments.pptxEducation of marginalized and socially disadvantages segments.pptx
Education of marginalized and socially disadvantages segments.pptx
GarimaBhati543 views
Creative Restart 2023: Leonard Savage - The Permanent Brief: Unearthing unobv... by Taste
Creative Restart 2023: Leonard Savage - The Permanent Brief: Unearthing unobv...Creative Restart 2023: Leonard Savage - The Permanent Brief: Unearthing unobv...
Creative Restart 2023: Leonard Savage - The Permanent Brief: Unearthing unobv...
Taste55 views
11.30.23A Poverty and Inequality in America.pptx by mary850239
11.30.23A Poverty and Inequality in America.pptx11.30.23A Poverty and Inequality in America.pptx
11.30.23A Poverty and Inequality in America.pptx
mary850239130 views
CUNY IT Picciano.pptx by apicciano
CUNY IT Picciano.pptxCUNY IT Picciano.pptx
CUNY IT Picciano.pptx
apicciano64 views
JQUERY.pdf by ArthyR3
JQUERY.pdfJQUERY.pdf
JQUERY.pdf
ArthyR3105 views
Retail Store Scavenger Hunt.pptx by jmurphy154
Retail Store Scavenger Hunt.pptxRetail Store Scavenger Hunt.pptx
Retail Store Scavenger Hunt.pptx
jmurphy15452 views

The MIAPA ontology: An annotation ontology for validating minimum metadata reporting for phylogenetic analyses

  • 1. The MIAPA ontology: An annotation ontology for validating minimum metadata reporting for phylogenetic analyses Hilmar Lapp, Joachim Baran, Enrico Pontelli, Arlin Stoltzfus, Ramona Walls 2013 iEvoBio Conference Snowbird, UT
  • 2. Phylogenetic trees are published at a rapidly increasing rate Abby et al (2012) PNAS Bininda-Emonds et al (2007) Betancur et al 2013 Smith et al 2011
  • 3. Phylogenies are key to many biological questions Edwards & Smith 2010 Jetz et al 2012 Emerson & Gillespie, 2008
  • 4. Reuse of phylogenies is nonetheless rare “Most attempts at phylogenetic data reuse seem to end in disappointment.”
  • 5. Part of the challenge is digital access {<4% of published phylogenies
  • 6. The other challenge: Determining fitness for purpose •What kind of tree is it? •Does it have branch lengths? Support values? •From what kind of matrix or alignment? •How was the alignment created, how the tree?
  • 7. OMICS, 2006 “If inadequately documented, however, even the most carefully constructed phylogenetic analysis will languish in the pages of a journal.”
  • 8. Standards need normative documents Normative Other implentations MIAME MAGE-ML MAGE-TAB SOFT (GEO) MIGS/MIMS/ MIMARKS GCDML Structured Comments (NCBI) MIAPA ❌
  • 9. Steps towards a Minimum Information About a Phylogenetic Analysis (MIAPA) Standard 2011 TDWG Conference
  • 10. Draft MIAPA checklist • Topology: including identifier; type of tree, topology, and consensus • OTUs: Terminal nodes with appropriate labels and referenced to meaningful external identifier(s) • Branch lengths and branch support, unless not applicable to analysis method. • Character matrix: aligned data matrix that is the basis for the tree, with data type; accession numbers of the sequences; mapping from rows to a tip of the topology • Alignment method • Tree inference method Now on Github: http://github.com/miapa/miapa
  • 11. MIAPA Ontology • Initiated at the Phylotastic II hackathon in Tucson (Feb 2013) • Developed in OWL • Designed to follow OBO Foundry best practices • Open: CC0, http://github.com/miapa/miapa • Mailing list: miapa-discuss@googlegroups.com • OBO ID policy: http://purl.obolibrary.org/obo/ MIAPA_NNNNNNN for term IDs
  • 12. MIAPA Ontology • Application Ontology: Reuses most terms and properties from existing ontologies • Comparative Data Analysis Ontology (CDAO) • Information Artifact Ontology (IAO) • Software Ontology (SWO + EDAM) • W3C Provenance Ontology (PROV) • In addition, some properties from Darwin Core (DwC), Semantic Darwin Core (DSW), and Dublin Core (DC)
  • 13. MIAPA Checklist: Example Topology Already (mostly) provided by CDAO
  • 14. Reused terms with usage recommendations
  • 17. Data annotation using the MIAPA Ontology
  • 18. Data annotation using the MIAPA Ontology Annotated a total of 10 published phylogenies See http:// github.com/miapa/ miapa
  • 19. Phylogenetic Tree and Data RDF Triplestore RDF, using CDAO RDFization MIAPA Expert KnowledgeManual Annotation RDF, using CDAO and MIAPA Automatic Annotation Extract & Transform MIAPA information Actor SPARQL Queries MIAPA annotations can facilitate data integration
  • 21. Acknowledgements • Jim Leebens-Mack and other authors of the 2006 publication • Maryam Panahiazar for earlier work on ontology support for MIAPA • Nico Cellinese and participants of the 2011 TDWG Workshop on MIAPA • HIP Leadership Team (organizers of the 2013 Phylotastic Hackathon in Tucson) • Andrea Thomer, co-conspirator at the Tucson hackathon • Phylotastic Hackathon sponsors: NESCent, Biodiversity Synthesis Center, iPlant Collaborative