Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Ontology-based Tools to Enhance the Curation Workflow
1. Ontology-based Tools to Enhance
the Curation Workflow
Trish Whetzel
Outreach Coordinator
THE NATIONAL CENTER FOR
BIOMEDICAL ONTOLOGY
2. Enhancing the Curation
Workflow
• Curation
– The activity of organizing, representing and making
biological information accessible to both humans and
computers1
• Constraints
– Large amounts of data
– Time-consuming
• Components to enhance
– Data submission
– Ontology enrichment
– Annotation of textual metadata
1Howe et al. Nature. 2008 Oct 2;455(7213):590
4. Enhancing the Curation
Workflow
• Data submission
– Ontology Widgets and Ontology Web services
• Ontology enrichment
– BioPortal Notes
• Annotation of textual metadata
– NCBO Annotator
5. Tools for Data Submission
• Ontology Widgets
– Code to use in your web site
– Search for and select terms from ontologies to
annotate your data
– Enables consistent annotation of data by re-use of
the properly formatted term
– Available for all ontologies in BioPortal
5
9. Tools for Data Submission
• Ontology Web services
– Access to ontology content via REST services
• Types
– Search across all BioPortal ontologies
– Get Term details
– Get Term parents, children or siblings
– Extract subsets of terms
• Available for all ontologies in BioPortal
– http://www.bioontology.org/wiki/index.php/BioPorta
l_REST_services
9
10. Enhancing the Curation
Workflow
• Data submission
– Ontology Widgets and Ontology Web services
• Ontology enrichment
– BioPortal Notes
• Annotation of textual metadata
– NCBO Annotator
11. Tools for Ontology Enrichment
• Ontology Enrichment
– Expansion of the ontology based on user need for
terms
• Constraints
– Existing trackers do not collect structured
information
– No programmatic access to tracker
– Lack of integration with ontology editing software
11
12. Ontology Development Lifecycle
Collect feedback
from Subject Matter
Experts
Draft prototype
ontology
Collect feedback
from Subject Matter
Experts
Develop new
ontology
Publish Ontology Refine Ontology
Collect feedback
from community
EEmmaaiill
EEmmaaiill EEmmaaiill
13. Ontology Development Lifecycle
Collect feedback
from Subject Matter
Experts
Draft prototype
ontology
Collect feedback
from Subject Matter
Experts
Develop new
ontology
Publish Ontology Refine Ontology
Collect feedback
from community
BBiiooPPoorrttaall
BBiiooPPoorrttaall
BBiiooPPoorrttaall
14. BioPortal Notes
• Notes
– Provide a mechanism to collect structured
information
– Programmatic access
– Alerts of updates from both Email and RSS
– Integration with ontology editing programs
16. Enhancing the Curation
Workflow
• Data submission
– Ontology Widgets and Ontology Web services
• Ontology enrichment
– BioPortal Notes
• Annotation of textual metadata
– NCBO Annotator
17. Tools for annotation of textual
metadata
• NCBO Annotator
– Open access, ontology-based Web service that
annotates or “tags” textual metadata
– Annotation is done using ontologies from
BioPortal, which includes OBO Foundry and
Unified Medical Language System ontologies
– Variety of parameters that can be customized
17
22. Enhancing the Curation
Workflow
• Data submission
– Ontology Widgets and Ontology Web services
• Ontology enrichment
– BioPortal Notes
• Annotation of textual metadata
– NCBO Annotator
23. Acknowledgements
• NCBO Team
– Mark Musen, Stanford Univerity
– Partners: Barry Smith, University of Buffalo, Chris
Chute, Mayo Clinic, and Peggy Storey, University
of Victoria
– Developers, Driving Biological Projects, and other
Collaborators
24. Thank you!
• Using NCBO Technology in Your Project:
– http://www.bioontology.org/wiki/index.php/Usin
g_NCBO_Technology_In_Your_Project
• Web service documentation:
– http://www.bioontology.org/wiki/index.php/NCB
O_REST_services
• Questions:
– support@bioontology.org
Editor's Notes
Curation – making unstructured data, structured and searchable
Data submission – collecting structured data ( tagged with ontology terms ) at the time of data submission
Ontology enrichment – adding new terms to the ontology at the time of submission
Annotation of textual metadata – tagging free text with ontology term
Curation – making unstructured data, structured and searchable
Data submission – collecting structured data ( tagged with ontology terms ) at the time of data submission
Ontology enrichment – adding new terms to the ontology at the time of submission
Annotation of textual metadata – tagging free text with ontology term
SimTK - https://simtk.org/home/simtk
--physics-based simulations of biological structures
GMiner - http://gminer.mcw.edu/
-Ontolgoy-indexed annotations from GEO, widget used to get more information on term (Jump To)
aTag Generator - http://hcls.deri.org/atag/generator/
- aTags ("associative tags") are snippets of HTML that capture the information that is most important to you in a machine-readable, interlinked format, making it easier for you and others to see the big picture.
RNSA – various tools, e.g. RadLex viewer, propose new terms
-ISAcreator - http://isatab.sourceforge.net/isacreator.html
RightField – http://www.sysmo-db.org/rightfield
Jinx - http://ncmir.ucsd.edu/downloads/jinx.shtm
ECG Gadget – http://wiki.cvrgrid.org/index.php/CVRG_Tool_Demonstrations
ODIE - http://www.bioontology.org/ODIE
Word Add-in for Ontology Recognition - http://ucsdbiolit.codeplex.com/
and more…
Curation – making unstructured data, structured and searchable
Data submission – collecting structured data ( tagged with ontology terms ) at the time of data submission
Ontology enrichment – adding new terms to the ontology at the time of submission
Annotation of textual metadata – tagging free text with ontology term
Curation – making unstructured data, structured and searchable
Data submission – collecting structured data ( tagged with ontology terms ) at the time of data submission
Ontology enrichment – adding new terms to the ontology at the time of submission
Annotation of textual metadata – tagging free text with ontology term
Elsevier SciVerse
Karen Dowell, Jackson Lab
Shai-shen Orr, Mark Davis’s lab
Sean Mooney’s group
Ida Sim, UCSF
Simon Twigger, Medical college of Wisconsin
Nathan Baker, Washington Univ.
Amit Seth, Wright State Univ.
Neil Sarkar, University of Vermont
Larry Hunter, University of Colorado, Denver
Curation – making unstructured data, structured and searchable
Data submission – collecting structured data ( tagged with ontology terms ) at the time of data submission
Ontology enrichment – adding new terms to the ontology at the time of submission
Annotation of textual metadata – tagging free text with ontology term