SlideShare a Scribd company logo
Ontologies for Semantic Normalization of Immunological Data
Yannick Pouliot1, Atul J. Butte1,2
1. Division of Systems Medicine, Stanford University School of Medicine
2. Center for Pediatric Bioinformatics, Lucile Packard Children’s Hospital, Palo Alto, California

The Problem
Data from experiments probing the immune system are inherently
complex because of the diversity of data types, assay types and the
number of biological agents involved. This complexity is further
increased by the multi-center nature of data generated by HIPC.
One of the goals of HIPC is to deliver a database able to support
broad community access to these complex data sets. Critical to the
success of this database will be its ability to provide conceptual
characterizations of experiments and their results (“data and
metadata encoding”). Such encodings identify data sets according to
experimental properties so that users can quickly narrow their
searches to the most pertinent results.
To this end, conceptual encoding that rely on “industry-standard”
ontologies are preferred is the best way to achieve this. We
determined the extent to which existing ontologies can be used to
encode HIPC data, and ImmPort’s ability to support the application of
these concepts.
Since ImmPort will be the repository of HIPC data, we evaluated its
use of ontologies. Upon determining that ImmPort is not ontologycompliant, we analyzed the universe of ontologies to determine the
extent to which existing ontologies can be used to encode HIPC data,
and ImmPort’s ability to support the application of these concepts.

Ontologies provided by BioPortal1 were first selected
based on their domain of application:
Domain

Description

Example Ontology

Analysis
Anatomy

Process of data analysis
Anatomical structures at all level of
resolution except molecular
Disease states manifested by organisms at
anatomical, spatial, temporal and
functional levels
Conditions/specifications associated with
a scientific or clinical protocol
Process/properties/data types of
modeling, computational or otherwise
Aspects of biomolecules : structure,
sequence, function
Biochemical, signaling pathways used by
organisms
States manifested by organisms at
anatomical, spatial, temporal and
functional levels. “Anatomy” and
“Disease” are components of
“Phenotype” but treated distinctly

Ontology of Data Mining
Cell Ontology

Experimental
conditions
Modeling
Molecule
Pathways
Phenotype

Criterium

Parameter

Design

Must be integrative (ability to draw on similar/identical
concepts from other ontologies)
Must minimize overlap with other ontologies, consistent with
providing terms that can be inter-related across ontologies
Should be “relatable” to clinical applications
Must be applicable to humans, and perhaps one animal model
Must be an ontology, not just a controlled vocabulary (with a
few exceptions)
Evidence of ongoing development and maintenance

Developmental
state

Usage
Content

Developed with, accepted by standards or professional
organizations
Adheres to standards such as the Basic Formal Ontology
Must be released (post beta)
Reasonably widely adopted
Must exhibit a good balance between expressiveness and
usability/understandability:
Conceptual clarity (e.g., no ambiguous classifications for dualuse organs such as reproductive/urinary organs)
Limited redundancy of synonymous concepts
Usable definitions of concepts describing HIPC data or
metadata, including experiment design
Completeness (frequency of missing concepts)
Correctness (how accurately is a concept is expressed)

These ontologies were then analyzed for their ability to
recognize terms from text obtained from Build 1 datasets,
protocols, and metadata, as well as from Stanford’s DataMt
database2 (which stores many Stanford HIPC datasets). An
automated pipeline that relies on the National Center for
Biomedical Ontology’s Annotator3 was written that relies on
BioPortal’s Web services to parse the text and attempt to
map to the reference ontologies.

Methods

Disease

57 Ontologies within these domains were then
screened for preliminary suitability to HIPC data/metadata
according to four broad criteria:

Infectious Disease Ontology (IDO)

Ontology of Clinical Research
(OCRe)
Interaction Network Ontology
(INO)
ONTIE - Ontology of Immune
Epitopes
Pathway Ontology
Phenotypic Quality

Results & Discussion
Ontology
NCI Thesaurus
Medical Subject Headings
Molecule role
SNOMED Clinical Terms
PRotein Ontology (PRO)
Cell Cycle Ontology
Ontology for Biomedical Investigations
Experimental Factor Ontology
SemanticScience Integrated Ontology
Units of measurement
Phenotypic quality
EDAM
Foundational Model of Anatomy
Vaccine Ontology
ICPC-2 PLUS
MGED Ontology
Measurement Method Ontology
Gene Ontology
Ontology of Clinical Research (OCRe)
Protein-protein interaction
Mammalian phenotype

Build 1
(%)
16.7%
7.1%
14.3%
2.4%
16.7%
7.1%
2.4%

n
7
3
6
1
7
3
1

ImmPort
CV
(%)
33.3%
21.4%
0.0%
11.9%
0.0%
0.0%
14.3%
7.1%
2.4%
2.4%
7.1%
2.4%
2.4%
2.4%
2.4%

n
14
9

5

6
3

1
1
3

1

Data Mt
(%)
52.6%
22.9%
24.6%
17.7%
10.3%
12.6%
5.7%
7.4%
2.9%
2.3%
2.3%
0.6%
2.3%
1.1%
1.1%

n
92
40
43
31
18
22
10
13
5
4
4
1
4
2
2

1
1
1

0.6%
0.6%
0.6%

1
1
1

• Many mapping failures attributable to lack of definition for commercial objects
within the reference ontologies (e.g., “Anti-CD27” antibody from BD)
Solution: Contacting ontology owners to have them add commercial terms
to their ontologies
• Many mapping failures are easily correctable
Example: Adding a pre-processor able to recognized instances of the “anti-“
problem (e.g., “anti-CD20” not recognized even though “CD20” is known)
We conclude that ImmPort should be able to migrate toward ontologically-based
encodings.

References
1. Noy et al., (2009) “BioPortal: Ontologies and Integrated Data resources at the Click of a
Mouse”, Nucl. Acids Res., 37:W170-W173.
2. Siebert, J., Munsil, D. & Maecker, H. (2011) "A Novel Approach for Integrating and Exploring
Heterogeneous Translational Data", manuscript in preparation.
3. Jonquet et al., (2009) “The Open Biomedical Annotator”, Summit on Transla. Bioinfo., 56-60.

Acknowledgments
NIAID, Hewlett Packard Foundation, Butte Lab

More Related Content

What's hot

Text and data mining
Text and data miningText and data mining
Text and data mining
Lars Juhl Jensen
 
Machine learning in biology
Machine learning in biologyMachine learning in biology
Machine learning in biology
Pranavathiyani G
 
X-Meeting Poster 2015 - Vallys A Coverage tool
X-Meeting Poster 2015 - Vallys A Coverage toolX-Meeting Poster 2015 - Vallys A Coverage tool
X-Meeting Poster 2015 - Vallys A Coverage tool
Genomika Diagnósticos
 
Ai and biology
Ai and biologyAi and biology
Ai and biology
DhirendraKumarChauha1
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformatics
philmaweb
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
AkanshaChauhan15
 
Uses of Artificial Intelligence in Bioinformatics
Uses of Artificial Intelligence in BioinformaticsUses of Artificial Intelligence in Bioinformatics
Uses of Artificial Intelligence in Bioinformatics
Pragya Pai
 
GENOMICS
GENOMICSGENOMICS
GENOMICS
resgmasheikh
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to Bioinformatics
Denis C. Bauer
 
Ontology-Driven Clinical Intelligence: Removing Data Barriers for Cross-Disci...
Ontology-Driven Clinical Intelligence: Removing Data Barriers for Cross-Disci...Ontology-Driven Clinical Intelligence: Removing Data Barriers for Cross-Disci...
Ontology-Driven Clinical Intelligence: Removing Data Barriers for Cross-Disci...
Remedy Informatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
Tanveer Abbas
 
Chapter 1. Introduction
Chapter 1. IntroductionChapter 1. Introduction
Chapter 1. Introduction
butest
 
I NTRODUCTION.doc
I NTRODUCTION.docI NTRODUCTION.doc
I NTRODUCTION.doc
butest
 
Application of blockchain technology in healthcare and biomedicine
Application of blockchain technology in healthcare and biomedicineApplication of blockchain technology in healthcare and biomedicine
Application of blockchain technology in healthcare and biomedicine
Pranavathiyani G
 
Колкер Е. An introduction to MOPED: Multi-Omics Profiling Expression Database
Колкер Е. An introduction to MOPED: Multi-Omics Profiling Expression DatabaseКолкер Е. An introduction to MOPED: Multi-Omics Profiling Expression Database
Колкер Е. An introduction to MOPED: Multi-Omics Profiling Expression Database
bigdatabm
 
03 Guerra, Rudy
03 Guerra, Rudy03 Guerra, Rudy
03 Guerra, Rudy
Hadley Wickham
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to Bioinformatics
Alexander Niema Moshiri
 
Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...
Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...
Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...
Seattle DAML meetup
 
IEEE AP-MTT OSU Seminar
IEEE AP-MTT OSU SeminarIEEE AP-MTT OSU Seminar
IEEE AP-MTT OSU Seminar
Ogan Gurel MD
 
NetBioSIG2014-Talk by David Amar
NetBioSIG2014-Talk by David AmarNetBioSIG2014-Talk by David Amar
NetBioSIG2014-Talk by David Amar
Alexander Pico
 

What's hot (20)

Text and data mining
Text and data miningText and data mining
Text and data mining
 
Machine learning in biology
Machine learning in biologyMachine learning in biology
Machine learning in biology
 
X-Meeting Poster 2015 - Vallys A Coverage tool
X-Meeting Poster 2015 - Vallys A Coverage toolX-Meeting Poster 2015 - Vallys A Coverage tool
X-Meeting Poster 2015 - Vallys A Coverage tool
 
Ai and biology
Ai and biologyAi and biology
Ai and biology
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Uses of Artificial Intelligence in Bioinformatics
Uses of Artificial Intelligence in BioinformaticsUses of Artificial Intelligence in Bioinformatics
Uses of Artificial Intelligence in Bioinformatics
 
GENOMICS
GENOMICSGENOMICS
GENOMICS
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to Bioinformatics
 
Ontology-Driven Clinical Intelligence: Removing Data Barriers for Cross-Disci...
Ontology-Driven Clinical Intelligence: Removing Data Barriers for Cross-Disci...Ontology-Driven Clinical Intelligence: Removing Data Barriers for Cross-Disci...
Ontology-Driven Clinical Intelligence: Removing Data Barriers for Cross-Disci...
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Chapter 1. Introduction
Chapter 1. IntroductionChapter 1. Introduction
Chapter 1. Introduction
 
I NTRODUCTION.doc
I NTRODUCTION.docI NTRODUCTION.doc
I NTRODUCTION.doc
 
Application of blockchain technology in healthcare and biomedicine
Application of blockchain technology in healthcare and biomedicineApplication of blockchain technology in healthcare and biomedicine
Application of blockchain technology in healthcare and biomedicine
 
Колкер Е. An introduction to MOPED: Multi-Omics Profiling Expression Database
Колкер Е. An introduction to MOPED: Multi-Omics Profiling Expression DatabaseКолкер Е. An introduction to MOPED: Multi-Omics Profiling Expression Database
Колкер Е. An introduction to MOPED: Multi-Omics Profiling Expression Database
 
03 Guerra, Rudy
03 Guerra, Rudy03 Guerra, Rudy
03 Guerra, Rudy
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to Bioinformatics
 
Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...
Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...
Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...
 
IEEE AP-MTT OSU Seminar
IEEE AP-MTT OSU SeminarIEEE AP-MTT OSU Seminar
IEEE AP-MTT OSU Seminar
 
NetBioSIG2014-Talk by David Amar
NetBioSIG2014-Talk by David AmarNetBioSIG2014-Talk by David Amar
NetBioSIG2014-Talk by David Amar
 

Similar to Ontologies for Semantic Normalization of Immunological Data

2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
open_phacts
 
Quantifying the content of biomedical semantic resources as a core for drug d...
Quantifying the content of biomedical semantic resources as a core for drug d...Quantifying the content of biomedical semantic resources as a core for drug d...
Quantifying the content of biomedical semantic resources as a core for drug d...
Syed Muhammad Ali Hasnain
 
IMMUNOINFORMATICS , MICROARRAY and Machine Learning - All about Immunology, I...
IMMUNOINFORMATICS , MICROARRAY and Machine Learning - All about Immunology, I...IMMUNOINFORMATICS , MICROARRAY and Machine Learning - All about Immunology, I...
IMMUNOINFORMATICS , MICROARRAY and Machine Learning - All about Immunology, I...
Mekhla Diwan
 
Online Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery SystemsOnline Resources to Support Open Drug Discovery Systems
Bioinformatics: Building the cornerstones of Sequence Homology and its use fo...
Bioinformatics: Building the cornerstones of Sequence Homology and its use fo...Bioinformatics: Building the cornerstones of Sequence Homology and its use fo...
Bioinformatics: Building the cornerstones of Sequence Homology and its use fo...
OECD Environment
 
Introducción a la bioinformatica
Introducción a la bioinformaticaIntroducción a la bioinformatica
Introducción a la bioinformatica
Martín Arrieta
 
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Adam Ford
 
Role of bioinformatics of drug designing
Role of bioinformatics of drug designingRole of bioinformatics of drug designing
Role of bioinformatics of drug designing
Dr NEETHU ASOKAN
 
Haendel clingenetics.3.14.14
Haendel clingenetics.3.14.14Haendel clingenetics.3.14.14
Haendel clingenetics.3.14.14
mhaendel
 
Bioinformatics Introduction and Use of BLAST Tool
Bioinformatics Introduction and Use of BLAST ToolBioinformatics Introduction and Use of BLAST Tool
Bioinformatics Introduction and Use of BLAST Tool
JesminBinti
 
INBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria LópezINBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision
 
Bioinformatics .pptx
Bioinformatics .pptxBioinformatics .pptx
Bioinformatics .pptx
UpendraSharmaUS1
 
My ontology is better than yours! Building and evaluating ontologies for inte...
My ontology is better than yours! Building and evaluating ontologies for inte...My ontology is better than yours! Building and evaluating ontologies for inte...
My ontology is better than yours! Building and evaluating ontologies for inte...
Robert Hoehndorf
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
Amna Jalil
 
Gellibolian 2010 Audio Visual2
Gellibolian 2010 Audio Visual2Gellibolian 2010 Audio Visual2
Gellibolian 2010 Audio Visual2
Robert Gellibolian, Ph.D
 
BRN Seminar 13/06/16 12 Revolutionizing healthcare management
BRN Seminar 13/06/16 12 Revolutionizing healthcare managementBRN Seminar 13/06/16 12 Revolutionizing healthcare management
BRN Seminar 13/06/16 12 Revolutionizing healthcare management
brnmomentum
 
Revolutionizing Heathcare and Wellness Management through Systems P4 Medicine
Revolutionizing Heathcare and Wellness Management through Systems P4 MedicineRevolutionizing Heathcare and Wellness Management through Systems P4 Medicine
Revolutionizing Heathcare and Wellness Management through Systems P4 Medicine
brnbarcelona
 
Metrological experiments in biomarker development (mass spectrometry statisti...
Metrological experiments in biomarker development (mass spectrometry statisti...Metrological experiments in biomarker development (mass spectrometry statisti...
Metrological experiments in biomarker development (mass spectrometry statisti...
National Institute of Biologics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
Dr. Siddhi Upadhyay
 
Assignment Of Protein Function And Discovery Of Novel Nucleolar Proteins Base...
Assignment Of Protein Function And Discovery Of Novel Nucleolar Proteins Base...Assignment Of Protein Function And Discovery Of Novel Nucleolar Proteins Base...
Assignment Of Protein Function And Discovery Of Novel Nucleolar Proteins Base...
Jim Jimenez
 

Similar to Ontologies for Semantic Normalization of Immunological Data (20)

2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
 
Quantifying the content of biomedical semantic resources as a core for drug d...
Quantifying the content of biomedical semantic resources as a core for drug d...Quantifying the content of biomedical semantic resources as a core for drug d...
Quantifying the content of biomedical semantic resources as a core for drug d...
 
IMMUNOINFORMATICS , MICROARRAY and Machine Learning - All about Immunology, I...
IMMUNOINFORMATICS , MICROARRAY and Machine Learning - All about Immunology, I...IMMUNOINFORMATICS , MICROARRAY and Machine Learning - All about Immunology, I...
IMMUNOINFORMATICS , MICROARRAY and Machine Learning - All about Immunology, I...
 
Online Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery SystemsOnline Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery Systems
 
Bioinformatics: Building the cornerstones of Sequence Homology and its use fo...
Bioinformatics: Building the cornerstones of Sequence Homology and its use fo...Bioinformatics: Building the cornerstones of Sequence Homology and its use fo...
Bioinformatics: Building the cornerstones of Sequence Homology and its use fo...
 
Introducción a la bioinformatica
Introducción a la bioinformaticaIntroducción a la bioinformatica
Introducción a la bioinformatica
 
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
 
Role of bioinformatics of drug designing
Role of bioinformatics of drug designingRole of bioinformatics of drug designing
Role of bioinformatics of drug designing
 
Haendel clingenetics.3.14.14
Haendel clingenetics.3.14.14Haendel clingenetics.3.14.14
Haendel clingenetics.3.14.14
 
Bioinformatics Introduction and Use of BLAST Tool
Bioinformatics Introduction and Use of BLAST ToolBioinformatics Introduction and Use of BLAST Tool
Bioinformatics Introduction and Use of BLAST Tool
 
INBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria LópezINBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria López
 
Bioinformatics .pptx
Bioinformatics .pptxBioinformatics .pptx
Bioinformatics .pptx
 
My ontology is better than yours! Building and evaluating ontologies for inte...
My ontology is better than yours! Building and evaluating ontologies for inte...My ontology is better than yours! Building and evaluating ontologies for inte...
My ontology is better than yours! Building and evaluating ontologies for inte...
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Gellibolian 2010 Audio Visual2
Gellibolian 2010 Audio Visual2Gellibolian 2010 Audio Visual2
Gellibolian 2010 Audio Visual2
 
BRN Seminar 13/06/16 12 Revolutionizing healthcare management
BRN Seminar 13/06/16 12 Revolutionizing healthcare managementBRN Seminar 13/06/16 12 Revolutionizing healthcare management
BRN Seminar 13/06/16 12 Revolutionizing healthcare management
 
Revolutionizing Heathcare and Wellness Management through Systems P4 Medicine
Revolutionizing Heathcare and Wellness Management through Systems P4 MedicineRevolutionizing Heathcare and Wellness Management through Systems P4 Medicine
Revolutionizing Heathcare and Wellness Management through Systems P4 Medicine
 
Metrological experiments in biomarker development (mass spectrometry statisti...
Metrological experiments in biomarker development (mass spectrometry statisti...Metrological experiments in biomarker development (mass spectrometry statisti...
Metrological experiments in biomarker development (mass spectrometry statisti...
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Assignment Of Protein Function And Discovery Of Novel Nucleolar Proteins Base...
Assignment Of Protein Function And Discovery Of Novel Nucleolar Proteins Base...Assignment Of Protein Function And Discovery Of Novel Nucleolar Proteins Base...
Assignment Of Protein Function And Discovery Of Novel Nucleolar Proteins Base...
 

More from Yannick Pouliot

Survey of Spark for Data Pre-Processing and Analytics
Survey of Spark for Data Pre-Processing and AnalyticsSurvey of Spark for Data Pre-Processing and Analytics
Survey of Spark for Data Pre-Processing and Analytics
Yannick Pouliot
 
Systems Immunology -- 2014
Systems Immunology -- 2014Systems Immunology -- 2014
Systems Immunology -- 2014
Yannick Pouliot
 
Managing experiment data using Excel and Friends
Managing experiment data using Excel and FriendsManaging experiment data using Excel and Friends
Managing experiment data using Excel and Friends
Yannick Pouliot
 
Essential UNIX skills for biologists
Essential UNIX skills for biologistsEssential UNIX skills for biologists
Essential UNIX skills for biologists
Yannick Pouliot
 
A guided SQL tour of bioinformatics databases
A guided SQL tour of bioinformatics databasesA guided SQL tour of bioinformatics databases
A guided SQL tour of bioinformatics databases
Yannick Pouliot
 
Ontologically-Aware Automated Gating
Ontologically-Aware Automated GatingOntologically-Aware Automated Gating
Ontologically-Aware Automated Gating
Yannick Pouliot
 
Why The Cloud Is A Computational Biologist's Best Friend
Why The Cloud Is A Computational Biologist's Best FriendWhy The Cloud Is A Computational Biologist's Best Friend
Why The Cloud Is A Computational Biologist's Best Friend
Yannick Pouliot
 
There’s No Avoiding It: Programming Skills You’ll Need
There’s No Avoiding It:  Programming Skills You’ll NeedThere’s No Avoiding It:  Programming Skills You’ll Need
There’s No Avoiding It: Programming Skills You’ll Need
Yannick Pouliot
 
Predicting Adverse Drug Reactions Using PubChem Screening Data
Predicting Adverse Drug Reactions Using PubChem Screening DataPredicting Adverse Drug Reactions Using PubChem Screening Data
Predicting Adverse Drug Reactions Using PubChem Screening Data
Yannick Pouliot
 
Repositioning Old Drugs For New Indications Using Computational Approaches
Repositioning Old Drugs For New Indications Using Computational ApproachesRepositioning Old Drugs For New Indications Using Computational Approaches
Repositioning Old Drugs For New Indications Using Computational Approaches
Yannick Pouliot
 
Databases, Web Services and Tools For Systems Immunology
Databases, Web Services and Tools For Systems ImmunologyDatabases, Web Services and Tools For Systems Immunology
Databases, Web Services and Tools For Systems Immunology
Yannick Pouliot
 

More from Yannick Pouliot (11)

Survey of Spark for Data Pre-Processing and Analytics
Survey of Spark for Data Pre-Processing and AnalyticsSurvey of Spark for Data Pre-Processing and Analytics
Survey of Spark for Data Pre-Processing and Analytics
 
Systems Immunology -- 2014
Systems Immunology -- 2014Systems Immunology -- 2014
Systems Immunology -- 2014
 
Managing experiment data using Excel and Friends
Managing experiment data using Excel and FriendsManaging experiment data using Excel and Friends
Managing experiment data using Excel and Friends
 
Essential UNIX skills for biologists
Essential UNIX skills for biologistsEssential UNIX skills for biologists
Essential UNIX skills for biologists
 
A guided SQL tour of bioinformatics databases
A guided SQL tour of bioinformatics databasesA guided SQL tour of bioinformatics databases
A guided SQL tour of bioinformatics databases
 
Ontologically-Aware Automated Gating
Ontologically-Aware Automated GatingOntologically-Aware Automated Gating
Ontologically-Aware Automated Gating
 
Why The Cloud Is A Computational Biologist's Best Friend
Why The Cloud Is A Computational Biologist's Best FriendWhy The Cloud Is A Computational Biologist's Best Friend
Why The Cloud Is A Computational Biologist's Best Friend
 
There’s No Avoiding It: Programming Skills You’ll Need
There’s No Avoiding It:  Programming Skills You’ll NeedThere’s No Avoiding It:  Programming Skills You’ll Need
There’s No Avoiding It: Programming Skills You’ll Need
 
Predicting Adverse Drug Reactions Using PubChem Screening Data
Predicting Adverse Drug Reactions Using PubChem Screening DataPredicting Adverse Drug Reactions Using PubChem Screening Data
Predicting Adverse Drug Reactions Using PubChem Screening Data
 
Repositioning Old Drugs For New Indications Using Computational Approaches
Repositioning Old Drugs For New Indications Using Computational ApproachesRepositioning Old Drugs For New Indications Using Computational Approaches
Repositioning Old Drugs For New Indications Using Computational Approaches
 
Databases, Web Services and Tools For Systems Immunology
Databases, Web Services and Tools For Systems ImmunologyDatabases, Web Services and Tools For Systems Immunology
Databases, Web Services and Tools For Systems Immunology
 

Recently uploaded

The Best Ayurvedic Antacid Tablets in India
The Best Ayurvedic Antacid Tablets in IndiaThe Best Ayurvedic Antacid Tablets in India
The Best Ayurvedic Antacid Tablets in India
Swastik Ayurveda
 
CHEMOTHERAPY_RDP_CHAPTER 6_Anti Malarial Drugs.pdf
CHEMOTHERAPY_RDP_CHAPTER 6_Anti Malarial Drugs.pdfCHEMOTHERAPY_RDP_CHAPTER 6_Anti Malarial Drugs.pdf
CHEMOTHERAPY_RDP_CHAPTER 6_Anti Malarial Drugs.pdf
rishi2789
 
CHEMOTHERAPY_RDP_CHAPTER 1_ANTI TB DRUGS.pdf
CHEMOTHERAPY_RDP_CHAPTER 1_ANTI TB DRUGS.pdfCHEMOTHERAPY_RDP_CHAPTER 1_ANTI TB DRUGS.pdf
CHEMOTHERAPY_RDP_CHAPTER 1_ANTI TB DRUGS.pdf
rishi2789
 
Abortion PG Seminar Power point presentation
Abortion PG Seminar Power point presentationAbortion PG Seminar Power point presentation
Abortion PG Seminar Power point presentation
AksshayaRajanbabu
 
Promoting Wellbeing - Applied Social Psychology - Psychology SuperNotes
Promoting Wellbeing - Applied Social Psychology - Psychology SuperNotesPromoting Wellbeing - Applied Social Psychology - Psychology SuperNotes
Promoting Wellbeing - Applied Social Psychology - Psychology SuperNotes
PsychoTech Services
 
Ketone bodies and metabolism-biochemistry
Ketone bodies and metabolism-biochemistryKetone bodies and metabolism-biochemistry
Ketone bodies and metabolism-biochemistry
Dhayanithi C
 
Top-Vitamin-Supplement-Brands-in-India List
Top-Vitamin-Supplement-Brands-in-India ListTop-Vitamin-Supplement-Brands-in-India List
Top-Vitamin-Supplement-Brands-in-India List
SwisschemDerma
 
Integrating Ayurveda into Parkinson’s Management: A Holistic Approach
Integrating Ayurveda into Parkinson’s Management: A Holistic ApproachIntegrating Ayurveda into Parkinson’s Management: A Holistic Approach
Integrating Ayurveda into Parkinson’s Management: A Holistic Approach
Ayurveda ForAll
 
share - Lions, tigers, AI and health misinformation, oh my!.pptx
share - Lions, tigers, AI and health misinformation, oh my!.pptxshare - Lions, tigers, AI and health misinformation, oh my!.pptx
share - Lions, tigers, AI and health misinformation, oh my!.pptx
Tina Purnat
 
Cardiac Assessment for B.sc Nursing Student.pdf
Cardiac Assessment for B.sc Nursing Student.pdfCardiac Assessment for B.sc Nursing Student.pdf
Cardiac Assessment for B.sc Nursing Student.pdf
shivalingatalekar1
 
TEST BANK For Community Health Nursing A Canadian Perspective, 5th Edition by...
TEST BANK For Community Health Nursing A Canadian Perspective, 5th Edition by...TEST BANK For Community Health Nursing A Canadian Perspective, 5th Edition by...
TEST BANK For Community Health Nursing A Canadian Perspective, 5th Edition by...
Donc Test
 
OCT Training Course for clinical practice Part 1
OCT Training Course for clinical practice Part 1OCT Training Course for clinical practice Part 1
OCT Training Course for clinical practice Part 1
KafrELShiekh University
 
Netter's Atlas of Human Anatomy 7.ed.pdf
Netter's Atlas of Human Anatomy 7.ed.pdfNetter's Atlas of Human Anatomy 7.ed.pdf
Netter's Atlas of Human Anatomy 7.ed.pdf
BrissaOrtiz3
 
Tests for analysis of different pharmaceutical.pptx
Tests for analysis of different pharmaceutical.pptxTests for analysis of different pharmaceutical.pptx
Tests for analysis of different pharmaceutical.pptx
taiba qazi
 
Clinic ^%[+27633867063*Abortion Pills For Sale In Tembisa Central
Clinic ^%[+27633867063*Abortion Pills For Sale In Tembisa CentralClinic ^%[+27633867063*Abortion Pills For Sale In Tembisa Central
Clinic ^%[+27633867063*Abortion Pills For Sale In Tembisa Central
19various
 
Chapter 11 Nutrition and Chronic Diseases.pptx
Chapter 11 Nutrition and Chronic Diseases.pptxChapter 11 Nutrition and Chronic Diseases.pptx
Chapter 11 Nutrition and Chronic Diseases.pptx
Earlene McNair
 
CHEMOTHERAPY_RDP_CHAPTER 2 _LEPROSY.pdf1
CHEMOTHERAPY_RDP_CHAPTER 2 _LEPROSY.pdf1CHEMOTHERAPY_RDP_CHAPTER 2 _LEPROSY.pdf1
CHEMOTHERAPY_RDP_CHAPTER 2 _LEPROSY.pdf1
rishi2789
 
The Electrocardiogram - Physiologic Principles
The Electrocardiogram - Physiologic PrinciplesThe Electrocardiogram - Physiologic Principles
The Electrocardiogram - Physiologic Principles
MedicoseAcademics
 
Best Ayurvedic medicine for Gas and Indigestion
Best Ayurvedic medicine for Gas and IndigestionBest Ayurvedic medicine for Gas and Indigestion
Best Ayurvedic medicine for Gas and Indigestion
Swastik Ayurveda
 
TEST BANK For Basic and Clinical Pharmacology, 14th Edition by Bertram G. Kat...
TEST BANK For Basic and Clinical Pharmacology, 14th Edition by Bertram G. Kat...TEST BANK For Basic and Clinical Pharmacology, 14th Edition by Bertram G. Kat...
TEST BANK For Basic and Clinical Pharmacology, 14th Edition by Bertram G. Kat...
rightmanforbloodline
 

Recently uploaded (20)

The Best Ayurvedic Antacid Tablets in India
The Best Ayurvedic Antacid Tablets in IndiaThe Best Ayurvedic Antacid Tablets in India
The Best Ayurvedic Antacid Tablets in India
 
CHEMOTHERAPY_RDP_CHAPTER 6_Anti Malarial Drugs.pdf
CHEMOTHERAPY_RDP_CHAPTER 6_Anti Malarial Drugs.pdfCHEMOTHERAPY_RDP_CHAPTER 6_Anti Malarial Drugs.pdf
CHEMOTHERAPY_RDP_CHAPTER 6_Anti Malarial Drugs.pdf
 
CHEMOTHERAPY_RDP_CHAPTER 1_ANTI TB DRUGS.pdf
CHEMOTHERAPY_RDP_CHAPTER 1_ANTI TB DRUGS.pdfCHEMOTHERAPY_RDP_CHAPTER 1_ANTI TB DRUGS.pdf
CHEMOTHERAPY_RDP_CHAPTER 1_ANTI TB DRUGS.pdf
 
Abortion PG Seminar Power point presentation
Abortion PG Seminar Power point presentationAbortion PG Seminar Power point presentation
Abortion PG Seminar Power point presentation
 
Promoting Wellbeing - Applied Social Psychology - Psychology SuperNotes
Promoting Wellbeing - Applied Social Psychology - Psychology SuperNotesPromoting Wellbeing - Applied Social Psychology - Psychology SuperNotes
Promoting Wellbeing - Applied Social Psychology - Psychology SuperNotes
 
Ketone bodies and metabolism-biochemistry
Ketone bodies and metabolism-biochemistryKetone bodies and metabolism-biochemistry
Ketone bodies and metabolism-biochemistry
 
Top-Vitamin-Supplement-Brands-in-India List
Top-Vitamin-Supplement-Brands-in-India ListTop-Vitamin-Supplement-Brands-in-India List
Top-Vitamin-Supplement-Brands-in-India List
 
Integrating Ayurveda into Parkinson’s Management: A Holistic Approach
Integrating Ayurveda into Parkinson’s Management: A Holistic ApproachIntegrating Ayurveda into Parkinson’s Management: A Holistic Approach
Integrating Ayurveda into Parkinson’s Management: A Holistic Approach
 
share - Lions, tigers, AI and health misinformation, oh my!.pptx
share - Lions, tigers, AI and health misinformation, oh my!.pptxshare - Lions, tigers, AI and health misinformation, oh my!.pptx
share - Lions, tigers, AI and health misinformation, oh my!.pptx
 
Cardiac Assessment for B.sc Nursing Student.pdf
Cardiac Assessment for B.sc Nursing Student.pdfCardiac Assessment for B.sc Nursing Student.pdf
Cardiac Assessment for B.sc Nursing Student.pdf
 
TEST BANK For Community Health Nursing A Canadian Perspective, 5th Edition by...
TEST BANK For Community Health Nursing A Canadian Perspective, 5th Edition by...TEST BANK For Community Health Nursing A Canadian Perspective, 5th Edition by...
TEST BANK For Community Health Nursing A Canadian Perspective, 5th Edition by...
 
OCT Training Course for clinical practice Part 1
OCT Training Course for clinical practice Part 1OCT Training Course for clinical practice Part 1
OCT Training Course for clinical practice Part 1
 
Netter's Atlas of Human Anatomy 7.ed.pdf
Netter's Atlas of Human Anatomy 7.ed.pdfNetter's Atlas of Human Anatomy 7.ed.pdf
Netter's Atlas of Human Anatomy 7.ed.pdf
 
Tests for analysis of different pharmaceutical.pptx
Tests for analysis of different pharmaceutical.pptxTests for analysis of different pharmaceutical.pptx
Tests for analysis of different pharmaceutical.pptx
 
Clinic ^%[+27633867063*Abortion Pills For Sale In Tembisa Central
Clinic ^%[+27633867063*Abortion Pills For Sale In Tembisa CentralClinic ^%[+27633867063*Abortion Pills For Sale In Tembisa Central
Clinic ^%[+27633867063*Abortion Pills For Sale In Tembisa Central
 
Chapter 11 Nutrition and Chronic Diseases.pptx
Chapter 11 Nutrition and Chronic Diseases.pptxChapter 11 Nutrition and Chronic Diseases.pptx
Chapter 11 Nutrition and Chronic Diseases.pptx
 
CHEMOTHERAPY_RDP_CHAPTER 2 _LEPROSY.pdf1
CHEMOTHERAPY_RDP_CHAPTER 2 _LEPROSY.pdf1CHEMOTHERAPY_RDP_CHAPTER 2 _LEPROSY.pdf1
CHEMOTHERAPY_RDP_CHAPTER 2 _LEPROSY.pdf1
 
The Electrocardiogram - Physiologic Principles
The Electrocardiogram - Physiologic PrinciplesThe Electrocardiogram - Physiologic Principles
The Electrocardiogram - Physiologic Principles
 
Best Ayurvedic medicine for Gas and Indigestion
Best Ayurvedic medicine for Gas and IndigestionBest Ayurvedic medicine for Gas and Indigestion
Best Ayurvedic medicine for Gas and Indigestion
 
TEST BANK For Basic and Clinical Pharmacology, 14th Edition by Bertram G. Kat...
TEST BANK For Basic and Clinical Pharmacology, 14th Edition by Bertram G. Kat...TEST BANK For Basic and Clinical Pharmacology, 14th Edition by Bertram G. Kat...
TEST BANK For Basic and Clinical Pharmacology, 14th Edition by Bertram G. Kat...
 

Ontologies for Semantic Normalization of Immunological Data

  • 1. Ontologies for Semantic Normalization of Immunological Data Yannick Pouliot1, Atul J. Butte1,2 1. Division of Systems Medicine, Stanford University School of Medicine 2. Center for Pediatric Bioinformatics, Lucile Packard Children’s Hospital, Palo Alto, California The Problem Data from experiments probing the immune system are inherently complex because of the diversity of data types, assay types and the number of biological agents involved. This complexity is further increased by the multi-center nature of data generated by HIPC. One of the goals of HIPC is to deliver a database able to support broad community access to these complex data sets. Critical to the success of this database will be its ability to provide conceptual characterizations of experiments and their results (“data and metadata encoding”). Such encodings identify data sets according to experimental properties so that users can quickly narrow their searches to the most pertinent results. To this end, conceptual encoding that rely on “industry-standard” ontologies are preferred is the best way to achieve this. We determined the extent to which existing ontologies can be used to encode HIPC data, and ImmPort’s ability to support the application of these concepts. Since ImmPort will be the repository of HIPC data, we evaluated its use of ontologies. Upon determining that ImmPort is not ontologycompliant, we analyzed the universe of ontologies to determine the extent to which existing ontologies can be used to encode HIPC data, and ImmPort’s ability to support the application of these concepts. Ontologies provided by BioPortal1 were first selected based on their domain of application: Domain Description Example Ontology Analysis Anatomy Process of data analysis Anatomical structures at all level of resolution except molecular Disease states manifested by organisms at anatomical, spatial, temporal and functional levels Conditions/specifications associated with a scientific or clinical protocol Process/properties/data types of modeling, computational or otherwise Aspects of biomolecules : structure, sequence, function Biochemical, signaling pathways used by organisms States manifested by organisms at anatomical, spatial, temporal and functional levels. “Anatomy” and “Disease” are components of “Phenotype” but treated distinctly Ontology of Data Mining Cell Ontology Experimental conditions Modeling Molecule Pathways Phenotype Criterium Parameter Design Must be integrative (ability to draw on similar/identical concepts from other ontologies) Must minimize overlap with other ontologies, consistent with providing terms that can be inter-related across ontologies Should be “relatable” to clinical applications Must be applicable to humans, and perhaps one animal model Must be an ontology, not just a controlled vocabulary (with a few exceptions) Evidence of ongoing development and maintenance Developmental state Usage Content Developed with, accepted by standards or professional organizations Adheres to standards such as the Basic Formal Ontology Must be released (post beta) Reasonably widely adopted Must exhibit a good balance between expressiveness and usability/understandability: Conceptual clarity (e.g., no ambiguous classifications for dualuse organs such as reproductive/urinary organs) Limited redundancy of synonymous concepts Usable definitions of concepts describing HIPC data or metadata, including experiment design Completeness (frequency of missing concepts) Correctness (how accurately is a concept is expressed) These ontologies were then analyzed for their ability to recognize terms from text obtained from Build 1 datasets, protocols, and metadata, as well as from Stanford’s DataMt database2 (which stores many Stanford HIPC datasets). An automated pipeline that relies on the National Center for Biomedical Ontology’s Annotator3 was written that relies on BioPortal’s Web services to parse the text and attempt to map to the reference ontologies. Methods Disease 57 Ontologies within these domains were then screened for preliminary suitability to HIPC data/metadata according to four broad criteria: Infectious Disease Ontology (IDO) Ontology of Clinical Research (OCRe) Interaction Network Ontology (INO) ONTIE - Ontology of Immune Epitopes Pathway Ontology Phenotypic Quality Results & Discussion Ontology NCI Thesaurus Medical Subject Headings Molecule role SNOMED Clinical Terms PRotein Ontology (PRO) Cell Cycle Ontology Ontology for Biomedical Investigations Experimental Factor Ontology SemanticScience Integrated Ontology Units of measurement Phenotypic quality EDAM Foundational Model of Anatomy Vaccine Ontology ICPC-2 PLUS MGED Ontology Measurement Method Ontology Gene Ontology Ontology of Clinical Research (OCRe) Protein-protein interaction Mammalian phenotype Build 1 (%) 16.7% 7.1% 14.3% 2.4% 16.7% 7.1% 2.4% n 7 3 6 1 7 3 1 ImmPort CV (%) 33.3% 21.4% 0.0% 11.9% 0.0% 0.0% 14.3% 7.1% 2.4% 2.4% 7.1% 2.4% 2.4% 2.4% 2.4% n 14 9 5 6 3 1 1 3 1 Data Mt (%) 52.6% 22.9% 24.6% 17.7% 10.3% 12.6% 5.7% 7.4% 2.9% 2.3% 2.3% 0.6% 2.3% 1.1% 1.1% n 92 40 43 31 18 22 10 13 5 4 4 1 4 2 2 1 1 1 0.6% 0.6% 0.6% 1 1 1 • Many mapping failures attributable to lack of definition for commercial objects within the reference ontologies (e.g., “Anti-CD27” antibody from BD) Solution: Contacting ontology owners to have them add commercial terms to their ontologies • Many mapping failures are easily correctable Example: Adding a pre-processor able to recognized instances of the “anti-“ problem (e.g., “anti-CD20” not recognized even though “CD20” is known) We conclude that ImmPort should be able to migrate toward ontologically-based encodings. References 1. Noy et al., (2009) “BioPortal: Ontologies and Integrated Data resources at the Click of a Mouse”, Nucl. Acids Res., 37:W170-W173. 2. Siebert, J., Munsil, D. & Maecker, H. (2011) "A Novel Approach for Integrating and Exploring Heterogeneous Translational Data", manuscript in preparation. 3. Jonquet et al., (2009) “The Open Biomedical Annotator”, Summit on Transla. Bioinfo., 56-60. Acknowledgments NIAID, Hewlett Packard Foundation, Butte Lab