SlideShare a Scribd company logo
Making Semantics do Some
Work
Robert Stevens
BioHealth Informatics Group
School of Computer Science
University of Manchester
Robert.Stevens@manchester.ac.uk
Introduction
• What’s the use of highly axiomatised
ontological descriptions?
• Two use cases:
• Classifying instances based on features:
New discoveries;
• Building a complex terminology.
• Cost and benefit.
• Conclusions
Protein Classification
• Proteins divided into broad functional classes
“Protein Families”
• Families sub-divided to give family
classifications
• Class membership can be determined by
“protein features”, such as domains, etc.
• Resources exist for feature detection via
primary sequence– but not class membership
• Current Limitation of Automated Tools
• Needs human knowledge to recognise class
membership
Finding Domains on a
Sequence
A search of the linear sequence of protein tyrosine
phosphatase type K – identified 9 functional
domains
>uniprot|Q15262|PTPK_HUMAN Receptor-type protein-tyrosine phosphatase kappa precursor (EC
3.1.3.48) (R-PTP-kappa).
MDTTAAAALPAFVALLLLSPWPLLGSAQGQFSAGGCTFDDGPGACDYHQDLYDDFEWVHV
SAQEPHYLPPEMPQGSYMIVDSSDHDPGEKARLQLPTMKENDTHCIDFSYLLYSQKGLNP
GTLNILVRVNKGPLANPIWNVTGFTGRDWLRAELAVSSFWPNEYQVIFEAEVSGGRSGYI
AIDDIQVLSYPCDKSPHFLRLGDVEVNAGQNATFQCIATGRDAVHNKLWLQRRNGEDIPV………..
Why Classify?
• Classification and curation of a genome is the
first step in understanding the processes and
functions happening in an organism
• Classification enables comparative genomic
studies - what is already known in other
organisms
• The similarities and differences between
processes and functions in related organisms
often provide the greatest insight into the
biology
• In silico characterisation is the current
bottleneck
Phosphatase Classification
• Diagnostic phosphatase domains/motifs – sufficient for
membership of the protein phosphatase superfamily
• Any protein having a phosphatase domain is a
member of the phosphatase super-family
• Other motifs determine a protein’s place within the
family
• Usually needs human to recognise that features
detected imply class membership
• Can these be captured in an ontology?
OWL represents
classes of
instances
A
B
C
Necessity and Sufficiency
• An R2A phosphatase must have a fibronectin domain
• Having a fibronectin domain does not a phosphatase
make
• Necessity -- what must a class instance have?
• Any protein that has a phosphatase catalytic domain is a
phosphatase enzyme
• All phosphatase enzymes have a catalytic domain
• Sufficiency – how is an instance recognised to be a
member of a class?
Definition of Tyrosine
Phosphatase
Class:
TyrosineReceptorProteinPhosphatase
EquivalentTo:
Protein That
- contains atLeast 1
ProteinTyrosinePhosphataseDomain
and
- contains 1 TransmembraneDomain
…there are known knowns; there are things
we know we know. We also know there are
known unknowns; that is to say we know there
are some things we do not know. But there
are also unknown unknowns -- the ones we
don't know we don't know.
Definition for R2A Phosphatase
Class: R2A
EquivalentTO: Protein That
- contains 2 ProteinTyrosinePhosphataseDomain and
- contains 1 TransmembraneDomain and
- contains 4 FibronectinDomains and
- contains 1 ImmunoglobulinDomain and
- contains 1 MAMDomain and
- contains 1 Cadherin-LikeDomain and
- contains only TyrosinePhosphataseDomain or
TransmembraneDomain or FibronectinDomain or
ImnunoglobulinDomain or Clathrin-LikeDomain or
ManDomain
Automated Reasoning
• An OWL-DL ontology mapped to its DL form
as a collection of axioms
• An automated reasoner checks for
satisfiability – throws out the inconsistent
and infers subsumption
• Defined classes (where there are necessary
and sufficient restrictions) enable a reasoner
to infer subclass axioms
• Also infer to which class an individual
belongs
Incremental Addition of Protein
Functional Domains
Phosphatase catalytic
Cadherin-like
Immunoglobulin
MAM domain Cellular retinaldehyde
Adhesion recognition Transmembrane
Fibronectin III Glycosylation
Classification of the Classical
Tyrosine Phosphatases
What is the Ontology Telling Us?
• Each class of phosphatase defined in terms of
domain composition
• We know the characteristics by which an
individual protein can be recognised to be a
member of a particular class of phosphatase
• We have this knowledge in a computational form
• If we had protein instances described in terms of
the ontology, we could classify those individual
proteins
• A catalogue of phosphatases
Description of an Instance of a
Protein
Individual: P21592
Types: Protein,
hasDomain 2
ProteinTyrosinePhosphataseDomain
hasdomain 1 TransmembraneDomain,,
hasdomain 4 FibronectinDomain,
hasDomain 1 ImmunoglobulinDomain,
hasdomain 1 MAMDomain,
hasdomain 1 Cadherin-LikeDomain
Instance: P21592        
TypeOf: Protein That
Fact: hasDomain 2 ProteinTyrosinePhosphataseDomain
and
Fact: hasdomain 1 TransmembraneDomain and 
Fact: hasdomain 4 FibronectinDomains and
Fact: hasDomain 1 ImmunoglobulinDomain and
Fact: hasdomain 1 MAMDomain and
Fact: hasdomain 1 Cadherin-LikeDomain
Tyrosine Phosphatase
(containsDomain some TransmembraneDomain) and
(containsDomain at least 1 ProteinTyrosinePhosphataseDomain)
tase
n some MAMDomain) and
n some ProteinTyrosineCatalyticDomain or ImmunoglobulinDomain) and
n some FibronectinDomain or FibronectinTypeIIIFoldDomain) and
n exactly 2 ProteinTyrosinePhosphataseDomain)
ClassifyingProteins
>uniprot|Q15262|PTPK_HUMAN Receptor-type protein-tyrosine
phosphatase kappa precursor (EC 3.1.3.48) (R-PTP-kappa).
MDTTAAAALPAFVALLLLSPWPLLGSAQGQFSAGGCTFDDGPGACDYHQDLYDDFEWVHV
SAQEPHYLPPEMPQGSYMIVDSSDHDPGEKARLQLPTMKENDTHCIDFSYLLYSQKGLNP
GTLNILVRVNKGPLANPIWNVTGFTGRDWLRAELAVSSFWPNEYQVIFEAEVSGGRSGYI
AIDDIQVLSYPCDKSPHFLRLGDVEVNAGQNATFQCIATGRDAVHNKLWLQRRNGEDIPV………..
InterPro
Instance Store
Reasoner
Translate
Codify
So Far…..
• Human phosphatases have been classified using the
system
• The ontology classification performed equally well as
expert classification
• The ontology system refined classification
- DUSC contains zinc finger domain
Characterised and conserved – but not in classification
- DUSA contains a disintegrin domain
previously uncharacterised – evolutionarily conserved
• A new kind of phosphatase?
Aspergillus fumigatus
• Phosphatase compliment very different from human
>100 human <50 A.fumigatus
• Whole subfamilies ‘missing’
Different fungi-specific phosphorylation pathways?
No requirement for tissue-specific variations?
• Novel serine/threonine phosphatase with homeobox
Conserved in aspergillus and closely related species,
but not in any other
Again, a new phosphatase?
Generic Technique
• Feature detection
• Categories defined in terms of those
features
• Produce catalogue of what you currently
know
• Highlight cases that don’t match current
knowledge
The Cell type Ontology
• Some 880 terms
• Describing cell function, lineage,
developmental stage, ploidy, secretion,
species,…
• Not explicitly classified according to anatomy
• Uses is-a and developsFrom
• Used to describe cell types used in
experiments
OBO Cell Type Ontology
Issues with Current CTO
• History: A need was seen and a few days
was spent “lashing” together an ontology
by hand
• Contains lots of knowledge
• Asserted multiple inheritance: Humans will
make slips and it is difficult
• Some biological mistakes
• All the knowledge is within the “is-a”
relationships and implicit in the cell names
CTO Axes of Classification
• Histology: What cells look like
• Lineage: Whence a given cell develops
• Ploidy: How sets of chromosomes in a cell
• Nucleation: How many nuclei
• Secretion & accumulation: What chemicals a
cells secretes or accumulates
• Function: What does the cell do
• Location: In anatomy
• Species: In what taxa does the cell exist
• And some others
Implicit Knowledge
• Anatomy: muscle cell; red blood cell
• Maturity: immature t-lymphocyte
• Cell surface protein: CD45 positive
lymphocyte
• Size
• Shape
Problems
• Tangles
• Hard to maintain
• Difficult to add a new cell
• Inflexible queries: What about hormone
secreting mesodermal cells?
• Information hidden inside term names
A Tangled Ontology of Cars
Describing a Big Blue Ford Car
Class: BigBlueFordCar
SubClassOf: Car
that hasColour some Blue
and hasSize some Big
and hasManufacturer some Ford
Modules
• Choose a primary axis: In this case
Vehicle
• Other axes are represented in separate
modules (Colour, Size (qualities) and
manufacturer)
• Represent other aspects of classes
through restrictions
• (Spot the ontological howler in this toy
example)
Definition of a Red Car
Class: RedCar
EquivalentTo: Car
that hasColour some Red
• Any car that has the colour red is
recognised to be a member of the
class RedCar
• The reasoner works it all out and
builds the hierarchy for you
Normalisation
• This technique of “pulling” apart tangled
ontologies is “normalisation”
• Makes for cleaner modelling
• Makes for re-usable components
• The reasoner builds the taxonomy
“completely”
• A new car (e.g., yellow Saab” is described
and it just appears in the right place
What We Did
• Examined CTO
• Chose primary axis of classification
• All other axes added as restrictions on
class membership
• Describe cells
• Build ontology
• Use reasoner
Ontologies Used
CTO
Ontolog
y
PATO
Ontology
GO
Biological Process
GO
Cellular Component
Species
Taxonomy
Anatom
y
Nucleation
Morphology
Size
Ploidy
Muscle Contraction
Secretion
Bacillus anthracis str. Ames
Chloroplast
Cell Membrane
Epithelium
Kidney
Mammalian Red Blood Cell
Class: RedBloodCell
SubclassOf: Cell
That hasNucleation some Anucleate
and participatesIn some
OxygenTransport
and existsIn some mammalia
and part_of some BloodTissue
and developsFrom some Reticulocyte
Mesodermal Lineage Cells
Class: MesodermalLineageCell
EquivalentTo: Cell
That developsFrom some
MesodermalCell
(developsFrom is transitive)
Spreadsheet
Workflow
Spreadsheet CVS OPPL
OWL
Ontology
Reasoned
Ontology
Secreting Cells
Class: EpinephrinSecretingCell
SubclassOf: Cell
That belongs_to_line some Somatic
and has_nucleation some mononucleate
and has_ploidy some diploid
and potentiality some TerminallyDifferentiated
and participates_in some EpinephrineBiosyntheticProcess
and participates_in EpinephrineSecretion
Class: ProlactinSecretingCell
SubclassOf: Cell
That belongs_to_line some Somatic
and has_nucleation some mononucleate
and has_ploidy some diploid
and potentiality some TerminallyDifferentiated
and participates_in some PeptideHormoneSecretion
and participates_in some ProlactinSecretion
Defined Cells
Class: SecretoryCell
EquivalentTo: Cell
that participates_in some (secretion or
(part_of some secrection)
Class: EndocrineCell
EquivalentTo: Cell
that participates_in some (EndocrineProcess or
(part_of some EndocrineProcess)
Asserted Hierarchy
Inferred Hierarchy
What We Found
• More subsumption relationships
• The “is-a” hierarchy is complete
• Explicitness made us ask questions
• Found bad structure
• Can just slip in a new cell
• Can make arbitrary queries based on any
of the types of axis
Conclusions
• Can use strict semantics and automated
reasoning to build structurally sound ontologies
• Can catalogue instances and make discoveries
• If an object can be recognised by its features
and features can be computationally generated
classification can be automated
• High cost and high benefit
Acknowledgements
• Katy Wolstencroft did the protein
phosphtase work as part of her Ph.D.
• The work on the cell type ontology was
udnertaken by members of the EPSRC
OntoGenesis Network
• All the ontoogy work at Manchester relies
on the support and input of the wider
BioHealth and Information Management
Groups

More Related Content

Similar to Making Semantics do Some Work

Advanced molecular biology.ppt
Advanced molecular biology.pptAdvanced molecular biology.ppt
Advanced molecular biology.ppt
MUHAMMEDBAWAYUSUF
 
Epidermal Growth Factor use in Diabetic Foot Ulcers
Epidermal Growth Factor use in Diabetic Foot UlcersEpidermal Growth Factor use in Diabetic Foot Ulcers
Epidermal Growth Factor use in Diabetic Foot Ulcers
G H PRABHU
 
Cloning and Extracellular Expression of Recombinant Tissue Plasminogen Activa...
Cloning and Extracellular Expression of Recombinant Tissue Plasminogen Activa...Cloning and Extracellular Expression of Recombinant Tissue Plasminogen Activa...
Cloning and Extracellular Expression of Recombinant Tissue Plasminogen Activa...
bioejjournal
 
CLONING AND EXTRACELLULAR EXPRESSION OF RECOMBINANT TISSUE PLASMINOGEN ACTIVA...
CLONING AND EXTRACELLULAR EXPRESSION OF RECOMBINANT TISSUE PLASMINOGEN ACTIVA...CLONING AND EXTRACELLULAR EXPRESSION OF RECOMBINANT TISSUE PLASMINOGEN ACTIVA...
CLONING AND EXTRACELLULAR EXPRESSION OF RECOMBINANT TISSUE PLASMINOGEN ACTIVA...
bioejjournal
 
Cloning and extracellular expression of recombinant tissue plasminogen activa...
Cloning and extracellular expression of recombinant tissue plasminogen activa...Cloning and extracellular expression of recombinant tissue plasminogen activa...
Cloning and extracellular expression of recombinant tissue plasminogen activa...
bioejjournal
 
Cloning and extracellular expression of recombinant tissue plasminogen activa...
Cloning and extracellular expression of recombinant tissue plasminogen activa...Cloning and extracellular expression of recombinant tissue plasminogen activa...
Cloning and extracellular expression of recombinant tissue plasminogen activa...
bioejjournal
 
Cloning and Extracellular Expression of Recombinant Tissue Plasminogen Activa...
Cloning and Extracellular Expression of Recombinant Tissue Plasminogen Activa...Cloning and Extracellular Expression of Recombinant Tissue Plasminogen Activa...
Cloning and Extracellular Expression of Recombinant Tissue Plasminogen Activa...
bioejjournal
 
Biotechnology Chapter One Lecture- Intro to Biotech
Biotechnology Chapter One Lecture- Intro to BiotechBiotechnology Chapter One Lecture- Intro to Biotech
Biotechnology Chapter One Lecture- Intro to Biotech
Mary Beth Smith
 
An introduction to promoter prediction and analysis
An introduction to promoter prediction and analysisAn introduction to promoter prediction and analysis
An introduction to promoter prediction and analysis
Sarbesh D. Dangol
 
2013 10 23_dna_for_dummies_v_presented
2013 10 23_dna_for_dummies_v_presented2013 10 23_dna_for_dummies_v_presented
2013 10 23_dna_for_dummies_v_presented
Prof. Wim Van Criekinge
 
2nd level presentation (1) (1)
2nd level presentation (1) (1)2nd level presentation (1) (1)
2nd level presentation (1) (1)Lynn Yang
 
2009 Crofton DNT Thyroid Disruptors NBTS Meeting
2009 Crofton DNT Thyroid Disruptors NBTS Meeting2009 Crofton DNT Thyroid Disruptors NBTS Meeting
2009 Crofton DNT Thyroid Disruptors NBTS Meeting
KevinCrofton
 
TranskinTM : a novel skin penetration enhancing peptide for the transdermal d...
TranskinTM : a novel skin penetration enhancing peptide for the transdermal d...TranskinTM : a novel skin penetration enhancing peptide for the transdermal d...
TranskinTM : a novel skin penetration enhancing peptide for the transdermal d...
Simba Events
 
intracellular receptors ,signaling,transcription factors , nucleic acids
intracellular receptors ,signaling,transcription factors , nucleic acidsintracellular receptors ,signaling,transcription factors , nucleic acids
intracellular receptors ,signaling,transcription factors , nucleic acids
Dina Kishor
 
Introduction to Bioinformatics DNA, RNA, Transcriotion, Genes
Introduction to Bioinformatics DNA, RNA, Transcriotion, GenesIntroduction to Bioinformatics DNA, RNA, Transcriotion, Genes
Introduction to Bioinformatics DNA, RNA, Transcriotion, Genes
Rana Muhammad ATIF
 
ChEMBL US tour December 2014
ChEMBL US tour December 2014ChEMBL US tour December 2014
ChEMBL US tour December 2014
John Overington
 
BioPrimer.ppt
BioPrimer.pptBioPrimer.ppt
BioPrimer.ppt
Syed Attíyà
 
BioPrimer (1).ppt
BioPrimer (1).pptBioPrimer (1).ppt
BioPrimer (1).ppt
sidra285437
 
Introduction to Bioinformatics Molecular Biology Primer
Introduction to Bioinformatics Molecular Biology PrimerIntroduction to Bioinformatics Molecular Biology Primer
Introduction to Bioinformatics Molecular Biology Primer
vanessajoytimoteogar
 

Similar to Making Semantics do Some Work (20)

Advanced molecular biology.ppt
Advanced molecular biology.pptAdvanced molecular biology.ppt
Advanced molecular biology.ppt
 
Epidermal Growth Factor use in Diabetic Foot Ulcers
Epidermal Growth Factor use in Diabetic Foot UlcersEpidermal Growth Factor use in Diabetic Foot Ulcers
Epidermal Growth Factor use in Diabetic Foot Ulcers
 
Cloning and Extracellular Expression of Recombinant Tissue Plasminogen Activa...
Cloning and Extracellular Expression of Recombinant Tissue Plasminogen Activa...Cloning and Extracellular Expression of Recombinant Tissue Plasminogen Activa...
Cloning and Extracellular Expression of Recombinant Tissue Plasminogen Activa...
 
CLONING AND EXTRACELLULAR EXPRESSION OF RECOMBINANT TISSUE PLASMINOGEN ACTIVA...
CLONING AND EXTRACELLULAR EXPRESSION OF RECOMBINANT TISSUE PLASMINOGEN ACTIVA...CLONING AND EXTRACELLULAR EXPRESSION OF RECOMBINANT TISSUE PLASMINOGEN ACTIVA...
CLONING AND EXTRACELLULAR EXPRESSION OF RECOMBINANT TISSUE PLASMINOGEN ACTIVA...
 
Cloning and extracellular expression of recombinant tissue plasminogen activa...
Cloning and extracellular expression of recombinant tissue plasminogen activa...Cloning and extracellular expression of recombinant tissue plasminogen activa...
Cloning and extracellular expression of recombinant tissue plasminogen activa...
 
Cloning and extracellular expression of recombinant tissue plasminogen activa...
Cloning and extracellular expression of recombinant tissue plasminogen activa...Cloning and extracellular expression of recombinant tissue plasminogen activa...
Cloning and extracellular expression of recombinant tissue plasminogen activa...
 
Cloning and Extracellular Expression of Recombinant Tissue Plasminogen Activa...
Cloning and Extracellular Expression of Recombinant Tissue Plasminogen Activa...Cloning and Extracellular Expression of Recombinant Tissue Plasminogen Activa...
Cloning and Extracellular Expression of Recombinant Tissue Plasminogen Activa...
 
Biotechnology Chapter One Lecture- Intro to Biotech
Biotechnology Chapter One Lecture- Intro to BiotechBiotechnology Chapter One Lecture- Intro to Biotech
Biotechnology Chapter One Lecture- Intro to Biotech
 
An introduction to promoter prediction and analysis
An introduction to promoter prediction and analysisAn introduction to promoter prediction and analysis
An introduction to promoter prediction and analysis
 
targeting
targetingtargeting
targeting
 
2013 10 23_dna_for_dummies_v_presented
2013 10 23_dna_for_dummies_v_presented2013 10 23_dna_for_dummies_v_presented
2013 10 23_dna_for_dummies_v_presented
 
2nd level presentation (1) (1)
2nd level presentation (1) (1)2nd level presentation (1) (1)
2nd level presentation (1) (1)
 
2009 Crofton DNT Thyroid Disruptors NBTS Meeting
2009 Crofton DNT Thyroid Disruptors NBTS Meeting2009 Crofton DNT Thyroid Disruptors NBTS Meeting
2009 Crofton DNT Thyroid Disruptors NBTS Meeting
 
TranskinTM : a novel skin penetration enhancing peptide for the transdermal d...
TranskinTM : a novel skin penetration enhancing peptide for the transdermal d...TranskinTM : a novel skin penetration enhancing peptide for the transdermal d...
TranskinTM : a novel skin penetration enhancing peptide for the transdermal d...
 
intracellular receptors ,signaling,transcription factors , nucleic acids
intracellular receptors ,signaling,transcription factors , nucleic acidsintracellular receptors ,signaling,transcription factors , nucleic acids
intracellular receptors ,signaling,transcription factors , nucleic acids
 
Introduction to Bioinformatics DNA, RNA, Transcriotion, Genes
Introduction to Bioinformatics DNA, RNA, Transcriotion, GenesIntroduction to Bioinformatics DNA, RNA, Transcriotion, Genes
Introduction to Bioinformatics DNA, RNA, Transcriotion, Genes
 
ChEMBL US tour December 2014
ChEMBL US tour December 2014ChEMBL US tour December 2014
ChEMBL US tour December 2014
 
BioPrimer.ppt
BioPrimer.pptBioPrimer.ppt
BioPrimer.ppt
 
BioPrimer (1).ppt
BioPrimer (1).pptBioPrimer (1).ppt
BioPrimer (1).ppt
 
Introduction to Bioinformatics Molecular Biology Primer
Introduction to Bioinformatics Molecular Biology PrimerIntroduction to Bioinformatics Molecular Biology Primer
Introduction to Bioinformatics Molecular Biology Primer
 

More from robertstevens65

Ontologies: Necessary, but not sufficient
Ontologies: Necessary, but not sufficientOntologies: Necessary, but not sufficient
Ontologies: Necessary, but not sufficient
robertstevens65
 
The Pragmatics and Formality of Authoring OntologiesOdsl 2016
The Pragmatics and Formality of Authoring OntologiesOdsl 2016The Pragmatics and Formality of Authoring OntologiesOdsl 2016
The Pragmatics and Formality of Authoring OntologiesOdsl 2016
robertstevens65
 
OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...
OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...
OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...
robertstevens65
 
The Quality of Method Reporting in
The Quality of Method Reporting in The Quality of Method Reporting in
The Quality of Method Reporting in
robertstevens65
 
The Semantics of Genomic Analysis
The Semantics of  Genomic AnalysisThe Semantics of  Genomic Analysis
The Semantics of Genomic Analysis
robertstevens65
 
Issues and activities in authoring ontologies
Issues and activities in authoring ontologiesIssues and activities in authoring ontologies
Issues and activities in authoring ontologies
robertstevens65
 
The state of the nation for ontology development
The state of the nation for ontology developmentThe state of the nation for ontology development
The state of the nation for ontology development
robertstevens65
 
Building and Using Ontologies to do biology
Building and Using Ontologies to do biologyBuilding and Using Ontologies to do biology
Building and Using Ontologies to do biology
robertstevens65
 
Properties and Individuals in OWL: Reasoning About Family History
Properties and Individuals in OWL: Reasoning About Family HistoryProperties and Individuals in OWL: Reasoning About Family History
Properties and Individuals in OWL: Reasoning About Family History
robertstevens65
 
Choosing and Building Knowledge Artefacts
Choosing and Building Knowledge ArtefactsChoosing and Building Knowledge Artefacts
Choosing and Building Knowledge Artefacts
robertstevens65
 
Populous: A tool for Populating OWL Ontologies from Templates
Populous: A tool for Populating OWL Ontologies from TemplatesPopulous: A tool for Populating OWL Ontologies from Templates
Populous: A tool for Populating OWL Ontologies from Templates
robertstevens65
 
Keeping ontology development Agile
Keeping ontology development AgileKeeping ontology development Agile
Keeping ontology development Agile
robertstevens65
 
Spreadsheets to OWL
Spreadsheets to OWLSpreadsheets to OWL
Spreadsheets to OWL
robertstevens65
 
Lessons from teaching non-computer scientists OWL and ontologies
Lessons from teaching non-computer scientists OWL and ontologiesLessons from teaching non-computer scientists OWL and ontologies
Lessons from teaching non-computer scientists OWL and ontologies
robertstevens65
 
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)Kidney and Urinary Pathways Knowledge Base (part of e-LICO)
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)
robertstevens65
 
A Rose by Any Other Name is Still a Rose
A Rose by Any Other Name is Still a RoseA Rose by Any Other Name is Still a Rose
A Rose by Any Other Name is Still a Rose
robertstevens65
 
Working with big biomedical ontologies
Working with big biomedical ontologiesWorking with big biomedical ontologies
Working with big biomedical ontologies
robertstevens65
 
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
robertstevens65
 
Ontology learning from text
Ontology learning from textOntology learning from text
Ontology learning from textrobertstevens65
 
Knowledge Management in a Knowledge Based Discipline
Knowledge Management in a Knowledge Based DisciplineKnowledge Management in a Knowledge Based Discipline
Knowledge Management in a Knowledge Based Discipline
robertstevens65
 

More from robertstevens65 (20)

Ontologies: Necessary, but not sufficient
Ontologies: Necessary, but not sufficientOntologies: Necessary, but not sufficient
Ontologies: Necessary, but not sufficient
 
The Pragmatics and Formality of Authoring OntologiesOdsl 2016
The Pragmatics and Formality of Authoring OntologiesOdsl 2016The Pragmatics and Formality of Authoring OntologiesOdsl 2016
The Pragmatics and Formality of Authoring OntologiesOdsl 2016
 
OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...
OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...
OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...
 
The Quality of Method Reporting in
The Quality of Method Reporting in The Quality of Method Reporting in
The Quality of Method Reporting in
 
The Semantics of Genomic Analysis
The Semantics of  Genomic AnalysisThe Semantics of  Genomic Analysis
The Semantics of Genomic Analysis
 
Issues and activities in authoring ontologies
Issues and activities in authoring ontologiesIssues and activities in authoring ontologies
Issues and activities in authoring ontologies
 
The state of the nation for ontology development
The state of the nation for ontology developmentThe state of the nation for ontology development
The state of the nation for ontology development
 
Building and Using Ontologies to do biology
Building and Using Ontologies to do biologyBuilding and Using Ontologies to do biology
Building and Using Ontologies to do biology
 
Properties and Individuals in OWL: Reasoning About Family History
Properties and Individuals in OWL: Reasoning About Family HistoryProperties and Individuals in OWL: Reasoning About Family History
Properties and Individuals in OWL: Reasoning About Family History
 
Choosing and Building Knowledge Artefacts
Choosing and Building Knowledge ArtefactsChoosing and Building Knowledge Artefacts
Choosing and Building Knowledge Artefacts
 
Populous: A tool for Populating OWL Ontologies from Templates
Populous: A tool for Populating OWL Ontologies from TemplatesPopulous: A tool for Populating OWL Ontologies from Templates
Populous: A tool for Populating OWL Ontologies from Templates
 
Keeping ontology development Agile
Keeping ontology development AgileKeeping ontology development Agile
Keeping ontology development Agile
 
Spreadsheets to OWL
Spreadsheets to OWLSpreadsheets to OWL
Spreadsheets to OWL
 
Lessons from teaching non-computer scientists OWL and ontologies
Lessons from teaching non-computer scientists OWL and ontologiesLessons from teaching non-computer scientists OWL and ontologies
Lessons from teaching non-computer scientists OWL and ontologies
 
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)Kidney and Urinary Pathways Knowledge Base (part of e-LICO)
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)
 
A Rose by Any Other Name is Still a Rose
A Rose by Any Other Name is Still a RoseA Rose by Any Other Name is Still a Rose
A Rose by Any Other Name is Still a Rose
 
Working with big biomedical ontologies
Working with big biomedical ontologiesWorking with big biomedical ontologies
Working with big biomedical ontologies
 
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
 
Ontology learning from text
Ontology learning from textOntology learning from text
Ontology learning from text
 
Knowledge Management in a Knowledge Based Discipline
Knowledge Management in a Knowledge Based DisciplineKnowledge Management in a Knowledge Based Discipline
Knowledge Management in a Knowledge Based Discipline
 

Recently uploaded

Toxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and ArsenicToxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and Arsenic
sanjana502982
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
YOGESH DOGRA
 
in vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptxin vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptx
yusufzako14
 
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Sérgio Sacani
 
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of LipidsGBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
Areesha Ahmad
 
Richard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlandsRichard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlands
Richard Gill
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Sérgio Sacani
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Erdal Coalmaker
 
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
Sérgio Sacani
 
BLOOD AND BLOOD COMPONENT- introduction to blood physiology
BLOOD AND BLOOD COMPONENT- introduction to blood physiologyBLOOD AND BLOOD COMPONENT- introduction to blood physiology
BLOOD AND BLOOD COMPONENT- introduction to blood physiology
NoelManyise1
 
nodule formation by alisha dewangan.pptx
nodule formation by alisha dewangan.pptxnodule formation by alisha dewangan.pptx
nodule formation by alisha dewangan.pptx
alishadewangan1
 
role of pramana in research.pptx in science
role of pramana in research.pptx in sciencerole of pramana in research.pptx in science
role of pramana in research.pptx in science
sonaliswain16
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
yqqaatn0
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
University of Rennes, INSA Rennes, Inria/IRISA, CNRS
 
Hemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxHemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptx
muralinath2
 
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
Scintica Instrumentation
 
GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
Areesha Ahmad
 
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Studia Poinsotiana
 
erythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptxerythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptx
muralinath2
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
IqrimaNabilatulhusni
 

Recently uploaded (20)

Toxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and ArsenicToxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and Arsenic
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
 
in vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptxin vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptx
 
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
 
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of LipidsGBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
 
Richard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlandsRichard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlands
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
 
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
 
BLOOD AND BLOOD COMPONENT- introduction to blood physiology
BLOOD AND BLOOD COMPONENT- introduction to blood physiologyBLOOD AND BLOOD COMPONENT- introduction to blood physiology
BLOOD AND BLOOD COMPONENT- introduction to blood physiology
 
nodule formation by alisha dewangan.pptx
nodule formation by alisha dewangan.pptxnodule formation by alisha dewangan.pptx
nodule formation by alisha dewangan.pptx
 
role of pramana in research.pptx in science
role of pramana in research.pptx in sciencerole of pramana in research.pptx in science
role of pramana in research.pptx in science
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
 
Hemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxHemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptx
 
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
 
GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
 
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...
 
erythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptxerythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptx
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
 

Making Semantics do Some Work

  • 1. Making Semantics do Some Work Robert Stevens BioHealth Informatics Group School of Computer Science University of Manchester Robert.Stevens@manchester.ac.uk
  • 2. Introduction • What’s the use of highly axiomatised ontological descriptions? • Two use cases: • Classifying instances based on features: New discoveries; • Building a complex terminology. • Cost and benefit. • Conclusions
  • 3. Protein Classification • Proteins divided into broad functional classes “Protein Families” • Families sub-divided to give family classifications • Class membership can be determined by “protein features”, such as domains, etc. • Resources exist for feature detection via primary sequence– but not class membership • Current Limitation of Automated Tools • Needs human knowledge to recognise class membership
  • 4. Finding Domains on a Sequence A search of the linear sequence of protein tyrosine phosphatase type K – identified 9 functional domains >uniprot|Q15262|PTPK_HUMAN Receptor-type protein-tyrosine phosphatase kappa precursor (EC 3.1.3.48) (R-PTP-kappa). MDTTAAAALPAFVALLLLSPWPLLGSAQGQFSAGGCTFDDGPGACDYHQDLYDDFEWVHV SAQEPHYLPPEMPQGSYMIVDSSDHDPGEKARLQLPTMKENDTHCIDFSYLLYSQKGLNP GTLNILVRVNKGPLANPIWNVTGFTGRDWLRAELAVSSFWPNEYQVIFEAEVSGGRSGYI AIDDIQVLSYPCDKSPHFLRLGDVEVNAGQNATFQCIATGRDAVHNKLWLQRRNGEDIPV………..
  • 5. Why Classify? • Classification and curation of a genome is the first step in understanding the processes and functions happening in an organism • Classification enables comparative genomic studies - what is already known in other organisms • The similarities and differences between processes and functions in related organisms often provide the greatest insight into the biology • In silico characterisation is the current bottleneck
  • 6. Phosphatase Classification • Diagnostic phosphatase domains/motifs – sufficient for membership of the protein phosphatase superfamily • Any protein having a phosphatase domain is a member of the phosphatase super-family • Other motifs determine a protein’s place within the family • Usually needs human to recognise that features detected imply class membership • Can these be captured in an ontology?
  • 8. Necessity and Sufficiency • An R2A phosphatase must have a fibronectin domain • Having a fibronectin domain does not a phosphatase make • Necessity -- what must a class instance have? • Any protein that has a phosphatase catalytic domain is a phosphatase enzyme • All phosphatase enzymes have a catalytic domain • Sufficiency – how is an instance recognised to be a member of a class?
  • 9. Definition of Tyrosine Phosphatase Class: TyrosineReceptorProteinPhosphatase EquivalentTo: Protein That - contains atLeast 1 ProteinTyrosinePhosphataseDomain and - contains 1 TransmembraneDomain
  • 10. …there are known knowns; there are things we know we know. We also know there are known unknowns; that is to say we know there are some things we do not know. But there are also unknown unknowns -- the ones we don't know we don't know.
  • 11. Definition for R2A Phosphatase Class: R2A EquivalentTO: Protein That - contains 2 ProteinTyrosinePhosphataseDomain and - contains 1 TransmembraneDomain and - contains 4 FibronectinDomains and - contains 1 ImmunoglobulinDomain and - contains 1 MAMDomain and - contains 1 Cadherin-LikeDomain and - contains only TyrosinePhosphataseDomain or TransmembraneDomain or FibronectinDomain or ImnunoglobulinDomain or Clathrin-LikeDomain or ManDomain
  • 12. Automated Reasoning • An OWL-DL ontology mapped to its DL form as a collection of axioms • An automated reasoner checks for satisfiability – throws out the inconsistent and infers subsumption • Defined classes (where there are necessary and sufficient restrictions) enable a reasoner to infer subclass axioms • Also infer to which class an individual belongs
  • 13. Incremental Addition of Protein Functional Domains Phosphatase catalytic Cadherin-like Immunoglobulin MAM domain Cellular retinaldehyde Adhesion recognition Transmembrane Fibronectin III Glycosylation
  • 14. Classification of the Classical Tyrosine Phosphatases
  • 15. What is the Ontology Telling Us? • Each class of phosphatase defined in terms of domain composition • We know the characteristics by which an individual protein can be recognised to be a member of a particular class of phosphatase • We have this knowledge in a computational form • If we had protein instances described in terms of the ontology, we could classify those individual proteins • A catalogue of phosphatases
  • 16. Description of an Instance of a Protein Individual: P21592 Types: Protein, hasDomain 2 ProteinTyrosinePhosphataseDomain hasdomain 1 TransmembraneDomain,, hasdomain 4 FibronectinDomain, hasDomain 1 ImmunoglobulinDomain, hasdomain 1 MAMDomain, hasdomain 1 Cadherin-LikeDomain
  • 17. Instance: P21592         TypeOf: Protein That Fact: hasDomain 2 ProteinTyrosinePhosphataseDomain and Fact: hasdomain 1 TransmembraneDomain and  Fact: hasdomain 4 FibronectinDomains and Fact: hasDomain 1 ImmunoglobulinDomain and Fact: hasdomain 1 MAMDomain and Fact: hasdomain 1 Cadherin-LikeDomain Tyrosine Phosphatase (containsDomain some TransmembraneDomain) and (containsDomain at least 1 ProteinTyrosinePhosphataseDomain) tase n some MAMDomain) and n some ProteinTyrosineCatalyticDomain or ImmunoglobulinDomain) and n some FibronectinDomain or FibronectinTypeIIIFoldDomain) and n exactly 2 ProteinTyrosinePhosphataseDomain)
  • 18. ClassifyingProteins >uniprot|Q15262|PTPK_HUMAN Receptor-type protein-tyrosine phosphatase kappa precursor (EC 3.1.3.48) (R-PTP-kappa). MDTTAAAALPAFVALLLLSPWPLLGSAQGQFSAGGCTFDDGPGACDYHQDLYDDFEWVHV SAQEPHYLPPEMPQGSYMIVDSSDHDPGEKARLQLPTMKENDTHCIDFSYLLYSQKGLNP GTLNILVRVNKGPLANPIWNVTGFTGRDWLRAELAVSSFWPNEYQVIFEAEVSGGRSGYI AIDDIQVLSYPCDKSPHFLRLGDVEVNAGQNATFQCIATGRDAVHNKLWLQRRNGEDIPV……….. InterPro Instance Store Reasoner Translate Codify
  • 19. So Far….. • Human phosphatases have been classified using the system • The ontology classification performed equally well as expert classification • The ontology system refined classification - DUSC contains zinc finger domain Characterised and conserved – but not in classification - DUSA contains a disintegrin domain previously uncharacterised – evolutionarily conserved • A new kind of phosphatase?
  • 20. Aspergillus fumigatus • Phosphatase compliment very different from human >100 human <50 A.fumigatus • Whole subfamilies ‘missing’ Different fungi-specific phosphorylation pathways? No requirement for tissue-specific variations? • Novel serine/threonine phosphatase with homeobox Conserved in aspergillus and closely related species, but not in any other Again, a new phosphatase?
  • 21. Generic Technique • Feature detection • Categories defined in terms of those features • Produce catalogue of what you currently know • Highlight cases that don’t match current knowledge
  • 22. The Cell type Ontology • Some 880 terms • Describing cell function, lineage, developmental stage, ploidy, secretion, species,… • Not explicitly classified according to anatomy • Uses is-a and developsFrom • Used to describe cell types used in experiments
  • 23. OBO Cell Type Ontology
  • 24. Issues with Current CTO • History: A need was seen and a few days was spent “lashing” together an ontology by hand • Contains lots of knowledge • Asserted multiple inheritance: Humans will make slips and it is difficult • Some biological mistakes • All the knowledge is within the “is-a” relationships and implicit in the cell names
  • 25. CTO Axes of Classification • Histology: What cells look like • Lineage: Whence a given cell develops • Ploidy: How sets of chromosomes in a cell • Nucleation: How many nuclei • Secretion & accumulation: What chemicals a cells secretes or accumulates • Function: What does the cell do • Location: In anatomy • Species: In what taxa does the cell exist • And some others
  • 26. Implicit Knowledge • Anatomy: muscle cell; red blood cell • Maturity: immature t-lymphocyte • Cell surface protein: CD45 positive lymphocyte • Size • Shape
  • 27. Problems • Tangles • Hard to maintain • Difficult to add a new cell • Inflexible queries: What about hormone secreting mesodermal cells? • Information hidden inside term names
  • 29. Describing a Big Blue Ford Car Class: BigBlueFordCar SubClassOf: Car that hasColour some Blue and hasSize some Big and hasManufacturer some Ford
  • 30. Modules • Choose a primary axis: In this case Vehicle • Other axes are represented in separate modules (Colour, Size (qualities) and manufacturer) • Represent other aspects of classes through restrictions • (Spot the ontological howler in this toy example)
  • 31. Definition of a Red Car Class: RedCar EquivalentTo: Car that hasColour some Red • Any car that has the colour red is recognised to be a member of the class RedCar • The reasoner works it all out and builds the hierarchy for you
  • 32. Normalisation • This technique of “pulling” apart tangled ontologies is “normalisation” • Makes for cleaner modelling • Makes for re-usable components • The reasoner builds the taxonomy “completely” • A new car (e.g., yellow Saab” is described and it just appears in the right place
  • 33. What We Did • Examined CTO • Chose primary axis of classification • All other axes added as restrictions on class membership • Describe cells • Build ontology • Use reasoner
  • 34. Ontologies Used CTO Ontolog y PATO Ontology GO Biological Process GO Cellular Component Species Taxonomy Anatom y Nucleation Morphology Size Ploidy Muscle Contraction Secretion Bacillus anthracis str. Ames Chloroplast Cell Membrane Epithelium Kidney
  • 35. Mammalian Red Blood Cell Class: RedBloodCell SubclassOf: Cell That hasNucleation some Anucleate and participatesIn some OxygenTransport and existsIn some mammalia and part_of some BloodTissue and developsFrom some Reticulocyte
  • 36. Mesodermal Lineage Cells Class: MesodermalLineageCell EquivalentTo: Cell That developsFrom some MesodermalCell (developsFrom is transitive)
  • 39. Secreting Cells Class: EpinephrinSecretingCell SubclassOf: Cell That belongs_to_line some Somatic and has_nucleation some mononucleate and has_ploidy some diploid and potentiality some TerminallyDifferentiated and participates_in some EpinephrineBiosyntheticProcess and participates_in EpinephrineSecretion Class: ProlactinSecretingCell SubclassOf: Cell That belongs_to_line some Somatic and has_nucleation some mononucleate and has_ploidy some diploid and potentiality some TerminallyDifferentiated and participates_in some PeptideHormoneSecretion and participates_in some ProlactinSecretion
  • 40. Defined Cells Class: SecretoryCell EquivalentTo: Cell that participates_in some (secretion or (part_of some secrection) Class: EndocrineCell EquivalentTo: Cell that participates_in some (EndocrineProcess or (part_of some EndocrineProcess)
  • 43. What We Found • More subsumption relationships • The “is-a” hierarchy is complete • Explicitness made us ask questions • Found bad structure • Can just slip in a new cell • Can make arbitrary queries based on any of the types of axis
  • 44. Conclusions • Can use strict semantics and automated reasoning to build structurally sound ontologies • Can catalogue instances and make discoveries • If an object can be recognised by its features and features can be computationally generated classification can be automated • High cost and high benefit
  • 45. Acknowledgements • Katy Wolstencroft did the protein phosphtase work as part of her Ph.D. • The work on the cell type ontology was udnertaken by members of the EPSRC OntoGenesis Network • All the ontoogy work at Manchester relies on the support and input of the wider BioHealth and Information Management Groups

Editor's Notes

  1. All of which helps build better ontologies. But can we actually apply this computational amenability more Directly to biological knowledge. In this example, which is work by Katy Wolstencroft, we have codified Community knowledge about protein domains in phosphatases in OWL. We then take unknown protein sequences, Pass then through interpro and stick them into the instance store, which is basically a database and reasoner tied together Qualified Cardiniality!!!
  2. Showing cell by function, cell by histology etc….
  3. Tangled on left, arrow moving to picture of untangled ontology
  4. Shows the asserted hierarchy in OWLViz for epiniephrin and prolactin cell (primitives) as well as Secretory and Endocrine Cell (defined)
  5. Shows the inferred hierarchy (all parents) for epiniephrin and prolactin cell as well as defined secretory and endocrine Cell. Shows lots of multiple inheritance inferred by the reasoner.