SlideShare a Scribd company logo
1 of 25
Ontology Learning From Text?
Robert Stevens
BioHealth Informatics Group
School of Computer Science
University of Manchester
Robert.stevens@manchester.ac.uk
Introduction
• Can we use ontology learning to build
ontologies?
• Not text-mining research, but ontology
research
• What is ontology learning from text?
• The questions we posed
• The experiment we performed
• The results we obtained
• The conclusions we made
Ontology learning
• Text2Onto: http://ontoware
.org/projects/text2onto/
• “The erythrocytes are the blood cells that carry
oxygen to others cells in the body”
• “Lymphocytes, leukocytes, monocytes, phagocytes
and granulocytes are all kinds of white blood cell”
• “These experiments show that the individual
hemopoietic stem cell is a multipotent cell and can
give rise to the complete range of blood cell types,
both myeloid and lymphoid, as well as new stem cells
like itself.”
Ontology Learning
Blood Cell
Erythrocyte
White Blood Cell
Monocyte
Leukocyte
Lymphocyte
Phagocyte
Granulocyte
Multipotent Stem Cell
Hemopoietic Stem Cell
arise from
Text to Ontology “Workflow”
Corpus
Tokenising /
Sentence splitting
Part-Of-Speech
(POS) tagging
Lemmatizing /
Stemming
JAPE transducer
annotates corpus
Text2Onto Algorithms for
extracting modeling primitive
Text2Onto
meta-ontology
Promotion to
OWL ontology
Extracting Patterns from Text
“CFU-S is a blood stem cell”
CFU-S[NNP] is[VBN] a[DT] blood[NN] stem[NN] cell[NN]
Sentence:
Part of Speech (POS) Tagging:
Pseudo JAPE rule:
Any series of nouns (A) followed by the string “ is a ”
followed by series of nouns (B)
Key: NN=noun; DT=determiner; NNP=proper noun; VBN = verb past participle.
Ontological assertions:
A and B are concepts, A is a subclass of B
Text2Onto meta-ontology
Some Text2Onto Instances
• Instance: Astrocyte_c
– typeOf: Concept that
– Fact: confidence VALUE 1.0
Instance: AstrocycteNerveCell
TypeOf: Subclass that
Fact: domain VALUE NerveCell and
FACT: Range VALUE Astrocyte and
Fact: confidence VALUE 1.0
The Questions We Asked
• Can we press the button and get a
good ontology?
• If not, can we get something useful?
• Can we do it without having to write too
many rules?
• Does the end-point act as as a donor or
recipient ontology?
Strategy
• Collect corpus
• Manually markup text for cells: Definitive list
of terms
• Process corpus through T2O
• Analyse output of T2O for recall and precision
of terms and hierarchy
• Iteration of previous two step with variants in
rules
• Evaluation against CTO gold standard
The Experimental Conditions
• Default T2O
• T2O plus cell specific JAPE rules and all
algorithms
• Only cell specific JAPE rules,
/EntropyExtraction Algorithm and some
“hierarchy spotting” based on term
composition
• Same 3, but with
VerticalRelationsConceptClassification to
include our simple JAPE rules
• Same 4, but with WordConceptClassificaiton
Rules for Extracting Cell
Types
• Words ending in ‘cyte’, ‘blast’, ‘cell’, ‘glia’, ‘glium’, ‘cell type’, ‘cell line’
and ‘cell lineage’ (together with their plurals)
• Zero or more adjectives followed by zero or more nouns or proper
nouns followed by a ‘cell word’ (together with plural) e.g. ‘renshaw cell’,
‘Muller cell’, ‘immature blood cell’, etc..
• Any stem cell term is a stem cell
• Any term ending with ‘progeneitor cell’ is a Progenitor Cell.
• Any term ending with ‘precursor cell’ is a Precursor Cell.
• Any term ending in ‘blast’ is a Blast Cell.
• Any term ending with ‘cyte’ or ‘cell’ is a Differentiated Cell.
Evaluation Strategy
• Extraction performance
• Ontology evaluation
• Domain coverage
• Expert evaluation
Term Recognition
• 1,277 terms in our definitive list
• 16,384 terms from whole corpus; 625
relevant
• Increase to 17,851 and 916
• All 118 CTO terms in corpus recalled
• Corpus has anatomical bias
• Simple rules exploit regularity of language
• Many false positives from adjective noun rule
Cell Terms
• Morphology: Stellate cell; columnar cell;
• Ploidy
• Maturity: Tetrapooil cell; multiploid cell;
• Potentiality
• Lineage: Totipotent stem cell; multipotent cell;
• Species origin
• Anatomical location: Animal cell; human sell;
• Developmental stage: Mitotic cell; S-phase cell;
• Lineage: Mesoderm cell;
Common errors
Manually
extracted from
corpus
Automatically
extracted from
corpus
Comments
+t - cell Symbols not handled very well
contains cell False -positive cell type
Foam cell New cell type extracted
leukocyte leucocyte Spelling errors in corpus
naïve cell nave cell Character encoding problem
Spermatogonia No rule to extract
Term Recall and Precision
Default learnt ontology
Final learnt ontology
Still not perfect!
Ontology evaluation
Learnt Ontology under CTO
Discussion
• Exploiting poor performance to focus learning
• Exploiting regularity of language
• Never really going to find CTO domain
general layer
• Terms highly compositional and conflate axes
• Ask the questions “is it useful?” not “is it
good?”
• Is CTO a good standard?
• The extracted hierarchy was not bad from a
cell biology and ontological point of view
Nascent Methodology
• Form corpus that includes, but is not limited
to scope of target ontology
• Extract terms from corpus
• Filter and massage list of terms to find those
of ontological interest
• Use ontology learning to see what happens
• Inspect and augment rules to recognise and
incorporate into hierarchy
• Iterate Use as donor ontology to transfer
useful bits to recipient ontology
Conclusions
• No;
• Yes;
• Yes;
• Donor
Acknowledgements
• Simon Jupp has done the work
• Jaclyn Bibby MSc Project prototype
• Johanna Volker for help with Text2Onto
• David Shotton for knowledge about cell
biology

More Related Content

Similar to Ontology Learning From Text Using Text2Onto

The Past, Present and Future of Knowledge in Biology
The Past, Present and Future of Knowledge in BiologyThe Past, Present and Future of Knowledge in Biology
The Past, Present and Future of Knowledge in Biologyrobertstevens65
 
Collaborative Ontology building: So much more than authoring an Ontology
Collaborative Ontology building: So much more than authoring an Ontology Collaborative Ontology building: So much more than authoring an Ontology
Collaborative Ontology building: So much more than authoring an Ontology robertstevens65
 
Biological Basis of Oral Sciences .pptx
Biological Basis of Oral Sciences .pptxBiological Basis of Oral Sciences .pptx
Biological Basis of Oral Sciences .pptxKENWONGSIONGHOU
 
Mitosis And Meiosis
Mitosis And MeiosisMitosis And Meiosis
Mitosis And Meiosismsu
 
Biology Cell transport and cell cycle 12 / 06 / 12 Thursday
 Biology Cell transport and cell cycle 12 / 06 / 12 Thursday Biology Cell transport and cell cycle 12 / 06 / 12 Thursday
Biology Cell transport and cell cycle 12 / 06 / 12 Thursdaymrhunterspage
 
Cell Division-TST3B-BL Molumo.pptx
Cell Division-TST3B-BL Molumo.pptxCell Division-TST3B-BL Molumo.pptx
Cell Division-TST3B-BL Molumo.pptxBenjamin Molumo
 
Anatomy and Physiology Cell Transport and The Cell Cycle
Anatomy and Physiology Cell Transport and The Cell CycleAnatomy and Physiology Cell Transport and The Cell Cycle
Anatomy and Physiology Cell Transport and The Cell Cyclemrhunterspage
 
Cells and Cell Transports
Cells and Cell TransportsCells and Cell Transports
Cells and Cell Transportsmszeron
 
Ewan Birney Biocuration 2013
Ewan Birney Biocuration 2013Ewan Birney Biocuration 2013
Ewan Birney Biocuration 2013Iddo
 
Stem cell and its clinical implications
Stem cell and its clinical implicationsStem cell and its clinical implications
Stem cell and its clinical implicationsDRx.Yogesh Chaudhari
 
Stages of mitotic cell cycle A Level Biology
Stages of mitotic cell cycle A Level BiologyStages of mitotic cell cycle A Level Biology
Stages of mitotic cell cycle A Level Biologysriwidowati10
 
Lecture#01 (Cell structure and function).pptx
Lecture#01 (Cell structure and function).pptxLecture#01 (Cell structure and function).pptx
Lecture#01 (Cell structure and function).pptxSabaMahmood22
 
Cell cycle & cell division
Cell cycle & cell divisionCell cycle & cell division
Cell cycle & cell divisiondebasish prusty
 
CELL CYCLE, MITOSIS & MEIOSIS SMG
CELL CYCLE, MITOSIS & MEIOSIS   SMGCELL CYCLE, MITOSIS & MEIOSIS   SMG
CELL CYCLE, MITOSIS & MEIOSIS SMGsajigeorge64
 
Lecture_1_The_cell_is_a_structural_functional_unit_of_life..pptx
Lecture_1_The_cell_is_a_structural_functional_unit_of_life..pptxLecture_1_The_cell_is_a_structural_functional_unit_of_life..pptx
Lecture_1_The_cell_is_a_structural_functional_unit_of_life..pptxAnkitSingh550318
 
Cellstructure 111113162625-phpapp02
Cellstructure 111113162625-phpapp02Cellstructure 111113162625-phpapp02
Cellstructure 111113162625-phpapp02bajuar
 
L1 Introduction to cells.pptx
L1 Introduction to cells.pptxL1 Introduction to cells.pptx
L1 Introduction to cells.pptxAbdulkarim803288
 

Similar to Ontology Learning From Text Using Text2Onto (20)

The Past, Present and Future of Knowledge in Biology
The Past, Present and Future of Knowledge in BiologyThe Past, Present and Future of Knowledge in Biology
The Past, Present and Future of Knowledge in Biology
 
Collaborative Ontology building: So much more than authoring an Ontology
Collaborative Ontology building: So much more than authoring an Ontology Collaborative Ontology building: So much more than authoring an Ontology
Collaborative Ontology building: So much more than authoring an Ontology
 
Biological Basis of Oral Sciences .pptx
Biological Basis of Oral Sciences .pptxBiological Basis of Oral Sciences .pptx
Biological Basis of Oral Sciences .pptx
 
Mitosis And Meiosis
Mitosis And MeiosisMitosis And Meiosis
Mitosis And Meiosis
 
Biology Cell transport and cell cycle 12 / 06 / 12 Thursday
 Biology Cell transport and cell cycle 12 / 06 / 12 Thursday Biology Cell transport and cell cycle 12 / 06 / 12 Thursday
Biology Cell transport and cell cycle 12 / 06 / 12 Thursday
 
Gcse biology 9 - 1
Gcse biology 9 - 1Gcse biology 9 - 1
Gcse biology 9 - 1
 
The Cell
The CellThe Cell
The Cell
 
Cell Division-TST3B-BL Molumo.pptx
Cell Division-TST3B-BL Molumo.pptxCell Division-TST3B-BL Molumo.pptx
Cell Division-TST3B-BL Molumo.pptx
 
Anatomy and Physiology Cell Transport and The Cell Cycle
Anatomy and Physiology Cell Transport and The Cell CycleAnatomy and Physiology Cell Transport and The Cell Cycle
Anatomy and Physiology Cell Transport and The Cell Cycle
 
Cells and Cell Transports
Cells and Cell TransportsCells and Cell Transports
Cells and Cell Transports
 
Ewan Birney Biocuration 2013
Ewan Birney Biocuration 2013Ewan Birney Biocuration 2013
Ewan Birney Biocuration 2013
 
Stem cell and its clinical implications
Stem cell and its clinical implicationsStem cell and its clinical implications
Stem cell and its clinical implications
 
Stages of mitotic cell cycle A Level Biology
Stages of mitotic cell cycle A Level BiologyStages of mitotic cell cycle A Level Biology
Stages of mitotic cell cycle A Level Biology
 
Lecture#01 (Cell structure and function).pptx
Lecture#01 (Cell structure and function).pptxLecture#01 (Cell structure and function).pptx
Lecture#01 (Cell structure and function).pptx
 
Cellular life
Cellular lifeCellular life
Cellular life
 
Cell cycle & cell division
Cell cycle & cell divisionCell cycle & cell division
Cell cycle & cell division
 
CELL CYCLE, MITOSIS & MEIOSIS SMG
CELL CYCLE, MITOSIS & MEIOSIS   SMGCELL CYCLE, MITOSIS & MEIOSIS   SMG
CELL CYCLE, MITOSIS & MEIOSIS SMG
 
Lecture_1_The_cell_is_a_structural_functional_unit_of_life..pptx
Lecture_1_The_cell_is_a_structural_functional_unit_of_life..pptxLecture_1_The_cell_is_a_structural_functional_unit_of_life..pptx
Lecture_1_The_cell_is_a_structural_functional_unit_of_life..pptx
 
Cellstructure 111113162625-phpapp02
Cellstructure 111113162625-phpapp02Cellstructure 111113162625-phpapp02
Cellstructure 111113162625-phpapp02
 
L1 Introduction to cells.pptx
L1 Introduction to cells.pptxL1 Introduction to cells.pptx
L1 Introduction to cells.pptx
 

More from robertstevens65

Ontologies: Necessary, but not sufficient
Ontologies: Necessary, but not sufficientOntologies: Necessary, but not sufficient
Ontologies: Necessary, but not sufficientrobertstevens65
 
The Pragmatics and Formality of Authoring OntologiesOdsl 2016
The Pragmatics and Formality of Authoring OntologiesOdsl 2016The Pragmatics and Formality of Authoring OntologiesOdsl 2016
The Pragmatics and Formality of Authoring OntologiesOdsl 2016robertstevens65
 
The Quality of Method Reporting in
The Quality of Method Reporting in The Quality of Method Reporting in
The Quality of Method Reporting in robertstevens65
 
The Semantics of Genomic Analysis
The Semantics of  Genomic AnalysisThe Semantics of  Genomic Analysis
The Semantics of Genomic Analysisrobertstevens65
 
Issues and activities in authoring ontologies
Issues and activities in authoring ontologiesIssues and activities in authoring ontologies
Issues and activities in authoring ontologiesrobertstevens65
 
The state of the nation for ontology development
The state of the nation for ontology developmentThe state of the nation for ontology development
The state of the nation for ontology developmentrobertstevens65
 
Building and Using Ontologies to do biology
Building and Using Ontologies to do biologyBuilding and Using Ontologies to do biology
Building and Using Ontologies to do biologyrobertstevens65
 
Properties and Individuals in OWL: Reasoning About Family History
Properties and Individuals in OWL: Reasoning About Family HistoryProperties and Individuals in OWL: Reasoning About Family History
Properties and Individuals in OWL: Reasoning About Family Historyrobertstevens65
 
Choosing and Building Knowledge Artefacts
Choosing and Building Knowledge ArtefactsChoosing and Building Knowledge Artefacts
Choosing and Building Knowledge Artefactsrobertstevens65
 
Populous: A tool for Populating OWL Ontologies from Templates
Populous: A tool for Populating OWL Ontologies from TemplatesPopulous: A tool for Populating OWL Ontologies from Templates
Populous: A tool for Populating OWL Ontologies from Templatesrobertstevens65
 
Keeping ontology development Agile
Keeping ontology development AgileKeeping ontology development Agile
Keeping ontology development Agilerobertstevens65
 
Lessons from teaching non-computer scientists OWL and ontologies
Lessons from teaching non-computer scientists OWL and ontologiesLessons from teaching non-computer scientists OWL and ontologies
Lessons from teaching non-computer scientists OWL and ontologiesrobertstevens65
 
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)Kidney and Urinary Pathways Knowledge Base (part of e-LICO)
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)robertstevens65
 
A Rose by Any Other Name is Still a Rose
A Rose by Any Other Name is Still a RoseA Rose by Any Other Name is Still a Rose
A Rose by Any Other Name is Still a Roserobertstevens65
 
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...robertstevens65
 
Knowledge Management in a Knowledge Based Discipline
Knowledge Management in a Knowledge Based DisciplineKnowledge Management in a Knowledge Based Discipline
Knowledge Management in a Knowledge Based Disciplinerobertstevens65
 
A family History Knowledge Base in OWL 2
A family History Knowledge Base in OWL 2A family History Knowledge Base in OWL 2
A family History Knowledge Base in OWL 2robertstevens65
 
RIO: The Regularities Inspector for Ontologies Plugin for Protégé 4
RIO: The Regularities Inspector for Ontologies Plugin for Protégé 4 RIO: The Regularities Inspector for Ontologies Plugin for Protégé 4
RIO: The Regularities Inspector for Ontologies Plugin for Protégé 4 robertstevens65
 

More from robertstevens65 (20)

Ontologies: Necessary, but not sufficient
Ontologies: Necessary, but not sufficientOntologies: Necessary, but not sufficient
Ontologies: Necessary, but not sufficient
 
The Pragmatics and Formality of Authoring OntologiesOdsl 2016
The Pragmatics and Formality of Authoring OntologiesOdsl 2016The Pragmatics and Formality of Authoring OntologiesOdsl 2016
The Pragmatics and Formality of Authoring OntologiesOdsl 2016
 
The Quality of Method Reporting in
The Quality of Method Reporting in The Quality of Method Reporting in
The Quality of Method Reporting in
 
The Semantics of Genomic Analysis
The Semantics of  Genomic AnalysisThe Semantics of  Genomic Analysis
The Semantics of Genomic Analysis
 
Issues and activities in authoring ontologies
Issues and activities in authoring ontologiesIssues and activities in authoring ontologies
Issues and activities in authoring ontologies
 
The state of the nation for ontology development
The state of the nation for ontology developmentThe state of the nation for ontology development
The state of the nation for ontology development
 
Building and Using Ontologies to do biology
Building and Using Ontologies to do biologyBuilding and Using Ontologies to do biology
Building and Using Ontologies to do biology
 
Properties and Individuals in OWL: Reasoning About Family History
Properties and Individuals in OWL: Reasoning About Family HistoryProperties and Individuals in OWL: Reasoning About Family History
Properties and Individuals in OWL: Reasoning About Family History
 
Choosing and Building Knowledge Artefacts
Choosing and Building Knowledge ArtefactsChoosing and Building Knowledge Artefacts
Choosing and Building Knowledge Artefacts
 
Populous: A tool for Populating OWL Ontologies from Templates
Populous: A tool for Populating OWL Ontologies from TemplatesPopulous: A tool for Populating OWL Ontologies from Templates
Populous: A tool for Populating OWL Ontologies from Templates
 
Keeping ontology development Agile
Keeping ontology development AgileKeeping ontology development Agile
Keeping ontology development Agile
 
Spreadsheets to OWL
Spreadsheets to OWLSpreadsheets to OWL
Spreadsheets to OWL
 
Lessons from teaching non-computer scientists OWL and ontologies
Lessons from teaching non-computer scientists OWL and ontologiesLessons from teaching non-computer scientists OWL and ontologies
Lessons from teaching non-computer scientists OWL and ontologies
 
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)Kidney and Urinary Pathways Knowledge Base (part of e-LICO)
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)
 
A Rose by Any Other Name is Still a Rose
A Rose by Any Other Name is Still a RoseA Rose by Any Other Name is Still a Rose
A Rose by Any Other Name is Still a Rose
 
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
 
Knowledge Management in a Knowledge Based Discipline
Knowledge Management in a Knowledge Based DisciplineKnowledge Management in a Knowledge Based Discipline
Knowledge Management in a Knowledge Based Discipline
 
Ontology at Manchester
Ontology at ManchesterOntology at Manchester
Ontology at Manchester
 
A family History Knowledge Base in OWL 2
A family History Knowledge Base in OWL 2A family History Knowledge Base in OWL 2
A family History Knowledge Base in OWL 2
 
RIO: The Regularities Inspector for Ontologies Plugin for Protégé 4
RIO: The Regularities Inspector for Ontologies Plugin for Protégé 4 RIO: The Regularities Inspector for Ontologies Plugin for Protégé 4
RIO: The Regularities Inspector for Ontologies Plugin for Protégé 4
 

Recently uploaded

Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptMAESTRELLAMesa2
 
Types of different blotting techniques.pptx
Types of different blotting techniques.pptxTypes of different blotting techniques.pptx
Types of different blotting techniques.pptxkhadijarafiq2012
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡anilsa9823
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhousejana861314
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Patrick Diehl
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxSwapnil Therkar
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxAArockiyaNisha
 

Recently uploaded (20)

Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.ppt
 
Types of different blotting techniques.pptx
Types of different blotting techniques.pptxTypes of different blotting techniques.pptx
Types of different blotting techniques.pptx
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptx
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhouse
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
 

Ontology Learning From Text Using Text2Onto

  • 1. Ontology Learning From Text? Robert Stevens BioHealth Informatics Group School of Computer Science University of Manchester Robert.stevens@manchester.ac.uk
  • 2. Introduction • Can we use ontology learning to build ontologies? • Not text-mining research, but ontology research • What is ontology learning from text? • The questions we posed • The experiment we performed • The results we obtained • The conclusions we made
  • 3. Ontology learning • Text2Onto: http://ontoware .org/projects/text2onto/ • “The erythrocytes are the blood cells that carry oxygen to others cells in the body” • “Lymphocytes, leukocytes, monocytes, phagocytes and granulocytes are all kinds of white blood cell” • “These experiments show that the individual hemopoietic stem cell is a multipotent cell and can give rise to the complete range of blood cell types, both myeloid and lymphoid, as well as new stem cells like itself.”
  • 4. Ontology Learning Blood Cell Erythrocyte White Blood Cell Monocyte Leukocyte Lymphocyte Phagocyte Granulocyte Multipotent Stem Cell Hemopoietic Stem Cell arise from
  • 5. Text to Ontology “Workflow” Corpus Tokenising / Sentence splitting Part-Of-Speech (POS) tagging Lemmatizing / Stemming JAPE transducer annotates corpus Text2Onto Algorithms for extracting modeling primitive Text2Onto meta-ontology Promotion to OWL ontology
  • 6. Extracting Patterns from Text “CFU-S is a blood stem cell” CFU-S[NNP] is[VBN] a[DT] blood[NN] stem[NN] cell[NN] Sentence: Part of Speech (POS) Tagging: Pseudo JAPE rule: Any series of nouns (A) followed by the string “ is a ” followed by series of nouns (B) Key: NN=noun; DT=determiner; NNP=proper noun; VBN = verb past participle. Ontological assertions: A and B are concepts, A is a subclass of B
  • 8. Some Text2Onto Instances • Instance: Astrocyte_c – typeOf: Concept that – Fact: confidence VALUE 1.0 Instance: AstrocycteNerveCell TypeOf: Subclass that Fact: domain VALUE NerveCell and FACT: Range VALUE Astrocyte and Fact: confidence VALUE 1.0
  • 9. The Questions We Asked • Can we press the button and get a good ontology? • If not, can we get something useful? • Can we do it without having to write too many rules? • Does the end-point act as as a donor or recipient ontology?
  • 10. Strategy • Collect corpus • Manually markup text for cells: Definitive list of terms • Process corpus through T2O • Analyse output of T2O for recall and precision of terms and hierarchy • Iteration of previous two step with variants in rules • Evaluation against CTO gold standard
  • 11. The Experimental Conditions • Default T2O • T2O plus cell specific JAPE rules and all algorithms • Only cell specific JAPE rules, /EntropyExtraction Algorithm and some “hierarchy spotting” based on term composition • Same 3, but with VerticalRelationsConceptClassification to include our simple JAPE rules • Same 4, but with WordConceptClassificaiton
  • 12. Rules for Extracting Cell Types • Words ending in ‘cyte’, ‘blast’, ‘cell’, ‘glia’, ‘glium’, ‘cell type’, ‘cell line’ and ‘cell lineage’ (together with their plurals) • Zero or more adjectives followed by zero or more nouns or proper nouns followed by a ‘cell word’ (together with plural) e.g. ‘renshaw cell’, ‘Muller cell’, ‘immature blood cell’, etc.. • Any stem cell term is a stem cell • Any term ending with ‘progeneitor cell’ is a Progenitor Cell. • Any term ending with ‘precursor cell’ is a Precursor Cell. • Any term ending in ‘blast’ is a Blast Cell. • Any term ending with ‘cyte’ or ‘cell’ is a Differentiated Cell.
  • 13. Evaluation Strategy • Extraction performance • Ontology evaluation • Domain coverage • Expert evaluation
  • 14. Term Recognition • 1,277 terms in our definitive list • 16,384 terms from whole corpus; 625 relevant • Increase to 17,851 and 916 • All 118 CTO terms in corpus recalled • Corpus has anatomical bias • Simple rules exploit regularity of language • Many false positives from adjective noun rule
  • 15. Cell Terms • Morphology: Stellate cell; columnar cell; • Ploidy • Maturity: Tetrapooil cell; multiploid cell; • Potentiality • Lineage: Totipotent stem cell; multipotent cell; • Species origin • Anatomical location: Animal cell; human sell; • Developmental stage: Mitotic cell; S-phase cell; • Lineage: Mesoderm cell;
  • 16. Common errors Manually extracted from corpus Automatically extracted from corpus Comments +t - cell Symbols not handled very well contains cell False -positive cell type Foam cell New cell type extracted leukocyte leucocyte Spelling errors in corpus naïve cell nave cell Character encoding problem Spermatogonia No rule to extract
  • 17. Term Recall and Precision
  • 22. Discussion • Exploiting poor performance to focus learning • Exploiting regularity of language • Never really going to find CTO domain general layer • Terms highly compositional and conflate axes • Ask the questions “is it useful?” not “is it good?” • Is CTO a good standard? • The extracted hierarchy was not bad from a cell biology and ontological point of view
  • 23. Nascent Methodology • Form corpus that includes, but is not limited to scope of target ontology • Extract terms from corpus • Filter and massage list of terms to find those of ontological interest • Use ontology learning to see what happens • Inspect and augment rules to recognise and incorporate into hierarchy • Iterate Use as donor ontology to transfer useful bits to recipient ontology
  • 25. Acknowledgements • Simon Jupp has done the work • Jaclyn Bibby MSc Project prototype • Johanna Volker for help with Text2Onto • David Shotton for knowledge about cell biology

Editor's Notes

  1. Workflow slide, corpus > tokenising/sentence splitting > POS tagging > lemmatizing/stemming > JAPE transducer annotates corpus > Text2Onto algorithms extract modeling primitives > Text2Onto meta-ontology > Promotion to OWL ontology > ontology
  2. OwlViz image showing test2onto meta-ontology
  3. Graph of increasing term and recall over 5 experimental conditio. Recall 50% -> 72%. Precision 4% -> 49%.
  4. OWLViz default ontology. Show fibroblast is a cell, also incorrectly asserts that fibroblast is a protein. Also shows some other junk term like ‘a_strong_candidate’ and ‘many_molecules’
  5. OWLViz of final ontology. Shows t-lymphocyte is-a lymphocyte is-a white blood cell is-a blood cell is cell. Also shows that its still not perfect: fibroblast is a blast cell, which is not actually correct.
  6. Graph of OntoEval results showing gradual improvement of taxonomic recall and precision. Lexical Precision 40% -> 50%. In final condition where we placed it under CTO, rose to 72%.
  7. Same image as previous slide, but showing where we manually inserted our learnt ontology under CTO classes. This image is again only for cell_by_function