Issues and activities in authoring ontologies

Issues and activities in 
authoring ontologies 
Robert Stevens 
School of Computer Science 
University of Manchester 
robert.stevens@manchester.ac.uk
We need to know what we’re talking 
about… 
• … if we don’t, our data are useless 
• If we are to interpret our data then we need 
to know what entities it describes 
• We need to share data and re-use it 
• We need to find data; compare data; analyse 
data 
• We need to know what we know and agree 
about it….
What is an Ontology? 
• Ontology (Socrates & Aristotle 400-360 
BC) 
• The study of being 
•Word borrowed by computing for the 
explicit description of the 
conceptualisation of a domain: 
– concepts 
– properties and attributes of concepts 
– constraints on properties and 
attributes 
– individuals (often, but not always) 
• An ontology defines 
– An agreement on the entities of a 
domain 
– a common vocabulary for the entities 
of a domain
Web Ontology Language (OWL) 
• W3C recommendation for ontologies for the Semantic 
Web 
• OWL-DL mapped to a decidable fragment of first order 
logic 
• Classes, properties and instances 
• Boolean operators, plus existential and universal 
quantification 
• Rich class expressions used in restriction on properties 
– hasDomain some (ImnunoGlobinDomain or 
FibronectinDomain) 
• Automated reasoners reveal entailments 
from the axioms of an ontology in OWL
OWL represents classes of 
instances 
A 
B 
C
Some OWL and why it’s hard 
Class: RanunculusRepens 
SubClassOf: 
* Flower, 
Flower 
and (hasFlowerSymmetry some RadialSymmetry) 
and (hasPart some 
(Androecium 
and (hasAndroecialFusion some Apostemonous) 
and (hasPart some 
(Stamen 
and (hasPart some Filament) 
and (hasPart some 
(Anther 
and (hasAntherAttachment some AdnateAntherAttachment) 
and (hasDehiscenceType some LongitudinalDehiscence))))))) 
and (hasPart some 
(Gynoecium 
and (hasGynoecialFusion some Apocarpous) 
and (hasPart some 
(Pistil 
and (hasPart some Carpel) 
and (hasPart some Style) 
and (hasPart some 
(Stigma 
and (hasStickiness some Stickiness) 
and (hasStigmaShape some HookedStigmaShape))) 
and (hasPart only 
(Carpel 
or Stigma 
or Style)))) 
and (hasSexualPartArrangement some SpiralArrangement))) 
and (hasPart exactly 1 (Perianth
Some OWL and why it’s hard 
Class: RanunculusRepens 
SubClassOf: 
* Flower, 
Flower 
and (hasPart some 
(Calyx 
and (hasPart exactly 5 (Sepal 
and (hasColour some Green) 
and (hasRegion some 
(BaseRegion 
and (hasForm some Truncate))) 
and (hasRegion some 
(MarginRegion 
and (hasSepalPetalFeature some Entire) 
and (hasSepalPetalFeature some Membranous))) 
and (hasRegion some 
(SurfaceRegion 
and (hasSepalPetalFeature some Pubescent) 
and (hasSurfaceSelector some LowerSurfaceSelector))) 
and (hasRegion some 
(SurfaceRegion 
and (hasSepalPetalFeature some Smooth) 
and (hasSurfaceSelector some UpperSurfaceSelector))) 
and (hasRegion some 
(TipRegion 
and (hasForm some Truncate))) 
and (hasSepalPetalFeature some PalmatelyNetted) 
and (hasSepalPetalShape some Ovate) 
and (hasSepalousity some Aposepalos)))))
Some OWL and why it’s hard 
Class: RanunculusRepens 
SubClassOf: 
* Flower, 
Flower 
and (hasPart some 
(Corolla 
and (hasPart exactly 5 (Petal 
and (hasColour some Yellow) 
and (hasPetalousity some Apopetalos) 
and (hasRegion some 
(BaseRegion 
and (hasForm some Acute))) 
and (hasRegion some 
(MarginRegion 
and (hasSepalPetalFeature some Entire))) 
and (hasRegion some 
(TipRegion 
and (hasForm some Acute))) 
and (hasSepalPetalFeature some PalmatelyNetted) 
and (hasSepalPetalShape some Obovate) 
and (hasPart exactly 1 Nectary))))) 
and (hasPerianthArrangement some AlternatingPerianthArrangement) 
and (hasPart only 
(Calyx 
or Corolla))))
Describing potatoes 
Potato 
BoilingPotato LateFirstEarlyPotato 
Accent 
Class: BoilingPotato 
EquivalentTo: Potato and hasPreferredCookingMethod some Boiling 
Class: LateFirstEarlyPotato 
EquivalentTo: Potato and hasCroppingTime some LateFirstEarlyCropping 
Class: Accent 
SubClassOf: 
Potato, 
hasPreferredCookingMethod some Boiling, 
hasYield some HighYield, 
hasCroppingTime some LateFirstEarlyCropping
Protégé 
protege.stanford.edu
Understanding how ontologies are 
authored in OWL 
• We want to understand how these complex, 
cognitively hard artefacts are authored 
• HCI approaches do not pervade all computing 
disciplines 
• Instruments to run user studies are scarce 
• Consequences for the OWL realm 
– No real understanding about the authoring process 
– Authoring tools are not human-centered 
• What if we want to go further? 
– Automatic detection of authoring patterns 
– Intelligent support for authoring
How we tackle the problem 
• Get familiarised with the 
problem 
• Set the scope 
• Acquire insights for the 
quantitative approach 
Qualitative 
approach 
 Interview study 
 Thematic analysis 
• Collection of quantifiable data 
• Use of lab apparatus (eye-tracker, 
video, etc.) 
• Find authoring patterns 
• Quantify and generalise 
Quantitative 
approach 
 Instrumentation of Protégé 
 Lab study 
 Data-driven analysis
Little is known about the human 
factors of ontology authoring 
• What we know is mostly based on anecdotal 
evidence 
• We asked about problems and strategies
Uncovering issues in ontology 
authoring 
• Exploration and navigation 
– Increase situational awareness by giving feedback 
about the consequences of actions: e.g. undo, 
reasoning 
– Provide overviews for those who are not familiar 
with a given ontology 
– For those who are familiar with an ontology allow 
bookmarks and provide landmarks 
– Facilitate the navigation through filters, faceted 
navigation mechanisms and hyperlinking entities
Uncovering issues in ontology 
authoring 
• Search and retrieval 
– Integrated support to search on remote ontologies 
and incorporate entities in the working ontology 
• Efficient authoring 
– Include design templates and spreadsheets 
• Provide on-the-fly reasoning capabilities 
• Remove information overload in explanations 
• Include predefined unit tests for evaluation
Protégé4US: a step towards having 
observational instruments 
• Protégé4US: Protégé for User Studies 
• Logging capabilities of: 
– Interaction events: click, hover, expand hierarchy... 
– Authoring events: add siblings, add restrictions... 
– Environment commands: reason, search, undo... 
76585,2,Classes,Element edited,Juliette subclass of: Potato and hasCroppingTime some ’Main 
cropping’ 
77786,3,Classes,Save ontology,http://owl.cs.manchester.ac.uk/ontology/start-here.owl 
80204,3,Classes,Reasoner invoked,HermiT 1.3.8 
80647,1,Classes,Mouse entered, Class hierarchy (inferred) 
82910,1,Classes,Element hovered,Early_cropping_potato 
83049,1,Classes,Element selected,Early_cropping_potato 
83661,1,Classes,Hierarchy expanded,Early_cropping_potato
User study to show the strengths of 
Protégé4US 
• Experimental design: 
– Participants: 16 expert authors 
– Stimuli: a potato ontology and Protégé4US 
– 3 authoring tasks with an increased complexity 
• Collected data 
– Protégé4US logs: 10K events 
– Completion times 
– Self reported expertise 
– Perceived task difficulty 
– Screen video and eye-tracking
Describing potatoes 
Potato 
BoilingPotato LateFirstEarlyPotato 
Accent 
Class: BoilingPotato 
EquivalentTo: Potato and hasPreferredCookingMethod some Boiling 
Class: LateFirstEarlyPotato 
EquivalentTo: Potato and hasCroppingTime some LateFirstEarlyCropping 
Class: Accent 
SubClassOf: 
Potato, 
hasPreferredCookingMethod some Boiling, 
hasYield some HighYield, 
hasCroppingTime some LateFirstEarlyCropping
Protégé4US in action
Analysis of log data 
• Interaction events account for 65% of events 
while authoring events are 30% 
• The top 3 events (entity selection, description 
selection and invocation of editing menu) 
account for 56% of events
Analysis of log data 
• N-gram analysis of consecutive 
events suggests lots of 
repetition 
• Esp. for entity selection and 
hierarchy expansion 
• Mouse driven functionalities 
makes this possible in Protégé 
• We built adjacency matrices for 
participants: number of 
transitions from event x to 
event y 
1000 
750 
500 
250 
0 
2 4 6 8 10 
N−grams size 
frequency 
Event 
Class addition 
Description selected 
Entity selected 
Entity selected(i) 
Hierarchy expanded 
Hierarchy expanded(i)
Reconstructing the interaction to 
identify patterns through visualisation 
• Left: web diagrams show most frequent 
transitions between states 
• Right: time diagrams show the authoring 
rhythm P8 
Back 
Class addition 
Convert into defined 
Description selected 
Description selected(i) 
Entity deleted 
Entity dragged 
Entity edited:finish 
Entity edited:start 
Entity selected 
Set property Undo 
Run reasoner 
Property addition 
Load ontology 
Get explanation 
Hierarchy expanded(i) 
Hierarchy collapsed(i) 
Hierarchy collapsed 
Entity renamed Entity selected(i) 
Hierarchy expanded 
Save 
Description selected(i) 
Description selected 
Entity dragged 
Entity deleted 
Entity renamed 
Back 
Undo 
Hierarchy collapsed(i) 
Hierarchy collapsed 
Get explanation 
Set property 
Property addition 
Class addition 
Run reasoner 
Save 
Convert into defined 
Entity edited:finish 
Entity edited:start 
Hierarchy expanded(i) 
Hierarchy expanded 
EntitySelected(i) 
Entity selected 
Load ontology 
0 1000 2000 3000 4000
Analysis of eye-tracking data 
• Distribution of aggregated dwell times in the 
areas of interest 
• The class hierarchy 
and the entity 
edition menu get 
the majority of 
fixations and dwell 
time
Analysis of eye-tracking data 
• Number of fixations between areas of interest 
• High frequency 
expected at the 
diagonal 
• Symmetry 
suggests checking 
behaviours 
• The class hierarchy 
is the pivotal 
window
Log data + eye-tracking data 
• Synchronised both data sources 
• Merged same consecutive events 
e.g. class additiont, class additiont+1, class additiont+2, entity selectedt+3 
M_class_additiont+2, entity selectedt+3 
• Synchronised both data sources 
• Computed N-gram analysis and we found 3 
main activities: 
– Exploration activity 
– Authoring activity 
– Reasoning activity
Exploration activity 
Select 
entity 
Expand 
hierarchy 
0.48 
0.31 
Select 
inferred 
entity 
Expand 
inferred 
hierarchy 
0.25 
0.43 
0.12 
0.54 
Load 
ontology 
0.52 
0.31 
Expand 
hierarchy 
Select 
description 
0.29 
0.37 
Exploration activity 
• Expand the asserted class 
hierarchy after loading an 
ontology 
• The exploration of the 
asserted hierarchy is 
about finding a specific 
location to add or modify 
an entity, while exploration 
of the inferred one is to 
check the state of the 
ontology
Editing activity 
Select 
description 
Select 
entity 
0.29 Modify 
entity 
0.37 
0.63 
0.59 
Editing activity 
• Sequence found 362 times 
• 22.6 times per participant 
• The high probabilities along with the frequency 
with which this activity is performed, indicates 
that entities were modified in batches
Reasoning activity 
Run 
reasoner 
0.17 
Convert into 
defined class 
Save 
Select 
description 
0.16 
0.15 
0.40 
Expand 
inferred 
hierarchy 
0.30 
Select 
entity 
0.41 
0.37 
0.43 
Select 
inferred 
entity 
0.54 
0.25 0.12 
Reasoning activity 
• After running the reasoner participants observe 
the consequences of reasoning on the asserted 
hierarchy and the description area OR 
• To check classification, participants expand the 
inferred class hierarchy and make selections on 
inferred entities
Discussion 
• Ontology editing is highly repetitive 
• The class hierarchy received users’ attention 
45% of the time 
– Acts as an external memory of the ontology 
– Plays the role of an index with pointers to extended 
information 
• Navigation of the inferred hierarchy is 
exploratory, while the navigation of the asserted 
hierarchy is directed
Discussion 
• Some outcomes corroborate initial findings: 
repetitiveness of editing task and lack of 
situational awareness after running the 
reasoner 
• Design recommendations 
– Support bulk editing 
– Place editing features close to the class hierarchy 
– Show entity descriptions close to the class 
hierarchy 
– Anticipate reasoner invocation 
– Make changes to the inferred hierarchy explicit
Acknowledgements 
Markel Vigo did the work. 
Caroline Jay and Robert Stevens helped out with design, 
analysis, and so on.
Issues and activities in 
authoring ontologies 
Robert Stevens 
School of Computer Science 
University of Manchester 
robert.stevens@manchester.ac.uk 
WhatIf: Answering “What if...” questions for Ontology Authoring. 
EPSRC reference EP/J014176/1
1 of 32

Recommended

Ontologies: Necessary, but not sufficient by
Ontologies: Necessary, but not sufficientOntologies: Necessary, but not sufficient
Ontologies: Necessary, but not sufficientrobertstevens65
716 views27 slides
OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20... by
OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...
OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...robertstevens65
745 views9 slides
GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit... by
GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...
GraphConnect Europe 2016 - Building a Repository of Biomedical Ontologies wit...Neo4j
784 views20 slides
Semantic Application for Healthcare by
Semantic Application for HealthcareSemantic Application for Healthcare
Semantic Application for Healthcarescholten
1K views58 slides
Type-Aware Entity Retrieval by
Type-Aware Entity RetrievalType-Aware Entity Retrieval
Type-Aware Entity RetrievalDarío Garigliotti
11 views25 slides
Bio ontologies and semantic technologies[2] by
Bio ontologies and semantic technologies[2]Bio ontologies and semantic technologies[2]
Bio ontologies and semantic technologies[2]Prof. Wim Van Criekinge
1.9K views126 slides

More Related Content

What's hot

Classifications in EOL by
Classifications in EOLClassifications in EOL
Classifications in EOLCyndy Parr
303 views13 slides
Ontology and Ontology Libraries: a critical study by
Ontology and Ontology Libraries: a critical studyOntology and Ontology Libraries: a critical study
Ontology and Ontology Libraries: a critical studyDebashisnaskar
2.4K views42 slides
2019 02 12_biological_databases_part1_v_upload by
2019 02 12_biological_databases_part1_v_upload2019 02 12_biological_databases_part1_v_upload
2019 02 12_biological_databases_part1_v_uploadProf. Wim Van Criekinge
2.9K views96 slides
Ontology engineering by
Ontology engineering Ontology engineering
Ontology engineering Aliabbas Petiwala
16.8K views49 slides
T1 2018 bioinformatics by
T1 2018 bioinformaticsT1 2018 bioinformatics
T1 2018 bioinformaticsProf. Wim Van Criekinge
1.7K views73 slides
2020 02 11_biological_databases_part1 by
2020 02 11_biological_databases_part12020 02 11_biological_databases_part1
2020 02 11_biological_databases_part1Prof. Wim Van Criekinge
1.6K views90 slides

What's hot(15)

Classifications in EOL by Cyndy Parr
Classifications in EOLClassifications in EOL
Classifications in EOL
Cyndy Parr303 views
Ontology and Ontology Libraries: a critical study by Debashisnaskar
Ontology and Ontology Libraries: a critical studyOntology and Ontology Libraries: a critical study
Ontology and Ontology Libraries: a critical study
Debashisnaskar2.4K views
OWL-XML-Summer-School-09 by Duncan Hull
OWL-XML-Summer-School-09OWL-XML-Summer-School-09
OWL-XML-Summer-School-09
Duncan Hull3.5K views
Content Team Update by eMonocot
Content Team UpdateContent Team Update
Content Team Update
eMonocot650 views
Botanists and annotations printer friendly by William Ulate
Botanists and annotations   printer friendlyBotanists and annotations   printer friendly
Botanists and annotations printer friendly
William Ulate334 views

Viewers also liked

The Semantics of Genomic Analysis by
The Semantics of  Genomic AnalysisThe Semantics of  Genomic Analysis
The Semantics of Genomic Analysisrobertstevens65
841 views18 slides
Building and Using Ontologies to do biology by
Building and Using Ontologies to do biologyBuilding and Using Ontologies to do biology
Building and Using Ontologies to do biologyrobertstevens65
585 views38 slides
The Quality of Method Reporting in by
The Quality of Method Reporting in The Quality of Method Reporting in
The Quality of Method Reporting in robertstevens65
471 views21 slides
The Pragmatics and Formality of Authoring OntologiesOdsl 2016 by
The Pragmatics and Formality of Authoring OntologiesOdsl 2016The Pragmatics and Formality of Authoring OntologiesOdsl 2016
The Pragmatics and Formality of Authoring OntologiesOdsl 2016robertstevens65
228 views36 slides
The state of the nation for ontology development by
The state of the nation for ontology developmentThe state of the nation for ontology development
The state of the nation for ontology developmentrobertstevens65
700 views19 slides
Properties and Individuals in OWL: Reasoning About Family History by
Properties and Individuals in OWL: Reasoning About Family HistoryProperties and Individuals in OWL: Reasoning About Family History
Properties and Individuals in OWL: Reasoning About Family Historyrobertstevens65
4.2K views69 slides

Viewers also liked(7)

The Semantics of Genomic Analysis by robertstevens65
The Semantics of  Genomic AnalysisThe Semantics of  Genomic Analysis
The Semantics of Genomic Analysis
robertstevens65841 views
Building and Using Ontologies to do biology by robertstevens65
Building and Using Ontologies to do biologyBuilding and Using Ontologies to do biology
Building and Using Ontologies to do biology
robertstevens65585 views
The Quality of Method Reporting in by robertstevens65
The Quality of Method Reporting in The Quality of Method Reporting in
The Quality of Method Reporting in
robertstevens65471 views
The Pragmatics and Formality of Authoring OntologiesOdsl 2016 by robertstevens65
The Pragmatics and Formality of Authoring OntologiesOdsl 2016The Pragmatics and Formality of Authoring OntologiesOdsl 2016
The Pragmatics and Formality of Authoring OntologiesOdsl 2016
robertstevens65228 views
The state of the nation for ontology development by robertstevens65
The state of the nation for ontology developmentThe state of the nation for ontology development
The state of the nation for ontology development
robertstevens65700 views
Properties and Individuals in OWL: Reasoning About Family History by robertstevens65
Properties and Individuals in OWL: Reasoning About Family HistoryProperties and Individuals in OWL: Reasoning About Family History
Properties and Individuals in OWL: Reasoning About Family History
robertstevens654.2K views

Similar to Issues and activities in authoring ontologies

SMART Protocols in LISC-2014 by
SMART Protocols in LISC-2014 SMART Protocols in LISC-2014
SMART Protocols in LISC-2014 Olga Ximena Giraldo
406 views19 slides
Ontology Design Patterns for Linked Data Tutorial at ISWC2016 - Introduction by
Ontology Design Patterns for Linked Data Tutorial at ISWC2016 - IntroductionOntology Design Patterns for Linked Data Tutorial at ISWC2016 - Introduction
Ontology Design Patterns for Linked Data Tutorial at ISWC2016 - IntroductionAldo Gangemi
955 views51 slides
Working with big biomedical ontologies by
Working with big biomedical ontologiesWorking with big biomedical ontologies
Working with big biomedical ontologiesrobertstevens65
567 views28 slides
OntologyEngineering.ppt by
OntologyEngineering.pptOntologyEngineering.ppt
OntologyEngineering.pptAmitSehgalSETProfess
22 views49 slides
BT02.pptx by
BT02.pptxBT02.pptx
BT02.pptxThAnhonc
3 views35 slides
SMART Protocols by
SMART ProtocolsSMART Protocols
SMART ProtocolsOlga Ximena Giraldo
461 views24 slides

Similar to Issues and activities in authoring ontologies(20)

Ontology Design Patterns for Linked Data Tutorial at ISWC2016 - Introduction by Aldo Gangemi
Ontology Design Patterns for Linked Data Tutorial at ISWC2016 - IntroductionOntology Design Patterns for Linked Data Tutorial at ISWC2016 - Introduction
Ontology Design Patterns for Linked Data Tutorial at ISWC2016 - Introduction
Aldo Gangemi955 views
Working with big biomedical ontologies by robertstevens65
Working with big biomedical ontologiesWorking with big biomedical ontologies
Working with big biomedical ontologies
robertstevens65567 views
Building a repository of biomedical ontologies with Neo4j by Simon Jupp
Building a repository of biomedical ontologies with Neo4jBuilding a repository of biomedical ontologies with Neo4j
Building a repository of biomedical ontologies with Neo4j
Simon Jupp4.3K views
Pharo: a reflective language A first systematic analysis of reflective APIs by ESUG
Pharo: a reflective language A first systematic analysis of reflective APIsPharo: a reflective language A first systematic analysis of reflective APIs
Pharo: a reflective language A first systematic analysis of reflective APIs
ESUG57 views
EvoPat - Pattern-Based Evolution and Refactoring of RDF Knowledge Bases by Sebastian Tramp
EvoPat - Pattern-Based Evolution and Refactoring of RDF Knowledge BasesEvoPat - Pattern-Based Evolution and Refactoring of RDF Knowledge Bases
EvoPat - Pattern-Based Evolution and Refactoring of RDF Knowledge Bases
Sebastian Tramp410 views
Semi-automated Exploration and Extraction of Data in Scientific Tables by Elsevier
Semi-automated Exploration and Extraction of Data in Scientific TablesSemi-automated Exploration and Extraction of Data in Scientific Tables
Semi-automated Exploration and Extraction of Data in Scientific Tables
Elsevier1.6K views
Achille Felicetti - ARIADNE Semantic Integration of Archaeological Information by ariadnenetwork
Achille Felicetti - ARIADNE Semantic Integration of Archaeological InformationAchille Felicetti - ARIADNE Semantic Integration of Archaeological Information
Achille Felicetti - ARIADNE Semantic Integration of Archaeological Information
ariadnenetwork367 views
Research Objects for improved sharing and reproducibility by Oscar Corcho
Research Objects for improved sharing and reproducibilityResearch Objects for improved sharing and reproducibility
Research Objects for improved sharing and reproducibility
Oscar Corcho843 views
Ontology Engineering for the Semantic Web and beyond by Peter Geil
Ontology Engineering for the Semantic Web and beyondOntology Engineering for the Semantic Web and beyond
Ontology Engineering for the Semantic Web and beyond
Peter Geil2.2K views
Tutorial OWL and drug discovery ICBO 2013 by Samuel Croset
Tutorial OWL and drug discovery ICBO 2013Tutorial OWL and drug discovery ICBO 2013
Tutorial OWL and drug discovery ICBO 2013
Samuel Croset612 views
The Past, Present and Future of Knowledge in Biology by robertstevens65
The Past, Present and Future of Knowledge in BiologyThe Past, Present and Future of Knowledge in Biology
The Past, Present and Future of Knowledge in Biology
robertstevens65760 views

More from robertstevens65

Choosing and Building Knowledge Artefacts by
Choosing and Building Knowledge ArtefactsChoosing and Building Knowledge Artefacts
Choosing and Building Knowledge Artefactsrobertstevens65
474 views21 slides
Populous: A tool for Populating OWL Ontologies from Templates by
Populous: A tool for Populating OWL Ontologies from TemplatesPopulous: A tool for Populating OWL Ontologies from Templates
Populous: A tool for Populating OWL Ontologies from Templatesrobertstevens65
965 views35 slides
Keeping ontology development Agile by
Keeping ontology development AgileKeeping ontology development Agile
Keeping ontology development Agilerobertstevens65
429 views23 slides
Spreadsheets to OWL by
Spreadsheets to OWLSpreadsheets to OWL
Spreadsheets to OWLrobertstevens65
534 views31 slides
Lessons from teaching non-computer scientists OWL and ontologies by
Lessons from teaching non-computer scientists OWL and ontologiesLessons from teaching non-computer scientists OWL and ontologies
Lessons from teaching non-computer scientists OWL and ontologiesrobertstevens65
769 views22 slides
Kidney and Urinary Pathways Knowledge Base (part of e-LICO) by
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)Kidney and Urinary Pathways Knowledge Base (part of e-LICO)
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)robertstevens65
495 views27 slides

More from robertstevens65(20)

Choosing and Building Knowledge Artefacts by robertstevens65
Choosing and Building Knowledge ArtefactsChoosing and Building Knowledge Artefacts
Choosing and Building Knowledge Artefacts
robertstevens65474 views
Populous: A tool for Populating OWL Ontologies from Templates by robertstevens65
Populous: A tool for Populating OWL Ontologies from TemplatesPopulous: A tool for Populating OWL Ontologies from Templates
Populous: A tool for Populating OWL Ontologies from Templates
robertstevens65965 views
Keeping ontology development Agile by robertstevens65
Keeping ontology development AgileKeeping ontology development Agile
Keeping ontology development Agile
robertstevens65429 views
Lessons from teaching non-computer scientists OWL and ontologies by robertstevens65
Lessons from teaching non-computer scientists OWL and ontologiesLessons from teaching non-computer scientists OWL and ontologies
Lessons from teaching non-computer scientists OWL and ontologies
robertstevens65769 views
Kidney and Urinary Pathways Knowledge Base (part of e-LICO) by robertstevens65
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)Kidney and Urinary Pathways Knowledge Base (part of e-LICO)
Kidney and Urinary Pathways Knowledge Base (part of e-LICO)
robertstevens65495 views
A Rose by Any Other Name is Still a Rose by robertstevens65
A Rose by Any Other Name is Still a RoseA Rose by Any Other Name is Still a Rose
A Rose by Any Other Name is Still a Rose
robertstevens65576 views
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus... by robertstevens65
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
robertstevens65359 views
Knowledge Management in a Knowledge Based Discipline by robertstevens65
Knowledge Management in a Knowledge Based DisciplineKnowledge Management in a Knowledge Based Discipline
Knowledge Management in a Knowledge Based Discipline
robertstevens65443 views
A family History Knowledge Base in OWL 2 by robertstevens65
A family History Knowledge Base in OWL 2A family History Knowledge Base in OWL 2
A family History Knowledge Base in OWL 2
robertstevens65376 views
RIO: The Regularities Inspector for Ontologies Plugin for Protégé 4 by robertstevens65
RIO: The Regularities Inspector for Ontologies Plugin for Protégé 4 RIO: The Regularities Inspector for Ontologies Plugin for Protégé 4
RIO: The Regularities Inspector for Ontologies Plugin for Protégé 4
robertstevens65199 views
Communities building ontologies: Tensions and Reality by robertstevens65
Communities building ontologies: Tensions and RealityCommunities building ontologies: Tensions and Reality
Communities building ontologies: Tensions and Reality
robertstevens65406 views
Issues in Learning an Ontology from Text by robertstevens65
Issues in Learning an Ontology from Text Issues in Learning an Ontology from Text
Issues in Learning an Ontology from Text
robertstevens65508 views
Can there be such a thing as Ontology Engineering? by robertstevens65
Can there be such a thing as Ontology Engineering?Can there be such a thing as Ontology Engineering?
Can there be such a thing as Ontology Engineering?
robertstevens65483 views
Knowing what we’re talking about by robertstevens65
Knowing what we’re talking aboutKnowing what we’re talking about
Knowing what we’re talking about
robertstevens65470 views
Could Mendelev have Dreamt in OWL? by robertstevens65
Could Mendelev have Dreamt in OWL?Could Mendelev have Dreamt in OWL?
Could Mendelev have Dreamt in OWL?
robertstevens65484 views
Using Ontology to Classify Members of a Protein Family by robertstevens65
Using Ontology to Classify Members of a Protein Family Using Ontology to Classify Members of a Protein Family
Using Ontology to Classify Members of a Protein Family
robertstevens65748 views

Recently uploaded

Factors affecting fluorescence and phosphorescence.pptx by
Factors affecting fluorescence and phosphorescence.pptxFactors affecting fluorescence and phosphorescence.pptx
Factors affecting fluorescence and phosphorescence.pptxSamarthGiri1
7 views11 slides
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ... by
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...ILRI
8 views1 slide
DEVELOPMENT OF FROG.pptx by
DEVELOPMENT OF FROG.pptxDEVELOPMENT OF FROG.pptx
DEVELOPMENT OF FROG.pptxsushant292556
11 views21 slides
Bacterial Reproduction.pdf by
Bacterial Reproduction.pdfBacterial Reproduction.pdf
Bacterial Reproduction.pdfNandadulalSannigrahi
24 views32 slides
Experimental animal Guinea pigs.pptx by
Experimental animal Guinea pigs.pptxExperimental animal Guinea pigs.pptx
Experimental animal Guinea pigs.pptxMansee Arya
38 views16 slides
NUTRITION IN BACTERIA.pdf by
NUTRITION IN BACTERIA.pdfNUTRITION IN BACTERIA.pdf
NUTRITION IN BACTERIA.pdfNandadulalSannigrahi
36 views14 slides

Recently uploaded(20)

Factors affecting fluorescence and phosphorescence.pptx by SamarthGiri1
Factors affecting fluorescence and phosphorescence.pptxFactors affecting fluorescence and phosphorescence.pptx
Factors affecting fluorescence and phosphorescence.pptx
SamarthGiri17 views
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ... by ILRI
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...
ILRI8 views
Experimental animal Guinea pigs.pptx by Mansee Arya
Experimental animal Guinea pigs.pptxExperimental animal Guinea pigs.pptx
Experimental animal Guinea pigs.pptx
Mansee Arya38 views
Nitrosamine & NDSRI.pptx by NileshBonde4
Nitrosamine & NDSRI.pptxNitrosamine & NDSRI.pptx
Nitrosamine & NDSRI.pptx
NileshBonde418 views
Exploring the nature and synchronicity of early cluster formation in the Larg... by Sérgio Sacani
Exploring the nature and synchronicity of early cluster formation in the Larg...Exploring the nature and synchronicity of early cluster formation in the Larg...
Exploring the nature and synchronicity of early cluster formation in the Larg...
Sérgio Sacani1.2K views
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ... by ILRI
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...
ILRI5 views
Discovery of therapeutic agents targeting PKLR for NAFLD using drug repositio... by Trustlife
Discovery of therapeutic agents targeting PKLR for NAFLD using drug repositio...Discovery of therapeutic agents targeting PKLR for NAFLD using drug repositio...
Discovery of therapeutic agents targeting PKLR for NAFLD using drug repositio...
Trustlife142 views
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance... by InsideScientific
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...
InsideScientific105 views
Study on Drug Drug Interaction Through Prescription Analysis of Type II Diabe... by Anmol Vishnu Gupta
Study on Drug Drug Interaction Through Prescription Analysis of Type II Diabe...Study on Drug Drug Interaction Through Prescription Analysis of Type II Diabe...
Study on Drug Drug Interaction Through Prescription Analysis of Type II Diabe...
A giant thin stellar stream in the Coma Galaxy Cluster by Sérgio Sacani
A giant thin stellar stream in the Coma Galaxy ClusterA giant thin stellar stream in the Coma Galaxy Cluster
A giant thin stellar stream in the Coma Galaxy Cluster
Sérgio Sacani18 views
application of genetic engineering 2.pptx by SankSurezz
application of genetic engineering 2.pptxapplication of genetic engineering 2.pptx
application of genetic engineering 2.pptx
SankSurezz14 views

Issues and activities in authoring ontologies

  • 1. Issues and activities in authoring ontologies Robert Stevens School of Computer Science University of Manchester robert.stevens@manchester.ac.uk
  • 2. We need to know what we’re talking about… • … if we don’t, our data are useless • If we are to interpret our data then we need to know what entities it describes • We need to share data and re-use it • We need to find data; compare data; analyse data • We need to know what we know and agree about it….
  • 3. What is an Ontology? • Ontology (Socrates & Aristotle 400-360 BC) • The study of being •Word borrowed by computing for the explicit description of the conceptualisation of a domain: – concepts – properties and attributes of concepts – constraints on properties and attributes – individuals (often, but not always) • An ontology defines – An agreement on the entities of a domain – a common vocabulary for the entities of a domain
  • 4. Web Ontology Language (OWL) • W3C recommendation for ontologies for the Semantic Web • OWL-DL mapped to a decidable fragment of first order logic • Classes, properties and instances • Boolean operators, plus existential and universal quantification • Rich class expressions used in restriction on properties – hasDomain some (ImnunoGlobinDomain or FibronectinDomain) • Automated reasoners reveal entailments from the axioms of an ontology in OWL
  • 5. OWL represents classes of instances A B C
  • 6. Some OWL and why it’s hard Class: RanunculusRepens SubClassOf: * Flower, Flower and (hasFlowerSymmetry some RadialSymmetry) and (hasPart some (Androecium and (hasAndroecialFusion some Apostemonous) and (hasPart some (Stamen and (hasPart some Filament) and (hasPart some (Anther and (hasAntherAttachment some AdnateAntherAttachment) and (hasDehiscenceType some LongitudinalDehiscence))))))) and (hasPart some (Gynoecium and (hasGynoecialFusion some Apocarpous) and (hasPart some (Pistil and (hasPart some Carpel) and (hasPart some Style) and (hasPart some (Stigma and (hasStickiness some Stickiness) and (hasStigmaShape some HookedStigmaShape))) and (hasPart only (Carpel or Stigma or Style)))) and (hasSexualPartArrangement some SpiralArrangement))) and (hasPart exactly 1 (Perianth
  • 7. Some OWL and why it’s hard Class: RanunculusRepens SubClassOf: * Flower, Flower and (hasPart some (Calyx and (hasPart exactly 5 (Sepal and (hasColour some Green) and (hasRegion some (BaseRegion and (hasForm some Truncate))) and (hasRegion some (MarginRegion and (hasSepalPetalFeature some Entire) and (hasSepalPetalFeature some Membranous))) and (hasRegion some (SurfaceRegion and (hasSepalPetalFeature some Pubescent) and (hasSurfaceSelector some LowerSurfaceSelector))) and (hasRegion some (SurfaceRegion and (hasSepalPetalFeature some Smooth) and (hasSurfaceSelector some UpperSurfaceSelector))) and (hasRegion some (TipRegion and (hasForm some Truncate))) and (hasSepalPetalFeature some PalmatelyNetted) and (hasSepalPetalShape some Ovate) and (hasSepalousity some Aposepalos)))))
  • 8. Some OWL and why it’s hard Class: RanunculusRepens SubClassOf: * Flower, Flower and (hasPart some (Corolla and (hasPart exactly 5 (Petal and (hasColour some Yellow) and (hasPetalousity some Apopetalos) and (hasRegion some (BaseRegion and (hasForm some Acute))) and (hasRegion some (MarginRegion and (hasSepalPetalFeature some Entire))) and (hasRegion some (TipRegion and (hasForm some Acute))) and (hasSepalPetalFeature some PalmatelyNetted) and (hasSepalPetalShape some Obovate) and (hasPart exactly 1 Nectary))))) and (hasPerianthArrangement some AlternatingPerianthArrangement) and (hasPart only (Calyx or Corolla))))
  • 9. Describing potatoes Potato BoilingPotato LateFirstEarlyPotato Accent Class: BoilingPotato EquivalentTo: Potato and hasPreferredCookingMethod some Boiling Class: LateFirstEarlyPotato EquivalentTo: Potato and hasCroppingTime some LateFirstEarlyCropping Class: Accent SubClassOf: Potato, hasPreferredCookingMethod some Boiling, hasYield some HighYield, hasCroppingTime some LateFirstEarlyCropping
  • 11. Understanding how ontologies are authored in OWL • We want to understand how these complex, cognitively hard artefacts are authored • HCI approaches do not pervade all computing disciplines • Instruments to run user studies are scarce • Consequences for the OWL realm – No real understanding about the authoring process – Authoring tools are not human-centered • What if we want to go further? – Automatic detection of authoring patterns – Intelligent support for authoring
  • 12. How we tackle the problem • Get familiarised with the problem • Set the scope • Acquire insights for the quantitative approach Qualitative approach  Interview study  Thematic analysis • Collection of quantifiable data • Use of lab apparatus (eye-tracker, video, etc.) • Find authoring patterns • Quantify and generalise Quantitative approach  Instrumentation of Protégé  Lab study  Data-driven analysis
  • 13. Little is known about the human factors of ontology authoring • What we know is mostly based on anecdotal evidence • We asked about problems and strategies
  • 14. Uncovering issues in ontology authoring • Exploration and navigation – Increase situational awareness by giving feedback about the consequences of actions: e.g. undo, reasoning – Provide overviews for those who are not familiar with a given ontology – For those who are familiar with an ontology allow bookmarks and provide landmarks – Facilitate the navigation through filters, faceted navigation mechanisms and hyperlinking entities
  • 15. Uncovering issues in ontology authoring • Search and retrieval – Integrated support to search on remote ontologies and incorporate entities in the working ontology • Efficient authoring – Include design templates and spreadsheets • Provide on-the-fly reasoning capabilities • Remove information overload in explanations • Include predefined unit tests for evaluation
  • 16. Protégé4US: a step towards having observational instruments • Protégé4US: Protégé for User Studies • Logging capabilities of: – Interaction events: click, hover, expand hierarchy... – Authoring events: add siblings, add restrictions... – Environment commands: reason, search, undo... 76585,2,Classes,Element edited,Juliette subclass of: Potato and hasCroppingTime some ’Main cropping’ 77786,3,Classes,Save ontology,http://owl.cs.manchester.ac.uk/ontology/start-here.owl 80204,3,Classes,Reasoner invoked,HermiT 1.3.8 80647,1,Classes,Mouse entered, Class hierarchy (inferred) 82910,1,Classes,Element hovered,Early_cropping_potato 83049,1,Classes,Element selected,Early_cropping_potato 83661,1,Classes,Hierarchy expanded,Early_cropping_potato
  • 17. User study to show the strengths of Protégé4US • Experimental design: – Participants: 16 expert authors – Stimuli: a potato ontology and Protégé4US – 3 authoring tasks with an increased complexity • Collected data – Protégé4US logs: 10K events – Completion times – Self reported expertise – Perceived task difficulty – Screen video and eye-tracking
  • 18. Describing potatoes Potato BoilingPotato LateFirstEarlyPotato Accent Class: BoilingPotato EquivalentTo: Potato and hasPreferredCookingMethod some Boiling Class: LateFirstEarlyPotato EquivalentTo: Potato and hasCroppingTime some LateFirstEarlyCropping Class: Accent SubClassOf: Potato, hasPreferredCookingMethod some Boiling, hasYield some HighYield, hasCroppingTime some LateFirstEarlyCropping
  • 20. Analysis of log data • Interaction events account for 65% of events while authoring events are 30% • The top 3 events (entity selection, description selection and invocation of editing menu) account for 56% of events
  • 21. Analysis of log data • N-gram analysis of consecutive events suggests lots of repetition • Esp. for entity selection and hierarchy expansion • Mouse driven functionalities makes this possible in Protégé • We built adjacency matrices for participants: number of transitions from event x to event y 1000 750 500 250 0 2 4 6 8 10 N−grams size frequency Event Class addition Description selected Entity selected Entity selected(i) Hierarchy expanded Hierarchy expanded(i)
  • 22. Reconstructing the interaction to identify patterns through visualisation • Left: web diagrams show most frequent transitions between states • Right: time diagrams show the authoring rhythm P8 Back Class addition Convert into defined Description selected Description selected(i) Entity deleted Entity dragged Entity edited:finish Entity edited:start Entity selected Set property Undo Run reasoner Property addition Load ontology Get explanation Hierarchy expanded(i) Hierarchy collapsed(i) Hierarchy collapsed Entity renamed Entity selected(i) Hierarchy expanded Save Description selected(i) Description selected Entity dragged Entity deleted Entity renamed Back Undo Hierarchy collapsed(i) Hierarchy collapsed Get explanation Set property Property addition Class addition Run reasoner Save Convert into defined Entity edited:finish Entity edited:start Hierarchy expanded(i) Hierarchy expanded EntitySelected(i) Entity selected Load ontology 0 1000 2000 3000 4000
  • 23. Analysis of eye-tracking data • Distribution of aggregated dwell times in the areas of interest • The class hierarchy and the entity edition menu get the majority of fixations and dwell time
  • 24. Analysis of eye-tracking data • Number of fixations between areas of interest • High frequency expected at the diagonal • Symmetry suggests checking behaviours • The class hierarchy is the pivotal window
  • 25. Log data + eye-tracking data • Synchronised both data sources • Merged same consecutive events e.g. class additiont, class additiont+1, class additiont+2, entity selectedt+3 M_class_additiont+2, entity selectedt+3 • Synchronised both data sources • Computed N-gram analysis and we found 3 main activities: – Exploration activity – Authoring activity – Reasoning activity
  • 26. Exploration activity Select entity Expand hierarchy 0.48 0.31 Select inferred entity Expand inferred hierarchy 0.25 0.43 0.12 0.54 Load ontology 0.52 0.31 Expand hierarchy Select description 0.29 0.37 Exploration activity • Expand the asserted class hierarchy after loading an ontology • The exploration of the asserted hierarchy is about finding a specific location to add or modify an entity, while exploration of the inferred one is to check the state of the ontology
  • 27. Editing activity Select description Select entity 0.29 Modify entity 0.37 0.63 0.59 Editing activity • Sequence found 362 times • 22.6 times per participant • The high probabilities along with the frequency with which this activity is performed, indicates that entities were modified in batches
  • 28. Reasoning activity Run reasoner 0.17 Convert into defined class Save Select description 0.16 0.15 0.40 Expand inferred hierarchy 0.30 Select entity 0.41 0.37 0.43 Select inferred entity 0.54 0.25 0.12 Reasoning activity • After running the reasoner participants observe the consequences of reasoning on the asserted hierarchy and the description area OR • To check classification, participants expand the inferred class hierarchy and make selections on inferred entities
  • 29. Discussion • Ontology editing is highly repetitive • The class hierarchy received users’ attention 45% of the time – Acts as an external memory of the ontology – Plays the role of an index with pointers to extended information • Navigation of the inferred hierarchy is exploratory, while the navigation of the asserted hierarchy is directed
  • 30. Discussion • Some outcomes corroborate initial findings: repetitiveness of editing task and lack of situational awareness after running the reasoner • Design recommendations – Support bulk editing – Place editing features close to the class hierarchy – Show entity descriptions close to the class hierarchy – Anticipate reasoner invocation – Make changes to the inferred hierarchy explicit
  • 31. Acknowledgements Markel Vigo did the work. Caroline Jay and Robert Stevens helped out with design, analysis, and so on.
  • 32. Issues and activities in authoring ontologies Robert Stevens School of Computer Science University of Manchester robert.stevens@manchester.ac.uk WhatIf: Answering “What if...” questions for Ontology Authoring. EPSRC reference EP/J014176/1