SlideShare a Scribd company logo
SMART Protocols: SeMAntic
RepresenTation for
Experimental Protocols
Olga Giraldo
ogiraldo@fi.upm.es
Ontology engineering group (OEG)
Universidad Politécnica de Madrid
Agenda
• What is a lab protocol
• Motivation
• Our general research question
• Our assumption
• Our propose
• Preliminary results
• Future work
What is a lab protocol
• Laboratory protocols are like cooking recipes
• They have ingredients: reagents and sample,
• They have appliances: equipment,
• They have a total time,
• They have a list of instructions,
• They have critical steps.
• The laboratory protocols are “the how to do” an
experiment.
Some problems in lab protocols
 some of them present
insufficient granularity,
 the instructions can be
imprecise or ambiguous due to
the use of natural language.
• Incubate the
centrifuge tubes in a
water bath.
• Incubate the samples
for 5 min with gentle
shaking.
• Rinse DNA briefly in
1-2 ml of wash.
• Incubate at -20C
overnight.
Why do we need to formalize and extract information from
lab protocols?
Because we want a recommendation system…
• That matches protocols according to my situation, for
instance
• samples I have,
• availability of equipment, reagents, lab conditions
• expertise
We also want content based information retrieval
• Meaningful sentences, sample used, purpose of the
protocol, applicability, critical steps, etc. Also,
identification of instructions
• Find all protocols for DNA extraction that have been used in
Oryza sativa that are suitable for processing a large number of
samples with a low execution time.
Motivation
Currently…
Semi-structured information
Unstructured information
How to formalize the information from laboratory
protocols as a knowledge base?
Ontologies + NLP tools
Our assumption
“Experimental protocols
are fundamental
information structures that
should support the
description of the
processes by means of
which results are
generated in experimental
research”
Our propose
Methods to represent and extract information
• Gazetteer-based method: use existing lists of named
entities
 Lists of proper nouns, which refer to real-life entities
• Rule-based approaches: write manual extraction rules
• Combination of the above
• Ontology model representing lab protocols
Ontology development
Methodology used to develop SMART Protocols
Kick-off
• Gathering use cases.
• Gathering competency questions.
Conceptualization
&
Formalization
• DAKA - Domain Analysis and Knowledge Acquisition
 Analysis of 175 experimental protocols.1
• LISA - Linguistic and Semantic Analysis
 Identification of key metadata for reporting protocols,2
 Determination of workflow aspects in protocols (implicit
order in the instructions, following the input output
structure.)
 Extraction of elements pertaining to domain knowledge.
(e.g. classification of protocols in groups according to the
purpose. Within each group were identified basic steps (or
common patterns), according to the type of protocol.
• IO - Iterative Ontology building
 Design of conceptual maps and draft ontologies. The
ontology modules were gathering from DAKA and LISA
activities and exchanged with domain experts.
Evaluation
&
Evolution
• OWL
• Correction of syntactic inconsistencies by using OWLViz3
and OOPS4
• The ontology model evolves as new knowledge goes
through the whole cycle.
1http://goo.gl/MC4mR9
2goo.gl/gAVnn
3http://protegewiki.stanford.edu/wiki/OWLViz
4http://oeg-lia3.dia.fi.upm.es/oops/index-content.jsp
SMART Protocols - document
The Protocol as a document
sp:application of the protocol
sp:advantage of the protocol
sp:limitation of the protocol
sp:provenance of the protocol
sp:purpose of the protocol
sp:introduction section
sp:buffer list
sp:equipment and supplies list
sp:kit list
sp:primer list
sp:reagent list
sp:software list
sp:solution list
sp:materials section
exact:caution
sp:critical step
sp:hint
sp:pause point
sp:storage condition
sp:timing
sp:troubleshooting
sp:methods section
sp:experimental
protocol
iao:document iao:document part
iao:textual entity iao:data set
owl:subClassOf
ro:hasPart
ro:partOf
owl:subClassOf
owl:subClassOfowl:subClassOf
ro:hasPart
ro:hasPart
ro:hasPart
ro:partOf
ro:partOf
ro:partOf
owl:subClassOf owl:subClassOf
exact:alert message
owl:subClassOf
 It is an extension of IAO ontology.
 It supports rhetorical and structural components (e.g. introduction, materials, and methods);
 It supports Information like application of the protocol, advantages and limitations, list of
reagents, critical steps.
SMART Protocols ontology
is available here:
http://vocab.linkeddata.es/S
MARTProtocols/
SMART Protocols - wf
sp:basic step of
DNA extraction
p-plan:Step
p-plan:Variable
sp:cell disruption
sp:plant tissue
Basic Steps of DNA Extraction
sp:DNA purification
obi:DNA extract
p-plan:hasInputVariable
p-plan:hasOutputVariable
p-plan:hasOutputVariable
owl:subClassOf
sp:digestion
reaction
sp:powdered tissue
owl:subClassOf owl:subClassOf
owl:subClassOf
p-plan:hasInputVariable
sp:digested
contaminant
p-plan:hasInputVariable
p-plan:hasOutputVariable
owl:subClassOfowl:subClassOfowl:subClassOfowl:subClassOf
bfo:isPrecededBy bfo:isPrecededBy
• It is an extension of the P-Plan Ontology.
• It represents of the workflow aspects in protocols
 implicit order in the instructions, following the input output structure.
SMART Protocols ontology is
available here:
http://vocab.linkeddata.es/SM
ARTProtocols/
New and reused terms
Resource No. of terms Resource No. of terms
OBI 15 P-Plan 3
NCIthesaurus 9 NPO 3
CHEBI 7 EXACT 2
IAO 7 SO 2
MGEDOntology 3 MeSH 1
• Reused classes = 52
• Reused properties = 4
Property Origen Reused in
isManufacturedBy OBI SMART Protocols-Document
hasInputVar P-Plan SMART Protocols-Workflow
hasOutputVar P-Plan SMART Protocols-Workflow
isStepOfPlan P-Plan SMART Protocols-Workflow
Ontology No. of classes No. of properties
SMART Protocols-Document 60 7
SMART Protocols-Workflow 44 1
Total 104 8
• New terms
Future work
• Analysis of the protocols. Focus on the
identification of keywords and/or constructs in
English –e.g. instructions, actions.
• Writing rules.
• Executing, testing and debugging the rules.
Work in progress
Summarizing…
Our purpose is the
formalization of lab
protocols by using
ontologies and NLP
tools to intelligently
extract information.
Special thanks…
Supervisors
Oscar Corcho Alexander Garcia
OEG’s colleagues
Daniel Garijo María Poveda Pablo Calleja Nandana
Mihindukulasooriya
Olga Giraldo
ogiraldo@fi.upm.es
oxgiraldo@gmail.com
Ontology engineering group (OEG)
Universidad Politécnica de Madrid

More Related Content

Viewers also liked

Using semantics and NLP in experimental protocols
Using semantics and NLP in experimental protocolsUsing semantics and NLP in experimental protocols
Using semantics and NLP in experimental protocolsOlga Ximena Giraldo
 
Semantic Formalization of Laboratory Protocols
Semantic Formalization of Laboratory Protocols Semantic Formalization of Laboratory Protocols
Semantic Formalization of Laboratory Protocols Olga Ximena Giraldo
 
Caching your rails application
Caching your rails applicationCaching your rails application
Caching your rails applicationArrrrCamp
 
Ruby 1.9 And Rails 3.0
Ruby 1.9 And Rails 3.0Ruby 1.9 And Rails 3.0
Ruby 1.9 And Rails 3.0ArrrrCamp
 
February 2010 Canadas S@H
February 2010 Canadas S@HFebruary 2010 Canadas S@H
February 2010 Canadas S@Hstephenfillmore
 
PSAsunday
PSAsundayPSAsunday
PSAsundayobbor
 
September 2000 House and Home
September 2000 House and HomeSeptember 2000 House and Home
September 2000 House and Homestephenfillmore
 
Ruby and Rails Basics
Ruby and Rails BasicsRuby and Rails Basics
Ruby and Rails BasicsArrrrCamp
 
December 2004 House and Home
December 2004  House and HomeDecember 2004  House and Home
December 2004 House and Homestephenfillmore
 
Library 2.0 Presentation: SC&I 550 Fall 2009
Library 2.0 Presentation: SC&I 550 Fall 2009Library 2.0 Presentation: SC&I 550 Fall 2009
Library 2.0 Presentation: SC&I 550 Fall 2009guest4bff3e6
 
SMART PROTOCOLS, Self Describing Executable Elements
SMART PROTOCOLS, Self Describing Executable ElementsSMART PROTOCOLS, Self Describing Executable Elements
SMART PROTOCOLS, Self Describing Executable ElementsOlga Ximena Giraldo
 
Arrrrcamp Radiant Intro
Arrrrcamp Radiant IntroArrrrcamp Radiant Intro
Arrrrcamp Radiant IntroArrrrCamp
 

Viewers also liked (17)

Olga giraldo flashtalk_icbo2015
Olga giraldo flashtalk_icbo2015Olga giraldo flashtalk_icbo2015
Olga giraldo flashtalk_icbo2015
 
Using semantics and NLP in experimental protocols
Using semantics and NLP in experimental protocolsUsing semantics and NLP in experimental protocols
Using semantics and NLP in experimental protocols
 
Semantic Formalization of Laboratory Protocols
Semantic Formalization of Laboratory Protocols Semantic Formalization of Laboratory Protocols
Semantic Formalization of Laboratory Protocols
 
Validation
ValidationValidation
Validation
 
Funny Twitter
Funny TwitterFunny Twitter
Funny Twitter
 
Nanoc
NanocNanoc
Nanoc
 
Caching your rails application
Caching your rails applicationCaching your rails application
Caching your rails application
 
Ruby 1.9 And Rails 3.0
Ruby 1.9 And Rails 3.0Ruby 1.9 And Rails 3.0
Ruby 1.9 And Rails 3.0
 
February 2010 Canadas S@H
February 2010 Canadas S@HFebruary 2010 Canadas S@H
February 2010 Canadas S@H
 
PSAsunday
PSAsundayPSAsunday
PSAsunday
 
September 2000 House and Home
September 2000 House and HomeSeptember 2000 House and Home
September 2000 House and Home
 
Ruby and Rails Basics
Ruby and Rails BasicsRuby and Rails Basics
Ruby and Rails Basics
 
December 2004 House and Home
December 2004  House and HomeDecember 2004  House and Home
December 2004 House and Home
 
Library 2.0 Presentation: SC&I 550 Fall 2009
Library 2.0 Presentation: SC&I 550 Fall 2009Library 2.0 Presentation: SC&I 550 Fall 2009
Library 2.0 Presentation: SC&I 550 Fall 2009
 
SMART PROTOCOLS, Self Describing Executable Elements
SMART PROTOCOLS, Self Describing Executable ElementsSMART PROTOCOLS, Self Describing Executable Elements
SMART PROTOCOLS, Self Describing Executable Elements
 
Arrrrcamp Radiant Intro
Arrrrcamp Radiant IntroArrrrcamp Radiant Intro
Arrrrcamp Radiant Intro
 
2012 Maritime Salary Review
2012 Maritime Salary Review2012 Maritime Salary Review
2012 Maritime Salary Review
 

Similar to SMART Protocols in LISC-2014

Research Objects for improved sharing and reproducibility
Research Objects for improved sharing and reproducibilityResearch Objects for improved sharing and reproducibility
Research Objects for improved sharing and reproducibilityOscar Corcho
 
Ontologies and semantic web
Ontologies and semantic webOntologies and semantic web
Ontologies and semantic webStanley Wang
 
Technical research writing
Technical research writing   Technical research writing
Technical research writing AJAL A J
 
Data analysis patterns, tools and data types in genomics
Data analysis patterns, tools and data types in genomicsData analysis patterns, tools and data types in genomics
Data analysis patterns, tools and data types in genomicsAltuna Akalin
 
OOPS!: on-line ontology diagnosis by Maria Poveda
OOPS!: on-line ontology diagnosis by Maria PovedaOOPS!: on-line ontology diagnosis by Maria Poveda
OOPS!: on-line ontology diagnosis by Maria Povedasemanticsconference
 
Towards Reproducible Science: a few building blocks from my personal experience
Towards Reproducible Science: a few building blocks from my personal experienceTowards Reproducible Science: a few building blocks from my personal experience
Towards Reproducible Science: a few building blocks from my personal experienceOscar Corcho
 
Semantics for integrated laboratory analytical processes - The Allotrope Pers...
Semantics for integrated laboratory analytical processes - The Allotrope Pers...Semantics for integrated laboratory analytical processes - The Allotrope Pers...
Semantics for integrated laboratory analytical processes - The Allotrope Pers...OSTHUS
 
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...Alejandra Gonzalez-Beltran
 
ISA-Tab Standards at Metabolomics Society Meeting, Tsuruoka 2014, Japan
ISA-Tab Standards at Metabolomics Society Meeting, Tsuruoka 2014, JapanISA-Tab Standards at Metabolomics Society Meeting, Tsuruoka 2014, Japan
ISA-Tab Standards at Metabolomics Society Meeting, Tsuruoka 2014, JapanPhilippe Rocca-Serra
 
From Zero to Nextflow 2017
From Zero to Nextflow 2017From Zero to Nextflow 2017
From Zero to Nextflow 2017Luca Cozzuto
 
From Scientific Workflows to Research Objects: Publication and Abstraction of...
From Scientific Workflows to Research Objects: Publication and Abstraction of...From Scientific Workflows to Research Objects: Publication and Abstraction of...
From Scientific Workflows to Research Objects: Publication and Abstraction of...dgarijo
 
OpenDiscovery
OpenDiscoveryOpenDiscovery
OpenDiscoverygwprice
 
one complete report from all the 4 labs.pdf
one complete report from all the 4 labs.pdfone complete report from all the 4 labs.pdf
one complete report from all the 4 labs.pdfstudy help
 
one complete report from all the 4 labs.pdf
one complete report from all the 4 labs.pdfone complete report from all the 4 labs.pdf
one complete report from all the 4 labs.pdfstudy help
 

Similar to SMART Protocols in LISC-2014 (20)

Phd tesis olga giraldo 10mayo
Phd tesis olga giraldo 10mayoPhd tesis olga giraldo 10mayo
Phd tesis olga giraldo 10mayo
 
Research Objects for improved sharing and reproducibility
Research Objects for improved sharing and reproducibilityResearch Objects for improved sharing and reproducibility
Research Objects for improved sharing and reproducibility
 
NGSonto and TyPon
NGSonto and TyPon NGSonto and TyPon
NGSonto and TyPon
 
Ontologies and semantic web
Ontologies and semantic webOntologies and semantic web
Ontologies and semantic web
 
Technical research writing
Technical research writing   Technical research writing
Technical research writing
 
Data analysis patterns, tools and data types in genomics
Data analysis patterns, tools and data types in genomicsData analysis patterns, tools and data types in genomics
Data analysis patterns, tools and data types in genomics
 
OOPS!: on-line ontology diagnosis by Maria Poveda
OOPS!: on-line ontology diagnosis by Maria PovedaOOPS!: on-line ontology diagnosis by Maria Poveda
OOPS!: on-line ontology diagnosis by Maria Poveda
 
Credible workshop
Credible workshopCredible workshop
Credible workshop
 
Towards Reproducible Science: a few building blocks from my personal experience
Towards Reproducible Science: a few building blocks from my personal experienceTowards Reproducible Science: a few building blocks from my personal experience
Towards Reproducible Science: a few building blocks from my personal experience
 
Semantics for integrated laboratory analytical processes - The Allotrope Pers...
Semantics for integrated laboratory analytical processes - The Allotrope Pers...Semantics for integrated laboratory analytical processes - The Allotrope Pers...
Semantics for integrated laboratory analytical processes - The Allotrope Pers...
 
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
 
13680904.ppt
13680904.ppt13680904.ppt
13680904.ppt
 
ISA-Tab Standards at Metabolomics Society Meeting, Tsuruoka 2014, Japan
ISA-Tab Standards at Metabolomics Society Meeting, Tsuruoka 2014, JapanISA-Tab Standards at Metabolomics Society Meeting, Tsuruoka 2014, Japan
ISA-Tab Standards at Metabolomics Society Meeting, Tsuruoka 2014, Japan
 
From Zero to Nextflow 2017
From Zero to Nextflow 2017From Zero to Nextflow 2017
From Zero to Nextflow 2017
 
COPO kick-off meeting
COPO kick-off meetingCOPO kick-off meeting
COPO kick-off meeting
 
Ontology at Manchester
Ontology at ManchesterOntology at Manchester
Ontology at Manchester
 
From Scientific Workflows to Research Objects: Publication and Abstraction of...
From Scientific Workflows to Research Objects: Publication and Abstraction of...From Scientific Workflows to Research Objects: Publication and Abstraction of...
From Scientific Workflows to Research Objects: Publication and Abstraction of...
 
OpenDiscovery
OpenDiscoveryOpenDiscovery
OpenDiscovery
 
one complete report from all the 4 labs.pdf
one complete report from all the 4 labs.pdfone complete report from all the 4 labs.pdf
one complete report from all the 4 labs.pdf
 
one complete report from all the 4 labs.pdf
one complete report from all the 4 labs.pdfone complete report from all the 4 labs.pdf
one complete report from all the 4 labs.pdf
 

Recently uploaded

Seminar on Halal AGriculture and Fisheries.pptx
Seminar on Halal AGriculture and Fisheries.pptxSeminar on Halal AGriculture and Fisheries.pptx
Seminar on Halal AGriculture and Fisheries.pptxRUDYLUMAPINET2
 
FAIRSpectra - Towards a common data file format for SIMS images
FAIRSpectra - Towards a common data file format for SIMS imagesFAIRSpectra - Towards a common data file format for SIMS images
FAIRSpectra - Towards a common data file format for SIMS imagesAlex Henderson
 
GEOLOGICAL FIELD REPORT On Kaptai Rangamati Road-Cut Section.pdf
GEOLOGICAL FIELD REPORT  On  Kaptai Rangamati Road-Cut Section.pdfGEOLOGICAL FIELD REPORT  On  Kaptai Rangamati Road-Cut Section.pdf
GEOLOGICAL FIELD REPORT On Kaptai Rangamati Road-Cut Section.pdfUniversity of Barishal
 
insect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationinsect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationanitaento25
 
Microbial Type Culture Collection (MTCC)
Microbial Type Culture Collection (MTCC)Microbial Type Culture Collection (MTCC)
Microbial Type Culture Collection (MTCC)abhishekdhamu51
 
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...Health Advances
 
Cancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate PathwayCancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate PathwayAADYARAJPANDEY1
 
Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...Sérgio Sacani
 
A Giant Impact Origin for the First Subduction on Earth
A Giant Impact Origin for the First Subduction on EarthA Giant Impact Origin for the First Subduction on Earth
A Giant Impact Origin for the First Subduction on EarthSérgio Sacani
 
Detectability of Solar Panels as a Technosignature
Detectability of Solar Panels as a TechnosignatureDetectability of Solar Panels as a Technosignature
Detectability of Solar Panels as a TechnosignatureSérgio Sacani
 
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCINGRNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCINGAADYARAJPANDEY1
 
electrochemical gas sensors and their uses.pptx
electrochemical gas sensors and their uses.pptxelectrochemical gas sensors and their uses.pptx
electrochemical gas sensors and their uses.pptxHusna Zaheer
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...muralinath2
 
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of LipidsGBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of LipidsAreesha Ahmad
 
Gliese 12 b, a temperate Earth-sized planet at 12 parsecs discovered with TES...
Gliese 12 b, a temperate Earth-sized planet at 12 parsecs discovered with TES...Gliese 12 b, a temperate Earth-sized planet at 12 parsecs discovered with TES...
Gliese 12 b, a temperate Earth-sized planet at 12 parsecs discovered with TES...Sérgio Sacani
 
Shuaib Y-basedComprehensive mahmudj.pptx
Shuaib Y-basedComprehensive mahmudj.pptxShuaib Y-basedComprehensive mahmudj.pptx
Shuaib Y-basedComprehensive mahmudj.pptxMdAbuRayhan16
 
Pests of Green Manures_Bionomics_IPM_Dr.UPR.pdf
Pests of Green Manures_Bionomics_IPM_Dr.UPR.pdfPests of Green Manures_Bionomics_IPM_Dr.UPR.pdf
Pests of Green Manures_Bionomics_IPM_Dr.UPR.pdfPirithiRaju
 
Aerodynamics. flippatterncn5tm5ttnj6nmnynyppt
Aerodynamics. flippatterncn5tm5ttnj6nmnynypptAerodynamics. flippatterncn5tm5ttnj6nmnynyppt
Aerodynamics. flippatterncn5tm5ttnj6nmnynypptsreddyrahul
 
Transport in plants G1.pptx Cambridge IGCSE
Transport in plants G1.pptx Cambridge IGCSETransport in plants G1.pptx Cambridge IGCSE
Transport in plants G1.pptx Cambridge IGCSEjordanparish425
 
GLOBAL AND LOCAL SCENARIO OF FOOD AND NUTRITION.pptx
GLOBAL AND LOCAL SCENARIO OF FOOD AND NUTRITION.pptxGLOBAL AND LOCAL SCENARIO OF FOOD AND NUTRITION.pptx
GLOBAL AND LOCAL SCENARIO OF FOOD AND NUTRITION.pptxSultanMuhammadGhauri
 

Recently uploaded (20)

Seminar on Halal AGriculture and Fisheries.pptx
Seminar on Halal AGriculture and Fisheries.pptxSeminar on Halal AGriculture and Fisheries.pptx
Seminar on Halal AGriculture and Fisheries.pptx
 
FAIRSpectra - Towards a common data file format for SIMS images
FAIRSpectra - Towards a common data file format for SIMS imagesFAIRSpectra - Towards a common data file format for SIMS images
FAIRSpectra - Towards a common data file format for SIMS images
 
GEOLOGICAL FIELD REPORT On Kaptai Rangamati Road-Cut Section.pdf
GEOLOGICAL FIELD REPORT  On  Kaptai Rangamati Road-Cut Section.pdfGEOLOGICAL FIELD REPORT  On  Kaptai Rangamati Road-Cut Section.pdf
GEOLOGICAL FIELD REPORT On Kaptai Rangamati Road-Cut Section.pdf
 
insect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationinsect taxonomy importance systematics and classification
insect taxonomy importance systematics and classification
 
Microbial Type Culture Collection (MTCC)
Microbial Type Culture Collection (MTCC)Microbial Type Culture Collection (MTCC)
Microbial Type Culture Collection (MTCC)
 
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
 
Cancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate PathwayCancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate Pathway
 
Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...
 
A Giant Impact Origin for the First Subduction on Earth
A Giant Impact Origin for the First Subduction on EarthA Giant Impact Origin for the First Subduction on Earth
A Giant Impact Origin for the First Subduction on Earth
 
Detectability of Solar Panels as a Technosignature
Detectability of Solar Panels as a TechnosignatureDetectability of Solar Panels as a Technosignature
Detectability of Solar Panels as a Technosignature
 
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCINGRNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
 
electrochemical gas sensors and their uses.pptx
electrochemical gas sensors and their uses.pptxelectrochemical gas sensors and their uses.pptx
electrochemical gas sensors and their uses.pptx
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
 
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of LipidsGBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
 
Gliese 12 b, a temperate Earth-sized planet at 12 parsecs discovered with TES...
Gliese 12 b, a temperate Earth-sized planet at 12 parsecs discovered with TES...Gliese 12 b, a temperate Earth-sized planet at 12 parsecs discovered with TES...
Gliese 12 b, a temperate Earth-sized planet at 12 parsecs discovered with TES...
 
Shuaib Y-basedComprehensive mahmudj.pptx
Shuaib Y-basedComprehensive mahmudj.pptxShuaib Y-basedComprehensive mahmudj.pptx
Shuaib Y-basedComprehensive mahmudj.pptx
 
Pests of Green Manures_Bionomics_IPM_Dr.UPR.pdf
Pests of Green Manures_Bionomics_IPM_Dr.UPR.pdfPests of Green Manures_Bionomics_IPM_Dr.UPR.pdf
Pests of Green Manures_Bionomics_IPM_Dr.UPR.pdf
 
Aerodynamics. flippatterncn5tm5ttnj6nmnynyppt
Aerodynamics. flippatterncn5tm5ttnj6nmnynypptAerodynamics. flippatterncn5tm5ttnj6nmnynyppt
Aerodynamics. flippatterncn5tm5ttnj6nmnynyppt
 
Transport in plants G1.pptx Cambridge IGCSE
Transport in plants G1.pptx Cambridge IGCSETransport in plants G1.pptx Cambridge IGCSE
Transport in plants G1.pptx Cambridge IGCSE
 
GLOBAL AND LOCAL SCENARIO OF FOOD AND NUTRITION.pptx
GLOBAL AND LOCAL SCENARIO OF FOOD AND NUTRITION.pptxGLOBAL AND LOCAL SCENARIO OF FOOD AND NUTRITION.pptx
GLOBAL AND LOCAL SCENARIO OF FOOD AND NUTRITION.pptx
 

SMART Protocols in LISC-2014

  • 1. SMART Protocols: SeMAntic RepresenTation for Experimental Protocols Olga Giraldo ogiraldo@fi.upm.es Ontology engineering group (OEG) Universidad Politécnica de Madrid
  • 2. Agenda • What is a lab protocol • Motivation • Our general research question • Our assumption • Our propose • Preliminary results • Future work
  • 3. What is a lab protocol • Laboratory protocols are like cooking recipes • They have ingredients: reagents and sample, • They have appliances: equipment, • They have a total time, • They have a list of instructions, • They have critical steps. • The laboratory protocols are “the how to do” an experiment.
  • 4. Some problems in lab protocols  some of them present insufficient granularity,  the instructions can be imprecise or ambiguous due to the use of natural language. • Incubate the centrifuge tubes in a water bath. • Incubate the samples for 5 min with gentle shaking. • Rinse DNA briefly in 1-2 ml of wash. • Incubate at -20C overnight.
  • 5. Why do we need to formalize and extract information from lab protocols? Because we want a recommendation system… • That matches protocols according to my situation, for instance • samples I have, • availability of equipment, reagents, lab conditions • expertise We also want content based information retrieval • Meaningful sentences, sample used, purpose of the protocol, applicability, critical steps, etc. Also, identification of instructions • Find all protocols for DNA extraction that have been used in Oryza sativa that are suitable for processing a large number of samples with a low execution time. Motivation
  • 6. Currently… Semi-structured information Unstructured information How to formalize the information from laboratory protocols as a knowledge base? Ontologies + NLP tools
  • 7. Our assumption “Experimental protocols are fundamental information structures that should support the description of the processes by means of which results are generated in experimental research”
  • 9. Methods to represent and extract information • Gazetteer-based method: use existing lists of named entities  Lists of proper nouns, which refer to real-life entities • Rule-based approaches: write manual extraction rules • Combination of the above • Ontology model representing lab protocols
  • 10.
  • 12. Methodology used to develop SMART Protocols Kick-off • Gathering use cases. • Gathering competency questions. Conceptualization & Formalization • DAKA - Domain Analysis and Knowledge Acquisition  Analysis of 175 experimental protocols.1 • LISA - Linguistic and Semantic Analysis  Identification of key metadata for reporting protocols,2  Determination of workflow aspects in protocols (implicit order in the instructions, following the input output structure.)  Extraction of elements pertaining to domain knowledge. (e.g. classification of protocols in groups according to the purpose. Within each group were identified basic steps (or common patterns), according to the type of protocol. • IO - Iterative Ontology building  Design of conceptual maps and draft ontologies. The ontology modules were gathering from DAKA and LISA activities and exchanged with domain experts. Evaluation & Evolution • OWL • Correction of syntactic inconsistencies by using OWLViz3 and OOPS4 • The ontology model evolves as new knowledge goes through the whole cycle. 1http://goo.gl/MC4mR9 2goo.gl/gAVnn 3http://protegewiki.stanford.edu/wiki/OWLViz 4http://oeg-lia3.dia.fi.upm.es/oops/index-content.jsp
  • 13. SMART Protocols - document The Protocol as a document sp:application of the protocol sp:advantage of the protocol sp:limitation of the protocol sp:provenance of the protocol sp:purpose of the protocol sp:introduction section sp:buffer list sp:equipment and supplies list sp:kit list sp:primer list sp:reagent list sp:software list sp:solution list sp:materials section exact:caution sp:critical step sp:hint sp:pause point sp:storage condition sp:timing sp:troubleshooting sp:methods section sp:experimental protocol iao:document iao:document part iao:textual entity iao:data set owl:subClassOf ro:hasPart ro:partOf owl:subClassOf owl:subClassOfowl:subClassOf ro:hasPart ro:hasPart ro:hasPart ro:partOf ro:partOf ro:partOf owl:subClassOf owl:subClassOf exact:alert message owl:subClassOf  It is an extension of IAO ontology.  It supports rhetorical and structural components (e.g. introduction, materials, and methods);  It supports Information like application of the protocol, advantages and limitations, list of reagents, critical steps. SMART Protocols ontology is available here: http://vocab.linkeddata.es/S MARTProtocols/
  • 14. SMART Protocols - wf sp:basic step of DNA extraction p-plan:Step p-plan:Variable sp:cell disruption sp:plant tissue Basic Steps of DNA Extraction sp:DNA purification obi:DNA extract p-plan:hasInputVariable p-plan:hasOutputVariable p-plan:hasOutputVariable owl:subClassOf sp:digestion reaction sp:powdered tissue owl:subClassOf owl:subClassOf owl:subClassOf p-plan:hasInputVariable sp:digested contaminant p-plan:hasInputVariable p-plan:hasOutputVariable owl:subClassOfowl:subClassOfowl:subClassOfowl:subClassOf bfo:isPrecededBy bfo:isPrecededBy • It is an extension of the P-Plan Ontology. • It represents of the workflow aspects in protocols  implicit order in the instructions, following the input output structure. SMART Protocols ontology is available here: http://vocab.linkeddata.es/SM ARTProtocols/
  • 15. New and reused terms Resource No. of terms Resource No. of terms OBI 15 P-Plan 3 NCIthesaurus 9 NPO 3 CHEBI 7 EXACT 2 IAO 7 SO 2 MGEDOntology 3 MeSH 1 • Reused classes = 52 • Reused properties = 4 Property Origen Reused in isManufacturedBy OBI SMART Protocols-Document hasInputVar P-Plan SMART Protocols-Workflow hasOutputVar P-Plan SMART Protocols-Workflow isStepOfPlan P-Plan SMART Protocols-Workflow Ontology No. of classes No. of properties SMART Protocols-Document 60 7 SMART Protocols-Workflow 44 1 Total 104 8 • New terms
  • 17. • Analysis of the protocols. Focus on the identification of keywords and/or constructs in English –e.g. instructions, actions. • Writing rules. • Executing, testing and debugging the rules. Work in progress
  • 18. Summarizing… Our purpose is the formalization of lab protocols by using ontologies and NLP tools to intelligently extract information.
  • 19. Special thanks… Supervisors Oscar Corcho Alexander Garcia OEG’s colleagues Daniel Garijo María Poveda Pablo Calleja Nandana Mihindukulasooriya Olga Giraldo ogiraldo@fi.upm.es oxgiraldo@gmail.com Ontology engineering group (OEG) Universidad Politécnica de Madrid

Editor's Notes

  1. And as I mentioned before an experimental protocol is a how to do an experiment. For this reason our assumption is that experimental protocols are…
  2. What do we propose?
  3. These set of methods to represent and extract intelligent information from laboratory protocols: the first one is an ontology model… The use of gazetteer-based method, this is a list of entities or objects from lab protocols that we like to recovery. The manual creation of rules, And a combination of all of these methods.
  4. which results we have obtained
  5. The development of two ontology modules, one of them represent the metadata to report a laboratory protocol and the another module represent the protocol as a executable element.
  6. Currently, our ontologies reuse 52 classes from 10 ontologies. Also, reuse 4 properties form two ontologies and were proposed 104 new classes and 8 new properties. Both modules were designed in OWL, reuse the BFO ontology and in general follows the good practices recommended by OBO consortium for the design of ontologies.