SlideShare a Scribd company logo
Semantic Web Technologies: A
Paradigm for Medical
Informatics
Chimezie Ogbuji (Owner, Metacognition LLC.)
http://metacognition.info/presentations/SWTMedicalInformatics.pdf
http://metacognition.info/presentations/SWTMedicalInformatics.ppt
Who I am
 Circa 2001: Introduced to web standards and
Semantic Web technologies
 2003-2011: Lead architect of CCF in-house
clinical repository project
 2006-2011: Member representative of CCF in
World-wide Web Consortium (W3C)
◦ Editor of various standards and Semantic Web
Health Care and Life Sciences Interest Group
chair
 2011-2012: Senior Research Associate at
CWRU Center for Clinical Investigations
 2012-current: Started business providing
resource and data management software for
home healthcare agencies (Metacognition
LLC)
Medical Informatics
Challenges
 Semantic interoperability
◦ Exchange of data with common meaning
between sender and receiver
 Most of the intended benefits of HIT
depend on interoperability between
systems
 Difficulties integrating patient record
systems with other information resources
are among the major issues hampering
their effectiveness
◦ Interoperability is a major goal for meaningful
use of Electronic Health Records (EHR)
Rodrigues et al. 2013; Kadry et al. 2010; Shortliffe and Cimino, 2006
Requirements and Solutions
 Semantic interoperability requires:
◦ Structured data
◦ A common controlled vocabulary
 Solutions emphasize the meaning of
data rather than how they are
structured
◦ “Semantic” paradigms
Registries and Research DBs
 Patient registries and clinical research
repositories capture data elements in
a uniform manner
 The structure of the underlying data
needs to be able to evolve along with
the investigations they support
 Thus, schema extensibility is
important
Querying Interfaces
 Standardized interfaces for querying
facilitate:
◦ Accessibility to clinical information
systems
◦ Distributed querying of data from where
they reside
 Requires:
◦ Semantically-equivalent data structures
 Alternatively, data are centralized in
data warehouses
Austin et al. 2007, “Implementation of a query interface for a generic record server”
Biomedical Ontologies
 Ontologies are artifacts that
conceptualize a domain as a taxonomy
of classes and constraints on
relationships between their members
 Represented in a particular formalism
 Increasingly adopted as a foundation for
the next generation of biomedical
vocabularies
 Construction involves representing a
domain of interest independent of
behavior of applications using an
ontology
 Important means towards achieving
semantic interoperability
Biomedical Ontology
Communities
 Prominent examples of adoption by
life science and healthcare
terminology communities:
◦ The Open Biological and Biomedical
Ontologies (OBO) Foundry
◦ Gene Ontology (GO)
◦ National Center for Biomedical Ontology
(NCBO) Bioportal
◦ International Health Terminology
Standards Development Organization
(IHTSDO)
Semantic Web and
Technologies
 The Semantic Web is a vision of how
the existing infrastructure of the
World-wide Web (WWW) can be
extended such that machines can
interpret the meaning of data on it
 Semantic Web technologies are the
standards and technologies that have
been developed to achieve the vision
An Analogy
 (Technological) singularity is a
theoretical moment when artificial
intelligence (AI) will have progressed
to a greater-than-human intelligence
 Despite remaining in the realm of
science fiction, it has motivated many
useful developments along the way
◦ The use of ontologies for knowledge
representation and IBM Watson
capabilities, for example
Background: Graphs
 Graphs are data structures
comprising nodes and edges that
connect them
 The edges can be directional
 Either the nodes, the edges, or both
can be labeled
 The labels provide meaning to the
graphs (edge labels in particular)
Node Nodeedge
Resource Description
Framework
 The Resource Description Framework
(RDF) is a graph-based knowledge
representation language for describing
resources
 It’s edges are directional and both
nodes and edges are labeled
 It uses Universal Resource Identifiers
(URI) for labeling
 Foundation for Semantic Web
technologies
RDF: Continued
 The edges are statements (triples) that
go from a subject to an object
 Some objects are text values
 Some subjects and objects can be left
unlabeled (Blank nodes)
◦ Anonymous resources: not important to label
them uniquely
 The URI of the edge is the predicate
 Predicates used together for a common
purpose are a vocabulary
 Subject: Dr. X (a URI)
 Object: Chime
 Predicate: treats
 Vocabulary:
◦ treats, subject of record, author, and full
name
Chime
Dr. X
treats subject of record
author
"Chimezie Ogbuji"full name
RDF vocabularies
 How meaning is interpreted from an RDF
graph
 There are vocabularies that constrain how
predicates are used
◦ Want a sense of treats where the subject is a
clinician and the object is a patient
 There is a predicate relating resources to the
classes they are a member of (type)
 There are vocabularies that define
constraints on class hierarchies
 These comprise a basic RDF Schema
(RDFS) language
 Represented as an RDF graph
Chime
Dr. X
treats subject of record
Patient
Physician
type
type
Hypertension DX
Clinical Diagnosis
type
is a
authorPerson
is a
is a
Ontologies for RDF
 The Ontology Web Language (OWL)
is used to describe ontologies for RDF
graphs
 More sophisticated constraints than
RDFS
 Commonly expressed as an RDF
graph
 Defines the meaning of RDF
statements through constraints:
◦ On their predicates
◦ On the classes the resources they relate
Chime
Dr. X
treats subject of record
Patient
Physician
type
type
Hypertension DX
Clinical Diagnosis
type
is a
authorPerson
is a
is a
Governed by OWL/ RDFS for domain
OWL Formats
 Most common format for describing
ontologies
 Distribution format of ontologies in the
NCBO BioPortal
 SNOMED CT distributions include an
OWL representation
◦ RDF graphs can describe medical content
in a SNOMED CT-compliant way through
the use of this vocabulary
Validation and Deduction
 OWL is based on a formal,
mathematical logic that can be used
for validating the structure of an
ontology and RDF data that conform
to it (consistency checking)
 Used to deduce additional RDF
statements implied by the meaning of
a given RDF graph (logical inference)
 Logical reasoners are used for this
Inference
 Can infer anatomical location from
SNOMED CT definitions
Hypertension DX
type
finding site
Systemic circulatory
system structure
type
Hypertension DX <-> 1201005 / “Benign essential hypertension (disorder)
Querying RDF Graphs
 SPARQL is the official query language
for RDF graphs
 Comparable to relational query
languages
◦ Primary difference: it queries RDF triples,
whereas SQL queries tables of arbitrary
dimensions
 Includes various web protocols for
querying RDF graphs
 Foundation of SPARQL is the triple
pattern
 (?clinician, treats, ?patient)
◦ ?clinician and ?patient are variables (like a
wildcard)
?patient
?physician ?dx
treats subject of record
author
Hypertension DX
type
Which physicians have given essential hypertension diagnoses and to w
(?physician, author, ?dx)
(?physician, treats, ?patient)
(?dx, subject of record, ?patient)
(?dx, type, Hypertension DX)
?physician ?patient ?dx
Dr. X Chime …
SPARQL over Relational Data
 Most common implementations
convert SPARQL to SQL and evaluate
over:
◦ a relational databases designed for RDF
storage
◦ an existing relational database
 There are products for both
approaches
 Former requires native storage of RDF
◦ Relational structure doesn’t change even
as RDF vocabulary does (schemaElliot et al. 2009, “A Complete Translation from SPARQL into Efficient SQL”
SPARQL over Existing Relation
Data
 “Virtual RDF view”
◦ Translation to SQL follows a given
mapping from existing relational
structures to an RDF vocabulary
◦ Allows non-disruptive evolution of existing
systems
◦ Well-suited as a standard querying
interface over clinical data repositories
◦ They can be queried as SPARQL,
securely over encrypted HTTP
Relational RDF (SNOMED CT perhaps)
Mapping and
Translation layer
Secure HTTP
SPARQL
SQL
Legacy / existing
applications
Patient registry or
data repository
3rd party applications
SQL
Example: Cleveland Clinic
(SemanticDB)
 Content repository and data
production system released in Jan.
2008
 80 million (native) RDF statements
◦ Uses vocabulary from a patient record
OWL ontology for the registry
 Based on
◦ Existing registry of heart surgery and CV
interventions
◦ 200,000 patient records
◦ Generating over 100 publications per year
Pierce et al. 2012, “SemanticDB: A Semantic Web Infrastructure for Clinical Research and Quality Reporting
Cohort Identification
 Interface developed in conjunction
with Cycorp
 Leverage their logical reasoning
system (Cyc)
◦ Identifies cohorts using natural language
(NL) sentence fragments
◦ Converts fragments to SPARQL
◦ SPARQL is evaluated against RDF store
Example: Mayo Clinic
(MCLSS)
 Mayo Clinic Life Sciences System
(MCLSS)
◦ Effort to represent Mayo Clinic EHR data
as RDF graphs
◦ Patient demographics, diagnoses,
procedures, lab results, and free-text
notes
◦ Goal was to wrap MCLSS relational
database and expose as read-only, query-
able RDF graphs that conform to standard
ontologiesPathak et al. 2012, "Using Semantic Web Technologies for Cohort Identification from Electronic
Health Records for Clinical Research"
Example: Mayo Clinic (CEM)
 Clinical Element Model (CEM)
◦ Represents logical structure of data in
EHR
◦ Goal: translate CEM definitions into OWL
and patient (instance) data into
conformant RDF
◦ Use tools (logical reasoners) to check
semantic consistency of the ontology,
instance data, and to extract new
knowledge via deduction
◦ Instance data validation:
 correct number of linked components, value
within data range, existence of units, etc.
Tao et al. 2012, ”A semantic-web oriented representation of the clinical element model for
secondary use of electronic health records data"
Summary
 Schema extensibility
◦ Use of RDF
 Semantic Interoperability
◦ Domain modeling using OWL and RDFS
 Standardized query interfaces
◦ Querying over SPARQL
 Incremental, non-disruptive adoption
◦ Virtual RDF views
 Main challenge: highly disruptive
innovation

More Related Content

What's hot

AnIML: A New Analytical Data Standard
AnIML: A New Analytical Data StandardAnIML: A New Analytical Data Standard
AnIML: A New Analytical Data Standard
Stuart Chalk
 
Cedar OnDemand: An intelligent browser extension to generate ontology-based m...
Cedar OnDemand: An intelligent browser extension to generate ontology-based m...Cedar OnDemand: An intelligent browser extension to generate ontology-based m...
Cedar OnDemand: An intelligent browser extension to generate ontology-based m...
Syed Ahmad Chan Bukhari, PhD
 
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Stuart Chalk
 
Ontology For Data Integration
Ontology For Data IntegrationOntology For Data Integration
Ontology For Data Integration
juanesteva
 
Exploration of a Data Landscape using a Collaborative Linked Data Framework.
Exploration of a Data Landscape using a Collaborative Linked Data Framework.Exploration of a Data Landscape using a Collaborative Linked Data Framework.
Exploration of a Data Landscape using a Collaborative Linked Data Framework.
Laurent Alquier
 
FAIRness through a novel combination of Web technologies
FAIRness through a novel combination of Web technologiesFAIRness through a novel combination of Web technologies
FAIRness through a novel combination of Web technologies
Research Data Alliance
 
Semantic Web: Technolgies and Applications for Real-World
Semantic Web: Technolgies and Applications for Real-WorldSemantic Web: Technolgies and Applications for Real-World
Semantic Web: Technolgies and Applications for Real-World
Amit Sheth
 
A semantic framework for biomedical image discovery
A semantic framework for biomedical image discoveryA semantic framework for biomedical image discovery
A semantic framework for biomedical image discovery
Syed Ahmad Chan Bukhari, PhD
 
Aleksandar Zivaljevic - Annotation of clinical datasets using openEHR Archety...
Aleksandar Zivaljevic - Annotation of clinical datasets using openEHR Archety...Aleksandar Zivaljevic - Annotation of clinical datasets using openEHR Archety...
Aleksandar Zivaljevic - Annotation of clinical datasets using openEHR Archety...
Health Informatics New Zealand
 
Bringing Things Together and Linking to Health Information using openEHR
Bringing Things Together and Linking to Health Information using openEHRBringing Things Together and Linking to Health Information using openEHR
Bringing Things Together and Linking to Health Information using openEHR
Koray Atalag
 
SNOMED Bound to (Information) Model | Putting terminology to work
SNOMED Bound to (Information) Model | Putting terminology to workSNOMED Bound to (Information) Model | Putting terminology to work
SNOMED Bound to (Information) Model | Putting terminology to work
Koray Atalag
 
Presentation_euroCRIS_ES
Presentation_euroCRIS_ESPresentation_euroCRIS_ES
Presentation_euroCRIS_ES
Ed Simons
 
Effective Classification of Clinical Reports: Natural Language Processing-Bas...
Effective Classification of Clinical Reports: Natural Language Processing-Bas...Effective Classification of Clinical Reports: Natural Language Processing-Bas...
Effective Classification of Clinical Reports: Natural Language Processing-Bas...
Efsun Kayi
 
Clinical Models - What Are They Good For?
Clinical Models - What Are They Good For?Clinical Models - What Are They Good For?
Clinical Models - What Are They Good For?
Health Informatics New Zealand
 
CDISC2RDF overview with examples
CDISC2RDF overview with examplesCDISC2RDF overview with examples
CDISC2RDF overview with examples
Kerstin Forsberg
 
CDISC's CDASH and SDTM: Why You Need Both!
CDISC's CDASH and SDTM: Why You Need Both!CDISC's CDASH and SDTM: Why You Need Both!
CDISC's CDASH and SDTM: Why You Need Both!
Kit Howard
 
Ontology-based Data Integration
Ontology-based Data IntegrationOntology-based Data Integration
Ontology-based Data Integration
Janna Hastings
 
IHE Cross-Enterprise Document Sharing (XDS)
IHE Cross-Enterprise Document Sharing (XDS)IHE Cross-Enterprise Document Sharing (XDS)
IHE Cross-Enterprise Document Sharing (XDS)
HL7 New Zealand
 
Acupulco cda access v3-1
Acupulco cda access v3-1Acupulco cda access v3-1
Acupulco cda access v3-1
eyetech
 
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
CEDAR: Center for Expanded Data Annotation and Retrieval
 

What's hot (20)

AnIML: A New Analytical Data Standard
AnIML: A New Analytical Data StandardAnIML: A New Analytical Data Standard
AnIML: A New Analytical Data Standard
 
Cedar OnDemand: An intelligent browser extension to generate ontology-based m...
Cedar OnDemand: An intelligent browser extension to generate ontology-based m...Cedar OnDemand: An intelligent browser extension to generate ontology-based m...
Cedar OnDemand: An intelligent browser extension to generate ontology-based m...
 
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
 
Ontology For Data Integration
Ontology For Data IntegrationOntology For Data Integration
Ontology For Data Integration
 
Exploration of a Data Landscape using a Collaborative Linked Data Framework.
Exploration of a Data Landscape using a Collaborative Linked Data Framework.Exploration of a Data Landscape using a Collaborative Linked Data Framework.
Exploration of a Data Landscape using a Collaborative Linked Data Framework.
 
FAIRness through a novel combination of Web technologies
FAIRness through a novel combination of Web technologiesFAIRness through a novel combination of Web technologies
FAIRness through a novel combination of Web technologies
 
Semantic Web: Technolgies and Applications for Real-World
Semantic Web: Technolgies and Applications for Real-WorldSemantic Web: Technolgies and Applications for Real-World
Semantic Web: Technolgies and Applications for Real-World
 
A semantic framework for biomedical image discovery
A semantic framework for biomedical image discoveryA semantic framework for biomedical image discovery
A semantic framework for biomedical image discovery
 
Aleksandar Zivaljevic - Annotation of clinical datasets using openEHR Archety...
Aleksandar Zivaljevic - Annotation of clinical datasets using openEHR Archety...Aleksandar Zivaljevic - Annotation of clinical datasets using openEHR Archety...
Aleksandar Zivaljevic - Annotation of clinical datasets using openEHR Archety...
 
Bringing Things Together and Linking to Health Information using openEHR
Bringing Things Together and Linking to Health Information using openEHRBringing Things Together and Linking to Health Information using openEHR
Bringing Things Together and Linking to Health Information using openEHR
 
SNOMED Bound to (Information) Model | Putting terminology to work
SNOMED Bound to (Information) Model | Putting terminology to workSNOMED Bound to (Information) Model | Putting terminology to work
SNOMED Bound to (Information) Model | Putting terminology to work
 
Presentation_euroCRIS_ES
Presentation_euroCRIS_ESPresentation_euroCRIS_ES
Presentation_euroCRIS_ES
 
Effective Classification of Clinical Reports: Natural Language Processing-Bas...
Effective Classification of Clinical Reports: Natural Language Processing-Bas...Effective Classification of Clinical Reports: Natural Language Processing-Bas...
Effective Classification of Clinical Reports: Natural Language Processing-Bas...
 
Clinical Models - What Are They Good For?
Clinical Models - What Are They Good For?Clinical Models - What Are They Good For?
Clinical Models - What Are They Good For?
 
CDISC2RDF overview with examples
CDISC2RDF overview with examplesCDISC2RDF overview with examples
CDISC2RDF overview with examples
 
CDISC's CDASH and SDTM: Why You Need Both!
CDISC's CDASH and SDTM: Why You Need Both!CDISC's CDASH and SDTM: Why You Need Both!
CDISC's CDASH and SDTM: Why You Need Both!
 
Ontology-based Data Integration
Ontology-based Data IntegrationOntology-based Data Integration
Ontology-based Data Integration
 
IHE Cross-Enterprise Document Sharing (XDS)
IHE Cross-Enterprise Document Sharing (XDS)IHE Cross-Enterprise Document Sharing (XDS)
IHE Cross-Enterprise Document Sharing (XDS)
 
Acupulco cda access v3-1
Acupulco cda access v3-1Acupulco cda access v3-1
Acupulco cda access v3-1
 
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
 

Similar to Semantic Web Technologies: A Paradigm for Medical Informatics

A Semantic Web based Framework for Linking Healthcare Information with Comput...
A Semantic Web based Framework for Linking Healthcare Information with Comput...A Semantic Web based Framework for Linking Healthcare Information with Comput...
A Semantic Web based Framework for Linking Healthcare Information with Comput...
Koray Atalag
 
The Logical Model Designer - Binding Information Models to Terminology
The Logical Model Designer - Binding Information Models to TerminologyThe Logical Model Designer - Binding Information Models to Terminology
The Logical Model Designer - Binding Information Models to Terminology
Snow Owl
 
Semantic Web Technologies as a Framework for Clinical Informatics
Semantic Web Technologies as a Framework for Clinical InformaticsSemantic Web Technologies as a Framework for Clinical Informatics
Semantic Web Technologies as a Framework for Clinical Informatics
Chimezie Ogbuji
 
Reference Domain Ontologies and Large Medical Language Models.pptx
Reference Domain Ontologies and Large Medical Language Models.pptxReference Domain Ontologies and Large Medical Language Models.pptx
Reference Domain Ontologies and Large Medical Language Models.pptx
Chimezie Ogbuji
 
Health Datapalooza 2013: Open Government Data - Natasha Noy
Health Datapalooza 2013: Open Government Data - Natasha NoyHealth Datapalooza 2013: Open Government Data - Natasha Noy
Health Datapalooza 2013: Open Government Data - Natasha Noy
Health Data Consortium
 
Metadata requirements4healthportals mie2015
Metadata requirements4healthportals mie2015Metadata requirements4healthportals mie2015
Metadata requirements4healthportals mie2015
Tim Benson
 
A Reason Able View To The Web Of Pathway Data
A Reason Able View To The Web Of Pathway DataA Reason Able View To The Web Of Pathway Data
A Reason Able View To The Web Of Pathway Data
guest9fc5f3
 
Implementation and Use of ISO EN 13606 and openEHR
Implementation and Use of ISO EN 13606 and openEHRImplementation and Use of ISO EN 13606 and openEHR
Implementation and Use of ISO EN 13606 and openEHR
Koray Atalag
 
Enhancing the Quality of ImmPort Data
Enhancing the Quality of ImmPort DataEnhancing the Quality of ImmPort Data
Enhancing the Quality of ImmPort Data
Barry Smith
 
Exploring HL7 CDA & Its Structures
Exploring HL7 CDA & Its StructuresExploring HL7 CDA & Its Structures
Exploring HL7 CDA & Its Structures
Nawanan Theera-Ampornpunt
 
UniProt and the Semantic Web
UniProt and the Semantic WebUniProt and the Semantic Web
UniProt and the Semantic Web
Chimezie Ogbuji
 
Linked data presentation for who umc 21 jan 2015
Linked data presentation for who umc 21 jan 2015Linked data presentation for who umc 21 jan 2015
Linked data presentation for who umc 21 jan 2015
Kerstin Forsberg
 
Resource Description Framework Approach to Data Publication and Federation
Resource Description Framework Approach to Data Publication and FederationResource Description Framework Approach to Data Publication and Federation
Resource Description Framework Approach to Data Publication and Federation
Pistoia Alliance
 
Enabling faster analysis of vaccine adverse event reports with ontology support
Enabling faster analysis of vaccine adverse event reports with ontology supportEnabling faster analysis of vaccine adverse event reports with ontology support
Enabling faster analysis of vaccine adverse event reports with ontology support
Melanie Courtot
 
Archetypes and FHIR by Koray Atalag
Archetypes and FHIR by Koray AtalagArchetypes and FHIR by Koray Atalag
Archetypes and FHIR by Koray Atalag
David Hay
 
The Electronic Notebook Ontology
The Electronic Notebook OntologyThe Electronic Notebook Ontology
The Electronic Notebook Ontology
Stuart Chalk
 
Semantic Web use cases in outcomes research
Semantic Web use cases in outcomes researchSemantic Web use cases in outcomes research
Semantic Web use cases in outcomes research
Chimezie Ogbuji
 
AeHIN 28 August, 2014 - Innovation in Healthcare IT Standards: The Path to Bi...
AeHIN 28 August, 2014 - Innovation in Healthcare IT Standards: The Path to Bi...AeHIN 28 August, 2014 - Innovation in Healthcare IT Standards: The Path to Bi...
AeHIN 28 August, 2014 - Innovation in Healthcare IT Standards: The Path to Bi...
Timothy Cook
 
Use of CEDAR Technology for Ontology-based Submission of Biomedical Data to ...
 Use of CEDAR Technology for Ontology-based Submission of Biomedical Data to ... Use of CEDAR Technology for Ontology-based Submission of Biomedical Data to ...
Use of CEDAR Technology for Ontology-based Submission of Biomedical Data to ...
Syed Ahmad Chan Bukhari, PhD
 
Ontology Web Services for Semantic Applications
Ontology Web Services for Semantic Applications Ontology Web Services for Semantic Applications
Ontology Web Services for Semantic Applications
Trish Whetzel
 

Similar to Semantic Web Technologies: A Paradigm for Medical Informatics (20)

A Semantic Web based Framework for Linking Healthcare Information with Comput...
A Semantic Web based Framework for Linking Healthcare Information with Comput...A Semantic Web based Framework for Linking Healthcare Information with Comput...
A Semantic Web based Framework for Linking Healthcare Information with Comput...
 
The Logical Model Designer - Binding Information Models to Terminology
The Logical Model Designer - Binding Information Models to TerminologyThe Logical Model Designer - Binding Information Models to Terminology
The Logical Model Designer - Binding Information Models to Terminology
 
Semantic Web Technologies as a Framework for Clinical Informatics
Semantic Web Technologies as a Framework for Clinical InformaticsSemantic Web Technologies as a Framework for Clinical Informatics
Semantic Web Technologies as a Framework for Clinical Informatics
 
Reference Domain Ontologies and Large Medical Language Models.pptx
Reference Domain Ontologies and Large Medical Language Models.pptxReference Domain Ontologies and Large Medical Language Models.pptx
Reference Domain Ontologies and Large Medical Language Models.pptx
 
Health Datapalooza 2013: Open Government Data - Natasha Noy
Health Datapalooza 2013: Open Government Data - Natasha NoyHealth Datapalooza 2013: Open Government Data - Natasha Noy
Health Datapalooza 2013: Open Government Data - Natasha Noy
 
Metadata requirements4healthportals mie2015
Metadata requirements4healthportals mie2015Metadata requirements4healthportals mie2015
Metadata requirements4healthportals mie2015
 
A Reason Able View To The Web Of Pathway Data
A Reason Able View To The Web Of Pathway DataA Reason Able View To The Web Of Pathway Data
A Reason Able View To The Web Of Pathway Data
 
Implementation and Use of ISO EN 13606 and openEHR
Implementation and Use of ISO EN 13606 and openEHRImplementation and Use of ISO EN 13606 and openEHR
Implementation and Use of ISO EN 13606 and openEHR
 
Enhancing the Quality of ImmPort Data
Enhancing the Quality of ImmPort DataEnhancing the Quality of ImmPort Data
Enhancing the Quality of ImmPort Data
 
Exploring HL7 CDA & Its Structures
Exploring HL7 CDA & Its StructuresExploring HL7 CDA & Its Structures
Exploring HL7 CDA & Its Structures
 
UniProt and the Semantic Web
UniProt and the Semantic WebUniProt and the Semantic Web
UniProt and the Semantic Web
 
Linked data presentation for who umc 21 jan 2015
Linked data presentation for who umc 21 jan 2015Linked data presentation for who umc 21 jan 2015
Linked data presentation for who umc 21 jan 2015
 
Resource Description Framework Approach to Data Publication and Federation
Resource Description Framework Approach to Data Publication and FederationResource Description Framework Approach to Data Publication and Federation
Resource Description Framework Approach to Data Publication and Federation
 
Enabling faster analysis of vaccine adverse event reports with ontology support
Enabling faster analysis of vaccine adverse event reports with ontology supportEnabling faster analysis of vaccine adverse event reports with ontology support
Enabling faster analysis of vaccine adverse event reports with ontology support
 
Archetypes and FHIR by Koray Atalag
Archetypes and FHIR by Koray AtalagArchetypes and FHIR by Koray Atalag
Archetypes and FHIR by Koray Atalag
 
The Electronic Notebook Ontology
The Electronic Notebook OntologyThe Electronic Notebook Ontology
The Electronic Notebook Ontology
 
Semantic Web use cases in outcomes research
Semantic Web use cases in outcomes researchSemantic Web use cases in outcomes research
Semantic Web use cases in outcomes research
 
AeHIN 28 August, 2014 - Innovation in Healthcare IT Standards: The Path to Bi...
AeHIN 28 August, 2014 - Innovation in Healthcare IT Standards: The Path to Bi...AeHIN 28 August, 2014 - Innovation in Healthcare IT Standards: The Path to Bi...
AeHIN 28 August, 2014 - Innovation in Healthcare IT Standards: The Path to Bi...
 
Use of CEDAR Technology for Ontology-based Submission of Biomedical Data to ...
 Use of CEDAR Technology for Ontology-based Submission of Biomedical Data to ... Use of CEDAR Technology for Ontology-based Submission of Biomedical Data to ...
Use of CEDAR Technology for Ontology-based Submission of Biomedical Data to ...
 
Ontology Web Services for Semantic Applications
Ontology Web Services for Semantic Applications Ontology Web Services for Semantic Applications
Ontology Web Services for Semantic Applications
 

More from Chimezie Ogbuji

Using OWL for the RESO Data Dictionary
Using OWL for the RESO Data DictionaryUsing OWL for the RESO Data Dictionary
Using OWL for the RESO Data Dictionary
Chimezie Ogbuji
 
Integrating Large, Disparate, Biomedical Ontologies to Boost Organ Developmen...
Integrating Large, Disparate, Biomedical Ontologies to Boost Organ Developmen...Integrating Large, Disparate, Biomedical Ontologies to Boost Organ Developmen...
Integrating Large, Disparate, Biomedical Ontologies to Boost Organ Developmen...
Chimezie Ogbuji
 
Automated clinicalontologyextraction
Automated clinicalontologyextractionAutomated clinicalontologyextraction
Automated clinicalontologyextraction
Chimezie Ogbuji
 
GRDDL: The Why, What, How, and Where
GRDDL: The Why, What, How, and WhereGRDDL: The Why, What, How, and Where
GRDDL: The Why, What, How, and Where
Chimezie Ogbuji
 
GRDDL: A Pictorial Approach
GRDDL: A Pictorial ApproachGRDDL: A Pictorial Approach
GRDDL: A Pictorial Approach
Chimezie Ogbuji
 
Tools for Next Generation of CMS: XML, RDF, & GRDDL
Tools for Next Generation of CMS: XML, RDF, & GRDDLTools for Next Generation of CMS: XML, RDF, & GRDDL
Tools for Next Generation of CMS: XML, RDF, & GRDDL
Chimezie Ogbuji
 
Segmenting & Merging Domain-specific Modules for Clinical Informatics
Segmenting & Merging Domain-specific Modules for Clinical InformaticsSegmenting & Merging Domain-specific Modules for Clinical Informatics
Segmenting & Merging Domain-specific Modules for Clinical Informatics
Chimezie Ogbuji
 
Overview of CPR Ontology
Overview of CPR OntologyOverview of CPR Ontology
Overview of CPR Ontology
Chimezie Ogbuji
 
The Characteristics of a RESTful Semantic Web and Why They Are Important
The Characteristics of a RESTful Semantic Web and Why They Are ImportantThe Characteristics of a RESTful Semantic Web and Why They Are Important
The Characteristics of a RESTful Semantic Web and Why They Are Important
Chimezie Ogbuji
 

More from Chimezie Ogbuji (9)

Using OWL for the RESO Data Dictionary
Using OWL for the RESO Data DictionaryUsing OWL for the RESO Data Dictionary
Using OWL for the RESO Data Dictionary
 
Integrating Large, Disparate, Biomedical Ontologies to Boost Organ Developmen...
Integrating Large, Disparate, Biomedical Ontologies to Boost Organ Developmen...Integrating Large, Disparate, Biomedical Ontologies to Boost Organ Developmen...
Integrating Large, Disparate, Biomedical Ontologies to Boost Organ Developmen...
 
Automated clinicalontologyextraction
Automated clinicalontologyextractionAutomated clinicalontologyextraction
Automated clinicalontologyextraction
 
GRDDL: The Why, What, How, and Where
GRDDL: The Why, What, How, and WhereGRDDL: The Why, What, How, and Where
GRDDL: The Why, What, How, and Where
 
GRDDL: A Pictorial Approach
GRDDL: A Pictorial ApproachGRDDL: A Pictorial Approach
GRDDL: A Pictorial Approach
 
Tools for Next Generation of CMS: XML, RDF, & GRDDL
Tools for Next Generation of CMS: XML, RDF, & GRDDLTools for Next Generation of CMS: XML, RDF, & GRDDL
Tools for Next Generation of CMS: XML, RDF, & GRDDL
 
Segmenting & Merging Domain-specific Modules for Clinical Informatics
Segmenting & Merging Domain-specific Modules for Clinical InformaticsSegmenting & Merging Domain-specific Modules for Clinical Informatics
Segmenting & Merging Domain-specific Modules for Clinical Informatics
 
Overview of CPR Ontology
Overview of CPR OntologyOverview of CPR Ontology
Overview of CPR Ontology
 
The Characteristics of a RESTful Semantic Web and Why They Are Important
The Characteristics of a RESTful Semantic Web and Why They Are ImportantThe Characteristics of a RESTful Semantic Web and Why They Are Important
The Characteristics of a RESTful Semantic Web and Why They Are Important
 

Recently uploaded

20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
Pixlogix Infotech
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Wask
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
ssuserfac0301
 
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceAI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
IndexBug
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
Chart Kalyan
 
Webinar: Designing a schema for a Data Warehouse
Webinar: Designing a schema for a Data WarehouseWebinar: Designing a schema for a Data Warehouse
Webinar: Designing a schema for a Data Warehouse
Federico Razzoli
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
Edge AI and Vision Alliance
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
shyamraj55
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
kumardaparthi1024
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
Mariano Tinti
 
Recommendation System using RAG Architecture
Recommendation System using RAG ArchitectureRecommendation System using RAG Architecture
Recommendation System using RAG Architecture
fredae14
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Alpen-Adria-Universität
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
Ivanti
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Speck&Tech
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 

Recently uploaded (20)

20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
 
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceAI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
 
Webinar: Designing a schema for a Data Warehouse
Webinar: Designing a schema for a Data WarehouseWebinar: Designing a schema for a Data Warehouse
Webinar: Designing a schema for a Data Warehouse
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
 
Recommendation System using RAG Architecture
Recommendation System using RAG ArchitectureRecommendation System using RAG Architecture
Recommendation System using RAG Architecture
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 

Semantic Web Technologies: A Paradigm for Medical Informatics

  • 1. Semantic Web Technologies: A Paradigm for Medical Informatics Chimezie Ogbuji (Owner, Metacognition LLC.) http://metacognition.info/presentations/SWTMedicalInformatics.pdf http://metacognition.info/presentations/SWTMedicalInformatics.ppt
  • 2. Who I am  Circa 2001: Introduced to web standards and Semantic Web technologies  2003-2011: Lead architect of CCF in-house clinical repository project  2006-2011: Member representative of CCF in World-wide Web Consortium (W3C) ◦ Editor of various standards and Semantic Web Health Care and Life Sciences Interest Group chair  2011-2012: Senior Research Associate at CWRU Center for Clinical Investigations  2012-current: Started business providing resource and data management software for home healthcare agencies (Metacognition LLC)
  • 3. Medical Informatics Challenges  Semantic interoperability ◦ Exchange of data with common meaning between sender and receiver  Most of the intended benefits of HIT depend on interoperability between systems  Difficulties integrating patient record systems with other information resources are among the major issues hampering their effectiveness ◦ Interoperability is a major goal for meaningful use of Electronic Health Records (EHR) Rodrigues et al. 2013; Kadry et al. 2010; Shortliffe and Cimino, 2006
  • 4. Requirements and Solutions  Semantic interoperability requires: ◦ Structured data ◦ A common controlled vocabulary  Solutions emphasize the meaning of data rather than how they are structured ◦ “Semantic” paradigms
  • 5. Registries and Research DBs  Patient registries and clinical research repositories capture data elements in a uniform manner  The structure of the underlying data needs to be able to evolve along with the investigations they support  Thus, schema extensibility is important
  • 6. Querying Interfaces  Standardized interfaces for querying facilitate: ◦ Accessibility to clinical information systems ◦ Distributed querying of data from where they reside  Requires: ◦ Semantically-equivalent data structures  Alternatively, data are centralized in data warehouses Austin et al. 2007, “Implementation of a query interface for a generic record server”
  • 7. Biomedical Ontologies  Ontologies are artifacts that conceptualize a domain as a taxonomy of classes and constraints on relationships between their members  Represented in a particular formalism  Increasingly adopted as a foundation for the next generation of biomedical vocabularies  Construction involves representing a domain of interest independent of behavior of applications using an ontology  Important means towards achieving semantic interoperability
  • 8. Biomedical Ontology Communities  Prominent examples of adoption by life science and healthcare terminology communities: ◦ The Open Biological and Biomedical Ontologies (OBO) Foundry ◦ Gene Ontology (GO) ◦ National Center for Biomedical Ontology (NCBO) Bioportal ◦ International Health Terminology Standards Development Organization (IHTSDO)
  • 9. Semantic Web and Technologies  The Semantic Web is a vision of how the existing infrastructure of the World-wide Web (WWW) can be extended such that machines can interpret the meaning of data on it  Semantic Web technologies are the standards and technologies that have been developed to achieve the vision
  • 10. An Analogy  (Technological) singularity is a theoretical moment when artificial intelligence (AI) will have progressed to a greater-than-human intelligence  Despite remaining in the realm of science fiction, it has motivated many useful developments along the way ◦ The use of ontologies for knowledge representation and IBM Watson capabilities, for example
  • 11. Background: Graphs  Graphs are data structures comprising nodes and edges that connect them  The edges can be directional  Either the nodes, the edges, or both can be labeled  The labels provide meaning to the graphs (edge labels in particular) Node Nodeedge
  • 12. Resource Description Framework  The Resource Description Framework (RDF) is a graph-based knowledge representation language for describing resources  It’s edges are directional and both nodes and edges are labeled  It uses Universal Resource Identifiers (URI) for labeling  Foundation for Semantic Web technologies
  • 13. RDF: Continued  The edges are statements (triples) that go from a subject to an object  Some objects are text values  Some subjects and objects can be left unlabeled (Blank nodes) ◦ Anonymous resources: not important to label them uniquely  The URI of the edge is the predicate  Predicates used together for a common purpose are a vocabulary
  • 14.  Subject: Dr. X (a URI)  Object: Chime  Predicate: treats  Vocabulary: ◦ treats, subject of record, author, and full name Chime Dr. X treats subject of record author "Chimezie Ogbuji"full name
  • 15. RDF vocabularies  How meaning is interpreted from an RDF graph  There are vocabularies that constrain how predicates are used ◦ Want a sense of treats where the subject is a clinician and the object is a patient  There is a predicate relating resources to the classes they are a member of (type)  There are vocabularies that define constraints on class hierarchies  These comprise a basic RDF Schema (RDFS) language  Represented as an RDF graph
  • 16. Chime Dr. X treats subject of record Patient Physician type type Hypertension DX Clinical Diagnosis type is a authorPerson is a is a
  • 17. Ontologies for RDF  The Ontology Web Language (OWL) is used to describe ontologies for RDF graphs  More sophisticated constraints than RDFS  Commonly expressed as an RDF graph  Defines the meaning of RDF statements through constraints: ◦ On their predicates ◦ On the classes the resources they relate
  • 18. Chime Dr. X treats subject of record Patient Physician type type Hypertension DX Clinical Diagnosis type is a authorPerson is a is a Governed by OWL/ RDFS for domain
  • 19. OWL Formats  Most common format for describing ontologies  Distribution format of ontologies in the NCBO BioPortal  SNOMED CT distributions include an OWL representation ◦ RDF graphs can describe medical content in a SNOMED CT-compliant way through the use of this vocabulary
  • 20. Validation and Deduction  OWL is based on a formal, mathematical logic that can be used for validating the structure of an ontology and RDF data that conform to it (consistency checking)  Used to deduce additional RDF statements implied by the meaning of a given RDF graph (logical inference)  Logical reasoners are used for this
  • 21. Inference  Can infer anatomical location from SNOMED CT definitions Hypertension DX type finding site Systemic circulatory system structure type Hypertension DX <-> 1201005 / “Benign essential hypertension (disorder)
  • 22. Querying RDF Graphs  SPARQL is the official query language for RDF graphs  Comparable to relational query languages ◦ Primary difference: it queries RDF triples, whereas SQL queries tables of arbitrary dimensions  Includes various web protocols for querying RDF graphs  Foundation of SPARQL is the triple pattern  (?clinician, treats, ?patient) ◦ ?clinician and ?patient are variables (like a wildcard)
  • 23. ?patient ?physician ?dx treats subject of record author Hypertension DX type Which physicians have given essential hypertension diagnoses and to w (?physician, author, ?dx) (?physician, treats, ?patient) (?dx, subject of record, ?patient) (?dx, type, Hypertension DX) ?physician ?patient ?dx Dr. X Chime …
  • 24. SPARQL over Relational Data  Most common implementations convert SPARQL to SQL and evaluate over: ◦ a relational databases designed for RDF storage ◦ an existing relational database  There are products for both approaches  Former requires native storage of RDF ◦ Relational structure doesn’t change even as RDF vocabulary does (schemaElliot et al. 2009, “A Complete Translation from SPARQL into Efficient SQL”
  • 25. SPARQL over Existing Relation Data  “Virtual RDF view” ◦ Translation to SQL follows a given mapping from existing relational structures to an RDF vocabulary ◦ Allows non-disruptive evolution of existing systems ◦ Well-suited as a standard querying interface over clinical data repositories ◦ They can be queried as SPARQL, securely over encrypted HTTP
  • 26. Relational RDF (SNOMED CT perhaps) Mapping and Translation layer Secure HTTP SPARQL SQL Legacy / existing applications Patient registry or data repository 3rd party applications SQL
  • 27. Example: Cleveland Clinic (SemanticDB)  Content repository and data production system released in Jan. 2008  80 million (native) RDF statements ◦ Uses vocabulary from a patient record OWL ontology for the registry  Based on ◦ Existing registry of heart surgery and CV interventions ◦ 200,000 patient records ◦ Generating over 100 publications per year Pierce et al. 2012, “SemanticDB: A Semantic Web Infrastructure for Clinical Research and Quality Reporting
  • 28. Cohort Identification  Interface developed in conjunction with Cycorp  Leverage their logical reasoning system (Cyc) ◦ Identifies cohorts using natural language (NL) sentence fragments ◦ Converts fragments to SPARQL ◦ SPARQL is evaluated against RDF store
  • 29. Example: Mayo Clinic (MCLSS)  Mayo Clinic Life Sciences System (MCLSS) ◦ Effort to represent Mayo Clinic EHR data as RDF graphs ◦ Patient demographics, diagnoses, procedures, lab results, and free-text notes ◦ Goal was to wrap MCLSS relational database and expose as read-only, query- able RDF graphs that conform to standard ontologiesPathak et al. 2012, "Using Semantic Web Technologies for Cohort Identification from Electronic Health Records for Clinical Research"
  • 30. Example: Mayo Clinic (CEM)  Clinical Element Model (CEM) ◦ Represents logical structure of data in EHR ◦ Goal: translate CEM definitions into OWL and patient (instance) data into conformant RDF ◦ Use tools (logical reasoners) to check semantic consistency of the ontology, instance data, and to extract new knowledge via deduction ◦ Instance data validation:  correct number of linked components, value within data range, existence of units, etc. Tao et al. 2012, ”A semantic-web oriented representation of the clinical element model for secondary use of electronic health records data"
  • 31. Summary  Schema extensibility ◦ Use of RDF  Semantic Interoperability ◦ Domain modeling using OWL and RDFS  Standardized query interfaces ◦ Querying over SPARQL  Incremental, non-disruptive adoption ◦ Virtual RDF views  Main challenge: highly disruptive innovation