SlideShare a Scribd company logo
SEMANTIC WEB TECHNOLOGIES IN HEALTH CARE ANALYTICS
AN IMPACT SCENARIO FOR DATALOG REASONING WITH RDFOX
Robert Piro
Departmental Seminar
Robert Piro Semantic Web Technologies in Health Care 1/15
OVERVIEW
1 RDFOX
RDF
Datalog
2 PROJECT WITH KAISER PERMANENTE
HEDIS Measures for Diabetic Care
Data Model
Data Model as RDF Triples
The Datalog Rules
3 CONCLUSION & FUTURE WORK
Robert Piro Semantic Web Technologies in Health Care 2/15
RDFox
RDFOX — RESULT OF 4 YEARS OF DEVELOPMENT
RDFOX (BORIS MOTIK, YAVOR NENOV, ROBERT PIRO, IAN HORROCKS)
in memory RDF Triple Store — optimised indexing
parallel Datalog Reasoner — very good scalability
Robert Piro Semantic Web Technologies in Health Care 3/15
RDFox
RDFOX — RESULT OF 4 YEARS OF DEVELOPMENT
RDFOX (BORIS MOTIK, YAVOR NENOV, ROBERT PIRO, IAN HORROCKS)
in memory RDF Triple Store — optimised indexing
parallel Datalog Reasoner — very good scalability
FEATURES
load RDF data (Triples/Turtle)
materialise data — (extended) Datalog language
incremental reasoning / equality reasoning
query data — SPARQL query Language
Robert Piro Semantic Web Technologies in Health Care 3/15
RDFox
RDFOX — RESULT OF 4 YEARS OF DEVELOPMENT
RDFOX (BORIS MOTIK, YAVOR NENOV, ROBERT PIRO, IAN HORROCKS)
in memory RDF Triple Store — optimised indexing
parallel Datalog Reasoner — very good scalability
FEATURES
load RDF data (Triples/Turtle)
materialise data — (extended) Datalog language
incremental reasoning / equality reasoning
query data — SPARQL query Language
INTEGRATION
stand-alone C++ implementation / C++ library
Java/Python Bridge
SPARQL end-point
Robert Piro Semantic Web Technologies in Health Care 3/15
RDFox RDF
RDF — RESOURCE DESCRIPTION FRAMEWORK
RDF
data format with types W3C standard encode semantic data
Triple: subject predicate object (s, p, o)
building blocks: resources & literals
URI — <http://www.w3.org/2001/XMLSchema#double>
String, Boolean, Integer, Decimal — "0.789"ˆˆxsd:double
Robert Piro Semantic Web Technologies in Health Care 4/15
RDFox RDF
RDF — RESOURCE DESCRIPTION FRAMEWORK
RDF
data format with types W3C standard encode semantic data
Triple: subject predicate object (s, p, o)
building blocks: resources & literals
URI — <http://www.w3.org/2001/XMLSchema#double>
String, Boolean, Integer, Decimal — "0.789"ˆˆxsd:double
EXAMPLE (ENCODING A DATABASE TABLE IN RDF)
Table: PATIENT VISIT
REC | MBR | SERV DT | CPT | ... | DIAG1 | ... | DIAG22
001 | 007 | 20151101 | ...
@prefix ex: <http://my.example.com/FieldName/> .
@prefix visit: <http://my.example.com/Rec/PATIENT VISIT/> .
visit:001 ex:MBR "007" .
visit:001 ex:SERV DT "2015-11-01"ˆˆxsd:date .
Robert Piro Semantic Web Technologies in Health Care 4/15
RDFox Datalog
DATALOG
RDF DATALOG RULE
[s0, p0, o0] ← [s1, p1, o1], . . . , [sn, pn, on]. ‘IF...AND...THEN...’
Variables start with ‘?’. Var(head) ⊆ Var(body)
EXAMPLE (MATERIALISATION WITH RDFOX)
[?p, ex:has, ex:Diabetes] ← [?p, ex:MBRNo, ?mbr], [?rec, ex:MBR, ?mbr],
[?rec, ex:DIAG, "Diabetes"].
Robert Piro Semantic Web Technologies in Health Care 5/15
RDFox Datalog
DATALOG
RDF DATALOG RULE
[s0, p0, o0] ← [s1, p1, o1], . . . , [sn, pn, on]. ‘IF...AND...THEN...’
Variables start with ‘?’. Var(head) ⊆ Var(body)
EXAMPLE (MATERIALISATION WITH RDFOX)
[?p, ex:has, ex:Diabetes] ← [?p, ex:MBRNo, ?mbr], [?rec, ex:MBR, ?mbr],
[?rec, ex:DIAG, "Diabetes"].
Data
p:007 ex:MBRNo "007" . v:001 ex:DIAG "Diabetes" .
v:001 ex:MBR "007" . p:001 ex:MBR "001" .
Robert Piro Semantic Web Technologies in Health Care 5/15
RDFox Datalog
DATALOG
RDF DATALOG RULE
[s0, p0, o0] ← [s1, p1, o1], . . . , [sn, pn, on]. ‘IF...AND...THEN...’
Variables start with ‘?’. Var(head) ⊆ Var(body)
EXAMPLE (MATERIALISATION WITH RDFOX)
[?p, ex:has, ex:Diabetes] ← [?p, ex:MBRNo, ?mbr], [?rec, ex:MBR, ?mbr],
[?rec, ex:DIAG, "Diabetes"].
Data
p:007 ex:MBRNo "007" . v:001 ex:DIAG "Diabetes" .
v:001 ex:MBR "007" . p:001 ex:MBR "001" .
Robert Piro Semantic Web Technologies in Health Care 5/15
RDFox Datalog
DATALOG
RDF DATALOG RULE
[s0, p0, o0] ← [s1, p1, o1], . . . , [sn, pn, on]. ‘IF...AND...THEN...’
Variables start with ‘?’. Var(head) ⊆ Var(body)
EXAMPLE (MATERIALISATION WITH RDFOX)
[p:007, ex:has, ex:Diabetes] ← [p:007, ex:MBRNo, "007"], [?rec, ex:MBR, "007"],
[?rec, ex:DIAG, "Diabetes"].
Data
p:007 ex:MBRNo "007" . v:001 ex:DIAG "Diabetes" .
v:001 ex:MBR "007" . p:001 ex:MBR "001" .
Robert Piro Semantic Web Technologies in Health Care 5/15
RDFox Datalog
DATALOG
RDF DATALOG RULE
[s0, p0, o0] ← [s1, p1, o1], . . . , [sn, pn, on]. ‘IF...AND...THEN...’
Variables start with ‘?’. Var(head) ⊆ Var(body)
EXAMPLE (MATERIALISATION WITH RDFOX)
[p:007, ex:has, ex:Diabetes] ← [p:007, ex:MBRNo, "007"], [v:001, ex:MBR, "001"]
[v:001, ex:DIAG, "Diabetes"].
Data
p:007 ex:MBRNo "007" . v:001 ex:DIAG "Diabetes" .
v:001 ex:MBR "007" . p:001 ex:MBR "001" .
Robert Piro Semantic Web Technologies in Health Care 5/15
RDFox Datalog
DATALOG
RDF DATALOG RULE
[s0, p0, o0] ← [s1, p1, o1], . . . , [sn, pn, on]. ‘IF...AND...THEN...’
Variables start with ‘?’. Var(head) ⊆ Var(body)
EXAMPLE (MATERIALISATION WITH RDFOX)
[p:007, ex:has, ex:Diabetes] ← [p:007, ex:MBRNo, "007"], [v:001, ex:MBR, "007"],
[v:001, ex:DIAG, "Diabetes"].
Data
p:007 ex:MBRNo "007" . v:001 ex:DIAG "Diabetes" .
v:001 ex:MBR "007" . p:001 ex:MBR "001" .
Robert Piro Semantic Web Technologies in Health Care 5/15
RDFox Datalog
DATALOG
RDF DATALOG RULE
[s0, p0, o0] ← [s1, p1, o1], . . . , [sn, pn, on]. ‘IF...AND...THEN...’
Variables start with ‘?’. Var(head) ⊆ Var(body)
EXAMPLE (MATERIALISATION WITH RDFOX)
[p:007, ex:has, ex:Diabetes] ← [p:007, ex:MBRNo, "007"], [v:001, ex:MBR, "007"],
[v:001, ex:DIAG, "Diabetes"].
Data
p:007 ex:MBRNo "007" . v:001 ex:DIAG "Diabetes" .
v:001 ex:MBR "007" . p:001 ex:MBR "001" .
p:007 ex:has ex:Diabetes .
Robert Piro Semantic Web Technologies in Health Care 5/15
RDFox Datalog
DATALOG
RDF DATALOG RULE
[s0, p0, o0] ← [s1, p1, o1], . . . , [sn, pn, on]. ‘IF...AND...THEN...’
Variables start with ‘?’. Var(head) ⊆ Var(body)
EXAMPLE (MATERIALISATION WITH RDFOX)
[?p, ex:has, ex:Diabetes] ← [?p, ex:MBRNo, ?mbr], [?rec, ex:MBR, ?mbr],
[?rec, ex:DIAG, "Diabetes"].
Data
p:007 ex:MBR "007" . v:001 ex:DIAG "Diabetes" .
v:001 ex:MBR "007" . p:001 ex:MBR "001" .
p:007 ex:has ex:Diabetes .
Robert Piro Semantic Web Technologies in Health Care 5/15
RDFox Datalog
DATALOG
RDF DATALOG RULE
[s0, p0, o0] ← [s1, p1, o1], . . . , [sn, pn, on]. ‘IF...AND...THEN...’
Variables start with ‘?’. Var(head) ⊆ Var(body)
EXAMPLE (MATERIALISATION WITH RDFOX)
[?p, ex:has, ex:Diabetes] ← [?p, ex:MBRNo, ?mbr], [?rec, ex:MBR, ?mbr],
[?rec, ex:DIAG, "Diabetes"].
Data
p:007 ex:MBR "007" . v:001 ex:DIAG "Diabetes" .
v:001 ex:MBR "007" . p:001 ex:MBR "001" .
p:007 ex:has ex:Diabetes .
RDFOX COMPUTES all CONSEQUENCES . . .
also from newly derived data
in a systematic way
Robert Piro Semantic Web Technologies in Health Care 5/15
RDFox Datalog
DATALOG
RDF DATALOG RULE
[s0, p0, o0] ← [s1, p1, o1], . . . , [sn, pn, on]. ‘IF...AND...THEN...’
Variables start with ‘?’. Var(head) ⊆ Var(body)
EXAMPLE (MATERIALISATION WITH RDFOX)
[?p, ex:has, ex:Diabetes] ← [?p, ex:MBRNo, ?mbr], [?rec, ex:MBR, ?mbr],
[?rec, ex:DIAG, "Diabetes"].
Data
p:007 ex:MBR "007" . v:001 ex:DIAG "Diabetes" .
v:001 ex:MBR "007" . p:001 ex:MBR "001" .
p:007 ex:has ex:Diabetes .
RDFOX COMPUTES all CONSEQUENCES . . . AND TERMINATES
also from newly derived data
in a systematic way
Robert Piro Semantic Web Technologies in Health Care 5/15
RDFox Datalog
RDFOX AND DATALOG
STATS
Name Start (Trp) End (Trp) Mem Cores Time
DBpedia 112M 118M 6.1GB 8 28s
Claros 19M 96 M 4.2GB 16(32) 127s
LUBM-1K 134M 182M 9.3GB 16 8s
LUBM-9K 6G 9G ≈100GB 128(1024) 8s
Robert Piro Semantic Web Technologies in Health Care 6/15
RDFox Datalog
RDFOX AND DATALOG
STATS
Name Start (Trp) End (Trp) Mem Cores Time
DBpedia 112M 118M 6.1GB 8 28s
Claros 19M 96 M 4.2GB 16(32) 127s
LUBM-1K 134M 182M 9.3GB 16 8s
LUBM-9K 6G 9G ≈100GB 128(1024) 8s
FEATURES OF RDFOX DATALOG
Allows many more constructs (arithmetic*, string ops*, comparisons)
Will allow negation, aggregation (can be simulated already)
Generalises OWL 2 RL; Reasoning with OWL 2 EL reduceable to Datalog
Robert Piro Semantic Web Technologies in Health Care 6/15
RDFox Datalog
RDFOX AND DATALOG
STATS
Name Start (Trp) End (Trp) Mem Cores Time
DBpedia 112M 118M 6.1GB 8 28s
Claros 19M 96 M 4.2GB 16(32) 127s
LUBM-1K 134M 182M 9.3GB 16 8s
LUBM-9K 6G 9G ≈100GB 128(1024) 8s
FEATURES OF RDFOX DATALOG
Allows many more constructs (arithmetic*, string ops*, comparisons)
Will allow negation, aggregation (can be simulated already)
Generalises OWL 2 RL; Reasoning with OWL 2 EL reduceable to Datalog
GENERAL FEATURES OF DATALOG
Intuitive if-then-statements
Declarative (say what, not how to compute)
Powerful due to recursion
Robert Piro Semantic Web Technologies in Health Care 6/15
Project with Kaiser Permanente
KAISER PERMANENTE
THE ORGANISATION
Kaiser HealthPlan, Kaiser Hospitals, Permanente Medical Group
KP largest ‘managed care’ organisation in the U.S.
KP HealthConnect; largest private electronic health record system
STATS
9.6M members
38 medical centres
620 medical offices
177k emloyees
17k physicians
50k nurses
Turn over 56.4G USD
Net income 3.1G USD
Robert Piro Semantic Web Technologies in Health Care 7/15
Project with Kaiser Permanente HEDIS Measures for Diabetic Care
HEALTHCARE EFFECTIVENESS DATA AND INFORMATION SET
HEDIS
Performance measure specification issued NCQA1
(USA)
Percentages of a precisely defined eligible population:
#Eligible with eye exam
#Eligible(is Diabetic,≤65yo, etc)
Entry requirements for government funded healthcare (Medicare)
1
National Committee for Quality assurance
Robert Piro Semantic Web Technologies in Health Care 8/15
Project with Kaiser Permanente HEDIS Measures for Diabetic Care
HEALTHCARE EFFECTIVENESS DATA AND INFORMATION SET
HEDIS
Performance measure specification issued NCQA1
(USA)
Percentages of a precisely defined eligible population:
#Eligible with eye exam
#Eligible(is Diabetic,≤65yo, etc)
Entry requirements for government funded healthcare (Medicare)
HEDIS MEASURE COMPUTATION: TODAY
Disparate data sources (historically grown)
Ad-hoc schemas used to store data (meaning implicit)
Involved programs for analytics software
mix data (re)formatting and measuring
difficult to maintain
require high expertise of IT-experts
1
National Committee for Quality assurance
Robert Piro Semantic Web Technologies in Health Care 8/15
Project with Kaiser Permanente HEDIS Measures for Diabetic Care
HEDIS MEASURE COMPUTATION IN OUR PROJECT
NEW APPROACH (PETER HENDLER, ROBERT PIRO)
Separate data aggregation and reformatting from computing measures!
Data model inspired by HL7 RIM: ‘Entities in Roles Participating in Acts’
Data translated as RDF-triples into the data model first (Java/Scala)
RDFox Datalog rules compute measures according to this model
Results are read out through simple queries
Robert Piro Semantic Web Technologies in Health Care 9/15
Project with Kaiser Permanente HEDIS Measures for Diabetic Care
HEDIS MEASURE COMPUTATION IN OUR PROJECT
NEW APPROACH (PETER HENDLER, ROBERT PIRO)
Separate data aggregation and reformatting from computing measures!
Data model inspired by HL7 RIM: ‘Entities in Roles Participating in Acts’
Data translated as RDF-triples into the data model first (Java/Scala)
RDFox Datalog rules compute measures according to this model
Results are read out through simple queries
BENEFITS
Reusability: uniform data model reusable for other tasks
Efficiency: rules are close to natural language & concise
Maintainability: rules are declarative and easy to understand
Robert Piro Semantic Web Technologies in Health Care 9/15
Project with Kaiser Permanente Data Model
DATA MODEL
INSPIRED BY HL7 REFERENCE INFORMATION MODEL (RIM)
Entity Role Participation Act
hasRole hasPart hasAct
ISO standard: ISO/HL7 21731:2014
Process centric (Administrative KR)
Developed for/in the medical community; BUT ‘NHS experience’
Robert Piro Semantic Web Technologies in Health Care 10/15
Project with Kaiser Permanente Data Model
DATA MODEL
INSPIRED BY HL7 REFERENCE INFORMATION MODEL (RIM)
Entity Role Participation Act
hasRole hasPart hasAct
ISO standard: ISO/HL7 21731:2014
Process centric (Administrative KR)
Developed for/in the medical community; BUT ‘NHS experience’
EXAMPLE
Getting a coffee
Person Customer Purchaser
‘Buying a
product’
Person Barista Preparer
Subst Coffee Product
Person Customer Consumer
hasRole hasPart hasAct
hasRole hasPart
hasAct
hasRole hasPart
hasAct
hasRole hasPart
hasAct
Robert Piro Semantic Web Technologies in Health Care 10/15
Project with Kaiser Permanente Data Model
DATA MODEL
INSPIRED BY HL7 REFERENCE INFORMATION MODEL (RIM)
Entity Role Participation Act
hasRole hasPart hasAct
ISO standard: ISO/HL7 21731:2014
Process centric (Administrative KR)
Developed for/in the medical community; BUT ‘NHS experience’
EXAMPLE
Contract for Work
Person Customer Offering Party
‘Buying a
product’
Person Representative Accepting Party
Subst Coffee Work Result
Person Customer Beneficiary
hasRole hasPart hasAct
hasRole hasPart
hasAct
hasRole hasPart
hasAct
hasRole hasPart
hasAct
Robert Piro Semantic Web Technologies in Health Care 10/15
Project with Kaiser Permanente Data Model
DATA MODEL
INSPIRED BY HL7 REFERENCE INFORMATION MODEL (RIM)
Entity Role Participation Act
hasRole hasPart hasAct
ISO standard: ISO/HL7 21731:2014
Process centric (Administrative KR)
Developed for/in the medical community; BUT ‘NHS experience’
EXAMPLE
Prescription
Person Physician Prescriber Prescription
Person Pharmacist Dispenser
Subst Drug Medication
Person Patient Recipient
hasRole hasPart hasAct
hasRole hasPart
hasAct
hasRole hasPart
hasAct
hasRole hasPart
hasAct
Robert Piro Semantic Web Technologies in Health Care 10/15
Project with Kaiser Permanente Data Model as RDF Triples
DATA MODEL AS RDF TRIPLES
DATA MODEL USED FOR HEDIS
Entity(EN00)
Name: ”John Smith”
Gender: kp:male
DoB: ”1973-10-22”ˆˆxsd:date
type: cat:person
Role(RL00)
type : cat:Patient
Act(ACT00)
Date : “2013-03-22”ˆˆxsd:date
type: cat:Diagnosis
Participation(PT00)
type : cat:Subject
kp:hasRole
kp:hasPart
kp:hasContext
Robert Piro Semantic Web Technologies in Health Care 11/15
Project with Kaiser Permanente Data Model as RDF Triples
DATA MODEL AS RDF TRIPLES
DATA MODEL USED FOR HEDIS
Entity(EN00)
Name: ”John Smith”
Gender: kp:male
DoB: ”1973-10-22”ˆˆxsd:date
type: cat:person
Role(RL00)
type : cat:Patient
Act(ACT00)
Date : “2013-03-22”ˆˆxsd:date
type: cat:Diagnosis
Participation(PT00)
type : cat:Subject
kp:hasRole
kp:hasPart
kp:hasContext
ENCODING IN RDF-TRIPLES
EN00 kp:DoB ”1973-10-22”ˆˆxsd:date PT00 kp:hasContext ACT00 .
EN00 kp:hasRole RL00 . ACT00 rdf:type cat:Diagnosis .
RL00 rdf:type kp:Patient .
RL00 kp:hasPart PT00 .
Robert Piro Semantic Web Technologies in Health Care 11/15
Project with Kaiser Permanente Data Model as RDF Triples
DATA TRANSLATION
DATA PROVIDED
Real Data from a KP regional branch2
Data: ASCII-files, one record per line, pipe-separated fields
MBR | SERV DT | CPT | ... | DIAG1 | ... | DIAG22 | PROVNBR
2
The data never left Kaiser
Robert Piro Semantic Web Technologies in Health Care 12/15
Project with Kaiser Permanente Data Model as RDF Triples
DATA TRANSLATION
DATA PROVIDED
Real Data from a KP regional branch2
Data: ASCII-files, one record per line, pipe-separated fields
MBR | SERV DT | CPT | ... | DIAG1 | ... | DIAG22 | PROVNBR
DATA STATS
About Records Size About Records Size
Providers 113k 6.8M Labs 28.3M 1.4GB
Members 466k 84MB Prescriptions 8.9M 892MB
Enrollments 3.3M 332MB Visits 54M 8.6GB
2
The data never left Kaiser
Robert Piro Semantic Web Technologies in Health Care 12/15
Project with Kaiser Permanente Data Model as RDF Triples
DATA TRANSLATION
DATA PROVIDED
Real Data from a KP regional branch2
Data: ASCII-files, one record per line, pipe-separated fields
MBR | SERV DT | CPT | ... | DIAG1 | ... | DIAG22 | PROVNBR
DATA STATS
About Records Size About Records Size
Providers 113k 6.8M Labs 28.3M 1.4GB
Members 466k 84MB Prescriptions 8.9M 892MB
Enrollments 3.3M 332MB Visits 54M 8.6GB
TRANSLATION & IMPORT
Translation time: 45min @ 8threads
902M triples (4.6GB gzipped), 547M unique
RDFox import time 390s @ 8threads
2
The data never left Kaiser
Robert Piro Semantic Web Technologies in Health Care 12/15
Project with Kaiser Permanente The Datalog Rules
DATALOG RULES
RULES HEDIS DIABETES CARE DENOMINATORS AND NUMERATORS
174 rules in 607 lines of code distributed in 21 files
authored on a 200 patient test set using an interactive autoring tool
Robert Piro Semantic Web Technologies in Health Care 13/15
Project with Kaiser Permanente The Datalog Rules
DATALOG RULES
RULES HEDIS DIABETES CARE DENOMINATORS AND NUMERATORS
174 rules in 607 lines of code distributed in 21 files
authored on a 200 patient test set using an interactive autoring tool
MATERIALISATION
8 Intel Xeon E5-2680@2.7GHz with 64GB RAM
Data import + materialisation: 1h40m
Maximal number of triples before subgraph extraction: 731M (43GB)
Subgraph 71.7M triples (4GB), maximal number of triples: 92.2M (4.8GB)
Robert Piro Semantic Web Technologies in Health Care 13/15
Project with Kaiser Permanente The Datalog Rules
DATALOG RULES
RULES HEDIS DIABETES CARE DENOMINATORS AND NUMERATORS
174 rules in 607 lines of code distributed in 21 files
authored on a 200 patient test set using an interactive autoring tool
MATERIALISATION
8 Intel Xeon E5-2680@2.7GHz with 64GB RAM
Data import + materialisation: 1h40m
Maximal number of triples before subgraph extraction: 731M (43GB)
Subgraph 71.7M triples (4GB), maximal number of triples: 92.2M (4.8GB)
SUMMARY
Data is translated into RDF triples
RDFox computes with a Datalog Program and the RDF triples the
materialisation
Results are obtained by querying the triple store (SPARQL)
Robert Piro Semantic Web Technologies in Health Care 13/15
Project with Kaiser Permanente The Datalog Rules
RULE EXAMPLE
EXAMPLE
Patients must be enrolled and can have multiple enrollements in a year.
Enrollments are given as [begin-date,end-date] pair per patient.
“Compute all patients with contintuous enrollments within the
measurement year” i.e. the enrollments must form a connected chain
[x0, x1] . . . [xi , xi+1][xi+1, xi+2] . . . [xn−1, xn]
such that “2013-01-01” and “2013-12-31” are enclosed by some interval
[?Patient, aux : continiousEnrollment, ?PredEnr] ←
[?Patient, aux : continiousEnrollment, ?Enr],
[?Enr, kp : hasBeginConnectDateTime, ?begin],
[?Patient, aux : roleHasEnrollment, ?PredEnr],
[?PredEnr, kp : hasEndDateTime, ?begin] .
Robert Piro Semantic Web Technologies in Health Care 14/15
Conclusion & Future Work
CONCLUSION & FUTURE WORK
CONCLUSION
Created a use-case / Impact Scenario: real requirements, real data
Rooting of reasearch; usefulness of RDFox, new avenues, benchmarks
FUTURE WORK
Rule authoring tool / anoymisation of the data
Research
stratification of the reasoning
negation + aggregates
Big data reasoning + browsing
www.rdfox.org
Robert Piro Semantic Web Technologies in Health Care 15/15

More Related Content

What's hot

Mon norton tut_queryinglinkeddata02
Mon norton tut_queryinglinkeddata02Mon norton tut_queryinglinkeddata02
Mon norton tut_queryinglinkeddata02eswcsummerschool
 
Works 2015-provenance-mileage
Works 2015-provenance-mileageWorks 2015-provenance-mileage
Works 2015-provenance-mileage
Bertram Ludäscher
 
An Introduction to RDF and the Web of Data
An Introduction to RDF and the Web of DataAn Introduction to RDF and the Web of Data
An Introduction to RDF and the Web of Data
Olaf Hartig
 
RDB2RDF, an overview of R2RML and Direct Mapping
RDB2RDF, an overview of R2RML and Direct MappingRDB2RDF, an overview of R2RML and Direct Mapping
RDB2RDF, an overview of R2RML and Direct MappingBoris Villazón-Terrazas
 
Apache Drill Workshop
Apache Drill WorkshopApache Drill Workshop
Apache Drill Workshop
Charles Givre
 
Extreme Scripting July 2009
Extreme Scripting July 2009Extreme Scripting July 2009
Extreme Scripting July 2009
Ian Foster
 
Ease and speed of implementation
Ease and speed of implementationEase and speed of implementation
Ease and speed of implementation
Creative Coeur
 
BioBankCloud: Machine Learning on Genomics + GA4GH @ Med at Scale
BioBankCloud: Machine Learning on Genomics + GA4GH  @ Med at ScaleBioBankCloud: Machine Learning on Genomics + GA4GH  @ Med at Scale
BioBankCloud: Machine Learning on Genomics + GA4GH @ Med at Scale
Andy Petrella
 
Real time data processing with spark & cassandra @ NoSQLMatters 2015 Paris
Real time data processing with spark & cassandra @ NoSQLMatters 2015 ParisReal time data processing with spark & cassandra @ NoSQLMatters 2015 Paris
Real time data processing with spark & cassandra @ NoSQLMatters 2015 Paris
Duyhai Doan
 
Introduction to Pig & Pig Latin | Big Data Hadoop Spark Tutorial | CloudxLab
Introduction to Pig & Pig Latin | Big Data Hadoop Spark Tutorial | CloudxLabIntroduction to Pig & Pig Latin | Big Data Hadoop Spark Tutorial | CloudxLab
Introduction to Pig & Pig Latin | Big Data Hadoop Spark Tutorial | CloudxLab
CloudxLab
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Olaf Hartig
 
Stop! Don't make these mistakes in your document database!
Stop! Don't make these mistakes in your document database!Stop! Don't make these mistakes in your document database!
Stop! Don't make these mistakes in your document database!
Lauren Hayward Schaefer
 
RDF, SPARQL and Semantic Repositories
RDF, SPARQL and Semantic RepositoriesRDF, SPARQL and Semantic Repositories
RDF, SPARQL and Semantic Repositories
Marin Dimitrov
 
RDF data model
RDF data modelRDF data model
RDF data model
Jose Emilio Labra Gayo
 
Distributed Query Processing for Federated RDF Data Management
Distributed Query Processing for Federated RDF Data ManagementDistributed Query Processing for Federated RDF Data Management
Distributed Query Processing for Federated RDF Data Management
OlafGoerlitz
 
I Mapreduced a Neo store: Creating large Neo4j Databases with Hadoop
I Mapreduced a Neo store: Creating large Neo4j Databases with HadoopI Mapreduced a Neo store: Creating large Neo4j Databases with Hadoop
I Mapreduced a Neo store: Creating large Neo4j Databases with Hadoop
GoDataDriven
 
Presentation at the EMBL-EBI Industry RDF meeting
Presentation at the EMBL-EBI  Industry RDF meetingPresentation at the EMBL-EBI  Industry RDF meeting
Presentation at the EMBL-EBI Industry RDF meeting
Johannes Keizer
 

What's hot (19)

Mon norton tut_queryinglinkeddata02
Mon norton tut_queryinglinkeddata02Mon norton tut_queryinglinkeddata02
Mon norton tut_queryinglinkeddata02
 
SWT Lecture Session 2 - RDF
SWT Lecture Session 2 - RDFSWT Lecture Session 2 - RDF
SWT Lecture Session 2 - RDF
 
Works 2015-provenance-mileage
Works 2015-provenance-mileageWorks 2015-provenance-mileage
Works 2015-provenance-mileage
 
An Introduction to RDF and the Web of Data
An Introduction to RDF and the Web of DataAn Introduction to RDF and the Web of Data
An Introduction to RDF and the Web of Data
 
SWT Lecture Session 8 - Rules
SWT Lecture Session 8 - RulesSWT Lecture Session 8 - Rules
SWT Lecture Session 8 - Rules
 
RDB2RDF, an overview of R2RML and Direct Mapping
RDB2RDF, an overview of R2RML and Direct MappingRDB2RDF, an overview of R2RML and Direct Mapping
RDB2RDF, an overview of R2RML and Direct Mapping
 
Apache Drill Workshop
Apache Drill WorkshopApache Drill Workshop
Apache Drill Workshop
 
Extreme Scripting July 2009
Extreme Scripting July 2009Extreme Scripting July 2009
Extreme Scripting July 2009
 
Ease and speed of implementation
Ease and speed of implementationEase and speed of implementation
Ease and speed of implementation
 
BioBankCloud: Machine Learning on Genomics + GA4GH @ Med at Scale
BioBankCloud: Machine Learning on Genomics + GA4GH  @ Med at ScaleBioBankCloud: Machine Learning on Genomics + GA4GH  @ Med at Scale
BioBankCloud: Machine Learning on Genomics + GA4GH @ Med at Scale
 
Real time data processing with spark & cassandra @ NoSQLMatters 2015 Paris
Real time data processing with spark & cassandra @ NoSQLMatters 2015 ParisReal time data processing with spark & cassandra @ NoSQLMatters 2015 Paris
Real time data processing with spark & cassandra @ NoSQLMatters 2015 Paris
 
Introduction to Pig & Pig Latin | Big Data Hadoop Spark Tutorial | CloudxLab
Introduction to Pig & Pig Latin | Big Data Hadoop Spark Tutorial | CloudxLabIntroduction to Pig & Pig Latin | Big Data Hadoop Spark Tutorial | CloudxLab
Introduction to Pig & Pig Latin | Big Data Hadoop Spark Tutorial | CloudxLab
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
 
Stop! Don't make these mistakes in your document database!
Stop! Don't make these mistakes in your document database!Stop! Don't make these mistakes in your document database!
Stop! Don't make these mistakes in your document database!
 
RDF, SPARQL and Semantic Repositories
RDF, SPARQL and Semantic RepositoriesRDF, SPARQL and Semantic Repositories
RDF, SPARQL and Semantic Repositories
 
RDF data model
RDF data modelRDF data model
RDF data model
 
Distributed Query Processing for Federated RDF Data Management
Distributed Query Processing for Federated RDF Data ManagementDistributed Query Processing for Federated RDF Data Management
Distributed Query Processing for Federated RDF Data Management
 
I Mapreduced a Neo store: Creating large Neo4j Databases with Hadoop
I Mapreduced a Neo store: Creating large Neo4j Databases with HadoopI Mapreduced a Neo store: Creating large Neo4j Databases with Hadoop
I Mapreduced a Neo store: Creating large Neo4j Databases with Hadoop
 
Presentation at the EMBL-EBI Industry RDF meeting
Presentation at the EMBL-EBI  Industry RDF meetingPresentation at the EMBL-EBI  Industry RDF meeting
Presentation at the EMBL-EBI Industry RDF meeting
 

Viewers also liked

Semantic web Technology
Semantic web TechnologySemantic web Technology
Semantic web Technology
Abdelrahman Ibrahim
 
Making the semantic web work
Making the semantic web workMaking the semantic web work
Making the semantic web work
Paul Houle
 
Roby bastian
Roby bastianRoby bastian
Roby bastian
Robbie Persie
 
Resume Current with references
Resume Current with referencesResume Current with references
Resume Current with referencesKaylan Maloney
 
Информация и информационные процессы
Информация и информационные процессыИнформация и информационные процессы
Информация и информационные процессы
Сергей Балан
 
Nme double
Nme doubleNme double
Nme double
asmediae15
 
School Journal October 2015 Level 3
School Journal October 2015 Level 3School Journal October 2015 Level 3
School Journal October 2015 Level 3
christinam86
 
IUCEGY - Dr.Mohamed Taher
IUCEGY - Dr.Mohamed TaherIUCEGY - Dr.Mohamed Taher
IUCEGY - Dr.Mohamed Taher
Mohamed Jamal El-Din
 
презентація школи
презентація школипрезентація школи
презентація школи
subbotina_yuliya
 
Land forms cayden
Land forms caydenLand forms cayden
Land forms cayden
peggy Williams
 
Lagos LaidBac co business report first half 2015 v5
Lagos LaidBac co business report first half 2015 v5Lagos LaidBac co business report first half 2015 v5
Lagos LaidBac co business report first half 2015 v5Chris Nwachukwu
 
PuttingPlacesFirst
PuttingPlacesFirstPuttingPlacesFirst
PuttingPlacesFirstCharles Tso
 
2 крок
2 крок2 крок
2 крок
Tet321
 
Компьютерные словари и системы машинного перевода текстов
Компьютерные словари и системы машинного перевода текстов Компьютерные словари и системы машинного перевода текстов
Компьютерные словари и системы машинного перевода текстов
Сергей Балан
 
How to use flickr
How to use flickrHow to use flickr
How to use flickr
Chantel2015
 
Notebook 2 parcial
Notebook 2 parcialNotebook 2 parcial
Notebook 2 parcial
Marurix Jozz
 

Viewers also liked (18)

Semantic web Technology
Semantic web TechnologySemantic web Technology
Semantic web Technology
 
Making the semantic web work
Making the semantic web workMaking the semantic web work
Making the semantic web work
 
Roby bastian
Roby bastianRoby bastian
Roby bastian
 
Resume Current with references
Resume Current with referencesResume Current with references
Resume Current with references
 
HPTtransparency
HPTtransparencyHPTtransparency
HPTtransparency
 
Информация и информационные процессы
Информация и информационные процессыИнформация и информационные процессы
Информация и информационные процессы
 
Nme double
Nme doubleNme double
Nme double
 
School Journal October 2015 Level 3
School Journal October 2015 Level 3School Journal October 2015 Level 3
School Journal October 2015 Level 3
 
fulltext(1)
fulltext(1)fulltext(1)
fulltext(1)
 
IUCEGY - Dr.Mohamed Taher
IUCEGY - Dr.Mohamed TaherIUCEGY - Dr.Mohamed Taher
IUCEGY - Dr.Mohamed Taher
 
презентація школи
презентація школипрезентація школи
презентація школи
 
Land forms cayden
Land forms caydenLand forms cayden
Land forms cayden
 
Lagos LaidBac co business report first half 2015 v5
Lagos LaidBac co business report first half 2015 v5Lagos LaidBac co business report first half 2015 v5
Lagos LaidBac co business report first half 2015 v5
 
PuttingPlacesFirst
PuttingPlacesFirstPuttingPlacesFirst
PuttingPlacesFirst
 
2 крок
2 крок2 крок
2 крок
 
Компьютерные словари и системы машинного перевода текстов
Компьютерные словари и системы машинного перевода текстов Компьютерные словари и системы машинного перевода текстов
Компьютерные словари и системы машинного перевода текстов
 
How to use flickr
How to use flickrHow to use flickr
How to use flickr
 
Notebook 2 parcial
Notebook 2 parcialNotebook 2 parcial
Notebook 2 parcial
 

Similar to Semantic Web Technologies in Health Care Analytics

Visualize open data with Plone - eea.daviz PLOG 2013
Visualize open data with Plone - eea.daviz PLOG 2013Visualize open data with Plone - eea.daviz PLOG 2013
Visualize open data with Plone - eea.daviz PLOG 2013
Antonio De Marinis
 
Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...
Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...
Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...
Rothamsted Research, UK
 
Querying the Web of Data
Querying the Web of DataQuerying the Web of Data
Querying the Web of Data
Rinke Hoekstra
 
Bio it 2005_rdf_workshop05
Bio it 2005_rdf_workshop05Bio it 2005_rdf_workshop05
Bio it 2005_rdf_workshop05
Joanne Luciano
 
Rdf conjunctive query selectivity estimation
Rdf conjunctive query selectivity estimationRdf conjunctive query selectivity estimation
Rdf conjunctive query selectivity estimation
INRIA-OAK
 
Context-Aware Access Control for RDF Graph Stores
Context-Aware Access Control for RDF Graph StoresContext-Aware Access Control for RDF Graph Stores
Context-Aware Access Control for RDF Graph Stores
Serena Villata
 
Visualizations using Visualbox
Visualizations using VisualboxVisualizations using Visualbox
Visualizations using Visualbox
Alvaro Graves
 
Bringing the Semantic Web closer to reality: PostgreSQL as RDF Graph Database
Bringing the Semantic Web closer to reality: PostgreSQL as RDF Graph DatabaseBringing the Semantic Web closer to reality: PostgreSQL as RDF Graph Database
Bringing the Semantic Web closer to reality: PostgreSQL as RDF Graph Database
Jimmy Angelakos
 
Linking the world with Python and Semantics
Linking the world with Python and SemanticsLinking the world with Python and Semantics
Linking the world with Python and Semantics
Tatiana Al-Chueyr
 
State of the Semantic Web
State of the Semantic WebState of the Semantic Web
State of the Semantic Web
Ivan Herman
 
Next Top Data Model by Ian Plosker
Next Top Data Model by Ian PloskerNext Top Data Model by Ian Plosker
Next Top Data Model by Ian Plosker
SyncConf
 
The Semantic Web #10 - SPARQL
The Semantic Web #10 - SPARQLThe Semantic Web #10 - SPARQL
The Semantic Web #10 - SPARQL
Myungjin Lee
 
Biohackathon2013: Tripling Bioinformatics Productivity
Biohackathon2013: Tripling Bioinformatics ProductivityBiohackathon2013: Tripling Bioinformatics Productivity
Biohackathon2013: Tripling Bioinformatics Productivity
Jerven Bolleman
 
Enabling Exploratory Analysis of Large Data with Apache Spark and R
Enabling Exploratory Analysis of Large Data with Apache Spark and REnabling Exploratory Analysis of Large Data with Apache Spark and R
Enabling Exploratory Analysis of Large Data with Apache Spark and R
Databricks
 
20100614 ISWSA Keynote
20100614 ISWSA Keynote20100614 ISWSA Keynote
20100614 ISWSA Keynote
Axel Polleres
 
Concurrent Stream Processing
Concurrent Stream ProcessingConcurrent Stream Processing
Concurrent Stream Processing
Alex Miller
 
Semantic Web(Web 3.0) SPARQL
Semantic Web(Web 3.0) SPARQLSemantic Web(Web 3.0) SPARQL
Semantic Web(Web 3.0) SPARQL
Daniel D.J. UM
 
Using PostgreSQL with Bibliographic Data
Using PostgreSQL with Bibliographic DataUsing PostgreSQL with Bibliographic Data
Using PostgreSQL with Bibliographic Data
Jimmy Angelakos
 
A Little SPARQL in your Analytics
A Little SPARQL in your AnalyticsA Little SPARQL in your Analytics
A Little SPARQL in your Analytics
Dr. Neil Brittliff
 

Similar to Semantic Web Technologies in Health Care Analytics (20)

Visualize open data with Plone - eea.daviz PLOG 2013
Visualize open data with Plone - eea.daviz PLOG 2013Visualize open data with Plone - eea.daviz PLOG 2013
Visualize open data with Plone - eea.daviz PLOG 2013
 
Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...
Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...
Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...
 
Querying the Web of Data
Querying the Web of DataQuerying the Web of Data
Querying the Web of Data
 
Bio it 2005_rdf_workshop05
Bio it 2005_rdf_workshop05Bio it 2005_rdf_workshop05
Bio it 2005_rdf_workshop05
 
Rdf conjunctive query selectivity estimation
Rdf conjunctive query selectivity estimationRdf conjunctive query selectivity estimation
Rdf conjunctive query selectivity estimation
 
Context-Aware Access Control for RDF Graph Stores
Context-Aware Access Control for RDF Graph StoresContext-Aware Access Control for RDF Graph Stores
Context-Aware Access Control for RDF Graph Stores
 
Visualizations using Visualbox
Visualizations using VisualboxVisualizations using Visualbox
Visualizations using Visualbox
 
Bringing the Semantic Web closer to reality: PostgreSQL as RDF Graph Database
Bringing the Semantic Web closer to reality: PostgreSQL as RDF Graph DatabaseBringing the Semantic Web closer to reality: PostgreSQL as RDF Graph Database
Bringing the Semantic Web closer to reality: PostgreSQL as RDF Graph Database
 
Linking the world with Python and Semantics
Linking the world with Python and SemanticsLinking the world with Python and Semantics
Linking the world with Python and Semantics
 
State of the Semantic Web
State of the Semantic WebState of the Semantic Web
State of the Semantic Web
 
Next Top Data Model by Ian Plosker
Next Top Data Model by Ian PloskerNext Top Data Model by Ian Plosker
Next Top Data Model by Ian Plosker
 
The Semantic Web #10 - SPARQL
The Semantic Web #10 - SPARQLThe Semantic Web #10 - SPARQL
The Semantic Web #10 - SPARQL
 
Biohackathon2013: Tripling Bioinformatics Productivity
Biohackathon2013: Tripling Bioinformatics ProductivityBiohackathon2013: Tripling Bioinformatics Productivity
Biohackathon2013: Tripling Bioinformatics Productivity
 
Enabling Exploratory Analysis of Large Data with Apache Spark and R
Enabling Exploratory Analysis of Large Data with Apache Spark and REnabling Exploratory Analysis of Large Data with Apache Spark and R
Enabling Exploratory Analysis of Large Data with Apache Spark and R
 
20100614 ISWSA Keynote
20100614 ISWSA Keynote20100614 ISWSA Keynote
20100614 ISWSA Keynote
 
Concurrent Stream Processing
Concurrent Stream ProcessingConcurrent Stream Processing
Concurrent Stream Processing
 
Semantic Web(Web 3.0) SPARQL
Semantic Web(Web 3.0) SPARQLSemantic Web(Web 3.0) SPARQL
Semantic Web(Web 3.0) SPARQL
 
SWT Lecture Session 10 R2RML Part 1
SWT Lecture Session 10 R2RML Part 1SWT Lecture Session 10 R2RML Part 1
SWT Lecture Session 10 R2RML Part 1
 
Using PostgreSQL with Bibliographic Data
Using PostgreSQL with Bibliographic DataUsing PostgreSQL with Bibliographic Data
Using PostgreSQL with Bibliographic Data
 
A Little SPARQL in your Analytics
A Little SPARQL in your AnalyticsA Little SPARQL in your Analytics
A Little SPARQL in your Analytics
 

Recently uploaded

一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
pchutichetpong
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
John Andrews
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
g4dpvqap0
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
AbhimanyuSinha9
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
oz8q3jxlp
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
benishzehra469
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
jerlynmaetalle
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
MaleehaSheikh2
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
mbawufebxi
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
balafet
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
ArpitMalhotra16
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
rwarrenll
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
AnirbanRoy608946
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 

Recently uploaded (20)

一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 

Semantic Web Technologies in Health Care Analytics

  • 1. SEMANTIC WEB TECHNOLOGIES IN HEALTH CARE ANALYTICS AN IMPACT SCENARIO FOR DATALOG REASONING WITH RDFOX Robert Piro Departmental Seminar Robert Piro Semantic Web Technologies in Health Care 1/15
  • 2. OVERVIEW 1 RDFOX RDF Datalog 2 PROJECT WITH KAISER PERMANENTE HEDIS Measures for Diabetic Care Data Model Data Model as RDF Triples The Datalog Rules 3 CONCLUSION & FUTURE WORK Robert Piro Semantic Web Technologies in Health Care 2/15
  • 3. RDFox RDFOX — RESULT OF 4 YEARS OF DEVELOPMENT RDFOX (BORIS MOTIK, YAVOR NENOV, ROBERT PIRO, IAN HORROCKS) in memory RDF Triple Store — optimised indexing parallel Datalog Reasoner — very good scalability Robert Piro Semantic Web Technologies in Health Care 3/15
  • 4. RDFox RDFOX — RESULT OF 4 YEARS OF DEVELOPMENT RDFOX (BORIS MOTIK, YAVOR NENOV, ROBERT PIRO, IAN HORROCKS) in memory RDF Triple Store — optimised indexing parallel Datalog Reasoner — very good scalability FEATURES load RDF data (Triples/Turtle) materialise data — (extended) Datalog language incremental reasoning / equality reasoning query data — SPARQL query Language Robert Piro Semantic Web Technologies in Health Care 3/15
  • 5. RDFox RDFOX — RESULT OF 4 YEARS OF DEVELOPMENT RDFOX (BORIS MOTIK, YAVOR NENOV, ROBERT PIRO, IAN HORROCKS) in memory RDF Triple Store — optimised indexing parallel Datalog Reasoner — very good scalability FEATURES load RDF data (Triples/Turtle) materialise data — (extended) Datalog language incremental reasoning / equality reasoning query data — SPARQL query Language INTEGRATION stand-alone C++ implementation / C++ library Java/Python Bridge SPARQL end-point Robert Piro Semantic Web Technologies in Health Care 3/15
  • 6. RDFox RDF RDF — RESOURCE DESCRIPTION FRAMEWORK RDF data format with types W3C standard encode semantic data Triple: subject predicate object (s, p, o) building blocks: resources & literals URI — <http://www.w3.org/2001/XMLSchema#double> String, Boolean, Integer, Decimal — "0.789"ˆˆxsd:double Robert Piro Semantic Web Technologies in Health Care 4/15
  • 7. RDFox RDF RDF — RESOURCE DESCRIPTION FRAMEWORK RDF data format with types W3C standard encode semantic data Triple: subject predicate object (s, p, o) building blocks: resources & literals URI — <http://www.w3.org/2001/XMLSchema#double> String, Boolean, Integer, Decimal — "0.789"ˆˆxsd:double EXAMPLE (ENCODING A DATABASE TABLE IN RDF) Table: PATIENT VISIT REC | MBR | SERV DT | CPT | ... | DIAG1 | ... | DIAG22 001 | 007 | 20151101 | ... @prefix ex: <http://my.example.com/FieldName/> . @prefix visit: <http://my.example.com/Rec/PATIENT VISIT/> . visit:001 ex:MBR "007" . visit:001 ex:SERV DT "2015-11-01"ˆˆxsd:date . Robert Piro Semantic Web Technologies in Health Care 4/15
  • 8. RDFox Datalog DATALOG RDF DATALOG RULE [s0, p0, o0] ← [s1, p1, o1], . . . , [sn, pn, on]. ‘IF...AND...THEN...’ Variables start with ‘?’. Var(head) ⊆ Var(body) EXAMPLE (MATERIALISATION WITH RDFOX) [?p, ex:has, ex:Diabetes] ← [?p, ex:MBRNo, ?mbr], [?rec, ex:MBR, ?mbr], [?rec, ex:DIAG, "Diabetes"]. Robert Piro Semantic Web Technologies in Health Care 5/15
  • 9. RDFox Datalog DATALOG RDF DATALOG RULE [s0, p0, o0] ← [s1, p1, o1], . . . , [sn, pn, on]. ‘IF...AND...THEN...’ Variables start with ‘?’. Var(head) ⊆ Var(body) EXAMPLE (MATERIALISATION WITH RDFOX) [?p, ex:has, ex:Diabetes] ← [?p, ex:MBRNo, ?mbr], [?rec, ex:MBR, ?mbr], [?rec, ex:DIAG, "Diabetes"]. Data p:007 ex:MBRNo "007" . v:001 ex:DIAG "Diabetes" . v:001 ex:MBR "007" . p:001 ex:MBR "001" . Robert Piro Semantic Web Technologies in Health Care 5/15
  • 10. RDFox Datalog DATALOG RDF DATALOG RULE [s0, p0, o0] ← [s1, p1, o1], . . . , [sn, pn, on]. ‘IF...AND...THEN...’ Variables start with ‘?’. Var(head) ⊆ Var(body) EXAMPLE (MATERIALISATION WITH RDFOX) [?p, ex:has, ex:Diabetes] ← [?p, ex:MBRNo, ?mbr], [?rec, ex:MBR, ?mbr], [?rec, ex:DIAG, "Diabetes"]. Data p:007 ex:MBRNo "007" . v:001 ex:DIAG "Diabetes" . v:001 ex:MBR "007" . p:001 ex:MBR "001" . Robert Piro Semantic Web Technologies in Health Care 5/15
  • 11. RDFox Datalog DATALOG RDF DATALOG RULE [s0, p0, o0] ← [s1, p1, o1], . . . , [sn, pn, on]. ‘IF...AND...THEN...’ Variables start with ‘?’. Var(head) ⊆ Var(body) EXAMPLE (MATERIALISATION WITH RDFOX) [p:007, ex:has, ex:Diabetes] ← [p:007, ex:MBRNo, "007"], [?rec, ex:MBR, "007"], [?rec, ex:DIAG, "Diabetes"]. Data p:007 ex:MBRNo "007" . v:001 ex:DIAG "Diabetes" . v:001 ex:MBR "007" . p:001 ex:MBR "001" . Robert Piro Semantic Web Technologies in Health Care 5/15
  • 12. RDFox Datalog DATALOG RDF DATALOG RULE [s0, p0, o0] ← [s1, p1, o1], . . . , [sn, pn, on]. ‘IF...AND...THEN...’ Variables start with ‘?’. Var(head) ⊆ Var(body) EXAMPLE (MATERIALISATION WITH RDFOX) [p:007, ex:has, ex:Diabetes] ← [p:007, ex:MBRNo, "007"], [v:001, ex:MBR, "001"] [v:001, ex:DIAG, "Diabetes"]. Data p:007 ex:MBRNo "007" . v:001 ex:DIAG "Diabetes" . v:001 ex:MBR "007" . p:001 ex:MBR "001" . Robert Piro Semantic Web Technologies in Health Care 5/15
  • 13. RDFox Datalog DATALOG RDF DATALOG RULE [s0, p0, o0] ← [s1, p1, o1], . . . , [sn, pn, on]. ‘IF...AND...THEN...’ Variables start with ‘?’. Var(head) ⊆ Var(body) EXAMPLE (MATERIALISATION WITH RDFOX) [p:007, ex:has, ex:Diabetes] ← [p:007, ex:MBRNo, "007"], [v:001, ex:MBR, "007"], [v:001, ex:DIAG, "Diabetes"]. Data p:007 ex:MBRNo "007" . v:001 ex:DIAG "Diabetes" . v:001 ex:MBR "007" . p:001 ex:MBR "001" . Robert Piro Semantic Web Technologies in Health Care 5/15
  • 14. RDFox Datalog DATALOG RDF DATALOG RULE [s0, p0, o0] ← [s1, p1, o1], . . . , [sn, pn, on]. ‘IF...AND...THEN...’ Variables start with ‘?’. Var(head) ⊆ Var(body) EXAMPLE (MATERIALISATION WITH RDFOX) [p:007, ex:has, ex:Diabetes] ← [p:007, ex:MBRNo, "007"], [v:001, ex:MBR, "007"], [v:001, ex:DIAG, "Diabetes"]. Data p:007 ex:MBRNo "007" . v:001 ex:DIAG "Diabetes" . v:001 ex:MBR "007" . p:001 ex:MBR "001" . p:007 ex:has ex:Diabetes . Robert Piro Semantic Web Technologies in Health Care 5/15
  • 15. RDFox Datalog DATALOG RDF DATALOG RULE [s0, p0, o0] ← [s1, p1, o1], . . . , [sn, pn, on]. ‘IF...AND...THEN...’ Variables start with ‘?’. Var(head) ⊆ Var(body) EXAMPLE (MATERIALISATION WITH RDFOX) [?p, ex:has, ex:Diabetes] ← [?p, ex:MBRNo, ?mbr], [?rec, ex:MBR, ?mbr], [?rec, ex:DIAG, "Diabetes"]. Data p:007 ex:MBR "007" . v:001 ex:DIAG "Diabetes" . v:001 ex:MBR "007" . p:001 ex:MBR "001" . p:007 ex:has ex:Diabetes . Robert Piro Semantic Web Technologies in Health Care 5/15
  • 16. RDFox Datalog DATALOG RDF DATALOG RULE [s0, p0, o0] ← [s1, p1, o1], . . . , [sn, pn, on]. ‘IF...AND...THEN...’ Variables start with ‘?’. Var(head) ⊆ Var(body) EXAMPLE (MATERIALISATION WITH RDFOX) [?p, ex:has, ex:Diabetes] ← [?p, ex:MBRNo, ?mbr], [?rec, ex:MBR, ?mbr], [?rec, ex:DIAG, "Diabetes"]. Data p:007 ex:MBR "007" . v:001 ex:DIAG "Diabetes" . v:001 ex:MBR "007" . p:001 ex:MBR "001" . p:007 ex:has ex:Diabetes . RDFOX COMPUTES all CONSEQUENCES . . . also from newly derived data in a systematic way Robert Piro Semantic Web Technologies in Health Care 5/15
  • 17. RDFox Datalog DATALOG RDF DATALOG RULE [s0, p0, o0] ← [s1, p1, o1], . . . , [sn, pn, on]. ‘IF...AND...THEN...’ Variables start with ‘?’. Var(head) ⊆ Var(body) EXAMPLE (MATERIALISATION WITH RDFOX) [?p, ex:has, ex:Diabetes] ← [?p, ex:MBRNo, ?mbr], [?rec, ex:MBR, ?mbr], [?rec, ex:DIAG, "Diabetes"]. Data p:007 ex:MBR "007" . v:001 ex:DIAG "Diabetes" . v:001 ex:MBR "007" . p:001 ex:MBR "001" . p:007 ex:has ex:Diabetes . RDFOX COMPUTES all CONSEQUENCES . . . AND TERMINATES also from newly derived data in a systematic way Robert Piro Semantic Web Technologies in Health Care 5/15
  • 18. RDFox Datalog RDFOX AND DATALOG STATS Name Start (Trp) End (Trp) Mem Cores Time DBpedia 112M 118M 6.1GB 8 28s Claros 19M 96 M 4.2GB 16(32) 127s LUBM-1K 134M 182M 9.3GB 16 8s LUBM-9K 6G 9G ≈100GB 128(1024) 8s Robert Piro Semantic Web Technologies in Health Care 6/15
  • 19. RDFox Datalog RDFOX AND DATALOG STATS Name Start (Trp) End (Trp) Mem Cores Time DBpedia 112M 118M 6.1GB 8 28s Claros 19M 96 M 4.2GB 16(32) 127s LUBM-1K 134M 182M 9.3GB 16 8s LUBM-9K 6G 9G ≈100GB 128(1024) 8s FEATURES OF RDFOX DATALOG Allows many more constructs (arithmetic*, string ops*, comparisons) Will allow negation, aggregation (can be simulated already) Generalises OWL 2 RL; Reasoning with OWL 2 EL reduceable to Datalog Robert Piro Semantic Web Technologies in Health Care 6/15
  • 20. RDFox Datalog RDFOX AND DATALOG STATS Name Start (Trp) End (Trp) Mem Cores Time DBpedia 112M 118M 6.1GB 8 28s Claros 19M 96 M 4.2GB 16(32) 127s LUBM-1K 134M 182M 9.3GB 16 8s LUBM-9K 6G 9G ≈100GB 128(1024) 8s FEATURES OF RDFOX DATALOG Allows many more constructs (arithmetic*, string ops*, comparisons) Will allow negation, aggregation (can be simulated already) Generalises OWL 2 RL; Reasoning with OWL 2 EL reduceable to Datalog GENERAL FEATURES OF DATALOG Intuitive if-then-statements Declarative (say what, not how to compute) Powerful due to recursion Robert Piro Semantic Web Technologies in Health Care 6/15
  • 21. Project with Kaiser Permanente KAISER PERMANENTE THE ORGANISATION Kaiser HealthPlan, Kaiser Hospitals, Permanente Medical Group KP largest ‘managed care’ organisation in the U.S. KP HealthConnect; largest private electronic health record system STATS 9.6M members 38 medical centres 620 medical offices 177k emloyees 17k physicians 50k nurses Turn over 56.4G USD Net income 3.1G USD Robert Piro Semantic Web Technologies in Health Care 7/15
  • 22. Project with Kaiser Permanente HEDIS Measures for Diabetic Care HEALTHCARE EFFECTIVENESS DATA AND INFORMATION SET HEDIS Performance measure specification issued NCQA1 (USA) Percentages of a precisely defined eligible population: #Eligible with eye exam #Eligible(is Diabetic,≤65yo, etc) Entry requirements for government funded healthcare (Medicare) 1 National Committee for Quality assurance Robert Piro Semantic Web Technologies in Health Care 8/15
  • 23. Project with Kaiser Permanente HEDIS Measures for Diabetic Care HEALTHCARE EFFECTIVENESS DATA AND INFORMATION SET HEDIS Performance measure specification issued NCQA1 (USA) Percentages of a precisely defined eligible population: #Eligible with eye exam #Eligible(is Diabetic,≤65yo, etc) Entry requirements for government funded healthcare (Medicare) HEDIS MEASURE COMPUTATION: TODAY Disparate data sources (historically grown) Ad-hoc schemas used to store data (meaning implicit) Involved programs for analytics software mix data (re)formatting and measuring difficult to maintain require high expertise of IT-experts 1 National Committee for Quality assurance Robert Piro Semantic Web Technologies in Health Care 8/15
  • 24. Project with Kaiser Permanente HEDIS Measures for Diabetic Care HEDIS MEASURE COMPUTATION IN OUR PROJECT NEW APPROACH (PETER HENDLER, ROBERT PIRO) Separate data aggregation and reformatting from computing measures! Data model inspired by HL7 RIM: ‘Entities in Roles Participating in Acts’ Data translated as RDF-triples into the data model first (Java/Scala) RDFox Datalog rules compute measures according to this model Results are read out through simple queries Robert Piro Semantic Web Technologies in Health Care 9/15
  • 25. Project with Kaiser Permanente HEDIS Measures for Diabetic Care HEDIS MEASURE COMPUTATION IN OUR PROJECT NEW APPROACH (PETER HENDLER, ROBERT PIRO) Separate data aggregation and reformatting from computing measures! Data model inspired by HL7 RIM: ‘Entities in Roles Participating in Acts’ Data translated as RDF-triples into the data model first (Java/Scala) RDFox Datalog rules compute measures according to this model Results are read out through simple queries BENEFITS Reusability: uniform data model reusable for other tasks Efficiency: rules are close to natural language & concise Maintainability: rules are declarative and easy to understand Robert Piro Semantic Web Technologies in Health Care 9/15
  • 26. Project with Kaiser Permanente Data Model DATA MODEL INSPIRED BY HL7 REFERENCE INFORMATION MODEL (RIM) Entity Role Participation Act hasRole hasPart hasAct ISO standard: ISO/HL7 21731:2014 Process centric (Administrative KR) Developed for/in the medical community; BUT ‘NHS experience’ Robert Piro Semantic Web Technologies in Health Care 10/15
  • 27. Project with Kaiser Permanente Data Model DATA MODEL INSPIRED BY HL7 REFERENCE INFORMATION MODEL (RIM) Entity Role Participation Act hasRole hasPart hasAct ISO standard: ISO/HL7 21731:2014 Process centric (Administrative KR) Developed for/in the medical community; BUT ‘NHS experience’ EXAMPLE Getting a coffee Person Customer Purchaser ‘Buying a product’ Person Barista Preparer Subst Coffee Product Person Customer Consumer hasRole hasPart hasAct hasRole hasPart hasAct hasRole hasPart hasAct hasRole hasPart hasAct Robert Piro Semantic Web Technologies in Health Care 10/15
  • 28. Project with Kaiser Permanente Data Model DATA MODEL INSPIRED BY HL7 REFERENCE INFORMATION MODEL (RIM) Entity Role Participation Act hasRole hasPart hasAct ISO standard: ISO/HL7 21731:2014 Process centric (Administrative KR) Developed for/in the medical community; BUT ‘NHS experience’ EXAMPLE Contract for Work Person Customer Offering Party ‘Buying a product’ Person Representative Accepting Party Subst Coffee Work Result Person Customer Beneficiary hasRole hasPart hasAct hasRole hasPart hasAct hasRole hasPart hasAct hasRole hasPart hasAct Robert Piro Semantic Web Technologies in Health Care 10/15
  • 29. Project with Kaiser Permanente Data Model DATA MODEL INSPIRED BY HL7 REFERENCE INFORMATION MODEL (RIM) Entity Role Participation Act hasRole hasPart hasAct ISO standard: ISO/HL7 21731:2014 Process centric (Administrative KR) Developed for/in the medical community; BUT ‘NHS experience’ EXAMPLE Prescription Person Physician Prescriber Prescription Person Pharmacist Dispenser Subst Drug Medication Person Patient Recipient hasRole hasPart hasAct hasRole hasPart hasAct hasRole hasPart hasAct hasRole hasPart hasAct Robert Piro Semantic Web Technologies in Health Care 10/15
  • 30. Project with Kaiser Permanente Data Model as RDF Triples DATA MODEL AS RDF TRIPLES DATA MODEL USED FOR HEDIS Entity(EN00) Name: ”John Smith” Gender: kp:male DoB: ”1973-10-22”ˆˆxsd:date type: cat:person Role(RL00) type : cat:Patient Act(ACT00) Date : “2013-03-22”ˆˆxsd:date type: cat:Diagnosis Participation(PT00) type : cat:Subject kp:hasRole kp:hasPart kp:hasContext Robert Piro Semantic Web Technologies in Health Care 11/15
  • 31. Project with Kaiser Permanente Data Model as RDF Triples DATA MODEL AS RDF TRIPLES DATA MODEL USED FOR HEDIS Entity(EN00) Name: ”John Smith” Gender: kp:male DoB: ”1973-10-22”ˆˆxsd:date type: cat:person Role(RL00) type : cat:Patient Act(ACT00) Date : “2013-03-22”ˆˆxsd:date type: cat:Diagnosis Participation(PT00) type : cat:Subject kp:hasRole kp:hasPart kp:hasContext ENCODING IN RDF-TRIPLES EN00 kp:DoB ”1973-10-22”ˆˆxsd:date PT00 kp:hasContext ACT00 . EN00 kp:hasRole RL00 . ACT00 rdf:type cat:Diagnosis . RL00 rdf:type kp:Patient . RL00 kp:hasPart PT00 . Robert Piro Semantic Web Technologies in Health Care 11/15
  • 32. Project with Kaiser Permanente Data Model as RDF Triples DATA TRANSLATION DATA PROVIDED Real Data from a KP regional branch2 Data: ASCII-files, one record per line, pipe-separated fields MBR | SERV DT | CPT | ... | DIAG1 | ... | DIAG22 | PROVNBR 2 The data never left Kaiser Robert Piro Semantic Web Technologies in Health Care 12/15
  • 33. Project with Kaiser Permanente Data Model as RDF Triples DATA TRANSLATION DATA PROVIDED Real Data from a KP regional branch2 Data: ASCII-files, one record per line, pipe-separated fields MBR | SERV DT | CPT | ... | DIAG1 | ... | DIAG22 | PROVNBR DATA STATS About Records Size About Records Size Providers 113k 6.8M Labs 28.3M 1.4GB Members 466k 84MB Prescriptions 8.9M 892MB Enrollments 3.3M 332MB Visits 54M 8.6GB 2 The data never left Kaiser Robert Piro Semantic Web Technologies in Health Care 12/15
  • 34. Project with Kaiser Permanente Data Model as RDF Triples DATA TRANSLATION DATA PROVIDED Real Data from a KP regional branch2 Data: ASCII-files, one record per line, pipe-separated fields MBR | SERV DT | CPT | ... | DIAG1 | ... | DIAG22 | PROVNBR DATA STATS About Records Size About Records Size Providers 113k 6.8M Labs 28.3M 1.4GB Members 466k 84MB Prescriptions 8.9M 892MB Enrollments 3.3M 332MB Visits 54M 8.6GB TRANSLATION & IMPORT Translation time: 45min @ 8threads 902M triples (4.6GB gzipped), 547M unique RDFox import time 390s @ 8threads 2 The data never left Kaiser Robert Piro Semantic Web Technologies in Health Care 12/15
  • 35. Project with Kaiser Permanente The Datalog Rules DATALOG RULES RULES HEDIS DIABETES CARE DENOMINATORS AND NUMERATORS 174 rules in 607 lines of code distributed in 21 files authored on a 200 patient test set using an interactive autoring tool Robert Piro Semantic Web Technologies in Health Care 13/15
  • 36. Project with Kaiser Permanente The Datalog Rules DATALOG RULES RULES HEDIS DIABETES CARE DENOMINATORS AND NUMERATORS 174 rules in 607 lines of code distributed in 21 files authored on a 200 patient test set using an interactive autoring tool MATERIALISATION 8 Intel Xeon E5-2680@2.7GHz with 64GB RAM Data import + materialisation: 1h40m Maximal number of triples before subgraph extraction: 731M (43GB) Subgraph 71.7M triples (4GB), maximal number of triples: 92.2M (4.8GB) Robert Piro Semantic Web Technologies in Health Care 13/15
  • 37. Project with Kaiser Permanente The Datalog Rules DATALOG RULES RULES HEDIS DIABETES CARE DENOMINATORS AND NUMERATORS 174 rules in 607 lines of code distributed in 21 files authored on a 200 patient test set using an interactive autoring tool MATERIALISATION 8 Intel Xeon E5-2680@2.7GHz with 64GB RAM Data import + materialisation: 1h40m Maximal number of triples before subgraph extraction: 731M (43GB) Subgraph 71.7M triples (4GB), maximal number of triples: 92.2M (4.8GB) SUMMARY Data is translated into RDF triples RDFox computes with a Datalog Program and the RDF triples the materialisation Results are obtained by querying the triple store (SPARQL) Robert Piro Semantic Web Technologies in Health Care 13/15
  • 38. Project with Kaiser Permanente The Datalog Rules RULE EXAMPLE EXAMPLE Patients must be enrolled and can have multiple enrollements in a year. Enrollments are given as [begin-date,end-date] pair per patient. “Compute all patients with contintuous enrollments within the measurement year” i.e. the enrollments must form a connected chain [x0, x1] . . . [xi , xi+1][xi+1, xi+2] . . . [xn−1, xn] such that “2013-01-01” and “2013-12-31” are enclosed by some interval [?Patient, aux : continiousEnrollment, ?PredEnr] ← [?Patient, aux : continiousEnrollment, ?Enr], [?Enr, kp : hasBeginConnectDateTime, ?begin], [?Patient, aux : roleHasEnrollment, ?PredEnr], [?PredEnr, kp : hasEndDateTime, ?begin] . Robert Piro Semantic Web Technologies in Health Care 14/15
  • 39. Conclusion & Future Work CONCLUSION & FUTURE WORK CONCLUSION Created a use-case / Impact Scenario: real requirements, real data Rooting of reasearch; usefulness of RDFox, new avenues, benchmarks FUTURE WORK Rule authoring tool / anoymisation of the data Research stratification of the reasoning negation + aggregates Big data reasoning + browsing www.rdfox.org Robert Piro Semantic Web Technologies in Health Care 15/15