SlideShare a Scribd company logo
1 of 19
Download to read offline
Awakening Clinical Data: Semantics for
Scalable Medical Research Informatics
                    Satya S. Sahoo
                 Division Medical Informatics
  Electrical Engineering and Computer Science Department
           Case Western Reserve University
                   Cleveland, OH, USA
Big Picture of Data in Clinical Research
143, 961 Patients per year
(e.g. Emory)                                                                          MRI: 50-100MB
                                                                                      PET: 60-100MB




                                         National Sleep Research Resource: 500 TB                        MRI, PET scans
     Patient Reports
                                                                                                      source: PRISM project, BME dept CWRU
source: PRISM project CWRU
                                                     Case Western EMU: 250 TB
  Epilepsy Monitoring Unit (EMU) Data
                                    500-600MB per patient
                                    per stay in EMU



                                                                                                         Wireless Health Data
                                                                                                         source: CWRU School of Engineering

                                                                                                         ~5.6 billion wireless
                              1-20GB each                                                                connections and growing
       Polysomnograms
                                                    Pathology Reports, Tissue Bank
       source: Physio-MIMI, PRISM CWRU                    source: NLM and Wikipedia
Big Picture of Data in Clinical Research
143, 961 Patients per year
(e.g. Emory)                                          MRI: 50-100MB
                      •  Ultra large volume of data and growing rapidly
                                                      PET: 60-100MB
                      •  Data is Multi-modal, Heterogeneous
                      •  Heterogeneity: Syntactic, Structural, Semantic

                                         National Sleep Research Resource: 500 TB        MRI, PET scans
     Patient Reports
                                                                                      source: PRISM project, BME dept CWRU
source: PRISM project CWRU
                                                     Case Western EMU: 250 TB
  Epilepsy Monitoring Unit (EMU) Data
                                    500-600MB per patient
                                    per stay in EMU



                                                                                         Wireless Health Data
                                                                                         source: CWRU School of Engineering

                                                                                         ~5.6 billion wireless
                              1-20GB each                                                connections and growing
       Polysomnograms
                                                    Pathology Reports, Tissue Bank
       source: Physio-MIMI, PRISM CWRU                    source: NLM and Wikipedia
Scalability in Medical Informatics: Beyond Volume
                                         Exemplar: Sleep Medicine Research




                                                                                   MRI, PET scans
     Patient Reports
                                                                                source: PRISM project, BME dept CWRU
source: PRISM project CWRU


  Epilepsy Monitoring Unit (EMU) Data




                                                                                   Wireless Health Data
                                                                                   source: CWRU School of Engineering




       Polysomnograms
                                              Pathology Reports, Tissue Bank
       source: Physio-MIMI, PRISM CWRU              source: NLM and Wikipedia
Scalability in Medical Informatics: Beyond Volume
                                         Exemplar: Sleep Medicine Research




            •  Multi-Center Studies with differing
                administrative requirements – business logicscans
   Patient Reports
                                                      MRI, PET
                                                                                source: PRISM project, BME dept CWRU
source: PRISM project CWRU

            •  Dynamic data – grows over project duration
  Epilepsy Monitoring Unit (EMU) Data
            •  Data Semantics as foundation to support a
                wide spectrum of users – clinicians, nurse
                practitioners, research fellows
                                                                                   Wireless Health Data
                                                                                   source: CWRU School of Engineering




       Polysomnograms
                                              Pathology Reports, Tissue Bank
       source: Physio-MIMI, PRISM CWRU              source: NLM and Wikipedia
A Wish List for Scalable Clinical Data Management
•  Reconcile Data Heterogeneity – most critical to successful
   translational research
   o  Syntactic heterogeneity – less of a problem, data dictionaries
      help
   o  Structural heterogeneity – problematic, XML somewhat helpful
   o  Semantic heterogeneity – a huge problem, ontologies to the
      rescue?
•  Provenance – essential for data quality, compliance, insight
   o  Blood Oxygen Baseline: oxygen saturation during the first 15 or
      30 seconds of sleep
   o  Patient blood report last month cause of change in medication
      – Domain Provenance (not just tuple provenance)
•  Intuitive access to information – clinical trials eligibility,
   cohort identification
•  Scalable - Data sources, research partners added or removed
   dynamically
A “not to do” list for Clinical Data Management




                                         Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch




•  No Linked Open Patient Data – HIPAA, HITECH
   Act (US), Data Protection Act (UK)
  o  De-identified data – IRB approval
•  Ontology as global schema – but no RDF
  o  Vast majority as RDB
  o  Practical issues with RDF – cannot be institution-
     specific URI (privacy)
Physio-MIMI: Multi‐Modality, Multi‐Resource Environment for Physiological
                              and Clinical Research

              Clinical
             Researcher




            SNOMED-CT                                 FMA
                                Sleep Domain
                          …       Ontology            OGMS



                                                                     Any
                                                                   number of
                                                                     new
                                                                    centers
Physio-MIMI: Enabling Scalable Medical Research
•  NCRR‐funded, multi‐CTSA site project: Sleep medicine as
   exemplar
•  Federated data management – scalable, adapts to changing
   data access policies
•  Ontology-driven:
   o  Data mappings – Ontology class to data dictionary terms
      (manually curated)
   o  Drive query interface
   o  Manage provenance
•  Privacy aware, IRB-compliant
•  Collaboration among Case Western, U. of Michigan,
   Marshfield Clinic and U. of Wisconsin, Madison
   o  Now Harvard Medical School
Key Resource: Sleep Domain Ontology (SDO)
           https://mimi.case.edu/concepts
Data Mappings: SDO to Data Dictionary
                       Physio-Map Module
                       •  Visual interface
                       •  Stores mappings in XML –
                       moving towards rules
                       •  Dynamically executed in response
                       to user query




       User Voting
Provenance: Contextual Metadata for Clinical
                Research




             Slide courtesy: Remo Mueller
Provenance: To Trace Variations in Data and
                 Results




             Slide courtesy: Remo Mueller
Modified from slide courtesy: Remo
Mueller
Provenance: Source information for Patient Data




                                    Slide courtesy: Remo Mueller
Intuitive Query Interface: Ontology (SDO)-driven
       Visual Aggregator and Explorer (VisAgE)


 DataSets




Ontology Concept – Type of Query Widget
PhysioMIMI in National Sleep Research Resource
•  National Sleep Research Resource (NSSR) – scored and
   awaiting funding review
•  Collaboration between Harvard Medical School (domain
   experts) and Case Western (CS) with 15 projects
    o  50,000 sleep research studies – total size of 500TB
•  Semantic Data Integration – SDO and Sleep Provenance
   Ontology (extending W3C PROV Ontology PROV-O)
•  Signal processing tools – using a common format called
   European Data Format (EDF), XML-based
•  Domain analysis, cross-linking – secure Web access
Challenges: Semantics in Large Scale Clinical Data
•  Incentives for adopting RDF in clinical data management
   – what is already not possible in RDB?
•  OWL2, RDFS reasoning – Privacy aware reasoning,
   semantics-aware access control (Nguyen et al. 2012)
•  Missing Semantics?
    o  Variable, missing provenance in original study - re-
       create provenance with (limited) provenance?
    o  Fine-level granularity for semantic annotation of
       signal data – currently not scalable
•  A little semantics does not go too far in clinical data
    o  Need for greater involvement of Semantic Web
       community in development of EHR systems
Acknowledgements
•  Guo-Qiang Zhang, Remo Mueller, Samden Lhatoo, Susan Redline, Alireza Bozorgi
•  Division of Medical Informatics: Lingyun Luo, Joe Teagno, Meng Zhao, Jake Luo,
   Licong Cui, Chien-Hung Chen, Catherine Jayapandian
•  Physio-MIMI Team: http://physiomimi.case.edu/
•  Contact Information: satya.sahoo@case.edu,
   http://cci.case.edu/cci/index.php/Satya_Sahoo

More Related Content

Similar to Awakening Clinical Data: Semantics for Scalable Medical Research Informatics

Sequencing Genomics: The New Big Data Driver
Sequencing Genomics:The New Big Data DriverSequencing Genomics:The New Big Data Driver
Sequencing Genomics: The New Big Data DriverLarry Smarr
 
Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...
Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...
Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...Matthieu Schapranow
 
The Importance Of Data Mining By Musa Mohd. Nordin, Noor
The Importance Of Data Mining By Musa Mohd. Nordin, NoorThe Importance Of Data Mining By Musa Mohd. Nordin, Noor
The Importance Of Data Mining By Musa Mohd. Nordin, Noormuzkara
 
Fireside chat: Newton Howard, Director of the MIT Synthetic Intelligence Lab ...
Fireside chat: Newton Howard, Director of the MIT Synthetic Intelligence Lab ...Fireside chat: Newton Howard, Director of the MIT Synthetic Intelligence Lab ...
Fireside chat: Newton Howard, Director of the MIT Synthetic Intelligence Lab ...Codiax
 
Bioinformatics as a tool for understanding carcinogenesis
Bioinformatics as a tool for understanding carcinogenesisBioinformatics as a tool for understanding carcinogenesis
Bioinformatics as a tool for understanding carcinogenesisDespoina Kalfakakou
 
Health Sciences Driving UCSD Research Cyberinfrastructure
Health Sciences Driving UCSD Research CyberinfrastructureHealth Sciences Driving UCSD Research Cyberinfrastructure
Health Sciences Driving UCSD Research CyberinfrastructureLarry Smarr
 
Bioinformatics tools for NGS data analysis
Bioinformatics tools for NGS data analysisBioinformatics tools for NGS data analysis
Bioinformatics tools for NGS data analysisDespoina Kalfakakou
 
Cancer genome repository_berkeley
Cancer genome repository_berkeleyCancer genome repository_berkeley
Cancer genome repository_berkeleyShyam Sarkar
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...David Peyruc
 
Computational Pathology Workshop July 8 2014
Computational Pathology Workshop July 8 2014Computational Pathology Workshop July 8 2014
Computational Pathology Workshop July 8 2014Joel Saltz
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Ian Foster
 
Data analytics challenges in genomics
Data analytics challenges in genomicsData analytics challenges in genomics
Data analytics challenges in genomicsmikaelhuss
 
Driving Applications on the UCSD Big Data Freeway System
Driving Applications on the UCSD Big Data Freeway SystemDriving Applications on the UCSD Big Data Freeway System
Driving Applications on the UCSD Big Data Freeway SystemLarry Smarr
 
NCI HTAN, cancer trajectories, precision oncology
NCI HTAN, cancer trajectories, precision oncologyNCI HTAN, cancer trajectories, precision oncology
NCI HTAN, cancer trajectories, precision oncologyWarren Kibbe
 

Similar to Awakening Clinical Data: Semantics for Scalable Medical Research Informatics (20)

Sequencing Genomics: The New Big Data Driver
Sequencing Genomics:The New Big Data DriverSequencing Genomics:The New Big Data Driver
Sequencing Genomics: The New Big Data Driver
 
Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...
Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...
Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...
 
The Importance Of Data Mining By Musa Mohd. Nordin, Noor
The Importance Of Data Mining By Musa Mohd. Nordin, NoorThe Importance Of Data Mining By Musa Mohd. Nordin, Noor
The Importance Of Data Mining By Musa Mohd. Nordin, Noor
 
Fireside chat: Newton Howard, Director of the MIT Synthetic Intelligence Lab ...
Fireside chat: Newton Howard, Director of the MIT Synthetic Intelligence Lab ...Fireside chat: Newton Howard, Director of the MIT Synthetic Intelligence Lab ...
Fireside chat: Newton Howard, Director of the MIT Synthetic Intelligence Lab ...
 
Bioinformatics as a tool for understanding carcinogenesis
Bioinformatics as a tool for understanding carcinogenesisBioinformatics as a tool for understanding carcinogenesis
Bioinformatics as a tool for understanding carcinogenesis
 
Health Sciences Driving UCSD Research Cyberinfrastructure
Health Sciences Driving UCSD Research CyberinfrastructureHealth Sciences Driving UCSD Research Cyberinfrastructure
Health Sciences Driving UCSD Research Cyberinfrastructure
 
Brief introduction to Bioinformatics
Brief introduction to BioinformaticsBrief introduction to Bioinformatics
Brief introduction to Bioinformatics
 
Bioinformatics tools for NGS data analysis
Bioinformatics tools for NGS data analysisBioinformatics tools for NGS data analysis
Bioinformatics tools for NGS data analysis
 
Cancer genome repository_berkeley
Cancer genome repository_berkeleyCancer genome repository_berkeley
Cancer genome repository_berkeley
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...
 
UNMSymposium2014
UNMSymposium2014UNMSymposium2014
UNMSymposium2014
 
GFII 2014 Big Data
GFII 2014 Big DataGFII 2014 Big Data
GFII 2014 Big Data
 
Computational Pathology Workshop July 8 2014
Computational Pathology Workshop July 8 2014Computational Pathology Workshop July 8 2014
Computational Pathology Workshop July 8 2014
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009
 
ANN presentataion
ANN presentataionANN presentataion
ANN presentataion
 
Data analytics challenges in genomics
Data analytics challenges in genomicsData analytics challenges in genomics
Data analytics challenges in genomics
 
Driving Applications on the UCSD Big Data Freeway System
Driving Applications on the UCSD Big Data Freeway SystemDriving Applications on the UCSD Big Data Freeway System
Driving Applications on the UCSD Big Data Freeway System
 
Hybrid imaging
Hybrid imagingHybrid imaging
Hybrid imaging
 
NCI HTAN, cancer trajectories, precision oncology
NCI HTAN, cancer trajectories, precision oncologyNCI HTAN, cancer trajectories, precision oncology
NCI HTAN, cancer trajectories, precision oncology
 
Dr. Leroy Hood Lecuture on P4 Medicine
Dr. Leroy Hood Lecuture on P4 MedicineDr. Leroy Hood Lecuture on P4 Medicine
Dr. Leroy Hood Lecuture on P4 Medicine
 

Recently uploaded

Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Availablenarwatsonia7
 
Kolkata Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
Kolkata Call Girls Services 9907093804 @24x7 High Class Babes Here Call NowKolkata Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
Kolkata Call Girls Services 9907093804 @24x7 High Class Babes Here Call NowNehru place Escorts
 
VIP Call Girls Mumbai Arpita 9910780858 Independent Escort Service Mumbai
VIP Call Girls Mumbai Arpita 9910780858 Independent Escort Service MumbaiVIP Call Girls Mumbai Arpita 9910780858 Independent Escort Service Mumbai
VIP Call Girls Mumbai Arpita 9910780858 Independent Escort Service Mumbaisonalikaur4
 
call girls in Connaught Place DELHI 🔝 >༒9540349809 🔝 genuine Escort Service ...
call girls in Connaught Place  DELHI 🔝 >༒9540349809 🔝 genuine Escort Service ...call girls in Connaught Place  DELHI 🔝 >༒9540349809 🔝 genuine Escort Service ...
call girls in Connaught Place DELHI 🔝 >༒9540349809 🔝 genuine Escort Service ...saminamagar
 
Call Girls Frazer Town Just Call 7001305949 Top Class Call Girl Service Avail...
Call Girls Frazer Town Just Call 7001305949 Top Class Call Girl Service Avail...Call Girls Frazer Town Just Call 7001305949 Top Class Call Girl Service Avail...
Call Girls Frazer Town Just Call 7001305949 Top Class Call Girl Service Avail...narwatsonia7
 
Noida Sector 135 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few C...
Noida Sector 135 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few C...Noida Sector 135 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few C...
Noida Sector 135 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few C...rajnisinghkjn
 
Pharmaceutical Marketting: Unit-5, Pricing
Pharmaceutical Marketting: Unit-5, PricingPharmaceutical Marketting: Unit-5, Pricing
Pharmaceutical Marketting: Unit-5, PricingArunagarwal328757
 
Call Girls Viman Nagar 7001305949 All Area Service COD available Any Time
Call Girls Viman Nagar 7001305949 All Area Service COD available Any TimeCall Girls Viman Nagar 7001305949 All Area Service COD available Any Time
Call Girls Viman Nagar 7001305949 All Area Service COD available Any Timevijaych2041
 
Glomerular Filtration rate and its determinants.pptx
Glomerular Filtration rate and its determinants.pptxGlomerular Filtration rate and its determinants.pptx
Glomerular Filtration rate and its determinants.pptxDr.Nusrat Tariq
 
Russian Call Girls Chickpet - 7001305949 Booking and charges genuine rate for...
Russian Call Girls Chickpet - 7001305949 Booking and charges genuine rate for...Russian Call Girls Chickpet - 7001305949 Booking and charges genuine rate for...
Russian Call Girls Chickpet - 7001305949 Booking and charges genuine rate for...narwatsonia7
 
Book Call Girls in Yelahanka - For 7001305949 Cheap & Best with original Photos
Book Call Girls in Yelahanka - For 7001305949 Cheap & Best with original PhotosBook Call Girls in Yelahanka - For 7001305949 Cheap & Best with original Photos
Book Call Girls in Yelahanka - For 7001305949 Cheap & Best with original Photosnarwatsonia7
 
call girls in green park DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in green park  DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️call girls in green park  DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in green park DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️saminamagar
 
97111 47426 Call Girls In Delhi MUNIRKAA
97111 47426 Call Girls In Delhi MUNIRKAA97111 47426 Call Girls In Delhi MUNIRKAA
97111 47426 Call Girls In Delhi MUNIRKAAjennyeacort
 
Call Girls Service Noida Maya 9711199012 Independent Escort Service Noida
Call Girls Service Noida Maya 9711199012 Independent Escort Service NoidaCall Girls Service Noida Maya 9711199012 Independent Escort Service Noida
Call Girls Service Noida Maya 9711199012 Independent Escort Service NoidaPooja Gupta
 
Call Girls Hsr Layout Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Hsr Layout Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Hsr Layout Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Hsr Layout Just Call 7001305949 Top Class Call Girl Service Availablenarwatsonia7
 
Call Girl Lucknow Mallika 7001305949 Independent Escort Service Lucknow
Call Girl Lucknow Mallika 7001305949 Independent Escort Service LucknowCall Girl Lucknow Mallika 7001305949 Independent Escort Service Lucknow
Call Girl Lucknow Mallika 7001305949 Independent Escort Service Lucknownarwatsonia7
 
Call Girls Whitefield Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Whitefield Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Whitefield Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Whitefield Just Call 7001305949 Top Class Call Girl Service Availablenarwatsonia7
 
Call Girl Nagpur Sia 7001305949 Independent Escort Service Nagpur
Call Girl Nagpur Sia 7001305949 Independent Escort Service NagpurCall Girl Nagpur Sia 7001305949 Independent Escort Service Nagpur
Call Girl Nagpur Sia 7001305949 Independent Escort Service NagpurRiya Pathan
 
Dwarka Sector 6 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few Cl...
Dwarka Sector 6 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few Cl...Dwarka Sector 6 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few Cl...
Dwarka Sector 6 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few Cl...rajnisinghkjn
 
Call Girls Electronic City Just Call 7001305949 Top Class Call Girl Service A...
Call Girls Electronic City Just Call 7001305949 Top Class Call Girl Service A...Call Girls Electronic City Just Call 7001305949 Top Class Call Girl Service A...
Call Girls Electronic City Just Call 7001305949 Top Class Call Girl Service A...narwatsonia7
 

Recently uploaded (20)

Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Available
 
Kolkata Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
Kolkata Call Girls Services 9907093804 @24x7 High Class Babes Here Call NowKolkata Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
Kolkata Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
 
VIP Call Girls Mumbai Arpita 9910780858 Independent Escort Service Mumbai
VIP Call Girls Mumbai Arpita 9910780858 Independent Escort Service MumbaiVIP Call Girls Mumbai Arpita 9910780858 Independent Escort Service Mumbai
VIP Call Girls Mumbai Arpita 9910780858 Independent Escort Service Mumbai
 
call girls in Connaught Place DELHI 🔝 >༒9540349809 🔝 genuine Escort Service ...
call girls in Connaught Place  DELHI 🔝 >༒9540349809 🔝 genuine Escort Service ...call girls in Connaught Place  DELHI 🔝 >༒9540349809 🔝 genuine Escort Service ...
call girls in Connaught Place DELHI 🔝 >༒9540349809 🔝 genuine Escort Service ...
 
Call Girls Frazer Town Just Call 7001305949 Top Class Call Girl Service Avail...
Call Girls Frazer Town Just Call 7001305949 Top Class Call Girl Service Avail...Call Girls Frazer Town Just Call 7001305949 Top Class Call Girl Service Avail...
Call Girls Frazer Town Just Call 7001305949 Top Class Call Girl Service Avail...
 
Noida Sector 135 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few C...
Noida Sector 135 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few C...Noida Sector 135 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few C...
Noida Sector 135 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few C...
 
Pharmaceutical Marketting: Unit-5, Pricing
Pharmaceutical Marketting: Unit-5, PricingPharmaceutical Marketting: Unit-5, Pricing
Pharmaceutical Marketting: Unit-5, Pricing
 
Call Girls Viman Nagar 7001305949 All Area Service COD available Any Time
Call Girls Viman Nagar 7001305949 All Area Service COD available Any TimeCall Girls Viman Nagar 7001305949 All Area Service COD available Any Time
Call Girls Viman Nagar 7001305949 All Area Service COD available Any Time
 
Glomerular Filtration rate and its determinants.pptx
Glomerular Filtration rate and its determinants.pptxGlomerular Filtration rate and its determinants.pptx
Glomerular Filtration rate and its determinants.pptx
 
Russian Call Girls Chickpet - 7001305949 Booking and charges genuine rate for...
Russian Call Girls Chickpet - 7001305949 Booking and charges genuine rate for...Russian Call Girls Chickpet - 7001305949 Booking and charges genuine rate for...
Russian Call Girls Chickpet - 7001305949 Booking and charges genuine rate for...
 
Book Call Girls in Yelahanka - For 7001305949 Cheap & Best with original Photos
Book Call Girls in Yelahanka - For 7001305949 Cheap & Best with original PhotosBook Call Girls in Yelahanka - For 7001305949 Cheap & Best with original Photos
Book Call Girls in Yelahanka - For 7001305949 Cheap & Best with original Photos
 
call girls in green park DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in green park  DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️call girls in green park  DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in green park DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
 
97111 47426 Call Girls In Delhi MUNIRKAA
97111 47426 Call Girls In Delhi MUNIRKAA97111 47426 Call Girls In Delhi MUNIRKAA
97111 47426 Call Girls In Delhi MUNIRKAA
 
Call Girls Service Noida Maya 9711199012 Independent Escort Service Noida
Call Girls Service Noida Maya 9711199012 Independent Escort Service NoidaCall Girls Service Noida Maya 9711199012 Independent Escort Service Noida
Call Girls Service Noida Maya 9711199012 Independent Escort Service Noida
 
Call Girls Hsr Layout Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Hsr Layout Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Hsr Layout Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Hsr Layout Just Call 7001305949 Top Class Call Girl Service Available
 
Call Girl Lucknow Mallika 7001305949 Independent Escort Service Lucknow
Call Girl Lucknow Mallika 7001305949 Independent Escort Service LucknowCall Girl Lucknow Mallika 7001305949 Independent Escort Service Lucknow
Call Girl Lucknow Mallika 7001305949 Independent Escort Service Lucknow
 
Call Girls Whitefield Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Whitefield Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Whitefield Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Whitefield Just Call 7001305949 Top Class Call Girl Service Available
 
Call Girl Nagpur Sia 7001305949 Independent Escort Service Nagpur
Call Girl Nagpur Sia 7001305949 Independent Escort Service NagpurCall Girl Nagpur Sia 7001305949 Independent Escort Service Nagpur
Call Girl Nagpur Sia 7001305949 Independent Escort Service Nagpur
 
Dwarka Sector 6 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few Cl...
Dwarka Sector 6 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few Cl...Dwarka Sector 6 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few Cl...
Dwarka Sector 6 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few Cl...
 
Call Girls Electronic City Just Call 7001305949 Top Class Call Girl Service A...
Call Girls Electronic City Just Call 7001305949 Top Class Call Girl Service A...Call Girls Electronic City Just Call 7001305949 Top Class Call Girl Service A...
Call Girls Electronic City Just Call 7001305949 Top Class Call Girl Service A...
 

Awakening Clinical Data: Semantics for Scalable Medical Research Informatics

  • 1. Awakening Clinical Data: Semantics for Scalable Medical Research Informatics Satya S. Sahoo Division Medical Informatics Electrical Engineering and Computer Science Department Case Western Reserve University Cleveland, OH, USA
  • 2. Big Picture of Data in Clinical Research 143, 961 Patients per year (e.g. Emory) MRI: 50-100MB PET: 60-100MB National Sleep Research Resource: 500 TB MRI, PET scans Patient Reports source: PRISM project, BME dept CWRU source: PRISM project CWRU Case Western EMU: 250 TB Epilepsy Monitoring Unit (EMU) Data 500-600MB per patient per stay in EMU Wireless Health Data source: CWRU School of Engineering ~5.6 billion wireless 1-20GB each connections and growing Polysomnograms Pathology Reports, Tissue Bank source: Physio-MIMI, PRISM CWRU source: NLM and Wikipedia
  • 3. Big Picture of Data in Clinical Research 143, 961 Patients per year (e.g. Emory) MRI: 50-100MB •  Ultra large volume of data and growing rapidly PET: 60-100MB •  Data is Multi-modal, Heterogeneous •  Heterogeneity: Syntactic, Structural, Semantic National Sleep Research Resource: 500 TB MRI, PET scans Patient Reports source: PRISM project, BME dept CWRU source: PRISM project CWRU Case Western EMU: 250 TB Epilepsy Monitoring Unit (EMU) Data 500-600MB per patient per stay in EMU Wireless Health Data source: CWRU School of Engineering ~5.6 billion wireless 1-20GB each connections and growing Polysomnograms Pathology Reports, Tissue Bank source: Physio-MIMI, PRISM CWRU source: NLM and Wikipedia
  • 4. Scalability in Medical Informatics: Beyond Volume Exemplar: Sleep Medicine Research MRI, PET scans Patient Reports source: PRISM project, BME dept CWRU source: PRISM project CWRU Epilepsy Monitoring Unit (EMU) Data Wireless Health Data source: CWRU School of Engineering Polysomnograms Pathology Reports, Tissue Bank source: Physio-MIMI, PRISM CWRU source: NLM and Wikipedia
  • 5. Scalability in Medical Informatics: Beyond Volume Exemplar: Sleep Medicine Research •  Multi-Center Studies with differing administrative requirements – business logicscans Patient Reports MRI, PET source: PRISM project, BME dept CWRU source: PRISM project CWRU •  Dynamic data – grows over project duration Epilepsy Monitoring Unit (EMU) Data •  Data Semantics as foundation to support a wide spectrum of users – clinicians, nurse practitioners, research fellows Wireless Health Data source: CWRU School of Engineering Polysomnograms Pathology Reports, Tissue Bank source: Physio-MIMI, PRISM CWRU source: NLM and Wikipedia
  • 6. A Wish List for Scalable Clinical Data Management •  Reconcile Data Heterogeneity – most critical to successful translational research o  Syntactic heterogeneity – less of a problem, data dictionaries help o  Structural heterogeneity – problematic, XML somewhat helpful o  Semantic heterogeneity – a huge problem, ontologies to the rescue? •  Provenance – essential for data quality, compliance, insight o  Blood Oxygen Baseline: oxygen saturation during the first 15 or 30 seconds of sleep o  Patient blood report last month cause of change in medication – Domain Provenance (not just tuple provenance) •  Intuitive access to information – clinical trials eligibility, cohort identification •  Scalable - Data sources, research partners added or removed dynamically
  • 7. A “not to do” list for Clinical Data Management Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch •  No Linked Open Patient Data – HIPAA, HITECH Act (US), Data Protection Act (UK) o  De-identified data – IRB approval •  Ontology as global schema – but no RDF o  Vast majority as RDB o  Practical issues with RDF – cannot be institution- specific URI (privacy)
  • 8. Physio-MIMI: Multi‐Modality, Multi‐Resource Environment for Physiological and Clinical Research Clinical Researcher SNOMED-CT FMA Sleep Domain … Ontology OGMS Any number of new centers
  • 9. Physio-MIMI: Enabling Scalable Medical Research •  NCRR‐funded, multi‐CTSA site project: Sleep medicine as exemplar •  Federated data management – scalable, adapts to changing data access policies •  Ontology-driven: o  Data mappings – Ontology class to data dictionary terms (manually curated) o  Drive query interface o  Manage provenance •  Privacy aware, IRB-compliant •  Collaboration among Case Western, U. of Michigan, Marshfield Clinic and U. of Wisconsin, Madison o  Now Harvard Medical School
  • 10. Key Resource: Sleep Domain Ontology (SDO) https://mimi.case.edu/concepts
  • 11. Data Mappings: SDO to Data Dictionary Physio-Map Module •  Visual interface •  Stores mappings in XML – moving towards rules •  Dynamically executed in response to user query User Voting
  • 12. Provenance: Contextual Metadata for Clinical Research Slide courtesy: Remo Mueller
  • 13. Provenance: To Trace Variations in Data and Results Slide courtesy: Remo Mueller
  • 14. Modified from slide courtesy: Remo Mueller
  • 15. Provenance: Source information for Patient Data Slide courtesy: Remo Mueller
  • 16. Intuitive Query Interface: Ontology (SDO)-driven Visual Aggregator and Explorer (VisAgE) DataSets Ontology Concept – Type of Query Widget
  • 17. PhysioMIMI in National Sleep Research Resource •  National Sleep Research Resource (NSSR) – scored and awaiting funding review •  Collaboration between Harvard Medical School (domain experts) and Case Western (CS) with 15 projects o  50,000 sleep research studies – total size of 500TB •  Semantic Data Integration – SDO and Sleep Provenance Ontology (extending W3C PROV Ontology PROV-O) •  Signal processing tools – using a common format called European Data Format (EDF), XML-based •  Domain analysis, cross-linking – secure Web access
  • 18. Challenges: Semantics in Large Scale Clinical Data •  Incentives for adopting RDF in clinical data management – what is already not possible in RDB? •  OWL2, RDFS reasoning – Privacy aware reasoning, semantics-aware access control (Nguyen et al. 2012) •  Missing Semantics? o  Variable, missing provenance in original study - re- create provenance with (limited) provenance? o  Fine-level granularity for semantic annotation of signal data – currently not scalable •  A little semantics does not go too far in clinical data o  Need for greater involvement of Semantic Web community in development of EHR systems
  • 19. Acknowledgements •  Guo-Qiang Zhang, Remo Mueller, Samden Lhatoo, Susan Redline, Alireza Bozorgi •  Division of Medical Informatics: Lingyun Luo, Joe Teagno, Meng Zhao, Jake Luo, Licong Cui, Chien-Hung Chen, Catherine Jayapandian •  Physio-MIMI Team: http://physiomimi.case.edu/ •  Contact Information: satya.sahoo@case.edu, http://cci.case.edu/cci/index.php/Satya_Sahoo