SlideShare a Scribd company logo
1 of 66
Human Studies Database Project CTSA Informatics All Hands Meeting October 13, 2011 Ida Sim, UCSF, for the HSDB team Funding: CTSAs and R01-RR-026040
[object Object],The HSDB Team Jim Brinkley U Wash Simona Carini UCSF Todd Detwiler U Wash Harold Lehmann Hopkins Brad Pollock UTHSC S Ant Shamim Mollah Rockefeller Ida Sim UCSF Harold Solbrig Mayo Samson Tu Stanford Knut Wittkowski Rockefeller BERD BERD
Outline ,[object Object],[object Object],[object Object],[object Object],[object Object]
Broad Long-Term Objective  ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Go for the Gold? Main Results Table  2.7 (1.1 - 4.1) 2.2 (1.7-3.4) 121 (99-129) 110 (87-134) 0.91 (0.93-1.04) 0.83 (0.79-0.99) 45.1 (39.9-50.5) 46.4 (39.2-51.2)
Need Standardized Metadata ,[object Object],2.7 (1.1 - 4.1) 2.2 (1.7-3.4) Creatinine 121 (99-129) 110 (87-134) Weight (kg) 0.91 (0.93-1.04) 0.83 (0.79-0.99) ICa 45.1 (39.9-50.5) 46.4 (39.2-51.2) Age
Description of Study Protocol Critical  for Interpreting Results ,[object Object],2.7 (1.1 - 4.1) 2.2 (1.7-3.4) Creatinine 121 (99-129) 110 (87-134) Weight (kg) 0.91 (0.93-1.04) 0.83 (0.79-0.99) ICa 45.1 (39.9-50.5) 46.4 (39.2-51.2) Age
Description of Study Protocol Critical  for Interpreting Results ,[object Object],[object Object],2.7 (1.1 - 4.1) 2.2 (1.7-3.4) Creatinine Chocolate Garlic 121 (99-129) 110 (87-134) Weight (kg) 0.91 (0.93-1.04) 0.83 (0.79-0.99) ICa 45.1 (39.9-50.5) 46.4 (39.2-51.2) Age
Need Ontology of Clinical Research  ,[object Object],[object Object],2.7 (1.1 - 4.1) 2.2 (1.7-3.4) Creatinine Chocolate Garlic 121 (99-129) 110 (87-134) Weight (kg) 0.91 (0.93-1.04) 0.83 (0.79-0.99) ICa 45.1 (39.9-50.5) 46.4 (39.2-51.2) Age
HSDB Project Aims and Status ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
[object Object]
6 HIV and 186 Studies Studies Study design Intervention/Factor Primary outcome(s) Secondary outcome(s) Taha  (#1,  UCSF* ) Parallel group, randomized Arm 1: metronidazole  + erythomycin Arm 2: placebo - Infant HIV Infection at 4-6 wks  - Composite of infant HIV infection and mortality, at 1 year of age - Infant HIV Infection at 24-48 hours, and 12 months  - etc. Metzger  (#2,  UCSF* ) Parallel group, randomized Arm 1: Buprenorphine/ Naloxone 3 wks + 52 wks Arm 2: Buprenorphine/ Naloxone max 18 days All arms: counseling HIV-1 Infection or death, at 104 week visit - Death, through week 156 - HIV-1 Infection every 6 months at scheduled follow up visits - etc. German  (#3, Hopkins) Cohort HIV status at baseline Recognized HIV Infection, Wave 2 (3 years) - Unrecognized HIV Infection, Wave 2 (3 years)  - etc. Wawer  (#4, Hopkins) Arm 1: Immediate circumcision Arm 2: Delayed circumcision Male-to-female HIV transmission, throughout study El-Sadr  (#5,  UCSF* ) Cohort Assigned to drug conservation (DC) arm or assigned to viral suppression (VS) arm in SMART study HIV transmission risk behavior, end of study HIV transmission risk behavior in participants who are not on ART at enrollment, end of study Cohen  (#6,  UCSF* ) Cohort HIV-1 infection status: proven acute, established, or uninfected - Prevalence of acute HIV infection, throughout study  - etc. Rockefeller: 186 studies Interventional Observational
Data Sources Local Servers Query Integrator Registry XML XML auto-generation XML Manual Bulk Upload Protocol Documents Electronic IRB (iMedRIS) Johns Hopkins Rockefeller UCSF  (AWS) OCRe-XSD OCRe
[object Object],Calls BioPortal with a subsumption query on SNOMED, for children of “macrolide” [428787002] Searches for interventions within arms, for codes matching “macrolide” or children
[object Object]
Outline ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Data Sources Local Servers Query Integrator Registry XML XML auto-generation XML Manual Bulk Upload Protocol Documents Electronic IRB (iMedRIS) Johns Hopkins Rockefeller UCSF  (AWS) OCRe-XSD OCRe
OCRe ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Sim I, et al.  AMIA CRI Summit  2010,  p.51-55.
OCRe Import Graph ,[object Object],[object Object],[object Object]
Modeling Study Outcomes and Analyses ,[object Object],HIV Infection has_code 86406008 has_code_system_name SNOMED-CT has_code_system_version2011_01_31 has_display_name Human immunodeficiency virus infection Primary
Composite Outcomes:  HIV Infection or Death at 1 year of age ,[object Object],Var1: HIV Infection Var2: Death 1 year of age Primary
Outline ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Data Sources Local Servers Query Integrator Registry XML XML auto-generation XML Manual Bulk Upload Protocol Documents Electronic IRB (iMedRIS) Johns Hopkins Rockefeller UCSF  (AWS) OCRe-XSD OCRe
The HSDB 4-Quadrant Diagram
[object Object],October, 2010
[object Object],April, 2011
[object Object],[object Object],May, 2011
[object Object],June, 2011
October, 2010 October, 2011
[object Object]
OCRe-XSD ,[object Object],[object Object],[object Object]
Outline ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
 
RU HSDB Data Mapping Workflow XML Generator: 1. Links IORG number 2. Cleans textual data 3. Generates xml file Oracle DB  RU iMedRIS HSDB xsd schema SQL Mapper RU HSDB xml  XML Generator generates rules for mapping  data elements extracts data elements SQL Mapper: 1. Maps data elements using xsd  2. Transforms extracted data using  analytics (data conversion, masking,  concatenation, etc.)  generates data  elements table
Hopkins and UCSF Workflow ,[object Object],[object Object],[object Object],[object Object]
Registry File of Published XML Instances ,[object Object],[object Object],[object Object],[object Object]
Outline ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
 
Query Integrator: Brinkley Lab, U Wash ,[object Object],[object Object],http://www.si.washington.edu/projects/QI
BioPortal REST services (SNOMED) UCSF HSDB Data OCRe in OWL Remote Services vSPARQL Service DXQuery Service Other Services XML RDF/OWL Other Clients QI Client RDF Store Query Database QI Server QES QI Core QI “Plugins”
Outline ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Four Illustrative Queries ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Demo: Query on Interventions ,[object Object],[object Object],[object Object]
Querying BioPortal
[object Object],Chains a subsumption query to SNOMED, macrolide ID = 428787002
[object Object],REST call to SNOMED in BioPortal Cleans and returns all subclasses of SNOMED ID (e.g., 428787002 for Macrolide)
SNOMED Results: All Children of 42878700
[object Object],Matches studies where Arm tags contain SNOMED code of Macrolide or its children
Arm Structure Explicitly Modeled
Query on Study Design ,[object Object],[object Object],[object Object]
Finding all study designs matching OCRe ID for “interventional” or its children Retrieve all interventional studies
OCRe study design typology OCRe_XSD XML instance
 
Query on Study Design ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Finding all study designs matching OCRe ID for “parallel group”  Retrieve all placebo-controlled randomized trials Finding allocation schemes under OCRe’s “random allocation” hierarchy Finding interventions = SNOMED code for “placebo” [182886004]
OCRe Hierarchy of Allocation Type
Query on Outcome Variables ,[object Object],[object Object],[object Object]
Same BioPortal SNOMED subsumption call as for Macrolide query Matches any outcome variable code to SNOMED ID for HIV Infection or children All studies with HIV Infection as any single variable outcome
Call query for HIV infection as a single variable outcome with outcome priority = Primary All studies with HIV infection as a Primary single variable outcome
Primary outcome is HIV Infection at 4-6 weeks HIV Infection at 24-48 hours, and at 12 months, are Secondary outcomes
[object Object]
Outline ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Summary ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Future Work ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Federate Your Data With Us! ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Links ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

More Related Content

Viewers also liked

Scaling Self-Experimentation
Scaling Self-ExperimentationScaling Self-Experimentation
Scaling Self-ExperimentationIda Sim
 
Evidence Farming and Open Architecture
Evidence Farming and Open ArchitectureEvidence Farming and Open Architecture
Evidence Farming and Open ArchitectureIda Sim
 
BYO App: Announcing Linq from Open mHealth
BYO App: Announcing Linq from Open mHealthBYO App: Announcing Linq from Open mHealth
BYO App: Announcing Linq from Open mHealthIda Sim
 
Insights into the Success of Text Messaging for Health
Insights into the Success of Text Messaging for HealthInsights into the Success of Text Messaging for Health
Insights into the Success of Text Messaging for Healthmobilecommons
 
Mobile Health for Reducing Disparities: Does it Work and How Will we Know?
Mobile Health for Reducing Disparities: Does it Work and How Will we Know? Mobile Health for Reducing Disparities: Does it Work and How Will we Know?
Mobile Health for Reducing Disparities: Does it Work and How Will we Know? Ida Sim
 
The Uneven Future of Evidence-Based Medicine
The Uneven Future of Evidence-Based MedicineThe Uneven Future of Evidence-Based Medicine
The Uneven Future of Evidence-Based MedicineIda Sim
 

Viewers also liked (7)

Stanford globalm health_p_mechael
Stanford globalm health_p_mechaelStanford globalm health_p_mechael
Stanford globalm health_p_mechael
 
Scaling Self-Experimentation
Scaling Self-ExperimentationScaling Self-Experimentation
Scaling Self-Experimentation
 
Evidence Farming and Open Architecture
Evidence Farming and Open ArchitectureEvidence Farming and Open Architecture
Evidence Farming and Open Architecture
 
BYO App: Announcing Linq from Open mHealth
BYO App: Announcing Linq from Open mHealthBYO App: Announcing Linq from Open mHealth
BYO App: Announcing Linq from Open mHealth
 
Insights into the Success of Text Messaging for Health
Insights into the Success of Text Messaging for HealthInsights into the Success of Text Messaging for Health
Insights into the Success of Text Messaging for Health
 
Mobile Health for Reducing Disparities: Does it Work and How Will we Know?
Mobile Health for Reducing Disparities: Does it Work and How Will we Know? Mobile Health for Reducing Disparities: Does it Work and How Will we Know?
Mobile Health for Reducing Disparities: Does it Work and How Will we Know?
 
The Uneven Future of Evidence-Based Medicine
The Uneven Future of Evidence-Based MedicineThe Uneven Future of Evidence-Based Medicine
The Uneven Future of Evidence-Based Medicine
 

Similar to Human Studies Database Project (demo)

The Human Cell Atlas Data Coordination Platform
The Human Cell Atlas Data Coordination PlatformThe Human Cell Atlas Data Coordination Platform
The Human Cell Atlas Data Coordination PlatformLaura Clarke
 
Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Carole Goble
 
Semantic Web Technologies as a Framework for Clinical Informatics
Semantic Web Technologies as a Framework for Clinical InformaticsSemantic Web Technologies as a Framework for Clinical Informatics
Semantic Web Technologies as a Framework for Clinical InformaticsChimezie Ogbuji
 
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...GigaScience, BGI Hong Kong
 
The Logical Model Designer - Binding Information Models to Terminology
The Logical Model Designer - Binding Information Models to TerminologyThe Logical Model Designer - Binding Information Models to Terminology
The Logical Model Designer - Binding Information Models to TerminologySnow Owl
 
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...Alejandra Gonzalez-Beltran
 
The beauty of workflows and models
The beauty of workflows and modelsThe beauty of workflows and models
The beauty of workflows and modelsmyGrid team
 
Being FAIR: Enabling Reproducible Data Science
Being FAIR: Enabling Reproducible Data ScienceBeing FAIR: Enabling Reproducible Data Science
Being FAIR: Enabling Reproducible Data ScienceCarole Goble
 
Production Bioinformatics, emphasis on Production
Production Bioinformatics, emphasis on ProductionProduction Bioinformatics, emphasis on Production
Production Bioinformatics, emphasis on ProductionChris Dwan
 
The Electronic Notebook Ontology
The Electronic Notebook OntologyThe Electronic Notebook Ontology
The Electronic Notebook OntologyStuart Chalk
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8Scott Edmunds
 
fhir and loinc
fhir and loincfhir and loinc
fhir and loincDevDays
 
GigaScience: a new resource for the big-data community.
GigaScience: a new resource for the big-data community.GigaScience: a new resource for the big-data community.
GigaScience: a new resource for the big-data community.GigaScience, BGI Hong Kong
 
Ontologising the Health Level Seven (HL7) Standard
Ontologising the Health Level Seven (HL7) StandardOntologising the Health Level Seven (HL7) Standard
Ontologising the Health Level Seven (HL7) StandardRatnesh Sahay
 
2011-11-28 Open PHACTS at RSC CICAG
2011-11-28 Open PHACTS at RSC CICAG2011-11-28 Open PHACTS at RSC CICAG
2011-11-28 Open PHACTS at RSC CICAGopen_phacts
 
Assessing Galaxy's ability to express scientific workflows in bioinformatics
Assessing Galaxy's ability to express scientific workflows in bioinformaticsAssessing Galaxy's ability to express scientific workflows in bioinformatics
Assessing Galaxy's ability to express scientific workflows in bioinformaticsPeter van Heusden
 
Towards Automatic Classification of LOD Datasets
Towards Automatic Classification of LOD DatasetsTowards Automatic Classification of LOD Datasets
Towards Automatic Classification of LOD DatasetsBlerina Spahiu
 
OpenTox - an open community and framework supporting predictive toxicology an...
OpenTox - an open community and framework supporting predictive toxicology an...OpenTox - an open community and framework supporting predictive toxicology an...
OpenTox - an open community and framework supporting predictive toxicology an...Barry Hardy
 
Computation and Knowledge
Computation and KnowledgeComputation and Knowledge
Computation and KnowledgeIan Foster
 

Similar to Human Studies Database Project (demo) (20)

The Human Cell Atlas Data Coordination Platform
The Human Cell Atlas Data Coordination PlatformThe Human Cell Atlas Data Coordination Platform
The Human Cell Atlas Data Coordination Platform
 
Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017
 
Semantic Web Technologies as a Framework for Clinical Informatics
Semantic Web Technologies as a Framework for Clinical InformaticsSemantic Web Technologies as a Framework for Clinical Informatics
Semantic Web Technologies as a Framework for Clinical Informatics
 
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
 
The Logical Model Designer - Binding Information Models to Terminology
The Logical Model Designer - Binding Information Models to TerminologyThe Logical Model Designer - Binding Information Models to Terminology
The Logical Model Designer - Binding Information Models to Terminology
 
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
 
The beauty of workflows and models
The beauty of workflows and modelsThe beauty of workflows and models
The beauty of workflows and models
 
Being FAIR: Enabling Reproducible Data Science
Being FAIR: Enabling Reproducible Data ScienceBeing FAIR: Enabling Reproducible Data Science
Being FAIR: Enabling Reproducible Data Science
 
Production Bioinformatics, emphasis on Production
Production Bioinformatics, emphasis on ProductionProduction Bioinformatics, emphasis on Production
Production Bioinformatics, emphasis on Production
 
The Electronic Notebook Ontology
The Electronic Notebook OntologyThe Electronic Notebook Ontology
The Electronic Notebook Ontology
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8
 
fhir and loinc
fhir and loincfhir and loinc
fhir and loinc
 
GigaScience: a new resource for the big-data community.
GigaScience: a new resource for the big-data community.GigaScience: a new resource for the big-data community.
GigaScience: a new resource for the big-data community.
 
Ontologising the Health Level Seven (HL7) Standard
Ontologising the Health Level Seven (HL7) StandardOntologising the Health Level Seven (HL7) Standard
Ontologising the Health Level Seven (HL7) Standard
 
2011-11-28 Open PHACTS at RSC CICAG
2011-11-28 Open PHACTS at RSC CICAG2011-11-28 Open PHACTS at RSC CICAG
2011-11-28 Open PHACTS at RSC CICAG
 
Assessing Galaxy's ability to express scientific workflows in bioinformatics
Assessing Galaxy's ability to express scientific workflows in bioinformaticsAssessing Galaxy's ability to express scientific workflows in bioinformatics
Assessing Galaxy's ability to express scientific workflows in bioinformatics
 
Towards Automatic Classification of LOD Datasets
Towards Automatic Classification of LOD DatasetsTowards Automatic Classification of LOD Datasets
Towards Automatic Classification of LOD Datasets
 
OpenTox - an open community and framework supporting predictive toxicology an...
OpenTox - an open community and framework supporting predictive toxicology an...OpenTox - an open community and framework supporting predictive toxicology an...
OpenTox - an open community and framework supporting predictive toxicology an...
 
Computation and Knowledge
Computation and KnowledgeComputation and Knowledge
Computation and Knowledge
 
Overview of Next Gen Sequencing Data Analysis
Overview of Next Gen Sequencing Data AnalysisOverview of Next Gen Sequencing Data Analysis
Overview of Next Gen Sequencing Data Analysis
 

Recently uploaded

How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...RKavithamani
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 

Recently uploaded (20)

How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 

Human Studies Database Project (demo)

  • 1. Human Studies Database Project CTSA Informatics All Hands Meeting October 13, 2011 Ida Sim, UCSF, for the HSDB team Funding: CTSAs and R01-RR-026040
  • 2.
  • 3.
  • 4.
  • 5. Go for the Gold? Main Results Table 2.7 (1.1 - 4.1) 2.2 (1.7-3.4) 121 (99-129) 110 (87-134) 0.91 (0.93-1.04) 0.83 (0.79-0.99) 45.1 (39.9-50.5) 46.4 (39.2-51.2)
  • 6.
  • 7.
  • 8.
  • 9.
  • 10.
  • 11.
  • 12. 6 HIV and 186 Studies Studies Study design Intervention/Factor Primary outcome(s) Secondary outcome(s) Taha (#1, UCSF* ) Parallel group, randomized Arm 1: metronidazole + erythomycin Arm 2: placebo - Infant HIV Infection at 4-6 wks - Composite of infant HIV infection and mortality, at 1 year of age - Infant HIV Infection at 24-48 hours, and 12 months - etc. Metzger (#2, UCSF* ) Parallel group, randomized Arm 1: Buprenorphine/ Naloxone 3 wks + 52 wks Arm 2: Buprenorphine/ Naloxone max 18 days All arms: counseling HIV-1 Infection or death, at 104 week visit - Death, through week 156 - HIV-1 Infection every 6 months at scheduled follow up visits - etc. German (#3, Hopkins) Cohort HIV status at baseline Recognized HIV Infection, Wave 2 (3 years) - Unrecognized HIV Infection, Wave 2 (3 years) - etc. Wawer (#4, Hopkins) Arm 1: Immediate circumcision Arm 2: Delayed circumcision Male-to-female HIV transmission, throughout study El-Sadr (#5, UCSF* ) Cohort Assigned to drug conservation (DC) arm or assigned to viral suppression (VS) arm in SMART study HIV transmission risk behavior, end of study HIV transmission risk behavior in participants who are not on ART at enrollment, end of study Cohen (#6, UCSF* ) Cohort HIV-1 infection status: proven acute, established, or uninfected - Prevalence of acute HIV infection, throughout study - etc. Rockefeller: 186 studies Interventional Observational
  • 13. Data Sources Local Servers Query Integrator Registry XML XML auto-generation XML Manual Bulk Upload Protocol Documents Electronic IRB (iMedRIS) Johns Hopkins Rockefeller UCSF (AWS) OCRe-XSD OCRe
  • 14.
  • 15.
  • 16.
  • 17. Data Sources Local Servers Query Integrator Registry XML XML auto-generation XML Manual Bulk Upload Protocol Documents Electronic IRB (iMedRIS) Johns Hopkins Rockefeller UCSF (AWS) OCRe-XSD OCRe
  • 18.
  • 19.
  • 20.
  • 21.
  • 22.
  • 23. Data Sources Local Servers Query Integrator Registry XML XML auto-generation XML Manual Bulk Upload Protocol Documents Electronic IRB (iMedRIS) Johns Hopkins Rockefeller UCSF (AWS) OCRe-XSD OCRe
  • 25.
  • 26.
  • 27.
  • 28.
  • 30.
  • 31.
  • 32.
  • 33.  
  • 34. RU HSDB Data Mapping Workflow XML Generator: 1. Links IORG number 2. Cleans textual data 3. Generates xml file Oracle DB RU iMedRIS HSDB xsd schema SQL Mapper RU HSDB xml XML Generator generates rules for mapping data elements extracts data elements SQL Mapper: 1. Maps data elements using xsd 2. Transforms extracted data using analytics (data conversion, masking, concatenation, etc.) generates data elements table
  • 35.
  • 36.
  • 37.
  • 38.  
  • 39.
  • 40. BioPortal REST services (SNOMED) UCSF HSDB Data OCRe in OWL Remote Services vSPARQL Service DXQuery Service Other Services XML RDF/OWL Other Clients QI Client RDF Store Query Database QI Server QES QI Core QI “Plugins”
  • 41.
  • 42.
  • 43.
  • 45.
  • 46.
  • 47. SNOMED Results: All Children of 42878700
  • 48.
  • 50.
  • 51. Finding all study designs matching OCRe ID for “interventional” or its children Retrieve all interventional studies
  • 52. OCRe study design typology OCRe_XSD XML instance
  • 53.  
  • 54.
  • 55. Finding all study designs matching OCRe ID for “parallel group” Retrieve all placebo-controlled randomized trials Finding allocation schemes under OCRe’s “random allocation” hierarchy Finding interventions = SNOMED code for “placebo” [182886004]
  • 56. OCRe Hierarchy of Allocation Type
  • 57.
  • 58. Same BioPortal SNOMED subsumption call as for Macrolide query Matches any outcome variable code to SNOMED ID for HIV Infection or children All studies with HIV Infection as any single variable outcome
  • 59. Call query for HIV infection as a single variable outcome with outcome priority = Primary All studies with HIV infection as a Primary single variable outcome
  • 60. Primary outcome is HIV Infection at 4-6 weeks HIV Infection at 24-48 hours, and at 12 months, are Secondary outcomes
  • 61.
  • 62.
  • 63.
  • 64.
  • 65.
  • 66.

Editor's Notes

  1. Our project objective is
  2. The gold is results data, so start sharing there?
  3. Standardized metadata isn’t enough. We need to know more about the study protocol. Were these outcomes…Probably baseline, but do you KNOW?
  4. But still you don’t know enough. What do the columns represent? RCT of garlic vs. chocolate on weight loss? An observational study of garlic vs. chocolate on slowing renal failure?
  5. To make sense of Context around Results Table … not study execution or research administration
  6. These studies are in XML, conformant to an XSD that is automatically generated from OCRe. The XML data is in 3 seoarate servers,and we are going to query over them using the Query Integrator from the U. Washington.
  7. The Quick Pass Demo: Xquery --
  8. These studies are in XML, conformant to an XSD that is automatically generated from OCRe. The XML data is in 3 seoarate servers,and we are going to query over them using the Query Integrator from the U. Washington.
  9. in original studies, observations are acquired directly on or from the study participants (including individual participant-level data from databases, and participant-level meta-analysis) in meta studies, observations are acquired from journal articles, abstracts, etc. reporting on other studies
  10. To give you a sense of how OCRe is set up: (the generic set of possible annotations are defined in 'export_annotation_def.owl)
  11. Protocols have variables (outcome variables, factor or treatment assignment variables). For this demo, we’ll be looking at HIV Infection as a study outcome variable. They may be described by a Clinical Descriptor, like an ID from a code system Outcome variables may be primary or secondary, and may be assessed at one or more timepoints. Each variable plays a role of an independent of dependent variable in one or more statistical analyses. For example, in the main analysis of an interventional study, has as its independent variable the assignment variable (to a marcolide or placebo), and the dependent variable is the primary outcome.
  12. Variables may also be derived from other variables, e.g., composite outcomes, averages, etc.
  13. These studies are in XML, conformant to an XSD that is automatically generated from OCRe. The XML data is in 3 seoarate servers,and we are going to query over them using the Query Integrator from the U. Washington.
  14. We talked already about the OCRe Import graph. For data acquisition, we’ve decided that the normative form of HSDB data will be RDF. But it is a big jump from relational databases to RDF for most institutions. Therefore, we’re defining an intermediate step, OCRe in XSD, as a target for institutions to take their data from relational to XML. Then need to go from XML to RDF, not quite sure how yet. Once data in RDF, we can do logical curation and use various tools including WebProtege and/or VITRO to build curation interfaces. The RDF data can then be queried using Query Integrator, which accesses BioPortal for terminologies and value sets.
  15. For
  16. Interactive Matrix Language http://www.si.washington.edu/projects/QI http ://sig.biostr.washington. edu/projects/queryintegrator
  17. manually supply macrolide SNOMED code for now queries BioPortal REST service for all children of macrolide term query 221: in turn REST query, get paths to leaves (passes in root), calls BioPortal, returns paths from Macrolide to leaf subclasses; and parses the results and pulls out the all leaf and interim SNOMED codes;