SlideShare a Scribd company logo
1 of 15
APPROACHES TO COMBINING
SUPPLEMENTARY DATASETS ACROSS
MULTIPLE TRUSTED RESEARCH
ENVIRONMENTS USING FEDERATED
ANALYSIS
DR. ARAVIND SESAGIRI RAAMKUMAR
FEDERATED ANALYSIS (FA)
• Analysis of datasets or data sources that are present in different
geographic locations or networks
• Raw data generally not shared during analysis
• Statistical parameters or aggregated data are exchanged during
analysis
• The efficiency of FA projects is contingent on both communication
bandwidth and computational complexity
APPROACHES
• Linked Data Approach
• OMOP CDM Approach
• Custom API Approach
• Dedicated Environment Approach
• Hybrid Approach
Linked Data Approach
• Linked Data is based on Semantic
Web standards
• Data needs to be represented in the
Resource Description Framework
(RDF) format
• Each data item needs to be assigned a
URI (Uniform Resource Indicator)
• Relations and class hierarchy in the
datasets need to be represented
using Ontologies
• Data can be queried through SPARQL
endpoints
Linked Data Approach - Examples
https://dbpedia.org/page/Dementia
https://dbpedia.org/sparql
Linked Data Approach – Federated Querying
Example
PREFIX owl: <http://www.w3.org/2002/07/owl#>
PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>
PREFIX movie: <http://data.linkedmdb.org/resource/movie/>
PREFIX dc: <http://purl.org/dc/terms/>
PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX dbo: <http://dbpedia.org/ontology/>
PREFIX foaf: <http://xmlns.com/foaf/0.1/>
PREFIX dbp: <http://dbpedia.org/property/>
SELECT distinct ?name ?author ?filmname ?imdbID WHERE {
SERVICE <http://dbpedia.org/sparql> {
?book rdf:type dbo:Book .
?book foaf:name ?name .
?book dbp:author ?author .
?author foaf:name ?authname .
?book ^dbo:basedOn ?movie .
?movie a dbo:Film .
?movie foaf:name ?filmname
FILTER (str(?name) IN ("Royal Flash","White Oleander", "Possession: A Romance", "Misery", "Intensity", "The War of The Roses", "Momo", "The
Sicilian", "Derailed", "Ragtime"))
}
SERVICE <http://data.linkedmdb.org/sparql> {
?filmname foaf:page ?imdbID .
?filmname dc:title ?title .
FILTER(regex(str(?imdbID), "www.imdb.com" ) )
}
}
OMOP CDM Approach
• The Observational Medical Outcomes Partnership (OMOP) Common
Data Model (CDM) approach facilitates federated analysis using a
common data format
• Common data format includes concepts (terminologies), vocabularies,
and coding schemes
• The parent organization OHDSI provides a suite of tools for analysis
OMOP CDM Approach – CDM
OMOP CDM Approach – Tools
https://atlas-demo.ohdsi.org/
• Support for custom, OHDSI methods library in R and ATLAS
• Free tool for analyzing standardized, patient-level, observational data
CDM data
• Makes use of OHDSI WebAPI
• Supports Machine Learning
Custom API Approach
• Application Programming Interfaces (API) gateways can be setup to
serve as a black box over the data environment
• HTTP-based REST APIs along with an authentication key can be used
to pass specific data points over the internet
• Data usually shared via JSON, XML formats
• Example Link
Dedicated Environment Approach
• In this approach, the data environments are connected to the other
related data environments through a dedicated network or data access is
provided through a dedicated environment
• Special authentication is provided for remote access purposes
• Facilitated usually via a VPN (virtual private network) or directly through
internet
• Users access the data and tools using a Virtual Desktop Interface (VDI)
• Installation of special software might be required depending on the
setup
Comparison of Standalone Approaches
Approach Advantages Challenges and Barriers
Linked Data
Usage of open standards; Facilitates
inferencing and querying massive
public datasets
Data conversion from existing formats
will be a big effort, Periodic
object/ontology modeling for new
concepts; Dedicated training
OMOP CDM
Usage of a universally accepted
data format; Potential for
international collaborations
Data conversion from existing formats
might be a big effort; Might not be
useful if collaborators do not use the
same approach
Custom API
Convenient; Easiest to deploy;
Suitable for minimal data sharing
Ad-hoc querying would not be
possible; Large-scale data sharing
would be difficult
Dedicated
Environment
Safest in terms of data security;
Facilitates detailed auditing
Higher setup cost; Higher onboarding
time
Hybrid Approaches
• Certain or all aspects of standalone approaches can be combined to
form hybrid approaches
• Example 1: OMOP CDM with Dedicated Network where the health
organizations, government bodies, and academic institutes are
connected using common data standard, and a country-wide
dedicated network
• Partly implemented by National University Health System in
Singapore
• Example 2: Linked Data with API where REST APIs encapsulate the
querying complexities of SPARQL and RDF
Other Factors for Consideration
• Data anonymization
• Data versioning
• Provenance
• Security threats
• Standards and tools selection
THANK YOU

More Related Content

Similar to Approaches to combining supplementary datasets across multiple trusted research environments using federated analysis

Data accessibilityandchallenges
Data accessibilityandchallengesData accessibilityandchallenges
Data accessibilityandchallengesjyotikhadake
 
Wed roman tut_open_datapub
Wed roman tut_open_datapubWed roman tut_open_datapub
Wed roman tut_open_datapubeswcsummerschool
 
A BASILar Approach for Building Web APIs on top of SPARQL Endpoints
A BASILar Approach for Building Web APIs on top of SPARQL EndpointsA BASILar Approach for Building Web APIs on top of SPARQL Endpoints
A BASILar Approach for Building Web APIs on top of SPARQL EndpointsEnrico Daga
 
An Approach to Combining Disparate Clinical Study Data across Multiple Sponso...
An Approach to Combining Disparate Clinical Study Data across Multiple Sponso...An Approach to Combining Disparate Clinical Study Data across Multiple Sponso...
An Approach to Combining Disparate Clinical Study Data across Multiple Sponso...imgcommcall
 
Ontologies for Emergency & Disaster Management
Ontologies for Emergency & Disaster Management Ontologies for Emergency & Disaster Management
Ontologies for Emergency & Disaster Management Stephane Fellah
 
ALIGNED Data Curation Methods and Tools
ALIGNED Data Curation Methods and ToolsALIGNED Data Curation Methods and Tools
ALIGNED Data Curation Methods and ToolsAlignedProject
 
Starfish-A self tuning system for bigdata analytics
Starfish-A self tuning system for bigdata analyticsStarfish-A self tuning system for bigdata analytics
Starfish-A self tuning system for bigdata analyticssai Pramoda
 
data analytics lecture3.ppt
data analytics lecture3.pptdata analytics lecture3.ppt
data analytics lecture3.pptNamrataBhatt8
 
FAIR BioData Management
FAIR BioData ManagementFAIR BioData Management
FAIR BioData ManagementUlrike Wittig
 
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...EUDAT
 
Tim Pugh-SPEDDEXES 2014
Tim Pugh-SPEDDEXES 2014Tim Pugh-SPEDDEXES 2014
Tim Pugh-SPEDDEXES 2014aceas13tern
 
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWSExperiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWSEd Dodds
 
Hybrid Cloud Journey - Maximizing Private and Public Cloud
Hybrid Cloud Journey - Maximizing Private and Public CloudHybrid Cloud Journey - Maximizing Private and Public Cloud
Hybrid Cloud Journey - Maximizing Private and Public CloudRyan Lynn
 
Bonazzi commons bd2 k ahm 2016 v2
Bonazzi commons bd2 k ahm 2016 v2Bonazzi commons bd2 k ahm 2016 v2
Bonazzi commons bd2 k ahm 2016 v2Vivien Bonazzi
 
Core Geospatial Ontologies
Core Geospatial OntologiesCore Geospatial Ontologies
Core Geospatial OntologiesStephane Fellah
 
Importance of data standards and system validation of software for clinical r...
Importance of data standards and system validation of software for clinical r...Importance of data standards and system validation of software for clinical r...
Importance of data standards and system validation of software for clinical r...Wolfgang Kuchinke
 
Creating enterprise standards 09302010
Creating enterprise standards 09302010Creating enterprise standards 09302010
Creating enterprise standards 09302010ERwin Modeling
 
Cloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentCloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentPeter Haase
 

Similar to Approaches to combining supplementary datasets across multiple trusted research environments using federated analysis (20)

Data accessibilityandchallenges
Data accessibilityandchallengesData accessibilityandchallenges
Data accessibilityandchallenges
 
Wed roman tut_open_datapub
Wed roman tut_open_datapubWed roman tut_open_datapub
Wed roman tut_open_datapub
 
A BASILar Approach for Building Web APIs on top of SPARQL Endpoints
A BASILar Approach for Building Web APIs on top of SPARQL EndpointsA BASILar Approach for Building Web APIs on top of SPARQL Endpoints
A BASILar Approach for Building Web APIs on top of SPARQL Endpoints
 
An Approach to Combining Disparate Clinical Study Data across Multiple Sponso...
An Approach to Combining Disparate Clinical Study Data across Multiple Sponso...An Approach to Combining Disparate Clinical Study Data across Multiple Sponso...
An Approach to Combining Disparate Clinical Study Data across Multiple Sponso...
 
Ontologies for Emergency & Disaster Management
Ontologies for Emergency & Disaster Management Ontologies for Emergency & Disaster Management
Ontologies for Emergency & Disaster Management
 
ALIGNED Data Curation Methods and Tools
ALIGNED Data Curation Methods and ToolsALIGNED Data Curation Methods and Tools
ALIGNED Data Curation Methods and Tools
 
Starfish-A self tuning system for bigdata analytics
Starfish-A self tuning system for bigdata analyticsStarfish-A self tuning system for bigdata analytics
Starfish-A self tuning system for bigdata analytics
 
data analytics lecture3.ppt
data analytics lecture3.pptdata analytics lecture3.ppt
data analytics lecture3.ppt
 
FAIR BioData Management
FAIR BioData ManagementFAIR BioData Management
FAIR BioData Management
 
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
 
Tim Pugh-SPEDDEXES 2014
Tim Pugh-SPEDDEXES 2014Tim Pugh-SPEDDEXES 2014
Tim Pugh-SPEDDEXES 2014
 
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWSExperiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
 
Hybrid Cloud Journey - Maximizing Private and Public Cloud
Hybrid Cloud Journey - Maximizing Private and Public CloudHybrid Cloud Journey - Maximizing Private and Public Cloud
Hybrid Cloud Journey - Maximizing Private and Public Cloud
 
Bonazzi commons bd2 k ahm 2016 v2
Bonazzi commons bd2 k ahm 2016 v2Bonazzi commons bd2 k ahm 2016 v2
Bonazzi commons bd2 k ahm 2016 v2
 
Core Geospatial Ontologies
Core Geospatial OntologiesCore Geospatial Ontologies
Core Geospatial Ontologies
 
Data Domain-Driven Design
Data Domain-Driven DesignData Domain-Driven Design
Data Domain-Driven Design
 
Importance of data standards and system validation of software for clinical r...
Importance of data standards and system validation of software for clinical r...Importance of data standards and system validation of software for clinical r...
Importance of data standards and system validation of software for clinical r...
 
MEDIN data guidelines
MEDIN data guidelinesMEDIN data guidelines
MEDIN data guidelines
 
Creating enterprise standards 09302010
Creating enterprise standards 09302010Creating enterprise standards 09302010
Creating enterprise standards 09302010
 
Cloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentCloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application Development
 

More from Aravind Sesagiri Raamkumar

Measuring the Outreach Efforts of Public Health Authorities and the Public Re...
Measuring the Outreach Efforts of Public Health Authorities and the Public Re...Measuring the Outreach Efforts of Public Health Authorities and the Public Re...
Measuring the Outreach Efforts of Public Health Authorities and the Public Re...Aravind Sesagiri Raamkumar
 
Understanding the Twitter Usage of Science Citation Index (SCI) Journals
Understanding the Twitter Usage of Science Citation Index (SCI) JournalsUnderstanding the Twitter Usage of Science Citation Index (SCI) Journals
Understanding the Twitter Usage of Science Citation Index (SCI) JournalsAravind Sesagiri Raamkumar
 
Investigating the Characteristics and Research Impact of Sentiments in Tweets...
Investigating the Characteristics and Research Impact of Sentiments in Tweets...Investigating the Characteristics and Research Impact of Sentiments in Tweets...
Investigating the Characteristics and Research Impact of Sentiments in Tweets...Aravind Sesagiri Raamkumar
 
Understanding the Twitter Usage of Humanities and Social Sciences Academic Jo...
Understanding the Twitter Usage of Humanities and Social Sciences Academic Jo...Understanding the Twitter Usage of Humanities and Social Sciences Academic Jo...
Understanding the Twitter Usage of Humanities and Social Sciences Academic Jo...Aravind Sesagiri Raamkumar
 
Multi-method Evaluation in Scientific Paper Recommender Systems
Multi-method Evaluation in Scientific Paper Recommender SystemsMulti-method Evaluation in Scientific Paper Recommender Systems
Multi-method Evaluation in Scientific Paper Recommender SystemsAravind Sesagiri Raamkumar
 
A task-based scientific paper recommender system for literature review and ma...
A task-based scientific paper recommender system for literature review and ma...A task-based scientific paper recommender system for literature review and ma...
A task-based scientific paper recommender system for literature review and ma...Aravind Sesagiri Raamkumar
 
Using altmetrics to support research evaluation
Using altmetrics to support research evaluationUsing altmetrics to support research evaluation
Using altmetrics to support research evaluationAravind Sesagiri Raamkumar
 
Evolution and state-of-the art of Altmetric research: Insights from network a...
Evolution and state-of-the art of Altmetric research: Insights from network a...Evolution and state-of-the art of Altmetric research: Insights from network a...
Evolution and state-of-the art of Altmetric research: Insights from network a...Aravind Sesagiri Raamkumar
 
Scientometric Analysis of Research Performance of African Countries in select...
Scientometric Analysis of Research Performance of African Countries in select...Scientometric Analysis of Research Performance of African Countries in select...
Scientometric Analysis of Research Performance of African Countries in select...Aravind Sesagiri Raamkumar
 
New Dialog, New Services with Altmetrics: Lingnan University Library Experience
New Dialog, New Services with Altmetrics: Lingnan University Library ExperienceNew Dialog, New Services with Altmetrics: Lingnan University Library Experience
New Dialog, New Services with Altmetrics: Lingnan University Library ExperienceAravind Sesagiri Raamkumar
 
Field-weighting readership: how does it compare to field-weighting citations?
Field-weighting readership: how does it compare to field-weighting citations?Field-weighting readership: how does it compare to field-weighting citations?
Field-weighting readership: how does it compare to field-weighting citations?Aravind Sesagiri Raamkumar
 
How do Scholars Evaluate and Promote Research Outputs? An NTU Case Study
How do Scholars Evaluate and Promote Research Outputs? An NTU Case StudyHow do Scholars Evaluate and Promote Research Outputs? An NTU Case Study
How do Scholars Evaluate and Promote Research Outputs? An NTU Case StudyAravind Sesagiri Raamkumar
 
Monitoring the broad impact of the journal publication output on country leve...
Monitoring the broad impact of the journal publication output on country leve...Monitoring the broad impact of the journal publication output on country leve...
Monitoring the broad impact of the journal publication output on country leve...Aravind Sesagiri Raamkumar
 
A Comparative Investigation on Citation Counts and Altmetrics between Papers ...
A Comparative Investigation on Citation Counts and Altmetrics between Papers ...A Comparative Investigation on Citation Counts and Altmetrics between Papers ...
A Comparative Investigation on Citation Counts and Altmetrics between Papers ...Aravind Sesagiri Raamkumar
 
Database-Centric Guidelines for Building a Scholarly Metrics Information Syst...
Database-Centric Guidelines for Building a Scholarly Metrics Information Syst...Database-Centric Guidelines for Building a Scholarly Metrics Information Syst...
Database-Centric Guidelines for Building a Scholarly Metrics Information Syst...Aravind Sesagiri Raamkumar
 
Altmetrics for Research Impact Actuation (ARIA)
Altmetrics for Research Impact Actuation (ARIA)Altmetrics for Research Impact Actuation (ARIA)
Altmetrics for Research Impact Actuation (ARIA)Aravind Sesagiri Raamkumar
 
Proposing a Scientific Paper Retrieval and Recommender Framework
Proposing a Scientific Paper Retrieval and Recommender FrameworkProposing a Scientific Paper Retrieval and Recommender Framework
Proposing a Scientific Paper Retrieval and Recommender FrameworkAravind Sesagiri Raamkumar
 
What papers should I cite from my reading list? User evaluation of a manuscri...
What papers should I cite from my reading list? User evaluation of a manuscri...What papers should I cite from my reading list? User evaluation of a manuscri...
What papers should I cite from my reading list? User evaluation of a manuscri...Aravind Sesagiri Raamkumar
 
What’s in a Country Name – Twitter Hashtag Analysis of #singapore
What’s in a Country Name – Twitter Hashtag Analysis of #singaporeWhat’s in a Country Name – Twitter Hashtag Analysis of #singapore
What’s in a Country Name – Twitter Hashtag Analysis of #singaporeAravind Sesagiri Raamkumar
 

More from Aravind Sesagiri Raamkumar (20)

Measuring the Outreach Efforts of Public Health Authorities and the Public Re...
Measuring the Outreach Efforts of Public Health Authorities and the Public Re...Measuring the Outreach Efforts of Public Health Authorities and the Public Re...
Measuring the Outreach Efforts of Public Health Authorities and the Public Re...
 
Understanding the Twitter Usage of Science Citation Index (SCI) Journals
Understanding the Twitter Usage of Science Citation Index (SCI) JournalsUnderstanding the Twitter Usage of Science Citation Index (SCI) Journals
Understanding the Twitter Usage of Science Citation Index (SCI) Journals
 
Investigating the Characteristics and Research Impact of Sentiments in Tweets...
Investigating the Characteristics and Research Impact of Sentiments in Tweets...Investigating the Characteristics and Research Impact of Sentiments in Tweets...
Investigating the Characteristics and Research Impact of Sentiments in Tweets...
 
Understanding the Twitter Usage of Humanities and Social Sciences Academic Jo...
Understanding the Twitter Usage of Humanities and Social Sciences Academic Jo...Understanding the Twitter Usage of Humanities and Social Sciences Academic Jo...
Understanding the Twitter Usage of Humanities and Social Sciences Academic Jo...
 
Multi-method Evaluation in Scientific Paper Recommender Systems
Multi-method Evaluation in Scientific Paper Recommender SystemsMulti-method Evaluation in Scientific Paper Recommender Systems
Multi-method Evaluation in Scientific Paper Recommender Systems
 
A task-based scientific paper recommender system for literature review and ma...
A task-based scientific paper recommender system for literature review and ma...A task-based scientific paper recommender system for literature review and ma...
A task-based scientific paper recommender system for literature review and ma...
 
Using altmetrics to support research evaluation
Using altmetrics to support research evaluationUsing altmetrics to support research evaluation
Using altmetrics to support research evaluation
 
Evolution and state-of-the art of Altmetric research: Insights from network a...
Evolution and state-of-the art of Altmetric research: Insights from network a...Evolution and state-of-the art of Altmetric research: Insights from network a...
Evolution and state-of-the art of Altmetric research: Insights from network a...
 
Feature Analysis of Research Metrics Systems
Feature Analysis of Research Metrics SystemsFeature Analysis of Research Metrics Systems
Feature Analysis of Research Metrics Systems
 
Scientometric Analysis of Research Performance of African Countries in select...
Scientometric Analysis of Research Performance of African Countries in select...Scientometric Analysis of Research Performance of African Countries in select...
Scientometric Analysis of Research Performance of African Countries in select...
 
New Dialog, New Services with Altmetrics: Lingnan University Library Experience
New Dialog, New Services with Altmetrics: Lingnan University Library ExperienceNew Dialog, New Services with Altmetrics: Lingnan University Library Experience
New Dialog, New Services with Altmetrics: Lingnan University Library Experience
 
Field-weighting readership: how does it compare to field-weighting citations?
Field-weighting readership: how does it compare to field-weighting citations?Field-weighting readership: how does it compare to field-weighting citations?
Field-weighting readership: how does it compare to field-weighting citations?
 
How do Scholars Evaluate and Promote Research Outputs? An NTU Case Study
How do Scholars Evaluate and Promote Research Outputs? An NTU Case StudyHow do Scholars Evaluate and Promote Research Outputs? An NTU Case Study
How do Scholars Evaluate and Promote Research Outputs? An NTU Case Study
 
Monitoring the broad impact of the journal publication output on country leve...
Monitoring the broad impact of the journal publication output on country leve...Monitoring the broad impact of the journal publication output on country leve...
Monitoring the broad impact of the journal publication output on country leve...
 
A Comparative Investigation on Citation Counts and Altmetrics between Papers ...
A Comparative Investigation on Citation Counts and Altmetrics between Papers ...A Comparative Investigation on Citation Counts and Altmetrics between Papers ...
A Comparative Investigation on Citation Counts and Altmetrics between Papers ...
 
Database-Centric Guidelines for Building a Scholarly Metrics Information Syst...
Database-Centric Guidelines for Building a Scholarly Metrics Information Syst...Database-Centric Guidelines for Building a Scholarly Metrics Information Syst...
Database-Centric Guidelines for Building a Scholarly Metrics Information Syst...
 
Altmetrics for Research Impact Actuation (ARIA)
Altmetrics for Research Impact Actuation (ARIA)Altmetrics for Research Impact Actuation (ARIA)
Altmetrics for Research Impact Actuation (ARIA)
 
Proposing a Scientific Paper Retrieval and Recommender Framework
Proposing a Scientific Paper Retrieval and Recommender FrameworkProposing a Scientific Paper Retrieval and Recommender Framework
Proposing a Scientific Paper Retrieval and Recommender Framework
 
What papers should I cite from my reading list? User evaluation of a manuscri...
What papers should I cite from my reading list? User evaluation of a manuscri...What papers should I cite from my reading list? User evaluation of a manuscri...
What papers should I cite from my reading list? User evaluation of a manuscri...
 
What’s in a Country Name – Twitter Hashtag Analysis of #singapore
What’s in a Country Name – Twitter Hashtag Analysis of #singaporeWhat’s in a Country Name – Twitter Hashtag Analysis of #singapore
What’s in a Country Name – Twitter Hashtag Analysis of #singapore
 

Recently uploaded

Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home ServiceSapana Sha
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 

Recently uploaded (20)

Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 

Approaches to combining supplementary datasets across multiple trusted research environments using federated analysis

  • 1. APPROACHES TO COMBINING SUPPLEMENTARY DATASETS ACROSS MULTIPLE TRUSTED RESEARCH ENVIRONMENTS USING FEDERATED ANALYSIS DR. ARAVIND SESAGIRI RAAMKUMAR
  • 2. FEDERATED ANALYSIS (FA) • Analysis of datasets or data sources that are present in different geographic locations or networks • Raw data generally not shared during analysis • Statistical parameters or aggregated data are exchanged during analysis • The efficiency of FA projects is contingent on both communication bandwidth and computational complexity
  • 3. APPROACHES • Linked Data Approach • OMOP CDM Approach • Custom API Approach • Dedicated Environment Approach • Hybrid Approach
  • 4. Linked Data Approach • Linked Data is based on Semantic Web standards • Data needs to be represented in the Resource Description Framework (RDF) format • Each data item needs to be assigned a URI (Uniform Resource Indicator) • Relations and class hierarchy in the datasets need to be represented using Ontologies • Data can be queried through SPARQL endpoints
  • 5. Linked Data Approach - Examples https://dbpedia.org/page/Dementia https://dbpedia.org/sparql
  • 6. Linked Data Approach – Federated Querying Example PREFIX owl: <http://www.w3.org/2002/07/owl#> PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#> PREFIX movie: <http://data.linkedmdb.org/resource/movie/> PREFIX dc: <http://purl.org/dc/terms/> PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX dbo: <http://dbpedia.org/ontology/> PREFIX foaf: <http://xmlns.com/foaf/0.1/> PREFIX dbp: <http://dbpedia.org/property/> SELECT distinct ?name ?author ?filmname ?imdbID WHERE { SERVICE <http://dbpedia.org/sparql> { ?book rdf:type dbo:Book . ?book foaf:name ?name . ?book dbp:author ?author . ?author foaf:name ?authname . ?book ^dbo:basedOn ?movie . ?movie a dbo:Film . ?movie foaf:name ?filmname FILTER (str(?name) IN ("Royal Flash","White Oleander", "Possession: A Romance", "Misery", "Intensity", "The War of The Roses", "Momo", "The Sicilian", "Derailed", "Ragtime")) } SERVICE <http://data.linkedmdb.org/sparql> { ?filmname foaf:page ?imdbID . ?filmname dc:title ?title . FILTER(regex(str(?imdbID), "www.imdb.com" ) ) } }
  • 7. OMOP CDM Approach • The Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) approach facilitates federated analysis using a common data format • Common data format includes concepts (terminologies), vocabularies, and coding schemes • The parent organization OHDSI provides a suite of tools for analysis
  • 9. OMOP CDM Approach – Tools https://atlas-demo.ohdsi.org/ • Support for custom, OHDSI methods library in R and ATLAS • Free tool for analyzing standardized, patient-level, observational data CDM data • Makes use of OHDSI WebAPI • Supports Machine Learning
  • 10. Custom API Approach • Application Programming Interfaces (API) gateways can be setup to serve as a black box over the data environment • HTTP-based REST APIs along with an authentication key can be used to pass specific data points over the internet • Data usually shared via JSON, XML formats • Example Link
  • 11. Dedicated Environment Approach • In this approach, the data environments are connected to the other related data environments through a dedicated network or data access is provided through a dedicated environment • Special authentication is provided for remote access purposes • Facilitated usually via a VPN (virtual private network) or directly through internet • Users access the data and tools using a Virtual Desktop Interface (VDI) • Installation of special software might be required depending on the setup
  • 12. Comparison of Standalone Approaches Approach Advantages Challenges and Barriers Linked Data Usage of open standards; Facilitates inferencing and querying massive public datasets Data conversion from existing formats will be a big effort, Periodic object/ontology modeling for new concepts; Dedicated training OMOP CDM Usage of a universally accepted data format; Potential for international collaborations Data conversion from existing formats might be a big effort; Might not be useful if collaborators do not use the same approach Custom API Convenient; Easiest to deploy; Suitable for minimal data sharing Ad-hoc querying would not be possible; Large-scale data sharing would be difficult Dedicated Environment Safest in terms of data security; Facilitates detailed auditing Higher setup cost; Higher onboarding time
  • 13. Hybrid Approaches • Certain or all aspects of standalone approaches can be combined to form hybrid approaches • Example 1: OMOP CDM with Dedicated Network where the health organizations, government bodies, and academic institutes are connected using common data standard, and a country-wide dedicated network • Partly implemented by National University Health System in Singapore • Example 2: Linked Data with API where REST APIs encapsulate the querying complexities of SPARQL and RDF
  • 14. Other Factors for Consideration • Data anonymization • Data versioning • Provenance • Security threats • Standards and tools selection

Editor's Notes

  1. ICD – International Classification of Diseases SNOMED - Systematized Nomenclature of Medicine