Irida immemxi hsiao

IRIDA: Canada’s federated
platform for genomic
epidemiology
William Hsiao, Ph.D.
William.hsiao@bccdc.ca
@wlhsiao
BC Centre for Disease Control Public Health Laboratory
and University of British Columbia
IRIDA Platform Overview
• IRIDA= Integrated Rapid Infectious Disease Analysis
• A free, open source, standards compliant, high quality genomic
epidemiology analysis platform to support real-time disease
outbreak investigations
Core Functions:
• Management of strain and genomic sequence data
• Rapid processing and analysis of genomic data
• Informative display of genomic results
• Sample, Case, and aggregate data (“metadata”) Management
Target audience:
• Public health agencies who need a platform to manage and
process genomic data
• Public health agencies who need a platform to use genomics for
outbreak investigations
IRIDA
Sequencing
Instruments
Web
Application
Data
management
Built-in
Analytical
Tools
External
Galaxy
Command-
line Tools
10 simple rules (wish list) to build a
better public health microbiology
genomic epidemiology analysis
system
Download
Latest version at https://github.com/phac-nml/irida
1: Engage the Users Through the Entire Software
Development Cycle
National
Public Health Agency
Provincial
Public Health Agency
Academic/Public
- Project Team has direct access to state of the
art research in academia
- Project Team is directly embedded in user
organization
2: Have A Simple User Interface
Line List View (under testing)
Timeline View (Conceptualization)
Selectable fields
Travel
Symptoms and Onset
Exposure Types
Hospitalization
Launch a pipeline
Be Like
3: Build a Robust, Extensible Platform
• IRIDA uses Galaxy to
manage workflows
• Adding additional
pipelines is relatively
easy
• Using a standard
API to allow 3rd party
tools to obtain data
from IRIDA (e.g.
IslandViewer and
GenGIS)
IRIDA
ServletContainer
REST API
Central File
Storage
Web
Interface
ApplicationLogic
Compute
Cluster
Galaxy
$ ~ >_ Galaxy
http://www.pathogenomics.sfu.ca/islandviewer/
http://kiwi.cs.dal.ca/GenGIS/Main_Page
4: Have Extensive Documentation
• Documentation should be available for
• Users – step by step tutorial with screen shots / FAQ
• System Administrators – installation instructions / issue trackers
• Developers – open source, collaborative development / IRC Channel
• Easily Accessible at https://irida.corefacility.ca/documentation/
5: Implement QC Throughout the Whole Application
• Genomics is sensitive and sequence data are inherently noisy
• Genomics is a rapidly advancing technology
• Standardizing pipelines difficult and can stifle innovation
• Better to standardize the performance and reporting metrics and ensure any
validated pipelines meet the testing criteria
• Developing a general QC testing module (RCQC) that use ontology to
standardize QC metrics (https://github.com/Public-Health-Bioinformatics/rcqc)
• Data Provenance and Version Control (data + Pipelines) are must’s for
Diagnostic Labs
6: Build to Enable Collaboration
• Be able to compare pipelines
• Pipeline implemented using Galaxy – transparent
and shareable
• Define QC criteria using ontology to compare the
different pipelines of the same purpose
• Be able to share data in standard formats to
minimize data re-entry from one platform to
another
• Federation of platforms using standard API to
share data and analysis results
7: Use Compatible Data Standards
• Sequence data are more compatible / shareable but
metadata are currently in silo and incompatible
• Collaboration and Sharing are difficult when data are
incompatible
• Compatibility != Sameness
• Use Ontology to allow customization of term list but
all terms with same meaning (semantics) should have
the same universal ID (e.g. an URL) to facilitate
mapping of terms
8: Implement Fine Grained Access Control
Detailed View Restricted View
E.g. User role permissions control visibility and editing of content
Authorization
• Industry-standard
authentication and
authorization mechanisms
• Local authorization per
instance.
• Method-level authorization.
• Object-level authorization.
9: Use Technology to Safeguard Patient Privacy
It’s easy to lose control of the Excel Line List -
someone can make a copy of the content and pass
it around without your knowledge; typos are
common and cumulative!
Technology can control who sees what and when
Separate out sensitive patient data from pathogen
sequence data but be able to bring them together
when necessary without resorting to emailing of
line lists!
10: Have Multiple, Flexible Access Options
• No one size fits all solution; Having many platforms to choose from is
a good thing (but data should be portable across platforms!)
• IRIDA is available in several different flavours:
Local Install Virtual Machine Cloud Instance Public Version
Advantages Full control of the
system; your data
never leave your
centre
Full control of the
system; Easy to setup
Full control of the
system; does not
require local
computing
infrastructure
No setup required,
upload your data and
have it processed
using Compute
Canada Resource
Disadvantages Computing
infrastructure and IT
support needed to
main the resource
Not really scalable if
run on your own
desktop; some
performance loss
Data go into a cloud
environment;
uploading to cloud
environment can be
slow
Data go into a public
instance (data
remain private to
your account);
upload can be slow
Acknowledgements
Project Leaders
Fiona Brinkman – SFU
Will Hsiao – PHMRL
Gary Van Domselaar – NML
University of Lisbon
Joᾶo Carriҫo
National Microbiology Laboratory (NML)
Franklin Bristow
Aaron Petkau
Thomas Matthews
Josh Adam
Adam Olson
Tarah Lynch
Shaun Tyler
Philip Mabon
Philip Au
Celine Nadon
Matthew Stuart-Edwards
Morag Graham
Chrystal Berry
Lorelee Tschetter
Aleisha Reimer
Laboratory for Foodborne Zoonoses (LFZ)
Eduardo Taboada
Peter Kruczkiewicz
Chad Laing
Vic Gannon
Matthew Whiteside
Ross Duncan
Steven Mutschall
Simon Fraser University (SFU)
Melanie Courtot
Emma Griffiths
Geoff Winsor
Julie Shay
Matthew Laird
Bhav Dhillon
Raymond Lo
BC Public Health Microbiology &
Reference Laboratory (PHMRL) and BC
Centre for Disease Control (BCCDC)
Judy Isaac-Renton
Patrick Tang
Natalie Prystajecky
Jennifer Gardy
Damion Dooley
Linda Hoang
Kim MacDonald
Yin Chang
Eleni Galanis
Marsha Taylor
Cletus D’Souza
Ana Paccagnella
University of Maryland
Lynn Schriml
Canadian Food Inspection Agency (CFIA)
Burton Blais
Catherine Carrillo
Dominic Lambert
Dalhousie University
Rob Beiko
Alex Keddy
14
McMaster University
Andrew McArthur
Daim Sardar
European Nucleotide Archive
Guy Cochrane
Petra ten Hoopen
Clara Amid
European Food Safety Agency
Leibana Criado Ernesto
Vernazza Francesco
Rizzi Valentina
15
15
IRIDA Annual General Meeting
Winnipeg, April 8-9, 2015
1 of 15

Recommended

Irida bccdc dec10_2015 by
Irida bccdc dec10_2015Irida bccdc dec10_2015
Irida bccdc dec10_2015IRIDA_community
191 views62 slides
Data mashups skyhook by
Data mashups   skyhookData mashups   skyhook
Data mashups skyhookMassTLC
359 views3 slides
Making obamacare work with Big Data by
Making obamacare work with Big DataMaking obamacare work with Big Data
Making obamacare work with Big Datalaurenstill
727 views41 slides
Enabling faster analysis of vaccine adverse event reports with ontology support by
Enabling faster analysis of vaccine adverse event reports with ontology supportEnabling faster analysis of vaccine adverse event reports with ontology support
Enabling faster analysis of vaccine adverse event reports with ontology supportMelanie Courtot
819 views33 slides
IFD&TC 2018: A Novel Approach for Conveniently and Securely Collecting Person... by
IFD&TC 2018: A Novel Approach for Conveniently and Securely Collecting Person...IFD&TC 2018: A Novel Approach for Conveniently and Securely Collecting Person...
IFD&TC 2018: A Novel Approach for Conveniently and Securely Collecting Person...Lew Berman
112 views18 slides
AIRA Update by
AIRA UpdateAIRA Update
AIRA UpdateBaylorWilliams2
748 views35 slides

More Related Content

What's hot

Relevance feedback algorithm inspired by Quantum detection by
Relevance feedback algorithm inspired by Quantum detectionRelevance feedback algorithm inspired by Quantum detection
Relevance feedback algorithm inspired by Quantum detectionR prasad
935 views28 slides
AFIX IIS Inegration by
AFIX IIS InegrationAFIX IIS Inegration
AFIX IIS InegrationBaylorWilliams2
970 views26 slides
ACK Response Messages by
ACK Response MessagesACK Response Messages
ACK Response MessagesBaylorWilliams2
2.7K views24 slides
Two Clinical Workflows - From Unfiltered Variants to a Clinical Report by
Two Clinical Workflows - From Unfiltered Variants to a Clinical ReportTwo Clinical Workflows - From Unfiltered Variants to a Clinical Report
Two Clinical Workflows - From Unfiltered Variants to a Clinical ReportGolden Helix Inc
99 views17 slides
Curoverse Presentation at ICG-11 (November 2016) by
Curoverse Presentation at ICG-11 (November 2016)Curoverse Presentation at ICG-11 (November 2016)
Curoverse Presentation at ICG-11 (November 2016)Arvados
1.1K views11 slides
Discover Introduction to REDCap by
Discover Introduction to REDCapDiscover Introduction to REDCap
Discover Introduction to REDCapSTARSurg
1.8K views11 slides

What's hot(14)

Relevance feedback algorithm inspired by Quantum detection by R prasad
Relevance feedback algorithm inspired by Quantum detectionRelevance feedback algorithm inspired by Quantum detection
Relevance feedback algorithm inspired by Quantum detection
R prasad935 views
Two Clinical Workflows - From Unfiltered Variants to a Clinical Report by Golden Helix Inc
Two Clinical Workflows - From Unfiltered Variants to a Clinical ReportTwo Clinical Workflows - From Unfiltered Variants to a Clinical Report
Two Clinical Workflows - From Unfiltered Variants to a Clinical Report
Golden Helix Inc99 views
Curoverse Presentation at ICG-11 (November 2016) by Arvados
Curoverse Presentation at ICG-11 (November 2016)Curoverse Presentation at ICG-11 (November 2016)
Curoverse Presentation at ICG-11 (November 2016)
Arvados1.1K views
Discover Introduction to REDCap by STARSurg
Discover Introduction to REDCapDiscover Introduction to REDCap
Discover Introduction to REDCap
STARSurg1.8K views
Covance Laboratory FSPx Solutions by Covance
Covance Laboratory FSPx SolutionsCovance Laboratory FSPx Solutions
Covance Laboratory FSPx Solutions
Covance53 views
eHealth - Mark Yendt by GHBN
eHealth - Mark YendteHealth - Mark Yendt
eHealth - Mark Yendt
GHBN596 views
Developing Apps: Exposing Your Data Through Araport by Matthew Vaughn
Developing Apps: Exposing Your Data Through AraportDeveloping Apps: Exposing Your Data Through Araport
Developing Apps: Exposing Your Data Through Araport
Matthew Vaughn424 views
Website ranking system by R prasad
Website ranking systemWebsite ranking system
Website ranking system
R prasad1K views
Prov4J: A Semantic Web Framework for Generic Provenance Management by Andre Freitas
Prov4J: A Semantic Web Framework for Generic Provenance Management Prov4J: A Semantic Web Framework for Generic Provenance Management
Prov4J: A Semantic Web Framework for Generic Provenance Management
Andre Freitas1.1K views

Similar to Irida immemxi hsiao

Flattening the Curve with Kafka (Rishi Tarar, Northrop Grumman Corp.) Kafka S... by
Flattening the Curve with Kafka (Rishi Tarar, Northrop Grumman Corp.) Kafka S...Flattening the Curve with Kafka (Rishi Tarar, Northrop Grumman Corp.) Kafka S...
Flattening the Curve with Kafka (Rishi Tarar, Northrop Grumman Corp.) Kafka S...confluent
2.1K views18 slides
IMMEM XI: Ten Simple Rules to Build a Better Public Health Genomic Epidemiolo... by
IMMEM XI: Ten Simple Rules to Build a Better Public Health Genomic Epidemiolo...IMMEM XI: Ten Simple Rules to Build a Better Public Health Genomic Epidemiolo...
IMMEM XI: Ten Simple Rules to Build a Better Public Health Genomic Epidemiolo...William Hsiao
392 views15 slides
IRIDA: A Federated Bioinformatics Platform Enabling Richer Genomic Epidemiolo... by
IRIDA: A Federated Bioinformatics Platform Enabling Richer Genomic Epidemiolo...IRIDA: A Federated Bioinformatics Platform Enabling Richer Genomic Epidemiolo...
IRIDA: A Federated Bioinformatics Platform Enabling Richer Genomic Epidemiolo...William Hsiao
926 views56 slides
Data Harmonization for a Molecularly Driven Health System by
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemWarren Kibbe
501 views69 slides
NIH NCI Childhood Cancer Data Initiative (CCDI) Symposium Globus Poster by
NIH NCI Childhood Cancer Data Initiative (CCDI) Symposium Globus PosterNIH NCI Childhood Cancer Data Initiative (CCDI) Symposium Globus Poster
NIH NCI Childhood Cancer Data Initiative (CCDI) Symposium Globus PosterGlobus
121 views1 slide
Scalable and Repeatable Machine Learning pipelines: A key requirement for you... by
Scalable and Repeatable Machine Learning pipelines: A key requirement for you...Scalable and Repeatable Machine Learning pipelines: A key requirement for you...
Scalable and Repeatable Machine Learning pipelines: A key requirement for you...All Things Open
77 views20 slides

Similar to Irida immemxi hsiao(20)

Flattening the Curve with Kafka (Rishi Tarar, Northrop Grumman Corp.) Kafka S... by confluent
Flattening the Curve with Kafka (Rishi Tarar, Northrop Grumman Corp.) Kafka S...Flattening the Curve with Kafka (Rishi Tarar, Northrop Grumman Corp.) Kafka S...
Flattening the Curve with Kafka (Rishi Tarar, Northrop Grumman Corp.) Kafka S...
confluent2.1K views
IMMEM XI: Ten Simple Rules to Build a Better Public Health Genomic Epidemiolo... by William Hsiao
IMMEM XI: Ten Simple Rules to Build a Better Public Health Genomic Epidemiolo...IMMEM XI: Ten Simple Rules to Build a Better Public Health Genomic Epidemiolo...
IMMEM XI: Ten Simple Rules to Build a Better Public Health Genomic Epidemiolo...
William Hsiao392 views
IRIDA: A Federated Bioinformatics Platform Enabling Richer Genomic Epidemiolo... by William Hsiao
IRIDA: A Federated Bioinformatics Platform Enabling Richer Genomic Epidemiolo...IRIDA: A Federated Bioinformatics Platform Enabling Richer Genomic Epidemiolo...
IRIDA: A Federated Bioinformatics Platform Enabling Richer Genomic Epidemiolo...
William Hsiao926 views
Data Harmonization for a Molecularly Driven Health System by Warren Kibbe
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health System
Warren Kibbe501 views
NIH NCI Childhood Cancer Data Initiative (CCDI) Symposium Globus Poster by Globus
NIH NCI Childhood Cancer Data Initiative (CCDI) Symposium Globus PosterNIH NCI Childhood Cancer Data Initiative (CCDI) Symposium Globus Poster
NIH NCI Childhood Cancer Data Initiative (CCDI) Symposium Globus Poster
Globus 121 views
Scalable and Repeatable Machine Learning pipelines: A key requirement for you... by All Things Open
Scalable and Repeatable Machine Learning pipelines: A key requirement for you...Scalable and Repeatable Machine Learning pipelines: A key requirement for you...
Scalable and Repeatable Machine Learning pipelines: A key requirement for you...
All Things Open77 views
Evaluation of the importance of standards for data and metadata exchange for ... by Wolfgang Kuchinke
Evaluation of the importance of standards for data and metadata exchange for ...Evaluation of the importance of standards for data and metadata exchange for ...
Evaluation of the importance of standards for data and metadata exchange for ...
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiao by IRIDA_community
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiaoIRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiao
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiao
IRIDA_community436 views
Data Harmonization for a Molecularly Driven Health System by Warren Kibbe
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health System
Warren Kibbe110 views
Building an Intelligent Biobank to Power Research Decision-Making by Denodo
Building an Intelligent Biobank to Power Research Decision-MakingBuilding an Intelligent Biobank to Power Research Decision-Making
Building an Intelligent Biobank to Power Research Decision-Making
Denodo 2K views
IRIDA: Canada’s federated platform for genomic epidemiology by William Hsiao
IRIDA: Canada’s federated platform for genomic epidemiology IRIDA: Canada’s federated platform for genomic epidemiology
IRIDA: Canada’s federated platform for genomic epidemiology
William Hsiao379 views
Importance of data standards and system validation of software for clinical r... by Wolfgang Kuchinke
Importance of data standards and system validation of software for clinical r...Importance of data standards and system validation of software for clinical r...
Importance of data standards and system validation of software for clinical r...
Bonazzi commons bd2 k ahm 2016 v2 by Vivien Bonazzi
Bonazzi commons bd2 k ahm 2016 v2Bonazzi commons bd2 k ahm 2016 v2
Bonazzi commons bd2 k ahm 2016 v2
Vivien Bonazzi738 views
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ... by Bonnie Hurwitz
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
Bonnie Hurwitz1.8K views
The challenges of Analytical Data Management in R&D by Laura Berry
The challenges of Analytical Data Management in R&DThe challenges of Analytical Data Management in R&D
The challenges of Analytical Data Management in R&D
Laura Berry256 views
Reproducible research: theory by C. Tobin Magle
Reproducible research: theoryReproducible research: theory
Reproducible research: theory
C. Tobin Magle1.5K views
What is Data Commons and How Can Your Organization Build One? by Robert Grossman
What is Data Commons and How Can Your Organization Build One?What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?
Robert Grossman5.1K views
Docker in Open Science Data Analysis Challenges by Bruce Hoff by Docker, Inc.
Docker in Open Science Data Analysis Challenges by Bruce HoffDocker in Open Science Data Analysis Challenges by Bruce Hoff
Docker in Open Science Data Analysis Challenges by Bruce Hoff
Docker, Inc.652 views

More from IRIDA_community

Robertson immemxi final March 2016 by
Robertson immemxi final March 2016Robertson immemxi final March 2016
Robertson immemxi final March 2016IRIDA_community
367 views22 slides
Hetman immem xi final March 2016 by
Hetman immem xi final March 2016Hetman immem xi final March 2016
Hetman immem xi final March 2016IRIDA_community
486 views21 slides
Barker immemxi final March 2016 by
Barker immemxi final March 2016Barker immemxi final March 2016
Barker immemxi final March 2016IRIDA_community
312 views26 slides
Emma FoodON poster3 by
Emma FoodON poster3Emma FoodON poster3
Emma FoodON poster3IRIDA_community
404 views1 slide
Emma Food on workshop allergy_eg by
Emma Food on workshop allergy_egEmma Food on workshop allergy_eg
Emma Food on workshop allergy_egIRIDA_community
525 views15 slides
Biocuration gen epio_poster by
Biocuration gen epio_posterBiocuration gen epio_poster
Biocuration gen epio_posterIRIDA_community
262 views1 slide

More from IRIDA_community(13)

Robertson immemxi final March 2016 by IRIDA_community
Robertson immemxi final March 2016Robertson immemxi final March 2016
Robertson immemxi final March 2016
IRIDA_community367 views
Hetman immem xi final March 2016 by IRIDA_community
Hetman immem xi final March 2016Hetman immem xi final March 2016
Hetman immem xi final March 2016
IRIDA_community486 views
Emma Food on workshop allergy_eg by IRIDA_community
Emma Food on workshop allergy_egEmma Food on workshop allergy_eg
Emma Food on workshop allergy_eg
IRIDA_community525 views
Emma Griffiths ASM microbe gen_epio_poster by IRIDA_community
Emma Griffiths ASM microbe gen_epio_posterEmma Griffiths ASM microbe gen_epio_poster
Emma Griffiths ASM microbe gen_epio_poster
IRIDA_community383 views
Julie Shay CCBC poster may 11 2016 by IRIDA_community
Julie Shay CCBC poster may 11 2016Julie Shay CCBC poster may 11 2016
Julie Shay CCBC poster may 11 2016
IRIDA_community321 views
Integrate Ontologies into your apps by IRIDA_community
Integrate Ontologies into your appsIntegrate Ontologies into your apps
Integrate Ontologies into your apps
IRIDA_community132 views
Domselaar GMI8 Beijing Canadian WGS Surveillance Experience by IRIDA_community
Domselaar GMI8 Beijing Canadian WGS Surveillance ExperienceDomselaar GMI8 Beijing Canadian WGS Surveillance Experience
Domselaar GMI8 Beijing Canadian WGS Surveillance Experience
IRIDA_community570 views

Recently uploaded

Aminoglycosides by
AminoglycosidesAminoglycosides
AminoglycosidesDr. Ajmer Singh Grewal
11 views26 slides
The Art of naming drugs.pptx by
The Art of naming drugs.pptxThe Art of naming drugs.pptx
The Art of naming drugs.pptxDanaKarem1
12 views48 slides
Histology of pancreas.pdf by
Histology of pancreas.pdfHistology of pancreas.pdf
Histology of pancreas.pdfTimWiyuleMutafyaMD
7 views22 slides
Pharma Franchise For Critical Care Medicine | Saphnix Lifesciences by
Pharma Franchise For Critical Care Medicine | Saphnix LifesciencesPharma Franchise For Critical Care Medicine | Saphnix Lifesciences
Pharma Franchise For Critical Care Medicine | Saphnix LifesciencesSaphnix Lifesciences
9 views8 slides
Examining Pleural Fluid.pptx by
Examining Pleural Fluid.pptxExamining Pleural Fluid.pptx
Examining Pleural Fluid.pptxFareeha Riaz
17 views18 slides
sedative and hypnotics by
sedative and hypnoticssedative and hypnotics
sedative and hypnoticsP.N.DESHMUKH
6 views10 slides

Recently uploaded(20)

The Art of naming drugs.pptx by DanaKarem1
The Art of naming drugs.pptxThe Art of naming drugs.pptx
The Art of naming drugs.pptx
DanaKarem112 views
Pharma Franchise For Critical Care Medicine | Saphnix Lifesciences by Saphnix Lifesciences
Pharma Franchise For Critical Care Medicine | Saphnix LifesciencesPharma Franchise For Critical Care Medicine | Saphnix Lifesciences
Pharma Franchise For Critical Care Medicine | Saphnix Lifesciences
Examining Pleural Fluid.pptx by Fareeha Riaz
Examining Pleural Fluid.pptxExamining Pleural Fluid.pptx
Examining Pleural Fluid.pptx
Fareeha Riaz 17 views
Asthalin Inhaler (Generic Albuterol Sulfate Inhaler) by The Swiss Pharmacy
Asthalin Inhaler (Generic Albuterol Sulfate Inhaler) Asthalin Inhaler (Generic Albuterol Sulfate Inhaler)
Asthalin Inhaler (Generic Albuterol Sulfate Inhaler)
Save 20% on our supplements for kids by novaferrum
Save 20% on our supplements for kidsSave 20% on our supplements for kids
Save 20% on our supplements for kids
novaferrum6 views
Peptic ulcer.pdf by UVAS
Peptic ulcer.pdfPeptic ulcer.pdf
Peptic ulcer.pdf
UVAS9 views
PATIENTCOUNSELLING in.pptx by skShashi1
PATIENTCOUNSELLING  in.pptxPATIENTCOUNSELLING  in.pptx
PATIENTCOUNSELLING in.pptx
skShashi121 views
VarSeq 2.5.0: VSClinical AMP Workflow from the User Perspective by Golden Helix
VarSeq 2.5.0: VSClinical AMP Workflow from the User PerspectiveVarSeq 2.5.0: VSClinical AMP Workflow from the User Perspective
VarSeq 2.5.0: VSClinical AMP Workflow from the User Perspective
Golden Helix83 views
24th oct Pulp Therapy In Young Permanent Teeth.pptx by ismasajjad1
24th oct Pulp Therapy In Young Permanent Teeth.pptx24th oct Pulp Therapy In Young Permanent Teeth.pptx
24th oct Pulp Therapy In Young Permanent Teeth.pptx
ismasajjad113 views
Cholera Romy W. (3).pptx by rweth613
Cholera Romy W. (3).pptxCholera Romy W. (3).pptx
Cholera Romy W. (3).pptx
rweth61353 views

Irida immemxi hsiao

  • 1. IRIDA: Canada’s federated platform for genomic epidemiology William Hsiao, Ph.D. William.hsiao@bccdc.ca @wlhsiao BC Centre for Disease Control Public Health Laboratory and University of British Columbia
  • 2. IRIDA Platform Overview • IRIDA= Integrated Rapid Infectious Disease Analysis • A free, open source, standards compliant, high quality genomic epidemiology analysis platform to support real-time disease outbreak investigations Core Functions: • Management of strain and genomic sequence data • Rapid processing and analysis of genomic data • Informative display of genomic results • Sample, Case, and aggregate data (“metadata”) Management Target audience: • Public health agencies who need a platform to manage and process genomic data • Public health agencies who need a platform to use genomics for outbreak investigations IRIDA Sequencing Instruments Web Application Data management Built-in Analytical Tools External Galaxy Command- line Tools
  • 3. 10 simple rules (wish list) to build a better public health microbiology genomic epidemiology analysis system Download Latest version at https://github.com/phac-nml/irida
  • 4. 1: Engage the Users Through the Entire Software Development Cycle National Public Health Agency Provincial Public Health Agency Academic/Public - Project Team has direct access to state of the art research in academia - Project Team is directly embedded in user organization
  • 5. 2: Have A Simple User Interface Line List View (under testing) Timeline View (Conceptualization) Selectable fields Travel Symptoms and Onset Exposure Types Hospitalization Launch a pipeline Be Like
  • 6. 3: Build a Robust, Extensible Platform • IRIDA uses Galaxy to manage workflows • Adding additional pipelines is relatively easy • Using a standard API to allow 3rd party tools to obtain data from IRIDA (e.g. IslandViewer and GenGIS) IRIDA ServletContainer REST API Central File Storage Web Interface ApplicationLogic Compute Cluster Galaxy $ ~ >_ Galaxy http://www.pathogenomics.sfu.ca/islandviewer/ http://kiwi.cs.dal.ca/GenGIS/Main_Page
  • 7. 4: Have Extensive Documentation • Documentation should be available for • Users – step by step tutorial with screen shots / FAQ • System Administrators – installation instructions / issue trackers • Developers – open source, collaborative development / IRC Channel • Easily Accessible at https://irida.corefacility.ca/documentation/
  • 8. 5: Implement QC Throughout the Whole Application • Genomics is sensitive and sequence data are inherently noisy • Genomics is a rapidly advancing technology • Standardizing pipelines difficult and can stifle innovation • Better to standardize the performance and reporting metrics and ensure any validated pipelines meet the testing criteria • Developing a general QC testing module (RCQC) that use ontology to standardize QC metrics (https://github.com/Public-Health-Bioinformatics/rcqc) • Data Provenance and Version Control (data + Pipelines) are must’s for Diagnostic Labs
  • 9. 6: Build to Enable Collaboration • Be able to compare pipelines • Pipeline implemented using Galaxy – transparent and shareable • Define QC criteria using ontology to compare the different pipelines of the same purpose • Be able to share data in standard formats to minimize data re-entry from one platform to another • Federation of platforms using standard API to share data and analysis results
  • 10. 7: Use Compatible Data Standards • Sequence data are more compatible / shareable but metadata are currently in silo and incompatible • Collaboration and Sharing are difficult when data are incompatible • Compatibility != Sameness • Use Ontology to allow customization of term list but all terms with same meaning (semantics) should have the same universal ID (e.g. an URL) to facilitate mapping of terms
  • 11. 8: Implement Fine Grained Access Control Detailed View Restricted View E.g. User role permissions control visibility and editing of content Authorization • Industry-standard authentication and authorization mechanisms • Local authorization per instance. • Method-level authorization. • Object-level authorization.
  • 12. 9: Use Technology to Safeguard Patient Privacy It’s easy to lose control of the Excel Line List - someone can make a copy of the content and pass it around without your knowledge; typos are common and cumulative! Technology can control who sees what and when Separate out sensitive patient data from pathogen sequence data but be able to bring them together when necessary without resorting to emailing of line lists!
  • 13. 10: Have Multiple, Flexible Access Options • No one size fits all solution; Having many platforms to choose from is a good thing (but data should be portable across platforms!) • IRIDA is available in several different flavours: Local Install Virtual Machine Cloud Instance Public Version Advantages Full control of the system; your data never leave your centre Full control of the system; Easy to setup Full control of the system; does not require local computing infrastructure No setup required, upload your data and have it processed using Compute Canada Resource Disadvantages Computing infrastructure and IT support needed to main the resource Not really scalable if run on your own desktop; some performance loss Data go into a cloud environment; uploading to cloud environment can be slow Data go into a public instance (data remain private to your account); upload can be slow
  • 14. Acknowledgements Project Leaders Fiona Brinkman – SFU Will Hsiao – PHMRL Gary Van Domselaar – NML University of Lisbon Joᾶo Carriҫo National Microbiology Laboratory (NML) Franklin Bristow Aaron Petkau Thomas Matthews Josh Adam Adam Olson Tarah Lynch Shaun Tyler Philip Mabon Philip Au Celine Nadon Matthew Stuart-Edwards Morag Graham Chrystal Berry Lorelee Tschetter Aleisha Reimer Laboratory for Foodborne Zoonoses (LFZ) Eduardo Taboada Peter Kruczkiewicz Chad Laing Vic Gannon Matthew Whiteside Ross Duncan Steven Mutschall Simon Fraser University (SFU) Melanie Courtot Emma Griffiths Geoff Winsor Julie Shay Matthew Laird Bhav Dhillon Raymond Lo BC Public Health Microbiology & Reference Laboratory (PHMRL) and BC Centre for Disease Control (BCCDC) Judy Isaac-Renton Patrick Tang Natalie Prystajecky Jennifer Gardy Damion Dooley Linda Hoang Kim MacDonald Yin Chang Eleni Galanis Marsha Taylor Cletus D’Souza Ana Paccagnella University of Maryland Lynn Schriml Canadian Food Inspection Agency (CFIA) Burton Blais Catherine Carrillo Dominic Lambert Dalhousie University Rob Beiko Alex Keddy 14 McMaster University Andrew McArthur Daim Sardar European Nucleotide Archive Guy Cochrane Petra ten Hoopen Clara Amid European Food Safety Agency Leibana Criado Ernesto Vernazza Francesco Rizzi Valentina
  • 15. 15 15 IRIDA Annual General Meeting Winnipeg, April 8-9, 2015

Editor's Notes

  1. What is IRIDA?
  2. Inspired by Jenn’s keynote, I reworked my slides in the 10 simple rules format Many systems are and will be available for analyzing public health microbiology data and we have seen a few throughout this conference. So I thought I’d present what I think are some of the rules and my wishlist for building a better public health genomic epidemiology platform. Highlighting how some of this thinking apply to our implementation of a platform Some of these rules have been implemented well in others applications
  3. Large Group of People who contributed to this work
  4. We also have a wonderful group of advisors