SlideShare a Scribd company logo
1 of 38
Building a sustainable, open source
community
July 22, 2015
Outline of today’s presentation
● Nikolaus Schultz (Memorial Sloan Kettering Cancer Center, New York)
o cBioPortal background
o License change
o New group structure
o Roadmap (features & data)
● Ethan Cerami (Dana-Farber Cancer Institute, Boston)
o DFCI experience
o Roadmap ctd. (collaborative software development)
● Stuart Watt (Princess Margaret Cancer Centre, Toronto)
o Princess Margaret experience
● Kees van Bochove (The Hyve, The Netherlands)
o The Hyve: Support for cBioPortal
● Nikolaus Schultz
o Possible funding options
o Discussion
cBioPortal:
Data to Knowledge
Intuitive interface, quick response time, reduction of complexity
Gene alteration events per sample:
Which genes are altered in each sample?
Alteration types and thresholds can be customized for each gene.
cBioPortal usage and interest
cbioportal.org
>5,000 unique users per week, doubling every year
cBioPortal use at MSKCC
Integration of MSKCC research sequencing,
clinical sequencing (in-house and Foundation
Medicine), with public data sets (TCGA, ICGC,
manually curated data).
MSK clinical data is loaded from institutional
databases
MSK-internal access control.
cBioPortal usage and interest
cbioportal.org
>5,000 unique users per week, doubling every year
Numerous academic installations of cBioPortal:
Dana-Farber, Princess Margaret, CHOP, Weill Cornell, Fred Hutchinson, UCSC, Columbia, NYU, NY Genome
Center, British Columbia, University of Michigan, SickKids, Vanderbilt, Emory, UNC, University of Pittsburgh,
CRUK, EMBL, Charite Berlin, institutions in Japan, China, …
Interest by several people to modify or customize the code, and to contribute new features
Interest by pharmaceutical companies and others to use cBioPortal
● For internal data analysis (large pharma)
● In customer-facing applications (smaller service companies)
Switch to open source
cBioPortal source code is available via GitHub:
https://github.com/cBioPortal/cbioportal
AGPL license v3 (Affero GPL):
A GPL variant, main difference is that redistribution over a network triggers
the copyleft requirements
Impact on cancer research, patient treatment, drug development through:
● More robust and flexible software
● Accelerated development of new features
● Wider user base, collaborative culture
Core cBioPortal Development group
Memorial Sloan Kettering Cancer Center
Nikolaus Schultz, Chris Sander, Benjamin Gross, JJ Gao
Dana Farber Cancer Institute
Ethan Cerami
Princess Margaret Cancer Centre
Trevor Pugh, Stuart Watt
Re-uniting two cBioPortal founders
Coordination of architectural decisions, feature development, merges, etc.
cBioPortal roadmap overview
Analysis and Visualization Features
- Enhancements to the patient view
- Improved cohort analysis
- Query interface improvements
Software Architecture Improvements
- Performance and scalability
improvements
New Data Types and Pipelines
- Support for new genomic data types
- Data import pipelines
- Manual data curation
Community Engagement
- Users
- Local installations
- Developers
Planned features: Improved study view
Planned features: Improved patient view
Planned features: Enrichment analysis
Example: Mutations enriched in EGFR/KRAS-wildtype lung cancer samples (TCGA)
Planned features: Enrichment analysis
Example: Mutations enriched in EGFR/KRAS-wildtype lung cancer samples (TCGA)
Planned features: Cross-cancer expression
Example: Expression of SMARCA2 across TCGA tumor types
Cohort analysis:
Enrichment analysis
Cross-cancer RNA expression
Cross-cancer query within mixed studies
Improved plotting of clinical vs genomic data
True cross-cancer queries (merged OncoPrints)
OncoPrint enhancements:
Improved rendering speed
Better clinical annotation tracks
More coloring options for mutation
type
Not sequenced vs no mutation
mRNA expression heatmaps
Merging of tracks
Mutation-specific improvements
Support for multiple gene isoforms
Support for silent mutations
Planned features
Patient view:
Variant annotation and tiering
Clinical trial integration
New simplified interface for clinicians
Multi-sample view and timelines
Support for xenografts
Dynamic copy-number view
Integration of pathology images
Query interface improvements:
Building of case sets
Study view improvements
Improved rendering speed
Selection of genes to start
queries
New data:
ICGC data import pipeline
More curated data sets from the literature
Normal expression levels from GTEX
RNA fusions for TCGA samples via PanCanFus
Planned new data
New data types:
Richer support for gene fusions
Support for normal mRNA expression
Support for normal methylation levels
Epigenetic silencing calls
Mass-spec data
microRNA expression
Different types of z-scores
DFCI Experience
Ethan Cerami
Precision Medicine
Garraway, LA, Genomics-Driven Oncology: Framework for an Emerging Paradigm. J. Clin Oncology, 2013 May 20.
Knowledge Systems @ DFCI
DFCI Portal @ DFCI
● Getting data to researchers and clinicians with lowest barrier to access.
● Data transformation.
● Integrating with institutional authentication procedures: SAML
o roles and users, fine grained authorization.
● Prioritized Features
o cBioSkins
o Pan-Cancer Queries
o Build a Case
o Integrating Tiering information
RoadMap Continued
Collaborative Software Development
● All code and issues are now on GitHub
● Improved Documentation
● Stabilizing all Unit Tests
o automation via https://travis-ci.org/
● Finalizing
o branch process
o contributor agreement
o review of pull requests
RoadMap Continued
Community outreach / communication
Users:
● cBioPortal.org website. Approximately 1,000 unique users visiting cBioPortal.org every day -
this site will be used to announce new features, data, and the general availability of the software.
● Google Group. We have an active message board in Google Groups, which users can turn to
for any questions about cBioPortal features and data on the public site.
https://groups.google.com/forum/#!forum/cbioportal
● Twitter. The cBioPortal Twitter handle @cbioportal currently has over 300 followers.
● Mailing list. We have a mailing list with over 300 subscribers.
Developers / installers:
● cBioPortal GitHub page. We will use this page to communicate with cBioPortal software
developers
● cBioPortal Slack channel. We have recently set up a cBioPortal Slack channel for core
developers.
Princess Margaret
Experience
Stuart Watt
Trevor Pugh
Princess Margaret Cancer Centre
● Largest cancer centre in Canada
● Fifth largest in the world
● 3,000 staff, 400,000 patient visits annually
● Central part of the Toronto biotech hub
o University of Toronto teaching hospital
o Close associations with SickKids, OICR
PMCC Cancer Genomics Programme
● Translational research programme
o Aim for comprehensive molecular characterization
as standard practice by 2018
o Joint action by research and clinical groups
o CAP/CLIA-certified diagnostics lab (AMDL)
● Connecting research and the clinic by:
o Designing, developing, validating and implementing
new tests
o E.g., 555-gene sequencing panel for tumor profiling
cBioPortal at PMCC
● It’s the closest to where we want to be
o Most functionality needed is already there
o Integration of clinical and genomic data
o Already used by some clinicians
o Significant uptake from research groups
● Some challenges we face
o Deployment processes
o Data import
o Access control
Current plans: cBioPortal at PMCC
● Improve verifiability of the platform
● Integration into standard clinical pipeline
● Integration into institutional systems
o Clinical records
o Authentication
o Access control
The Hyve
Kees van Bochove
● cBioPortal now available as open source
● Used at many research institutions,
sometimes already in “production” mode
● Coordinated software development across
three groups
● Detailed roadmap of planned features, open
to feedback about other features
● Commercial support by The Hyve
Summary
How can we fund this going forward?
● Institutional funds? Often used for custom internal
work
● Grants? Less available than they used to be
● Pre-competitive funding through pharma?
Pre-competitive funding options
● Create independent foundation
● Join existing foundation (e.g., tranSMART)
● Create a smaller coalition of funders
How can the contributions be used? Who allocates
them?
“Steering” committee vs “advisory” committee?
Small and nimble vs larger and …?
How valuable is access to curated data?
Small consortium of 4-5 pharma with seat on advisory
committee (plus the three of us) @ $100k / year
● Advisory role
● Access to all data, ability to nominate data sets for curation
● Money is allocated for development within our groups
● Lower data-only tier available @ $30k / year
Proposed cBioPortal funding model
2-3 year commitment, determine longer term sustainability plan during year 1
Discussion & Survey
We need your feedback on our ideas
What are your ideas?
What else is needed to make this work?
Please complete our survey at:
http://bit.ly/cbioportal-survey
Thank you
Carmen Nitsche
Richard Holland
Julia Rudolph, MSKCC
cBioPortal developers
present:
JianJiong Gao
Benjamin Gross
Yichao Sun
Hongxin Zhang
Fred Criscuolo
Dong Li
Adam Abeshouse
Ritika Kundra
Annice Chen
Onur Sumer
past:
Caitlin Byrne
Gideon Dresdner
Arthur Goldberg
Michael Heuer
Anders Jacobsen
Erik Larsson

More Related Content

What's hot

Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
hemantbreeder
 

What's hot (20)

Illumina (sequencing by synthesis) method
Illumina (sequencing by synthesis) methodIllumina (sequencing by synthesis) method
Illumina (sequencing by synthesis) method
 
Tech Talk: UCSC Genome Browser
Tech Talk: UCSC Genome BrowserTech Talk: UCSC Genome Browser
Tech Talk: UCSC Genome Browser
 
Intro bioinfo
Intro bioinfoIntro bioinfo
Intro bioinfo
 
Comparative genomics 2
Comparative genomics 2Comparative genomics 2
Comparative genomics 2
 
Challenges and opportunities in personal omics profiling
Challenges and opportunities in personal omics profilingChallenges and opportunities in personal omics profiling
Challenges and opportunities in personal omics profiling
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Databases pathways of genomics and proteomics
Databases pathways of genomics and proteomics Databases pathways of genomics and proteomics
Databases pathways of genomics and proteomics
 
Immunoinformatics
ImmunoinformaticsImmunoinformatics
Immunoinformatics
 
COMPARATIVE GENOMICS.ppt
COMPARATIVE GENOMICS.pptCOMPARATIVE GENOMICS.ppt
COMPARATIVE GENOMICS.ppt
 
Systems biology & Approaches of genomics and proteomics
 Systems biology & Approaches of genomics and proteomics Systems biology & Approaches of genomics and proteomics
Systems biology & Approaches of genomics and proteomics
 
Types of genomics ppt
Types of genomics pptTypes of genomics ppt
Types of genomics ppt
 
Genome analysis2
Genome analysis2Genome analysis2
Genome analysis2
 
Cancer Databases
Cancer Databases Cancer Databases
Cancer Databases
 
Orthologs,Paralogs & Xenologs
 Orthologs,Paralogs & Xenologs  Orthologs,Paralogs & Xenologs
Orthologs,Paralogs & Xenologs
 
Multiple Sequence Alignment by Shubham Kaushik
Multiple Sequence Alignment by Shubham KaushikMultiple Sequence Alignment by Shubham Kaushik
Multiple Sequence Alignment by Shubham Kaushik
 
Structural Genomics
Structural GenomicsStructural Genomics
Structural Genomics
 
GENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSGENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICS
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
2016 Cytoscape 3.3 Tutorial
2016 Cytoscape 3.3 Tutorial2016 Cytoscape 3.3 Tutorial
2016 Cytoscape 3.3 Tutorial
 
String.pptx
String.pptxString.pptx
String.pptx
 

Viewers also liked (7)

cBio Cancer Genomics Portal: Getting started
cBio Cancer Genomics Portal:  Getting startedcBio Cancer Genomics Portal:  Getting started
cBio Cancer Genomics Portal: Getting started
 
cBio Cancer Genomics Portal: Network View
cBio Cancer Genomics Portal:  Network ViewcBio Cancer Genomics Portal:  Network View
cBio Cancer Genomics Portal: Network View
 
cBioPortal Webinar Slides (1/3)
cBioPortal Webinar Slides (1/3)cBioPortal Webinar Slides (1/3)
cBioPortal Webinar Slides (1/3)
 
cBioPortal Webinar Slides (3/3)
cBioPortal Webinar Slides (3/3)cBioPortal Webinar Slides (3/3)
cBioPortal Webinar Slides (3/3)
 
Dr. Ying Xiao: Radiation Therapy Oncology Group Bioinformatics
Dr. Ying Xiao: Radiation Therapy Oncology Group BioinformaticsDr. Ying Xiao: Radiation Therapy Oncology Group Bioinformatics
Dr. Ying Xiao: Radiation Therapy Oncology Group Bioinformatics
 
The Stanford-Cancer Genome Atlas Portal Tutorial
The Stanford-Cancer Genome Atlas Portal TutorialThe Stanford-Cancer Genome Atlas Portal Tutorial
The Stanford-Cancer Genome Atlas Portal Tutorial
 
Dr. Ethan Cerami: cBio Cancer Genomics Portal
Dr. Ethan Cerami: cBio Cancer Genomics PortalDr. Ethan Cerami: cBio Cancer Genomics Portal
Dr. Ethan Cerami: cBio Cancer Genomics Portal
 

Similar to cBioPortal Webinar Slides (2/3)

Predictive in vitro & in silico Methods for Precision Medicine- Robert G. Hun...
Predictive in vitro & in silico Methods for Precision Medicine- Robert G. Hun...Predictive in vitro & in silico Methods for Precision Medicine- Robert G. Hun...
Predictive in vitro & in silico Methods for Precision Medicine- Robert G. Hun...
RobertGHunter
 
GCAT Update June 2013 @ The Clinical Genome Conference
GCAT Update June 2013 @ The Clinical Genome ConferenceGCAT Update June 2013 @ The Clinical Genome Conference
GCAT Update June 2013 @ The Clinical Genome Conference
David Mittelman
 

Similar to cBioPortal Webinar Slides (2/3) (20)

NCI Cancer Genomics, Open Science and PMI: FAIR
NCI Cancer Genomics, Open Science and PMI: FAIR NCI Cancer Genomics, Open Science and PMI: FAIR
NCI Cancer Genomics, Open Science and PMI: FAIR
 
The Cancer Genomics Cloud (CGC) pilots - an Introduction
The Cancer Genomics Cloud (CGC) pilots  - an IntroductionThe Cancer Genomics Cloud (CGC) pilots  - an Introduction
The Cancer Genomics Cloud (CGC) pilots - an Introduction
 
Predictive in vitro & in silico Methods for Precision Medicine- Robert G. Hun...
Predictive in vitro & in silico Methods for Precision Medicine- Robert G. Hun...Predictive in vitro & in silico Methods for Precision Medicine- Robert G. Hun...
Predictive in vitro & in silico Methods for Precision Medicine- Robert G. Hun...
 
Nov 2014 ouellette_windsor_icgc_final
Nov 2014 ouellette_windsor_icgc_finalNov 2014 ouellette_windsor_icgc_final
Nov 2014 ouellette_windsor_icgc_final
 
Data Commons & Data Science Workshop
Data Commons & Data Science WorkshopData Commons & Data Science Workshop
Data Commons & Data Science Workshop
 
Big Data in Genomics: Opportunities and Challenges
Big Data in Genomics: Opportunities and ChallengesBig Data in Genomics: Opportunities and Challenges
Big Data in Genomics: Opportunities and Challenges
 
Cancer Moonshot, Data sharing and the Genomic Data Commons
Cancer Moonshot, Data sharing and the Genomic Data CommonsCancer Moonshot, Data sharing and the Genomic Data Commons
Cancer Moonshot, Data sharing and the Genomic Data Commons
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: The TraIT user stories fo...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: The TraIT user stories fo...tranSMART Community Meeting 5-7 Nov 13 - Session 3: The TraIT user stories fo...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: The TraIT user stories fo...
 
Introduction to CTIM - the Clinical Trial Information Mediator
Introduction to CTIM - the Clinical Trial Information MediatorIntroduction to CTIM - the Clinical Trial Information Mediator
Introduction to CTIM - the Clinical Trial Information Mediator
 
Biocuration activities for the International Cancer Genome Consortium (ICGC).
Biocuration activities for the International Cancer Genome Consortium (ICGC).Biocuration activities for the International Cancer Genome Consortium (ICGC).
Biocuration activities for the International Cancer Genome Consortium (ICGC).
 
GCAT Update June 2013 @ The Clinical Genome Conference
GCAT Update June 2013 @ The Clinical Genome ConferenceGCAT Update June 2013 @ The Clinical Genome Conference
GCAT Update June 2013 @ The Clinical Genome Conference
 
Markham2009
Markham2009Markham2009
Markham2009
 
ICBO 2014, October 8, 2014
ICBO 2014, October 8, 2014ICBO 2014, October 8, 2014
ICBO 2014, October 8, 2014
 
FDA NGS and Big Data Conference September 2014
FDA NGS and Big Data Conference September 2014FDA NGS and Big Data Conference September 2014
FDA NGS and Big Data Conference September 2014
 
Cancer moonshot and data sharing
Cancer moonshot and data sharingCancer moonshot and data sharing
Cancer moonshot and data sharing
 
EBI Industry programme TCGA Warren KIbbe November 2013
EBI Industry programme TCGA Warren KIbbe November 2013EBI Industry programme TCGA Warren KIbbe November 2013
EBI Industry programme TCGA Warren KIbbe November 2013
 
International Cancer Genomics Consortium (ICGC) Data Coordinating Center
International Cancer Genomics Consortium (ICGC) Data Coordinating CenterInternational Cancer Genomics Consortium (ICGC) Data Coordinating Center
International Cancer Genomics Consortium (ICGC) Data Coordinating Center
 
Federal Research & Development for the Florida system Sept 2014
Federal Research & Development for the Florida system Sept 2014 Federal Research & Development for the Florida system Sept 2014
Federal Research & Development for the Florida system Sept 2014
 
Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...
Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...
Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...
 
BRCA exchange: Sharing global knowledge about BRCA1/2
BRCA exchange: Sharing global knowledge about BRCA1/2BRCA exchange: Sharing global knowledge about BRCA1/2
BRCA exchange: Sharing global knowledge about BRCA1/2
 

More from Pistoia Alliance

More from Pistoia Alliance (20)

Fairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matricesFairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matrices
 
MPS webinar master deck
MPS webinar master deckMPS webinar master deck
MPS webinar master deck
 
Digital webinar master deck final
Digital webinar master deck finalDigital webinar master deck final
Digital webinar master deck final
 
Heartificial intelligence - claudio-mirti
Heartificial intelligence - claudio-mirtiHeartificial intelligence - claudio-mirti
Heartificial intelligence - claudio-mirti
 
Fair by design
Fair by designFair by design
Fair by design
 
Knowledge graphs ilaria maresi the hyve 23apr2020
Knowledge graphs   ilaria maresi the hyve 23apr2020Knowledge graphs   ilaria maresi the hyve 23apr2020
Knowledge graphs ilaria maresi the hyve 23apr2020
 
2020.04.07 automated molecular design and the bradshaw platform webinar
2020.04.07 automated molecular design and the bradshaw platform webinar2020.04.07 automated molecular design and the bradshaw platform webinar
2020.04.07 automated molecular design and the bradshaw platform webinar
 
Data market evolution, a future shaped by FAIR
Data market evolution, a future shaped by FAIRData market evolution, a future shaped by FAIR
Data market evolution, a future shaped by FAIR
 
AI in translational medicine webinar
AI in translational medicine webinarAI in translational medicine webinar
AI in translational medicine webinar
 
CEDAR work bench for metadata management
CEDAR work bench for metadata managementCEDAR work bench for metadata management
CEDAR work bench for metadata management
 
Open interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBIOpen interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBI
 
Fair webinar, Ted slater: progress towards commercial fair data products and ...
Fair webinar, Ted slater: progress towards commercial fair data products and ...Fair webinar, Ted slater: progress towards commercial fair data products and ...
Fair webinar, Ted slater: progress towards commercial fair data products and ...
 
Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
Application of recently developed FAIR metrics to the ELIXIR Core Data ResourcesApplication of recently developed FAIR metrics to the ELIXIR Core Data Resources
Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
 
Implementing Blockchain applications in healthcare
Implementing Blockchain applications in healthcareImplementing Blockchain applications in healthcare
Implementing Blockchain applications in healthcare
 
Building trust and accountability - the role User Experience design can play ...
Building trust and accountability - the role User Experience design can play ...Building trust and accountability - the role User Experience design can play ...
Building trust and accountability - the role User Experience design can play ...
 
Pistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier DatathonPistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier Datathon
 
Data for AI models, the past, the present, the future
Data for AI models, the past, the present, the futureData for AI models, the past, the present, the future
Data for AI models, the past, the present, the future
 
PA webinar on benefits & costs of FAIR implementation in life sciences
PA webinar on benefits & costs of FAIR implementation in life sciences PA webinar on benefits & costs of FAIR implementation in life sciences
PA webinar on benefits & costs of FAIR implementation in life sciences
 
AI & ML in Drug Design: Pistoia Alliance CoE
AI & ML in Drug Design: Pistoia Alliance CoEAI & ML in Drug Design: Pistoia Alliance CoE
AI & ML in Drug Design: Pistoia Alliance CoE
 
Ai in drug design webinar 26 feb 2019
Ai in drug design webinar 26 feb 2019Ai in drug design webinar 26 feb 2019
Ai in drug design webinar 26 feb 2019
 

Recently uploaded

Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
Areesha Ahmad
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
gindu3009
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
AlMamun560346
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
Areesha Ahmad
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
PirithiRaju
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
ssuser79fe74
 

Recently uploaded (20)

GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
 
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 

cBioPortal Webinar Slides (2/3)

  • 1. Building a sustainable, open source community July 22, 2015
  • 2. Outline of today’s presentation ● Nikolaus Schultz (Memorial Sloan Kettering Cancer Center, New York) o cBioPortal background o License change o New group structure o Roadmap (features & data) ● Ethan Cerami (Dana-Farber Cancer Institute, Boston) o DFCI experience o Roadmap ctd. (collaborative software development) ● Stuart Watt (Princess Margaret Cancer Centre, Toronto) o Princess Margaret experience ● Kees van Bochove (The Hyve, The Netherlands) o The Hyve: Support for cBioPortal ● Nikolaus Schultz o Possible funding options o Discussion
  • 3. cBioPortal: Data to Knowledge Intuitive interface, quick response time, reduction of complexity
  • 4. Gene alteration events per sample: Which genes are altered in each sample? Alteration types and thresholds can be customized for each gene.
  • 5. cBioPortal usage and interest cbioportal.org >5,000 unique users per week, doubling every year
  • 6. cBioPortal use at MSKCC Integration of MSKCC research sequencing, clinical sequencing (in-house and Foundation Medicine), with public data sets (TCGA, ICGC, manually curated data). MSK clinical data is loaded from institutional databases MSK-internal access control.
  • 7. cBioPortal usage and interest cbioportal.org >5,000 unique users per week, doubling every year Numerous academic installations of cBioPortal: Dana-Farber, Princess Margaret, CHOP, Weill Cornell, Fred Hutchinson, UCSC, Columbia, NYU, NY Genome Center, British Columbia, University of Michigan, SickKids, Vanderbilt, Emory, UNC, University of Pittsburgh, CRUK, EMBL, Charite Berlin, institutions in Japan, China, … Interest by several people to modify or customize the code, and to contribute new features Interest by pharmaceutical companies and others to use cBioPortal ● For internal data analysis (large pharma) ● In customer-facing applications (smaller service companies)
  • 8. Switch to open source cBioPortal source code is available via GitHub: https://github.com/cBioPortal/cbioportal AGPL license v3 (Affero GPL): A GPL variant, main difference is that redistribution over a network triggers the copyleft requirements Impact on cancer research, patient treatment, drug development through: ● More robust and flexible software ● Accelerated development of new features ● Wider user base, collaborative culture
  • 9. Core cBioPortal Development group Memorial Sloan Kettering Cancer Center Nikolaus Schultz, Chris Sander, Benjamin Gross, JJ Gao Dana Farber Cancer Institute Ethan Cerami Princess Margaret Cancer Centre Trevor Pugh, Stuart Watt Re-uniting two cBioPortal founders Coordination of architectural decisions, feature development, merges, etc.
  • 10. cBioPortal roadmap overview Analysis and Visualization Features - Enhancements to the patient view - Improved cohort analysis - Query interface improvements Software Architecture Improvements - Performance and scalability improvements New Data Types and Pipelines - Support for new genomic data types - Data import pipelines - Manual data curation Community Engagement - Users - Local installations - Developers
  • 13. Planned features: Enrichment analysis Example: Mutations enriched in EGFR/KRAS-wildtype lung cancer samples (TCGA)
  • 14. Planned features: Enrichment analysis Example: Mutations enriched in EGFR/KRAS-wildtype lung cancer samples (TCGA)
  • 15. Planned features: Cross-cancer expression Example: Expression of SMARCA2 across TCGA tumor types
  • 16. Cohort analysis: Enrichment analysis Cross-cancer RNA expression Cross-cancer query within mixed studies Improved plotting of clinical vs genomic data True cross-cancer queries (merged OncoPrints) OncoPrint enhancements: Improved rendering speed Better clinical annotation tracks More coloring options for mutation type Not sequenced vs no mutation mRNA expression heatmaps Merging of tracks Mutation-specific improvements Support for multiple gene isoforms Support for silent mutations Planned features Patient view: Variant annotation and tiering Clinical trial integration New simplified interface for clinicians Multi-sample view and timelines Support for xenografts Dynamic copy-number view Integration of pathology images Query interface improvements: Building of case sets Study view improvements Improved rendering speed Selection of genes to start queries
  • 17. New data: ICGC data import pipeline More curated data sets from the literature Normal expression levels from GTEX RNA fusions for TCGA samples via PanCanFus Planned new data New data types: Richer support for gene fusions Support for normal mRNA expression Support for normal methylation levels Epigenetic silencing calls Mass-spec data microRNA expression Different types of z-scores
  • 19. Precision Medicine Garraway, LA, Genomics-Driven Oncology: Framework for an Emerging Paradigm. J. Clin Oncology, 2013 May 20.
  • 21.
  • 22.
  • 23. DFCI Portal @ DFCI ● Getting data to researchers and clinicians with lowest barrier to access. ● Data transformation. ● Integrating with institutional authentication procedures: SAML o roles and users, fine grained authorization. ● Prioritized Features o cBioSkins o Pan-Cancer Queries o Build a Case o Integrating Tiering information
  • 24. RoadMap Continued Collaborative Software Development ● All code and issues are now on GitHub ● Improved Documentation ● Stabilizing all Unit Tests o automation via https://travis-ci.org/ ● Finalizing o branch process o contributor agreement o review of pull requests
  • 25. RoadMap Continued Community outreach / communication Users: ● cBioPortal.org website. Approximately 1,000 unique users visiting cBioPortal.org every day - this site will be used to announce new features, data, and the general availability of the software. ● Google Group. We have an active message board in Google Groups, which users can turn to for any questions about cBioPortal features and data on the public site. https://groups.google.com/forum/#!forum/cbioportal ● Twitter. The cBioPortal Twitter handle @cbioportal currently has over 300 followers. ● Mailing list. We have a mailing list with over 300 subscribers. Developers / installers: ● cBioPortal GitHub page. We will use this page to communicate with cBioPortal software developers ● cBioPortal Slack channel. We have recently set up a cBioPortal Slack channel for core developers.
  • 27. Princess Margaret Cancer Centre ● Largest cancer centre in Canada ● Fifth largest in the world ● 3,000 staff, 400,000 patient visits annually ● Central part of the Toronto biotech hub o University of Toronto teaching hospital o Close associations with SickKids, OICR
  • 28. PMCC Cancer Genomics Programme ● Translational research programme o Aim for comprehensive molecular characterization as standard practice by 2018 o Joint action by research and clinical groups o CAP/CLIA-certified diagnostics lab (AMDL) ● Connecting research and the clinic by: o Designing, developing, validating and implementing new tests o E.g., 555-gene sequencing panel for tumor profiling
  • 29. cBioPortal at PMCC ● It’s the closest to where we want to be o Most functionality needed is already there o Integration of clinical and genomic data o Already used by some clinicians o Significant uptake from research groups ● Some challenges we face o Deployment processes o Data import o Access control
  • 30.
  • 31. Current plans: cBioPortal at PMCC ● Improve verifiability of the platform ● Integration into standard clinical pipeline ● Integration into institutional systems o Clinical records o Authentication o Access control
  • 32. The Hyve Kees van Bochove
  • 33. ● cBioPortal now available as open source ● Used at many research institutions, sometimes already in “production” mode ● Coordinated software development across three groups ● Detailed roadmap of planned features, open to feedback about other features ● Commercial support by The Hyve Summary
  • 34. How can we fund this going forward? ● Institutional funds? Often used for custom internal work ● Grants? Less available than they used to be ● Pre-competitive funding through pharma?
  • 35. Pre-competitive funding options ● Create independent foundation ● Join existing foundation (e.g., tranSMART) ● Create a smaller coalition of funders How can the contributions be used? Who allocates them? “Steering” committee vs “advisory” committee? Small and nimble vs larger and …? How valuable is access to curated data?
  • 36. Small consortium of 4-5 pharma with seat on advisory committee (plus the three of us) @ $100k / year ● Advisory role ● Access to all data, ability to nominate data sets for curation ● Money is allocated for development within our groups ● Lower data-only tier available @ $30k / year Proposed cBioPortal funding model 2-3 year commitment, determine longer term sustainability plan during year 1
  • 37. Discussion & Survey We need your feedback on our ideas What are your ideas? What else is needed to make this work? Please complete our survey at: http://bit.ly/cbioportal-survey
  • 38. Thank you Carmen Nitsche Richard Holland Julia Rudolph, MSKCC cBioPortal developers present: JianJiong Gao Benjamin Gross Yichao Sun Hongxin Zhang Fred Criscuolo Dong Li Adam Abeshouse Ritika Kundra Annice Chen Onur Sumer past: Caitlin Byrne Gideon Dresdner Arthur Goldberg Michael Heuer Anders Jacobsen Erik Larsson