SlideShare a Scribd company logo
DREAM Challenge
 Gustavo Stolovitzky, IBM Computational Biology Center
         Andrea Califano, Columbia University
DREAM

• DREAM is a Dialogue for Reverse Engineering
  Assessments and Methods.
• The main objective is to catalyze the interaction
  between experiment and theory in the area of
  cellular network inference and quantitative model
  building in systems biology
Challenges

• Network Topology and Parameter Inference Challenge
• Sage Bionetworks - DREAM Breast Cancer Prognosis C
• The DREAM-Phil Bowen ALS Prediction Prize4Life
• NCI-DREAM Drug Sensitivity Prediction Challenge
Project/Challenges

•   Network Topology and Parameter Inference challenge
    •   Develop/apply optimization methods including the selection of the
        most informative experiments, to accurately estimate parameters
        and predict outcomes of perturbations in Systems Biology models
        given
        •   In Model 1 the complete structure of the model (including
            expressions for the kinetic rate laws) for a gene regulatory
            network composed of 9 genes. Protein and mRNA are
            explicitly modeled.
        •   In Model 2 an incomplete structure of the model, with missing
            regulatory links, for a gene regulatory network composed of
            11 genes. Here, participants will also have to find the missing
            links. Only proteins are explicitly modeled.
DREAM Phil Bowen ALS Prediction Prize4Life


•   Goal
    •   Challenge is to predict the progression of disease in ALS patients based
        on the patient’s current disease status
    •   Specifically develop an approach to predict a given patient’s disease status
        within a year’s time based on 3 months of data
•   Data
    •   Includes demographics, medical and family history data, functional
        measures, vital signs, and lab data (blood
        chemistry/hematology/urinalysis) collected at multiple times
•   Disease progression will be calculated as the average change in (ALSFRS)
    Amyotrophic Lateral Sclerosis Functional Rating Scale over a year’s time
    from enrollment in a clinical trial
•   At the end of the challenge, the prediction submitted (based on 3 months of
    data) will be compared against the actual ALSFRS slope experienced by the
    patient over a year
Output
•   Improve disease prediction beyond the current capabilities by
    •   Developing more accurate (sensitive and specific) methods of
        predicting progression,
    •   Identifying markers (variables) that would enable a
        determination of expected future disease progression earlier
        on in the course of the disease
•   Validation
    •   Validate the model against a subset of patients that are neither
        part of the training set nor the final validation (test) set.
    •   Submit the actual code written in R language (“Validation
        Code”) and InnoCentive will run the code against the interim
        validation data set
NCI-DREAM Drug Sensitivity Prediction
           Challenge




• Use genomic information to build models capable of
  ranking the sensitivity of cancer cell lines to a set of
  small molecule compounds or their combinations
Sub Challenges
•   Sub Challenge 1
    •   Predict the sensitivity of breast cancer cell lines to previously
        untested compounds
        •   Model capable of ranking the sensitivity of 18 breast cancer
            cell lines to 31 compounds
        •   Challenge in this case is to link the drug effects to the
            underlying genetics of the 53 cell lines.
•   Sub Challenge 2
    •   Predicting compound combinations that have a synergistic
        effect in reducing viability of a DLBCL cell line
        •   Predict the activity of pairs of compounds in the DLBCL LY3
            cell line from expression profiles acquired after treatment
            of the cell line with each of 14 individual compounds
Sage Bionetworks - DREAM Breast Cancer Prognosis
                    Challenge
•   Background
    •   Molecular diagnostics for cancer therapeutic decision making are among
        the most promising applications of genomic technology
    •   Molecular profiles have proved particularly powerful in adding prognosis
        information to standard clinical practice in breast cancer
•   Trends emerging
    •   Genes defining predictive signatures of the same phenotype often do
        not overlap across studies
    •   Predictive signatures are not very robust
    •   No consensus regarding the most accurate signatures or computational
        methods for inferring predictive signatures
    •   No consensus regarding the added value of incorporating molecular
        data in addition to or instead of traditionally used clinical covariates
Goal/Challenge
•   Goal
    •   To assess the accuracy of computational models designed to predict
        breast cancer survival, based on clinical information about the patient's
        tumor as well as genome-wide molecular profiling data including gene
        expression and copy number profiles
•   Challenge
    •   Create a community-based effort to provide an unbiased assessment of
        models and methodologies for the prediction of breast cancer survival
    •   Common dataset will be provided to all participants, with a validation
        dataset held out for model evaluation
    •   Novel dataset will be generated at the end of the Challenge and used to
        provide a final, unbiased score for each model
DATA
•   Training data set from METABRIC cohort of 2000 breast cancer samples
    •   Include detailed clinical annotations
    •   10 median year survival time, gene expression and copy number data
•   Additional breast cancer datasets curated by Sage Bionetworks
    •   Can be use in the model development
•   Web based platform called Synapse
    •   Enable transparent reproducible model building and analysis
        workflows as well as sharing of data, tools and models with the
        challenge community
•   Validation dataset
    •   Derived from 300- 500 fresh frozen primary tumors with the same
        clinical annotations and survival data as the METABRIC cohort
DATA
•   Survival data 
    •   Survival data is loaded into R as a Surv object as defined in the R survival package.
    •   This object is simply a 2 column matrix with sample names on the rows and columns:
            •    time – time from diagnosis to last follow up.
            •    status – weather the patient was alive at last follow up time
•   Feature data
    •   Gene expression data.
        •   Performed on the Illumina HT 12v3 platform
        •   Loaded as Bioconductor ExpressionSet object
        •   Data normalized
•   Copy number data.
    •   Performed on the Affymetrix SNP 6.0 platform
    •   Loaded as Bioconductor ExpressionSet object
    •   Data normalized
•   Clinical covariates
    •   Loaded as a data.frame  object with features
Covariates Features
Covariates Features
Submission
•   Models built for this Challenge will be constructed using the
    R programming language and uploaded to a common platform
    (Synapse) provided by Sage Bionetworks
•   Models will be uploaded as R objects implementing a function
    called customPredict() that returns a vector of survival
    predictions when given a set of feature data as input
•   customPredict() function will be run by a validation script for
    each submitted model and resulting predictions will be scored
•   Phase 3 submissions must be accompanied by a write-up that
    includes a short description of the approach used in the final
    model
Scoring
•   Challenge models will be scored by calculating the
    concordance index between the predicted survival and
    the true survival information in the validation dataset
    (accounting for the censor variable indicating whether
    the patient was alive at last follow-up)
•   Final assessment of models and the determination of
    the best performer will be based on the concordance
    index of predictions on the test dataset in Phase 3 of
    the Challenge.
•   In addition, other scoring metrics will be considered
    depending on the suggestions of the community
    throughout the Challenge
Time Line
•   Deadline for submitting models for the Breast Cancer Prognosis Challenge
    is 5PM EST October 15th
•   Best performers will be announced at the DREAM 7 Conference taking
    place in San Francisco on November 12 to 16
•   Final assessment of all models in newly generated data
    •   Additional cohort of 350 breast cancer samples with archived fresh
        frozen tumor samples has been identified by Anne-Lise Borresen-Dale of
        Oslo University Hospital and a generous donation has been made by the
        Avon Foundation to obtain gene expression and copy number data on
        these samples
    •   Currently curation of the clinical records of this patient cohort to
        harmonize with the current METABRIC dataset and generation of the
        genomic profiling data for these samples is being carried on
    •   Aim is to generate these data by the November 12 DREAM conference

More Related Content

What's hot

Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
dagunisa
 
Systems genetics approaches to understand complex traits
Systems genetics approaches to understand complex traitsSystems genetics approaches to understand complex traits
Systems genetics approaches to understand complex traits
SOYEON KIM
 
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
IJDKP
 
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
IJDKP
 
dream
dreamdream
NetBioSIG2014-Talk by David Amar
NetBioSIG2014-Talk by David AmarNetBioSIG2014-Talk by David Amar
NetBioSIG2014-Talk by David Amar
Alexander Pico
 
AI in Bioinformatics
AI in BioinformaticsAI in Bioinformatics
AI in Bioinformatics
Ali Kishk
 
BRITEREU_finalposter
BRITEREU_finalposterBRITEREU_finalposter
BRITEREU_finalposter
Elsa Fecke
 
Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...
Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...
Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...
Seattle DAML meetup
 
Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management
inscit2006
 
Reconstruction and analysis of cancerspecific Gene regulatory networks from G...
Reconstruction and analysis of cancerspecific Gene regulatory networks from G...Reconstruction and analysis of cancerspecific Gene regulatory networks from G...
Reconstruction and analysis of cancerspecific Gene regulatory networks from G...
ijbbjournal
 
NetBioSIG2012 ugurdogrusoz-cbio
NetBioSIG2012 ugurdogrusoz-cbioNetBioSIG2012 ugurdogrusoz-cbio
NetBioSIG2012 ugurdogrusoz-cbio
Alexander Pico
 
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
rahulmonikasharma
 
Digital Pathology, FDA Approval and Precision Medicine
Digital Pathology, FDA Approval and Precision MedicineDigital Pathology, FDA Approval and Precision Medicine
Digital Pathology, FDA Approval and Precision Medicine
Joel Saltz
 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessing
Kimberly Williams
 
Computational biology
Computational biologyComputational biology
Computational biology
Zeina Abdelmoez
 
iOmics
iOmicsiOmics
computer simulation in pharmacokinetics and pharmacodynamics
 computer simulation in pharmacokinetics and pharmacodynamics computer simulation in pharmacokinetics and pharmacodynamics
computer simulation in pharmacokinetics and pharmacodynamics
SUJITHA MARY
 
INBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria LópezINBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision
 
Efficiency of Using Sequence Discovery for Polymorphism in DNA Sequence
Efficiency of Using Sequence Discovery for Polymorphism in DNA SequenceEfficiency of Using Sequence Discovery for Polymorphism in DNA Sequence
Efficiency of Using Sequence Discovery for Polymorphism in DNA Sequence
IJSTA
 

What's hot (20)

Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Systems genetics approaches to understand complex traits
Systems genetics approaches to understand complex traitsSystems genetics approaches to understand complex traits
Systems genetics approaches to understand complex traits
 
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
 
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
 
dream
dreamdream
dream
 
NetBioSIG2014-Talk by David Amar
NetBioSIG2014-Talk by David AmarNetBioSIG2014-Talk by David Amar
NetBioSIG2014-Talk by David Amar
 
AI in Bioinformatics
AI in BioinformaticsAI in Bioinformatics
AI in Bioinformatics
 
BRITEREU_finalposter
BRITEREU_finalposterBRITEREU_finalposter
BRITEREU_finalposter
 
Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...
Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...
Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...
 
Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management
 
Reconstruction and analysis of cancerspecific Gene regulatory networks from G...
Reconstruction and analysis of cancerspecific Gene regulatory networks from G...Reconstruction and analysis of cancerspecific Gene regulatory networks from G...
Reconstruction and analysis of cancerspecific Gene regulatory networks from G...
 
NetBioSIG2012 ugurdogrusoz-cbio
NetBioSIG2012 ugurdogrusoz-cbioNetBioSIG2012 ugurdogrusoz-cbio
NetBioSIG2012 ugurdogrusoz-cbio
 
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
 
Digital Pathology, FDA Approval and Precision Medicine
Digital Pathology, FDA Approval and Precision MedicineDigital Pathology, FDA Approval and Precision Medicine
Digital Pathology, FDA Approval and Precision Medicine
 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessing
 
Computational biology
Computational biologyComputational biology
Computational biology
 
iOmics
iOmicsiOmics
iOmics
 
computer simulation in pharmacokinetics and pharmacodynamics
 computer simulation in pharmacokinetics and pharmacodynamics computer simulation in pharmacokinetics and pharmacodynamics
computer simulation in pharmacokinetics and pharmacodynamics
 
INBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria LópezINBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria López
 
Efficiency of Using Sequence Discovery for Polymorphism in DNA Sequence
Efficiency of Using Sequence Discovery for Polymorphism in DNA SequenceEfficiency of Using Sequence Discovery for Polymorphism in DNA Sequence
Efficiency of Using Sequence Discovery for Polymorphism in DNA Sequence
 

Viewers also liked

H-Maps: An Efficient Approach for Graphical Visualization and Navigation of T...
H-Maps: An Efficient Approach for Graphical Visualization and Navigation of T...H-Maps: An Efficient Approach for Graphical Visualization and Navigation of T...
H-Maps: An Efficient Approach for Graphical Visualization and Navigation of T...
tmra
 
Creating Topic Maps Ontologies for Space Experiments
Creating Topic Maps Ontologies for Space ExperimentsCreating Topic Maps Ontologies for Space Experiments
Creating Topic Maps Ontologies for Space Experiments
tmra
 
Computational and Neural Engineering Journal - SciDocPublishers
Computational and Neural Engineering Journal - SciDocPublishersComputational and Neural Engineering Journal - SciDocPublishers
Computational and Neural Engineering Journal - SciDocPublishers
Scidoc Publishers
 
HAP - Measuring Effectiveness in international development
HAP - Measuring Effectiveness in international developmentHAP - Measuring Effectiveness in international development
HAP - Measuring Effectiveness in international development
NIDOS
 
Creating a new language to support open innovation
Creating a new language to support open innovationCreating a new language to support open innovation
Creating a new language to support open innovation
Mike Hucka
 
current evidence based treatment for gynecologic cancer
current evidence based treatment for gynecologic cancercurrent evidence based treatment for gynecologic cancer
current evidence based treatment for gynecologic cancer
Scidoc Publishers
 
Large scale machine learning challenges for systems biology
Large scale machine learning challenges for systems biologyLarge scale machine learning challenges for systems biology
Large scale machine learning challenges for systems biology
Maté Ongenaert
 
01 hap4 4-space - Hourly Analysis Program - Carrier
01 hap4 4-space - Hourly Analysis Program - Carrier01 hap4 4-space - Hourly Analysis Program - Carrier
01 hap4 4-space - Hourly Analysis Program - Carrier
Monzer Salahdine
 
2014-DREAM-Certificates-OS
2014-DREAM-Certificates-OS2014-DREAM-Certificates-OS
2014-DREAM-Certificates-OS
Tatiana Karelina
 
Computational Approaches to Systems Biology
Computational Approaches to Systems BiologyComputational Approaches to Systems Biology
Computational Approaches to Systems Biology
Mike Hucka
 

Viewers also liked (10)

H-Maps: An Efficient Approach for Graphical Visualization and Navigation of T...
H-Maps: An Efficient Approach for Graphical Visualization and Navigation of T...H-Maps: An Efficient Approach for Graphical Visualization and Navigation of T...
H-Maps: An Efficient Approach for Graphical Visualization and Navigation of T...
 
Creating Topic Maps Ontologies for Space Experiments
Creating Topic Maps Ontologies for Space ExperimentsCreating Topic Maps Ontologies for Space Experiments
Creating Topic Maps Ontologies for Space Experiments
 
Computational and Neural Engineering Journal - SciDocPublishers
Computational and Neural Engineering Journal - SciDocPublishersComputational and Neural Engineering Journal - SciDocPublishers
Computational and Neural Engineering Journal - SciDocPublishers
 
HAP - Measuring Effectiveness in international development
HAP - Measuring Effectiveness in international developmentHAP - Measuring Effectiveness in international development
HAP - Measuring Effectiveness in international development
 
Creating a new language to support open innovation
Creating a new language to support open innovationCreating a new language to support open innovation
Creating a new language to support open innovation
 
current evidence based treatment for gynecologic cancer
current evidence based treatment for gynecologic cancercurrent evidence based treatment for gynecologic cancer
current evidence based treatment for gynecologic cancer
 
Large scale machine learning challenges for systems biology
Large scale machine learning challenges for systems biologyLarge scale machine learning challenges for systems biology
Large scale machine learning challenges for systems biology
 
01 hap4 4-space - Hourly Analysis Program - Carrier
01 hap4 4-space - Hourly Analysis Program - Carrier01 hap4 4-space - Hourly Analysis Program - Carrier
01 hap4 4-space - Hourly Analysis Program - Carrier
 
2014-DREAM-Certificates-OS
2014-DREAM-Certificates-OS2014-DREAM-Certificates-OS
2014-DREAM-Certificates-OS
 
Computational Approaches to Systems Biology
Computational Approaches to Systems BiologyComputational Approaches to Systems Biology
Computational Approaches to Systems Biology
 

Similar to DREAM Challenge

Bioinformatics-R program의 실례
Bioinformatics-R program의 실례Bioinformatics-R program의 실례
Bioinformatics-R program의 실례
mothersafe
 
Predicting Life Expectancy of Hepatitis B Patients
Predicting Life Expectancy of Hepatitis B PatientsPredicting Life Expectancy of Hepatitis B Patients
Predicting Life Expectancy of Hepatitis B Patients
nabeelali11101999
 
BFRG AI Investor Aug 2023
BFRG AI Investor Aug 2023BFRG AI Investor Aug 2023
BFRG AI Investor Aug 2023
RedChip Companies, Inc.
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART a Data Warehous...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART a Data Warehous...tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART a Data Warehous...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART a Data Warehous...
David Peyruc
 
Cancer Care using Quahog Health Decision System
Cancer Care using Quahog Health Decision SystemCancer Care using Quahog Health Decision System
Cancer Care using Quahog Health Decision System
Quahog Life Sciences
 
coad_machine_learning
coad_machine_learningcoad_machine_learning
coad_machine_learning
Ford Sleeman
 
Datamining in BreastCancer.pptx
Datamining in BreastCancer.pptxDatamining in BreastCancer.pptx
Datamining in BreastCancer.pptx
MaligireddyTanujaRed1
 
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
DataScienceConferenc1
 
COMPUTER SIMULATIONS IN PHARMACOKINETICS AND PHARMACODYNAMICS
COMPUTER SIMULATIONS INPHARMACOKINETICS ANDPHARMACODYNAMICSCOMPUTER SIMULATIONS INPHARMACOKINETICS ANDPHARMACODYNAMICS
COMPUTER SIMULATIONS IN PHARMACOKINETICS AND PHARMACODYNAMICS
naazmohd2
 
Pathomics, Clinical Studies, and Cancer Surveillance
Pathomics, Clinical Studies, and Cancer SurveillancePathomics, Clinical Studies, and Cancer Surveillance
Pathomics, Clinical Studies, and Cancer Surveillance
Joel Saltz
 
Artificial Intelligence in pathology
Artificial Intelligence in pathologyArtificial Intelligence in pathology
Artificial Intelligence in pathology
nehaSingh1543
 
DataMining Techniques in BreastCancer.pptx
DataMining Techniques in BreastCancer.pptxDataMining Techniques in BreastCancer.pptx
DataMining Techniques in BreastCancer.pptx
MaligireddyTanujaRed1
 
computer simulations.pptx
computer simulations.pptxcomputer simulations.pptx
computer simulations.pptx
Dr Pavani A
 
Genomic Big Data Management, Integration and Mining - Emanuel Weitschek
Genomic Big Data Management, Integration and Mining - Emanuel WeitschekGenomic Big Data Management, Integration and Mining - Emanuel Weitschek
Genomic Big Data Management, Integration and Mining - Emanuel Weitschek
Data Driven Innovation
 
Lisa_Grimm
Lisa_GrimmLisa_Grimm
Lisa_Grimm
Lisa (Prior) Grimm
 
PSN for Precision Medicine
PSN for Precision MedicinePSN for Precision Medicine
PSN for Precision Medicine
Thi K. Tran-Nguyen, PhD
 
cloudapplications.pptx
cloudapplications.pptxcloudapplications.pptx
cloudapplications.pptx
Dr. Thippeswamy S.
 
Cloud applications - Protein Structure Predication and gene expression data...
Cloud applications - Protein Structure Predication  and  gene expression data...Cloud applications - Protein Structure Predication  and  gene expression data...
Cloud applications - Protein Structure Predication and gene expression data...
Pushpendra Singh Dangi
 
Bioinformatics tools for development, analysis, and preclinical testing of in...
Bioinformatics tools for development, analysis, and preclinical testing of in...Bioinformatics tools for development, analysis, and preclinical testing of in...
Bioinformatics tools for development, analysis, and preclinical testing of in...
Malachi Griffith
 
16
1616

Similar to DREAM Challenge (20)

Bioinformatics-R program의 실례
Bioinformatics-R program의 실례Bioinformatics-R program의 실례
Bioinformatics-R program의 실례
 
Predicting Life Expectancy of Hepatitis B Patients
Predicting Life Expectancy of Hepatitis B PatientsPredicting Life Expectancy of Hepatitis B Patients
Predicting Life Expectancy of Hepatitis B Patients
 
BFRG AI Investor Aug 2023
BFRG AI Investor Aug 2023BFRG AI Investor Aug 2023
BFRG AI Investor Aug 2023
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART a Data Warehous...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART a Data Warehous...tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART a Data Warehous...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART a Data Warehous...
 
Cancer Care using Quahog Health Decision System
Cancer Care using Quahog Health Decision SystemCancer Care using Quahog Health Decision System
Cancer Care using Quahog Health Decision System
 
coad_machine_learning
coad_machine_learningcoad_machine_learning
coad_machine_learning
 
Datamining in BreastCancer.pptx
Datamining in BreastCancer.pptxDatamining in BreastCancer.pptx
Datamining in BreastCancer.pptx
 
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
 
COMPUTER SIMULATIONS IN PHARMACOKINETICS AND PHARMACODYNAMICS
COMPUTER SIMULATIONS INPHARMACOKINETICS ANDPHARMACODYNAMICSCOMPUTER SIMULATIONS INPHARMACOKINETICS ANDPHARMACODYNAMICS
COMPUTER SIMULATIONS IN PHARMACOKINETICS AND PHARMACODYNAMICS
 
Pathomics, Clinical Studies, and Cancer Surveillance
Pathomics, Clinical Studies, and Cancer SurveillancePathomics, Clinical Studies, and Cancer Surveillance
Pathomics, Clinical Studies, and Cancer Surveillance
 
Artificial Intelligence in pathology
Artificial Intelligence in pathologyArtificial Intelligence in pathology
Artificial Intelligence in pathology
 
DataMining Techniques in BreastCancer.pptx
DataMining Techniques in BreastCancer.pptxDataMining Techniques in BreastCancer.pptx
DataMining Techniques in BreastCancer.pptx
 
computer simulations.pptx
computer simulations.pptxcomputer simulations.pptx
computer simulations.pptx
 
Genomic Big Data Management, Integration and Mining - Emanuel Weitschek
Genomic Big Data Management, Integration and Mining - Emanuel WeitschekGenomic Big Data Management, Integration and Mining - Emanuel Weitschek
Genomic Big Data Management, Integration and Mining - Emanuel Weitschek
 
Lisa_Grimm
Lisa_GrimmLisa_Grimm
Lisa_Grimm
 
PSN for Precision Medicine
PSN for Precision MedicinePSN for Precision Medicine
PSN for Precision Medicine
 
cloudapplications.pptx
cloudapplications.pptxcloudapplications.pptx
cloudapplications.pptx
 
Cloud applications - Protein Structure Predication and gene expression data...
Cloud applications - Protein Structure Predication  and  gene expression data...Cloud applications - Protein Structure Predication  and  gene expression data...
Cloud applications - Protein Structure Predication and gene expression data...
 
Bioinformatics tools for development, analysis, and preclinical testing of in...
Bioinformatics tools for development, analysis, and preclinical testing of in...Bioinformatics tools for development, analysis, and preclinical testing of in...
Bioinformatics tools for development, analysis, and preclinical testing of in...
 
16
1616
16
 

Recently uploaded

MDP on air pollution of class 8 year 2024-2025
MDP on air pollution of class 8 year 2024-2025MDP on air pollution of class 8 year 2024-2025
MDP on air pollution of class 8 year 2024-2025
khuleseema60
 
Haunted Houses by H W Longfellow for class 10
Haunted Houses by H W Longfellow for class 10Haunted Houses by H W Longfellow for class 10
Haunted Houses by H W Longfellow for class 10
nitinpv4ai
 
Gender and Mental Health - Counselling and Family Therapy Applications and In...
Gender and Mental Health - Counselling and Family Therapy Applications and In...Gender and Mental Health - Counselling and Family Therapy Applications and In...
Gender and Mental Health - Counselling and Family Therapy Applications and In...
PsychoTech Services
 
BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...
BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...
BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...
Nguyen Thanh Tu Collection
 
Bossa N’ Roll Records by Ismael Vazquez.
Bossa N’ Roll Records by Ismael Vazquez.Bossa N’ Roll Records by Ismael Vazquez.
Bossa N’ Roll Records by Ismael Vazquez.
IsmaelVazquez38
 
Educational Technology in the Health Sciences
Educational Technology in the Health SciencesEducational Technology in the Health Sciences
Educational Technology in the Health Sciences
Iris Thiele Isip-Tan
 
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
TechSoup
 
Standardized tool for Intelligence test.
Standardized tool for Intelligence test.Standardized tool for Intelligence test.
Standardized tool for Intelligence test.
deepaannamalai16
 
Chapter wise All Notes of First year Basic Civil Engineering.pptx
Chapter wise All Notes of First year Basic Civil Engineering.pptxChapter wise All Notes of First year Basic Civil Engineering.pptx
Chapter wise All Notes of First year Basic Civil Engineering.pptx
Denish Jangid
 
Benner "Expanding Pathways to Publishing Careers"
Benner "Expanding Pathways to Publishing Careers"Benner "Expanding Pathways to Publishing Careers"
Benner "Expanding Pathways to Publishing Careers"
National Information Standards Organization (NISO)
 
Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...
Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...
Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...
EduSkills OECD
 
Geography as a Discipline Chapter 1 __ Class 11 Geography NCERT _ Class Notes...
Geography as a Discipline Chapter 1 __ Class 11 Geography NCERT _ Class Notes...Geography as a Discipline Chapter 1 __ Class 11 Geography NCERT _ Class Notes...
Geography as a Discipline Chapter 1 __ Class 11 Geography NCERT _ Class Notes...
ImMuslim
 
The basics of sentences session 7pptx.pptx
The basics of sentences session 7pptx.pptxThe basics of sentences session 7pptx.pptx
The basics of sentences session 7pptx.pptx
heathfieldcps1
 
Oliver Asks for More by Charles Dickens (9)
Oliver Asks for More by Charles Dickens (9)Oliver Asks for More by Charles Dickens (9)
Oliver Asks for More by Charles Dickens (9)
nitinpv4ai
 
Wound healing PPT
Wound healing PPTWound healing PPT
Wound healing PPT
Jyoti Chand
 
REASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdf
REASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdfREASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdf
REASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdf
giancarloi8888
 
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptxPrésentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
siemaillard
 
How Barcodes Can Be Leveraged Within Odoo 17
How Barcodes Can Be Leveraged Within Odoo 17How Barcodes Can Be Leveraged Within Odoo 17
How Barcodes Can Be Leveraged Within Odoo 17
Celine George
 
Electric Fetus - Record Store Scavenger Hunt
Electric Fetus - Record Store Scavenger HuntElectric Fetus - Record Store Scavenger Hunt
Electric Fetus - Record Store Scavenger Hunt
RamseyBerglund
 
مصحف القراءات العشر أعد أحرف الخلاف سمير بسيوني.pdf
مصحف القراءات العشر   أعد أحرف الخلاف سمير بسيوني.pdfمصحف القراءات العشر   أعد أحرف الخلاف سمير بسيوني.pdf
مصحف القراءات العشر أعد أحرف الخلاف سمير بسيوني.pdf
سمير بسيوني
 

Recently uploaded (20)

MDP on air pollution of class 8 year 2024-2025
MDP on air pollution of class 8 year 2024-2025MDP on air pollution of class 8 year 2024-2025
MDP on air pollution of class 8 year 2024-2025
 
Haunted Houses by H W Longfellow for class 10
Haunted Houses by H W Longfellow for class 10Haunted Houses by H W Longfellow for class 10
Haunted Houses by H W Longfellow for class 10
 
Gender and Mental Health - Counselling and Family Therapy Applications and In...
Gender and Mental Health - Counselling and Family Therapy Applications and In...Gender and Mental Health - Counselling and Family Therapy Applications and In...
Gender and Mental Health - Counselling and Family Therapy Applications and In...
 
BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...
BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...
BÀI TẬP DẠY THÊM TIẾNG ANH LỚP 7 CẢ NĂM FRIENDS PLUS SÁCH CHÂN TRỜI SÁNG TẠO ...
 
Bossa N’ Roll Records by Ismael Vazquez.
Bossa N’ Roll Records by Ismael Vazquez.Bossa N’ Roll Records by Ismael Vazquez.
Bossa N’ Roll Records by Ismael Vazquez.
 
Educational Technology in the Health Sciences
Educational Technology in the Health SciencesEducational Technology in the Health Sciences
Educational Technology in the Health Sciences
 
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
 
Standardized tool for Intelligence test.
Standardized tool for Intelligence test.Standardized tool for Intelligence test.
Standardized tool for Intelligence test.
 
Chapter wise All Notes of First year Basic Civil Engineering.pptx
Chapter wise All Notes of First year Basic Civil Engineering.pptxChapter wise All Notes of First year Basic Civil Engineering.pptx
Chapter wise All Notes of First year Basic Civil Engineering.pptx
 
Benner "Expanding Pathways to Publishing Careers"
Benner "Expanding Pathways to Publishing Careers"Benner "Expanding Pathways to Publishing Careers"
Benner "Expanding Pathways to Publishing Careers"
 
Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...
Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...
Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...
 
Geography as a Discipline Chapter 1 __ Class 11 Geography NCERT _ Class Notes...
Geography as a Discipline Chapter 1 __ Class 11 Geography NCERT _ Class Notes...Geography as a Discipline Chapter 1 __ Class 11 Geography NCERT _ Class Notes...
Geography as a Discipline Chapter 1 __ Class 11 Geography NCERT _ Class Notes...
 
The basics of sentences session 7pptx.pptx
The basics of sentences session 7pptx.pptxThe basics of sentences session 7pptx.pptx
The basics of sentences session 7pptx.pptx
 
Oliver Asks for More by Charles Dickens (9)
Oliver Asks for More by Charles Dickens (9)Oliver Asks for More by Charles Dickens (9)
Oliver Asks for More by Charles Dickens (9)
 
Wound healing PPT
Wound healing PPTWound healing PPT
Wound healing PPT
 
REASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdf
REASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdfREASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdf
REASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdf
 
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptxPrésentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
 
How Barcodes Can Be Leveraged Within Odoo 17
How Barcodes Can Be Leveraged Within Odoo 17How Barcodes Can Be Leveraged Within Odoo 17
How Barcodes Can Be Leveraged Within Odoo 17
 
Electric Fetus - Record Store Scavenger Hunt
Electric Fetus - Record Store Scavenger HuntElectric Fetus - Record Store Scavenger Hunt
Electric Fetus - Record Store Scavenger Hunt
 
مصحف القراءات العشر أعد أحرف الخلاف سمير بسيوني.pdf
مصحف القراءات العشر   أعد أحرف الخلاف سمير بسيوني.pdfمصحف القراءات العشر   أعد أحرف الخلاف سمير بسيوني.pdf
مصحف القراءات العشر أعد أحرف الخلاف سمير بسيوني.pdf
 

DREAM Challenge

  • 1. DREAM Challenge Gustavo Stolovitzky, IBM Computational Biology Center Andrea Califano, Columbia University
  • 2. DREAM • DREAM is a Dialogue for Reverse Engineering Assessments and Methods. • The main objective is to catalyze the interaction between experiment and theory in the area of cellular network inference and quantitative model building in systems biology
  • 3. Challenges • Network Topology and Parameter Inference Challenge • Sage Bionetworks - DREAM Breast Cancer Prognosis C • The DREAM-Phil Bowen ALS Prediction Prize4Life • NCI-DREAM Drug Sensitivity Prediction Challenge
  • 4. Project/Challenges • Network Topology and Parameter Inference challenge • Develop/apply optimization methods including the selection of the most informative experiments, to accurately estimate parameters and predict outcomes of perturbations in Systems Biology models given • In Model 1 the complete structure of the model (including expressions for the kinetic rate laws) for a gene regulatory network composed of 9 genes. Protein and mRNA are explicitly modeled. • In Model 2 an incomplete structure of the model, with missing regulatory links, for a gene regulatory network composed of 11 genes. Here, participants will also have to find the missing links. Only proteins are explicitly modeled.
  • 5. DREAM Phil Bowen ALS Prediction Prize4Life • Goal • Challenge is to predict the progression of disease in ALS patients based on the patient’s current disease status • Specifically develop an approach to predict a given patient’s disease status within a year’s time based on 3 months of data • Data • Includes demographics, medical and family history data, functional measures, vital signs, and lab data (blood chemistry/hematology/urinalysis) collected at multiple times • Disease progression will be calculated as the average change in (ALSFRS) Amyotrophic Lateral Sclerosis Functional Rating Scale over a year’s time from enrollment in a clinical trial • At the end of the challenge, the prediction submitted (based on 3 months of data) will be compared against the actual ALSFRS slope experienced by the patient over a year
  • 6. Output • Improve disease prediction beyond the current capabilities by • Developing more accurate (sensitive and specific) methods of predicting progression, • Identifying markers (variables) that would enable a determination of expected future disease progression earlier on in the course of the disease • Validation • Validate the model against a subset of patients that are neither part of the training set nor the final validation (test) set. • Submit the actual code written in R language (“Validation Code”) and InnoCentive will run the code against the interim validation data set
  • 7. NCI-DREAM Drug Sensitivity Prediction Challenge • Use genomic information to build models capable of ranking the sensitivity of cancer cell lines to a set of small molecule compounds or their combinations
  • 8. Sub Challenges • Sub Challenge 1 • Predict the sensitivity of breast cancer cell lines to previously untested compounds • Model capable of ranking the sensitivity of 18 breast cancer cell lines to 31 compounds • Challenge in this case is to link the drug effects to the underlying genetics of the 53 cell lines. • Sub Challenge 2 • Predicting compound combinations that have a synergistic effect in reducing viability of a DLBCL cell line • Predict the activity of pairs of compounds in the DLBCL LY3 cell line from expression profiles acquired after treatment of the cell line with each of 14 individual compounds
  • 9. Sage Bionetworks - DREAM Breast Cancer Prognosis Challenge • Background • Molecular diagnostics for cancer therapeutic decision making are among the most promising applications of genomic technology • Molecular profiles have proved particularly powerful in adding prognosis information to standard clinical practice in breast cancer • Trends emerging • Genes defining predictive signatures of the same phenotype often do not overlap across studies • Predictive signatures are not very robust • No consensus regarding the most accurate signatures or computational methods for inferring predictive signatures • No consensus regarding the added value of incorporating molecular data in addition to or instead of traditionally used clinical covariates
  • 10. Goal/Challenge • Goal • To assess the accuracy of computational models designed to predict breast cancer survival, based on clinical information about the patient's tumor as well as genome-wide molecular profiling data including gene expression and copy number profiles • Challenge • Create a community-based effort to provide an unbiased assessment of models and methodologies for the prediction of breast cancer survival • Common dataset will be provided to all participants, with a validation dataset held out for model evaluation • Novel dataset will be generated at the end of the Challenge and used to provide a final, unbiased score for each model
  • 11. DATA • Training data set from METABRIC cohort of 2000 breast cancer samples • Include detailed clinical annotations • 10 median year survival time, gene expression and copy number data • Additional breast cancer datasets curated by Sage Bionetworks • Can be use in the model development • Web based platform called Synapse • Enable transparent reproducible model building and analysis workflows as well as sharing of data, tools and models with the challenge community • Validation dataset • Derived from 300- 500 fresh frozen primary tumors with the same clinical annotations and survival data as the METABRIC cohort
  • 12. DATA • Survival data  • Survival data is loaded into R as a Surv object as defined in the R survival package. • This object is simply a 2 column matrix with sample names on the rows and columns: • time – time from diagnosis to last follow up. • status – weather the patient was alive at last follow up time • Feature data • Gene expression data. • Performed on the Illumina HT 12v3 platform • Loaded as Bioconductor ExpressionSet object • Data normalized • Copy number data. • Performed on the Affymetrix SNP 6.0 platform • Loaded as Bioconductor ExpressionSet object • Data normalized • Clinical covariates • Loaded as a data.frame  object with features
  • 15. Submission • Models built for this Challenge will be constructed using the R programming language and uploaded to a common platform (Synapse) provided by Sage Bionetworks • Models will be uploaded as R objects implementing a function called customPredict() that returns a vector of survival predictions when given a set of feature data as input • customPredict() function will be run by a validation script for each submitted model and resulting predictions will be scored • Phase 3 submissions must be accompanied by a write-up that includes a short description of the approach used in the final model
  • 16. Scoring • Challenge models will be scored by calculating the concordance index between the predicted survival and the true survival information in the validation dataset (accounting for the censor variable indicating whether the patient was alive at last follow-up) • Final assessment of models and the determination of the best performer will be based on the concordance index of predictions on the test dataset in Phase 3 of the Challenge. • In addition, other scoring metrics will be considered depending on the suggestions of the community throughout the Challenge
  • 17. Time Line • Deadline for submitting models for the Breast Cancer Prognosis Challenge is 5PM EST October 15th • Best performers will be announced at the DREAM 7 Conference taking place in San Francisco on November 12 to 16 • Final assessment of all models in newly generated data • Additional cohort of 350 breast cancer samples with archived fresh frozen tumor samples has been identified by Anne-Lise Borresen-Dale of Oslo University Hospital and a generous donation has been made by the Avon Foundation to obtain gene expression and copy number data on these samples • Currently curation of the clinical records of this patient cohort to harmonize with the current METABRIC dataset and generation of the genomic profiling data for these samples is being carried on • Aim is to generate these data by the November 12 DREAM conference

Editor's Notes

  1. reverse engineering of gene networks has been on the identification of causal interaction topologies between genes. how can we decide between models having very similar topologies and how do we characterize the actual kinetics of these networks in a way that accurately reflects the causal relationships implied in the proposed topology?
  2. Amyotrophic Lateral Sclerosis (ALS)–also known as Lou Gehrig’s disease (in the US) or Motor Neurone disease (outside the US)–is a fatal neurological disease causing death of the nerve cells in the brain and spinal cord which control voluntary muscle movements.  In the early stages of the disease, it is currently very difficult to determine whether a given patient will experience slow or fast disease progression. lack of specific and reliable predictors.
  3. challenge 1 - genomic characterization of 53 cell lines;GI50 concentrations for 31 compounds on 35 cell lines; 18 cell lines for which GI 50 concentrations are not given DLBCL - diffuse large B cell lymphoma cell lines
  4. covariate - may be of direct interest or it may be a confounding or interacting variable.
  5. METABRIC - Molecular taxonomy of Breast Cancer International Consortium