SlideShare a Scribd company logo
1 / 27
What Can Community Challenges
do for You?
Iddo Friedberg
Iowa State University
2 / 27
What Can Community Challenges
do for You?
Iddo Friedberg
Iowa State University
3 / 27
Science is Symbiotic
Discovery
Methods
4 / 27
Science is Symbiotic
5 / 27
There are many methods, which is best?
6 / 27
What is “best”?
7 / 27
Example: CASP
Critical Assessment of Structure Prediction
Problem: protein folding, or, predicting structure from sequence
Community: Structural biologists & bioinformaticians
Challenge set: proteins whose structures were experimentally determined. Varying difficulties
Assessment metrics: Root Mean Square Deviation, Q3, etc.
8 / 27
Example: CAFA
Critical Assessment of Function Annotation
Predict sequence from Function
Community: function predictors, ontologists, experimental biologists
Problem: given a protein, which ontology terms are associated with it?
Challenge set: proteins that organizers know their functions, but predictors don’t
9 / 27
Biomedical prediction problems
●
Protein Sequence to Structure
●
Protein Sequence to Function
●
Image to Phenotype
●
Phenotype to Genotype (disease)
●
Genotype to Phenotype
●
Symptoms to Disease
10 / 27
A History of Critical Assessments
TREC: Text Retrieval Conference run by NIST
CASP: structure prediction
CAPRI: Protein Interaction
DREAM: umbrella – many biomedical
challenges
CAGI: genotype / phenotype
CACAO: educational
11 / 27
Precision Recall on Ontologies
DNA binding
True function
Predicted function
12 / 27
Precision Recall on Ontologies
Nucleic acid binding
DNA binding
Binding
True function
Predicted function
13 / 27
Precision Recall on Ontologies
Nucleic acid binding
DNA binding
Binding
rRNA binding
RNA binding
True Positives : 2
False Positives: 2
False Negatives: 1 True function
Predicted function
14 / 27
Precision Recall
00 1
1
Recall
Precision
2/4
2/3
15 / 27
Adding Confidence as Threshold
AUTHOR Dr. Natalia Alianovna
MODEL 1
KEYWORDS sequence alignment.
T96060020120 GO:0008270 0.80
T96060020120 GO:0003700 0.80
T96060020120 GO:0006351 0.80
T96060020119 GO:0005730 0.01
T96060020119 GO:0003676 0.07
T96060020119 GO:0005622 0.07
T96060020119 GO:0046872 0.07
T96060020118 GO:0008270 0.75
T96060020118 GO:0006351 0.68
T96060020118 GO:0003677 0.67
T96060020118 GO:0005634 0.67
T96060020118 GO:0006355 0.55
T96060020118 GO:0003700 0.34
Protein
ID
GO
term
Confidence
16 / 27
DREAM Digital Mammography
Challenge (2016-2017)
17 / 27
DREAM Digital Mammography
Challenge (2016-2017)
640K
mammography
images
Patient
metadata
0 1
Sensitivity: TP/(TP+FN)
Specificity: TN/(TN+FP)
TP: patient sick, predicted sick
FP: patient healthy, predicted sick
TN: patient healthy, predicted
healthy
FN: patient sick, predicted healthy
Probability of cancer one year after?
18 / 27
Are we improving?
https://www.ibm.com/blogs/research/2017/06/dream-challenge-results/
https://predictioncenter.org/
https://biofunctionprediction.org/
19 / 27
Are we improving?
https://www.ibm.com/blogs/research/2017/06/dream-challenge-results/
https://predictioncenter.org/
https://biofunctionprediction.org/
20 / 27
Goodhart’s Law
21 / 27
Gaming Metrics
22 / 27
Gaming Metrics
Metric Claims to
measure
Used for How to game
h-index Individual
publication
impact
Promotion and
Tenure
Multi-author
papers
IQ test Intelligence Bragging rights Be born in the
right culture
SAT Success in
college
College
admissions
Study
Precision recall CAFA
performance
Assessing
function
predictions
Not telling you
23 / 27
Gaming Metrics
Metric Claims to
measure
Used for How to game
h-index Individual
publication
impact
Promotion and
Tenure
Multi-author
papers
IQ test Intelligence Bragging rights Be born in the
right culture
SAT Success in
college
College
admissions
Study
Precision recall CAFA
performance
Assessing
function
predictions
Not telling you
24 / 27
Gaming Metrics
Metric Claims to
measure
Used for How to game
h-index Individual
publication
impact
Promotion and
Tenure
Multi-author
papers
IQ test Intelligence Bragging rights Be born in the
right culture
SAT Success in
college
College
admissions
Study
Semantic
similarity
CAFA
performance
Assessing
function
predictions
Not telling you
25 / 27
Gaming Metrics
Metric Claims to
measure
Used for How to game
h-index Individual
publication
impact
Promotion and
Tenure
Multi-author
papers
IQ test Intelligence Bragging rights Be born in the
right culture
SAT Success in
college
College
admissions
Study
Precision recall CAFA
performance
Assessing
function
predictions
Not telling you
26 / 27
Gaming Metrics
Metric Claims to
measure
Used for How to game
h-index Individual
publication
impact
Promotion and
Tenure
Multi-author
papers
IQ test Intelligence Bragging rights Be born in the
right culture
SAT Success in
college
College
admissions
Study
Precision recall CAFA
performance
Assessing
function
predictions
Not telling you
27 / 27
Thank you
●
Sage Bionetworks
– Lara Mangravite
●
CAFA
– Wyatt Clark, Indiana University (1,2)
– Yuxiang Jiang, Indiana University (2-4)
– Naihui Zhou, Iowa State University (3, 3.14)
– Tim Bergquist University of Washington (2-4)
– Predrag Radivojac (Northeastern University)
– Sean Mooney (University of Washington)
– Casey Greene (University of Colorado)
– Mark Wass (University of Kent)
– Kim Reynolds (University of Texas
Southwestern)
●
Sandra Orchard (EMBL-EBI)
●
Maria Martin (EMBL-EBI)
●
> 250 co-authors over the years

More Related Content

What's hot

Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma Ankur Khanna
 
Artificial intelligence in drug discovery
Artificial intelligence in drug discoveryArtificial intelligence in drug discovery
Artificial intelligence in drug discoveryRAVINDRABABUKOPPERA
 
How Artificial Intelligence in Transforming Pharma
How Artificial Intelligence in Transforming PharmaHow Artificial Intelligence in Transforming Pharma
How Artificial Intelligence in Transforming PharmaTyrone Systems
 
neutralresumemergui17
neutralresumemergui17neutralresumemergui17
neutralresumemergui17Sylvia Mergui
 
2012 DIA EMRs for clinical research
2012 DIA  EMRs for clinical research2012 DIA  EMRs for clinical research
2012 DIA EMRs for clinical researchEd Seguine
 

What's hot (6)

Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma
 
Artificial intelligence in drug discovery
Artificial intelligence in drug discoveryArtificial intelligence in drug discovery
Artificial intelligence in drug discovery
 
How Artificial Intelligence in Transforming Pharma
How Artificial Intelligence in Transforming PharmaHow Artificial Intelligence in Transforming Pharma
How Artificial Intelligence in Transforming Pharma
 
SaMD Lastest Trend
SaMD Lastest TrendSaMD Lastest Trend
SaMD Lastest Trend
 
neutralresumemergui17
neutralresumemergui17neutralresumemergui17
neutralresumemergui17
 
2012 DIA EMRs for clinical research
2012 DIA  EMRs for clinical research2012 DIA  EMRs for clinical research
2012 DIA EMRs for clinical research
 

Similar to What can Community Challenges do for You?

Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management inscit2006
 
Healthcare Predicitive Analytics for Risk Profiling in Chronic Care: A Bayesi...
Healthcare Predicitive Analytics for Risk Profiling in Chronic Care: A Bayesi...Healthcare Predicitive Analytics for Risk Profiling in Chronic Care: A Bayesi...
Healthcare Predicitive Analytics for Risk Profiling in Chronic Care: A Bayesi...MIS Quarterly
 
Bias in covid 19 models
Bias in covid 19 modelsBias in covid 19 models
Bias in covid 19 modelsLaure Wynants
 
Exposome data challenge - ISGlobal hub prez July 2022.pptx
Exposome data challenge - ISGlobal hub prez July 2022.pptxExposome data challenge - ISGlobal hub prez July 2022.pptx
Exposome data challenge - ISGlobal hub prez July 2022.pptxLeaMaitre1
 
FAIR as a Working Principle for Cancer Genomic Data
FAIR as a Working Principle for Cancer Genomic DataFAIR as a Working Principle for Cancer Genomic Data
FAIR as a Working Principle for Cancer Genomic DataIan Fore
 
Math, Stats and CS in Public Health and Medical Research
Math, Stats and CS in Public Health and Medical ResearchMath, Stats and CS in Public Health and Medical Research
Math, Stats and CS in Public Health and Medical ResearchJessica Minnier
 
The state of the art in behavioral machine learning for healthcare
The state of the art in behavioral machine learning for healthcareThe state of the art in behavioral machine learning for healthcare
The state of the art in behavioral machine learning for healthcareAfrica Perianez
 
Computational Challenges in Biological Data Science: an Optimistically Cautio...
Computational Challenges in Biological Data Science: an Optimistically Cautio...Computational Challenges in Biological Data Science: an Optimistically Cautio...
Computational Challenges in Biological Data Science: an Optimistically Cautio...Iddo
 
Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416Chirag Patel
 
ai-in-healthcare-202011-201117103639.pptx
ai-in-healthcare-202011-201117103639.pptxai-in-healthcare-202011-201117103639.pptx
ai-in-healthcare-202011-201117103639.pptxssuser6b571f
 
AI at GSK_Kim Branson_mHealth Israel
AI at GSK_Kim Branson_mHealth IsraelAI at GSK_Kim Branson_mHealth Israel
AI at GSK_Kim Branson_mHealth IsraelLevi Shapiro
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Ian Foster
 
Stephen Friend Dana Farber Cancer Institute 2011-10-24
Stephen Friend Dana Farber Cancer Institute 2011-10-24Stephen Friend Dana Farber Cancer Institute 2011-10-24
Stephen Friend Dana Farber Cancer Institute 2011-10-24Sage Base
 
Enabling the Computational Future of Biology.pdf
Enabling the Computational Future of Biology.pdfEnabling the Computational Future of Biology.pdf
Enabling the Computational Future of Biology.pdfVaticle
 
AI: The Future is So Bright: Part Deux
AI: The Future is So Bright: Part DeuxAI: The Future is So Bright: Part Deux
AI: The Future is So Bright: Part DeuxIsaac Kohane
 
Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!adcobb
 
Building Biomedical Knowledge Graphs for In-Silico Drug Discovery
Building Biomedical Knowledge Graphs for In-Silico Drug DiscoveryBuilding Biomedical Knowledge Graphs for In-Silico Drug Discovery
Building Biomedical Knowledge Graphs for In-Silico Drug DiscoveryVaticle
 
Stephen Friend Institute of Development, Aging and Cancer 2011-11-29
Stephen Friend Institute of Development, Aging and Cancer 2011-11-29Stephen Friend Institute of Development, Aging and Cancer 2011-11-29
Stephen Friend Institute of Development, Aging and Cancer 2011-11-29Sage Base
 

Similar to What can Community Challenges do for You? (20)

Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management
 
Healthcare Predicitive Analytics for Risk Profiling in Chronic Care: A Bayesi...
Healthcare Predicitive Analytics for Risk Profiling in Chronic Care: A Bayesi...Healthcare Predicitive Analytics for Risk Profiling in Chronic Care: A Bayesi...
Healthcare Predicitive Analytics for Risk Profiling in Chronic Care: A Bayesi...
 
Bias in covid 19 models
Bias in covid 19 modelsBias in covid 19 models
Bias in covid 19 models
 
Exposome data challenge - ISGlobal hub prez July 2022.pptx
Exposome data challenge - ISGlobal hub prez July 2022.pptxExposome data challenge - ISGlobal hub prez July 2022.pptx
Exposome data challenge - ISGlobal hub prez July 2022.pptx
 
FAIR as a Working Principle for Cancer Genomic Data
FAIR as a Working Principle for Cancer Genomic DataFAIR as a Working Principle for Cancer Genomic Data
FAIR as a Working Principle for Cancer Genomic Data
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Math, Stats and CS in Public Health and Medical Research
Math, Stats and CS in Public Health and Medical ResearchMath, Stats and CS in Public Health and Medical Research
Math, Stats and CS in Public Health and Medical Research
 
The state of the art in behavioral machine learning for healthcare
The state of the art in behavioral machine learning for healthcareThe state of the art in behavioral machine learning for healthcare
The state of the art in behavioral machine learning for healthcare
 
Computational Challenges in Biological Data Science: an Optimistically Cautio...
Computational Challenges in Biological Data Science: an Optimistically Cautio...Computational Challenges in Biological Data Science: an Optimistically Cautio...
Computational Challenges in Biological Data Science: an Optimistically Cautio...
 
Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416
 
ai-in-healthcare-202011-201117103639.pptx
ai-in-healthcare-202011-201117103639.pptxai-in-healthcare-202011-201117103639.pptx
ai-in-healthcare-202011-201117103639.pptx
 
AI at GSK_Kim Branson_mHealth Israel
AI at GSK_Kim Branson_mHealth IsraelAI at GSK_Kim Branson_mHealth Israel
AI at GSK_Kim Branson_mHealth Israel
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009
 
Stephen Friend Dana Farber Cancer Institute 2011-10-24
Stephen Friend Dana Farber Cancer Institute 2011-10-24Stephen Friend Dana Farber Cancer Institute 2011-10-24
Stephen Friend Dana Farber Cancer Institute 2011-10-24
 
Biostatistics and its importance to Biologist
Biostatistics and its importance to BiologistBiostatistics and its importance to Biologist
Biostatistics and its importance to Biologist
 
Enabling the Computational Future of Biology.pdf
Enabling the Computational Future of Biology.pdfEnabling the Computational Future of Biology.pdf
Enabling the Computational Future of Biology.pdf
 
AI: The Future is So Bright: Part Deux
AI: The Future is So Bright: Part DeuxAI: The Future is So Bright: Part Deux
AI: The Future is So Bright: Part Deux
 
Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!
 
Building Biomedical Knowledge Graphs for In-Silico Drug Discovery
Building Biomedical Knowledge Graphs for In-Silico Drug DiscoveryBuilding Biomedical Knowledge Graphs for In-Silico Drug Discovery
Building Biomedical Knowledge Graphs for In-Silico Drug Discovery
 
Stephen Friend Institute of Development, Aging and Cancer 2011-11-29
Stephen Friend Institute of Development, Aging and Cancer 2011-11-29Stephen Friend Institute of Development, Aging and Cancer 2011-11-29
Stephen Friend Institute of Development, Aging and Cancer 2011-11-29
 

More from Iddo

Surviving Scientific Presentations
Surviving Scientific PresentationsSurviving Scientific Presentations
Surviving Scientific PresentationsIddo
 
Friedberg lab-overview-grad-students-2019-nr
Friedberg lab-overview-grad-students-2019-nrFriedberg lab-overview-grad-students-2019-nr
Friedberg lab-overview-grad-students-2019-nrIddo
 
The roles communities play in improving bioinformatics: better software, bett...
The roles communities play in improving bioinformatics: better software, bett...The roles communities play in improving bioinformatics: better software, bett...
The roles communities play in improving bioinformatics: better software, bett...Iddo
 
Why Your Microbiome Analysis is Wrong
Why Your Microbiome Analysis is WrongWhy Your Microbiome Analysis is Wrong
Why Your Microbiome Analysis is WrongIddo
 
Tracing the Ancestry of Genomes in Bacteria
Tracing the Ancestry of Genomes in BacteriaTracing the Ancestry of Genomes in Bacteria
Tracing the Ancestry of Genomes in BacteriaIddo
 
Friedberg lab-overview-grad-students
Friedberg lab-overview-grad-studentsFriedberg lab-overview-grad-students
Friedberg lab-overview-grad-studentsIddo
 
Understanding Biological Function in Times of High Throughput and Low Output
Understanding Biological Function in Times of High Throughput and Low OutputUnderstanding Biological Function in Times of High Throughput and Low Output
Understanding Biological Function in Times of High Throughput and Low OutputIddo
 
Random Musings on Fixing Data Shambles in Science
Random Musings on Fixing Data Shambles in ScienceRandom Musings on Fixing Data Shambles in Science
Random Musings on Fixing Data Shambles in ScienceIddo
 
Genome Informatics 2015 Bacteriocin Discovery
Genome Informatics 2015 Bacteriocin DiscoveryGenome Informatics 2015 Bacteriocin Discovery
Genome Informatics 2015 Bacteriocin DiscoveryIddo
 
Convergent divergent
Convergent divergentConvergent divergent
Convergent divergentIddo
 
Some US Science Funding sources
Some US Science Funding sourcesSome US Science Funding sources
Some US Science Funding sourcesIddo
 
CAFA poster presented at CSHL Genome Informatics 2013
CAFA poster presented at CSHL Genome Informatics 2013CAFA poster presented at CSHL Genome Informatics 2013
CAFA poster presented at CSHL Genome Informatics 2013Iddo
 
Ewan Birney Biocuration 2013
Ewan Birney Biocuration 2013Ewan Birney Biocuration 2013
Ewan Birney Biocuration 2013Iddo
 
Metagenomics Biocuration 2013
Metagenomics Biocuration 2013Metagenomics Biocuration 2013
Metagenomics Biocuration 2013Iddo
 
Ismb grant-writing-2012
Ismb grant-writing-2012Ismb grant-writing-2012
Ismb grant-writing-2012Iddo
 
David Jones AFP/CAFA2011
David Jones AFP/CAFA2011David Jones AFP/CAFA2011
David Jones AFP/CAFA2011Iddo
 
Vienna afp2011
Vienna afp2011Vienna afp2011
Vienna afp2011Iddo
 
Afp cafa djuric
Afp cafa djuricAfp cafa djuric
Afp cafa djuricIddo
 
Go camp 2010_cacao
Go camp 2010_cacaoGo camp 2010_cacao
Go camp 2010_cacaoIddo
 
Ignobel2010
Ignobel2010Ignobel2010
Ignobel2010Iddo
 

More from Iddo (20)

Surviving Scientific Presentations
Surviving Scientific PresentationsSurviving Scientific Presentations
Surviving Scientific Presentations
 
Friedberg lab-overview-grad-students-2019-nr
Friedberg lab-overview-grad-students-2019-nrFriedberg lab-overview-grad-students-2019-nr
Friedberg lab-overview-grad-students-2019-nr
 
The roles communities play in improving bioinformatics: better software, bett...
The roles communities play in improving bioinformatics: better software, bett...The roles communities play in improving bioinformatics: better software, bett...
The roles communities play in improving bioinformatics: better software, bett...
 
Why Your Microbiome Analysis is Wrong
Why Your Microbiome Analysis is WrongWhy Your Microbiome Analysis is Wrong
Why Your Microbiome Analysis is Wrong
 
Tracing the Ancestry of Genomes in Bacteria
Tracing the Ancestry of Genomes in BacteriaTracing the Ancestry of Genomes in Bacteria
Tracing the Ancestry of Genomes in Bacteria
 
Friedberg lab-overview-grad-students
Friedberg lab-overview-grad-studentsFriedberg lab-overview-grad-students
Friedberg lab-overview-grad-students
 
Understanding Biological Function in Times of High Throughput and Low Output
Understanding Biological Function in Times of High Throughput and Low OutputUnderstanding Biological Function in Times of High Throughput and Low Output
Understanding Biological Function in Times of High Throughput and Low Output
 
Random Musings on Fixing Data Shambles in Science
Random Musings on Fixing Data Shambles in ScienceRandom Musings on Fixing Data Shambles in Science
Random Musings on Fixing Data Shambles in Science
 
Genome Informatics 2015 Bacteriocin Discovery
Genome Informatics 2015 Bacteriocin DiscoveryGenome Informatics 2015 Bacteriocin Discovery
Genome Informatics 2015 Bacteriocin Discovery
 
Convergent divergent
Convergent divergentConvergent divergent
Convergent divergent
 
Some US Science Funding sources
Some US Science Funding sourcesSome US Science Funding sources
Some US Science Funding sources
 
CAFA poster presented at CSHL Genome Informatics 2013
CAFA poster presented at CSHL Genome Informatics 2013CAFA poster presented at CSHL Genome Informatics 2013
CAFA poster presented at CSHL Genome Informatics 2013
 
Ewan Birney Biocuration 2013
Ewan Birney Biocuration 2013Ewan Birney Biocuration 2013
Ewan Birney Biocuration 2013
 
Metagenomics Biocuration 2013
Metagenomics Biocuration 2013Metagenomics Biocuration 2013
Metagenomics Biocuration 2013
 
Ismb grant-writing-2012
Ismb grant-writing-2012Ismb grant-writing-2012
Ismb grant-writing-2012
 
David Jones AFP/CAFA2011
David Jones AFP/CAFA2011David Jones AFP/CAFA2011
David Jones AFP/CAFA2011
 
Vienna afp2011
Vienna afp2011Vienna afp2011
Vienna afp2011
 
Afp cafa djuric
Afp cafa djuricAfp cafa djuric
Afp cafa djuric
 
Go camp 2010_cacao
Go camp 2010_cacaoGo camp 2010_cacao
Go camp 2010_cacao
 
Ignobel2010
Ignobel2010Ignobel2010
Ignobel2010
 

Recently uploaded

RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCINGRNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCINGAADYARAJPANDEY1
 
Hemoglobin metabolism: C Kalyan & E. Muralinath
Hemoglobin metabolism: C Kalyan & E. MuralinathHemoglobin metabolism: C Kalyan & E. Muralinath
Hemoglobin metabolism: C Kalyan & E. Muralinathmuralinath2
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxmuralinath2
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONChetanK57
 
THYROID-PARATHYROID medical surgical nursing
THYROID-PARATHYROID medical surgical nursingTHYROID-PARATHYROID medical surgical nursing
THYROID-PARATHYROID medical surgical nursingJocelyn Atis
 
The importance of continents, oceans and plate tectonics for the evolution of...
The importance of continents, oceans and plate tectonics for the evolution of...The importance of continents, oceans and plate tectonics for the evolution of...
The importance of continents, oceans and plate tectonics for the evolution of...Sérgio Sacani
 
Richard's entangled aventures in wonderland
Richard's entangled aventures in wonderlandRichard's entangled aventures in wonderland
Richard's entangled aventures in wonderlandRichard Gill
 
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of LipidsGBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of LipidsAreesha Ahmad
 
Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinossaicprecious19
 
NuGOweek 2024 full programme - hosted by Ghent University
NuGOweek 2024 full programme - hosted by Ghent UniversityNuGOweek 2024 full programme - hosted by Ghent University
NuGOweek 2024 full programme - hosted by Ghent Universitypablovgd
 
Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...Sérgio Sacani
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...muralinath2
 
FAIRSpectra - Towards a common data file format for SIMS images
FAIRSpectra - Towards a common data file format for SIMS imagesFAIRSpectra - Towards a common data file format for SIMS images
FAIRSpectra - Towards a common data file format for SIMS imagesAlex Henderson
 
Climate extremes likely to drive land mammal extinction during next supercont...
Climate extremes likely to drive land mammal extinction during next supercont...Climate extremes likely to drive land mammal extinction during next supercont...
Climate extremes likely to drive land mammal extinction during next supercont...Sérgio Sacani
 
Transport in plants G1.pptx Cambridge IGCSE
Transport in plants G1.pptx Cambridge IGCSETransport in plants G1.pptx Cambridge IGCSE
Transport in plants G1.pptx Cambridge IGCSEjordanparish425
 
Hemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxHemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxmuralinath2
 
biotech-regenration of plants, pharmaceutical applications.pptx
biotech-regenration of plants, pharmaceutical applications.pptxbiotech-regenration of plants, pharmaceutical applications.pptx
biotech-regenration of plants, pharmaceutical applications.pptxANONYMOUS
 
Aerodynamics. flippatterncn5tm5ttnj6nmnynyppt
Aerodynamics. flippatterncn5tm5ttnj6nmnynypptAerodynamics. flippatterncn5tm5ttnj6nmnynyppt
Aerodynamics. flippatterncn5tm5ttnj6nmnynypptsreddyrahul
 
Structures and textures of metamorphic rocks
Structures and textures of metamorphic rocksStructures and textures of metamorphic rocks
Structures and textures of metamorphic rockskumarmathi863
 
Detectability of Solar Panels as a Technosignature
Detectability of Solar Panels as a TechnosignatureDetectability of Solar Panels as a Technosignature
Detectability of Solar Panels as a TechnosignatureSérgio Sacani
 

Recently uploaded (20)

RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCINGRNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
 
Hemoglobin metabolism: C Kalyan & E. Muralinath
Hemoglobin metabolism: C Kalyan & E. MuralinathHemoglobin metabolism: C Kalyan & E. Muralinath
Hemoglobin metabolism: C Kalyan & E. Muralinath
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
 
THYROID-PARATHYROID medical surgical nursing
THYROID-PARATHYROID medical surgical nursingTHYROID-PARATHYROID medical surgical nursing
THYROID-PARATHYROID medical surgical nursing
 
The importance of continents, oceans and plate tectonics for the evolution of...
The importance of continents, oceans and plate tectonics for the evolution of...The importance of continents, oceans and plate tectonics for the evolution of...
The importance of continents, oceans and plate tectonics for the evolution of...
 
Richard's entangled aventures in wonderland
Richard's entangled aventures in wonderlandRichard's entangled aventures in wonderland
Richard's entangled aventures in wonderland
 
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of LipidsGBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
 
Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
 
NuGOweek 2024 full programme - hosted by Ghent University
NuGOweek 2024 full programme - hosted by Ghent UniversityNuGOweek 2024 full programme - hosted by Ghent University
NuGOweek 2024 full programme - hosted by Ghent University
 
Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
 
FAIRSpectra - Towards a common data file format for SIMS images
FAIRSpectra - Towards a common data file format for SIMS imagesFAIRSpectra - Towards a common data file format for SIMS images
FAIRSpectra - Towards a common data file format for SIMS images
 
Climate extremes likely to drive land mammal extinction during next supercont...
Climate extremes likely to drive land mammal extinction during next supercont...Climate extremes likely to drive land mammal extinction during next supercont...
Climate extremes likely to drive land mammal extinction during next supercont...
 
Transport in plants G1.pptx Cambridge IGCSE
Transport in plants G1.pptx Cambridge IGCSETransport in plants G1.pptx Cambridge IGCSE
Transport in plants G1.pptx Cambridge IGCSE
 
Hemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxHemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptx
 
biotech-regenration of plants, pharmaceutical applications.pptx
biotech-regenration of plants, pharmaceutical applications.pptxbiotech-regenration of plants, pharmaceutical applications.pptx
biotech-regenration of plants, pharmaceutical applications.pptx
 
Aerodynamics. flippatterncn5tm5ttnj6nmnynyppt
Aerodynamics. flippatterncn5tm5ttnj6nmnynypptAerodynamics. flippatterncn5tm5ttnj6nmnynyppt
Aerodynamics. flippatterncn5tm5ttnj6nmnynyppt
 
Structures and textures of metamorphic rocks
Structures and textures of metamorphic rocksStructures and textures of metamorphic rocks
Structures and textures of metamorphic rocks
 
Detectability of Solar Panels as a Technosignature
Detectability of Solar Panels as a TechnosignatureDetectability of Solar Panels as a Technosignature
Detectability of Solar Panels as a Technosignature
 

What can Community Challenges do for You?

  • 1. 1 / 27 What Can Community Challenges do for You? Iddo Friedberg Iowa State University
  • 2. 2 / 27 What Can Community Challenges do for You? Iddo Friedberg Iowa State University
  • 3. 3 / 27 Science is Symbiotic Discovery Methods
  • 4. 4 / 27 Science is Symbiotic
  • 5. 5 / 27 There are many methods, which is best?
  • 6. 6 / 27 What is “best”?
  • 7. 7 / 27 Example: CASP Critical Assessment of Structure Prediction Problem: protein folding, or, predicting structure from sequence Community: Structural biologists & bioinformaticians Challenge set: proteins whose structures were experimentally determined. Varying difficulties Assessment metrics: Root Mean Square Deviation, Q3, etc.
  • 8. 8 / 27 Example: CAFA Critical Assessment of Function Annotation Predict sequence from Function Community: function predictors, ontologists, experimental biologists Problem: given a protein, which ontology terms are associated with it? Challenge set: proteins that organizers know their functions, but predictors don’t
  • 9. 9 / 27 Biomedical prediction problems ● Protein Sequence to Structure ● Protein Sequence to Function ● Image to Phenotype ● Phenotype to Genotype (disease) ● Genotype to Phenotype ● Symptoms to Disease
  • 10. 10 / 27 A History of Critical Assessments TREC: Text Retrieval Conference run by NIST CASP: structure prediction CAPRI: Protein Interaction DREAM: umbrella – many biomedical challenges CAGI: genotype / phenotype CACAO: educational
  • 11. 11 / 27 Precision Recall on Ontologies DNA binding True function Predicted function
  • 12. 12 / 27 Precision Recall on Ontologies Nucleic acid binding DNA binding Binding True function Predicted function
  • 13. 13 / 27 Precision Recall on Ontologies Nucleic acid binding DNA binding Binding rRNA binding RNA binding True Positives : 2 False Positives: 2 False Negatives: 1 True function Predicted function
  • 14. 14 / 27 Precision Recall 00 1 1 Recall Precision 2/4 2/3
  • 15. 15 / 27 Adding Confidence as Threshold AUTHOR Dr. Natalia Alianovna MODEL 1 KEYWORDS sequence alignment. T96060020120 GO:0008270 0.80 T96060020120 GO:0003700 0.80 T96060020120 GO:0006351 0.80 T96060020119 GO:0005730 0.01 T96060020119 GO:0003676 0.07 T96060020119 GO:0005622 0.07 T96060020119 GO:0046872 0.07 T96060020118 GO:0008270 0.75 T96060020118 GO:0006351 0.68 T96060020118 GO:0003677 0.67 T96060020118 GO:0005634 0.67 T96060020118 GO:0006355 0.55 T96060020118 GO:0003700 0.34 Protein ID GO term Confidence
  • 16. 16 / 27 DREAM Digital Mammography Challenge (2016-2017)
  • 17. 17 / 27 DREAM Digital Mammography Challenge (2016-2017) 640K mammography images Patient metadata 0 1 Sensitivity: TP/(TP+FN) Specificity: TN/(TN+FP) TP: patient sick, predicted sick FP: patient healthy, predicted sick TN: patient healthy, predicted healthy FN: patient sick, predicted healthy Probability of cancer one year after?
  • 18. 18 / 27 Are we improving? https://www.ibm.com/blogs/research/2017/06/dream-challenge-results/ https://predictioncenter.org/ https://biofunctionprediction.org/
  • 19. 19 / 27 Are we improving? https://www.ibm.com/blogs/research/2017/06/dream-challenge-results/ https://predictioncenter.org/ https://biofunctionprediction.org/
  • 21. 21 / 27 Gaming Metrics
  • 22. 22 / 27 Gaming Metrics Metric Claims to measure Used for How to game h-index Individual publication impact Promotion and Tenure Multi-author papers IQ test Intelligence Bragging rights Be born in the right culture SAT Success in college College admissions Study Precision recall CAFA performance Assessing function predictions Not telling you
  • 23. 23 / 27 Gaming Metrics Metric Claims to measure Used for How to game h-index Individual publication impact Promotion and Tenure Multi-author papers IQ test Intelligence Bragging rights Be born in the right culture SAT Success in college College admissions Study Precision recall CAFA performance Assessing function predictions Not telling you
  • 24. 24 / 27 Gaming Metrics Metric Claims to measure Used for How to game h-index Individual publication impact Promotion and Tenure Multi-author papers IQ test Intelligence Bragging rights Be born in the right culture SAT Success in college College admissions Study Semantic similarity CAFA performance Assessing function predictions Not telling you
  • 25. 25 / 27 Gaming Metrics Metric Claims to measure Used for How to game h-index Individual publication impact Promotion and Tenure Multi-author papers IQ test Intelligence Bragging rights Be born in the right culture SAT Success in college College admissions Study Precision recall CAFA performance Assessing function predictions Not telling you
  • 26. 26 / 27 Gaming Metrics Metric Claims to measure Used for How to game h-index Individual publication impact Promotion and Tenure Multi-author papers IQ test Intelligence Bragging rights Be born in the right culture SAT Success in college College admissions Study Precision recall CAFA performance Assessing function predictions Not telling you
  • 27. 27 / 27 Thank you ● Sage Bionetworks – Lara Mangravite ● CAFA – Wyatt Clark, Indiana University (1,2) – Yuxiang Jiang, Indiana University (2-4) – Naihui Zhou, Iowa State University (3, 3.14) – Tim Bergquist University of Washington (2-4) – Predrag Radivojac (Northeastern University) – Sean Mooney (University of Washington) – Casey Greene (University of Colorado) – Mark Wass (University of Kent) – Kim Reynolds (University of Texas Southwestern) ● Sandra Orchard (EMBL-EBI) ● Maria Martin (EMBL-EBI) ● > 250 co-authors over the years