SlideShare a Scribd company logo
1 of 27
Download to read offline
1 / 27
What Can Community Challenges
do for You?
Iddo Friedberg
Iowa State University
2 / 27
What Can Community Challenges
do for You?
Iddo Friedberg
Iowa State University
3 / 27
Science is Symbiotic
Discovery
Methods
4 / 27
Science is Symbiotic
5 / 27
There are many methods, which is best?
6 / 27
What is “best”?
7 / 27
Example: CASP
Critical Assessment of Structure Prediction
Problem: protein folding, or, predicting structure from sequence
Community: Structural biologists & bioinformaticians
Challenge set: proteins whose structures were experimentally determined. Varying difficulties
Assessment metrics: Root Mean Square Deviation, Q3, etc.
8 / 27
Example: CAFA
Critical Assessment of Function Annotation
Predict sequence from Function
Community: function predictors, ontologists, experimental biologists
Problem: given a protein, which ontology terms are associated with it?
Challenge set: proteins that organizers know their functions, but predictors don’t
9 / 27
Biomedical prediction problems
●
Protein Sequence to Structure
●
Protein Sequence to Function
●
Image to Phenotype
●
Phenotype to Genotype (disease)
●
Genotype to Phenotype
●
Symptoms to Disease
10 / 27
A History of Critical Assessments
TREC: Text Retrieval Conference run by NIST
CASP: structure prediction
CAPRI: Protein Interaction
DREAM: umbrella – many biomedical
challenges
CAGI: genotype / phenotype
CACAO: educational
11 / 27
Precision Recall on Ontologies
DNA binding
True function
Predicted function
12 / 27
Precision Recall on Ontologies
Nucleic acid binding
DNA binding
Binding
True function
Predicted function
13 / 27
Precision Recall on Ontologies
Nucleic acid binding
DNA binding
Binding
rRNA binding
RNA binding
True Positives : 2
False Positives: 2
False Negatives: 1 True function
Predicted function
14 / 27
Precision Recall
00 1
1
Recall
Precision
2/4
2/3
15 / 27
Adding Confidence as Threshold
AUTHOR Dr. Natalia Alianovna
MODEL 1
KEYWORDS sequence alignment.
T96060020120 GO:0008270 0.80
T96060020120 GO:0003700 0.80
T96060020120 GO:0006351 0.80
T96060020119 GO:0005730 0.01
T96060020119 GO:0003676 0.07
T96060020119 GO:0005622 0.07
T96060020119 GO:0046872 0.07
T96060020118 GO:0008270 0.75
T96060020118 GO:0006351 0.68
T96060020118 GO:0003677 0.67
T96060020118 GO:0005634 0.67
T96060020118 GO:0006355 0.55
T96060020118 GO:0003700 0.34
Protein
ID
GO
term
Confidence
16 / 27
DREAM Digital Mammography
Challenge (2016-2017)
17 / 27
DREAM Digital Mammography
Challenge (2016-2017)
640K
mammography
images
Patient
metadata
0 1
Sensitivity: TP/(TP+FN)
Specificity: TN/(TN+FP)
TP: patient sick, predicted sick
FP: patient healthy, predicted sick
TN: patient healthy, predicted
healthy
FN: patient sick, predicted healthy
Probability of cancer one year after?
18 / 27
Are we improving?
https://www.ibm.com/blogs/research/2017/06/dream-challenge-results/
https://predictioncenter.org/
https://biofunctionprediction.org/
19 / 27
Are we improving?
https://www.ibm.com/blogs/research/2017/06/dream-challenge-results/
https://predictioncenter.org/
https://biofunctionprediction.org/
20 / 27
Goodhart’s Law
21 / 27
Gaming Metrics
22 / 27
Gaming Metrics
Metric Claims to
measure
Used for How to game
h-index Individual
publication
impact
Promotion and
Tenure
Multi-author
papers
IQ test Intelligence Bragging rights Be born in the
right culture
SAT Success in
college
College
admissions
Study
Precision recall CAFA
performance
Assessing
function
predictions
Not telling you
23 / 27
Gaming Metrics
Metric Claims to
measure
Used for How to game
h-index Individual
publication
impact
Promotion and
Tenure
Multi-author
papers
IQ test Intelligence Bragging rights Be born in the
right culture
SAT Success in
college
College
admissions
Study
Precision recall CAFA
performance
Assessing
function
predictions
Not telling you
24 / 27
Gaming Metrics
Metric Claims to
measure
Used for How to game
h-index Individual
publication
impact
Promotion and
Tenure
Multi-author
papers
IQ test Intelligence Bragging rights Be born in the
right culture
SAT Success in
college
College
admissions
Study
Semantic
similarity
CAFA
performance
Assessing
function
predictions
Not telling you
25 / 27
Gaming Metrics
Metric Claims to
measure
Used for How to game
h-index Individual
publication
impact
Promotion and
Tenure
Multi-author
papers
IQ test Intelligence Bragging rights Be born in the
right culture
SAT Success in
college
College
admissions
Study
Precision recall CAFA
performance
Assessing
function
predictions
Not telling you
26 / 27
Gaming Metrics
Metric Claims to
measure
Used for How to game
h-index Individual
publication
impact
Promotion and
Tenure
Multi-author
papers
IQ test Intelligence Bragging rights Be born in the
right culture
SAT Success in
college
College
admissions
Study
Precision recall CAFA
performance
Assessing
function
predictions
Not telling you
27 / 27
Thank you
●
Sage Bionetworks
– Lara Mangravite
●
CAFA
– Wyatt Clark, Indiana University (1,2)
– Yuxiang Jiang, Indiana University (2-4)
– Naihui Zhou, Iowa State University (3, 3.14)
– Tim Bergquist University of Washington (2-4)
– Predrag Radivojac (Northeastern University)
– Sean Mooney (University of Washington)
– Casey Greene (University of Colorado)
– Mark Wass (University of Kent)
– Kim Reynolds (University of Texas
Southwestern)
●
Sandra Orchard (EMBL-EBI)
●
Maria Martin (EMBL-EBI)
●
> 250 co-authors over the years

More Related Content

What's hot

Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma Ankur Khanna
 
Artificial intelligence in drug discovery
Artificial intelligence in drug discoveryArtificial intelligence in drug discovery
Artificial intelligence in drug discoveryRAVINDRABABUKOPPERA
 
How Artificial Intelligence in Transforming Pharma
How Artificial Intelligence in Transforming PharmaHow Artificial Intelligence in Transforming Pharma
How Artificial Intelligence in Transforming PharmaTyrone Systems
 
neutralresumemergui17
neutralresumemergui17neutralresumemergui17
neutralresumemergui17Sylvia Mergui
 
2012 DIA EMRs for clinical research
2012 DIA  EMRs for clinical research2012 DIA  EMRs for clinical research
2012 DIA EMRs for clinical researchEd Seguine
 

What's hot (6)

Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma
 
Artificial intelligence in drug discovery
Artificial intelligence in drug discoveryArtificial intelligence in drug discovery
Artificial intelligence in drug discovery
 
How Artificial Intelligence in Transforming Pharma
How Artificial Intelligence in Transforming PharmaHow Artificial Intelligence in Transforming Pharma
How Artificial Intelligence in Transforming Pharma
 
SaMD Lastest Trend
SaMD Lastest TrendSaMD Lastest Trend
SaMD Lastest Trend
 
neutralresumemergui17
neutralresumemergui17neutralresumemergui17
neutralresumemergui17
 
2012 DIA EMRs for clinical research
2012 DIA  EMRs for clinical research2012 DIA  EMRs for clinical research
2012 DIA EMRs for clinical research
 

Similar to What can Community Challenges do for You?

Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management inscit2006
 
Healthcare Predicitive Analytics for Risk Profiling in Chronic Care: A Bayesi...
Healthcare Predicitive Analytics for Risk Profiling in Chronic Care: A Bayesi...Healthcare Predicitive Analytics for Risk Profiling in Chronic Care: A Bayesi...
Healthcare Predicitive Analytics for Risk Profiling in Chronic Care: A Bayesi...MIS Quarterly
 
Bias in covid 19 models
Bias in covid 19 modelsBias in covid 19 models
Bias in covid 19 modelsLaure Wynants
 
Exposome data challenge - ISGlobal hub prez July 2022.pptx
Exposome data challenge - ISGlobal hub prez July 2022.pptxExposome data challenge - ISGlobal hub prez July 2022.pptx
Exposome data challenge - ISGlobal hub prez July 2022.pptxLeaMaitre1
 
FAIR as a Working Principle for Cancer Genomic Data
FAIR as a Working Principle for Cancer Genomic DataFAIR as a Working Principle for Cancer Genomic Data
FAIR as a Working Principle for Cancer Genomic DataIan Fore
 
Math, Stats and CS in Public Health and Medical Research
Math, Stats and CS in Public Health and Medical ResearchMath, Stats and CS in Public Health and Medical Research
Math, Stats and CS in Public Health and Medical ResearchJessica Minnier
 
The state of the art in behavioral machine learning for healthcare
The state of the art in behavioral machine learning for healthcareThe state of the art in behavioral machine learning for healthcare
The state of the art in behavioral machine learning for healthcareAfrica Perianez
 
Computational Challenges in Biological Data Science: an Optimistically Cautio...
Computational Challenges in Biological Data Science: an Optimistically Cautio...Computational Challenges in Biological Data Science: an Optimistically Cautio...
Computational Challenges in Biological Data Science: an Optimistically Cautio...Iddo
 
Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416Chirag Patel
 
ai-in-healthcare-202011-201117103639.pptx
ai-in-healthcare-202011-201117103639.pptxai-in-healthcare-202011-201117103639.pptx
ai-in-healthcare-202011-201117103639.pptxssuser6b571f
 
AI at GSK_Kim Branson_mHealth Israel
AI at GSK_Kim Branson_mHealth IsraelAI at GSK_Kim Branson_mHealth Israel
AI at GSK_Kim Branson_mHealth IsraelLevi Shapiro
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Ian Foster
 
Stephen Friend Dana Farber Cancer Institute 2011-10-24
Stephen Friend Dana Farber Cancer Institute 2011-10-24Stephen Friend Dana Farber Cancer Institute 2011-10-24
Stephen Friend Dana Farber Cancer Institute 2011-10-24Sage Base
 
Enabling the Computational Future of Biology.pdf
Enabling the Computational Future of Biology.pdfEnabling the Computational Future of Biology.pdf
Enabling the Computational Future of Biology.pdfVaticle
 
AI: The Future is So Bright: Part Deux
AI: The Future is So Bright: Part DeuxAI: The Future is So Bright: Part Deux
AI: The Future is So Bright: Part DeuxIsaac Kohane
 
Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!adcobb
 
Building Biomedical Knowledge Graphs for In-Silico Drug Discovery
Building Biomedical Knowledge Graphs for In-Silico Drug DiscoveryBuilding Biomedical Knowledge Graphs for In-Silico Drug Discovery
Building Biomedical Knowledge Graphs for In-Silico Drug DiscoveryVaticle
 
Stephen Friend Institute of Development, Aging and Cancer 2011-11-29
Stephen Friend Institute of Development, Aging and Cancer 2011-11-29Stephen Friend Institute of Development, Aging and Cancer 2011-11-29
Stephen Friend Institute of Development, Aging and Cancer 2011-11-29Sage Base
 

Similar to What can Community Challenges do for You? (20)

Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management
 
Healthcare Predicitive Analytics for Risk Profiling in Chronic Care: A Bayesi...
Healthcare Predicitive Analytics for Risk Profiling in Chronic Care: A Bayesi...Healthcare Predicitive Analytics for Risk Profiling in Chronic Care: A Bayesi...
Healthcare Predicitive Analytics for Risk Profiling in Chronic Care: A Bayesi...
 
Bias in covid 19 models
Bias in covid 19 modelsBias in covid 19 models
Bias in covid 19 models
 
Exposome data challenge - ISGlobal hub prez July 2022.pptx
Exposome data challenge - ISGlobal hub prez July 2022.pptxExposome data challenge - ISGlobal hub prez July 2022.pptx
Exposome data challenge - ISGlobal hub prez July 2022.pptx
 
FAIR as a Working Principle for Cancer Genomic Data
FAIR as a Working Principle for Cancer Genomic DataFAIR as a Working Principle for Cancer Genomic Data
FAIR as a Working Principle for Cancer Genomic Data
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Math, Stats and CS in Public Health and Medical Research
Math, Stats and CS in Public Health and Medical ResearchMath, Stats and CS in Public Health and Medical Research
Math, Stats and CS in Public Health and Medical Research
 
The state of the art in behavioral machine learning for healthcare
The state of the art in behavioral machine learning for healthcareThe state of the art in behavioral machine learning for healthcare
The state of the art in behavioral machine learning for healthcare
 
Computational Challenges in Biological Data Science: an Optimistically Cautio...
Computational Challenges in Biological Data Science: an Optimistically Cautio...Computational Challenges in Biological Data Science: an Optimistically Cautio...
Computational Challenges in Biological Data Science: an Optimistically Cautio...
 
Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416
 
ai-in-healthcare-202011-201117103639.pptx
ai-in-healthcare-202011-201117103639.pptxai-in-healthcare-202011-201117103639.pptx
ai-in-healthcare-202011-201117103639.pptx
 
AI at GSK_Kim Branson_mHealth Israel
AI at GSK_Kim Branson_mHealth IsraelAI at GSK_Kim Branson_mHealth Israel
AI at GSK_Kim Branson_mHealth Israel
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009
 
Stephen Friend Dana Farber Cancer Institute 2011-10-24
Stephen Friend Dana Farber Cancer Institute 2011-10-24Stephen Friend Dana Farber Cancer Institute 2011-10-24
Stephen Friend Dana Farber Cancer Institute 2011-10-24
 
Biostatistics and its importance to Biologist
Biostatistics and its importance to BiologistBiostatistics and its importance to Biologist
Biostatistics and its importance to Biologist
 
Enabling the Computational Future of Biology.pdf
Enabling the Computational Future of Biology.pdfEnabling the Computational Future of Biology.pdf
Enabling the Computational Future of Biology.pdf
 
AI: The Future is So Bright: Part Deux
AI: The Future is So Bright: Part DeuxAI: The Future is So Bright: Part Deux
AI: The Future is So Bright: Part Deux
 
Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!
 
Building Biomedical Knowledge Graphs for In-Silico Drug Discovery
Building Biomedical Knowledge Graphs for In-Silico Drug DiscoveryBuilding Biomedical Knowledge Graphs for In-Silico Drug Discovery
Building Biomedical Knowledge Graphs for In-Silico Drug Discovery
 
Stephen Friend Institute of Development, Aging and Cancer 2011-11-29
Stephen Friend Institute of Development, Aging and Cancer 2011-11-29Stephen Friend Institute of Development, Aging and Cancer 2011-11-29
Stephen Friend Institute of Development, Aging and Cancer 2011-11-29
 

More from Iddo

Surviving Scientific Presentations
Surviving Scientific PresentationsSurviving Scientific Presentations
Surviving Scientific PresentationsIddo
 
Friedberg lab-overview-grad-students-2019-nr
Friedberg lab-overview-grad-students-2019-nrFriedberg lab-overview-grad-students-2019-nr
Friedberg lab-overview-grad-students-2019-nrIddo
 
The roles communities play in improving bioinformatics: better software, bett...
The roles communities play in improving bioinformatics: better software, bett...The roles communities play in improving bioinformatics: better software, bett...
The roles communities play in improving bioinformatics: better software, bett...Iddo
 
Why Your Microbiome Analysis is Wrong
Why Your Microbiome Analysis is WrongWhy Your Microbiome Analysis is Wrong
Why Your Microbiome Analysis is WrongIddo
 
Tracing the Ancestry of Genomes in Bacteria
Tracing the Ancestry of Genomes in BacteriaTracing the Ancestry of Genomes in Bacteria
Tracing the Ancestry of Genomes in BacteriaIddo
 
Friedberg lab-overview-grad-students
Friedberg lab-overview-grad-studentsFriedberg lab-overview-grad-students
Friedberg lab-overview-grad-studentsIddo
 
Understanding Biological Function in Times of High Throughput and Low Output
Understanding Biological Function in Times of High Throughput and Low OutputUnderstanding Biological Function in Times of High Throughput and Low Output
Understanding Biological Function in Times of High Throughput and Low OutputIddo
 
Random Musings on Fixing Data Shambles in Science
Random Musings on Fixing Data Shambles in ScienceRandom Musings on Fixing Data Shambles in Science
Random Musings on Fixing Data Shambles in ScienceIddo
 
Genome Informatics 2015 Bacteriocin Discovery
Genome Informatics 2015 Bacteriocin DiscoveryGenome Informatics 2015 Bacteriocin Discovery
Genome Informatics 2015 Bacteriocin DiscoveryIddo
 
Convergent divergent
Convergent divergentConvergent divergent
Convergent divergentIddo
 
Some US Science Funding sources
Some US Science Funding sourcesSome US Science Funding sources
Some US Science Funding sourcesIddo
 
CAFA poster presented at CSHL Genome Informatics 2013
CAFA poster presented at CSHL Genome Informatics 2013CAFA poster presented at CSHL Genome Informatics 2013
CAFA poster presented at CSHL Genome Informatics 2013Iddo
 
Ewan Birney Biocuration 2013
Ewan Birney Biocuration 2013Ewan Birney Biocuration 2013
Ewan Birney Biocuration 2013Iddo
 
Metagenomics Biocuration 2013
Metagenomics Biocuration 2013Metagenomics Biocuration 2013
Metagenomics Biocuration 2013Iddo
 
Ismb grant-writing-2012
Ismb grant-writing-2012Ismb grant-writing-2012
Ismb grant-writing-2012Iddo
 
David Jones AFP/CAFA2011
David Jones AFP/CAFA2011David Jones AFP/CAFA2011
David Jones AFP/CAFA2011Iddo
 
Vienna afp2011
Vienna afp2011Vienna afp2011
Vienna afp2011Iddo
 
Afp cafa djuric
Afp cafa djuricAfp cafa djuric
Afp cafa djuricIddo
 
Go camp 2010_cacao
Go camp 2010_cacaoGo camp 2010_cacao
Go camp 2010_cacaoIddo
 
Ignobel2010
Ignobel2010Ignobel2010
Ignobel2010Iddo
 

More from Iddo (20)

Surviving Scientific Presentations
Surviving Scientific PresentationsSurviving Scientific Presentations
Surviving Scientific Presentations
 
Friedberg lab-overview-grad-students-2019-nr
Friedberg lab-overview-grad-students-2019-nrFriedberg lab-overview-grad-students-2019-nr
Friedberg lab-overview-grad-students-2019-nr
 
The roles communities play in improving bioinformatics: better software, bett...
The roles communities play in improving bioinformatics: better software, bett...The roles communities play in improving bioinformatics: better software, bett...
The roles communities play in improving bioinformatics: better software, bett...
 
Why Your Microbiome Analysis is Wrong
Why Your Microbiome Analysis is WrongWhy Your Microbiome Analysis is Wrong
Why Your Microbiome Analysis is Wrong
 
Tracing the Ancestry of Genomes in Bacteria
Tracing the Ancestry of Genomes in BacteriaTracing the Ancestry of Genomes in Bacteria
Tracing the Ancestry of Genomes in Bacteria
 
Friedberg lab-overview-grad-students
Friedberg lab-overview-grad-studentsFriedberg lab-overview-grad-students
Friedberg lab-overview-grad-students
 
Understanding Biological Function in Times of High Throughput and Low Output
Understanding Biological Function in Times of High Throughput and Low OutputUnderstanding Biological Function in Times of High Throughput and Low Output
Understanding Biological Function in Times of High Throughput and Low Output
 
Random Musings on Fixing Data Shambles in Science
Random Musings on Fixing Data Shambles in ScienceRandom Musings on Fixing Data Shambles in Science
Random Musings on Fixing Data Shambles in Science
 
Genome Informatics 2015 Bacteriocin Discovery
Genome Informatics 2015 Bacteriocin DiscoveryGenome Informatics 2015 Bacteriocin Discovery
Genome Informatics 2015 Bacteriocin Discovery
 
Convergent divergent
Convergent divergentConvergent divergent
Convergent divergent
 
Some US Science Funding sources
Some US Science Funding sourcesSome US Science Funding sources
Some US Science Funding sources
 
CAFA poster presented at CSHL Genome Informatics 2013
CAFA poster presented at CSHL Genome Informatics 2013CAFA poster presented at CSHL Genome Informatics 2013
CAFA poster presented at CSHL Genome Informatics 2013
 
Ewan Birney Biocuration 2013
Ewan Birney Biocuration 2013Ewan Birney Biocuration 2013
Ewan Birney Biocuration 2013
 
Metagenomics Biocuration 2013
Metagenomics Biocuration 2013Metagenomics Biocuration 2013
Metagenomics Biocuration 2013
 
Ismb grant-writing-2012
Ismb grant-writing-2012Ismb grant-writing-2012
Ismb grant-writing-2012
 
David Jones AFP/CAFA2011
David Jones AFP/CAFA2011David Jones AFP/CAFA2011
David Jones AFP/CAFA2011
 
Vienna afp2011
Vienna afp2011Vienna afp2011
Vienna afp2011
 
Afp cafa djuric
Afp cafa djuricAfp cafa djuric
Afp cafa djuric
 
Go camp 2010_cacao
Go camp 2010_cacaoGo camp 2010_cacao
Go camp 2010_cacao
 
Ignobel2010
Ignobel2010Ignobel2010
Ignobel2010
 

Recently uploaded

GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINsankalpkumarsahoo174
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSSLeenakshiTyagi
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPirithiRaju
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsSumit Kumar yadav
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhousejana861314
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 

Recently uploaded (20)

GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSS
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questions
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhouse
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 

What can Community Challenges do for You?

  • 1. 1 / 27 What Can Community Challenges do for You? Iddo Friedberg Iowa State University
  • 2. 2 / 27 What Can Community Challenges do for You? Iddo Friedberg Iowa State University
  • 3. 3 / 27 Science is Symbiotic Discovery Methods
  • 4. 4 / 27 Science is Symbiotic
  • 5. 5 / 27 There are many methods, which is best?
  • 6. 6 / 27 What is “best”?
  • 7. 7 / 27 Example: CASP Critical Assessment of Structure Prediction Problem: protein folding, or, predicting structure from sequence Community: Structural biologists & bioinformaticians Challenge set: proteins whose structures were experimentally determined. Varying difficulties Assessment metrics: Root Mean Square Deviation, Q3, etc.
  • 8. 8 / 27 Example: CAFA Critical Assessment of Function Annotation Predict sequence from Function Community: function predictors, ontologists, experimental biologists Problem: given a protein, which ontology terms are associated with it? Challenge set: proteins that organizers know their functions, but predictors don’t
  • 9. 9 / 27 Biomedical prediction problems ● Protein Sequence to Structure ● Protein Sequence to Function ● Image to Phenotype ● Phenotype to Genotype (disease) ● Genotype to Phenotype ● Symptoms to Disease
  • 10. 10 / 27 A History of Critical Assessments TREC: Text Retrieval Conference run by NIST CASP: structure prediction CAPRI: Protein Interaction DREAM: umbrella – many biomedical challenges CAGI: genotype / phenotype CACAO: educational
  • 11. 11 / 27 Precision Recall on Ontologies DNA binding True function Predicted function
  • 12. 12 / 27 Precision Recall on Ontologies Nucleic acid binding DNA binding Binding True function Predicted function
  • 13. 13 / 27 Precision Recall on Ontologies Nucleic acid binding DNA binding Binding rRNA binding RNA binding True Positives : 2 False Positives: 2 False Negatives: 1 True function Predicted function
  • 14. 14 / 27 Precision Recall 00 1 1 Recall Precision 2/4 2/3
  • 15. 15 / 27 Adding Confidence as Threshold AUTHOR Dr. Natalia Alianovna MODEL 1 KEYWORDS sequence alignment. T96060020120 GO:0008270 0.80 T96060020120 GO:0003700 0.80 T96060020120 GO:0006351 0.80 T96060020119 GO:0005730 0.01 T96060020119 GO:0003676 0.07 T96060020119 GO:0005622 0.07 T96060020119 GO:0046872 0.07 T96060020118 GO:0008270 0.75 T96060020118 GO:0006351 0.68 T96060020118 GO:0003677 0.67 T96060020118 GO:0005634 0.67 T96060020118 GO:0006355 0.55 T96060020118 GO:0003700 0.34 Protein ID GO term Confidence
  • 16. 16 / 27 DREAM Digital Mammography Challenge (2016-2017)
  • 17. 17 / 27 DREAM Digital Mammography Challenge (2016-2017) 640K mammography images Patient metadata 0 1 Sensitivity: TP/(TP+FN) Specificity: TN/(TN+FP) TP: patient sick, predicted sick FP: patient healthy, predicted sick TN: patient healthy, predicted healthy FN: patient sick, predicted healthy Probability of cancer one year after?
  • 18. 18 / 27 Are we improving? https://www.ibm.com/blogs/research/2017/06/dream-challenge-results/ https://predictioncenter.org/ https://biofunctionprediction.org/
  • 19. 19 / 27 Are we improving? https://www.ibm.com/blogs/research/2017/06/dream-challenge-results/ https://predictioncenter.org/ https://biofunctionprediction.org/
  • 21. 21 / 27 Gaming Metrics
  • 22. 22 / 27 Gaming Metrics Metric Claims to measure Used for How to game h-index Individual publication impact Promotion and Tenure Multi-author papers IQ test Intelligence Bragging rights Be born in the right culture SAT Success in college College admissions Study Precision recall CAFA performance Assessing function predictions Not telling you
  • 23. 23 / 27 Gaming Metrics Metric Claims to measure Used for How to game h-index Individual publication impact Promotion and Tenure Multi-author papers IQ test Intelligence Bragging rights Be born in the right culture SAT Success in college College admissions Study Precision recall CAFA performance Assessing function predictions Not telling you
  • 24. 24 / 27 Gaming Metrics Metric Claims to measure Used for How to game h-index Individual publication impact Promotion and Tenure Multi-author papers IQ test Intelligence Bragging rights Be born in the right culture SAT Success in college College admissions Study Semantic similarity CAFA performance Assessing function predictions Not telling you
  • 25. 25 / 27 Gaming Metrics Metric Claims to measure Used for How to game h-index Individual publication impact Promotion and Tenure Multi-author papers IQ test Intelligence Bragging rights Be born in the right culture SAT Success in college College admissions Study Precision recall CAFA performance Assessing function predictions Not telling you
  • 26. 26 / 27 Gaming Metrics Metric Claims to measure Used for How to game h-index Individual publication impact Promotion and Tenure Multi-author papers IQ test Intelligence Bragging rights Be born in the right culture SAT Success in college College admissions Study Precision recall CAFA performance Assessing function predictions Not telling you
  • 27. 27 / 27 Thank you ● Sage Bionetworks – Lara Mangravite ● CAFA – Wyatt Clark, Indiana University (1,2) – Yuxiang Jiang, Indiana University (2-4) – Naihui Zhou, Iowa State University (3, 3.14) – Tim Bergquist University of Washington (2-4) – Predrag Radivojac (Northeastern University) – Sean Mooney (University of Washington) – Casey Greene (University of Colorado) – Mark Wass (University of Kent) – Kim Reynolds (University of Texas Southwestern) ● Sandra Orchard (EMBL-EBI) ● Maria Martin (EMBL-EBI) ● > 250 co-authors over the years