SlideShare a Scribd company logo
1 of 38
http://cancerimagingarchive.net
Justin Kirby – justin.kirby@nih.gov
Frederick National Laboratory for Cancer Research
Leidos Biomedical Research, Inc.
Informatics in Cancer Imaging (ICI) Team
Support to: Cancer Imaging Program/DCTD/NCI
Crowds Cure Cancer: Annotating data from
Citizen Science WG Meeting: Oct 2018
2
The Cancer Imaging Archive: Brief intro
• 87 data sets (20 terabytes) consisting of
41,000 subjects (33 million radiology images)
• Covers radiology modalities (CT/MR/PET/RT)
and digitized pathology slides
• Wide variety of cancers + phantoms
• Patient populations vary from a handful to
>26,000 (NLST)
• Many have associated meta-data
 Demographics/outcomes/therapy
 Radiologist expert and automated
computational analyses (segmentations,
features)
• ‘Omics ties to GDC/TCGA, CPTAC, and GEO
http://www.cancerimagingarchive.net
3
Tackling the de-identification challenge
PHI can appear in hundreds
of places in DICOM
• Dates
• Identifiers
• Descriptions
Potential legal risks are a
significant barrier to data
sharing for research
4
TCIA services (not just software)
Relieves PI of majority of data sharing burden/risks
• Data hosting with >99% uptime
• De-identification using pre-configured RSNA’s Clinical Trials Processor (CTP) and
DICOM PS 3.15 Annex E standards
• Multi-tiered QC process inspects both DICOM headers and pixels for PHI and
integrity of data set
Phone/email support available for end users and submitters
Extensive documentation throughout the site
Exposure to a large community of researchers
• Increase visibility of your work, get more citations!
5
TCIA Services: Staffing
TCIA Management at UAMS
Prior, Smith
Data Collection:
Programs
Bennett, Berryman, Billelo
Data Collection:
Research Collections
Jarosz, Stockton, Honomichl
Data Collection:
Pathology
Sharma, Birminam, TBD
APOLLO
(clinical/backlog)
Levine, Angelus, TBD
Clinical Trials
TBD
Infrastructure Support
Smith, Nolan, Dobbins, Tarbox, Tobler, Frund, Utecht, Brown
CIP ICI
Management
Freymann
Kirby
Sullivan
Cordeiro
Hill
6
Organization of TCIA ecosystem
The
Cancer
Imaging
Archive
Data Collection Center
•Tools and staffing to support data
collection, curation, and de-
identification
Data Access
•Browse (home page)
•Filter/Search (Data Portal)
•REST API
•Analysis Data
Data Analysis Centers
•3rd party web sites or tools which
connect to TCIA’s API or mirror its
data
7
Data Collection Center: Publish Your Data
Primary Data (radiology, pathology, clinical, etc) Analysis Results (derived from primary data)
Image credit: Hugo Aerts
8
Data Collection Center:
Publishing data in addition to manuscripts
Data citations for both primary and analysis data to enable reproducible research
Analysis Dataset Citation (derived image features)
Gutman DA, Cooper LA, Hwang SN, Holder CA, Gao J, Aurora TD, Dunn WD Jr, Scarpace L,
Mikkelsen T, Jain R, Wintermark M, Jilwan M, Raghavan P, Huang E, Clifford RJ, Mongkolwat
P, Kleper V, Freymann J, Kirby J, Zinn PO, Moreno CS, Jaffe C, Colen R, Rubin DL, Saltz J,
Flanders A, Brat DJ. (2014). MR Imaging Predictors of Molecular Profile and Survival: Multi-
institutional Study of the TCGA Glioblastoma Data Set. The Cancer Imaging Archive.
http://doi.org/10.7937/K9/TCIA.2014.4HTXYRCN
Publication Citation (cites specific data used)
MR imaging predictors of molecular profile and survival: multi-
institutional study of the TCGA glioblastoma data set. Radiology.
2013 May;267(2):560-9. doi: 10.1148/radiol.13120118. Epub
2013 Feb 7. PubMed PMID: 23392431; PubMed Central PMCID:
PMC3632807.
Primary Data Citation (TCIA images used for study)
Scarpace, L., Mikkelsen, T., Cha, soonmee, Rao, S., Tekchandani, S.,
Gutman, D., … Pierce, L. J. (2016). Radiology Data from The Cancer
Genome Atlas Glioblastoma Multiforme [TCGA-GBM] collection. The
Cancer Imaging Archive. http://doi.org/10.7937/K9/TCIA.2016.RNYFUYE9
9
Data Descriptor Journals
Journal Recommended Repositories
Nature Scientific Data https://www.nature.com/sdata/policies/repositories#imaging
Medical Physics http://aapm.onlinelibrary.wiley.com/hub/journal/10.1002/(ISSN)2473-4209/about/author-
guidelines.html (see section 13-Medical Physics Dataset Articles)
Elsevier Data in Brief http://www.elsevier.com/authors/author-services/research-data/data-base-linking/supported-
data-repositories#Health
PLOS ONE http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories
Research Data Support https://www.springernature.com/gp/authors/research-data-policy/repositories-bio/12327160
Publish detailed descriptions about how to use your TCIA data to gain
academic credit (publication/citations) in addition to the novel scientific
findings you might publish in traditional journals.
10
Researchers want to share data – 38 data set queue
Community Proposed Data Sets
GBM-DSC-MRI-DRO
ASCC TNM Consensus
Colorectal Liver Metastases
QIN-BREAST-02
MyelomaTT3a
Low Dose CT Liver Metastases
Lung Fused-CT-Pathology
HNSCC Oropharyngeal Radiomics
OPC-Radiomics
Oropharynx Phantoms
HNSCC 3D CT RT
MSK Pancreatic Cancer Repository
Program Data Sets Notes
TCGA 2 collection//sites
CPTAC 9 cancer types, 14
sites
Exceptional
Responders
24 of 58 subjects
remaining
Immunotherapy 2 cancer types
PDX mouse Not started
NCTN integration RTOG 0617 pilot in
process
QIN ECOG-ACRIN 10 trials
11
New funding to collect imaging from NCTN
Clinical
Imaging
12
A significantly growing community!
38 incoming data sets in varying stages of curation
Over 10,000 active users per month
• Up from ~3,000/month in 2015
Downloads of 40-50TB per month
• Up from ~2TB/month in 2015
613 publications based on TCIA data
• 134 new publications in 2017
13
Crowds Cure Cancer: The motivation
Continued growth: Engage researchers outside the medical
community
Computer science researchers often lack disease understanding
or sufficient contact with medical experts
Labeling basic information in the images such as tumor locations
to create “training data” can enable others to apply new methods
to our data
TCIA has this kind of annotation for some data sets, but not all
Performing such annotations/labeling using a small number of
radiologists is extremely time consuming and expensive
14
Stay tuned!
15
TCIA Acknowledgements
University of
Arkansas
Dr. Fred Prior
Dr. Lawrence Tarbox
Kirk Smith
Bill Bennett
Tracy Nolan
Julie Frund
Sean Berryman
Dwayne Dobbins
Quasar Jarosz
Jeff Tobler
Sonya Utecht
Diana Stockton
Betty Levine
Erica Bilello
Geri Blake
Robert Brown
Leidos Biomedical Research, Contract 16X011
for NCI, Maintenance and Extension of The
Cancer Imaging Archive (TCIA ) (Prior)
NBIA Team
(NCI/FNLCR/Ellumen)
Ed Helton
Ulli Wagner
Scott Gustafson
Qinyan Pan
Russ Rieling
Carolyn Klinger
Martin Lerner
Tin Tran
Contractor
John Perry
Frederick National Laboratory
for Cancer Research (FNLCR)
John Freymann
Justin Kirby
Brenda Fevrier-Sullivan
Pam Angelus
Carl Jaffe
Luis Cordeiro
Craig Hill
Emory University
Dr. Ashish Sharma
Ryan Birmingham
Crowds Cure
Cancer
October 1, 2018
Jayashree Kalpathy-Cramer
Agenda
 Background
 Collaborators
 System Development
 General impressions from RSNA of the
experiment
• Feedback from participants
 Current status/results
 Features to be added
 Next steps
Background
Annotated data is key to improving
performance of machine learning algorithms
Inter-rater agreement, even among experts,
can be less than optimal
• Helpful to have multiple annotations per case
Getting annotations is expensive
Hypothesis: crowd-sourced annotations, even
derived from non-experts, can be used for
machine learning
Collaborators  Erik Ziegler/Gordon Harris
(OHIF/MGH/ITCR)
 Steve Pieper (ITCR)
 Lawrence Tarbox, Jeff Tobler, Fred Prior
(UAMS/ITCR)
 Ashish Sharma (Emory/ITCR)
 Jayashree Kalpathy-Cramer/Artem
Mamonov (MGH/ITCR)
 Justin Kirby, Brenda Fevrier-Sullivan, John
Freyman (FNL)
 Erich Huang, Paula Jacobs (NCI)
 RSNA (Informatics)
System
Architecture
 TCIA
• Used images with known “truth”
from TCGA studies/Carl Jaffe
 Azure VM
 Cornerstone Viewer
• Lightweight version, mobile friendly!!
 Chronicle DB (CouchDB) backend
• DICOM aware!
 Registration system
• Logic for next case
 JS/D3 for results (in progress)
• Charts
• Summary tables
General
Impressions
 Overall seemed to be a success
 Many participants ended up spending
more time than planned
 Viewer was responsive
 No major complaints about system
performance
 Results are (very) promising
Logistics
 Better location?
• Near case of the day
 Coffee, water, cookies, chocolate
 Wipes or alcohol wash for hands
 Integration with RSNA app
Sociology
 Leaderboard needs to be prominently
displayed
• Maybe award stars?
Annotators
 Over 250 signed up
• 211 had at least 1 annotation
• 112 were radiologists
 Selected
• Liver: 189
• Lung: 211
• Renal: 165
• Ovarian: 133
 Selected
• 4: 120
• 3: 27
• 2: 29
• 1: 78
Dashboard
Annotations per
annotator
Annotations per annotator
 Misty_mandrill clear winner
• Nocturnal sparrow catching up!
• Rubbery_hamster close behind
Time it takes to make an annotation
• Typically 10-30 seconds
Are we done?
Are some cases
more difficult?
Lung results y = 0.8574x + 5.0806
R² = 0.8379
0
10
20
30
40
50
60
70
80
0 10 20 30 40 50 60 70 80
Averagemeasurment
ground truth measurement
Comparisons to ground truth
Feedback
on interface
 Need to start with a click through tutorial
 Window/Level presets
 Ability to change category mid session
• Also important when multiple cancer types are chosen.
 Mobile support?
 Flag button for quality with text box for comments
 No log in when username is empty
 Logout button
 Need metrics - time per session, scores per session
 People asked to see the number of cases they annotated somewhere on
the screen.
 Add sagittal and coronal views for measurement accuracy
 A few did not like the randomization of the images if they selected 3 or
4 cancer types – they complained about getting a bunch of all liver or all
lung cases at a time; and said they would have preferred being given
the options of which to do next.
 System stalled if images for only one cancer type was selected and all
the cases were completed.
 Reorder buttons – keep essentials only for mobile
 Teaching interface – provide immediate feedback compared to
“consensus”?
 Key up/down
Features to
be added
 Annotators by type
 Time histogram by type
 Update charts to remove skips
 Ability to all annotations for each case
• screen shots of all annotations
 Statistics on variability of measurement
 Ability to filter out by type??
• Can’t filter out our best annotator (misty_mandrill) but can’t
keep in junk?
 Filter out by number of cases annotated
• If <5, probably just trying to see what it is all about
 Compare to ground truth
• Add data for all cases
• Convert AIM measurements to cornerstone
 Length, approximate slice
 Ability to overlay all annotations including GT
Potential
next steps
for system • Permanently linked from TCIA
• Road show to various meetings?
• (Link to challenge platform)

More Related Content

What's hot

WEBINAR: The Yosemite Project PART 6 -- Data-Driven Biomedical Research with ...
WEBINAR: The Yosemite Project PART 6 -- Data-Driven Biomedical Research with ...WEBINAR: The Yosemite Project PART 6 -- Data-Driven Biomedical Research with ...
WEBINAR: The Yosemite Project PART 6 -- Data-Driven Biomedical Research with ...DATAVERSITY
 
Universal access to virtual colonoscopy may be on the horizon
Universal access to virtual colonoscopy may be on the horizonUniversal access to virtual colonoscopy may be on the horizon
Universal access to virtual colonoscopy may be on the horizonYael Waknine
 
NCI Cancer Imaging Program - Cancer Research Data Ecosystem
NCI Cancer Imaging Program - Cancer Research Data EcosystemNCI Cancer Imaging Program - Cancer Research Data Ecosystem
NCI Cancer Imaging Program - Cancer Research Data EcosystemWarren Kibbe
 
Radiology Appropriateness Criteria - American College of Radiology (ACR)
Radiology Appropriateness Criteria - American College of Radiology (ACR)Radiology Appropriateness Criteria - American College of Radiology (ACR)
Radiology Appropriateness Criteria - American College of Radiology (ACR)Frank Rybicki
 
Next Generation Companion Diagnostics; Adoption, Drivers, and Moderators of N...
Next Generation Companion Diagnostics; Adoption, Drivers, and Moderators of N...Next Generation Companion Diagnostics; Adoption, Drivers, and Moderators of N...
Next Generation Companion Diagnostics; Adoption, Drivers, and Moderators of N...Andrew Aijian
 
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.caGenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.cafionabrinkman
 
Towards Digitally Enabled Genomic Medicine: the Patient of The Future
Towards Digitally Enabled Genomic Medicine: the Patient of The FutureTowards Digitally Enabled Genomic Medicine: the Patient of The Future
Towards Digitally Enabled Genomic Medicine: the Patient of The FutureLarry Smarr
 
Automated image analysis: rescue for diffusion-MRI of threat to radiologists?
Automated image analysis: rescue for diffusion-MRI of threat to radiologists?Automated image analysis: rescue for diffusion-MRI of threat to radiologists?
Automated image analysis: rescue for diffusion-MRI of threat to radiologists?Erik R. Ranschaert, MD, PhD
 
ContentMine + EPMC: Finding Zika!
ContentMine + EPMC: Finding Zika! ContentMine + EPMC: Finding Zika!
ContentMine + EPMC: Finding Zika! TheContentMine
 
Data sharing - Data management - The SysMO-SEEK Story
Data sharing - Data management - The SysMO-SEEK StoryData sharing - Data management - The SysMO-SEEK Story
Data sharing - Data management - The SysMO-SEEK StoryResearch Information Network
 
Prostate Cancer Diagnosis using Deep Learning with 3D Multiparametric MRI
Prostate Cancer Diagnosis using Deep Learning with 3D Multiparametric MRIProstate Cancer Diagnosis using Deep Learning with 3D Multiparametric MRI
Prostate Cancer Diagnosis using Deep Learning with 3D Multiparametric MRISaifeng (Aaron) Liu
 
Using Public Access Clinical Databases to Interpret NGS Variants
Using Public Access Clinical Databases to Interpret NGS VariantsUsing Public Access Clinical Databases to Interpret NGS Variants
Using Public Access Clinical Databases to Interpret NGS VariantsGolden Helix Inc
 
Computational Pathology Workshop July 8 2014
Computational Pathology Workshop July 8 2014Computational Pathology Workshop July 8 2014
Computational Pathology Workshop July 8 2014Joel Saltz
 
ICBO 2014, October 8, 2014
ICBO 2014, October 8, 2014ICBO 2014, October 8, 2014
ICBO 2014, October 8, 2014Warren Kibbe
 

What's hot (16)

WEBINAR: The Yosemite Project PART 6 -- Data-Driven Biomedical Research with ...
WEBINAR: The Yosemite Project PART 6 -- Data-Driven Biomedical Research with ...WEBINAR: The Yosemite Project PART 6 -- Data-Driven Biomedical Research with ...
WEBINAR: The Yosemite Project PART 6 -- Data-Driven Biomedical Research with ...
 
Theralase Photodynamic Compounds Destroy Cancer
Theralase Photodynamic Compounds Destroy CancerTheralase Photodynamic Compounds Destroy Cancer
Theralase Photodynamic Compounds Destroy Cancer
 
Universal access to virtual colonoscopy may be on the horizon
Universal access to virtual colonoscopy may be on the horizonUniversal access to virtual colonoscopy may be on the horizon
Universal access to virtual colonoscopy may be on the horizon
 
NCI Cancer Imaging Program - Cancer Research Data Ecosystem
NCI Cancer Imaging Program - Cancer Research Data EcosystemNCI Cancer Imaging Program - Cancer Research Data Ecosystem
NCI Cancer Imaging Program - Cancer Research Data Ecosystem
 
Radiology Appropriateness Criteria - American College of Radiology (ACR)
Radiology Appropriateness Criteria - American College of Radiology (ACR)Radiology Appropriateness Criteria - American College of Radiology (ACR)
Radiology Appropriateness Criteria - American College of Radiology (ACR)
 
Next Generation Companion Diagnostics; Adoption, Drivers, and Moderators of N...
Next Generation Companion Diagnostics; Adoption, Drivers, and Moderators of N...Next Generation Companion Diagnostics; Adoption, Drivers, and Moderators of N...
Next Generation Companion Diagnostics; Adoption, Drivers, and Moderators of N...
 
23and me
23and me23and me
23and me
 
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.caGenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
 
Towards Digitally Enabled Genomic Medicine: the Patient of The Future
Towards Digitally Enabled Genomic Medicine: the Patient of The FutureTowards Digitally Enabled Genomic Medicine: the Patient of The Future
Towards Digitally Enabled Genomic Medicine: the Patient of The Future
 
Automated image analysis: rescue for diffusion-MRI of threat to radiologists?
Automated image analysis: rescue for diffusion-MRI of threat to radiologists?Automated image analysis: rescue for diffusion-MRI of threat to radiologists?
Automated image analysis: rescue for diffusion-MRI of threat to radiologists?
 
ContentMine + EPMC: Finding Zika!
ContentMine + EPMC: Finding Zika! ContentMine + EPMC: Finding Zika!
ContentMine + EPMC: Finding Zika!
 
Data sharing - Data management - The SysMO-SEEK Story
Data sharing - Data management - The SysMO-SEEK StoryData sharing - Data management - The SysMO-SEEK Story
Data sharing - Data management - The SysMO-SEEK Story
 
Prostate Cancer Diagnosis using Deep Learning with 3D Multiparametric MRI
Prostate Cancer Diagnosis using Deep Learning with 3D Multiparametric MRIProstate Cancer Diagnosis using Deep Learning with 3D Multiparametric MRI
Prostate Cancer Diagnosis using Deep Learning with 3D Multiparametric MRI
 
Using Public Access Clinical Databases to Interpret NGS Variants
Using Public Access Clinical Databases to Interpret NGS VariantsUsing Public Access Clinical Databases to Interpret NGS Variants
Using Public Access Clinical Databases to Interpret NGS Variants
 
Computational Pathology Workshop July 8 2014
Computational Pathology Workshop July 8 2014Computational Pathology Workshop July 8 2014
Computational Pathology Workshop July 8 2014
 
ICBO 2014, October 8, 2014
ICBO 2014, October 8, 2014ICBO 2014, October 8, 2014
ICBO 2014, October 8, 2014
 

Similar to Crowds Cure Canver: Annotating Data from The Cancer Imaging Archive

Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Ian Foster
 
The Future of Personalized Medicine
The Future of Personalized MedicineThe Future of Personalized Medicine
The Future of Personalized MedicineEdgewater
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: The TraIT user stories fo...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: The TraIT user stories fo...tranSMART Community Meeting 5-7 Nov 13 - Session 3: The TraIT user stories fo...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: The TraIT user stories fo...David Peyruc
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...David Peyruc
 
A practical guide to using The Cancer Imaging Archive for QIN Challenges and ...
A practical guide to using The Cancer Imaging Archive for QIN Challenges and ...A practical guide to using The Cancer Imaging Archive for QIN Challenges and ...
A practical guide to using The Cancer Imaging Archive for QIN Challenges and ...CancerImagingInforma
 
AI and whole slide imaging biomarkers
AI and whole slide imaging biomarkersAI and whole slide imaging biomarkers
AI and whole slide imaging biomarkersJoel Saltz
 
Next generation electronic medical records and search a test implementation i...
Next generation electronic medical records and search a test implementation i...Next generation electronic medical records and search a test implementation i...
Next generation electronic medical records and search a test implementation i...lucenerevolution
 
Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...
Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...
Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...Jerry Lee
 
University of Toronto - Radiomics for Oncology - 2017
University of Toronto  - Radiomics for Oncology - 2017University of Toronto  - Radiomics for Oncology - 2017
University of Toronto - Radiomics for Oncology - 2017Andre Dekker
 
Generation and Use of Quantitative Pathology Phenotype
Generation and Use of Quantitative Pathology PhenotypeGeneration and Use of Quantitative Pathology Phenotype
Generation and Use of Quantitative Pathology PhenotypeJoel Saltz
 
AAPM Foster July 2009
AAPM Foster July 2009AAPM Foster July 2009
AAPM Foster July 2009Ian Foster
 
NCI Cancer Genomic Data Commons for NCAB September 2016
NCI Cancer Genomic Data Commons for NCAB September 2016NCI Cancer Genomic Data Commons for NCAB September 2016
NCI Cancer Genomic Data Commons for NCAB September 2016Warren Kibbe
 
Radiomics Data Management, Computation, and Analysis for QIN F2F 2016
Radiomics Data Management, Computation, and Analysis for QIN F2F 2016Radiomics Data Management, Computation, and Analysis for QIN F2F 2016
Radiomics Data Management, Computation, and Analysis for QIN F2F 2016Ashish Sharma
 
Precision Oncology - using Genomics, Proteomics and Imaging to inform biology...
Precision Oncology - using Genomics, Proteomics and Imaging to inform biology...Precision Oncology - using Genomics, Proteomics and Imaging to inform biology...
Precision Oncology - using Genomics, Proteomics and Imaging to inform biology...Warren Kibbe
 
Clinical & Molecular Epidemiology
Clinical & Molecular EpidemiologyClinical & Molecular Epidemiology
Clinical & Molecular Epidemiologygumccomm
 
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...Remedy Informatics
 
Workshop finding and accessing data - fiona - lunteren april 18 2016
Workshop   finding and accessing data - fiona - lunteren april 18 2016Workshop   finding and accessing data - fiona - lunteren april 18 2016
Workshop finding and accessing data - fiona - lunteren april 18 2016Fiona Nielsen
 

Similar to Crowds Cure Canver: Annotating Data from The Cancer Imaging Archive (20)

Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009
 
The Future of Personalized Medicine
The Future of Personalized MedicineThe Future of Personalized Medicine
The Future of Personalized Medicine
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: The TraIT user stories fo...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: The TraIT user stories fo...tranSMART Community Meeting 5-7 Nov 13 - Session 3: The TraIT user stories fo...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: The TraIT user stories fo...
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...
 
A practical guide to using The Cancer Imaging Archive for QIN Challenges and ...
A practical guide to using The Cancer Imaging Archive for QIN Challenges and ...A practical guide to using The Cancer Imaging Archive for QIN Challenges and ...
A practical guide to using The Cancer Imaging Archive for QIN Challenges and ...
 
AI and whole slide imaging biomarkers
AI and whole slide imaging biomarkersAI and whole slide imaging biomarkers
AI and whole slide imaging biomarkers
 
Next generation electronic medical records and search a test implementation i...
Next generation electronic medical records and search a test implementation i...Next generation electronic medical records and search a test implementation i...
Next generation electronic medical records and search a test implementation i...
 
Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...
Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...
Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...
 
University of Toronto - Radiomics for Oncology - 2017
University of Toronto  - Radiomics for Oncology - 2017University of Toronto  - Radiomics for Oncology - 2017
University of Toronto - Radiomics for Oncology - 2017
 
Kaleidoscope
KaleidoscopeKaleidoscope
Kaleidoscope
 
Generation and Use of Quantitative Pathology Phenotype
Generation and Use of Quantitative Pathology PhenotypeGeneration and Use of Quantitative Pathology Phenotype
Generation and Use of Quantitative Pathology Phenotype
 
AAPM Foster July 2009
AAPM Foster July 2009AAPM Foster July 2009
AAPM Foster July 2009
 
Dalton
DaltonDalton
Dalton
 
Dalton presentation
Dalton presentationDalton presentation
Dalton presentation
 
NCI Cancer Genomic Data Commons for NCAB September 2016
NCI Cancer Genomic Data Commons for NCAB September 2016NCI Cancer Genomic Data Commons for NCAB September 2016
NCI Cancer Genomic Data Commons for NCAB September 2016
 
Radiomics Data Management, Computation, and Analysis for QIN F2F 2016
Radiomics Data Management, Computation, and Analysis for QIN F2F 2016Radiomics Data Management, Computation, and Analysis for QIN F2F 2016
Radiomics Data Management, Computation, and Analysis for QIN F2F 2016
 
Precision Oncology - using Genomics, Proteomics and Imaging to inform biology...
Precision Oncology - using Genomics, Proteomics and Imaging to inform biology...Precision Oncology - using Genomics, Proteomics and Imaging to inform biology...
Precision Oncology - using Genomics, Proteomics and Imaging to inform biology...
 
Clinical & Molecular Epidemiology
Clinical & Molecular EpidemiologyClinical & Molecular Epidemiology
Clinical & Molecular Epidemiology
 
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
 
Workshop finding and accessing data - fiona - lunteren april 18 2016
Workshop   finding and accessing data - fiona - lunteren april 18 2016Workshop   finding and accessing data - fiona - lunteren april 18 2016
Workshop finding and accessing data - fiona - lunteren april 18 2016
 

Recently uploaded

Aspirin presentation slides by Dr. Rewas Ali
Aspirin presentation slides by Dr. Rewas AliAspirin presentation slides by Dr. Rewas Ali
Aspirin presentation slides by Dr. Rewas AliRewAs ALI
 
Asthma Review - GINA guidelines summary 2024
Asthma Review - GINA guidelines summary 2024Asthma Review - GINA guidelines summary 2024
Asthma Review - GINA guidelines summary 2024Gabriel Guevara MD
 
Call Girls Electronic City Just Call 7001305949 Top Class Call Girl Service A...
Call Girls Electronic City Just Call 7001305949 Top Class Call Girl Service A...Call Girls Electronic City Just Call 7001305949 Top Class Call Girl Service A...
Call Girls Electronic City Just Call 7001305949 Top Class Call Girl Service A...narwatsonia7
 
Call Girls Frazer Town Just Call 7001305949 Top Class Call Girl Service Avail...
Call Girls Frazer Town Just Call 7001305949 Top Class Call Girl Service Avail...Call Girls Frazer Town Just Call 7001305949 Top Class Call Girl Service Avail...
Call Girls Frazer Town Just Call 7001305949 Top Class Call Girl Service Avail...narwatsonia7
 
Call Girl Lucknow Mallika 7001305949 Independent Escort Service Lucknow
Call Girl Lucknow Mallika 7001305949 Independent Escort Service LucknowCall Girl Lucknow Mallika 7001305949 Independent Escort Service Lucknow
Call Girl Lucknow Mallika 7001305949 Independent Escort Service Lucknownarwatsonia7
 
Call Girls ITPL Just Call 7001305949 Top Class Call Girl Service Available
Call Girls ITPL Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls ITPL Just Call 7001305949 Top Class Call Girl Service Available
Call Girls ITPL Just Call 7001305949 Top Class Call Girl Service Availablenarwatsonia7
 
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...narwatsonia7
 
Call Girls Service Chennai Jiya 7001305949 Independent Escort Service Chennai
Call Girls Service Chennai Jiya 7001305949 Independent Escort Service ChennaiCall Girls Service Chennai Jiya 7001305949 Independent Escort Service Chennai
Call Girls Service Chennai Jiya 7001305949 Independent Escort Service ChennaiNehru place Escorts
 
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original PhotosCall Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original Photosnarwatsonia7
 
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Availablenarwatsonia7
 
Call Girl Surat Madhuri 7001305949 Independent Escort Service Surat
Call Girl Surat Madhuri 7001305949 Independent Escort Service SuratCall Girl Surat Madhuri 7001305949 Independent Escort Service Surat
Call Girl Surat Madhuri 7001305949 Independent Escort Service Suratnarwatsonia7
 
Russian Call Girls in Pune Riya 9907093804 Short 1500 Night 6000 Best call gi...
Russian Call Girls in Pune Riya 9907093804 Short 1500 Night 6000 Best call gi...Russian Call Girls in Pune Riya 9907093804 Short 1500 Night 6000 Best call gi...
Russian Call Girls in Pune Riya 9907093804 Short 1500 Night 6000 Best call gi...Miss joya
 
Call Girls Hsr Layout Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Hsr Layout Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Hsr Layout Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Hsr Layout Just Call 7001305949 Top Class Call Girl Service Availablenarwatsonia7
 
Housewife Call Girls Hoskote | 7001305949 At Low Cost Cash Payment Booking
Housewife Call Girls Hoskote | 7001305949 At Low Cost Cash Payment BookingHousewife Call Girls Hoskote | 7001305949 At Low Cost Cash Payment Booking
Housewife Call Girls Hoskote | 7001305949 At Low Cost Cash Payment Bookingnarwatsonia7
 
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableVip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableNehru place Escorts
 
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...narwatsonia7
 
Ahmedabad Call Girls CG Road 🔝9907093804 Short 1500 💋 Night 6000
Ahmedabad Call Girls CG Road 🔝9907093804  Short 1500  💋 Night 6000Ahmedabad Call Girls CG Road 🔝9907093804  Short 1500  💋 Night 6000
Ahmedabad Call Girls CG Road 🔝9907093804 Short 1500 💋 Night 6000aliya bhat
 

Recently uploaded (20)

Aspirin presentation slides by Dr. Rewas Ali
Aspirin presentation slides by Dr. Rewas AliAspirin presentation slides by Dr. Rewas Ali
Aspirin presentation slides by Dr. Rewas Ali
 
Asthma Review - GINA guidelines summary 2024
Asthma Review - GINA guidelines summary 2024Asthma Review - GINA guidelines summary 2024
Asthma Review - GINA guidelines summary 2024
 
Call Girls Electronic City Just Call 7001305949 Top Class Call Girl Service A...
Call Girls Electronic City Just Call 7001305949 Top Class Call Girl Service A...Call Girls Electronic City Just Call 7001305949 Top Class Call Girl Service A...
Call Girls Electronic City Just Call 7001305949 Top Class Call Girl Service A...
 
Call Girls Frazer Town Just Call 7001305949 Top Class Call Girl Service Avail...
Call Girls Frazer Town Just Call 7001305949 Top Class Call Girl Service Avail...Call Girls Frazer Town Just Call 7001305949 Top Class Call Girl Service Avail...
Call Girls Frazer Town Just Call 7001305949 Top Class Call Girl Service Avail...
 
Call Girl Lucknow Mallika 7001305949 Independent Escort Service Lucknow
Call Girl Lucknow Mallika 7001305949 Independent Escort Service LucknowCall Girl Lucknow Mallika 7001305949 Independent Escort Service Lucknow
Call Girl Lucknow Mallika 7001305949 Independent Escort Service Lucknow
 
sauth delhi call girls in Bhajanpura 🔝 9953056974 🔝 escort Service
sauth delhi call girls in Bhajanpura 🔝 9953056974 🔝 escort Servicesauth delhi call girls in Bhajanpura 🔝 9953056974 🔝 escort Service
sauth delhi call girls in Bhajanpura 🔝 9953056974 🔝 escort Service
 
Call Girls ITPL Just Call 7001305949 Top Class Call Girl Service Available
Call Girls ITPL Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls ITPL Just Call 7001305949 Top Class Call Girl Service Available
Call Girls ITPL Just Call 7001305949 Top Class Call Girl Service Available
 
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...
 
Call Girls Service Chennai Jiya 7001305949 Independent Escort Service Chennai
Call Girls Service Chennai Jiya 7001305949 Independent Escort Service ChennaiCall Girls Service Chennai Jiya 7001305949 Independent Escort Service Chennai
Call Girls Service Chennai Jiya 7001305949 Independent Escort Service Chennai
 
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original PhotosCall Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
 
Russian Call Girls in Delhi Tanvi ➡️ 9711199012 💋📞 Independent Escort Service...
Russian Call Girls in Delhi Tanvi ➡️ 9711199012 💋📞 Independent Escort Service...Russian Call Girls in Delhi Tanvi ➡️ 9711199012 💋📞 Independent Escort Service...
Russian Call Girls in Delhi Tanvi ➡️ 9711199012 💋📞 Independent Escort Service...
 
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Available
 
Call Girl Surat Madhuri 7001305949 Independent Escort Service Surat
Call Girl Surat Madhuri 7001305949 Independent Escort Service SuratCall Girl Surat Madhuri 7001305949 Independent Escort Service Surat
Call Girl Surat Madhuri 7001305949 Independent Escort Service Surat
 
Escort Service Call Girls In Sarita Vihar,, 99530°56974 Delhi NCR
Escort Service Call Girls In Sarita Vihar,, 99530°56974 Delhi NCREscort Service Call Girls In Sarita Vihar,, 99530°56974 Delhi NCR
Escort Service Call Girls In Sarita Vihar,, 99530°56974 Delhi NCR
 
Russian Call Girls in Pune Riya 9907093804 Short 1500 Night 6000 Best call gi...
Russian Call Girls in Pune Riya 9907093804 Short 1500 Night 6000 Best call gi...Russian Call Girls in Pune Riya 9907093804 Short 1500 Night 6000 Best call gi...
Russian Call Girls in Pune Riya 9907093804 Short 1500 Night 6000 Best call gi...
 
Call Girls Hsr Layout Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Hsr Layout Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Hsr Layout Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Hsr Layout Just Call 7001305949 Top Class Call Girl Service Available
 
Housewife Call Girls Hoskote | 7001305949 At Low Cost Cash Payment Booking
Housewife Call Girls Hoskote | 7001305949 At Low Cost Cash Payment BookingHousewife Call Girls Hoskote | 7001305949 At Low Cost Cash Payment Booking
Housewife Call Girls Hoskote | 7001305949 At Low Cost Cash Payment Booking
 
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableVip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
 
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...
 
Ahmedabad Call Girls CG Road 🔝9907093804 Short 1500 💋 Night 6000
Ahmedabad Call Girls CG Road 🔝9907093804  Short 1500  💋 Night 6000Ahmedabad Call Girls CG Road 🔝9907093804  Short 1500  💋 Night 6000
Ahmedabad Call Girls CG Road 🔝9907093804 Short 1500 💋 Night 6000
 

Crowds Cure Canver: Annotating Data from The Cancer Imaging Archive

  • 1. http://cancerimagingarchive.net Justin Kirby – justin.kirby@nih.gov Frederick National Laboratory for Cancer Research Leidos Biomedical Research, Inc. Informatics in Cancer Imaging (ICI) Team Support to: Cancer Imaging Program/DCTD/NCI Crowds Cure Cancer: Annotating data from Citizen Science WG Meeting: Oct 2018
  • 2. 2 The Cancer Imaging Archive: Brief intro • 87 data sets (20 terabytes) consisting of 41,000 subjects (33 million radiology images) • Covers radiology modalities (CT/MR/PET/RT) and digitized pathology slides • Wide variety of cancers + phantoms • Patient populations vary from a handful to >26,000 (NLST) • Many have associated meta-data  Demographics/outcomes/therapy  Radiologist expert and automated computational analyses (segmentations, features) • ‘Omics ties to GDC/TCGA, CPTAC, and GEO http://www.cancerimagingarchive.net
  • 3. 3 Tackling the de-identification challenge PHI can appear in hundreds of places in DICOM • Dates • Identifiers • Descriptions Potential legal risks are a significant barrier to data sharing for research
  • 4. 4 TCIA services (not just software) Relieves PI of majority of data sharing burden/risks • Data hosting with >99% uptime • De-identification using pre-configured RSNA’s Clinical Trials Processor (CTP) and DICOM PS 3.15 Annex E standards • Multi-tiered QC process inspects both DICOM headers and pixels for PHI and integrity of data set Phone/email support available for end users and submitters Extensive documentation throughout the site Exposure to a large community of researchers • Increase visibility of your work, get more citations!
  • 5. 5 TCIA Services: Staffing TCIA Management at UAMS Prior, Smith Data Collection: Programs Bennett, Berryman, Billelo Data Collection: Research Collections Jarosz, Stockton, Honomichl Data Collection: Pathology Sharma, Birminam, TBD APOLLO (clinical/backlog) Levine, Angelus, TBD Clinical Trials TBD Infrastructure Support Smith, Nolan, Dobbins, Tarbox, Tobler, Frund, Utecht, Brown CIP ICI Management Freymann Kirby Sullivan Cordeiro Hill
  • 6. 6 Organization of TCIA ecosystem The Cancer Imaging Archive Data Collection Center •Tools and staffing to support data collection, curation, and de- identification Data Access •Browse (home page) •Filter/Search (Data Portal) •REST API •Analysis Data Data Analysis Centers •3rd party web sites or tools which connect to TCIA’s API or mirror its data
  • 7. 7 Data Collection Center: Publish Your Data Primary Data (radiology, pathology, clinical, etc) Analysis Results (derived from primary data) Image credit: Hugo Aerts
  • 8. 8 Data Collection Center: Publishing data in addition to manuscripts Data citations for both primary and analysis data to enable reproducible research Analysis Dataset Citation (derived image features) Gutman DA, Cooper LA, Hwang SN, Holder CA, Gao J, Aurora TD, Dunn WD Jr, Scarpace L, Mikkelsen T, Jain R, Wintermark M, Jilwan M, Raghavan P, Huang E, Clifford RJ, Mongkolwat P, Kleper V, Freymann J, Kirby J, Zinn PO, Moreno CS, Jaffe C, Colen R, Rubin DL, Saltz J, Flanders A, Brat DJ. (2014). MR Imaging Predictors of Molecular Profile and Survival: Multi- institutional Study of the TCGA Glioblastoma Data Set. The Cancer Imaging Archive. http://doi.org/10.7937/K9/TCIA.2014.4HTXYRCN Publication Citation (cites specific data used) MR imaging predictors of molecular profile and survival: multi- institutional study of the TCGA glioblastoma data set. Radiology. 2013 May;267(2):560-9. doi: 10.1148/radiol.13120118. Epub 2013 Feb 7. PubMed PMID: 23392431; PubMed Central PMCID: PMC3632807. Primary Data Citation (TCIA images used for study) Scarpace, L., Mikkelsen, T., Cha, soonmee, Rao, S., Tekchandani, S., Gutman, D., … Pierce, L. J. (2016). Radiology Data from The Cancer Genome Atlas Glioblastoma Multiforme [TCGA-GBM] collection. The Cancer Imaging Archive. http://doi.org/10.7937/K9/TCIA.2016.RNYFUYE9
  • 9. 9 Data Descriptor Journals Journal Recommended Repositories Nature Scientific Data https://www.nature.com/sdata/policies/repositories#imaging Medical Physics http://aapm.onlinelibrary.wiley.com/hub/journal/10.1002/(ISSN)2473-4209/about/author- guidelines.html (see section 13-Medical Physics Dataset Articles) Elsevier Data in Brief http://www.elsevier.com/authors/author-services/research-data/data-base-linking/supported- data-repositories#Health PLOS ONE http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories Research Data Support https://www.springernature.com/gp/authors/research-data-policy/repositories-bio/12327160 Publish detailed descriptions about how to use your TCIA data to gain academic credit (publication/citations) in addition to the novel scientific findings you might publish in traditional journals.
  • 10. 10 Researchers want to share data – 38 data set queue Community Proposed Data Sets GBM-DSC-MRI-DRO ASCC TNM Consensus Colorectal Liver Metastases QIN-BREAST-02 MyelomaTT3a Low Dose CT Liver Metastases Lung Fused-CT-Pathology HNSCC Oropharyngeal Radiomics OPC-Radiomics Oropharynx Phantoms HNSCC 3D CT RT MSK Pancreatic Cancer Repository Program Data Sets Notes TCGA 2 collection//sites CPTAC 9 cancer types, 14 sites Exceptional Responders 24 of 58 subjects remaining Immunotherapy 2 cancer types PDX mouse Not started NCTN integration RTOG 0617 pilot in process QIN ECOG-ACRIN 10 trials
  • 11. 11 New funding to collect imaging from NCTN Clinical Imaging
  • 12. 12 A significantly growing community! 38 incoming data sets in varying stages of curation Over 10,000 active users per month • Up from ~3,000/month in 2015 Downloads of 40-50TB per month • Up from ~2TB/month in 2015 613 publications based on TCIA data • 134 new publications in 2017
  • 13. 13 Crowds Cure Cancer: The motivation Continued growth: Engage researchers outside the medical community Computer science researchers often lack disease understanding or sufficient contact with medical experts Labeling basic information in the images such as tumor locations to create “training data” can enable others to apply new methods to our data TCIA has this kind of annotation for some data sets, but not all Performing such annotations/labeling using a small number of radiologists is extremely time consuming and expensive
  • 15. 15 TCIA Acknowledgements University of Arkansas Dr. Fred Prior Dr. Lawrence Tarbox Kirk Smith Bill Bennett Tracy Nolan Julie Frund Sean Berryman Dwayne Dobbins Quasar Jarosz Jeff Tobler Sonya Utecht Diana Stockton Betty Levine Erica Bilello Geri Blake Robert Brown Leidos Biomedical Research, Contract 16X011 for NCI, Maintenance and Extension of The Cancer Imaging Archive (TCIA ) (Prior) NBIA Team (NCI/FNLCR/Ellumen) Ed Helton Ulli Wagner Scott Gustafson Qinyan Pan Russ Rieling Carolyn Klinger Martin Lerner Tin Tran Contractor John Perry Frederick National Laboratory for Cancer Research (FNLCR) John Freymann Justin Kirby Brenda Fevrier-Sullivan Pam Angelus Carl Jaffe Luis Cordeiro Craig Hill Emory University Dr. Ashish Sharma Ryan Birmingham
  • 16. Crowds Cure Cancer October 1, 2018 Jayashree Kalpathy-Cramer
  • 17. Agenda  Background  Collaborators  System Development  General impressions from RSNA of the experiment • Feedback from participants  Current status/results  Features to be added  Next steps
  • 18. Background Annotated data is key to improving performance of machine learning algorithms Inter-rater agreement, even among experts, can be less than optimal • Helpful to have multiple annotations per case Getting annotations is expensive Hypothesis: crowd-sourced annotations, even derived from non-experts, can be used for machine learning
  • 19.
  • 20.
  • 21.
  • 22.
  • 23.
  • 24. Collaborators  Erik Ziegler/Gordon Harris (OHIF/MGH/ITCR)  Steve Pieper (ITCR)  Lawrence Tarbox, Jeff Tobler, Fred Prior (UAMS/ITCR)  Ashish Sharma (Emory/ITCR)  Jayashree Kalpathy-Cramer/Artem Mamonov (MGH/ITCR)  Justin Kirby, Brenda Fevrier-Sullivan, John Freyman (FNL)  Erich Huang, Paula Jacobs (NCI)  RSNA (Informatics)
  • 25. System Architecture  TCIA • Used images with known “truth” from TCGA studies/Carl Jaffe  Azure VM  Cornerstone Viewer • Lightweight version, mobile friendly!!  Chronicle DB (CouchDB) backend • DICOM aware!  Registration system • Logic for next case  JS/D3 for results (in progress) • Charts • Summary tables
  • 26. General Impressions  Overall seemed to be a success  Many participants ended up spending more time than planned  Viewer was responsive  No major complaints about system performance  Results are (very) promising
  • 27. Logistics  Better location? • Near case of the day  Coffee, water, cookies, chocolate  Wipes or alcohol wash for hands  Integration with RSNA app
  • 28. Sociology  Leaderboard needs to be prominently displayed • Maybe award stars?
  • 29. Annotators  Over 250 signed up • 211 had at least 1 annotation • 112 were radiologists  Selected • Liver: 189 • Lung: 211 • Renal: 165 • Ovarian: 133  Selected • 4: 120 • 3: 27 • 2: 29 • 1: 78
  • 31. Annotations per annotator Annotations per annotator  Misty_mandrill clear winner • Nocturnal sparrow catching up! • Rubbery_hamster close behind
  • 32. Time it takes to make an annotation • Typically 10-30 seconds
  • 34. Are some cases more difficult?
  • 35. Lung results y = 0.8574x + 5.0806 R² = 0.8379 0 10 20 30 40 50 60 70 80 0 10 20 30 40 50 60 70 80 Averagemeasurment ground truth measurement Comparisons to ground truth
  • 36. Feedback on interface  Need to start with a click through tutorial  Window/Level presets  Ability to change category mid session • Also important when multiple cancer types are chosen.  Mobile support?  Flag button for quality with text box for comments  No log in when username is empty  Logout button  Need metrics - time per session, scores per session  People asked to see the number of cases they annotated somewhere on the screen.  Add sagittal and coronal views for measurement accuracy  A few did not like the randomization of the images if they selected 3 or 4 cancer types – they complained about getting a bunch of all liver or all lung cases at a time; and said they would have preferred being given the options of which to do next.  System stalled if images for only one cancer type was selected and all the cases were completed.  Reorder buttons – keep essentials only for mobile  Teaching interface – provide immediate feedback compared to “consensus”?  Key up/down
  • 37. Features to be added  Annotators by type  Time histogram by type  Update charts to remove skips  Ability to all annotations for each case • screen shots of all annotations  Statistics on variability of measurement  Ability to filter out by type?? • Can’t filter out our best annotator (misty_mandrill) but can’t keep in junk?  Filter out by number of cases annotated • If <5, probably just trying to see what it is all about  Compare to ground truth • Add data for all cases • Convert AIM measurements to cornerstone  Length, approximate slice  Ability to overlay all annotations including GT
  • 38. Potential next steps for system • Permanently linked from TCIA • Road show to various meetings? • (Link to challenge platform)

Editor's Notes

  1. Provide DOIs to collections and meta-collections (article’s analysis) Publication can refer to the specific data sets used via the DOIs in the data citations Currently working with NLM, collaborating with Nature Scientific Data and other publications