SlideShare a Scribd company logo
Data Standards in Radiomics
Research
Andrey Fedorov, PhD
Molecular Medicine Tri-Conference
February 13, 2018
https://fedorov.github.io
Disclosures
Grant support from NIH/NCI under the Informatics Technology for Cancer
Research (ITCR) program:
● U24 CA180918 Quantitative Image Informatics for Cancer Research (QIICR) -
PIs Fedorov, Kikinis - http://qiicr.org
● U24 CA194354 Quantitative Radiomics System Decoding the Tumor
Phenotype - PI Aerts - https://radiomics.io
2
Radiomics analysis workflow
3
Lambin P, Rios-Velazquez E, Leijenaar R, Carvalho S, van Stiphout RGPM, Granton P, et al. Radiomics:
extracting more information from medical images using advanced feature analysis. Eur J Cancer. 2012;48:
441–446. doi:10.1016/j.ejca.2011.11.036
Kumar V, Gu Y, Basu S, Berglund A, Eschrich S a., Schabath MB, et al. Radiomics: the process and the
challenges. Magn Reson Imaging. Elsevier Inc.; 2012;30: 1234–1248. doi:10.1016/j.mri.2012.06.010
Images Regions Features Classes
Radiomics analysis “data view”
4
Images Regions Features Classes
“Swap” pipeline components
5
Images
Features Classes
Aggregate results
Images
Images
Regions
6
Images Regions Features Classes
Integration with patient record
7
Weber GM, Mandl KD, Kohane IS. Finding the
missing link for big biomedical data. JAMA.
2014;311(24):2479–2480
8
https://www.forbes.com/sites/gilpress/2016/03/23/dat
a-preparation-most-time-consuming-least-enjoyable-d
ata-science-task-survey-says/#5bbab7d56f63
“Cleaning Big Data: Most Time-Consuming, Least
Enjoyable Data Science Task, Survey Says”
9
Data science is more about data than methods
TCIA image directory
10
Standard for images
Digital Imaging and Communication in Medicine (DICOM) is the standard for
communication of medical imaging information and related data
● Compatibility with acquisition and archival tools
● Harmonized with other standards (HL7 CDA, JSON, XML, REST, FHIR)
● History of development and adoption since 1983
● Adopted by virtually all manufacturers of medical imaging equipment
● Open international community of stakeholders
● Continuously evolving standard
11
DICOM
Clinical images
12
DICOM Image
Pixel data:
● Sparse measurements sampling a 2d plane or 3d
volume
Non pixel data (a.k.a. “metadata”, or “header”):
● Patient identification, dates, image acquisition details,
unique identifiers of the dataset, pointers to related
evidence, body part imaged, …
13
TCIA image directory
14
TCIA analysis results directory
15
Example: Image Segmentations - Research formats
16
● encoding of pixel data
● common tasks
○ visualization, comparison of the results, image
fusion, quantitative measurements
● image processing domain-specific formats
○ “just enough” for typical image analysis tasks
○ image spacing, origin and direction cosines
○ NIFTI, NRRD, MHD, Analyze, …
● non-volumetric formats
○ planar
○ voxel intensity values
○ JPG, PNG, TIFF, …
Segmentations Composite context
● Patient identification, dates, image acquisition
details, unique identifiers of the dataset,
pointers to related evidence, annotation of the
body part imaged
Structured metadata about analysis result
● Structure segmented, tissue type, body
location, segmentation approach, references to
the source images
Research formats
17
DICOM - preparing for the unknown, since 1983
● Standard for images and image-related evidence
● Object definitions
○ Object type defines required and optional attributes
● For all object types: Composite context is formalized and required
○ Dates, patient IDs, study, series - for every object
● Unique identifiers
● References to related evidence
○ Provenance of data acquisition
○ Provenance of data analysis
● Fixed syntax
○ (hierarchical) list of attribute/value pairs
● Normalized semantics
○ Common data elements / lexicons / ontologies 18
Example: Image Segmentations - DICOM Segmentation image
19
In addition to the spatial labeling of
the voxels of interest:
● composite context
● semantics
● segmentation approach
● single file, efficient storage,
multiple occupancy voxels
● overlay color
No guessing about the meaning
20
Measurements (e.g., radiomics features)
● DICOM structured reporting
template
● Hierarchy of measurement groups
assigned per segmentation
● Coded and structured
○ Source image and segmentation
○ Anatomy
○ Quantity
○ Units
● Composite context
21
Radiomics feature extraction
● Free open source implementation of
radiomics features in Python
● Preprocessing filters
● Integrated with 3D Slicer
● Integration with MeV underway
22
Head and neck cancer use case
● Images and related evidence in DICOM:
○ PET/CT image data
○ segmentations
○ PET quantitative features
○ Clinical data (demographics, Dx, Tx)
● Data: TCIA QIN-HEADNECK collection
● Tools: public in dcmqi and 3D Slicer
● Demo: http://qiicr.org/dicom4miccai/
Beichel et al. 2016. Semiautomated segmentation of head and neck
cancers in 18F-FDG PET scans: A just-enough-interaction approach.
Medical physics 43:2948. DOI: 10.1118/1.4948679.
23
non-DICOM
Conversion to DICOM
DICOM
DICOM
dcmqi
JSON
Composite context is propagated from the
source image data
Analysis-specific metadata is
defined by the user and
parametrized by a JSON-Schema
Segmentation image volume
in any format readable by ITK
(NRRD, NIfTI, Analyze, MHD)
24
https://github.com/qiicr/dcmqi
Free open source, binaries for
Win/Mac/Linux, Docker image
Conversion from DICOM - free open source tools available
25
XML, JSON
SQL tables,
CSV, Excel
Volumetric research
formats
(NRRD, NIfTI,
MHD)
PDF, HTML
(human-readable
reports)
DICOM4QI
● Connectathon and demonstration of interoperability of standardized
analysis results at the RSNA, since 2015
○ 4 types of QI DICOM objects (segmentations, parametric maps, volumetric measurements,
tractography)
○ 11 platforms participated (including 5 commercial)
● Free open source tools that support the standard for QI analysis results
○ 3D Slicer (desktop platform and application)
○ ePAD (browser-based application)
○ MITK (desktop platform and application)
○ OHIF LesionTracker / Cornerstone (browser-based application and toolkit)
○ vtk-dicom (desktop)
26
https://qiicr.gitbooks.io/dicom4qi
27
NIH Cancer Research Data Commons
Response to Blue Ribbon panel
recommendation: “to collect, share, and
interconnect a broad array of large
datasets so that researchers, clinicians,
and patients will be able to both
contribute and analyze data, facilitating
discovery that will ultimately improve
patient care and outcomes”
● Domain-specialized nodes
● Containerized standardized tools
● Node-specific consistent identifiers
and semantics
Hinkson IV, Davidsen TM, Klemm JD, Kerlavage AR, Kibbe WA. A
Comprehensive Infrastructure for Big Data in Cancer Research: Accelerating
Cancer Research and Precision Medicine. Front Cell Dev Biol. 2017;5:83 28
Few of the many challenges that need to be addressed
● Radiomics data sharing is in its infancy
● No tools to query/discover/interpret standardized data
● Support in commercial tools is very limited
○ —> data collected during routine clinical care is not standardized
● Clinical trial datasets are not standardized
○ Annotations often are not even machine readable
● Adherence to the standards is inconsistent
● No ontologies to describe research outputs (radiomics, filters, parameters)
● Standardization/harmonization of clinical data
29
In conclusion
● Data standardization is required
○ Cloud computing
○ Data-thirsty AI methods
○ Funders, publishers, regulators are increasingly requiring data sharing
● DICOM can help standardize radiomics outputs
○ Capabilities demonstrated
○ Supporting open source tools are emerging
○ Adoption is growing
● Joint effort is required to address standardization
○ Multi-domain expertise is needed
○ Community building
○ Aggregate datasets for analysis
30
Thank you!
Questions? Comments? More info?
andrey.fedorov@gmail.com
http://qiicr.org
31

More Related Content

What's hot

Big Databases and Outcome Research - Opportunities and Challenges for Radiati...
Big Databases and Outcome Research - Opportunities and Challenges for Radiati...Big Databases and Outcome Research - Opportunities and Challenges for Radiati...
Big Databases and Outcome Research - Opportunities and Challenges for Radiati...
Andre Dekker
 
Twenty Years of Whole Slide Imaging - the Coming Phase Change
Twenty Years of Whole Slide Imaging - the Coming Phase ChangeTwenty Years of Whole Slide Imaging - the Coming Phase Change
Twenty Years of Whole Slide Imaging - the Coming Phase Change
Joel Saltz
 
Dekker trog - big data for radiation oncology - 2017
Dekker   trog  - big data for radiation oncology - 2017Dekker   trog  - big data for radiation oncology - 2017
Dekker trog - big data for radiation oncology - 2017
Andre Dekker
 
Artificial Intelligence in pathology
Artificial Intelligence in pathologyArtificial Intelligence in pathology
Artificial Intelligence in pathology
nehaSingh1543
 
Digital Pathology, FDA Approval and Precision Medicine
Digital Pathology, FDA Approval and Precision MedicineDigital Pathology, FDA Approval and Precision Medicine
Digital Pathology, FDA Approval and Precision Medicine
Joel Saltz
 
Machine Learning and Deep Contemplation of Data
Machine Learning and Deep Contemplation of DataMachine Learning and Deep Contemplation of Data
Machine Learning and Deep Contemplation of Data
Joel Saltz
 
Digital pathology in developing country
Digital pathology in developing countryDigital pathology in developing country
Digital pathology in developing country
Dr. Ashish lakhey
 
Generation and Use of Quantitative Pathology Phenotype
Generation and Use of Quantitative Pathology PhenotypeGeneration and Use of Quantitative Pathology Phenotype
Generation and Use of Quantitative Pathology Phenotype
Joel Saltz
 
Dekker trog - learning outcome prediction models from cancer data - 2017
Dekker   trog  - learning outcome prediction models from cancer data - 2017Dekker   trog  - learning outcome prediction models from cancer data - 2017
Dekker trog - learning outcome prediction models from cancer data - 2017
Andre Dekker
 
Ph.D. Defense Presentation Slides (Changhee Han) カリスの東大博論審査会(公聴会)発表スライド Patho...
Ph.D. Defense Presentation Slides (Changhee Han) カリスの東大博論審査会(公聴会)発表スライド Patho...Ph.D. Defense Presentation Slides (Changhee Han) カリスの東大博論審査会(公聴会)発表スライド Patho...
Ph.D. Defense Presentation Slides (Changhee Han) カリスの東大博論審査会(公聴会)発表スライド Patho...
カリス 東大AI博士
 
High Dimensional Fused-Informatics
High Dimensional Fused-InformaticsHigh Dimensional Fused-Informatics
High Dimensional Fused-Informatics
Joel Saltz
 
Presentation clinical applications of Artificial Intelligence for radiation o...
Presentation clinical applications of Artificial Intelligence for radiation o...Presentation clinical applications of Artificial Intelligence for radiation o...
Presentation clinical applications of Artificial Intelligence for radiation o...
Jean-Emmanuel Bibault Bibault, MD, PhD
 
Twenty Years of Whole Slide Imaging - the Coming Phase Change
Twenty Years of Whole Slide Imaging - the Coming Phase ChangeTwenty Years of Whole Slide Imaging - the Coming Phase Change
Twenty Years of Whole Slide Imaging - the Coming Phase Change
Joel Saltz
 
USING DATA MINING TECHNIQUES FOR DIAGNOSIS AND PROGNOSIS OF CANCER DISEASE
USING DATA MINING TECHNIQUES FOR DIAGNOSIS AND PROGNOSIS OF CANCER DISEASEUSING DATA MINING TECHNIQUES FOR DIAGNOSIS AND PROGNOSIS OF CANCER DISEASE
USING DATA MINING TECHNIQUES FOR DIAGNOSIS AND PROGNOSIS OF CANCER DISEASE
IJCSEIT Journal
 
Integrative Multi-Scale Analysis in Biomedical Data Science: Tools, Methods a...
Integrative Multi-Scale Analysis in Biomedical Data Science: Tools, Methods a...Integrative Multi-Scale Analysis in Biomedical Data Science: Tools, Methods a...
Integrative Multi-Scale Analysis in Biomedical Data Science: Tools, Methods a...
Joel Saltz
 
Model guided therapy and the role of dicom in surgery
Model guided therapy and the role of dicom in surgeryModel guided therapy and the role of dicom in surgery
Model guided therapy and the role of dicom in surgery
Klaus19
 
Definiens In Digital Pathology Hr
Definiens In Digital Pathology HrDefiniens In Digital Pathology Hr
Definiens In Digital Pathology Hr
Daniel Nicolson
 
Trampleasure VR [P1]
Trampleasure VR [P1]Trampleasure VR [P1]
Trampleasure VR [P1]
Oliver Trampleasure
 
Tools to Analyze Morphology and Spatially Mapped Molecular Data - Informatio...
Tools to Analyze Morphology and Spatially Mapped Molecular Data -  Informatio...Tools to Analyze Morphology and Spatially Mapped Molecular Data -  Informatio...
Tools to Analyze Morphology and Spatially Mapped Molecular Data - Informatio...
Joel Saltz
 
Computer Aided Diagnosis in Pathology: Pros & Cons by Dr. Liron Pantanowitz
Computer Aided Diagnosis in Pathology: Pros & Cons by Dr. Liron PantanowitzComputer Aided Diagnosis in Pathology: Pros & Cons by Dr. Liron Pantanowitz
Computer Aided Diagnosis in Pathology: Pros & Cons by Dr. Liron Pantanowitz
Cirdan
 

What's hot (20)

Big Databases and Outcome Research - Opportunities and Challenges for Radiati...
Big Databases and Outcome Research - Opportunities and Challenges for Radiati...Big Databases and Outcome Research - Opportunities and Challenges for Radiati...
Big Databases and Outcome Research - Opportunities and Challenges for Radiati...
 
Twenty Years of Whole Slide Imaging - the Coming Phase Change
Twenty Years of Whole Slide Imaging - the Coming Phase ChangeTwenty Years of Whole Slide Imaging - the Coming Phase Change
Twenty Years of Whole Slide Imaging - the Coming Phase Change
 
Dekker trog - big data for radiation oncology - 2017
Dekker   trog  - big data for radiation oncology - 2017Dekker   trog  - big data for radiation oncology - 2017
Dekker trog - big data for radiation oncology - 2017
 
Artificial Intelligence in pathology
Artificial Intelligence in pathologyArtificial Intelligence in pathology
Artificial Intelligence in pathology
 
Digital Pathology, FDA Approval and Precision Medicine
Digital Pathology, FDA Approval and Precision MedicineDigital Pathology, FDA Approval and Precision Medicine
Digital Pathology, FDA Approval and Precision Medicine
 
Machine Learning and Deep Contemplation of Data
Machine Learning and Deep Contemplation of DataMachine Learning and Deep Contemplation of Data
Machine Learning and Deep Contemplation of Data
 
Digital pathology in developing country
Digital pathology in developing countryDigital pathology in developing country
Digital pathology in developing country
 
Generation and Use of Quantitative Pathology Phenotype
Generation and Use of Quantitative Pathology PhenotypeGeneration and Use of Quantitative Pathology Phenotype
Generation and Use of Quantitative Pathology Phenotype
 
Dekker trog - learning outcome prediction models from cancer data - 2017
Dekker   trog  - learning outcome prediction models from cancer data - 2017Dekker   trog  - learning outcome prediction models from cancer data - 2017
Dekker trog - learning outcome prediction models from cancer data - 2017
 
Ph.D. Defense Presentation Slides (Changhee Han) カリスの東大博論審査会(公聴会)発表スライド Patho...
Ph.D. Defense Presentation Slides (Changhee Han) カリスの東大博論審査会(公聴会)発表スライド Patho...Ph.D. Defense Presentation Slides (Changhee Han) カリスの東大博論審査会(公聴会)発表スライド Patho...
Ph.D. Defense Presentation Slides (Changhee Han) カリスの東大博論審査会(公聴会)発表スライド Patho...
 
High Dimensional Fused-Informatics
High Dimensional Fused-InformaticsHigh Dimensional Fused-Informatics
High Dimensional Fused-Informatics
 
Presentation clinical applications of Artificial Intelligence for radiation o...
Presentation clinical applications of Artificial Intelligence for radiation o...Presentation clinical applications of Artificial Intelligence for radiation o...
Presentation clinical applications of Artificial Intelligence for radiation o...
 
Twenty Years of Whole Slide Imaging - the Coming Phase Change
Twenty Years of Whole Slide Imaging - the Coming Phase ChangeTwenty Years of Whole Slide Imaging - the Coming Phase Change
Twenty Years of Whole Slide Imaging - the Coming Phase Change
 
USING DATA MINING TECHNIQUES FOR DIAGNOSIS AND PROGNOSIS OF CANCER DISEASE
USING DATA MINING TECHNIQUES FOR DIAGNOSIS AND PROGNOSIS OF CANCER DISEASEUSING DATA MINING TECHNIQUES FOR DIAGNOSIS AND PROGNOSIS OF CANCER DISEASE
USING DATA MINING TECHNIQUES FOR DIAGNOSIS AND PROGNOSIS OF CANCER DISEASE
 
Integrative Multi-Scale Analysis in Biomedical Data Science: Tools, Methods a...
Integrative Multi-Scale Analysis in Biomedical Data Science: Tools, Methods a...Integrative Multi-Scale Analysis in Biomedical Data Science: Tools, Methods a...
Integrative Multi-Scale Analysis in Biomedical Data Science: Tools, Methods a...
 
Model guided therapy and the role of dicom in surgery
Model guided therapy and the role of dicom in surgeryModel guided therapy and the role of dicom in surgery
Model guided therapy and the role of dicom in surgery
 
Definiens In Digital Pathology Hr
Definiens In Digital Pathology HrDefiniens In Digital Pathology Hr
Definiens In Digital Pathology Hr
 
Trampleasure VR [P1]
Trampleasure VR [P1]Trampleasure VR [P1]
Trampleasure VR [P1]
 
Tools to Analyze Morphology and Spatially Mapped Molecular Data - Informatio...
Tools to Analyze Morphology and Spatially Mapped Molecular Data -  Informatio...Tools to Analyze Morphology and Spatially Mapped Molecular Data -  Informatio...
Tools to Analyze Morphology and Spatially Mapped Molecular Data - Informatio...
 
Computer Aided Diagnosis in Pathology: Pros & Cons by Dr. Liron Pantanowitz
Computer Aided Diagnosis in Pathology: Pros & Cons by Dr. Liron PantanowitzComputer Aided Diagnosis in Pathology: Pros & Cons by Dr. Liron Pantanowitz
Computer Aided Diagnosis in Pathology: Pros & Cons by Dr. Liron Pantanowitz
 

Similar to Data Standards in Radiomics Research

TCIA Update
TCIA UpdateTCIA Update
TCIA Update
imgcommcall
 
Standardized representation of the LIDC annotations using DICOM
Standardized representation of the LIDC annotations using DICOMStandardized representation of the LIDC annotations using DICOM
Standardized representation of the LIDC annotations using DICOM
imgcommcall
 
Pathomics, Clinical Studies, and Cancer Surveillance
Pathomics, Clinical Studies, and Cancer SurveillancePathomics, Clinical Studies, and Cancer Surveillance
Pathomics, Clinical Studies, and Cancer Surveillance
Joel Saltz
 
Imaging Community Call - Introduction
Imaging Community Call - IntroductionImaging Community Call - Introduction
Imaging Community Call - Introduction
imgcommcall
 
Imaging Data Commons (IDC) - Introduction and intital approach
Imaging Data Commons (IDC) - Introduction and intital approachImaging Data Commons (IDC) - Introduction and intital approach
Imaging Data Commons (IDC) - Introduction and intital approach
imgcommcall
 
P.Surendar - VIVA PPT.pptx
P.Surendar - VIVA PPT.pptxP.Surendar - VIVA PPT.pptx
P.Surendar - VIVA PPT.pptx
surendar1989
 
TCIA Update
TCIA UpdateTCIA Update
TCIA Update
imgcommcall
 
Clinical Data Models - The Hyve - Bio IT World April 2019
Clinical Data Models - The Hyve - Bio IT World April 2019Clinical Data Models - The Hyve - Bio IT World April 2019
Clinical Data Models - The Hyve - Bio IT World April 2019
Kees van Bochove
 
Electronic health records and machine learning
Electronic health records and machine learningElectronic health records and machine learning
Electronic health records and machine learning
Eman Abdelrazik
 
Fundamentals and Innovations in medical imaging.pptx
Fundamentals and Innovations in medical imaging.pptxFundamentals and Innovations in medical imaging.pptx
Fundamentals and Innovations in medical imaging.pptx
Hemant Rakesh
 
Medical Imaging at UPC - Elisa Sayrol - UPC Barcelona 2018
Medical Imaging at UPC - Elisa Sayrol - UPC Barcelona 2018Medical Imaging at UPC - Elisa Sayrol - UPC Barcelona 2018
Medical Imaging at UPC - Elisa Sayrol - UPC Barcelona 2018
Universitat Politècnica de Catalunya
 
2014 Medical Imaging
2014 Medical Imaging2014 Medical Imaging
2014 Medical Imaging
Engku Fahmi
 
A practical guide to using The Cancer Imaging Archive for QIN Challenges and ...
A practical guide to using The Cancer Imaging Archive for QIN Challenges and ...A practical guide to using The Cancer Imaging Archive for QIN Challenges and ...
A practical guide to using The Cancer Imaging Archive for QIN Challenges and ...
CancerImagingInforma
 
The Cancer imaging Phenomics Toolkit (CaPTk)
The Cancer imaging Phenomics Toolkit (CaPTk)The Cancer imaging Phenomics Toolkit (CaPTk)
The Cancer imaging Phenomics Toolkit (CaPTk)
imgcommcall
 
Artificial Neural Networks applications in Computer Aided Diagnosis. System d...
Artificial Neural Networks applications in Computer Aided Diagnosis. System d...Artificial Neural Networks applications in Computer Aided Diagnosis. System d...
Artificial Neural Networks applications in Computer Aided Diagnosis. System d...
Technological Ecosystems for Enhancing Multiculturality
 
NCATS CTSA N3C
NCATS CTSA N3C NCATS CTSA N3C
NCATS CTSA N3C
Warren Kibbe
 
IMAGING BIOMARKER PANELS AND MULTI-OMICS AI MODELS FOR OUTCOMES PREDICTION
IMAGING BIOMARKER PANELS AND MULTI-OMICS AI MODELS FOR OUTCOMES PREDICTIONIMAGING BIOMARKER PANELS AND MULTI-OMICS AI MODELS FOR OUTCOMES PREDICTION
IMAGING BIOMARKER PANELS AND MULTI-OMICS AI MODELS FOR OUTCOMES PREDICTION
iQHub
 
radiotherapy.pptx
radiotherapy.pptxradiotherapy.pptx
radiotherapy.pptx
Behgalcancer Hospital
 
Data Mining Techniques In Computer Aided Cancer Diagnosis
Data Mining Techniques In Computer Aided Cancer DiagnosisData Mining Techniques In Computer Aided Cancer Diagnosis
Data Mining Techniques In Computer Aided Cancer Diagnosis
Datamining Tools
 
Data Mining Techniques In Computer Aided Cancer Diagnosis
Data Mining Techniques In Computer Aided Cancer DiagnosisData Mining Techniques In Computer Aided Cancer Diagnosis
Data Mining Techniques In Computer Aided Cancer Diagnosis
DataminingTools Inc
 

Similar to Data Standards in Radiomics Research (20)

TCIA Update
TCIA UpdateTCIA Update
TCIA Update
 
Standardized representation of the LIDC annotations using DICOM
Standardized representation of the LIDC annotations using DICOMStandardized representation of the LIDC annotations using DICOM
Standardized representation of the LIDC annotations using DICOM
 
Pathomics, Clinical Studies, and Cancer Surveillance
Pathomics, Clinical Studies, and Cancer SurveillancePathomics, Clinical Studies, and Cancer Surveillance
Pathomics, Clinical Studies, and Cancer Surveillance
 
Imaging Community Call - Introduction
Imaging Community Call - IntroductionImaging Community Call - Introduction
Imaging Community Call - Introduction
 
Imaging Data Commons (IDC) - Introduction and intital approach
Imaging Data Commons (IDC) - Introduction and intital approachImaging Data Commons (IDC) - Introduction and intital approach
Imaging Data Commons (IDC) - Introduction and intital approach
 
P.Surendar - VIVA PPT.pptx
P.Surendar - VIVA PPT.pptxP.Surendar - VIVA PPT.pptx
P.Surendar - VIVA PPT.pptx
 
TCIA Update
TCIA UpdateTCIA Update
TCIA Update
 
Clinical Data Models - The Hyve - Bio IT World April 2019
Clinical Data Models - The Hyve - Bio IT World April 2019Clinical Data Models - The Hyve - Bio IT World April 2019
Clinical Data Models - The Hyve - Bio IT World April 2019
 
Electronic health records and machine learning
Electronic health records and machine learningElectronic health records and machine learning
Electronic health records and machine learning
 
Fundamentals and Innovations in medical imaging.pptx
Fundamentals and Innovations in medical imaging.pptxFundamentals and Innovations in medical imaging.pptx
Fundamentals and Innovations in medical imaging.pptx
 
Medical Imaging at UPC - Elisa Sayrol - UPC Barcelona 2018
Medical Imaging at UPC - Elisa Sayrol - UPC Barcelona 2018Medical Imaging at UPC - Elisa Sayrol - UPC Barcelona 2018
Medical Imaging at UPC - Elisa Sayrol - UPC Barcelona 2018
 
2014 Medical Imaging
2014 Medical Imaging2014 Medical Imaging
2014 Medical Imaging
 
A practical guide to using The Cancer Imaging Archive for QIN Challenges and ...
A practical guide to using The Cancer Imaging Archive for QIN Challenges and ...A practical guide to using The Cancer Imaging Archive for QIN Challenges and ...
A practical guide to using The Cancer Imaging Archive for QIN Challenges and ...
 
The Cancer imaging Phenomics Toolkit (CaPTk)
The Cancer imaging Phenomics Toolkit (CaPTk)The Cancer imaging Phenomics Toolkit (CaPTk)
The Cancer imaging Phenomics Toolkit (CaPTk)
 
Artificial Neural Networks applications in Computer Aided Diagnosis. System d...
Artificial Neural Networks applications in Computer Aided Diagnosis. System d...Artificial Neural Networks applications in Computer Aided Diagnosis. System d...
Artificial Neural Networks applications in Computer Aided Diagnosis. System d...
 
NCATS CTSA N3C
NCATS CTSA N3C NCATS CTSA N3C
NCATS CTSA N3C
 
IMAGING BIOMARKER PANELS AND MULTI-OMICS AI MODELS FOR OUTCOMES PREDICTION
IMAGING BIOMARKER PANELS AND MULTI-OMICS AI MODELS FOR OUTCOMES PREDICTIONIMAGING BIOMARKER PANELS AND MULTI-OMICS AI MODELS FOR OUTCOMES PREDICTION
IMAGING BIOMARKER PANELS AND MULTI-OMICS AI MODELS FOR OUTCOMES PREDICTION
 
radiotherapy.pptx
radiotherapy.pptxradiotherapy.pptx
radiotherapy.pptx
 
Data Mining Techniques In Computer Aided Cancer Diagnosis
Data Mining Techniques In Computer Aided Cancer DiagnosisData Mining Techniques In Computer Aided Cancer Diagnosis
Data Mining Techniques In Computer Aided Cancer Diagnosis
 
Data Mining Techniques In Computer Aided Cancer Diagnosis
Data Mining Techniques In Computer Aided Cancer DiagnosisData Mining Techniques In Computer Aided Cancer Diagnosis
Data Mining Techniques In Computer Aided Cancer Diagnosis
 

Recently uploaded

THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
Abdul Wali Khan University Mardan,kP,Pakistan
 
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Ana Luísa Pinho
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
PRIYANKA PATEL
 
Nucleophilic Addition of carbonyl compounds.pptx
Nucleophilic Addition of carbonyl  compounds.pptxNucleophilic Addition of carbonyl  compounds.pptx
Nucleophilic Addition of carbonyl compounds.pptx
SSR02
 
Oedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptxOedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptx
muralinath2
 
aziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobelaziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobel
İsa Badur
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
Sérgio Sacani
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
University of Maribor
 
Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
Gokturk Mehmet Dilci
 
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero WaterSharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Texas Alliance of Groundwater Districts
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
yqqaatn0
 
The debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically youngThe debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically young
Sérgio Sacani
 
Equivariant neural networks and representation theory
Equivariant neural networks and representation theoryEquivariant neural networks and representation theory
Equivariant neural networks and representation theory
Daniel Tubbenhauer
 
20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx
Sharon Liu
 
Thornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdfThornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdf
European Sustainable Phosphorus Platform
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
University of Maribor
 
8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf
by6843629
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
Nistarini College, Purulia (W.B) India
 
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
yqqaatn0
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
moosaasad1975
 

Recently uploaded (20)

THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
 
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
 
Nucleophilic Addition of carbonyl compounds.pptx
Nucleophilic Addition of carbonyl  compounds.pptxNucleophilic Addition of carbonyl  compounds.pptx
Nucleophilic Addition of carbonyl compounds.pptx
 
Oedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptxOedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptx
 
aziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobelaziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobel
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
 
Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
 
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero WaterSharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
 
The debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically youngThe debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically young
 
Equivariant neural networks and representation theory
Equivariant neural networks and representation theoryEquivariant neural networks and representation theory
Equivariant neural networks and representation theory
 
20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx
 
Thornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdfThornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdf
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
 
8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
 
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
 

Data Standards in Radiomics Research

  • 1. Data Standards in Radiomics Research Andrey Fedorov, PhD Molecular Medicine Tri-Conference February 13, 2018 https://fedorov.github.io
  • 2. Disclosures Grant support from NIH/NCI under the Informatics Technology for Cancer Research (ITCR) program: ● U24 CA180918 Quantitative Image Informatics for Cancer Research (QIICR) - PIs Fedorov, Kikinis - http://qiicr.org ● U24 CA194354 Quantitative Radiomics System Decoding the Tumor Phenotype - PI Aerts - https://radiomics.io 2
  • 3. Radiomics analysis workflow 3 Lambin P, Rios-Velazquez E, Leijenaar R, Carvalho S, van Stiphout RGPM, Granton P, et al. Radiomics: extracting more information from medical images using advanced feature analysis. Eur J Cancer. 2012;48: 441–446. doi:10.1016/j.ejca.2011.11.036 Kumar V, Gu Y, Basu S, Berglund A, Eschrich S a., Schabath MB, et al. Radiomics: the process and the challenges. Magn Reson Imaging. Elsevier Inc.; 2012;30: 1234–1248. doi:10.1016/j.mri.2012.06.010
  • 4. Images Regions Features Classes Radiomics analysis “data view” 4
  • 5. Images Regions Features Classes “Swap” pipeline components 5
  • 7. Images Regions Features Classes Integration with patient record 7
  • 8. Weber GM, Mandl KD, Kohane IS. Finding the missing link for big biomedical data. JAMA. 2014;311(24):2479–2480 8
  • 9. https://www.forbes.com/sites/gilpress/2016/03/23/dat a-preparation-most-time-consuming-least-enjoyable-d ata-science-task-survey-says/#5bbab7d56f63 “Cleaning Big Data: Most Time-Consuming, Least Enjoyable Data Science Task, Survey Says” 9 Data science is more about data than methods
  • 11. Standard for images Digital Imaging and Communication in Medicine (DICOM) is the standard for communication of medical imaging information and related data ● Compatibility with acquisition and archival tools ● Harmonized with other standards (HL7 CDA, JSON, XML, REST, FHIR) ● History of development and adoption since 1983 ● Adopted by virtually all manufacturers of medical imaging equipment ● Open international community of stakeholders ● Continuously evolving standard 11
  • 13. DICOM Image Pixel data: ● Sparse measurements sampling a 2d plane or 3d volume Non pixel data (a.k.a. “metadata”, or “header”): ● Patient identification, dates, image acquisition details, unique identifiers of the dataset, pointers to related evidence, body part imaged, … 13
  • 15. TCIA analysis results directory 15
  • 16. Example: Image Segmentations - Research formats 16 ● encoding of pixel data ● common tasks ○ visualization, comparison of the results, image fusion, quantitative measurements ● image processing domain-specific formats ○ “just enough” for typical image analysis tasks ○ image spacing, origin and direction cosines ○ NIFTI, NRRD, MHD, Analyze, … ● non-volumetric formats ○ planar ○ voxel intensity values ○ JPG, PNG, TIFF, …
  • 17. Segmentations Composite context ● Patient identification, dates, image acquisition details, unique identifiers of the dataset, pointers to related evidence, annotation of the body part imaged Structured metadata about analysis result ● Structure segmented, tissue type, body location, segmentation approach, references to the source images Research formats 17
  • 18. DICOM - preparing for the unknown, since 1983 ● Standard for images and image-related evidence ● Object definitions ○ Object type defines required and optional attributes ● For all object types: Composite context is formalized and required ○ Dates, patient IDs, study, series - for every object ● Unique identifiers ● References to related evidence ○ Provenance of data acquisition ○ Provenance of data analysis ● Fixed syntax ○ (hierarchical) list of attribute/value pairs ● Normalized semantics ○ Common data elements / lexicons / ontologies 18
  • 19. Example: Image Segmentations - DICOM Segmentation image 19 In addition to the spatial labeling of the voxels of interest: ● composite context ● semantics ● segmentation approach ● single file, efficient storage, multiple occupancy voxels ● overlay color
  • 20. No guessing about the meaning 20
  • 21. Measurements (e.g., radiomics features) ● DICOM structured reporting template ● Hierarchy of measurement groups assigned per segmentation ● Coded and structured ○ Source image and segmentation ○ Anatomy ○ Quantity ○ Units ● Composite context 21
  • 22. Radiomics feature extraction ● Free open source implementation of radiomics features in Python ● Preprocessing filters ● Integrated with 3D Slicer ● Integration with MeV underway 22
  • 23. Head and neck cancer use case ● Images and related evidence in DICOM: ○ PET/CT image data ○ segmentations ○ PET quantitative features ○ Clinical data (demographics, Dx, Tx) ● Data: TCIA QIN-HEADNECK collection ● Tools: public in dcmqi and 3D Slicer ● Demo: http://qiicr.org/dicom4miccai/ Beichel et al. 2016. Semiautomated segmentation of head and neck cancers in 18F-FDG PET scans: A just-enough-interaction approach. Medical physics 43:2948. DOI: 10.1118/1.4948679. 23
  • 24. non-DICOM Conversion to DICOM DICOM DICOM dcmqi JSON Composite context is propagated from the source image data Analysis-specific metadata is defined by the user and parametrized by a JSON-Schema Segmentation image volume in any format readable by ITK (NRRD, NIfTI, Analyze, MHD) 24 https://github.com/qiicr/dcmqi Free open source, binaries for Win/Mac/Linux, Docker image
  • 25. Conversion from DICOM - free open source tools available 25 XML, JSON SQL tables, CSV, Excel Volumetric research formats (NRRD, NIfTI, MHD) PDF, HTML (human-readable reports)
  • 26. DICOM4QI ● Connectathon and demonstration of interoperability of standardized analysis results at the RSNA, since 2015 ○ 4 types of QI DICOM objects (segmentations, parametric maps, volumetric measurements, tractography) ○ 11 platforms participated (including 5 commercial) ● Free open source tools that support the standard for QI analysis results ○ 3D Slicer (desktop platform and application) ○ ePAD (browser-based application) ○ MITK (desktop platform and application) ○ OHIF LesionTracker / Cornerstone (browser-based application and toolkit) ○ vtk-dicom (desktop) 26 https://qiicr.gitbooks.io/dicom4qi
  • 27. 27
  • 28. NIH Cancer Research Data Commons Response to Blue Ribbon panel recommendation: “to collect, share, and interconnect a broad array of large datasets so that researchers, clinicians, and patients will be able to both contribute and analyze data, facilitating discovery that will ultimately improve patient care and outcomes” ● Domain-specialized nodes ● Containerized standardized tools ● Node-specific consistent identifiers and semantics Hinkson IV, Davidsen TM, Klemm JD, Kerlavage AR, Kibbe WA. A Comprehensive Infrastructure for Big Data in Cancer Research: Accelerating Cancer Research and Precision Medicine. Front Cell Dev Biol. 2017;5:83 28
  • 29. Few of the many challenges that need to be addressed ● Radiomics data sharing is in its infancy ● No tools to query/discover/interpret standardized data ● Support in commercial tools is very limited ○ —> data collected during routine clinical care is not standardized ● Clinical trial datasets are not standardized ○ Annotations often are not even machine readable ● Adherence to the standards is inconsistent ● No ontologies to describe research outputs (radiomics, filters, parameters) ● Standardization/harmonization of clinical data 29
  • 30. In conclusion ● Data standardization is required ○ Cloud computing ○ Data-thirsty AI methods ○ Funders, publishers, regulators are increasingly requiring data sharing ● DICOM can help standardize radiomics outputs ○ Capabilities demonstrated ○ Supporting open source tools are emerging ○ Adoption is growing ● Joint effort is required to address standardization ○ Multi-domain expertise is needed ○ Community building ○ Aggregate datasets for analysis 30
  • 31. Thank you! Questions? Comments? More info? andrey.fedorov@gmail.com http://qiicr.org 31