Niffler: A DICOM Framework for Machine Learning and Processing Pipelines.

•Download as PPTX, PDF•

0 likes•593 views

Niffler is an efficient DICOM Framework for machine learning pipelines and processing workflows on metadata. It facilitates efficient transfer of DICOM images on-demand and real-time from PACS to the research environments, to run processing workflows and machine learning pipelines. https://github.com/Emory-HITI/Niffler/

Healthcare

Niffler: A DICOM Framework for
Machine Learning and Processing Pipelines.
Pradeeban Kathiravelu, PhD
Emory University,
Atlanta, GA.
2021, October 6th

Introduction
2
• Real-time execution of machine learning (ML)
pipelines on radiology images is hard
• limited computing resources in clinical
environments
• running them in research clusters?
• efficient data transfer
• processing capabilities.

Our Proposal – Niffler
3
• An open-source ML framework: https://github.com/Emory-HITI/Niffler
• DICOM Images: PACS ⇨ to process in Research Clusters.
• Real-time and retrospectively on-demand
• Extracts the textual metadata from DICOM headers
• in real-time
• stores in a scalable database.
• Workflows and ML pipelines on images and metadata.

Niffler Modules
4
• Real-time DICOM extractions (a data stream).
• On-demand DICOM extractions (simplified and
expanded C-FIND, C-MOVE, and C-FIND+C-MOVE).
• DICOM  PNG conversion.
• Workflows module (Niffler service chaining).
• RTA extraction (fetch data from RIS).
• DICOM  NifTi conversion (experimental).
• DICOM Anonymization (experimental).

Niffler On-demand queries
5
• Niffler provides a flexible one-command retrieval of
several studies.
• all at once at bulk, with pause-and-resume capabilities.
• A CSV file with DICOM headers.
• A config file indicating up to 3 DICOM C-FIND headers to
query from.
• Niffler internally issues several C-FIND and C-MOVE
combinations for each line in the CSV file.

Sample configuration (json)
6
• {
• "NifflerSystem": "system.json",
• "StorageFolder": "/opt/data/new-study",
• "FilePath": "{00100020}/{0020000D}/{0020000E}/{00080018}.dcm",
• "CsvFile": "csv/empi.csv",
• "NumberOfQueryAttributes": 1,
• "FirstAttr": "PatientID",
• "FirstIndex": 0,
• "SecondAttr": "AccessionNumber",
• "SecondIndex": 5,
• "ThirdAttr": "StudyDate",
• "ThirdIndex": 3,
• "DateFormat": "%Y%m%d",
• "SendEmail": true,
• "YourEmail": "test@test.test"
• }

Capabilities
7
• Flexible queries that were not possible or easy before.
• “Retrieve all the CT Images of the female patients from
August 2021” can be a one Niffler extraction based on
StudyDate, Modality, and PatientSex.

Architecture and Prototype Deployment
8
• Queries to multiple
PACS.
• A Real-Time Listener
and a Retrospective
Extractor.
• ML pipelines and
analytics as containers.

Evaluation
9
• 715 Scanners
• 350 GB/day
• A few practical use cases
• Real-time ML Pipelines on DICOM images.
• IVC Filter detection.
• Real-time processing of metadata.
• Calculating Scanner utilization.
• Scanner clock calibration.

Evaluation – Use case 1: IVC Filter Detection
10
• Pre-trained
models.
• Real-time
Execution.
• 96% accuracy.

Evaluation – Use case 2: Understanding
Operational Efficiency of MRI Systems
11
• Based on calculated metrics from exam timestamps.
• Evaluate against a clinical data warehouse (CDW).
• Timestamps from CDW more prone to human errors.
• False depiction of exam overlaps.

Conclusion
12
• Running the ML pipelines from a research cluster
• Feasibility and efficiency
• On images and their metadata
• Received in real-time and retrospectively from PACS.
Future work
• Niffler + real-time clinical feed ⇨ Live AI inference.
Thank you. Questions?
pradeeban.kathiravelu@emory.edu
linkedin.com/in/kpradeeban/

What's hot

OBA: An Ontology-Based Framework for Creating REST APIs for Knowledge Graphsdgarijo

What's New in CytoscapeKeiichiro Ono

SDCSB CYTOSCAPE AND NETWORK ANALYSIS WORKSHOP at Sanford ConsortiumKeiichiro Ono

Introduction to Biological Network Analysis and Visualization with Cytoscape ...Keiichiro Ono

A Controlled Crowdsourcing Approach for Practical Ontology Extensions and Met...dgarijo

Cytoscape and External Data Analysis ToolsKeiichiro Ono

IASSIST identifiers By Joan StarrCarly Strasser

What's hot (7)

OBA: An Ontology-Based Framework for Creating REST APIs for Knowledge Graphs

What's New in Cytoscape

SDCSB CYTOSCAPE AND NETWORK ANALYSIS WORKSHOP at Sanford Consortium

Introduction to Biological Network Analysis and Visualization with Cytoscape ...

A Controlled Crowdsourcing Approach for Practical Ontology Extensions and Met...

Cytoscape and External Data Analysis Tools

IASSIST identifiers By Joan Starr

Similar to Niffler: A DICOM Framework for Machine Learning and Processing Pipelines.

Leveraging Machine Learning Techniques Predictive Analytics for Knowledge Dis...Kevin Mader

01-pengantar.pdfssuseradaf5f

Automated Colony Counting - New Methods Can Save Researchers Time and CostScintica Instrumentation

Is424 g1 t9_proposal_slidesJing WANG

陸永祥/全球網路攝影機帶來的機會與挑戰台灣資料科學年會

Main principles of Data Science and Machine LearningNikolay Karelin

8th semester syllabus b sc csit-pawan kaflePAWAN KAFLE

Structure of DICOM Imageisha saxena

OpenStack in Action 4! Susheel Varma - VPH-Share: Patient-Centred Multi-scale...eNovance

Activity Monitoring Using Wearable Sensors and Smart PhoneDrAhmedZoha

Seminar nov2017Ahmed Youssef Ali Amer

Tsinghua University: Two Exemplary Applications in ChinaDataStax Academy

A Predictive Analytics Workflow on DICOM Images using Apache Spark with Anahi...Databricks

Bachelor-ThesisCristi Schuszter

Digital Document Preservation Simulation - Boston Python User's GroupMicah Altman

Machine Learning for Malware Classification and ClusteringAshwini Almad

Machine Learning for Malware Classification and ClusteringEndgameInc

Analytics Zoo: Building Analytics and AI Pipeline for Apache Spark and BigDL ...Databricks

2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...The Statistical and Applied Mathematical Sciences Institute

lecture1.pptbayhehua

Similar to Niffler: A DICOM Framework for Machine Learning and Processing Pipelines. (20)

Leveraging Machine Learning Techniques Predictive Analytics for Knowledge Dis...

01-pengantar.pdf

Automated Colony Counting - New Methods Can Save Researchers Time and Cost

Is424 g1 t9_proposal_slides

陸永祥/全球網路攝影機帶來的機會與挑戰

Main principles of Data Science and Machine Learning

8th semester syllabus b sc csit-pawan kafle

Structure of DICOM Image

OpenStack in Action 4! Susheel Varma - VPH-Share: Patient-Centred Multi-scale...

Activity Monitoring Using Wearable Sensors and Smart Phone

Seminar nov2017

Tsinghua University: Two Exemplary Applications in China

A Predictive Analytics Workflow on DICOM Images using Apache Spark with Anahi...

Bachelor-Thesis

Digital Document Preservation Simulation - Boston Python User's Group

Machine Learning for Malware Classification and Clustering

Analytics Zoo: Building Analytics and AI Pipeline for Apache Spark and BigDL ...

2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...

lecture1.ppt

Recently uploaded

Leading transformational change: inner and outer skillsHelenBevan4

Gurgaon Sector 90 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few ...ggsonu500

Call Girl Chandigarh Mallika ❤️🍑 9907093804 👄🫦 Independent Escort Service Cha...High Profile Call Girls Chandigarh Aarushi

Russian Call Girls in Raipur 9873940964 Book Hot And Sexy Girlsddev2574

Russian Escorts Aishbagh Road * 9548273370 Naughty Call Girls Service in Lucknowgragteena

Call Girls in Lucknow Esha 🔝 8923113531 🔝 🎶 Independent Escort Service LucknowCall Girls Lucknow Aaradhya Independent Escort Service Lucknow

Russian Call Girls in Goa Samaira 7001305949 Independent Escort Service Goanarwatsonia7

College Call Girls Hyderabad Sakshi 9907093804 Independent Escort Service Hyd...delhimodelshub1

Russian Call Girls in Hyderabad Ishita 9907093804 Independent Escort Service ...delhimodelshub1

Call Girls Kukatpally 7001305949 all area service COD available Any Timedelhimodelshub1

Russian Call Girls Hyderabad Saloni 9907093804 Independent Escort Service Hyd...delhimodelshub1

Russian Call Girls in Dehradun Komal 🔝 7001305949 🔝 📍 Independent Escort Serv...Call Girls Lucknow Aaradhya Independent Escort Service Lucknow

Call Girl Hyderabad Madhuri 9907093804 Independent Escort Service Hyderabaddelhimodelshub1

VIP Call Girl Sector 25 Gurgaon Just Call Me 9899900591adityaroy0215

Basics of Anatomy- Language of Anatomy.pptxAyush Gupta

Dehradun Call Girls Service ❤️🍑 9675010100 👄🫦Independent Escort Service DehradunNiamh verma

Low Rate Call Girls In Bommanahalli Just Call 7001305949ps5894268

Call Girls in Mohali Surbhi ❤️🍑 9907093804 👄🫦 Independent Escort Service MohaliHigh Profile Call Girls Chandigarh Aarushi

No Advance 9053900678 Chandigarh Call Girls , Indian Call Girls For Full Ni...Vip call girls In Chandigarh

Call Girls LB Nagar 7001305949 all area service COD available Any Timedelhimodelshub1

Recently uploaded (20)

Leading transformational change: inner and outer skills

Gurgaon Sector 90 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few ...

Call Girl Chandigarh Mallika ❤️🍑 9907093804 👄🫦 Independent Escort Service Cha...

Russian Call Girls in Raipur 9873940964 Book Hot And Sexy Girls

Russian Escorts Aishbagh Road * 9548273370 Naughty Call Girls Service in Lucknow

Call Girls in Lucknow Esha 🔝 8923113531 🔝 🎶 Independent Escort Service Lucknow

Russian Call Girls in Goa Samaira 7001305949 Independent Escort Service Goa

College Call Girls Hyderabad Sakshi 9907093804 Independent Escort Service Hyd...

Russian Call Girls in Hyderabad Ishita 9907093804 Independent Escort Service ...

Call Girls Kukatpally 7001305949 all area service COD available Any Time

Russian Call Girls Hyderabad Saloni 9907093804 Independent Escort Service Hyd...

Russian Call Girls in Dehradun Komal 🔝 7001305949 🔝 📍 Independent Escort Serv...

Call Girl Hyderabad Madhuri 9907093804 Independent Escort Service Hyderabad

VIP Call Girl Sector 25 Gurgaon Just Call Me 9899900591

Basics of Anatomy- Language of Anatomy.pptx

Dehradun Call Girls Service ❤️🍑 9675010100 👄🫦Independent Escort Service Dehradun

Low Rate Call Girls In Bommanahalli Just Call 7001305949

Call Girls in Mohali Surbhi ❤️🍑 9907093804 👄🫦 Independent Escort Service Mohali

No Advance 9053900678 Chandigarh Call Girls , Indian Call Girls For Full Ni...

Call Girls LB Nagar 7001305949 all area service COD available Any Time

Niffler: A DICOM Framework for Machine Learning and Processing Pipelines.

1. Niffler: A DICOM Framework for Machine Learning and Processing Pipelines. Pradeeban Kathiravelu, PhD Emory University, Atlanta, GA. 2021, October 6th

2. Introduction 2 • Real-time execution of machine learning (ML) pipelines on radiology images is hard • limited computing resources in clinical environments • running them in research clusters? • efficient data transfer • processing capabilities.

3. Our Proposal – Niffler 3 • An open-source ML framework: https://github.com/Emory-HITI/Niffler • DICOM Images: PACS ⇨ to process in Research Clusters. • Real-time and retrospectively on-demand • Extracts the textual metadata from DICOM headers • in real-time • stores in a scalable database. • Workflows and ML pipelines on images and metadata.

4. Niffler Modules 4 • Real-time DICOM extractions (a data stream). • On-demand DICOM extractions (simplified and expanded C-FIND, C-MOVE, and C-FIND+C-MOVE). • DICOM  PNG conversion. • Workflows module (Niffler service chaining). • RTA extraction (fetch data from RIS). • DICOM  NifTi conversion (experimental). • DICOM Anonymization (experimental).

5. Niffler On-demand queries 5 • Niffler provides a flexible one-command retrieval of several studies. • all at once at bulk, with pause-and-resume capabilities. • A CSV file with DICOM headers. • A config file indicating up to 3 DICOM C-FIND headers to query from. • Niffler internally issues several C-FIND and C-MOVE combinations for each line in the CSV file.

6. Sample configuration (json) 6 • { • "NifflerSystem": "system.json", • "StorageFolder": "/opt/data/new-study", • "FilePath": "{00100020}/{0020000D}/{0020000E}/{00080018}.dcm", • "CsvFile": "csv/empi.csv", • "NumberOfQueryAttributes": 1, • "FirstAttr": "PatientID", • "FirstIndex": 0, • "SecondAttr": "AccessionNumber", • "SecondIndex": 5, • "ThirdAttr": "StudyDate", • "ThirdIndex": 3, • "DateFormat": "%Y%m%d", • "SendEmail": true, • "YourEmail": "test@test.test" • }

7. Capabilities 7 • Flexible queries that were not possible or easy before. • “Retrieve all the CT Images of the female patients from August 2021” can be a one Niffler extraction based on StudyDate, Modality, and PatientSex.

8. Architecture and Prototype Deployment 8 • Queries to multiple PACS. • A Real-Time Listener and a Retrospective Extractor. • ML pipelines and analytics as containers.

9. Evaluation 9 • 715 Scanners • 350 GB/day • A few practical use cases • Real-time ML Pipelines on DICOM images. • IVC Filter detection. • Real-time processing of metadata. • Calculating Scanner utilization. • Scanner clock calibration.

10. Evaluation – Use case 1: IVC Filter Detection 10 • Pre-trained models. • Real-time Execution. • 96% accuracy.

11. Evaluation – Use case 2: Understanding Operational Efficiency of MRI Systems 11 • Based on calculated metrics from exam timestamps. • Evaluate against a clinical data warehouse (CDW). • Timestamps from CDW more prone to human errors. • False depiction of exam overlaps.

12. Conclusion 12 • Running the ML pipelines from a research cluster • Feasibility and efficiency • On images and their metadata • Received in real-time and retrospectively from PACS. Future work • Niffler + real-time clinical feed ⇨ Live AI inference. Thank you. Questions? pradeeban.kathiravelu@emory.edu linkedin.com/in/kpradeeban/

Editor's Notes

Hi everyone, I am Pradeeban Kathiravelu from Emory University. Today I present our research “A DICOM Framework for Machine Learning Pipelines against Real-Time Radiology Images”.
Real-time execution of machine learning pipelines on radiology images is challenging, due to limited computing resources in clinical environments. On the other hand, running them in research clusters requires efficient data transfer and processing capabilities.
We propose Niffler, an open-source ML framework that runs in research clusters by receiving images in real-time and retrospectively on-demand, using DICOM networking protocol from hospitals' PACS. Niffler extracts the metadata from the DICOM headers in real-time and stores the metadata in a scalable database. Niffler enables efficient execution of processing workflows and ML pipelines on images and their metadata. Niffler consists of DICOM listeners for receiving images in real-time, and retrospective DICOM extractors to query and retrieve images on-demand. It also consists of a Metadata Extractor that extracts the textual metadata from the retrieved DICOM images. Niffler stores the images in its storage and the metadata in a Metadata Store.
The Figure depicts the Niffler architecture and prototype deployment. In the standard healthcare system, the radiology department may consist of several PACS, each receiving radiology images from scanners of various modalities. Our current deployment environment consists of 2 PACS from our institutional radiology department, configured to accept DICOM retrieval queries from Niffler. Niffler can receive DICOM data from more PACS systems at once, with minor configuration changes to the PACS and Niffler. In the deployment at our institution, the primary PACS receives data in real-time from the scanners. The radiology department has configured an archival process that periodically copies the images from the primary PACS to a shadow PACS and then cleans up the primary PACS every week. Hence, the shadow PACS stores imaging data of several years, supporting retrospective queries. Niffler enables the execution of ML and real-time analytics pipelines as Docker containers on radiology images retrieved from the PACS and the textual metadata of the images. The Real-Time DICOM Listener receives images from the primary PACS continuously as a DICOM imaging stream. The Retrospective DICOM Extractor performs on-demand queries issued by the users on the retrospective data. Niffler executes multiple StoreSCP processes, one for each PACS. It stores the images from each PACS separately in an encrypted storage, in a hierarchical folder structure. The Metadata Extractor traverses and queries all the images in the storage, extracts the relevant metadata from the DICOM headers, and stores the PHI-free metadata in a NoSQL database, which we call the Metadata Store. The Application Layer facilitates access to the DICOM images from the storage and the respective metadata from the metadata store. Thus, it provides a unified data explorer access to both data and metadata. It further provides utility functions such as de-identification, image conversion, and scripts for scanner utilization computation and scanner clock calibration. The ML pipelines run either directly or via the application layer, on the images and the metadata. Niffler deletes the images from its storage periodically once the metadata extraction and the ML pipelines on the images complete their execution. Subsets of images relevant for a study can be shared with the other researchers, typically after processing them, including de-identifying images, converting DICOM images into PNG, or the image output and associated ML inference result. Since the framework supports queries to extract images meeting specific criteria, it limits the amount of information stored in the research clusters that are duplicated. Without Niffler, a researcher would have to submit multiple queries to the PACS and a clinical data warehouse -- a CDW, work on anonymizing the data collected, merge the data, and then run the model inference. Niffler supports prospective dynamic cohort and subcohort creation, eliminating the need for duplicate data storage and aggregation, with anonymized model output. Through its native support for the ML pipeline execution as containers, Niffler provides an infrastructure-agnostic execution with seamless scaling and migration. Thus, Niffler minimizes the repetitive and complicated configuration steps while automating the end-to-end process of an ML pipeline.
Niffler retrieved data from 715 scanners, up to 350 GB/day continuously over the past 19 months. We look into a few practical use cases that highlight the performance of Niffler. First, real-time ML pipelines on DICOM images in a research cluster. Second, real-time processing of metadata for operational efficiency, such as computing scanner utilization and calibrating scanner clock.
First, we built an IVC filter detection pipeline as a container to execute on the images retrieved in real-time with Niffler. The pipeline uses the Keras RetinaNet object detection and pre-trained models to determine whether an IVC filter is detected in the subcategories of the images. The Niffler Metadata Extractor applies the filters on modality and body parts to create a DICOM subset. The IVC Filter detection container runs its inference on the identified images, including chest Xray, abdomen radiographs, and Spine Xrays. The pipeline draws a bounding box around the filter and outputs a PNG image with the detection box, as the Figure shows. The IVC filter detection algorithm classified the test images with high accuracy of 96% on the images retrieved in real-time with Niffler.
As the second use case, we applied the Niffler Metadata Extractor to understand the operational efficiency of individual MRI systems, based on calculated metrics from exam timestamps. These metadata allow measurement of exam duration and system idle time. The figure indicates the calculated exam time windows from one scanner on a particular day, according to Niffler and CDW. Identifying scanner utilization from CDW is more prone to human errors, leading to a false depiction of exam overlap. Niffler accurately identified examination timeframes and idling times of the scanners.
We observe that our evaluations on Niffler highlight the feasibility and efficiency in running the ML pipelines from a research cluster on the images and metadata received in real-time and retrospectively from PACS. By merging Niffler with a real-time HL7 clinical feed, we can create a live AI inference pipeline that accelerates the development of clinically useful AI algorithms. To our knowledge, this will be the first AI inference pipeline that combines real-time image and clinical data information during AI validation. Thank you for your attention. Please feel free to reach out to me if you have questions or comments.

Niffler: A DICOM Framework for Machine Learning and Processing Pipelines.

Recommended

Recommended

More Related Content

What's hot

What's hot (7)

Similar to Niffler: A DICOM Framework for Machine Learning and Processing Pipelines.

Similar to Niffler: A DICOM Framework for Machine Learning and Processing Pipelines. (20)

More from Pradeeban Kathiravelu, Ph.D.

More from Pradeeban Kathiravelu, Ph.D. (20)

Recently uploaded

Recently uploaded (20)

Niffler: A DICOM Framework for Machine Learning and Processing Pipelines.

Editor's Notes