SlideShare a Scribd company logo
1 of 32
Download to read offline
Automatic Radiology Report
Generation using Deep Learning
Presented by:
Chandresh Kumar Maurya
Assistant professor at CSE, IIT Indore
https://chandreshiit.github.io/about/
Outline of the Talk
1. Introduction
2. Motivation
3. The problem
4. Solution Approach
5. Results
6. Discussion
7. Future direction
Introduction
● Maintaining digital record of patient's history is crucial for diagnosing the
problem.
● Traditional way: carry hard copies of reports and prescription such as x-rays,
CT scan and related reports
● The Problem: losing reports, forgetting to carry it is quite common.
● The Solution: Past years have seen rapid growth in the volume of the digital
health record also called Electronic Health Record (EHR).
● Previously, it was used mostly for administrative tasks such as billing,
insurance etc.
● However, researchers have now realized that EHR has more potential than
just billing etc.
Tasks on EHR data
Shickel, B., Tighe, P. J., Bihorac, A., & Rashidi, P. (2017). Deep EHR: a survey of recent advances in deep learning techniques for electronic health
record (EHR) analysis. IEEE journal of biomedical and health informatics, 22(5), 1589-1604.
Methods on EHR data
Tasks on EHR data
Clinical Information Extraction
Representation learning
Patient Outcome Prediction
Radiology Report Generation [2]
● Radiology report generation is automatic generation of report through the use
of machine learning techniques.
● This offers potential to accelerate the report generation process which is
time-consuming, repetitive, and error-prone.
● Current techniques suffer from incomplete and inconsistent report generation
problem.
It is incomplete
since it neglects a
critical observation
of right pleural
effusion for bilateral
pleural effusions
It is also inconsistent
since atelectasis is
seen in left lung base
along with right
pleural effusion.
it includes pulmonary
edema which is not
present in the image.
Highlights of the Paper
● The author present two new metrics for image-to-text radiology report
generation, which focus on evaluating the factual completeness and
consistency of generated reports, and a weak supervision-based approach for
training a radiology-domain NLI model that realizes the metrics.
● They present a new radiology report generation model that directly optimizes
the two new metrics with RL, and show its improved performance against
existing models on two publicly available datasets.
Image-to-Text Radiology Report Generation with
Meshed-Memory Transformer
● Given K individual images x1...K of a patient, our task involves generating a
sequence of words to form a textual report yˆ, which describes the clinical
observations in the images.
Image-to-Text Radiology Report Generation with
Meshed-Memory Transformer
● Given K individual images x1...K of a patient, the task involves generating a
sequence of words to form a textual report yˆ, which describes the clinical
observations in the images.
● This task has close resemblance to the image captioning task, with the
difference that the input involves multiple images and the generated
sequences are usually longer in the task.
Image-to-Text Radiology Report Generation with
Meshed-Memory Transformer
Meshed Memory Transformer
● Given an image x, image regions are first extracted with a CNN as X =
CNN(x).
● X is then encoded with a memory-augmented attention process Mmem
(X) as
where Wq, Wk, Wv are weights, Mk,Mv
are memory matrices, d is a scaling
factor, and [∗; ∗] is concatenation
operation.
Attention in Transformer
Source: http://jalammar.github.io/illustrated-transformer/
Attention in Transformer- Matrix way
Attention in Transformer- Matrix way
Attention in Transformer- Matrix way
The Transformer
Why Attention?
Source: see ref [3]
Back to Meshed-Memory Transformer
Where is element-wise multiplication, maxK is
maxpooling over K images, σ is sigmoid function,
Wn is a weight, and bn is a bias. The weighted
summation in Mmesh(X˜ N,K,Y¨ ) exploits both
low-level and high-level information from the N
stacked encoder.
Optimization with Factual Completeness and
Consistency
Exact Entity Match Score : designed to measure factual completeness. A named
entity recognizer is applied against yˆ and the corresponding reference report y.
Given entities Egen and Eref recognized from ygen and yref respectively,
precision (pr) and recall (rc) of entity match are calculated as
The harmonic mean of precision and recall is taken as factENT
Contd...
factENTNLI Entailing Entity Match Score: an F-score style metric that expand
factENT with NLI to evaluate factual consistency. δ is expanded to
The Loss function
A loss L is minimized as the negative expectation of the reward r and its gradient
is estimated with a single Monte Carlo sample as
where yˆsp is a sampled text and yˆgd is a greedy decoded text. They combine a
factual metric loss with a language model loss and an NLG loss as
Results
Dataset: MIMIC-CXR dataset
References
● Shickel, B., Tighe, P. J., Bihorac, A., & Rashidi, P. (2017). Deep EHR: a survey of recent advances in deep learning
techniques for electronic health record (EHR) analysis. IEEE journal of biomedical and health informatics, 22(5),
1589-1604.
● Miura, Y., Zhang, Y., Langlotz, C. P., & Jurafsky, D. (2020). Improving Factual Completeness and Consistency of
Image-to-Text Radiology Report Generation. arXiv preprint arXiv:2010.10042.
● Guan, Qingji, et al. “Diagnose like a radiologist: Attention guided convolutional neural network for thorax disease
classification.” arXiv preprint arXiv:1801.09927 (2018).

More Related Content

Similar to Automatic Radiology Report Generation.pdf

COMPARISON OF CHANNEL ESTIMATION AND EQUALIZATION TECHNIQUES FOR OFDM SYSTEMS
COMPARISON OF CHANNEL ESTIMATION AND EQUALIZATION TECHNIQUES FOR OFDM SYSTEMSCOMPARISON OF CHANNEL ESTIMATION AND EQUALIZATION TECHNIQUES FOR OFDM SYSTEMS
COMPARISON OF CHANNEL ESTIMATION AND EQUALIZATION TECHNIQUES FOR OFDM SYSTEMScsijjournal
 
IEEE Bio medical engineering 2016 Title and Abstract
IEEE Bio medical engineering  2016 Title and Abstract IEEE Bio medical engineering  2016 Title and Abstract
IEEE Bio medical engineering 2016 Title and Abstract tsysglobalsolutions
 
Mr image compression based on selection of mother wavelet and lifting based w...
Mr image compression based on selection of mother wavelet and lifting based w...Mr image compression based on selection of mother wavelet and lifting based w...
Mr image compression based on selection of mother wavelet and lifting based w...ijma
 
MICCS: A Novel Framework for Medical Image Compression Using Compressive Sens...
MICCS: A Novel Framework for Medical Image Compression Using Compressive Sens...MICCS: A Novel Framework for Medical Image Compression Using Compressive Sens...
MICCS: A Novel Framework for Medical Image Compression Using Compressive Sens...IJECEIAES
 
Predicting an Applicant Status Using Principal Component, Discriminant and Lo...
Predicting an Applicant Status Using Principal Component, Discriminant and Lo...Predicting an Applicant Status Using Principal Component, Discriminant and Lo...
Predicting an Applicant Status Using Principal Component, Discriminant and Lo...inventionjournals
 
Biomedical Image Retrieval using LBWP
Biomedical Image Retrieval using LBWPBiomedical Image Retrieval using LBWP
Biomedical Image Retrieval using LBWPIRJET Journal
 
Mutual Information for Registration of Monomodal Brain Images using Modified ...
Mutual Information for Registration of Monomodal Brain Images using Modified ...Mutual Information for Registration of Monomodal Brain Images using Modified ...
Mutual Information for Registration of Monomodal Brain Images using Modified ...IDES Editor
 
A framework for mining signatures from event sequences and its applications i...
A framework for mining signatures from event sequences and its applications i...A framework for mining signatures from event sequences and its applications i...
A framework for mining signatures from event sequences and its applications i...JPINFOTECH JAYAPRAKASH
 
Gesture Recognition using Principle Component Analysis & Viola-Jones Algorithm
Gesture Recognition using Principle Component Analysis &  Viola-Jones AlgorithmGesture Recognition using Principle Component Analysis &  Viola-Jones Algorithm
Gesture Recognition using Principle Component Analysis & Viola-Jones AlgorithmIJMER
 
Towards reproducibility and maximally-open data
Towards reproducibility and maximally-open dataTowards reproducibility and maximally-open data
Towards reproducibility and maximally-open dataPablo Bernabeu
 
Conference_paper.pdf
Conference_paper.pdfConference_paper.pdf
Conference_paper.pdfNarenRajVivek
 
Top Cited Articles in Signal & Image Processing 2021-2022
Top Cited Articles in Signal & Image Processing 2021-2022Top Cited Articles in Signal & Image Processing 2021-2022
Top Cited Articles in Signal & Image Processing 2021-2022sipij
 
Vol 13 No 1 - May 2014
Vol 13 No 1 - May 2014Vol 13 No 1 - May 2014
Vol 13 No 1 - May 2014ijcsbi
 
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...Jonathan Stray
 
NUMBER PLATE IMAGE DETECTION FOR FAST MOTION VEHICLES USING BLUR KERNEL ESTIM...
NUMBER PLATE IMAGE DETECTION FOR FAST MOTION VEHICLES USING BLUR KERNEL ESTIM...NUMBER PLATE IMAGE DETECTION FOR FAST MOTION VEHICLES USING BLUR KERNEL ESTIM...
NUMBER PLATE IMAGE DETECTION FOR FAST MOTION VEHICLES USING BLUR KERNEL ESTIM...paperpublications3
 

Similar to Automatic Radiology Report Generation.pdf (20)

COMPARISON OF CHANNEL ESTIMATION AND EQUALIZATION TECHNIQUES FOR OFDM SYSTEMS
COMPARISON OF CHANNEL ESTIMATION AND EQUALIZATION TECHNIQUES FOR OFDM SYSTEMSCOMPARISON OF CHANNEL ESTIMATION AND EQUALIZATION TECHNIQUES FOR OFDM SYSTEMS
COMPARISON OF CHANNEL ESTIMATION AND EQUALIZATION TECHNIQUES FOR OFDM SYSTEMS
 
IEEE Bio medical engineering 2016 Title and Abstract
IEEE Bio medical engineering  2016 Title and Abstract IEEE Bio medical engineering  2016 Title and Abstract
IEEE Bio medical engineering 2016 Title and Abstract
 
Mr image compression based on selection of mother wavelet and lifting based w...
Mr image compression based on selection of mother wavelet and lifting based w...Mr image compression based on selection of mother wavelet and lifting based w...
Mr image compression based on selection of mother wavelet and lifting based w...
 
MICCS: A Novel Framework for Medical Image Compression Using Compressive Sens...
MICCS: A Novel Framework for Medical Image Compression Using Compressive Sens...MICCS: A Novel Framework for Medical Image Compression Using Compressive Sens...
MICCS: A Novel Framework for Medical Image Compression Using Compressive Sens...
 
Predicting an Applicant Status Using Principal Component, Discriminant and Lo...
Predicting an Applicant Status Using Principal Component, Discriminant and Lo...Predicting an Applicant Status Using Principal Component, Discriminant and Lo...
Predicting an Applicant Status Using Principal Component, Discriminant and Lo...
 
Madhavi
MadhaviMadhavi
Madhavi
 
Biomedical Image Retrieval using LBWP
Biomedical Image Retrieval using LBWPBiomedical Image Retrieval using LBWP
Biomedical Image Retrieval using LBWP
 
Ijecet 06 10_002
Ijecet 06 10_002Ijecet 06 10_002
Ijecet 06 10_002
 
Ijecet 06 10_002
Ijecet 06 10_002Ijecet 06 10_002
Ijecet 06 10_002
 
Image Fusion
Image FusionImage Fusion
Image Fusion
 
Mutual Information for Registration of Monomodal Brain Images using Modified ...
Mutual Information for Registration of Monomodal Brain Images using Modified ...Mutual Information for Registration of Monomodal Brain Images using Modified ...
Mutual Information for Registration of Monomodal Brain Images using Modified ...
 
A framework for mining signatures from event sequences and its applications i...
A framework for mining signatures from event sequences and its applications i...A framework for mining signatures from event sequences and its applications i...
A framework for mining signatures from event sequences and its applications i...
 
Gesture Recognition using Principle Component Analysis & Viola-Jones Algorithm
Gesture Recognition using Principle Component Analysis &  Viola-Jones AlgorithmGesture Recognition using Principle Component Analysis &  Viola-Jones Algorithm
Gesture Recognition using Principle Component Analysis & Viola-Jones Algorithm
 
Towards reproducibility and maximally-open data
Towards reproducibility and maximally-open dataTowards reproducibility and maximally-open data
Towards reproducibility and maximally-open data
 
Conference_paper.pdf
Conference_paper.pdfConference_paper.pdf
Conference_paper.pdf
 
Top Cited Articles in Signal & Image Processing 2021-2022
Top Cited Articles in Signal & Image Processing 2021-2022Top Cited Articles in Signal & Image Processing 2021-2022
Top Cited Articles in Signal & Image Processing 2021-2022
 
Vol 13 No 1 - May 2014
Vol 13 No 1 - May 2014Vol 13 No 1 - May 2014
Vol 13 No 1 - May 2014
 
1.pdf
1.pdf1.pdf
1.pdf
 
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
 
NUMBER PLATE IMAGE DETECTION FOR FAST MOTION VEHICLES USING BLUR KERNEL ESTIM...
NUMBER PLATE IMAGE DETECTION FOR FAST MOTION VEHICLES USING BLUR KERNEL ESTIM...NUMBER PLATE IMAGE DETECTION FOR FAST MOTION VEHICLES USING BLUR KERNEL ESTIM...
NUMBER PLATE IMAGE DETECTION FOR FAST MOTION VEHICLES USING BLUR KERNEL ESTIM...
 

Recently uploaded

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 

Recently uploaded (20)

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 

Automatic Radiology Report Generation.pdf

  • 1. Automatic Radiology Report Generation using Deep Learning Presented by: Chandresh Kumar Maurya Assistant professor at CSE, IIT Indore https://chandreshiit.github.io/about/
  • 2. Outline of the Talk 1. Introduction 2. Motivation 3. The problem 4. Solution Approach 5. Results 6. Discussion 7. Future direction
  • 3. Introduction ● Maintaining digital record of patient's history is crucial for diagnosing the problem. ● Traditional way: carry hard copies of reports and prescription such as x-rays, CT scan and related reports ● The Problem: losing reports, forgetting to carry it is quite common. ● The Solution: Past years have seen rapid growth in the volume of the digital health record also called Electronic Health Record (EHR). ● Previously, it was used mostly for administrative tasks such as billing, insurance etc. ● However, researchers have now realized that EHR has more potential than just billing etc.
  • 4. Tasks on EHR data Shickel, B., Tighe, P. J., Bihorac, A., & Rashidi, P. (2017). Deep EHR: a survey of recent advances in deep learning techniques for electronic health record (EHR) analysis. IEEE journal of biomedical and health informatics, 22(5), 1589-1604.
  • 10. Radiology Report Generation [2] ● Radiology report generation is automatic generation of report through the use of machine learning techniques. ● This offers potential to accelerate the report generation process which is time-consuming, repetitive, and error-prone. ● Current techniques suffer from incomplete and inconsistent report generation problem.
  • 11.
  • 12. It is incomplete since it neglects a critical observation of right pleural effusion for bilateral pleural effusions
  • 13. It is also inconsistent since atelectasis is seen in left lung base along with right pleural effusion.
  • 14. it includes pulmonary edema which is not present in the image.
  • 15. Highlights of the Paper ● The author present two new metrics for image-to-text radiology report generation, which focus on evaluating the factual completeness and consistency of generated reports, and a weak supervision-based approach for training a radiology-domain NLI model that realizes the metrics. ● They present a new radiology report generation model that directly optimizes the two new metrics with RL, and show its improved performance against existing models on two publicly available datasets.
  • 16. Image-to-Text Radiology Report Generation with Meshed-Memory Transformer ● Given K individual images x1...K of a patient, our task involves generating a sequence of words to form a textual report yˆ, which describes the clinical observations in the images.
  • 17. Image-to-Text Radiology Report Generation with Meshed-Memory Transformer ● Given K individual images x1...K of a patient, the task involves generating a sequence of words to form a textual report yˆ, which describes the clinical observations in the images. ● This task has close resemblance to the image captioning task, with the difference that the input involves multiple images and the generated sequences are usually longer in the task.
  • 18. Image-to-Text Radiology Report Generation with Meshed-Memory Transformer
  • 19. Meshed Memory Transformer ● Given an image x, image regions are first extracted with a CNN as X = CNN(x). ● X is then encoded with a memory-augmented attention process Mmem (X) as where Wq, Wk, Wv are weights, Mk,Mv are memory matrices, d is a scaling factor, and [∗; ∗] is concatenation operation.
  • 20. Attention in Transformer Source: http://jalammar.github.io/illustrated-transformer/
  • 26. Back to Meshed-Memory Transformer Where is element-wise multiplication, maxK is maxpooling over K images, σ is sigmoid function, Wn is a weight, and bn is a bias. The weighted summation in Mmesh(X˜ N,K,Y¨ ) exploits both low-level and high-level information from the N stacked encoder.
  • 27. Optimization with Factual Completeness and Consistency Exact Entity Match Score : designed to measure factual completeness. A named entity recognizer is applied against yˆ and the corresponding reference report y. Given entities Egen and Eref recognized from ygen and yref respectively, precision (pr) and recall (rc) of entity match are calculated as The harmonic mean of precision and recall is taken as factENT
  • 28. Contd... factENTNLI Entailing Entity Match Score: an F-score style metric that expand factENT with NLI to evaluate factual consistency. δ is expanded to
  • 29. The Loss function A loss L is minimized as the negative expectation of the reward r and its gradient is estimated with a single Monte Carlo sample as where yˆsp is a sampled text and yˆgd is a greedy decoded text. They combine a factual metric loss with a language model loss and an NLG loss as
  • 31.
  • 32. References ● Shickel, B., Tighe, P. J., Bihorac, A., & Rashidi, P. (2017). Deep EHR: a survey of recent advances in deep learning techniques for electronic health record (EHR) analysis. IEEE journal of biomedical and health informatics, 22(5), 1589-1604. ● Miura, Y., Zhang, Y., Langlotz, C. P., & Jurafsky, D. (2020). Improving Factual Completeness and Consistency of Image-to-Text Radiology Report Generation. arXiv preprint arXiv:2010.10042. ● Guan, Qingji, et al. “Diagnose like a radiologist: Attention guided convolutional neural network for thorax disease classification.” arXiv preprint arXiv:1801.09927 (2018).