SlideShare a Scribd company logo
1 of 16
Download to read offline
Institute of
                                   Information Systems




  Separating compound figures in journal
articles to allow for subfigure classification

                           Ajad Chhatkuli
            Antonio Foncubierta-Rodríguez
                        Dimitrios Markonis
                           Henning Müller
Motivation                                 Institute of
                                           Information Systems

•   Figures in biomedical journals contain a lot of
    information
•   CBIR has been proposed for accessing medical
    literature
•   Modality classification
    •   Improves accessibility
    •   Allows result filtering
    •   But 50% of figures are compound or multipanel
Aim                                               Institute of
                                                  Information Systems

•   Develop a system that separates compound figures
    in the biomedical literature
    •       Visual-information only
        •     Textual information is discarded
    •       Modality-independent
        •     One method for many images types
        •     Many methods for few images types
    •       Tunable according to the dataset
•   Large-scale tested
    •       Approximately 250 open access journals
Compound figure examples   Institute of
                           Information Systems
Methods. Dataset                       Institute of
                                       Information Systems

•   2982 manually classified figures from ImageCLEF
    2012 dataset
•   Ground truth:
    •   Image subclass: 2x1,1x2,
    •   Position of separators
Methods. Overview                                 Institute of
                                                  Information Systems

•   Problem is separated in two
    •       Find subfigure separator candidates
        •     Preprocessing if required
    •       Analyze candidates
        •     Remove false positives
        •     Rule-based decisions
Methods. Separator detection           Institute of
                                       Information Systems

 •       Based on minimum
         pixel projection for
         white-space separated
         figures
 •       Horizontal  Vertical
         detection
     •     Inverse order by rotation
           according to aspect ratio
     •     Recursive
Methods. Separator detection                       Institute of
                                                   Information Systems

 •       Rule-based processing
     •     Progressive truncation to remove labels if no
           separators are found
     •     Text removal based on connected commponents if no
           separators are found
     •     Complement image for black-space separations
     •     Standard deviation image for subtle separations
     •     Binarization of non-graph figures:
           •   Less than 40% of the image is white or almost white
Methods. Separator analysis                     Institute of
                                                Information Systems

 •       Classification problem
     •     True/false separator
 •       Features used:
     •     Closeness to border, division ratio, standard
           deviation, text removal analysis, histogram, gap
           comparison
 •       Classifiers:
     •     SVM
     •     Rule-based classifier
Results   Institute of
          Information Systems
Successful examples   Institute of
                      Information Systems
Successful examples   Institute of
                      Information Systems
Unsuccessful examples           Institute of
                                Information Systems

                    Not horizontal/vertical
No separation gap   separation
Conclusions future work                         Institute of
                                                Information Systems

 •       Good results for a wide range of images
 •       Using purely visual information
 •       Separation problem: detection and analysis
 •       Rule weights can be fine-tuned according to dataset
     •     What would be the impact of a larger training set?
     •     What would be the impact in existing modality
           classification accuracy?
Conclusions future work                         Institute of
                                                Information Systems

 •       Good results for a wide range of images
 •       Using purely visual information
 •       Separation problem: detection and analysis
 •       Rule weights can be fine-tuned according to dataset
     •     What would be the impact of a larger training set?
     •     What would be the impact in existing modality
           classification accuracy?
Institute of
                                                                     Information Systems




  Thanks for your attention!

                        More information at http://medgift.hevs.ch



Ajad Chhatkuli, Dimitrios Markonis, Antonio Foncubierta-Rodríguez, Fabrice Meriaudeau
and Henning Müller, Separating compound figures in journal articles to allow for subfigure
          classification, in: SPIE, Medical Imaging, Orlando, FL, USA, 2013

More Related Content

What's hot

An Efficient Brain Tumor Detection Algorithm based on Segmentation for MRI Sy...
An Efficient Brain Tumor Detection Algorithm based on Segmentation for MRI Sy...An Efficient Brain Tumor Detection Algorithm based on Segmentation for MRI Sy...
An Efficient Brain Tumor Detection Algorithm based on Segmentation for MRI Sy...ijtsrd
 
IRJET - A Review on Identification and Disease Detection in Plants using Mach...
IRJET - A Review on Identification and Disease Detection in Plants using Mach...IRJET - A Review on Identification and Disease Detection in Plants using Mach...
IRJET - A Review on Identification and Disease Detection in Plants using Mach...IRJET Journal
 
IRJET- Nail based Disease Analysis at Earlier Stage using Median Filter i...
IRJET-  	  Nail based Disease Analysis at Earlier Stage using Median Filter i...IRJET-  	  Nail based Disease Analysis at Earlier Stage using Median Filter i...
IRJET- Nail based Disease Analysis at Earlier Stage using Median Filter i...IRJET Journal
 
IRJET- Supervised Learning Approach for Flower Images using Color, Shape and ...
IRJET- Supervised Learning Approach for Flower Images using Color, Shape and ...IRJET- Supervised Learning Approach for Flower Images using Color, Shape and ...
IRJET- Supervised Learning Approach for Flower Images using Color, Shape and ...IRJET Journal
 
Skin Lesion Classification using Supervised Algorithm in Data Mining
Skin Lesion Classification using Supervised Algorithm in Data MiningSkin Lesion Classification using Supervised Algorithm in Data Mining
Skin Lesion Classification using Supervised Algorithm in Data Miningijtsrd
 
Identifying brain tumour from mri image using modified fcm and support
Identifying brain tumour from mri image using modified fcm and supportIdentifying brain tumour from mri image using modified fcm and support
Identifying brain tumour from mri image using modified fcm and supportIAEME Publication
 
IRJET- Retinal Fundus Image Segmentation using Watershed Algorithm
IRJET-  	  Retinal Fundus Image Segmentation using Watershed AlgorithmIRJET-  	  Retinal Fundus Image Segmentation using Watershed Algorithm
IRJET- Retinal Fundus Image Segmentation using Watershed AlgorithmIRJET Journal
 
thesis_Jerzy_Zielinski_2012-08-27
thesis_Jerzy_Zielinski_2012-08-27thesis_Jerzy_Zielinski_2012-08-27
thesis_Jerzy_Zielinski_2012-08-27Jerzy Zielinski
 
Paper id 25201482
Paper id 25201482Paper id 25201482
Paper id 25201482IJRAT
 
A survey early detection of
A survey early detection ofA survey early detection of
A survey early detection ofijcsa
 

What's hot (12)

An Efficient Brain Tumor Detection Algorithm based on Segmentation for MRI Sy...
An Efficient Brain Tumor Detection Algorithm based on Segmentation for MRI Sy...An Efficient Brain Tumor Detection Algorithm based on Segmentation for MRI Sy...
An Efficient Brain Tumor Detection Algorithm based on Segmentation for MRI Sy...
 
IRJET - A Review on Identification and Disease Detection in Plants using Mach...
IRJET - A Review on Identification and Disease Detection in Plants using Mach...IRJET - A Review on Identification and Disease Detection in Plants using Mach...
IRJET - A Review on Identification and Disease Detection in Plants using Mach...
 
Breast legion
Breast legionBreast legion
Breast legion
 
IRJET- Nail based Disease Analysis at Earlier Stage using Median Filter i...
IRJET-  	  Nail based Disease Analysis at Earlier Stage using Median Filter i...IRJET-  	  Nail based Disease Analysis at Earlier Stage using Median Filter i...
IRJET- Nail based Disease Analysis at Earlier Stage using Median Filter i...
 
IRJET- Supervised Learning Approach for Flower Images using Color, Shape and ...
IRJET- Supervised Learning Approach for Flower Images using Color, Shape and ...IRJET- Supervised Learning Approach for Flower Images using Color, Shape and ...
IRJET- Supervised Learning Approach for Flower Images using Color, Shape and ...
 
Skin Lesion Classification using Supervised Algorithm in Data Mining
Skin Lesion Classification using Supervised Algorithm in Data MiningSkin Lesion Classification using Supervised Algorithm in Data Mining
Skin Lesion Classification using Supervised Algorithm in Data Mining
 
Identifying brain tumour from mri image using modified fcm and support
Identifying brain tumour from mri image using modified fcm and supportIdentifying brain tumour from mri image using modified fcm and support
Identifying brain tumour from mri image using modified fcm and support
 
IRJET- Retinal Fundus Image Segmentation using Watershed Algorithm
IRJET-  	  Retinal Fundus Image Segmentation using Watershed AlgorithmIRJET-  	  Retinal Fundus Image Segmentation using Watershed Algorithm
IRJET- Retinal Fundus Image Segmentation using Watershed Algorithm
 
C1103041623
C1103041623C1103041623
C1103041623
 
thesis_Jerzy_Zielinski_2012-08-27
thesis_Jerzy_Zielinski_2012-08-27thesis_Jerzy_Zielinski_2012-08-27
thesis_Jerzy_Zielinski_2012-08-27
 
Paper id 25201482
Paper id 25201482Paper id 25201482
Paper id 25201482
 
A survey early detection of
A survey early detection ofA survey early detection of
A survey early detection of
 

Similar to Separating compound figures in journal articles to allow for subfigure classification

Artificial intelligence Pattern recognition system
Artificial intelligence Pattern recognition systemArtificial intelligence Pattern recognition system
Artificial intelligence Pattern recognition systemREHMAT ULLAH
 
Application of machine learning in industrial applications
Application of machine learning in industrial applicationsApplication of machine learning in industrial applications
Application of machine learning in industrial applicationsAnish Das
 
7-1 ARTIFICIAL INTELLIGENCE IN PATHOLOGY semiar 2.pptx
7-1 ARTIFICIAL INTELLIGENCE IN PATHOLOGY  semiar 2.pptx7-1 ARTIFICIAL INTELLIGENCE IN PATHOLOGY  semiar 2.pptx
7-1 ARTIFICIAL INTELLIGENCE IN PATHOLOGY semiar 2.pptxHarishankarSharma27
 
Advanced Technologies in Medicine and Allied.ppt
Advanced Technologies in Medicine and Allied.pptAdvanced Technologies in Medicine and Allied.ppt
Advanced Technologies in Medicine and Allied.pptWAQARULHASSAN48
 
nnU-Net: a self-configuring method for deep learning-based biomedical image s...
nnU-Net: a self-configuring method for deep learning-based biomedical image s...nnU-Net: a self-configuring method for deep learning-based biomedical image s...
nnU-Net: a self-configuring method for deep learning-based biomedical image s...ivaderivader
 
AI IN PATH final PPT.pptx
AI IN PATH final PPT.pptxAI IN PATH final PPT.pptx
AI IN PATH final PPT.pptxDivyaGaurav4
 
Developing high content image analysis software for biologists
Developing high content image analysis software for biologistsDeveloping high content image analysis software for biologists
Developing high content image analysis software for biologistsClaire McQuin
 
Practical aspects of medical image ai for hospital (IRB course)
Practical aspects of medical image ai for hospital (IRB course)Practical aspects of medical image ai for hospital (IRB course)
Practical aspects of medical image ai for hospital (IRB course)Sean Yu
 
Face recognition: A Comparison of Appearance Based Approaches
Face recognition: A Comparison of Appearance Based ApproachesFace recognition: A Comparison of Appearance Based Approaches
Face recognition: A Comparison of Appearance Based Approachessadique_ghitm
 
Anomaly Detection - Real World Scenarios, Approaches and Live Implementation
Anomaly Detection - Real World Scenarios, Approaches and Live ImplementationAnomaly Detection - Real World Scenarios, Approaches and Live Implementation
Anomaly Detection - Real World Scenarios, Approaches and Live ImplementationImpetus Technologies
 
Supervised Learning
Supervised LearningSupervised Learning
Supervised LearningFEG
 
Wearable Computing - Part IV: Ensemble classifiers & Insight into ongoing res...
Wearable Computing - Part IV: Ensemble classifiers & Insight into ongoing res...Wearable Computing - Part IV: Ensemble classifiers & Insight into ongoing res...
Wearable Computing - Part IV: Ensemble classifiers & Insight into ongoing res...Daniel Roggen
 
Basic image analysis(processing and classification) and visualization using m...
Basic image analysis(processing and classification) and visualization using m...Basic image analysis(processing and classification) and visualization using m...
Basic image analysis(processing and classification) and visualization using m...Vishwas N
 
Fcv core liu
Fcv core liuFcv core liu
Fcv core liuzukun
 
ITS 832 CHAPTER 5FROM BUILDING A MODEL TO ADAPTIVE ROBUST DE.docx
ITS 832 CHAPTER 5FROM BUILDING A MODEL TO ADAPTIVE ROBUST DE.docxITS 832 CHAPTER 5FROM BUILDING A MODEL TO ADAPTIVE ROBUST DE.docx
ITS 832 CHAPTER 5FROM BUILDING A MODEL TO ADAPTIVE ROBUST DE.docxvrickens
 
Machine Learning in Pathology Diagnostics with Simagis Live
Machine Learning in Pathology Diagnostics with Simagis LiveMachine Learning in Pathology Diagnostics with Simagis Live
Machine Learning in Pathology Diagnostics with Simagis Livekhvatkov
 
Diagnosis Support by Machine Learning Using Posturography Data
Diagnosis Support by Machine Learning Using Posturography DataDiagnosis Support by Machine Learning Using Posturography Data
Diagnosis Support by Machine Learning Using Posturography DataTeruKamogashira
 
LAS - System Biology Lesson
LAS - System Biology LessonLAS - System Biology Lesson
LAS - System Biology LessonLASircc
 
Semantic enrichment and similarity approximation for biomedical sequence images
Semantic enrichment and similarity approximation for biomedical sequence imagesSemantic enrichment and similarity approximation for biomedical sequence images
Semantic enrichment and similarity approximation for biomedical sequence imagesSyed Ahmad Chan Bukhari, PhD
 

Similar to Separating compound figures in journal articles to allow for subfigure classification (20)

Region-based volumetric medical image retrieval
Region-based volumetric medical image retrievalRegion-based volumetric medical image retrieval
Region-based volumetric medical image retrieval
 
Artificial intelligence Pattern recognition system
Artificial intelligence Pattern recognition systemArtificial intelligence Pattern recognition system
Artificial intelligence Pattern recognition system
 
Application of machine learning in industrial applications
Application of machine learning in industrial applicationsApplication of machine learning in industrial applications
Application of machine learning in industrial applications
 
7-1 ARTIFICIAL INTELLIGENCE IN PATHOLOGY semiar 2.pptx
7-1 ARTIFICIAL INTELLIGENCE IN PATHOLOGY  semiar 2.pptx7-1 ARTIFICIAL INTELLIGENCE IN PATHOLOGY  semiar 2.pptx
7-1 ARTIFICIAL INTELLIGENCE IN PATHOLOGY semiar 2.pptx
 
Advanced Technologies in Medicine and Allied.ppt
Advanced Technologies in Medicine and Allied.pptAdvanced Technologies in Medicine and Allied.ppt
Advanced Technologies in Medicine and Allied.ppt
 
nnU-Net: a self-configuring method for deep learning-based biomedical image s...
nnU-Net: a self-configuring method for deep learning-based biomedical image s...nnU-Net: a self-configuring method for deep learning-based biomedical image s...
nnU-Net: a self-configuring method for deep learning-based biomedical image s...
 
AI IN PATH final PPT.pptx
AI IN PATH final PPT.pptxAI IN PATH final PPT.pptx
AI IN PATH final PPT.pptx
 
Developing high content image analysis software for biologists
Developing high content image analysis software for biologistsDeveloping high content image analysis software for biologists
Developing high content image analysis software for biologists
 
Practical aspects of medical image ai for hospital (IRB course)
Practical aspects of medical image ai for hospital (IRB course)Practical aspects of medical image ai for hospital (IRB course)
Practical aspects of medical image ai for hospital (IRB course)
 
Face recognition: A Comparison of Appearance Based Approaches
Face recognition: A Comparison of Appearance Based ApproachesFace recognition: A Comparison of Appearance Based Approaches
Face recognition: A Comparison of Appearance Based Approaches
 
Anomaly Detection - Real World Scenarios, Approaches and Live Implementation
Anomaly Detection - Real World Scenarios, Approaches and Live ImplementationAnomaly Detection - Real World Scenarios, Approaches and Live Implementation
Anomaly Detection - Real World Scenarios, Approaches and Live Implementation
 
Supervised Learning
Supervised LearningSupervised Learning
Supervised Learning
 
Wearable Computing - Part IV: Ensemble classifiers & Insight into ongoing res...
Wearable Computing - Part IV: Ensemble classifiers & Insight into ongoing res...Wearable Computing - Part IV: Ensemble classifiers & Insight into ongoing res...
Wearable Computing - Part IV: Ensemble classifiers & Insight into ongoing res...
 
Basic image analysis(processing and classification) and visualization using m...
Basic image analysis(processing and classification) and visualization using m...Basic image analysis(processing and classification) and visualization using m...
Basic image analysis(processing and classification) and visualization using m...
 
Fcv core liu
Fcv core liuFcv core liu
Fcv core liu
 
ITS 832 CHAPTER 5FROM BUILDING A MODEL TO ADAPTIVE ROBUST DE.docx
ITS 832 CHAPTER 5FROM BUILDING A MODEL TO ADAPTIVE ROBUST DE.docxITS 832 CHAPTER 5FROM BUILDING A MODEL TO ADAPTIVE ROBUST DE.docx
ITS 832 CHAPTER 5FROM BUILDING A MODEL TO ADAPTIVE ROBUST DE.docx
 
Machine Learning in Pathology Diagnostics with Simagis Live
Machine Learning in Pathology Diagnostics with Simagis LiveMachine Learning in Pathology Diagnostics with Simagis Live
Machine Learning in Pathology Diagnostics with Simagis Live
 
Diagnosis Support by Machine Learning Using Posturography Data
Diagnosis Support by Machine Learning Using Posturography DataDiagnosis Support by Machine Learning Using Posturography Data
Diagnosis Support by Machine Learning Using Posturography Data
 
LAS - System Biology Lesson
LAS - System Biology LessonLAS - System Biology Lesson
LAS - System Biology Lesson
 
Semantic enrichment and similarity approximation for biomedical sequence images
Semantic enrichment and similarity approximation for biomedical sequence imagesSemantic enrichment and similarity approximation for biomedical sequence images
Semantic enrichment and similarity approximation for biomedical sequence images
 

More from Institute of Information Systems (HES-SO)

Classification of noisy free-text prostate cancer pathology reports using nat...
Classification of noisy free-text prostate cancer pathology reports using nat...Classification of noisy free-text prostate cancer pathology reports using nat...
Classification of noisy free-text prostate cancer pathology reports using nat...Institute of Information Systems (HES-SO)
 
Machine learning assisted citation screening for Systematic Reviews - Anjani ...
Machine learning assisted citation screening for Systematic Reviews - Anjani ...Machine learning assisted citation screening for Systematic Reviews - Anjani ...
Machine learning assisted citation screening for Systematic Reviews - Anjani ...Institute of Information Systems (HES-SO)
 
Exploiting biomedical literature to mine out a large multimodal dataset of ra...
Exploiting biomedical literature to mine out a large multimodal dataset of ra...Exploiting biomedical literature to mine out a large multimodal dataset of ra...
Exploiting biomedical literature to mine out a large multimodal dataset of ra...Institute of Information Systems (HES-SO)
 
Studying Public Medical Images from Open Access Literature and Social Network...
Studying Public Medical Images from Open Access Literature and Social Network...Studying Public Medical Images from Open Access Literature and Social Network...
Studying Public Medical Images from Open Access Literature and Social Network...Institute of Information Systems (HES-SO)
 
Risques opérationnels et le système de contrôle interne : les limites d’un te...
Risques opérationnels et le système de contrôle interne : les limites d’un te...Risques opérationnels et le système de contrôle interne : les limites d’un te...
Risques opérationnels et le système de contrôle interne : les limites d’un te...Institute of Information Systems (HES-SO)
 
Le contrôle interne dans les administrations publiques tient-il toutes ses pr...
Le contrôle interne dans les administrations publiques tient-il toutes ses pr...Le contrôle interne dans les administrations publiques tient-il toutes ses pr...
Le contrôle interne dans les administrations publiques tient-il toutes ses pr...Institute of Information Systems (HES-SO)
 
Le système de contrôle interne : Présentation générale, enjeux et méthodes
Le système de contrôle interne : Présentation générale, enjeux et méthodesLe système de contrôle interne : Présentation générale, enjeux et méthodes
Le système de contrôle interne : Présentation générale, enjeux et méthodesInstitute of Information Systems (HES-SO)
 
A 3-D Riesz-Covariance Texture Model for the Prediction of Nodule Recurrence ...
A 3-D Riesz-Covariance Texture Model for the Prediction of Nodule Recurrence ...A 3-D Riesz-Covariance Texture Model for the Prediction of Nodule Recurrence ...
A 3-D Riesz-Covariance Texture Model for the Prediction of Nodule Recurrence ...Institute of Information Systems (HES-SO)
 
NOSE: une approche Smart-City pour les zones périphériques et extra-urbaines
NOSE: une approche Smart-City pour les zones périphériques et extra-urbainesNOSE: une approche Smart-City pour les zones périphériques et extra-urbaines
NOSE: une approche Smart-City pour les zones périphériques et extra-urbainesInstitute of Information Systems (HES-SO)
 

More from Institute of Information Systems (HES-SO) (20)

MIE20232.pptx
MIE20232.pptxMIE20232.pptx
MIE20232.pptx
 
Classification of noisy free-text prostate cancer pathology reports using nat...
Classification of noisy free-text prostate cancer pathology reports using nat...Classification of noisy free-text prostate cancer pathology reports using nat...
Classification of noisy free-text prostate cancer pathology reports using nat...
 
Machine learning assisted citation screening for Systematic Reviews - Anjani ...
Machine learning assisted citation screening for Systematic Reviews - Anjani ...Machine learning assisted citation screening for Systematic Reviews - Anjani ...
Machine learning assisted citation screening for Systematic Reviews - Anjani ...
 
Exploiting biomedical literature to mine out a large multimodal dataset of ra...
Exploiting biomedical literature to mine out a large multimodal dataset of ra...Exploiting biomedical literature to mine out a large multimodal dataset of ra...
Exploiting biomedical literature to mine out a large multimodal dataset of ra...
 
L'IoT dans les usines. Quels avantages ?
L'IoT dans les usines. Quels avantages ?L'IoT dans les usines. Quels avantages ?
L'IoT dans les usines. Quels avantages ?
 
Studying Public Medical Images from Open Access Literature and Social Network...
Studying Public Medical Images from Open Access Literature and Social Network...Studying Public Medical Images from Open Access Literature and Social Network...
Studying Public Medical Images from Open Access Literature and Social Network...
 
Risques opérationnels et le système de contrôle interne : les limites d’un te...
Risques opérationnels et le système de contrôle interne : les limites d’un te...Risques opérationnels et le système de contrôle interne : les limites d’un te...
Risques opérationnels et le système de contrôle interne : les limites d’un te...
 
Le contrôle interne dans les administrations publiques tient-il toutes ses pr...
Le contrôle interne dans les administrations publiques tient-il toutes ses pr...Le contrôle interne dans les administrations publiques tient-il toutes ses pr...
Le contrôle interne dans les administrations publiques tient-il toutes ses pr...
 
Le système de contrôle interne : Présentation générale, enjeux et méthodes
Le système de contrôle interne : Présentation générale, enjeux et méthodesLe système de contrôle interne : Présentation générale, enjeux et méthodes
Le système de contrôle interne : Présentation générale, enjeux et méthodes
 
Crowdsourcing-based Mobile Application for Wheelchair Accessibility
Crowdsourcing-based Mobile Application for Wheelchair AccessibilityCrowdsourcing-based Mobile Application for Wheelchair Accessibility
Crowdsourcing-based Mobile Application for Wheelchair Accessibility
 
Quelle(s) valeur(s) pour le leadership stratégique ?
Quelle(s) valeur(s) pour le leadership stratégique ?Quelle(s) valeur(s) pour le leadership stratégique ?
Quelle(s) valeur(s) pour le leadership stratégique ?
 
A 3-D Riesz-Covariance Texture Model for the Prediction of Nodule Recurrence ...
A 3-D Riesz-Covariance Texture Model for the Prediction of Nodule Recurrence ...A 3-D Riesz-Covariance Texture Model for the Prediction of Nodule Recurrence ...
A 3-D Riesz-Covariance Texture Model for the Prediction of Nodule Recurrence ...
 
Challenges in medical imaging and the VISCERAL model
Challenges in medical imaging and the VISCERAL modelChallenges in medical imaging and the VISCERAL model
Challenges in medical imaging and the VISCERAL model
 
NOSE: une approche Smart-City pour les zones périphériques et extra-urbaines
NOSE: une approche Smart-City pour les zones périphériques et extra-urbainesNOSE: une approche Smart-City pour les zones périphériques et extra-urbaines
NOSE: une approche Smart-City pour les zones périphériques et extra-urbaines
 
Medical image analysis and big data evaluation infrastructures
Medical image analysis and big data evaluation infrastructuresMedical image analysis and big data evaluation infrastructures
Medical image analysis and big data evaluation infrastructures
 
Medical image analysis, retrieval and evaluation infrastructures
Medical image analysis, retrieval and evaluation infrastructuresMedical image analysis, retrieval and evaluation infrastructures
Medical image analysis, retrieval and evaluation infrastructures
 
How to detect soft falls on devices
How to detect soft falls on devicesHow to detect soft falls on devices
How to detect soft falls on devices
 
FUNDAMENTALS OF TEXTURE PROCESSING FOR BIOMEDICAL IMAGE ANALYSIS
FUNDAMENTALS OF TEXTURE PROCESSING FOR BIOMEDICAL IMAGE ANALYSISFUNDAMENTALS OF TEXTURE PROCESSING FOR BIOMEDICAL IMAGE ANALYSIS
FUNDAMENTALS OF TEXTURE PROCESSING FOR BIOMEDICAL IMAGE ANALYSIS
 
MOBILE COLLECTION AND DISSEMINATION OF SENIORS’ SKILLS
MOBILE COLLECTION AND DISSEMINATION OF SENIORS’ SKILLSMOBILE COLLECTION AND DISSEMINATION OF SENIORS’ SKILLS
MOBILE COLLECTION AND DISSEMINATION OF SENIORS’ SKILLS
 
Enhanced Students Laboratory The GET project
Enhanced Students Laboratory The GET projectEnhanced Students Laboratory The GET project
Enhanced Students Laboratory The GET project
 

Recently uploaded

Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 

Recently uploaded (20)

Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 

Separating compound figures in journal articles to allow for subfigure classification

  • 1. Institute of Information Systems Separating compound figures in journal articles to allow for subfigure classification Ajad Chhatkuli Antonio Foncubierta-Rodríguez Dimitrios Markonis Henning Müller
  • 2. Motivation Institute of Information Systems • Figures in biomedical journals contain a lot of information • CBIR has been proposed for accessing medical literature • Modality classification • Improves accessibility • Allows result filtering • But 50% of figures are compound or multipanel
  • 3. Aim Institute of Information Systems • Develop a system that separates compound figures in the biomedical literature • Visual-information only • Textual information is discarded • Modality-independent • One method for many images types • Many methods for few images types • Tunable according to the dataset • Large-scale tested • Approximately 250 open access journals
  • 4. Compound figure examples Institute of Information Systems
  • 5. Methods. Dataset Institute of Information Systems • 2982 manually classified figures from ImageCLEF 2012 dataset • Ground truth: • Image subclass: 2x1,1x2, • Position of separators
  • 6. Methods. Overview Institute of Information Systems • Problem is separated in two • Find subfigure separator candidates • Preprocessing if required • Analyze candidates • Remove false positives • Rule-based decisions
  • 7. Methods. Separator detection Institute of Information Systems • Based on minimum pixel projection for white-space separated figures • Horizontal  Vertical detection • Inverse order by rotation according to aspect ratio • Recursive
  • 8. Methods. Separator detection Institute of Information Systems • Rule-based processing • Progressive truncation to remove labels if no separators are found • Text removal based on connected commponents if no separators are found • Complement image for black-space separations • Standard deviation image for subtle separations • Binarization of non-graph figures: • Less than 40% of the image is white or almost white
  • 9. Methods. Separator analysis Institute of Information Systems • Classification problem • True/false separator • Features used: • Closeness to border, division ratio, standard deviation, text removal analysis, histogram, gap comparison • Classifiers: • SVM • Rule-based classifier
  • 10. Results Institute of Information Systems
  • 11. Successful examples Institute of Information Systems
  • 12. Successful examples Institute of Information Systems
  • 13. Unsuccessful examples Institute of Information Systems Not horizontal/vertical No separation gap separation
  • 14. Conclusions future work Institute of Information Systems • Good results for a wide range of images • Using purely visual information • Separation problem: detection and analysis • Rule weights can be fine-tuned according to dataset • What would be the impact of a larger training set? • What would be the impact in existing modality classification accuracy?
  • 15. Conclusions future work Institute of Information Systems • Good results for a wide range of images • Using purely visual information • Separation problem: detection and analysis • Rule weights can be fine-tuned according to dataset • What would be the impact of a larger training set? • What would be the impact in existing modality classification accuracy?
  • 16. Institute of Information Systems Thanks for your attention! More information at http://medgift.hevs.ch Ajad Chhatkuli, Dimitrios Markonis, Antonio Foncubierta-Rodríguez, Fabrice Meriaudeau and Henning Müller, Separating compound figures in journal articles to allow for subfigure classification, in: SPIE, Medical Imaging, Orlando, FL, USA, 2013