In this work, a content-based image retrieval (CBIR) - computer aided diagnosis (CADx) scheme is proposed that aims to contribute on the decision-making process of a radiologist in the assessment of a mammographic case. The proposed scheme comprises two stages: The first one concerns a margin-specific CBIR scheme that utilizes a support vector machine (SVM) ensemble to produce vectors of likelihood for each margin type in a mass. In the second stage, a decision process is employed that is based on the top K retrieved images. Performance evaluation is addressed by using standard measures on a widely adopted digital database for screening mammography (DDSM).
Iterative Visual Recognition for Learning Based Randomized Bin-pickingKensuke Harada
This paper proposes a iterative visual recognition system
for learning based randomized bin-picking. Since the configuration on randomly stacked objects while executing the current picking trial is just partially different from the configuration while executing the previous picking trial, we consider detecting the poses of objects just by using a part of visual image taken at the current picking trial where it is different from the visual image taken at the previous picking trial. By using this method, we do not need to try to detect the poses of all objects included in the pile at every picking trial.
Assuming the 3D vision sensor attached at the wrist of a manipulator, we first explain a method to determine the pose of a 3D vision sensor maximizing the visibility of randomly stacked objects. Then, we explain a method for detecting the poses of randomly stacked objects. Effectiveness of our proposed approach is confirmed by experiments using a dual-arm manipulator where a 3D vision sensor and the two-fingered hand attached at the right and the left wrists, respectively.
2014 IEEE Int. Symposium on System Integration (SII) : Project on Development...Kensuke Harada
This paper discusses the vision/manipulation technology where a dual-arm manipulator first picks up an object from the pile, then regrasps it from the right hand to the left hand, and finally places it to the fixture. We especially focus on the grasp/manipulation planner for a dual-arm manipulator. Here, our grasp planner is well applied for objects which can be approximated by multiple cylinders. Our pick-and-place planner can quickly find a robot motion with regrasp. We also explain our vision technology to measure the position/orientation of an object. To show the effectiveness of our proposed approach, we show an experimental result of a dual-arm manipulator.
Microcalcification oriented content based mammogram retrieval for breast canc...Lazaros Tsochatzidis
Microcalcifications (MCs) provide a significant
early indication of breast malignancy. This work introduces a
supervised scheme for malignancy risk assessment of mammograms containing MCs. The proposed scheme employs shape and
textural features as input to a support vector machine (SVM)
ensemble, in order to perform content-based image retrieval
(CBIR) of mammograms. The retrieval performance of the
proposed scheme has been evaluated by taking into account
the variation of MCs morphology as defined in BI-RADS. In
our experiments, we use a set of 87 mammograms containing
MCs, obtained from the widely adopted DDSM database for
screening mammography. The experimental results demonstrate
that the proposed supervised CBIR scheme addresses effective
retrieval of MCs mammograms outperforming relevant unsupervised schemes.
This paper tackles the problem of the user’s incapability to describe the image that he seeks by introducing an innovative image engine called TsoKaDo. Until now the traditional web image was based only on the comparison between metadata of the webpage the user’s textual description. In the method proposed images various search engines are classified based on visual content new tags are proposed to the user. Recursively the results get to the user’s desire. The aim of this paper is to present new way of searching especially in case with less query generality greater weight in visual content rather than in metadata.
Mammography is currently the dominant imaging modality for the early detection of breast cancer. However, its robustness in distinguishing malignancy is relatively low, resulting in a large number of unnecessary biopsies. A computer-aided diagnosis (CAD) scheme, capable of visually justifying its results, is expected to aid the decision made by radiologists. Content-based image retrieval (CBIR) accounts for a promising paradigm in this direction. Facing this challenge, we introduce a CBIR scheme that utilizes the extracted features as input to a support vector machine (SVM) ensemble. The final features used for CBIR comprise the participation value of each SVM. The retrieval performance of the proposed scheme has been evaluated quantitatively on the basis of the standard measures. In the experiments, a set of 90 mammograms is used, derived from a widely adopted digital database for screening mammography. The experimental results show the improved performance of the proposed scheme.
Iterative Visual Recognition for Learning Based Randomized Bin-pickingKensuke Harada
This paper proposes a iterative visual recognition system
for learning based randomized bin-picking. Since the configuration on randomly stacked objects while executing the current picking trial is just partially different from the configuration while executing the previous picking trial, we consider detecting the poses of objects just by using a part of visual image taken at the current picking trial where it is different from the visual image taken at the previous picking trial. By using this method, we do not need to try to detect the poses of all objects included in the pile at every picking trial.
Assuming the 3D vision sensor attached at the wrist of a manipulator, we first explain a method to determine the pose of a 3D vision sensor maximizing the visibility of randomly stacked objects. Then, we explain a method for detecting the poses of randomly stacked objects. Effectiveness of our proposed approach is confirmed by experiments using a dual-arm manipulator where a 3D vision sensor and the two-fingered hand attached at the right and the left wrists, respectively.
2014 IEEE Int. Symposium on System Integration (SII) : Project on Development...Kensuke Harada
This paper discusses the vision/manipulation technology where a dual-arm manipulator first picks up an object from the pile, then regrasps it from the right hand to the left hand, and finally places it to the fixture. We especially focus on the grasp/manipulation planner for a dual-arm manipulator. Here, our grasp planner is well applied for objects which can be approximated by multiple cylinders. Our pick-and-place planner can quickly find a robot motion with regrasp. We also explain our vision technology to measure the position/orientation of an object. To show the effectiveness of our proposed approach, we show an experimental result of a dual-arm manipulator.
Microcalcification oriented content based mammogram retrieval for breast canc...Lazaros Tsochatzidis
Microcalcifications (MCs) provide a significant
early indication of breast malignancy. This work introduces a
supervised scheme for malignancy risk assessment of mammograms containing MCs. The proposed scheme employs shape and
textural features as input to a support vector machine (SVM)
ensemble, in order to perform content-based image retrieval
(CBIR) of mammograms. The retrieval performance of the
proposed scheme has been evaluated by taking into account
the variation of MCs morphology as defined in BI-RADS. In
our experiments, we use a set of 87 mammograms containing
MCs, obtained from the widely adopted DDSM database for
screening mammography. The experimental results demonstrate
that the proposed supervised CBIR scheme addresses effective
retrieval of MCs mammograms outperforming relevant unsupervised schemes.
This paper tackles the problem of the user’s incapability to describe the image that he seeks by introducing an innovative image engine called TsoKaDo. Until now the traditional web image was based only on the comparison between metadata of the webpage the user’s textual description. In the method proposed images various search engines are classified based on visual content new tags are proposed to the user. Recursively the results get to the user’s desire. The aim of this paper is to present new way of searching especially in case with less query generality greater weight in visual content rather than in metadata.
Mammography is currently the dominant imaging modality for the early detection of breast cancer. However, its robustness in distinguishing malignancy is relatively low, resulting in a large number of unnecessary biopsies. A computer-aided diagnosis (CAD) scheme, capable of visually justifying its results, is expected to aid the decision made by radiologists. Content-based image retrieval (CBIR) accounts for a promising paradigm in this direction. Facing this challenge, we introduce a CBIR scheme that utilizes the extracted features as input to a support vector machine (SVM) ensemble. The final features used for CBIR comprise the participation value of each SVM. The retrieval performance of the proposed scheme has been evaluated quantitatively on the basis of the standard measures. In the experiments, a set of 90 mammograms is used, derived from a widely adopted digital database for screening mammography. The experimental results show the improved performance of the proposed scheme.
Product defect detection based on convolutional autoencoder and one-class cla...IAESIJAI
To meet customer expectations and remain competitive, industrials try constantly to improve their quality control systems. There is hence increasing demand for adopting automatic defect detection solutions. However, the biggest issue in addressing such systems is the imbalanced aspect of industrial datasets. Often, defect-free samples far exceed the defected ones, due to continuous improvement approaches adopted by manufacturing companies. In this sense, we propose an automatic defect detection system based on one-class classification (OCC) since it involves only normal samples during training. It consists of three sub-models, first, a convolutional autoencoder serves as latent features extractor, the extracted features vectors are subsequently fed into the dimensionality reduction process by performing principal component analysis (PCA), then the reduced-dimensional data are used to train the one-class classifier support vector data description (SVDD). During the test phase, both normal and defected images are used. The first two stages of the trained model generate a low-dimensional features vector, whereas the SVDD classifies the new input, whether it is defect-free or defected. This approach is evaluated on the carpet images from the industrial inspection dataset MVTec anomaly detection (MVTec AD). During training, only normal images were used. The results showed that the proposed method outperforms the state-of-the-art methods.
This slide represents topics on PCA (Principal Component Analysis) and SVD (Singular Value Decomposition) where I tried to cover basic PCA, application, and use of PCA and SVD, Important keywords to know about PCA briefly, PCA algorithm and implementation, Basic SVD, SVD calculation, SVD implementation, Performance comparison of SVD and PCA regarding one publicly available dataset.
N.B. Information in this slide are gathered from
1. Machine Learning course by Andrew NG,
2. Mining of Massive Dataset | Stanford University | Artificial Intelligence - All in One (youtube channel)
3. and many more they are described in the slide.
PR-159 : Synergistic Image and Feature Adaptation: Towards Cross-Modality Dom...Sunghoon Joo
Paper review slide.
Title : Synergistic Image and Feature Adaptation: Towards Cross-Modality Domain Adaptation for Medical Image Segmentation
Paper url : https://arxiv.org/pdf/1901.08211
video url : https://youtu.be/sR7hBJGpwQo
FREE- REFERENCE IMAGE QUALITY ASSESSMENT FRAMEWORK USING METRICS FUSION AND D...sipij
This paper focuses on no-reference image quality assessment(NR-IQA)metrics. In the literature, a wide range of algorithms are proposed to automatically estimate the perceived quality of visual data. However, most of them are not able to effectively quantify the various degradations and artifacts that the image may undergo. Thus, merging of diverse metrics operating in different information domains is hoped to yield better performances, which is the main theme of the proposed work. In particular, the metric proposed in this paper is based on three well-known NR-IQA objective metrics that depend on natural scene statistical attributes from three different domains to extract a vector of image features. Then, Singular Value Decomposition (SVD) based dominant eigenvectors method is used to select the most relevant image quality attributes. These latter are used as input to Relevance Vector Machine (RVM) to derive the overall quality index. Validation experiments are divided into two groups; in the first group, learning process (training and test phases) is applied on one single image quality database whereas in the second group of simulations, training and test phases are separated on two distinct datasets. Obtained results demonstrate that the proposed metric performs very well in terms of correlation, monotonicity and accuracy in both the two scenarios.
The German Credit data provides variables that help classify observations as good credit vs bad credit. Multiple algorithms such as Logistic Regression, Classification tree, GAM, Neural Net and Linear Discriminant Analysis were used to compare the classification power of the models built.
For the full video of this presentation, please visit:
https://www.edge-ai-vision.com/2021/02/introduction-to-simultaneous-localization-and-mapping-slam-a-presentation-from-skydio/
Gareth Cross, Technical Lead for State Estimation at Skydio, presents the “Introduction to Simultaneous Localization and Mapping (SLAM)” tutorial at the September 2020 Embedded Vision Summit.
This talk provides an introduction to the fundamentals of simultaneous localization and mapping (SLAM). Cross provides foundational knowledge; viewers are not expected to have any prerequisite experience in the field. The talk consists of an introduction to the concept of SLAM, as well as practical design considerations in formulating SLAM problems. Visual inertial odometry is introduced as a motivating example of SLAM, and Cross reviews how the problem is structured and solved.
Lane and Object Detection for Autonomous Vehicle using Advanced Computer VisionYogeshIJTSRD
The vision of this project is to develop lane and object detection in Autonomous Vehicle system to run efficiently in normal road condition and to eliminate the use of high cost Light based LiDAR system to implement high resolution cameras with advanced computer vision and deep learning technology to provide an Advanced Driver Assistance System ADAS . Detecting lane lines could be a crucial task for any self driving autonomous vehicle. Hence, this project was focused to identify lane lines on the road using OpenCV. The OpenCV tools such as colour selection, the region of interest selection, grey scaling, canny edge detection and perspective transformation are being employed. This project is modelled as an integration of two systems to solve the real time implementation problem in autonomous vehicles. The first part of the system is lane detection by advanced computer vision techniques to detect the lane lines to command the vehicle to stay inside the lane marking. The second part of the system is object detection and tracking is to detect and track the vehicle and pedestrians on the road to get a clear understanding of the environment to plan and generate a trajectory to navigate the autonomous vehicle safely to its destination without any crashes, this is done by a special deep learning method called transfer learning with Single Shot multibox Detection SSD algorithm and Mobile Net architecture. G. Monika | S. Bhavani | L. Azim Jahan Siana | N. Meenakshi "Lane and Object Detection for Autonomous Vehicle using Advanced Computer Vision" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-5 | Issue-3 , April 2021, URL: https://www.ijtsrd.com/papers/ijtsrd39952.pdf Paper URL: https://www.ijtsrd.com/engineering/electronics-and-communication-engineering/39952/lane-and-object-detection-for-autonomous-vehicle-using-advanced-computer-vision/g-monika
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Ramesh Iyer
In today's fast-changing business world, Companies that adapt and embrace new ideas often need help to keep up with the competition. However, fostering a culture of innovation takes much work. It takes vision, leadership and willingness to take risks in the right proportion. Sachin Dev Duggal, co-founder of Builder.ai, has perfected the art of this balance, creating a company culture where creativity and growth are nurtured at each stage.
More Related Content
Similar to A two stage svm-based mammographic cbir for ca dx
Product defect detection based on convolutional autoencoder and one-class cla...IAESIJAI
To meet customer expectations and remain competitive, industrials try constantly to improve their quality control systems. There is hence increasing demand for adopting automatic defect detection solutions. However, the biggest issue in addressing such systems is the imbalanced aspect of industrial datasets. Often, defect-free samples far exceed the defected ones, due to continuous improvement approaches adopted by manufacturing companies. In this sense, we propose an automatic defect detection system based on one-class classification (OCC) since it involves only normal samples during training. It consists of three sub-models, first, a convolutional autoencoder serves as latent features extractor, the extracted features vectors are subsequently fed into the dimensionality reduction process by performing principal component analysis (PCA), then the reduced-dimensional data are used to train the one-class classifier support vector data description (SVDD). During the test phase, both normal and defected images are used. The first two stages of the trained model generate a low-dimensional features vector, whereas the SVDD classifies the new input, whether it is defect-free or defected. This approach is evaluated on the carpet images from the industrial inspection dataset MVTec anomaly detection (MVTec AD). During training, only normal images were used. The results showed that the proposed method outperforms the state-of-the-art methods.
This slide represents topics on PCA (Principal Component Analysis) and SVD (Singular Value Decomposition) where I tried to cover basic PCA, application, and use of PCA and SVD, Important keywords to know about PCA briefly, PCA algorithm and implementation, Basic SVD, SVD calculation, SVD implementation, Performance comparison of SVD and PCA regarding one publicly available dataset.
N.B. Information in this slide are gathered from
1. Machine Learning course by Andrew NG,
2. Mining of Massive Dataset | Stanford University | Artificial Intelligence - All in One (youtube channel)
3. and many more they are described in the slide.
PR-159 : Synergistic Image and Feature Adaptation: Towards Cross-Modality Dom...Sunghoon Joo
Paper review slide.
Title : Synergistic Image and Feature Adaptation: Towards Cross-Modality Domain Adaptation for Medical Image Segmentation
Paper url : https://arxiv.org/pdf/1901.08211
video url : https://youtu.be/sR7hBJGpwQo
FREE- REFERENCE IMAGE QUALITY ASSESSMENT FRAMEWORK USING METRICS FUSION AND D...sipij
This paper focuses on no-reference image quality assessment(NR-IQA)metrics. In the literature, a wide range of algorithms are proposed to automatically estimate the perceived quality of visual data. However, most of them are not able to effectively quantify the various degradations and artifacts that the image may undergo. Thus, merging of diverse metrics operating in different information domains is hoped to yield better performances, which is the main theme of the proposed work. In particular, the metric proposed in this paper is based on three well-known NR-IQA objective metrics that depend on natural scene statistical attributes from three different domains to extract a vector of image features. Then, Singular Value Decomposition (SVD) based dominant eigenvectors method is used to select the most relevant image quality attributes. These latter are used as input to Relevance Vector Machine (RVM) to derive the overall quality index. Validation experiments are divided into two groups; in the first group, learning process (training and test phases) is applied on one single image quality database whereas in the second group of simulations, training and test phases are separated on two distinct datasets. Obtained results demonstrate that the proposed metric performs very well in terms of correlation, monotonicity and accuracy in both the two scenarios.
The German Credit data provides variables that help classify observations as good credit vs bad credit. Multiple algorithms such as Logistic Regression, Classification tree, GAM, Neural Net and Linear Discriminant Analysis were used to compare the classification power of the models built.
For the full video of this presentation, please visit:
https://www.edge-ai-vision.com/2021/02/introduction-to-simultaneous-localization-and-mapping-slam-a-presentation-from-skydio/
Gareth Cross, Technical Lead for State Estimation at Skydio, presents the “Introduction to Simultaneous Localization and Mapping (SLAM)” tutorial at the September 2020 Embedded Vision Summit.
This talk provides an introduction to the fundamentals of simultaneous localization and mapping (SLAM). Cross provides foundational knowledge; viewers are not expected to have any prerequisite experience in the field. The talk consists of an introduction to the concept of SLAM, as well as practical design considerations in formulating SLAM problems. Visual inertial odometry is introduced as a motivating example of SLAM, and Cross reviews how the problem is structured and solved.
Lane and Object Detection for Autonomous Vehicle using Advanced Computer VisionYogeshIJTSRD
The vision of this project is to develop lane and object detection in Autonomous Vehicle system to run efficiently in normal road condition and to eliminate the use of high cost Light based LiDAR system to implement high resolution cameras with advanced computer vision and deep learning technology to provide an Advanced Driver Assistance System ADAS . Detecting lane lines could be a crucial task for any self driving autonomous vehicle. Hence, this project was focused to identify lane lines on the road using OpenCV. The OpenCV tools such as colour selection, the region of interest selection, grey scaling, canny edge detection and perspective transformation are being employed. This project is modelled as an integration of two systems to solve the real time implementation problem in autonomous vehicles. The first part of the system is lane detection by advanced computer vision techniques to detect the lane lines to command the vehicle to stay inside the lane marking. The second part of the system is object detection and tracking is to detect and track the vehicle and pedestrians on the road to get a clear understanding of the environment to plan and generate a trajectory to navigate the autonomous vehicle safely to its destination without any crashes, this is done by a special deep learning method called transfer learning with Single Shot multibox Detection SSD algorithm and Mobile Net architecture. G. Monika | S. Bhavani | L. Azim Jahan Siana | N. Meenakshi "Lane and Object Detection for Autonomous Vehicle using Advanced Computer Vision" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-5 | Issue-3 , April 2021, URL: https://www.ijtsrd.com/papers/ijtsrd39952.pdf Paper URL: https://www.ijtsrd.com/engineering/electronics-and-communication-engineering/39952/lane-and-object-detection-for-autonomous-vehicle-using-advanced-computer-vision/g-monika
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Ramesh Iyer
In today's fast-changing business world, Companies that adapt and embrace new ideas often need help to keep up with the competition. However, fostering a culture of innovation takes much work. It takes vision, leadership and willingness to take risks in the right proportion. Sachin Dev Duggal, co-founder of Builder.ai, has perfected the art of this balance, creating a company culture where creativity and growth are nurtured at each stage.
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
Search and Society: Reimagining Information Access for Radical FuturesBhaskar Mitra
The field of Information retrieval (IR) is currently undergoing a transformative shift, at least partly due to the emerging applications of generative AI to information access. In this talk, we will deliberate on the sociotechnical implications of generative AI for information access. We will argue that there is both a critical necessity and an exciting opportunity for the IR community to re-center our research agendas on societal needs while dismantling the artificial separation between the work on fairness, accountability, transparency, and ethics in IR and the rest of IR research. Instead of adopting a reactionary strategy of trying to mitigate potential social harms from emerging technologies, the community should aim to proactively set the research agenda for the kinds of systems we should build inspired by diverse explicitly stated sociotechnical imaginaries. The sociotechnical imaginaries that underpin the design and development of information access technologies needs to be explicitly articulated, and we need to develop theories of change in context of these diverse perspectives. Our guiding future imaginaries must be informed by other academic fields, such as democratic theory and critical theory, and should be co-developed with social science scholars, legal scholars, civil rights and social justice activists, and artists, among others.
"Impact of front-end architecture on development cost", Viktor TurskyiFwdays
I have heard many times that architecture is not important for the front-end. Also, many times I have seen how developers implement features on the front-end just following the standard rules for a framework and think that this is enough to successfully launch the project, and then the project fails. How to prevent this and what approach to choose? I have launched dozens of complex projects and during the talk we will analyze which approaches have worked for me and which have not.
Accelerate your Kubernetes clusters with Varnish CachingThijs Feryn
A presentation about the usage and availability of Varnish on Kubernetes. This talk explores the capabilities of Varnish caching and shows how to use the Varnish Helm chart to deploy it to Kubernetes.
This presentation was delivered at K8SUG Singapore. See https://feryn.eu/presentations/accelerate-your-kubernetes-clusters-with-varnish-caching-k8sug-singapore-28-2024 for more details.
Key Trends Shaping the Future of Infrastructure.pdfCheryl Hung
Keynote at DIGIT West Expo, Glasgow on 29 May 2024.
Cheryl Hung, ochery.com
Sr Director, Infrastructure Ecosystem, Arm.
The key trends across hardware, cloud and open-source; exploring how these areas are likely to mature and develop over the short and long-term, and then considering how organisations can position themselves to adapt and thrive.
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
💥 Speed, accuracy, and scaling – discover the superpowers of GenAI in action with UiPath Document Understanding and Communications Mining™:
See how to accelerate model training and optimize model performance with active learning
Learn about the latest enhancements to out-of-the-box document processing – with little to no training required
Get an exclusive demo of the new family of UiPath LLMs – GenAI models specialized for processing different types of documents and messages
This is a hands-on session specifically designed for automation developers and AI enthusiasts seeking to enhance their knowledge in leveraging the latest intelligent document processing capabilities offered by UiPath.
Speakers:
👨🏫 Andras Palfi, Senior Product Manager, UiPath
👩🏫 Lenka Dulovicova, Product Program Manager, UiPath
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
A two stage svm-based mammographic cbir for ca dx
1. A two-stage SVM-based
mammographic CBIR for CADx
L. TSOCHATZIDIS1
A. KARAHALIOU2
K. ZAGORIS1
S. SKIADOPOULOS2
N. ARIKIDIS2
L. COSTARIDOU2
I. PRATIKAKIS1
University of Patras 2
School of Medicine
Department of Medical Physics
Democritus University of Thrace 1
Department of Electrical and Computer Engineering
Visual Computing Group
TSOCHATZIDIS ET AL. A TWO-STAGE SVM-BASED MAMMOGRAPHIC CBIR FOR CADX, MICCAI-BIA 201519/10/2015 1
2. CADx in Mammography
Mammography is a dominant imaging
modality for early detection of breast cancer
Often, diagnosis leads to unnecessary
biopsies
Two types of CADx:
• Single-stage: Classification schemes for benign-
malignant discrimination
• Two-stage: Content-Based Image Retrieval that
feeds the diagnosis step which discriminates
between benign and malignant
TSOCHATZIDIS ET AL. A TWO-STAGE SVM-BASED MAMMOGRAPHIC CBIR FOR CADX, MICCAI-BIA 201519/10/2015 2
3. Proposed CBIR-CAD System
CAD system that incorporates a CBIR step and a decision step
Retrieves similar images based on low-level image features
Margin specific CBIR
Diagnosis is based on the ranked lists produced by CBIR
Provides visual aid and enables consultation of previous cases, leading to increased confidence
into incorporating CAD-cued results
TSOCHATZIDIS ET AL. A TWO-STAGE SVM-BASED MAMMOGRAPHIC CBIR FOR CADX, MICCAI-BIA 201519/10/2015 3
5. CBIR-CAD’s pipeline
BENIGN / MALIGNANT
TSOCHATZIDIS ET AL. A TWO-STAGE SVM-BASED MAMMOGRAPHIC CBIR FOR CADX, MICCAI-BIA 201519/10/2015 5
6. Semi-automatic Segmentation
The Dijkstra’s shortest path algorithm is exploited to obtain the optimal path between
sequential pairs of landmark points upon mass boundaries
A new cost function is proposed to avoid background correction techniques that may deform
mass contour and introduce additional adjustment parameters
𝑐𝑜𝑠𝑡 𝑃 𝑥, 𝑦 = 𝐼 𝑝 − 𝑔 𝐴𝐵 𝑃 𝑔 𝐴𝐵 𝑃 = 𝐼𝐴 + 𝐼 𝐵 − 𝐼𝐴
𝑑(𝐴, 𝑃)
𝑑(𝐴, 𝐵)
TSOCHATZIDIS ET AL. A TWO-STAGE SVM-BASED MAMMOGRAPHIC CBIR FOR CADX, MICCAI-BIA 201519/10/2015 6
Arikidis, N., Skiadopoulos, S., Karahaliou, A., Kazantzi, A., Vassiou, K., Tsochatzidis, L., Pratikakis, I.,
Costaridou, L.: Shortest paths of mass contour estimates in mammography. In: MICCAI-BIA 2015
8. Feature Extraction – Global Shape
Solidity factor: The degree that the shape deviates from its convex hull
𝑆𝑜𝑙𝑖𝑑𝑖𝑡𝑦 =
𝐴𝑟𝑒𝑎 𝑜𝑓 𝑚𝑎𝑠𝑠 (𝐴)
𝐴𝑟𝑒𝑎 𝑜𝑓 𝑖𝑡𝑠 𝑐𝑜𝑛𝑣𝑒𝑥 ℎ𝑢𝑙𝑙 (𝐻)
Compactness factor: The degree that a shape deviates from a perfect circle
𝐶𝑜𝑚𝑝𝑎𝑐𝑡𝑛𝑒𝑠𝑠 = 1 −
4𝜋𝐴2
𝑃2
TSOCHATZIDIS ET AL. A TWO-STAGE SVM-BASED MAMMOGRAPHIC CBIR FOR CADX, MICCAI-BIA 201519/10/2015 8
9. Feature Extraction – Global Shape
Circumscribed Microlobulated Spiculated Ill-defined
Compactness=0.008
Solidity=0.99
Compactness=0.20
Solidity=0.92
Compactness=0.83
Solidity=0.32
Compactness=0.17
Solidity=0.93
TSOCHATZIDIS ET AL. A TWO-STAGE SVM-BASED MAMMOGRAPHIC CBIR FOR CADX, MICCAI-BIA 201519/10/2015 9
10. Feature Extraction – DFT of NRL
Normalized Radial Length Function
1. The distance of each contour point to the
shape’s center of gravity
2. Normalized by the average radial length
3. Computation of Discrete Fourier Transform
coefficients
TSOCHATZIDIS ET AL. A TWO-STAGE SVM-BASED MAMMOGRAPHIC CBIR FOR CADX, MICCAI-BIA 201519/10/2015 10
11. Feature Extraction - Texture-based
Rubber Band Straightening Transform (RBST)
•Unfolding the ribbon around the contour as a
flat image
•RBST Column → line segment normal to the
contour
•RBST Row → iso-distant to the contour paths
•Intensity profiles at every contour point along
a line segment normal to the contour
TSOCHATZIDIS ET AL. A TWO-STAGE SVM-BASED MAMMOGRAPHIC CBIR FOR CADX, MICCAI-BIA 201519/10/2015 11
12. Feature Extraction - Texture-based
RBST Image
Sobel gradient magnitude operator
Detected edge points
TSOCHATZIDIS ET AL. A TWO-STAGE SVM-BASED MAMMOGRAPHIC CBIR FOR CADX, MICCAI-BIA 201519/10/2015 12
13. Feature Extraction - Texture-based
TSOCHATZIDIS ET AL. A TWO-STAGE SVM-BASED MAMMOGRAPHIC CBIR FOR CADX, MICCAI-BIA 201519/10/2015 13
14. Feature Extraction - Texture-based
Extracted Features
•Distance between edge points of consecutive
columns
•Distance between edge points and middle row of
RBST image
•Magnitude of gradient on y-axis
•Gradient orientation divergence from vertical
direction
•Acutance (The sum of the difference of gray-level
values between pixels that are iso-distant from
either sides of the contour)
Mean and SD value of the above functions
TSOCHATZIDIS ET AL. A TWO-STAGE SVM-BASED MAMMOGRAPHIC CBIR FOR CADX, MICCAI-BIA 201519/10/2015 14
15. Feature Extraction - Texture-based
Feature Name Circumscribed Ill-defined
Avg. dist. edge points 0.036 0.747
Avg. dist center row 0.323 0.865
SD dist. edge points 0.105 0.879
SD dist center row 0.077 0.952
TSOCHATZIDIS ET AL. A TWO-STAGE SVM-BASED MAMMOGRAPHIC CBIR FOR CADX, MICCAI-BIA 201519/10/2015 15
17. The SVM Layer – Support Vector
Machines
Binary Linear Classifiers
For non-linear problems: Projection of
samples to a higher dimensionality space.
Finds a hyper-plane that optimally separates
the two classes
Decision function:
𝑓 𝑥 = 𝑠𝑖𝑔𝑛(𝑤 ∙ 𝑥 + 𝑏)
TSOCHATZIDIS ET AL. A TWO-STAGE SVM-BASED MAMMOGRAPHIC CBIR FOR CADX, MICCAI-BIA 201519/10/2015 17
18. The SVM Layer – Structure
An ensemble of binary SVM classifiers is
employed
One SVM for each class – Four SVMs in total
Each SVM outputs the participation level of a
sample in the corresponding class
TSOCHATZIDIS ET AL. A TWO-STAGE SVM-BASED MAMMOGRAPHIC CBIR FOR CADX, MICCAI-BIA 201519/10/2015 18
20. The Diagnosis Stage
GOAL: Provide the likelihood of malignancy for a query case.
•Based on the K most similar ROIs retrieved
•Similarity between query and an item: 𝑆 𝑞, 𝑥𝑖 =
1
𝑑 𝑞,𝑥 𝑖 +1
, 𝑑 𝑞, 𝑥𝑖 → 𝐸𝑢𝑐𝑙𝑖𝑑𝑒𝑎𝑛 𝑑𝑖𝑠𝑡𝑎𝑛𝑐𝑒
•Two decisions indices investigated:
1. 𝐷1 𝑞 = 𝑖=1
𝑁
𝑆(𝑞,𝑥 𝑖
𝑇𝑃
)
𝑖=1
𝑁 𝑆(𝑞,𝑥 𝑖
𝑇𝑃)+ 𝑖=1
𝑀 𝑆(𝑞,𝑥 𝑗
𝐹𝑃)
2. 𝐷2 q =
1
𝐾 𝑖=1
𝑁
𝑆 𝑞, 𝑥𝑖
𝑇𝑃
−
1
𝐾 𝑗=1
𝑀
𝑆 𝑞, 𝑥𝑗
𝐹𝑃
TSOCHATZIDIS ET AL. A TWO-STAGE SVM-BASED MAMMOGRAPHIC CBIR FOR CADX, MICCAI-BIA 201519/10/2015 20
21. Experimental Results
Experiments on a dataset of total 400 mammograms from DDSM
Precise contour delineation from expert radiologist
CC and MLO views are treated independently
5-fold cross validation
Grid search for SVM and kernel parameters
TSOCHATZIDIS ET AL. A TWO-STAGE SVM-BASED MAMMOGRAPHIC CBIR FOR CADX, MICCAI-BIA 201519/10/2015 21
22. Experimental Results – Evaluation
metrics
Precision at N (P@R): The percentage of correct images at the top-R places of the rank list
Mean Average Precision (MAP): Measures the overall performance of a query
𝐴𝑃 =
𝑘=1
𝑛
𝑃@𝑘 ∗ 𝑟𝑒𝑙 𝑘
𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑟𝑒𝑙𝑒𝑣𝑎𝑛𝑡 𝑑𝑜𝑐𝑢𝑚𝑒𝑛𝑡𝑠
𝑟𝑒𝑙 𝑘 =
1, 𝑖𝑓 𝑘_th 𝑖𝑚𝑎𝑔𝑒 𝑖𝑠 𝑐𝑜𝑟𝑟𝑒𝑐𝑡
0, 𝑒𝑙𝑠𝑒
𝑀𝐴𝑃 =
1
𝑁
𝑁
𝐴𝑃
TSOCHATZIDIS ET AL. A TWO-STAGE SVM-BASED MAMMOGRAPHIC CBIR FOR CADX, MICCAI-BIA 201519/10/2015 22
24. Area under ROC curve (AUC)
The Receiver Operating Characteristic (ROC) curve illustrates the performance of a binary
classifier as its discrimination threshold is varied
The curve is created by plotting the true positive rate (sensitivity) against the false positive rate
(1-specificity) at various threshold settings
The AUC of a classifier is equivalent to the probability that the classifier will rank a randomly
chosen positive instance higher than a randomly chosen negative instance
TSOCHATZIDIS ET AL. A TWO-STAGE SVM-BASED MAMMOGRAPHIC CBIR FOR CADX, MICCAI-BIA 201519/10/2015 24
25. Experimental Results - Decision
0.74
0.75
0.76
0.77
0.78
0.79
0.8
0.81
0.82
3 4 5 6 7 8 9 10 11 12 13 14 15
Classification performance of D1 and D2 in terms of Az index.
D1 D2
Maximum 𝐴 𝑧 = 0,815 using D2 for K=13 ranked items
TSOCHATZIDIS ET AL. A TWO-STAGE SVM-BASED MAMMOGRAPHIC CBIR FOR CADX, MICCAI-BIA 201519/10/2015 25
26. Conclusions
Two-stage CBIR-CAD:
• Margin-specific CBIR stage
• Diagnosis stage
Incorporation of training into the feature extraction (SVM ensemble)
High-performance for spiculated and microlobulated masses
Lack of standard datasets leads to difficulty in comparison between methods
Future efforts:
• Performance improvement for ill-defined masses
• Feature selection for each SVM independently
• Introduction of weights in decision calculation modifying the significance of each retrieved ROIs
• Use of relevance feedback mechanism to improve performance
TSOCHATZIDIS ET AL. A TWO-STAGE SVM-BASED MAMMOGRAPHIC CBIR FOR CADX, MICCAI-BIA 201519/10/2015 26