SlideShare a Scribd company logo
Image Segmentation Using Machine Learning
Presented by
Shreshtha Aggarwal
(1BM19CS155)
Under the guidance of
Prof. Sunayana S
Assistant Professor
Department of Computer Science and Engineering
BMS College and Engineering, Bengaluru
3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE
Introduction
3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE
1.1. Overview
As the connectivity increasing so though the amount of data transferred is increasing, this has led to sharing
all type of data like images, voice etc. As we look at the images, we know that computer can only understand
numbers only and also as for human they might not understand all the hidden aspect of images. So to help
computers understand images the process of image segmentation was developed which is the process of
partitioning a digital image into multiple segments.
The goal of segmentation is to simplify and/or change the representation of an image into something that is
more meaningful and easier to analyze. This helps in computer in understanding the image and also help
human analyzing the images. In this one we will use two architectures of it known as Unet and mask-RCNN.
3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE
1.2. Motivation
3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE
With the increase of the technology and modernization, there has been increase in the data. This data contains text,
images, and many more. But as we know that computer can only understand the numbers only so understanding of
the images by the computer was a hefty thing. But with the booming of the Machine learning and deep learning
this task has become easier for computer to understand the images and even analyze and classify them.
Few of the tools and architectures have helped such as U-net architecture and Mask-RCNN has helped in our task.
How these architectures work and classify images is the motivation here.
1.3. Objectives
3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE
The objective of our project is simple. There are two of them firstly since we know that computer can only
understand the numbers so our first objective is to make computer understand the images. With the release
of the Pytorch and tensorflow it has helped us in achieving.
This is done by understanding what exactly the images are made up of and how computer perceive them. It
is one of the technologies that help digital world interact with the physical world. Second Objective is to
understand how these technologies work and various architecture used in these. Thus, we will demonstrate
the following architectures and show how they work.
Literature Survey
3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE
U-net and its variants for medical image segmentation: A
review of theoryand applications
3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE
Authors: NAHIAN SIDDIQUE, SIDIKE PAHEDING, COLIN P. ELKIN and VIJAY
DEVABHAKTUNI
In this paper, it was aimed to provide a starting point for researchers who wish to explore U- net, which is a
powerful deep learning model used extensively for medical image segmentation. To do so, firstly we started
with the definition of Unet and what it is and then we explored the many variants of Unet and its diverse
applications on a multitude of image modalities. We also examined the major deep learning methods and
their application areas for all of the papers in this survey. Indeed U-net based architecture is quite
groundbreaking and valuable in medical image analysis. We also saw that the growth of U-net papers since
2017 lends credence to its status as a premier deep learning technique in medical image diagnosis. Thus,
despite the many challenges and limitations remaining in deep learning- based image analysis, we expect U-
net to be one of the major paths forward.
U-Net: Convolutional Networks for Biomedical Image
Segmentation
3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE
Author: Olaf Ronneberger, Philipp Fischer, and Thomas Brox
This paper introduces the Unet architecture and talks about its architecture and design when it was designed
for first time. It also shows us the experimental details which show it has high accuracy than another model
despite using less dataset. It also shows how it is beneficiary for medical field. The u-net architecture
achieves very good performance on very different biomedical segmentation applications. Thanks to data
augmentation with elastic deformations, it only needs very few annotated images and has a very reasonable
training time of only 10 hours on a NVidia Titan GPU (6 GB). We are sure that the u-net architecture can be
applied easily to many more tasks.
Mask R-CNN
3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE
Author: Kaiming He Georgia Gkioxari Piotr Dollar Ross Girshick
This paper talks about Mask-RCNN and also about its application and working. It is faster RCNN and also
instance segmentation and talks about various of the Mask – RCNN applications and also talks about the
implementation details and also compare them to many other architectures like FCIS+++ and have shown to
outperform them. The experiment was done on the coco dataset 2015-2016 and also. But one thing is clear
from it is that it can used for many more things.
Classification of Image using CNN
3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE
Authors: Md. Anwar Hossain and Md. Shahriar Alam Sajib
This paper focuses on Convolutional Neural Networks which currently is the state-of-art technique for image
classification. The authors have used the CIFAR-10 dataset which has around 60,000RGBimages. It has ten
classes, and they are an airplane, automobile, bird, cat, deer, dog, frog, horse, ship, truck. The images are of
size 32x32 pixels. The dataset consists of 50,000training and 10,000 testing examples. Optimization
algorithm used is the Stochastic Gradient Descent algorithm (SGD) which is a variation of Gradient Descent
algorithm (GD). Upon successful training of the images, among 10,118 test cases, the model misclassified a
total of661 images after three hundred epochs which corresponds to 93.47%recognitionrate. Authors also
found that higher the number of epochs the higher would be the accuracy.
Methodology/Techniques or Algorithms used
3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE
3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE
Algorithms and techniques were learnt and used in the implementation.
Some of the techniques are listed below.
● Image processing
● Data Augmentation
● Plotting images using matplotlib
● Transfer Learning
Algorithms that were learnt are listed below:
3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE
● Gradient Descent
● Stochastic Gradient Descent
● Backpropagation
The algorithm and the techniques go hand in hand for efficient and fast execution. Techniques help reduce the
complexity of data on which algorithms work on. The techniques make sure to get rid of some extreme edge
cases with which our program might crash or with which development time would increase. Thus techniques
are equally important to algorithms.
4. Tools used
3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE
Tools that were used to learn and implement CNN model were mainly frameworks and packages that were
open source and implemented in python. These packages helped us get a faster grasp of the field and eased the
task of understanding and solving the problem, also to solve some trivial problems. Yet one should not just use
the packages but must also understand the working of the packages. Under the hood mechanism must also be
learnt so that any future problems which require deeper understanding of the field could be solved.
Tools that were used were:
● TensorFlow
TensorFlow is an end-to-end open-source platform for machine learning. It has a comprehensive, flexible
ecosystem of tools, libraries and community resources that lets researchers push the state-of-the-art in ML
and developers easily build and deploy MLpowered applications.
3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE
● Matplotlib
Matplotlib is a comprehensive library for creating static, animated, and interactivevisualizations in
Python. Matplotlib makes easy things easy and hard things possible.
● NumPy
This is a package that is used for making efficient and multi-dimensional arrays that are efficient. This
package also includes many features like Random number generation, generation of values between a range
with equal difference between them and so on. Basically this package is used to play with numbers and arrays.
These were the tools that were selected for the implementation.
The modules were implemented on Kaggle.
3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE
Modules implementation and results
3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE
MODULE 1: UNET – ARCHITECTURE
U-Net architecture is a deep learning image segmentation architecture introduced by Olaf Ronneberger,
Philipp Fischer and Thomas Brox in 2015. It’s an encoder-decoder architecture convolutional neural network
that was specifically designed tobe used in Biomedical Imaging.
The main goal for this architecture was to tackle two important issues in the field of medical imaging which
are:
1. Lack of large training datasets –
Traditional convolutional neural networks with fully connected layers, require large datasets because of the
large number of parameters needed to learn. Since medical imaging has small datasets, this architecture ensures
maximum learning from the information provided because a fully connected layer is replaced with series of up
convolutions on the decoder side.
3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE
2. Capturing context accurately at different resolutions and scales.
Its U shape design consists of two parts. The left side is known as the contracting path or encoder path,
where repeated typical convolutions are applied followed by ReLU and max pooling operations. The right side
is known as the expansive path which has transposed 2D convolutional layers where upsampling technique is
performed.
3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE
Input:
Fig.1 Input Image with its actual mask
Output:
3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE
Fig.2 Input Image with its actual mask and predicted mask by using U-Net Architecture
Graph output:
3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE
Fig.3 Graph Output of Training and Variation Loss
MODULE 2: MASK-RCNN
3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE
Modern methods used to perform image segmentation use dilated convolutions at the core to extract high-
resolution features. This architecture is used for instance image segmentation which extends Faster R-CNN
(an architecture proposed by Shaoqing Renetal to eliminate selective search and allow the network to learn
region proposals) by adding an object mask predictor as a parallel branch to bounding box recognition.
Input:
3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE
Fig.4 Input Image with various objects to identify
Output:
3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE
Fig.5 Image with objects segmented
3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE
References
1. N. Siddique, S. Paheding, C. P. Elkin and V. Devabhaktuni, "U-Net and Its Variants for Medical Image
Segmentation: A Review of Theory and Applications," in IEEE Access, vol. 9, pp. 82031-82057, 2021.
2. Ronneberger, O., Fischer, P., Brox, T. (2015). U-Net: Convolutional Networks for Biomedical Image
Segmentation. In: Navab, N., Hornegger, J., Wells, W., Frangi, A. (eds) Medical Image Computing and
Computer-Assisted Intervention – MICCAI 2015. MICCAI 2015. Lecture Notes in Computer Science(), vol
9351. Springer, Cham
3. K. He, G. Gkioxari, P. Dollar, and R. Girshick, “Mask R-CNN,” 2017 IEEE International Conference on
Computer Vision (ICCV), 2017.
4. Md. Anwar Hossain, & Md. Shahriar Alam Sajib. (2019). Classification of Image using Convolutional
Neural Network (CNN). Global Journal of Computer Science and Technology, 19(D2), 13–18. Retrieved
from https://computerresearch.org/index.php/computer/article/view/1821

More Related Content

Similar to 1BM19CS155_ShreshthaAggarwal_Technical Seminar_PPT.pptx

IMAGE CONTENT DESCRIPTION USING LSTM APPROACH
IMAGE CONTENT DESCRIPTION USING LSTM APPROACHIMAGE CONTENT DESCRIPTION USING LSTM APPROACH
IMAGE CONTENT DESCRIPTION USING LSTM APPROACH
csandit
 
From Pixels to Understanding: Deep Learning's Impact on Image Classification ...
From Pixels to Understanding: Deep Learning's Impact on Image Classification ...From Pixels to Understanding: Deep Learning's Impact on Image Classification ...
From Pixels to Understanding: Deep Learning's Impact on Image Classification ...
IRJET Journal
 
Image Recognition Expert System based on deep learning
Image Recognition Expert System based on deep learningImage Recognition Expert System based on deep learning
Image Recognition Expert System based on deep learning
PRATHAMESH REGE
 
ATTENTION BASED IMAGE CAPTIONING USING DEEP LEARNING
ATTENTION BASED IMAGE CAPTIONING USING DEEP LEARNINGATTENTION BASED IMAGE CAPTIONING USING DEEP LEARNING
ATTENTION BASED IMAGE CAPTIONING USING DEEP LEARNING
Nathan Mathis
 
IRJET-MText Extraction from Images using Convolutional Neural Network
IRJET-MText Extraction from Images using Convolutional Neural NetworkIRJET-MText Extraction from Images using Convolutional Neural Network
IRJET-MText Extraction from Images using Convolutional Neural Network
IRJET Journal
 
CNN FEATURES ARE ALSO GREAT AT UNSUPERVISED CLASSIFICATION
CNN FEATURES ARE ALSO GREAT AT UNSUPERVISED CLASSIFICATION CNN FEATURES ARE ALSO GREAT AT UNSUPERVISED CLASSIFICATION
CNN FEATURES ARE ALSO GREAT AT UNSUPERVISED CLASSIFICATION
cscpconf
 
IRJET - Deep Learning Approach to Inpainting and Outpainting System
IRJET -  	  Deep Learning Approach to Inpainting and Outpainting SystemIRJET -  	  Deep Learning Approach to Inpainting and Outpainting System
IRJET - Deep Learning Approach to Inpainting and Outpainting System
IRJET Journal
 
IRJET - Reversible Texture Combination Steganography
IRJET - Reversible Texture Combination SteganographyIRJET - Reversible Texture Combination Steganography
IRJET - Reversible Texture Combination Steganography
IRJET Journal
 
Image Captioning Generator using Deep Machine Learning
Image Captioning Generator using Deep Machine LearningImage Captioning Generator using Deep Machine Learning
Image Captioning Generator using Deep Machine Learning
ijtsrd
 
IRJET- Face Recognition using Machine Learning
IRJET- Face Recognition using Machine LearningIRJET- Face Recognition using Machine Learning
IRJET- Face Recognition using Machine Learning
IRJET Journal
 
Real time Traffic Signs Recognition using Deep Learning
Real time Traffic Signs Recognition using Deep LearningReal time Traffic Signs Recognition using Deep Learning
Real time Traffic Signs Recognition using Deep Learning
IRJET Journal
 
CONTENT RECOVERY AND IMAGE RETRIVAL IN IMAGE DATABASE CONTENT RETRIVING IN TE...
CONTENT RECOVERY AND IMAGE RETRIVAL IN IMAGE DATABASE CONTENT RETRIVING IN TE...CONTENT RECOVERY AND IMAGE RETRIVAL IN IMAGE DATABASE CONTENT RETRIVING IN TE...
CONTENT RECOVERY AND IMAGE RETRIVAL IN IMAGE DATABASE CONTENT RETRIVING IN TE...
Editor IJMTER
 
Image segmentation using advanced fuzzy c-mean algorithm [FYP @ IITR, obtaine...
Image segmentation using advanced fuzzy c-mean algorithm [FYP @ IITR, obtaine...Image segmentation using advanced fuzzy c-mean algorithm [FYP @ IITR, obtaine...
Image segmentation using advanced fuzzy c-mean algorithm [FYP @ IITR, obtaine...
Koteswar Rao Jerripothula
 
IMAGE RECOGNITION USING MATLAB SIMULINK BLOCKSET
IMAGE RECOGNITION USING MATLAB SIMULINK BLOCKSETIMAGE RECOGNITION USING MATLAB SIMULINK BLOCKSET
IMAGE RECOGNITION USING MATLAB SIMULINK BLOCKSET
IJCSEA Journal
 
Search engine based on image and name for campus
Search engine based on image and name for campusSearch engine based on image and name for campus
Search engine based on image and name for campus
IRJET Journal
 
IMAGE CAPTIONING USING TRANSFORMER: VISIONAID
IMAGE CAPTIONING USING TRANSFORMER: VISIONAIDIMAGE CAPTIONING USING TRANSFORMER: VISIONAID
IMAGE CAPTIONING USING TRANSFORMER: VISIONAID
IRJET Journal
 
Minor Project Synopsis on Data Structure Visualizer
Minor Project Synopsis on Data Structure VisualizerMinor Project Synopsis on Data Structure Visualizer
Minor Project Synopsis on Data Structure Visualizer
RonitShrivastava057
 
Image recognition
Image recognitionImage recognition
Image recognition
Joel Jose
 
FACE EXPRESSION IDENTIFICATION USING IMAGE FEATURE CLUSTRING AND QUERY SCHEME...
FACE EXPRESSION IDENTIFICATION USING IMAGE FEATURE CLUSTRING AND QUERY SCHEME...FACE EXPRESSION IDENTIFICATION USING IMAGE FEATURE CLUSTRING AND QUERY SCHEME...
FACE EXPRESSION IDENTIFICATION USING IMAGE FEATURE CLUSTRING AND QUERY SCHEME...
Editor IJMTER
 
IRJET - Visual Question Answering – Implementation using Keras
IRJET -  	  Visual Question Answering – Implementation using KerasIRJET -  	  Visual Question Answering – Implementation using Keras
IRJET - Visual Question Answering – Implementation using Keras
IRJET Journal
 

Similar to 1BM19CS155_ShreshthaAggarwal_Technical Seminar_PPT.pptx (20)

IMAGE CONTENT DESCRIPTION USING LSTM APPROACH
IMAGE CONTENT DESCRIPTION USING LSTM APPROACHIMAGE CONTENT DESCRIPTION USING LSTM APPROACH
IMAGE CONTENT DESCRIPTION USING LSTM APPROACH
 
From Pixels to Understanding: Deep Learning's Impact on Image Classification ...
From Pixels to Understanding: Deep Learning's Impact on Image Classification ...From Pixels to Understanding: Deep Learning's Impact on Image Classification ...
From Pixels to Understanding: Deep Learning's Impact on Image Classification ...
 
Image Recognition Expert System based on deep learning
Image Recognition Expert System based on deep learningImage Recognition Expert System based on deep learning
Image Recognition Expert System based on deep learning
 
ATTENTION BASED IMAGE CAPTIONING USING DEEP LEARNING
ATTENTION BASED IMAGE CAPTIONING USING DEEP LEARNINGATTENTION BASED IMAGE CAPTIONING USING DEEP LEARNING
ATTENTION BASED IMAGE CAPTIONING USING DEEP LEARNING
 
IRJET-MText Extraction from Images using Convolutional Neural Network
IRJET-MText Extraction from Images using Convolutional Neural NetworkIRJET-MText Extraction from Images using Convolutional Neural Network
IRJET-MText Extraction from Images using Convolutional Neural Network
 
CNN FEATURES ARE ALSO GREAT AT UNSUPERVISED CLASSIFICATION
CNN FEATURES ARE ALSO GREAT AT UNSUPERVISED CLASSIFICATION CNN FEATURES ARE ALSO GREAT AT UNSUPERVISED CLASSIFICATION
CNN FEATURES ARE ALSO GREAT AT UNSUPERVISED CLASSIFICATION
 
IRJET - Deep Learning Approach to Inpainting and Outpainting System
IRJET -  	  Deep Learning Approach to Inpainting and Outpainting SystemIRJET -  	  Deep Learning Approach to Inpainting and Outpainting System
IRJET - Deep Learning Approach to Inpainting and Outpainting System
 
IRJET - Reversible Texture Combination Steganography
IRJET - Reversible Texture Combination SteganographyIRJET - Reversible Texture Combination Steganography
IRJET - Reversible Texture Combination Steganography
 
Image Captioning Generator using Deep Machine Learning
Image Captioning Generator using Deep Machine LearningImage Captioning Generator using Deep Machine Learning
Image Captioning Generator using Deep Machine Learning
 
IRJET- Face Recognition using Machine Learning
IRJET- Face Recognition using Machine LearningIRJET- Face Recognition using Machine Learning
IRJET- Face Recognition using Machine Learning
 
Real time Traffic Signs Recognition using Deep Learning
Real time Traffic Signs Recognition using Deep LearningReal time Traffic Signs Recognition using Deep Learning
Real time Traffic Signs Recognition using Deep Learning
 
CONTENT RECOVERY AND IMAGE RETRIVAL IN IMAGE DATABASE CONTENT RETRIVING IN TE...
CONTENT RECOVERY AND IMAGE RETRIVAL IN IMAGE DATABASE CONTENT RETRIVING IN TE...CONTENT RECOVERY AND IMAGE RETRIVAL IN IMAGE DATABASE CONTENT RETRIVING IN TE...
CONTENT RECOVERY AND IMAGE RETRIVAL IN IMAGE DATABASE CONTENT RETRIVING IN TE...
 
Image segmentation using advanced fuzzy c-mean algorithm [FYP @ IITR, obtaine...
Image segmentation using advanced fuzzy c-mean algorithm [FYP @ IITR, obtaine...Image segmentation using advanced fuzzy c-mean algorithm [FYP @ IITR, obtaine...
Image segmentation using advanced fuzzy c-mean algorithm [FYP @ IITR, obtaine...
 
IMAGE RECOGNITION USING MATLAB SIMULINK BLOCKSET
IMAGE RECOGNITION USING MATLAB SIMULINK BLOCKSETIMAGE RECOGNITION USING MATLAB SIMULINK BLOCKSET
IMAGE RECOGNITION USING MATLAB SIMULINK BLOCKSET
 
Search engine based on image and name for campus
Search engine based on image and name for campusSearch engine based on image and name for campus
Search engine based on image and name for campus
 
IMAGE CAPTIONING USING TRANSFORMER: VISIONAID
IMAGE CAPTIONING USING TRANSFORMER: VISIONAIDIMAGE CAPTIONING USING TRANSFORMER: VISIONAID
IMAGE CAPTIONING USING TRANSFORMER: VISIONAID
 
Minor Project Synopsis on Data Structure Visualizer
Minor Project Synopsis on Data Structure VisualizerMinor Project Synopsis on Data Structure Visualizer
Minor Project Synopsis on Data Structure Visualizer
 
Image recognition
Image recognitionImage recognition
Image recognition
 
FACE EXPRESSION IDENTIFICATION USING IMAGE FEATURE CLUSTRING AND QUERY SCHEME...
FACE EXPRESSION IDENTIFICATION USING IMAGE FEATURE CLUSTRING AND QUERY SCHEME...FACE EXPRESSION IDENTIFICATION USING IMAGE FEATURE CLUSTRING AND QUERY SCHEME...
FACE EXPRESSION IDENTIFICATION USING IMAGE FEATURE CLUSTRING AND QUERY SCHEME...
 
IRJET - Visual Question Answering – Implementation using Keras
IRJET -  	  Visual Question Answering – Implementation using KerasIRJET -  	  Visual Question Answering – Implementation using Keras
IRJET - Visual Question Answering – Implementation using Keras
 

Recently uploaded

学校原版美国波士顿大学毕业证学历学位证书原版一模一样
学校原版美国波士顿大学毕业证学历学位证书原版一模一样学校原版美国波士顿大学毕业证学历学位证书原版一模一样
学校原版美国波士顿大学毕业证学历学位证书原版一模一样
171ticu
 
Introduction to AI Safety (public presentation).pptx
Introduction to AI Safety (public presentation).pptxIntroduction to AI Safety (public presentation).pptx
Introduction to AI Safety (public presentation).pptx
MiscAnnoy1
 
Engineering Drawings Lecture Detail Drawings 2014.pdf
Engineering Drawings Lecture Detail Drawings 2014.pdfEngineering Drawings Lecture Detail Drawings 2014.pdf
Engineering Drawings Lecture Detail Drawings 2014.pdf
abbyasa1014
 
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
ecqow
 
BRAIN TUMOR DETECTION for seminar ppt.pdf
BRAIN TUMOR DETECTION for seminar ppt.pdfBRAIN TUMOR DETECTION for seminar ppt.pdf
BRAIN TUMOR DETECTION for seminar ppt.pdf
LAXMAREDDY22
 
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressionsKuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
Victor Morales
 
UNLOCKING HEALTHCARE 4.0: NAVIGATING CRITICAL SUCCESS FACTORS FOR EFFECTIVE I...
UNLOCKING HEALTHCARE 4.0: NAVIGATING CRITICAL SUCCESS FACTORS FOR EFFECTIVE I...UNLOCKING HEALTHCARE 4.0: NAVIGATING CRITICAL SUCCESS FACTORS FOR EFFECTIVE I...
UNLOCKING HEALTHCARE 4.0: NAVIGATING CRITICAL SUCCESS FACTORS FOR EFFECTIVE I...
amsjournal
 
spirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptxspirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptx
Madan Karki
 
Generative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of contentGenerative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of content
Hitesh Mohapatra
 
Certificates - Mahmoud Mohamed Moursi Ahmed
Certificates - Mahmoud Mohamed Moursi AhmedCertificates - Mahmoud Mohamed Moursi Ahmed
Certificates - Mahmoud Mohamed Moursi Ahmed
Mahmoud Morsy
 
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
insn4465
 
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENTNATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
Addu25809
 
ACEP Magazine edition 4th launched on 05.06.2024
ACEP Magazine edition 4th launched on 05.06.2024ACEP Magazine edition 4th launched on 05.06.2024
ACEP Magazine edition 4th launched on 05.06.2024
Rahul
 
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
Yasser Mahgoub
 
Hematology Analyzer Machine - Complete Blood Count
Hematology Analyzer Machine - Complete Blood CountHematology Analyzer Machine - Complete Blood Count
Hematology Analyzer Machine - Complete Blood Count
shahdabdulbaset
 
Understanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine LearningUnderstanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine Learning
SUTEJAS
 
gray level transformation unit 3(image processing))
gray level transformation unit 3(image processing))gray level transformation unit 3(image processing))
gray level transformation unit 3(image processing))
shivani5543
 
Engine Lubrication performance System.pdf
Engine Lubrication performance System.pdfEngine Lubrication performance System.pdf
Engine Lubrication performance System.pdf
mamamaam477
 
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECTCHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
jpsjournal1
 
Properties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptxProperties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptx
MDSABBIROJJAMANPAYEL
 

Recently uploaded (20)

学校原版美国波士顿大学毕业证学历学位证书原版一模一样
学校原版美国波士顿大学毕业证学历学位证书原版一模一样学校原版美国波士顿大学毕业证学历学位证书原版一模一样
学校原版美国波士顿大学毕业证学历学位证书原版一模一样
 
Introduction to AI Safety (public presentation).pptx
Introduction to AI Safety (public presentation).pptxIntroduction to AI Safety (public presentation).pptx
Introduction to AI Safety (public presentation).pptx
 
Engineering Drawings Lecture Detail Drawings 2014.pdf
Engineering Drawings Lecture Detail Drawings 2014.pdfEngineering Drawings Lecture Detail Drawings 2014.pdf
Engineering Drawings Lecture Detail Drawings 2014.pdf
 
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
 
BRAIN TUMOR DETECTION for seminar ppt.pdf
BRAIN TUMOR DETECTION for seminar ppt.pdfBRAIN TUMOR DETECTION for seminar ppt.pdf
BRAIN TUMOR DETECTION for seminar ppt.pdf
 
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressionsKuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
 
UNLOCKING HEALTHCARE 4.0: NAVIGATING CRITICAL SUCCESS FACTORS FOR EFFECTIVE I...
UNLOCKING HEALTHCARE 4.0: NAVIGATING CRITICAL SUCCESS FACTORS FOR EFFECTIVE I...UNLOCKING HEALTHCARE 4.0: NAVIGATING CRITICAL SUCCESS FACTORS FOR EFFECTIVE I...
UNLOCKING HEALTHCARE 4.0: NAVIGATING CRITICAL SUCCESS FACTORS FOR EFFECTIVE I...
 
spirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptxspirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptx
 
Generative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of contentGenerative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of content
 
Certificates - Mahmoud Mohamed Moursi Ahmed
Certificates - Mahmoud Mohamed Moursi AhmedCertificates - Mahmoud Mohamed Moursi Ahmed
Certificates - Mahmoud Mohamed Moursi Ahmed
 
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
 
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENTNATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
 
ACEP Magazine edition 4th launched on 05.06.2024
ACEP Magazine edition 4th launched on 05.06.2024ACEP Magazine edition 4th launched on 05.06.2024
ACEP Magazine edition 4th launched on 05.06.2024
 
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
 
Hematology Analyzer Machine - Complete Blood Count
Hematology Analyzer Machine - Complete Blood CountHematology Analyzer Machine - Complete Blood Count
Hematology Analyzer Machine - Complete Blood Count
 
Understanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine LearningUnderstanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine Learning
 
gray level transformation unit 3(image processing))
gray level transformation unit 3(image processing))gray level transformation unit 3(image processing))
gray level transformation unit 3(image processing))
 
Engine Lubrication performance System.pdf
Engine Lubrication performance System.pdfEngine Lubrication performance System.pdf
Engine Lubrication performance System.pdf
 
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECTCHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
 
Properties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptxProperties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptx
 

1BM19CS155_ShreshthaAggarwal_Technical Seminar_PPT.pptx

  • 1. Image Segmentation Using Machine Learning Presented by Shreshtha Aggarwal (1BM19CS155) Under the guidance of Prof. Sunayana S Assistant Professor Department of Computer Science and Engineering BMS College and Engineering, Bengaluru 3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE
  • 2. Introduction 3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE
  • 3. 1.1. Overview As the connectivity increasing so though the amount of data transferred is increasing, this has led to sharing all type of data like images, voice etc. As we look at the images, we know that computer can only understand numbers only and also as for human they might not understand all the hidden aspect of images. So to help computers understand images the process of image segmentation was developed which is the process of partitioning a digital image into multiple segments. The goal of segmentation is to simplify and/or change the representation of an image into something that is more meaningful and easier to analyze. This helps in computer in understanding the image and also help human analyzing the images. In this one we will use two architectures of it known as Unet and mask-RCNN. 3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE
  • 4. 1.2. Motivation 3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE With the increase of the technology and modernization, there has been increase in the data. This data contains text, images, and many more. But as we know that computer can only understand the numbers only so understanding of the images by the computer was a hefty thing. But with the booming of the Machine learning and deep learning this task has become easier for computer to understand the images and even analyze and classify them. Few of the tools and architectures have helped such as U-net architecture and Mask-RCNN has helped in our task. How these architectures work and classify images is the motivation here.
  • 5. 1.3. Objectives 3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE The objective of our project is simple. There are two of them firstly since we know that computer can only understand the numbers so our first objective is to make computer understand the images. With the release of the Pytorch and tensorflow it has helped us in achieving. This is done by understanding what exactly the images are made up of and how computer perceive them. It is one of the technologies that help digital world interact with the physical world. Second Objective is to understand how these technologies work and various architecture used in these. Thus, we will demonstrate the following architectures and show how they work.
  • 6. Literature Survey 3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE
  • 7. U-net and its variants for medical image segmentation: A review of theoryand applications 3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE Authors: NAHIAN SIDDIQUE, SIDIKE PAHEDING, COLIN P. ELKIN and VIJAY DEVABHAKTUNI In this paper, it was aimed to provide a starting point for researchers who wish to explore U- net, which is a powerful deep learning model used extensively for medical image segmentation. To do so, firstly we started with the definition of Unet and what it is and then we explored the many variants of Unet and its diverse applications on a multitude of image modalities. We also examined the major deep learning methods and their application areas for all of the papers in this survey. Indeed U-net based architecture is quite groundbreaking and valuable in medical image analysis. We also saw that the growth of U-net papers since 2017 lends credence to its status as a premier deep learning technique in medical image diagnosis. Thus, despite the many challenges and limitations remaining in deep learning- based image analysis, we expect U- net to be one of the major paths forward.
  • 8. U-Net: Convolutional Networks for Biomedical Image Segmentation 3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE Author: Olaf Ronneberger, Philipp Fischer, and Thomas Brox This paper introduces the Unet architecture and talks about its architecture and design when it was designed for first time. It also shows us the experimental details which show it has high accuracy than another model despite using less dataset. It also shows how it is beneficiary for medical field. The u-net architecture achieves very good performance on very different biomedical segmentation applications. Thanks to data augmentation with elastic deformations, it only needs very few annotated images and has a very reasonable training time of only 10 hours on a NVidia Titan GPU (6 GB). We are sure that the u-net architecture can be applied easily to many more tasks.
  • 9. Mask R-CNN 3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE Author: Kaiming He Georgia Gkioxari Piotr Dollar Ross Girshick This paper talks about Mask-RCNN and also about its application and working. It is faster RCNN and also instance segmentation and talks about various of the Mask – RCNN applications and also talks about the implementation details and also compare them to many other architectures like FCIS+++ and have shown to outperform them. The experiment was done on the coco dataset 2015-2016 and also. But one thing is clear from it is that it can used for many more things.
  • 10. Classification of Image using CNN 3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE Authors: Md. Anwar Hossain and Md. Shahriar Alam Sajib This paper focuses on Convolutional Neural Networks which currently is the state-of-art technique for image classification. The authors have used the CIFAR-10 dataset which has around 60,000RGBimages. It has ten classes, and they are an airplane, automobile, bird, cat, deer, dog, frog, horse, ship, truck. The images are of size 32x32 pixels. The dataset consists of 50,000training and 10,000 testing examples. Optimization algorithm used is the Stochastic Gradient Descent algorithm (SGD) which is a variation of Gradient Descent algorithm (GD). Upon successful training of the images, among 10,118 test cases, the model misclassified a total of661 images after three hundred epochs which corresponds to 93.47%recognitionrate. Authors also found that higher the number of epochs the higher would be the accuracy.
  • 11. Methodology/Techniques or Algorithms used 3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE
  • 12. 3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE Algorithms and techniques were learnt and used in the implementation. Some of the techniques are listed below. ● Image processing ● Data Augmentation ● Plotting images using matplotlib ● Transfer Learning
  • 13. Algorithms that were learnt are listed below: 3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE ● Gradient Descent ● Stochastic Gradient Descent ● Backpropagation The algorithm and the techniques go hand in hand for efficient and fast execution. Techniques help reduce the complexity of data on which algorithms work on. The techniques make sure to get rid of some extreme edge cases with which our program might crash or with which development time would increase. Thus techniques are equally important to algorithms.
  • 14. 4. Tools used 3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE Tools that were used to learn and implement CNN model were mainly frameworks and packages that were open source and implemented in python. These packages helped us get a faster grasp of the field and eased the task of understanding and solving the problem, also to solve some trivial problems. Yet one should not just use the packages but must also understand the working of the packages. Under the hood mechanism must also be learnt so that any future problems which require deeper understanding of the field could be solved. Tools that were used were: ● TensorFlow TensorFlow is an end-to-end open-source platform for machine learning. It has a comprehensive, flexible ecosystem of tools, libraries and community resources that lets researchers push the state-of-the-art in ML and developers easily build and deploy MLpowered applications.
  • 15. 3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE ● Matplotlib Matplotlib is a comprehensive library for creating static, animated, and interactivevisualizations in Python. Matplotlib makes easy things easy and hard things possible. ● NumPy This is a package that is used for making efficient and multi-dimensional arrays that are efficient. This package also includes many features like Random number generation, generation of values between a range with equal difference between them and so on. Basically this package is used to play with numbers and arrays. These were the tools that were selected for the implementation. The modules were implemented on Kaggle.
  • 16. 3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE Modules implementation and results
  • 17. 3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE MODULE 1: UNET – ARCHITECTURE U-Net architecture is a deep learning image segmentation architecture introduced by Olaf Ronneberger, Philipp Fischer and Thomas Brox in 2015. It’s an encoder-decoder architecture convolutional neural network that was specifically designed tobe used in Biomedical Imaging. The main goal for this architecture was to tackle two important issues in the field of medical imaging which are: 1. Lack of large training datasets – Traditional convolutional neural networks with fully connected layers, require large datasets because of the large number of parameters needed to learn. Since medical imaging has small datasets, this architecture ensures maximum learning from the information provided because a fully connected layer is replaced with series of up convolutions on the decoder side.
  • 18. 3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE 2. Capturing context accurately at different resolutions and scales. Its U shape design consists of two parts. The left side is known as the contracting path or encoder path, where repeated typical convolutions are applied followed by ReLU and max pooling operations. The right side is known as the expansive path which has transposed 2D convolutional layers where upsampling technique is performed.
  • 19. 3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE Input: Fig.1 Input Image with its actual mask
  • 20. Output: 3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE Fig.2 Input Image with its actual mask and predicted mask by using U-Net Architecture
  • 21. Graph output: 3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE Fig.3 Graph Output of Training and Variation Loss
  • 22. MODULE 2: MASK-RCNN 3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE Modern methods used to perform image segmentation use dilated convolutions at the core to extract high- resolution features. This architecture is used for instance image segmentation which extends Faster R-CNN (an architecture proposed by Shaoqing Renetal to eliminate selective search and allow the network to learn region proposals) by adding an object mask predictor as a parallel branch to bounding box recognition.
  • 23. Input: 3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE Fig.4 Input Image with various objects to identify
  • 24. Output: 3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE Fig.5 Image with objects segmented
  • 25. 3-01-2023 Image Segmentation Using Machine Learning Department of CSE, BMSCE References 1. N. Siddique, S. Paheding, C. P. Elkin and V. Devabhaktuni, "U-Net and Its Variants for Medical Image Segmentation: A Review of Theory and Applications," in IEEE Access, vol. 9, pp. 82031-82057, 2021. 2. Ronneberger, O., Fischer, P., Brox, T. (2015). U-Net: Convolutional Networks for Biomedical Image Segmentation. In: Navab, N., Hornegger, J., Wells, W., Frangi, A. (eds) Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015. MICCAI 2015. Lecture Notes in Computer Science(), vol 9351. Springer, Cham 3. K. He, G. Gkioxari, P. Dollar, and R. Girshick, “Mask R-CNN,” 2017 IEEE International Conference on Computer Vision (ICCV), 2017. 4. Md. Anwar Hossain, & Md. Shahriar Alam Sajib. (2019). Classification of Image using Convolutional Neural Network (CNN). Global Journal of Computer Science and Technology, 19(D2), 13–18. Retrieved from https://computerresearch.org/index.php/computer/article/view/1821