Semantic Segmentation Overview

•

0 likes•184 views

Semantic segmentation is a dense prediction task that labels each pixel of an image with a class. It has applications in autonomous vehicles, medical imaging, and surgeries. Popular architectures for semantic segmentation include U-Net, which uses an encoder-decoder structure with skip connections, and Tiramisu, which uses dense blocks. The loss function commonly used is pixel-wise cross entropy loss, which examines predictions at each pixel.

Data & Analytics

Hello!
I am Frederick Apina
Machine Learning Engineer @ParrotAI
I am here because I love to give
presentations.
2

“When I think about strong
innovations in term of
automation, cognitive computing,
and artificial intelligence, they will
be coming a lot from Tanzania as
well.”
3

6
Limitations
Still a bit rough since we’re only
drawing bounding boxes and don’t
really get an accurate idea of
object shape.

8
Semantic Segmentation
Semantic Segmentation is to
label each pixel of an image with a
corresponding class of what is being
represented.
✗ commonly referred to as dense prediction.

2.
Applications of
Semantic
Segmentation

15
Our goal is to take either a RGB color image or a grayscale image and
output a segmentation map where each pixel contains a class label
represented as an integer.

16
We create our target by one-hot encoding the class labels - essentially
creating an output channel for each of the possible classes.

17
We can easily inspect a target by overlaying it onto the observation.
When we overlay a single channel of our target (or prediction), we refer to this
as a mask which illuminates the regions of an image where a specific class is
present.

20
✗ Recall that for deep convolutional networks,
earlier layers tend to learn low-level concepts
while later layers develop more high-level (and
specialized) feature mappings. In order to
maintain expressiveness, we typically need to
increase the number of feature maps (channels)
as we get deeper in the network.

Lucky for us..
One popular approach for image segmentation models is to follow
an encoder/decoder structure.

U-Net Architecture..
Consists of a
contracting path
to capture
context and
a symmetric expa
nding path that
enables precise
localization.

Advanced U-Net variants
The standard U-Net model consists of a series of
convolution operations for each "block" in the architecture.
Proposed: swap out the basic stacked convolution blocks in
favor of residual blocks. This residual block introduces short skip
connections (within the block) alongside the existing long skip
connections (between the corresponding feature maps of
encoder and decoder modules) found in the standard U-Net
structure.

Tiramisu: Full Convolution DenseNet
Tiramisu adopts the UNet design with downsampling, bottleneck, and upsampling paths
and skip connections. It replaces convolution and max pooling layers with Dense blocks
from the DenseNet architecture. Dense blocks contain residual connections.

Defining loss function
The most commonly used loss function for the task of image segmentation is a pixel-wise cross
entropy loss. This loss examines each pixel individually, comparing the class predictions (depth-wise
pixel vector) to our one-hot encoded target vector.

Deep Learning is an continuously-growing and a
relatively new concept, the vast amount of
resources can be a touch overwhelming for those
either looking to get into the field, or those
already engraved in it. A good way of cooping is to
get a good general knowledge of machine learning
and then find a good structured path to follow (be
a project or research).
27
Conclusion

28
Thanks!
Any questions?
You can find me at:
✗ Fred@parrotai.co.tz

What's hot

Digit recognition using neural networkshachibattar

Machine Learning - Convolutional Neural NetworkRichard Kuo

Computer Vision for BeginnersSanghamitra Deb

Offline Character Recognition Using Monte Carlo Method and Neural Networkijaia

Person re-identification, PhD Day 2011Riccardo Satta

Dissimilarity-based people re-identification and search for intelligent video...Riccardo Satta

Exploiting Dissimilarity Representations for Person Re-IdentificationRiccardo Satta

Using Multi-layered Feed-forward Neural Network (MLFNN) Architecture as Bidir...IOSR Journals

Handwritten Digit Recognition using Convolutional Neural NetworksIRJET Journal

Convolutional neural network from VGG to DenseNetSungminYou

Comparison of Learning Algorithms for Handwritten Digit RecognitionSafaa Alnabulsi

GTSRB Traffic Sign recognition using machine learningRupali Aher

IEEE 2014 MATLAB IMAGE PROCESSING PROJECTS Fingerprint compression-based-on-...IEEEBEBTECHSTUDENTPROJECTS

Kq3518291832IJERA Editor

Manifold learning with application to object recognitionzukun

Image classification with Deep Neural NetworksYogendra Tamang

A survey on the layers of convolutional Neural NetworkSasanko Sekhar Gantayat

Digit recognition using mnist databasebtandale

CnnNirthika Rajendran

Transfer Learning in NLP: A SurveyNUPUR YADAV

What's hot (20)

Digit recognition using neural network

Machine Learning - Convolutional Neural Network

Computer Vision for Beginners

Offline Character Recognition Using Monte Carlo Method and Neural Network

Person re-identification, PhD Day 2011

Dissimilarity-based people re-identification and search for intelligent video...

Exploiting Dissimilarity Representations for Person Re-Identification

Using Multi-layered Feed-forward Neural Network (MLFNN) Architecture as Bidir...

Handwritten Digit Recognition using Convolutional Neural Networks

Convolutional neural network from VGG to DenseNet

Comparison of Learning Algorithms for Handwritten Digit Recognition

GTSRB Traffic Sign recognition using machine learning

IEEE 2014 MATLAB IMAGE PROCESSING PROJECTS Fingerprint compression-based-on-...

Kq3518291832

Manifold learning with application to object recognition

Image classification with Deep Neural Networks

A survey on the layers of convolutional Neural Network

Digit recognition using mnist database

Cnn

Transfer Learning in NLP: A Survey

Similar to Semantic Segmentation Overview

AaSeminar_Template.pptxManojGowdaKb

IRJET- Automatic Data Collection from Forms using Optical Character RecognitionIRJET Journal

Mnist report pptRaghunandanJairam

11_Saloni Malhotra_SummerTraining_PPT.pptxSaloniMalhotra23

Mirko Lucchese - Deep Image ProcessingMeetupDataScienceRoma

IRJET- Alternate Vision Assistance: For the BlindIRJET Journal

Review-image-segmentation-by-deep-learningTrong-An Bui

U-Netpresentation.pptxNoorUlHaq47

Devanagari Digit and Character Recognition Using Convolutional Neural NetworkIRJET Journal

Garbage Classification Using Deep Learning TechniquesIRJET Journal

IRJET- Wearable AI Device for BlindIRJET Journal

Image segmentation with deep learningAntonio Rueda-Toicen

A UTILIZATION OF CONVOLUTIONAL MATRIX METHODS ON SLICED HIPPOCAMPAL NEURON RE...ijscai

Traffic Automation SystemPrabal Chauhan

Pres Tesi LM-2016+transcript_engDaniele Ciriello

Color Detection & Segmentation based Invisible CloakAviral Chaurasia

Computer vision-nit-silchar-hackathonAditya Bhattacharya

19BCS1815_PresentationAutomatic Number Plate Recognition(ANPR)P.pptxSamridhGarg

Similar to Semantic Segmentation Overview (20)

AaSeminar_Template.pptx

IRJET- Automatic Data Collection from Forms using Optical Character Recognition

Mnist report ppt

11_Saloni Malhotra_SummerTraining_PPT.pptx

Mirko Lucchese - Deep Image Processing

IRJET- Alternate Vision Assistance: For the Blind

Review-image-segmentation-by-deep-learning

U-Netpresentation.pptx

Devanagari Digit and Character Recognition Using Convolutional Neural Network

Garbage Classification Using Deep Learning Techniques

IRJET- Wearable AI Device for Blind

Image segmentation with deep learning

A UTILIZATION OF CONVOLUTIONAL MATRIX METHODS ON SLICED HIPPOCAMPAL NEURON RE...

Traffic Automation System

Pres Tesi LM-2016+transcript_eng

Color Detection & Segmentation based Invisible Cloak

Computer vision-nit-silchar-hackathon

19BCS1815_PresentationAutomatic Number Plate Recognition(ANPR)P.pptx

Recently uploaded

Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha

20240419 - Measurecamp Amsterdam - SAM.pdfHuman37

How we prevented account sharing with MFAAndrei Kaleshka

9654467111 Call Girls In Munirka Hotel And Home ServiceSapana Sha

Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Universitat Politècnica de Catalunya

办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss

GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch

Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375

VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor

1:1定制(UQ毕业证）昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk

办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一F La

VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna

DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett

Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863

From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck

9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort

Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster

Brighton SEO | April 2024 | Data StorytellingNeil Barnes

Decoding Loan Approval: Predictive Modeling in ActionBoston Institute of Analytics

Recently uploaded (20)

Call Girls In Dwarka 9654467111 Escorts Service

20240419 - Measurecamp Amsterdam - SAM.pdf

How we prevented account sharing with MFA

9654467111 Call Girls In Munirka Hotel And Home Service

Deep Generative Learning for All - The Gen AI Hype (Spring 2024)

办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一

GA4 Without Cookies [Measure Camp AMS]

Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...

VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...

1:1定制(UQ毕业证）昆士兰大学毕业证成绩单修改留信学历认证原版一模一样

办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一

VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...

Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...

DBA Basics: Getting Started with Performance Tuning.pdf

Dubai Call Girls Wifey O52&786472 Call Girls Dubai

From idea to production in a day – Leveraging Azure ML and Streamlit to build...

9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service

Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024

Brighton SEO | April 2024 | Data Storytelling

Decoding Loan Approval: Predictive Modeling in Action

Semantic Segmentation Overview

1. Semantic Segmentation

2. Hello! I am Frederick Apina Machine Learning Engineer @ParrotAI I am here because I love to give presentations. 2

3. “When I think about strong innovations in term of automation, cognitive computing, and artificial intelligence, they will be coming a lot from Tanzania as well.” 3

4. 1. What is semantic segmentation?

5. 5

6. 6 Limitations Still a bit rough since we’re only drawing bounding boxes and don’t really get an accurate idea of object shape.

7. 7 What if!?

8. 8 Semantic Segmentation Semantic Segmentation is to label each pixel of an image with a corresponding class of what is being represented. ✗ commonly referred to as dense prediction.

9. 2. Applications of Semantic Segmentation

10. 10 Autonomous Vehicles

11. 11 Medical Surgeries

12. 12 Medical Surgeries

13. 13 Medical Images Diagnostics

14. 3. Representing the Task

15. 15 Our goal is to take either a RGB color image or a grayscale image and output a segmentation map where each pixel contains a class label represented as an integer.

16. 16 We create our target by one-hot encoding the class labels - essentially creating an output channel for each of the possible classes.

17. 17 We can easily inspect a target by overlaying it onto the observation. When we overlay a single channel of our target (or prediction), we refer to this as a mask which illuminates the regions of an image where a specific class is present.

18. 3. Constructing an Architecture

19. A naive approach…

20. 20 ✗ Recall that for deep convolutional networks, earlier layers tend to learn low-level concepts while later layers develop more high-level (and specialized) feature mappings. In order to maintain expressiveness, we typically need to increase the number of feature maps (channels) as we get deeper in the network.

21. 21 Solution?

22. Lucky for us.. One popular approach for image segmentation models is to follow an encoder/decoder structure.

23. U-Net Architecture.. Consists of a contracting path to capture context and a symmetric expa nding path that enables precise localization.

24. Advanced U-Net variants The standard U-Net model consists of a series of convolution operations for each "block" in the architecture. Proposed: swap out the basic stacked convolution blocks in favor of residual blocks. This residual block introduces short skip connections (within the block) alongside the existing long skip connections (between the corresponding feature maps of encoder and decoder modules) found in the standard U-Net structure.

25. Tiramisu: Full Convolution DenseNet Tiramisu adopts the UNet design with downsampling, bottleneck, and upsampling paths and skip connections. It replaces convolution and max pooling layers with Dense blocks from the DenseNet architecture. Dense blocks contain residual connections.

26. Defining loss function The most commonly used loss function for the task of image segmentation is a pixel-wise cross entropy loss. This loss examines each pixel individually, comparing the class predictions (depth-wise pixel vector) to our one-hot encoded target vector.

27. Deep Learning is an continuously-growing and a relatively new concept, the vast amount of resources can be a touch overwhelming for those either looking to get into the field, or those already engraved in it. A good way of cooping is to get a good general knowledge of machine learning and then find a good structured path to follow (be a project or research). 27 Conclusion

28. 28 Thanks! Any questions? You can find me at: ✗ Fred@parrotai.co.tz

Semantic Segmentation Overview

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Semantic Segmentation Overview

Similar to Semantic Segmentation Overview (20)

Recently uploaded

Recently uploaded (20)

Semantic Segmentation Overview