Enhanced real time semantic segmentation

•Download as PPTX, PDF•

0 likes•45 views

AkankshaRawat42

Enhanced Real-time Semantic Segmentation of Road Scenes — DDRNets

Technology

Enhanced Real-time Semantic Segmentation
of Road Scenes with DDRNets Name: Akanksha Rawat

The Fundamentals:
A. Computer Vision
B. Computer Vision Tasks
C. Convolutional neural network - basics of Modern Computer Vision
D. Semantic Segmentation and Autonomous Vehicle

A. Computer Vision
● Computer vision is a field of artificial intelligence (AI) that enables computers
and systems to derive meaningful information from digital images, videos, and
other visual inputs.
● The agenda of this field is to enable machines to view the world as humans do.

B. Computer Vision Tasks
● The most general computer vision tasks that we frequently encounter in AI
jargon include:
○ Image classification
○ Object detection
○ Image segmentation- Semantic Segmentation and Instance Segmentation.

C. Convolutional neural network - basics of
Modern Computer Vision
● In Deep learning, CNN is most commonly applied to analyze the visual imagery.
● Most computer vision algorithms are based on convolution neural network.
● CNNs are able to treat images like matrices as they exist and extract spatial
features from them, like texture, edges and depth. They do this by using
convolutional layers and pooling.

Convolutional neural network - basics of Modern
Computer Vision

D. Semantic Segmentation and Autonomous
Vehicle
● For fully autonomous vehicles, semantic Segmentation is a core element where
neural networks need to output high-resolution feature maps.
● Semantic segmentation is a key technology for autonomous vehicles to
understand the surrounding scenes.
● Semantic segmentation is a fundamental task in which each pixel of the input
image should be assigned to the corresponding label.
● It plays a vital role in many practical applications, such as medical image
segmentation, navigation of autonomous vehicles, and robots.

Semantic Segmentation and Autonomous
Vehicle

DDRNets- Introduction
● Real-time semantic segmentation is the task of achieving
computationally efficient semantic segmentation.
● The appealing performances of contemporary models usually come at
the expense of heavy computations and lengthy inference time, which
is intolerable for self-driving.
● Mostly methods are very time-consuming in the inference stage and
can not be directly deployed on the actual autonomous vehicles.
● DDRnets tackle this problem and achieves a new state-of-the-art
trade-off between accuracy and speed on both Cityscapes and CamVid
datasets.

DDRNets
● Semantic segmentation is a kind of dense prediction task which is
computationally expensive. This problem is especially critical for scene parsing
of autonomous driving.
● DDRnets consists of two main components:
the Deep Dual-resolution Network and the Deep Aggregation Pyramid Pooling
Module.

Architecture
Deep Dual-resolution Network- A family of novel bilateral networks with deep dual-
resolution branches and multiple bilateral fusions is proposed for real-time semantic
segmentation as efficient backbones.
Deep Aggregation Pyramid Pooling Module- A novel module is designed to harvest rich
context information by combining feature aggregation with pyramid pooling. When
executed on low-resolution feature maps, it leads to little increase in inference time.

Conclusion
● With widely used test augmentation, this method is superior to most state-of-
the-art models and requires much less computation.
● Due to the simplicity and efficiency of method, it can be seen as a strong
baseline for unifying real-time and high-accuracy semantic segmentation.

Visualized segmentation results on CamVid
test set

What's hot

IRJET- A Vision based Hand Gesture Recognition System using Convolutional...IRJET Journal

Computer Vision IntroductionCamera Culture Group, MIT Media Lab

Computer Vision with Deep LearningCapgemini

Image, Modelling & Computingmathgear

Image processing pptRaviteja Chowdary Adusumalli

«Design and purpose of convolutional layers in neural networks», Andrii Latysh.Provectus

HARDWARE SOFTWARE CO-SIMULATION FOR TRAFFIC LOAD COMPUTATION USING MATLAB SIM...ijcsity

Artificial intelligence at the edgeJameson Toole

Meetup 18/10/2018 - Artificiële intelligentie en mobiliteitDigipolis Antwerpen

What's hot (9)

IRJET- A Vision based Hand Gesture Recognition System using Convolutional...

Computer Vision Introduction

Computer Vision with Deep Learning

Image, Modelling & Computing

Image processing ppt

«Design and purpose of convolutional layers in neural networks», Andrii Latysh.

HARDWARE SOFTWARE CO-SIMULATION FOR TRAFFIC LOAD COMPUTATION USING MATLAB SIM...

Artificial intelligence at the edge

Meetup 18/10/2018 - Artificiële intelligentie en mobiliteit

Similar to Enhanced real time semantic segmentation

Automatism System Using Faster R-CNN and SVMIRJET Journal

A deep learning based stereo matching model for autonomous vehicleIAESIJAI

Car Steering Angle Prediction Using Deep LearningIRJET Journal

GP_Slides_V3 .pptxAhmedEldairy

Object Detection for Autonomous Cars using AI/MLIRJET Journal

Understanding the world in 3D with AI.pdfQualcomm Research

CAR DAMAGE DETECTION USING DEEP LEARNINGIRJET Journal

IRJET - Autonomous Navigation System using Deep LearningIRJET Journal

Major PRC-1 ppt.pptxJagruthiDARAPUNENI1

Design of Image Segmentation Algorithm for Autonomous Vehicle Navigationusing...IJEEE

Computer vesionAdil Mehmoood

Self Driving CarIRJET Journal

Data Annotation_Cars.pptxssuserfb92ae

Computer visionAnkitKamal6

Identifying Parking Spots from Surveillance Cameras using CNNIRJET Journal

AUTONOMOUS SELF DRIVING CARSIRJET Journal

Intelligent image processingAndrew Stewart

Residual balanced attention network for real-time traffic scene semantic segm...IJECEIAES

IRJET- A Deep Learning based Approach for Automatic Detection of Bike Rid...IRJET Journal

10.1109@ICCMC48092.2020.ICCMC-000167.pdfmokamojah

Similar to Enhanced real time semantic segmentation (20)

Automatism System Using Faster R-CNN and SVM

A deep learning based stereo matching model for autonomous vehicle

Car Steering Angle Prediction Using Deep Learning

GP_Slides_V3 .pptx

Object Detection for Autonomous Cars using AI/ML

Understanding the world in 3D with AI.pdf

CAR DAMAGE DETECTION USING DEEP LEARNING

IRJET - Autonomous Navigation System using Deep Learning

Major PRC-1 ppt.pptx

Design of Image Segmentation Algorithm for Autonomous Vehicle Navigationusing...

Computer vesion

Self Driving Car

Data Annotation_Cars.pptx

Computer vision

Identifying Parking Spots from Surveillance Cameras using CNN

AUTONOMOUS SELF DRIVING CARS

Intelligent image processing

Residual balanced attention network for real-time traffic scene semantic segm...

IRJET- A Deep Learning based Approach for Automatic Detection of Bike Rid...

10.1109@ICCMC48092.2020.ICCMC-000167.pdf

Recently uploaded

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55

Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik

08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

GenCyber Cyber Security Day PresentationMichael W. Hawkins

Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies

IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge

Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group

08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls

CloudStudio User manual (basic edition):comworks

Next-generation AAM aircraft unveiled by Supernal, S-A2Hyundai Motor Group

Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

The transition to renewables in India.pdfCompetition Advisory Services (India) LLP

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxnull - The Open Security Community

How to Remove Document Management Hurdles with X-Docs?XfilesPro

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j

SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren

Recently uploaded (20)

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...

Injustice - Developers Among Us (SciFiDevCon 2024)

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

08448380779 Call Girls In Friends Colony Women Seeking Men

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024

GenCyber Cyber Security Day Presentation

Benefits Of Flutter Compared To Other Frameworks

IAC 2024 - IA Fast Track to Search Focused AI Solutions

Snow Chain-Integrated Tire for a Safe Drive on Winter Roads

08448380779 Call Girls In Greater Kailash - I Women Seeking Men

CloudStudio User manual (basic edition):

Next-generation AAM aircraft unveiled by Supernal, S-A2

Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024

The transition to renewables in India.pdf

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...

08448380779 Call Girls In Civil Lines Women Seeking Men

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx

How to Remove Document Management Hurdles with X-Docs?

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...

SQL Database Design For Developers at php[tek] 2024

Enhanced real time semantic segmentation

1. Enhanced Real-time Semantic Segmentation of Road Scenes with DDRNets Name: Akanksha Rawat

2. The Fundamentals: A. Computer Vision B. Computer Vision Tasks C. Convolutional neural network - basics of Modern Computer Vision D. Semantic Segmentation and Autonomous Vehicle

3. A. Computer Vision ● Computer vision is a field of artificial intelligence (AI) that enables computers and systems to derive meaningful information from digital images, videos, and other visual inputs. ● The agenda of this field is to enable machines to view the world as humans do.

4. Computer Vision Source

5. B. Computer Vision Tasks ● The most general computer vision tasks that we frequently encounter in AI jargon include: ○ Image classification ○ Object detection ○ Image segmentation- Semantic Segmentation and Instance Segmentation.

6. Computer Vision Tasks

7. C. Convolutional neural network - basics of Modern Computer Vision ● In Deep learning, CNN is most commonly applied to analyze the visual imagery. ● Most computer vision algorithms are based on convolution neural network. ● CNNs are able to treat images like matrices as they exist and extract spatial features from them, like texture, edges and depth. They do this by using convolutional layers and pooling.

8. Convolutional neural network - basics of Modern Computer Vision

9. D. Semantic Segmentation and Autonomous Vehicle ● For fully autonomous vehicles, semantic Segmentation is a core element where neural networks need to output high-resolution feature maps. ● Semantic segmentation is a key technology for autonomous vehicles to understand the surrounding scenes. ● Semantic segmentation is a fundamental task in which each pixel of the input image should be assigned to the corresponding label. ● It plays a vital role in many practical applications, such as medical image segmentation, navigation of autonomous vehicles, and robots.

10. Semantic Segmentation and Autonomous Vehicle

11. DDRNets- Introduction ● Real-time semantic segmentation is the task of achieving computationally efficient semantic segmentation. ● The appealing performances of contemporary models usually come at the expense of heavy computations and lengthy inference time, which is intolerable for self-driving. ● Mostly methods are very time-consuming in the inference stage and can not be directly deployed on the actual autonomous vehicles. ● DDRnets tackle this problem and achieves a new state-of-the-art trade-off between accuracy and speed on both Cityscapes and CamVid datasets.

12. Performance of model

13. DDRNets ● Semantic segmentation is a kind of dense prediction task which is computationally expensive. This problem is especially critical for scene parsing of autonomous driving. ● DDRnets consists of two main components: the Deep Dual-resolution Network and the Deep Aggregation Pyramid Pooling Module.

14. Architecture Deep Dual-resolution Network- A family of novel bilateral networks with deep dual- resolution branches and multiple bilateral fusions is proposed for real-time semantic segmentation as efficient backbones. Deep Aggregation Pyramid Pooling Module- A novel module is designed to harvest rich context information by combining feature aggregation with pyramid pooling. When executed on low-resolution feature maps, it leads to little increase in inference time.

15. Architecture

16. Comparison of different backbones:

17. Conclusion ● With widely used test augmentation, this method is superior to most state-of- the-art models and requires much less computation. ● Due to the simplicity and efficiency of method, it can be seen as a strong baseline for unifying real-time and high-accuracy semantic segmentation.

18. Visualized segmentation results on CamVid test set

Enhanced real time semantic segmentation

Recommended

Recommended

More Related Content

What's hot

What's hot (9)

Similar to Enhanced real time semantic segmentation

Similar to Enhanced real time semantic segmentation (20)

Recently uploaded

Recently uploaded (20)

Enhanced real time semantic segmentation