Deep convolutional neural networks and their many uses for computer vision

•

3 likes•775 views

This document discusses deep convolutional neural networks and their many uses for computer vision. It provides an overview of computer vision, conventional computer vision techniques, and deep learning. It then describes the key components of convolutional neural networks, including convolutional layers, pooling layers, and fully connected layers. The document traces the evolution of convolutional neural networks from early models in 1980 to more recent state-of-the-art models from 2012 onward. It highlights several applications of convolutional neural networks like object detection, segmentation, recognition, style transfer, and image generation. In conclusion, the document demonstrates convolutional neural networks have many uses in computer vision tasks.

Data & Analytics

Deep Convolutional Neural
Networks and their Many Uses
for Computer Vision
Dr. Fares Al-Qunaieer
Lead Data Scientist
Saudi Information Technology Company (SITE)

Computer Vision
A field to develop algorithms that make machines and
computers “understand” the content of images and videos
Machine
Learning
Image
Processing
Computer
Vision

Conventional Computer Vision
Hand crafted operations and features

What is Convolution?
Multiply and sum kernel/filter across the image

Deep Learning
Machine Learning Algorithms that use Neural Networks,
with many layers

Convolutional Neural Networks (ConvNets)
Image by Aphex34 - Own work, CC BY-SA 4.0, https://commons.wikimedia.org/w/index.php?curid=45679374
• Neural Networks with convolution layers (and some more)
• Learn best kernels/filters from data instead of manually selected

Components and Layers of ConvNets
• Convolutional layers
• Pooling layers
• Fully connected layers
• Activation functions (e.g., ReLU)

Convolutional Layers
Image by Aphex34 - Own work, CC BY-SA 4.0, https://commons.wikimedia.org/w/index.php?curid=45659236

Pooling Layers
Image by Aphex34 - Own work, CC BY-SA 4.0, https://commons.wikimedia.org/w/index.php?curid=45673581
Max Pooling

Evolution of ConvNets
(Historical Journey)

Neocognitron (1980)
K. Fukushima: "Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position", 1980

LeNet-5 (1998)
Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner. Gradient-based learning applied to document recognition, 1998

AlexNet (2012)
Krizhevsky, Alex; Sutskever, Ilya; Hinton, Geoffrey E. "ImageNet classification with deep convolutional neural networks". 2012

VGG Net (2014)
Simonyan, Karen, and Andrew Zisserman. "Very deep convolutional networks for large-scale image recognition." 2014

GoogleNet – Inception (2014)
C. Szegedy et al., "Going deeper with convolutions," 2015

ResNet (2016)
K. He, X. Zhang, S. Ren and J. Sun, " Deep Residual Learning for Image Recognition," 2016

Objects Detection and Localization
Images source: https://towardsdatascience.com/r-cnn-fast-r-cnn-faster-r-cnn-yolo-object-detection-algorithms-36d53571365e
YOLO (You Only Look Once)
Faster RCNN (Region Based CNN)
https://www.youtube.com/watch?v=MPU2HistivI

Objects Segmentation
V. Badrinarayanan, A. Kendall and R. Cipolla, "SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation," 2017
Ronneberger, O., Fischer, P., & Brox, T. “U-Net: Convolutional
Networks for Biomedical Image Segmentation” 2015

Face Recognition (FaceNet)
F. Schroff, D. Kalenichenko and J. Philbin, "FaceNet: A unified embedding for face recognition and clustering," 2015
Image source: https://omoindrot.github.io/triplet-loss

Style Transfer
L. A. Gatys, A. S. Ecker and M. Bethge, "Image Style Transfer Using Convolutional Neural Networks," 2016

Image Generation - GAN (Generative Adversarial Network)
Image source: Goodfellow et al; Karras, Laine, Aila / Nvidia

Other Applications
• Scene labelling
• Action recognition
• Human Pose estimation
• Document analysis
• Medical diagnosis
• And many more …

What's hot

Introduction talk to Computer Vision Chen Sagiv

Learning with Videos (D4L4 2017 UPC Deep Learning for Computer Vision)Universitat Politècnica de Catalunya

Tele immersionsagarpg

Meetup 18/10/2018 - Artificiële intelligentie en mobiliteitDigipolis Antwerpen

Image classification on Imagenet (D1L4 2017 UPC Deep Learning for Computer Vi...Universitat Politècnica de Catalunya

An Introduction to Computer Visionguestd1b1b5

Neural networks and deep learningJörgen Sandig

“Maintaining DNN Accuracy When the Real World is Changing,” a Presentation fr...Edge AI and Vision Alliance

Computer Vision IntroductionCamera Culture Group, MIT Media Lab

Tele-immersionVrindha sajikumar

Open Cv – An Introduction To The VisionHemanth Haridas

Image Processing pptOECLIB Odisha Electronics Control Library

Deep Learning: a birds eye viewRoelof Pieters

Promises of Deep LearningDavid Khosid

Tele immersionronak patel

Tele immersionhimnshu16

Creating smaller, faster, production-ready mobile machine learning models.Jameson Toole

Digital Image ProcessingSamir Sabry

Mobile Projections in Urban SpacesDenis Perevalov

Tele immersionAnup Dere

What's hot (20)

Introduction talk to Computer Vision

Learning with Videos (D4L4 2017 UPC Deep Learning for Computer Vision)

Tele immersion

Meetup 18/10/2018 - Artificiële intelligentie en mobiliteit

Image classification on Imagenet (D1L4 2017 UPC Deep Learning for Computer Vi...

An Introduction to Computer Vision

Neural networks and deep learning

“Maintaining DNN Accuracy When the Real World is Changing,” a Presentation fr...

Computer Vision Introduction

Tele-immersion

Open Cv – An Introduction To The Vision

Image Processing ppt

Deep Learning: a birds eye view

Promises of Deep Learning

Tele immersion

Creating smaller, faster, production-ready mobile machine learning models.

Digital Image Processing

Mobile Projections in Urban Spaces

Tele immersion

Similar to Deep convolutional neural networks and their many uses for computer vision

Scene recognition using Convolutional Neural NetworkDhirajGidde

Object Detetcion using SSD-MobileNetIRJET Journal

Makine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine LearningAli Alkan

Deep Learning AtoC with Image PerspectiveDong Heon Cho

Real Time Object Detection with Audio Feedback using Yolo v3ijtsrd

IRJET- Real-Time Object Detection using Deep Learning: A SurveyIRJET Journal

Deep Learning for X ray Image to Text Generationijtsrd

Computer vision introduction Wael Badawy

Introduction to Deep learningleopauly

Deep Learning For Computer Vision- Day 3 Study Jams GDSC Unsri.pptxpmgdscunsri

UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...ijscai

Unsupervised learning models of invariant features in images: Recent developm...IJSCAI Journal

Anomaly Detection with Azure and .NETMarco Parenzan

kanimozhi2019.pdfAshrafDabbas1

An introduction to computer vision with Hugging FaceJulien SIMON

TRANSFER LEARNING WITH CONVOLUTIONAL NEURAL NETWORKS FOR IRIS RECOGNITIONijaia

Transfer Learning with Convolutional Neural Networks for IRIS Recognitiongerogepatton

TRANSFER LEARNING WITH CONVOLUTIONAL NEURAL NETWORKS FOR IRIS RECOGNITION gerogepatton

Deep learning Rajgupta258

Similar to Deep convolutional neural networks and their many uses for computer vision (20)

Scene recognition using Convolutional Neural Network

Object Detetcion using SSD-MobileNet

Makine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine Learning

Deep Learning AtoC with Image Perspective

Real Time Object Detection with Audio Feedback using Yolo v3

IRJET- Real-Time Object Detection using Deep Learning: A Survey

Deep Learning for X ray Image to Text Generation

Computer vision introduction

Introduction to Deep learning

Deep Learning For Computer Vision- Day 3 Study Jams GDSC Unsri.pptx

UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...

Unsupervised learning models of invariant features in images: Recent developm...

Anomaly Detection with Azure and .NET

kanimozhi2019.pdf

An introduction to computer vision with Hugging Face

TRANSFER LEARNING WITH CONVOLUTIONAL NEURAL NETWORKS FOR IRIS RECOGNITION

Transfer Learning with Convolutional Neural Networks for IRIS Recognition

TRANSFER LEARNING WITH CONVOLUTIONAL NEURAL NETWORKS FOR IRIS RECOGNITION

Deep learning

Recently uploaded

GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch

VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor

Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha

VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor

办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss

办理学位证纽约大学毕业证(NYU毕业证书）原版一比一fhwihughh

Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083

Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster

Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...ThinkInnovation

Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAbdelrhman abooda

Data Science Jobs and Salaries Analysis.pptxFurkanTasci3

Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Universitat Politècnica de Catalunya

Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha

9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort

Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort

办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一F La

9654467111 Call Girls In Munirka Hotel And Home ServiceSapana Sha

Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408

DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett

Recently uploaded (20)

GA4 Without Cookies [Measure Camp AMS]

VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130

Call Girls In Mahipalpur O9654467111 Escorts Service

VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...

办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一

办理学位证纽约大学毕业证(NYU毕业证书）原版一比一

Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call

Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024

Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...

Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx

Data Science Jobs and Salaries Analysis.pptx

Deep Generative Learning for All - The Gen AI Hype (Spring 2024)

Call Girls In Dwarka 9654467111 Escorts Service

9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service

Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)

办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一

9654467111 Call Girls In Munirka Hotel And Home Service

Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps

DBA Basics: Getting Started with Performance Tuning.pdf

Deep convolutional neural networks and their many uses for computer vision

1. Deep Convolutional Neural Networks and their Many Uses for Computer Vision Dr. Fares Al-Qunaieer Lead Data Scientist Saudi Information Technology Company (SITE)

2. Computer Vision A field to develop algorithms that make machines and computers “understand” the content of images and videos Machine Learning Image Processing Computer Vision

3. Conventional Computer Vision Hand crafted operations and features

4. What is Convolution? Multiply and sum kernel/filter across the image

5. Convolution Sobel Operators

6. Deep Learning Machine Learning Algorithms that use Neural Networks, with many layers

7. Convolutional Neural Networks (ConvNets) Image by Aphex34 - Own work, CC BY-SA 4.0, https://commons.wikimedia.org/w/index.php?curid=45679374 • Neural Networks with convolution layers (and some more) • Learn best kernels/filters from data instead of manually selected

8. Components and Layers of ConvNets • Convolutional layers • Pooling layers • Fully connected layers • Activation functions (e.g., ReLU)

9. Convolutional Layers Image by Aphex34 - Own work, CC BY-SA 4.0, https://commons.wikimedia.org/w/index.php?curid=45659236

10. Pooling Layers Image by Aphex34 - Own work, CC BY-SA 4.0, https://commons.wikimedia.org/w/index.php?curid=45673581 Max Pooling

11. Fully Connected Layers

12. Activation Functions

13. Visualizing ConvNets

14. Evolution of ConvNets (Historical Journey)

15. Neocognitron (1980) K. Fukushima: "Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position", 1980

16. LeNet-5 (1998) Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner. Gradient-based learning applied to document recognition, 1998

17. AlexNet (2012) Krizhevsky, Alex; Sutskever, Ilya; Hinton, Geoffrey E. "ImageNet classification with deep convolutional neural networks". 2012

18. VGG Net (2014) Simonyan, Karen, and Andrew Zisserman. "Very deep convolutional networks for large-scale image recognition." 2014

19. GoogleNet – Inception (2014) C. Szegedy et al., "Going deeper with convolutions," 2015

20. ResNet (2016) K. He, X. Zhang, S. Ren and J. Sun, " Deep Residual Learning for Image Recognition," 2016

21. Applications

22. Objects Detection and Localization Images source: https://towardsdatascience.com/r-cnn-fast-r-cnn-faster-r-cnn-yolo-object-detection-algorithms-36d53571365e YOLO (You Only Look Once) Faster RCNN (Region Based CNN) https://www.youtube.com/watch?v=MPU2HistivI

23. Objects Segmentation V. Badrinarayanan, A. Kendall and R. Cipolla, "SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation," 2017 Ronneberger, O., Fischer, P., & Brox, T. “U-Net: Convolutional Networks for Biomedical Image Segmentation” 2015

24. Face Recognition (FaceNet) F. Schroff, D. Kalenichenko and J. Philbin, "FaceNet: A unified embedding for face recognition and clustering," 2015 Image source: https://omoindrot.github.io/triplet-loss

25. Style Transfer L. A. Gatys, A. S. Ecker and M. Bethge, "Image Style Transfer Using Convolutional Neural Networks," 2016

26. Image Generation - GAN (Generative Adversarial Network) Image source: Goodfellow et al; Karras, Laine, Aila / Nvidia

27. Other Applications • Scene labelling • Action recognition • Human Pose estimation • Document analysis • Medical diagnosis • And many more …

28. DEMO

29. Thank You

Deep convolutional neural networks and their many uses for computer vision

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Deep convolutional neural networks and their many uses for computer vision

Similar to Deep convolutional neural networks and their many uses for computer vision (20)

More from Fares Al-Qunaieer

More from Fares Al-Qunaieer (12)

Recently uploaded

Recently uploaded (20)

Deep convolutional neural networks and their many uses for computer vision