Intro to Deep Learning for Computer Vision

•

4 likes•2,246 views

Christoph Körner

Slides for my presentation at the Seminar in Computer Graphics at Vienna University of Technology, 2016

Technology

Applications of Deep Learning
in Computer Vision
Christoph Körner

Outline
1) Introduction to Neural Networks
2) Deep Learning
3) Applications in Computer Vision
4) Conclusion

Why Deep Learning?
●
Wins every computer vision challenge
(classification, segmentation, etc.)
●
Can be applied in various domains (speech
recognition, game prediction, computer vision,
etc.)
●
Beats human accuracy
●
Big communities and resources
●
Hardware for Deep Learning

Perceptron (1958)
●
Weighted sum of inputs
●
Threshold operator

Artificial Neural Network (1960)
●
Universal function approximator
●
Can solve the XOR problem

Backpropagation (1982)
●
Propagate the error through the network
●
Allows Optimization (SGD, etc.)
●
Enables training of multi-layer networks

Convolution and Pooling (1989)
●
Less parameters than hidden layers
●
More efficient training

Handwritten ZIP Codes (1989)
●
30 training passes
●
Achieved 92% accuracy

What happened until 2011?
●
Better Initialization
●
Better Non-linearities: ReLU
●
1000 times more training data
●
More computing power
●
Factor 1 million speedup in training time through
parallelization on GPUs

Deep Learning
●
Conv-, Pool- and Fully-Connected Layers
●
ReLU activations
●
Deep nested models with many parameters
●
New layer types and structures
●
New techniques to reduce overfitting
●
Loads of training data and compute power
●
10.000.000 images
●
Weeks of training on multi-GPU machines

AlexNet (2012)
●
62.378.344 parameters (250MB)
●
24 layers

VGGNet (2013)
●
102.908.520 parameters (412MB)
●
23 layers

GoogLeNet (2014)
●
6.998.552 parameters (28MB)
●
143 layers

Inception Module
●
Heavy use of 1x1 convolutions (applied along the
depth dimension)
●
Very efficient

ResNet (2015)
●
Residual learning
●
152 layers

Classification
●
One class per image
●
Softmax layer at the end

Localization
●
Bounding box Regression
●
Sigmoid layer with 4 outputs at the end
●
Via Classification

Detection
●
Multiple Objects, multiple classes
●
Solved using multiple networks

More Applications
●
Compression
●
Auto-encoders, Self-organizing maps
●
Image Captioning
●
Solved with Recurrent Architecture
●
Image Stylization
●
Clustering
●
Many more...

Conclusion
●
Powerful, learn from data instead of hand-crafted
feature extraction
●
Better than humans
●
Deeper is always better
●
Overfitting
●
More data is always better
●
Data quality
●
Ground truth

What's hot

Introduction to Deep LearningOswald Campesato

Deep Learning Tutorial | Deep Learning Tutorial For Beginners | What Is Deep ...Simplilearn

Deep Learning in Computer VisionSungjoon Choi

1.Introduction to deep learningKONGU ENGINEERING COLLEGE

Deep learningMohamed Loey

Deep Learning - Overview of my work IIMohamed Loey

Introduction to Deep learningMassimiliano Ruocco

Deep learningRatnakar Pandey

Convolutional Neural Network (CNN)Muhammad Haroon

Cnnrimshailyas1

Introduction of Deep LearningMyungjin Lee

Hyperparameter TuningJon Lederman

Ai vs machine learning vs deep learningSanjay Patel

Deep Learning Roshan Chettri

Introduction to CNNShuai Zhang

From Conventional Machine Learning to Deep Learning and Beyond.pptxChun-Hao Chang

Deep learningHatim EL-QADDOURY

Image classification with Deep Neural NetworksYogendra Tamang

Deep Learning - Convolutional Neural NetworksChristian Perone

Deep learning for medical imaginggeetachauhan

What's hot (20)

Introduction to Deep Learning

Deep Learning Tutorial | Deep Learning Tutorial For Beginners | What Is Deep ...

Deep Learning in Computer Vision

1.Introduction to deep learning

Deep learning

Deep Learning - Overview of my work II

Introduction to Deep learning

Deep learning

Convolutional Neural Network (CNN)

Cnn

Introduction of Deep Learning

Hyperparameter Tuning

Ai vs machine learning vs deep learning

Deep Learning

Introduction to CNN

From Conventional Machine Learning to Deep Learning and Beyond.pptx

Deep learning

Image classification with Deep Neural Networks

Deep Learning - Convolutional Neural Networks

Deep learning for medical imaging

Similar to Intro to Deep Learning for Computer Vision

Meetup 18/10/2018 - Artificiële intelligentie en mobiliteitDigipolis Antwerpen

Netflix machine learningAmer Ather

Deep Learning And Business Models (VNITC 2015-09-13)Ha Phuong

Learn to Build an App to Find Similar Images using Deep Learning- Piotr TeterwakPyData

Hand Finger Counting using Deep Convolutional Neural Network (CNN) on GPUMahesh Khadatare

AI and Deep Learning Subrat Panda, PhD

Deep learning for smart manufacturingSunil Kumar Pradhan

Machine Learning with New Hardware ChallegensOscar Law

2020 ml swarm ascend presentationKyongsik Yun

Computer Architecture and Organizationssuserdfc773

Google Cloud Platform Empowers TensorFlow and Machine LearningDataWorks Summit/Hadoop Summit

Once-for-All: Train One Network and Specialize it for Efficient Deploymenttaeseon ryu

Task programming in cloud computingSuresh Pokharel

Scaling TensorFlow Models for Training using multi-GPUs & Google Cloud MLSeldon

PyData Global 2022 - Things I learned while running neural networks on microc...SARADINDU SENGUPTA

資工人為什麼需要學習數位電路？Murphy Chen

MN-3, MN-Core and HPL - SC21 Green500 BOFPreferred Networks

realtime_ai_systems_academia.pptxgopikahari7

Accelerating Real Time Applications on Heterogeneous PlatformsIJMER

Finding the best solution for Image ProcessingTech Triveni

Similar to Intro to Deep Learning for Computer Vision (20)

Meetup 18/10/2018 - Artificiële intelligentie en mobiliteit

Netflix machine learning

Deep Learning And Business Models (VNITC 2015-09-13)

Learn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak

Hand Finger Counting using Deep Convolutional Neural Network (CNN) on GPU

AI and Deep Learning

Deep learning for smart manufacturing

Machine Learning with New Hardware Challegens

2020 ml swarm ascend presentation

Computer Architecture and Organization

Google Cloud Platform Empowers TensorFlow and Machine Learning

Once-for-All: Train One Network and Specialize it for Efficient Deployment

Task programming in cloud computing

Scaling TensorFlow Models for Training using multi-GPUs & Google Cloud ML

PyData Global 2022 - Things I learned while running neural networks on microc...

資工人為什麼需要學習數位電路？

MN-3, MN-Core and HPL - SC21 Green500 BOF

realtime_ai_systems_academia.pptx

Accelerating Real Time Applications on Heterogeneous Platforms

Finding the best solution for Image Processing

Recently uploaded

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge

🐬 The future of MySQL is Postgres 🐘RTylerCroy

Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2

Automating Google Workspace (GWS) & more with Apps Scriptwesley chun

Presentation on how to chat with PDF using ChatGPT code interpreternaman860154

Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal

Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer

CNv6 Instructor Chapter 6 Quality of Servicegiselly40

Real Time Object Detection Using Open CVKhem

A Domino Admins Adventures (Engage 2024)Gabriella Davis

TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Slack Application Development 101 Slidespraypatel2

IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

Recently uploaded (20)

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

🐬 The future of MySQL is Postgres 🐘

Exploring the Future Potential of AI-Enabled Smartphone Processors

Automating Google Workspace (GWS) & more with Apps Script

Presentation on how to chat with PDF using ChatGPT code interpreter

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf

Axa Assurance Maroc - Insurer Innovation Award 2024

CNv6 Instructor Chapter 6 Quality of Service

Real Time Object Detection Using Open CV

A Domino Admins Adventures (Engage 2024)

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

How to Troubleshoot Apps for the Modern Connected Worker

Slack Application Development 101 Slides

IAC 2024 - IA Fast Track to Search Focused AI Solutions

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

What Are The Drone Anti-jamming Systems Technology?

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx

Boost PC performance: How more available memory can improve productivity

08448380779 Call Girls In Friends Colony Women Seeking Men

Intro to Deep Learning for Computer Vision

1. Applications of Deep Learning in Computer Vision Christoph Körner

2. Outline 1) Introduction to Neural Networks 2) Deep Learning 3) Applications in Computer Vision 4) Conclusion

3. Why Deep Learning? ● Wins every computer vision challenge (classification, segmentation, etc.) ● Can be applied in various domains (speech recognition, game prediction, computer vision, etc.) ● Beats human accuracy ● Big communities and resources ● Hardware for Deep Learning

4. Perceptron (1958) ● Weighted sum of inputs ● Threshold operator

5. Artificial Neural Network (1960) ● Universal function approximator ● Can solve the XOR problem

6. Backpropagation (1982) ● Propagate the error through the network ● Allows Optimization (SGD, etc.) ● Enables training of multi-layer networks

7. Convolution and Pooling (1989) ● Less parameters than hidden layers ● More efficient training

8. Handwritten ZIP Codes (1989) ● 30 training passes ● Achieved 92% accuracy

9. What happened until 2011? ● Better Initialization ● Better Non-linearities: ReLU ● 1000 times more training data ● More computing power ● Factor 1 million speedup in training time through parallelization on GPUs

10. Deep Learning ● Conv-, Pool- and Fully-Connected Layers ● ReLU activations ● Deep nested models with many parameters ● New layer types and structures ● New techniques to reduce overfitting ● Loads of training data and compute power ● 10.000.000 images ● Weeks of training on multi-GPU machines

11. AlexNet (2012) ● 62.378.344 parameters (250MB) ● 24 layers

12. VGGNet (2013) ● 102.908.520 parameters (412MB) ● 23 layers

13. GoogLeNet (2014) ● 6.998.552 parameters (28MB) ● 143 layers

14. Inception Module ● Heavy use of 1x1 convolutions (applied along the depth dimension) ● Very efficient

15. ResNet (2015) ● Residual learning ● 152 layers

16. Applications in Computer Vision

17. Classification ● One class per image ● Softmax layer at the end

18. Localization ● Bounding box Regression ● Sigmoid layer with 4 outputs at the end ● Via Classification

19. Detection ● Multiple Objects, multiple classes ● Solved using multiple networks

20. Segmentation

21. More Applications ● Compression ● Auto-encoders, Self-organizing maps ● Image Captioning ● Solved with Recurrent Architecture ● Image Stylization ● Clustering ● Many more...

22. Conclusion ● Powerful, learn from data instead of hand-crafted feature extraction ● Better than humans ● Deeper is always better ● Overfitting ● More data is always better ● Data quality ● Ground truth

23. Thank you! Christoph Körner

Intro to Deep Learning for Computer Vision

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Intro to Deep Learning for Computer Vision

Similar to Intro to Deep Learning for Computer Vision (20)

Recently uploaded

Recently uploaded (20)

Intro to Deep Learning for Computer Vision