The document discusses computer vision with deep learning. It provides an overview of convolutional neural networks and their use in computer vision applications like image classification and object detection. Specifically, it discusses how CNNs use convolutional layers to learn visual features from images and provide examples of CNNs being used for pipeline defect classification and filler cap quality control.
5. Computer Vision with Deep Learning | Andreas Eßbaumer
Why the hype? – The
ImageNet competition
ImageNet:
Over 10 Mio labeled images.
Over 10.000 different labels.
ImageNet Competition:
Restrict the number of labels to 1000.
Algorithms have to produce up to five labels per
image.
8. Computer Vision with Deep Learning | Andreas Eßbaumer
Visualization of convolution layers learned(!) by the network
From millions of training images, the network learns that the most basic features to
look for in an image are basic forms like edges, circles or smooth color transitions.
Source: Visualizing and Understanding Convolutional Networks by Zeiler/Fergus arXiv:1311.2901
10. Computer Vision with Deep Learning | Andreas Eßbaumer
Visualization of convolution layers learned(!) by the network
Working with the most basic features extracted by the first layers, the network is able
to combine them in a larger area of the picture and find more complex
structures like certain patterns and written text pieces.
Source: Visualizing and Understanding Convolutional Networks by Zeiler/Fergus arXiv:1311.2901
19. Computer Vision with Deep Learning | Andreas Eßbaumer
Deep Learning gave better results with little effort
0,00%
10,00%
20,00%
30,00%
40,00%
50,00%
60,00%
70,00%
80,00%
90,00%
100,00%
CV InceptionV3 - 172+ InceptionV3 - 0+ Custom-V1
Defect classification results
Recall Specificity
• After the successful POC, industrialization is currently ongoing
• Human experts have a strong assistance tool now
• Lowering risks by double-checks (Human + AI)
22. Computer Vision with Deep Learning | Andreas Eßbaumer
Andreas Eßbaumer
Senior Data Scientist
Capgemini Munich
Olof-Palme-Straße 14
81829 Munich
andreas.essbaumer@capgemini.com
M: +49 151 4025 2219
Summary
Convolutional Neural Networks (CNNs)
• are the State-of-the-Art for Computer Vision
• run via Open-Source-Frameworks
• require GPUs and labelled Images
• are (relatively) easy to run