State-of-the-art Image Processing across all domains

2
Our Agenda
01 Image Processing history
02 Diﬀerent Approaches
03 Residual Neural Networks
04 Performances
05 Demo Application

3
About Knoldus MachineX
MachineX is a group of data wizards.
We are a team of Data Scientist and engineers with a
product mindset who deliver competitive business
advantage.

4
An Intelligent
Meeting Assistant
Application
Record Videos
View DashBoard

6
An Intelligent
marketing tool
FishEye
FishEye

7
Machine learning library
in scala
KSAI

8
Enable organizations to
capture new value
and business capabilities
Innovation Labs
Consistently blogging, to
share our knowledge,
research
Blogs
Deeplearning, Coursera,
Stanford certified
professionals
Certifications
Insight & perspective to help
you to make right business
decisions
TOK Sessions
It’s great to contribute back
to the community. We
continuously advance open
source technologies to meet
demanding business
requirements.
Open Source
Contribution

State-of-the-art Image Processing
across domains

11
Image processing History
Traditional way

12
Traditional Way
Traditional pipeline for image classification involves two
modules
● Feature extraction
● Classification

13
Problems
The problem with this pipeline
● Feature extraction cannot be tweaked according to
the classes and images
● Completely different from how we humans learn to
recognize things.

14
Image processing across
domains

Convolutional Neural Network
(CNN, or ConvNet)

18
● Convolutional base
● Classifier

20
The Application of
skills, knowledge,
and/or attitudes that
were learned in one
situation to another
learning situation
transfer learning is usually
expressed through the use of
pre-trained models

22
Problems
The problem was
● less learned rate in each generation
● Number of knowledge amount passed down was
less

Understanding various architectures of
Convolutional Networks
ResNet, AlexNet, VGGNet, Inception

26
ImageNet Large Scale Visual Recognition Challenge
(ILSVRC)
CNN architectures of ILSVRC
top competitors

27
AlexNet
● 5 Convolutional (CONV) layers and 3 Fully Connected (FC) layers
● 62 million trainable variables

29
AlexNet
● Data augmentation is carried out to reduce overfitting
● Used Relu which achieved 25% error rate about 6 times faster
than the same network with tanh nonlinearity.
● AlexNet introduced Local Response Normalization (LRN) to
help with the vanishing gradient problem

30
VGGNet
● VGG16 has a total of 138 million parameters
● Conv kernels are of size 3x3 and maxpool kernels are of size 2x2 with
stride of two

32
VGGNet
● It is painfully slow to train.
● Spatial pooling is carried out by five max-pooling layers, which
follow some of the conv. layers

ResNet : Deep Residual learning

35
Hierarchical Features and role of Depth
● Low, Mid , and High-level features
● More layers enrich the “levels” of the features
● Previous ImageNet models have depths of 16 and 30
layers

Is learning better networks as easy as
stacking more layers ?

37
Adding layers to deep
Convolutional neural nets

38
Construction Insight
● Consider a shallow architecture and its deeper
counterpart
● The deeper model would would just need to copy the
shallower model with identity mapping
● Construction solution suggests that a deeper model
should produce no higher training error that its shallow
counterpart

39
Residual Functions
● We explicitly reformulate the layers as learning residual functions
with reference to the layer inputs, instead of learning
unreferenced functions
● H[x] = F[x] + x

42
Experiment
● 152 layer Layers on ImageNet
○ 8* Deeper than VGGNet
○ Less parameters
● ResNet achieve 3.57% error on Imagenet test
○ 1st place in ILSVRC

43
Results
● AlexNet and ResNet-152, both have about 60M parameters but there is
about 10% diﬀerence in their top-5 accuracy
● VGGNet not only has a higher number of parameters and FLOP as compared
to ResNet-152, but also has a decreased accuracy
● Training an AlexNet takes about the same time as training Inception (10
times less memory requirements)

44
Clinic Assistant
● Notebook http://bit.ly/2D2LOQT
● Web App https://virtual-clinic.onrender.com

45
References
● [1]. A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classification with deep convolutional
neural networks. In Advances in neural information processing systems,pages 1097–1105,2012.
● [2]. K. He, X. Zhang, S. Ren, and J. Sun. Deep residual learning for image recognition. arXiv preprint
arXiv:1512.03385,2015.
● [3]. K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image
recognition. arXiv preprint arXiv:1409.1556,2014.
● [4]. C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A.
Rabinovich. Going deeper with convolutions. In Proceedings of the IEEE Conference on
Computer Vision and Pattern Recognition,pages 1–9,2015.
● https://arxiv.org/pdf/1901.06032.pdf

State-of-the-art Image Processing across all domains

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to State-of-the-art Image Processing across all domains

Similar to State-of-the-art Image Processing across all domains (20)

More from Knoldus Inc.

More from Knoldus Inc. (20)

Recently uploaded

Recently uploaded (20)

State-of-the-art Image Processing across all domains