Introduction to Machine Learning, Hands-on Deep Learning with Tensroflow 2.0

https://ntg.ai/gdgbaku/
Introduction to Machine
Learning & Hands on Deep
Learning with Tensorflow 2.0

scientia potentia est
GDG Baku 2019 - Natig Vahabov 2
“Knowledge is power”
Sir Francis Bacon, 16th century
English philosopher

Agenda
• AI & ML & DL
• Theory of CNN
• Hands on Tensorflow 2.0
• Building CNN with Keras
• Django + Tensorflow (trained model)

AI & ML & DL

What is AI?
Academic term:
As the study of "intelligent agents": any device that
perceives its environment and takes actions that
maximize its chance of successfully achieving its
goals
Simple term:
A.I. is the study of how to make computers do
things at which, at the moment, people are better.

Weak AI vs Strong AI
Weak or Narrow AI is a type of artificial intelligence that
is focused on one narrow task
- Siri, Alexa, Sophia
On the other hand, Artificial General Intelligence (AGI) is
the intelligence of an intelligent agent that can
understand or learn any intellectual task that
a human being can (Strong AI)
- Samantha (‘Her’ movie)

Good, Old-Fashioned AI(GOFAI)
The Intelligent Agent must inertly duplicate the
human mind in a such manner that the
synthetic mind
• can be studied
• can be animated within the memory of the
Intelligent Agent

Expert System as GOFAI
‘An expert system is a computer system
that emulates, or acts in all respects, with
the decision-making capabilities of a human
expert’.
Prof Edward Feigenbaum

Blood Disorder with ES

MYCIN
Diagnosing and treating patients with infectious blood
diseases
• a rule-based expert system
• developed at Stanford University – 1976
• uses backward chaining for reasoning
• incorporates about 500 rules
• written in INTERLISP (a dialect of LISP)
• a correct diagnosis rate of about 65% even though it
was worse than real physicians who had average
correct diagnosis rates of about 80%

Machine Learning
Machine Learning is the field of study that gives
computers the ability to learn without being
explicitly programmed.
Arthur Samuel, 1959

Machine Learning
A computer program is said to learn from
experience E with respect to some task T and some
performance measure P, if its performance on T, as
measured by P, improves with experience E
Tom Mitchell, 1997

Blood Disorder with ML
University of Colorado, Nov 6th 2019

ML Engineer Interview question

Machine Learning vs ES
Task: Spam/ Ham detection(T) (Supervised
Learning, Classification problem)
• ES – writing rules (with knowledge
representation - FirstOrderLogic) which
classifies an email as a spam
• ML – give previous spam/ham emails(E), train
the model, evaluate the solution(P), analyze
errors, update the model weights

Machine Learning vs ES
Machine Learning is great for:
• Problems for which existing solutions require a lot of hand-
tuning or long lists of rules: one Machine Learning algorithm can
often simplify code and perform better.
• Complex problems for which there is no good solution at all
using a traditional approach: the best Machine Learning
techniques can find a solution.
• Fluctuating environments: a Machine Learning system can adapt
to new data.
• Getting insights about complex problems and large amounts of
data.

Machine Learning
Systems

Machine Learning Systems
• Whether or not they are trained with human
supervision (supervised, unsupervised, semi-
supervised, and Reinforcement Learning)
• Whether or not they can learn incrementally on the fly
(online versus batch learning)
• Whether they work by simply comparing new data
points to known data points, or instead detect
patterns in the training data and build a predictive
model, much like scientists do (instance-based
versus model-based learning)

Supervised MLSs
• In supervised learning, the training data you feed to
the algorithm includes the desired solutions, called
labels
• Two main categories: Regression and Classification
• Main Algorithms:
– k-Nearest Neighbors (recommender system, similar users)
– Linear Regression (house price prediction)
– Logistic Regression (spam/ham filter)
– Support Vector Machines (face detection)
– Decision Trees and Random Forests (google photos)
– Neural networks

Logistic Regression

Logistic Regression in practice
• Political campaigns try to
predict the chances that a
voter will vote for their
candidate
• Bankers use it to predict
the chances that a loan
applicant will default on the
loan
• Marketers use it to predict
whether a customer will
respond to a particular ad

Unsupervised MLSs
• In unsupervised learning, the training data is unlabeled
• Clustering
– K-Means, DBSCAN, Hierarchical Cluster Analysis (HCA)
• Anomaly detection and novelty detection
– One-class SVM, Isolation Forest
• Visualization and dimensionality reduction
– Principal Component Analysis (PCA), Kernel PCA
– Locally-Linear Embedding (LLE)
– t-distributed Stochastic Neighbor Embedding (t-SNE)
• Association rule learning
– Apriori
– Eclat

Apriori in practice
Wal-Mart actually used the Apriori
algorithm to increase sales of
beer. Wal-Mart studied their data
to find that American males who
bought diapers on Friday
afternoons also frequently bought
beer. They moved the beer next to
the diapers, and sales increased
(Tesco in UK did the same)

Semi-supervised MLSs
• Some algorithms can deal with partially labeled
training data, usually a lot of unlabeled data and a
little bit of labeled data. This is called semisupervised
learning
• Google Photos (detecting faces, assigns them to a
person)
• Webpage classification (news, educational, shopping,
blog ..)

Reinforcement Learning

Reinforcement Learning
• The Man vs. The Machine / Deep Blue defeated Garry
Kasparov in 1997
• At the 2017 Future of Go Summit, DeepMind’s
successor AlphaGo Master beat Ke Jie, the world No.1
ranked player at the time, in a three-game match
• Robotics that mimics real animals
• Alibaba Group published a paper “Real-Time Bidding
with Multi-Agent Reinforcement Learning in Display
Advertising”

What exactly does a
machine learning
engineer do?

Machine Learning Engineer
• Running machine learning experiments using a
programming language with machine learning
libraries.
• Deploying machine learning solutions into
production.
• Optimizing solutions for performance and scalability.
• Data engineering, i.e. ensuring a good data flow
between database and backend systems.
• Implementing custom machine learning code.
• Data science, i.e. analyzing data and coming up with
use cases

ML Engineer sample jobs

ML Engineer should know

ML Engineer’s nightmare

If ML solves our problem,
why do we need DL?

Reason1: Massive Data
Amount of data that are generated in a single day of
2019
• 500 million tweets are sent
• 294 billion emails are sent
• 4 petabytes of data are created on Facebook
• 4 terabytes of data are created from each connected car
• 65 billion messages are sent on WhatsApp
• 5 billion searches are made
• By 2025, it’s estimated that 463 exabytes of data will be created
each day globally – that’s the equivalent of 212,765,957 DVDs per
day!

Reason2: Moore’s Law

Reason2: Moore’s Law
Moore's law is the observation that the
number of transistors in a dense
integrated circuit doubles about every 18-
24 months
TPU >> GPU >> CPU

Deep Learning

Neural Network

Artificial Neural Network

NN Types
Perceptron, CNN, RNN,
Boltzman Machine, DBN etc.
Article: https://towardsdatascience.com/the-
mostly-complete-chart-of-neural-networks-
explained-3fb6f2367464

Activation Functions

Deep Learning
Frameworks

DL Frameworks

Why we should use framework?
• High level abstracted API
• Code friendly environment with engineers
• Easy hands-on adaptation with newbies
• Advance visualization of inside NN
(Tensorboard)
• Single tool for both development and serving
(TFServing)
• Option to run of recent academic papers
(paperswithcode.com)

Bones of DLF
• Components of any DL framework
– Tensors
– Operations
– Computation Graph
– Auto-differentiation
– Fast and Efficient floating pt. Operations
– GPU support
• BLAS, cuBLAS, cuDNN

Tensorflow
Google’s Tensorflow — arguably the most popular
Deep Learning framework today. Gmail, Uber,
Airbnb, Nvidia and lots of other prominent brands
using it.
• Python is the most convenient client language for
working with TensorFlow. However, there are also
experimental interfaces available in JavaScript, C
++, Java and Go, C # and Julia
• Ability to run models on mobile platforms like iOS
and Android
• TF needs a lot of coding
• TF operates with a static computation graph
GDG Baku 2019- Natig Vahabov 47

Keras
It’s the most minimalist approach to using
TensorFlow, Theano, or CNTK in the high-level
• Creating massive models of deep learning in
Keras is reduced to single-line functions. But this
strategy makes Keras a less configurable
environment than low-level frameworks
• Keras model Serialization/Deserialization APIs,
callbacks, and data streaming using Python
generators are very mature
• Keras results in a much more readable and
succinct code

PyTorch
The PyTorch framework was developed for Facebook
services but is already used for its own tasks by
companies like Twitter and Salesforce.
• Unlike TensorFlow, the PyTorch library operates
with a dynamically updated graph. This means that
it allows you to make changes to the architecture
in the process
• In PyTorch, standard debuggers, for example, pdb
or PyCharm can be used
• PyTorch is much better suited for small projects
and prototyping. When it comes to cross-platform
solutions, TensorFlow looks like a more suitable
choice

Caffe2
Caffe supports many different types of deep learning
architectures geared towards image
classification and image segmentation such as CNN,
RCNN, LSTM and fully connected neural network
designs
• Caffe is being used in academic research projects,
startup prototypes, and even large-scale
industrial applications in vision, speech, and
multimedia
• Yahoo! has also integrated caffe with Apache
Spark to create CaffeOnSpark, a distributed deep
learning framework
• At the end of March 2018, Caffe2 was merged
into PyTorch

Sonnet
Sonnet deep learning framework built on top of
TensorFlow. It is designed to create neural networks
with a complex architecture by the world famous
company DeepMind.
• High-level object-oriented libraries that bring
about abstraction when developing neural
networks (NN) or other machine learning (ML)
algorithms
• The main advantage of Sonnet, is you can use it
to reproduce the research demonstrated in
DeepMind’s papers with greater ease than Keras,
since DeepMind will be using Sonnet themselves

MXNet
MXNet, as an Apache product, is very effective
framework for parallel on multiple GPUs and many
machines. This, in particular, has been demonstrated
by his work on Amazon Web Services
• The framework initially supports a large number
of languages (C ++, Python, R, Julia, JavaScript,
Scala, Go, and even Perl)
• Support of multiple GPUs (with optimized
computations and fast context switching)
• Fast problem-solving ability

Gluon
the Gluon supports work with a dynamic graph,
combining this with high-performance MXNet. From
this perspective, Gluon looks like an extremely
interesting alternative to Keras for distributed
computing
• Gluon is based on MXNet and offers a simple API
that simplifies the creation of deep learning
models
• Gluon enables to define neural network models
that are dynamic, meaning they can be built on
the fly, with any structure, and using any of
Python’s native control flow

CNTK
CNTK is one of the most widely known machine
learning frameworks in the market, which is
developed by Microsoft that features great
compatibility and effective use of computational
resources
• Microsoft Cognitive Toolkit (previously CNTK) is
a deep learning framework developed
by Microsoft Research
• CNTK support for CUDA 10
• CNTK contributes to ONNX development and
runtime.

Chainer
Until the advent of DyNet at CMU, and PyTorch at
Facebook, Chainer was the leading neural network
framework for dynamic computation graphs or nets
that allowed for input of varying length, a popular
feature for NLP tasks.
• Chainer is the first framework to use a dynamic
architecture model
• Better GPU & GPU data center performance than
TensorFlow. Recently, Chainer became the world
champion for GPU data center performance
• OOP like programming style

DL4J
Those who are on a short leg with Java or Scala
should pay attention to DL4J
• The process is supported by Hadoop and
Spark architectures
• Using Java allows you to use the library in the
development cycle of programs for Android
devices
• Training of neural networks in DL4J is carried out
in parallel through iterations through clusters

ONNX
The ONNX project was born from the collaboration
of Microsoft and Facebook as a search for an open
format for the presentation of deep learning models.
ONNX simplifies the process of transferring models
between different means of working with artificial
intelligence
• ONNX enables models to be trained in one
framework and transferred to another for
inference. ONNX models are currently supported
in Caffe2, Microsoft Cognitive Toolkit, MXNet, and
PyTorch, and there are connectors for many other
common frameworks and libraries

Which DLF Should You Use
• If you are just starting out and want to figure out
what’s what, the best choice is Keras
• For research purposes, choose PyTorch
• For production, you need to focus on the
environment. So, for Google Cloud, the best choice
is TensorFlow, for AWS — MXNet and Gluon.
• Android developers should pay attention to D4LJ, for
iOS, a similar range of tasks is compromised by Core
ML.
• Finally, ONNX will help with questions of interaction
between different frameworks.

DLF Statistics

Convolutional Neural
Network

Agenda
• Fashion MNIST Dataset
• CNN
• Softmax & Cross-entropy Loss
• Dropout
• Batch Normalization

ConvNet
• CNN is a class of deep neural networks, most
commonly applied to analyzing visual imagery
• Layers:
– Convolution layer + ReLu layer
– Pooling layer
– Flattening layer
– Full Connection layer
• Applications:
– Image and video recognition
– Recommender systems
– Image classification, medical image analysis
– Natural language processing

Fashion MNIST
It is a dataset comprised of 60,000
small square 28×28 pixel grayscale
images of items of 10 types of clothing,
such as:
0: T-shirt/top
1: Trouser
2: Pullover
3: Dress
4: Coat
5: Sandal
6: Shirt
7: Sneaker
8: Bag
9: Ankle boot

Fashion MNIST

ConvNet

Convolution Layer
Convolution Function:

Convolution Layer

Convolution Layer
For increasing non-linearity we are adding ReLu function

Convolution Layer

Pooling
• Average, Min, Max, Sum Pooling Layers

Max Pooling

Flattening

Full Connected Layer

Softmax & Cross-Entropy
• Softmax Activation Function
• Cross-Entropy Loss Function (Binary, Sparse)

Dropout Layer
• We don’t want model learn too much on training set

Batch Normalization

MiniBatch Normalization

Hands on Tensorflow

Tensorflow 2.0 and Building
CNN
https://colab.research.google.com/drive/18UGEAgkAFa
dNPK0f2b7t3UzrKlusnTxp

Django + Tensorflow

TF deploy with Django
https://github.com/ntgai/django-tensorflow-fashion/

Resources

Machine Learning Video Courses
• Coursera — Machine Learning (Andrew Ng)
• Coursera — Neural Networks for Machine Learning (Geoffrey Hinton)
• Udacity — Intro to Machine Learning (Sebastian Thrun)
• Udacity — Machine Learning (Georgia Tech)
• Udacity — Deep Learning (Vincent Vanhoucke)
• Machine Learning (mathematicalmonk)
• Practical Deep Learning For Coders (Jeremy Howard & Rachel Thomas)
• Stanford CS231n — Convolutional Neural Networks for Visual
Recognition (Winter 2016) (class link)
• Stanford CS224n — Natural Language Processing with Deep Learning
(Winter 2017) (class link)
• Oxford Deep NLP 2017 (Phil Blunsom et al.)
• Reinforcement Learning (David Silver)
• Practical Machine Learning Tutorial with Python (sentdex)

Machine Learning Blogs
• Andrej Karpathy
• i am trask
• Christopher Olah
• Top Bots
• WildML
• Distill
• Machine Learning Mastery
• FastML
• Adventures in NI
• Sebastian Ruder
• Unsupervised Methods
• Explosion
• Tim Dettmers
• When trees fall…
• ML@B

Machine Learning Theory
• Machine Learning, Stanford University
• Machine Learning, Carnegie Mellon University
• Machine Learning, MIT
• Machine Learning, California Institute of
Technology
• Machine Learning, Oxford University
• Machine Learning, Data School

Deep Learning Theory
• Deep Learning, Ian Goodfellow
• Neural Networks and Deep Learning
• Understanding LSTM Networks
• Deep Residual Learning

References

References
1. https://www.colorado.edu/chbe/2019/11/06/machine-learning-
technology-may-help-doctors-identify-and-treat-infections-
newborns
2. https://towardsdatascience.com/top-10-best-deep-learning-
frameworks-in-2019-5ccb90ea6de
3. https://towardsdatascience.com/deep-learning-framework-
power-scores-2018-23607ddf297a

Introduction to Machine Learning, Hands-on Deep Learning with Tensroflow 2.0

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Introduction to Machine Learning, Hands-on Deep Learning with Tensroflow 2.0

Similar to Introduction to Machine Learning, Hands-on Deep Learning with Tensroflow 2.0 (20)

Recently uploaded

Recently uploaded (20)

Introduction to Machine Learning, Hands-on Deep Learning with Tensroflow 2.0