Practical Introduction to AI, Deep Learning, and Large Scale Image Analytics

PRACTICAL INTRODUCTION TO
ARTIFICIAL INTELLIGENCE
DEEP LEARNING
LARGE-SCALE IMAGE ANALYTICS
KEVIN MADER / FLAVIO TROLESE
4QUANT | BIG IMAGE ANALYTICS
PANTALK
TUESDAY, MARCH 19 2016 / IMPACTHUB GARAGE ZURICH

Die Länder, die Österreich
umgeben.
↓
Was sind Schweiz, Italien,
Slowenien, Ungarn,
Tschechische Republik,
Deutschland, Slovakei?

http://4quant.com/javascript-breakout/
4Quant | BIG IMAGE
ANALYTICS

HOW?
4Quant | BIG IMAGE
ANALYTICS

STANDARD MACHINE LEARNING
4Quant | BIG IMAGE
ANALYTICS

CORE IDEAS
What is an image?
What a human sees What a machine sees
4Quant | BIG IMAGE
ANALYTICS

CORE IDEAS
Feature Generation → Making the computer see more
4Quant | BIG IMAGE
ANALYTICS

CORE IDEAS
Training / Validation
With all machine learning techniques it
is critical to divide data into training
and validation sets.
The algorithm can then be tested
(validated) on data it has never seen
before to ensure it generalizes
4Quant | BIG IMAGE
ANALYTICS

OUTPUT / LOSS FUNCTION
4Quant | BIG IMAGE
ANALYTICS
The in order for machine learning to work there has to be a
single output for the system which quantifies how well it is
working
- the number of correctly identified structures (true-
positives)
- the number of correct letters in a sentences
- the score of a game

CORE IDEAS
Learning from tagged data (supervised)
4Quant | BIG IMAGE
ANALYTICS
What is this?

PROBLEMS
Features can be very difficult
to ‘engineer’.
What makes a person a
person?
More data doesn’t always
lead to better results.
4Quant | BIG IMAGE
ANALYTICS

DEEP LEARNING
4Quant | BIG IMAGE
ANALYTICS

One of these can recognize without any
programming by just experiencing and
getting feedback.
THE IDEA
4Quant | BIG IMAGE
ANALYTICS
https://flic.kr/p/2eryEj

The human brain is a large, layered,
connected network of neurons.
THE IDEA
4Quant | BIG IMAGE
ANALYTICS
https://flic.kr/p/5J4uci

We understand how some of these
layers work and can make
computationally fast models for
simulating their behavior
THE IDEA
4Quant | BIG IMAGE
ANALYTICS

DEEP LEARNING
Deep learning is a set of algorithms in machine learning that
attempt to learn in multiple levels, corresponding to different
levels of abstraction.
4Quant | BIG IMAGE
ANALYTICS

THE IDEA
4Quant | BIG IMAGE
ANALYTICS
A machine learning system with millions of inputs
And 1 output

THE IDEA
4Quant | BIG IMAGE
ANALYTICS
The networks can get very large (hence the deep)
Here is the Inception Network from Google

TYPES OF ARTIFICIAL NEURONS
4Quant | BIG IMAGE
ANALYTICS
Fully-connected → everything connected to everything
Convolutional (CNN) → mix things together
Recurring (RNN) → remember parts of sequences

Recurring Networks
4Quant | BIG IMAGE
ANALYTICS
http://karpathy.github.io/2015/05/21/rnn-effectiveness/

Recurring Networks
4Quant | BIG IMAGE
ANALYTICS
http://karpathy.github.io/2015/05/21/rnn-effectiveness/
Given the starting letter h
Predict the rest of the letters

SCHWIIZERDÜTSCH
4Quant | BIG IMAGE
ANALYTICS
Goes through hundreds of
pages of text character by
character and trains
neurons to predict the
correct output

The text shows the algorithm learning to complete the
sentence.
The curve shows how confident it is in each guess
SCHWIIZERDÜTSCH
4Quant | BIG IMAGE
ANALYTICS

100 gu sisxt n eigeiua a esSWctaicobemhat,E out?s v t t uew
10K Uhe uf Hountigm don d’Bomura fürsyn al jerisim Sbeour Rucch
65K Übschamt wiänä wo und ebs haGscham, üblart uls zä flusch, zänsert.
De Unner sindämzalagsel
100K Totatwärt. Dischtä Tittä vo dä ues und erwiä Gsacht agä
schtüswongeilä. Beterischtiongehärne vordä em Verbichunt. Diä
Mieräng ader h d Zientlichnig vu CHF
4Quant | BIG IMAGE
ANALYTICS
SCHWIIZERDÜTSCH

Spell/Grammar Check (for a language with ‘no rules’)
Dialect Detector
Autocomplete
APPLICATIONS
4Quant | BIG IMAGE
ANALYTICS

Automatic C code
Wikipedia Text
BEYOND SCHWIIZERDÜTSCH
4Quant | BIG IMAGE
ANALYTICS

CONVOLUTIONAL NETWORKS
4Quant | BIG IMAGE
ANALYTICS
Pixels Edges Object parts Object models
→ → →

4Quant | BIG IMAGE
ANALYTICS

Street, Trees, Fence, Bicycle
UNDERSTANDING
COMPLEX SCENES
4Quant | BIG IMAGE
ANALYTICS

Self-driving cars
need to be able to
identify walkways
automatically
All point geo-
referenced
IDENTIFY WALKWAYS
4Quant | BIG IMAGE
ANALYTICS

Understanding what is happening
inside of these complex networks
DREAMING
4Quant | BIG IMAGE
ANALYTICS

4Quant | BIG IMAGE
ANALYTICS
DREAMING

Applying parts of trained networks to other types of images.
TRANSFER
4Quant | BIG IMAGE
ANALYTICS

A challenging field
- noisy
- highly variable
- many tissues / diseases look
the same
MEDICAL IMAGES
4Quant | BIG IMAGE
ANALYTICS

Red are lungs
Yellow are bones
Blue are the other organs
MEDICAL IMAGES
4Quant | BIG IMAGE
ANALYTICS

FINDING CANCER
4Quant | BIG IMAGE
ANALYTICS

MEDICAL IMAGES
4Quant | BIG IMAGE
ANALYTICS
Convolutional neurons act on the image and learn to
extract the relevant information

MEDICAL IMAGES
4Quant | BIG IMAGE
ANALYTICS
These representations can then be used to automatically
find organs like the heart and measure blood flow
→ →

Open Challenges
4Quant | BIG IMAGE
ANALYTICS
You need a lot of data to identify (1K-100M)
Some networks learn well, others do not
Parameters can make a huge difference
Intermediate layers can be difficult to interpret

RESOURCES
Google Cloud Vision API
IBM Watson Vision Service
Microsoft Project Oxford
TensorFlow
Caffe
Moodstocks
4Quant

PRACTICAL INTRODUCTION TO
ARTIFICIAL INTELLIGENCE
DEEP LEARNING
LARGE-SCALE IMAGE ANALYTICS
THANK YOU
KEVIN MADER / FLAVIO TROLESE
4Quant Ltd.
PANTALK
THURSDAY MARCH 19 2016 / IMPACTHUB GARAGE ZURICH
4Quant| BIG IMAGE ANALYTICS

Practical Introduction to AI, Deep Learning, and Large Scale Image Analytics

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (17)

Similar to Practical Introduction to AI, Deep Learning, and Large Scale Image Analytics

Similar to Practical Introduction to AI, Deep Learning, and Large Scale Image Analytics (20)

Recently uploaded

Recently uploaded (20)

Practical Introduction to AI, Deep Learning, and Large Scale Image Analytics

Editor's Notes