The State of ML for iOS: On the Advent of WWDC 2018 🕯

The State of ML for iOS
On the Advent of WWDC 2018
Meghan Kane, @meghafon
NSLondon
May 2018

!
Hey, I'm Meghan!
@meghafon
iOS Engineer @ Novoda Berlin

!
big picture
"
when is it practical to use ML for iOS?
#
what's available to us?
$
end-to-end examples
!

barriers to entry?
1. A large dataset
2. Access to high end compute power
3. PhD in machine learning
4. All the time in the world
...nope!

Is it practical for my app?
image classification
audio classification
speech recognition
text classification
gesture recognition
optical character recognition (OCR)
translation
voice synthesis

embrace idea generation
& experimentation

machine learning is a powerful tool
but, it is still just another tool

how can we think about
ML as
!
developers?

Can this be solved without ML?
if so, choose that

ML vs not ML
basic unit of solving problem = function ("model")
ML: enabling a machine to learn function on its own
classify sign language alphabet images
not ML: explicitly deﬁning function
determining if a number is even/odd

If you decide to use ML
still go with the simplest solution

Why do ML (predictions) on mobile?
→ low latency user experience
→ user privacy

What's available from Apple?
image classiﬁcation of 1000 common categories
→ trees, animals, food, vehicles, people
→ SqueezeNet (5 MB), MobileNet (17 MB), Inception
V3 (95 MB), ResNet50 (103 MB), VGG16 (554 MB)
scene classiﬁcation of 205 categories
→ airport terminal, bedroom, forest, coast
→ Places205-GoogLeNet (25 MB)

If not, train custom ML model
step 1: use framework for training
TensorFlow, keras, Turi Create , Caffe, etc
⚠
warning, there are a lot of them
step 2: convert to .mlmodel format (OSS)
→  coremltools github.com/apple/coremltools
→ tf-coreml github.com/tf-coreml

beyond the cat/dog classiﬁer (TM)

End-to-end Process as a developer?
0. Deﬁne problem
1. Collect data
2. Train ML model
3. Convert to coreml .mlmodel
4. Import into Xcode project
5. Predict using Core ML (+Vision) framework

Mobile speciﬁc concerns
size of model
time it takes to run predictions
supported layers

0.Deﬁne problem
American Sign Language (ASL) alphabet classiﬁer

Quick Review: Deep Learning
neural network model with many layers
deep = many layers
-> deep neural network
Mobile Machine Learning 101: Glossary Jameson
Toole on Heartbeat blog

sometime way back in B.C.
people used to train deep neural network from
scratch

still some (more recent) time in B.C.
people stand on the shoulders of giants' work
utilizing transfer learning

enter.. transfer learning
!
use knowledge learned from source task (MobileNet)
--> to train target task (ASL classiﬁer)

Why Transfer Learning works
neural networks are universal approximators
in theory, they can approximate any function

how much data do i need?
depends on problem
just 100s images per category

where can i get it?
kaggle
google for them...
record video + extract frames (using e.g. FFmpeg)

what if i don't have enough?
data augmentation!
Deeplearning.ai: C4W2L10 Data Augmentation (~10
min video)

Let's start
training...
→ can we use Swift for Tensor
Flow?
→ for now, stick with regular
Tensor Flow

Performance
so how did our training go?
~20 min to run on my MacBook
95.3% accuracy on the test data

3. Convert
to .mlmodel
tf-coreml

4. Import into Xcode project
drag + drop

5. Predict using Core ML (+Vision)
framework
vision + core ml

audio classiﬁcation
0.Deﬁne problem
1. Gather data
2. Train ML model

0.Deﬁne problem
Audio classiﬁer of urban sounds
air conditioner, car horn, children playing, drilling,
siren, etc

1.Gather data
UrbanSound 8K open dataset
Urban Sound Datasets, NYU CUSP

should we use raw audio (.wav)?
no, it's too computationally expensive

convert wav to spectrogram
represent audio as image (3 dimensions)
1st dimension: time (x-axis)
2nd dimension: frequency (y-axis)
3rd dimension: sound intensity (color)

Performance
so how did our training go?
~1 hour to run on my MacBook
77.1% accuracy on the test data

Where to ﬁnd inspiration
look at open datasets
read research papers!
follow heartbeat blog, openAI

Reproduce results
research papers often include this
make sure to do the same if you publish
check licensing + attribute proper credit

Looking forward to the future
ML interpretability
swift for TensorFlow

Review
!
big picture
"
when is it practical to use ML for iOS?
#
what's available to us?
$
end-to-end examples

Attributions & Mentions (1/4)
Apple Machine Learning
WWDC 2017 Videos
TensorFlow for Poets Google codelabs tutorial
Apple coremltools GitHub repo
tf-coreml GitHub repo: TensorFlow->core ml
converter

Heartbeat by fritz.ai blog: Machine Learning at the
edge
ASL Datasets
Kaggle Sign Language MNIST
Urban Sound Datasets, NYU CUSP
deeplearning.ai course: Data Augmentation

Swift for TensorFlow GitHub repo
Dockerized Swift for TF GitHub repo, Alexis Gallager
themorningpaper by Adrian Colyer
OpenAI Research
"The Building Blocks of Interpretability" Google: C.
Olah et al

"Strategically Ignorant" Devon Zuegel
"Transfer Learning of Temporal Information for Driver
Action Classiﬁcation" J. Lemley et al
"Transfer Learning for Sound Classiﬁcation"
TataLab

Further Learning (1/3)
fast.ai Deep Learning course
My Udacity Core ML course
machinethink,
!
ML for iOS blog by Matthijs
Hollemans
TensorFlow Dev Summit 2018 Videos
TensorFlow playground

Building Mobile Apps w/ Tensor Flow Pete Warden
Neural Networks & Deep Learning Michael
Nielsen
Stanford's Computer Vision course (CS231n)

"Distilling the Knowledge in a Neural Network"
Geoffrey Hinton et al.
"Transfer Learning - Machine Learning's Next
Frontier"
!
Sebastian Ruder
"Transfer learning for music classiﬁcation and
regression tasks"
!
Keunwoo Choi et al.

Thank you
Keep in touch!
twitter: @meghafon

The State of ML for iOS: On the Advent of WWDC 2018 🕯

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to The State of ML for iOS: On the Advent of WWDC 2018 🕯

Similar to The State of ML for iOS: On the Advent of WWDC 2018 🕯 (20)

Recently uploaded

Recently uploaded (20)

The State of ML for iOS: On the Advent of WWDC 2018 🕯