Implementing Vision DL Solution using Open Source Tools

Unearth the Journey of
Implementing Vision based
Deep Learning Solution using
Open Source Tools
Neethu Elizabeth Simon
Software Engineer
Intel Corporation
Phoenix, AZ

Java
Developer
C#
Developer
IoT Solutions Engineer
My Journey

https://www.youtube.com/watch?v=nwPtcqcqz00

Dried Chili Peppers Cinnamon Sticks
Solution Data Set

Implementation
• Build Model
• Improve Models
to meet
performance &
accuracy
objectives
• Deploy Model
• Make Predictions
by applying the
Trained Model
on new Data
• Label Data• Data
Acquisition &
Organization
• Preprocessing
Collect Annotation
TrainingInference

Implementation – Data Collection
Collect Annotation
TrainingInference
 Data Acquisition & Organization
• Web Crawling/Scraping - Download pictures &
videos
Python Packages Request, Parcel, Beautiful Soup
Frameworks/Tools Scrapy, Pyspider, Mechanical Soup

Collect Annotation
TrainingInference
 Data Acquisition & Organization
• Web Crawling/Scraping - Download pictures &
videos
• Live Video Recording/Image Capture from actual
retail store
Python Packages Request, Parcel, Beautiful Soup
Frameworks/Tools Scrapy, Pyspider, Mechanical Soup

Collect Annotation
TrainingInference
 Pre-processing
• Data Cleanup
• Remove Bad/Unrelated Data

Implementation – Data Annotation
Collect Annotation
TrainingInference
 Label Data
• Establish ground truth

Implementation – Data Annotation
Collect Annotation
TrainingInference
 Open Source Tools
• Bbox - https://github.com/puzzledqs/BBox-Label-Tool
• LabelImg - https://github.com/tzutalin/labelImg
• VGG Image Annotator (VIA)
http://www.robots.ox.ac.uk/~vgg/software/via/
• Computer Vision Annotation Tool (CVAT)
https://github.com/opencv/cvat
 Paid Tools – Labelbox, Dataloop, RectLabel

Implementation – Training
Collect Annotation
TrainingInference
dried_chilli_pepper
cinnamon_stick
Lots of
Labeled Data !!!
Model Weights
Forward
Backward

Collect Annotation
TrainingInference
Data
• Type/Amount – Visual, Sensors
• Labeled Datasets – PASCAL VOC, COCO, KITTI, ImageNet or
ILSVRC, SUN
• Custom Data

Collect Annotation
TrainingInference
Compute Platform
• Complexity Determined by Data
• Cloud vs Edge
• Intel® Xeon vs Core vs Atom
Data
ILSVRC, SUN
• Custom Data
Intel
Core +
GPU

Collect Annotation
TrainingInference
Object Detection
Algorithm
• YOLO – You Only Look Once
• SSD, R-CNN, Fast R-CNN, Faster RCNN, R-FCN
Compute Platform
• Cloud vs Edge
Data
ILSVRC, SUN
• Custom Data
Intel
Core +
GPU
YOLO

Collect Annotation
TrainingInference
Deep Learning
Framework
• DarkNet - C & CUDA - https://pjreddie.com/darknet/
• DarkFlow - Tensorflow implementation of DarkNet
https://github.com/thtrieu/darkflow
• Others – PyTorch, Caffe
Object Detection
Algorithm
Compute Platform
• Cloud vs Edge
Data
ILSVRC, SUN
• Custom Data
YOLO
Intel
Core +
GPU
DarkFlow

Collect Annotation
TrainingInference
Deep Learning
Framework
• DarkNet - C & CUDA - https://pjreddie.com/darknet/
• DarkFlow - Tensorflow implementation of DarkNet
https://github.com/thtrieu/darkflow
• Others – PyTorch, Caffe
Object Detection
Algorithm
Compute Platform
• Cloud vs Edge
Data
ILSVRC, SUN
• Custom Data
YOLO
Intel
Core +
GPU
DarkFlow
Images - 1000 Images
Training Time - 10 hrs
Model Size – 800 MB
Average loss – ~0.9

Collect Annotation
TrainingInferencedried_chilli_pepper
cinnamon_stick
Lots of
Labeled Data !!!
Model Weights
Forward
Backward
python flow --train --model cfg/yolov2-2c.cfg --load bin/yolov2.weights --annotation all_Spice_Annotations/ --dataset all_Spice_Images/
DarkFlow

Implementation – Inference
Collect Annotation
TrainingInference
Model Weights
Forward
dried_chilli_pepper
Edge Device – Laptop/NUC/Mobile
????
Intel® OpenVINO Toolkit
https://software.intel.com/en-
us/openvino-toolkit
Open Source Tool for Inference
Support Different Models
(Caffe/Tensorflow/MXNet/ONNX/Kaldi)
Model Optimization
Speed Deploymentpython flow --imgdir all_test_images/ --model cfg/yolov2-2c.cfg --load new_model_weights
DarkFlow

Results
[{"label": "cinnamon_stick", "confidence": 0.87, "topleft":
{"x": 42, "y": 22}, "bottomright": {"x": 208, "y": 113}}]
[{"label": "cinnamon_stick", "confidence": 0.06, "topleft":
{"x": 93, "y": 216}, "bottomright": {"x": 384, "y": 402}}]

Results
[{"label": "dried_chilli_pepper", "confidence": 0.95, "topleft": {"x": 0,
"y": 25}, "bottomright": {"x": 761, "y": 352}}]
[{"label": "dried_chilli_pepper", "confidence": 0.02, "topleft":
{"x": 357, "y": 93}, "bottomright": {"x": 513, "y": 446}}, {"label":
"cinnamon_stick", "confidence": 0.03, "topleft": {"x": 21, "y":
6}, "bottomright": {"x": 568, "y": 472}}]

Implementation Challenges
• More is Better
• Object Orientation, angle,
lighting
• Remove Bad/Unrelated
Data
• Require SME knowledge
to define bad data &
remove unrelated data
Collect Annotation
TrainingInference

[{"label": "cinnamon_stick", "confidence": 0.87, "topleft": {"x": 42, "y": 22}, "bottomright": {"x": 208, "y": 113}}]

• Manual vs
Automation
• Time consuming
• Expensive
• Difficult to Scale
• More is Better
• Object Orientation,
angle, lighting
Data
Collect Annotation
TrainingInference

• Iterative Learning
• Varied Data
parameters &
tuning
• Hyperparamter
optimization & data
overfitting
• High Compute
• Manual vs
Automation
• Time consuming
• Expensive
• More is Better
angle, lighting
Data
Collect Annotation
TrainingInference

• Iterative Learning
• Varied Data
parameters &
tuning
• Hyperparamter
optimization & data
overfitting
• High Compute
• Constantly Update
Models
• Complexity in Hardware
design – combination of
CPU/GPU/Accelerators
• Data Privacy & Security
• Manual vs
Automation
• Time consuming
• Expensive
• More is Better
angle, lighting
Data
Collect Annotation
TrainingInference

Neethu Elizabeth Simon
Software Engineer
Intel Corporation
Phoenix, AZ

Implementing Vision DL Solution using Open Source Tools

Recommended

Recommended

More Related Content

What's hot

What's hot (19)

Similar to Implementing Vision DL Solution using Open Source Tools

Similar to Implementing Vision DL Solution using Open Source Tools (20)

More from Women in Analytics Conference

More from Women in Analytics Conference (10)

Recently uploaded

Recently uploaded (20)

Implementing Vision DL Solution using Open Source Tools