Image Recognition App Dataset Preparation

Image Recognition
Applications & Dataset Preparation

Did you see this before ?
•Cover violence or
nudity images
facebook app
AppChief

•Suggests sharing
photos with
recognized facebook
friends
Moments app
AppChief

•Search for photo
content in iOS and
Android
Photos
AppChief

•Cutting out a person
from an image
Sticky app

What is image recognition ?
How to build my own app ?
Why I need it ?
Is the ability of software to identify objects, places, people, writing and
actions in images.
labeling the content of images with meta-tags, performing image content
search and guiding autonomous robots, self-driving cars and accident
avoidance systems…etc
Next slides

How to build my own app ?
TRAIN THE MODEL
PREPARE DATASET
BUILD AND RUN
…
…

What do you need ?
PREPARE DATASET
Classify image ? Detect multiple objects
inside image ?
or

Image recognition types
&
IMAGE
CLASSIFICATION
OBJECT
DETECTION
PREPARE DATASET

vs
OBJECT
DETECTION
input
output
Image Image
Class labelsClass label
OBJECT
DETECTION
INSTANCE
SEGMENTATION
types
+
bounding box
+
bounding boxes
+
segmentation
PREPARE DATASET
-
IMAGE
CLASSIFICATION
CLASSIFICATION
CLASSIFICATION
+ LOCALIZATION

vs
IMAGE
CLASSIFICATION
OBJECT
DETECTION
CLASSIFICATION
CLASSIFICATION
+ LOCALIZATION
OBJECT
DETECTION
INSTANCE
SEGMENTATION
types
PREPARE DATASET
Example

vs
IMAGE
CLASSIFICATION
OBJECT
DETECTION
CLASSIFICATION
CLASSIFICATION
+ LOCALIZATION
OBJECT
DETECTION
INSTANCE
SEGMENTATION
types
PREPARE DATASET
WE
WILL
CONTINUE
WITH
OBJECT
DETECTION

Let’s create a money reader model
STEP 1. Naming objects (Object labels)
PREPARE DATASET / OBJECT DETECTION
1 IQD_50000_ar
3 IQD_25000_ar
5 IQD_10000_ar
7 IQD_5000_ar
9 IQD_1000_ar
11 IQD_500_ar
13 IQD_250_ar
2 IQD_50000_en
4 IQD_25000_en
6 IQD_10000_en
8 IQD_5000_en
10 IQD_1000_en
12 IQD_500_en
14 IQD_250_en

STEP 2. Take photos for each money face AS MUCH AS YO CAN
For better accuracy take hundreds of photos with
• Different backgrounds

• Different positions

• Different light conditions

• Different orientations

• Different backgrounds

• Different positions

• Different light conditions

• Different orientations

2. Take about 300 photos for each money face

STEP 3. Labeling objects inside images
Label : IQD_50000_en
x : 6
y : 120
width : 150
heigh : 370
Objects :
Object #1

STEP 3. Labeling objects inside images
Label : IQD_50000_en
x : 90
y : 125
width : 313
heigh : 313
Objects :
Object #1

CAUTION : Some training libraries prefers diﬀerent coordinates system in labeling
X, Y, Width , Height midX, midY, Width , Height
minX, minY, maxX , maxY
It’s recommended to check the library needs you want to use for training before start labeling

How many photos do you think we need
for each label ?
10 ?
20 ?
50 ?
100 ?
200 ?
300 ?

Assuming 300 photos
is good for our model
let’s calculate time required
300 image x 14 Label x (5 sec) taking photo x (30 sec) labeling
175 hours !!! 7 Days !

We made a timesaving app
Only 1 hour

1 2 3Create labels Capture Generate

3 Transfer
Easy Dataset

Demo time
Easy Dataset

Money Reader - ‫العملة‬ ‫قارئ‬
Final live app
Final dataset
kaggle.com/husamaamer/iraqi-currency-
~1 GB
Iraqi Money ‫العراقية‬ ‫العملة‬
Object detection dataset for Iraqi currency

MODEL TRAINING
import turicreate as tc
import os
# Define all images annotations with bounding box details (I am showing only 1)
annotations = tc.SArray([
[{
“label”:”5000ar",
“type":"rectangle",
“coordinates”:{“y":188.5,"x":207,"width":304,"height":152}
}],
… , … …
])

MODEL TRAINING
import os
[{
}],
… , … …
])
# 1. Load images (Note: you can ignore 'Not a JPEG file' errors)
data = tc.image_analysis.load_images('mr_turi_ic', with_path=True)
data['label'] = data['path'].apply(lambda path: os.path.basename(os.path.dirname(path)))
data['annotations'] = tc.SArray(data=annotations, dtype=list)

MODEL TRAINING
import os
[{
}],
… , … …
])
# Make a train-test split
train_data, test_data = data.random_split(0.8)

MODEL TRAINING
import os
[{
}],
… , … …
])
# Create a model using Turi Create's object detector API
model = tc.object_detector.create(train_data, max_iterations=1000)
# Save the predictions to an SArray
predictions = model.predict(test_data)
# Evaluate the model and save the results into a dictionary
metrics = model.evaluate(test_data)
print('Precision' , metrics['mean_average_precision'])

MODEL TRAINING
import os
[{
}],
… , … …
])
# Create a model using Turi Create's object detector API
model = tc.object_detector.create(train_data, max_iterations=1000)
# Save the predictions to an SArray
predictions = model.predict(test_data)
# Evaluate the model and save the results into a dictionary
metrics = model.evaluate(test_data)
print('Precision' , metrics['mean_average_precision'])
# Save the model for later use in Turi Create
model.save(‘turi_ic.model')
# Export for use in Core ML file to the current directory
model.export_coreml('turi_ic.mlmodel')

Image Recognition App Dataset Preparation

Recommended

Recommended

More Related Content

What's hot

What's hot (11)

Similar to Image Recognition App Dataset Preparation

Similar to Image Recognition App Dataset Preparation (20)

Recently uploaded

Recently uploaded (20)

Image Recognition App Dataset Preparation