IMAGE PROCESSING

What is the problem?
What is the RTIP?
Why are we making RTIP?
What we are going to do?

 Introduction
 Literature survey
 Requirements
 Difference between real and non real time processing
 General detection system
 System design
 Advantages
 Disadvantages
 Applications
 Future scope
 Definitions in RTIP in different sense
 References

Introduction
•What is Real-time Image Processing?
•How it differs from ordinary Image
Processing?
•What is the need for Real time image
processing?

Real time processing Non real time processing
 Is continuous
 Also called interactive
processing
 Have deadlines
 Have predictable response
time
 With soft real-times, missing
a deadline indicates that the
system is not working at its
peak
 Is non continuous
 Also called as batch
processing
 Not have deadlines
 Extended over time period
 In batch processing missed
deadlines might mean that
the computer needs more
processing capacity to finish
tasks.

When the processing of sound is first calculated
entirely and saved to an audio file (which can be
listened to later) one speaks of non-realtime or
offline synthesis.

When the stream of data goes directly to the
audio interface as it is processed, so that there are
only few milliseconds between the processing
and the listening of the synthesized sound, one
speaks of realtime synthesis.

Real-Time Image Processing Platform
Requirements:
• High resolution, high frame rate video
input
• Low latency video input
• Low latency operating system scheduling
• High processing performance

 Image processing attempts to extract
information from the outside world through
its visual appearance. Therefore adequate
information must be provided to the
processing algorithm by the video input
hardware.
 Broadcast video provides a practical reference
point as most cameras provide images in
formats derived from broadcast standards
regardless of their computer interface
(analog, USB etc).

1. Customer can see the results immediately.
2. Allows you to automate your business. This
is especially important if your time is very
limited.
3. Real-time image processing also helps
eliminate customer errors.
4. Real-time image processing is fast .
5. Real-time image processing is continuous.

• Binary images {0,1}
• Greyscale Image : [0,1]
• RGB images (Truecolour Image) :

 IMAGE ACQUISITION TOOLBOX
 IMAGE PROCESSING TOOLBOX
 COMPUTER VISION TOOLBOX

 Introduction
 Image Representation – Integral Image
 Learning Algorithm – AdaBoost
 Cascade Classifier
 Experiment and Result

Object Detection
Detection System
Introduction

Paul Viola Michael J. Jones
Introduction

24x2
4
Detectio
n
Window
The size of
detection
window will be
enlarged with a
certain scale
Introduction

Integral Image
Integral Image Feature
Image
Representation

• Image representation
• Feature
• The integral image at
location x,y contains
the sum of the pixels
above and to the left
of x,y
Integral Image | AdaBoost | Cascade
Sum of pixel
value in this
area
(x,y)

• Feature
• The integral image at
location x,y contains
the sum of the pixels
above and to the left
of x,y
Sum of pixel
value in this
area
(x,y)
How to calculate D ?
A B
C D
1 2
3 4

• Feature
How to calculate D ?
D = (1+4) –
(2+3)
A B
C D
1 2
3 4
1
2
3
4

Detection window
A
B
C
D
Detection window
Detection window Detection window
The value of integral image
feature is the difference
between the sum of the
pixels within the two
rectangular regions.
S1
S2
|S1-S2|

The size and position of feature can
be different.
• The eye region is
darker than the
upper-cheeks.
• The nose bridge
region is brighter
than the eyes.

• Different size and position of integral image
give so many possible features
• How do we obtain the best representing
features possible?
• How can we refrain from wasting time on
image background? (i.e. non-object)
Good Feature Not Good
Feature
AdaBoost…

Weak Classifier
Strong Classifier
AdaBoost
AdaBoost

• Consist of just 1 feature
• 1 feature that can separate the image
data with error rate less than 0.5
Feature 1
Feature 2
Feature 3
θ 1
θ 2
θ 3
Weak
Classifier 1
Weak
Classifier 2
Weak
Classifier 3
1/10 = 0.1
6/10 = 0.6
3/10 = 0.3

• Set of features/weak classifier
 1 1 1
strong
1
1 ( ) ( )
( ) 2
0 otherwise
n n nh h
h

        
 

x x
x
K K
Weak Classifier
How to build the strong classifier?
AdaBoost

• T = number of feature we choose
• h(x, f,p,θ)) = A weak classifier
• f = feature
• θ = threshold
• p = polarity indicating the
direction of the inequality (less
than or greater than θ)

 AdaBoost starts with a uniform
distribution of “weights” over training
examples.
 Select the classifier with the lowest
weighted error (i.e. a “weak” classifier)
 Increase the weights on the training
examples that were misclassified.
 (Repeat)
 At the end, carefully make a linear
combination of the weak classifiers
obtained at all iterations.
 1 1 1
strong
1
1 ( ) ( )
( ) 2
0 otherwise
n n nh h
h

        
 

x x
x
K K
Slide taken from a presentation by Qing Chen, Discover Lab,
University of Ottawa

• By increasing the number of features per classifier,
we:
◦ Increase detection accuracy.
◦ Decrease detection speed.
• Experiments showed that a 200 feature
classifier makes a good face detector:
◦ Takes 0.7 seconds to scan an 384 by 288 pixel
image.
• Problem: Not real time! (At most 0.067 seconds
needed).
Cascade of
Classifier…

• Classifier is structured as several
layer/hierarchy
• Each layer consist of 1 strong classifier (set of
weak classifiers/features)
• Evaluating all input window
• Ignore non-face window
• Continue for processing “suspected” face
window

• The number of feature/weak classifier in
every layer is based on the value of false
positive rate and detection rate determined
by user
• Previous layers have simpler classifier than
next layer
Stage 1 Stage 2 Stage 3 …
Further
Processing
Input Image
Window
Ignore/Rej
ect Input
Ignore/Rej
ect Input
Ignore/Rej
ect Input
Ignore/Rej
ect Input
Y
N
Y
N
Y
N
Y
N

Stage 1
(2 feature)
N
Y Stage 2
(2++
feature)

• Training set
• Face: 4916 labeled frontal faces
• Testing / Real-time test
• Face: MIT + CMU frontal face test set
• Contain: 130 images with 507 labeled frontal faces

Cascade Classifier
v.s
Non Cascade
Classifier
Cascade classifier
nearly 10x faster.

They use dataset MIT+CMU with 5 images removed
Voting (Using 3 Viola-Jones Classifier with different
parameter)
++- : Face
--+ : Non Face

• Informal observation suggests that the face
detector can detect faces that are tilted up to
about ±15 degrees in plane and about ± 45
degrees out of plane
• Harsh backlighting in which the faces are very
dark while the background is relatively light
sometimes causes failures
• Proposed algorithm will also fail on
significantly occluded faces

 imaqfind
 imaqhwinfo
 Imaqreset
 Stop
 videoinput
 imaq.VideoDevice

 imread Read image from graphics file
 imwrite Write image to graphics file
 imfinfo Information about graphics file
 gray2ind Convert grayscale or binary image
to indexed image
 ind2gray Convert indexed image to
grayscale image
 mat2grayConvert matrix to grayscale image
 rgb2gray Convert RGB image or colormap
to grayscale
 imshow Display image

 detector = vision.CascadeObjectDetector

 imread- To read an image into Matlab.
 imshow- To show image in a figure.
 imsave- To save the image.
 imwrite(image,‘filename.type’)- To save the
image.
 imageinfo- For image information
 imaqhwinfo- For Information about available
image acquisition hardware

 videoinput- Create video input object
 vision.CascadeObjectDetector- Detect objects
using the Viola-Jones algorithm
 preview- To show the preview of camera
 getsnapshot- Take snapshot
 step- To apply algorithm on image
 insertObjectAnnotation- Insert annotation on
detected object

a=videoinput('winvideo',1);
preview(a);
i=getsnapshot(a);
imshow(i);

FaceDetector=vision.CascadeObjectDetector()
;
BBox=step(FaceDetector,i);
qw=insertObjectAnnotation(i,'rectangle',BBox,
'face');
imshow(qw)

 [a,b]=imread('kid
s.tif');
 b(:,1:2)=0;
 imshow(a,b)
 impixelinfo
 figure
 [a,b]=imread('kid
s.tif');
 b(:,1)=0;
 b(:,3)=0;
 imshow(a,b)
 impixelinfo
 figure
 [a,b]=imread('kids.
tif');
 b(:,3)=0;
 b(:,2)=0;
 imshow(a,b)
 impixelinfo
 figure
 [a,b]=imread('kids.
tif');
 imshow(a,b)
 impixelinfo

clc
clear
a=videoinput('winvideo',2);
fd=vision.CascadeObjectDetector;
i=getsnapshot(a);
b=step(fd,i);
ia=insertObjectAnnotation(i,'rectangle
',b,'detected faces');
imshow(ia);

IMAGE PROCESSING

More Related Content

What's hot

Viewers also liked

Similar to IMAGE PROCESSING

More from ABHISHEK MAURYA

Recently uploaded

IMAGE PROCESSING