利用影像匹配進行物件辨識與對位

Image Matching for Object Recognition
and Alignment
Chin-Sheng Chen

2015/10/19 2
Commercial Machine Vision Library
MIL
VisionPro

 Image retrieval
 Industrial inspection
 Target search
 Automatic pick and place
2015/10/19 3
2D/ 3D Image Matching for Object Recognition
“perfect” object recognition :
Real-time, High accuracy, General, Robustness,
Multi-object detection

VIDEO – Single object
2015/10/19 4

VIDEO – Two objects
2015/10/19 5

VIDEO – 3D object recognition
2015/10/19 6

1
2
x
y
θ
(x1, y1)
(x2, y2)
Template A
Template A
Search image S
Template image P
T
Recognition of 2D rigid objects
2015/10/19 7
Template
Search image

Similarity measure
1 Simialrity coefficient 1  
Perfect matching  Similarity coefficient =1
minimum difference
2015/10/19 8
 Sum of Absolute Differences (SAD)
 Sum of Squared Differences (SSD)
 Normalized cross correlation (NCC)
 Shape-based matching (SBM)

 Sum of Absolute Differences (SAD)
 Sum of Squared Differences (SSD)
 Matching result : Minimum difference
Similarity measure
2015/10/19 9
/2 /2
/2 /2
( , ) ( , ) ( , )
w h
i w j h
SAD x y I x i y j T i j
 
    
/2 /2
2
/2 /2
( , ) ( , ) ( , )
w h
i w j h
SAD x y I x i y j T i j
 
    
Image I
Image T

The size of the template image
and are the gray-level averages of the template image and the
windowed compared image.
Similarity measure
/2 /2
/2 /2
/2 /2 /2 /2
2 2 2 2
/2 /2 /2 /2
[ ( , ) ( , )]
( , ) ,
( , ) ( , )
w h
I T
i w j h
g
w h w h
I T
i w j h i w j h
I x i y j T i j w h
x y
I x i y j w h T i j w h
 

 
 
   
      

   
           
   
 
   
w h
Tu Iu
/2 /2
/2 /2
1
( , )
w h
T
i w j h
T i j
w h

 


 
/2 /2
/2 /2
1
( , )
w h
I
i w j h
I x i y j
w h

 
  

 
2015/10/19 10

The size of the template image
and are the color-level averages of the template image and the
windowed compared image.
Similarity measure
w h
Tu Iu
/2 /2
/2 /2
/2 /2 /2 /2
2 22 2
/2 /2 /2 /2
[ ( , ) ( , )] 3
( , ) ,
( , ) 3 ( , ) 3
w h
I T
i w j h
c
w h w h
I T
i w j h i w j h
I x i y j T i j w h
x y
I x i y j w h T i j w h
 

 
 
   
       

   
             
   
 
   
 R G B( , ) ( , ), ( , ), ( , )T i j T i j T i j T i j
 R G B( , ) ( , ), ( , ), ( , )I x i y j I x i y j I x i y j I x i y j        
/2 /2
R G B
/2 /2
1
[ ( , ) ( , ) ( , )]
3
w h
T
i w j h
T i j T i j T i j
w h

 
  
 
 
/2 /2
R G B
/2 /2
1
[ ( , ) ( , ) ( , )]
3
w h
I
i w j h
I x i x j I x i x j I x i x j
w h

 
        
 
 
2015/10/19 11

 Shape-based matching (SBM)
Similarity measure
  , ,
2 2 2 2
1 1 , ,
1 1
, ,i i i i
i i i i
n n
i x r y c i x r y ci i
eg
i ii i i i x r y c x r y c
t v u w
x y
n n t u v w

   
     

 
 
 
d e
d e
( , )T
i i it ud
, ,( , )i i i i
T
i r c r cv we
( , )T
i i ir cq
gradients direction vector of template image
gradients direction vector of scene image
edge points, these points are relative to the center of gravity
and the edge points of the object.
2015/10/19 12
Picture Ref: Halcon

Category Features Approaches Disadvantages
Feature-based
Edge maps
Interest point
Invariant descriptors
Orientation code
Hausdorff distance
Feature correspondence
Zernike moment
Dissimilarity measurement
Ring-projection
Inaccurate
feature
extraction
Area-based Intensity Cross correlation
Excessive
computation
time
Table. Summary of image alignment methods.
2015/10/19 13

Category Strategy Improving methods
Area-based
Computation
enhancement
Integral image
GPU
SIMD
Skipping
unnecessary
computation
Coarse-to-fine
Bounded conditions
Elimination strategy
Winner update
Walsh-Hadamard kernels
Feature-based
Computation
enhancement
SIMD
Dominant gradients orientation
Skipping
unnecessary
computation
Branch and bound
Invariant descriptor
Bounded conditions
Table. Summary of efficiency improving methods in image alignment methods.
2015/10/19 14

Optimal (Coarse-to-fine) :
 Image pyramid technique
 SIMD
 Search strategy
Template Image Input Image
Matching Results
Threshold of
Similarity Score
User
Defined
Detected Range of
Rotation Angle
Offline Model Generation
Image Pyramid Technique
Model Gengeration
Online Matching Process
Object Selection Process
, 0, 1,..., 1m l
lT l n 
l
n
m
lT
Sub-pixel Accuracy Estimation
minTh
min maxand 
Optimal Matching
Process
Detected Range of
Scaling
min maxandS S
, 0, 1,..., 1m l
lI l n 
T I
Detected Number
of objects
max
Obj
and
training data
2015/10/19 15
pyramid level

 Pyramid levels: as high as possible.
 Top level: recognizable.
Level 0 1 2 3 4
2015/10/19 16
Inspection
image
Level 0
Level 1
Level 2
Search region

 Search strategy
Inspection
image
Level 0
Level 1
Level 2
Search region
Level 2
Level 1
Level 0
2,1obj
1,1obj 1,2obj 1,3obj 1,4obj
0,{1,...,4}obj
0,5obj 0,6obj 0,7obj 0,8obj
0,{13,...,16}obj0,{9,...,12}obj
2,1obj
1,{1, , ,42 3 }obj
0,{5, , ,86 7 }obj
Yellow: search candidates
Red: with maximum similarity score and it exceeds the
threshold of similarity minTh
2015/10/19 17

 SIMD (Single Instruction Multiple Data )
• SSE2 (Streaming SIMD Extensions 2)
• 8 registers, XMM0 to XMM7, each with a width of 16 bytes
__m128i _T
__m128i _I
T0 T1
0 16 32
...
12811296
T6 T7
I0 I1
0 16 32
...
12811296
I6 I7
__m128i _sum T0+I0
0 32
...
12896
...
16

T1+I1
 
T6+I6 T7+I7
112
__m128i _T
__m128i _I
T0 T1
0 16 32
...
12811296
T6 T7
I0 I1
0 16 32
...
12811296
I6 I7
__m128i _mul T0 I0
0 32
...
12896
...
16
T1 I1
 
T6 I6 T7 I7
112

   
__m128i _T
__m128i _I
T0 T1
0 16 32
...
12811296
T6 T7
I0 I1
0 16 32
...
12811296
I6 I7
__m128i _madd T0× I0+T1× I1
0 32
...
12896
T6× I6+T7× I7
...

   

 _ _ 16 _ , __ _ mm add epiu Ts Im   _ _ 1 __ 6_ , _mm mullo epi T Imul 
 _ _ _ 16_ _ , _mm madd epi Td Imad 
For instance, the size of the template and
compared windows images are pixel8 8
Instruction
cycles
SSE2 469
Without SSE2 3736
8 times
2015/10/19 18

Template Image Input Image
Matching Results
Threshold of
Similarity Score
User
Defined
Detected Range of
Rotation Angle
Model Gengeration
, 0, 1,..., 1m l
lT l n 
l
n
m
lT
Sub-pixel Accuracy Estimation
minTh
min maxand 
Optimal Matching
Process
Detected Range of
Scaling
min maxandS S
, 0, 1,..., 1m l
lI l n 
T I
Detected Number
of objects
max
Obj
and
training data
2015/10/19 19

↓2
Level
Level
Level 0
Search Image
↓2 Similarity
Coefficient
Calculation
2D Rigid
Transformation
2D Rigid
Transformation
Exhaustive
Search
Candidate Extraction and Ranking
Detected Range of
Rotation Angle
and Scaling
Threshold of
Similarity Score
Track Track
2D Rigid
Transformation
2D Rigid
Transformation
First
Tracking
Track Track
2D Rigid
Transformation
2D Rigid
Transformation
Final
Tracking
Sub-pixel Pose Estimation
Candidate Ranking
Threshold of
Similarity Score
Matching
Results
Local Maximum Local Maximum
Results
Similarity
Coefficient
Calculation
Detected Range of
Rotation Angle
and Scaling
Detected Range of
Rotation Angle
and Scaling
max max max max
( , , , )ix y S   
1l
n 
2l
n 
Inspection
image
Level 0
Level 1
Level 2
Search region
2015/10/19 20

Tracking
Model Pyramid
Similarity
Coefficient
Calculation
Similarity
Coefficient
Calculation
Candidate
Transformed
Image Pyramid
Maximum Extraction
Candidate
max max max max , , ,
( , , , ) arg max ( , , , ),
[ , ], [ , ],
[ ' 2 , ' 2 ]
[ , ]
x y s
x x y y
x y s x y s
x x t x t y y t y t
s s s s s
   

  
    

        
    
     
 Tracking
2015/10/19
Inspection
image
Level 0
Level 1
Level 2
Search region
21

↓2
Level
Level
Level 0
Search Image
↓2 Similarity
Coefficient
Calculation
2D Rigid
Transformation
2D Rigid
Transformation
Exhaustive
Search
Detected Range of
Rotation Angle
and Scaling
Threshold of
Similarity Score
Track Track
2D Rigid
Transformation
2D Rigid
Transformation
First
Tracking
Track Track
2D Rigid
Transformation
2D Rigid
Transformation
Final
Tracking
Sub-pixel Pose Estimation
Candidate Ranking
Threshold of
Similarity Score
Matching
Results
Local Maximum Local Maximum
Results
Similarity
Coefficient
Calculation
Detected Range of
Rotation Angle
and Scaling
Detected Range of
Rotation Angle
and Scaling
max max max max
( , , , )ix y S   
1l
n 
2l
n 
Inspection
image
Level 0
Level 1
Level 2
Search region
2015/10/19 22

 Sub-pixel estimation for NCC and SBM
2 2 2
0 1 2 3 4 5 6 7 8 9( , , )x y k x k y k k xy k x k y k x k y k k              
x
y

1
0
1
101
1
0
1 x
y

Maximum
of * * *
( , , )x y 
3 3 3  neighborhood space
0 0
0 0
0 0 0 0
[ 1, 1],
[ 1, 1],
[ ' , ' ]
x x x
y y y
    
   
   
    
1*
0 3 4 6
*
3 1 5 7
*
4 5 2 8
2
2
2
x k k k k
y k k k k
k k k k

     
           
         
2015/10/19 23

VIDEO - APPLICATION
2015/10/19 24

Methods Strength weakness
Stereo vision
Simple and inexpensive
High accuracy on well-defined targets
Computational burden
Limited to well defined scenes
Low data acquisition rata
Laser triangulation
Relative simplicity
High data acquisition rate
High accuracy
Safety constrains, if laser based
Limited range
Missing data in correspondence
with occlusions and shadows
High cost
Structured light
High accuracy
Safety constrains, if laser based
Missing data in correspondence
with occlusions and shadows
High cost
Time of Flight
Medium to large measurement range
High cost
Recognition of 3D objects
2015/10/19 25

Methods Data Strength weakness
View based
CAD model
or
model image
Efficiency
Robust to occlusions, Clutter
and contrast changes
Low cost
Limited accuracy on
viewpoints
Point Cloud based Point clouds
High accuracy
Robust to occlusions and
clutter
3D sensor required
High Computation complexity
High cost
Table. Summarized of 3D pose estimation techniques.
2015/10/19 26

 Perspective shape-based matching
Recognition of perspective objects
Image coordinate
Camerea coordinate
Object plane
coordinate
Point on plane
P
Q
extM
projectionM
R
C
1
r
c
 
 
 
  
Pixel coordinate
Pixel point
affineM
0
1
p
q
 
 
 
 
 
 
X
Y
Z
x
y
U
V
W
,
1
1
affine projection ext
u
r
v
c
w
 
   
      
    
 
M M M
11 12 13
21 22 23
31 32 33
11 12
21 22
31 32
0 0 0
0 0 0
0
1 0 0 1 0
0 0 0 1 1
0 0
0 0 ,
0 0 1 1
x
y
z
x
y
z
r r r t p
x f
r r r t q
y f
r r r t
f r r t p
f r r t q
r r t
   
       
              
          
   
     
           
          
𝐑 𝐓
11
21
31
x
r
r
r
 
   
  
r
12
22
32
y
r
r
r
 
   
  
r
( , , ) ( ) ( ) ( )x y z x x y y z z     R R R R
2015/10/19 27

 Perspective shape-based matching
, ,
2 2 2 2
1 , ,
1
( , , , , ) ,i i i i
i i i i
n
i x r y c i x r y c
perspective x y z
i i i x r y c x r y c
t v u w
x y
n t u v w
   
      
       
 

  

cos( ) sin( )
,
sin( ) cos( )
i iz z
i iz z
t t
u u
 
 
     
         
cos cos cos sin sin cos sin1 0
,
cos sin cos cos sin sin sin0 1
y z z x y x zi i
y z x z x y zi i
r r
c c
      
      
      
             
( , )T
i i it ud
, ,( , )i i i i
T
i r c r cv we
( , )T
i i ir cq
gradients direction vector of template image
gradients direction vector of scene image
edge points, these points are relative to the center of gravity
and the edge points of the object.
, ,
2 2 2 2
1 , ,
1
( , , , ) ,i i i i
i i i i
n
i x r y c i x r y c
eg
i i i x r y c x r y c
t v u w
x y s
n t u v w
 
      
       
 

  

cos( ) sin( )
sin( ) cos( )
i i
i i
t ts s
u us s
 
 
       
          
cos( ) sin( )
sin( ) cos( )
i i
i i
r rs s
c cs s
 
 
       
          
2015/10/19 28
orthogonal projection
weak-perspective projection

Optimal (Coarse-to-fine) :
 SIMD
 Search strategy
Template Image
Input Image
Matching Results
Threshold of
Similarity Score
User
Defined
Detected Range of
Rotation Angle
Model Gengeration
, 0, 1,..., 1m l
lT l n 
l
n
minTh
_ min _ max
_ min _ max
_ min _ max
and ,
and ,
and
x x
y y
z z
 
 
 
Optimal Matching
Process
, 0, 1,..., 1m l
lI l n 
T
I
Detected Number
of objects max
Obj
, ,,l i l ip d
,l ie
2015/10/19 29

 Rotation requirements
(roll)x
(pitch)y
(yaw)z
Image Plane x
y
z
Camera
(pitch)y
(yaw)z
x
Original Image Plane
Rotated Image Plane
d
l
Camera
(roll)x
(yaw)z
yOriginal Image Plane
Rotated Image Plane
d
l
Camera
1 1
cosx y
l
l
    
     
 
1
2
1
cos 1
2
z
l
   
   
 
2015/10/19 30

Inspection
image
Level 0
Level 1
Level 2
Search region
↓2
Level
Level
Level 0
Search Image
↓2
Similarity
Coefficient
Calculation
3D Rigid
Transformation
3D Rigid
Transformation
Exhaustive
Search
Detected Range of
Rotation Angle
Threshold of
Similarity Score
Track Track
3D Rigid
Transformation
3D Rigid
Transformation
First
Tracking
Track Track
3D Rigid
Transformation
3D Rigid
Transformation
Final
Tracking
Candidate Ranking
Threshold of
Similarity Score
Matching
Results
Results
Similarity
Coefficient
Calculation
Detected Range of
Rotation Angle
Detected Range of
Rotation Angle
max max max max max_ _ _( , , , , )x y z ix y      
1l
n 
2l
n 
2015/10/19 31

max max max max max_ _ _
, , ,
( , , , , ) arg max ( , , , , ),
[ , ], [ , ],
[ ' 2 , ' 2 ],
[ ' 2 , ' 2 ],
[ ' 2 , ' 2 ]
x y z x y z
x y s
x x y y
x x x x x
y y y y y
z z z z z
x y x y
x x t x t y y t y t
    

      
    
    
    

        
    
    
    
 Tracking
Tracking
Model Pyramid
Similarity
Coefficient
Calculation
Similarity
Coefficient
Calculation
Candidate
3D Transformed
Image Pyramid
Maximum Extraction
Candidate
2015/10/19 32

2015/10/19 33

 Stereo vision
2015/10/19 34
l r
fb
Z
x x


Ref: Stefano Mattoccia - http://vision.deis.unibo.it/~smatt/Seminars/StereoVision.pdf

 Stereo vision
2015/10/19 35
Camera Calibration
(offline)
Rectification
Stereo Correspondence
Triangulation
Calibration is carried out acquiring and
processing 10+ stereo pairs of a known
pattern (typically a checkerboard)
 Intrinsic parameters of the two cameras
(focal length, image center, parameters
of lenses distortion, etc)
 Extrinsic parameters
(R and T that aligns the two cameras)
Left
i
j
i
j
..(i,j) (i,j)
Right

 Laser triangulation
2015/10/19 36
Ref: http://www.vision-doctor.co.uk/laser-illumination/principle-of-triangulation.html

 Structured light
2015/10/19 37

VIDEO-Structured light
2015/10/19 38

 Kinect
2015/10/19 39

 Random pattern projection
2015/10/19 40
Camera
Laser RPP

 Point Cloud
2015/10/19 41

 Point Cloud Library (PCL)
2015/10/19 42
•Module common
•Module features
•Module filters
•Module geometry
•Module io
•Module kdtree
•Module keypoints
•Module octree
•Module outofcore
•Module recognition
•Module registration
•Module sample_consensus
•Module search
•Module segmentation
•Module surface
•Module visualization
The Point Cloud Library (or PCL) is a large scale,
open project for point cloud processing, started by
Willow Garage, with the purpose to accelerate 3D
algorithmic work in perception for use in robotic
applications.
Ref: http://www.pointclouds.org/

 Point Cloud Library (PCL) – Point Feature Histogram (PFH)
2015/10/19 43
1D
2D
3D
4D
5D
6D
9D
7D
8D
qD
[D D ] Ut sV   
W U V 
sD
tD
U
W
V
tN
sN U



t sD D
sU N
( )t s
t s
D D U
V
D D
 


W U V 
acos( )tV N  
t sd D D 
acos( )t sD D
U
d


 
atan( , )t tW N U N   

 Point Cloud Library (PCL) – Point Feature Histogram (PFH)
2015/10/19 44
1
2 4
1
13
4
=
(s , ) 2
atan( , )
t
t s i
i
i it s
i
t t
f V N
f D D
idx step fD D
f U
d
f W N U N



 

  
 
  

   

0, if >
( , )
1, if
f s
step s f
f s



1D
2D
3D
4D
5D
6D
9D
7D
8D
qD
0 1 2 3 … 15
count

 Point Cloud Library (PCL) – Fast Point Feature Histogram (FPFH)
2015/10/19 45
1D
4D
6D
7D
9D
qD
5D
3D
8D
2D
1
1 1
( ) ( ) ( )
k
q q k
i k
FPFH D SPFH D SPFH D
k w
  
1
2 4
1
13
4
=
(s , ) 2
atan( , )
t
t s i
i
i it s
i
t t
f V N
f D D
idx step fD D
f U
d
f W N U N



 

  
 
  

   


 Point Cloud Library (PCL) – PFH- FPFH
2015/10/19 46
1D
4D
6D
7D
9D
qD
5D
3D
8D
2D
1D
2D
3D
4D
5D
6D
9D
7D
8D
qD
PFH FPFH

 Point Cloud Library (PCL) – registration
2015/10/19 47
R
T
R
T
Phase1 SAC-IA >>>
Phase2 ICP >>>
Template
pose
Search
Sample Consensus Initial Alignment (SAC-IA)
Iteration Closest Point (ICP)

利用影像匹配進行物件辨識與對位

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to 利用影像匹配進行物件辨識與對位

Similar to 利用影像匹配進行物件辨識與對位 (20)

More from CHENHuiMei

More from CHENHuiMei (20)

Recently uploaded

Recently uploaded (20)

利用影像匹配進行物件辨識與對位