SlideShare a Scribd company logo
1 of 48
Image Matching for Object Recognition
and Alignment
Chin-Sheng Chen
2015/10/19 2
Commercial Machine Vision Library
MIL
VisionPro
 Image retrieval
 Industrial inspection
 Target search
 Automatic pick and place
2015/10/19 3
2D/ 3D Image Matching for Object Recognition
“perfect” object recognition :
Real-time, High accuracy, General, Robustness,
Multi-object detection
VIDEO – Single object
2015/10/19 4
VIDEO – Two objects
2015/10/19 5
VIDEO – 3D object recognition
2015/10/19 6
1
2
x
y
θ
(x1, y1)
(x2, y2)
Template A
Template A
Search image S
Template image P
T
Recognition of 2D rigid objects
2015/10/19 7
Template
Search image
Similarity measure
1 Simialrity coefficient 1  
Perfect matching  Similarity coefficient =1
minimum difference
2015/10/19 8
 Sum of Absolute Differences (SAD)
 Sum of Squared Differences (SSD)
 Normalized cross correlation (NCC)
 Shape-based matching (SBM)
 Sum of Absolute Differences (SAD)
 Sum of Squared Differences (SSD)
 Matching result : Minimum difference
Similarity measure
2015/10/19 9
/2 /2
/2 /2
( , ) ( , ) ( , )
w h
i w j h
SAD x y I x i y j T i j
 
    
/2 /2
2
/2 /2
( , ) ( , ) ( , )
w h
i w j h
SAD x y I x i y j T i j
 
    
Image I
Image T
 Normalized cross correlation (NCC)
The size of the template image
and are the gray-level averages of the template image and the
windowed compared image.
Similarity measure
/2 /2
/2 /2
/2 /2 /2 /2
2 2 2 2
/2 /2 /2 /2
[ ( , ) ( , )]
( , ) ,
( , ) ( , )
w h
I T
i w j h
g
w h w h
I T
i w j h i w j h
I x i y j T i j w h
x y
I x i y j w h T i j w h
 

 
 
   
      

   
           
   
 
   
w h
Tu Iu
/2 /2
/2 /2
1
( , )
w h
T
i w j h
T i j
w h

 


 
/2 /2
/2 /2
1
( , )
w h
I
i w j h
I x i y j
w h

 
  

 
2015/10/19 10
 Normalized cross correlation (NCC)
The size of the template image
and are the color-level averages of the template image and the
windowed compared image.
Similarity measure
w h
Tu Iu
/2 /2
/2 /2
/2 /2 /2 /2
2 22 2
/2 /2 /2 /2
[ ( , ) ( , )] 3
( , ) ,
( , ) 3 ( , ) 3
w h
I T
i w j h
c
w h w h
I T
i w j h i w j h
I x i y j T i j w h
x y
I x i y j w h T i j w h
 

 
 
   
       

   
             
   
 
   
 R G B( , ) ( , ), ( , ), ( , )T i j T i j T i j T i j
 R G B( , ) ( , ), ( , ), ( , )I x i y j I x i y j I x i y j I x i y j        
/2 /2
R G B
/2 /2
1
[ ( , ) ( , ) ( , )]
3
w h
T
i w j h
T i j T i j T i j
w h

 
  
 
 
/2 /2
R G B
/2 /2
1
[ ( , ) ( , ) ( , )]
3
w h
I
i w j h
I x i x j I x i x j I x i x j
w h

 
        
 
 
2015/10/19 11
 Shape-based matching (SBM)
Similarity measure
  , ,
2 2 2 2
1 1 , ,
1 1
, ,i i i i
i i i i
n n
i x r y c i x r y ci i
eg
i ii i i i x r y c x r y c
t v u w
x y
n n t u v w

   
     

 
 
 
d e
d e
( , )T
i i it ud
, ,( , )i i i i
T
i r c r cv we
( , )T
i i ir cq
gradients direction vector of template image
gradients direction vector of scene image
edge points, these points are relative to the center of gravity
and the edge points of the object.
2015/10/19 12
Picture Ref: Halcon
Category Features Approaches Disadvantages
Feature-based
Edge maps
Interest point
Invariant descriptors
Orientation code
Hausdorff distance
Feature correspondence
Zernike moment
Dissimilarity measurement
Ring-projection
Inaccurate
feature
extraction
Area-based Intensity Cross correlation
Excessive
computation
time
Recognition of 2D rigid objects
Table. Summary of image alignment methods.
2015/10/19 13
Recognition of 2D rigid objects
Category Strategy Improving methods
Area-based
Computation
enhancement
Integral image
GPU
SIMD
Skipping
unnecessary
computation
Coarse-to-fine
Bounded conditions
Elimination strategy
Winner update
Walsh-Hadamard kernels
Feature-based
Computation
enhancement
SIMD
Dominant gradients orientation
Skipping
unnecessary
computation
Branch and bound
Invariant descriptor
Bounded conditions
Table. Summary of efficiency improving methods in image alignment methods.
2015/10/19 14
Optimal (Coarse-to-fine) :
 Image pyramid technique
 SIMD
 Search strategy
Recognition of 2D rigid objects
Template Image Input Image
Matching Results
Threshold of
Similarity Score
User
Defined
Detected Range of
Rotation Angle
Offline Model Generation
Image Pyramid Technique
Model Gengeration
Online Matching Process
Image Pyramid Technique
Object Selection Process
, 0, 1,..., 1m l
lT l n 
l
n
m
lT
Sub-pixel Accuracy Estimation
minTh
min maxand 
Optimal Matching
Process
Detected Range of
Scaling
min maxandS S
, 0, 1,..., 1m l
lI l n 
T I
Detected Number
of objects
max
Obj
and
training data
2015/10/19 15
pyramid level
 Image pyramid technique
 Pyramid levels: as high as possible.
 Top level: recognizable.
Recognition of 2D rigid objects
Level 0 1 2 3 4
2015/10/19 16
Inspection
image
Level 0
Level 1
Level 2
Search region
 Search strategy
Recognition of 2D rigid objects
Inspection
image
Level 0
Level 1
Level 2
Search region
Level 2
Level 1
Level 0
2,1obj
1,1obj 1,2obj 1,3obj 1,4obj
0,{1,...,4}obj
0,5obj 0,6obj 0,7obj 0,8obj
0,{13,...,16}obj0,{9,...,12}obj
2,1obj
1,{1, , ,42 3 }obj
0,{5, , ,86 7 }obj
Yellow: search candidates
Red: with maximum similarity score and it exceeds the
threshold of similarity minTh
2015/10/19 17
 SIMD (Single Instruction Multiple Data )
• SSE2 (Streaming SIMD Extensions 2)
• 8 registers, XMM0 to XMM7, each with a width of 16 bytes
Recognition of 2D rigid objects
__m128i _T
__m128i _I
T0 T1
0 16 32
...
12811296
T6 T7
I0 I1
0 16 32
...
12811296
I6 I7
__m128i _sum T0+I0
0 32
...
12896
...
16

T1+I1
 
T6+I6 T7+I7
112
__m128i _T
__m128i _I
T0 T1
0 16 32
...
12811296
T6 T7
I0 I1
0 16 32
...
12811296
I6 I7
__m128i _mul T0 I0
0 32
...
12896
...
16
T1 I1
 
T6 I6 T7 I7
112

   
__m128i _T
__m128i _I
T0 T1
0 16 32
...
12811296
T6 T7
I0 I1
0 16 32
...
12811296
I6 I7
__m128i _madd T0× I0+T1× I1
0 32
...
12896
T6× I6+T7× I7
...

   

 _ _ 16 _ , __ _ mm add epiu Ts Im   _ _ 1 __ 6_ , _mm mullo epi T Imul 
 _ _ _ 16_ _ , _mm madd epi Td Imad 
For instance, the size of the template and
compared windows images are pixel8 8
Instruction
cycles
SSE2 469
Without SSE2 3736
8 times
2015/10/19 18
Recognition of 2D rigid objects
Template Image Input Image
Matching Results
Threshold of
Similarity Score
User
Defined
Detected Range of
Rotation Angle
Offline Model Generation
Image Pyramid Technique
Model Gengeration
Online Matching Process
Image Pyramid Technique
Object Selection Process
, 0, 1,..., 1m l
lT l n 
l
n
m
lT
Sub-pixel Accuracy Estimation
minTh
min maxand 
Optimal Matching
Process
Detected Range of
Scaling
min maxandS S
, 0, 1,..., 1m l
lI l n 
T I
Detected Number
of objects
max
Obj
and
training data
2015/10/19 19
Recognition of 2D rigid objects
↓2
Level
Level
Level 0
Search Image
↓2 Similarity
Coefficient
Calculation
2D Rigid
Transformation
2D Rigid
Transformation
Exhaustive
Search
Candidate Extraction and Ranking
Detected Range of
Rotation Angle
and Scaling
Threshold of
Similarity Score
Track Track
2D Rigid
Transformation
2D Rigid
Transformation
First
Tracking
Track Track
2D Rigid
Transformation
2D Rigid
Transformation
Final
Tracking
Sub-pixel Pose Estimation
Candidate Ranking
Threshold of
Similarity Score
Matching
Results
Local Maximum Local Maximum
Results
Similarity
Coefficient
Calculation
Detected Range of
Rotation Angle
and Scaling
Detected Range of
Rotation Angle
and Scaling
max max max max
( , , , )ix y S   
1l
n 
2l
n 
Inspection
image
Level 0
Level 1
Level 2
Search region
2015/10/19 20
Recognition of 2D rigid objects
Tracking
Model Pyramid
Similarity
Coefficient
Calculation
Similarity
Coefficient
Calculation
Candidate
Transformed
Image Pyramid
Maximum Extraction
Candidate
max max max max , , ,
( , , , ) arg max ( , , , ),
[ , ], [ , ],
[ ' 2 , ' 2 ]
[ , ]
x y s
x x y y
x y s x y s
x x t x t y y t y t
s s s s s
   

  
    

        
    
     
 Tracking
2015/10/19
Inspection
image
Level 0
Level 1
Level 2
Search region
21
Recognition of 2D rigid objects
↓2
Level
Level
Level 0
Search Image
↓2 Similarity
Coefficient
Calculation
2D Rigid
Transformation
2D Rigid
Transformation
Exhaustive
Search
Candidate Extraction and Ranking
Detected Range of
Rotation Angle
and Scaling
Threshold of
Similarity Score
Track Track
2D Rigid
Transformation
2D Rigid
Transformation
First
Tracking
Track Track
2D Rigid
Transformation
2D Rigid
Transformation
Final
Tracking
Sub-pixel Pose Estimation
Candidate Ranking
Threshold of
Similarity Score
Matching
Results
Local Maximum Local Maximum
Results
Similarity
Coefficient
Calculation
Detected Range of
Rotation Angle
and Scaling
Detected Range of
Rotation Angle
and Scaling
max max max max
( , , , )ix y S   
1l
n 
2l
n 
Inspection
image
Level 0
Level 1
Level 2
Search region
2015/10/19 22
 Sub-pixel estimation for NCC and SBM
Recognition of 2D rigid objects
2 2 2
0 1 2 3 4 5 6 7 8 9( , , )x y k x k y k k xy k x k y k x k y k k              
x
y

1
0
1
101
1
0
1 x
y

Maximum
of * * *
( , , )x y 
3 3 3  neighborhood space
0 0
0 0
0 0 0 0
[ 1, 1],
[ 1, 1],
[ ' , ' ]
x x x
y y y
    
   
   
    
1*
0 3 4 6
*
3 1 5 7
*
4 5 2 8
2
2
2
x k k k k
y k k k k
k k k k

     
           
         
2015/10/19 23
VIDEO - APPLICATION
2015/10/19 24
Methods Strength weakness
Stereo vision
Simple and inexpensive
High accuracy on well-defined targets
Computational burden
Limited to well defined scenes
Low data acquisition rata
Laser triangulation
Relative simplicity
High data acquisition rate
High accuracy
Safety constrains, if laser based
Limited range
Missing data in correspondence
with occlusions and shadows
High cost
Structured light
High data acquisition rate
High accuracy
Safety constrains, if laser based
Missing data in correspondence
with occlusions and shadows
High cost
Time of Flight
Medium to large measurement range
High data acquisition rate
High cost
Recognition of 3D objects
2015/10/19 25
Methods Data Strength weakness
View based
CAD model
or
model image
Efficiency
Robust to occlusions, Clutter
and contrast changes
Low cost
Limited accuracy on
viewpoints
Point Cloud based Point clouds
High accuracy
Robust to occlusions and
clutter
3D sensor required
High Computation complexity
High cost
Recognition of 3D objects
Table. Summarized of 3D pose estimation techniques.
2015/10/19 26
 Perspective shape-based matching
Recognition of perspective objects
Image coordinate
Camerea coordinate
Object plane
coordinate
Point on plane
P
Q
extM
projectionM
R
C
1
r
c
 
 
 
  
Pixel coordinate
Pixel point
affineM
0
1
p
q
 
 
 
 
 
 
X
Y
Z
x
y
U
V
W
,
1
1
affine projection ext
u
r
v
c
w
 
   
      
    
 
M M M
11 12 13
21 22 23
31 32 33
11 12
21 22
31 32
0 0 0
0 0 0
0
1 0 0 1 0
0 0 0 1 1
0 0
0 0 ,
0 0 1 1
x
y
z
x
y
z
r r r t p
x f
r r r t q
y f
r r r t
f r r t p
f r r t q
r r t
   
       
              
          
   
     
           
          
𝐑 𝐓
11
21
31
x
r
r
r
 
   
  
r
12
22
32
y
r
r
r
 
   
  
r
( , , ) ( ) ( ) ( )x y z x x y y z z     R R R R
2015/10/19 27
 Perspective shape-based matching
Recognition of perspective objects
, ,
2 2 2 2
1 , ,
1
( , , , , ) ,i i i i
i i i i
n
i x r y c i x r y c
perspective x y z
i i i x r y c x r y c
t v u w
x y
n t u v w
   
      
       
 

  

cos( ) sin( )
,
sin( ) cos( )
i iz z
i iz z
t t
u u
 
 
     
         
cos cos cos sin sin cos sin1 0
,
cos sin cos cos sin sin sin0 1
y z z x y x zi i
y z x z x y zi i
r r
c c
      
      
      
             
( , )T
i i it ud
, ,( , )i i i i
T
i r c r cv we
( , )T
i i ir cq
gradients direction vector of template image
gradients direction vector of scene image
edge points, these points are relative to the center of gravity
and the edge points of the object.
, ,
2 2 2 2
1 , ,
1
( , , , ) ,i i i i
i i i i
n
i x r y c i x r y c
eg
i i i x r y c x r y c
t v u w
x y s
n t u v w
 
      
       
 

  

cos( ) sin( )
sin( ) cos( )
i i
i i
t ts s
u us s
 
 
       
          
cos( ) sin( )
sin( ) cos( )
i i
i i
r rs s
c cs s
 
 
       
          
2015/10/19 28
orthogonal projection
weak-perspective projection
Optimal (Coarse-to-fine) :
 Image pyramid technique
 SIMD
 Search strategy
Recognition of perspective objects
Template Image
Input Image
Matching Results
Threshold of
Similarity Score
User
Defined
Detected Range of
Rotation Angle
Offline Model Generation
Image Pyramid Technique
Model Gengeration
Online Matching Process
Image Pyramid Technique
Object Selection Process
, 0, 1,..., 1m l
lT l n 
l
n
minTh
_ min _ max
_ min _ max
_ min _ max
and ,
and ,
and
x x
y y
z z
 
 
 
Optimal Matching
Process
, 0, 1,..., 1m l
lI l n 
T
I
Detected Number
of objects max
Obj
, ,,l i l ip d
,l ie
2015/10/19 29
 Rotation requirements
Recognition of perspective objects
(roll)x
(pitch)y
(yaw)z
Image Plane x
y
z
Camera
(pitch)y
(yaw)z
x
Original Image Plane
Rotated Image Plane
d
l
Camera
(roll)x
(yaw)z
yOriginal Image Plane
Rotated Image Plane
d
l
Camera
1 1
cosx y
l
l
    
     
 
1
2
1
cos 1
2
z
l
   
   
 
2015/10/19 30
Recognition of perspective objects
Inspection
image
Level 0
Level 1
Level 2
Search region
↓2
Level
Level
Level 0
Search Image
↓2
Similarity
Coefficient
Calculation
3D Rigid
Transformation
3D Rigid
Transformation
Exhaustive
Search
Candidate Extraction and Ranking
Detected Range of
Rotation Angle
Threshold of
Similarity Score
Track Track
3D Rigid
Transformation
3D Rigid
Transformation
First
Tracking
Track Track
3D Rigid
Transformation
3D Rigid
Transformation
Final
Tracking
Candidate Ranking
Threshold of
Similarity Score
Matching
Results
Results
Similarity
Coefficient
Calculation
Detected Range of
Rotation Angle
Detected Range of
Rotation Angle
max max max max max_ _ _( , , , , )x y z ix y      
1l
n 
2l
n 
2015/10/19 31
Recognition of perspective objects
max max max max max_ _ _
, , ,
( , , , , ) arg max ( , , , , ),
[ , ], [ , ],
[ ' 2 , ' 2 ],
[ ' 2 , ' 2 ],
[ ' 2 , ' 2 ]
x y z x y z
x y s
x x y y
x x x x x
y y y y y
z z z z z
x y x y
x x t x t y y t y t
    

      
    
    
    

        
    
    
    
 Tracking
Tracking
Model Pyramid
Similarity
Coefficient
Calculation
Similarity
Coefficient
Calculation
Candidate
3D Transformed
Image Pyramid
Maximum Extraction
Candidate
2015/10/19 32
Recognition of perspective objects
2015/10/19 33
 Stereo vision
Recognition of 3D objects
2015/10/19 34
l r
fb
Z
x x


Ref: Stefano Mattoccia - http://vision.deis.unibo.it/~smatt/Seminars/StereoVision.pdf
 Stereo vision
Recognition of 3D objects
2015/10/19 35
Camera Calibration
(offline)
Rectification
Stereo Correspondence
Triangulation
Calibration is carried out acquiring and
processing 10+ stereo pairs of a known
pattern (typically a checkerboard)
 Intrinsic parameters of the two cameras
(focal length, image center, parameters
of lenses distortion, etc)
 Extrinsic parameters
(R and T that aligns the two cameras)
Left
i
j
i
j
..(i,j) (i,j)
Right
 Laser triangulation
Recognition of 3D objects
2015/10/19 36
Ref: http://www.vision-doctor.co.uk/laser-illumination/principle-of-triangulation.html
 Structured light
Recognition of 3D objects
2015/10/19 37
VIDEO-Structured light
2015/10/19 38
 Kinect
Recognition of 3D objects
2015/10/19 39
 Random pattern projection
Recognition of 3D objects
2015/10/19 40
Camera
Laser RPP
 Point Cloud
Recognition of 3D objects
2015/10/19 41
 Point Cloud Library (PCL)
Recognition of 3D objects
2015/10/19 42
•Module common
•Module features
•Module filters
•Module geometry
•Module io
•Module kdtree
•Module keypoints
•Module octree
•Module outofcore
•Module recognition
•Module registration
•Module sample_consensus
•Module search
•Module segmentation
•Module surface
•Module visualization
The Point Cloud Library (or PCL) is a large scale,
open project for point cloud processing, started by
Willow Garage, with the purpose to accelerate 3D
algorithmic work in perception for use in robotic
applications.
Ref: http://www.pointclouds.org/
 Point Cloud Library (PCL) – Point Feature Histogram (PFH)
Recognition of 3D objects
2015/10/19 43
1D
2D
3D
4D
5D
6D
9D
7D
8D
qD
[D D ] Ut sV   
W U V 
sD
tD
U
W
V
tN
sN U



t sD D
sU N
( )t s
t s
D D U
V
D D
 


W U V 
acos( )tV N  
t sd D D 
acos( )t sD D
U
d


 
atan( , )t tW N U N   
 Point Cloud Library (PCL) – Point Feature Histogram (PFH)
Recognition of 3D objects
2015/10/19 44
1
2 4
1
13
4
=
(s , ) 2
atan( , )
t
t s i
i
i it s
i
t t
f V N
f D D
idx step fD D
f U
d
f W N U N



 

  
 
  

   

0, if >
( , )
1, if
f s
step s f
f s



1D
2D
3D
4D
5D
6D
9D
7D
8D
qD
0 1 2 3 … 15
count
 Point Cloud Library (PCL) – Fast Point Feature Histogram (FPFH)
Recognition of 3D objects
2015/10/19 45
1D
4D
6D
7D
9D
qD
5D
3D
8D
2D
1
1 1
( ) ( ) ( )
k
q q k
i k
FPFH D SPFH D SPFH D
k w
  
1
2 4
1
13
4
=
(s , ) 2
atan( , )
t
t s i
i
i it s
i
t t
f V N
f D D
idx step fD D
f U
d
f W N U N



 

  
 
  

   

 Point Cloud Library (PCL) – PFH- FPFH
Recognition of 3D objects
2015/10/19 46
1D
4D
6D
7D
9D
qD
5D
3D
8D
2D
1D
2D
3D
4D
5D
6D
9D
7D
8D
qD
PFH FPFH
 Point Cloud Library (PCL) – registration
Recognition of 3D objects
2015/10/19 47
R
T
R
T
Phase1 SAC-IA >>>
Phase2 ICP >>>
Template
pose
Search
Sample Consensus Initial Alignment (SAC-IA)
Iteration Closest Point (ICP)
2015/10/19 48

More Related Content

What's hot

OIT to Volumetric Shadow Mapping, 101 Uses for Raster-Ordered Views using Dir...
OIT to Volumetric Shadow Mapping, 101 Uses for Raster-Ordered Views using Dir...OIT to Volumetric Shadow Mapping, 101 Uses for Raster-Ordered Views using Dir...
OIT to Volumetric Shadow Mapping, 101 Uses for Raster-Ordered Views using Dir...Gael Hofemeier
 
Understanding cnn
Understanding cnnUnderstanding cnn
Understanding cnnRucha Gole
 
Tutorial on Object Detection (Faster R-CNN)
Tutorial on Object Detection (Faster R-CNN)Tutorial on Object Detection (Faster R-CNN)
Tutorial on Object Detection (Faster R-CNN)Hwa Pyung Kim
 
You Only Look Once: Unified, Real-Time Object Detection
You Only Look Once: Unified, Real-Time Object DetectionYou Only Look Once: Unified, Real-Time Object Detection
You Only Look Once: Unified, Real-Time Object DetectionDADAJONJURAKUZIEV
 
Siggraph2016 - The Devil is in the Details: idTech 666
Siggraph2016 - The Devil is in the Details: idTech 666Siggraph2016 - The Devil is in the Details: idTech 666
Siggraph2016 - The Devil is in the Details: idTech 666Tiago Sousa
 
Yolo v2 ai_tech_20190421
Yolo v2 ai_tech_20190421Yolo v2 ai_tech_20190421
Yolo v2 ai_tech_20190421穗碧 陳
 
Rate-Distortion Function for Gamma Sources under Absolute-Log Distortion
Rate-Distortion Function for Gamma Sources under Absolute-Log DistortionRate-Distortion Function for Gamma Sources under Absolute-Log Distortion
Rate-Distortion Function for Gamma Sources under Absolute-Log Distortion奈良先端大 情報科学研究科
 
A completed modeling of local binary pattern operator
A completed modeling of local binary pattern operatorA completed modeling of local binary pattern operator
A completed modeling of local binary pattern operatorWin Yu
 
Image-to-Image Translation
Image-to-Image TranslationImage-to-Image Translation
Image-to-Image TranslationJunho Kim
 
Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)
Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)
Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya
 
Object detection and Instance Segmentation
Object detection and Instance SegmentationObject detection and Instance Segmentation
Object detection and Instance SegmentationHichem Felouat
 
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...Universitat Politècnica de Catalunya
 
Deep learning based object detection basics
Deep learning based object detection basicsDeep learning based object detection basics
Deep learning based object detection basicsBrodmann17
 
DTAM: Dense Tracking and Mapping in Real-Time, Robot vision Group
DTAM: Dense Tracking and Mapping in Real-Time, Robot vision GroupDTAM: Dense Tracking and Mapping in Real-Time, Robot vision Group
DTAM: Dense Tracking and Mapping in Real-Time, Robot vision GroupLihang Li
 
Sliced Wasserstein距離と生成モデル
Sliced Wasserstein距離と生成モデルSliced Wasserstein距離と生成モデル
Sliced Wasserstein距離と生成モデルohken
 
マハラノビス距離とユークリッド距離の違い
マハラノビス距離とユークリッド距離の違いマハラノビス距離とユークリッド距離の違い
マハラノビス距離とユークリッド距離の違いwada, kazumi
 

What's hot (20)

OIT to Volumetric Shadow Mapping, 101 Uses for Raster-Ordered Views using Dir...
OIT to Volumetric Shadow Mapping, 101 Uses for Raster-Ordered Views using Dir...OIT to Volumetric Shadow Mapping, 101 Uses for Raster-Ordered Views using Dir...
OIT to Volumetric Shadow Mapping, 101 Uses for Raster-Ordered Views using Dir...
 
Understanding cnn
Understanding cnnUnderstanding cnn
Understanding cnn
 
Tutorial on Object Detection (Faster R-CNN)
Tutorial on Object Detection (Faster R-CNN)Tutorial on Object Detection (Faster R-CNN)
Tutorial on Object Detection (Faster R-CNN)
 
Mask R-CNN
Mask R-CNNMask R-CNN
Mask R-CNN
 
You Only Look Once: Unified, Real-Time Object Detection
You Only Look Once: Unified, Real-Time Object DetectionYou Only Look Once: Unified, Real-Time Object Detection
You Only Look Once: Unified, Real-Time Object Detection
 
Siggraph2016 - The Devil is in the Details: idTech 666
Siggraph2016 - The Devil is in the Details: idTech 666Siggraph2016 - The Devil is in the Details: idTech 666
Siggraph2016 - The Devil is in the Details: idTech 666
 
Yolo v2 ai_tech_20190421
Yolo v2 ai_tech_20190421Yolo v2 ai_tech_20190421
Yolo v2 ai_tech_20190421
 
Rate-Distortion Function for Gamma Sources under Absolute-Log Distortion
Rate-Distortion Function for Gamma Sources under Absolute-Log DistortionRate-Distortion Function for Gamma Sources under Absolute-Log Distortion
Rate-Distortion Function for Gamma Sources under Absolute-Log Distortion
 
A completed modeling of local binary pattern operator
A completed modeling of local binary pattern operatorA completed modeling of local binary pattern operator
A completed modeling of local binary pattern operator
 
Image-to-Image Translation
Image-to-Image TranslationImage-to-Image Translation
Image-to-Image Translation
 
Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)
Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)
Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)
 
Object detection and Instance Segmentation
Object detection and Instance SegmentationObject detection and Instance Segmentation
Object detection and Instance Segmentation
 
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
 
Deep learning based object detection basics
Deep learning based object detection basicsDeep learning based object detection basics
Deep learning based object detection basics
 
Object detection
Object detectionObject detection
Object detection
 
Cours SDL2 (partie 1)
Cours SDL2 (partie 1)Cours SDL2 (partie 1)
Cours SDL2 (partie 1)
 
DTAM: Dense Tracking and Mapping in Real-Time, Robot vision Group
DTAM: Dense Tracking and Mapping in Real-Time, Robot vision GroupDTAM: Dense Tracking and Mapping in Real-Time, Robot vision Group
DTAM: Dense Tracking and Mapping in Real-Time, Robot vision Group
 
MIRU2014 tutorial deeplearning
MIRU2014 tutorial deeplearningMIRU2014 tutorial deeplearning
MIRU2014 tutorial deeplearning
 
Sliced Wasserstein距離と生成モデル
Sliced Wasserstein距離と生成モデルSliced Wasserstein距離と生成モデル
Sliced Wasserstein距離と生成モデル
 
マハラノビス距離とユークリッド距離の違い
マハラノビス距離とユークリッド距離の違いマハラノビス距離とユークリッド距離の違い
マハラノビス距離とユークリッド距離の違い
 

Similar to 利用影像匹配進行物件辨識與對位

Image Quality Feature Based Detection Algorithm for Forgery in Images
Image Quality Feature Based Detection Algorithm for Forgery in Images  Image Quality Feature Based Detection Algorithm for Forgery in Images
Image Quality Feature Based Detection Algorithm for Forgery in Images ijcga
 
Introducing New Parameters to Compare the Accuracy and Reliability of Mean-Sh...
Introducing New Parameters to Compare the Accuracy and Reliability of Mean-Sh...Introducing New Parameters to Compare the Accuracy and Reliability of Mean-Sh...
Introducing New Parameters to Compare the Accuracy and Reliability of Mean-Sh...sipij
 
COMPARISON OF RENDERING PROCESSES ON 3D MODEL
COMPARISON OF RENDERING PROCESSES ON 3D MODELCOMPARISON OF RENDERING PROCESSES ON 3D MODEL
COMPARISON OF RENDERING PROCESSES ON 3D MODELijcsit
 
JPEG XR objective and subjective evaluations
JPEG XR objective and subjective evaluationsJPEG XR objective and subjective evaluations
JPEG XR objective and subjective evaluationsTouradj Ebrahimi
 
When Discrete Optimization Meets Multimedia Security (and Beyond)
When Discrete Optimization Meets Multimedia Security (and Beyond)When Discrete Optimization Meets Multimedia Security (and Beyond)
When Discrete Optimization Meets Multimedia Security (and Beyond)Shujun Li
 
IRJET- Real Time Implementation of Bi-Histogram Equalization Method on Androi...
IRJET- Real Time Implementation of Bi-Histogram Equalization Method on Androi...IRJET- Real Time Implementation of Bi-Histogram Equalization Method on Androi...
IRJET- Real Time Implementation of Bi-Histogram Equalization Method on Androi...IRJET Journal
 
IRJET- Design and Implementation of ATM Security System using Vibration Senso...
IRJET- Design and Implementation of ATM Security System using Vibration Senso...IRJET- Design and Implementation of ATM Security System using Vibration Senso...
IRJET- Design and Implementation of ATM Security System using Vibration Senso...IRJET Journal
 
Build Your Own 3D Scanner: 3D Scanning with Swept-Planes
Build Your Own 3D Scanner: 3D Scanning with Swept-PlanesBuild Your Own 3D Scanner: 3D Scanning with Swept-Planes
Build Your Own 3D Scanner: 3D Scanning with Swept-PlanesDouglas Lanman
 
Secret-Fragment Visible Mosaic Images by Genetic Algorithm
Secret-Fragment Visible Mosaic Images by Genetic AlgorithmSecret-Fragment Visible Mosaic Images by Genetic Algorithm
Secret-Fragment Visible Mosaic Images by Genetic AlgorithmIRJET Journal
 
Stereo Correspondence Estimation by Two Dimensional Real Time Spiral Search A...
Stereo Correspondence Estimation by Two Dimensional Real Time Spiral Search A...Stereo Correspondence Estimation by Two Dimensional Real Time Spiral Search A...
Stereo Correspondence Estimation by Two Dimensional Real Time Spiral Search A...MDABDULMANNANMONDAL
 
Flow Trajectory Approach for Human Action Recognition
Flow Trajectory Approach for Human Action RecognitionFlow Trajectory Approach for Human Action Recognition
Flow Trajectory Approach for Human Action RecognitionIRJET Journal
 
Advanced Multimedia
Advanced MultimediaAdvanced Multimedia
Advanced Multimediakadalrocker
 
Canny Edge Detection Algorithm on FPGA
Canny Edge Detection Algorithm on FPGA Canny Edge Detection Algorithm on FPGA
Canny Edge Detection Algorithm on FPGA IOSR Journals
 
Canny Edge Detection Algorithm on FPGA
Canny Edge Detection Algorithm on FPGA Canny Edge Detection Algorithm on FPGA
Canny Edge Detection Algorithm on FPGA IOSR Journals
 
Design and implementation of video tracking system based on camera field of view
Design and implementation of video tracking system based on camera field of viewDesign and implementation of video tracking system based on camera field of view
Design and implementation of video tracking system based on camera field of viewsipij
 
Enhancing Security and Privacy Issue in Airport by Biometric based Iris Recog...
Enhancing Security and Privacy Issue in Airport by Biometric based Iris Recog...Enhancing Security and Privacy Issue in Airport by Biometric based Iris Recog...
Enhancing Security and Privacy Issue in Airport by Biometric based Iris Recog...idescitation
 
Trajectory Based Unusual Human Movement Identification for ATM System
	 Trajectory Based Unusual Human Movement Identification for ATM System	 Trajectory Based Unusual Human Movement Identification for ATM System
Trajectory Based Unusual Human Movement Identification for ATM SystemIRJET Journal
 

Similar to 利用影像匹配進行物件辨識與對位 (20)

Image Quality Feature Based Detection Algorithm for Forgery in Images
Image Quality Feature Based Detection Algorithm for Forgery in Images  Image Quality Feature Based Detection Algorithm for Forgery in Images
Image Quality Feature Based Detection Algorithm for Forgery in Images
 
Comparison of Rendering Processes on 3D Model
Comparison of Rendering Processes on 3D ModelComparison of Rendering Processes on 3D Model
Comparison of Rendering Processes on 3D Model
 
Introducing New Parameters to Compare the Accuracy and Reliability of Mean-Sh...
Introducing New Parameters to Compare the Accuracy and Reliability of Mean-Sh...Introducing New Parameters to Compare the Accuracy and Reliability of Mean-Sh...
Introducing New Parameters to Compare the Accuracy and Reliability of Mean-Sh...
 
COMPARISON OF RENDERING PROCESSES ON 3D MODEL
COMPARISON OF RENDERING PROCESSES ON 3D MODELCOMPARISON OF RENDERING PROCESSES ON 3D MODEL
COMPARISON OF RENDERING PROCESSES ON 3D MODEL
 
Comparison of Rendering Processes on 3D Model
Comparison of Rendering Processes on 3D ModelComparison of Rendering Processes on 3D Model
Comparison of Rendering Processes on 3D Model
 
JPEG XR objective and subjective evaluations
JPEG XR objective and subjective evaluationsJPEG XR objective and subjective evaluations
JPEG XR objective and subjective evaluations
 
When Discrete Optimization Meets Multimedia Security (and Beyond)
When Discrete Optimization Meets Multimedia Security (and Beyond)When Discrete Optimization Meets Multimedia Security (and Beyond)
When Discrete Optimization Meets Multimedia Security (and Beyond)
 
IRJET- Real Time Implementation of Bi-Histogram Equalization Method on Androi...
IRJET- Real Time Implementation of Bi-Histogram Equalization Method on Androi...IRJET- Real Time Implementation of Bi-Histogram Equalization Method on Androi...
IRJET- Real Time Implementation of Bi-Histogram Equalization Method on Androi...
 
IRJET- Design and Implementation of ATM Security System using Vibration Senso...
IRJET- Design and Implementation of ATM Security System using Vibration Senso...IRJET- Design and Implementation of ATM Security System using Vibration Senso...
IRJET- Design and Implementation of ATM Security System using Vibration Senso...
 
Build Your Own 3D Scanner: 3D Scanning with Swept-Planes
Build Your Own 3D Scanner: 3D Scanning with Swept-PlanesBuild Your Own 3D Scanner: 3D Scanning with Swept-Planes
Build Your Own 3D Scanner: 3D Scanning with Swept-Planes
 
Secret-Fragment Visible Mosaic Images by Genetic Algorithm
Secret-Fragment Visible Mosaic Images by Genetic AlgorithmSecret-Fragment Visible Mosaic Images by Genetic Algorithm
Secret-Fragment Visible Mosaic Images by Genetic Algorithm
 
Stereo Correspondence Estimation by Two Dimensional Real Time Spiral Search A...
Stereo Correspondence Estimation by Two Dimensional Real Time Spiral Search A...Stereo Correspondence Estimation by Two Dimensional Real Time Spiral Search A...
Stereo Correspondence Estimation by Two Dimensional Real Time Spiral Search A...
 
Flow Trajectory Approach for Human Action Recognition
Flow Trajectory Approach for Human Action RecognitionFlow Trajectory Approach for Human Action Recognition
Flow Trajectory Approach for Human Action Recognition
 
Advanced Multimedia
Advanced MultimediaAdvanced Multimedia
Advanced Multimedia
 
Canny Edge Detection Algorithm on FPGA
Canny Edge Detection Algorithm on FPGA Canny Edge Detection Algorithm on FPGA
Canny Edge Detection Algorithm on FPGA
 
C010111519
C010111519C010111519
C010111519
 
Canny Edge Detection Algorithm on FPGA
Canny Edge Detection Algorithm on FPGA Canny Edge Detection Algorithm on FPGA
Canny Edge Detection Algorithm on FPGA
 
Design and implementation of video tracking system based on camera field of view
Design and implementation of video tracking system based on camera field of viewDesign and implementation of video tracking system based on camera field of view
Design and implementation of video tracking system based on camera field of view
 
Enhancing Security and Privacy Issue in Airport by Biometric based Iris Recog...
Enhancing Security and Privacy Issue in Airport by Biometric based Iris Recog...Enhancing Security and Privacy Issue in Airport by Biometric based Iris Recog...
Enhancing Security and Privacy Issue in Airport by Biometric based Iris Recog...
 
Trajectory Based Unusual Human Movement Identification for ATM System
	 Trajectory Based Unusual Human Movement Identification for ATM System	 Trajectory Based Unusual Human Movement Identification for ATM System
Trajectory Based Unusual Human Movement Identification for ATM System
 

More from CHENHuiMei

小數據如何實現電腦視覺,微軟AI研究首席剖析關鍵
小數據如何實現電腦視覺,微軟AI研究首席剖析關鍵小數據如何實現電腦視覺,微軟AI研究首席剖析關鍵
小數據如何實現電腦視覺,微軟AI研究首席剖析關鍵CHENHuiMei
 
QIF對AOI設備業之衝擊與機會
QIF對AOI設備業之衝擊與機會QIF對AOI設備業之衝擊與機會
QIF對AOI設備業之衝擊與機會CHENHuiMei
 
產研融合推手-台大AOI設備研發聯盟_台大陳亮嘉
產研融合推手-台大AOI設備研發聯盟_台大陳亮嘉產研融合推手-台大AOI設備研發聯盟_台大陳亮嘉
產研融合推手-台大AOI設備研發聯盟_台大陳亮嘉CHENHuiMei
 
基於少樣本深度學習之橡膠墊片檢測系統
基於少樣本深度學習之橡膠墊片檢測系統基於少樣本深度學習之橡膠墊片檢測系統
基於少樣本深度學習之橡膠墊片檢測系統CHENHuiMei
 
AOI智慧升級─AI訓練師在地養成計畫_台灣人工智慧學校
AOI智慧升級─AI訓練師在地養成計畫_台灣人工智慧學校AOI智慧升級─AI訓練師在地養成計畫_台灣人工智慧學校
AOI智慧升級─AI訓練師在地養成計畫_台灣人工智慧學校CHENHuiMei
 
使用人工智慧檢測三維錫球瑕疵_台大傅楸善
使用人工智慧檢測三維錫球瑕疵_台大傅楸善使用人工智慧檢測三維錫球瑕疵_台大傅楸善
使用人工智慧檢測三維錫球瑕疵_台大傅楸善CHENHuiMei
 
IIoT發展趨勢及設備業者因應之_微軟葉怡君
IIoT發展趨勢及設備業者因應之_微軟葉怡君IIoT發展趨勢及設備業者因應之_微軟葉怡君
IIoT發展趨勢及設備業者因應之_微軟葉怡君CHENHuiMei
 
精密機械的空間軌跡精度光學檢測法_台大范光照
精密機械的空間軌跡精度光學檢測法_台大范光照精密機械的空間軌跡精度光學檢測法_台大范光照
精密機械的空間軌跡精度光學檢測法_台大范光照CHENHuiMei
 
When AOI meets AI
When AOI meets AIWhen AOI meets AI
When AOI meets AICHENHuiMei
 
2018AOI論壇_基於生成對抗網路之非監督式AOI技術_工研院蔡雅惠
2018AOI論壇_基於生成對抗網路之非監督式AOI技術_工研院蔡雅惠2018AOI論壇_基於生成對抗網路之非監督式AOI技術_工研院蔡雅惠
2018AOI論壇_基於生成對抗網路之非監督式AOI技術_工研院蔡雅惠CHENHuiMei
 
2018AOIEA論壇Keynote_眺望趨勢 量測設備未來10年發展重點_致茂曾一士
2018AOIEA論壇Keynote_眺望趨勢 量測設備未來10年發展重點_致茂曾一士2018AOIEA論壇Keynote_眺望趨勢 量測設備未來10年發展重點_致茂曾一士
2018AOIEA論壇Keynote_眺望趨勢 量測設備未來10年發展重點_致茂曾一士CHENHuiMei
 
2018AOI論壇Keynote_AI入魂製造領域現況與趨勢_工研院熊治民
2018AOI論壇Keynote_AI入魂製造領域現況與趨勢_工研院熊治民2018AOI論壇Keynote_AI入魂製造領域現況與趨勢_工研院熊治民
2018AOI論壇Keynote_AI入魂製造領域現況與趨勢_工研院熊治民CHENHuiMei
 
2018AOI論壇_AOI and IoT產線應用_工研院周森益
2018AOI論壇_AOI and IoT產線應用_工研院周森益2018AOI論壇_AOI and IoT產線應用_工研院周森益
2018AOI論壇_AOI and IoT產線應用_工研院周森益CHENHuiMei
 
2018AOI論壇_AOI參與整廠協作之實務建議_達明機器人黃鐘賢
2018AOI論壇_AOI參與整廠協作之實務建議_達明機器人黃鐘賢2018AOI論壇_AOI參與整廠協作之實務建議_達明機器人黃鐘賢
2018AOI論壇_AOI參與整廠協作之實務建議_達明機器人黃鐘賢CHENHuiMei
 
2018AOI論壇_深度學習在電腦視覺應用上的疑問_中央大學曾定章
2018AOI論壇_深度學習在電腦視覺應用上的疑問_中央大學曾定章2018AOI論壇_深度學習在電腦視覺應用上的疑問_中央大學曾定章
2018AOI論壇_深度學習在電腦視覺應用上的疑問_中央大學曾定章CHENHuiMei
 
2018AOI論壇_深度學習於表面瑕疪檢測_元智大學蔡篤銘
2018AOI論壇_深度學習於表面瑕疪檢測_元智大學蔡篤銘2018AOI論壇_深度學習於表面瑕疪檢測_元智大學蔡篤銘
2018AOI論壇_深度學習於表面瑕疪檢測_元智大學蔡篤銘CHENHuiMei
 
2018AOI論壇_時機已到 AOI導入邊緣運算_SAS林育宏
2018AOI論壇_時機已到 AOI導入邊緣運算_SAS林育宏2018AOI論壇_時機已到 AOI導入邊緣運算_SAS林育宏
2018AOI論壇_時機已到 AOI導入邊緣運算_SAS林育宏CHENHuiMei
 
2018AOI論壇_如何導入深度學習來提升工業瑕疵檢測技術_工研院賴璟皓
2018AOI論壇_如何導入深度學習來提升工業瑕疵檢測技術_工研院賴璟皓2018AOI論壇_如何導入深度學習來提升工業瑕疵檢測技術_工研院賴璟皓
2018AOI論壇_如何導入深度學習來提升工業瑕疵檢測技術_工研院賴璟皓CHENHuiMei
 

More from CHENHuiMei (20)

小數據如何實現電腦視覺,微軟AI研究首席剖析關鍵
小數據如何實現電腦視覺,微軟AI研究首席剖析關鍵小數據如何實現電腦視覺,微軟AI研究首席剖析關鍵
小數據如何實現電腦視覺,微軟AI研究首席剖析關鍵
 
QIF對AOI設備業之衝擊與機會
QIF對AOI設備業之衝擊與機會QIF對AOI設備業之衝擊與機會
QIF對AOI設備業之衝擊與機會
 
產研融合推手-台大AOI設備研發聯盟_台大陳亮嘉
產研融合推手-台大AOI設備研發聯盟_台大陳亮嘉產研融合推手-台大AOI設備研發聯盟_台大陳亮嘉
產研融合推手-台大AOI設備研發聯盟_台大陳亮嘉
 
基於少樣本深度學習之橡膠墊片檢測系統
基於少樣本深度學習之橡膠墊片檢測系統基於少樣本深度學習之橡膠墊片檢測系統
基於少樣本深度學習之橡膠墊片檢測系統
 
AOI智慧升級─AI訓練師在地養成計畫_台灣人工智慧學校
AOI智慧升級─AI訓練師在地養成計畫_台灣人工智慧學校AOI智慧升級─AI訓練師在地養成計畫_台灣人工智慧學校
AOI智慧升級─AI訓練師在地養成計畫_台灣人工智慧學校
 
使用人工智慧檢測三維錫球瑕疵_台大傅楸善
使用人工智慧檢測三維錫球瑕疵_台大傅楸善使用人工智慧檢測三維錫球瑕疵_台大傅楸善
使用人工智慧檢測三維錫球瑕疵_台大傅楸善
 
IIoT發展趨勢及設備業者因應之_微軟葉怡君
IIoT發展趨勢及設備業者因應之_微軟葉怡君IIoT發展趨勢及設備業者因應之_微軟葉怡君
IIoT發展趨勢及設備業者因應之_微軟葉怡君
 
精密機械的空間軌跡精度光學檢測法_台大范光照
精密機械的空間軌跡精度光學檢測法_台大范光照精密機械的空間軌跡精度光學檢測法_台大范光照
精密機械的空間軌跡精度光學檢測法_台大范光照
 
Report
ReportReport
Report
 
Deep learning
Deep learningDeep learning
Deep learning
 
When AOI meets AI
When AOI meets AIWhen AOI meets AI
When AOI meets AI
 
2018AOI論壇_基於生成對抗網路之非監督式AOI技術_工研院蔡雅惠
2018AOI論壇_基於生成對抗網路之非監督式AOI技術_工研院蔡雅惠2018AOI論壇_基於生成對抗網路之非監督式AOI技術_工研院蔡雅惠
2018AOI論壇_基於生成對抗網路之非監督式AOI技術_工研院蔡雅惠
 
2018AOIEA論壇Keynote_眺望趨勢 量測設備未來10年發展重點_致茂曾一士
2018AOIEA論壇Keynote_眺望趨勢 量測設備未來10年發展重點_致茂曾一士2018AOIEA論壇Keynote_眺望趨勢 量測設備未來10年發展重點_致茂曾一士
2018AOIEA論壇Keynote_眺望趨勢 量測設備未來10年發展重點_致茂曾一士
 
2018AOI論壇Keynote_AI入魂製造領域現況與趨勢_工研院熊治民
2018AOI論壇Keynote_AI入魂製造領域現況與趨勢_工研院熊治民2018AOI論壇Keynote_AI入魂製造領域現況與趨勢_工研院熊治民
2018AOI論壇Keynote_AI入魂製造領域現況與趨勢_工研院熊治民
 
2018AOI論壇_AOI and IoT產線應用_工研院周森益
2018AOI論壇_AOI and IoT產線應用_工研院周森益2018AOI論壇_AOI and IoT產線應用_工研院周森益
2018AOI論壇_AOI and IoT產線應用_工研院周森益
 
2018AOI論壇_AOI參與整廠協作之實務建議_達明機器人黃鐘賢
2018AOI論壇_AOI參與整廠協作之實務建議_達明機器人黃鐘賢2018AOI論壇_AOI參與整廠協作之實務建議_達明機器人黃鐘賢
2018AOI論壇_AOI參與整廠協作之實務建議_達明機器人黃鐘賢
 
2018AOI論壇_深度學習在電腦視覺應用上的疑問_中央大學曾定章
2018AOI論壇_深度學習在電腦視覺應用上的疑問_中央大學曾定章2018AOI論壇_深度學習在電腦視覺應用上的疑問_中央大學曾定章
2018AOI論壇_深度學習在電腦視覺應用上的疑問_中央大學曾定章
 
2018AOI論壇_深度學習於表面瑕疪檢測_元智大學蔡篤銘
2018AOI論壇_深度學習於表面瑕疪檢測_元智大學蔡篤銘2018AOI論壇_深度學習於表面瑕疪檢測_元智大學蔡篤銘
2018AOI論壇_深度學習於表面瑕疪檢測_元智大學蔡篤銘
 
2018AOI論壇_時機已到 AOI導入邊緣運算_SAS林育宏
2018AOI論壇_時機已到 AOI導入邊緣運算_SAS林育宏2018AOI論壇_時機已到 AOI導入邊緣運算_SAS林育宏
2018AOI論壇_時機已到 AOI導入邊緣運算_SAS林育宏
 
2018AOI論壇_如何導入深度學習來提升工業瑕疵檢測技術_工研院賴璟皓
2018AOI論壇_如何導入深度學習來提升工業瑕疵檢測技術_工研院賴璟皓2018AOI論壇_如何導入深度學習來提升工業瑕疵檢測技術_工研院賴璟皓
2018AOI論壇_如何導入深度學習來提升工業瑕疵檢測技術_工研院賴璟皓
 

Recently uploaded

The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 

Recently uploaded (20)

The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 

利用影像匹配進行物件辨識與對位

  • 1. Image Matching for Object Recognition and Alignment Chin-Sheng Chen
  • 2. 2015/10/19 2 Commercial Machine Vision Library MIL VisionPro
  • 3.  Image retrieval  Industrial inspection  Target search  Automatic pick and place 2015/10/19 3 2D/ 3D Image Matching for Object Recognition “perfect” object recognition : Real-time, High accuracy, General, Robustness, Multi-object detection
  • 4. VIDEO – Single object 2015/10/19 4
  • 5. VIDEO – Two objects 2015/10/19 5
  • 6. VIDEO – 3D object recognition 2015/10/19 6
  • 7. 1 2 x y θ (x1, y1) (x2, y2) Template A Template A Search image S Template image P T Recognition of 2D rigid objects 2015/10/19 7 Template Search image
  • 8. Similarity measure 1 Simialrity coefficient 1   Perfect matching  Similarity coefficient =1 minimum difference 2015/10/19 8  Sum of Absolute Differences (SAD)  Sum of Squared Differences (SSD)  Normalized cross correlation (NCC)  Shape-based matching (SBM)
  • 9.  Sum of Absolute Differences (SAD)  Sum of Squared Differences (SSD)  Matching result : Minimum difference Similarity measure 2015/10/19 9 /2 /2 /2 /2 ( , ) ( , ) ( , ) w h i w j h SAD x y I x i y j T i j        /2 /2 2 /2 /2 ( , ) ( , ) ( , ) w h i w j h SAD x y I x i y j T i j        Image I Image T
  • 10.  Normalized cross correlation (NCC) The size of the template image and are the gray-level averages of the template image and the windowed compared image. Similarity measure /2 /2 /2 /2 /2 /2 /2 /2 2 2 2 2 /2 /2 /2 /2 [ ( , ) ( , )] ( , ) , ( , ) ( , ) w h I T i w j h g w h w h I T i w j h i w j h I x i y j T i j w h x y I x i y j w h T i j w h                                              w h Tu Iu /2 /2 /2 /2 1 ( , ) w h T i w j h T i j w h        /2 /2 /2 /2 1 ( , ) w h I i w j h I x i y j w h          2015/10/19 10
  • 11.  Normalized cross correlation (NCC) The size of the template image and are the color-level averages of the template image and the windowed compared image. Similarity measure w h Tu Iu /2 /2 /2 /2 /2 /2 /2 /2 2 22 2 /2 /2 /2 /2 [ ( , ) ( , )] 3 ( , ) , ( , ) 3 ( , ) 3 w h I T i w j h c w h w h I T i w j h i w j h I x i y j T i j w h x y I x i y j w h T i j w h                                                  R G B( , ) ( , ), ( , ), ( , )T i j T i j T i j T i j  R G B( , ) ( , ), ( , ), ( , )I x i y j I x i y j I x i y j I x i y j         /2 /2 R G B /2 /2 1 [ ( , ) ( , ) ( , )] 3 w h T i w j h T i j T i j T i j w h           /2 /2 R G B /2 /2 1 [ ( , ) ( , ) ( , )] 3 w h I i w j h I x i x j I x i x j I x i x j w h                 2015/10/19 11
  • 12.  Shape-based matching (SBM) Similarity measure   , , 2 2 2 2 1 1 , , 1 1 , ,i i i i i i i i n n i x r y c i x r y ci i eg i ii i i i x r y c x r y c t v u w x y n n t u v w                   d e d e ( , )T i i it ud , ,( , )i i i i T i r c r cv we ( , )T i i ir cq gradients direction vector of template image gradients direction vector of scene image edge points, these points are relative to the center of gravity and the edge points of the object. 2015/10/19 12 Picture Ref: Halcon
  • 13. Category Features Approaches Disadvantages Feature-based Edge maps Interest point Invariant descriptors Orientation code Hausdorff distance Feature correspondence Zernike moment Dissimilarity measurement Ring-projection Inaccurate feature extraction Area-based Intensity Cross correlation Excessive computation time Recognition of 2D rigid objects Table. Summary of image alignment methods. 2015/10/19 13
  • 14. Recognition of 2D rigid objects Category Strategy Improving methods Area-based Computation enhancement Integral image GPU SIMD Skipping unnecessary computation Coarse-to-fine Bounded conditions Elimination strategy Winner update Walsh-Hadamard kernels Feature-based Computation enhancement SIMD Dominant gradients orientation Skipping unnecessary computation Branch and bound Invariant descriptor Bounded conditions Table. Summary of efficiency improving methods in image alignment methods. 2015/10/19 14
  • 15. Optimal (Coarse-to-fine) :  Image pyramid technique  SIMD  Search strategy Recognition of 2D rigid objects Template Image Input Image Matching Results Threshold of Similarity Score User Defined Detected Range of Rotation Angle Offline Model Generation Image Pyramid Technique Model Gengeration Online Matching Process Image Pyramid Technique Object Selection Process , 0, 1,..., 1m l lT l n  l n m lT Sub-pixel Accuracy Estimation minTh min maxand  Optimal Matching Process Detected Range of Scaling min maxandS S , 0, 1,..., 1m l lI l n  T I Detected Number of objects max Obj and training data 2015/10/19 15 pyramid level
  • 16.  Image pyramid technique  Pyramid levels: as high as possible.  Top level: recognizable. Recognition of 2D rigid objects Level 0 1 2 3 4 2015/10/19 16 Inspection image Level 0 Level 1 Level 2 Search region
  • 17.  Search strategy Recognition of 2D rigid objects Inspection image Level 0 Level 1 Level 2 Search region Level 2 Level 1 Level 0 2,1obj 1,1obj 1,2obj 1,3obj 1,4obj 0,{1,...,4}obj 0,5obj 0,6obj 0,7obj 0,8obj 0,{13,...,16}obj0,{9,...,12}obj 2,1obj 1,{1, , ,42 3 }obj 0,{5, , ,86 7 }obj Yellow: search candidates Red: with maximum similarity score and it exceeds the threshold of similarity minTh 2015/10/19 17
  • 18.  SIMD (Single Instruction Multiple Data ) • SSE2 (Streaming SIMD Extensions 2) • 8 registers, XMM0 to XMM7, each with a width of 16 bytes Recognition of 2D rigid objects __m128i _T __m128i _I T0 T1 0 16 32 ... 12811296 T6 T7 I0 I1 0 16 32 ... 12811296 I6 I7 __m128i _sum T0+I0 0 32 ... 12896 ... 16  T1+I1   T6+I6 T7+I7 112 __m128i _T __m128i _I T0 T1 0 16 32 ... 12811296 T6 T7 I0 I1 0 16 32 ... 12811296 I6 I7 __m128i _mul T0 I0 0 32 ... 12896 ... 16 T1 I1   T6 I6 T7 I7 112      __m128i _T __m128i _I T0 T1 0 16 32 ... 12811296 T6 T7 I0 I1 0 16 32 ... 12811296 I6 I7 __m128i _madd T0× I0+T1× I1 0 32 ... 12896 T6× I6+T7× I7 ...        _ _ 16 _ , __ _ mm add epiu Ts Im   _ _ 1 __ 6_ , _mm mullo epi T Imul   _ _ _ 16_ _ , _mm madd epi Td Imad  For instance, the size of the template and compared windows images are pixel8 8 Instruction cycles SSE2 469 Without SSE2 3736 8 times 2015/10/19 18
  • 19. Recognition of 2D rigid objects Template Image Input Image Matching Results Threshold of Similarity Score User Defined Detected Range of Rotation Angle Offline Model Generation Image Pyramid Technique Model Gengeration Online Matching Process Image Pyramid Technique Object Selection Process , 0, 1,..., 1m l lT l n  l n m lT Sub-pixel Accuracy Estimation minTh min maxand  Optimal Matching Process Detected Range of Scaling min maxandS S , 0, 1,..., 1m l lI l n  T I Detected Number of objects max Obj and training data 2015/10/19 19
  • 20. Recognition of 2D rigid objects ↓2 Level Level Level 0 Search Image ↓2 Similarity Coefficient Calculation 2D Rigid Transformation 2D Rigid Transformation Exhaustive Search Candidate Extraction and Ranking Detected Range of Rotation Angle and Scaling Threshold of Similarity Score Track Track 2D Rigid Transformation 2D Rigid Transformation First Tracking Track Track 2D Rigid Transformation 2D Rigid Transformation Final Tracking Sub-pixel Pose Estimation Candidate Ranking Threshold of Similarity Score Matching Results Local Maximum Local Maximum Results Similarity Coefficient Calculation Detected Range of Rotation Angle and Scaling Detected Range of Rotation Angle and Scaling max max max max ( , , , )ix y S    1l n  2l n  Inspection image Level 0 Level 1 Level 2 Search region 2015/10/19 20
  • 21. Recognition of 2D rigid objects Tracking Model Pyramid Similarity Coefficient Calculation Similarity Coefficient Calculation Candidate Transformed Image Pyramid Maximum Extraction Candidate max max max max , , , ( , , , ) arg max ( , , , ), [ , ], [ , ], [ ' 2 , ' 2 ] [ , ] x y s x x y y x y s x y s x x t x t y y t y t s s s s s                                    Tracking 2015/10/19 Inspection image Level 0 Level 1 Level 2 Search region 21
  • 22. Recognition of 2D rigid objects ↓2 Level Level Level 0 Search Image ↓2 Similarity Coefficient Calculation 2D Rigid Transformation 2D Rigid Transformation Exhaustive Search Candidate Extraction and Ranking Detected Range of Rotation Angle and Scaling Threshold of Similarity Score Track Track 2D Rigid Transformation 2D Rigid Transformation First Tracking Track Track 2D Rigid Transformation 2D Rigid Transformation Final Tracking Sub-pixel Pose Estimation Candidate Ranking Threshold of Similarity Score Matching Results Local Maximum Local Maximum Results Similarity Coefficient Calculation Detected Range of Rotation Angle and Scaling Detected Range of Rotation Angle and Scaling max max max max ( , , , )ix y S    1l n  2l n  Inspection image Level 0 Level 1 Level 2 Search region 2015/10/19 22
  • 23.  Sub-pixel estimation for NCC and SBM Recognition of 2D rigid objects 2 2 2 0 1 2 3 4 5 6 7 8 9( , , )x y k x k y k k xy k x k y k x k y k k               x y  1 0 1 101 1 0 1 x y  Maximum of * * * ( , , )x y  3 3 3  neighborhood space 0 0 0 0 0 0 0 0 [ 1, 1], [ 1, 1], [ ' , ' ] x x x y y y                   1* 0 3 4 6 * 3 1 5 7 * 4 5 2 8 2 2 2 x k k k k y k k k k k k k k                              2015/10/19 23
  • 25. Methods Strength weakness Stereo vision Simple and inexpensive High accuracy on well-defined targets Computational burden Limited to well defined scenes Low data acquisition rata Laser triangulation Relative simplicity High data acquisition rate High accuracy Safety constrains, if laser based Limited range Missing data in correspondence with occlusions and shadows High cost Structured light High data acquisition rate High accuracy Safety constrains, if laser based Missing data in correspondence with occlusions and shadows High cost Time of Flight Medium to large measurement range High data acquisition rate High cost Recognition of 3D objects 2015/10/19 25
  • 26. Methods Data Strength weakness View based CAD model or model image Efficiency Robust to occlusions, Clutter and contrast changes Low cost Limited accuracy on viewpoints Point Cloud based Point clouds High accuracy Robust to occlusions and clutter 3D sensor required High Computation complexity High cost Recognition of 3D objects Table. Summarized of 3D pose estimation techniques. 2015/10/19 26
  • 27.  Perspective shape-based matching Recognition of perspective objects Image coordinate Camerea coordinate Object plane coordinate Point on plane P Q extM projectionM R C 1 r c          Pixel coordinate Pixel point affineM 0 1 p q             X Y Z x y U V W , 1 1 affine projection ext u r v c w                     M M M 11 12 13 21 22 23 31 32 33 11 12 21 22 31 32 0 0 0 0 0 0 0 1 0 0 1 0 0 0 0 1 1 0 0 0 0 , 0 0 1 1 x y z x y z r r r t p x f r r r t q y f r r r t f r r t p f r r t q r r t                                                                        𝐑 𝐓 11 21 31 x r r r          r 12 22 32 y r r r          r ( , , ) ( ) ( ) ( )x y z x x y y z z     R R R R 2015/10/19 27
  • 28.  Perspective shape-based matching Recognition of perspective objects , , 2 2 2 2 1 , , 1 ( , , , , ) ,i i i i i i i i n i x r y c i x r y c perspective x y z i i i x r y c x r y c t v u w x y n t u v w                           cos( ) sin( ) , sin( ) cos( ) i iz z i iz z t t u u                     cos cos cos sin sin cos sin1 0 , cos sin cos cos sin sin sin0 1 y z z x y x zi i y z x z x y zi i r r c c                                    ( , )T i i it ud , ,( , )i i i i T i r c r cv we ( , )T i i ir cq gradients direction vector of template image gradients direction vector of scene image edge points, these points are relative to the center of gravity and the edge points of the object. , , 2 2 2 2 1 , , 1 ( , , , ) ,i i i i i i i i n i x r y c i x r y c eg i i i x r y c x r y c t v u w x y s n t u v w                         cos( ) sin( ) sin( ) cos( ) i i i i t ts s u us s                        cos( ) sin( ) sin( ) cos( ) i i i i r rs s c cs s                        2015/10/19 28 orthogonal projection weak-perspective projection
  • 29. Optimal (Coarse-to-fine) :  Image pyramid technique  SIMD  Search strategy Recognition of perspective objects Template Image Input Image Matching Results Threshold of Similarity Score User Defined Detected Range of Rotation Angle Offline Model Generation Image Pyramid Technique Model Gengeration Online Matching Process Image Pyramid Technique Object Selection Process , 0, 1,..., 1m l lT l n  l n minTh _ min _ max _ min _ max _ min _ max and , and , and x x y y z z       Optimal Matching Process , 0, 1,..., 1m l lI l n  T I Detected Number of objects max Obj , ,,l i l ip d ,l ie 2015/10/19 29
  • 30.  Rotation requirements Recognition of perspective objects (roll)x (pitch)y (yaw)z Image Plane x y z Camera (pitch)y (yaw)z x Original Image Plane Rotated Image Plane d l Camera (roll)x (yaw)z yOriginal Image Plane Rotated Image Plane d l Camera 1 1 cosx y l l              1 2 1 cos 1 2 z l           2015/10/19 30
  • 31. Recognition of perspective objects Inspection image Level 0 Level 1 Level 2 Search region ↓2 Level Level Level 0 Search Image ↓2 Similarity Coefficient Calculation 3D Rigid Transformation 3D Rigid Transformation Exhaustive Search Candidate Extraction and Ranking Detected Range of Rotation Angle Threshold of Similarity Score Track Track 3D Rigid Transformation 3D Rigid Transformation First Tracking Track Track 3D Rigid Transformation 3D Rigid Transformation Final Tracking Candidate Ranking Threshold of Similarity Score Matching Results Results Similarity Coefficient Calculation Detected Range of Rotation Angle Detected Range of Rotation Angle max max max max max_ _ _( , , , , )x y z ix y       1l n  2l n  2015/10/19 31
  • 32. Recognition of perspective objects max max max max max_ _ _ , , , ( , , , , ) arg max ( , , , , ), [ , ], [ , ], [ ' 2 , ' 2 ], [ ' 2 , ' 2 ], [ ' 2 , ' 2 ] x y z x y z x y s x x y y x x x x x y y y y y z z z z z x y x y x x t x t y y t y t                                                       Tracking Tracking Model Pyramid Similarity Coefficient Calculation Similarity Coefficient Calculation Candidate 3D Transformed Image Pyramid Maximum Extraction Candidate 2015/10/19 32
  • 33. Recognition of perspective objects 2015/10/19 33
  • 34.  Stereo vision Recognition of 3D objects 2015/10/19 34 l r fb Z x x   Ref: Stefano Mattoccia - http://vision.deis.unibo.it/~smatt/Seminars/StereoVision.pdf
  • 35.  Stereo vision Recognition of 3D objects 2015/10/19 35 Camera Calibration (offline) Rectification Stereo Correspondence Triangulation Calibration is carried out acquiring and processing 10+ stereo pairs of a known pattern (typically a checkerboard)  Intrinsic parameters of the two cameras (focal length, image center, parameters of lenses distortion, etc)  Extrinsic parameters (R and T that aligns the two cameras) Left i j i j ..(i,j) (i,j) Right
  • 36.  Laser triangulation Recognition of 3D objects 2015/10/19 36 Ref: http://www.vision-doctor.co.uk/laser-illumination/principle-of-triangulation.html
  • 37.  Structured light Recognition of 3D objects 2015/10/19 37
  • 39.  Kinect Recognition of 3D objects 2015/10/19 39
  • 40.  Random pattern projection Recognition of 3D objects 2015/10/19 40 Camera Laser RPP
  • 41.  Point Cloud Recognition of 3D objects 2015/10/19 41
  • 42.  Point Cloud Library (PCL) Recognition of 3D objects 2015/10/19 42 •Module common •Module features •Module filters •Module geometry •Module io •Module kdtree •Module keypoints •Module octree •Module outofcore •Module recognition •Module registration •Module sample_consensus •Module search •Module segmentation •Module surface •Module visualization The Point Cloud Library (or PCL) is a large scale, open project for point cloud processing, started by Willow Garage, with the purpose to accelerate 3D algorithmic work in perception for use in robotic applications. Ref: http://www.pointclouds.org/
  • 43.  Point Cloud Library (PCL) – Point Feature Histogram (PFH) Recognition of 3D objects 2015/10/19 43 1D 2D 3D 4D 5D 6D 9D 7D 8D qD [D D ] Ut sV    W U V  sD tD U W V tN sN U    t sD D sU N ( )t s t s D D U V D D     W U V  acos( )tV N   t sd D D  acos( )t sD D U d     atan( , )t tW N U N   
  • 44.  Point Cloud Library (PCL) – Point Feature Histogram (PFH) Recognition of 3D objects 2015/10/19 44 1 2 4 1 13 4 = (s , ) 2 atan( , ) t t s i i i it s i t t f V N f D D idx step fD D f U d f W N U N                     0, if > ( , ) 1, if f s step s f f s    1D 2D 3D 4D 5D 6D 9D 7D 8D qD 0 1 2 3 … 15 count
  • 45.  Point Cloud Library (PCL) – Fast Point Feature Histogram (FPFH) Recognition of 3D objects 2015/10/19 45 1D 4D 6D 7D 9D qD 5D 3D 8D 2D 1 1 1 ( ) ( ) ( ) k q q k i k FPFH D SPFH D SPFH D k w    1 2 4 1 13 4 = (s , ) 2 atan( , ) t t s i i i it s i t t f V N f D D idx step fD D f U d f W N U N                    
  • 46.  Point Cloud Library (PCL) – PFH- FPFH Recognition of 3D objects 2015/10/19 46 1D 4D 6D 7D 9D qD 5D 3D 8D 2D 1D 2D 3D 4D 5D 6D 9D 7D 8D qD PFH FPFH
  • 47.  Point Cloud Library (PCL) – registration Recognition of 3D objects 2015/10/19 47 R T R T Phase1 SAC-IA >>> Phase2 ICP >>> Template pose Search Sample Consensus Initial Alignment (SAC-IA) Iteration Closest Point (ICP)