SlideShare a Scribd company logo
http://www.lsr.ei.tum.d
e
www.lsr.ei.tum.de
Lehrstuhl für Steuerungs- und Regelungstechnik
Technische Universität München
Forschungspraxis : Object
recognition and pose estimation
from an RGB-D image
Chiraz Nafouki Supervisor: Shile Li
chiraz.nafouki@tum.de li.shile@mytum.de
Motivation
2
Why 3D based object recognition and 6-DOF
pose estimation?
●
Many robotic applications rely on object recognition
and 6-DOF pose estimation.
●
Emergence of low-cost RGB-D devices.
3
PointCloudObjectP={p1,p2, p3,...,pN},withpi=(x,y,z,r,g,b)T
P
Globalfeaturedescriptor ⃗f=(f1, f 2, ...,fm)
Problem : Develop a global feature descriptor that combines
geometry and color information for object recognition and pose estimation.
Problem Formulation
P
Keypointsextraction
Localfeaturedescriptors
⃗f1 ,⃗f2 ,...⃗f N
Related Work
4
Viewpoint Feature Histogram (VFH):
●
Global descriptor of size 308.
●
Surflet-pair relation between centroid and other points.
●
Drawbacks: Only geometric information is used.
[Radu et. al. 2010]
Two objects with similar shapes
and different colors
β=arccos(
n5 .c
∥c∥
)
α=v.n5
Φ=u.
(p5−c)
d
θ=arctan(w.n5 ,u.n5)
d=∥p5−c∥
5
Pipeline of object recognition and pose estimation
Object modeling Feature extraction Database
Segmentation
& clustering
Feature extraction
Recognition
(matching)
Pose
optimization
Synthetic views
generation
Object recognition pipeline
Real scene
Point cloud
Testing data
Training data
6
Pipeline of object recognition and pose estimation
Object modeling Feature extraction Database
Segmentation
& clustering
Feature extraction
Recognition
(matching)
Pose
optimization
Synthetic views
generation
Object recognition pipeline
Real scene
Point cloud
Object Modeling and synthetic
views generation
7
- Generate 1260 synthetic views for each 3D object model.
- Compute a feature descriptor for each synthetic view.
3D Object models used in the
training dataset
Example of synthetic views of
two object models
Synthetic views generation: Naïve
Sampling and uniform sampling
Naïve Sampling Uniform Sampling
Naïve Sampling:
● Uniformly sampling each Euler angle independetly.
● Results in oversampling near polar regions.
Uniform sampling:
● Uniformly sampling the pitch angle.
● Use the inverse cosine for the yaw angle to avoid oversampling.
[James J.Kuffner 2004]
9
Pipeline of object recognition and pose estimation
Object modeling Feature extraction Database
Segmentation
& clustering
Feature extraction
Recognition
(matching)
Pose
optimization
Synthetic views
generation
Object recognition pipeline
Real scene
Point cloud
Segmentation and clustering
10
●
Extract the horizontal plane on which the objects lie using RANSAC algorithm.
●
Cluster the remaining points into individual objects.
Segmentation &
clustering
11
Pipeline of object recognition and pose estimation
Object modeling Feature extraction Database
Segmentation
& clustering
Feature extraction
Recognition
(matching)
Pose
optimization
Synthetic views
generation
Object recognition pipeline
Real scene
Point cloud
12
Geometry
Information
Correlation between
Color & geometry
⃗f
Feature extraction
The feature descriptor is composed of two parts:
Part one : three features between random point pairs
●
Distance :
●
Angle with normals : ,
●
Angle with the viewpoint direction :
Each of the three features is encoded
in a 30-bin histogram.
d(pi , pj)=∥pi−pj∥
(pi , pj).
a1=^(ni ,⃗pi pj) a2=^(nj ,⃗pj pi)
a3=^(⃗v ,⃗pi pj)
Feature extraction
13
Part two :
●
Points are classified into 10 subregions
based on their distance to the centroid.
●
Color information is encoded using CIELab
color space in a 30-bin histogram.
●
Color and geometry are correlated.
⃗f
Definition of regions for an object
14
Pipeline of object recognition and pose estimation
Object modeling Feature extraction Database
Segmentation
& clustering
Feature extraction
Recognition
(matching)
Pose
optimization
Synthetic views
generation
Object recognition pipeline
Real scene
Point cloud
Object Recognition and pose
estimation
15
●
Given an object with a global descriptor :
Find the k-nearest neighbours to in the database.
●
Chi-squared distance is used :
Object from
real scene
⃗fFeature
extraction
⃗f
Database
Find k-nearest
neighbours
⃗f
⃗f 1
⃗f 2
⃗f k
...
Object recognition
and pose
estimation
16
Pipeline of object recognition and pose estimation
Object modeling Feature extraction Database
Segmentation
& clustering
Feature extraction
Recognition
(matching)
Pose
optimization
Synthetic views
generation
Object recognition pipeline
Real scene
Point cloud
Pose Optimization
17
●
Using Iterative Closest Point (ICP).
●
ICP aims at minmizing the distance between two point clouds.
●
Find optimal rotation and translation to apply on the source point
cloud to match the reference.
Object from
real scene
(reference)
Retrieved nearest
neighbour from database
(source)
ICP
Optimize the pose of the
source point cloud
Experimental Results
18
Experimental setup
● Online recognition (integration into Ros node).
● Recognition rate of 94% for online objects.
● Recognition rate of 100% for synthetic views.
Superposition of the real object with the
recognized object in its estimated pose
Experimental Results
19
Occlusion handling
20
Summary
Feature part:
● Combining color and geometry information in a feature descriptor.
Training part:
● Uniformly distributed synthetic views generation (uniform sampling).
Testing part:
● Segmentation and clustering.
● Recognition (matching).
● Pose refinement after recognition (ICP).
Evaluation:
● Recognition rate.
● Pose accuracy.
● Occlusion handling.
21
Future Work
●
Make the second part of the descriptor independent of the centroid.
●
Use Locality-Sensitive-Hashing (LSH) algorithm instead of kd-tree.
●
Test the descriptor with uniformly sampled training data.
●
Estimate quantitatively the pose accuracy.
References
23
Radu Bogdan Rusu, Gary Bradski, Romain Thibaux and John Hsu.
Fast 3D Recognition and Pose Using the Viewpoint Feature Histogram.
In Proc. of the Int. Conf. on Intelligent Robot Systems (IROS), 2010, pp. 3467–3474.
Wei Wang , Shile Li , Lili Chen , Dongming Chen and Kolja Kühnlenz.
Fast object recognition and 6d pose estimation using viewpoint oriented color-shape
histogram.
In Multimedia and Expo (ICME), 2013 IEEE International Conference on, pages 1–6. IEEE, 2013.
James J.Kuffner
Effective Sampling and Distance Metrics for 3D Rigid Body Path Planning.
In Proc. of the Int. Conf. on Robotics and Automation (ICRA), 2004.
Walter Wohlkinger and Markus Vincze
Ensemble of Shape Functions for 3D Object Classification.
In Proc. of the Int. Conf. on Robotics and Biomimetics, 2011.

More Related Content

What's hot

Computer Graphics and Multimedia Techniques Paper (RTU VI Semester)
Computer Graphics and Multimedia Techniques Paper (RTU VI Semester)Computer Graphics and Multimedia Techniques Paper (RTU VI Semester)
Computer Graphics and Multimedia Techniques Paper (RTU VI Semester)
FellowBuddy.com
 
Low-Resolution Contour Recognition for Hexagonal Grid Images
Low-Resolution Contour Recognition for Hexagonal Grid Images Low-Resolution Contour Recognition for Hexagonal Grid Images
Low-Resolution Contour Recognition for Hexagonal Grid Images
Kazi Mostafa
 
I. AO* SEARCH ALGORITHM
I. AO* SEARCH ALGORITHMI. AO* SEARCH ALGORITHM
I. AO* SEARCH ALGORITHM
vikas dhakane
 
Computer Graphics - Hidden Line Removal Algorithm
Computer Graphics - Hidden Line Removal AlgorithmComputer Graphics - Hidden Line Removal Algorithm
Computer Graphics - Hidden Line Removal Algorithm
Jyotiraman De
 
Computer Graphics (Hidden surfaces and line removal, Curves and surfaces, Sur...
Computer Graphics (Hidden surfaces and line removal, Curves and surfaces, Sur...Computer Graphics (Hidden surfaces and line removal, Curves and surfaces, Sur...
Computer Graphics (Hidden surfaces and line removal, Curves and surfaces, Sur...
rohitjasudkar
 
Visible surface detection methods
Visible surface detection methodsVisible surface detection methods
Visible surface detection methods
KABILESH RAMAR
 
Weakly supervised semantic segmentation of 3D point cloud
Weakly supervised semantic segmentation of 3D point cloudWeakly supervised semantic segmentation of 3D point cloud
Weakly supervised semantic segmentation of 3D point cloud
Arithmer Inc.
 
Hidden line removal algorithm
Hidden line removal algorithmHidden line removal algorithm
Hidden line removal algorithm
KKARUNKARTHIK
 
2018 IMSM: MIT Lincoln Laboratories Problem Presentation - Light Reflections...
2018 IMSM: MIT Lincoln Laboratories  Problem Presentation - Light Reflections...2018 IMSM: MIT Lincoln Laboratories  Problem Presentation - Light Reflections...
2018 IMSM: MIT Lincoln Laboratories Problem Presentation - Light Reflections...
The Statistical and Applied Mathematical Sciences Institute
 
07 a70110 remotesensingandgisapplications
07 a70110 remotesensingandgisapplications07 a70110 remotesensingandgisapplications
07 a70110 remotesensingandgisapplications
imaduddin91
 
Lecture 21 problem reduction search ao star search
Lecture 21 problem reduction search ao star searchLecture 21 problem reduction search ao star search
Lecture 21 problem reduction search ao star search
Hema Kashyap
 
Test
TestTest
Test
Kinni MEW
 
Hidden surface removal algorithm
Hidden surface removal algorithmHidden surface removal algorithm
Hidden surface removal algorithm
KKARUNKARTHIK
 
Ivan Sahumbaiev "Deep Learning approaches meet 3D data"
Ivan Sahumbaiev "Deep Learning approaches meet 3D data"Ivan Sahumbaiev "Deep Learning approaches meet 3D data"
Ivan Sahumbaiev "Deep Learning approaches meet 3D data"
Fwdays
 
CLIM: Transition Workshop - Statistical Emulation with Dimension Reduction fo...
CLIM: Transition Workshop - Statistical Emulation with Dimension Reduction fo...CLIM: Transition Workshop - Statistical Emulation with Dimension Reduction fo...
CLIM: Transition Workshop - Statistical Emulation with Dimension Reduction fo...
The Statistical and Applied Mathematical Sciences Institute
 
Polygon filling
Polygon fillingPolygon filling
Polygon filling
Ankit Garg
 
Discrete Math Lab Cheminformatics Joint Project
Discrete Math Lab Cheminformatics Joint ProjectDiscrete Math Lab Cheminformatics Joint Project
Discrete Math Lab Cheminformatics Joint Project
京都大学大学院情報学研究科数理工学専攻
 
Relief Clipping Planes (SIGGRAPH ASIA 2008)
Relief Clipping Planes (SIGGRAPH ASIA 2008)Relief Clipping Planes (SIGGRAPH ASIA 2008)
Relief Clipping Planes (SIGGRAPH ASIA 2008)
Matthias Trapp
 
Compound Structure Detection
Compound Structure DetectionCompound Structure Detection
Compound Structure Detection
Huseyin Gokhan Akcay
 
aip shape detection and tracking using contours
aip shape detection and tracking using contoursaip shape detection and tracking using contours
aip shape detection and tracking using contours
Saeed Ullah
 

What's hot (20)

Computer Graphics and Multimedia Techniques Paper (RTU VI Semester)
Computer Graphics and Multimedia Techniques Paper (RTU VI Semester)Computer Graphics and Multimedia Techniques Paper (RTU VI Semester)
Computer Graphics and Multimedia Techniques Paper (RTU VI Semester)
 
Low-Resolution Contour Recognition for Hexagonal Grid Images
Low-Resolution Contour Recognition for Hexagonal Grid Images Low-Resolution Contour Recognition for Hexagonal Grid Images
Low-Resolution Contour Recognition for Hexagonal Grid Images
 
I. AO* SEARCH ALGORITHM
I. AO* SEARCH ALGORITHMI. AO* SEARCH ALGORITHM
I. AO* SEARCH ALGORITHM
 
Computer Graphics - Hidden Line Removal Algorithm
Computer Graphics - Hidden Line Removal AlgorithmComputer Graphics - Hidden Line Removal Algorithm
Computer Graphics - Hidden Line Removal Algorithm
 
Computer Graphics (Hidden surfaces and line removal, Curves and surfaces, Sur...
Computer Graphics (Hidden surfaces and line removal, Curves and surfaces, Sur...Computer Graphics (Hidden surfaces and line removal, Curves and surfaces, Sur...
Computer Graphics (Hidden surfaces and line removal, Curves and surfaces, Sur...
 
Visible surface detection methods
Visible surface detection methodsVisible surface detection methods
Visible surface detection methods
 
Weakly supervised semantic segmentation of 3D point cloud
Weakly supervised semantic segmentation of 3D point cloudWeakly supervised semantic segmentation of 3D point cloud
Weakly supervised semantic segmentation of 3D point cloud
 
Hidden line removal algorithm
Hidden line removal algorithmHidden line removal algorithm
Hidden line removal algorithm
 
2018 IMSM: MIT Lincoln Laboratories Problem Presentation - Light Reflections...
2018 IMSM: MIT Lincoln Laboratories  Problem Presentation - Light Reflections...2018 IMSM: MIT Lincoln Laboratories  Problem Presentation - Light Reflections...
2018 IMSM: MIT Lincoln Laboratories Problem Presentation - Light Reflections...
 
07 a70110 remotesensingandgisapplications
07 a70110 remotesensingandgisapplications07 a70110 remotesensingandgisapplications
07 a70110 remotesensingandgisapplications
 
Lecture 21 problem reduction search ao star search
Lecture 21 problem reduction search ao star searchLecture 21 problem reduction search ao star search
Lecture 21 problem reduction search ao star search
 
Test
TestTest
Test
 
Hidden surface removal algorithm
Hidden surface removal algorithmHidden surface removal algorithm
Hidden surface removal algorithm
 
Ivan Sahumbaiev "Deep Learning approaches meet 3D data"
Ivan Sahumbaiev "Deep Learning approaches meet 3D data"Ivan Sahumbaiev "Deep Learning approaches meet 3D data"
Ivan Sahumbaiev "Deep Learning approaches meet 3D data"
 
CLIM: Transition Workshop - Statistical Emulation with Dimension Reduction fo...
CLIM: Transition Workshop - Statistical Emulation with Dimension Reduction fo...CLIM: Transition Workshop - Statistical Emulation with Dimension Reduction fo...
CLIM: Transition Workshop - Statistical Emulation with Dimension Reduction fo...
 
Polygon filling
Polygon fillingPolygon filling
Polygon filling
 
Discrete Math Lab Cheminformatics Joint Project
Discrete Math Lab Cheminformatics Joint ProjectDiscrete Math Lab Cheminformatics Joint Project
Discrete Math Lab Cheminformatics Joint Project
 
Relief Clipping Planes (SIGGRAPH ASIA 2008)
Relief Clipping Planes (SIGGRAPH ASIA 2008)Relief Clipping Planes (SIGGRAPH ASIA 2008)
Relief Clipping Planes (SIGGRAPH ASIA 2008)
 
Compound Structure Detection
Compound Structure DetectionCompound Structure Detection
Compound Structure Detection
 
aip shape detection and tracking using contours
aip shape detection and tracking using contoursaip shape detection and tracking using contours
aip shape detection and tracking using contours
 

Similar to final_presentation

RWDA
RWDARWDA
[DL輪読会]ClearGrasp
[DL輪読会]ClearGrasp[DL輪読会]ClearGrasp
[DL輪読会]ClearGrasp
Deep Learning JP
 
3-d interpretation from single 2-d image for autonomous driving II
3-d interpretation from single 2-d image for autonomous driving II3-d interpretation from single 2-d image for autonomous driving II
3-d interpretation from single 2-d image for autonomous driving II
Yu Huang
 
Lidar for Autonomous Driving II (via Deep Learning)
Lidar for Autonomous Driving II (via Deep Learning)Lidar for Autonomous Driving II (via Deep Learning)
Lidar for Autonomous Driving II (via Deep Learning)
Yu Huang
 
fusion of Camera and lidar for autonomous driving II
fusion of Camera and lidar for autonomous driving IIfusion of Camera and lidar for autonomous driving II
fusion of Camera and lidar for autonomous driving II
Yu Huang
 
final_report
final_reportfinal_report
final_report
Chiraz Nafouki
 
Point-GNN: Graph Neural Network for 3D Object Detection in a Point Cloud
Point-GNN: Graph Neural Network for 3D Object Detection in a Point CloudPoint-GNN: Graph Neural Network for 3D Object Detection in a Point Cloud
Point-GNN: Graph Neural Network for 3D Object Detection in a Point Cloud
Nuwan Sriyantha Bandara
 
Object Pose Estimation
Object Pose EstimationObject Pose Estimation
Object Pose Estimation
Arithmer Inc.
 
APPLYING R-SPATIOGRAM IN OBJECT TRACKING FOR OCCLUSION HANDLING
APPLYING R-SPATIOGRAM IN OBJECT TRACKING FOR OCCLUSION HANDLINGAPPLYING R-SPATIOGRAM IN OBJECT TRACKING FOR OCCLUSION HANDLING
APPLYING R-SPATIOGRAM IN OBJECT TRACKING FOR OCCLUSION HANDLING
sipij
 
3-d interpretation from stereo images for autonomous driving
3-d interpretation from stereo images for autonomous driving3-d interpretation from stereo images for autonomous driving
3-d interpretation from stereo images for autonomous driving
Yu Huang
 
Orb feature by nitin
Orb feature by nitinOrb feature by nitin
Orb feature by nitin
NitinMauryaKashipur
 
Tracking Robustness and Green View Index Estimation of Augmented and Diminish...
Tracking Robustness and Green View Index Estimation of Augmented and Diminish...Tracking Robustness and Green View Index Estimation of Augmented and Diminish...
Tracking Robustness and Green View Index Estimation of Augmented and Diminish...
Tomohiro Fukuda
 
fusion of Camera and lidar for autonomous driving I
fusion of Camera and lidar for autonomous driving Ifusion of Camera and lidar for autonomous driving I
fusion of Camera and lidar for autonomous driving I
Yu Huang
 
superglue_slides.pdf
superglue_slides.pdfsuperglue_slides.pdf
superglue_slides.pdf
XiaoguangHan3
 
Presentation Object Recognition And Tracking Project
Presentation Object Recognition And Tracking ProjectPresentation Object Recognition And Tracking Project
Presentation Object Recognition And Tracking Project
Prathamesh Joshi
 
3-d interpretation from single 2-d image IV
3-d interpretation from single 2-d image IV3-d interpretation from single 2-d image IV
3-d interpretation from single 2-d image IV
Yu Huang
 
Pose estimation from RGB images by deep learning
Pose estimation from RGB images by deep learningPose estimation from RGB images by deep learning
Pose estimation from RGB images by deep learning
Yu Huang
 
DEEP LEARNING TECHNIQUES POWER POINT PRESENTATION
DEEP LEARNING TECHNIQUES POWER POINT PRESENTATIONDEEP LEARNING TECHNIQUES POWER POINT PRESENTATION
DEEP LEARNING TECHNIQUES POWER POINT PRESENTATION
SelvaLakshmi63
 
PCA-SIFT: A More Distinctive Representation for Local Image Descriptors
PCA-SIFT: A More Distinctive Representation for Local Image DescriptorsPCA-SIFT: A More Distinctive Representation for Local Image Descriptors
PCA-SIFT: A More Distinctive Representation for Local Image Descriptors
wolf
 
Udacity-Didi Challenge Finalists
Udacity-Didi Challenge FinalistsUdacity-Didi Challenge Finalists
Udacity-Didi Challenge Finalists
David Silver
 

Similar to final_presentation (20)

RWDA
RWDARWDA
RWDA
 
[DL輪読会]ClearGrasp
[DL輪読会]ClearGrasp[DL輪読会]ClearGrasp
[DL輪読会]ClearGrasp
 
3-d interpretation from single 2-d image for autonomous driving II
3-d interpretation from single 2-d image for autonomous driving II3-d interpretation from single 2-d image for autonomous driving II
3-d interpretation from single 2-d image for autonomous driving II
 
Lidar for Autonomous Driving II (via Deep Learning)
Lidar for Autonomous Driving II (via Deep Learning)Lidar for Autonomous Driving II (via Deep Learning)
Lidar for Autonomous Driving II (via Deep Learning)
 
fusion of Camera and lidar for autonomous driving II
fusion of Camera and lidar for autonomous driving IIfusion of Camera and lidar for autonomous driving II
fusion of Camera and lidar for autonomous driving II
 
final_report
final_reportfinal_report
final_report
 
Point-GNN: Graph Neural Network for 3D Object Detection in a Point Cloud
Point-GNN: Graph Neural Network for 3D Object Detection in a Point CloudPoint-GNN: Graph Neural Network for 3D Object Detection in a Point Cloud
Point-GNN: Graph Neural Network for 3D Object Detection in a Point Cloud
 
Object Pose Estimation
Object Pose EstimationObject Pose Estimation
Object Pose Estimation
 
APPLYING R-SPATIOGRAM IN OBJECT TRACKING FOR OCCLUSION HANDLING
APPLYING R-SPATIOGRAM IN OBJECT TRACKING FOR OCCLUSION HANDLINGAPPLYING R-SPATIOGRAM IN OBJECT TRACKING FOR OCCLUSION HANDLING
APPLYING R-SPATIOGRAM IN OBJECT TRACKING FOR OCCLUSION HANDLING
 
3-d interpretation from stereo images for autonomous driving
3-d interpretation from stereo images for autonomous driving3-d interpretation from stereo images for autonomous driving
3-d interpretation from stereo images for autonomous driving
 
Orb feature by nitin
Orb feature by nitinOrb feature by nitin
Orb feature by nitin
 
Tracking Robustness and Green View Index Estimation of Augmented and Diminish...
Tracking Robustness and Green View Index Estimation of Augmented and Diminish...Tracking Robustness and Green View Index Estimation of Augmented and Diminish...
Tracking Robustness and Green View Index Estimation of Augmented and Diminish...
 
fusion of Camera and lidar for autonomous driving I
fusion of Camera and lidar for autonomous driving Ifusion of Camera and lidar for autonomous driving I
fusion of Camera and lidar for autonomous driving I
 
superglue_slides.pdf
superglue_slides.pdfsuperglue_slides.pdf
superglue_slides.pdf
 
Presentation Object Recognition And Tracking Project
Presentation Object Recognition And Tracking ProjectPresentation Object Recognition And Tracking Project
Presentation Object Recognition And Tracking Project
 
3-d interpretation from single 2-d image IV
3-d interpretation from single 2-d image IV3-d interpretation from single 2-d image IV
3-d interpretation from single 2-d image IV
 
Pose estimation from RGB images by deep learning
Pose estimation from RGB images by deep learningPose estimation from RGB images by deep learning
Pose estimation from RGB images by deep learning
 
DEEP LEARNING TECHNIQUES POWER POINT PRESENTATION
DEEP LEARNING TECHNIQUES POWER POINT PRESENTATIONDEEP LEARNING TECHNIQUES POWER POINT PRESENTATION
DEEP LEARNING TECHNIQUES POWER POINT PRESENTATION
 
PCA-SIFT: A More Distinctive Representation for Local Image Descriptors
PCA-SIFT: A More Distinctive Representation for Local Image DescriptorsPCA-SIFT: A More Distinctive Representation for Local Image Descriptors
PCA-SIFT: A More Distinctive Representation for Local Image Descriptors
 
Udacity-Didi Challenge Finalists
Udacity-Didi Challenge FinalistsUdacity-Didi Challenge Finalists
Udacity-Didi Challenge Finalists
 

final_presentation

  • 1. http://www.lsr.ei.tum.d e www.lsr.ei.tum.de Lehrstuhl für Steuerungs- und Regelungstechnik Technische Universität München Forschungspraxis : Object recognition and pose estimation from an RGB-D image Chiraz Nafouki Supervisor: Shile Li chiraz.nafouki@tum.de li.shile@mytum.de
  • 2. Motivation 2 Why 3D based object recognition and 6-DOF pose estimation? ● Many robotic applications rely on object recognition and 6-DOF pose estimation. ● Emergence of low-cost RGB-D devices.
  • 3. 3 PointCloudObjectP={p1,p2, p3,...,pN},withpi=(x,y,z,r,g,b)T P Globalfeaturedescriptor ⃗f=(f1, f 2, ...,fm) Problem : Develop a global feature descriptor that combines geometry and color information for object recognition and pose estimation. Problem Formulation P Keypointsextraction Localfeaturedescriptors ⃗f1 ,⃗f2 ,...⃗f N
  • 4. Related Work 4 Viewpoint Feature Histogram (VFH): ● Global descriptor of size 308. ● Surflet-pair relation between centroid and other points. ● Drawbacks: Only geometric information is used. [Radu et. al. 2010] Two objects with similar shapes and different colors β=arccos( n5 .c ∥c∥ ) α=v.n5 Φ=u. (p5−c) d θ=arctan(w.n5 ,u.n5) d=∥p5−c∥
  • 5. 5 Pipeline of object recognition and pose estimation Object modeling Feature extraction Database Segmentation & clustering Feature extraction Recognition (matching) Pose optimization Synthetic views generation Object recognition pipeline Real scene Point cloud Testing data Training data
  • 6. 6 Pipeline of object recognition and pose estimation Object modeling Feature extraction Database Segmentation & clustering Feature extraction Recognition (matching) Pose optimization Synthetic views generation Object recognition pipeline Real scene Point cloud
  • 7. Object Modeling and synthetic views generation 7 - Generate 1260 synthetic views for each 3D object model. - Compute a feature descriptor for each synthetic view. 3D Object models used in the training dataset Example of synthetic views of two object models
  • 8. Synthetic views generation: Naïve Sampling and uniform sampling Naïve Sampling Uniform Sampling Naïve Sampling: ● Uniformly sampling each Euler angle independetly. ● Results in oversampling near polar regions. Uniform sampling: ● Uniformly sampling the pitch angle. ● Use the inverse cosine for the yaw angle to avoid oversampling. [James J.Kuffner 2004]
  • 9. 9 Pipeline of object recognition and pose estimation Object modeling Feature extraction Database Segmentation & clustering Feature extraction Recognition (matching) Pose optimization Synthetic views generation Object recognition pipeline Real scene Point cloud
  • 10. Segmentation and clustering 10 ● Extract the horizontal plane on which the objects lie using RANSAC algorithm. ● Cluster the remaining points into individual objects. Segmentation & clustering
  • 11. 11 Pipeline of object recognition and pose estimation Object modeling Feature extraction Database Segmentation & clustering Feature extraction Recognition (matching) Pose optimization Synthetic views generation Object recognition pipeline Real scene Point cloud
  • 12. 12 Geometry Information Correlation between Color & geometry ⃗f Feature extraction The feature descriptor is composed of two parts: Part one : three features between random point pairs ● Distance : ● Angle with normals : , ● Angle with the viewpoint direction : Each of the three features is encoded in a 30-bin histogram. d(pi , pj)=∥pi−pj∥ (pi , pj). a1=^(ni ,⃗pi pj) a2=^(nj ,⃗pj pi) a3=^(⃗v ,⃗pi pj)
  • 13. Feature extraction 13 Part two : ● Points are classified into 10 subregions based on their distance to the centroid. ● Color information is encoded using CIELab color space in a 30-bin histogram. ● Color and geometry are correlated. ⃗f Definition of regions for an object
  • 14. 14 Pipeline of object recognition and pose estimation Object modeling Feature extraction Database Segmentation & clustering Feature extraction Recognition (matching) Pose optimization Synthetic views generation Object recognition pipeline Real scene Point cloud
  • 15. Object Recognition and pose estimation 15 ● Given an object with a global descriptor : Find the k-nearest neighbours to in the database. ● Chi-squared distance is used : Object from real scene ⃗fFeature extraction ⃗f Database Find k-nearest neighbours ⃗f ⃗f 1 ⃗f 2 ⃗f k ... Object recognition and pose estimation
  • 16. 16 Pipeline of object recognition and pose estimation Object modeling Feature extraction Database Segmentation & clustering Feature extraction Recognition (matching) Pose optimization Synthetic views generation Object recognition pipeline Real scene Point cloud
  • 17. Pose Optimization 17 ● Using Iterative Closest Point (ICP). ● ICP aims at minmizing the distance between two point clouds. ● Find optimal rotation and translation to apply on the source point cloud to match the reference. Object from real scene (reference) Retrieved nearest neighbour from database (source) ICP Optimize the pose of the source point cloud
  • 18. Experimental Results 18 Experimental setup ● Online recognition (integration into Ros node). ● Recognition rate of 94% for online objects. ● Recognition rate of 100% for synthetic views. Superposition of the real object with the recognized object in its estimated pose
  • 21. Summary Feature part: ● Combining color and geometry information in a feature descriptor. Training part: ● Uniformly distributed synthetic views generation (uniform sampling). Testing part: ● Segmentation and clustering. ● Recognition (matching). ● Pose refinement after recognition (ICP). Evaluation: ● Recognition rate. ● Pose accuracy. ● Occlusion handling. 21
  • 22. Future Work ● Make the second part of the descriptor independent of the centroid. ● Use Locality-Sensitive-Hashing (LSH) algorithm instead of kd-tree. ● Test the descriptor with uniformly sampled training data. ● Estimate quantitatively the pose accuracy.
  • 23. References 23 Radu Bogdan Rusu, Gary Bradski, Romain Thibaux and John Hsu. Fast 3D Recognition and Pose Using the Viewpoint Feature Histogram. In Proc. of the Int. Conf. on Intelligent Robot Systems (IROS), 2010, pp. 3467–3474. Wei Wang , Shile Li , Lili Chen , Dongming Chen and Kolja Kühnlenz. Fast object recognition and 6d pose estimation using viewpoint oriented color-shape histogram. In Multimedia and Expo (ICME), 2013 IEEE International Conference on, pages 1–6. IEEE, 2013. James J.Kuffner Effective Sampling and Distance Metrics for 3D Rigid Body Path Planning. In Proc. of the Int. Conf. on Robotics and Automation (ICRA), 2004. Walter Wohlkinger and Markus Vincze Ensemble of Shape Functions for 3D Object Classification. In Proc. of the Int. Conf. on Robotics and Biomimetics, 2011.