RWDA

•

1 like•786 views

Abraham Monrroy

Project RWDA

Hypothesis
Objects in three-dimensional space can be classified
accurately through the analysis of Point Cloud data
acquired by LiDARs

What is a LiDAR?
A LiDAR (Light RaDAR) is a sensor that measures distance by
illuminating a target with a laser and analyzing the reflected light.
X Y Z R
77.465 12.068 2.860 0.00
77.502 12.323 2.863 0.00
45.525 7.448 1.769 0.00
45.517 7.594 1.770 0.00
77.819 13.253 2.878 0.00
77.885 13.516 2.882 0.00
77.945 13.779 2.886 0.00

What is a PointCloud?
• A point cloud is a set of data points in some coordinate system.
• Are intended to represent the external surface of an object.

Analyzed Dataset
KITTI Dataset is a collection of:
• Velodyne Points (Point Clouds)
• Stereo Images
• GPS localization points
Is free and available for academic use only

Workflow
Data
structure
Analyze Train Test Result

Data structure
Input
• Velodyne Points in binary format
• JPEG Images (used only for reference visualization)
• Labels in XML format for each frame in the image coordinate
system
My Processed Output
• Velodyne Points in Matlab’s structure array and labeled.
• Extracted Velodyne points for each labeled object (extracted
from 2D to 3D)
• Generated 2D image from 3D points

Analysis
Approaches:
1. Using the 3D projection in an image. SURF, ORB, HOG
features
• Result: Failed. Since the image is not complex
enough, there were not enough features to perform
data analysis.
2. Using the 3D points and try to obtain descriptive
features.
• Result: Success, there are a few feature
descriptors for 3D point clouds.

Features
• Extract local features from the now segmented velodyne points.
Feature Name
Supports Texture /
Color
Local / Global /
Regional
Best Use Case
PFH No L
FPFH No L 2.5D Scans (Pseudo single position range images)
VFH No G Object detection with basic pose estimation
CVFH No R
Object detection with basic pose estimation, detection of
partial objects
RIFT Yes L
Real world 3D-Scans with no mirror effects. RIFT is vulnerable
against flipping.
RSD No L
NARF No L 2.5D (Range Images)

FPFH (Fast Point Feature Histogram)
Input Format
• A point cloud consisting of a set of oriented points P. Oriented means
that all points have a normal n.
• This feature does not make use of color information.
Output
• A 33 bin histogram stores in a vector for each point.
Radu Bogdan Rusu, Nico Blodow, Michael Beetz
Technische Universitat Munchen
PointCloud Library
C++ API
Matlab
MAT
PCD
Files PCL CSV

SVM Training / Tests / Results
SVM Type Accuracy
Multiclass Model 72.8627%
Car binary Model 85.8886%
Pedestrian binary Model 96.3387%
Van binary Model 90.1602%
Training set #Objects
1 572
2 56
3 705
4 1754

Todo Work
• Test with other features extractors.
• Train with more observations
• The development of a detector in the PointCloud would be useful for other
purposes such as perception systems.

What's hot

ICRA 2015 interactive presentationSunando Sengupta

Object based image analysis tools for opticksMohit Kumar

Canny Edge Detection Algorithm on FPGA IOSR Journals

Foreground Detection : Combining Background Subspace Learning with Object Smo...Shanghai Jiao Tong University(上海交通大学)

Deep image retrieval - learning global representations for image search - ub ...Universitat de Barcelona

Deep Learning for Computer Vision: Image Retrieval (UPC 2016)Universitat Politècnica de Catalunya

Tennis video shot classification based on support vectores712

Survey on optical flow estimation with DLLeapMind Inc

Annotation tools for ADAS & Autonomous DrivingYu Huang

30th コンピュータビジョン勉強会@関東 DynamicFusionHiroki Mizuno

Deep image retrieval learning global representations for image searchUniversitat Politècnica de Catalunya

Object detection - RCNNs vs RetinanetRishabh Indoria

EECSCon PosterVincent Kee

Deformable DETR Review [CDM]Dongmin Choi

Survey 1 (project overview)Ahmed Abd El-Fattah

SkyStitch: a Cooperative Multi-UAV-based Real-time Video Surveillance System ...Kitsukawa Yuki

[unofficial] Pyramid Scene Parsing Network (CVPR 2017)Shunta Saito

Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya

Objects as pointsDADAJONJURAKUZIEV

Convolutional Patch Representations for Image Retrieval An unsupervised approachUniversitat de Barcelona

What's hot (20)

ICRA 2015 interactive presentation

Object based image analysis tools for opticks

Canny Edge Detection Algorithm on FPGA

Foreground Detection : Combining Background Subspace Learning with Object Smo...

Deep image retrieval - learning global representations for image search - ub ...

Deep Learning for Computer Vision: Image Retrieval (UPC 2016)

Tennis video shot classification based on support vector

Survey on optical flow estimation with DL

Annotation tools for ADAS & Autonomous Driving

30th コンピュータビジョン勉強会@関東 DynamicFusion

Deep image retrieval learning global representations for image search

Object detection - RCNNs vs Retinanet

EECSCon Poster

Deformable DETR Review [CDM]

Survey 1 (project overview)

SkyStitch: a Cooperative Multi-UAV-based Real-time Video Surveillance System ...

[unofficial] Pyramid Scene Parsing Network (CVPR 2017)

Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)

Objects as points

Convolutional Patch Representations for Image Retrieval An unsupervised approach

Similar to RWDA

final_presentationChiraz Nafouki

pydataPointCloud.pptxManuel Rodrigo Cabello Malagón

Introduction to 3D Computer Vision and Differentiable RenderingPreferred Networks

LiDAR-based Autonomous Driving III (by Deep Learning)Yu Huang

3-d interpretation from single 2-d image for autonomous driving IIYu Huang

3D Image visualizationalok ray

fusion of Camera and lidar for autonomous driving IYu Huang

Lidar for Autonomous Driving II (via Deep Learning)Yu Huang

AUTO AI 2021 talk Real world data augmentations for autonomous driving : B Ra...Ravi Kiran B.

3-d interpretation from single 2-d image IVYu Huang

Presentation Object Recognition And Tracking ProjectPrathamesh Joshi

Deep 3D Visual Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2017Universitat Politècnica de Catalunya

Deep 3D Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2018Universitat Politècnica de Catalunya

Udacity-Didi Challenge FinalistsDavid Silver

fusion of Camera and lidar for autonomous driving IIYu Huang

DEEP LEARNING TECHNIQUES POWER POINT PRESENTATIONSelvaLakshmi63

Weakly supervised semantic segmentation of 3D point cloudArithmer Inc.

3-d interpretation from stereo images for autonomous drivingYu Huang

Continuous and Parallel LiDAR Point-cloud ClusteringHannaneh Najdataei

Object detection with deep learningSushant Shrivastava

Similar to RWDA (20)

final_presentation

pydataPointCloud.pptx

Introduction to 3D Computer Vision and Differentiable Rendering

LiDAR-based Autonomous Driving III (by Deep Learning)

3-d interpretation from single 2-d image for autonomous driving II

3D Image visualization

fusion of Camera and lidar for autonomous driving I

Lidar for Autonomous Driving II (via Deep Learning)

AUTO AI 2021 talk Real world data augmentations for autonomous driving : B Ra...

3-d interpretation from single 2-d image IV

Presentation Object Recognition And Tracking Project

Deep 3D Visual Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2017

Deep 3D Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2018

Udacity-Didi Challenge Finalists

fusion of Camera and lidar for autonomous driving II

DEEP LEARNING TECHNIQUES POWER POINT PRESENTATION

Weakly supervised semantic segmentation of 3D point cloud

3-d interpretation from stereo images for autonomous driving

Continuous and Parallel LiDAR Point-cloud Clustering

Object detection with deep learning

RWDA

1. Real World Data Analysis Final Project Object Classification on 3D Environments Abraham Monrroy Cano 2014/01/29

2. Hypothesis Objects in three-dimensional space can be classified accurately through the analysis of Point Cloud data acquired by LiDARs

3. What is a LiDAR? A LiDAR (Light RaDAR) is a sensor that measures distance by illuminating a target with a laser and analyzing the reflected light. X Y Z R 77.465 12.068 2.860 0.00 77.502 12.323 2.863 0.00 45.525 7.448 1.769 0.00 45.517 7.594 1.770 0.00 77.819 13.253 2.878 0.00 77.885 13.516 2.882 0.00 77.945 13.779 2.886 0.00

4. What is a PointCloud? • A point cloud is a set of data points in some coordinate system. • Are intended to represent the external surface of an object.

5. Analyzed Dataset KITTI Dataset is a collection of: • Velodyne Points (Point Clouds) • Stereo Images • GPS localization points Is free and available for academic use only

6. Workflow Data structure Analyze Train Test Result

7. Data structure Input • Velodyne Points in binary format • JPEG Images (used only for reference visualization) • Labels in XML format for each frame in the image coordinate system My Processed Output • Velodyne Points in Matlab’s structure array and labeled. • Extracted Velodyne points for each labeled object (extracted from 2D to 3D) • Generated 2D image from 3D points

8. Analysis Approaches: 1. Using the 3D projection in an image. SURF, ORB, HOG features • Result: Failed. Since the image is not complex enough, there were not enough features to perform data analysis. 2. Using the 3D points and try to obtain descriptive features. • Result: Success, there are a few feature descriptors for 3D point clouds.

9. Features • Extract local features from the now segmented velodyne points. Feature Name Supports Texture / Color Local / Global / Regional Best Use Case PFH No L FPFH No L 2.5D Scans (Pseudo single position range images) VFH No G Object detection with basic pose estimation CVFH No R Object detection with basic pose estimation, detection of partial objects RIFT Yes L Real world 3D-Scans with no mirror effects. RIFT is vulnerable against flipping. RSD No L NARF No L 2.5D (Range Images)

10. FPFH (Fast Point Feature Histogram) Input Format • A point cloud consisting of a set of oriented points P. Oriented means that all points have a normal n. • This feature does not make use of color information. Output • A 33 bin histogram stores in a vector for each point. Radu Bogdan Rusu, Nico Blodow, Michael Beetz Technische Universitat Munchen PointCloud Library C++ API Matlab MAT PCD Files PCL CSV

11. SVM Training / Tests / Results SVM Type Accuracy Multiclass Model 72.8627% Car binary Model 85.8886% Pedestrian binary Model 96.3387% Van binary Model 90.1602% Training set #Objects 1 572 2 56 3 705 4 1754

12. Todo Work • Test with other features extractors. • Train with more observations • The development of a detector in the PointCloud would be useful for other purposes such as perception systems.

13. ご清聴ありがとうございます

RWDA

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to RWDA

Similar to RWDA (20)

RWDA