VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
ย
Stixel based real time object detection for ADAS using surface normal
1. Stixel-based Real Time Object detection for
ADAS using Surface Normal Vectors
CVLab. at Inha Univ.
Tae-Kang Woo
2016.12.
Keywords : Surface vector, Detection validation, Disparity confidence(Middle level representation confidence),
Stixel, Real-time ADAS, 3D reconstruction, Extrinsic parameter estimation, Object detection
2. Contents
I. Problem Definition
1. Problem
2. Goal
3. Related work
II. System Design
III. Surface Normal Vector
1. SNV map using integral image
2. Local SNV computation
IV. Super-SNV
1. S-SNV computation
2. Parametric issue
3. Adaptive mean shift
V. Evaluation
1. Test scenario & Database
2. Evaluation method
3. Experimental result & discussion
VI. Conclusion
2
Chap.1, 2
Chap.4
Chap.3
Chap.3
Chap.4
3. Introduction
โ Flow chart of ADAS Stereo vision
3
disparity V-disparity Ground remove
In disparity map
Remove the sky that is over 2.5m based on ground
Ground line Remove below ground plane(v-disparity line)
Height constraint
Left image
Stixel Stixel
segmentation
5. Purpose of Stereo vision
5
โ Goal
โช Stixel-based stereo vision module for real time ADAS
โข 15fps on TX1 or 30fps on PC
โช Stable hypothesis ROI for recognition module
โข Precision rate 10% improvement by removing error ROI
โช Object geometry feature analysis & classification using SNV
โข Propose 3 classes of forward situation based on Surface Normal Vector
โข Hypothesis ROI validation using surface vector object classification
โช Extrinsic parameter output between camera and ROI(ground, object)
โข The representative vector of the surface vectors in the ROI is selected
6. Previous approach โ object detection
6
โ A disparity map refinement to enhance weakly-textured urban
environment data (2013)
โช Research to overcome disparity error is most active.
โช Define refinement term using segmentation based on edge
โช Performance improves but takes more than 1,700ms
Result
7. Previous approach โ object detection
7
โ Disparity confidence map (2010)
โช Confidence map based on matching cost to enable disparity validation
โช There is a disadvantage that the reliability of the object unit is not provided
โช Reliability is not provided for interpolated disparity estimates
Image Disparity
Confidence
8. Previous approach โ object detection
8
โ U-V Disparity Map Analysis (2010, 2015)
โช Super pixel method based on 2D projection
โช It is assumed that the disparity exists in the object, and the object is detected by fitting the line
of each axis after projection.
โช Disparity Multiple errors occur when estimating to improve performance.
Test image
Result
disparity
V-disparity
U-disparity
9. Problem โ object detection
โ Hypothesis Error
9
Error Out-Noc Out-All
2 pixels 24.83 % 28.39 %
3 pixels 17.14 % 20.78 %
4 pixels 13.18 % 16.70 %
5 pixels 10.79 % 14.14 %
Deep Embed alg.
There are inevitable errors on reflective regions in spite of state-of-the-art method.
Z. Chen, X. Sun, Y. Yu, L. Wang and C. Huang: A Deep Visual Correspondence Embedding Model for Stereo Matching Costs. ICCV 2015.
10. System design
10
โ INNOVATION : Develop validation method with physical meaning
Left image
Right image
Distance
3D position
Multiple ROI
Bounding box
NERV Object Detection
Normal-based Efficient Re-Validation
Stixel
Estimation
Stixel
Segmentation
(objectness)stixels
Stereo
Matching
Disparity
map
Hypothesis
ValidationSurface Normal
map
BB with 3D
position
Surface
normal
computation
Depth feature for
RGB-D processing
Extrinsic parameter
Camera pose
11. Hypothesis ROI validation
โ Surface normal vector
11
A
B
C
๐ด
๐ต
๐ถ
๐ = ๐ด๐ต ร ๐ด๐ถ
A
B
C๐ = ๐ข ร ิฆ๐ฃ
๐ = {๐ฆ ๐ข ๐ง ๐ฃ โ ๐ง ๐ข ๐ฆ๐ฃ, ๐ฅ ๐ข ๐ง ๐ฃ โ ๐ง ๐ข ๐ฅ ๐ฃ, ๐ฅ ๐ข ๐ฆ๐ฃ โ ๐ฆ ๐ข ๐ฅ ๐ฃ}
๐ข
ิฆ๐ฃ
Real meter Surface Normal vector
12. Hypothesis ROI validation
โ Surface normal at Hypothesis Error
12
A
B
C
Surface Normal vector of Error area
Disparity error
13. Hypothesis ROI validation
โ Assumption
โช Normal vectors can represent a difference of object attributes.
โช Each normal vector has information : {position, direction, scale}
โช Surface normal can be divided into 3 part ( i.e. object, ground, error )
โ Goal
โช Find the differences among each part of surface normal
13
Object
Surface normal
Position : {x, y, z}
Direction : {i, j, k}
Scale : s
Ground
Surface normal
Position : {x, y, z}
Direction : {i, j, k}
Scale : s
Error
Surface normal
Position : {x, y, z}
Direction : {i, j, k}
Scale : s
Class of Surface Normal vector
14. Hypothesis ROI validation
โ Feature of normal in error region
14
High Density than others positionโs
Their normal donโt have
horizontal component
15. SNV Map Computation
โ How to compute normal vector? โ Naรฏve SNV
15
3D point cloud from disparity
Time : 29 ms
Surface normal
โข Generally, it has been considered that calculating a surface vector in an image is difficult to operate in real time.
16. SNV Map Computation
โ How to compute normal vectors efficiently? โ Integral image
16
๐ฎ(๐ผ๐, ๐, ๐, ๐) =
1
4๐2
ยท ( ๐ผ๐(๐ + ๐, ๐ + ๐) โ ๐ผ๐(๐ โ ๐, ๐ + ๐) โ ๐ผ๐(๐ + ๐, ๐ โ ๐) + ๐ผ๐(๐ โ ๐, ๐ โ ๐) )
๐ข ๐ฅ =
๐ซ๐ฅ ๐ + ๐, ๐ โ ๐ซ๐ฅ ๐ โ ๐, ๐
2
๐ข ๐ฆ =
๐ซ๐ฆ ๐ + ๐, ๐ โ ๐ซ๐ฆ ๐ โ ๐, ๐
2
๐ข ๐ง =
๐ฎ(๐ผ ๐ซ๐ง
, ๐ + 1, ๐, ๐ โ 1) โ ๐ฎ(๐ผ ๐ซ๐ง
, ๐ โ 1, ๐, ๐ โ 1)
2
๐ข ๐ฅ =
๐ซ๐ฅ ๐, ๐ + ๐ โ ๐ซ๐ฅ ๐, ๐ โ ๐
2
๐ข ๐ฆ =
๐ซ๐ฆ ๐, ๐ + ๐ โ ๐ซ๐ฆ ๐, ๐ โ ๐
2
๐ข ๐ง =
๐ฎ(๐ผ ๐ซ๐ง
, ๐, ๐ + 1, ๐ โ 1) โ ๐ฎ(๐ผ ๐ซ๐ง
, ๐, ๐ โ 1, ๐ โ 1)
2
โข where ๐ซ๐ฅ, ๐ซ๐ฆ, and ๐ซ๐ง are two-dimensional maps storing the x-, y-, and z-coordinates of the organized point cloud.
๐ผ ๐ซ๐ง
is the integral image of the z-components of the point cloud.
๐ means, โ ๐, ๐ = min(โฌ ๐, ๐ ,
๐ฏ ๐,๐
2
). ๏ง smoothing function depending on depth and depth change
๐ = ๐ข ร ิฆ๐ฃ
Holzer, S., Rusu, R. B., Dixon, M., Gedikli, S., & Navab, N. (2012, October). Adaptive neighborhood selection for real-time surface normal estimation from organized point cloud data using
integral images. In Intelligent Robots and Systems (IROS), 2012 IEEE/RSJ International Conference on (pp. 2684-2689). IEEE.
Processing time : 28 ms
At 2.26 GHz Intel(R) Core(TM)2 Quad CPU and 4 GB of RAM
VGA image 307200 pixels
Processing time : 12 ms
At 2.7 GHz Intel Core i7 CPU and 16 GB of RAM
VGA image 307200 pixels
17. SNV Map Computation
โ How to compute normal vectors in real time? โ Local SNV
17
๐ = ๐ข ร ิฆ๐ฃ
PROCESS
1. Compute SNV in stixel ROI
2. Convert coordinate to angle
3. Find the mode angle using
histogram
4. Remove outlier in adaptive
mean shift
5. Select the convergence value to
main extrinsic parameter
18. Ground information in surface vector
โ How to compute normal vector in real time? โ Local SNV
18
Original image Full search
2 pixel ground 5 pixel ground
Time: 28ms
Time: 5ms
19. Ground information in surface vector
โ How to compute normal vector efficiently? โ Local SNV + Super-SNV
19
X
Y
Z
-0.0~0.0
-1.0~-0.9
-0.1~-0.0
The angle of the surface of the object can be calculated. ๏จ Automatic calculation of external parameters is possible.
Pitch angle : -1.89ยฐ
20. Direction of ground and object
20
Pitch angle : -1.89ยฐโ Stixel area based SNV
Direction of ground vector Direction of object vector
y
x
z
y
z
x
21. Super Surface Normal vector
โ SSNV Selection method
21
โข Although the resolution of the Cartesian coordinate reference is 0.1, the range of resolution varies by
cosโ1
๐ฅ. Therefore, the calculation of the histogram itself will have a significant effect on the reliability.
โข Therefore, the histogram is calculated by changing the x-axis in units of ๐.
๐
y
-2.5 0 2.5 5-5
0
1
Interval : 0.1ยฐ
(x, y, z) Cartesian coordinate ๏ pitch, yaw, roll angle
๐๐๐๐๐ ยฐ = ๐๐ยฐ โ ๐๐จ๐ฌโ๐
๐
๐ ๐ + ๐ ๐
22. Issue of vector interval
โ How to set the interval of the vectors?
โช The optimal solution from the trade-off between execution time and accuracy
โช Super surface normal vector confidence
22
Interval : 5 pixel
Setting standard?
23. Issue of vector interval
โ How to set the interval of the vectors? โ Processing time
23
10.71
6.53
5.36
4.48
4.12
4.01
6.83
1.7
0.97
0.71
0.52
0.41
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19
1
2
3
4
5
6
Time(ms)
Interval(pixel)
1 2 3 4 5 6
SNV cal time 10.71 6.53 5.36 4.48 4.12 4.01
Super-SNV time 6.83 1.7 0.97 0.71 0.52 0.41
Suface normal processing time
SNV cal time Super-SNV time
โข Compute the surface vector for the entire image while maintaining real time.
โข Since the surface vector calculation module is a sub-module of the entire module, it can not be costly.
โข It is difficult to use less than three pixels because it is available only within a vector interval of 6.6ms which
is 10% of real time.
โข Will it be possible to guarantee reliability in intervals of 3 pixels or more?
Real time
boundary
6.6ms
24. Issue of vector interval
โ How to set the interval of the vectors? โ Accuracy
โช Histogram mode
24
โข The range of ยฑ ฮฑ is defined as Inlier based on the bin having the maximum value of the histogram. The
range should be 95% of the total number. Find the mean and standard deviation of the Inlier vectors.
โข The mean is -1.94ยฐ, The variation is (64.06?).
mode: -1.3ยฐ Inlier: 95% range
based on mode
Inlier mean:
-1.94ยฐ
25. Issue of vector interval
โ How to set the interval of the vectors? โ Accuracy
โช Histogram Distribution
25
โข Due to the physical phenomenon of the camera, it is not possible to extract the opposite parallax and vector
on the curved surface.
๏จ Therefore, a skewness occurs in the distribution and the entire distribution is biased in one direction
โข The vector of the ideal ground rather than the camera observation is expected to follow the normal
distribution, but the specimen is distorted due to camera observation.
โข In this case, the representative value of the distribution is known as the average < mean value < mode, and
the median value is known to be located near the average at the point where the interval between the mean
and the mode is divided into three equal parts.
๐ ๐
Optical axis
๐ฝ
V-FOV
Ground
Camera
26. Issue of vector interval
โ How to set the interval of the vectors? โ Accuracy
โช Advanced mean-shift
26
1. Use the initial value of the mode to find the mean value of the inliers within the surrounding ๐.
In this case, ๐ is determined as a range including 50% of the total number. Ex) ๋ณธ ์์ ์์ ์ฝ ยฑ3.5ยฐ
2. Perform STEP 1. again based on the average value.
3. Repeat STEP 1. and 2. until the average converges to 0.01ยฐ or less.
Init value: -1.3ยฐ Inlier: 50% range
based on center
r
First step:-1.13ยฐ
27. Issue of vector interval
โ How to set the interval of the vectors? โ Accuracy
โช Advanced mean-shift
27
โข Confidence measure: entropy
๐ป ๐ = ๐ธ ๐ผ ๐ = เท
1
๐พ
๐ ๐ = ๐ ln(
1
๐ ๐ = ๐
) = โ เท
1
๐พ
๐ ๐ = ๐ ln ๐ ๐ = ๐
Init value: -1.3ยฐ Inlier: 50% range
based on center
r
First step:-1.13ยฐ
๐ป ๐๐๐๐ โฅ ๐ป ๐๐๐๐ > ๐ป ๐๐๐๐๐ โ๐๐๐ก
28. Experiment Result
โ Surface Normal Result
28
Point cloud
Pitch histogram Ground dir
Normal vector
Disparity
Left image
32. Experiment result
โ Result on KITTI Dataset
32
Stixel only Stixel with SNV
Number
of object
9873 9873
True positive 9579 9562
False positive 1805 396
False negative 294 311
Precision 0.841 0.960
Recall 0.970 0.968
๐น1 measure 0.901 0.964
True Positive
False Negative
False Positive
Removed
SNV
True Positive
True Positive
False Positive
False Positive
STIXEL
Aver time: 24ms
33. Discussion about Experiment result
โ Discussion on KITTI Dataset
33
Stixel only Stixel with SNV
Number
of object
9873 9873
True positive 9579 9562
False positive 1805 396
False negative 294 311
Precision 0.841 0.960
Recall 0.970 0.968
๐น1 measure 0.901 0.964
True Positive
False Negative
False Positive
Removed
SNV
True Positive
True Positive
False Positive
False Positive
STIXEL
It should not be
removed, but it
was removed
because there is
a lot of ground
vector
It should be removed there is a lot of ground vector
It should be removed, but it was not removed
because there is a lot of object vector
36. Conclusion
36
โHypothesis ROI validation
โข It use โSurface normalโ to find the difference direction between other object.
โข Surface normal can be computed by Global or Local method.
โข This method depends on only two inputs that are โDisparity mapโ and โBounding boxโ.
โข So It can be utilized to any 3D recognition system for their result validation.
โข This method appears to solve the disparity error on reflective region.
โข Also, In global method, Surface normal map can be used to recognition module.
โFuture work
โข I will develop a 3D ROI for ADAS based on collision risk analysis.
pixel
distance 5m