SlideShare a Scribd company logo
1 of 27
Download to read offline
이명규A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (1/29) 이명규
확장성, 견고성, 온라인
세 가지 장점을 갖춘 다용도
학습 기반 3D 트래커
A Versatile Learning based
3D Temporal Tracker -
Scalable, Robust, Online
이명규A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (2/29)
AGENDA
01
02
03
04
05
06
Introduction
Contributions
Main Idea
Evaluation
Conclusions & Limitations
Additional Slides
이명규A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (3/29)
Introduction
Part 01
이명규A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (4/29)
↳
• Published in 2015 IEEE International Conference on
Computer Vision
• Publisher: IEEE (Electronic ISSN: 2380-7504)
• Keyword (INSPEC) : target tracking, computational complexity,
learning (artificial intelligence), object tracking, pose estimation
• Authors : David Joseph Tan, Federico Tombari,
Slobodan Ilic, Nassir Navab
Introduction
Paper Info
Part 01
이명규A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (5/29)
Contributions
Part 02
이명규A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (6/29)
↳
Contributions
Related Works
Part 02
• Using solely depth images
• Energy minimization(ICP)[3, 5]
• learning-based algorithm[23]
• Using RGB-D data
• hand-held object tracking[10]
• particle filter approaches[6, 13]
• Using level-set optimization[17]
이명규A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (7/29)
↳
Contributions
Contributions
Part 02
• Novel occlusion handling strategy
• Increases the overall robustness
• Use only one depth image to create the entire learning
dataset (‘3D Online Learning’)
• Low Tracking time, Memory consumption, and computational cost.
• Scalable to track a hundred objects in real-time
• Less learning time than previous studies
이명규A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (8/29)
Main Idea
Part 03
A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (9/29) 이명규
↳ Random Forest (1/2)
• One of the ensemble techniques that improves
learning performance.
• Derive the results through voting among
multiple decision trees.
• Configure the forest by bootstrapping the data.
• Variables are used randomly without giving preference.
Main IdeaPart 03
A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (10/29) 이명규
↳ Random Forest (2/2)
• Advantages
• Voting through multiple trees prevent for overfitting.
• Both classification and regression problems can be applied.
• Disadvantages
• Learning speed is slow and prediction speed is slow in real-time.
• Do not make predictions beyond the range of learning data.
Main IdeaPart 03
A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (11/29) 이명규
↳ Object temporal tracking (overall idea)
• The objective is solving the registration problem between the
‘3D points on the object’ and the ‘3D points’.
• Predicting the 𝐓𝐓𝒕𝒕 by taking the individual values of 𝜖𝜖𝑗𝑗
𝑣𝑣
.
• Transform X𝑗𝑗 to �T𝑡𝑡= ∏𝑖𝑖=0
𝑡𝑡
T𝑖𝑖.
Main IdeaPart 03
Indivisual displacement
𝝐𝝐𝒋𝒋
𝒗𝒗 �𝑻𝑻𝐭𝐭−𝟏𝟏; 𝑫𝑫𝒕𝒕
Object Transformation
from �𝑻𝑻𝐭𝐭−𝟏𝟏
Current Frame 𝑫𝑫𝒕𝒕 at time 𝒕𝒕
Learned forest
with 6𝒏𝒏𝒗𝒗 trees
Predicted
transform
parameters of 𝑻𝑻𝒕𝒕
A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (12/29) 이명규
↳
1. Among the pixels 𝒙𝒙𝒊𝒊 𝒋𝒋=𝟏𝟏
𝒏𝒏𝒋𝒋
from 𝐃𝐃𝒗𝒗 that are on the object, 𝒏𝒏𝒋𝒋 points are selected,
back-projected and transformed to the object coordinate system.
• Transformed set of points X𝑣𝑣 = 𝑥𝑥𝑖𝑖 𝑗𝑗=1
𝑛𝑛𝑗𝑗
are used to compute the displacements.
Learning from one viewpoint – Dataset
Main IdeaPart 03
A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (13/29) 이명규
↳
2. 𝛕𝛕𝐫𝐫 are randomly parametrized to compose 𝐓𝐓𝐫𝐫 and formulate �𝐓𝐓𝐫𝐫.
• Convert 𝑿𝑿𝒋𝒋 to �𝑻𝑻𝒓𝒓 and calculate displacement vector 𝝐𝝐𝒓𝒓
𝒗𝒗
= 𝝐𝝐𝒋𝒋
𝒗𝒗 �𝑻𝑻𝒓𝒓; 𝑫𝑫𝒗𝒗 𝒋𝒋=𝟏𝟏
𝒏𝒏𝒋𝒋
.
3. Construct a learning data set 𝓢𝓢 = 𝝐𝝐𝒓𝒓
𝒗𝒗
, 𝝉𝝉𝒓𝒓 𝒓𝒓=𝟏𝟏
𝒏𝒏𝒓𝒓
by accumulating
𝝐𝝐𝒓𝒓
𝒗𝒗
and 𝝉𝝉𝒓𝒓 with random parameter 𝒏𝒏𝒓𝒓.
Learning from one viewpoint - Dataset
Main IdeaPart 03
A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (14/29) 이명규
↳
4. The objective is to split using ϵ while optimizing a parameter in τ
to make the values more coherent
• ϵ < Threshold → 𝒮𝒮𝑙𝑙, ϵ > Threshold → 𝒮𝒮𝑟𝑟
5. Testing best split using Information Gain function.
• 𝐺𝐺 = ϵ 𝒮𝒮𝑁𝑁 − ∑𝑖𝑖∈ 𝑙𝑙,𝑟𝑟
𝒮𝒮𝑖𝑖
𝒮𝒮 𝑁𝑁
𝜖𝜖 𝒮𝒮𝑖𝑖 , highest information gain gives the best split
Learning from one viewpoint – Dividing Tree
Main IdeaPart 03
A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (15/29) 이명규
↳
6. Tree stops growing
• Tree stops growing the size of the inherited learning dataset is too small or
• Standard deviation of the parameter is less than a threshold.
• The node set to be a leaf and stores the mean and standard deviation of the
parameter.
7. Iteration with 𝑛𝑛𝑣𝑣 views of the object
Learning from one viewpoint – Dividing Tree
Main IdeaPart 03
이명규A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (16/29)
Evaluation
Part 04
A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (17/29) 이명규
↳
• Using three benchmark datasets [6, 11, 23] to evaluate the
robustness of algorithm
1. Using data set [11] to compare ‘CT[23]’ and ‘Section 3’ of this paper.
2. Comparing the results of ‘this paper’ with
‘RGB-D filter approach [6, 13, 20]’ with data set [6].
3. Using the benchmark data set of CT[23] to compare the
robustness of using ‘depth image only’.
Robustness
EvaluationPart 04
A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (18/29) 이명규
↳
• Optimal parameters for learning include 642 camera views labeled as
2500 pairs of samples
• It shows the lowest error value when 10 iteration rounds.
Robustness – Optimum Parameters
EvaluationPart 04
A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (19/29) 이명규
↳
• Evaluation via publicly available Synthetic
Dataset [6]
• Each object consists of 1000 RGB-D images with
ground truth results along with the model.
• 0.01mm better translation and 1.01 better rotation,
but notice that this study uses only depth image.
• Only use the object's model without any
prior knowledge of the environment.
Robustness – Synthetic Dataset
EvaluationPart 04
A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (20/29) 이명규
↳
• Evaluate robustness using only actual depth image
• Each sequence consists of 400 RGB-D images and a
ground truth pose tracked using a marker board.
• ICP is trapped in a Local Minimum in (c), but CT [23]
and the results of this paper track the cat well.
• CT [23] fails to track in severe occlusion as in (e),
but this paper succeeds in tracking successfully.
Robustness – Real Dataset
EvaluationPart 04
A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (21/29) 이명규
↳
• Sec 3.1 runs at 1.5ms per frame on a single core Intel I7 Core CPU.
• The memory increases linearly with the number of camera views and the
size of the training data set.
• CT [23] requires 821.3 MB per forest, but only 7.4 MB is needed in this paper.
• Tracking 108 objects shows 33.7ms(30fps) on an 8-core CPU with
799MB of memory usage.
Computational Cost & Learning Time (1/2)
EvaluationPart 04
A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (22/29) 이명규
↳
• The learning time is linearly related to the number of camera views and
the size of the training data set.
• Sec 3.1 with Optimum parameters takes 31.8 seconds. (run on 8-core CPU)
• Learning on 2500 pairs of samples and labels with 642 camera views.
• CT [23] takes 12.3 hours.
Computational Cost & Learning Time (1/2)
EvaluationPart 04
A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (23/29) 이명규
↳
• Evaluate online learning through data set in [6].
• The initial frame starts learning through the ground truth transformation.
• It takes 1.3 seconds to learn with 50 trees per parameter.
• Subsequent frames likewise continue to learn one tree per parameter,
taking 25.6 ms per frame.
• With 8 Core CPU, both learning and tracking take 26.8ms
per frame.
• Tracking is possible even the object's model is not at hand
(e.g. head pose estimation in Fig. 1(c).)
Learning Time – Online Learning
EvaluationPart 04
이명규A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (24/29)
Conclusions & Limitations
Part 05
이명규A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (25/29)
↳
• Real-time, scalable and robust 3D tracking algorithm
• Can be employed both in model-based as well as in
online 3D tracking.
• Flexible and versatile so to adapt to a variety of
3D tracking applications.
Conclusions
Conclusions & LimitationsPart 05
이명규A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (26/29)
↳
• Highly symmetric objects loses some degrees of freedom.
• Rotation around its axis of symmetry is ambiguous
when viewed from depth image
• Fails to estimate the full 3D pose with six degrees of freedom
• Tree is not learned enough if there is a problem with the initial
frame.
• Large holes or occlusions in the initial frames create problems.
Limitations
Conclusions & LimitationsPart 05
이명규A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (27/29)
Thank you for Listening.
Email : brstar96@naver.com (or brstar96@soongsil.ac.kr)
Mobile : +82-10-8234-3179

More Related Content

What's hot

Neural Networks: Least Mean Square (LSM) Algorithm
Neural Networks: Least Mean Square (LSM) AlgorithmNeural Networks: Least Mean Square (LSM) Algorithm
Neural Networks: Least Mean Square (LSM) AlgorithmMostafa G. M. Mostafa
 
Scalable Learning in Computer Vision
Scalable Learning in Computer VisionScalable Learning in Computer Vision
Scalable Learning in Computer Visionbutest
 
Deep Learning for Computer Vision: A comparision between Convolutional Neural...
Deep Learning for Computer Vision: A comparision between Convolutional Neural...Deep Learning for Computer Vision: A comparision between Convolutional Neural...
Deep Learning for Computer Vision: A comparision between Convolutional Neural...Vincenzo Lomonaco
 
An Efficient Block Matching Algorithm Using Logical Image
An Efficient Block Matching Algorithm Using Logical ImageAn Efficient Block Matching Algorithm Using Logical Image
An Efficient Block Matching Algorithm Using Logical ImageIJERA Editor
 
Clustering introduction
Clustering introductionClustering introduction
Clustering introductionYan Xu
 
SkyStitch: a Cooperative Multi-UAV-based Real-time Video Surveillance System ...
SkyStitch: a Cooperative Multi-UAV-based Real-time Video Surveillance System ...SkyStitch: a Cooperative Multi-UAV-based Real-time Video Surveillance System ...
SkyStitch: a Cooperative Multi-UAV-based Real-time Video Surveillance System ...Kitsukawa Yuki
 
Scene classification using Convolutional Neural Networks - Jayani Withanawasam
Scene classification using Convolutional Neural Networks - Jayani WithanawasamScene classification using Convolutional Neural Networks - Jayani Withanawasam
Scene classification using Convolutional Neural Networks - Jayani WithanawasamWithTheBest
 
An enhanced fireworks algorithm to generate prime key for multiple users in f...
An enhanced fireworks algorithm to generate prime key for multiple users in f...An enhanced fireworks algorithm to generate prime key for multiple users in f...
An enhanced fireworks algorithm to generate prime key for multiple users in f...journalBEEI
 
Object Detection using Deep Neural Networks
Object Detection using Deep Neural NetworksObject Detection using Deep Neural Networks
Object Detection using Deep Neural NetworksUsman Qayyum
 
MSShin-Machine_Learning_Algorithm_in_Period_Estimation.ppt
MSShin-Machine_Learning_Algorithm_in_Period_Estimation.pptMSShin-Machine_Learning_Algorithm_in_Period_Estimation.ppt
MSShin-Machine_Learning_Algorithm_in_Period_Estimation.pptbutest
 
A Literature Survey: Neural Networks for object detection
A Literature Survey: Neural Networks for object detectionA Literature Survey: Neural Networks for object detection
A Literature Survey: Neural Networks for object detectionvivatechijri
 
Real Time Human Posture Detection with Multiple Depth Sensors
Real Time Human Posture Detection with Multiple Depth SensorsReal Time Human Posture Detection with Multiple Depth Sensors
Real Time Human Posture Detection with Multiple Depth SensorsWassim Filali
 
Online video object segmentation via convolutional trident network
Online video object segmentation via convolutional trident networkOnline video object segmentation via convolutional trident network
Online video object segmentation via convolutional trident networkNAVER Engineering
 
Deep LearningフレームワークChainerと最近の技術動向
Deep LearningフレームワークChainerと最近の技術動向Deep LearningフレームワークChainerと最近の技術動向
Deep LearningフレームワークChainerと最近の技術動向Shunta Saito
 
Robust foreground modelling to segment and detect multiple moving objects in ...
Robust foreground modelling to segment and detect multiple moving objects in ...Robust foreground modelling to segment and detect multiple moving objects in ...
Robust foreground modelling to segment and detect multiple moving objects in ...IJECEIAES
 
Moving object detection in video surveillance
Moving object detection in video surveillanceMoving object detection in video surveillance
Moving object detection in video surveillanceAshfaqul Haque John
 

What's hot (20)

Neural Networks: Least Mean Square (LSM) Algorithm
Neural Networks: Least Mean Square (LSM) AlgorithmNeural Networks: Least Mean Square (LSM) Algorithm
Neural Networks: Least Mean Square (LSM) Algorithm
 
Scalable Learning in Computer Vision
Scalable Learning in Computer VisionScalable Learning in Computer Vision
Scalable Learning in Computer Vision
 
Deep Learning for Computer Vision: A comparision between Convolutional Neural...
Deep Learning for Computer Vision: A comparision between Convolutional Neural...Deep Learning for Computer Vision: A comparision between Convolutional Neural...
Deep Learning for Computer Vision: A comparision between Convolutional Neural...
 
An Efficient Block Matching Algorithm Using Logical Image
An Efficient Block Matching Algorithm Using Logical ImageAn Efficient Block Matching Algorithm Using Logical Image
An Efficient Block Matching Algorithm Using Logical Image
 
Clustering introduction
Clustering introductionClustering introduction
Clustering introduction
 
SkyStitch: a Cooperative Multi-UAV-based Real-time Video Surveillance System ...
SkyStitch: a Cooperative Multi-UAV-based Real-time Video Surveillance System ...SkyStitch: a Cooperative Multi-UAV-based Real-time Video Surveillance System ...
SkyStitch: a Cooperative Multi-UAV-based Real-time Video Surveillance System ...
 
Bol.com
Bol.comBol.com
Bol.com
 
Scene classification using Convolutional Neural Networks - Jayani Withanawasam
Scene classification using Convolutional Neural Networks - Jayani WithanawasamScene classification using Convolutional Neural Networks - Jayani Withanawasam
Scene classification using Convolutional Neural Networks - Jayani Withanawasam
 
An enhanced fireworks algorithm to generate prime key for multiple users in f...
An enhanced fireworks algorithm to generate prime key for multiple users in f...An enhanced fireworks algorithm to generate prime key for multiple users in f...
An enhanced fireworks algorithm to generate prime key for multiple users in f...
 
Gi3511181122
Gi3511181122Gi3511181122
Gi3511181122
 
Object Detection using Deep Neural Networks
Object Detection using Deep Neural NetworksObject Detection using Deep Neural Networks
Object Detection using Deep Neural Networks
 
MSShin-Machine_Learning_Algorithm_in_Period_Estimation.ppt
MSShin-Machine_Learning_Algorithm_in_Period_Estimation.pptMSShin-Machine_Learning_Algorithm_in_Period_Estimation.ppt
MSShin-Machine_Learning_Algorithm_in_Period_Estimation.ppt
 
A Literature Survey: Neural Networks for object detection
A Literature Survey: Neural Networks for object detectionA Literature Survey: Neural Networks for object detection
A Literature Survey: Neural Networks for object detection
 
Real Time Human Posture Detection with Multiple Depth Sensors
Real Time Human Posture Detection with Multiple Depth SensorsReal Time Human Posture Detection with Multiple Depth Sensors
Real Time Human Posture Detection with Multiple Depth Sensors
 
Online video object segmentation via convolutional trident network
Online video object segmentation via convolutional trident networkOnline video object segmentation via convolutional trident network
Online video object segmentation via convolutional trident network
 
Moving object detection
Moving object detectionMoving object detection
Moving object detection
 
Deep LearningフレームワークChainerと最近の技術動向
Deep LearningフレームワークChainerと最近の技術動向Deep LearningフレームワークChainerと最近の技術動向
Deep LearningフレームワークChainerと最近の技術動向
 
Robust foreground modelling to segment and detect multiple moving objects in ...
Robust foreground modelling to segment and detect multiple moving objects in ...Robust foreground modelling to segment and detect multiple moving objects in ...
Robust foreground modelling to segment and detect multiple moving objects in ...
 
Presentation of Visual Tracking
Presentation of Visual TrackingPresentation of Visual Tracking
Presentation of Visual Tracking
 
Moving object detection in video surveillance
Moving object detection in video surveillanceMoving object detection in video surveillance
Moving object detection in video surveillance
 

Similar to (Paper Review)A versatile learning based 3D temporal tracker - scalable, robust, online

Deep learning fundamental and Research project on IBM POWER9 system from NUS
Deep learning fundamental and Research project on IBM POWER9 system from NUSDeep learning fundamental and Research project on IBM POWER9 system from NUS
Deep learning fundamental and Research project on IBM POWER9 system from NUSGanesan Narayanasamy
 
Learning to Perceive the 3D World
Learning to Perceive the 3D WorldLearning to Perceive the 3D World
Learning to Perceive the 3D WorldNAVER Engineering
 
Real Time Object Dectection using machine learning
Real Time Object Dectection using machine learningReal Time Object Dectection using machine learning
Real Time Object Dectection using machine learningpratik pratyay
 
PhD Thesis Proposal
PhD Thesis Proposal PhD Thesis Proposal
PhD Thesis Proposal Ziqiang Feng
 
Garuda Robotics x DataScience SG Meetup (Sep 2015)
Garuda Robotics x DataScience SG Meetup (Sep 2015)Garuda Robotics x DataScience SG Meetup (Sep 2015)
Garuda Robotics x DataScience SG Meetup (Sep 2015)Eugene Yan Ziyou
 
[RSS2023] Local Object Crop Collision Network for Efficient Simulation
[RSS2023] Local Object Crop Collision Network for Efficient Simulation[RSS2023] Local Object Crop Collision Network for Efficient Simulation
[RSS2023] Local Object Crop Collision Network for Efficient SimulationDongwonSon1
 
Cvpr 2018 papers review (efficient computing)
Cvpr 2018 papers review (efficient computing)Cvpr 2018 papers review (efficient computing)
Cvpr 2018 papers review (efficient computing)DonghyunKang12
 
181123 asynchronous method for deep reinforcement learning seunghyeok back
181123 asynchronous method for deep reinforcement learning seunghyeok back181123 asynchronous method for deep reinforcement learning seunghyeok back
181123 asynchronous method for deep reinforcement learning seunghyeok backSeungHyeok Baek
 
Fast object re detection and localization in video for spatio-temporal fragme...
Fast object re detection and localization in video for spatio-temporal fragme...Fast object re detection and localization in video for spatio-temporal fragme...
Fast object re detection and localization in video for spatio-temporal fragme...MediaMixerCommunity
 
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...KAIST
 
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...KAIST
 
"Energy-efficient Hardware for Embedded Vision and Deep Convolutional Neural ...
"Energy-efficient Hardware for Embedded Vision and Deep Convolutional Neural ..."Energy-efficient Hardware for Embedded Vision and Deep Convolutional Neural ...
"Energy-efficient Hardware for Embedded Vision and Deep Convolutional Neural ...Edge AI and Vision Alliance
 
Using SigOpt to Tune Deep Learning Models with Nervana Cloud
Using SigOpt to Tune Deep Learning Models with Nervana CloudUsing SigOpt to Tune Deep Learning Models with Nervana Cloud
Using SigOpt to Tune Deep Learning Models with Nervana CloudSigOpt
 
Unsupervised/Self-supervvised visual object tracking
Unsupervised/Self-supervvised visual object trackingUnsupervised/Self-supervvised visual object tracking
Unsupervised/Self-supervvised visual object trackingYu Huang
 
Rapid object detection using boosted cascade of simple features
Rapid object detection using boosted  cascade of simple featuresRapid object detection using boosted  cascade of simple features
Rapid object detection using boosted cascade of simple featuresHirantha Pradeep
 
Improving computer vision models at scale presentation
Improving computer vision models at scale presentationImproving computer vision models at scale presentation
Improving computer vision models at scale presentationDr. Mirko Kämpf
 
Improving computer vision models at scale presentation
Improving computer vision models at scale presentationImproving computer vision models at scale presentation
Improving computer vision models at scale presentationJan Kunigk
 
Extend Your Journey: Introducing Signal Strength into Location-based Applicat...
Extend Your Journey: Introducing Signal Strength into Location-based Applicat...Extend Your Journey: Introducing Signal Strength into Location-based Applicat...
Extend Your Journey: Introducing Signal Strength into Location-based Applicat...Chih-Chuan Cheng
 

Similar to (Paper Review)A versatile learning based 3D temporal tracker - scalable, robust, online (20)

Deep learning fundamental and Research project on IBM POWER9 system from NUS
Deep learning fundamental and Research project on IBM POWER9 system from NUSDeep learning fundamental and Research project on IBM POWER9 system from NUS
Deep learning fundamental and Research project on IBM POWER9 system from NUS
 
slide-171212080528.pptx
slide-171212080528.pptxslide-171212080528.pptx
slide-171212080528.pptx
 
Learning to Perceive the 3D World
Learning to Perceive the 3D WorldLearning to Perceive the 3D World
Learning to Perceive the 3D World
 
Real Time Object Dectection using machine learning
Real Time Object Dectection using machine learningReal Time Object Dectection using machine learning
Real Time Object Dectection using machine learning
 
PhD Thesis Proposal
PhD Thesis Proposal PhD Thesis Proposal
PhD Thesis Proposal
 
Garuda Robotics x DataScience SG Meetup (Sep 2015)
Garuda Robotics x DataScience SG Meetup (Sep 2015)Garuda Robotics x DataScience SG Meetup (Sep 2015)
Garuda Robotics x DataScience SG Meetup (Sep 2015)
 
[RSS2023] Local Object Crop Collision Network for Efficient Simulation
[RSS2023] Local Object Crop Collision Network for Efficient Simulation[RSS2023] Local Object Crop Collision Network for Efficient Simulation
[RSS2023] Local Object Crop Collision Network for Efficient Simulation
 
Cvpr 2018 papers review (efficient computing)
Cvpr 2018 papers review (efficient computing)Cvpr 2018 papers review (efficient computing)
Cvpr 2018 papers review (efficient computing)
 
181123 asynchronous method for deep reinforcement learning seunghyeok back
181123 asynchronous method for deep reinforcement learning seunghyeok back181123 asynchronous method for deep reinforcement learning seunghyeok back
181123 asynchronous method for deep reinforcement learning seunghyeok back
 
Fast object re detection and localization in video for spatio-temporal fragme...
Fast object re detection and localization in video for spatio-temporal fragme...Fast object re detection and localization in video for spatio-temporal fragme...
Fast object re detection and localization in video for spatio-temporal fragme...
 
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
 
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
 
"Energy-efficient Hardware for Embedded Vision and Deep Convolutional Neural ...
"Energy-efficient Hardware for Embedded Vision and Deep Convolutional Neural ..."Energy-efficient Hardware for Embedded Vision and Deep Convolutional Neural ...
"Energy-efficient Hardware for Embedded Vision and Deep Convolutional Neural ...
 
Using SigOpt to Tune Deep Learning Models with Nervana Cloud
Using SigOpt to Tune Deep Learning Models with Nervana CloudUsing SigOpt to Tune Deep Learning Models with Nervana Cloud
Using SigOpt to Tune Deep Learning Models with Nervana Cloud
 
Unsupervised/Self-supervvised visual object tracking
Unsupervised/Self-supervvised visual object trackingUnsupervised/Self-supervvised visual object tracking
Unsupervised/Self-supervvised visual object tracking
 
Icbai 2018 ver_1
Icbai 2018 ver_1Icbai 2018 ver_1
Icbai 2018 ver_1
 
Rapid object detection using boosted cascade of simple features
Rapid object detection using boosted  cascade of simple featuresRapid object detection using boosted  cascade of simple features
Rapid object detection using boosted cascade of simple features
 
Improving computer vision models at scale presentation
Improving computer vision models at scale presentationImproving computer vision models at scale presentation
Improving computer vision models at scale presentation
 
Improving computer vision models at scale presentation
Improving computer vision models at scale presentationImproving computer vision models at scale presentation
Improving computer vision models at scale presentation
 
Extend Your Journey: Introducing Signal Strength into Location-based Applicat...
Extend Your Journey: Introducing Signal Strength into Location-based Applicat...Extend Your Journey: Introducing Signal Strength into Location-based Applicat...
Extend Your Journey: Introducing Signal Strength into Location-based Applicat...
 

More from MYEONGGYU LEE

(Paper Review) Abnormal Event Detection in Videos using Generative Adversaria...
(Paper Review) Abnormal Event Detection in Videos using Generative Adversaria...(Paper Review) Abnormal Event Detection in Videos using Generative Adversaria...
(Paper Review) Abnormal Event Detection in Videos using Generative Adversaria...MYEONGGYU LEE
 
Survey of HDR & Tone Mapping Task
Survey of HDR & Tone Mapping TaskSurvey of HDR & Tone Mapping Task
Survey of HDR & Tone Mapping TaskMYEONGGYU LEE
 
Survey of Super Resolution Task (SISR Only)
Survey of Super Resolution Task (SISR Only)Survey of Super Resolution Task (SISR Only)
Survey of Super Resolution Task (SISR Only)MYEONGGYU LEE
 
Simple Review of Single Image Super Resolution Task
Simple Review of Single Image Super Resolution TaskSimple Review of Single Image Super Resolution Task
Simple Review of Single Image Super Resolution TaskMYEONGGYU LEE
 
(Paper Review)Few-Shot Adversarial Learning of Realistic Neural Talking Head ...
(Paper Review)Few-Shot Adversarial Learning of Realistic Neural Talking Head ...(Paper Review)Few-Shot Adversarial Learning of Realistic Neural Talking Head ...
(Paper Review)Few-Shot Adversarial Learning of Realistic Neural Talking Head ...MYEONGGYU LEE
 
(Paper Review)U-GAT-IT: unsupervised generative attentional networks with ada...
(Paper Review)U-GAT-IT: unsupervised generative attentional networks with ada...(Paper Review)U-GAT-IT: unsupervised generative attentional networks with ada...
(Paper Review)U-GAT-IT: unsupervised generative attentional networks with ada...MYEONGGYU LEE
 
(Book Summary) Classification and ensemble(book review)
(Book Summary) Classification and ensemble(book review)(Book Summary) Classification and ensemble(book review)
(Book Summary) Classification and ensemble(book review)MYEONGGYU LEE
 
(Paper Review) Reconstruction of Monte Carlo Image Sequences using a Recurren...
(Paper Review) Reconstruction of Monte Carlo Image Sequences using a Recurren...(Paper Review) Reconstruction of Monte Carlo Image Sequences using a Recurren...
(Paper Review) Reconstruction of Monte Carlo Image Sequences using a Recurren...MYEONGGYU LEE
 
(Paper Review)Neural 3D mesh renderer
(Paper Review)Neural 3D mesh renderer(Paper Review)Neural 3D mesh renderer
(Paper Review)Neural 3D mesh rendererMYEONGGYU LEE
 
(Paper Review)3D shape reconstruction from sketches via multi view convolutio...
(Paper Review)3D shape reconstruction from sketches via multi view convolutio...(Paper Review)3D shape reconstruction from sketches via multi view convolutio...
(Paper Review)3D shape reconstruction from sketches via multi view convolutio...MYEONGGYU LEE
 
(Paper Review)Image to image translation with conditional adversarial network...
(Paper Review)Image to image translation with conditional adversarial network...(Paper Review)Image to image translation with conditional adversarial network...
(Paper Review)Image to image translation with conditional adversarial network...MYEONGGYU LEE
 
(Book summary) Ensemble method 2018summerml_study
(Book summary) Ensemble method 2018summerml_study(Book summary) Ensemble method 2018summerml_study
(Book summary) Ensemble method 2018summerml_studyMYEONGGYU LEE
 
(Paper Review)Towards foveated rendering for gaze tracked virtual reality
(Paper Review)Towards foveated rendering for gaze tracked virtual reality(Paper Review)Towards foveated rendering for gaze tracked virtual reality
(Paper Review)Towards foveated rendering for gaze tracked virtual realityMYEONGGYU LEE
 
(Paper Review)Geometrically correct projection-based texture mapping onto a d...
(Paper Review)Geometrically correct projection-based texture mapping onto a d...(Paper Review)Geometrically correct projection-based texture mapping onto a d...
(Paper Review)Geometrically correct projection-based texture mapping onto a d...MYEONGGYU LEE
 
(Papers Review)CNN for sentence classification
(Papers Review)CNN for sentence classification(Papers Review)CNN for sentence classification
(Papers Review)CNN for sentence classificationMYEONGGYU LEE
 
(Paper Review)Kernel predicting-convolutional-networks-for-denoising-monte-ca...
(Paper Review)Kernel predicting-convolutional-networks-for-denoising-monte-ca...(Paper Review)Kernel predicting-convolutional-networks-for-denoising-monte-ca...
(Paper Review)Kernel predicting-convolutional-networks-for-denoising-monte-ca...MYEONGGYU LEE
 

More from MYEONGGYU LEE (17)

(Paper Review) Abnormal Event Detection in Videos using Generative Adversaria...
(Paper Review) Abnormal Event Detection in Videos using Generative Adversaria...(Paper Review) Abnormal Event Detection in Videos using Generative Adversaria...
(Paper Review) Abnormal Event Detection in Videos using Generative Adversaria...
 
Survey of HDR & Tone Mapping Task
Survey of HDR & Tone Mapping TaskSurvey of HDR & Tone Mapping Task
Survey of HDR & Tone Mapping Task
 
Survey of Super Resolution Task (SISR Only)
Survey of Super Resolution Task (SISR Only)Survey of Super Resolution Task (SISR Only)
Survey of Super Resolution Task (SISR Only)
 
Simple Review of Single Image Super Resolution Task
Simple Review of Single Image Super Resolution TaskSimple Review of Single Image Super Resolution Task
Simple Review of Single Image Super Resolution Task
 
ICCV 2019 Review
ICCV 2019 ReviewICCV 2019 Review
ICCV 2019 Review
 
(Paper Review)Few-Shot Adversarial Learning of Realistic Neural Talking Head ...
(Paper Review)Few-Shot Adversarial Learning of Realistic Neural Talking Head ...(Paper Review)Few-Shot Adversarial Learning of Realistic Neural Talking Head ...
(Paper Review)Few-Shot Adversarial Learning of Realistic Neural Talking Head ...
 
(Paper Review)U-GAT-IT: unsupervised generative attentional networks with ada...
(Paper Review)U-GAT-IT: unsupervised generative attentional networks with ada...(Paper Review)U-GAT-IT: unsupervised generative attentional networks with ada...
(Paper Review)U-GAT-IT: unsupervised generative attentional networks with ada...
 
(Book Summary) Classification and ensemble(book review)
(Book Summary) Classification and ensemble(book review)(Book Summary) Classification and ensemble(book review)
(Book Summary) Classification and ensemble(book review)
 
(Paper Review) Reconstruction of Monte Carlo Image Sequences using a Recurren...
(Paper Review) Reconstruction of Monte Carlo Image Sequences using a Recurren...(Paper Review) Reconstruction of Monte Carlo Image Sequences using a Recurren...
(Paper Review) Reconstruction of Monte Carlo Image Sequences using a Recurren...
 
(Paper Review)Neural 3D mesh renderer
(Paper Review)Neural 3D mesh renderer(Paper Review)Neural 3D mesh renderer
(Paper Review)Neural 3D mesh renderer
 
(Paper Review)3D shape reconstruction from sketches via multi view convolutio...
(Paper Review)3D shape reconstruction from sketches via multi view convolutio...(Paper Review)3D shape reconstruction from sketches via multi view convolutio...
(Paper Review)3D shape reconstruction from sketches via multi view convolutio...
 
(Paper Review)Image to image translation with conditional adversarial network...
(Paper Review)Image to image translation with conditional adversarial network...(Paper Review)Image to image translation with conditional adversarial network...
(Paper Review)Image to image translation with conditional adversarial network...
 
(Book summary) Ensemble method 2018summerml_study
(Book summary) Ensemble method 2018summerml_study(Book summary) Ensemble method 2018summerml_study
(Book summary) Ensemble method 2018summerml_study
 
(Paper Review)Towards foveated rendering for gaze tracked virtual reality
(Paper Review)Towards foveated rendering for gaze tracked virtual reality(Paper Review)Towards foveated rendering for gaze tracked virtual reality
(Paper Review)Towards foveated rendering for gaze tracked virtual reality
 
(Paper Review)Geometrically correct projection-based texture mapping onto a d...
(Paper Review)Geometrically correct projection-based texture mapping onto a d...(Paper Review)Geometrically correct projection-based texture mapping onto a d...
(Paper Review)Geometrically correct projection-based texture mapping onto a d...
 
(Papers Review)CNN for sentence classification
(Papers Review)CNN for sentence classification(Papers Review)CNN for sentence classification
(Papers Review)CNN for sentence classification
 
(Paper Review)Kernel predicting-convolutional-networks-for-denoising-monte-ca...
(Paper Review)Kernel predicting-convolutional-networks-for-denoising-monte-ca...(Paper Review)Kernel predicting-convolutional-networks-for-denoising-monte-ca...
(Paper Review)Kernel predicting-convolutional-networks-for-denoising-monte-ca...
 

Recently uploaded

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 

Recently uploaded (20)

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 

(Paper Review)A versatile learning based 3D temporal tracker - scalable, robust, online

  • 1. 이명규A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (1/29) 이명규 확장성, 견고성, 온라인 세 가지 장점을 갖춘 다용도 학습 기반 3D 트래커 A Versatile Learning based 3D Temporal Tracker - Scalable, Robust, Online
  • 2. 이명규A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (2/29) AGENDA 01 02 03 04 05 06 Introduction Contributions Main Idea Evaluation Conclusions & Limitations Additional Slides
  • 3. 이명규A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (3/29) Introduction Part 01
  • 4. 이명규A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (4/29) ↳ • Published in 2015 IEEE International Conference on Computer Vision • Publisher: IEEE (Electronic ISSN: 2380-7504) • Keyword (INSPEC) : target tracking, computational complexity, learning (artificial intelligence), object tracking, pose estimation • Authors : David Joseph Tan, Federico Tombari, Slobodan Ilic, Nassir Navab Introduction Paper Info Part 01
  • 5. 이명규A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (5/29) Contributions Part 02
  • 6. 이명규A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (6/29) ↳ Contributions Related Works Part 02 • Using solely depth images • Energy minimization(ICP)[3, 5] • learning-based algorithm[23] • Using RGB-D data • hand-held object tracking[10] • particle filter approaches[6, 13] • Using level-set optimization[17]
  • 7. 이명규A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (7/29) ↳ Contributions Contributions Part 02 • Novel occlusion handling strategy • Increases the overall robustness • Use only one depth image to create the entire learning dataset (‘3D Online Learning’) • Low Tracking time, Memory consumption, and computational cost. • Scalable to track a hundred objects in real-time • Less learning time than previous studies
  • 8. 이명규A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (8/29) Main Idea Part 03
  • 9. A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (9/29) 이명규 ↳ Random Forest (1/2) • One of the ensemble techniques that improves learning performance. • Derive the results through voting among multiple decision trees. • Configure the forest by bootstrapping the data. • Variables are used randomly without giving preference. Main IdeaPart 03
  • 10. A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (10/29) 이명규 ↳ Random Forest (2/2) • Advantages • Voting through multiple trees prevent for overfitting. • Both classification and regression problems can be applied. • Disadvantages • Learning speed is slow and prediction speed is slow in real-time. • Do not make predictions beyond the range of learning data. Main IdeaPart 03
  • 11. A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (11/29) 이명규 ↳ Object temporal tracking (overall idea) • The objective is solving the registration problem between the ‘3D points on the object’ and the ‘3D points’. • Predicting the 𝐓𝐓𝒕𝒕 by taking the individual values of 𝜖𝜖𝑗𝑗 𝑣𝑣 . • Transform X𝑗𝑗 to �T𝑡𝑡= ∏𝑖𝑖=0 𝑡𝑡 T𝑖𝑖. Main IdeaPart 03 Indivisual displacement 𝝐𝝐𝒋𝒋 𝒗𝒗 �𝑻𝑻𝐭𝐭−𝟏𝟏; 𝑫𝑫𝒕𝒕 Object Transformation from �𝑻𝑻𝐭𝐭−𝟏𝟏 Current Frame 𝑫𝑫𝒕𝒕 at time 𝒕𝒕 Learned forest with 6𝒏𝒏𝒗𝒗 trees Predicted transform parameters of 𝑻𝑻𝒕𝒕
  • 12. A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (12/29) 이명규 ↳ 1. Among the pixels 𝒙𝒙𝒊𝒊 𝒋𝒋=𝟏𝟏 𝒏𝒏𝒋𝒋 from 𝐃𝐃𝒗𝒗 that are on the object, 𝒏𝒏𝒋𝒋 points are selected, back-projected and transformed to the object coordinate system. • Transformed set of points X𝑣𝑣 = 𝑥𝑥𝑖𝑖 𝑗𝑗=1 𝑛𝑛𝑗𝑗 are used to compute the displacements. Learning from one viewpoint – Dataset Main IdeaPart 03
  • 13. A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (13/29) 이명규 ↳ 2. 𝛕𝛕𝐫𝐫 are randomly parametrized to compose 𝐓𝐓𝐫𝐫 and formulate �𝐓𝐓𝐫𝐫. • Convert 𝑿𝑿𝒋𝒋 to �𝑻𝑻𝒓𝒓 and calculate displacement vector 𝝐𝝐𝒓𝒓 𝒗𝒗 = 𝝐𝝐𝒋𝒋 𝒗𝒗 �𝑻𝑻𝒓𝒓; 𝑫𝑫𝒗𝒗 𝒋𝒋=𝟏𝟏 𝒏𝒏𝒋𝒋 . 3. Construct a learning data set 𝓢𝓢 = 𝝐𝝐𝒓𝒓 𝒗𝒗 , 𝝉𝝉𝒓𝒓 𝒓𝒓=𝟏𝟏 𝒏𝒏𝒓𝒓 by accumulating 𝝐𝝐𝒓𝒓 𝒗𝒗 and 𝝉𝝉𝒓𝒓 with random parameter 𝒏𝒏𝒓𝒓. Learning from one viewpoint - Dataset Main IdeaPart 03
  • 14. A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (14/29) 이명규 ↳ 4. The objective is to split using ϵ while optimizing a parameter in τ to make the values more coherent • ϵ < Threshold → 𝒮𝒮𝑙𝑙, ϵ > Threshold → 𝒮𝒮𝑟𝑟 5. Testing best split using Information Gain function. • 𝐺𝐺 = ϵ 𝒮𝒮𝑁𝑁 − ∑𝑖𝑖∈ 𝑙𝑙,𝑟𝑟 𝒮𝒮𝑖𝑖 𝒮𝒮 𝑁𝑁 𝜖𝜖 𝒮𝒮𝑖𝑖 , highest information gain gives the best split Learning from one viewpoint – Dividing Tree Main IdeaPart 03
  • 15. A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (15/29) 이명규 ↳ 6. Tree stops growing • Tree stops growing the size of the inherited learning dataset is too small or • Standard deviation of the parameter is less than a threshold. • The node set to be a leaf and stores the mean and standard deviation of the parameter. 7. Iteration with 𝑛𝑛𝑣𝑣 views of the object Learning from one viewpoint – Dividing Tree Main IdeaPart 03
  • 16. 이명규A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (16/29) Evaluation Part 04
  • 17. A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (17/29) 이명규 ↳ • Using three benchmark datasets [6, 11, 23] to evaluate the robustness of algorithm 1. Using data set [11] to compare ‘CT[23]’ and ‘Section 3’ of this paper. 2. Comparing the results of ‘this paper’ with ‘RGB-D filter approach [6, 13, 20]’ with data set [6]. 3. Using the benchmark data set of CT[23] to compare the robustness of using ‘depth image only’. Robustness EvaluationPart 04
  • 18. A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (18/29) 이명규 ↳ • Optimal parameters for learning include 642 camera views labeled as 2500 pairs of samples • It shows the lowest error value when 10 iteration rounds. Robustness – Optimum Parameters EvaluationPart 04
  • 19. A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (19/29) 이명규 ↳ • Evaluation via publicly available Synthetic Dataset [6] • Each object consists of 1000 RGB-D images with ground truth results along with the model. • 0.01mm better translation and 1.01 better rotation, but notice that this study uses only depth image. • Only use the object's model without any prior knowledge of the environment. Robustness – Synthetic Dataset EvaluationPart 04
  • 20. A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (20/29) 이명규 ↳ • Evaluate robustness using only actual depth image • Each sequence consists of 400 RGB-D images and a ground truth pose tracked using a marker board. • ICP is trapped in a Local Minimum in (c), but CT [23] and the results of this paper track the cat well. • CT [23] fails to track in severe occlusion as in (e), but this paper succeeds in tracking successfully. Robustness – Real Dataset EvaluationPart 04
  • 21. A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (21/29) 이명규 ↳ • Sec 3.1 runs at 1.5ms per frame on a single core Intel I7 Core CPU. • The memory increases linearly with the number of camera views and the size of the training data set. • CT [23] requires 821.3 MB per forest, but only 7.4 MB is needed in this paper. • Tracking 108 objects shows 33.7ms(30fps) on an 8-core CPU with 799MB of memory usage. Computational Cost & Learning Time (1/2) EvaluationPart 04
  • 22. A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (22/29) 이명규 ↳ • The learning time is linearly related to the number of camera views and the size of the training data set. • Sec 3.1 with Optimum parameters takes 31.8 seconds. (run on 8-core CPU) • Learning on 2500 pairs of samples and labels with 642 camera views. • CT [23] takes 12.3 hours. Computational Cost & Learning Time (1/2) EvaluationPart 04
  • 23. A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (23/29) 이명규 ↳ • Evaluate online learning through data set in [6]. • The initial frame starts learning through the ground truth transformation. • It takes 1.3 seconds to learn with 50 trees per parameter. • Subsequent frames likewise continue to learn one tree per parameter, taking 25.6 ms per frame. • With 8 Core CPU, both learning and tracking take 26.8ms per frame. • Tracking is possible even the object's model is not at hand (e.g. head pose estimation in Fig. 1(c).) Learning Time – Online Learning EvaluationPart 04
  • 24. 이명규A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (24/29) Conclusions & Limitations Part 05
  • 25. 이명규A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (25/29) ↳ • Real-time, scalable and robust 3D tracking algorithm • Can be employed both in model-based as well as in online 3D tracking. • Flexible and versatile so to adapt to a variety of 3D tracking applications. Conclusions Conclusions & LimitationsPart 05
  • 26. 이명규A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (26/29) ↳ • Highly symmetric objects loses some degrees of freedom. • Rotation around its axis of symmetry is ambiguous when viewed from depth image • Fails to estimate the full 3D pose with six degrees of freedom • Tree is not learned enough if there is a problem with the initial frame. • Large holes or occlusions in the initial frames create problems. Limitations Conclusions & LimitationsPart 05
  • 27. 이명규A Versatile Learning-based 3D Temporal Tracker: Scalable, Robust, Online (27/29) Thank you for Listening. Email : brstar96@naver.com (or brstar96@soongsil.ac.kr) Mobile : +82-10-8234-3179