SlideShare a Scribd company logo
1 of 15
Download to read offline
Speeding up probabilistic inference of
                       camera orientation by function
                      approximation and grid masking


 LTI–PCS–EPUSP                      Nicolau L. Werneck
     nic-wscg2011

     N. Werneck

                                           Doctoral candidate
1–Introduction
                              Supervisor: Prof. Anna Helena Reali Costa
2–Methodology           Intelligent Techniques Laboratory, LTI — PCS — Poli
                                Universidade de S˜o Paulo (USP), Brazil
                                                 a
3–Results

References

Referˆncias
     e
                                     WSCG’2011, Plzen
                                              Feb/2011
 c    N. Werneck


                                                                              1 / 15
Introduction
                    The problem — camera orientation estimation


                     Environment edges are assumed
                     to be in the three directions of
                     the reference frame.
                     (Lego Land, Manhattan World)
 LTI–PCS–EPUSP

     nic-wscg2011    We want to calculate the
     N. Werneck
                     camera orientation in relation
1–Introduction
                     to this reference frame, in
2–Methodology
                     real-time.
3–Results

References
                     Technique based on continuous
Referˆncias
     e
                     optimization. No edge extrac-
                     tion or matching involved.
      N. Werneck
                     (Maximum likelihood)
 c


                                                                  2 / 15
Introduction
                    Geometrical constraints

                     Knowing the camera orientation from a picture we can
                     predict the directions of image edges.



 LTI–PCS–EPUSP

     nic-wscg2011

     N. Werneck


1–Introduction

2–Methodology

3–Results

References

Referˆncias
     e



 c    N. Werneck


                                                                            3 / 15
Introduction
                    Bayesian camera orientation estimation

                     The data analized is the gradient of the input image.




 LTI–PCS–EPUSP

     nic-wscg2011

     N. Werneck


1–Introduction

2–Methodology

3–Results

References

Referˆncias
     e



 c    N. Werneck


                                                                             5 / 15
Introduction
                    Bayesian camera orientation estimation


                     The Bayesian camera orientation estimation works by
                     defining an objective function L(Ψ) to be optimized. The
                     solution is Ψ∗ = argmax L(Ψ).

 LTI–PCS–EPUSP
                         The function L tells how well the arguments
     nic-wscg2011

     N. Werneck
                         “explain” the evidences. (Likelihood function)
1–Introduction           In this problem Ψ is a set of arguments that model
2–Methodology            the camera orientation.
3–Results
                         L tells how much the edges in the images are
References
                         aligned to the directions expected from the
Referˆncias
     e
                         vanishing points produced by Ψ.

 c    N. Werneck


                                                                          6 / 15
Existing techniques

                    This work is based on previous research by Coughlan and
                    Yuille [2003], Deutscher et al. [2002], Schindler and
                    Dellaert [2004], Denis et al. [2008].

                    They are all based on likelihood maximization. The
 LTI–PCS–EPUSP
                    differences lie in:
     nic-wscg2011

     N. Werneck
                        What parameters are estimated.
1–Introduction          (Other than orientation).
2–Methodology
                        What optimization algorithm is employed.
3–Results
                        Expression of the likelihood function.
References

Referˆncias
     e
                        (Specially what PDF models are used).
                        Subsampling technique.
 c    N. Werneck


                                                                         7 / 15
Original expression
                    In Coughlan and Yuille [2003] the image likelihood is a
                    product of the likelihoods of gradients Eu at each pixel u.
                    Observation model


 LTI–PCS–EPUSP

     nic-wscg2011

     N. Werneck
                       Lik. pixel is edge
1–Introduction         Lik. orientation match
2–Methodology

3–Results
                    The expression built is a Maximum a posteriori estimator.
References

Referˆncias
     e
                    Using M k for P(mu = k), Φk for P(φu |mu = k, Ψ, u) and
                    taking the log we arrive at the objective function...
 c    N. Werneck


                                                                             8 / 15
Proposed expression

                    L Ψ = ∑ log Poff (Eu )Φ1 M 1 +Pon (Eu )Φ5 M 5 +Pon (Eu ) ∑4 Φk M k
                                                                             k=2
                             u

                                       a
                    Using log(b + a) ≈ b + log(b), we arrive at


 LTI–PCS–EPUSP



                    Lik. pixel is edge
     nic-wscg2011

     N. Werneck


1–Introduction
                    Lik. orientation match
2–Methodology

3–Results
                    There is a weighting coefficient based on the gradient
References
                    norm multiplied by something that depends on the
Referˆncias
     e
                    gradient directions and camera orientation.
 c    N. Werneck


                                                                                 9 / 15
Gradient norm masking
                    The mask generating function
                                                              −1
                                          Poff (Eu ) 1
                              W (Eu ) =             M + M5
                                          Pon (Eu )



 LTI–PCS–EPUSP
                                                   Also...
     nic-wscg2011

     N. Werneck                                    We replaced W for
1–Introduction
                                                   W , based on the
2–Methodology                                      logistic function.
3–Results

References                                         We also used vector
Referˆncias
     e                                             dot products instead
                                                   of calculating arctan.
 c    N. Werneck


                                                                            10 / 15
Grid masking
                    We select one from every few lines and columns.

                        Images edges are sampled regularly.
                        Minimally long lines are necessarily sampled.
                        Better strategy for high resolution images, where
 LTI–PCS–EPUSP          edge pixels are “rare”.
     nic-wscg2011

     N. Werneck


1–Introduction

2–Methodology

3–Results

References

Referˆncias
     e



 c    N. Werneck


                                                                            11 / 15
Results
                    Expression evaluation
                     Speed
                     Expressions were implemented in Cython, using SIMD
                     instructions, and tested on c1.xlarge AWS computers.
                     A speedup of 50–64× was detected.

 LTI–PCS–EPUSP
                                   Original 1100.0      ±60ms
     nic-wscg2011                  Proposed   18.9      ±2.4ms
     N. Werneck


1–Introduction       (4s per image with the proposal, without subsampling.)
2–Methodology

3–Results            Quality
References
                     From 102 tests, the original expression “fixed” the
Referˆncias
     e
                     solution in 5 occasions, but ruined 6 good solutions.
                     Mean error went from 4.7◦ to 5.5◦ . (Large outliers)
 c    N. Werneck


                                                                             12 / 15
Results
                    Grid masking evaluation

                     Speed increases as solution quality drops.




 LTI–PCS–EPUSP

     nic-wscg2011

     N. Werneck


1–Introduction

2–Methodology

3–Results

References

Referˆncias
     e



 c    N. Werneck


                                                                  13 / 15
Conclusion
                    The proposed expression is simpler, faster, intuitive and
                    justifies selecting pixels from gradient norm.

                    The grid masking technique proved to be a good
                    alternative for subsampling images deterministically.
                    Future work
 LTI–PCS–EPUSP

     nic-wscg2011
                        Develop a complete pixel selection method.
     N. Werneck
                        Find best parameters.
1–Introduction
                        Try to use gradient-based optimization.
2–Methodology

3–Results

References          Thanks!                               THE END
Referˆncias
     e


                    http://nwerneck.sdf.org
 c    N. Werneck


                                                                            14 / 15
References
                    James M. Coughlan and A. L. Yuille. Manhattan world: orientation
                       and outlier detection by bayesian inference. Neural Comput.,
                       15(5):1063–1088, 2003. ISSN 0899-7667. URL
                       doi:10.1162/089976603765202668.
                    Patrick Denis, James H. Elder, and Francisco J. Estrada. Efficient
                        edge-based methods for estimating manhattan frames in urban
                        imagery. In David A. Forsyth, Philip H. S. Torr, and Andrew
                        Zisserman, editors, ECCV (2), volume 5303 of Lecture Notes
 LTI–PCS–EPUSP          in Computer Science, pages 197–210. Springer, 2008. ISBN
     nic-wscg2011       978-3-540-88685-3.
     N. Werneck
                    Jonathan Deutscher, Michael Isard, and John Maccormick.
1–Introduction         Automatic camera calibration from a single manhattan image.
2–Methodology          In Eur. Conf. on Computer Vision (ECCV, pages 175–205,
3–Results
                       2002.
References          Grant Schindler and Frank Dellaert. Atlanta world: An expectation
Referˆncias
     e
                       maximization framework for simultaneous low-level edge
                       grouping and camera calibration in complex man-made
                       environments. In CVPR (1), pages 203–209, 2004.
 c    N. Werneck


                                                                                  15 / 15

More Related Content

What's hot

Tro07 sparse-solutions-talk
Tro07 sparse-solutions-talkTro07 sparse-solutions-talk
Tro07 sparse-solutions-talkmpbchina
 
A Wavelet - Based Object Watermarking System for MPEG4 Video
A Wavelet - Based Object Watermarking System for MPEG4 VideoA Wavelet - Based Object Watermarking System for MPEG4 Video
A Wavelet - Based Object Watermarking System for MPEG4 VideoCSCJournals
 
Action unit detection with segment based sv ms-cvpr2010
Action unit detection with segment based sv ms-cvpr2010Action unit detection with segment based sv ms-cvpr2010
Action unit detection with segment based sv ms-cvpr2010cvwpamisjtu
 
Chebyshev Functional Link Artificial Neural Networks for Denoising of Image C...
Chebyshev Functional Link Artificial Neural Networks for Denoising of Image C...Chebyshev Functional Link Artificial Neural Networks for Denoising of Image C...
Chebyshev Functional Link Artificial Neural Networks for Denoising of Image C...IDES Editor
 
IFKSA-ESPRIT - Estimating the Direction of Arrival under the Element Failures...
IFKSA-ESPRIT - Estimating the Direction of Arrival under the Element Failures...IFKSA-ESPRIT - Estimating the Direction of Arrival under the Element Failures...
IFKSA-ESPRIT - Estimating the Direction of Arrival under the Element Failures...IDES Editor
 
cvpr2011: game theory in CVPR part 2
cvpr2011: game theory in CVPR part 2cvpr2011: game theory in CVPR part 2
cvpr2011: game theory in CVPR part 2zukun
 
Applications of Structural Optimization: Corso di dottorato INTRODUZIONE ALL'...
Applications of Structural Optimization: Corso di dottorato INTRODUZIONE ALL'...Applications of Structural Optimization: Corso di dottorato INTRODUZIONE ALL'...
Applications of Structural Optimization: Corso di dottorato INTRODUZIONE ALL'...Franco Bontempi
 
Dct,gibbs phen,oversampled adc,polyphase decomposition
Dct,gibbs phen,oversampled adc,polyphase decompositionDct,gibbs phen,oversampled adc,polyphase decomposition
Dct,gibbs phen,oversampled adc,polyphase decompositionMuhammad Younas
 
ElectroencephalographySignalClassification based on Sub-Band Common Spatial P...
ElectroencephalographySignalClassification based on Sub-Band Common Spatial P...ElectroencephalographySignalClassification based on Sub-Band Common Spatial P...
ElectroencephalographySignalClassification based on Sub-Band Common Spatial P...IOSRJVSP
 
Review on cs231 part-2
Review on cs231 part-2Review on cs231 part-2
Review on cs231 part-2Jeong Choi
 

What's hot (14)

Tro07 sparse-solutions-talk
Tro07 sparse-solutions-talkTro07 sparse-solutions-talk
Tro07 sparse-solutions-talk
 
A Wavelet - Based Object Watermarking System for MPEG4 Video
A Wavelet - Based Object Watermarking System for MPEG4 VideoA Wavelet - Based Object Watermarking System for MPEG4 Video
A Wavelet - Based Object Watermarking System for MPEG4 Video
 
Action unit detection with segment based sv ms-cvpr2010
Action unit detection with segment based sv ms-cvpr2010Action unit detection with segment based sv ms-cvpr2010
Action unit detection with segment based sv ms-cvpr2010
 
Chebyshev Functional Link Artificial Neural Networks for Denoising of Image C...
Chebyshev Functional Link Artificial Neural Networks for Denoising of Image C...Chebyshev Functional Link Artificial Neural Networks for Denoising of Image C...
Chebyshev Functional Link Artificial Neural Networks for Denoising of Image C...
 
IFKSA-ESPRIT - Estimating the Direction of Arrival under the Element Failures...
IFKSA-ESPRIT - Estimating the Direction of Arrival under the Element Failures...IFKSA-ESPRIT - Estimating the Direction of Arrival under the Element Failures...
IFKSA-ESPRIT - Estimating the Direction of Arrival under the Element Failures...
 
ICPR 2012
ICPR 2012ICPR 2012
ICPR 2012
 
cvpr2011: game theory in CVPR part 2
cvpr2011: game theory in CVPR part 2cvpr2011: game theory in CVPR part 2
cvpr2011: game theory in CVPR part 2
 
Applications of Structural Optimization: Corso di dottorato INTRODUZIONE ALL'...
Applications of Structural Optimization: Corso di dottorato INTRODUZIONE ALL'...Applications of Structural Optimization: Corso di dottorato INTRODUZIONE ALL'...
Applications of Structural Optimization: Corso di dottorato INTRODUZIONE ALL'...
 
Ct31628631
Ct31628631Ct31628631
Ct31628631
 
Class01
Class01Class01
Class01
 
Back propagation
Back propagation Back propagation
Back propagation
 
Dct,gibbs phen,oversampled adc,polyphase decomposition
Dct,gibbs phen,oversampled adc,polyphase decompositionDct,gibbs phen,oversampled adc,polyphase decomposition
Dct,gibbs phen,oversampled adc,polyphase decomposition
 
ElectroencephalographySignalClassification based on Sub-Band Common Spatial P...
ElectroencephalographySignalClassification based on Sub-Band Common Spatial P...ElectroencephalographySignalClassification based on Sub-Band Common Spatial P...
ElectroencephalographySignalClassification based on Sub-Band Common Spatial P...
 
Review on cs231 part-2
Review on cs231 part-2Review on cs231 part-2
Review on cs231 part-2
 

Viewers also liked

Cs221 lecture4-fall11
Cs221 lecture4-fall11Cs221 lecture4-fall11
Cs221 lecture4-fall11darwinrlo
 
Artificial Neural Networks fuzzy logic
Artificial Neural Networks fuzzy logicArtificial Neural Networks fuzzy logic
Artificial Neural Networks fuzzy logicKrishna Reddy
 
Expert system neural fuzzy system
Expert system neural fuzzy systemExpert system neural fuzzy system
Expert system neural fuzzy systemJamil S. Alagha
 
Application of expert system
Application of expert systemApplication of expert system
Application of expert systemDinkar DP
 
Lecture5 Expert Systems And Artificial Intelligence
Lecture5 Expert Systems And Artificial IntelligenceLecture5 Expert Systems And Artificial Intelligence
Lecture5 Expert Systems And Artificial IntelligenceKodok Ngorex
 
Expert Systems
Expert SystemsExpert Systems
Expert Systemsosmancikk
 
Artificial neural network
Artificial neural networkArtificial neural network
Artificial neural networkDEEPASHRI HK
 

Viewers also liked (14)

Cs221 lecture4-fall11
Cs221 lecture4-fall11Cs221 lecture4-fall11
Cs221 lecture4-fall11
 
Artificial Neural Networks fuzzy logic
Artificial Neural Networks fuzzy logicArtificial Neural Networks fuzzy logic
Artificial Neural Networks fuzzy logic
 
Fuzzy and nn
Fuzzy and nnFuzzy and nn
Fuzzy and nn
 
Expert System - Examples
Expert System - ExamplesExpert System - Examples
Expert System - Examples
 
Expert Systems
Expert SystemsExpert Systems
Expert Systems
 
Expert system neural fuzzy system
Expert system neural fuzzy systemExpert system neural fuzzy system
Expert system neural fuzzy system
 
Expert Systems
Expert SystemsExpert Systems
Expert Systems
 
Application of expert system
Application of expert systemApplication of expert system
Application of expert system
 
Lecture5 Expert Systems And Artificial Intelligence
Lecture5 Expert Systems And Artificial IntelligenceLecture5 Expert Systems And Artificial Intelligence
Lecture5 Expert Systems And Artificial Intelligence
 
Topic 8 expert system
Topic 8 expert systemTopic 8 expert system
Topic 8 expert system
 
Expert Systems
Expert SystemsExpert Systems
Expert Systems
 
6.expert systems
6.expert systems6.expert systems
6.expert systems
 
Expert Systems
Expert SystemsExpert Systems
Expert Systems
 
Artificial neural network
Artificial neural networkArtificial neural network
Artificial neural network
 

Similar to Speeding up probabilistic inference of camera orientation by function approximation and grid masking - WSCG2011 presentation

Localized Characterization of GaAs/AlGaAs Quantum Well Devices
Localized Characterization of GaAs/AlGaAs Quantum Well DevicesLocalized Characterization of GaAs/AlGaAs Quantum Well Devices
Localized Characterization of GaAs/AlGaAs Quantum Well DevicesImee Rose Tagaca
 
Image Compression Using Wavelet Packet Tree
Image Compression Using Wavelet Packet TreeImage Compression Using Wavelet Packet Tree
Image Compression Using Wavelet Packet TreeIDES Editor
 
Reading_0413_var_Transformers.pptx
Reading_0413_var_Transformers.pptxReading_0413_var_Transformers.pptx
Reading_0413_var_Transformers.pptxcongtran88
 
3680-NoCA.pptx
3680-NoCA.pptx3680-NoCA.pptx
3680-NoCA.pptxgrssieee
 
Noise resistance territorial intensity-based optical flow using inverse confi...
Noise resistance territorial intensity-based optical flow using inverse confi...Noise resistance territorial intensity-based optical flow using inverse confi...
Noise resistance territorial intensity-based optical flow using inverse confi...journalBEEI
 
論文紹介"DynamicFusion: Reconstruction and Tracking of Non-­‐rigid Scenes in Real...
論文紹介"DynamicFusion: Reconstruction and Tracking of Non-­‐rigid Scenes in Real...論文紹介"DynamicFusion: Reconstruction and Tracking of Non-­‐rigid Scenes in Real...
論文紹介"DynamicFusion: Reconstruction and Tracking of Non-­‐rigid Scenes in Real...Ken Sakurada
 
NUMBER PLATE IMAGE DETECTION FOR FAST MOTION VEHICLES USING BLUR KERNEL ESTIM...
NUMBER PLATE IMAGE DETECTION FOR FAST MOTION VEHICLES USING BLUR KERNEL ESTIM...NUMBER PLATE IMAGE DETECTION FOR FAST MOTION VEHICLES USING BLUR KERNEL ESTIM...
NUMBER PLATE IMAGE DETECTION FOR FAST MOTION VEHICLES USING BLUR KERNEL ESTIM...paperpublications3
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)ijceronline
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)ijceronline
 
Object Distance Detection using a Joint Transform Correlator
Object Distance Detection using a Joint Transform CorrelatorObject Distance Detection using a Joint Transform Correlator
Object Distance Detection using a Joint Transform CorrelatorAlexander Layton
 
Presentation_Guccione.pptx
Presentation_Guccione.pptxPresentation_Guccione.pptx
Presentation_Guccione.pptxgrssieee
 
Expert system design for elastic scattering neutrons optical model using bpnn
Expert system design for elastic scattering neutrons optical model using bpnnExpert system design for elastic scattering neutrons optical model using bpnn
Expert system design for elastic scattering neutrons optical model using bpnnijcsa
 
Shift Invarient and Eigen Feature Based Image Fusion
Shift Invarient and Eigen Feature Based Image Fusion Shift Invarient and Eigen Feature Based Image Fusion
Shift Invarient and Eigen Feature Based Image Fusion ijcisjournal
 
Optimal neural network models for wind speed prediction
Optimal neural network models for wind speed predictionOptimal neural network models for wind speed prediction
Optimal neural network models for wind speed predictionIAEME Publication
 
Optimal neural network models for wind speed prediction
Optimal neural network models for wind speed predictionOptimal neural network models for wind speed prediction
Optimal neural network models for wind speed predictionIAEME Publication
 
Optimal neural network models for wind speed prediction
Optimal neural network models for wind speed predictionOptimal neural network models for wind speed prediction
Optimal neural network models for wind speed predictionIAEME Publication
 

Similar to Speeding up probabilistic inference of camera orientation by function approximation and grid masking - WSCG2011 presentation (20)

Localized Characterization of GaAs/AlGaAs Quantum Well Devices
Localized Characterization of GaAs/AlGaAs Quantum Well DevicesLocalized Characterization of GaAs/AlGaAs Quantum Well Devices
Localized Characterization of GaAs/AlGaAs Quantum Well Devices
 
Image Compression Using Wavelet Packet Tree
Image Compression Using Wavelet Packet TreeImage Compression Using Wavelet Packet Tree
Image Compression Using Wavelet Packet Tree
 
SPIE99_1B.DOC
SPIE99_1B.DOCSPIE99_1B.DOC
SPIE99_1B.DOC
 
Reading_0413_var_Transformers.pptx
Reading_0413_var_Transformers.pptxReading_0413_var_Transformers.pptx
Reading_0413_var_Transformers.pptx
 
Dx25751756
Dx25751756Dx25751756
Dx25751756
 
3680-NoCA.pptx
3680-NoCA.pptx3680-NoCA.pptx
3680-NoCA.pptx
 
Noise resistance territorial intensity-based optical flow using inverse confi...
Noise resistance territorial intensity-based optical flow using inverse confi...Noise resistance territorial intensity-based optical flow using inverse confi...
Noise resistance territorial intensity-based optical flow using inverse confi...
 
Final Poster
Final PosterFinal Poster
Final Poster
 
論文紹介"DynamicFusion: Reconstruction and Tracking of Non-­‐rigid Scenes in Real...
論文紹介"DynamicFusion: Reconstruction and Tracking of Non-­‐rigid Scenes in Real...論文紹介"DynamicFusion: Reconstruction and Tracking of Non-­‐rigid Scenes in Real...
論文紹介"DynamicFusion: Reconstruction and Tracking of Non-­‐rigid Scenes in Real...
 
NUMBER PLATE IMAGE DETECTION FOR FAST MOTION VEHICLES USING BLUR KERNEL ESTIM...
NUMBER PLATE IMAGE DETECTION FOR FAST MOTION VEHICLES USING BLUR KERNEL ESTIM...NUMBER PLATE IMAGE DETECTION FOR FAST MOTION VEHICLES USING BLUR KERNEL ESTIM...
NUMBER PLATE IMAGE DETECTION FOR FAST MOTION VEHICLES USING BLUR KERNEL ESTIM...
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
 
Object Distance Detection using a Joint Transform Correlator
Object Distance Detection using a Joint Transform CorrelatorObject Distance Detection using a Joint Transform Correlator
Object Distance Detection using a Joint Transform Correlator
 
Presentation_Guccione.pptx
Presentation_Guccione.pptxPresentation_Guccione.pptx
Presentation_Guccione.pptx
 
Expert system design for elastic scattering neutrons optical model using bpnn
Expert system design for elastic scattering neutrons optical model using bpnnExpert system design for elastic scattering neutrons optical model using bpnn
Expert system design for elastic scattering neutrons optical model using bpnn
 
Shift Invarient and Eigen Feature Based Image Fusion
Shift Invarient and Eigen Feature Based Image Fusion Shift Invarient and Eigen Feature Based Image Fusion
Shift Invarient and Eigen Feature Based Image Fusion
 
Optimal neural network models for wind speed prediction
Optimal neural network models for wind speed predictionOptimal neural network models for wind speed prediction
Optimal neural network models for wind speed prediction
 
Optimal neural network models for wind speed prediction
Optimal neural network models for wind speed predictionOptimal neural network models for wind speed prediction
Optimal neural network models for wind speed prediction
 
Optimal neural network models for wind speed prediction
Optimal neural network models for wind speed predictionOptimal neural network models for wind speed prediction
Optimal neural network models for wind speed prediction
 
73.1 s
73.1 s73.1 s
73.1 s
 

More from Nicolau Werneck

Introdução a Scala [GeekieTalk]
Introdução a Scala [GeekieTalk]Introdução a Scala [GeekieTalk]
Introdução a Scala [GeekieTalk]Nicolau Werneck
 
Expectation Maximization: o básico do básico
Expectation Maximization: o básico do básicoExpectation Maximization: o básico do básico
Expectation Maximization: o básico do básicoNicolau Werneck
 
Nicolau Werneck professional experience
Nicolau Werneck professional experienceNicolau Werneck professional experience
Nicolau Werneck professional experienceNicolau Werneck
 
Estimação de orientação de câmera em ambientes antrópicos a partir de edgels
Estimação de orientação de câmera em ambientes antrópicos a partir de edgelsEstimação de orientação de câmera em ambientes antrópicos a partir de edgels
Estimação de orientação de câmera em ambientes antrópicos a partir de edgelsNicolau Werneck
 
Detecção de quedas em robôs móveis com visão monocular linear
Detecção de quedas em robôs móveis com visão monocular linearDetecção de quedas em robôs móveis com visão monocular linear
Detecção de quedas em robôs móveis com visão monocular linearNicolau Werneck
 
Mapeamento Visual Monocular com a Transformada Rápida de Hough
Mapeamento Visual Monocular com a Transformada Rápida de HoughMapeamento Visual Monocular com a Transformada Rápida de Hough
Mapeamento Visual Monocular com a Transformada Rápida de HoughNicolau Werneck
 

More from Nicolau Werneck (8)

Corisco - 2015
Corisco - 2015Corisco - 2015
Corisco - 2015
 
Introdução a Scala [GeekieTalk]
Introdução a Scala [GeekieTalk]Introdução a Scala [GeekieTalk]
Introdução a Scala [GeekieTalk]
 
Expectation Maximization: o básico do básico
Expectation Maximization: o básico do básicoExpectation Maximization: o básico do básico
Expectation Maximization: o básico do básico
 
Nicolau Werneck professional experience
Nicolau Werneck professional experienceNicolau Werneck professional experience
Nicolau Werneck professional experience
 
Agrupamento espectral
Agrupamento espectralAgrupamento espectral
Agrupamento espectral
 
Estimação de orientação de câmera em ambientes antrópicos a partir de edgels
Estimação de orientação de câmera em ambientes antrópicos a partir de edgelsEstimação de orientação de câmera em ambientes antrópicos a partir de edgels
Estimação de orientação de câmera em ambientes antrópicos a partir de edgels
 
Detecção de quedas em robôs móveis com visão monocular linear
Detecção de quedas em robôs móveis com visão monocular linearDetecção de quedas em robôs móveis com visão monocular linear
Detecção de quedas em robôs móveis com visão monocular linear
 
Mapeamento Visual Monocular com a Transformada Rápida de Hough
Mapeamento Visual Monocular com a Transformada Rápida de HoughMapeamento Visual Monocular com a Transformada Rápida de Hough
Mapeamento Visual Monocular com a Transformada Rápida de Hough
 

Recently uploaded

Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Hyundai Motor Group
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetEnjoy Anytime
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 

Recently uploaded (20)

Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 

Speeding up probabilistic inference of camera orientation by function approximation and grid masking - WSCG2011 presentation

  • 1. Speeding up probabilistic inference of camera orientation by function approximation and grid masking LTI–PCS–EPUSP Nicolau L. Werneck nic-wscg2011 N. Werneck Doctoral candidate 1–Introduction Supervisor: Prof. Anna Helena Reali Costa 2–Methodology Intelligent Techniques Laboratory, LTI — PCS — Poli Universidade de S˜o Paulo (USP), Brazil a 3–Results References Referˆncias e WSCG’2011, Plzen Feb/2011 c N. Werneck 1 / 15
  • 2. Introduction The problem — camera orientation estimation Environment edges are assumed to be in the three directions of the reference frame. (Lego Land, Manhattan World) LTI–PCS–EPUSP nic-wscg2011 We want to calculate the N. Werneck camera orientation in relation 1–Introduction to this reference frame, in 2–Methodology real-time. 3–Results References Technique based on continuous Referˆncias e optimization. No edge extrac- tion or matching involved. N. Werneck (Maximum likelihood) c 2 / 15
  • 3. Introduction Geometrical constraints Knowing the camera orientation from a picture we can predict the directions of image edges. LTI–PCS–EPUSP nic-wscg2011 N. Werneck 1–Introduction 2–Methodology 3–Results References Referˆncias e c N. Werneck 3 / 15
  • 4.
  • 5. Introduction Bayesian camera orientation estimation The data analized is the gradient of the input image. LTI–PCS–EPUSP nic-wscg2011 N. Werneck 1–Introduction 2–Methodology 3–Results References Referˆncias e c N. Werneck 5 / 15
  • 6. Introduction Bayesian camera orientation estimation The Bayesian camera orientation estimation works by defining an objective function L(Ψ) to be optimized. The solution is Ψ∗ = argmax L(Ψ). LTI–PCS–EPUSP The function L tells how well the arguments nic-wscg2011 N. Werneck “explain” the evidences. (Likelihood function) 1–Introduction In this problem Ψ is a set of arguments that model 2–Methodology the camera orientation. 3–Results L tells how much the edges in the images are References aligned to the directions expected from the Referˆncias e vanishing points produced by Ψ. c N. Werneck 6 / 15
  • 7. Existing techniques This work is based on previous research by Coughlan and Yuille [2003], Deutscher et al. [2002], Schindler and Dellaert [2004], Denis et al. [2008]. They are all based on likelihood maximization. The LTI–PCS–EPUSP differences lie in: nic-wscg2011 N. Werneck What parameters are estimated. 1–Introduction (Other than orientation). 2–Methodology What optimization algorithm is employed. 3–Results Expression of the likelihood function. References Referˆncias e (Specially what PDF models are used). Subsampling technique. c N. Werneck 7 / 15
  • 8. Original expression In Coughlan and Yuille [2003] the image likelihood is a product of the likelihoods of gradients Eu at each pixel u. Observation model LTI–PCS–EPUSP nic-wscg2011 N. Werneck Lik. pixel is edge 1–Introduction Lik. orientation match 2–Methodology 3–Results The expression built is a Maximum a posteriori estimator. References Referˆncias e Using M k for P(mu = k), Φk for P(φu |mu = k, Ψ, u) and taking the log we arrive at the objective function... c N. Werneck 8 / 15
  • 9. Proposed expression L Ψ = ∑ log Poff (Eu )Φ1 M 1 +Pon (Eu )Φ5 M 5 +Pon (Eu ) ∑4 Φk M k k=2 u a Using log(b + a) ≈ b + log(b), we arrive at LTI–PCS–EPUSP Lik. pixel is edge nic-wscg2011 N. Werneck 1–Introduction Lik. orientation match 2–Methodology 3–Results There is a weighting coefficient based on the gradient References norm multiplied by something that depends on the Referˆncias e gradient directions and camera orientation. c N. Werneck 9 / 15
  • 10. Gradient norm masking The mask generating function −1 Poff (Eu ) 1 W (Eu ) = M + M5 Pon (Eu ) LTI–PCS–EPUSP Also... nic-wscg2011 N. Werneck We replaced W for 1–Introduction W , based on the 2–Methodology logistic function. 3–Results References We also used vector Referˆncias e dot products instead of calculating arctan. c N. Werneck 10 / 15
  • 11. Grid masking We select one from every few lines and columns. Images edges are sampled regularly. Minimally long lines are necessarily sampled. Better strategy for high resolution images, where LTI–PCS–EPUSP edge pixels are “rare”. nic-wscg2011 N. Werneck 1–Introduction 2–Methodology 3–Results References Referˆncias e c N. Werneck 11 / 15
  • 12. Results Expression evaluation Speed Expressions were implemented in Cython, using SIMD instructions, and tested on c1.xlarge AWS computers. A speedup of 50–64× was detected. LTI–PCS–EPUSP Original 1100.0 ±60ms nic-wscg2011 Proposed 18.9 ±2.4ms N. Werneck 1–Introduction (4s per image with the proposal, without subsampling.) 2–Methodology 3–Results Quality References From 102 tests, the original expression “fixed” the Referˆncias e solution in 5 occasions, but ruined 6 good solutions. Mean error went from 4.7◦ to 5.5◦ . (Large outliers) c N. Werneck 12 / 15
  • 13. Results Grid masking evaluation Speed increases as solution quality drops. LTI–PCS–EPUSP nic-wscg2011 N. Werneck 1–Introduction 2–Methodology 3–Results References Referˆncias e c N. Werneck 13 / 15
  • 14. Conclusion The proposed expression is simpler, faster, intuitive and justifies selecting pixels from gradient norm. The grid masking technique proved to be a good alternative for subsampling images deterministically. Future work LTI–PCS–EPUSP nic-wscg2011 Develop a complete pixel selection method. N. Werneck Find best parameters. 1–Introduction Try to use gradient-based optimization. 2–Methodology 3–Results References Thanks! THE END Referˆncias e http://nwerneck.sdf.org c N. Werneck 14 / 15
  • 15. References James M. Coughlan and A. L. Yuille. Manhattan world: orientation and outlier detection by bayesian inference. Neural Comput., 15(5):1063–1088, 2003. ISSN 0899-7667. URL doi:10.1162/089976603765202668. Patrick Denis, James H. Elder, and Francisco J. Estrada. Efficient edge-based methods for estimating manhattan frames in urban imagery. In David A. Forsyth, Philip H. S. Torr, and Andrew Zisserman, editors, ECCV (2), volume 5303 of Lecture Notes LTI–PCS–EPUSP in Computer Science, pages 197–210. Springer, 2008. ISBN nic-wscg2011 978-3-540-88685-3. N. Werneck Jonathan Deutscher, Michael Isard, and John Maccormick. 1–Introduction Automatic camera calibration from a single manhattan image. 2–Methodology In Eur. Conf. on Computer Vision (ECCV, pages 175–205, 3–Results 2002. References Grant Schindler and Frank Dellaert. Atlanta world: An expectation Referˆncias e maximization framework for simultaneous low-level edge grouping and camera calibration in complex man-made environments. In CVPR (1), pages 203–209, 2004. c N. Werneck 15 / 15