SlideShare a Scribd company logo
1 of 23
Download to read offline
Semantically Integrating Laser and Vision
        in Pedestrian Detection




                               Luciano Oliveira

                                          Advisors:
                                Prof. Urbano Nunes
                                Prof. Paulo Peixoto
Motivation
                                     Where is the pedestrian
                   Clustering        in the scene?
                   methods

  Segmentation
                                    Recognition



                 Kalman                         Efficient sub-
                  Filter                      window searching
                                                   (image)



    Tracking
                                Searching
Goals
          Object detection
        using laser/vision
          Proof-of-concept:
        pedestrian detection,
        but can be applied to
        several other objects
          Recover object
        localization
          DO NOT entirely
        rely on laser, as
        previous methods do
          Perform the fusion in
        a context-aware mode
Overview of the proposed method
   Laser points                                                                                         Images


                                                                       Laser-image registration                HLSM-FINT



      Coarse               3D sliding window                        Sensor                            Parts-based
   segmentation                searching               For each   registration        For each     ensemble detector
                                                      3D window                      2D window



          {cn }n=1
               N                                                                                   (object, confidence)

                                                                         Templating matching



       Fine                                         Procrustes                                    Semantic/contextual
   segmentation
                        { f m }m=1 for each c
                               M
                                                n
                                                     analysis        (label, confidence) for         interpretation
                                                                             each fm

   Laser segmentation                                                                                                 MLN
      and labeling
                                                                                                      Ground MRF



                                                    Reference                                       Inference and
                                                     shapes                                        decision outputs
Experimental setup
                 Pointgrey camera


                          Sick LMS200 laser




      Odometry
Sensor-driven detectors
   Laser points                                                                                         Images


                                                                       Laser-image registration                HLSM-FINT



      Coarse               3D sliding window                        Sensor                            Parts-based
   segmentation                searching               For each   registration        For each     ensemble detector
                                                      3D window                      2D window



          {cn }n=1
               N                                                                                   (object, confidence)

                                                                         Templating matching



       Fine                                         Procrustes                                    Semantic/contextual
   segmentation
                        { f m }m=1 for each c
                               M
                                                n
                                                     analysis        (label, confidence) for         interpretation
                                                                             each fm

   Laser segmentation                                                                                                 MLN
      and labeling
                                                                                                      Ground MRF



                                                    Reference                                       Inference and
                                                     shapes                                        decision outputs
Ensemble of classifiers HFI
         Fuzzy inputs            Hierarchical Fuzzy Integration

          Perimeter
            rate
                             Fuzzy
                            System




                                     Intersection
                                         rate




                                                             Final confidence
               Distance /
               max(w,w´)                             Fuzzy
                                                    System

    C1
                                     confidence




   C2
                                     Joint




    C1 scaled score          Fuzzy
                            System
    C2 scaled score
Drawbacks

 Initially evaluated on Haar-like features /
 Adaboost and HOG / SVM classification
 systems


   It suffers from exponential growing of
   rules and low overall performance over
   challenging situations
Ensemble of classifiers HLSM-FINT
HLSM-FINT – Rationale
• CNN – expert in background (BG)
(60% of hit rate in NiSIS competition)
                                         BG   BG   OB
• HOG/SVM – expert in objects (OB)
(70% of hit rate in NiSIS competition)
• Fuzzy integral (Sugeno) – provides
a comprehensive framework and
great synergism
• 95.67% of hit rate in NiSIS
competition over 6125 cropped
images (ped + non-ped), using
Heuristic Majority Vote method           BG   BG   OB
• 96.4% of hit rate over full
DaimlerChrysler datasets : ~15.000
images
Parts-based HLSM-FINT

                 Upper HLSM-FINT
                 (shoulder + header)




                 Lower HLSM-FINT
                 (waist)
Laser detector
Laser detector

                                                    arm
                                            arm torso



                                                    partial             arm           arm
                                                                              torso
                                                   segment



                                                                torso
                                                          arm




• Featureless approach
• Coarse-to-fine segmentation
• Relative Neighboorhood Graph (RNG) clustering + clustering index
• Procrustes Analysis (PA) labeling procedure
Laser detector
Laser detector


Occlusion problem:
• z-buffer analysis +
• angle between start and end
point (proportional to laser
angle resolution)
Laser-image registration
   Laser points                                                                                         Images


                                                                       Laser-image registration                HLSM-FINT



      Coarse               3D sliding window                        Sensor                            Parts-based
   segmentation                searching               For each   registration        For each     ensemble detector
                                                      3D window                      2D window



          {cn }n=1
               N                                                                                   (object, confidence)

                                                                         Templating matching



       Fine                                         Procrustes                                    Semantic/contextual
   segmentation
                        { f m }m=1 for each c
                               M
                                                n
                                                     analysis        (label, confidence) for         interpretation
                                                                             each fm

   Laser segmentation                                                                                                 MLN
      and labeling
                                                                                                      Ground MRF



                                                    Reference                                       Inference and
                                                     shapes                                        decision outputs
Laser-image registration




Zhang and Pless’ calibration method
(with an error of 6 mm in the calibration)
Semantic Fusion
  Laser points                                                                                         Images


                                                                      Laser-image registration                HLSM-FINT



     Coarse               3D sliding window                        Sensor                            Parts-based
  segmentation                searching               For each   registration        For each     ensemble detector
                                                     3D window                      2D window



         {cn }n=1
              N                                                                                   (object, confidence)

                                                                        Templating matching



      Fine                                         Procrustes                                    Semantic/contextual
  segmentation
                       { f m }m=1 for each c
                              M
                                               n
                                                    analysis        (label, confidence) for         interpretation
                                                                            each fm

  Laser segmentation                                                                                                 MLN
     and labeling
                                                                                                     Ground MRF



                                                   Reference                                       Inference and
                                                    shapes                                        decision outputs
Semantic fusion
Semantic fusion

                         Wi

                                                                        MRF




 • MRFs given by FOL formulas
 • Weights given by the MRF training (gradient ascent method over the
 conditonal log-likelihood)
Semantic fusion – Examples
Conclusions
 HFI has achieved better performance than its components, but failed
 to get the gist of the fusion
 HLSM-FINT has succeeded to capture the aimed synergism of the
 fusion, but has had difficulties on hard situations (e.g. occlusion).
 Parts-based occlusion has improved this issue.
 The introduction of the laser sensor has brought significant
 improvement
 The proposed fusion method offers two main advantages:
         Contextual and spatial relationship among the parts of the
         object, dropping the false alarm rate
         It is able to detect the object in spite of laser failing
 The whole system is not able to run on-the-fly, although there is no
 code optimization. Nevertheless, parallel hardware can provide
 interesting plataform to make the system faster. It will be subject of
 future research.
Publications and awards
 Journals
       OLIVEIRA, L.; NUNES, U.; PEIXOTO, P.; SILVA, M. and MOITA, F. Semantic
       Fusion of Laser and Vision in Pedestrian Detection, Journal of Pattern
       Recognition, Elsevier, accepted for publication (ISI impact factor: 3.279).
       OLIVEIRA, L.; NUNES, U. and PEIXOTO, P. On Exploration of Classifier
       Ensemble Synergism in Pedestrian Detection, IEEE Transactions on
       Intelligent Transportation Systems, pp. 16-21, 2010 (ISI impact factor:
       2.844).


 Awards
       3rd place in Intel/GV Entrepreneurship and Venture Capital Competition
       (2008)
       1st place in NiSIS Competition - Best accuracy model over Daimler Chrysler
       image dataset. Scheme of Primate's Visual Cortex Cells for Pedestrian
       Recognition (2007)


 5 international conferences

More Related Content

What's hot

Contribution of Non-Scrambled Chroma Information in Privacy-Protected Face Im...
Contribution of Non-Scrambled Chroma Information in Privacy-Protected Face Im...Contribution of Non-Scrambled Chroma Information in Privacy-Protected Face Im...
Contribution of Non-Scrambled Chroma Information in Privacy-Protected Face Im...Wesley De Neve
 
Poster Toward a realistic retinal simulator
Poster Toward a realistic retinal simulatorPoster Toward a realistic retinal simulator
Poster Toward a realistic retinal simulatorHassan Nasser
 
Robust wavelet denoising
Robust wavelet denoisingRobust wavelet denoising
Robust wavelet denoisinggulanglhs
 
Tsmf Methodology
Tsmf MethodologyTsmf Methodology
Tsmf Methodologykangaxx84
 
Reinhardt - Adaptive Combinatorial Multimodal Sensing Physics & Methods - Spr...
Reinhardt - Adaptive Combinatorial Multimodal Sensing Physics & Methods - Spr...Reinhardt - Adaptive Combinatorial Multimodal Sensing Physics & Methods - Spr...
Reinhardt - Adaptive Combinatorial Multimodal Sensing Physics & Methods - Spr...The Air Force Office of Scientific Research
 
IDS Survey on Entropy
IDS Survey  on Entropy IDS Survey  on Entropy
IDS Survey on Entropy Raj Kamal
 
Graded Patterns in Attractor Networks
Graded Patterns in Attractor NetworksGraded Patterns in Attractor Networks
Graded Patterns in Attractor Networkstristanjwebb
 
Quantum Information with Continuous Variable systems
Quantum Information with Continuous Variable systemsQuantum Information with Continuous Variable systems
Quantum Information with Continuous Variable systemskarl3s
 
44 i9 advanced-speaker-recognition
44 i9 advanced-speaker-recognition44 i9 advanced-speaker-recognition
44 i9 advanced-speaker-recognitionsunnysyed
 
CCNxCon2012: Session 5: Distributed Cooperative Caching Scheme in CCN
CCNxCon2012: Session 5: Distributed Cooperative Caching Scheme in CCNCCNxCon2012: Session 5: Distributed Cooperative Caching Scheme in CCN
CCNxCon2012: Session 5: Distributed Cooperative Caching Scheme in CCNPARC, a Xerox company
 
SSN2012 Deriving Semantic Sensor Metadata from Raw Measurements
SSN2012 Deriving Semantic Sensor Metadata from Raw MeasurementsSSN2012 Deriving Semantic Sensor Metadata from Raw Measurements
SSN2012 Deriving Semantic Sensor Metadata from Raw MeasurementsJean-Paul Calbimonte
 
iDiff 2008 conference #04 IP-Racine FSSG
iDiff 2008 conference #04 IP-Racine   FSSGiDiff 2008 conference #04 IP-Racine   FSSG
iDiff 2008 conference #04 IP-Racine FSSGBenoit Michel
 
Nishimoto Interspeech 2010 v3
Nishimoto Interspeech 2010 v3Nishimoto Interspeech 2010 v3
Nishimoto Interspeech 2010 v3Takuya Nishimoto
 

What's hot (20)

Contribution of Non-Scrambled Chroma Information in Privacy-Protected Face Im...
Contribution of Non-Scrambled Chroma Information in Privacy-Protected Face Im...Contribution of Non-Scrambled Chroma Information in Privacy-Protected Face Im...
Contribution of Non-Scrambled Chroma Information in Privacy-Protected Face Im...
 
Omd dibg 2
Omd dibg 2Omd dibg 2
Omd dibg 2
 
Poster Toward a realistic retinal simulator
Poster Toward a realistic retinal simulatorPoster Toward a realistic retinal simulator
Poster Toward a realistic retinal simulator
 
Robust wavelet denoising
Robust wavelet denoisingRobust wavelet denoising
Robust wavelet denoising
 
Tsmf Methodology
Tsmf MethodologyTsmf Methodology
Tsmf Methodology
 
Reinhardt - Adaptive Combinatorial Multimodal Sensing Physics & Methods - Spr...
Reinhardt - Adaptive Combinatorial Multimodal Sensing Physics & Methods - Spr...Reinhardt - Adaptive Combinatorial Multimodal Sensing Physics & Methods - Spr...
Reinhardt - Adaptive Combinatorial Multimodal Sensing Physics & Methods - Spr...
 
An01 dws
An01 dwsAn01 dws
An01 dws
 
Compressed Sensing In Spectral Imaging
Compressed Sensing In Spectral Imaging  Compressed Sensing In Spectral Imaging
Compressed Sensing In Spectral Imaging
 
IDS Survey on Entropy
IDS Survey  on Entropy IDS Survey  on Entropy
IDS Survey on Entropy
 
Mobile Networking Solutions for First Responders
Mobile Networking Solutions for First RespondersMobile Networking Solutions for First Responders
Mobile Networking Solutions for First Responders
 
Graded Patterns in Attractor Networks
Graded Patterns in Attractor NetworksGraded Patterns in Attractor Networks
Graded Patterns in Attractor Networks
 
Quantum Information with Continuous Variable systems
Quantum Information with Continuous Variable systemsQuantum Information with Continuous Variable systems
Quantum Information with Continuous Variable systems
 
Sparse representation and compressive sensing
Sparse representation and compressive sensingSparse representation and compressive sensing
Sparse representation and compressive sensing
 
05503983
0550398305503983
05503983
 
44 i9 advanced-speaker-recognition
44 i9 advanced-speaker-recognition44 i9 advanced-speaker-recognition
44 i9 advanced-speaker-recognition
 
CCNxCon2012: Session 5: Distributed Cooperative Caching Scheme in CCN
CCNxCon2012: Session 5: Distributed Cooperative Caching Scheme in CCNCCNxCon2012: Session 5: Distributed Cooperative Caching Scheme in CCN
CCNxCon2012: Session 5: Distributed Cooperative Caching Scheme in CCN
 
SSN2012 Deriving Semantic Sensor Metadata from Raw Measurements
SSN2012 Deriving Semantic Sensor Metadata from Raw MeasurementsSSN2012 Deriving Semantic Sensor Metadata from Raw Measurements
SSN2012 Deriving Semantic Sensor Metadata from Raw Measurements
 
iDiff 2008 conference #04 IP-Racine FSSG
iDiff 2008 conference #04 IP-Racine   FSSGiDiff 2008 conference #04 IP-Racine   FSSG
iDiff 2008 conference #04 IP-Racine FSSG
 
Nishimoto Interspeech 2010 v3
Nishimoto Interspeech 2010 v3Nishimoto Interspeech 2010 v3
Nishimoto Interspeech 2010 v3
 
Fdmu pro step_2010_final
Fdmu pro step_2010_finalFdmu pro step_2010_final
Fdmu pro step_2010_final
 

Viewers also liked

An Ontology-Based Intelligent Speed Adaptation System for Autonomous Cars
An Ontology-Based Intelligent Speed Adaptation System for Autonomous CarsAn Ontology-Based Intelligent Speed Adaptation System for Autonomous Cars
An Ontology-Based Intelligent Speed Adaptation System for Autonomous CarsLihua Zhao
 
Global NCAP highlights pedestrian safety at the UN
Global NCAP highlights pedestrian safety at the UNGlobal NCAP highlights pedestrian safety at the UN
Global NCAP highlights pedestrian safety at the UNGlobal NCAP
 
Machine vision based pedestrian detection and lane departure
Machine vision based pedestrian detection and lane departureMachine vision based pedestrian detection and lane departure
Machine vision based pedestrian detection and lane departureSanket Borhade
 
Pedestrian Detection Technology - Brochure
Pedestrian Detection Technology - BrochurePedestrian Detection Technology - Brochure
Pedestrian Detection Technology - BrochureMobileye
 
Real time pedestrian detection, tracking, and distance estimation
Real time pedestrian detection, tracking, and distance estimationReal time pedestrian detection, tracking, and distance estimation
Real time pedestrian detection, tracking, and distance estimationomid Asudeh
 

Viewers also liked (7)

Safe pedestrian
Safe pedestrianSafe pedestrian
Safe pedestrian
 
An Ontology-Based Intelligent Speed Adaptation System for Autonomous Cars
An Ontology-Based Intelligent Speed Adaptation System for Autonomous CarsAn Ontology-Based Intelligent Speed Adaptation System for Autonomous Cars
An Ontology-Based Intelligent Speed Adaptation System for Autonomous Cars
 
[IJET V2I3P2] Authors: Shraddha Kallappa Walikar, Dr. Aswatha Kumar M
[IJET V2I3P2] Authors: Shraddha Kallappa Walikar,  Dr. Aswatha Kumar M[IJET V2I3P2] Authors: Shraddha Kallappa Walikar,  Dr. Aswatha Kumar M
[IJET V2I3P2] Authors: Shraddha Kallappa Walikar, Dr. Aswatha Kumar M
 
Global NCAP highlights pedestrian safety at the UN
Global NCAP highlights pedestrian safety at the UNGlobal NCAP highlights pedestrian safety at the UN
Global NCAP highlights pedestrian safety at the UN
 
Machine vision based pedestrian detection and lane departure
Machine vision based pedestrian detection and lane departureMachine vision based pedestrian detection and lane departure
Machine vision based pedestrian detection and lane departure
 
Pedestrian Detection Technology - Brochure
Pedestrian Detection Technology - BrochurePedestrian Detection Technology - Brochure
Pedestrian Detection Technology - Brochure
 
Real time pedestrian detection, tracking, and distance estimation
Real time pedestrian detection, tracking, and distance estimationReal time pedestrian detection, tracking, and distance estimation
Real time pedestrian detection, tracking, and distance estimation
 

Recently uploaded

SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 

Recently uploaded (20)

SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 

Thesis presentation

  • 1. Semantically Integrating Laser and Vision in Pedestrian Detection Luciano Oliveira Advisors: Prof. Urbano Nunes Prof. Paulo Peixoto
  • 2. Motivation Where is the pedestrian Clustering in the scene? methods Segmentation Recognition Kalman Efficient sub- Filter window searching (image) Tracking Searching
  • 3. Goals Object detection using laser/vision Proof-of-concept: pedestrian detection, but can be applied to several other objects Recover object localization DO NOT entirely rely on laser, as previous methods do Perform the fusion in a context-aware mode
  • 4. Overview of the proposed method Laser points Images Laser-image registration HLSM-FINT Coarse 3D sliding window Sensor Parts-based segmentation searching For each registration For each ensemble detector 3D window 2D window {cn }n=1 N (object, confidence) Templating matching Fine Procrustes Semantic/contextual segmentation { f m }m=1 for each c M n analysis (label, confidence) for interpretation each fm Laser segmentation MLN and labeling Ground MRF Reference Inference and shapes decision outputs
  • 5. Experimental setup Pointgrey camera Sick LMS200 laser Odometry
  • 6. Sensor-driven detectors Laser points Images Laser-image registration HLSM-FINT Coarse 3D sliding window Sensor Parts-based segmentation searching For each registration For each ensemble detector 3D window 2D window {cn }n=1 N (object, confidence) Templating matching Fine Procrustes Semantic/contextual segmentation { f m }m=1 for each c M n analysis (label, confidence) for interpretation each fm Laser segmentation MLN and labeling Ground MRF Reference Inference and shapes decision outputs
  • 7. Ensemble of classifiers HFI Fuzzy inputs Hierarchical Fuzzy Integration Perimeter rate Fuzzy System Intersection rate Final confidence Distance / max(w,w´) Fuzzy System C1 confidence C2 Joint C1 scaled score Fuzzy System C2 scaled score
  • 8. Drawbacks Initially evaluated on Haar-like features / Adaboost and HOG / SVM classification systems It suffers from exponential growing of rules and low overall performance over challenging situations
  • 10. HLSM-FINT – Rationale • CNN – expert in background (BG) (60% of hit rate in NiSIS competition) BG BG OB • HOG/SVM – expert in objects (OB) (70% of hit rate in NiSIS competition) • Fuzzy integral (Sugeno) – provides a comprehensive framework and great synergism • 95.67% of hit rate in NiSIS competition over 6125 cropped images (ped + non-ped), using Heuristic Majority Vote method BG BG OB • 96.4% of hit rate over full DaimlerChrysler datasets : ~15.000 images
  • 11. Parts-based HLSM-FINT Upper HLSM-FINT (shoulder + header) Lower HLSM-FINT (waist)
  • 13. Laser detector arm arm torso partial arm arm torso segment torso arm • Featureless approach • Coarse-to-fine segmentation • Relative Neighboorhood Graph (RNG) clustering + clustering index • Procrustes Analysis (PA) labeling procedure
  • 15. Laser detector Occlusion problem: • z-buffer analysis + • angle between start and end point (proportional to laser angle resolution)
  • 16. Laser-image registration Laser points Images Laser-image registration HLSM-FINT Coarse 3D sliding window Sensor Parts-based segmentation searching For each registration For each ensemble detector 3D window 2D window {cn }n=1 N (object, confidence) Templating matching Fine Procrustes Semantic/contextual segmentation { f m }m=1 for each c M n analysis (label, confidence) for interpretation each fm Laser segmentation MLN and labeling Ground MRF Reference Inference and shapes decision outputs
  • 17. Laser-image registration Zhang and Pless’ calibration method (with an error of 6 mm in the calibration)
  • 18. Semantic Fusion Laser points Images Laser-image registration HLSM-FINT Coarse 3D sliding window Sensor Parts-based segmentation searching For each registration For each ensemble detector 3D window 2D window {cn }n=1 N (object, confidence) Templating matching Fine Procrustes Semantic/contextual segmentation { f m }m=1 for each c M n analysis (label, confidence) for interpretation each fm Laser segmentation MLN and labeling Ground MRF Reference Inference and shapes decision outputs
  • 20. Semantic fusion Wi MRF • MRFs given by FOL formulas • Weights given by the MRF training (gradient ascent method over the conditonal log-likelihood)
  • 22. Conclusions HFI has achieved better performance than its components, but failed to get the gist of the fusion HLSM-FINT has succeeded to capture the aimed synergism of the fusion, but has had difficulties on hard situations (e.g. occlusion). Parts-based occlusion has improved this issue. The introduction of the laser sensor has brought significant improvement The proposed fusion method offers two main advantages: Contextual and spatial relationship among the parts of the object, dropping the false alarm rate It is able to detect the object in spite of laser failing The whole system is not able to run on-the-fly, although there is no code optimization. Nevertheless, parallel hardware can provide interesting plataform to make the system faster. It will be subject of future research.
  • 23. Publications and awards Journals OLIVEIRA, L.; NUNES, U.; PEIXOTO, P.; SILVA, M. and MOITA, F. Semantic Fusion of Laser and Vision in Pedestrian Detection, Journal of Pattern Recognition, Elsevier, accepted for publication (ISI impact factor: 3.279). OLIVEIRA, L.; NUNES, U. and PEIXOTO, P. On Exploration of Classifier Ensemble Synergism in Pedestrian Detection, IEEE Transactions on Intelligent Transportation Systems, pp. 16-21, 2010 (ISI impact factor: 2.844). Awards 3rd place in Intel/GV Entrepreneurship and Venture Capital Competition (2008) 1st place in NiSIS Competition - Best accuracy model over Daimler Chrysler image dataset. Scheme of Primate's Visual Cortex Cells for Pedestrian Recognition (2007) 5 international conferences

Editor's Notes

  1. In the beginning of my phd, the aim was to conceive a pedestrian detection system using monocular vision. As the work was going on, we realized that using a unique sensor with a unique method would be cumbersome, if not impossible, mainly applied in ITS. Our goal has changed, then, to propose inovative synergistic methods using multiple sensors. Our work is intitled (read...), and was supported by the following organizations...
  2. An object detecion system is usually comprised of 4 modules. Each one of them forms a field of research, and can be subject of deep investigation within a thesis work. Therefore, we focused our attention on object detection itself.
  3. Our first proposed ensemble of classifiers.
  4. Our second proposed ensemble of classifiers.
  5. The rationale of the method at a glance is to explore the synergism of high performance detection system. Therefore, what we want is to find synergism between the representation of background and object.
  6. This is our parts-based HLSM-FINT. We use the more hinted parts, while avoiding representing the limbs, which are hard to detect at certain distances.