SlideShare a Scribd company logo
1 of 1
Download to read offline
Moving Cast Shadow Detection using Physics-based Features
                                                                                         Jia-Bin Huang and Chu-Song Chen
                                                                        Institute of Information Science, Academia Sinica, Taipei, Taiwan
                                                                                E-Mail: jbhuang@ieee.org, song@iis.sinica.edu.tw

 Goal                                                                         Physics-Based Shadow Model                                                                     Results
• Detecting   moving objects in a scene                                       • Bi-illuminant   Dichromatic Reflection Model                                                 • Qualitative   Results
                                                                                Decompose reflected light into four types: body and surface reflection for direct light
                                                                                sources and ambient illumination (Maxwell et al. CVPR 2008)
                                                                                              I (λ) = cb (λ)[mb ld (λ) + Mab (λ)] + cs (λ)[ms ld (λ) + Mas (λ)].
                                                                                Consider only matte surfaces, the camera sensor response
                                                                                                                          i i        i  i
                                                                                                             gi = αFi mb cb ld + Fi cb Mab , i ∈ {R , G , B }
                                                                                defines a line segment in the RGB color space: shadowed pixel to fully lit pixel.
                                                                              • Extracting    Useful Features
                                                                                Spectral ratio
       Input: Video sequence               Output: Label fields                               SD i         α         i
                                                                                                                  Mab
                                                                                  Si =       i − SD i
                                                                                                      =      +           i
                                                                                         BG             1 − α (1 − α)mb ld
                                                                                Subtract the bias term
 Challenges                                                                            γi = Si −
                                                                                                        α
                                                                                                         =
                                                                                                               Mabi

                                                                                                 1−α                  i
                                                                                                           (1 − α)mb ld
                                                                                Normalized spectral ratio
• Shadows   may be misclassified as foreground because:                                                 i
                                                                                                     Mab        1
• 1) they are typically significantly different from background                             γi =
                                                                                           ˆ                 i |γ|
                                                                                                                    ,
                                                                                                (1 − α)mb ld
• 2) they have the same motion as foreground objects                                                                                                                                 (a)              (b)               (c)                (d)               (e)
                                                                                                                R            G            B
                                                                                                 1           Mab 2        Mab 2        Mab 2
• 3) they are usually attached to foreground objects                            where |γ| =   (1−α)mb
                                                                                                            ( lR )   +   ( lG )   +   ( lB )                                 Figure: (a) Input frame. (b) Background posterior probability P (BG |xp ). (c) Confidence
                                                                                                               d            d            d
                                                                                                                                                                             map by the global shadow model. (d) The shadow posterior probability P (SD |xp ). (e)
                                                                                                                                                                             The foreground posterior probability P (FG |xp ).

 Related Works                                                                                                                                                              • Quantitative   Results
                                                                              Learning Cast Shadows                                                                           Evaluation metrics: shadow detection and discrimination rate:
                                                                                                                                                                                                                TPS              TPF
• Shadow    detection with static parameter settings                                                                                                                                                    η=              ;ξ =
                                                                                                                                                                                                            TPS + FNS        TPF + FNF
  Require significant parameter tuning                                         • Weak   Shadow Detector
  Cannot adapt to environment changes                                                                                                                                                       Table: Quantitative results on surveillance sequences
• Shadow    detection using statistical learning methods                                                                                                                                       Sequence Highway I Highway II             Hallway
  [1] Shadow flow (Porikli et al., ICCV 2005)                                    Evaluate every moving pixels detected by the                                                                    Methods η % ξ% η % ξ% η % ξ%
  [2] Gaussian mixture shadow model (Martel-Brisson et al., CVPR 2005)          background model to filter out impossible ones                                                                  Proposed 70.83 82.37 76.50 74.51 82.05 90.47
  [3] Local and global features (Liu et al., CVPR 2007)                                                                                                                                        Kernel [4] 70.50 84.40 68.40 71.20 72.40 86.70
  [4] Physical model of cast shadows (Martel-Brisson et al., CVPR 2008)                                                                                                                          LGf [3] 72.10 79.70 -             -     -     -
• Major   Drawback:                                                                                                                                                                            GMSM [2] 63.30 71.30 58.51 44.40 60.50 87.00
  All are pixel-based methods: need numerous foreground activities to learn   • Global   and Local Shadow Model                                                                                                            Handling Scene with Few
                                                                                                                                                                              Effect of Confidence-Rated Gaussian
  the parameters.                                                               Learn a global shadow model for the whole scene using Gaussian Mixture Model                  Mixture Learning                             Foreground Activities
                                                                                (GMM) with the Expectation-Maximization (EM) algorithm:
                                                                                                                                       K
                                                                                                                     p (ˆ|µ, Σ) =
                                                                                                                        r                    πk Gk (ˆ, µk , Σk )
                                                                                                                                                    r
 Contributions                                                                                                                        k =1
                                                                                Cast shadows are expected to form a compact cluster (with large mixing weight and
                                                                                small variance)
•A  global shadow model learned from physics-based features.                    Build pixel-based GMM to learn local features (gradient density distortion)
• Does not require numerous foreground activities or high frame               • Confidence-Rated             Gaussian Mixture Learning
  rates to learn the shadow model parameters                                    Adaptive learning rate for mixing weights and Gaussian parameters:                                  (a)          (b)          (c)
• Can be used for fast learning of local features in pixel-based                                                                                                              Figure: (a)Background image. (b) The            Figure: Quantitative results of the
                                                                                                                               1 − ρdefault
  models                                                                                                        ρπ = C (ˆ) ∗ (
                                                                                                                        γ                   ) + ρdefault                      mean map with confidence-rated learning.         Intelligent Room sequence under
                                                                                                                                  K
                                                                                                                                  j =1 cj                                     (c) The mean map w/o using                      different frame rates settings (10, 5,
                                                                                                                               1 − ρdefault                                   confidence-rated learning.                       3.33, and 2.5 frame/sec)
                                                                                                                ρG = C (ˆ) ∗ (
                                                                                                                        γ                   ) + ρdefault
                                                                                                                                   ck

Jia-Bin Huang and Chu-Song Chen (IIS, Academia Sinica, Taiwan)                                                                                          IEEE Conference on Computer Vision and Pattern Recognition 2009 (CVPR 2009)

More Related Content

What's hot

2_IGARSS2011_BPT_v2.pdf
2_IGARSS2011_BPT_v2.pdf2_IGARSS2011_BPT_v2.pdf
2_IGARSS2011_BPT_v2.pdfgrssieee
 
Realism of image composits
Realism of image compositsRealism of image composits
Realism of image compositsAliya Ibragimova
 
Tuto part2
Tuto part2Tuto part2
Tuto part2Bo Li
 
Lecture6
Lecture6Lecture6
Lecture6voracle
 
Fcv poster bao
Fcv poster baoFcv poster bao
Fcv poster baozukun
 
Reduced rcs uhf rfid gosper and hilbert tag antennas
Reduced rcs uhf rfid gosper and hilbert tag antennasReduced rcs uhf rfid gosper and hilbert tag antennas
Reduced rcs uhf rfid gosper and hilbert tag antennasiaemedu
 
Reduced rcs uhf rfid gosper and hilbert tag antennas no restriction
Reduced rcs uhf rfid gosper and hilbert tag antennas no restrictionReduced rcs uhf rfid gosper and hilbert tag antennas no restriction
Reduced rcs uhf rfid gosper and hilbert tag antennas no restrictioniaemedu
 
Video Copy Detection Using Inclined Video Tomography and Bag-of-Visual-Words
Video Copy Detection Using Inclined Video Tomography and Bag-of-Visual-WordsVideo Copy Detection Using Inclined Video Tomography and Bag-of-Visual-Words
Video Copy Detection Using Inclined Video Tomography and Bag-of-Visual-WordsWesley De Neve
 
L. Perivolaropoulos, Topological Quintessence
L. Perivolaropoulos, Topological QuintessenceL. Perivolaropoulos, Topological Quintessence
L. Perivolaropoulos, Topological QuintessenceSEENET-MTP
 
Pc12 sol c06_6-3
Pc12 sol c06_6-3Pc12 sol c06_6-3
Pc12 sol c06_6-3Garden City
 
Electrovariable Devices
Electrovariable DevicesElectrovariable Devices
Electrovariable Devicesjackpaget
 
CVPR2010: Advanced ITinCVPR in a Nutshell: part 7: Future Trend
CVPR2010: Advanced ITinCVPR in a Nutshell: part 7: Future TrendCVPR2010: Advanced ITinCVPR in a Nutshell: part 7: Future Trend
CVPR2010: Advanced ITinCVPR in a Nutshell: part 7: Future Trendzukun
 
Notes and-formulae-mathematics
Notes and-formulae-mathematicsNotes and-formulae-mathematics
Notes and-formulae-mathematicsRagulan Dev
 
Lect5 filling color_models
Lect5 filling color_modelsLect5 filling color_models
Lect5 filling color_modelsBCET
 

What's hot (15)

2_IGARSS2011_BPT_v2.pdf
2_IGARSS2011_BPT_v2.pdf2_IGARSS2011_BPT_v2.pdf
2_IGARSS2011_BPT_v2.pdf
 
Realism of image composits
Realism of image compositsRealism of image composits
Realism of image composits
 
Tuto part2
Tuto part2Tuto part2
Tuto part2
 
Lecture6
Lecture6Lecture6
Lecture6
 
OOPSLA04.ppt
OOPSLA04.pptOOPSLA04.ppt
OOPSLA04.ppt
 
Fcv poster bao
Fcv poster baoFcv poster bao
Fcv poster bao
 
Reduced rcs uhf rfid gosper and hilbert tag antennas
Reduced rcs uhf rfid gosper and hilbert tag antennasReduced rcs uhf rfid gosper and hilbert tag antennas
Reduced rcs uhf rfid gosper and hilbert tag antennas
 
Reduced rcs uhf rfid gosper and hilbert tag antennas no restriction
Reduced rcs uhf rfid gosper and hilbert tag antennas no restrictionReduced rcs uhf rfid gosper and hilbert tag antennas no restriction
Reduced rcs uhf rfid gosper and hilbert tag antennas no restriction
 
Video Copy Detection Using Inclined Video Tomography and Bag-of-Visual-Words
Video Copy Detection Using Inclined Video Tomography and Bag-of-Visual-WordsVideo Copy Detection Using Inclined Video Tomography and Bag-of-Visual-Words
Video Copy Detection Using Inclined Video Tomography and Bag-of-Visual-Words
 
L. Perivolaropoulos, Topological Quintessence
L. Perivolaropoulos, Topological QuintessenceL. Perivolaropoulos, Topological Quintessence
L. Perivolaropoulos, Topological Quintessence
 
Pc12 sol c06_6-3
Pc12 sol c06_6-3Pc12 sol c06_6-3
Pc12 sol c06_6-3
 
Electrovariable Devices
Electrovariable DevicesElectrovariable Devices
Electrovariable Devices
 
CVPR2010: Advanced ITinCVPR in a Nutshell: part 7: Future Trend
CVPR2010: Advanced ITinCVPR in a Nutshell: part 7: Future TrendCVPR2010: Advanced ITinCVPR in a Nutshell: part 7: Future Trend
CVPR2010: Advanced ITinCVPR in a Nutshell: part 7: Future Trend
 
Notes and-formulae-mathematics
Notes and-formulae-mathematicsNotes and-formulae-mathematics
Notes and-formulae-mathematics
 
Lect5 filling color_models
Lect5 filling color_modelsLect5 filling color_models
Lect5 filling color_models
 

Viewers also liked

A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)Jia-Bin Huang
 
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)Jia-Bin Huang
 
Transformation Guided Image Completion ICCP 2013
Transformation Guided Image Completion ICCP 2013Transformation Guided Image Completion ICCP 2013
Transformation Guided Image Completion ICCP 2013Jia-Bin Huang
 
What Makes a Creative Photograph?
What Makes a Creative Photograph?What Makes a Creative Photograph?
What Makes a Creative Photograph?Jia-Bin Huang
 
Three Reasons to Join FVE at uiuc
Three Reasons to Join FVE at uiucThree Reasons to Join FVE at uiuc
Three Reasons to Join FVE at uiucJia-Bin Huang
 
Saliency Detection via Divergence Analysis: A Unified Perspective ICPR 2012
Saliency Detection via Divergence Analysis: A Unified Perspective ICPR 2012Saliency Detection via Divergence Analysis: A Unified Perspective ICPR 2012
Saliency Detection via Divergence Analysis: A Unified Perspective ICPR 2012Jia-Bin Huang
 
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)Jia-Bin Huang
 
UIUC CS 498 - Computational Photography - Final project presentation
UIUC CS 498 - Computational Photography - Final project presentation UIUC CS 498 - Computational Photography - Final project presentation
UIUC CS 498 - Computational Photography - Final project presentation Jia-Bin Huang
 
Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Jia-Bin Huang
 
How to come up with new research ideas
How to come up with new research ideasHow to come up with new research ideas
How to come up with new research ideasJia-Bin Huang
 
How to Read Academic Papers
How to Read Academic PapersHow to Read Academic Papers
How to Read Academic PapersJia-Bin Huang
 
Toward Accurate and Robust Cross-Ratio based Gaze Trackers Through Learning F...
Toward Accurate and Robust Cross-Ratio based Gaze Trackers Through Learning F...Toward Accurate and Robust Cross-Ratio based Gaze Trackers Through Learning F...
Toward Accurate and Robust Cross-Ratio based Gaze Trackers Through Learning F...Jia-Bin Huang
 
Lecture 21 - Image Categorization - Computer Vision Spring2015
Lecture 21 - Image Categorization -  Computer Vision Spring2015Lecture 21 - Image Categorization -  Computer Vision Spring2015
Lecture 21 - Image Categorization - Computer Vision Spring2015Jia-Bin Huang
 
Image Completion using Planar Structure Guidance (SIGGRAPH 2014)
Image Completion using Planar Structure Guidance (SIGGRAPH 2014)Image Completion using Planar Structure Guidance (SIGGRAPH 2014)
Image Completion using Planar Structure Guidance (SIGGRAPH 2014)Jia-Bin Huang
 
Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)
Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)
Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)Jia-Bin Huang
 
Applying for Graduate School in S.T.E.M.
Applying for Graduate School in S.T.E.M.Applying for Graduate School in S.T.E.M.
Applying for Graduate School in S.T.E.M.Jia-Bin Huang
 
Computer Vision Crash Course
Computer Vision Crash CourseComputer Vision Crash Course
Computer Vision Crash CourseJia-Bin Huang
 
Writing Fast MATLAB Code
Writing Fast MATLAB CodeWriting Fast MATLAB Code
Writing Fast MATLAB CodeJia-Bin Huang
 
Linear Algebra and Matlab tutorial
Linear Algebra and Matlab tutorialLinear Algebra and Matlab tutorial
Linear Algebra and Matlab tutorialJia-Bin Huang
 
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
Lecture 29 Convolutional Neural Networks -  Computer Vision Spring2015Lecture 29 Convolutional Neural Networks -  Computer Vision Spring2015
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015Jia-Bin Huang
 

Viewers also liked (20)

A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
 
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
 
Transformation Guided Image Completion ICCP 2013
Transformation Guided Image Completion ICCP 2013Transformation Guided Image Completion ICCP 2013
Transformation Guided Image Completion ICCP 2013
 
What Makes a Creative Photograph?
What Makes a Creative Photograph?What Makes a Creative Photograph?
What Makes a Creative Photograph?
 
Three Reasons to Join FVE at uiuc
Three Reasons to Join FVE at uiucThree Reasons to Join FVE at uiuc
Three Reasons to Join FVE at uiuc
 
Saliency Detection via Divergence Analysis: A Unified Perspective ICPR 2012
Saliency Detection via Divergence Analysis: A Unified Perspective ICPR 2012Saliency Detection via Divergence Analysis: A Unified Perspective ICPR 2012
Saliency Detection via Divergence Analysis: A Unified Perspective ICPR 2012
 
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
 
UIUC CS 498 - Computational Photography - Final project presentation
UIUC CS 498 - Computational Photography - Final project presentation UIUC CS 498 - Computational Photography - Final project presentation
UIUC CS 498 - Computational Photography - Final project presentation
 
Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)
 
How to come up with new research ideas
How to come up with new research ideasHow to come up with new research ideas
How to come up with new research ideas
 
How to Read Academic Papers
How to Read Academic PapersHow to Read Academic Papers
How to Read Academic Papers
 
Toward Accurate and Robust Cross-Ratio based Gaze Trackers Through Learning F...
Toward Accurate and Robust Cross-Ratio based Gaze Trackers Through Learning F...Toward Accurate and Robust Cross-Ratio based Gaze Trackers Through Learning F...
Toward Accurate and Robust Cross-Ratio based Gaze Trackers Through Learning F...
 
Lecture 21 - Image Categorization - Computer Vision Spring2015
Lecture 21 - Image Categorization -  Computer Vision Spring2015Lecture 21 - Image Categorization -  Computer Vision Spring2015
Lecture 21 - Image Categorization - Computer Vision Spring2015
 
Image Completion using Planar Structure Guidance (SIGGRAPH 2014)
Image Completion using Planar Structure Guidance (SIGGRAPH 2014)Image Completion using Planar Structure Guidance (SIGGRAPH 2014)
Image Completion using Planar Structure Guidance (SIGGRAPH 2014)
 
Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)
Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)
Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)
 
Applying for Graduate School in S.T.E.M.
Applying for Graduate School in S.T.E.M.Applying for Graduate School in S.T.E.M.
Applying for Graduate School in S.T.E.M.
 
Computer Vision Crash Course
Computer Vision Crash CourseComputer Vision Crash Course
Computer Vision Crash Course
 
Writing Fast MATLAB Code
Writing Fast MATLAB CodeWriting Fast MATLAB Code
Writing Fast MATLAB Code
 
Linear Algebra and Matlab tutorial
Linear Algebra and Matlab tutorialLinear Algebra and Matlab tutorial
Linear Algebra and Matlab tutorial
 
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
Lecture 29 Convolutional Neural Networks -  Computer Vision Spring2015Lecture 29 Convolutional Neural Networks -  Computer Vision Spring2015
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
 

More from Jia-Bin Huang

How to write a clear paper
How to write a clear paperHow to write a clear paper
How to write a clear paperJia-Bin Huang
 
Research 101 - Paper Writing with LaTeX
Research 101 - Paper Writing with LaTeXResearch 101 - Paper Writing with LaTeX
Research 101 - Paper Writing with LaTeXJia-Bin Huang
 
Real-time Face Detection and Recognition
Real-time Face Detection and RecognitionReal-time Face Detection and Recognition
Real-time Face Detection and RecognitionJia-Bin Huang
 
Jia-Bin Huang's Curriculum Vitae
Jia-Bin Huang's Curriculum VitaeJia-Bin Huang's Curriculum Vitae
Jia-Bin Huang's Curriculum VitaeJia-Bin Huang
 
Pose aware online visual tracking
Pose aware online visual trackingPose aware online visual tracking
Pose aware online visual trackingJia-Bin Huang
 
Face Expression Enhancement
Face Expression EnhancementFace Expression Enhancement
Face Expression EnhancementJia-Bin Huang
 
Image Smoothing for Structure Extraction
Image Smoothing for Structure ExtractionImage Smoothing for Structure Extraction
Image Smoothing for Structure ExtractionJia-Bin Huang
 
Static and Dynamic Hand Gesture Recognition
Static and Dynamic Hand Gesture RecognitionStatic and Dynamic Hand Gesture Recognition
Static and Dynamic Hand Gesture RecognitionJia-Bin Huang
 
Real-Time Face Detection, Tracking, and Attributes Recognition
Real-Time Face Detection, Tracking, and Attributes RecognitionReal-Time Face Detection, Tracking, and Attributes Recognition
Real-Time Face Detection, Tracking, and Attributes RecognitionJia-Bin Huang
 
Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Jia-Bin Huang
 
Information Preserving Color Transformation for Protanopia and Deuteranopia (...
Information Preserving Color Transformation for Protanopia and Deuteranopia (...Information Preserving Color Transformation for Protanopia and Deuteranopia (...
Information Preserving Color Transformation for Protanopia and Deuteranopia (...Jia-Bin Huang
 
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)Jia-Bin Huang
 
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)Jia-Bin Huang
 

More from Jia-Bin Huang (13)

How to write a clear paper
How to write a clear paperHow to write a clear paper
How to write a clear paper
 
Research 101 - Paper Writing with LaTeX
Research 101 - Paper Writing with LaTeXResearch 101 - Paper Writing with LaTeX
Research 101 - Paper Writing with LaTeX
 
Real-time Face Detection and Recognition
Real-time Face Detection and RecognitionReal-time Face Detection and Recognition
Real-time Face Detection and Recognition
 
Jia-Bin Huang's Curriculum Vitae
Jia-Bin Huang's Curriculum VitaeJia-Bin Huang's Curriculum Vitae
Jia-Bin Huang's Curriculum Vitae
 
Pose aware online visual tracking
Pose aware online visual trackingPose aware online visual tracking
Pose aware online visual tracking
 
Face Expression Enhancement
Face Expression EnhancementFace Expression Enhancement
Face Expression Enhancement
 
Image Smoothing for Structure Extraction
Image Smoothing for Structure ExtractionImage Smoothing for Structure Extraction
Image Smoothing for Structure Extraction
 
Static and Dynamic Hand Gesture Recognition
Static and Dynamic Hand Gesture RecognitionStatic and Dynamic Hand Gesture Recognition
Static and Dynamic Hand Gesture Recognition
 
Real-Time Face Detection, Tracking, and Attributes Recognition
Real-Time Face Detection, Tracking, and Attributes RecognitionReal-Time Face Detection, Tracking, and Attributes Recognition
Real-Time Face Detection, Tracking, and Attributes Recognition
 
Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)
 
Information Preserving Color Transformation for Protanopia and Deuteranopia (...
Information Preserving Color Transformation for Protanopia and Deuteranopia (...Information Preserving Color Transformation for Protanopia and Deuteranopia (...
Information Preserving Color Transformation for Protanopia and Deuteranopia (...
 
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
 
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
 

Moving Shadow Detection using Physics-based Features

  • 1. Moving Cast Shadow Detection using Physics-based Features Jia-Bin Huang and Chu-Song Chen Institute of Information Science, Academia Sinica, Taipei, Taiwan E-Mail: jbhuang@ieee.org, song@iis.sinica.edu.tw Goal Physics-Based Shadow Model Results • Detecting moving objects in a scene • Bi-illuminant Dichromatic Reflection Model • Qualitative Results Decompose reflected light into four types: body and surface reflection for direct light sources and ambient illumination (Maxwell et al. CVPR 2008) I (λ) = cb (λ)[mb ld (λ) + Mab (λ)] + cs (λ)[ms ld (λ) + Mas (λ)]. Consider only matte surfaces, the camera sensor response i i i i gi = αFi mb cb ld + Fi cb Mab , i ∈ {R , G , B } defines a line segment in the RGB color space: shadowed pixel to fully lit pixel. • Extracting Useful Features Spectral ratio Input: Video sequence Output: Label fields SD i α i Mab Si = i − SD i = + i BG 1 − α (1 − α)mb ld Subtract the bias term Challenges γi = Si − α = Mabi 1−α i (1 − α)mb ld Normalized spectral ratio • Shadows may be misclassified as foreground because: i Mab 1 • 1) they are typically significantly different from background γi = ˆ i |γ| , (1 − α)mb ld • 2) they have the same motion as foreground objects (a) (b) (c) (d) (e) R G B 1 Mab 2 Mab 2 Mab 2 • 3) they are usually attached to foreground objects where |γ| = (1−α)mb ( lR ) + ( lG ) + ( lB ) Figure: (a) Input frame. (b) Background posterior probability P (BG |xp ). (c) Confidence d d d map by the global shadow model. (d) The shadow posterior probability P (SD |xp ). (e) The foreground posterior probability P (FG |xp ). Related Works • Quantitative Results Learning Cast Shadows Evaluation metrics: shadow detection and discrimination rate: TPS TPF • Shadow detection with static parameter settings η= ;ξ = TPS + FNS TPF + FNF Require significant parameter tuning • Weak Shadow Detector Cannot adapt to environment changes Table: Quantitative results on surveillance sequences • Shadow detection using statistical learning methods Sequence Highway I Highway II Hallway [1] Shadow flow (Porikli et al., ICCV 2005) Evaluate every moving pixels detected by the Methods η % ξ% η % ξ% η % ξ% [2] Gaussian mixture shadow model (Martel-Brisson et al., CVPR 2005) background model to filter out impossible ones Proposed 70.83 82.37 76.50 74.51 82.05 90.47 [3] Local and global features (Liu et al., CVPR 2007) Kernel [4] 70.50 84.40 68.40 71.20 72.40 86.70 [4] Physical model of cast shadows (Martel-Brisson et al., CVPR 2008) LGf [3] 72.10 79.70 - - - - • Major Drawback: GMSM [2] 63.30 71.30 58.51 44.40 60.50 87.00 All are pixel-based methods: need numerous foreground activities to learn • Global and Local Shadow Model Handling Scene with Few Effect of Confidence-Rated Gaussian the parameters. Learn a global shadow model for the whole scene using Gaussian Mixture Model Mixture Learning Foreground Activities (GMM) with the Expectation-Maximization (EM) algorithm: K p (ˆ|µ, Σ) = r πk Gk (ˆ, µk , Σk ) r Contributions k =1 Cast shadows are expected to form a compact cluster (with large mixing weight and small variance) •A global shadow model learned from physics-based features. Build pixel-based GMM to learn local features (gradient density distortion) • Does not require numerous foreground activities or high frame • Confidence-Rated Gaussian Mixture Learning rates to learn the shadow model parameters Adaptive learning rate for mixing weights and Gaussian parameters: (a) (b) (c) • Can be used for fast learning of local features in pixel-based Figure: (a)Background image. (b) The Figure: Quantitative results of the 1 − ρdefault models ρπ = C (ˆ) ∗ ( γ ) + ρdefault mean map with confidence-rated learning. Intelligent Room sequence under K j =1 cj (c) The mean map w/o using different frame rates settings (10, 5, 1 − ρdefault confidence-rated learning. 3.33, and 2.5 frame/sec) ρG = C (ˆ) ∗ ( γ ) + ρdefault ck Jia-Bin Huang and Chu-Song Chen (IIS, Academia Sinica, Taiwan) IEEE Conference on Computer Vision and Pattern Recognition 2009 (CVPR 2009)