SlideShare a Scribd company logo
1 of 43
Download to read offline
Sparse Coding and Dictionary Learning
                              for Image Analysis
                       Part II: Dictionary Learning for signal reconstruction


           Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro


                        ICCV’09 tutorial, Kyoto, 28th September 2009




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   1/43
What this part is about
           The learning of compact representations of images
           adapted to restoration tasks.
           A fast online algorithm for learning dictionaries and
           factorizing matrices in general.
           Various formulations for image and video processing.




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   2/43
The Image Denoising Problem




                               y                  =            xorig                  + w
                 measurements                            original image                        noise


Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   3/43
Sparse representations for image restoration

                                             y          =          xorig        + w
                                     measurements              original image       noise


   Energy minimization problem - MAP estimation
                                 E (x) =               ||y − x||2
                                                                2                + Pr (x)
                                              relation to measurements                 prior


   Some classical priors
           Smoothness λ||Lx||2
                             2
           Total variation λ|| x||2
                                  1
           Wavelet sparsity λ||Wx||1
           ...

Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro       Dictionary Learning for signal reconstruction   4/43
Sparse representations for image restoration
   Sparsity and redundancy
                 Pr (x) = λ||α||0 for x ≈ Dα
                                               
                                      α[1]
                                          α[2]
            x =  d1 d2 · · · dp   .  . 
                                           . 
                                            α[p]
                      x∈Rm                             D∈Rm×p
                                                                                   α∈Rp ,sparse




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   5/43
Sparse representations for image restoration
   Designed dictionaries
   [Haar, 1910], [Zweig, Morlet, Grossman ∼70s], [Meyer, Mallat,
   Daubechies, Coifman, Donoho, Candes ∼80s-today]. . . (see [Mallat,
   1999])
   Wavelets, Curvelets, Wedgelets, Bandlets, . . . lets

   Learned dictionaries of patches
   [Olshausen and Field, 1997], [Engan et al., 1999], [Lewicki and
   Sejnowski, 2000], [Aharon et al., 2006] , [Roth and Black, 2005], [Lee
   et al., 2007]
                                1
                       min        ||xi − Dαi ||2 + λψ(αi )
                                               2
                      αi ,D∈C   2
                                               i
                                                                               sparsity
                                                      reconstruction
           ψ(α) = ||α||0 (“               0   pseudo-norm”)
           ψ(α) = ||α||1 (            1   norm)

Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   6/43
Sparse representations for image restoration
   Solving the denoising problem
   [Elad and Aharon, 2006]
       Extract all overlapping 8 × 8 patches yi .
       Solve a matrix factorization problem:
                                             n
                                                   1
                               min                   ||yi − Dαi ||2 + λψ(αi ),
                                                                  2
                             αi ,D∈C               2
                                           i=1                                          sparsity
                                                        reconstruction

           with n > 100, 000
           Average the reconstruction of each patch.


Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   7/43
Sparse representations for image restoration
K-SVD: [Elad and Aharon, 2006]




   Figure:     Dictionary trained on a noisy version of the image
   boat.

Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   8/43
Sparse representations for image restoration
   Inpainting, Demosaicking
                                             1
                             min               ||β ⊗ (yi − Dαi )||2 + λi ψ(αi )
                           D∈C,α             2 i                  2
                                        i

   RAW Image Processing (see our poster)

                                               White
                                              balance.
                                                                                   Denoising
                                               Black
                                            substraction.
                                            Conversion
                                             to sRGB.
                                                                               Demosaicking
                                              Gamma
                                            correction.

Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   9/43
Sparse representations for image restoration
[Mairal, Bach, Ponce, Sapiro, and Zisserman, 2009c]




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   10/43
Sparse representations for image restoration
[Mairal, Sapiro, and Elad, 2008b]




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   11/43
Sparse representations for image restoration
Inpainting, [Mairal, Elad, and Sapiro, 2008a]




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   12/43
Sparse representations for image restoration
Inpainting, [Mairal, Elad, and Sapiro, 2008a]




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   13/43
Sparse representations for video restoration
   Key ideas for video processing
   [Protter and Elad, 2009]
       Using a 3D dictionary.
       Processing of many frames at the same time.
       Dictionary propagation.




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   14/43
Sparse representations for image restoration
Inpainting, [Mairal, Sapiro, and Elad, 2008b]




                                      Figure:     Inpainting results.




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   15/43
Sparse representations for image restoration
Inpainting, [Mairal, Sapiro, and Elad, 2008b]




                                      Figure:     Inpainting results.




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   16/43
Sparse representations for image restoration
Inpainting, [Mairal, Sapiro, and Elad, 2008b]




                                      Figure:     Inpainting results.




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   17/43
Sparse representations for image restoration
Inpainting, [Mairal, Sapiro, and Elad, 2008b]




                                      Figure:     Inpainting results.




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   18/43
Sparse representations for image restoration
Inpainting, [Mairal, Sapiro, and Elad, 2008b]




                                      Figure:     Inpainting results.




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   19/43
Sparse representations for image restoration
Color video denoising, [Mairal, Sapiro, and Elad, 2008b]




                              Figure:     Denoising results. σ = 25




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   20/43
Sparse representations for image restoration
Color video denoising, [Mairal, Sapiro, and Elad, 2008b]




                              Figure:     Denoising results. σ = 25




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   21/43
Sparse representations for image restoration
Color video denoising, [Mairal, Sapiro, and Elad, 2008b]




                              Figure:     Denoising results. σ = 25




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   22/43
Sparse representations for image restoration
Color video denoising, [Mairal, Sapiro, and Elad, 2008b]




                              Figure:     Denoising results. σ = 25




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   23/43
Sparse representations for image restoration
Color video denoising, [Mairal, Sapiro, and Elad, 2008b]




                              Figure:     Denoising results. σ = 25




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   24/43
Optimization for Dictionary Learning

                                          n
                                                1
                            min                   ||xi − Dαi ||2 + λ||αi ||1
                                                               2
                         α∈R  p×n               2
                          D∈C           i=1

         C = {D ∈ Rm×p s.t. ∀j = 1, . . . , p, ||dj ||2 ≤ 1}.


           Classical optimization alternates between D and α.
           Good results, but very slow!




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   25/43
Optimization for Dictionary Learning
[Mairal, Bach, Ponce, and Sapiro, 2009a]

   Classical formulation of dictionary learning
                                                                     n
                                                                1
                                     min fn (D) = min                     l(xi , D),
                                     D∈C                  D∈C   n
                                                                    i=1

   where
                                             1
                               l(x, D) = minp ||x − Dα||2 + λ||α||1 .
                                                        2
                                         α∈R 2

   Which formulation are we interested in?
                                                                                    n
                                                     1
                      min f (D) = Ex [l(x, D)] ≈ lim                                    l(xi , D)
                      D∈C                       n→+∞ n
                                                                                  i=1

   [Bottou and Bousquet, 2008]: Online learning can
           handle potentially infinite or dynamic datasets,
           be dramatically faster than batch algorithms.
Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro    Dictionary Learning for signal reconstruction   26/43
Optimization for Dictionary Learning
   Require: D0 ∈ Rm×p (initial dictionary); λ ∈ R
    1: A0 = 0, B0 = 0.
    2: for t=1,. . . ,T do
    3: Draw xt
    4: Sparse Coding

                                             1
                                 αt ← arg min ||xt − Dt−1 α||2 + λ||α||1 ,
                                                             2
                                       α∈Rp 2

     5:     Aggregate sufficient statistics
            At ← At−1 + αt αT , Bt ← Bt−1 + xt αT
                              t                    t
     6:     Dictionary Update (block-coordinate descent)
                                                        t
                                                   1           1
                           Dt ← arg min                          ||xi − Dαi ||2 + λ||αi ||1 .
                                                                              2
                                         D∈C       t           2
                                                       i=1

     7:   end for
Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro    Dictionary Learning for signal reconstruction   27/43
Optimization for Dictionary Learning
   Which guarantees do we have?
   Under a few reasonable assumptions,
                                     ˆ
       we build a surrogate function ft of the expected cost f verifying
                                                   ˆ
                                               lim ft (Dt ) − f (Dt ) = 0,
                                             t→+∞

           Dt is asymptotically close to a stationary point.

   Extensions (all implemented in SPAMS)
           non-negative matrix decompositions.
           sparse PCA (sparse dictionaries).
           fused-lasso regularizations (piecewise constant dictionaries)




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   28/43
Optimization for Dictionary Learning
Experimental results, batch vs online




                                             m = 8 × 8, p = 256
Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   29/43
Optimization for Dictionary Learning
Experimental results, batch vs online




                                       m = 12 × 12 × 3, p = 512
Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   30/43
Optimization for Dictionary Learning
Inpainting a 12-Mpixel photograph




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   31/43
Optimization for Dictionary Learning
Inpainting a 12-Mpixel photograph




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   32/43
Optimization for Dictionary Learning
Inpainting a 12-Mpixel photograph




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   33/43
Optimization for Dictionary Learning
Inpainting a 12-Mpixel photograph




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   34/43
Extension to NMF and sparse PCA
[Mairal, Bach, Ponce, and Sapiro, 2009b]

   NMF extension
                                    n
                                         1
                         min               ||xi − Dαi ||2 s.t. αi ≥ 0,
                                                        2                                  D ≥ 0.
                      α∈Rp×n             2
                       D∈C i=1


   SPCA extension
                                               n
                                                   1
                                   min               ||xi − Dαi ||2 + λ||α1 ||1
                                                                  2
                                 α∈Rp×n            2
                                   D∈C       i=1

                      C = {D ∈ Rm×p s.t. ∀j ||dj ||2 + γ||dj ||1 ≤ 1}.
                                                   2




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   35/43
Extension to NMF and sparse PCA
Faces: Extended Yale Database B




                 (a) PCA                               (b) NNMF                                  (c) DL




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   36/43
Extension to NMF and sparse PCA
Faces: Extended Yale Database B




         (d) SPCA, τ = 70%                       (e) SPCA, τ = 30%                     (f) SPCA, τ = 10%




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   37/43
Extension to NMF and sparse PCA
Natural Patches




                 (a) PCA                               (b) NNMF                                  (c) DL




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   38/43
Extension to NMF and sparse PCA
Natural Patches




         (d) SPCA, τ = 70%                       (e) SPCA, τ = 30%                     (f) SPCA, τ = 10%




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   39/43
Summary of this part
           The dictionary learning framework leads to
           state-of-the-art results for many image . . .
           . . . and video processing tasks.
           Online learning techniques are well-suited for this
           problem and allows training sets with millions of
           patches.




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   40/43
References I
   M. Aharon, M. Elad, and A. M. Bruckstein. The K-SVD: An algorithm for designing
     of overcomplete dictionaries for sparse representations. IEEE Transactions on
     Signal Processing, 54(11):4311–4322, November 2006.
   L. Bottou and O. Bousquet. The trade-offs of large scale learning. In J.C. Platt,
      D. Koller, Y. Singer, and S. Roweis, editors, Advances in Neural Information
      Processing Systems, volume 20, pages 161–168. MIT Press, Cambridge, MA, 2008.
   M. Elad and M. Aharon. Image denoising via sparse and redundant representations
     over learned dictionaries. IEEE Transactions on Image Processing, 54(12):
     3736–3745, December 2006.
   K. Engan, S. O. Aase, and J. H. Husoy. Frame based signal compression using
      method of optimal directions (MOD). In Proceedings of the 1999 IEEE
      International Symposium on Circuits Systems, volume 4, 1999.
   A. Haar. Zur theorie der orthogonalen funktionensysteme. Mathematische Annalen,
      69:331–371, 1910.
   H. Lee, A. Battle, R. Raina, and A. Y. Ng. Efficient sparse coding algorithms. In
      B. Sch¨lkopf, J. Platt, and T. Hoffman, editors, Advances in Neural Information
            o
      Processing Systems, volume 19, pages 801–808. MIT Press, Cambridge, MA, 2007.


Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   41/43
References II
   M. S. Lewicki and T. J. Sejnowski. Learning overcomplete representations. Neural
     Computation, 12(2):337–365, 2000.
   J. Mairal, M. Elad, and G. Sapiro. Sparse representation for color image restoration.
      IEEE Transactions on Image Processing, 17(1):53–69, January 2008a.
   J. Mairal, G. Sapiro, and M. Elad. Learning multiscale sparse representations for
      image and video restoration. SIAM Multiscale Modelling and Simulation, 7(1):
      214–241, April 2008b.
   J. Mairal, F. Bach, J. Ponce, and G. Sapiro. Online dictionary learning for sparse
      coding. In Proceedings of the International Conference on Machine Learning
      (ICML), 2009a.
   J. Mairal, F. Bach, J. Ponce, and G. Sapiro. Online learning for matrix factorization
      and sparse coding. ArXiv:0908.0050v1, 2009b. submitted.
   J. Mairal, F. Bach, J. Ponce, G. Sapiro, and A. Zisserman. Non-local sparse models
      for image restoration. In Proceedings of the IEEE International Conference on
      Computer Vision (ICCV), 2009c.
   S. Mallat. A Wavelet Tour of Signal Processing, Second Edition. Academic Press,
      New York, September 1999.



Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   42/43
References III
   B. A. Olshausen and D. J. Field. Sparse coding with an overcomplete basis set: A
      strategy employed by V1? Vision Research, 37:3311–3325, 1997.
   M. Protter and M. Elad. Image sequence denoising via sparse and redundant
     representations. IEEE Transactions on Image Processing, 18(1):27–36, 2009.
   S. Roth and M. J. Black. Fields of experts: A framework for learning image priors. In
      Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
      (CVPR), 2005.




Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro   Dictionary Learning for signal reconstruction   43/43

More Related Content

What's hot

Sparsity and Compressed Sensing
Sparsity and Compressed SensingSparsity and Compressed Sensing
Sparsity and Compressed SensingGabriel Peyré
 
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)Jia-Bin Huang
 
Object Detection with Discrmininatively Trained Part based Models
Object Detection with Discrmininatively Trained Part based ModelsObject Detection with Discrmininatively Trained Part based Models
Object Detection with Discrmininatively Trained Part based Modelszukun
 
Signal Processing Course : Compressed Sensing
Signal Processing Course : Compressed SensingSignal Processing Course : Compressed Sensing
Signal Processing Course : Compressed SensingGabriel Peyré
 
Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Jia-Bin Huang
 
Object Recognition with Deformable Models
Object Recognition with Deformable ModelsObject Recognition with Deformable Models
Object Recognition with Deformable Modelszukun
 
Digital image processing - Image Enhancement (MATERIAL)
Digital image processing  - Image Enhancement (MATERIAL)Digital image processing  - Image Enhancement (MATERIAL)
Digital image processing - Image Enhancement (MATERIAL)Mathankumar S
 
Compressed Sensing and Tomography
Compressed Sensing and TomographyCompressed Sensing and Tomography
Compressed Sensing and TomographyGabriel Peyré
 
Principal component analysis and matrix factorizations for learning (part 2) ...
Principal component analysis and matrix factorizations for learning (part 2) ...Principal component analysis and matrix factorizations for learning (part 2) ...
Principal component analysis and matrix factorizations for learning (part 2) ...zukun
 
Image pre processing-restoration
Image pre processing-restorationImage pre processing-restoration
Image pre processing-restorationAshish Kumar
 
Introduction to digital image processing
Introduction to digital image processingIntroduction to digital image processing
Introduction to digital image processingHossain Md Shakhawat
 
Tchebichef image watermarking along the edge using YCoCg-R color space for co...
Tchebichef image watermarking along the edge using YCoCg-R color space for co...Tchebichef image watermarking along the edge using YCoCg-R color space for co...
Tchebichef image watermarking along the edge using YCoCg-R color space for co...IJECEIAES
 
Digital Image Processing_ ch2 enhancement spatial-domain
Digital Image Processing_ ch2 enhancement spatial-domainDigital Image Processing_ ch2 enhancement spatial-domain
Digital Image Processing_ ch2 enhancement spatial-domainMalik obeisat
 
RP BASED OPTIMIZED IMAGE COMPRESSING TECHNIQUE
RP BASED OPTIMIZED IMAGE COMPRESSING TECHNIQUERP BASED OPTIMIZED IMAGE COMPRESSING TECHNIQUE
RP BASED OPTIMIZED IMAGE COMPRESSING TECHNIQUEprj_publication
 

What's hot (18)

Sparsity and Compressed Sensing
Sparsity and Compressed SensingSparsity and Compressed Sensing
Sparsity and Compressed Sensing
 
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
 
Object Detection with Discrmininatively Trained Part based Models
Object Detection with Discrmininatively Trained Part based ModelsObject Detection with Discrmininatively Trained Part based Models
Object Detection with Discrmininatively Trained Part based Models
 
Signal Processing Course : Compressed Sensing
Signal Processing Course : Compressed SensingSignal Processing Course : Compressed Sensing
Signal Processing Course : Compressed Sensing
 
It 603
It 603It 603
It 603
 
Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)
 
Object Recognition with Deformable Models
Object Recognition with Deformable ModelsObject Recognition with Deformable Models
Object Recognition with Deformable Models
 
Digital image processing - Image Enhancement (MATERIAL)
Digital image processing  - Image Enhancement (MATERIAL)Digital image processing  - Image Enhancement (MATERIAL)
Digital image processing - Image Enhancement (MATERIAL)
 
ICPR 2012
ICPR 2012ICPR 2012
ICPR 2012
 
Compressed Sensing and Tomography
Compressed Sensing and TomographyCompressed Sensing and Tomography
Compressed Sensing and Tomography
 
Principal component analysis and matrix factorizations for learning (part 2) ...
Principal component analysis and matrix factorizations for learning (part 2) ...Principal component analysis and matrix factorizations for learning (part 2) ...
Principal component analysis and matrix factorizations for learning (part 2) ...
 
Image pre processing-restoration
Image pre processing-restorationImage pre processing-restoration
Image pre processing-restoration
 
In2414961500
In2414961500In2414961500
In2414961500
 
Introduction to digital image processing
Introduction to digital image processingIntroduction to digital image processing
Introduction to digital image processing
 
Tchebichef image watermarking along the edge using YCoCg-R color space for co...
Tchebichef image watermarking along the edge using YCoCg-R color space for co...Tchebichef image watermarking along the edge using YCoCg-R color space for co...
Tchebichef image watermarking along the edge using YCoCg-R color space for co...
 
Digital Image Processing_ ch2 enhancement spatial-domain
Digital Image Processing_ ch2 enhancement spatial-domainDigital Image Processing_ ch2 enhancement spatial-domain
Digital Image Processing_ ch2 enhancement spatial-domain
 
Gg2411291135
Gg2411291135Gg2411291135
Gg2411291135
 
RP BASED OPTIMIZED IMAGE COMPRESSING TECHNIQUE
RP BASED OPTIMIZED IMAGE COMPRESSING TECHNIQUERP BASED OPTIMIZED IMAGE COMPRESSING TECHNIQUE
RP BASED OPTIMIZED IMAGE COMPRESSING TECHNIQUE
 

Similar to Dictionary Learning for Image Reconstruction

Tuto cvpr part4
Tuto cvpr part4Tuto cvpr part4
Tuto cvpr part4Bo Li
 
01 graphical models
01 graphical models01 graphical models
01 graphical modelszukun
 
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: additional slides
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: additional slidesCVPR2010: Advanced ITinCVPR in a Nutshell: part 4: additional slides
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: additional slideszukun
 
Image Restitution Using Non-Locally Centralized Sparse Representation Model
Image Restitution Using Non-Locally Centralized Sparse Representation ModelImage Restitution Using Non-Locally Centralized Sparse Representation Model
Image Restitution Using Non-Locally Centralized Sparse Representation ModelIJERA Editor
 
Digital Image Processing - Image Restoration
Digital Image Processing - Image RestorationDigital Image Processing - Image Restoration
Digital Image Processing - Image RestorationMathankumar S
 
Gi2429352937
Gi2429352937Gi2429352937
Gi2429352937IJMER
 
Habilitation à diriger des recherches
Habilitation à diriger des recherchesHabilitation à diriger des recherches
Habilitation à diriger des recherchesPierre Pudlo
 
Robust Super-Resolution by minimizing a Gaussian-weighted L2 error norm
Robust Super-Resolution by minimizing a Gaussian-weighted L2 error normRobust Super-Resolution by minimizing a Gaussian-weighted L2 error norm
Robust Super-Resolution by minimizing a Gaussian-weighted L2 error normTuan Q. Pham
 
No Slide Title
No Slide TitleNo Slide Title
No Slide Titlebutest
 
No Slide Title
No Slide TitleNo Slide Title
No Slide Titlebutest
 
Speckle Noise Reduction in Ultrasound Images using Adaptive and Anisotropic D...
Speckle Noise Reduction in Ultrasound Images using Adaptive and Anisotropic D...Speckle Noise Reduction in Ultrasound Images using Adaptive and Anisotropic D...
Speckle Noise Reduction in Ultrasound Images using Adaptive and Anisotropic D...Md. Shohel Rana
 
Talk icml
Talk icmlTalk icml
Talk icmlBo Li
 
Macrocanonical models for texture synthesis
Macrocanonical models for texture synthesisMacrocanonical models for texture synthesis
Macrocanonical models for texture synthesisValentin De Bortoli
 
Ip lectures 1 and 2
Ip lectures 1 and 2 Ip lectures 1 and 2
Ip lectures 1 and 2 samarthgec
 
Global illumination techniques for the computation of hight quality images in...
Global illumination techniques for the computation of hight quality images in...Global illumination techniques for the computation of hight quality images in...
Global illumination techniques for the computation of hight quality images in...Frederic Perez
 

Similar to Dictionary Learning for Image Reconstruction (19)

Tuto cvpr part4
Tuto cvpr part4Tuto cvpr part4
Tuto cvpr part4
 
01 graphical models
01 graphical models01 graphical models
01 graphical models
 
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: additional slides
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: additional slidesCVPR2010: Advanced ITinCVPR in a Nutshell: part 4: additional slides
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: additional slides
 
BMC 2012
BMC 2012BMC 2012
BMC 2012
 
Image Restitution Using Non-Locally Centralized Sparse Representation Model
Image Restitution Using Non-Locally Centralized Sparse Representation ModelImage Restitution Using Non-Locally Centralized Sparse Representation Model
Image Restitution Using Non-Locally Centralized Sparse Representation Model
 
Digital Image Processing - Image Restoration
Digital Image Processing - Image RestorationDigital Image Processing - Image Restoration
Digital Image Processing - Image Restoration
 
Gi2429352937
Gi2429352937Gi2429352937
Gi2429352937
 
Habilitation à diriger des recherches
Habilitation à diriger des recherchesHabilitation à diriger des recherches
Habilitation à diriger des recherches
 
Lec-3 DIP.pptx
Lec-3 DIP.pptxLec-3 DIP.pptx
Lec-3 DIP.pptx
 
Robust Super-Resolution by minimizing a Gaussian-weighted L2 error norm
Robust Super-Resolution by minimizing a Gaussian-weighted L2 error normRobust Super-Resolution by minimizing a Gaussian-weighted L2 error norm
Robust Super-Resolution by minimizing a Gaussian-weighted L2 error norm
 
No Slide Title
No Slide TitleNo Slide Title
No Slide Title
 
No Slide Title
No Slide TitleNo Slide Title
No Slide Title
 
Speckle Noise Reduction in Ultrasound Images using Adaptive and Anisotropic D...
Speckle Noise Reduction in Ultrasound Images using Adaptive and Anisotropic D...Speckle Noise Reduction in Ultrasound Images using Adaptive and Anisotropic D...
Speckle Noise Reduction in Ultrasound Images using Adaptive and Anisotropic D...
 
Talk icml
Talk icmlTalk icml
Talk icml
 
Macrocanonical models for texture synthesis
Macrocanonical models for texture synthesisMacrocanonical models for texture synthesis
Macrocanonical models for texture synthesis
 
Lecture 11
Lecture 11Lecture 11
Lecture 11
 
Ip lectures 1 and 2
Ip lectures 1 and 2 Ip lectures 1 and 2
Ip lectures 1 and 2
 
Global illumination techniques for the computation of hight quality images in...
Global illumination techniques for the computation of hight quality images in...Global illumination techniques for the computation of hight quality images in...
Global illumination techniques for the computation of hight quality images in...
 
DIP-Questions.pdf
DIP-Questions.pdfDIP-Questions.pdf
DIP-Questions.pdf
 

Recently uploaded

Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfchloefrazer622
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingTeacherCyreneCayanan
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 

Recently uploaded (20)

Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 

Dictionary Learning for Image Reconstruction

  • 1. Sparse Coding and Dictionary Learning for Image Analysis Part II: Dictionary Learning for signal reconstruction Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro ICCV’09 tutorial, Kyoto, 28th September 2009 Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 1/43
  • 2. What this part is about The learning of compact representations of images adapted to restoration tasks. A fast online algorithm for learning dictionaries and factorizing matrices in general. Various formulations for image and video processing. Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 2/43
  • 3. The Image Denoising Problem y = xorig + w measurements original image noise Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 3/43
  • 4. Sparse representations for image restoration y = xorig + w measurements original image noise Energy minimization problem - MAP estimation E (x) = ||y − x||2 2 + Pr (x) relation to measurements prior Some classical priors Smoothness λ||Lx||2 2 Total variation λ|| x||2 1 Wavelet sparsity λ||Wx||1 ... Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 4/43
  • 5. Sparse representations for image restoration Sparsity and redundancy Pr (x) = λ||α||0 for x ≈ Dα       α[1] α[2] x =  d1 d2 · · · dp   .  .   .  α[p] x∈Rm D∈Rm×p α∈Rp ,sparse Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 5/43
  • 6. Sparse representations for image restoration Designed dictionaries [Haar, 1910], [Zweig, Morlet, Grossman ∼70s], [Meyer, Mallat, Daubechies, Coifman, Donoho, Candes ∼80s-today]. . . (see [Mallat, 1999]) Wavelets, Curvelets, Wedgelets, Bandlets, . . . lets Learned dictionaries of patches [Olshausen and Field, 1997], [Engan et al., 1999], [Lewicki and Sejnowski, 2000], [Aharon et al., 2006] , [Roth and Black, 2005], [Lee et al., 2007] 1 min ||xi − Dαi ||2 + λψ(αi ) 2 αi ,D∈C 2 i sparsity reconstruction ψ(α) = ||α||0 (“ 0 pseudo-norm”) ψ(α) = ||α||1 ( 1 norm) Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 6/43
  • 7. Sparse representations for image restoration Solving the denoising problem [Elad and Aharon, 2006] Extract all overlapping 8 × 8 patches yi . Solve a matrix factorization problem: n 1 min ||yi − Dαi ||2 + λψ(αi ), 2 αi ,D∈C 2 i=1 sparsity reconstruction with n > 100, 000 Average the reconstruction of each patch. Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 7/43
  • 8. Sparse representations for image restoration K-SVD: [Elad and Aharon, 2006] Figure: Dictionary trained on a noisy version of the image boat. Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 8/43
  • 9. Sparse representations for image restoration Inpainting, Demosaicking 1 min ||β ⊗ (yi − Dαi )||2 + λi ψ(αi ) D∈C,α 2 i 2 i RAW Image Processing (see our poster) White balance. Denoising Black substraction. Conversion to sRGB. Demosaicking Gamma correction. Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 9/43
  • 10. Sparse representations for image restoration [Mairal, Bach, Ponce, Sapiro, and Zisserman, 2009c] Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 10/43
  • 11. Sparse representations for image restoration [Mairal, Sapiro, and Elad, 2008b] Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 11/43
  • 12. Sparse representations for image restoration Inpainting, [Mairal, Elad, and Sapiro, 2008a] Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 12/43
  • 13. Sparse representations for image restoration Inpainting, [Mairal, Elad, and Sapiro, 2008a] Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 13/43
  • 14. Sparse representations for video restoration Key ideas for video processing [Protter and Elad, 2009] Using a 3D dictionary. Processing of many frames at the same time. Dictionary propagation. Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 14/43
  • 15. Sparse representations for image restoration Inpainting, [Mairal, Sapiro, and Elad, 2008b] Figure: Inpainting results. Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 15/43
  • 16. Sparse representations for image restoration Inpainting, [Mairal, Sapiro, and Elad, 2008b] Figure: Inpainting results. Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 16/43
  • 17. Sparse representations for image restoration Inpainting, [Mairal, Sapiro, and Elad, 2008b] Figure: Inpainting results. Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 17/43
  • 18. Sparse representations for image restoration Inpainting, [Mairal, Sapiro, and Elad, 2008b] Figure: Inpainting results. Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 18/43
  • 19. Sparse representations for image restoration Inpainting, [Mairal, Sapiro, and Elad, 2008b] Figure: Inpainting results. Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 19/43
  • 20. Sparse representations for image restoration Color video denoising, [Mairal, Sapiro, and Elad, 2008b] Figure: Denoising results. σ = 25 Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 20/43
  • 21. Sparse representations for image restoration Color video denoising, [Mairal, Sapiro, and Elad, 2008b] Figure: Denoising results. σ = 25 Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 21/43
  • 22. Sparse representations for image restoration Color video denoising, [Mairal, Sapiro, and Elad, 2008b] Figure: Denoising results. σ = 25 Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 22/43
  • 23. Sparse representations for image restoration Color video denoising, [Mairal, Sapiro, and Elad, 2008b] Figure: Denoising results. σ = 25 Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 23/43
  • 24. Sparse representations for image restoration Color video denoising, [Mairal, Sapiro, and Elad, 2008b] Figure: Denoising results. σ = 25 Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 24/43
  • 25. Optimization for Dictionary Learning n 1 min ||xi − Dαi ||2 + λ||αi ||1 2 α∈R p×n 2 D∈C i=1 C = {D ∈ Rm×p s.t. ∀j = 1, . . . , p, ||dj ||2 ≤ 1}. Classical optimization alternates between D and α. Good results, but very slow! Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 25/43
  • 26. Optimization for Dictionary Learning [Mairal, Bach, Ponce, and Sapiro, 2009a] Classical formulation of dictionary learning n 1 min fn (D) = min l(xi , D), D∈C D∈C n i=1 where 1 l(x, D) = minp ||x − Dα||2 + λ||α||1 . 2 α∈R 2 Which formulation are we interested in? n 1 min f (D) = Ex [l(x, D)] ≈ lim l(xi , D) D∈C n→+∞ n i=1 [Bottou and Bousquet, 2008]: Online learning can handle potentially infinite or dynamic datasets, be dramatically faster than batch algorithms. Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 26/43
  • 27. Optimization for Dictionary Learning Require: D0 ∈ Rm×p (initial dictionary); λ ∈ R 1: A0 = 0, B0 = 0. 2: for t=1,. . . ,T do 3: Draw xt 4: Sparse Coding 1 αt ← arg min ||xt − Dt−1 α||2 + λ||α||1 , 2 α∈Rp 2 5: Aggregate sufficient statistics At ← At−1 + αt αT , Bt ← Bt−1 + xt αT t t 6: Dictionary Update (block-coordinate descent) t 1 1 Dt ← arg min ||xi − Dαi ||2 + λ||αi ||1 . 2 D∈C t 2 i=1 7: end for Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 27/43
  • 28. Optimization for Dictionary Learning Which guarantees do we have? Under a few reasonable assumptions, ˆ we build a surrogate function ft of the expected cost f verifying ˆ lim ft (Dt ) − f (Dt ) = 0, t→+∞ Dt is asymptotically close to a stationary point. Extensions (all implemented in SPAMS) non-negative matrix decompositions. sparse PCA (sparse dictionaries). fused-lasso regularizations (piecewise constant dictionaries) Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 28/43
  • 29. Optimization for Dictionary Learning Experimental results, batch vs online m = 8 × 8, p = 256 Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 29/43
  • 30. Optimization for Dictionary Learning Experimental results, batch vs online m = 12 × 12 × 3, p = 512 Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 30/43
  • 31. Optimization for Dictionary Learning Inpainting a 12-Mpixel photograph Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 31/43
  • 32. Optimization for Dictionary Learning Inpainting a 12-Mpixel photograph Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 32/43
  • 33. Optimization for Dictionary Learning Inpainting a 12-Mpixel photograph Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 33/43
  • 34. Optimization for Dictionary Learning Inpainting a 12-Mpixel photograph Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 34/43
  • 35. Extension to NMF and sparse PCA [Mairal, Bach, Ponce, and Sapiro, 2009b] NMF extension n 1 min ||xi − Dαi ||2 s.t. αi ≥ 0, 2 D ≥ 0. α∈Rp×n 2 D∈C i=1 SPCA extension n 1 min ||xi − Dαi ||2 + λ||α1 ||1 2 α∈Rp×n 2 D∈C i=1 C = {D ∈ Rm×p s.t. ∀j ||dj ||2 + γ||dj ||1 ≤ 1}. 2 Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 35/43
  • 36. Extension to NMF and sparse PCA Faces: Extended Yale Database B (a) PCA (b) NNMF (c) DL Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 36/43
  • 37. Extension to NMF and sparse PCA Faces: Extended Yale Database B (d) SPCA, τ = 70% (e) SPCA, τ = 30% (f) SPCA, τ = 10% Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 37/43
  • 38. Extension to NMF and sparse PCA Natural Patches (a) PCA (b) NNMF (c) DL Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 38/43
  • 39. Extension to NMF and sparse PCA Natural Patches (d) SPCA, τ = 70% (e) SPCA, τ = 30% (f) SPCA, τ = 10% Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 39/43
  • 40. Summary of this part The dictionary learning framework leads to state-of-the-art results for many image . . . . . . and video processing tasks. Online learning techniques are well-suited for this problem and allows training sets with millions of patches. Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 40/43
  • 41. References I M. Aharon, M. Elad, and A. M. Bruckstein. The K-SVD: An algorithm for designing of overcomplete dictionaries for sparse representations. IEEE Transactions on Signal Processing, 54(11):4311–4322, November 2006. L. Bottou and O. Bousquet. The trade-offs of large scale learning. In J.C. Platt, D. Koller, Y. Singer, and S. Roweis, editors, Advances in Neural Information Processing Systems, volume 20, pages 161–168. MIT Press, Cambridge, MA, 2008. M. Elad and M. Aharon. Image denoising via sparse and redundant representations over learned dictionaries. IEEE Transactions on Image Processing, 54(12): 3736–3745, December 2006. K. Engan, S. O. Aase, and J. H. Husoy. Frame based signal compression using method of optimal directions (MOD). In Proceedings of the 1999 IEEE International Symposium on Circuits Systems, volume 4, 1999. A. Haar. Zur theorie der orthogonalen funktionensysteme. Mathematische Annalen, 69:331–371, 1910. H. Lee, A. Battle, R. Raina, and A. Y. Ng. Efficient sparse coding algorithms. In B. Sch¨lkopf, J. Platt, and T. Hoffman, editors, Advances in Neural Information o Processing Systems, volume 19, pages 801–808. MIT Press, Cambridge, MA, 2007. Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 41/43
  • 42. References II M. S. Lewicki and T. J. Sejnowski. Learning overcomplete representations. Neural Computation, 12(2):337–365, 2000. J. Mairal, M. Elad, and G. Sapiro. Sparse representation for color image restoration. IEEE Transactions on Image Processing, 17(1):53–69, January 2008a. J. Mairal, G. Sapiro, and M. Elad. Learning multiscale sparse representations for image and video restoration. SIAM Multiscale Modelling and Simulation, 7(1): 214–241, April 2008b. J. Mairal, F. Bach, J. Ponce, and G. Sapiro. Online dictionary learning for sparse coding. In Proceedings of the International Conference on Machine Learning (ICML), 2009a. J. Mairal, F. Bach, J. Ponce, and G. Sapiro. Online learning for matrix factorization and sparse coding. ArXiv:0908.0050v1, 2009b. submitted. J. Mairal, F. Bach, J. Ponce, G. Sapiro, and A. Zisserman. Non-local sparse models for image restoration. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2009c. S. Mallat. A Wavelet Tour of Signal Processing, Second Edition. Academic Press, New York, September 1999. Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 42/43
  • 43. References III B. A. Olshausen and D. J. Field. Sparse coding with an overcomplete basis set: A strategy employed by V1? Vision Research, 37:3311–3325, 1997. M. Protter and M. Elad. Image sequence denoising via sparse and redundant representations. IEEE Transactions on Image Processing, 18(1):27–36, 2009. S. Roth and M. J. Black. Fields of experts: A framework for learning image priors. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2005. Francis Bach, Julien Mairal, Jean Ponce and Guillermo Sapiro Dictionary Learning for signal reconstruction 43/43