SlideShare a Scribd company logo
1 of 40
Download to read offline
Saliency Detection via
 Divergence Analysis

Jia-Bin Huang        Narendra Ahuja

 Electrical and Computer Engineering
University of Illinois, Urbana-Champaign
Jia-Bin Huang         Narendra Ahuja
jbhuang1@illinois.edu
   n-ahuja@illinois.edu
What this talk is about?
1.  The saliency detection problem

2.  A unifying framework for bottom-up
    saliency detection algorithms

3.  Ways to improve the performance
Look for the unique item
 Clap when you find it
What is Saliency?
•  Visual salience (or visual saliency) is the
    distinct subjective perceptual quality
    which makes some items in the world
    stand out from their neighbors and
    immediately grab our attention

                Where to look?
Example of Stimulus



 Color       Orientation




Motion     Natural scene
Why We Care?
•    Adaptive image compression
•    Object-of-interest image segmentation
•    Automatic image thumbnail
•    (Class-independent) Object detection and recognition
•    Visual tracking
•    Automatic Image collage
•    Content-aware image resizing
•    Non-photorealistic rendering
•    Understanding mechanism of human visual attention
Problem Setting
•  Input: Image -> Output: Saliency map
Design Principles
•  Rarity
   –  [Bruce NIPS 05] [Zhang JOV 08] [Rahtu ECCV 10] [Klein ICCV 11] [Borji
       CVPR 12]
•  Local complexity
   –  [Kadir IJCV 01]
•  Contrast
   –  [Itti PAMI 98] [Harel NIPS 05] [Ma MM 03] [Achanta CVPR 09] [Achanta
       ICIP 10] [Cheng CVPR 11] [Goferman CVPR 10] [Perazzi CVPR 12]
•  Spectral
   –  [Hou CVPR 07] [Guo CVPR 08] [Hou PAMI 12] [Li PAMI 12]
•  Learning
   –  [Liu CVPR 07] [Judd ICCV 09] [Oliva ICIP 03] [Torralba Psycho.Rev 06]
Complexity!                         Contrast!
Surprise!                                Spectral!                Learning!
            Information!       Rarity!                     WTH?
Which one is better?
Main Result
•  Most of the bottom-up saliency detection
   algorithms can be rewritten in the form of
   divergence between probabilistic
   distributions learned from center and
   surround
Center-Surround Divergence
Kullback-Leibler Divergence
From Center to Surround
From Center to Surround
From Surround to Center
From Surround to Center
From Surround to Center
Symmetrised Divergence
Symmetrised Divergence
How to Choose Support?
Experimental Results
•  Datasets
  –  MSRA salient object detection dataset
  –  1000 groundtruth binary mask are available from
      [Achanta CVPR 09]
•  Evaluation metric
  –  Precision and recall curves
State-of-the-art
Saliency Detection Methods
–  Center to surround
   •  AIM [Bruce NIPS 05] SUN [Zhang JOV 08], SW [Rahtu ECCV
       10]
–  Surround to center
   •  CA [Goferman CVPR 10], AC [Achanta ICIP 10], FT [Achanta
       CVPR 09], LC [Zhai-ACMMM 06]
–  Symmetrised divergence
   •  HC, RC, [Cheng CVPR 11], IT [Itti PAMI 98], GB [Harel NIPS
      07]
–  Spectrum-based
   •  SR [Hou CVPR 07]
•  Rarity-based
   –  AIM: image specific
   –  SUN: image indep.
•  Rarity-based
   –  SW: Patch/window
       based surround +
       scale space analysis
•  Contrast-based
   –  GB: spatial weighting
      for adaptive surround
•  Contrast-based
   –  AC: adaptive surround
      support
•  Contrast-based
   –  RC: region-based
      center support
•  Contrast-based
   –  FT: color feature
   –  LC: intensity feature
Lessons Learned
•  Most of the bottom-up saliency detection
   algorithms are in fact closely related
  –  Not exhaustive, e.g., spectral-based methods
•  How to improve the performance?
  –  Richer features
  –  Less approximation
  –  Adaptive center/surround support
Future Work
Input image     Saliency Map      Error Map




              Method parameters
Questions?
•  Comments or questions:

Jia-Bin Huang
jbhuang1@illinois.edu
https://netfiles.uiuc.edu/jbhuang1/www/
References
•  Saliency Detection via Divergence Analysis: An Unified Perspective
   J.B. Huang and N. Ahuja. ICPR, 2012
•  Frequency-tuned salient region detection
   R. Achanta, S. Hemami, F. Estrada, and S. Susstrunk., CVPR 2009
•  Saliency detection using maximum symmetric surround.
   R. Achanta and S. Susstrunk, ICIP, 2010.
•  State-of-the-art in visual attention modeling
   A. Borji and L. Itti. PAMI 2012
•  Saliency based on information maximization.
   N. Bruce and J. Tsotsos. NIPS, 2005
•  Global contrast based salient region detection.
   M. M. Cheng, G. X. Zhang, N. J. Mitra, X. Huang, and S. M. Hu. CVPR, 2011
•  The discriminant center-surround hypothesis for bottom-up saliency.
•  D. Gao, V. Mahadevan, and N. Vasconcelos. NIPS 2007
References
•  Context-aware saliency detection.
   S. Goferman, L. Zelnik-Manor, and A. Tal. CVPR, 2010.
•  Graph-based visual saliency.
   J. Harel, C. Koch, and P. Perona. NIPS, 2006
•  Saliency detection: A spectral residual approach.
   X. Hou and L. Zhang. CVPR, 2007
•  A model of saliency based visual attention for rapid scene analysis.
   L. Itti, C. Koch, and E. Niebur. PAMI 1998
•  Learning to predict where humans look.
   T. Judd, K. Ehinger, F. Durand, and A. Torralba. ICCV, 2009
•  Center-surround divergence of feature statistics for salient object detection.
   D. A. Klein and S. Frintrop. ICCV, 2011
•  Saliency detection based on frequency and spatial domain analyses
   J. Li, M. Levine, X. An, and H. He. BMVC 2011
References
•  Learning to detect a salient object
   T. Liu, J. Sun, N. N. Zheng, X. Tang, and H. Y. Shum, CVPR 2007
•  Contrast-based image attention analysis by using fuzzy growing
   Y. Ma and H. Zhang, ACM MM 2003
•  Top-down control of visual attention in object detection
   A. Oliva, A. Torralba, M. Castelhano, and J. Henderson. ICIP 2003
•  Segmenting salient objects from images and videos
   E. Rahtu, J. Kannala, M. Salo, and J. Heikkil., ECCV 2010
•  Computational versus psychophysical image saliency: A comparative
    evaluation study, A. Toet. PAMI 2011
•  Visual attention detection in video sequences using spatiotemporal cues
   Y. Zhai and M. Shah. ACM MM 2006
•  Sun: A bayesian framework for saliency using natural statistics
   L. Zhang, M. Tong, T. Marks, H. Shan, and G. Cottrell. JOV 2008

More Related Content

What's hot

Comparative study on image segmentation techniques
Comparative study on image segmentation techniquesComparative study on image segmentation techniques
Comparative study on image segmentation techniquesgmidhubala
 
Image segmentation 2
Image segmentation 2 Image segmentation 2
Image segmentation 2 Rumah Belajar
 
Video object tracking with classification and recognition of objects
Video object tracking with classification and recognition of objectsVideo object tracking with classification and recognition of objects
Video object tracking with classification and recognition of objectsManish Khare
 
Line Detection using Hough transform .pptx
Line Detection using Hough transform .pptxLine Detection using Hough transform .pptx
Line Detection using Hough transform .pptxshubham loni
 
Final presentation optical flow estimation with DL
Final presentation  optical flow estimation with DLFinal presentation  optical flow estimation with DL
Final presentation optical flow estimation with DLLeapMind Inc
 
Depth Fusion from RGB and Depth Sensors II
Depth Fusion from RGB and Depth Sensors IIDepth Fusion from RGB and Depth Sensors II
Depth Fusion from RGB and Depth Sensors IIYu Huang
 
Real Time Object Dectection using machine learning
Real Time Object Dectection using machine learningReal Time Object Dectection using machine learning
Real Time Object Dectection using machine learningpratik pratyay
 
CV_Chap 6 Motion Representation
CV_Chap 6 Motion RepresentationCV_Chap 6 Motion Representation
CV_Chap 6 Motion RepresentationKhushali Kathiriya
 
Designing more efficient convolution neural network
Designing more efficient convolution neural networkDesigning more efficient convolution neural network
Designing more efficient convolution neural networkNAVER Engineering
 
Object Detection Methods using Deep Learning
Object Detection Methods using Deep LearningObject Detection Methods using Deep Learning
Object Detection Methods using Deep LearningSungjoon Choi
 
fusion of Camera and lidar for autonomous driving II
fusion of Camera and lidar for autonomous driving IIfusion of Camera and lidar for autonomous driving II
fusion of Camera and lidar for autonomous driving IIYu Huang
 
You only look once: Unified, real-time object detection (UPC Reading Group)
You only look once: Unified, real-time object detection (UPC Reading Group)You only look once: Unified, real-time object detection (UPC Reading Group)
You only look once: Unified, real-time object detection (UPC Reading Group)Universitat Politècnica de Catalunya
 
Emerging Properties in Self-Supervised Vision Transformers
Emerging Properties in Self-Supervised Vision TransformersEmerging Properties in Self-Supervised Vision Transformers
Emerging Properties in Self-Supervised Vision TransformersSungchul Kim
 
CNNの可視化手法Grad-CAMの紹介~CNNさん、あなたはどこを見ているの?~ | OHS勉強会#6
CNNの可視化手法Grad-CAMの紹介~CNNさん、あなたはどこを見ているの?~ | OHS勉強会#6CNNの可視化手法Grad-CAMの紹介~CNNさん、あなたはどこを見ているの?~ | OHS勉強会#6
CNNの可視化手法Grad-CAMの紹介~CNNさん、あなたはどこを見ているの?~ | OHS勉強会#6Toshinori Hanya
 
Deep learning based object detection basics
Deep learning based object detection basicsDeep learning based object detection basics
Deep learning based object detection basicsBrodmann17
 
Deep learning based object detection
Deep learning based object detectionDeep learning based object detection
Deep learning based object detectionchettykulkarni
 

What's hot (20)

Comparative study on image segmentation techniques
Comparative study on image segmentation techniquesComparative study on image segmentation techniques
Comparative study on image segmentation techniques
 
Image segmentation 2
Image segmentation 2 Image segmentation 2
Image segmentation 2
 
Video object tracking with classification and recognition of objects
Video object tracking with classification and recognition of objectsVideo object tracking with classification and recognition of objects
Video object tracking with classification and recognition of objects
 
Line Detection using Hough transform .pptx
Line Detection using Hough transform .pptxLine Detection using Hough transform .pptx
Line Detection using Hough transform .pptx
 
Segmentation
SegmentationSegmentation
Segmentation
 
Final presentation optical flow estimation with DL
Final presentation  optical flow estimation with DLFinal presentation  optical flow estimation with DL
Final presentation optical flow estimation with DL
 
Defocus magnification
Defocus magnificationDefocus magnification
Defocus magnification
 
Depth Fusion from RGB and Depth Sensors II
Depth Fusion from RGB and Depth Sensors IIDepth Fusion from RGB and Depth Sensors II
Depth Fusion from RGB and Depth Sensors II
 
Real Time Object Dectection using machine learning
Real Time Object Dectection using machine learningReal Time Object Dectection using machine learning
Real Time Object Dectection using machine learning
 
CV_Chap 6 Motion Representation
CV_Chap 6 Motion RepresentationCV_Chap 6 Motion Representation
CV_Chap 6 Motion Representation
 
Designing more efficient convolution neural network
Designing more efficient convolution neural networkDesigning more efficient convolution neural network
Designing more efficient convolution neural network
 
Object Detection Methods using Deep Learning
Object Detection Methods using Deep LearningObject Detection Methods using Deep Learning
Object Detection Methods using Deep Learning
 
fusion of Camera and lidar for autonomous driving II
fusion of Camera and lidar for autonomous driving IIfusion of Camera and lidar for autonomous driving II
fusion of Camera and lidar for autonomous driving II
 
You only look once: Unified, real-time object detection (UPC Reading Group)
You only look once: Unified, real-time object detection (UPC Reading Group)You only look once: Unified, real-time object detection (UPC Reading Group)
You only look once: Unified, real-time object detection (UPC Reading Group)
 
Yolo
YoloYolo
Yolo
 
Emerging Properties in Self-Supervised Vision Transformers
Emerging Properties in Self-Supervised Vision TransformersEmerging Properties in Self-Supervised Vision Transformers
Emerging Properties in Self-Supervised Vision Transformers
 
Cv_Chap 4 Segmentation
Cv_Chap 4 SegmentationCv_Chap 4 Segmentation
Cv_Chap 4 Segmentation
 
CNNの可視化手法Grad-CAMの紹介~CNNさん、あなたはどこを見ているの?~ | OHS勉強会#6
CNNの可視化手法Grad-CAMの紹介~CNNさん、あなたはどこを見ているの?~ | OHS勉強会#6CNNの可視化手法Grad-CAMの紹介~CNNさん、あなたはどこを見ているの?~ | OHS勉強会#6
CNNの可視化手法Grad-CAMの紹介~CNNさん、あなたはどこを見ているの?~ | OHS勉強会#6
 
Deep learning based object detection basics
Deep learning based object detection basicsDeep learning based object detection basics
Deep learning based object detection basics
 
Deep learning based object detection
Deep learning based object detectionDeep learning based object detection
Deep learning based object detection
 

Viewers also liked

Transformation Guided Image Completion ICCP 2013
Transformation Guided Image Completion ICCP 2013Transformation Guided Image Completion ICCP 2013
Transformation Guided Image Completion ICCP 2013Jia-Bin Huang
 
Toward Accurate and Robust Cross-Ratio based Gaze Trackers Through Learning F...
Toward Accurate and Robust Cross-Ratio based Gaze Trackers Through Learning F...Toward Accurate and Robust Cross-Ratio based Gaze Trackers Through Learning F...
Toward Accurate and Robust Cross-Ratio based Gaze Trackers Through Learning F...Jia-Bin Huang
 
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)Jia-Bin Huang
 
How to Read Academic Papers
How to Read Academic PapersHow to Read Academic Papers
How to Read Academic PapersJia-Bin Huang
 
Image Completion using Planar Structure Guidance (SIGGRAPH 2014)
Image Completion using Planar Structure Guidance (SIGGRAPH 2014)Image Completion using Planar Structure Guidance (SIGGRAPH 2014)
Image Completion using Planar Structure Guidance (SIGGRAPH 2014)Jia-Bin Huang
 
Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Jia-Bin Huang
 
Lecture 21 - Image Categorization - Computer Vision Spring2015
Lecture 21 - Image Categorization -  Computer Vision Spring2015Lecture 21 - Image Categorization -  Computer Vision Spring2015
Lecture 21 - Image Categorization - Computer Vision Spring2015Jia-Bin Huang
 
Linear Algebra and Matlab tutorial
Linear Algebra and Matlab tutorialLinear Algebra and Matlab tutorial
Linear Algebra and Matlab tutorialJia-Bin Huang
 
Writing Fast MATLAB Code
Writing Fast MATLAB CodeWriting Fast MATLAB Code
Writing Fast MATLAB CodeJia-Bin Huang
 
Three Reasons to Join FVE at uiuc
Three Reasons to Join FVE at uiucThree Reasons to Join FVE at uiuc
Three Reasons to Join FVE at uiucJia-Bin Huang
 
Applying for Graduate School in S.T.E.M.
Applying for Graduate School in S.T.E.M.Applying for Graduate School in S.T.E.M.
Applying for Graduate School in S.T.E.M.Jia-Bin Huang
 
Jia-Bin Huang's Curriculum Vitae
Jia-Bin Huang's Curriculum VitaeJia-Bin Huang's Curriculum Vitae
Jia-Bin Huang's Curriculum VitaeJia-Bin Huang
 
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)Jia-Bin Huang
 
UIUC CS 498 - Computational Photography - Final project presentation
UIUC CS 498 - Computational Photography - Final project presentation UIUC CS 498 - Computational Photography - Final project presentation
UIUC CS 498 - Computational Photography - Final project presentation Jia-Bin Huang
 
美國研究所申請流程 (A Guide for Applying Graduate Schools in USA)
美國研究所申請流程 (A Guide for Applying Graduate Schools in USA)美國研究所申請流程 (A Guide for Applying Graduate Schools in USA)
美國研究所申請流程 (A Guide for Applying Graduate Schools in USA)Jia-Bin Huang
 
Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)
Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)
Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)Jia-Bin Huang
 
What Makes a Creative Photograph?
What Makes a Creative Photograph?What Makes a Creative Photograph?
What Makes a Creative Photograph?Jia-Bin Huang
 
Computer Vision Crash Course
Computer Vision Crash CourseComputer Vision Crash Course
Computer Vision Crash CourseJia-Bin Huang
 
How to come up with new research ideas
How to come up with new research ideasHow to come up with new research ideas
How to come up with new research ideasJia-Bin Huang
 
Research 101 - Paper Writing with LaTeX
Research 101 - Paper Writing with LaTeXResearch 101 - Paper Writing with LaTeX
Research 101 - Paper Writing with LaTeXJia-Bin Huang
 

Viewers also liked (20)

Transformation Guided Image Completion ICCP 2013
Transformation Guided Image Completion ICCP 2013Transformation Guided Image Completion ICCP 2013
Transformation Guided Image Completion ICCP 2013
 
Toward Accurate and Robust Cross-Ratio based Gaze Trackers Through Learning F...
Toward Accurate and Robust Cross-Ratio based Gaze Trackers Through Learning F...Toward Accurate and Robust Cross-Ratio based Gaze Trackers Through Learning F...
Toward Accurate and Robust Cross-Ratio based Gaze Trackers Through Learning F...
 
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
 
How to Read Academic Papers
How to Read Academic PapersHow to Read Academic Papers
How to Read Academic Papers
 
Image Completion using Planar Structure Guidance (SIGGRAPH 2014)
Image Completion using Planar Structure Guidance (SIGGRAPH 2014)Image Completion using Planar Structure Guidance (SIGGRAPH 2014)
Image Completion using Planar Structure Guidance (SIGGRAPH 2014)
 
Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)
 
Lecture 21 - Image Categorization - Computer Vision Spring2015
Lecture 21 - Image Categorization -  Computer Vision Spring2015Lecture 21 - Image Categorization -  Computer Vision Spring2015
Lecture 21 - Image Categorization - Computer Vision Spring2015
 
Linear Algebra and Matlab tutorial
Linear Algebra and Matlab tutorialLinear Algebra and Matlab tutorial
Linear Algebra and Matlab tutorial
 
Writing Fast MATLAB Code
Writing Fast MATLAB CodeWriting Fast MATLAB Code
Writing Fast MATLAB Code
 
Three Reasons to Join FVE at uiuc
Three Reasons to Join FVE at uiucThree Reasons to Join FVE at uiuc
Three Reasons to Join FVE at uiuc
 
Applying for Graduate School in S.T.E.M.
Applying for Graduate School in S.T.E.M.Applying for Graduate School in S.T.E.M.
Applying for Graduate School in S.T.E.M.
 
Jia-Bin Huang's Curriculum Vitae
Jia-Bin Huang's Curriculum VitaeJia-Bin Huang's Curriculum Vitae
Jia-Bin Huang's Curriculum Vitae
 
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
 
UIUC CS 498 - Computational Photography - Final project presentation
UIUC CS 498 - Computational Photography - Final project presentation UIUC CS 498 - Computational Photography - Final project presentation
UIUC CS 498 - Computational Photography - Final project presentation
 
美國研究所申請流程 (A Guide for Applying Graduate Schools in USA)
美國研究所申請流程 (A Guide for Applying Graduate Schools in USA)美國研究所申請流程 (A Guide for Applying Graduate Schools in USA)
美國研究所申請流程 (A Guide for Applying Graduate Schools in USA)
 
Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)
Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)
Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)
 
What Makes a Creative Photograph?
What Makes a Creative Photograph?What Makes a Creative Photograph?
What Makes a Creative Photograph?
 
Computer Vision Crash Course
Computer Vision Crash CourseComputer Vision Crash Course
Computer Vision Crash Course
 
How to come up with new research ideas
How to come up with new research ideasHow to come up with new research ideas
How to come up with new research ideas
 
Research 101 - Paper Writing with LaTeX
Research 101 - Paper Writing with LaTeXResearch 101 - Paper Writing with LaTeX
Research 101 - Paper Writing with LaTeX
 

Similar to Saliency Detection via Divergence Analysis: A Unified Perspective ICPR 2012

Lecture 10 ming yang - face recognition systems
Lecture 10   ming yang - face recognition systemsLecture 10   ming yang - face recognition systems
Lecture 10 ming yang - face recognition systemsmustafa sarac
 
Sample selection and measuring instruments basic consideration of.pptx
Sample selection and measuring instruments basic consideration of.pptxSample selection and measuring instruments basic consideration of.pptx
Sample selection and measuring instruments basic consideration of.pptxaamnaemir
 
"How Image Sensor and Video Compression Parameters Impact Vision Algorithms,"...
"How Image Sensor and Video Compression Parameters Impact Vision Algorithms,"..."How Image Sensor and Video Compression Parameters Impact Vision Algorithms,"...
"How Image Sensor and Video Compression Parameters Impact Vision Algorithms,"...Edge AI and Vision Alliance
 
Passive stereo vision with deep learning
Passive stereo vision with deep learningPassive stereo vision with deep learning
Passive stereo vision with deep learningYu Huang
 
Human Pose Estimation by Deep Learning
Human Pose Estimation by Deep LearningHuman Pose Estimation by Deep Learning
Human Pose Estimation by Deep LearningWei Yang
 
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...Tulipp. Eu
 
A Comprehensive Analysis on Co-Saliency Detection on Learning Approaches in 3...
A Comprehensive Analysis on Co-Saliency Detection on Learning Approaches in 3...A Comprehensive Analysis on Co-Saliency Detection on Learning Approaches in 3...
A Comprehensive Analysis on Co-Saliency Detection on Learning Approaches in 3...AnuragVijayAgrawal
 
Stereoscopic field of view guidelines
Stereoscopic field of view guidelinesStereoscopic field of view guidelines
Stereoscopic field of view guidelinesBoris Greenberg
 
When Remote Sensing Meets Artificial Intelligence
When Remote Sensing Meets Artificial IntelligenceWhen Remote Sensing Meets Artificial Intelligence
When Remote Sensing Meets Artificial IntelligenceWahyuRahmaniar2
 
Image registraiton in Radiation Therapy
Image registraiton in Radiation TherapyImage registraiton in Radiation Therapy
Image registraiton in Radiation TherapyTalluri Anil Kumar
 
“Designing Assistive Technology: A Personal Experience of Trial and Error “
“Designing Assistive Technology: A Personal Experience of Trial and Error ““Designing Assistive Technology: A Personal Experience of Trial and Error “
“Designing Assistive Technology: A Personal Experience of Trial and Error “diannepatricia
 
Automatic System for Detection and Classification of Brain Tumors
Automatic System for Detection and Classification of Brain TumorsAutomatic System for Detection and Classification of Brain Tumors
Automatic System for Detection and Classification of Brain TumorsFatma Sayed Ibrahim
 

Similar to Saliency Detection via Divergence Analysis: A Unified Perspective ICPR 2012 (20)

Lecture 10 ming yang - face recognition systems
Lecture 10   ming yang - face recognition systemsLecture 10   ming yang - face recognition systems
Lecture 10 ming yang - face recognition systems
 
Sample selection and measuring instruments basic consideration of.pptx
Sample selection and measuring instruments basic consideration of.pptxSample selection and measuring instruments basic consideration of.pptx
Sample selection and measuring instruments basic consideration of.pptx
 
CSU_comp
CSU_compCSU_comp
CSU_comp
 
Aberrometry
AberrometryAberrometry
Aberrometry
 
"How Image Sensor and Video Compression Parameters Impact Vision Algorithms,"...
"How Image Sensor and Video Compression Parameters Impact Vision Algorithms,"..."How Image Sensor and Video Compression Parameters Impact Vision Algorithms,"...
"How Image Sensor and Video Compression Parameters Impact Vision Algorithms,"...
 
Anits dip
Anits dipAnits dip
Anits dip
 
Passive stereo vision with deep learning
Passive stereo vision with deep learningPassive stereo vision with deep learning
Passive stereo vision with deep learning
 
Human Pose Estimation by Deep Learning
Human Pose Estimation by Deep LearningHuman Pose Estimation by Deep Learning
Human Pose Estimation by Deep Learning
 
Dip
DipDip
Dip
 
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
 
A Comprehensive Analysis on Co-Saliency Detection on Learning Approaches in 3...
A Comprehensive Analysis on Co-Saliency Detection on Learning Approaches in 3...A Comprehensive Analysis on Co-Saliency Detection on Learning Approaches in 3...
A Comprehensive Analysis on Co-Saliency Detection on Learning Approaches in 3...
 
Stereoscopic field of view guidelines
Stereoscopic field of view guidelinesStereoscopic field of view guidelines
Stereoscopic field of view guidelines
 
phase2 FINAL
phase2 FINALphase2 FINAL
phase2 FINAL
 
When Remote Sensing Meets Artificial Intelligence
When Remote Sensing Meets Artificial IntelligenceWhen Remote Sensing Meets Artificial Intelligence
When Remote Sensing Meets Artificial Intelligence
 
Image registraiton in Radiation Therapy
Image registraiton in Radiation TherapyImage registraiton in Radiation Therapy
Image registraiton in Radiation Therapy
 
“Designing Assistive Technology: A Personal Experience of Trial and Error “
“Designing Assistive Technology: A Personal Experience of Trial and Error ““Designing Assistive Technology: A Personal Experience of Trial and Error “
“Designing Assistive Technology: A Personal Experience of Trial and Error “
 
Automatic System for Detection and Classification of Brain Tumors
Automatic System for Detection and Classification of Brain TumorsAutomatic System for Detection and Classification of Brain Tumors
Automatic System for Detection and Classification of Brain Tumors
 
Part2
Part2Part2
Part2
 
Corneal topography
Corneal topographyCorneal topography
Corneal topography
 
Digital image processing 2
Digital image processing   2Digital image processing   2
Digital image processing 2
 

More from Jia-Bin Huang

How to write a clear paper
How to write a clear paperHow to write a clear paper
How to write a clear paperJia-Bin Huang
 
Real-time Face Detection and Recognition
Real-time Face Detection and RecognitionReal-time Face Detection and Recognition
Real-time Face Detection and RecognitionJia-Bin Huang
 
Pose aware online visual tracking
Pose aware online visual trackingPose aware online visual tracking
Pose aware online visual trackingJia-Bin Huang
 
Face Expression Enhancement
Face Expression EnhancementFace Expression Enhancement
Face Expression EnhancementJia-Bin Huang
 
Image Smoothing for Structure Extraction
Image Smoothing for Structure ExtractionImage Smoothing for Structure Extraction
Image Smoothing for Structure ExtractionJia-Bin Huang
 
Static and Dynamic Hand Gesture Recognition
Static and Dynamic Hand Gesture RecognitionStatic and Dynamic Hand Gesture Recognition
Static and Dynamic Hand Gesture RecognitionJia-Bin Huang
 
Real-Time Face Detection, Tracking, and Attributes Recognition
Real-Time Face Detection, Tracking, and Attributes RecognitionReal-Time Face Detection, Tracking, and Attributes Recognition
Real-Time Face Detection, Tracking, and Attributes RecognitionJia-Bin Huang
 
Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Jia-Bin Huang
 
Information Preserving Color Transformation for Protanopia and Deuteranopia (...
Information Preserving Color Transformation for Protanopia and Deuteranopia (...Information Preserving Color Transformation for Protanopia and Deuteranopia (...
Information Preserving Color Transformation for Protanopia and Deuteranopia (...Jia-Bin Huang
 
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)Jia-Bin Huang
 
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)Jia-Bin Huang
 
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)Jia-Bin Huang
 

More from Jia-Bin Huang (12)

How to write a clear paper
How to write a clear paperHow to write a clear paper
How to write a clear paper
 
Real-time Face Detection and Recognition
Real-time Face Detection and RecognitionReal-time Face Detection and Recognition
Real-time Face Detection and Recognition
 
Pose aware online visual tracking
Pose aware online visual trackingPose aware online visual tracking
Pose aware online visual tracking
 
Face Expression Enhancement
Face Expression EnhancementFace Expression Enhancement
Face Expression Enhancement
 
Image Smoothing for Structure Extraction
Image Smoothing for Structure ExtractionImage Smoothing for Structure Extraction
Image Smoothing for Structure Extraction
 
Static and Dynamic Hand Gesture Recognition
Static and Dynamic Hand Gesture RecognitionStatic and Dynamic Hand Gesture Recognition
Static and Dynamic Hand Gesture Recognition
 
Real-Time Face Detection, Tracking, and Attributes Recognition
Real-Time Face Detection, Tracking, and Attributes RecognitionReal-Time Face Detection, Tracking, and Attributes Recognition
Real-Time Face Detection, Tracking, and Attributes Recognition
 
Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)
 
Information Preserving Color Transformation for Protanopia and Deuteranopia (...
Information Preserving Color Transformation for Protanopia and Deuteranopia (...Information Preserving Color Transformation for Protanopia and Deuteranopia (...
Information Preserving Color Transformation for Protanopia and Deuteranopia (...
 
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
 
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
 
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
 

Saliency Detection via Divergence Analysis: A Unified Perspective ICPR 2012

  • 1. Saliency Detection via Divergence Analysis Jia-Bin Huang Narendra Ahuja Electrical and Computer Engineering University of Illinois, Urbana-Champaign
  • 2. Jia-Bin Huang Narendra Ahuja jbhuang1@illinois.edu n-ahuja@illinois.edu
  • 3. What this talk is about? 1.  The saliency detection problem 2.  A unifying framework for bottom-up saliency detection algorithms 3.  Ways to improve the performance
  • 4. Look for the unique item Clap when you find it
  • 5.
  • 6.
  • 7.
  • 8. What is Saliency? •  Visual salience (or visual saliency) is the distinct subjective perceptual quality which makes some items in the world stand out from their neighbors and immediately grab our attention Where to look?
  • 9. Example of Stimulus Color Orientation Motion Natural scene
  • 10. Why We Care? •  Adaptive image compression •  Object-of-interest image segmentation •  Automatic image thumbnail •  (Class-independent) Object detection and recognition •  Visual tracking •  Automatic Image collage •  Content-aware image resizing •  Non-photorealistic rendering •  Understanding mechanism of human visual attention
  • 11. Problem Setting •  Input: Image -> Output: Saliency map
  • 12. Design Principles •  Rarity –  [Bruce NIPS 05] [Zhang JOV 08] [Rahtu ECCV 10] [Klein ICCV 11] [Borji CVPR 12] •  Local complexity –  [Kadir IJCV 01] •  Contrast –  [Itti PAMI 98] [Harel NIPS 05] [Ma MM 03] [Achanta CVPR 09] [Achanta ICIP 10] [Cheng CVPR 11] [Goferman CVPR 10] [Perazzi CVPR 12] •  Spectral –  [Hou CVPR 07] [Guo CVPR 08] [Hou PAMI 12] [Li PAMI 12] •  Learning –  [Liu CVPR 07] [Judd ICCV 09] [Oliva ICIP 03] [Torralba Psycho.Rev 06]
  • 13. Complexity! Contrast! Surprise! Spectral! Learning! Information! Rarity! WTH?
  • 14. Which one is better?
  • 15. Main Result •  Most of the bottom-up saliency detection algorithms can be rewritten in the form of divergence between probabilistic distributions learned from center and surround
  • 18. From Center to Surround
  • 19. From Center to Surround
  • 25. How to Choose Support?
  • 26. Experimental Results •  Datasets –  MSRA salient object detection dataset –  1000 groundtruth binary mask are available from [Achanta CVPR 09] •  Evaluation metric –  Precision and recall curves
  • 27. State-of-the-art Saliency Detection Methods –  Center to surround •  AIM [Bruce NIPS 05] SUN [Zhang JOV 08], SW [Rahtu ECCV 10] –  Surround to center •  CA [Goferman CVPR 10], AC [Achanta ICIP 10], FT [Achanta CVPR 09], LC [Zhai-ACMMM 06] –  Symmetrised divergence •  HC, RC, [Cheng CVPR 11], IT [Itti PAMI 98], GB [Harel NIPS 07] –  Spectrum-based •  SR [Hou CVPR 07]
  • 28. •  Rarity-based –  AIM: image specific –  SUN: image indep.
  • 29. •  Rarity-based –  SW: Patch/window based surround + scale space analysis
  • 30. •  Contrast-based –  GB: spatial weighting for adaptive surround
  • 31. •  Contrast-based –  AC: adaptive surround support
  • 32. •  Contrast-based –  RC: region-based center support
  • 33. •  Contrast-based –  FT: color feature –  LC: intensity feature
  • 34. Lessons Learned •  Most of the bottom-up saliency detection algorithms are in fact closely related –  Not exhaustive, e.g., spectral-based methods •  How to improve the performance? –  Richer features –  Less approximation –  Adaptive center/surround support
  • 35. Future Work Input image Saliency Map Error Map Method parameters
  • 36.
  • 37. Questions? •  Comments or questions: Jia-Bin Huang jbhuang1@illinois.edu https://netfiles.uiuc.edu/jbhuang1/www/
  • 38. References •  Saliency Detection via Divergence Analysis: An Unified Perspective J.B. Huang and N. Ahuja. ICPR, 2012 •  Frequency-tuned salient region detection R. Achanta, S. Hemami, F. Estrada, and S. Susstrunk., CVPR 2009 •  Saliency detection using maximum symmetric surround. R. Achanta and S. Susstrunk, ICIP, 2010. •  State-of-the-art in visual attention modeling A. Borji and L. Itti. PAMI 2012 •  Saliency based on information maximization. N. Bruce and J. Tsotsos. NIPS, 2005 •  Global contrast based salient region detection. M. M. Cheng, G. X. Zhang, N. J. Mitra, X. Huang, and S. M. Hu. CVPR, 2011 •  The discriminant center-surround hypothesis for bottom-up saliency. •  D. Gao, V. Mahadevan, and N. Vasconcelos. NIPS 2007
  • 39. References •  Context-aware saliency detection. S. Goferman, L. Zelnik-Manor, and A. Tal. CVPR, 2010. •  Graph-based visual saliency. J. Harel, C. Koch, and P. Perona. NIPS, 2006 •  Saliency detection: A spectral residual approach. X. Hou and L. Zhang. CVPR, 2007 •  A model of saliency based visual attention for rapid scene analysis. L. Itti, C. Koch, and E. Niebur. PAMI 1998 •  Learning to predict where humans look. T. Judd, K. Ehinger, F. Durand, and A. Torralba. ICCV, 2009 •  Center-surround divergence of feature statistics for salient object detection. D. A. Klein and S. Frintrop. ICCV, 2011 •  Saliency detection based on frequency and spatial domain analyses J. Li, M. Levine, X. An, and H. He. BMVC 2011
  • 40. References •  Learning to detect a salient object T. Liu, J. Sun, N. N. Zheng, X. Tang, and H. Y. Shum, CVPR 2007 •  Contrast-based image attention analysis by using fuzzy growing Y. Ma and H. Zhang, ACM MM 2003 •  Top-down control of visual attention in object detection A. Oliva, A. Torralba, M. Castelhano, and J. Henderson. ICIP 2003 •  Segmenting salient objects from images and videos E. Rahtu, J. Kannala, M. Salo, and J. Heikkil., ECCV 2010 •  Computational versus psychophysical image saliency: A comparative evaluation study, A. Toet. PAMI 2011 •  Visual attention detection in video sequences using spatiotemporal cues Y. Zhai and M. Shah. ACM MM 2006 •  Sun: A bayesian framework for saliency using natural statistics L. Zhang, M. Tong, T. Marks, H. Shan, and G. Cottrell. JOV 2008

Editor's Notes

  1. Thanks for the introduction. I am Jia-Bin Huang. Today I am going to talk about some interesting facts about saliency detection.
  2. This is a joint work with Prof. Narendra Ahuja at the University of Illinois, Urbana-Champaign
  3. I will start with an overview of the problem. Then, introducing a unifying framework for bottom-up saliency detection algorithms. Later, we suggest ways to improve the performance.
  4. Let’s play a game first. Please clap when you see something different.
  5. Saliency is the distinct subjective quality which makes some items pop out from their neighbors and grab our attention. It guides us where to look.
  6. Here are some examples of stimulus.
  7. The reasons that this problem is important because it can facilitate numerous applications in computer vision and computer graphics.
  8. Here is the problem setting. Given an input image, our goal is to produce a saliency map locating the salient regions.
  9. There have been many design principles that guide the development of saliency detection algorithms. For example, rarity, complexity, contrast, spectral methods, and learning-based approaches.
  10. However, since some of the principles are inspired from psychological experiments, some are from intuitions, and others are purely computational, it’s often difficult to see their relations and more practically, which one is better?
  11. In this paper, we provide the bridges between these methods. Our claim is that most of the bottom-up saliency detection algorithms can be rewritten in the form of divergence between the probabilistic distributions learned from center and surround.
  12. Let’s start with KL-divergence. KL divergence is a contrast function between two probabilistic distributions. It has the following forms. It has many important operational meanings in detection, estimation, and information theory. In the following few slides, I will explicitly show the connections between various models.
  13. If we assume that the center support is a single pixel x_i, the KL-divergence from center to surround will become the so-called Shannon’s self-information. This class of saliency is called rarity-based saliency.
  14. If we take the difference between the self-information of center and surround, it yields the saliency in Rahtu’s paper at ECCV. If we make the assumption of feature channel independence, we can derive the saliency measure in Klein at ICCV.
  15. So how about switch the order? From the likelihood theory we know that the divergence can be rewritten as likelihood function. This forms the basics of contrast-based saliency methods.
  16. Assume that the distribution of the center follows Laplacian distributions. The saliency measure has this form.
  17. Similarly, assuming that the distribution is Gaussian. Using various approximation approaches gives rises to various measures.
  18. Now let’s look at some symmetrised divergence measure. The first one is the symmetric KL divergence. This measure has been widely used from feature selection. In the context of saliency detection, it corresponds to combine local and global rarity. Lambda divergence is another symmetric measure. It corresponds to the mutual information-based saliency.