SlideShare a Scribd company logo
1 of 15
Evaluation of Color Descriptors for Object and Scene Recognition Authors: Koen E. A. van de Sande, Theo Gevers, and Cees G. M. Snoek @ University of Amsterdam Presenter: Shao-Chuan Wang
Evaluation of Color Descriptors for Object and Scene Recognition Focus: Color features/descriptors on obj. and scene recognition Summary: The invariance of photometric transform aces and its effect on discriminative power. Conclusion: The usefulness of invariance is category-specific!
Photometric transforms (1/2) Light intensity scale invariant Light intensity shift invariant
Photometric transforms (1/2) Light intensity scale and shift invariant Light color change Light color change and shift
Color Descriptors (1/1) Histograms  RGB, Hue, Saturation, rgHistogram, … Color Moments: contain spatial info. Color SIFT: combined color and SIFT HSV-SIFT, Hue-SIFT, …
Color Histograms (1/2) RGB-histogram Hue-histogram H and S are scale-invariant and shift-invariant w.r.t light intensity rg-histogram Scale-invariant Not shift-invariant Not that b is redudant Image from wikipedia
Color Histograms (2/2) Transformed color Scale and shift-invariant w.r.t light intensity. Opponent color histogram O1,O2 shift invariant O3: intensity, no invariant
Color SIFT Descriptors (1/3) HSV-SIFT SIFT over HSV channels Hue is unstable in gray axis Hue-SIFT (Van de Weijer 2006) Used Hue histogram weighing  	by its saturation Concatenate Hue histogram with SIFT Only SIFT is invariant; Hue histogram is not! (partial invariance) Hue Instability
Color SIFT Descriptors (2/3) OpponentSIFT SIFT over all channels in the opponent color space. Shift-invariant to light intensity. W-SIFT Eliminate O1 and O2’s intensity information Scale-invariant to light intensity rg-SIFT SIFT over r,g spaces Scale and shift invariant, but not invariant to light color changes/shifts
Color SIFT Descriptors (3/3) Transformed color SIFT SIFT over normalized transformed channels. Scale- and shift-invariant to light color changes and shift.
Experiments Implementation: Scale-invariants points are detected by Harris-Laplace point detectors Color descriptors are computed over the area around the points; all regions are proportionally re-sampled to a uniform 60x60 patch. Cluster descriptors with k = 40 (images) k = 4000 (video) SVM classifier with EMD/chi-square kernel
Benchmark (1/3) Image: PASCAL VOC 2007, over 20 object categories
Benchmark (2/3) Most objs were categorized better under scale- and shift- invariant to light intensity Some, such as car and dining table, do not benefit from such invariance.
Benchmark (3/3) Video: Mediamill Challenge, 39 object and scene categories
Evaluation of Color Descriptors for Object and Scene Recognition Conclusion: W-SIFT and rgSIFT, in general, outperform other color descriptors. Light intensity info. Is important for some categories Usefulness of invariance is category-specific.

More Related Content

Similar to Evaluation of how color descriptors recognize objects and scenes

Optimizing Seismic Attributes Interpretation using HSV Rotation_PPT.pdf
Optimizing Seismic Attributes Interpretation using HSV Rotation_PPT.pdfOptimizing Seismic Attributes Interpretation using HSV Rotation_PPT.pdf
Optimizing Seismic Attributes Interpretation using HSV Rotation_PPT.pdfAwal72
 
Colour models
Colour modelsColour models
Colour modelsBCET
 
K Space in MRI
K Space in MRIK Space in MRI
K Space in MRIKajal Jha
 
LISSAJOUS PATTERNS Experiment 3
LISSAJOUS PATTERNS Experiment 3LISSAJOUS PATTERNS Experiment 3
LISSAJOUS PATTERNS Experiment 3Karimi LordRamza
 
Paper id 252014111
Paper id 252014111Paper id 252014111
Paper id 252014111IJRAT
 
The Importance of Terminology and sRGB Uncertainty - Notes - 0.4
The Importance of Terminology and sRGB Uncertainty - Notes - 0.4The Importance of Terminology and sRGB Uncertainty - Notes - 0.4
The Importance of Terminology and sRGB Uncertainty - Notes - 0.4Thomas Mansencal
 
Survey on Local Color Image Descriptors
Survey on Local Color Image DescriptorsSurvey on Local Color Image Descriptors
Survey on Local Color Image DescriptorsIRJET Journal
 
Iisrt anjana francis (ec)
Iisrt anjana francis (ec)Iisrt anjana francis (ec)
Iisrt anjana francis (ec)IISRT
 
SAR image interpretation
SAR image interpretationSAR image interpretation
SAR image interpretationElise Colin
 
Analysis and Simulation of Pseudo Ranging Noise codes for Geo-Stationary Sate...
Analysis and Simulation of Pseudo Ranging Noise codes for Geo-Stationary Sate...Analysis and Simulation of Pseudo Ranging Noise codes for Geo-Stationary Sate...
Analysis and Simulation of Pseudo Ranging Noise codes for Geo-Stationary Sate...IDES Editor
 
SLOPE 1st workshop - presentation 2
SLOPE 1st workshop - presentation 2SLOPE 1st workshop - presentation 2
SLOPE 1st workshop - presentation 2SLOPE Project
 
Sac ki anh mai - anh mai
Sac ki   anh mai - anh maiSac ki   anh mai - anh mai
Sac ki anh mai - anh maiThuy Dung Pham
 
GEOMETRIC TAMPERING ESTIMATION BY MEANS OF A SIFT-BASED FORENSIC ANALYSIS
GEOMETRIC TAMPERING ESTIMATION  BY MEANS OF A SIFT-BASED FORENSIC ANALYSISGEOMETRIC TAMPERING ESTIMATION  BY MEANS OF A SIFT-BASED FORENSIC ANALYSIS
GEOMETRIC TAMPERING ESTIMATION BY MEANS OF A SIFT-BASED FORENSIC ANALYSISICL - Image Communication Laboratory
 
RTOS BASED SECURE SHORTEST PATH ROUTING ALGORITHM IN MOBILE AD- HOC NETWORKS
RTOS BASED SECURE SHORTEST PATH ROUTING ALGORITHM IN MOBILE AD- HOC NETWORKSRTOS BASED SECURE SHORTEST PATH ROUTING ALGORITHM IN MOBILE AD- HOC NETWORKS
RTOS BASED SECURE SHORTEST PATH ROUTING ALGORITHM IN MOBILE AD- HOC NETWORKScscpconf
 
REVERSIBLE WAVELET AND SPECTRAL TRANSFORMS FOR LOSSLESS COMPRESSION OF COLOR ...
REVERSIBLE WAVELET AND SPECTRAL TRANSFORMS FOR LOSSLESS COMPRESSION OF COLOR ...REVERSIBLE WAVELET AND SPECTRAL TRANSFORMS FOR LOSSLESS COMPRESSION OF COLOR ...
REVERSIBLE WAVELET AND SPECTRAL TRANSFORMS FOR LOSSLESS COMPRESSION OF COLOR ...cscpconf
 
Visual thinking colin_ware_lectures_2013_5_color theory and color for informa...
Visual thinking colin_ware_lectures_2013_5_color theory and color for informa...Visual thinking colin_ware_lectures_2013_5_color theory and color for informa...
Visual thinking colin_ware_lectures_2013_5_color theory and color for informa...Elsa von Licy
 
Topology-Preserving Ordering of the RGB Space with an Evolutionary Algorithm
Topology-Preserving Ordering of the RGB Space with an Evolutionary AlgorithmTopology-Preserving Ordering of the RGB Space with an Evolutionary Algorithm
Topology-Preserving Ordering of the RGB Space with an Evolutionary AlgorithmFrancisco (Paco) Florez-Revuelta
 
Image Denoising Using Wavelet
Image Denoising Using WaveletImage Denoising Using Wavelet
Image Denoising Using WaveletAsim Qureshi
 

Similar to Evaluation of how color descriptors recognize objects and scenes (20)

Optimizing Seismic Attributes Interpretation using HSV Rotation_PPT.pdf
Optimizing Seismic Attributes Interpretation using HSV Rotation_PPT.pdfOptimizing Seismic Attributes Interpretation using HSV Rotation_PPT.pdf
Optimizing Seismic Attributes Interpretation using HSV Rotation_PPT.pdf
 
Colour models
Colour modelsColour models
Colour models
 
K Space in MRI
K Space in MRIK Space in MRI
K Space in MRI
 
LISSAJOUS PATTERNS Experiment 3
LISSAJOUS PATTERNS Experiment 3LISSAJOUS PATTERNS Experiment 3
LISSAJOUS PATTERNS Experiment 3
 
Paper id 252014111
Paper id 252014111Paper id 252014111
Paper id 252014111
 
Urban areas
Urban areasUrban areas
Urban areas
 
The Importance of Terminology and sRGB Uncertainty - Notes - 0.4
The Importance of Terminology and sRGB Uncertainty - Notes - 0.4The Importance of Terminology and sRGB Uncertainty - Notes - 0.4
The Importance of Terminology and sRGB Uncertainty - Notes - 0.4
 
Survey on Local Color Image Descriptors
Survey on Local Color Image DescriptorsSurvey on Local Color Image Descriptors
Survey on Local Color Image Descriptors
 
Iisrt anjana francis (ec)
Iisrt anjana francis (ec)Iisrt anjana francis (ec)
Iisrt anjana francis (ec)
 
SAR image interpretation
SAR image interpretationSAR image interpretation
SAR image interpretation
 
Analysis and Simulation of Pseudo Ranging Noise codes for Geo-Stationary Sate...
Analysis and Simulation of Pseudo Ranging Noise codes for Geo-Stationary Sate...Analysis and Simulation of Pseudo Ranging Noise codes for Geo-Stationary Sate...
Analysis and Simulation of Pseudo Ranging Noise codes for Geo-Stationary Sate...
 
SLOPE 1st workshop - presentation 2
SLOPE 1st workshop - presentation 2SLOPE 1st workshop - presentation 2
SLOPE 1st workshop - presentation 2
 
Sac ki anh mai - anh mai
Sac ki   anh mai - anh maiSac ki   anh mai - anh mai
Sac ki anh mai - anh mai
 
Lec11.ppt
Lec11.pptLec11.ppt
Lec11.ppt
 
GEOMETRIC TAMPERING ESTIMATION BY MEANS OF A SIFT-BASED FORENSIC ANALYSIS
GEOMETRIC TAMPERING ESTIMATION  BY MEANS OF A SIFT-BASED FORENSIC ANALYSISGEOMETRIC TAMPERING ESTIMATION  BY MEANS OF A SIFT-BASED FORENSIC ANALYSIS
GEOMETRIC TAMPERING ESTIMATION BY MEANS OF A SIFT-BASED FORENSIC ANALYSIS
 
RTOS BASED SECURE SHORTEST PATH ROUTING ALGORITHM IN MOBILE AD- HOC NETWORKS
RTOS BASED SECURE SHORTEST PATH ROUTING ALGORITHM IN MOBILE AD- HOC NETWORKSRTOS BASED SECURE SHORTEST PATH ROUTING ALGORITHM IN MOBILE AD- HOC NETWORKS
RTOS BASED SECURE SHORTEST PATH ROUTING ALGORITHM IN MOBILE AD- HOC NETWORKS
 
REVERSIBLE WAVELET AND SPECTRAL TRANSFORMS FOR LOSSLESS COMPRESSION OF COLOR ...
REVERSIBLE WAVELET AND SPECTRAL TRANSFORMS FOR LOSSLESS COMPRESSION OF COLOR ...REVERSIBLE WAVELET AND SPECTRAL TRANSFORMS FOR LOSSLESS COMPRESSION OF COLOR ...
REVERSIBLE WAVELET AND SPECTRAL TRANSFORMS FOR LOSSLESS COMPRESSION OF COLOR ...
 
Visual thinking colin_ware_lectures_2013_5_color theory and color for informa...
Visual thinking colin_ware_lectures_2013_5_color theory and color for informa...Visual thinking colin_ware_lectures_2013_5_color theory and color for informa...
Visual thinking colin_ware_lectures_2013_5_color theory and color for informa...
 
Topology-Preserving Ordering of the RGB Space with an Evolutionary Algorithm
Topology-Preserving Ordering of the RGB Space with an Evolutionary AlgorithmTopology-Preserving Ordering of the RGB Space with an Evolutionary Algorithm
Topology-Preserving Ordering of the RGB Space with an Evolutionary Algorithm
 
Image Denoising Using Wavelet
Image Denoising Using WaveletImage Denoising Using Wavelet
Image Denoising Using Wavelet
 

More from Shao-Chuan Wang

Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningShao-Chuan Wang
 
Beyond The Euclidean Distance: Creating effective visual codebooks using the ...
Beyond The Euclidean Distance: Creating effective visual codebooks using the ...Beyond The Euclidean Distance: Creating effective visual codebooks using the ...
Beyond The Euclidean Distance: Creating effective visual codebooks using the ...Shao-Chuan Wang
 
A Friendly Guide To Sparse Coding
A Friendly Guide To Sparse CodingA Friendly Guide To Sparse Coding
A Friendly Guide To Sparse CodingShao-Chuan Wang
 
An Exemplar Model For Learning Object Classes
An Exemplar Model For Learning Object ClassesAn Exemplar Model For Learning Object Classes
An Exemplar Model For Learning Object ClassesShao-Chuan Wang
 
Spatially Coherent Latent Topic Model For Concurrent Object Segmentation and ...
Spatially Coherent Latent Topic Model For Concurrent Object Segmentation and ...Spatially Coherent Latent Topic Model For Concurrent Object Segmentation and ...
Spatially Coherent Latent Topic Model For Concurrent Object Segmentation and ...Shao-Chuan Wang
 
Image Classification And Support Vector Machine
Image Classification And Support Vector MachineImage Classification And Support Vector Machine
Image Classification And Support Vector MachineShao-Chuan Wang
 

More from Shao-Chuan Wang (10)

Book Cover Recognition
Book Cover RecognitionBook Cover Recognition
Book Cover Recognition
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Beyond The Euclidean Distance: Creating effective visual codebooks using the ...
Beyond The Euclidean Distance: Creating effective visual codebooks using the ...Beyond The Euclidean Distance: Creating effective visual codebooks using the ...
Beyond The Euclidean Distance: Creating effective visual codebooks using the ...
 
Self Taught Learning
Self Taught LearningSelf Taught Learning
Self Taught Learning
 
A Friendly Guide To Sparse Coding
A Friendly Guide To Sparse CodingA Friendly Guide To Sparse Coding
A Friendly Guide To Sparse Coding
 
An Exemplar Model For Learning Object Classes
An Exemplar Model For Learning Object ClassesAn Exemplar Model For Learning Object Classes
An Exemplar Model For Learning Object Classes
 
Spatially Coherent Latent Topic Model For Concurrent Object Segmentation and ...
Spatially Coherent Latent Topic Model For Concurrent Object Segmentation and ...Spatially Coherent Latent Topic Model For Concurrent Object Segmentation and ...
Spatially Coherent Latent Topic Model For Concurrent Object Segmentation and ...
 
Support Vector Machine
Support Vector MachineSupport Vector Machine
Support Vector Machine
 
About Python
About PythonAbout Python
About Python
 
Image Classification And Support Vector Machine
Image Classification And Support Vector MachineImage Classification And Support Vector Machine
Image Classification And Support Vector Machine
 

Recently uploaded

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 

Recently uploaded (20)

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 

Evaluation of how color descriptors recognize objects and scenes

  • 1. Evaluation of Color Descriptors for Object and Scene Recognition Authors: Koen E. A. van de Sande, Theo Gevers, and Cees G. M. Snoek @ University of Amsterdam Presenter: Shao-Chuan Wang
  • 2. Evaluation of Color Descriptors for Object and Scene Recognition Focus: Color features/descriptors on obj. and scene recognition Summary: The invariance of photometric transform aces and its effect on discriminative power. Conclusion: The usefulness of invariance is category-specific!
  • 3. Photometric transforms (1/2) Light intensity scale invariant Light intensity shift invariant
  • 4. Photometric transforms (1/2) Light intensity scale and shift invariant Light color change Light color change and shift
  • 5. Color Descriptors (1/1) Histograms RGB, Hue, Saturation, rgHistogram, … Color Moments: contain spatial info. Color SIFT: combined color and SIFT HSV-SIFT, Hue-SIFT, …
  • 6. Color Histograms (1/2) RGB-histogram Hue-histogram H and S are scale-invariant and shift-invariant w.r.t light intensity rg-histogram Scale-invariant Not shift-invariant Not that b is redudant Image from wikipedia
  • 7. Color Histograms (2/2) Transformed color Scale and shift-invariant w.r.t light intensity. Opponent color histogram O1,O2 shift invariant O3: intensity, no invariant
  • 8. Color SIFT Descriptors (1/3) HSV-SIFT SIFT over HSV channels Hue is unstable in gray axis Hue-SIFT (Van de Weijer 2006) Used Hue histogram weighing by its saturation Concatenate Hue histogram with SIFT Only SIFT is invariant; Hue histogram is not! (partial invariance) Hue Instability
  • 9. Color SIFT Descriptors (2/3) OpponentSIFT SIFT over all channels in the opponent color space. Shift-invariant to light intensity. W-SIFT Eliminate O1 and O2’s intensity information Scale-invariant to light intensity rg-SIFT SIFT over r,g spaces Scale and shift invariant, but not invariant to light color changes/shifts
  • 10. Color SIFT Descriptors (3/3) Transformed color SIFT SIFT over normalized transformed channels. Scale- and shift-invariant to light color changes and shift.
  • 11. Experiments Implementation: Scale-invariants points are detected by Harris-Laplace point detectors Color descriptors are computed over the area around the points; all regions are proportionally re-sampled to a uniform 60x60 patch. Cluster descriptors with k = 40 (images) k = 4000 (video) SVM classifier with EMD/chi-square kernel
  • 12. Benchmark (1/3) Image: PASCAL VOC 2007, over 20 object categories
  • 13. Benchmark (2/3) Most objs were categorized better under scale- and shift- invariant to light intensity Some, such as car and dining table, do not benefit from such invariance.
  • 14. Benchmark (3/3) Video: Mediamill Challenge, 39 object and scene categories
  • 15. Evaluation of Color Descriptors for Object and Scene Recognition Conclusion: W-SIFT and rgSIFT, in general, outperform other color descriptors. Light intensity info. Is important for some categories Usefulness of invariance is category-specific.