SlideShare a Scribd company logo
1 of 39
study Image and Deoth from a Conventional Camera with a Coded Apertrue Anat Levin, Rob Fergus,    Frédo Durand, William Freeman MIT CSAIL
Single input image real objects Coded Aperture output #1: Depth map output #2: all-infocused  image
Conventional aperture and depth of field Big aperture Object Focal plane Small aperture
Depth from defocus Camera sensor Lens Point spread function Focal plane http://groups.csail.mit.edu/graphics/CodedAperture/CodedAperture-LevinEtAl-SIGGRAPH07.ppt
Depth from defocus Camera sensor Lens Object Point spread function Focal plane http://groups.csail.mit.edu/graphics/CodedAperture/CodedAperture-LevinEtAl-SIGGRAPH07.ppt
Depth from defocus Camera sensor Lens Object Point spread function Focal plane http://groups.csail.mit.edu/graphics/CodedAperture/CodedAperture-LevinEtAl-SIGGRAPH07.ppt
Defocus as local convolution Calibrated  blur kernels at depth K Local observed  sub-window Sharp  sub-window Input defocused image Depth k=1 Depth k=2 Depth k=3
Introduction Estimation of depth – a branch of Computational Photography Most challenges of  y = fk * x ,[object Object],Input Ringing with the traditional Richardson-Lucyalgorithm ,[object Object],? Larger scale  ? Correct scale  ? Smaller scale
Related work – depth estimation Active methods – additional illumination sources ,[object Object],Nayar et al. ICCV 95 Zhang and Nayar, SIGGRAPH 06 Projection Defocus Analysis for Capture and Image Display, Zhang and Nayar, 06
Related work – depth estimation Passive methods – changes of focus  ,[object Object],Pentland, IEEE 87 Chaudhuri, Favaro et al. , 99 ,[object Object],Kundur and Hatzinakos , IEEE 96 		Levin,  NIPS 06 ,[object Object],Fenimore and Cannon, Optics 78
Related work – depth estimation ,[object Object]
Plenoptic /light field cameraAdelson and Wang, IEEE 92 	Ng et al., 05 ,[object Object],Cathey & Dowski, Optics 94, 95 1.Rays don't converge anymore 2.Image blur is the same for all depth 3.Blur spectrum does not have too many zeros CompPhoto06/html/lecturenotes/25_LightField_6.pdf
Overview Try deconvolving local input windows with different scaled filters: ? Larger scale  ? Correct scale  ? Smaller scale  Somehow: select best scale
Challenges & contributions Hard to de-convolve even when kernel is known 	IDEA 1: Natural images prior Hard to identify correct scale 	IDEA 2: Coded Aperture
Deconvolution is ill posed Solution 1: = ? Solution 2: = ?
IDEA 1: Natural images prior What makes images special? Natural Unnatural Image gradient Natural images have sparse gradients put a penalty on gradients
Deconvolution with prior Convolution error Derivatives prior 2 ? Low  Equal convolution error 2 ? High
Comparing deconvolution algorithms Richardson-Lucy Input “spread” gradients “localizes” gradients Gaussian prior Sparse prior
Statistical Model of Images “Deconvolution using natural image priors”, Levin et. al., ETAI 07 Spatial domain Frequency domain
Maximum a-posteriori P(x|y) likelyhood Image prior  (gradient here)  Gradient operator For Gaussian priors For sparse priors
Minimize deconvolution error
Deconvolution using a Gaussian prior Note: solved in the frequency domain in a few seconds for MB size file
Deconvolution using a sparse prior Using an iterative reweighted least squares process (IRLS) [Meer 2004; Levin and Weiss to appear] Cannot solve in frequency domain Note: solved in the frequency domain  around  1 hour on 2.4Ghz CPR for 2MB file
Iterative reweighted least squares process (IRLS)
Recall: Overview Try deconvolving local input windows with different scaled filters: ? Larger scale  ? Correct scale  ? Smaller scale  Somehow: select best scale Challenge: smaller scale not so different than correct
IDEA 2: Coded Aperture Mask (code) in aperture plane Make defocus patterns different from natural images and easier to discriminate Conventional aperture Our coded aperture
Lens with coded aperture Image of a defocused point light source Aperture pattern Camera sensor Lens with coded aperture Object Point spread function Focal plane
Why coded ? Coded aperture- reduce uncertainty in scale identification Conventional Coded Larger scale Correct scale Smaller scale
Why coded ? Coded aperture- reduce uncertainty in scale identification Conventional Coded Larger scale Correct scale Smaller scale
Fourier transforms of 1D slide through the blur pattern
Coded aperture: Scale estimation and division by zero spectrum spectrum spectrum spectrum spectrum Frequency Frequency Frequency Frequency Frequency Estimated image ? Observed image  = Filter, correct scale Division by zero Estimated image ?        spatial ringing = Filter, wrong scale
Division by zero with a conventional aperture ? spectrum spectrum spectrum spectrum spectrum Frequency Frequency Frequency Frequency Frequency Estimated image ? No zero at ω ! Observed image  = Filter, correct scale No zero at ω ! Tiny value at ω no spatial ringing Estimated image ? = Filter, wrong scale ω is zero !
Filter Selection Criterion The filter f has good depth discrimination - blurry image distributions Pk1(y) and Pk2(y) at depths k1 and k2 should  not be similar KL-divergence scores
Filter Design Practical constrains Binary filter to construct accurately Cut the filter from a single piece Avoid excessive radial distortion Avoid using the full aperture Diffraction impose a min size on the holes in the file Spec. 13x13 patterns with 1 mm holes Each pattern, 8 different  scales  Varying between 5~15 pixels in width
Filter Design Conventional  Conventional
Blur scale identification Not robust at high-frequency noise Un-normalized energy term λk  learn to minimize  the scale misclassification error on a set of traning images Ek is approximate by the reconstruction error by ML solution x* is the deblurred image
Regularizing depth estimation
Results
Applications Digital refocusing from a single image e.g.  Synthesis an all-focus image e.g.  Post-exposure

More Related Content

What's hot

Object Detection & Tracking
Object Detection & TrackingObject Detection & Tracking
Object Detection & TrackingAkshay Gujarathi
 
Moving Object Detection And Tracking Using CNN
Moving Object Detection And Tracking Using CNNMoving Object Detection And Tracking Using CNN
Moving Object Detection And Tracking Using CNNNITISHKUMAR1401
 
Image denoising algorithms
Image denoising algorithmsImage denoising algorithms
Image denoising algorithmsMohammad Sunny
 
Introduction to multiple object tracking
Introduction to multiple object trackingIntroduction to multiple object tracking
Introduction to multiple object trackingFan Yang
 
Object detection
Object detectionObject detection
Object detectionSomesh Vyas
 
Image Restoration (Digital Image Processing)
Image Restoration (Digital Image Processing)Image Restoration (Digital Image Processing)
Image Restoration (Digital Image Processing)Shajun Nisha
 
Motion estimation overview
Motion estimation overviewMotion estimation overview
Motion estimation overviewYoss Cohen
 
Object tracking presentation
Object tracking  presentationObject tracking  presentation
Object tracking presentationMrsShwetaBanait1
 
Digital Image Processing
Digital Image ProcessingDigital Image Processing
Digital Image ProcessingAzharo7
 
A Brief History of Object Detection / Tommi Kerola
A Brief History of Object Detection / Tommi KerolaA Brief History of Object Detection / Tommi Kerola
A Brief History of Object Detection / Tommi KerolaPreferred Networks
 
Faster R-CNN: Towards real-time object detection with region proposal network...
Faster R-CNN: Towards real-time object detection with region proposal network...Faster R-CNN: Towards real-time object detection with region proposal network...
Faster R-CNN: Towards real-time object detection with region proposal network...Universitat Politècnica de Catalunya
 
PR-132: SSD: Single Shot MultiBox Detector
PR-132: SSD: Single Shot MultiBox DetectorPR-132: SSD: Single Shot MultiBox Detector
PR-132: SSD: Single Shot MultiBox DetectorJinwon Lee
 
Anchor free object detection by deep learning
Anchor free object detection by deep learningAnchor free object detection by deep learning
Anchor free object detection by deep learningYu Huang
 
Digital Image Processing: Digital Image Fundamentals
Digital Image Processing: Digital Image FundamentalsDigital Image Processing: Digital Image Fundamentals
Digital Image Processing: Digital Image FundamentalsMostafa G. M. Mostafa
 
Introduction to object detection
Introduction to object detectionIntroduction to object detection
Introduction to object detectionBrodmann17
 
CVPR 2018 Paper Reading MobileNet V2
CVPR 2018 Paper Reading MobileNet V2CVPR 2018 Paper Reading MobileNet V2
CVPR 2018 Paper Reading MobileNet V2Khang Pham
 

What's hot (20)

Object Detection & Tracking
Object Detection & TrackingObject Detection & Tracking
Object Detection & Tracking
 
Moving Object Detection And Tracking Using CNN
Moving Object Detection And Tracking Using CNNMoving Object Detection And Tracking Using CNN
Moving Object Detection And Tracking Using CNN
 
Image denoising algorithms
Image denoising algorithmsImage denoising algorithms
Image denoising algorithms
 
Lzw coding technique for image compression
Lzw coding technique for image compressionLzw coding technique for image compression
Lzw coding technique for image compression
 
Object detection
Object detectionObject detection
Object detection
 
Introduction to multiple object tracking
Introduction to multiple object trackingIntroduction to multiple object tracking
Introduction to multiple object tracking
 
Object detection
Object detectionObject detection
Object detection
 
Image Restoration (Digital Image Processing)
Image Restoration (Digital Image Processing)Image Restoration (Digital Image Processing)
Image Restoration (Digital Image Processing)
 
Motion estimation overview
Motion estimation overviewMotion estimation overview
Motion estimation overview
 
Medical Image Processing
Medical Image ProcessingMedical Image Processing
Medical Image Processing
 
Compressed Sensing In Spectral Imaging
Compressed Sensing In Spectral Imaging  Compressed Sensing In Spectral Imaging
Compressed Sensing In Spectral Imaging
 
Object tracking presentation
Object tracking  presentationObject tracking  presentation
Object tracking presentation
 
Digital Image Processing
Digital Image ProcessingDigital Image Processing
Digital Image Processing
 
A Brief History of Object Detection / Tommi Kerola
A Brief History of Object Detection / Tommi KerolaA Brief History of Object Detection / Tommi Kerola
A Brief History of Object Detection / Tommi Kerola
 
Faster R-CNN: Towards real-time object detection with region proposal network...
Faster R-CNN: Towards real-time object detection with region proposal network...Faster R-CNN: Towards real-time object detection with region proposal network...
Faster R-CNN: Towards real-time object detection with region proposal network...
 
PR-132: SSD: Single Shot MultiBox Detector
PR-132: SSD: Single Shot MultiBox DetectorPR-132: SSD: Single Shot MultiBox Detector
PR-132: SSD: Single Shot MultiBox Detector
 
Anchor free object detection by deep learning
Anchor free object detection by deep learningAnchor free object detection by deep learning
Anchor free object detection by deep learning
 
Digital Image Processing: Digital Image Fundamentals
Digital Image Processing: Digital Image FundamentalsDigital Image Processing: Digital Image Fundamentals
Digital Image Processing: Digital Image Fundamentals
 
Introduction to object detection
Introduction to object detectionIntroduction to object detection
Introduction to object detection
 
CVPR 2018 Paper Reading MobileNet V2
CVPR 2018 Paper Reading MobileNet V2CVPR 2018 Paper Reading MobileNet V2
CVPR 2018 Paper Reading MobileNet V2
 

Viewers also liked

Multi Aperture Photography
Multi Aperture PhotographyMulti Aperture Photography
Multi Aperture Photographymakrofajj
 
My presentation Jose M. Escalante Fernandez
My presentation Jose M. Escalante FernandezMy presentation Jose M. Escalante Fernandez
My presentation Jose M. Escalante FernandezEscalante Supertramp
 
Lytro Light Field Camera: from scientific research to a $50-million business
Lytro Light Field Camera: from scientific research to a $50-million businessLytro Light Field Camera: from scientific research to a $50-million business
Lytro Light Field Camera: from scientific research to a $50-million businessWeili Shi
 
IMAGE ENHANCEMENT IN CASE OF UNEVEN ILLUMINATION USING VARIABLE THRESHOLDING ...
IMAGE ENHANCEMENT IN CASE OF UNEVEN ILLUMINATION USING VARIABLE THRESHOLDING ...IMAGE ENHANCEMENT IN CASE OF UNEVEN ILLUMINATION USING VARIABLE THRESHOLDING ...
IMAGE ENHANCEMENT IN CASE OF UNEVEN ILLUMINATION USING VARIABLE THRESHOLDING ...ijsrd.com
 
A novel approach for denoising and enhancement of extremely low light video
A novel approach for denoising and enhancement of extremely low light videoA novel approach for denoising and enhancement of extremely low light video
A novel approach for denoising and enhancement of extremely low light videoI3E Technologies
 
Optical Biometry Measurements For Future Iol’S
Optical Biometry Measurements For Future Iol’SOptical Biometry Measurements For Future Iol’S
Optical Biometry Measurements For Future Iol’Smeikocat
 
Aperture presentation 1
Aperture presentation 1Aperture presentation 1
Aperture presentation 1laroos0815
 
Digital Image Processing_ ch2 enhancement spatial-domain
Digital Image Processing_ ch2 enhancement spatial-domainDigital Image Processing_ ch2 enhancement spatial-domain
Digital Image Processing_ ch2 enhancement spatial-domainMalik obeisat
 
Introduction to image contrast and enhancement method
Introduction to image contrast and enhancement methodIntroduction to image contrast and enhancement method
Introduction to image contrast and enhancement methodAbhishekvb
 
Image enhancement ppt nal2
Image enhancement ppt nal2Image enhancement ppt nal2
Image enhancement ppt nal2Surabhi Ks
 
Image enhancement
Image enhancementImage enhancement
Image enhancementAyaelshiwi
 
Optical rotatory dispersion
Optical rotatory dispersionOptical rotatory dispersion
Optical rotatory dispersionSujit Patel
 
The Low Vision Examination
The Low Vision ExaminationThe Low Vision Examination
The Low Vision ExaminationHossein Mirzaie
 
Digital Image Processing Fundamental
Digital Image Processing FundamentalDigital Image Processing Fundamental
Digital Image Processing FundamentalThuong Nguyen Canh
 
Final Project Report on Image processing based intelligent traffic control sy...
Final Project Report on Image processing based intelligent traffic control sy...Final Project Report on Image processing based intelligent traffic control sy...
Final Project Report on Image processing based intelligent traffic control sy...Louise Antonio
 

Viewers also liked (20)

Multi Aperture Photography
Multi Aperture PhotographyMulti Aperture Photography
Multi Aperture Photography
 
How to come up with new Ideas Raskar Feb09
How to come up with new Ideas Raskar Feb09How to come up with new Ideas Raskar Feb09
How to come up with new Ideas Raskar Feb09
 
My presentation Jose M. Escalante Fernandez
My presentation Jose M. Escalante FernandezMy presentation Jose M. Escalante Fernandez
My presentation Jose M. Escalante Fernandez
 
Lytro Light Field Camera: from scientific research to a $50-million business
Lytro Light Field Camera: from scientific research to a $50-million businessLytro Light Field Camera: from scientific research to a $50-million business
Lytro Light Field Camera: from scientific research to a $50-million business
 
IMAGE ENHANCEMENT IN CASE OF UNEVEN ILLUMINATION USING VARIABLE THRESHOLDING ...
IMAGE ENHANCEMENT IN CASE OF UNEVEN ILLUMINATION USING VARIABLE THRESHOLDING ...IMAGE ENHANCEMENT IN CASE OF UNEVEN ILLUMINATION USING VARIABLE THRESHOLDING ...
IMAGE ENHANCEMENT IN CASE OF UNEVEN ILLUMINATION USING VARIABLE THRESHOLDING ...
 
A novel approach for denoising and enhancement of extremely low light video
A novel approach for denoising and enhancement of extremely low light videoA novel approach for denoising and enhancement of extremely low light video
A novel approach for denoising and enhancement of extremely low light video
 
White balance Task
White balance TaskWhite balance Task
White balance Task
 
Optical Biometry Measurements For Future Iol’S
Optical Biometry Measurements For Future Iol’SOptical Biometry Measurements For Future Iol’S
Optical Biometry Measurements For Future Iol’S
 
Aperture presentation 1
Aperture presentation 1Aperture presentation 1
Aperture presentation 1
 
Digital Image Processing_ ch2 enhancement spatial-domain
Digital Image Processing_ ch2 enhancement spatial-domainDigital Image Processing_ ch2 enhancement spatial-domain
Digital Image Processing_ ch2 enhancement spatial-domain
 
Introduction to image contrast and enhancement method
Introduction to image contrast and enhancement methodIntroduction to image contrast and enhancement method
Introduction to image contrast and enhancement method
 
The Light Field Stereoscope | SIGGRAPH 2015
The Light Field Stereoscope | SIGGRAPH 2015The Light Field Stereoscope | SIGGRAPH 2015
The Light Field Stereoscope | SIGGRAPH 2015
 
Image enhancement ppt nal2
Image enhancement ppt nal2Image enhancement ppt nal2
Image enhancement ppt nal2
 
Image enhancement
Image enhancementImage enhancement
Image enhancement
 
Optical rotatory dispersion
Optical rotatory dispersionOptical rotatory dispersion
Optical rotatory dispersion
 
The Low Vision Examination
The Low Vision ExaminationThe Low Vision Examination
The Low Vision Examination
 
Optical Computing
Optical ComputingOptical Computing
Optical Computing
 
Digital Image Processing Fundamental
Digital Image Processing FundamentalDigital Image Processing Fundamental
Digital Image Processing Fundamental
 
Final Project Report on Image processing based intelligent traffic control sy...
Final Project Report on Image processing based intelligent traffic control sy...Final Project Report on Image processing based intelligent traffic control sy...
Final Project Report on Image processing based intelligent traffic control sy...
 
Quantum dots ppt
Quantum dots pptQuantum dots ppt
Quantum dots ppt
 

Similar to study Coded Aperture

Human-Computer Interactive Systems
Human-Computer Interactive SystemsHuman-Computer Interactive Systems
Human-Computer Interactive SystemsVertigo4
 
Depth Fusion from RGB and Depth Sensors by Deep Learning
Depth Fusion from RGB and Depth Sensors by Deep LearningDepth Fusion from RGB and Depth Sensors by Deep Learning
Depth Fusion from RGB and Depth Sensors by Deep LearningYu Huang
 
EC4160-lect 1,2.ppt
EC4160-lect 1,2.pptEC4160-lect 1,2.ppt
EC4160-lect 1,2.pptssuser812128
 
Exploring Methods to Improve Edge Detection with Canny Algorithm
Exploring Methods to Improve Edge Detection with Canny AlgorithmExploring Methods to Improve Edge Detection with Canny Algorithm
Exploring Methods to Improve Edge Detection with Canny AlgorithmPrasad Thakur
 
UIUC CS 498 - Computational Photography - Final project presentation
UIUC CS 498 - Computational Photography - Final project presentation UIUC CS 498 - Computational Photography - Final project presentation
UIUC CS 498 - Computational Photography - Final project presentation Jia-Bin Huang
 
ADAPTIVE FILTER FOR DENOISING 3D DATA CAPTURED BY DEPTH SENSORS
ADAPTIVE FILTER FOR DENOISING 3D DATA CAPTURED BY DEPTH SENSORSADAPTIVE FILTER FOR DENOISING 3D DATA CAPTURED BY DEPTH SENSORS
ADAPTIVE FILTER FOR DENOISING 3D DATA CAPTURED BY DEPTH SENSORSSoma Boubou
 
Efficient LDI Representation (TPCG 2008)
Efficient LDI Representation (TPCG 2008)Efficient LDI Representation (TPCG 2008)
Efficient LDI Representation (TPCG 2008)Matthias Trapp
 
Virtual Reality 3D home applications
Virtual Reality 3D home applicationsVirtual Reality 3D home applications
Virtual Reality 3D home applicationsslebrun
 
End-to-end Optimization of Cameras and Image Processing - SIGGRAPH 2018
End-to-end Optimization of Cameras and Image Processing - SIGGRAPH 2018End-to-end Optimization of Cameras and Image Processing - SIGGRAPH 2018
End-to-end Optimization of Cameras and Image Processing - SIGGRAPH 2018StanfordComputationalImaging
 
TOWARDS OPTIMALITY OF IMAGE SEGMENTATION PART- I
TOWARDS OPTIMALITY OF IMAGE SEGMENTATION PART- ITOWARDS OPTIMALITY OF IMAGE SEGMENTATION PART- I
TOWARDS OPTIMALITY OF IMAGE SEGMENTATION PART- IAnish Acharya
 
Ee 417 Senior Design
Ee 417 Senior DesignEe 417 Senior Design
Ee 417 Senior Designcrouchj1
 

Similar to study Coded Aperture (20)

Defocus magnification
Defocus magnificationDefocus magnification
Defocus magnification
 
Human-Computer Interactive Systems
Human-Computer Interactive SystemsHuman-Computer Interactive Systems
Human-Computer Interactive Systems
 
Raskar Banff
Raskar BanffRaskar Banff
Raskar Banff
 
Raskar Coded Opto Charlotte
Raskar Coded Opto CharlotteRaskar Coded Opto Charlotte
Raskar Coded Opto Charlotte
 
Raskar Paris Nov08
Raskar Paris Nov08Raskar Paris Nov08
Raskar Paris Nov08
 
Depth Fusion from RGB and Depth Sensors by Deep Learning
Depth Fusion from RGB and Depth Sensors by Deep LearningDepth Fusion from RGB and Depth Sensors by Deep Learning
Depth Fusion from RGB and Depth Sensors by Deep Learning
 
EC4160-lect 1,2.ppt
EC4160-lect 1,2.pptEC4160-lect 1,2.ppt
EC4160-lect 1,2.ppt
 
Exploring Methods to Improve Edge Detection with Canny Algorithm
Exploring Methods to Improve Edge Detection with Canny AlgorithmExploring Methods to Improve Edge Detection with Canny Algorithm
Exploring Methods to Improve Edge Detection with Canny Algorithm
 
Raskar Ilp Oct08 Web
Raskar Ilp Oct08 WebRaskar Ilp Oct08 Web
Raskar Ilp Oct08 Web
 
Dr,system abhishek
Dr,system abhishekDr,system abhishek
Dr,system abhishek
 
PhD_ppt_2012
PhD_ppt_2012PhD_ppt_2012
PhD_ppt_2012
 
UIUC CS 498 - Computational Photography - Final project presentation
UIUC CS 498 - Computational Photography - Final project presentation UIUC CS 498 - Computational Photography - Final project presentation
UIUC CS 498 - Computational Photography - Final project presentation
 
WT in IP.ppt
WT in IP.pptWT in IP.ppt
WT in IP.ppt
 
ADAPTIVE FILTER FOR DENOISING 3D DATA CAPTURED BY DEPTH SENSORS
ADAPTIVE FILTER FOR DENOISING 3D DATA CAPTURED BY DEPTH SENSORSADAPTIVE FILTER FOR DENOISING 3D DATA CAPTURED BY DEPTH SENSORS
ADAPTIVE FILTER FOR DENOISING 3D DATA CAPTURED BY DEPTH SENSORS
 
Efficient LDI Representation (TPCG 2008)
Efficient LDI Representation (TPCG 2008)Efficient LDI Representation (TPCG 2008)
Efficient LDI Representation (TPCG 2008)
 
Virtual Reality 3D home applications
Virtual Reality 3D home applicationsVirtual Reality 3D home applications
Virtual Reality 3D home applications
 
End-to-end Optimization of Cameras and Image Processing - SIGGRAPH 2018
End-to-end Optimization of Cameras and Image Processing - SIGGRAPH 2018End-to-end Optimization of Cameras and Image Processing - SIGGRAPH 2018
End-to-end Optimization of Cameras and Image Processing - SIGGRAPH 2018
 
TOWARDS OPTIMALITY OF IMAGE SEGMENTATION PART- I
TOWARDS OPTIMALITY OF IMAGE SEGMENTATION PART- ITOWARDS OPTIMALITY OF IMAGE SEGMENTATION PART- I
TOWARDS OPTIMALITY OF IMAGE SEGMENTATION PART- I
 
project_final
project_finalproject_final
project_final
 
Ee 417 Senior Design
Ee 417 Senior DesignEe 417 Senior Design
Ee 417 Senior Design
 

More from Chiamin Hsu

study Domain Transform for Edge-Aware Image and Video Processing
study Domain Transform for Edge-Aware Image and Video Processingstudy Domain Transform for Edge-Aware Image and Video Processing
study Domain Transform for Edge-Aware Image and Video ProcessingChiamin Hsu
 
study Image and video abstraction by multi scale anisotropic kuwahara
study  Image and video abstraction by multi scale anisotropic kuwaharastudy  Image and video abstraction by multi scale anisotropic kuwahara
study Image and video abstraction by multi scale anisotropic kuwaharaChiamin Hsu
 
study Accelerating Spatially Varying Gaussian Filters
study Accelerating Spatially Varying Gaussian Filtersstudy Accelerating Spatially Varying Gaussian Filters
study Accelerating Spatially Varying Gaussian FiltersChiamin Hsu
 
stduy Edge-Based Image Coarsening
stduy Edge-Based Image Coarseningstduy Edge-Based Image Coarsening
stduy Edge-Based Image CoarseningChiamin Hsu
 
study Streaming Multigrid For Gradient Domain Operations On Large Images
study Streaming Multigrid For Gradient Domain Operations On Large Imagesstudy Streaming Multigrid For Gradient Domain Operations On Large Images
study Streaming Multigrid For Gradient Domain Operations On Large ImagesChiamin Hsu
 
study Shading Based Surface Editing
study Shading Based Surface Editingstudy Shading Based Surface Editing
study Shading Based Surface EditingChiamin Hsu
 
study Diffusion Curves: A Vector Representation for Smooth-Shaded Images
study Diffusion Curves: A Vector Representation for Smooth-Shaded Imagesstudy Diffusion Curves: A Vector Representation for Smooth-Shaded Images
study Diffusion Curves: A Vector Representation for Smooth-Shaded ImagesChiamin Hsu
 
study Image Vectorization using Optimized Gradeint Meshes
study Image Vectorization using Optimized Gradeint Meshesstudy Image Vectorization using Optimized Gradeint Meshes
study Image Vectorization using Optimized Gradeint MeshesChiamin Hsu
 
study Seam Carving For Content Aware Image Resizing
study Seam Carving For Content Aware Image Resizingstudy Seam Carving For Content Aware Image Resizing
study Seam Carving For Content Aware Image ResizingChiamin Hsu
 
study Latent Doodle Space
study Latent Doodle Spacestudy Latent Doodle Space
study Latent Doodle SpaceChiamin Hsu
 
study Active Refocusing Of Images And Videos
study Active Refocusing Of Images And Videosstudy Active Refocusing Of Images And Videos
study Active Refocusing Of Images And VideosChiamin Hsu
 
study Dappled Photography
study Dappled Photographystudy Dappled Photography
study Dappled PhotographyChiamin Hsu
 

More from Chiamin Hsu (12)

study Domain Transform for Edge-Aware Image and Video Processing
study Domain Transform for Edge-Aware Image and Video Processingstudy Domain Transform for Edge-Aware Image and Video Processing
study Domain Transform for Edge-Aware Image and Video Processing
 
study Image and video abstraction by multi scale anisotropic kuwahara
study  Image and video abstraction by multi scale anisotropic kuwaharastudy  Image and video abstraction by multi scale anisotropic kuwahara
study Image and video abstraction by multi scale anisotropic kuwahara
 
study Accelerating Spatially Varying Gaussian Filters
study Accelerating Spatially Varying Gaussian Filtersstudy Accelerating Spatially Varying Gaussian Filters
study Accelerating Spatially Varying Gaussian Filters
 
stduy Edge-Based Image Coarsening
stduy Edge-Based Image Coarseningstduy Edge-Based Image Coarsening
stduy Edge-Based Image Coarsening
 
study Streaming Multigrid For Gradient Domain Operations On Large Images
study Streaming Multigrid For Gradient Domain Operations On Large Imagesstudy Streaming Multigrid For Gradient Domain Operations On Large Images
study Streaming Multigrid For Gradient Domain Operations On Large Images
 
study Shading Based Surface Editing
study Shading Based Surface Editingstudy Shading Based Surface Editing
study Shading Based Surface Editing
 
study Diffusion Curves: A Vector Representation for Smooth-Shaded Images
study Diffusion Curves: A Vector Representation for Smooth-Shaded Imagesstudy Diffusion Curves: A Vector Representation for Smooth-Shaded Images
study Diffusion Curves: A Vector Representation for Smooth-Shaded Images
 
study Image Vectorization using Optimized Gradeint Meshes
study Image Vectorization using Optimized Gradeint Meshesstudy Image Vectorization using Optimized Gradeint Meshes
study Image Vectorization using Optimized Gradeint Meshes
 
study Seam Carving For Content Aware Image Resizing
study Seam Carving For Content Aware Image Resizingstudy Seam Carving For Content Aware Image Resizing
study Seam Carving For Content Aware Image Resizing
 
study Latent Doodle Space
study Latent Doodle Spacestudy Latent Doodle Space
study Latent Doodle Space
 
study Active Refocusing Of Images And Videos
study Active Refocusing Of Images And Videosstudy Active Refocusing Of Images And Videos
study Active Refocusing Of Images And Videos
 
study Dappled Photography
study Dappled Photographystudy Dappled Photography
study Dappled Photography
 

Recently uploaded

08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 

Recently uploaded (20)

08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 

study Coded Aperture

  • 1. study Image and Deoth from a Conventional Camera with a Coded Apertrue Anat Levin, Rob Fergus, Frédo Durand, William Freeman MIT CSAIL
  • 2. Single input image real objects Coded Aperture output #1: Depth map output #2: all-infocused image
  • 3. Conventional aperture and depth of field Big aperture Object Focal plane Small aperture
  • 4. Depth from defocus Camera sensor Lens Point spread function Focal plane http://groups.csail.mit.edu/graphics/CodedAperture/CodedAperture-LevinEtAl-SIGGRAPH07.ppt
  • 5. Depth from defocus Camera sensor Lens Object Point spread function Focal plane http://groups.csail.mit.edu/graphics/CodedAperture/CodedAperture-LevinEtAl-SIGGRAPH07.ppt
  • 6. Depth from defocus Camera sensor Lens Object Point spread function Focal plane http://groups.csail.mit.edu/graphics/CodedAperture/CodedAperture-LevinEtAl-SIGGRAPH07.ppt
  • 7. Defocus as local convolution Calibrated blur kernels at depth K Local observed sub-window Sharp sub-window Input defocused image Depth k=1 Depth k=2 Depth k=3
  • 8.
  • 9.
  • 10.
  • 11.
  • 12.
  • 13. Overview Try deconvolving local input windows with different scaled filters: ? Larger scale ? Correct scale ? Smaller scale Somehow: select best scale
  • 14. Challenges & contributions Hard to de-convolve even when kernel is known IDEA 1: Natural images prior Hard to identify correct scale IDEA 2: Coded Aperture
  • 15. Deconvolution is ill posed Solution 1: = ? Solution 2: = ?
  • 16. IDEA 1: Natural images prior What makes images special? Natural Unnatural Image gradient Natural images have sparse gradients put a penalty on gradients
  • 17. Deconvolution with prior Convolution error Derivatives prior 2 ? Low Equal convolution error 2 ? High
  • 18. Comparing deconvolution algorithms Richardson-Lucy Input “spread” gradients “localizes” gradients Gaussian prior Sparse prior
  • 19. Statistical Model of Images “Deconvolution using natural image priors”, Levin et. al., ETAI 07 Spatial domain Frequency domain
  • 20. Maximum a-posteriori P(x|y) likelyhood Image prior (gradient here) Gradient operator For Gaussian priors For sparse priors
  • 22. Deconvolution using a Gaussian prior Note: solved in the frequency domain in a few seconds for MB size file
  • 23. Deconvolution using a sparse prior Using an iterative reweighted least squares process (IRLS) [Meer 2004; Levin and Weiss to appear] Cannot solve in frequency domain Note: solved in the frequency domain around 1 hour on 2.4Ghz CPR for 2MB file
  • 24. Iterative reweighted least squares process (IRLS)
  • 25. Recall: Overview Try deconvolving local input windows with different scaled filters: ? Larger scale ? Correct scale ? Smaller scale Somehow: select best scale Challenge: smaller scale not so different than correct
  • 26. IDEA 2: Coded Aperture Mask (code) in aperture plane Make defocus patterns different from natural images and easier to discriminate Conventional aperture Our coded aperture
  • 27. Lens with coded aperture Image of a defocused point light source Aperture pattern Camera sensor Lens with coded aperture Object Point spread function Focal plane
  • 28. Why coded ? Coded aperture- reduce uncertainty in scale identification Conventional Coded Larger scale Correct scale Smaller scale
  • 29. Why coded ? Coded aperture- reduce uncertainty in scale identification Conventional Coded Larger scale Correct scale Smaller scale
  • 30. Fourier transforms of 1D slide through the blur pattern
  • 31. Coded aperture: Scale estimation and division by zero spectrum spectrum spectrum spectrum spectrum Frequency Frequency Frequency Frequency Frequency Estimated image ? Observed image = Filter, correct scale Division by zero Estimated image ? spatial ringing = Filter, wrong scale
  • 32. Division by zero with a conventional aperture ? spectrum spectrum spectrum spectrum spectrum Frequency Frequency Frequency Frequency Frequency Estimated image ? No zero at ω ! Observed image = Filter, correct scale No zero at ω ! Tiny value at ω no spatial ringing Estimated image ? = Filter, wrong scale ω is zero !
  • 33. Filter Selection Criterion The filter f has good depth discrimination - blurry image distributions Pk1(y) and Pk2(y) at depths k1 and k2 should not be similar KL-divergence scores
  • 34. Filter Design Practical constrains Binary filter to construct accurately Cut the filter from a single piece Avoid excessive radial distortion Avoid using the full aperture Diffraction impose a min size on the holes in the file Spec. 13x13 patterns with 1 mm holes Each pattern, 8 different scales Varying between 5~15 pixels in width
  • 36. Blur scale identification Not robust at high-frequency noise Un-normalized energy term λk learn to minimize the scale misclassification error on a set of traning images Ek is approximate by the reconstruction error by ML solution x* is the deblurred image
  • 39. Applications Digital refocusing from a single image e.g. Synthesis an all-focus image e.g. Post-exposure
  • 40. Conclusion Pros. All-infocus image and depth at a single shot No loss of image resolution (compared with Plenoptic camera) Simple modification Coded aperture Conventional aperture Cons. 50 % light is blocked Depth is coarse May need manual correction