SlideShare a Scribd company logo
1 of 48
Seam Carving for Content-Aware Image Resizing Shai  Aidan (Mitsubishi Electric Research Labs) Ariel Shamir (The Interdisciplinary Center & MERL) ACM SIGGRAPH 2007
Resize Seam carving & insertion
Abstract Seams are optimal 8-connected paths of pixels cross the image Carving out or inserting seams to achieve content-aware resizing
Outline Introduction Background Seam-carving operator Discrete image resizing Multi-size images Limitations Conclusions and future work
Introduction
Motivation HTML can support dynamic changes of page layout and text.  Why can not an image deform to fit different layout automatically ? iGoogle How about aspect ratio of an image , such as fitting photo into PDA or phone cells ? Solution ? Resize – content independent Crop – remove pixels from the image periphery only
Basic Idea of Seam-Carving Use energy function to define the importanceof pixels Define seam-carving image operator Image reduction Carving out seams - the connected low energy pixels crossing the image  Preserving the image structure Image enlarging Insert seams on low energy area The order of seam insertion ensures a balance between the  original image content and the artificially inserted pixels
Application Discrete image resizing Aspect Ration Change,  Image Retarget,  Image Enlarging,  Content Amplification,  Seam Carving in gradient domain, Object Removal Multi-size images An image can continuously change their size in a content-aware manner Storing the order of seam removal and insertion
background
Image Retarget Seek to change the size of the image while maintaining the important features  Face detector An automatic thumbnail creation [Suh03]   ROI Fisheye-View warp [Liu and Gleicher 05, 06] Visual saliency []  [Suh 03] [Selur 04,  decompose image to foreground obj and background origin
Saliency map [Itti IEEE99] Simulate neuroscience of human visual system Pyramid tech. to compute 3 feature maps, color, intensity and orientation [Suh 03],  an automatic thumbnail creation, based on either a saliency map or the output of a face detector [Chen 03],  adapting most important region of images to mobile devices.
[Liu 03],  suggesting to trade time for space. Given a collection of regions of interest, they construct an optimal path through these regions and display them serially.
[Santella et al. 06] use eye tracking, in addition to composition rules to crop images intelligently.
ROI (Region-Of-Interest) Such a method was proposed by [Liu and Gleicher 05, 06] for image and video retargeting. For image retargeting they find ROI and construct a novel Fisheye-View warp that essentially applies a piecewise linear scaling function in each dimension to the image. This way the ROI is maintained while the rest of the image is warped. The retargeting can be done in interactive rates, once the ROI is found, so the user can control the desired size of the image by moving a slider. In their video retargeting work they use a combination of image and saliency maps to find the ROI. Then they use a combination of cropping, virtual pan and shot cuts to retarget the video frames.
Feature-aware warping The first solution to the general problem of warping an image into an arbitrary shape while preserving user-specified features was recently proposed by [Gal et al. 06].  The feature-aware warping is achieved by a particular formulation of the Laplacian editing technique, suited to accommodate similarity constraints on parts of the domain.  Since local constraints are propagated by the global optimization process, not all the constraints can always be satisfied at once
Seam Perfect seams to combine parts of a set of photo into a single composite picture [Agarwala et al. 04]  Drag-and-Drop Pasting that extends the Poisson Image Editing to computer an optimal boundary (seam) between the source picture and target images [Jia et al. 06] AutoCollage,  a program that automatically creates a collage image from a collection of images.  [Rother et al. 06] Simultaneously solve matting and compositing. They allow the user to scale the size of the foreground object and paste it back on the original background. [Wang , Cohen 06] evaluated several cost functions for seamless image stitching and concluded that minimizing an L1error norm between the gradients of the stitched image and the gradients of the input images performed well in general [Zomet et al. 05]
Sear Optimal Seam Dijkstra’s shortest path algorithm [98] Dynamic programming [Efros 01] Graph cuts [Kwatra 03]
Seam-carving operator
Strategies of Image Reduction Original e1 energy Optimal  global remove the lowest energy pixels Pixel remove the least energy in each row
Strategies of Image Reduction Original e1 energy Column removing columns with minimal energy Crop find a sub-win with the highest energy
Vertical Seam
Horizontal Seam
Optimal Seam Search
Optimal Seam Search S Dynamic Programming G
e1 energy
Image Energy Preservation The average energy of all pixels during resizing
Energy Functions L1 and L2-norm of the gradient,  saliency measure [Itti 99]
Histogram of Gradient  (HoG) ,[object Object],Dividing the image window into cells For each cell accumulating a local 1-D histogram of gradient directions Normalize cells by the measure of local histogram energy over larger blocks The average gradient image Weighted R-HOG descriptor R-HOG descriptor
Energy Functions Histogram of Gradient  (HoG) [Dalal and Triggs 95] max(HoG(I(x,y)) makes sure the seams run parallel to the edge of objects and not cross them
Energy Functions Entropy  Compute the entropy over a 9 x 9 window and add it to e1 eEntropy(x,y) = +  e1 (x,y)
Energy Functions Segmentation and L1 Image segmentation [Christoudias 02]  Apply e1 on the results
No single e function performs well across all images Similar range for resizing e1 or eHoG works well
Discrete image resizing  Aspect Ratio Change, Retargeting with Optimal Seams-Order,  Image Enlarging, Content Amplification,
Aspect Ratio Change Carving-out /insert seams Original Original Original 1D aspect ratio changing
2D aspect ratio changing Optimal Seams-Order Search + DynamicProgramming =  + min
Retargeting with Optimal Seams-Order h-first alternate v-first Transport map Original optimal
Image Enlarging Find first k seams for removal Duplicate them in order to arrive at I(-k) origin I(t): smaller image after t seam-carving I(-k): enlarged image after k seam insertion t I(-1) I(-k) I(t) I(-k) enlarged image insert seams in order of removal
Image Enlarging (>50%) origin Break into several steps Each step does not enlarge the size of image more than a fraction
Content Amplification Amplified Original
Seam Carving in the Gradient Domain Seam + Poisson Reconstruction [Perez 03] Compute e function Work on the gradient domain Remove seams from the x and y derivatives of the original image Use Poisson Reconstruction original retarget retarget in  Gradient Domain
Object Removal Mark the removing target Remove seams until all the marked pixels are gone * Employ seam insertion to maintain the original size
Object Removal Origin
Multi-size images Store the pre-computed  representation that encodes, for each pixel in V/H map The index of the seam that removed it The negative index of the seam that inserted it Blue (first seam)  Red (last seam) origin V(i,j)=t : pixel (i,j) removed by t-th vertical seam H(i,j)=t : pixel (i,j) removed by t-th horizontal seam
Limitations Seam-Carving  does not work well on all images 	Ex:  face Origin Crop Scale Constraint the face Face the flower Bottom up feature detection
Limitations The amount of content Too density, no “less” important area The layout of the image content origin origin
Conclusions Present a content-aware resizing using the seam-carving image operator Seams are the optimal paths on a single image Carve-out seams Insert seams Application of seam-carving operator Aspect ratio change, image retargeting, content amplification, object removal Multi-size images that support continuous resizingin real-time
Future Work Video resizing Combination of scaling and seam-carving Define more robust multi-size image Better solution to combine horizontal and vertical seams in multi-size image
END

More Related Content

What's hot

Modern face recognition with deep learning
Modern face recognition with deep learningModern face recognition with deep learning
Modern face recognition with deep learningmarada0033
 
Introduction to asp.net
Introduction to asp.netIntroduction to asp.net
Introduction to asp.netshan km
 
HTML5 Canvas - Basics.pptx
HTML5 Canvas - Basics.pptxHTML5 Canvas - Basics.pptx
HTML5 Canvas - Basics.pptxAhmadAbba6
 
android layouts
android layoutsandroid layouts
android layoutsDeepa Rani
 
Back face detection
Back face detectionBack face detection
Back face detectionPooja Dixit
 
CSS Day: CSS Grid Layout
CSS Day: CSS Grid Layout CSS Day: CSS Grid Layout
CSS Day: CSS Grid Layout Rachel Andrew
 
Decision making and loop in C#
Decision making and loop in C#Decision making and loop in C#
Decision making and loop in C#Prasanna Kumar SM
 
Knowledge discovery process
Knowledge discovery process Knowledge discovery process
Knowledge discovery process Shuvra Ghosh
 

What's hot (20)

Java socket programming
Java socket programmingJava socket programming
Java socket programming
 
Characteristics of c#
Characteristics of c#Characteristics of c#
Characteristics of c#
 
Modern face recognition with deep learning
Modern face recognition with deep learningModern face recognition with deep learning
Modern face recognition with deep learning
 
Final year ppt
Final year pptFinal year ppt
Final year ppt
 
Depth Buffer Method
Depth Buffer MethodDepth Buffer Method
Depth Buffer Method
 
Controls
ControlsControls
Controls
 
Introduction to .NET Framework
Introduction to .NET FrameworkIntroduction to .NET Framework
Introduction to .NET Framework
 
03 Data Mining Techniques
03 Data Mining Techniques03 Data Mining Techniques
03 Data Mining Techniques
 
Introduction to asp.net
Introduction to asp.netIntroduction to asp.net
Introduction to asp.net
 
Sq lite database
Sq lite databaseSq lite database
Sq lite database
 
OpenGL Basics
OpenGL BasicsOpenGL Basics
OpenGL Basics
 
Spline representations
Spline representationsSpline representations
Spline representations
 
Of Lambdas and LINQ
Of Lambdas and LINQOf Lambdas and LINQ
Of Lambdas and LINQ
 
HTML5 Canvas - Basics.pptx
HTML5 Canvas - Basics.pptxHTML5 Canvas - Basics.pptx
HTML5 Canvas - Basics.pptx
 
android layouts
android layoutsandroid layouts
android layouts
 
Back face detection
Back face detectionBack face detection
Back face detection
 
Web usage mining
Web usage miningWeb usage mining
Web usage mining
 
CSS Day: CSS Grid Layout
CSS Day: CSS Grid Layout CSS Day: CSS Grid Layout
CSS Day: CSS Grid Layout
 
Decision making and loop in C#
Decision making and loop in C#Decision making and loop in C#
Decision making and loop in C#
 
Knowledge discovery process
Knowledge discovery process Knowledge discovery process
Knowledge discovery process
 

Similar to study Seam Carving For Content Aware Image Resizing

Medial Axis Transformation based Skeletonzation of Image Patterns using Image...
Medial Axis Transformation based Skeletonzation of Image Patterns using Image...Medial Axis Transformation based Skeletonzation of Image Patterns using Image...
Medial Axis Transformation based Skeletonzation of Image Patterns using Image...IOSR Journals
 
A Detailed Analysis on Feature Extraction Techniques of Panoramic Image Stitc...
A Detailed Analysis on Feature Extraction Techniques of Panoramic Image Stitc...A Detailed Analysis on Feature Extraction Techniques of Panoramic Image Stitc...
A Detailed Analysis on Feature Extraction Techniques of Panoramic Image Stitc...IJEACS
 
F0255046056
F0255046056F0255046056
F0255046056theijes
 
Research Paper v2.0
Research Paper v2.0Research Paper v2.0
Research Paper v2.0Kapil Tiwari
 
Design and Implementation of VLSI Architecture for Image Scaling Processor
Design and Implementation of VLSI Architecture for Image  Scaling ProcessorDesign and Implementation of VLSI Architecture for Image  Scaling Processor
Design and Implementation of VLSI Architecture for Image Scaling ProcessorIJMER
 
paper writing
paper writingpaper writing
paper writingMukesh M
 
Performance analysis on color image mosaicing techniques on FPGA
Performance analysis on color image mosaicing techniques on FPGAPerformance analysis on color image mosaicing techniques on FPGA
Performance analysis on color image mosaicing techniques on FPGAIJECEIAES
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...ijceronline
 
Image Stitching Algorithm: An Optimization between Correlation-Based and Feat...
Image Stitching Algorithm: An Optimization between Correlation-Based and Feat...Image Stitching Algorithm: An Optimization between Correlation-Based and Feat...
Image Stitching Algorithm: An Optimization between Correlation-Based and Feat...IJCSIS Research Publications
 
IRJET- Saliency based Image Co-Segmentation
IRJET- Saliency based Image Co-SegmentationIRJET- Saliency based Image Co-Segmentation
IRJET- Saliency based Image Co-SegmentationIRJET Journal
 
Face Detection System on Ada boost Algorithm Using Haar Classifiers
Face Detection System on Ada boost Algorithm Using Haar ClassifiersFace Detection System on Ada boost Algorithm Using Haar Classifiers
Face Detection System on Ada boost Algorithm Using Haar ClassifiersIJMER
 
IRJET- Real Time Implementation of Bi-Histogram Equalization Method on Androi...
IRJET- Real Time Implementation of Bi-Histogram Equalization Method on Androi...IRJET- Real Time Implementation of Bi-Histogram Equalization Method on Androi...
IRJET- Real Time Implementation of Bi-Histogram Equalization Method on Androi...IRJET Journal
 
IRJET- Design and Implementation of ATM Security System using Vibration Senso...
IRJET- Design and Implementation of ATM Security System using Vibration Senso...IRJET- Design and Implementation of ATM Security System using Vibration Senso...
IRJET- Design and Implementation of ATM Security System using Vibration Senso...IRJET Journal
 
IRJET- An Approach to FPGA based Implementation of Image Mosaicing using Neur...
IRJET- An Approach to FPGA based Implementation of Image Mosaicing using Neur...IRJET- An Approach to FPGA based Implementation of Image Mosaicing using Neur...
IRJET- An Approach to FPGA based Implementation of Image Mosaicing using Neur...IRJET Journal
 
EXTENDED WAVELET TRANSFORM BASED IMAGE INPAINTING ALGORITHM FOR NATURAL SCENE...
EXTENDED WAVELET TRANSFORM BASED IMAGE INPAINTING ALGORITHM FOR NATURAL SCENE...EXTENDED WAVELET TRANSFORM BASED IMAGE INPAINTING ALGORITHM FOR NATURAL SCENE...
EXTENDED WAVELET TRANSFORM BASED IMAGE INPAINTING ALGORITHM FOR NATURAL SCENE...cscpconf
 

Similar to study Seam Carving For Content Aware Image Resizing (20)

Medial Axis Transformation based Skeletonzation of Image Patterns using Image...
Medial Axis Transformation based Skeletonzation of Image Patterns using Image...Medial Axis Transformation based Skeletonzation of Image Patterns using Image...
Medial Axis Transformation based Skeletonzation of Image Patterns using Image...
 
Medial axis transformation based skeletonzation of image patterns using image...
Medial axis transformation based skeletonzation of image patterns using image...Medial axis transformation based skeletonzation of image patterns using image...
Medial axis transformation based skeletonzation of image patterns using image...
 
A Detailed Analysis on Feature Extraction Techniques of Panoramic Image Stitc...
A Detailed Analysis on Feature Extraction Techniques of Panoramic Image Stitc...A Detailed Analysis on Feature Extraction Techniques of Panoramic Image Stitc...
A Detailed Analysis on Feature Extraction Techniques of Panoramic Image Stitc...
 
Oc2423022305
Oc2423022305Oc2423022305
Oc2423022305
 
F0255046056
F0255046056F0255046056
F0255046056
 
Research Paper v2.0
Research Paper v2.0Research Paper v2.0
Research Paper v2.0
 
Design and Implementation of VLSI Architecture for Image Scaling Processor
Design and Implementation of VLSI Architecture for Image  Scaling ProcessorDesign and Implementation of VLSI Architecture for Image  Scaling Processor
Design and Implementation of VLSI Architecture for Image Scaling Processor
 
paper writing
paper writingpaper writing
paper writing
 
Performance analysis on color image mosaicing techniques on FPGA
Performance analysis on color image mosaicing techniques on FPGAPerformance analysis on color image mosaicing techniques on FPGA
Performance analysis on color image mosaicing techniques on FPGA
 
A10 avidan
A10 avidanA10 avidan
A10 avidan
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
Image inpainting
Image inpaintingImage inpainting
Image inpainting
 
Image Stitching Algorithm: An Optimization between Correlation-Based and Feat...
Image Stitching Algorithm: An Optimization between Correlation-Based and Feat...Image Stitching Algorithm: An Optimization between Correlation-Based and Feat...
Image Stitching Algorithm: An Optimization between Correlation-Based and Feat...
 
IRJET- Saliency based Image Co-Segmentation
IRJET- Saliency based Image Co-SegmentationIRJET- Saliency based Image Co-Segmentation
IRJET- Saliency based Image Co-Segmentation
 
Face Detection System on Ada boost Algorithm Using Haar Classifiers
Face Detection System on Ada boost Algorithm Using Haar ClassifiersFace Detection System on Ada boost Algorithm Using Haar Classifiers
Face Detection System on Ada boost Algorithm Using Haar Classifiers
 
IRJET- Real Time Implementation of Bi-Histogram Equalization Method on Androi...
IRJET- Real Time Implementation of Bi-Histogram Equalization Method on Androi...IRJET- Real Time Implementation of Bi-Histogram Equalization Method on Androi...
IRJET- Real Time Implementation of Bi-Histogram Equalization Method on Androi...
 
IRJET- Design and Implementation of ATM Security System using Vibration Senso...
IRJET- Design and Implementation of ATM Security System using Vibration Senso...IRJET- Design and Implementation of ATM Security System using Vibration Senso...
IRJET- Design and Implementation of ATM Security System using Vibration Senso...
 
Log polar coordinates
Log polar coordinatesLog polar coordinates
Log polar coordinates
 
IRJET- An Approach to FPGA based Implementation of Image Mosaicing using Neur...
IRJET- An Approach to FPGA based Implementation of Image Mosaicing using Neur...IRJET- An Approach to FPGA based Implementation of Image Mosaicing using Neur...
IRJET- An Approach to FPGA based Implementation of Image Mosaicing using Neur...
 
EXTENDED WAVELET TRANSFORM BASED IMAGE INPAINTING ALGORITHM FOR NATURAL SCENE...
EXTENDED WAVELET TRANSFORM BASED IMAGE INPAINTING ALGORITHM FOR NATURAL SCENE...EXTENDED WAVELET TRANSFORM BASED IMAGE INPAINTING ALGORITHM FOR NATURAL SCENE...
EXTENDED WAVELET TRANSFORM BASED IMAGE INPAINTING ALGORITHM FOR NATURAL SCENE...
 

More from Chiamin Hsu

study Domain Transform for Edge-Aware Image and Video Processing
study Domain Transform for Edge-Aware Image and Video Processingstudy Domain Transform for Edge-Aware Image and Video Processing
study Domain Transform for Edge-Aware Image and Video ProcessingChiamin Hsu
 
study Image and video abstraction by multi scale anisotropic kuwahara
study  Image and video abstraction by multi scale anisotropic kuwaharastudy  Image and video abstraction by multi scale anisotropic kuwahara
study Image and video abstraction by multi scale anisotropic kuwaharaChiamin Hsu
 
study Accelerating Spatially Varying Gaussian Filters
study Accelerating Spatially Varying Gaussian Filtersstudy Accelerating Spatially Varying Gaussian Filters
study Accelerating Spatially Varying Gaussian FiltersChiamin Hsu
 
stduy Edge-Based Image Coarsening
stduy Edge-Based Image Coarseningstduy Edge-Based Image Coarsening
stduy Edge-Based Image CoarseningChiamin Hsu
 
study Streaming Multigrid For Gradient Domain Operations On Large Images
study Streaming Multigrid For Gradient Domain Operations On Large Imagesstudy Streaming Multigrid For Gradient Domain Operations On Large Images
study Streaming Multigrid For Gradient Domain Operations On Large ImagesChiamin Hsu
 
study Shading Based Surface Editing
study Shading Based Surface Editingstudy Shading Based Surface Editing
study Shading Based Surface EditingChiamin Hsu
 
study Diffusion Curves: A Vector Representation for Smooth-Shaded Images
study Diffusion Curves: A Vector Representation for Smooth-Shaded Imagesstudy Diffusion Curves: A Vector Representation for Smooth-Shaded Images
study Diffusion Curves: A Vector Representation for Smooth-Shaded ImagesChiamin Hsu
 
study Image Vectorization using Optimized Gradeint Meshes
study Image Vectorization using Optimized Gradeint Meshesstudy Image Vectorization using Optimized Gradeint Meshes
study Image Vectorization using Optimized Gradeint MeshesChiamin Hsu
 
study Latent Doodle Space
study Latent Doodle Spacestudy Latent Doodle Space
study Latent Doodle SpaceChiamin Hsu
 
study Coded Aperture
study Coded Aperturestudy Coded Aperture
study Coded ApertureChiamin Hsu
 
study Active Refocusing Of Images And Videos
study Active Refocusing Of Images And Videosstudy Active Refocusing Of Images And Videos
study Active Refocusing Of Images And VideosChiamin Hsu
 
study Dappled Photography
study Dappled Photographystudy Dappled Photography
study Dappled PhotographyChiamin Hsu
 

More from Chiamin Hsu (12)

study Domain Transform for Edge-Aware Image and Video Processing
study Domain Transform for Edge-Aware Image and Video Processingstudy Domain Transform for Edge-Aware Image and Video Processing
study Domain Transform for Edge-Aware Image and Video Processing
 
study Image and video abstraction by multi scale anisotropic kuwahara
study  Image and video abstraction by multi scale anisotropic kuwaharastudy  Image and video abstraction by multi scale anisotropic kuwahara
study Image and video abstraction by multi scale anisotropic kuwahara
 
study Accelerating Spatially Varying Gaussian Filters
study Accelerating Spatially Varying Gaussian Filtersstudy Accelerating Spatially Varying Gaussian Filters
study Accelerating Spatially Varying Gaussian Filters
 
stduy Edge-Based Image Coarsening
stduy Edge-Based Image Coarseningstduy Edge-Based Image Coarsening
stduy Edge-Based Image Coarsening
 
study Streaming Multigrid For Gradient Domain Operations On Large Images
study Streaming Multigrid For Gradient Domain Operations On Large Imagesstudy Streaming Multigrid For Gradient Domain Operations On Large Images
study Streaming Multigrid For Gradient Domain Operations On Large Images
 
study Shading Based Surface Editing
study Shading Based Surface Editingstudy Shading Based Surface Editing
study Shading Based Surface Editing
 
study Diffusion Curves: A Vector Representation for Smooth-Shaded Images
study Diffusion Curves: A Vector Representation for Smooth-Shaded Imagesstudy Diffusion Curves: A Vector Representation for Smooth-Shaded Images
study Diffusion Curves: A Vector Representation for Smooth-Shaded Images
 
study Image Vectorization using Optimized Gradeint Meshes
study Image Vectorization using Optimized Gradeint Meshesstudy Image Vectorization using Optimized Gradeint Meshes
study Image Vectorization using Optimized Gradeint Meshes
 
study Latent Doodle Space
study Latent Doodle Spacestudy Latent Doodle Space
study Latent Doodle Space
 
study Coded Aperture
study Coded Aperturestudy Coded Aperture
study Coded Aperture
 
study Active Refocusing Of Images And Videos
study Active Refocusing Of Images And Videosstudy Active Refocusing Of Images And Videos
study Active Refocusing Of Images And Videos
 
study Dappled Photography
study Dappled Photographystudy Dappled Photography
study Dappled Photography
 

Recently uploaded

Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 

Recently uploaded (20)

Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 

study Seam Carving For Content Aware Image Resizing

  • 1. Seam Carving for Content-Aware Image Resizing Shai Aidan (Mitsubishi Electric Research Labs) Ariel Shamir (The Interdisciplinary Center & MERL) ACM SIGGRAPH 2007
  • 2. Resize Seam carving & insertion
  • 3. Abstract Seams are optimal 8-connected paths of pixels cross the image Carving out or inserting seams to achieve content-aware resizing
  • 4. Outline Introduction Background Seam-carving operator Discrete image resizing Multi-size images Limitations Conclusions and future work
  • 6. Motivation HTML can support dynamic changes of page layout and text. Why can not an image deform to fit different layout automatically ? iGoogle How about aspect ratio of an image , such as fitting photo into PDA or phone cells ? Solution ? Resize – content independent Crop – remove pixels from the image periphery only
  • 7. Basic Idea of Seam-Carving Use energy function to define the importanceof pixels Define seam-carving image operator Image reduction Carving out seams - the connected low energy pixels crossing the image Preserving the image structure Image enlarging Insert seams on low energy area The order of seam insertion ensures a balance between the original image content and the artificially inserted pixels
  • 8. Application Discrete image resizing Aspect Ration Change, Image Retarget, Image Enlarging, Content Amplification, Seam Carving in gradient domain, Object Removal Multi-size images An image can continuously change their size in a content-aware manner Storing the order of seam removal and insertion
  • 10. Image Retarget Seek to change the size of the image while maintaining the important features Face detector An automatic thumbnail creation [Suh03] ROI Fisheye-View warp [Liu and Gleicher 05, 06] Visual saliency [] [Suh 03] [Selur 04, decompose image to foreground obj and background origin
  • 11. Saliency map [Itti IEEE99] Simulate neuroscience of human visual system Pyramid tech. to compute 3 feature maps, color, intensity and orientation [Suh 03], an automatic thumbnail creation, based on either a saliency map or the output of a face detector [Chen 03], adapting most important region of images to mobile devices.
  • 12. [Liu 03], suggesting to trade time for space. Given a collection of regions of interest, they construct an optimal path through these regions and display them serially.
  • 13. [Santella et al. 06] use eye tracking, in addition to composition rules to crop images intelligently.
  • 14. ROI (Region-Of-Interest) Such a method was proposed by [Liu and Gleicher 05, 06] for image and video retargeting. For image retargeting they find ROI and construct a novel Fisheye-View warp that essentially applies a piecewise linear scaling function in each dimension to the image. This way the ROI is maintained while the rest of the image is warped. The retargeting can be done in interactive rates, once the ROI is found, so the user can control the desired size of the image by moving a slider. In their video retargeting work they use a combination of image and saliency maps to find the ROI. Then they use a combination of cropping, virtual pan and shot cuts to retarget the video frames.
  • 15. Feature-aware warping The first solution to the general problem of warping an image into an arbitrary shape while preserving user-specified features was recently proposed by [Gal et al. 06]. The feature-aware warping is achieved by a particular formulation of the Laplacian editing technique, suited to accommodate similarity constraints on parts of the domain. Since local constraints are propagated by the global optimization process, not all the constraints can always be satisfied at once
  • 16. Seam Perfect seams to combine parts of a set of photo into a single composite picture [Agarwala et al. 04] Drag-and-Drop Pasting that extends the Poisson Image Editing to computer an optimal boundary (seam) between the source picture and target images [Jia et al. 06] AutoCollage, a program that automatically creates a collage image from a collection of images. [Rother et al. 06] Simultaneously solve matting and compositing. They allow the user to scale the size of the foreground object and paste it back on the original background. [Wang , Cohen 06] evaluated several cost functions for seamless image stitching and concluded that minimizing an L1error norm between the gradients of the stitched image and the gradients of the input images performed well in general [Zomet et al. 05]
  • 17. Sear Optimal Seam Dijkstra’s shortest path algorithm [98] Dynamic programming [Efros 01] Graph cuts [Kwatra 03]
  • 19. Strategies of Image Reduction Original e1 energy Optimal global remove the lowest energy pixels Pixel remove the least energy in each row
  • 20. Strategies of Image Reduction Original e1 energy Column removing columns with minimal energy Crop find a sub-win with the highest energy
  • 24. Optimal Seam Search S Dynamic Programming G
  • 26. Image Energy Preservation The average energy of all pixels during resizing
  • 27. Energy Functions L1 and L2-norm of the gradient, saliency measure [Itti 99]
  • 28.
  • 29. Energy Functions Histogram of Gradient (HoG) [Dalal and Triggs 95] max(HoG(I(x,y)) makes sure the seams run parallel to the edge of objects and not cross them
  • 30. Energy Functions Entropy Compute the entropy over a 9 x 9 window and add it to e1 eEntropy(x,y) = + e1 (x,y)
  • 31. Energy Functions Segmentation and L1 Image segmentation [Christoudias 02] Apply e1 on the results
  • 32. No single e function performs well across all images Similar range for resizing e1 or eHoG works well
  • 33. Discrete image resizing Aspect Ratio Change, Retargeting with Optimal Seams-Order, Image Enlarging, Content Amplification,
  • 34. Aspect Ratio Change Carving-out /insert seams Original Original Original 1D aspect ratio changing
  • 35. 2D aspect ratio changing Optimal Seams-Order Search + DynamicProgramming = + min
  • 36. Retargeting with Optimal Seams-Order h-first alternate v-first Transport map Original optimal
  • 37. Image Enlarging Find first k seams for removal Duplicate them in order to arrive at I(-k) origin I(t): smaller image after t seam-carving I(-k): enlarged image after k seam insertion t I(-1) I(-k) I(t) I(-k) enlarged image insert seams in order of removal
  • 38. Image Enlarging (>50%) origin Break into several steps Each step does not enlarge the size of image more than a fraction
  • 40. Seam Carving in the Gradient Domain Seam + Poisson Reconstruction [Perez 03] Compute e function Work on the gradient domain Remove seams from the x and y derivatives of the original image Use Poisson Reconstruction original retarget retarget in Gradient Domain
  • 41. Object Removal Mark the removing target Remove seams until all the marked pixels are gone * Employ seam insertion to maintain the original size
  • 43. Multi-size images Store the pre-computed representation that encodes, for each pixel in V/H map The index of the seam that removed it The negative index of the seam that inserted it Blue (first seam)  Red (last seam) origin V(i,j)=t : pixel (i,j) removed by t-th vertical seam H(i,j)=t : pixel (i,j) removed by t-th horizontal seam
  • 44. Limitations Seam-Carving does not work well on all images Ex: face Origin Crop Scale Constraint the face Face the flower Bottom up feature detection
  • 45. Limitations The amount of content Too density, no “less” important area The layout of the image content origin origin
  • 46. Conclusions Present a content-aware resizing using the seam-carving image operator Seams are the optimal paths on a single image Carve-out seams Insert seams Application of seam-carving operator Aspect ratio change, image retargeting, content amplification, object removal Multi-size images that support continuous resizingin real-time
  • 47. Future Work Video resizing Combination of scaling and seam-carving Define more robust multi-size image Better solution to combine horizontal and vertical seams in multi-size image
  • 48. END

Editor's Notes

  1. Shai Aidan 以色列人MERL(Mitsubishi Electric Research Labs)沒想到以色列人投效到日本研究中心