SlideShare a Scribd company logo
1 of 43
Comparative Analysis of Novel
Boundary Detection Methods
ANIL ULAŞ KOÇAK - OSMAN BUĞRA SARICA
Image Processing Course, 2016, Ankara, Turkey
1
Plan
What is Boundary Detection?
Related Works
Boundary Detection Methods
 Sketch Tokens
 Structured Random Forest
 Oriented Edge Forest
 Crisp Boundary Detection
Comparison
Results
2
Boundary detection
3
 Reduce dimensionality of data
 Preserve content information
 Useful in applications such as:
Boundary detection
4
 Boundary detection is usually formulated as a per-pixel
classification problem
 How to extract discriminative boundary features?
 How to learn a efficient boundary classifier?
Boundary detection
5
Related Works
6
Related Works
7
Image Optimized Cues Boundary Strength
Brightness
Color
Texture
Benchmark
Human Segmentations
Cue Combination
Model
r
(x,y)
Related Works
8
Related Works
9
Related Works
10
Related Works
11
Boundary detection methods
Learning based contour detection:
 Sketch Tokens: A Learned Mid-Level Representation for
Contour and Object Detection
 Structured Forests for Fast Edge Detection
 Oriented Edge Forest for Boundary Detection
Measuring rarity based on pointwise mutual information:
 Crisp Boundary Detection Using Pointwise Mutual Information
12
Sketch tokens
13
Sketch tokens
A novel approach to both learning and detecting local edge-
based mid-level features. Having 2 main parts;
◦ Defining Sketch Tokens Classes
◦ defining clasess by clustering of the patches sampled from the human generated hand drawn
images
◦ Detecting Sketch Tokens
◦ Training a random forest classifier with feature information from training images
14
Sketch tokens
Defining Sketch Tokens Classes
Goal is to define a set of token classes that represent the wide variety of local
edge structures that may exist in an image.
15
Sketch tokens
Defining Sketch Tokens Classes
◦ It is defined by clustering of the patches sampled from the human generated
hand drawn images (novel approach)
◦ Only patches that contain a labeled contour at the center pixel are used
(35x35)
16
Sketch tokens
Detecting Sketch Tokens
◦ Feature Extraction
◦ Integral Channel Features
◦ Self Similarity Features
◦ Classification
◦ Using Random Forest Classifier to predict sketch tokens
17
Sketch tokens
Integral Channel Features
 Color (3 channels)
 Gradients (3 unoriented + 8 oriented channels)
• Sigma = 0, Theta = 0, pi/2, pi, 3pi/2
• Sigma = 1.5, Theta = 0, pi/2, pi, 3pi/2
• Sigma = 5
18
Sketch tokens
19
Self-similarity features: The L1 distance from the yellow box to the
other 5x5 cells are shown for color and gradient magnitude channels.
Sketch tokens
20
Learning
 Random Forest Classifiers.
Advantages:
 Fast at test time, especially for
a nonlinear classifier.
 Dont have to explicitly compute
independent descriptors for
every patch. Just look up what
the decision tree wants to
know at each branch.
Sketch tokens
Detections of individual sketch tokens
21
Combining sketch token
detections:
Random forest classifier predicts the
probability that an image patch
belongs to each token class or the
negative set.
Crisp boundary detection
Main Idea: pixels belonging to the same object show higher statistical
dependencies than pixels belonging to the different object.
Crisp boundary detection
black-next-to-white occurs over and over again. This pattern shows up
in the image’s statistics as a suspicious coincidence — these colors must
be part of the same object!
23
Crisp boundary detection
Pointwise Mutual Information (PMI) is used to obtain
statistical association between two pixel and get affinity
measure, so can be predicted whether or not two pixel lie on
same object.
24
Crisp boundary detection
Method measures how often each color A occurs next to
each color B within the image.
25
Crisp boundary detection
Accurate Result
◦ Thanks to good prediction of PMI distinguishing between boundary
and non boundary is provided.
Crisp Result
◦ Highly localized features (only color and color variance information is
used in 3x3 window)
26
Structured forests
27
Idea
 It is an enhancement from Sketch token
 Using the edge structure directly instead of using predefined label
 It might allows it to learn more subtle variations in edge structure and leads to
a more accurate and efficient algorithm.
Structured learning with random forest
 It did not cluster the edges before training the tree
 If the splitting function could work with the edge structure directly,
then the random forest algorithm could work in original way
Structured forests
How to represent the edge structure?
Difference 2
Difference 1
Structured forests
29
Structured random forest
 Slightly modified from normal random forest
Clustering the incoming patches into 2 groups
based on the Z representation.
Now the patches have labels! Thus we could
still calculate the information gain to choose
the feature f and threshold t.
The representative edge in the leaf node is
chosen by:
 The edge structure whose Z representation is the
medoid
Structured forests
30
Generate the edge map.
 Each prediction is a voting. The pixel with higher votes has
higher confidence of being an edge
Oriented edge forest
31
Oriented edge forest
32
Once a forest is trained to recognize oriented edge
patterns, it is applied over the input image in scanning-
window fashion. From a test patch x each tree produces a
distribution which are then combined either by
averaging (accuracy) or voting (speed, sparsity).
 where 1 is the indicator function.
Oriented edge forest
33
 Method provide a simple procedure for calibrating forest-
generated posterior probabilities.
Oriented edge forest
34
The forest produces calibrated distributions at every
spatial position. To derive edge strengths we composite
sharpened edge masks into the image, weighted by their
posterior probabilities.
Experiments
IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015 5
10 test images from BSDS 500
P-R graph from BSDS Bechmark
Low powered PC
Evaluation Metric
IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015 5
Precision-Recall Curve
 Precision =
𝑇𝑝
𝑇𝑝+𝐹𝑝
How many output results are relevant ?
 Recall =
𝑇𝑝
𝑇𝑝+𝐹𝑛
How many relevant output results are selected ?
Time cost
 Measuring time during selected scenario.
Complexity
 Having complex or simple understanding
Performance
Precision-Recall Curve
What did we expect ?
6
Performance
Precision-Recall Curve
6
Speed
7
What did we expect ?
Speed
7
Comparison
Statistical
Learning
Based
Contribution Complexity
Novel
Boundary
Detectors
Sketch Tokens Random Forest Simple
Crisp Boundary
Detection
PMI Simple
Structured Forests
Struct Output
Domain
Complex
Oriented Edge
Forest
A simple Output
Space
Complex
IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015 4
Results
23
Original Sketch Tokens Crisp Boundary Structured Forest OEF
Conclusion
IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015 24
Method Pros Cons
Sketch Tokens Having Inovative
approach
Relatively Slow and
Average Success
Crisp B. D. Great Success,
Simple approach
Slow
Structured Good success on
performance and
time cost
Complex
Oriented Having relatively
good time cost
Not successful by
year , Complex

More Related Content

What's hot

Image feature extraction
Image feature extractionImage feature extraction
Image feature extractionRushin Shah
 
Image segmentation
Image segmentationImage segmentation
Image segmentationRania H
 
SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRU...
SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRU...SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRU...
SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRU...Cheriyan K M
 
Beginning direct3d gameprogramming01_thehistoryofdirect3dgraphics_20160407_ji...
Beginning direct3d gameprogramming01_thehistoryofdirect3dgraphics_20160407_ji...Beginning direct3d gameprogramming01_thehistoryofdirect3dgraphics_20160407_ji...
Beginning direct3d gameprogramming01_thehistoryofdirect3dgraphics_20160407_ji...JinTaek Seo
 
Real Time Myanmar Traffic Sign Recognition System using HOG and SVM
Real Time Myanmar Traffic Sign Recognition System using HOG and SVMReal Time Myanmar Traffic Sign Recognition System using HOG and SVM
Real Time Myanmar Traffic Sign Recognition System using HOG and SVMijtsrd
 
Image to text Converter
Image to text ConverterImage to text Converter
Image to text ConverterDhiraj Raj
 
Methods of Manifold Learning for Dimension Reduction of Large Data Sets
Methods of Manifold Learning for Dimension Reduction of Large Data SetsMethods of Manifold Learning for Dimension Reduction of Large Data Sets
Methods of Manifold Learning for Dimension Reduction of Large Data SetsRyan B Harvey, CSDP, CSM
 
Color Particle Filter Tracking using Frame Segmentation based on JND Color an...
Color Particle Filter Tracking using Frame Segmentation based on JND Color an...Color Particle Filter Tracking using Frame Segmentation based on JND Color an...
Color Particle Filter Tracking using Frame Segmentation based on JND Color an...IOSRJVSP
 
Color vs texture feature extraction and matching in visual content retrieval ...
Color vs texture feature extraction and matching in visual content retrieval ...Color vs texture feature extraction and matching in visual content retrieval ...
Color vs texture feature extraction and matching in visual content retrieval ...IAEME Publication
 
Contextless Object Recognition with Shape-enriched SIFT and Bags of Features
Contextless Object Recognition with Shape-enriched SIFT and Bags of FeaturesContextless Object Recognition with Shape-enriched SIFT and Bags of Features
Contextless Object Recognition with Shape-enriched SIFT and Bags of FeaturesUniversitat Politècnica de Catalunya
 
Developing 3D Viewing Model from 2D Stereo Pair with its Occlusion Ratio
Developing 3D Viewing Model from 2D Stereo Pair with its Occlusion RatioDeveloping 3D Viewing Model from 2D Stereo Pair with its Occlusion Ratio
Developing 3D Viewing Model from 2D Stereo Pair with its Occlusion RatioCSCJournals
 
Digit recognition using neural network
Digit recognition using neural networkDigit recognition using neural network
Digit recognition using neural networkshachibattar
 
Identification system of characters in vehicular plates
Identification system of characters in vehicular platesIdentification system of characters in vehicular plates
Identification system of characters in vehicular platesIJRES Journal
 
GRPHICS01 - Introduction to 3D Graphics
GRPHICS01 - Introduction to 3D GraphicsGRPHICS01 - Introduction to 3D Graphics
GRPHICS01 - Introduction to 3D GraphicsMichael Heron
 

What's hot (20)

Image feature extraction
Image feature extractionImage feature extraction
Image feature extraction
 
Image Inpainting
Image InpaintingImage Inpainting
Image Inpainting
 
object 3d(1)
object 3d(1)object 3d(1)
object 3d(1)
 
Self Organizing Maps
Self Organizing MapsSelf Organizing Maps
Self Organizing Maps
 
Image segmentation
Image segmentationImage segmentation
Image segmentation
 
SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRU...
SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRU...SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRU...
SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRU...
 
Beginning direct3d gameprogramming01_thehistoryofdirect3dgraphics_20160407_ji...
Beginning direct3d gameprogramming01_thehistoryofdirect3dgraphics_20160407_ji...Beginning direct3d gameprogramming01_thehistoryofdirect3dgraphics_20160407_ji...
Beginning direct3d gameprogramming01_thehistoryofdirect3dgraphics_20160407_ji...
 
Lec10 matching
Lec10 matchingLec10 matching
Lec10 matching
 
Real Time Myanmar Traffic Sign Recognition System using HOG and SVM
Real Time Myanmar Traffic Sign Recognition System using HOG and SVMReal Time Myanmar Traffic Sign Recognition System using HOG and SVM
Real Time Myanmar Traffic Sign Recognition System using HOG and SVM
 
Image to text Converter
Image to text ConverterImage to text Converter
Image to text Converter
 
Methods of Manifold Learning for Dimension Reduction of Large Data Sets
Methods of Manifold Learning for Dimension Reduction of Large Data SetsMethods of Manifold Learning for Dimension Reduction of Large Data Sets
Methods of Manifold Learning for Dimension Reduction of Large Data Sets
 
Image segmentation
Image segmentation Image segmentation
Image segmentation
 
Color Particle Filter Tracking using Frame Segmentation based on JND Color an...
Color Particle Filter Tracking using Frame Segmentation based on JND Color an...Color Particle Filter Tracking using Frame Segmentation based on JND Color an...
Color Particle Filter Tracking using Frame Segmentation based on JND Color an...
 
Histogram Equalization
Histogram EqualizationHistogram Equalization
Histogram Equalization
 
Color vs texture feature extraction and matching in visual content retrieval ...
Color vs texture feature extraction and matching in visual content retrieval ...Color vs texture feature extraction and matching in visual content retrieval ...
Color vs texture feature extraction and matching in visual content retrieval ...
 
Contextless Object Recognition with Shape-enriched SIFT and Bags of Features
Contextless Object Recognition with Shape-enriched SIFT and Bags of FeaturesContextless Object Recognition with Shape-enriched SIFT and Bags of Features
Contextless Object Recognition with Shape-enriched SIFT and Bags of Features
 
Developing 3D Viewing Model from 2D Stereo Pair with its Occlusion Ratio
Developing 3D Viewing Model from 2D Stereo Pair with its Occlusion RatioDeveloping 3D Viewing Model from 2D Stereo Pair with its Occlusion Ratio
Developing 3D Viewing Model from 2D Stereo Pair with its Occlusion Ratio
 
Digit recognition using neural network
Digit recognition using neural networkDigit recognition using neural network
Digit recognition using neural network
 
Identification system of characters in vehicular plates
Identification system of characters in vehicular platesIdentification system of characters in vehicular plates
Identification system of characters in vehicular plates
 
GRPHICS01 - Introduction to 3D Graphics
GRPHICS01 - Introduction to 3D GraphicsGRPHICS01 - Introduction to 3D Graphics
GRPHICS01 - Introduction to 3D Graphics
 

Viewers also liked

sketch to photo matching in criminal investigations
sketch to photo matching in criminal investigationssketch to photo matching in criminal investigations
sketch to photo matching in criminal investigationsChippy Thomas
 
sketch to photo matching
sketch to photo matchingsketch to photo matching
sketch to photo matchingChippy Thomas
 
The “EUROPOLIS” residential complex: a fantasy in Palladian style traditions ...
The “EUROPOLIS” residential complex: a fantasy in Palladian style traditions ...The “EUROPOLIS” residential complex: a fantasy in Palladian style traditions ...
The “EUROPOLIS” residential complex: a fantasy in Palladian style traditions ...Konstantin Ivanovich Samoilov
 
Composite Material With Ansys
Composite Material With AnsysComposite Material With Ansys
Composite Material With AnsysJafar Hasanpoor
 
Psychometric instrument development
Psychometric instrument developmentPsychometric instrument development
Psychometric instrument developmentJames Neill
 
Tutorial para aprender hacer una unidad tematica carlos leon
Tutorial para aprender hacer una unidad tematica carlos leonTutorial para aprender hacer una unidad tematica carlos leon
Tutorial para aprender hacer una unidad tematica carlos leonCarlos Jose Leon Polanco
 
Анель Хасенова + аренда квартир + клиенты
Анель Хасенова + аренда квартир + клиентыАнель Хасенова + аренда квартир + клиенты
Анель Хасенова + аренда квартир + клиентыAnel Khassenova
 
Artesanos
ArtesanosArtesanos
ArtesanosElly97
 
RJPenick Resume4
RJPenick Resume4RJPenick Resume4
RJPenick Resume4Ron Penick
 
Osama Shawky Abbas new1 co
Osama Shawky Abbas new1 coOsama Shawky Abbas new1 co
Osama Shawky Abbas new1 coOsama Shawky
 
Шунтукские эльфы
Шунтукские эльфыШунтукские эльфы
Шунтукские эльфыcdoarg01
 
Curriculum 2016_Eng
Curriculum 2016_EngCurriculum 2016_Eng
Curriculum 2016_EngCarlo Ceresa
 
SACHIN HARBA CV [MBA]
SACHIN HARBA CV [MBA]SACHIN HARBA CV [MBA]
SACHIN HARBA CV [MBA]sachin harba
 

Viewers also liked (20)

S01761131137
S01761131137S01761131137
S01761131137
 
sketch to photo matching in criminal investigations
sketch to photo matching in criminal investigationssketch to photo matching in criminal investigations
sketch to photo matching in criminal investigations
 
Ada-Sketch and friends
Ada-Sketch and friendsAda-Sketch and friends
Ada-Sketch and friends
 
sketch to photo matching
sketch to photo matchingsketch to photo matching
sketch to photo matching
 
The “EUROPOLIS” residential complex: a fantasy in Palladian style traditions ...
The “EUROPOLIS” residential complex: a fantasy in Palladian style traditions ...The “EUROPOLIS” residential complex: a fantasy in Palladian style traditions ...
The “EUROPOLIS” residential complex: a fantasy in Palladian style traditions ...
 
Composite Material With Ansys
Composite Material With AnsysComposite Material With Ansys
Composite Material With Ansys
 
Psychometric instrument development
Psychometric instrument developmentPsychometric instrument development
Psychometric instrument development
 
Презентация на День рождения
Презентация на День рожденияПрезентация на День рождения
Презентация на День рождения
 
Tutorial para aprender hacer una unidad tematica carlos leon
Tutorial para aprender hacer una unidad tematica carlos leonTutorial para aprender hacer una unidad tematica carlos leon
Tutorial para aprender hacer una unidad tematica carlos leon
 
Анель Хасенова + аренда квартир + клиенты
Анель Хасенова + аренда квартир + клиентыАнель Хасенова + аренда квартир + клиенты
Анель Хасенова + аренда квартир + клиенты
 
Artesanos
ArtesanosArtesanos
Artesanos
 
Reseña Contable
Reseña ContableReseña Contable
Reseña Contable
 
MATTS VISUAL AID
MATTS VISUAL AIDMATTS VISUAL AID
MATTS VISUAL AID
 
RJPenick Resume4
RJPenick Resume4RJPenick Resume4
RJPenick Resume4
 
Osama Shawky Abbas new1 co
Osama Shawky Abbas new1 coOsama Shawky Abbas new1 co
Osama Shawky Abbas new1 co
 
Шунтукские эльфы
Шунтукские эльфыШунтукские эльфы
Шунтукские эльфы
 
Curriculum 2016_Eng
Curriculum 2016_EngCurriculum 2016_Eng
Curriculum 2016_Eng
 
SACHIN HARBA CV [MBA]
SACHIN HARBA CV [MBA]SACHIN HARBA CV [MBA]
SACHIN HARBA CV [MBA]
 
FinalReport
FinalReportFinalReport
FinalReport
 
Syed_Khaja_Nooruddin.04042016
Syed_Khaja_Nooruddin.04042016Syed_Khaja_Nooruddin.04042016
Syed_Khaja_Nooruddin.04042016
 

Similar to Sunum

Image segmentation ajal
Image segmentation ajalImage segmentation ajal
Image segmentation ajalAJAL A J
 
Detection of leaf diseases and classification using digital image processing
Detection of leaf diseases and classification using digital image processingDetection of leaf diseases and classification using digital image processing
Detection of leaf diseases and classification using digital image processingNaeem Shehzad
 
image segmentation by ppres.pptx
image segmentation by ppres.pptximage segmentation by ppres.pptx
image segmentation by ppres.pptxmohan134666
 
Automatic microaneurysm detection walter et. al.
Automatic microaneurysm detection walter et. al.Automatic microaneurysm detection walter et. al.
Automatic microaneurysm detection walter et. al.Ritu Lahoti
 
Semi-automatic ground truth generation using unsupervised clustering and limi...
Semi-automatic ground truth generation using unsupervised clustering and limi...Semi-automatic ground truth generation using unsupervised clustering and limi...
Semi-automatic ground truth generation using unsupervised clustering and limi...SOYEON KIM
 
Computer vision series
Computer vision seriesComputer vision series
Computer vision seriesPerry Lea
 
Structured Forests for Fast Edge Detection [Paper Presentation]
Structured Forests for Fast Edge Detection [Paper Presentation]Structured Forests for Fast Edge Detection [Paper Presentation]
Structured Forests for Fast Edge Detection [Paper Presentation]Mohammad Shaker
 
International Journal of Pharmaceutical Science Invention (IJPSI)
International Journal of Pharmaceutical Science Invention (IJPSI)International Journal of Pharmaceutical Science Invention (IJPSI)
International Journal of Pharmaceutical Science Invention (IJPSI)inventionjournals
 
Conjugate Gradient method for Brain Magnetic Resonance Images Segmentation
Conjugate Gradient method for Brain Magnetic Resonance Images SegmentationConjugate Gradient method for Brain Magnetic Resonance Images Segmentation
Conjugate Gradient method for Brain Magnetic Resonance Images SegmentationEL-Hachemi Guerrout
 
edge Linking dip.pptx
  edge Linking dip.pptx  edge Linking dip.pptx
edge Linking dip.pptxMruthyunjayaS
 
Dimensionality Reduction and feature extraction.pptx
Dimensionality Reduction and feature extraction.pptxDimensionality Reduction and feature extraction.pptx
Dimensionality Reduction and feature extraction.pptxSivam Chinna
 
08 cie552 image_segmentation
08 cie552 image_segmentation08 cie552 image_segmentation
08 cie552 image_segmentationElsayed Hemayed
 
Locating texture boundaries using a fast unsupervised approach based on clust...
Locating texture boundaries using a fast unsupervised approach based on clust...Locating texture boundaries using a fast unsupervised approach based on clust...
Locating texture boundaries using a fast unsupervised approach based on clust...Mehryar (Mike) E., Ph.D.
 
Comparison of image segmentation
Comparison of image segmentationComparison of image segmentation
Comparison of image segmentationHaitham Ahmed
 

Similar to Sunum (20)

Image segmentation ajal
Image segmentation ajalImage segmentation ajal
Image segmentation ajal
 
regions
regionsregions
regions
 
Detection of leaf diseases and classification using digital image processing
Detection of leaf diseases and classification using digital image processingDetection of leaf diseases and classification using digital image processing
Detection of leaf diseases and classification using digital image processing
 
Understandig PCA and LDA
Understandig PCA and LDAUnderstandig PCA and LDA
Understandig PCA and LDA
 
image segmentation by ppres.pptx
image segmentation by ppres.pptximage segmentation by ppres.pptx
image segmentation by ppres.pptx
 
Automatic microaneurysm detection walter et. al.
Automatic microaneurysm detection walter et. al.Automatic microaneurysm detection walter et. al.
Automatic microaneurysm detection walter et. al.
 
Semi-automatic ground truth generation using unsupervised clustering and limi...
Semi-automatic ground truth generation using unsupervised clustering and limi...Semi-automatic ground truth generation using unsupervised clustering and limi...
Semi-automatic ground truth generation using unsupervised clustering and limi...
 
IJET-V2I6P17
IJET-V2I6P17IJET-V2I6P17
IJET-V2I6P17
 
RBF2.ppt
RBF2.pptRBF2.ppt
RBF2.ppt
 
Computer vision series
Computer vision seriesComputer vision series
Computer vision series
 
Structured Forests for Fast Edge Detection [Paper Presentation]
Structured Forests for Fast Edge Detection [Paper Presentation]Structured Forests for Fast Edge Detection [Paper Presentation]
Structured Forests for Fast Edge Detection [Paper Presentation]
 
International Journal of Pharmaceutical Science Invention (IJPSI)
International Journal of Pharmaceutical Science Invention (IJPSI)International Journal of Pharmaceutical Science Invention (IJPSI)
International Journal of Pharmaceutical Science Invention (IJPSI)
 
Conjugate Gradient method for Brain Magnetic Resonance Images Segmentation
Conjugate Gradient method for Brain Magnetic Resonance Images SegmentationConjugate Gradient method for Brain Magnetic Resonance Images Segmentation
Conjugate Gradient method for Brain Magnetic Resonance Images Segmentation
 
edge Linking dip.pptx
  edge Linking dip.pptx  edge Linking dip.pptx
edge Linking dip.pptx
 
Extended fuzzy c means clustering algorithm in segmentation of noisy images
Extended fuzzy c means clustering algorithm in segmentation of noisy imagesExtended fuzzy c means clustering algorithm in segmentation of noisy images
Extended fuzzy c means clustering algorithm in segmentation of noisy images
 
Dimensionality Reduction and feature extraction.pptx
Dimensionality Reduction and feature extraction.pptxDimensionality Reduction and feature extraction.pptx
Dimensionality Reduction and feature extraction.pptx
 
08 cie552 image_segmentation
08 cie552 image_segmentation08 cie552 image_segmentation
08 cie552 image_segmentation
 
PPT s08-machine vision-s2
PPT s08-machine vision-s2PPT s08-machine vision-s2
PPT s08-machine vision-s2
 
Locating texture boundaries using a fast unsupervised approach based on clust...
Locating texture boundaries using a fast unsupervised approach based on clust...Locating texture boundaries using a fast unsupervised approach based on clust...
Locating texture boundaries using a fast unsupervised approach based on clust...
 
Comparison of image segmentation
Comparison of image segmentationComparison of image segmentation
Comparison of image segmentation
 

Sunum

  • 1. Comparative Analysis of Novel Boundary Detection Methods ANIL ULAŞ KOÇAK - OSMAN BUĞRA SARICA Image Processing Course, 2016, Ankara, Turkey 1
  • 2. Plan What is Boundary Detection? Related Works Boundary Detection Methods  Sketch Tokens  Structured Random Forest  Oriented Edge Forest  Crisp Boundary Detection Comparison Results 2
  • 3. Boundary detection 3  Reduce dimensionality of data  Preserve content information  Useful in applications such as:
  • 4. Boundary detection 4  Boundary detection is usually formulated as a per-pixel classification problem  How to extract discriminative boundary features?  How to learn a efficient boundary classifier?
  • 7. Related Works 7 Image Optimized Cues Boundary Strength Brightness Color Texture Benchmark Human Segmentations Cue Combination Model r (x,y)
  • 12. Boundary detection methods Learning based contour detection:  Sketch Tokens: A Learned Mid-Level Representation for Contour and Object Detection  Structured Forests for Fast Edge Detection  Oriented Edge Forest for Boundary Detection Measuring rarity based on pointwise mutual information:  Crisp Boundary Detection Using Pointwise Mutual Information 12
  • 14. Sketch tokens A novel approach to both learning and detecting local edge- based mid-level features. Having 2 main parts; ◦ Defining Sketch Tokens Classes ◦ defining clasess by clustering of the patches sampled from the human generated hand drawn images ◦ Detecting Sketch Tokens ◦ Training a random forest classifier with feature information from training images 14
  • 15. Sketch tokens Defining Sketch Tokens Classes Goal is to define a set of token classes that represent the wide variety of local edge structures that may exist in an image. 15
  • 16. Sketch tokens Defining Sketch Tokens Classes ◦ It is defined by clustering of the patches sampled from the human generated hand drawn images (novel approach) ◦ Only patches that contain a labeled contour at the center pixel are used (35x35) 16
  • 17. Sketch tokens Detecting Sketch Tokens ◦ Feature Extraction ◦ Integral Channel Features ◦ Self Similarity Features ◦ Classification ◦ Using Random Forest Classifier to predict sketch tokens 17
  • 18. Sketch tokens Integral Channel Features  Color (3 channels)  Gradients (3 unoriented + 8 oriented channels) • Sigma = 0, Theta = 0, pi/2, pi, 3pi/2 • Sigma = 1.5, Theta = 0, pi/2, pi, 3pi/2 • Sigma = 5 18
  • 19. Sketch tokens 19 Self-similarity features: The L1 distance from the yellow box to the other 5x5 cells are shown for color and gradient magnitude channels.
  • 20. Sketch tokens 20 Learning  Random Forest Classifiers. Advantages:  Fast at test time, especially for a nonlinear classifier.  Dont have to explicitly compute independent descriptors for every patch. Just look up what the decision tree wants to know at each branch.
  • 21. Sketch tokens Detections of individual sketch tokens 21 Combining sketch token detections: Random forest classifier predicts the probability that an image patch belongs to each token class or the negative set.
  • 22. Crisp boundary detection Main Idea: pixels belonging to the same object show higher statistical dependencies than pixels belonging to the different object.
  • 23. Crisp boundary detection black-next-to-white occurs over and over again. This pattern shows up in the image’s statistics as a suspicious coincidence — these colors must be part of the same object! 23
  • 24. Crisp boundary detection Pointwise Mutual Information (PMI) is used to obtain statistical association between two pixel and get affinity measure, so can be predicted whether or not two pixel lie on same object. 24
  • 25. Crisp boundary detection Method measures how often each color A occurs next to each color B within the image. 25
  • 26. Crisp boundary detection Accurate Result ◦ Thanks to good prediction of PMI distinguishing between boundary and non boundary is provided. Crisp Result ◦ Highly localized features (only color and color variance information is used in 3x3 window) 26
  • 27. Structured forests 27 Idea  It is an enhancement from Sketch token  Using the edge structure directly instead of using predefined label  It might allows it to learn more subtle variations in edge structure and leads to a more accurate and efficient algorithm. Structured learning with random forest  It did not cluster the edges before training the tree  If the splitting function could work with the edge structure directly, then the random forest algorithm could work in original way
  • 28. Structured forests How to represent the edge structure?
  • 29. Difference 2 Difference 1 Structured forests 29 Structured random forest  Slightly modified from normal random forest Clustering the incoming patches into 2 groups based on the Z representation. Now the patches have labels! Thus we could still calculate the information gain to choose the feature f and threshold t. The representative edge in the leaf node is chosen by:  The edge structure whose Z representation is the medoid
  • 30. Structured forests 30 Generate the edge map.  Each prediction is a voting. The pixel with higher votes has higher confidence of being an edge
  • 32. Oriented edge forest 32 Once a forest is trained to recognize oriented edge patterns, it is applied over the input image in scanning- window fashion. From a test patch x each tree produces a distribution which are then combined either by averaging (accuracy) or voting (speed, sparsity).  where 1 is the indicator function.
  • 33. Oriented edge forest 33  Method provide a simple procedure for calibrating forest- generated posterior probabilities.
  • 34. Oriented edge forest 34 The forest produces calibrated distributions at every spatial position. To derive edge strengths we composite sharpened edge masks into the image, weighted by their posterior probabilities.
  • 35. Experiments IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015 5 10 test images from BSDS 500 P-R graph from BSDS Bechmark Low powered PC
  • 36. Evaluation Metric IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015 5 Precision-Recall Curve  Precision = 𝑇𝑝 𝑇𝑝+𝐹𝑝 How many output results are relevant ?  Recall = 𝑇𝑝 𝑇𝑝+𝐹𝑛 How many relevant output results are selected ? Time cost  Measuring time during selected scenario. Complexity  Having complex or simple understanding
  • 41. Comparison Statistical Learning Based Contribution Complexity Novel Boundary Detectors Sketch Tokens Random Forest Simple Crisp Boundary Detection PMI Simple Structured Forests Struct Output Domain Complex Oriented Edge Forest A simple Output Space Complex IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015 4
  • 42. Results 23 Original Sketch Tokens Crisp Boundary Structured Forest OEF
  • 43. Conclusion IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015 24 Method Pros Cons Sketch Tokens Having Inovative approach Relatively Slow and Average Success Crisp B. D. Great Success, Simple approach Slow Structured Good success on performance and time cost Complex Oriented Having relatively good time cost Not successful by year , Complex

Editor's Notes

  1. Canny : Muhtemelen computer visionda en çok kullanılan edge detector. Filter image with x, y derivatives of Gaussian Find magnitude and orientation of gradient Non maximum suppression: Thin multi-pixel wide «ridges» down to single width Thresholding and linking(hysteresis): Define two thresholds: low and high Use the high threshold to start edge curves and the low threshold to continue them.
  2. Goal: “use features extracted from such an image patch to estimate the posterior probability of a boundary passing through the center point” Use cues such as intensity, brightness, color and texture to get a measure of boundary strength How to combine cues? It’s a supervised learning problem. Learn an optimal local boundary model from labeled images Approach: look at each pixel for local discontinuities in several feature channels, over a range of orientations and scales
  3. To compute saliency, several bottom-up cues have been used in previous literature: One of them is the contrast/rarity cue, that assumes rare appearance tends to be more saliency Another one, called Image Boundary Connectivity cue, has been used in many recent methods. It assumes that background elements are usually extends beyond the image boundary, and thus connected to the image boundary. In contrast, salient objects are often centered and have closed boundaries.
  4. To compute saliency, several bottom-up cues have been used in previous literature: One of them is the contrast/rarity cue, that assumes rare appearance tends to be more saliency Another one, called Image Boundary Connectivity cue, has been used in many recent methods. It assumes that background elements are usually extends beyond the image boundary, and thus connected to the image boundary. In contrast, salient objects are often centered and have closed boundaries.
  5. To compute saliency, several bottom-up cues have been used in previous literature: One of them is the contrast/rarity cue, that assumes rare appearance tends to be more saliency Another one, called Image Boundary Connectivity cue, has been used in many recent methods. It assumes that background elements are usually extends beyond the image boundary, and thus connected to the image boundary. In contrast, salient objects are often centered and have closed boundaries.
  6. To compute saliency, several bottom-up cues have been used in previous literature: One of them is the contrast/rarity cue, that assumes rare appearance tends to be more saliency Another one, called Image Boundary Connectivity cue, has been used in many recent methods. It assumes that background elements are usually extends beyond the image boundary, and thus connected to the image boundary. In contrast, salient objects are often centered and have closed boundaries.
  7. To compute saliency, several bottom-up cues have been used in previous literature: One of them is the contrast/rarity cue, that assumes rare appearance tends to be more saliency Another one, called Image Boundary Connectivity cue, has been used in many recent methods. It assumes that background elements are usually extends beyond the image boundary, and thus connected to the image boundary. In contrast, salient objects are often centered and have closed boundaries.
  8. To compute saliency, several bottom-up cues have been used in previous literature: One of them is the contrast/rarity cue, that assumes rare appearance tends to be more saliency Another one, called Image Boundary Connectivity cue, has been used in many recent methods. It assumes that background elements are usually extends beyond the image boundary, and thus connected to the image boundary. In contrast, salient objects are often centered and have closed boundaries.
  9. To compute saliency, several bottom-up cues have been used in previous literature: One of them is the contrast/rarity cue, that assumes rare appearance tends to be more saliency Another one, called Image Boundary Connectivity cue, has been used in many recent methods. It assumes that background elements are usually extends beyond the image boundary, and thus connected to the image boundary. In contrast, salient objects are often centered and have closed boundaries.
  10. To compute saliency, several bottom-up cues have been used in previous literature: One of them is the contrast/rarity cue, that assumes rare appearance tends to be more saliency Another one, called Image Boundary Connectivity cue, has been used in many recent methods. It assumes that background elements are usually extends beyond the image boundary, and thus connected to the image boundary. In contrast, salient objects are often centered and have closed boundaries.
  11. To compute saliency, several bottom-up cues have been used in previous literature: One of them is the contrast/rarity cue, that assumes rare appearance tends to be more saliency Another one, called Image Boundary Connectivity cue, has been used in many recent methods. It assumes that background elements are usually extends beyond the image boundary, and thus connected to the image boundary. In contrast, salient objects are often centered and have closed boundaries.
  12. To compute saliency, several bottom-up cues have been used in previous literature: One of them is the contrast/rarity cue, that assumes rare appearance tends to be more saliency Another one, called Image Boundary Connectivity cue, has been used in many recent methods. It assumes that background elements are usually extends beyond the image boundary, and thus connected to the image boundary. In contrast, salient objects are often centered and have closed boundaries.
  13. To compute saliency, several bottom-up cues have been used in previous literature: One of them is the contrast/rarity cue, that assumes rare appearance tends to be more saliency Another one, called Image Boundary Connectivity cue, has been used in many recent methods. It assumes that background elements are usually extends beyond the image boundary, and thus connected to the image boundary. In contrast, salient objects are often centered and have closed boundaries.
  14. To compute saliency, several bottom-up cues have been used in previous literature: One of them is the contrast/rarity cue, that assumes rare appearance tends to be more saliency Another one, called Image Boundary Connectivity cue, has been used in many recent methods. It assumes that background elements are usually extends beyond the image boundary, and thus connected to the image boundary. In contrast, salient objects are often centered and have closed boundaries.
  15. To compute saliency, several bottom-up cues have been used in previous literature: One of them is the contrast/rarity cue, that assumes rare appearance tends to be more saliency Another one, called Image Boundary Connectivity cue, has been used in many recent methods. It assumes that background elements are usually extends beyond the image boundary, and thus connected to the image boundary. In contrast, salient objects are often centered and have closed boundaries.
  16. To compute saliency, several bottom-up cues have been used in previous literature: One of them is the contrast/rarity cue, that assumes rare appearance tends to be more saliency Another one, called Image Boundary Connectivity cue, has been used in many recent methods. It assumes that background elements are usually extends beyond the image boundary, and thus connected to the image boundary. In contrast, salient objects are often centered and have closed boundaries.
  17. To compute saliency, several bottom-up cues have been used in previous literature: One of them is the contrast/rarity cue, that assumes rare appearance tends to be more saliency Another one, called Image Boundary Connectivity cue, has been used in many recent methods. It assumes that background elements are usually extends beyond the image boundary, and thus connected to the image boundary. In contrast, salient objects are often centered and have closed boundaries.
  18. To compute saliency, several bottom-up cues have been used in previous literature: One of them is the contrast/rarity cue, that assumes rare appearance tends to be more saliency Another one, called Image Boundary Connectivity cue, has been used in many recent methods. It assumes that background elements are usually extends beyond the image boundary, and thus connected to the image boundary. In contrast, salient objects are often centered and have closed boundaries.
  19. To compute saliency, several bottom-up cues have been used in previous literature: One of them is the contrast/rarity cue, that assumes rare appearance tends to be more saliency Another one, called Image Boundary Connectivity cue, has been used in many recent methods. It assumes that background elements are usually extends beyond the image boundary, and thus connected to the image boundary. In contrast, salient objects are often centered and have closed boundaries.
  20. To compute saliency, several bottom-up cues have been used in previous literature: One of them is the contrast/rarity cue, that assumes rare appearance tends to be more saliency Another one, called Image Boundary Connectivity cue, has been used in many recent methods. It assumes that background elements are usually extends beyond the image boundary, and thus connected to the image boundary. In contrast, salient objects are often centered and have closed boundaries.
  21. To compute saliency, several bottom-up cues have been used in previous literature: One of them is the contrast/rarity cue, that assumes rare appearance tends to be more saliency Another one, called Image Boundary Connectivity cue, has been used in many recent methods. It assumes that background elements are usually extends beyond the image boundary, and thus connected to the image boundary. In contrast, salient objects are often centered and have closed boundaries.
  22. The image boundary connectivity cue has been proved very effective through several state-of-the-art methods
  23. To compute saliency, several bottom-up cues have been used in previous literature: One of them is the contrast/rarity cue, that assumes rare appearance tends to be more saliency Another one, called Image Boundary Connectivity cue, has been used in many recent methods. It assumes that background elements are usually extends beyond the image boundary, and thus connected to the image boundary. In contrast, salient objects are often centered and have closed boundaries.
  24. The image boundary connectivity cue has been proved very effective through several state-of-the-art methods
  25. The image boundary connectivity cue has been proved very effective through several state-of-the-art methods
  26. The image boundary connectivity cue has been proved very effective through several state-of-the-art methods
  27. The image boundary connectivity cue has been proved very effective through several state-of-the-art methods
  28. The image boundary connectivity cue has been proved very effective through several state-of-the-art methods
  29. The image boundary connectivity cue has been proved very effective through several state-of-the-art methods
  30. Given an input image: For each channel in the Lab color space: Apply Minimum Barrier Distance transform to compute MBD maps, which measure image boundary connectivity of each pixel Average the MBD maps of the color channels Apply postprocessing to improve the saliency map quality for object segmentation Optionally, we can further enhance the saliency map by leveraging the Backgroundness cue at a moderately increased cost. Backgroundness assumes image boundary regions are mostly background.
  31. Given an input image: For each channel in the Lab color space: Apply Minimum Barrier Distance transform to compute MBD maps, which measure image boundary connectivity of each pixel Average the MBD maps of the color channels Apply postprocessing to improve the saliency map quality for object segmentation Optionally, we can further enhance the saliency map by leveraging the Backgroundness cue at a moderately increased cost. Backgroundness assumes image boundary regions are mostly background.
  32. How to measure image boundary connectivity by distance transform: Set image boundary pixels as seed set (show in red) For each pixel (show in greed), find the shortest path (show in grey) to the seed set, according to the given path cost function The cost of the shortest path is the distance between green and red.
  33. How to measure image boundary connectivity by distance transform: Set image boundary pixels as seed set (show in red) For each pixel (show in greed), find the shortest path (show in grey) to the seed set, according to the given path cost function The cost of the shortest path is the distance between green and red.
  34. Most of the state-of-the-art methods requires super-pixel representation of the image. Computation of the super-pixel becomes the speed bottleneck.
  35. Some sample saliency maps The baseline GD, which uses Geodesic Distance, often produces fuzzy central area.