SlideShare a Scribd company logo
1 of 11
Download to read offline
Object Detection using HoG Features for Visual Situation
Recognition
Efsun Sarioglu Kayi, Melanie Mitchell
Portland State University
February 5, 2018
Abstract
This project investigates a novel approach to building computer systems that can recognize
visual situations. While much effort in computer vision has focused on identifying isolated
objects in images, what people actually do is recognize coherent situations — collections of
objects and their interrelations that, taken together, correspond to a known concept, such as
"dog-walking", or "a fight breaking out", or "a blind person crossing the street". Situation
recognition by humans may appear on the surface to be effortless, but it relies on a complex
dynamic interplay among human abilities to perceive objects, systems of relationships among
objects, and analogies with stored knowledge and memories. Enabling computers to flexibly
recognize visual situations would create a flood of important applications in fields as diverse as
autonomous vehicles, medical diagnosis, interpretation of scientific imagery, enhanced human-
computer interaction, and personal information organization.
1 Introduction
Two previously studied approaches, brain-inspired neural networks for lower-level vision and cognitive-
level models of concepts and analogy-making can be integrated for recognizing situations. System
architecture is depicted in Figure 1. For Dog-Walking situation, there are three objects: Person,
Dog and Leash. The relationships between them can be analyzed through Concept Network. A
concept network for Dog-Walking situation is given in Figure 2. In this project, we focus on lower-
level vision for efficient and effective object detection which would feed into cognitive-models for
visual situation recognition.
2 Dataset
Dog-walking dataset consists of photographs, each of which is an instance of the visual concept (or
situation) "Dog-Walking”. The dog-walkers, dogs, and leashes labeled with ground truth bounding
boxes. The number of files for each class is listed in Table 3.
2.1 Preprocessing
Label file of each image has the location of objects in that image. The negative crops are randomly
selected from the remaining parts of the image with similar size of the positive crop. Cropped
images are resized to a fixed width and height to make them comparable. For each class, 10%,
20%, 30%, 40%, 50% percentiles of width and height are used as resize options.
3 Experiments
Histograms of Oriented Gradients (HOG) [DT05] are edge orientation histograms computed on a
dense grid of uniformly spaced cells. They are computed using MATLAB’s extractHOGFeatures
[ext] function on overlapping local contrast normalizations for improved performance. Figure 4
shows an image and its corresponding HOG features. Tables 5,6,7 show number of HOG features
for different resize options for Dog, Leash and Person classes respectively. Tables 5, 6, and 7
1
Figure 1: Situate Architecture
Figure 2: Concept Network
Figure 3: Dog-Walking dataset
2
Figure 4: HOG Features [ext]
Figure 5: HOG features for Dog class
lists resize options and the corresponding feature numbers for Dog, Leash and Person classes
respectively.
Library for Support Vector Machines (LIBSVM) MATLAB extension is used [CL11]. Using
the images resized to their average width and height, SVM is trained using 50% of the data and
tested on the remaining 50%. The ROC graphs using decision values are displayed in Figures 8, 9
and 10 corresponding to Dog, Leash and Person classes. Corresponding AUC values are listed in
Table 11.
3.1 SVM Feature Selection
Once a classifier is built using SVM, its features can be selected based on their weights w (Recursive
Feature Elimination (RFE) [GWBV02]. Since number of features is high, instead of removing
features one by one in a recursive fashion, its top features were selected based on w2
. A new classifier
was trained using the selected features and ROC and AUC results were computed for the test
dataset. Using 50% training and 50% testing data and resized images of size 10%. Corresponding
AUC values are displayed in Figures 12, 13 and 14 and listed in Table 15. ROC plots with feature
selections are shown in Figures 16, 17, and 18.
3
Figure 6: HOG features for Leash class
Figure 7: HOG features for Person class
Figure 8: ROC plot for Dog class
4
Figure 9: ROC plot for Leash class
Figure 10: ROC plot for Person class
Figure 11: AUC values
5
Figure 12: AUC values for Dog class with different number of features
Figure 13: AUC values for Leash class with different number of features
6
Figure 14: AUC values for Person class with different number of features
Figure 15: AUC values with feature selection
Figure 16: ROC graph for Dog class using top 20% features
7
Figure 17: ROC graph for Leash class using top 20% features
Figure 18: ROC graph for Person class using top 20% features
8
Figure 19: AUC values using top HOG features with different cell sizes for Dog Class
Figure 20: AUC values using top HOG features with different cell sizes for Leash Class
3.2 Effect of HOG Cell Size
HOG features are computed on given cell sizes in pixels. The default size is 8×8. We also explored
2×2 and 4×4 cell sizes to capture small-scale details. In addition, we evaluated combination of
them as below:
• 2×2 + 4×4
• 2×2 + 8×8
• 4×4 + 8×8
• 2×2 + 4×4 + 8×8
Smaller cell sizes also correspond to bigger feature sizes. Accordingly, we used the same SVM
feature selection mechanism explained in Section 3.1. Using 50% training and 50% testing data
with images resized to 10% of their class sizes, SVM was trained with HOG features with different
cell sizes. Figures 19, 20, and 21 show the AUC values for Dog, Leash and Person classes for
different cell sizes and top selected features. Tables 22, 23, and 24 lists the AUC values for Dog,
Leash and Person classes using different cell sizes and top selected features.
9
Figure 21: AUC values using top HOG features with different cell sizes for Person Class
Figure 22: AUC values for Dog class for different cell sizes and top selected features
Figure 23: AUC values for Leash class for different cell sizes and top selected features
Figure 24: AUC values for Person class for different cell sizes and top selected features
10
4 Conclusion
We have shown effective and efficient object detection using HOG features and SVM for Dog-
Walking situation. Using SVM feature selection techniques, top 10 to 20% of top features performed
as well as using the entire dataset. However, using different cell sizes for HOG did not make much
difference in classification performance.
References
[CL11] Chih-Chung Chang and Chih-Jen Lin. LIBSVM: A library for support vector ma-
chines. ACM Transactions on Intelligent Systems and Technology, 2:27:1–27:27, 2011.
Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm.
[DT05] Navneet Dalal and Bill Triggs. Histograms of oriented gradients for human detection.
In Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision
and Pattern Recognition (CVPR’05) - Volume 1 - Volume 01, CVPR ’05, pages 886–
893, Washington, DC, USA, 2005. IEEE Computer Society.
[ext] MATLAB extracthogfeatures. https://www.mathworks.com/help/vision/ref/extracthogfeatures.html.
[GWBV02] Isabelle Guyon, Jason Weston, Stephen Barnhill, and Vladimir Vapnik. Gene selection
for cancer classification using support vector machines. Mach. Learn., 46(1-3):389–422,
March 2002.
11

More Related Content

What's hot

IRJET- Proposed System for Animal Recognition using Image Processing
IRJET-  	  Proposed System for Animal Recognition using Image ProcessingIRJET-  	  Proposed System for Animal Recognition using Image Processing
IRJET- Proposed System for Animal Recognition using Image ProcessingIRJET Journal
 
Avanced Image Classification
Avanced Image ClassificationAvanced Image Classification
Avanced Image ClassificationBayes Ahmed
 
IMDB Movie Dataset Analysis
IMDB Movie Dataset AnalysisIMDB Movie Dataset Analysis
IMDB Movie Dataset AnalysisAaron McClellan
 
Exploration of Imputation Methods for Missingness in Image Segmentation
Exploration of Imputation Methods for Missingness in Image SegmentationExploration of Imputation Methods for Missingness in Image Segmentation
Exploration of Imputation Methods for Missingness in Image SegmentationChristopher Peter Makris
 
Textural Feature Extraction of Natural Objects for Image Classification
Textural Feature Extraction of Natural Objects for Image ClassificationTextural Feature Extraction of Natural Objects for Image Classification
Textural Feature Extraction of Natural Objects for Image ClassificationCSCJournals
 
Energy minimization based spatially
Energy minimization based spatiallyEnergy minimization based spatially
Energy minimization based spatiallysipij
 
CROWD ANALYSIS WITH FISH EYE CAMERA
CROWD ANALYSIS WITH FISH EYE CAMERACROWD ANALYSIS WITH FISH EYE CAMERA
CROWD ANALYSIS WITH FISH EYE CAMERAijaceeejournal
 
Classification of remotely sensed images using the gene sis fuzzy segmentatio...
Classification of remotely sensed images using the gene sis fuzzy segmentatio...Classification of remotely sensed images using the gene sis fuzzy segmentatio...
Classification of remotely sensed images using the gene sis fuzzy segmentatio...I3E Technologies
 
Face Recognition Based Intelligent Door Control System
Face Recognition Based Intelligent Door Control SystemFace Recognition Based Intelligent Door Control System
Face Recognition Based Intelligent Door Control Systemijtsrd
 
Can computers count bacteria? Using macro-programming as a tool to improve sp...
Can computers count bacteria? Using macro-programming as a tool to improve sp...Can computers count bacteria? Using macro-programming as a tool to improve sp...
Can computers count bacteria? Using macro-programming as a tool to improve sp...MACE Lab
 
Quantitative Analysis of 3D Refractive Index Maps
Quantitative Analysis of 3D Refractive Index MapsQuantitative Analysis of 3D Refractive Index Maps
Quantitative Analysis of 3D Refractive Index MapsMathieuFRECHIN
 
Data analysis_PredictingActivity_SamsungSensorData
Data analysis_PredictingActivity_SamsungSensorDataData analysis_PredictingActivity_SamsungSensorData
Data analysis_PredictingActivity_SamsungSensorDataKaren Yang
 
An automated approach for the recognition of bengali license plates presentation
An automated approach for the recognition of bengali license plates presentationAn automated approach for the recognition of bengali license plates presentation
An automated approach for the recognition of bengali license plates presentationMD Abdullah Al Nasim
 
[Paper Review] MisGAN: Learning from Incomplete Data with Generative Adversar...
[Paper Review] MisGAN: Learning from Incomplete Data with Generative Adversar...[Paper Review] MisGAN: Learning from Incomplete Data with Generative Adversar...
[Paper Review] MisGAN: Learning from Incomplete Data with Generative Adversar...Jihoo Kim
 
Detection of medical instruments project- PART 2
Detection of medical instruments project- PART 2Detection of medical instruments project- PART 2
Detection of medical instruments project- PART 2Sairam Adithya
 
A Review on Label Image Constrained Multiatlas Selection
A Review on Label Image Constrained Multiatlas SelectionA Review on Label Image Constrained Multiatlas Selection
A Review on Label Image Constrained Multiatlas SelectionIRJET Journal
 

What's hot (18)

Second subjective assignment
Second  subjective assignmentSecond  subjective assignment
Second subjective assignment
 
IRJET- Proposed System for Animal Recognition using Image Processing
IRJET-  	  Proposed System for Animal Recognition using Image ProcessingIRJET-  	  Proposed System for Animal Recognition using Image Processing
IRJET- Proposed System for Animal Recognition using Image Processing
 
Avanced Image Classification
Avanced Image ClassificationAvanced Image Classification
Avanced Image Classification
 
IMDB Movie Dataset Analysis
IMDB Movie Dataset AnalysisIMDB Movie Dataset Analysis
IMDB Movie Dataset Analysis
 
Exploration of Imputation Methods for Missingness in Image Segmentation
Exploration of Imputation Methods for Missingness in Image SegmentationExploration of Imputation Methods for Missingness in Image Segmentation
Exploration of Imputation Methods for Missingness in Image Segmentation
 
Textural Feature Extraction of Natural Objects for Image Classification
Textural Feature Extraction of Natural Objects for Image ClassificationTextural Feature Extraction of Natural Objects for Image Classification
Textural Feature Extraction of Natural Objects for Image Classification
 
Energy minimization based spatially
Energy minimization based spatiallyEnergy minimization based spatially
Energy minimization based spatially
 
CROWD ANALYSIS WITH FISH EYE CAMERA
CROWD ANALYSIS WITH FISH EYE CAMERACROWD ANALYSIS WITH FISH EYE CAMERA
CROWD ANALYSIS WITH FISH EYE CAMERA
 
Classification of remotely sensed images using the gene sis fuzzy segmentatio...
Classification of remotely sensed images using the gene sis fuzzy segmentatio...Classification of remotely sensed images using the gene sis fuzzy segmentatio...
Classification of remotely sensed images using the gene sis fuzzy segmentatio...
 
Face Recognition Based Intelligent Door Control System
Face Recognition Based Intelligent Door Control SystemFace Recognition Based Intelligent Door Control System
Face Recognition Based Intelligent Door Control System
 
Can computers count bacteria? Using macro-programming as a tool to improve sp...
Can computers count bacteria? Using macro-programming as a tool to improve sp...Can computers count bacteria? Using macro-programming as a tool to improve sp...
Can computers count bacteria? Using macro-programming as a tool to improve sp...
 
An Image Segmentation and Classification for Brain Tumor Detection using Pill...
An Image Segmentation and Classification for Brain Tumor Detection using Pill...An Image Segmentation and Classification for Brain Tumor Detection using Pill...
An Image Segmentation and Classification for Brain Tumor Detection using Pill...
 
Quantitative Analysis of 3D Refractive Index Maps
Quantitative Analysis of 3D Refractive Index MapsQuantitative Analysis of 3D Refractive Index Maps
Quantitative Analysis of 3D Refractive Index Maps
 
Data analysis_PredictingActivity_SamsungSensorData
Data analysis_PredictingActivity_SamsungSensorDataData analysis_PredictingActivity_SamsungSensorData
Data analysis_PredictingActivity_SamsungSensorData
 
An automated approach for the recognition of bengali license plates presentation
An automated approach for the recognition of bengali license plates presentationAn automated approach for the recognition of bengali license plates presentation
An automated approach for the recognition of bengali license plates presentation
 
[Paper Review] MisGAN: Learning from Incomplete Data with Generative Adversar...
[Paper Review] MisGAN: Learning from Incomplete Data with Generative Adversar...[Paper Review] MisGAN: Learning from Incomplete Data with Generative Adversar...
[Paper Review] MisGAN: Learning from Incomplete Data with Generative Adversar...
 
Detection of medical instruments project- PART 2
Detection of medical instruments project- PART 2Detection of medical instruments project- PART 2
Detection of medical instruments project- PART 2
 
A Review on Label Image Constrained Multiatlas Selection
A Review on Label Image Constrained Multiatlas SelectionA Review on Label Image Constrained Multiatlas Selection
A Review on Label Image Constrained Multiatlas Selection
 

Similar to Object Detection using HoG Features for Visual Situation Recognition

A NOVEL IMAGE SEGMENTATION ENHANCEMENT TECHNIQUE BASED ON ACTIVE CONTOUR AND...
A NOVEL IMAGE SEGMENTATION ENHANCEMENT TECHNIQUE BASED ON  ACTIVE CONTOUR AND...A NOVEL IMAGE SEGMENTATION ENHANCEMENT TECHNIQUE BASED ON  ACTIVE CONTOUR AND...
A NOVEL IMAGE SEGMENTATION ENHANCEMENT TECHNIQUE BASED ON ACTIVE CONTOUR AND...acijjournal
 
A Novel GA-SVM Model For Vehicles And Pedestrial Classification In Videos
A Novel GA-SVM Model For Vehicles And Pedestrial Classification In VideosA Novel GA-SVM Model For Vehicles And Pedestrial Classification In Videos
A Novel GA-SVM Model For Vehicles And Pedestrial Classification In Videosijtsrd
 
IRJET- Digital Image Forgery Detection using Local Binary Patterns (LBP) and ...
IRJET- Digital Image Forgery Detection using Local Binary Patterns (LBP) and ...IRJET- Digital Image Forgery Detection using Local Binary Patterns (LBP) and ...
IRJET- Digital Image Forgery Detection using Local Binary Patterns (LBP) and ...IRJET Journal
 
自然方策勾配法の基礎と応用
自然方策勾配法の基礎と応用自然方策勾配法の基礎と応用
自然方策勾配法の基礎と応用Ryo Iwaki
 
Paper Presentation on CATTLE BREED DETECTION AND CATEGORIZATION USING IMAGE P...
Paper Presentation on CATTLE BREED DETECTION AND CATEGORIZATION USING IMAGE P...Paper Presentation on CATTLE BREED DETECTION AND CATEGORIZATION USING IMAGE P...
Paper Presentation on CATTLE BREED DETECTION AND CATEGORIZATION USING IMAGE P...NeilDuraiswami
 
GROUPING OBJECTS BASED ON THEIR APPEARANCE
GROUPING OBJECTS BASED ON THEIR APPEARANCEGROUPING OBJECTS BASED ON THEIR APPEARANCE
GROUPING OBJECTS BASED ON THEIR APPEARANCEijaia
 
researchpaper_2023_Skin_Csdbjsjvnvsdnfvancer.pdf
researchpaper_2023_Skin_Csdbjsjvnvsdnfvancer.pdfresearchpaper_2023_Skin_Csdbjsjvnvsdnfvancer.pdf
researchpaper_2023_Skin_Csdbjsjvnvsdnfvancer.pdfAvijitChaudhuri3
 
MediaEval 2015 - UNED-UV @ Retrieving Diverse Social Images Task - Poster
MediaEval 2015 - UNED-UV @ Retrieving Diverse Social Images Task - PosterMediaEval 2015 - UNED-UV @ Retrieving Diverse Social Images Task - Poster
MediaEval 2015 - UNED-UV @ Retrieving Diverse Social Images Task - Postermultimediaeval
 
Gesture Recognition Review: A Survey of Various Gesture Recognition Algorithms
Gesture Recognition Review: A Survey of Various Gesture Recognition AlgorithmsGesture Recognition Review: A Survey of Various Gesture Recognition Algorithms
Gesture Recognition Review: A Survey of Various Gesture Recognition AlgorithmsIJRES Journal
 
Brain Tumor Classification using Support Vector Machine
Brain Tumor Classification using Support Vector MachineBrain Tumor Classification using Support Vector Machine
Brain Tumor Classification using Support Vector MachineIRJET Journal
 
A Novel Efficient Medical Image Segmentation Methodology
A Novel Efficient Medical Image Segmentation MethodologyA Novel Efficient Medical Image Segmentation Methodology
A Novel Efficient Medical Image Segmentation Methodologyaciijournal
 
AUTOMATIC FRUIT RECOGNITION BASED ON DCNN FOR COMMERCIAL SOURCE TRACE SYSTEM
AUTOMATIC FRUIT RECOGNITION BASED ON DCNN FOR COMMERCIAL SOURCE TRACE SYSTEMAUTOMATIC FRUIT RECOGNITION BASED ON DCNN FOR COMMERCIAL SOURCE TRACE SYSTEM
AUTOMATIC FRUIT RECOGNITION BASED ON DCNN FOR COMMERCIAL SOURCE TRACE SYSTEMijcsa
 
Multimodal Biometrics at Feature Level Fusion using Texture Features
Multimodal Biometrics at Feature Level Fusion using Texture FeaturesMultimodal Biometrics at Feature Level Fusion using Texture Features
Multimodal Biometrics at Feature Level Fusion using Texture FeaturesCSCJournals
 
CELL TRACKING QUALITY COMPARISON BETWEEN ACTIVE SHAPE MODEL (ASM) AND ACTIVE ...
CELL TRACKING QUALITY COMPARISON BETWEEN ACTIVE SHAPE MODEL (ASM) AND ACTIVE ...CELL TRACKING QUALITY COMPARISON BETWEEN ACTIVE SHAPE MODEL (ASM) AND ACTIVE ...
CELL TRACKING QUALITY COMPARISON BETWEEN ACTIVE SHAPE MODEL (ASM) AND ACTIVE ...ijitcs
 
ON THE PREDICTION ACCURACIES OF THREE MOST KNOWN REGULARIZERS : RIDGE REGRESS...
ON THE PREDICTION ACCURACIES OF THREE MOST KNOWN REGULARIZERS : RIDGE REGRESS...ON THE PREDICTION ACCURACIES OF THREE MOST KNOWN REGULARIZERS : RIDGE REGRESS...
ON THE PREDICTION ACCURACIES OF THREE MOST KNOWN REGULARIZERS : RIDGE REGRESS...ijaia
 
IRJET - Movie Genre Prediction from Plot Summaries by Comparing Various C...
IRJET -  	  Movie Genre Prediction from Plot Summaries by Comparing Various C...IRJET -  	  Movie Genre Prediction from Plot Summaries by Comparing Various C...
IRJET - Movie Genre Prediction from Plot Summaries by Comparing Various C...IRJET Journal
 
UNSUPERVISED ROBOTIC SORTING: TOWARDS AUTONOMOUS DECISION MAKING ROBOTS
UNSUPERVISED ROBOTIC SORTING: TOWARDS AUTONOMOUS DECISION MAKING ROBOTSUNSUPERVISED ROBOTIC SORTING: TOWARDS AUTONOMOUS DECISION MAKING ROBOTS
UNSUPERVISED ROBOTIC SORTING: TOWARDS AUTONOMOUS DECISION MAKING ROBOTSijaia
 
Application of optimization algorithms for classification problem
Application of optimization algorithms for classification  problemApplication of optimization algorithms for classification  problem
Application of optimization algorithms for classification problemIJECEIAES
 
A Modified KS-test for Feature Selection
A Modified KS-test for Feature SelectionA Modified KS-test for Feature Selection
A Modified KS-test for Feature SelectionIOSR Journals
 
MR Image Compression Based on Selection of Mother Wavelet and Lifting Based W...
MR Image Compression Based on Selection of Mother Wavelet and Lifting Based W...MR Image Compression Based on Selection of Mother Wavelet and Lifting Based W...
MR Image Compression Based on Selection of Mother Wavelet and Lifting Based W...ijma
 

Similar to Object Detection using HoG Features for Visual Situation Recognition (20)

A NOVEL IMAGE SEGMENTATION ENHANCEMENT TECHNIQUE BASED ON ACTIVE CONTOUR AND...
A NOVEL IMAGE SEGMENTATION ENHANCEMENT TECHNIQUE BASED ON  ACTIVE CONTOUR AND...A NOVEL IMAGE SEGMENTATION ENHANCEMENT TECHNIQUE BASED ON  ACTIVE CONTOUR AND...
A NOVEL IMAGE SEGMENTATION ENHANCEMENT TECHNIQUE BASED ON ACTIVE CONTOUR AND...
 
A Novel GA-SVM Model For Vehicles And Pedestrial Classification In Videos
A Novel GA-SVM Model For Vehicles And Pedestrial Classification In VideosA Novel GA-SVM Model For Vehicles And Pedestrial Classification In Videos
A Novel GA-SVM Model For Vehicles And Pedestrial Classification In Videos
 
IRJET- Digital Image Forgery Detection using Local Binary Patterns (LBP) and ...
IRJET- Digital Image Forgery Detection using Local Binary Patterns (LBP) and ...IRJET- Digital Image Forgery Detection using Local Binary Patterns (LBP) and ...
IRJET- Digital Image Forgery Detection using Local Binary Patterns (LBP) and ...
 
自然方策勾配法の基礎と応用
自然方策勾配法の基礎と応用自然方策勾配法の基礎と応用
自然方策勾配法の基礎と応用
 
Paper Presentation on CATTLE BREED DETECTION AND CATEGORIZATION USING IMAGE P...
Paper Presentation on CATTLE BREED DETECTION AND CATEGORIZATION USING IMAGE P...Paper Presentation on CATTLE BREED DETECTION AND CATEGORIZATION USING IMAGE P...
Paper Presentation on CATTLE BREED DETECTION AND CATEGORIZATION USING IMAGE P...
 
GROUPING OBJECTS BASED ON THEIR APPEARANCE
GROUPING OBJECTS BASED ON THEIR APPEARANCEGROUPING OBJECTS BASED ON THEIR APPEARANCE
GROUPING OBJECTS BASED ON THEIR APPEARANCE
 
researchpaper_2023_Skin_Csdbjsjvnvsdnfvancer.pdf
researchpaper_2023_Skin_Csdbjsjvnvsdnfvancer.pdfresearchpaper_2023_Skin_Csdbjsjvnvsdnfvancer.pdf
researchpaper_2023_Skin_Csdbjsjvnvsdnfvancer.pdf
 
MediaEval 2015 - UNED-UV @ Retrieving Diverse Social Images Task - Poster
MediaEval 2015 - UNED-UV @ Retrieving Diverse Social Images Task - PosterMediaEval 2015 - UNED-UV @ Retrieving Diverse Social Images Task - Poster
MediaEval 2015 - UNED-UV @ Retrieving Diverse Social Images Task - Poster
 
Gesture Recognition Review: A Survey of Various Gesture Recognition Algorithms
Gesture Recognition Review: A Survey of Various Gesture Recognition AlgorithmsGesture Recognition Review: A Survey of Various Gesture Recognition Algorithms
Gesture Recognition Review: A Survey of Various Gesture Recognition Algorithms
 
Brain Tumor Classification using Support Vector Machine
Brain Tumor Classification using Support Vector MachineBrain Tumor Classification using Support Vector Machine
Brain Tumor Classification using Support Vector Machine
 
A Novel Efficient Medical Image Segmentation Methodology
A Novel Efficient Medical Image Segmentation MethodologyA Novel Efficient Medical Image Segmentation Methodology
A Novel Efficient Medical Image Segmentation Methodology
 
AUTOMATIC FRUIT RECOGNITION BASED ON DCNN FOR COMMERCIAL SOURCE TRACE SYSTEM
AUTOMATIC FRUIT RECOGNITION BASED ON DCNN FOR COMMERCIAL SOURCE TRACE SYSTEMAUTOMATIC FRUIT RECOGNITION BASED ON DCNN FOR COMMERCIAL SOURCE TRACE SYSTEM
AUTOMATIC FRUIT RECOGNITION BASED ON DCNN FOR COMMERCIAL SOURCE TRACE SYSTEM
 
Multimodal Biometrics at Feature Level Fusion using Texture Features
Multimodal Biometrics at Feature Level Fusion using Texture FeaturesMultimodal Biometrics at Feature Level Fusion using Texture Features
Multimodal Biometrics at Feature Level Fusion using Texture Features
 
CELL TRACKING QUALITY COMPARISON BETWEEN ACTIVE SHAPE MODEL (ASM) AND ACTIVE ...
CELL TRACKING QUALITY COMPARISON BETWEEN ACTIVE SHAPE MODEL (ASM) AND ACTIVE ...CELL TRACKING QUALITY COMPARISON BETWEEN ACTIVE SHAPE MODEL (ASM) AND ACTIVE ...
CELL TRACKING QUALITY COMPARISON BETWEEN ACTIVE SHAPE MODEL (ASM) AND ACTIVE ...
 
ON THE PREDICTION ACCURACIES OF THREE MOST KNOWN REGULARIZERS : RIDGE REGRESS...
ON THE PREDICTION ACCURACIES OF THREE MOST KNOWN REGULARIZERS : RIDGE REGRESS...ON THE PREDICTION ACCURACIES OF THREE MOST KNOWN REGULARIZERS : RIDGE REGRESS...
ON THE PREDICTION ACCURACIES OF THREE MOST KNOWN REGULARIZERS : RIDGE REGRESS...
 
IRJET - Movie Genre Prediction from Plot Summaries by Comparing Various C...
IRJET -  	  Movie Genre Prediction from Plot Summaries by Comparing Various C...IRJET -  	  Movie Genre Prediction from Plot Summaries by Comparing Various C...
IRJET - Movie Genre Prediction from Plot Summaries by Comparing Various C...
 
UNSUPERVISED ROBOTIC SORTING: TOWARDS AUTONOMOUS DECISION MAKING ROBOTS
UNSUPERVISED ROBOTIC SORTING: TOWARDS AUTONOMOUS DECISION MAKING ROBOTSUNSUPERVISED ROBOTIC SORTING: TOWARDS AUTONOMOUS DECISION MAKING ROBOTS
UNSUPERVISED ROBOTIC SORTING: TOWARDS AUTONOMOUS DECISION MAKING ROBOTS
 
Application of optimization algorithms for classification problem
Application of optimization algorithms for classification  problemApplication of optimization algorithms for classification  problem
Application of optimization algorithms for classification problem
 
A Modified KS-test for Feature Selection
A Modified KS-test for Feature SelectionA Modified KS-test for Feature Selection
A Modified KS-test for Feature Selection
 
MR Image Compression Based on Selection of Mother Wavelet and Lifting Based W...
MR Image Compression Based on Selection of Mother Wavelet and Lifting Based W...MR Image Compression Based on Selection of Mother Wavelet and Lifting Based W...
MR Image Compression Based on Selection of Mother Wavelet and Lifting Based W...
 

More from Efsun Kayi

Answer Span Correction in Machine Reading Comprehension
Answer Span Correction in Machine Reading ComprehensionAnswer Span Correction in Machine Reading Comprehension
Answer Span Correction in Machine Reading ComprehensionEfsun Kayi
 
Transfer Learning for Low Resource Languages and Domains
Transfer Learning for Low Resource Languages and DomainsTransfer Learning for Low Resource Languages and Domains
Transfer Learning for Low Resource Languages and DomainsEfsun Kayi
 
Detecting Urgency Status of Crisis Tweets: A Transfer Learning Approach for L...
Detecting Urgency Status of Crisis Tweets: A Transfer Learning Approach for L...Detecting Urgency Status of Crisis Tweets: A Transfer Learning Approach for L...
Detecting Urgency Status of Crisis Tweets: A Transfer Learning Approach for L...Efsun Kayi
 
Effective Classification of Clinical Reports: Natural Language Processing-Bas...
Effective Classification of Clinical Reports: Natural Language Processing-Bas...Effective Classification of Clinical Reports: Natural Language Processing-Bas...
Effective Classification of Clinical Reports: Natural Language Processing-Bas...Efsun Kayi
 
Predictive Linguistic Features of Schizophrenia
Predictive Linguistic Features of SchizophreniaPredictive Linguistic Features of Schizophrenia
Predictive Linguistic Features of SchizophreniaEfsun Kayi
 

More from Efsun Kayi (6)

Answer Span Correction in Machine Reading Comprehension
Answer Span Correction in Machine Reading ComprehensionAnswer Span Correction in Machine Reading Comprehension
Answer Span Correction in Machine Reading Comprehension
 
MultiSeg
MultiSegMultiSeg
MultiSeg
 
Transfer Learning for Low Resource Languages and Domains
Transfer Learning for Low Resource Languages and DomainsTransfer Learning for Low Resource Languages and Domains
Transfer Learning for Low Resource Languages and Domains
 
Detecting Urgency Status of Crisis Tweets: A Transfer Learning Approach for L...
Detecting Urgency Status of Crisis Tweets: A Transfer Learning Approach for L...Detecting Urgency Status of Crisis Tweets: A Transfer Learning Approach for L...
Detecting Urgency Status of Crisis Tweets: A Transfer Learning Approach for L...
 
Effective Classification of Clinical Reports: Natural Language Processing-Bas...
Effective Classification of Clinical Reports: Natural Language Processing-Bas...Effective Classification of Clinical Reports: Natural Language Processing-Bas...
Effective Classification of Clinical Reports: Natural Language Processing-Bas...
 
Predictive Linguistic Features of Schizophrenia
Predictive Linguistic Features of SchizophreniaPredictive Linguistic Features of Schizophrenia
Predictive Linguistic Features of Schizophrenia
 

Recently uploaded

Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一ffjhghh
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 

Recently uploaded (20)

Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 

Object Detection using HoG Features for Visual Situation Recognition

  • 1. Object Detection using HoG Features for Visual Situation Recognition Efsun Sarioglu Kayi, Melanie Mitchell Portland State University February 5, 2018 Abstract This project investigates a novel approach to building computer systems that can recognize visual situations. While much effort in computer vision has focused on identifying isolated objects in images, what people actually do is recognize coherent situations — collections of objects and their interrelations that, taken together, correspond to a known concept, such as "dog-walking", or "a fight breaking out", or "a blind person crossing the street". Situation recognition by humans may appear on the surface to be effortless, but it relies on a complex dynamic interplay among human abilities to perceive objects, systems of relationships among objects, and analogies with stored knowledge and memories. Enabling computers to flexibly recognize visual situations would create a flood of important applications in fields as diverse as autonomous vehicles, medical diagnosis, interpretation of scientific imagery, enhanced human- computer interaction, and personal information organization. 1 Introduction Two previously studied approaches, brain-inspired neural networks for lower-level vision and cognitive- level models of concepts and analogy-making can be integrated for recognizing situations. System architecture is depicted in Figure 1. For Dog-Walking situation, there are three objects: Person, Dog and Leash. The relationships between them can be analyzed through Concept Network. A concept network for Dog-Walking situation is given in Figure 2. In this project, we focus on lower- level vision for efficient and effective object detection which would feed into cognitive-models for visual situation recognition. 2 Dataset Dog-walking dataset consists of photographs, each of which is an instance of the visual concept (or situation) "Dog-Walking”. The dog-walkers, dogs, and leashes labeled with ground truth bounding boxes. The number of files for each class is listed in Table 3. 2.1 Preprocessing Label file of each image has the location of objects in that image. The negative crops are randomly selected from the remaining parts of the image with similar size of the positive crop. Cropped images are resized to a fixed width and height to make them comparable. For each class, 10%, 20%, 30%, 40%, 50% percentiles of width and height are used as resize options. 3 Experiments Histograms of Oriented Gradients (HOG) [DT05] are edge orientation histograms computed on a dense grid of uniformly spaced cells. They are computed using MATLAB’s extractHOGFeatures [ext] function on overlapping local contrast normalizations for improved performance. Figure 4 shows an image and its corresponding HOG features. Tables 5,6,7 show number of HOG features for different resize options for Dog, Leash and Person classes respectively. Tables 5, 6, and 7 1
  • 2. Figure 1: Situate Architecture Figure 2: Concept Network Figure 3: Dog-Walking dataset 2
  • 3. Figure 4: HOG Features [ext] Figure 5: HOG features for Dog class lists resize options and the corresponding feature numbers for Dog, Leash and Person classes respectively. Library for Support Vector Machines (LIBSVM) MATLAB extension is used [CL11]. Using the images resized to their average width and height, SVM is trained using 50% of the data and tested on the remaining 50%. The ROC graphs using decision values are displayed in Figures 8, 9 and 10 corresponding to Dog, Leash and Person classes. Corresponding AUC values are listed in Table 11. 3.1 SVM Feature Selection Once a classifier is built using SVM, its features can be selected based on their weights w (Recursive Feature Elimination (RFE) [GWBV02]. Since number of features is high, instead of removing features one by one in a recursive fashion, its top features were selected based on w2 . A new classifier was trained using the selected features and ROC and AUC results were computed for the test dataset. Using 50% training and 50% testing data and resized images of size 10%. Corresponding AUC values are displayed in Figures 12, 13 and 14 and listed in Table 15. ROC plots with feature selections are shown in Figures 16, 17, and 18. 3
  • 4. Figure 6: HOG features for Leash class Figure 7: HOG features for Person class Figure 8: ROC plot for Dog class 4
  • 5. Figure 9: ROC plot for Leash class Figure 10: ROC plot for Person class Figure 11: AUC values 5
  • 6. Figure 12: AUC values for Dog class with different number of features Figure 13: AUC values for Leash class with different number of features 6
  • 7. Figure 14: AUC values for Person class with different number of features Figure 15: AUC values with feature selection Figure 16: ROC graph for Dog class using top 20% features 7
  • 8. Figure 17: ROC graph for Leash class using top 20% features Figure 18: ROC graph for Person class using top 20% features 8
  • 9. Figure 19: AUC values using top HOG features with different cell sizes for Dog Class Figure 20: AUC values using top HOG features with different cell sizes for Leash Class 3.2 Effect of HOG Cell Size HOG features are computed on given cell sizes in pixels. The default size is 8×8. We also explored 2×2 and 4×4 cell sizes to capture small-scale details. In addition, we evaluated combination of them as below: • 2×2 + 4×4 • 2×2 + 8×8 • 4×4 + 8×8 • 2×2 + 4×4 + 8×8 Smaller cell sizes also correspond to bigger feature sizes. Accordingly, we used the same SVM feature selection mechanism explained in Section 3.1. Using 50% training and 50% testing data with images resized to 10% of their class sizes, SVM was trained with HOG features with different cell sizes. Figures 19, 20, and 21 show the AUC values for Dog, Leash and Person classes for different cell sizes and top selected features. Tables 22, 23, and 24 lists the AUC values for Dog, Leash and Person classes using different cell sizes and top selected features. 9
  • 10. Figure 21: AUC values using top HOG features with different cell sizes for Person Class Figure 22: AUC values for Dog class for different cell sizes and top selected features Figure 23: AUC values for Leash class for different cell sizes and top selected features Figure 24: AUC values for Person class for different cell sizes and top selected features 10
  • 11. 4 Conclusion We have shown effective and efficient object detection using HOG features and SVM for Dog- Walking situation. Using SVM feature selection techniques, top 10 to 20% of top features performed as well as using the entire dataset. However, using different cell sizes for HOG did not make much difference in classification performance. References [CL11] Chih-Chung Chang and Chih-Jen Lin. LIBSVM: A library for support vector ma- chines. ACM Transactions on Intelligent Systems and Technology, 2:27:1–27:27, 2011. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm. [DT05] Navneet Dalal and Bill Triggs. Histograms of oriented gradients for human detection. In Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05) - Volume 1 - Volume 01, CVPR ’05, pages 886– 893, Washington, DC, USA, 2005. IEEE Computer Society. [ext] MATLAB extracthogfeatures. https://www.mathworks.com/help/vision/ref/extracthogfeatures.html. [GWBV02] Isabelle Guyon, Jason Weston, Stephen Barnhill, and Vladimir Vapnik. Gene selection for cancer classification using support vector machines. Mach. Learn., 46(1-3):389–422, March 2002. 11