SlideShare a Scribd company logo
1 of 37
Download to read offline
Survey of Face Detection
Approaches
Yurii Pashchenko
DataScience Lab, Odessa, 2017
Classification vs. Detection
http://tutorial.caffe.berkeleyvision.org/caffe-cvpr15-detection.pdf 2
Evaluation
3
Evaluation metric. Receiver Operating
Characteristic (ROC)
4
Benchmarks
● FDDB
● AFW
● PascalFace
● IJB-A
● MALF
● WIDER Face
5
FDDB: A Benchmark for Face Detection in
Unconstrained Settings
● 2 845 images with a total of 5 171 faces;
● a wide range of difficulties:
○ occlusions
○ different poses
○ low resolution
○ out-of-focus faces
● the specification of face regions as
elliptical regions
● both grayscale and color images.
http://vis-www.cs.umass.edu/fddb/ 6
FDDB. Annotation
http://vis-www.cs.umass.edu/fddb/fddb.pdf 7
FDDB.Evaluation
8
IARPA Janus Benchmark A (IJB-A)
• 5 712 images and 2085 videos,
with an average of 11.4 images
and 4.2 videos per subject
• full pose variation
• joint use for face recognition and
face detection benchmarking
• a mix of images and videos
• wider geographic variation of
subjects
• landmark locations
Brendan F Klare, Emma Taborsky, Austin Blanton, Jordan Cheney, Kristen Allen, Patrick Grother, Alan Mah, Mark Burge,
and Anil K Jain. 2015. Pushing the frontiers of unconstrained face detection and recognition: IARPA Janus Benchmark A. In
Proc. IEEE Conf. Comput. Vis. Pattern Recognit. 1931–1939 9
IJB-A. Evaluation
* False Accept and Detection Rate are computed per image
10
WIDER FACE: A Face Detection Benchmark
• It consists of 32 203 images with 393 703
labeled faces, which is 10 times larger
than the current largest face detection
dataset
• The faces vary largely in appearance,
pose, and scale
• Annotated multiple attributes: occlusion,
pose, and event categories, which allows
in depth analysis of existing algorithms.
http://mmlab.ie.cuhk.edu.hk/projects/WIDERFace/ 11
WIDER FACE. Annotations
https://arxiv.org/pdf/1511.06523.pdf 12
WIDER FACE. Evaluation results
13
Comparison of Face Detection Datasets
https://arxiv.org/pdf/1511.06523.pdf 14
Viola-Jones Object Detector
• Very popular for Human Face Detection
• May be trained for Cat and Dog Face detection
• Available free in OpenCV library (http://opencv.org)
O. Parkhi, A. Vedaldi, C. V. Jawahar, and A. Zisserman. The Truth about Cats and Dogs // Proceedings
of the International Conference on Computer Vision (ICCV), 2011. J.
Liu, A. Kanazawa, D. Jacobs, P. Belhumeur. Dog Breed Classification Using Part Localization // Lecture
Notes in Computer Science Volume 7572, 2012, pp 172-185.
Main Principles
● Scanning window
● Features
● Integral image
● Boosted feature selection
● Cascaded classifier
P.A. Viola, M.J. Jones, Rapid object detection using a boosted cascade of simple features, in: CVPR, issue 1,
2001, pp. 511–518.
16
Scaning window
17
Integral Image
18
Features
⚫Available features:
⚫ HAAR
⚫ LBP
⚫ HOG
⚫Too many features!
⚫ location, scale, type
⚫ 180,000+ possible features
associated with each 24 x 24
window
⚫Not all of them are useful!
19
Feature selection
⚫Idea: Combining several weak classifiers to generate a strong
classifier
α1 α2
α3 αT
…
…
α1
h1
+ α2
h2
+ α3
h3
+ … + αT
hT >
<
Tthreshol
d
weak classifier (feature, threshold)
h1
= 1 or 0
20
Cascaded Classifier
● A 1 feature classifier achieves 100% detection rate and about 50% false
positive rate.
● A 5 feature classifier achieves 100% detection rate and 40% false
positive rate (20% cumulative) – using data from previous stage.
● A 20 feature classifier achieve 100% detection rate with 10% false
positive rate (2% cumulative)
21
Viola Jones Pipeline
https://habrahabr.ru/post/133826/ 22
Viola Jones. Evaluation Results on FDDB
23
A Convolutional Neural Network
Cascade for Face Detection
● 12-net
● 12-calibration-net
● 24-net
● 24-calibration-net
● 48-net
● 48-calibration-net
http://www.cv-foundation.org/openaccess/content_cvpr_2015/papers/Li_A_Convolutional_Neural_2015_CVPR_paper.pdf 24
Cascade CNN. Calibration Net
The calibration pattern adjusts the window to be
N = 45 patterns, formed by all combinations of
http://www.cv-foundation.org/openaccess/content_cvpr_2015/papers/Li_A_Convolutional_Neural_2015_CVPR_paper.pdf 25
Cascade CNN. Evaluation Results on FDDB
~14 fps on CPU ~100 fps on GPU
http://www.cv-foundation.org/openaccess/content_cvpr_2015/papers/Li_A_Convolutional_Neural_2015_CVPR_paper.pdf 26
Joint Face Detection and Alignment using Multi-task
Cascaded Convolutional Networks (MTCCN)
• Improved previous approach
• Joint face detection and alignment
• Online Hard sample mining
• Multi-source training
https://arxiv.org/pdf/1604.02878.pdf
27
MTCNN. Evaluation on FDDB and WIDER
https://arxiv.org/pdf/1604.02878.pdf
28
Faster R-CNN
29
Region proposal network
30
Bootstrapping Face Detection with
Hard Negative Examples
• ResNet-50
• Foreground ROI thr >=0.5
• Background ROI in the interval [0.1, 0.5)
• Balancing bg-fg RoIs: 3:1
• Hard Negative mining
https://arxiv.org/pdf/1608.02236.pdf 31
Face Detection using Deep Learning: An
Improved Faster RCNN Approach (DeepIR)
• VGG16 architecture
• Hard negative mining
• Feature concatenation
• Multi-scale training
https://arxiv.org/pdf/1701.08289.pdf
32
DeepIR. Evaluation on FDDB
DeepIR
https://arxiv.org/pdf/1701.08289.pdf
33
Finding Tiny Faces (HR-ER)
https://arxiv.org/pdf/1612.04402.pdf 34
HR-ER. Approach
What about context?
https://arxiv.org/pdf/1612.04402.pdf 35
HR-ER. Evaluation on WIDER and FDDB
https://arxiv.org/pdf/1612.04402.pdf 36
THANK YOU FOR YOUR
ATTENTION!
e-mail: yurii.pashchenko@ring.com
skype: george.pashchenko
37

More Related Content

Similar to DataScience Lab 2017_Обзор методов детекции лиц на изображение

[212]big models without big data using domain specific deep networks in data-...
[212]big models without big data using domain specific deep networks in data-...[212]big models without big data using domain specific deep networks in data-...
[212]big models without big data using domain specific deep networks in data-...NAVER D2
 
Deep learning based gaze detection system for automobile drivers using nir ca...
Deep learning based gaze detection system for automobile drivers using nir ca...Deep learning based gaze detection system for automobile drivers using nir ca...
Deep learning based gaze detection system for automobile drivers using nir ca...Jaey Jeong
 
Neural networks for semantic gaze analysis in xr settings
Neural networks for semantic gaze analysis in xr settingsNeural networks for semantic gaze analysis in xr settings
Neural networks for semantic gaze analysis in xr settingsJaey Jeong
 
Facial Emotion Recognition: A Survey
Facial Emotion Recognition: A SurveyFacial Emotion Recognition: A Survey
Facial Emotion Recognition: A SurveyIRJET Journal
 
A Review on Face Detection under Occlusion by Facial Accessories
A Review on Face Detection under Occlusion by Facial AccessoriesA Review on Face Detection under Occlusion by Facial Accessories
A Review on Face Detection under Occlusion by Facial AccessoriesIRJET Journal
 
Long-term Face Tracking in the Wild using Deep Learning
Long-term Face Tracking in the Wild using Deep LearningLong-term Face Tracking in the Wild using Deep Learning
Long-term Face Tracking in the Wild using Deep LearningElaheh Rashedi
 
IRJET-Comparision of PCA and LDA Techniques for Face Recognition Feature Base...
IRJET-Comparision of PCA and LDA Techniques for Face Recognition Feature Base...IRJET-Comparision of PCA and LDA Techniques for Face Recognition Feature Base...
IRJET-Comparision of PCA and LDA Techniques for Face Recognition Feature Base...IRJET Journal
 
Ubiquitious Computing system : Integrating RFID with Face Recognition systems
Ubiquitious Computing system : Integrating RFID with Face Recognition systemsUbiquitious Computing system : Integrating RFID with Face Recognition systems
Ubiquitious Computing system : Integrating RFID with Face Recognition systemsShahryar Ali
 
IRJET- Real-Time Partial Face Occlusion Detection using Matlab
IRJET-	 Real-Time Partial Face Occlusion Detection using MatlabIRJET-	 Real-Time Partial Face Occlusion Detection using Matlab
IRJET- Real-Time Partial Face Occlusion Detection using MatlabIRJET Journal
 
A dense depth representation for vlad descriptors in
A dense depth representation for vlad descriptors inA dense depth representation for vlad descriptors in
A dense depth representation for vlad descriptors inFederico Magliani
 
深度學習在AOI的應用
深度學習在AOI的應用深度學習在AOI的應用
深度學習在AOI的應用CHENHuiMei
 
An Efficient Face Recognition Using Multi-Kernel Based Scale Invariant Featur...
An Efficient Face Recognition Using Multi-Kernel Based Scale Invariant Featur...An Efficient Face Recognition Using Multi-Kernel Based Scale Invariant Featur...
An Efficient Face Recognition Using Multi-Kernel Based Scale Invariant Featur...CSCJournals
 
Classification project for artificial intelligence
Classification project for artificial intelligence Classification project for artificial intelligence
Classification project for artificial intelligence Zahidul Islam Razu
 
Road signs detection using voila jone's algorithm with the help of opencv
Road signs detection using voila jone's algorithm with the help of opencvRoad signs detection using voila jone's algorithm with the help of opencv
Road signs detection using voila jone's algorithm with the help of opencvMohdSalim34
 
[Paper] DetectoRS for Object Detection
[Paper] DetectoRS for Object Detection[Paper] DetectoRS for Object Detection
[Paper] DetectoRS for Object DetectionSusang Kim
 
Learning Biologically Relevant Features Using Convolutional Neural Networks f...
Learning Biologically Relevant Features Using Convolutional Neural Networks f...Learning Biologically Relevant Features Using Convolutional Neural Networks f...
Learning Biologically Relevant Features Using Convolutional Neural Networks f...Wesley De Neve
 

Similar to DataScience Lab 2017_Обзор методов детекции лиц на изображение (20)

[212]big models without big data using domain specific deep networks in data-...
[212]big models without big data using domain specific deep networks in data-...[212]big models without big data using domain specific deep networks in data-...
[212]big models without big data using domain specific deep networks in data-...
 
Deep learning based gaze detection system for automobile drivers using nir ca...
Deep learning based gaze detection system for automobile drivers using nir ca...Deep learning based gaze detection system for automobile drivers using nir ca...
Deep learning based gaze detection system for automobile drivers using nir ca...
 
Face Recognition - Elisa Sayrol - UPC Barcelona 2018
Face Recognition - Elisa Sayrol - UPC Barcelona 2018Face Recognition - Elisa Sayrol - UPC Barcelona 2018
Face Recognition - Elisa Sayrol - UPC Barcelona 2018
 
Neural networks for semantic gaze analysis in xr settings
Neural networks for semantic gaze analysis in xr settingsNeural networks for semantic gaze analysis in xr settings
Neural networks for semantic gaze analysis in xr settings
 
Facial Emotion Recognition: A Survey
Facial Emotion Recognition: A SurveyFacial Emotion Recognition: A Survey
Facial Emotion Recognition: A Survey
 
A Review on Face Detection under Occlusion by Facial Accessories
A Review on Face Detection under Occlusion by Facial AccessoriesA Review on Face Detection under Occlusion by Facial Accessories
A Review on Face Detection under Occlusion by Facial Accessories
 
Long-term Face Tracking in the Wild using Deep Learning
Long-term Face Tracking in the Wild using Deep LearningLong-term Face Tracking in the Wild using Deep Learning
Long-term Face Tracking in the Wild using Deep Learning
 
IRJET-Comparision of PCA and LDA Techniques for Face Recognition Feature Base...
IRJET-Comparision of PCA and LDA Techniques for Face Recognition Feature Base...IRJET-Comparision of PCA and LDA Techniques for Face Recognition Feature Base...
IRJET-Comparision of PCA and LDA Techniques for Face Recognition Feature Base...
 
Spoofing-aware Attention Back-end with Multiple Enrollment and Novel Trials S...
Spoofing-aware Attention Back-end with Multiple Enrollment and Novel Trials S...Spoofing-aware Attention Back-end with Multiple Enrollment and Novel Trials S...
Spoofing-aware Attention Back-end with Multiple Enrollment and Novel Trials S...
 
Ubiquitious Computing system : Integrating RFID with Face Recognition systems
Ubiquitious Computing system : Integrating RFID with Face Recognition systemsUbiquitious Computing system : Integrating RFID with Face Recognition systems
Ubiquitious Computing system : Integrating RFID with Face Recognition systems
 
IRJET- Real-Time Partial Face Occlusion Detection using Matlab
IRJET-	 Real-Time Partial Face Occlusion Detection using MatlabIRJET-	 Real-Time Partial Face Occlusion Detection using Matlab
IRJET- Real-Time Partial Face Occlusion Detection using Matlab
 
A dense depth representation for vlad descriptors in
A dense depth representation for vlad descriptors inA dense depth representation for vlad descriptors in
A dense depth representation for vlad descriptors in
 
face detection
face detectionface detection
face detection
 
深度學習在AOI的應用
深度學習在AOI的應用深度學習在AOI的應用
深度學習在AOI的應用
 
Visual Search for Musical Performances and Endoscopic Videos
Visual Search for Musical Performances and Endoscopic VideosVisual Search for Musical Performances and Endoscopic Videos
Visual Search for Musical Performances and Endoscopic Videos
 
An Efficient Face Recognition Using Multi-Kernel Based Scale Invariant Featur...
An Efficient Face Recognition Using Multi-Kernel Based Scale Invariant Featur...An Efficient Face Recognition Using Multi-Kernel Based Scale Invariant Featur...
An Efficient Face Recognition Using Multi-Kernel Based Scale Invariant Featur...
 
Classification project for artificial intelligence
Classification project for artificial intelligence Classification project for artificial intelligence
Classification project for artificial intelligence
 
Road signs detection using voila jone's algorithm with the help of opencv
Road signs detection using voila jone's algorithm with the help of opencvRoad signs detection using voila jone's algorithm with the help of opencv
Road signs detection using voila jone's algorithm with the help of opencv
 
[Paper] DetectoRS for Object Detection
[Paper] DetectoRS for Object Detection[Paper] DetectoRS for Object Detection
[Paper] DetectoRS for Object Detection
 
Learning Biologically Relevant Features Using Convolutional Neural Networks f...
Learning Biologically Relevant Features Using Convolutional Neural Networks f...Learning Biologically Relevant Features Using Convolutional Neural Networks f...
Learning Biologically Relevant Features Using Convolutional Neural Networks f...
 

More from GeeksLab Odessa

DataScience Lab2017_Коррекция геометрических искажений оптических спутниковых...
DataScience Lab2017_Коррекция геометрических искажений оптических спутниковых...DataScience Lab2017_Коррекция геометрических искажений оптических спутниковых...
DataScience Lab2017_Коррекция геометрических искажений оптических спутниковых...GeeksLab Odessa
 
DataScience Lab 2017_Kappa Architecture: How to implement a real-time streami...
DataScience Lab 2017_Kappa Architecture: How to implement a real-time streami...DataScience Lab 2017_Kappa Architecture: How to implement a real-time streami...
DataScience Lab 2017_Kappa Architecture: How to implement a real-time streami...GeeksLab Odessa
 
DataScience Lab 2017_Блиц-доклад_Турский Виктор
DataScience Lab 2017_Блиц-доклад_Турский ВикторDataScience Lab 2017_Блиц-доклад_Турский Виктор
DataScience Lab 2017_Блиц-доклад_Турский ВикторGeeksLab Odessa
 
DataScienceLab2017_Сходство пациентов: вычистка дубликатов и предсказание про...
DataScienceLab2017_Сходство пациентов: вычистка дубликатов и предсказание про...DataScienceLab2017_Сходство пациентов: вычистка дубликатов и предсказание про...
DataScienceLab2017_Сходство пациентов: вычистка дубликатов и предсказание про...GeeksLab Odessa
 
DataScienceLab2017_Блиц-доклад
DataScienceLab2017_Блиц-докладDataScienceLab2017_Блиц-доклад
DataScienceLab2017_Блиц-докладGeeksLab Odessa
 
DataScienceLab2017_Блиц-доклад
DataScienceLab2017_Блиц-докладDataScienceLab2017_Блиц-доклад
DataScienceLab2017_Блиц-докладGeeksLab Odessa
 
DataScienceLab2017_Блиц-доклад
DataScienceLab2017_Блиц-докладDataScienceLab2017_Блиц-доклад
DataScienceLab2017_Блиц-докладGeeksLab Odessa
 
DataScienceLab2017_Cервинг моделей, построенных на больших данных с помощью A...
DataScienceLab2017_Cервинг моделей, построенных на больших данных с помощью A...DataScienceLab2017_Cервинг моделей, построенных на больших данных с помощью A...
DataScienceLab2017_Cервинг моделей, построенных на больших данных с помощью A...GeeksLab Odessa
 
DataScienceLab2017_BioVec: Word2Vec в задачах анализа геномных данных и биоин...
DataScienceLab2017_BioVec: Word2Vec в задачах анализа геномных данных и биоин...DataScienceLab2017_BioVec: Word2Vec в задачах анализа геномных данных и биоин...
DataScienceLab2017_BioVec: Word2Vec в задачах анализа геномных данных и биоин...GeeksLab Odessa
 
DataScienceLab2017_Data Sciences и Big Data в Телекоме_Александр Саенко
DataScienceLab2017_Data Sciences и Big Data в Телекоме_Александр Саенко DataScienceLab2017_Data Sciences и Big Data в Телекоме_Александр Саенко
DataScienceLab2017_Data Sciences и Big Data в Телекоме_Александр Саенко GeeksLab Odessa
 
DataScienceLab2017_Высокопроизводительные вычислительные возможности для сист...
DataScienceLab2017_Высокопроизводительные вычислительные возможности для сист...DataScienceLab2017_Высокопроизводительные вычислительные возможности для сист...
DataScienceLab2017_Высокопроизводительные вычислительные возможности для сист...GeeksLab Odessa
 
DataScience Lab 2017_Мониторинг модных трендов с помощью глубокого обучения и...
DataScience Lab 2017_Мониторинг модных трендов с помощью глубокого обучения и...DataScience Lab 2017_Мониторинг модных трендов с помощью глубокого обучения и...
DataScience Lab 2017_Мониторинг модных трендов с помощью глубокого обучения и...GeeksLab Odessa
 
DataScience Lab 2017_Кто здесь? Автоматическая разметка спикеров на телефонны...
DataScience Lab 2017_Кто здесь? Автоматическая разметка спикеров на телефонны...DataScience Lab 2017_Кто здесь? Автоматическая разметка спикеров на телефонны...
DataScience Lab 2017_Кто здесь? Автоматическая разметка спикеров на телефонны...GeeksLab Odessa
 
DataScience Lab 2017_From bag of texts to bag of clusters_Терпиль Евгений / П...
DataScience Lab 2017_From bag of texts to bag of clusters_Терпиль Евгений / П...DataScience Lab 2017_From bag of texts to bag of clusters_Терпиль Евгений / П...
DataScience Lab 2017_From bag of texts to bag of clusters_Терпиль Евгений / П...GeeksLab Odessa
 
DataScience Lab 2017_Графические вероятностные модели для принятия решений в ...
DataScience Lab 2017_Графические вероятностные модели для принятия решений в ...DataScience Lab 2017_Графические вероятностные модели для принятия решений в ...
DataScience Lab 2017_Графические вероятностные модели для принятия решений в ...GeeksLab Odessa
 
DataScienceLab2017_Оптимизация гиперпараметров машинного обучения при помощи ...
DataScienceLab2017_Оптимизация гиперпараметров машинного обучения при помощи ...DataScienceLab2017_Оптимизация гиперпараметров машинного обучения при помощи ...
DataScienceLab2017_Оптимизация гиперпараметров машинного обучения при помощи ...GeeksLab Odessa
 
DataScienceLab2017_Как знать всё о покупателях (или почти всё)?_Дарина Перемот
DataScienceLab2017_Как знать всё о покупателях (или почти всё)?_Дарина Перемот DataScienceLab2017_Как знать всё о покупателях (или почти всё)?_Дарина Перемот
DataScienceLab2017_Как знать всё о покупателях (или почти всё)?_Дарина Перемот GeeksLab Odessa
 
JS Lab 2017_Mapbox GL: как работают современные интерактивные карты_Владимир ...
JS Lab 2017_Mapbox GL: как работают современные интерактивные карты_Владимир ...JS Lab 2017_Mapbox GL: как работают современные интерактивные карты_Владимир ...
JS Lab 2017_Mapbox GL: как работают современные интерактивные карты_Владимир ...GeeksLab Odessa
 
JS Lab2017_Под микроскопом: блеск и нищета микросервисов на node.js
JS Lab2017_Под микроскопом: блеск и нищета микросервисов на node.js JS Lab2017_Под микроскопом: блеск и нищета микросервисов на node.js
JS Lab2017_Под микроскопом: блеск и нищета микросервисов на node.js GeeksLab Odessa
 
JS Lab2017_Redux: время двигаться дальше?_Екатерина Лизогубова
JS Lab2017_Redux: время двигаться дальше?_Екатерина ЛизогубоваJS Lab2017_Redux: время двигаться дальше?_Екатерина Лизогубова
JS Lab2017_Redux: время двигаться дальше?_Екатерина ЛизогубоваGeeksLab Odessa
 

More from GeeksLab Odessa (20)

DataScience Lab2017_Коррекция геометрических искажений оптических спутниковых...
DataScience Lab2017_Коррекция геометрических искажений оптических спутниковых...DataScience Lab2017_Коррекция геометрических искажений оптических спутниковых...
DataScience Lab2017_Коррекция геометрических искажений оптических спутниковых...
 
DataScience Lab 2017_Kappa Architecture: How to implement a real-time streami...
DataScience Lab 2017_Kappa Architecture: How to implement a real-time streami...DataScience Lab 2017_Kappa Architecture: How to implement a real-time streami...
DataScience Lab 2017_Kappa Architecture: How to implement a real-time streami...
 
DataScience Lab 2017_Блиц-доклад_Турский Виктор
DataScience Lab 2017_Блиц-доклад_Турский ВикторDataScience Lab 2017_Блиц-доклад_Турский Виктор
DataScience Lab 2017_Блиц-доклад_Турский Виктор
 
DataScienceLab2017_Сходство пациентов: вычистка дубликатов и предсказание про...
DataScienceLab2017_Сходство пациентов: вычистка дубликатов и предсказание про...DataScienceLab2017_Сходство пациентов: вычистка дубликатов и предсказание про...
DataScienceLab2017_Сходство пациентов: вычистка дубликатов и предсказание про...
 
DataScienceLab2017_Блиц-доклад
DataScienceLab2017_Блиц-докладDataScienceLab2017_Блиц-доклад
DataScienceLab2017_Блиц-доклад
 
DataScienceLab2017_Блиц-доклад
DataScienceLab2017_Блиц-докладDataScienceLab2017_Блиц-доклад
DataScienceLab2017_Блиц-доклад
 
DataScienceLab2017_Блиц-доклад
DataScienceLab2017_Блиц-докладDataScienceLab2017_Блиц-доклад
DataScienceLab2017_Блиц-доклад
 
DataScienceLab2017_Cервинг моделей, построенных на больших данных с помощью A...
DataScienceLab2017_Cервинг моделей, построенных на больших данных с помощью A...DataScienceLab2017_Cервинг моделей, построенных на больших данных с помощью A...
DataScienceLab2017_Cервинг моделей, построенных на больших данных с помощью A...
 
DataScienceLab2017_BioVec: Word2Vec в задачах анализа геномных данных и биоин...
DataScienceLab2017_BioVec: Word2Vec в задачах анализа геномных данных и биоин...DataScienceLab2017_BioVec: Word2Vec в задачах анализа геномных данных и биоин...
DataScienceLab2017_BioVec: Word2Vec в задачах анализа геномных данных и биоин...
 
DataScienceLab2017_Data Sciences и Big Data в Телекоме_Александр Саенко
DataScienceLab2017_Data Sciences и Big Data в Телекоме_Александр Саенко DataScienceLab2017_Data Sciences и Big Data в Телекоме_Александр Саенко
DataScienceLab2017_Data Sciences и Big Data в Телекоме_Александр Саенко
 
DataScienceLab2017_Высокопроизводительные вычислительные возможности для сист...
DataScienceLab2017_Высокопроизводительные вычислительные возможности для сист...DataScienceLab2017_Высокопроизводительные вычислительные возможности для сист...
DataScienceLab2017_Высокопроизводительные вычислительные возможности для сист...
 
DataScience Lab 2017_Мониторинг модных трендов с помощью глубокого обучения и...
DataScience Lab 2017_Мониторинг модных трендов с помощью глубокого обучения и...DataScience Lab 2017_Мониторинг модных трендов с помощью глубокого обучения и...
DataScience Lab 2017_Мониторинг модных трендов с помощью глубокого обучения и...
 
DataScience Lab 2017_Кто здесь? Автоматическая разметка спикеров на телефонны...
DataScience Lab 2017_Кто здесь? Автоматическая разметка спикеров на телефонны...DataScience Lab 2017_Кто здесь? Автоматическая разметка спикеров на телефонны...
DataScience Lab 2017_Кто здесь? Автоматическая разметка спикеров на телефонны...
 
DataScience Lab 2017_From bag of texts to bag of clusters_Терпиль Евгений / П...
DataScience Lab 2017_From bag of texts to bag of clusters_Терпиль Евгений / П...DataScience Lab 2017_From bag of texts to bag of clusters_Терпиль Евгений / П...
DataScience Lab 2017_From bag of texts to bag of clusters_Терпиль Евгений / П...
 
DataScience Lab 2017_Графические вероятностные модели для принятия решений в ...
DataScience Lab 2017_Графические вероятностные модели для принятия решений в ...DataScience Lab 2017_Графические вероятностные модели для принятия решений в ...
DataScience Lab 2017_Графические вероятностные модели для принятия решений в ...
 
DataScienceLab2017_Оптимизация гиперпараметров машинного обучения при помощи ...
DataScienceLab2017_Оптимизация гиперпараметров машинного обучения при помощи ...DataScienceLab2017_Оптимизация гиперпараметров машинного обучения при помощи ...
DataScienceLab2017_Оптимизация гиперпараметров машинного обучения при помощи ...
 
DataScienceLab2017_Как знать всё о покупателях (или почти всё)?_Дарина Перемот
DataScienceLab2017_Как знать всё о покупателях (или почти всё)?_Дарина Перемот DataScienceLab2017_Как знать всё о покупателях (или почти всё)?_Дарина Перемот
DataScienceLab2017_Как знать всё о покупателях (или почти всё)?_Дарина Перемот
 
JS Lab 2017_Mapbox GL: как работают современные интерактивные карты_Владимир ...
JS Lab 2017_Mapbox GL: как работают современные интерактивные карты_Владимир ...JS Lab 2017_Mapbox GL: как работают современные интерактивные карты_Владимир ...
JS Lab 2017_Mapbox GL: как работают современные интерактивные карты_Владимир ...
 
JS Lab2017_Под микроскопом: блеск и нищета микросервисов на node.js
JS Lab2017_Под микроскопом: блеск и нищета микросервисов на node.js JS Lab2017_Под микроскопом: блеск и нищета микросервисов на node.js
JS Lab2017_Под микроскопом: блеск и нищета микросервисов на node.js
 
JS Lab2017_Redux: время двигаться дальше?_Екатерина Лизогубова
JS Lab2017_Redux: время двигаться дальше?_Екатерина ЛизогубоваJS Lab2017_Redux: время двигаться дальше?_Екатерина Лизогубова
JS Lab2017_Redux: время двигаться дальше?_Екатерина Лизогубова
 

Recently uploaded

Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsAndrey Dotsenko
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 

Recently uploaded (20)

Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 

DataScience Lab 2017_Обзор методов детекции лиц на изображение

  • 1. Survey of Face Detection Approaches Yurii Pashchenko DataScience Lab, Odessa, 2017
  • 4. Evaluation metric. Receiver Operating Characteristic (ROC) 4
  • 5. Benchmarks ● FDDB ● AFW ● PascalFace ● IJB-A ● MALF ● WIDER Face 5
  • 6. FDDB: A Benchmark for Face Detection in Unconstrained Settings ● 2 845 images with a total of 5 171 faces; ● a wide range of difficulties: ○ occlusions ○ different poses ○ low resolution ○ out-of-focus faces ● the specification of face regions as elliptical regions ● both grayscale and color images. http://vis-www.cs.umass.edu/fddb/ 6
  • 9. IARPA Janus Benchmark A (IJB-A) • 5 712 images and 2085 videos, with an average of 11.4 images and 4.2 videos per subject • full pose variation • joint use for face recognition and face detection benchmarking • a mix of images and videos • wider geographic variation of subjects • landmark locations Brendan F Klare, Emma Taborsky, Austin Blanton, Jordan Cheney, Kristen Allen, Patrick Grother, Alan Mah, Mark Burge, and Anil K Jain. 2015. Pushing the frontiers of unconstrained face detection and recognition: IARPA Janus Benchmark A. In Proc. IEEE Conf. Comput. Vis. Pattern Recognit. 1931–1939 9
  • 10. IJB-A. Evaluation * False Accept and Detection Rate are computed per image 10
  • 11. WIDER FACE: A Face Detection Benchmark • It consists of 32 203 images with 393 703 labeled faces, which is 10 times larger than the current largest face detection dataset • The faces vary largely in appearance, pose, and scale • Annotated multiple attributes: occlusion, pose, and event categories, which allows in depth analysis of existing algorithms. http://mmlab.ie.cuhk.edu.hk/projects/WIDERFace/ 11
  • 14. Comparison of Face Detection Datasets https://arxiv.org/pdf/1511.06523.pdf 14
  • 15. Viola-Jones Object Detector • Very popular for Human Face Detection • May be trained for Cat and Dog Face detection • Available free in OpenCV library (http://opencv.org) O. Parkhi, A. Vedaldi, C. V. Jawahar, and A. Zisserman. The Truth about Cats and Dogs // Proceedings of the International Conference on Computer Vision (ICCV), 2011. J. Liu, A. Kanazawa, D. Jacobs, P. Belhumeur. Dog Breed Classification Using Part Localization // Lecture Notes in Computer Science Volume 7572, 2012, pp 172-185.
  • 16. Main Principles ● Scanning window ● Features ● Integral image ● Boosted feature selection ● Cascaded classifier P.A. Viola, M.J. Jones, Rapid object detection using a boosted cascade of simple features, in: CVPR, issue 1, 2001, pp. 511–518. 16
  • 19. Features ⚫Available features: ⚫ HAAR ⚫ LBP ⚫ HOG ⚫Too many features! ⚫ location, scale, type ⚫ 180,000+ possible features associated with each 24 x 24 window ⚫Not all of them are useful! 19
  • 20. Feature selection ⚫Idea: Combining several weak classifiers to generate a strong classifier α1 α2 α3 αT … … α1 h1 + α2 h2 + α3 h3 + … + αT hT > < Tthreshol d weak classifier (feature, threshold) h1 = 1 or 0 20
  • 21. Cascaded Classifier ● A 1 feature classifier achieves 100% detection rate and about 50% false positive rate. ● A 5 feature classifier achieves 100% detection rate and 40% false positive rate (20% cumulative) – using data from previous stage. ● A 20 feature classifier achieve 100% detection rate with 10% false positive rate (2% cumulative) 21
  • 23. Viola Jones. Evaluation Results on FDDB 23
  • 24. A Convolutional Neural Network Cascade for Face Detection ● 12-net ● 12-calibration-net ● 24-net ● 24-calibration-net ● 48-net ● 48-calibration-net http://www.cv-foundation.org/openaccess/content_cvpr_2015/papers/Li_A_Convolutional_Neural_2015_CVPR_paper.pdf 24
  • 25. Cascade CNN. Calibration Net The calibration pattern adjusts the window to be N = 45 patterns, formed by all combinations of http://www.cv-foundation.org/openaccess/content_cvpr_2015/papers/Li_A_Convolutional_Neural_2015_CVPR_paper.pdf 25
  • 26. Cascade CNN. Evaluation Results on FDDB ~14 fps on CPU ~100 fps on GPU http://www.cv-foundation.org/openaccess/content_cvpr_2015/papers/Li_A_Convolutional_Neural_2015_CVPR_paper.pdf 26
  • 27. Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks (MTCCN) • Improved previous approach • Joint face detection and alignment • Online Hard sample mining • Multi-source training https://arxiv.org/pdf/1604.02878.pdf 27
  • 28. MTCNN. Evaluation on FDDB and WIDER https://arxiv.org/pdf/1604.02878.pdf 28
  • 31. Bootstrapping Face Detection with Hard Negative Examples • ResNet-50 • Foreground ROI thr >=0.5 • Background ROI in the interval [0.1, 0.5) • Balancing bg-fg RoIs: 3:1 • Hard Negative mining https://arxiv.org/pdf/1608.02236.pdf 31
  • 32. Face Detection using Deep Learning: An Improved Faster RCNN Approach (DeepIR) • VGG16 architecture • Hard negative mining • Feature concatenation • Multi-scale training https://arxiv.org/pdf/1701.08289.pdf 32
  • 33. DeepIR. Evaluation on FDDB DeepIR https://arxiv.org/pdf/1701.08289.pdf 33
  • 34. Finding Tiny Faces (HR-ER) https://arxiv.org/pdf/1612.04402.pdf 34
  • 35. HR-ER. Approach What about context? https://arxiv.org/pdf/1612.04402.pdf 35
  • 36. HR-ER. Evaluation on WIDER and FDDB https://arxiv.org/pdf/1612.04402.pdf 36
  • 37. THANK YOU FOR YOUR ATTENTION! e-mail: yurii.pashchenko@ring.com skype: george.pashchenko 37