SlideShare a Scribd company logo
VIDEO RETRIEVAL OF SPECIFIC
PERSONS IN SPECIFIC LOCATIONS
AUTHOR: ADVISORS:
Xavier Giró-i-Nieto
Eva Mohedano Kevin McGuinness
Andrea Calafell
Noel E. O’Connor
1
1. Motivation
2. State of the art
3. Framework for TRECVID
4. Face detection
5. Face representation
6. Query expansion
7. Annotation Tool
8. Fusion and normalization strategies
9. Conclusions and future work
OUTLINE
2
MOTIVATION
SURVEILLANCE PERSONAL VIDEO ORGANIZATION
3
TRECVID INSTANCE SEARCH 2016
PEOPLE AND LOCATION QUERY SET
Person
visual
examples
Binary
masks
Location
visual
examples
TARGET
DATABASE
1.5M keyframes
244 video files
(300GB)
4
MOTIVATION: goals
● Obtain a baseline to participate in
TRECVID Instance Search 2016 (July, 1).
● Improve the results obtained in TRECVID
using the baseline.
5
1. Motivation
2. State of the art
3. Framework for TRECVID
4. Face detection
5. Face representation
6. Query expansion
7. Annotation Tool
8. Fusion and normalization strategies
9. Conclusions and future work
OUTLINE
6
STATE OF THE ART
Image of Eva Mohedano, D3L6 Image Retrieval, Deep Learning for Computer Vision (UPC 2016)
BASIC RETRIEVAL PIPELINE:
7
STATE OF THE ART
Image of Eva Mohedano, D3L6 Image Retrieval, Deep Learning for Computer Vision (UPC 2016)
BAG OF VISUAL WORDS:
8
STATE OF THE ART
Image: Alex Krizhevsky , Ilya Sutskever , Geoffrey E. Hinton, Imagenet classification with deep convolutional neural networks, 2012
Ali Sharif Razavian, Josephine Sullivan, Atsuto Maki, and Stefan Carlsson. A baseline for visual instance retrieval with deep convolutional networks. ICLR 2015.
CNN REPRESENTATION:
9
STATE OF THE ART
Eva Mohedano, Amaia Salvador, Kevin McGuinness, Ferran Marqués, Noel E. O’Connor, and Xavier Giró i Nieto. Bags of local convolutional features for scalable
instance search. ICMR 2016.
BAG OF LOCAL CONVOLUTIONAL FEATURES:
10
1. Motivation
2. State of the art
3. Framework for TRECVID
4. Face detection
5. Face representation
6. Query expansion
7. Annotation Tool
8. Fusion and normalization strategies
9. Conclusions and future work
OUTLINE
11
FRAMEWORK FOR TRECVID
12
FRAMEWORK FOR TRECVID
13
Mohedano, et al. ICMR 2016
1. Motivation
2. State of the art
3. Framework for TRECVID
4. Face detection
5. Face representation
6. Query expansion
7. Annotation Tool
8. Fusion and normalization strategies
9. Conclusions and future work
OUTLINE
14
FRAMEWORK FOR TRECVID
15
FACE DETECTION: ReInspect
Russell Stewart, Mykhaylo Andriluka, and Andrew Y. Ng. End-to-end people detection in crowded scenes. CVPR 2016.
16
FACE DETECTION: ReInspect
QUALITATIVE RESULTS OF REINSPECT:
Changing both the input size of the network and the image size
Changing only the image size
17
Bad detections
False negatives
FACE DETECTION: ReInspect
PROBLEM: Images used to train ReInspect
18
FACE DETECTION: Menpo
1
https://github.com/menpo/menpodetect
Python wrapper for face detectors1
:
● DLIB
● OPENCV
● Pixel Intensity Comparison-based
Object detection (PICO)
● FFLD2
:
○ Based on Deformable Part
Models (DPM)
○ Use LUV color space
2
M. Mathias, R. Benenson, M. Pedersoli, and L. Van Gool. Face detection without bells and whistles. ECCV, 2014.
Examples of FFLD results
19
QUALITATIVE RESULTS OF MENPO:
DLIB
FACE DETECTION: Menpo
OPENCV
PICO 20
False negatives
QUALITATIVE RESULTS OF MENPO:
FACE DETECTION: Menpo
FFLD
Still some false negatives Solution: Equalize image
21
1. Motivation
2. State of the art
3. Framework for TRECVID
4. Face detection
5. Face representation
6. Query expansion
7. Annotation Tool
8. Fusion and normalization strategies
9. Conclusions and future work
OUTLINE
22
DEEP FACE RECOGNITION
FACE REPRESENTATION
Image: Simonyan, Karen, and Andrew Zisserman. Very deep convolutional networks for large-scale image recognition. ICLR 2015.
O. M. Parkhi, A. Vedaldi, and A. Zisserman. Deep face recognition. BMVC 2015
VGG 16-layer
23
1. Motivation
2. State of the art
3. Framework for TRECVID
4. Face detection
5. Face representation
6. Query expansion
7. Annotation Tool
8. Fusion and normalization strategies
9. Conclusions and future work
OUTLINE
24
QUERY EXPANSION
Sequence of keyframes of one shot
dilate
Mask creation pipeline
25
TEMPORAL QUERY EXPANSION:
Results of temporal query expansion
26
QUERY EXPANSION
TEMPORAL QUERY EXPANSION:
27
QUERY EXPANSION
PSEUDO-RELEVANCE FEEDBACK QUERY EXPANSION:
Top 20 retrieved keyframes
1. Motivation
2. State of the art
3. Framework for TRECVID
4. Face detection
5. Face representation
6. Query expansion
7. Annotation Tool
8. Fusion and normalization strategies
9. Conclusions and future work
OUTLINE
28
ANNOTATION TOOL
3.991 shots
for persons
1.528 shots
for locations
794 shots
in common
29
1. Motivation
2. State of the art
3. Framework for TRECVID
4. Face detection
5. Face representation
6. Query expansion
7. Annotation Tool
8. Fusion and normalization strategies
9. Conclusions and future work
OUTLINE
30
FRAMEWORK FOR TRECVID
31
FUSION AND NORMALIZATION STRATEGIES
NORMALIZATION:
● Z-score:
● Max-min:
● Extreme Value Theory:
FUSION:
Linear combination, maximum, minimum.
32
FUSION AND NORMALIZATION STRATEGIES
RESULTS OF APPLYING DIFFERENT NORMALIZATIONS:
BASELINE 33
FUSION AND NORMALIZATION STRATEGIES
Brad Person distribution Laundrette Location distribution
34
EXAMPLE DISTRIBUTION:
FUSION AND NORMALIZATION STRATEGIES
RESULTS OF APPLYING MAXIMUM OR MINIMUM FUSION
35
FUSION AND NORMALIZATION STRATEGIES
RESULTS OF WEIGHTING LINEAR COMBINATION
HIGHER THAN THE BASELINE
36
1. Motivation
2. State of the art
3. Framework for TRECVID
4. Face detection
5. Face representation
6. Query expansion
7. Annotation Tool
8. Fusion and normalization strategies
9. Conclusions and future work
OUTLINE
37
CONCLUSIONS
● FFLD, a simple approach using vanilla DPM combined with image equalization is the best option for TRECVID
dataset
38
CONCLUSIONS
● The temporal query expansion proposed works well, but the faces are very similar between them
However, using the top 20 faces in the ranking as new queries gives more diverse faces.
39
CONCLUSIONS
● An annotation tool is needed in order to obtain quantitative results.
3.991 shots
for persons
1.528 shots
for locations
794 shots
in common
TOTAL OF RELEVANT ANNOTATED SHOTS
● The best configuration is without applying normalization and combining the scores by weighting higher the
location ranking
40
FUTURE WORK
● Analyze deeper the location part
● Try to improve the location rankings
41
QUESTIONS?
42
FUSION AND NORMALIZATION STRATEGIES
RESULTS OF THE PARTS SEPARATELY OVER 50 KEYFRAMES
43
FUSION AND NORMALIZATION STRATEGIES
RESULTS OF APPLYING MAXIMUM, MINIMUM AND PRODUCT FUSION
44

More Related Content

What's hot

Brain tumour segmentation based on local independent projection based classif...
Brain tumour segmentation based on local independent projection based classif...Brain tumour segmentation based on local independent projection based classif...
Brain tumour segmentation based on local independent projection based classif...
eSAT Journals
 
IRJET- Brain Tumor Detection using Image Processing and MATLAB Application
IRJET-  	  Brain Tumor Detection using Image Processing and MATLAB ApplicationIRJET-  	  Brain Tumor Detection using Image Processing and MATLAB Application
IRJET- Brain Tumor Detection using Image Processing and MATLAB Application
IRJET Journal
 
Brain Tumor Segmentation in MRI Images
Brain Tumor Segmentation in MRI ImagesBrain Tumor Segmentation in MRI Images
Brain Tumor Segmentation in MRI Images
IJRAT
 
Brain Tumor Detection and Classification using Adaptive Boosting
Brain Tumor Detection and Classification using Adaptive BoostingBrain Tumor Detection and Classification using Adaptive Boosting
Brain Tumor Detection and Classification using Adaptive Boosting
IRJET Journal
 
Techniques of Brain Cancer Detection from MRI using Machine Learning
Techniques of Brain Cancer Detection from MRI using Machine LearningTechniques of Brain Cancer Detection from MRI using Machine Learning
Techniques of Brain Cancer Detection from MRI using Machine Learning
IRJET Journal
 
Comparitive study of brain tumor detection using morphological operators
Comparitive study of brain tumor detection using morphological operatorsComparitive study of brain tumor detection using morphological operators
Comparitive study of brain tumor detection using morphological operators
eSAT Journals
 
Identifying brain tumour from mri image using modified fcm and support
Identifying brain tumour from mri image using modified fcm and supportIdentifying brain tumour from mri image using modified fcm and support
Identifying brain tumour from mri image using modified fcm and supportIAEME Publication
 
A survey of deep learning approaches to medical applications
A survey of deep learning approaches to medical applicationsA survey of deep learning approaches to medical applications
A survey of deep learning approaches to medical applications
Joseph Paul Cohen PhD
 
BRAIN TUMOR MRI IMAGE SEGMENTATION AND DETECTION IN IMAGE PROCESSING
BRAIN TUMOR MRI IMAGE SEGMENTATION AND DETECTION IN IMAGE PROCESSINGBRAIN TUMOR MRI IMAGE SEGMENTATION AND DETECTION IN IMAGE PROCESSING
BRAIN TUMOR MRI IMAGE SEGMENTATION AND DETECTION IN IMAGE PROCESSING
Dharshika Shreeganesh
 
Brain Tumor Detection Using Artificial Neural Network Fuzzy Inference System ...
Brain Tumor Detection Using Artificial Neural Network Fuzzy Inference System ...Brain Tumor Detection Using Artificial Neural Network Fuzzy Inference System ...
Brain Tumor Detection Using Artificial Neural Network Fuzzy Inference System ...
Editor IJCATR
 
Neuroendoscopy Adapter Module Development for Better Brain Tumor Image Visual...
Neuroendoscopy Adapter Module Development for Better Brain Tumor Image Visual...Neuroendoscopy Adapter Module Development for Better Brain Tumor Image Visual...
Neuroendoscopy Adapter Module Development for Better Brain Tumor Image Visual...
IJECEIAES
 
11.segmentation and feature extraction of tumors from digital mammograms
11.segmentation and feature extraction of tumors from digital mammograms11.segmentation and feature extraction of tumors from digital mammograms
11.segmentation and feature extraction of tumors from digital mammogramsAlexander Decker
 
Automatic MRI brain segmentation using local features, Self-Organizing Maps, ...
Automatic MRI brain segmentation using local features, Self-Organizing Maps, ...Automatic MRI brain segmentation using local features, Self-Organizing Maps, ...
Automatic MRI brain segmentation using local features, Self-Organizing Maps, ...
Mehryar (Mike) E., Ph.D.
 
Brain tumor detection using image segmentation ppt
Brain tumor detection using image segmentation pptBrain tumor detection using image segmentation ppt
Brain tumor detection using image segmentation ppt
Roshini Vijayakumar
 
3D Segmentation of Brain Tumor Imaging
3D Segmentation of Brain Tumor Imaging3D Segmentation of Brain Tumor Imaging
3D Segmentation of Brain Tumor Imaging
IJAEMSJORNAL
 
Automatic grading of diabetic retinopathy through machine learning
Automatic grading of diabetic retinopathy through machine learning   Automatic grading of diabetic retinopathy through machine learning
Automatic grading of diabetic retinopathy through machine learning
SupriyaKamatgi
 
Zeeshan.ali.presentations
Zeeshan.ali.presentationsZeeshan.ali.presentations
Zeeshan.ali.presentations
Zeeshan Ali
 
Medical image analysis
Medical image analysisMedical image analysis
Medical image analysis
Aboul Ella Hassanien
 

What's hot (20)

Brain tumour segmentation based on local independent projection based classif...
Brain tumour segmentation based on local independent projection based classif...Brain tumour segmentation based on local independent projection based classif...
Brain tumour segmentation based on local independent projection based classif...
 
IRJET- Brain Tumor Detection using Image Processing and MATLAB Application
IRJET-  	  Brain Tumor Detection using Image Processing and MATLAB ApplicationIRJET-  	  Brain Tumor Detection using Image Processing and MATLAB Application
IRJET- Brain Tumor Detection using Image Processing and MATLAB Application
 
Brain Tumor Segmentation in MRI Images
Brain Tumor Segmentation in MRI ImagesBrain Tumor Segmentation in MRI Images
Brain Tumor Segmentation in MRI Images
 
Brain Tumor Detection and Classification using Adaptive Boosting
Brain Tumor Detection and Classification using Adaptive BoostingBrain Tumor Detection and Classification using Adaptive Boosting
Brain Tumor Detection and Classification using Adaptive Boosting
 
Techniques of Brain Cancer Detection from MRI using Machine Learning
Techniques of Brain Cancer Detection from MRI using Machine LearningTechniques of Brain Cancer Detection from MRI using Machine Learning
Techniques of Brain Cancer Detection from MRI using Machine Learning
 
Qt7355g8v8
Qt7355g8v8Qt7355g8v8
Qt7355g8v8
 
Comparitive study of brain tumor detection using morphological operators
Comparitive study of brain tumor detection using morphological operatorsComparitive study of brain tumor detection using morphological operators
Comparitive study of brain tumor detection using morphological operators
 
Report (1)
Report (1)Report (1)
Report (1)
 
Identifying brain tumour from mri image using modified fcm and support
Identifying brain tumour from mri image using modified fcm and supportIdentifying brain tumour from mri image using modified fcm and support
Identifying brain tumour from mri image using modified fcm and support
 
A survey of deep learning approaches to medical applications
A survey of deep learning approaches to medical applicationsA survey of deep learning approaches to medical applications
A survey of deep learning approaches to medical applications
 
BRAIN TUMOR MRI IMAGE SEGMENTATION AND DETECTION IN IMAGE PROCESSING
BRAIN TUMOR MRI IMAGE SEGMENTATION AND DETECTION IN IMAGE PROCESSINGBRAIN TUMOR MRI IMAGE SEGMENTATION AND DETECTION IN IMAGE PROCESSING
BRAIN TUMOR MRI IMAGE SEGMENTATION AND DETECTION IN IMAGE PROCESSING
 
Brain Tumor Detection Using Artificial Neural Network Fuzzy Inference System ...
Brain Tumor Detection Using Artificial Neural Network Fuzzy Inference System ...Brain Tumor Detection Using Artificial Neural Network Fuzzy Inference System ...
Brain Tumor Detection Using Artificial Neural Network Fuzzy Inference System ...
 
Neuroendoscopy Adapter Module Development for Better Brain Tumor Image Visual...
Neuroendoscopy Adapter Module Development for Better Brain Tumor Image Visual...Neuroendoscopy Adapter Module Development for Better Brain Tumor Image Visual...
Neuroendoscopy Adapter Module Development for Better Brain Tumor Image Visual...
 
11.segmentation and feature extraction of tumors from digital mammograms
11.segmentation and feature extraction of tumors from digital mammograms11.segmentation and feature extraction of tumors from digital mammograms
11.segmentation and feature extraction of tumors from digital mammograms
 
Automatic MRI brain segmentation using local features, Self-Organizing Maps, ...
Automatic MRI brain segmentation using local features, Self-Organizing Maps, ...Automatic MRI brain segmentation using local features, Self-Organizing Maps, ...
Automatic MRI brain segmentation using local features, Self-Organizing Maps, ...
 
Brain tumor detection using image segmentation ppt
Brain tumor detection using image segmentation pptBrain tumor detection using image segmentation ppt
Brain tumor detection using image segmentation ppt
 
3D Segmentation of Brain Tumor Imaging
3D Segmentation of Brain Tumor Imaging3D Segmentation of Brain Tumor Imaging
3D Segmentation of Brain Tumor Imaging
 
Automatic grading of diabetic retinopathy through machine learning
Automatic grading of diabetic retinopathy through machine learning   Automatic grading of diabetic retinopathy through machine learning
Automatic grading of diabetic retinopathy through machine learning
 
Zeeshan.ali.presentations
Zeeshan.ali.presentationsZeeshan.ali.presentations
Zeeshan.ali.presentations
 
Medical image analysis
Medical image analysisMedical image analysis
Medical image analysis
 

Similar to Video Retrieval of Specific Persons in Specific Locations

Face and Eye Detection Varying Scenarios With Haar Classifier_2015
Face and Eye Detection Varying Scenarios With Haar Classifier_2015Face and Eye Detection Varying Scenarios With Haar Classifier_2015
Face and Eye Detection Varying Scenarios With Haar Classifier_2015
Showrav Mazumder
 
Diving deep into sentiment: Understanding fine-tuned CNNs for visual sentimen...
Diving deep into sentiment: Understanding fine-tuned CNNs for visual sentimen...Diving deep into sentiment: Understanding fine-tuned CNNs for visual sentimen...
Diving deep into sentiment: Understanding fine-tuned CNNs for visual sentimen...
Universitat Politècnica de Catalunya
 
[DSC Europe 22] Face Spoofing Detection: Theory and Practice - Pavle Milosevic
[DSC Europe 22] Face Spoofing Detection: Theory and Practice - Pavle Milosevic[DSC Europe 22] Face Spoofing Detection: Theory and Practice - Pavle Milosevic
[DSC Europe 22] Face Spoofing Detection: Theory and Practice - Pavle Milosevic
DataScienceConferenc1
 
Long-term Face Tracking in the Wild using Deep Learning
Long-term Face Tracking in the Wild using Deep LearningLong-term Face Tracking in the Wild using Deep Learning
Long-term Face Tracking in the Wild using Deep Learning
Elaheh Rashedi
 
Artificial intelligence use cases for International Dating Apps. iDate 2018. ...
Artificial intelligence use cases for International Dating Apps. iDate 2018. ...Artificial intelligence use cases for International Dating Apps. iDate 2018. ...
Artificial intelligence use cases for International Dating Apps. iDate 2018. ...
Lluis Carreras
 
Layer-wise CNN Surgery for Visual Sentiment Prediction
Layer-wise CNN Surgery for Visual Sentiment PredictionLayer-wise CNN Surgery for Visual Sentiment Prediction
Layer-wise CNN Surgery for Visual Sentiment Prediction
Universitat Politècnica de Catalunya
 
Skin melanoma stage detection - CNN.pptx
Skin melanoma stage detection - CNN.pptxSkin melanoma stage detection - CNN.pptx
Skin melanoma stage detection - CNN.pptx
VishalLabde
 
DeepDRImageGuidedDiabeticRetinopathyDetectionUsingAttentionBasedDeepLearningS...
DeepDRImageGuidedDiabeticRetinopathyDetectionUsingAttentionBasedDeepLearningS...DeepDRImageGuidedDiabeticRetinopathyDetectionUsingAttentionBasedDeepLearningS...
DeepDRImageGuidedDiabeticRetinopathyDetectionUsingAttentionBasedDeepLearningS...
RamithaDevi
 
Deep learning for understanding faces
Deep learning for understanding facesDeep learning for understanding faces
Deep learning for understanding faces
sieubebu
 
Person Recognition
Person RecognitionPerson Recognition
Person Recognition
Fahmeen Mazhar
 
DSLR Image Quality Analysis using Minitab
DSLR Image Quality Analysis using MinitabDSLR Image Quality Analysis using Minitab
DSLR Image Quality Analysis using Minitab
Karthik Murali
 
Face Recognition using PCA and MSNN
Face Recognition using PCA and MSNNFace Recognition using PCA and MSNN
Face Recognition using PCA and MSNN
RABI GAYAN
 
IRJET- Survey on Face Detection Methods
IRJET- Survey on Face Detection MethodsIRJET- Survey on Face Detection Methods
IRJET- Survey on Face Detection Methods
IRJET Journal
 
PR-185: RetinaFace: Single-stage Dense Face Localisation in the Wild
PR-185: RetinaFace: Single-stage Dense Face Localisation in the WildPR-185: RetinaFace: Single-stage Dense Face Localisation in the Wild
PR-185: RetinaFace: Single-stage Dense Face Localisation in the Wild
jaewon lee
 
Methodology (DLAI D6L2 2017 UPC Deep Learning for Artificial Intelligence)
Methodology (DLAI D6L2 2017 UPC Deep Learning for Artificial Intelligence)Methodology (DLAI D6L2 2017 UPC Deep Learning for Artificial Intelligence)
Methodology (DLAI D6L2 2017 UPC Deep Learning for Artificial Intelligence)
Universitat Politècnica de Catalunya
 
Analysis_of_deep_learning_algorithms_for_diabetic_retinopathy.pptx
Analysis_of_deep_learning_algorithms_for_diabetic_retinopathy.pptxAnalysis_of_deep_learning_algorithms_for_diabetic_retinopathy.pptx
Analysis_of_deep_learning_algorithms_for_diabetic_retinopathy.pptx
DebrajBanerjee22
 
Robust Tracking Via Feature Mapping Method and Support Vector Machine
Robust Tracking Via Feature Mapping Method and Support Vector MachineRobust Tracking Via Feature Mapping Method and Support Vector Machine
Robust Tracking Via Feature Mapping Method and Support Vector Machine
IRJET Journal
 
FACE MASK DETECTION USING ML
FACE MASK DETECTION USING MLFACE MASK DETECTION USING ML
FACE MASK DETECTION USING ML
IRJET Journal
 
Face Recognition Methods based on Convolutional Neural Networks
Face Recognition Methods based on Convolutional Neural NetworksFace Recognition Methods based on Convolutional Neural Networks
Face Recognition Methods based on Convolutional Neural Networks
Elaheh Rashedi
 
Smriti's research paper
Smriti's research paperSmriti's research paper
Smriti's research paperSmriti Tikoo
 

Similar to Video Retrieval of Specific Persons in Specific Locations (20)

Face and Eye Detection Varying Scenarios With Haar Classifier_2015
Face and Eye Detection Varying Scenarios With Haar Classifier_2015Face and Eye Detection Varying Scenarios With Haar Classifier_2015
Face and Eye Detection Varying Scenarios With Haar Classifier_2015
 
Diving deep into sentiment: Understanding fine-tuned CNNs for visual sentimen...
Diving deep into sentiment: Understanding fine-tuned CNNs for visual sentimen...Diving deep into sentiment: Understanding fine-tuned CNNs for visual sentimen...
Diving deep into sentiment: Understanding fine-tuned CNNs for visual sentimen...
 
[DSC Europe 22] Face Spoofing Detection: Theory and Practice - Pavle Milosevic
[DSC Europe 22] Face Spoofing Detection: Theory and Practice - Pavle Milosevic[DSC Europe 22] Face Spoofing Detection: Theory and Practice - Pavle Milosevic
[DSC Europe 22] Face Spoofing Detection: Theory and Practice - Pavle Milosevic
 
Long-term Face Tracking in the Wild using Deep Learning
Long-term Face Tracking in the Wild using Deep LearningLong-term Face Tracking in the Wild using Deep Learning
Long-term Face Tracking in the Wild using Deep Learning
 
Artificial intelligence use cases for International Dating Apps. iDate 2018. ...
Artificial intelligence use cases for International Dating Apps. iDate 2018. ...Artificial intelligence use cases for International Dating Apps. iDate 2018. ...
Artificial intelligence use cases for International Dating Apps. iDate 2018. ...
 
Layer-wise CNN Surgery for Visual Sentiment Prediction
Layer-wise CNN Surgery for Visual Sentiment PredictionLayer-wise CNN Surgery for Visual Sentiment Prediction
Layer-wise CNN Surgery for Visual Sentiment Prediction
 
Skin melanoma stage detection - CNN.pptx
Skin melanoma stage detection - CNN.pptxSkin melanoma stage detection - CNN.pptx
Skin melanoma stage detection - CNN.pptx
 
DeepDRImageGuidedDiabeticRetinopathyDetectionUsingAttentionBasedDeepLearningS...
DeepDRImageGuidedDiabeticRetinopathyDetectionUsingAttentionBasedDeepLearningS...DeepDRImageGuidedDiabeticRetinopathyDetectionUsingAttentionBasedDeepLearningS...
DeepDRImageGuidedDiabeticRetinopathyDetectionUsingAttentionBasedDeepLearningS...
 
Deep learning for understanding faces
Deep learning for understanding facesDeep learning for understanding faces
Deep learning for understanding faces
 
Person Recognition
Person RecognitionPerson Recognition
Person Recognition
 
DSLR Image Quality Analysis using Minitab
DSLR Image Quality Analysis using MinitabDSLR Image Quality Analysis using Minitab
DSLR Image Quality Analysis using Minitab
 
Face Recognition using PCA and MSNN
Face Recognition using PCA and MSNNFace Recognition using PCA and MSNN
Face Recognition using PCA and MSNN
 
IRJET- Survey on Face Detection Methods
IRJET- Survey on Face Detection MethodsIRJET- Survey on Face Detection Methods
IRJET- Survey on Face Detection Methods
 
PR-185: RetinaFace: Single-stage Dense Face Localisation in the Wild
PR-185: RetinaFace: Single-stage Dense Face Localisation in the WildPR-185: RetinaFace: Single-stage Dense Face Localisation in the Wild
PR-185: RetinaFace: Single-stage Dense Face Localisation in the Wild
 
Methodology (DLAI D6L2 2017 UPC Deep Learning for Artificial Intelligence)
Methodology (DLAI D6L2 2017 UPC Deep Learning for Artificial Intelligence)Methodology (DLAI D6L2 2017 UPC Deep Learning for Artificial Intelligence)
Methodology (DLAI D6L2 2017 UPC Deep Learning for Artificial Intelligence)
 
Analysis_of_deep_learning_algorithms_for_diabetic_retinopathy.pptx
Analysis_of_deep_learning_algorithms_for_diabetic_retinopathy.pptxAnalysis_of_deep_learning_algorithms_for_diabetic_retinopathy.pptx
Analysis_of_deep_learning_algorithms_for_diabetic_retinopathy.pptx
 
Robust Tracking Via Feature Mapping Method and Support Vector Machine
Robust Tracking Via Feature Mapping Method and Support Vector MachineRobust Tracking Via Feature Mapping Method and Support Vector Machine
Robust Tracking Via Feature Mapping Method and Support Vector Machine
 
FACE MASK DETECTION USING ML
FACE MASK DETECTION USING MLFACE MASK DETECTION USING ML
FACE MASK DETECTION USING ML
 
Face Recognition Methods based on Convolutional Neural Networks
Face Recognition Methods based on Convolutional Neural NetworksFace Recognition Methods based on Convolutional Neural Networks
Face Recognition Methods based on Convolutional Neural Networks
 
Smriti's research paper
Smriti's research paperSmriti's research paper
Smriti's research paper
 

More from Universitat Politècnica de Catalunya

Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Universitat Politècnica de Catalunya
 
Deep Generative Learning for All
Deep Generative Learning for AllDeep Generative Learning for All
Deep Generative Learning for All
Universitat Politècnica de Catalunya
 
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
Universitat Politècnica de Catalunya
 
Towards Sign Language Translation & Production | Xavier Giro-i-Nieto
Towards Sign Language Translation & Production | Xavier Giro-i-NietoTowards Sign Language Translation & Production | Xavier Giro-i-Nieto
Towards Sign Language Translation & Production | Xavier Giro-i-Nieto
Universitat Politècnica de Catalunya
 
The Transformer - Xavier Giró - UPC Barcelona 2021
The Transformer - Xavier Giró - UPC Barcelona 2021The Transformer - Xavier Giró - UPC Barcelona 2021
The Transformer - Xavier Giró - UPC Barcelona 2021
Universitat Politècnica de Catalunya
 
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
Universitat Politècnica de Catalunya
 
Open challenges in sign language translation and production
Open challenges in sign language translation and productionOpen challenges in sign language translation and production
Open challenges in sign language translation and production
Universitat Politècnica de Catalunya
 
Generation of Synthetic Referring Expressions for Object Segmentation in Videos
Generation of Synthetic Referring Expressions for Object Segmentation in VideosGeneration of Synthetic Referring Expressions for Object Segmentation in Videos
Generation of Synthetic Referring Expressions for Object Segmentation in Videos
Universitat Politècnica de Catalunya
 
Discovery and Learning of Navigation Goals from Pixels in Minecraft
Discovery and Learning of Navigation Goals from Pixels in MinecraftDiscovery and Learning of Navigation Goals from Pixels in Minecraft
Discovery and Learning of Navigation Goals from Pixels in Minecraft
Universitat Politècnica de Catalunya
 
Learn2Sign : Sign language recognition and translation using human keypoint e...
Learn2Sign : Sign language recognition and translation using human keypoint e...Learn2Sign : Sign language recognition and translation using human keypoint e...
Learn2Sign : Sign language recognition and translation using human keypoint e...
Universitat Politècnica de Catalunya
 
Intepretability / Explainable AI for Deep Neural Networks
Intepretability / Explainable AI for Deep Neural NetworksIntepretability / Explainable AI for Deep Neural Networks
Intepretability / Explainable AI for Deep Neural Networks
Universitat Politècnica de Catalunya
 
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
Universitat Politècnica de Catalunya
 
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
Universitat Politècnica de Catalunya
 
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
Universitat Politècnica de Catalunya
 
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
Universitat Politècnica de Catalunya
 
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
Universitat Politècnica de Catalunya
 
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
Universitat Politècnica de Catalunya
 
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
Universitat Politècnica de Catalunya
 
Curriculum Learning for Recurrent Video Object Segmentation
Curriculum Learning for Recurrent Video Object SegmentationCurriculum Learning for Recurrent Video Object Segmentation
Curriculum Learning for Recurrent Video Object Segmentation
Universitat Politècnica de Catalunya
 
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020
Universitat Politècnica de Catalunya
 

More from Universitat Politècnica de Catalunya (20)

Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Deep Generative Learning for All
Deep Generative Learning for AllDeep Generative Learning for All
Deep Generative Learning for All
 
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
 
Towards Sign Language Translation & Production | Xavier Giro-i-Nieto
Towards Sign Language Translation & Production | Xavier Giro-i-NietoTowards Sign Language Translation & Production | Xavier Giro-i-Nieto
Towards Sign Language Translation & Production | Xavier Giro-i-Nieto
 
The Transformer - Xavier Giró - UPC Barcelona 2021
The Transformer - Xavier Giró - UPC Barcelona 2021The Transformer - Xavier Giró - UPC Barcelona 2021
The Transformer - Xavier Giró - UPC Barcelona 2021
 
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
 
Open challenges in sign language translation and production
Open challenges in sign language translation and productionOpen challenges in sign language translation and production
Open challenges in sign language translation and production
 
Generation of Synthetic Referring Expressions for Object Segmentation in Videos
Generation of Synthetic Referring Expressions for Object Segmentation in VideosGeneration of Synthetic Referring Expressions for Object Segmentation in Videos
Generation of Synthetic Referring Expressions for Object Segmentation in Videos
 
Discovery and Learning of Navigation Goals from Pixels in Minecraft
Discovery and Learning of Navigation Goals from Pixels in MinecraftDiscovery and Learning of Navigation Goals from Pixels in Minecraft
Discovery and Learning of Navigation Goals from Pixels in Minecraft
 
Learn2Sign : Sign language recognition and translation using human keypoint e...
Learn2Sign : Sign language recognition and translation using human keypoint e...Learn2Sign : Sign language recognition and translation using human keypoint e...
Learn2Sign : Sign language recognition and translation using human keypoint e...
 
Intepretability / Explainable AI for Deep Neural Networks
Intepretability / Explainable AI for Deep Neural NetworksIntepretability / Explainable AI for Deep Neural Networks
Intepretability / Explainable AI for Deep Neural Networks
 
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
 
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
 
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
 
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
 
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
 
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
 
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
 
Curriculum Learning for Recurrent Video Object Segmentation
Curriculum Learning for Recurrent Video Object SegmentationCurriculum Learning for Recurrent Video Object Segmentation
Curriculum Learning for Recurrent Video Object Segmentation
 
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020
 

Recently uploaded

Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
vikram sood
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
GetInData
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
u86oixdj
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
manishkhaire30
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
AnirbanRoy608946
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
jerlynmaetalle
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
rwarrenll
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
74nqk8xf
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
g4dpvqap0
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
Roger Valdez
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 

Recently uploaded (20)

Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 

Video Retrieval of Specific Persons in Specific Locations

  • 1. VIDEO RETRIEVAL OF SPECIFIC PERSONS IN SPECIFIC LOCATIONS AUTHOR: ADVISORS: Xavier Giró-i-Nieto Eva Mohedano Kevin McGuinness Andrea Calafell Noel E. O’Connor 1
  • 2. 1. Motivation 2. State of the art 3. Framework for TRECVID 4. Face detection 5. Face representation 6. Query expansion 7. Annotation Tool 8. Fusion and normalization strategies 9. Conclusions and future work OUTLINE 2
  • 4. TRECVID INSTANCE SEARCH 2016 PEOPLE AND LOCATION QUERY SET Person visual examples Binary masks Location visual examples TARGET DATABASE 1.5M keyframes 244 video files (300GB) 4
  • 5. MOTIVATION: goals ● Obtain a baseline to participate in TRECVID Instance Search 2016 (July, 1). ● Improve the results obtained in TRECVID using the baseline. 5
  • 6. 1. Motivation 2. State of the art 3. Framework for TRECVID 4. Face detection 5. Face representation 6. Query expansion 7. Annotation Tool 8. Fusion and normalization strategies 9. Conclusions and future work OUTLINE 6
  • 7. STATE OF THE ART Image of Eva Mohedano, D3L6 Image Retrieval, Deep Learning for Computer Vision (UPC 2016) BASIC RETRIEVAL PIPELINE: 7
  • 8. STATE OF THE ART Image of Eva Mohedano, D3L6 Image Retrieval, Deep Learning for Computer Vision (UPC 2016) BAG OF VISUAL WORDS: 8
  • 9. STATE OF THE ART Image: Alex Krizhevsky , Ilya Sutskever , Geoffrey E. Hinton, Imagenet classification with deep convolutional neural networks, 2012 Ali Sharif Razavian, Josephine Sullivan, Atsuto Maki, and Stefan Carlsson. A baseline for visual instance retrieval with deep convolutional networks. ICLR 2015. CNN REPRESENTATION: 9
  • 10. STATE OF THE ART Eva Mohedano, Amaia Salvador, Kevin McGuinness, Ferran Marqués, Noel E. O’Connor, and Xavier Giró i Nieto. Bags of local convolutional features for scalable instance search. ICMR 2016. BAG OF LOCAL CONVOLUTIONAL FEATURES: 10
  • 11. 1. Motivation 2. State of the art 3. Framework for TRECVID 4. Face detection 5. Face representation 6. Query expansion 7. Annotation Tool 8. Fusion and normalization strategies 9. Conclusions and future work OUTLINE 11
  • 14. 1. Motivation 2. State of the art 3. Framework for TRECVID 4. Face detection 5. Face representation 6. Query expansion 7. Annotation Tool 8. Fusion and normalization strategies 9. Conclusions and future work OUTLINE 14
  • 16. FACE DETECTION: ReInspect Russell Stewart, Mykhaylo Andriluka, and Andrew Y. Ng. End-to-end people detection in crowded scenes. CVPR 2016. 16
  • 17. FACE DETECTION: ReInspect QUALITATIVE RESULTS OF REINSPECT: Changing both the input size of the network and the image size Changing only the image size 17 Bad detections False negatives
  • 18. FACE DETECTION: ReInspect PROBLEM: Images used to train ReInspect 18
  • 19. FACE DETECTION: Menpo 1 https://github.com/menpo/menpodetect Python wrapper for face detectors1 : ● DLIB ● OPENCV ● Pixel Intensity Comparison-based Object detection (PICO) ● FFLD2 : ○ Based on Deformable Part Models (DPM) ○ Use LUV color space 2 M. Mathias, R. Benenson, M. Pedersoli, and L. Van Gool. Face detection without bells and whistles. ECCV, 2014. Examples of FFLD results 19
  • 20. QUALITATIVE RESULTS OF MENPO: DLIB FACE DETECTION: Menpo OPENCV PICO 20 False negatives
  • 21. QUALITATIVE RESULTS OF MENPO: FACE DETECTION: Menpo FFLD Still some false negatives Solution: Equalize image 21
  • 22. 1. Motivation 2. State of the art 3. Framework for TRECVID 4. Face detection 5. Face representation 6. Query expansion 7. Annotation Tool 8. Fusion and normalization strategies 9. Conclusions and future work OUTLINE 22
  • 23. DEEP FACE RECOGNITION FACE REPRESENTATION Image: Simonyan, Karen, and Andrew Zisserman. Very deep convolutional networks for large-scale image recognition. ICLR 2015. O. M. Parkhi, A. Vedaldi, and A. Zisserman. Deep face recognition. BMVC 2015 VGG 16-layer 23
  • 24. 1. Motivation 2. State of the art 3. Framework for TRECVID 4. Face detection 5. Face representation 6. Query expansion 7. Annotation Tool 8. Fusion and normalization strategies 9. Conclusions and future work OUTLINE 24
  • 25. QUERY EXPANSION Sequence of keyframes of one shot dilate Mask creation pipeline 25 TEMPORAL QUERY EXPANSION:
  • 26. Results of temporal query expansion 26 QUERY EXPANSION TEMPORAL QUERY EXPANSION:
  • 27. 27 QUERY EXPANSION PSEUDO-RELEVANCE FEEDBACK QUERY EXPANSION: Top 20 retrieved keyframes
  • 28. 1. Motivation 2. State of the art 3. Framework for TRECVID 4. Face detection 5. Face representation 6. Query expansion 7. Annotation Tool 8. Fusion and normalization strategies 9. Conclusions and future work OUTLINE 28
  • 29. ANNOTATION TOOL 3.991 shots for persons 1.528 shots for locations 794 shots in common 29
  • 30. 1. Motivation 2. State of the art 3. Framework for TRECVID 4. Face detection 5. Face representation 6. Query expansion 7. Annotation Tool 8. Fusion and normalization strategies 9. Conclusions and future work OUTLINE 30
  • 32. FUSION AND NORMALIZATION STRATEGIES NORMALIZATION: ● Z-score: ● Max-min: ● Extreme Value Theory: FUSION: Linear combination, maximum, minimum. 32
  • 33. FUSION AND NORMALIZATION STRATEGIES RESULTS OF APPLYING DIFFERENT NORMALIZATIONS: BASELINE 33
  • 34. FUSION AND NORMALIZATION STRATEGIES Brad Person distribution Laundrette Location distribution 34 EXAMPLE DISTRIBUTION:
  • 35. FUSION AND NORMALIZATION STRATEGIES RESULTS OF APPLYING MAXIMUM OR MINIMUM FUSION 35
  • 36. FUSION AND NORMALIZATION STRATEGIES RESULTS OF WEIGHTING LINEAR COMBINATION HIGHER THAN THE BASELINE 36
  • 37. 1. Motivation 2. State of the art 3. Framework for TRECVID 4. Face detection 5. Face representation 6. Query expansion 7. Annotation Tool 8. Fusion and normalization strategies 9. Conclusions and future work OUTLINE 37
  • 38. CONCLUSIONS ● FFLD, a simple approach using vanilla DPM combined with image equalization is the best option for TRECVID dataset 38
  • 39. CONCLUSIONS ● The temporal query expansion proposed works well, but the faces are very similar between them However, using the top 20 faces in the ranking as new queries gives more diverse faces. 39
  • 40. CONCLUSIONS ● An annotation tool is needed in order to obtain quantitative results. 3.991 shots for persons 1.528 shots for locations 794 shots in common TOTAL OF RELEVANT ANNOTATED SHOTS ● The best configuration is without applying normalization and combining the scores by weighting higher the location ranking 40
  • 41. FUTURE WORK ● Analyze deeper the location part ● Try to improve the location rankings 41
  • 43. FUSION AND NORMALIZATION STRATEGIES RESULTS OF THE PARTS SEPARATELY OVER 50 KEYFRAMES 43
  • 44. FUSION AND NORMALIZATION STRATEGIES RESULTS OF APPLYING MAXIMUM, MINIMUM AND PRODUCT FUSION 44