SlideShare a Scribd company logo
ETH-CVL @ MediaEval 2016: Textual-Visual
Embeddings and Video2GIF for Video Interestingness
Michael Gygli, PhD student @ Computer Vision Lab, ETH Zurich
Work with Arun Balajee Vasudevan, Anna Volokitin and Luc Van Gool
1
Content
▪ Overview
▪ Textual-visual embedding
▪ Video2GIF
▪ Results
2Michael Gygli, PhD student @ ETH Zurich06/21/2016
Finding the most interesting and relevant content
▪ Two dominant approaches
▪ Use of textual information (title, category) to obtain a video specific
model, often using web image priors
E.g. [Khosla et al. CVPR 13], [Liu et al. IJCAI 09], Song et al. CVPR 15]
▪ Supervised learning with generic features and large training set
E.g. [Potapov et al. ECCV 2014], [Sun et al. ECCV 2014], [Gygli et al.
CVPR 16]
▪ Some methods use both
E.g. [Liu et al. CVPR 15]
3
Textual visual embedding
Multi-task deep visual-semantic embedding for video
thumbnail selection [Liu et al. CVPR 15]
▪ Use Bing image search data (query, image, # of clicks) to learn a
joint embedding space for images and text
▪ Compute frame relevance as cosine similarity between the query or
title embedding and the frame embedding
[Liu et al. CVPR 2015]
5Michael Gygli, PhD student @ ETH Zurich06/21/2016
Our contribution with improvements over Liu et al.
● Siamese network
● Text Embedding Model
○ Words encoded through word2vec
○ LSTM to obtain fixed length embedding
● Convolutional Neural Network model
○ Fine-tuned VGG-19
● Training Data based on learning a ranking of: (query+, image+, image-
(“cat”, , )
6
Video2GIF
Video2GIF: Automatic Generation of Animated GIFs
from Video [Gygli et al. CVPR 16]
Approach
▪ Work with segments as units
▪ Obtained through change-point detection [Song et al. CVPR 15]
▪ Train a deep neural network for ranking segments
...
8
Example video
Highest
scoring
Lowest
scoring
Michael Gygli, PhD student @ ETH Zurich06/21/2016
Video2GIF: Automatic Generation of Animated GIFs
from Video
Approach
▪ Train a deep neural network for ranking
segments
▪ Built on C3D network [Tran et al. ICCV
2015]
▪ Objective: score positives higher than
negatives
h: scoring function
s+
: positive segment
s-
: negative segment
9Michael Gygli, PhD student @ ETH Zurich06/21/2016
Video2GIF: Dataset
10Michael Gygli, PhD student @ ETH Zurich06/21/2016
Available on github.com/gyglim/video2gif_dataset
▪ Large-scale training data: GIFs created from YouTube videos
▪ Align GIF back to video
▪ This part defines a positive example
▪ Assume non-selected parts are less interesting than selected part
Results
Task Run mAP
Image
1 0.1866
2 0.1952
3 0.1858
Video
1 0.1362
2 0.1574
Frame-based:
• Run 1: Visual-semantic embedding trained on Clickture dataset
• Run 2: As Run 1, with fine-tuning on development set
• Run 3: As Run 1, but trained on a larger subset of Clickture
Segment-based:
• Run 1: Video2GIF
• Run 2: Averaged score of Visual-semantic embedding and Video2GIF
Qualitative Results - Run 2
Title: Captives
Title: After Earth
Predicted best frame True best frame
Predicted best frame True best frame
More information
▪ Paper Video2GIF: Automatic Generation of Animated GIFs from Video.
M. Gygli, Y. Song, L. Cao, CVPR 2016
▪ Slides on Video Summarization as Subset Selection @ Tutorial on
Optimization Algorithms for Subset Selection and Summarization in Large
Data Sets, CVPR 2016
https://t.co/mQIpxMab3v
▪ Demo website for Video2GIF: http://video2gif.info/autogif
▪ Paper Multi-task deep visual-semantic embedding for video thumbnail
selection. W. Liu, T. Mei, Y. Zhang, C. Che, and J. Luo, CVPR 2015​
Questions?

More Related Content

Viewers also liked

MediaEval 2016 - LAPI @ 2016 Retrieving Diverse Social Images Task: A Pseudo-...
MediaEval 2016 - LAPI @ 2016 Retrieving Diverse Social Images Task: A Pseudo-...MediaEval 2016 - LAPI @ 2016 Retrieving Diverse Social Images Task: A Pseudo-...
MediaEval 2016 - LAPI @ 2016 Retrieving Diverse Social Images Task: A Pseudo-...
multimediaeval
 
MediaEval 2016 - BUT Zero-Cost Speech Recognition
MediaEval 2016 - BUT Zero-Cost Speech RecognitionMediaEval 2016 - BUT Zero-Cost Speech Recognition
MediaEval 2016 - BUT Zero-Cost Speech Recognition
multimediaeval
 
MediaEval 2016 - Simula Team @ Context of Experience Task
MediaEval 2016 - Simula Team @ Context of Experience TaskMediaEval 2016 - Simula Team @ Context of Experience Task
MediaEval 2016 - Simula Team @ Context of Experience Task
multimediaeval
 
MediaEval 2016 - ININ Submission to Zero Cost ASR Task
MediaEval 2016 - ININ Submission to Zero Cost ASR TaskMediaEval 2016 - ININ Submission to Zero Cost ASR Task
MediaEval 2016 - ININ Submission to Zero Cost ASR Task
multimediaeval
 
MediaEval 2016 - IR Evaluation: Putting the User Back in the Loop
MediaEval 2016 - IR Evaluation: Putting the User Back in the LoopMediaEval 2016 - IR Evaluation: Putting the User Back in the Loop
MediaEval 2016 - IR Evaluation: Putting the User Back in the Loop
multimediaeval
 
MediaEval 2016 - UPMC at MediaEval2016 Retrieving Diverse Social Images Task
MediaEval 2016 - UPMC at MediaEval2016 Retrieving Diverse Social Images TaskMediaEval 2016 - UPMC at MediaEval2016 Retrieving Diverse Social Images Task
MediaEval 2016 - UPMC at MediaEval2016 Retrieving Diverse Social Images Task
multimediaeval
 
MediaEval 2016 - UNIFESP Predicting Media Interestingness Task
MediaEval 2016 - UNIFESP Predicting Media Interestingness TaskMediaEval 2016 - UNIFESP Predicting Media Interestingness Task
MediaEval 2016 - UNIFESP Predicting Media Interestingness Task
multimediaeval
 
MediaEval 2016 - Emotion in Music Task: Lessons Learned
MediaEval 2016 - Emotion in Music Task: Lessons LearnedMediaEval 2016 - Emotion in Music Task: Lessons Learned
MediaEval 2016 - Emotion in Music Task: Lessons Learned
multimediaeval
 
MediaEval 2016 - TUD-MMC Predicting media Interestingness Task
MediaEval 2016 - TUD-MMC Predicting media Interestingness TaskMediaEval 2016 - TUD-MMC Predicting media Interestingness Task
MediaEval 2016 - TUD-MMC Predicting media Interestingness Task
multimediaeval
 
MediaEval 2016 - Tag Propagation in Talking Face Graphs
MediaEval 2016 - Tag Propagation in Talking Face GraphsMediaEval 2016 - Tag Propagation in Talking Face Graphs
MediaEval 2016 - Tag Propagation in Talking Face Graphs
multimediaeval
 
MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...
MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...
MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...
multimediaeval
 
MediaEval 2016 - UVigo System for Multimodal Person Discovery in Broadcast TV...
MediaEval 2016 - UVigo System for Multimodal Person Discovery in Broadcast TV...MediaEval 2016 - UVigo System for Multimodal Person Discovery in Broadcast TV...
MediaEval 2016 - UVigo System for Multimodal Person Discovery in Broadcast TV...
multimediaeval
 
MediaEval 2016 - EUMSSI Team at the MediaEval Person Discovery Challenge
MediaEval 2016 - EUMSSI Team at the MediaEval Person Discovery ChallengeMediaEval 2016 - EUMSSI Team at the MediaEval Person Discovery Challenge
MediaEval 2016 - EUMSSI Team at the MediaEval Person Discovery Challenge
multimediaeval
 
MediaEval 2016 - RECOD at Placing Task
MediaEval 2016 - RECOD at Placing TaskMediaEval 2016 - RECOD at Placing Task
MediaEval 2016 - RECOD at Placing Task
multimediaeval
 
MediaEval 2016 - Retrieving Diverse Social Images Task Overview
MediaEval 2016 - Retrieving Diverse Social Images Task OverviewMediaEval 2016 - Retrieving Diverse Social Images Task Overview
MediaEval 2016 - Retrieving Diverse Social Images Task Overview
multimediaeval
 
MediaEval 2016 - the C@merata task: Natural Language Queries Derived from Exa...
MediaEval 2016 - the C@merata task: Natural Language Queries Derived from Exa...MediaEval 2016 - the C@merata task: Natural Language Queries Derived from Exa...
MediaEval 2016 - the C@merata task: Natural Language Queries Derived from Exa...
multimediaeval
 

Viewers also liked (16)

MediaEval 2016 - LAPI @ 2016 Retrieving Diverse Social Images Task: A Pseudo-...
MediaEval 2016 - LAPI @ 2016 Retrieving Diverse Social Images Task: A Pseudo-...MediaEval 2016 - LAPI @ 2016 Retrieving Diverse Social Images Task: A Pseudo-...
MediaEval 2016 - LAPI @ 2016 Retrieving Diverse Social Images Task: A Pseudo-...
 
MediaEval 2016 - BUT Zero-Cost Speech Recognition
MediaEval 2016 - BUT Zero-Cost Speech RecognitionMediaEval 2016 - BUT Zero-Cost Speech Recognition
MediaEval 2016 - BUT Zero-Cost Speech Recognition
 
MediaEval 2016 - Simula Team @ Context of Experience Task
MediaEval 2016 - Simula Team @ Context of Experience TaskMediaEval 2016 - Simula Team @ Context of Experience Task
MediaEval 2016 - Simula Team @ Context of Experience Task
 
MediaEval 2016 - ININ Submission to Zero Cost ASR Task
MediaEval 2016 - ININ Submission to Zero Cost ASR TaskMediaEval 2016 - ININ Submission to Zero Cost ASR Task
MediaEval 2016 - ININ Submission to Zero Cost ASR Task
 
MediaEval 2016 - IR Evaluation: Putting the User Back in the Loop
MediaEval 2016 - IR Evaluation: Putting the User Back in the LoopMediaEval 2016 - IR Evaluation: Putting the User Back in the Loop
MediaEval 2016 - IR Evaluation: Putting the User Back in the Loop
 
MediaEval 2016 - UPMC at MediaEval2016 Retrieving Diverse Social Images Task
MediaEval 2016 - UPMC at MediaEval2016 Retrieving Diverse Social Images TaskMediaEval 2016 - UPMC at MediaEval2016 Retrieving Diverse Social Images Task
MediaEval 2016 - UPMC at MediaEval2016 Retrieving Diverse Social Images Task
 
MediaEval 2016 - UNIFESP Predicting Media Interestingness Task
MediaEval 2016 - UNIFESP Predicting Media Interestingness TaskMediaEval 2016 - UNIFESP Predicting Media Interestingness Task
MediaEval 2016 - UNIFESP Predicting Media Interestingness Task
 
MediaEval 2016 - Emotion in Music Task: Lessons Learned
MediaEval 2016 - Emotion in Music Task: Lessons LearnedMediaEval 2016 - Emotion in Music Task: Lessons Learned
MediaEval 2016 - Emotion in Music Task: Lessons Learned
 
MediaEval 2016 - TUD-MMC Predicting media Interestingness Task
MediaEval 2016 - TUD-MMC Predicting media Interestingness TaskMediaEval 2016 - TUD-MMC Predicting media Interestingness Task
MediaEval 2016 - TUD-MMC Predicting media Interestingness Task
 
MediaEval 2016 - Tag Propagation in Talking Face Graphs
MediaEval 2016 - Tag Propagation in Talking Face GraphsMediaEval 2016 - Tag Propagation in Talking Face Graphs
MediaEval 2016 - Tag Propagation in Talking Face Graphs
 
MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...
MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...
MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...
 
MediaEval 2016 - UVigo System for Multimodal Person Discovery in Broadcast TV...
MediaEval 2016 - UVigo System for Multimodal Person Discovery in Broadcast TV...MediaEval 2016 - UVigo System for Multimodal Person Discovery in Broadcast TV...
MediaEval 2016 - UVigo System for Multimodal Person Discovery in Broadcast TV...
 
MediaEval 2016 - EUMSSI Team at the MediaEval Person Discovery Challenge
MediaEval 2016 - EUMSSI Team at the MediaEval Person Discovery ChallengeMediaEval 2016 - EUMSSI Team at the MediaEval Person Discovery Challenge
MediaEval 2016 - EUMSSI Team at the MediaEval Person Discovery Challenge
 
MediaEval 2016 - RECOD at Placing Task
MediaEval 2016 - RECOD at Placing TaskMediaEval 2016 - RECOD at Placing Task
MediaEval 2016 - RECOD at Placing Task
 
MediaEval 2016 - Retrieving Diverse Social Images Task Overview
MediaEval 2016 - Retrieving Diverse Social Images Task OverviewMediaEval 2016 - Retrieving Diverse Social Images Task Overview
MediaEval 2016 - Retrieving Diverse Social Images Task Overview
 
MediaEval 2016 - the C@merata task: Natural Language Queries Derived from Exa...
MediaEval 2016 - the C@merata task: Natural Language Queries Derived from Exa...MediaEval 2016 - the C@merata task: Natural Language Queries Derived from Exa...
MediaEval 2016 - the C@merata task: Natural Language Queries Derived from Exa...
 

Similar to MediaEval 2016 - ETH-CVL: Textual-Visual Embeddings and Video2GIF for Video Interestingness

Using educational technology to convey complex IL topics: animating OSCOLA re...
Using educational technology to convey complex IL topics: animating OSCOLA re...Using educational technology to convey complex IL topics: animating OSCOLA re...
Using educational technology to convey complex IL topics: animating OSCOLA re...
IL Group (CILIP Information Literacy Group)
 
A Ensemble Learning-based No Reference QoE Model for User Generated Contents
A Ensemble Learning-based No Reference QoE Model for User Generated ContentsA Ensemble Learning-based No Reference QoE Model for User Generated Contents
A Ensemble Learning-based No Reference QoE Model for User Generated Contents
Duc Nguyen
 
Icme2020 tutorial video_summarization_part1
Icme2020 tutorial video_summarization_part1Icme2020 tutorial video_summarization_part1
Icme2020 tutorial video_summarization_part1
VasileiosMezaris
 
Interactive Video and Adult Education
Interactive Video and Adult EducationInteractive Video and Adult Education
Interactive Video and Adult Education
Carolyn Guertin
 
YouTube Trending Video Dashboard
YouTube Trending Video DashboardYouTube Trending Video Dashboard
YouTube Trending Video Dashboard
IRJET Journal
 
Jing
JingJing
Jing
MRMR77
 
Jing
JingJing
Jing
MRMR77
 
EDUC5101G Session 5 Presentation (March 8, 2016)
EDUC5101G Session 5 Presentation (March 8, 2016)EDUC5101G Session 5 Presentation (March 8, 2016)
EDUC5101G Session 5 Presentation (March 8, 2016)
Robert Power
 
Eye Tracking for Predicting ADHD
Eye Tracking for Predicting ADHDEye Tracking for Predicting ADHD
Eye Tracking for Predicting ADHD
Gavindya Jayawardena
 
軒銘Icalt.pptx
軒銘Icalt.pptx軒銘Icalt.pptx
軒銘Icalt.pptx
軒銘 張
 
OSGi Alliance and its Technology - Where Are We Now, and What is Your Vision ...
OSGi Alliance and its Technology - Where Are We Now, and What is Your Vision ...OSGi Alliance and its Technology - Where Are We Now, and What is Your Vision ...
OSGi Alliance and its Technology - Where Are We Now, and What is Your Vision ...
mfrancis
 
SUMMARY GENERATION FOR LECTURING VIDEOS
SUMMARY GENERATION FOR LECTURING VIDEOSSUMMARY GENERATION FOR LECTURING VIDEOS
SUMMARY GENERATION FOR LECTURING VIDEOS
IRJET Journal
 
The LEGO Strategy: Guidelines for a Profitable Deployment
The LEGO Strategy: Guidelines for a Profitable DeploymentThe LEGO Strategy: Guidelines for a Profitable Deployment
The LEGO Strategy: Guidelines for a Profitable Deployment
Luigi Buglione
 
IGoogle presentation
IGoogle presentationIGoogle presentation
IGoogle presentation
Suhe Ndri
 
IGoogle presentation
IGoogle presentationIGoogle presentation
IGoogle presentation
Suhe0004
 
IGoogle presentation
IGoogle presentationIGoogle presentation
IGoogle presentation
Suhe0004
 
OSGeo INSPIRE Ping-Pong Match
OSGeo INSPIRE Ping-Pong MatchOSGeo INSPIRE Ping-Pong Match
OSGeo INSPIRE Ping-Pong Match
Arnulf Christl
 
Jenkins and visual regression – Exove
Jenkins and visual regression – ExoveJenkins and visual regression – Exove
Jenkins and visual regression – Exove
Exove
 
How to start WebGL easily?
How to start WebGL easily?How to start WebGL easily?
How to start WebGL easily?
誠人 堀口
 
Modeling and Performance Analysis of Scrumban with Test-Driven Development us...
Modeling and Performance Analysis of Scrumban with Test-Driven Development us...Modeling and Performance Analysis of Scrumban with Test-Driven Development us...
Modeling and Performance Analysis of Scrumban with Test-Driven Development us...
Fernando Sambinelli, MSc
 

Similar to MediaEval 2016 - ETH-CVL: Textual-Visual Embeddings and Video2GIF for Video Interestingness (20)

Using educational technology to convey complex IL topics: animating OSCOLA re...
Using educational technology to convey complex IL topics: animating OSCOLA re...Using educational technology to convey complex IL topics: animating OSCOLA re...
Using educational technology to convey complex IL topics: animating OSCOLA re...
 
A Ensemble Learning-based No Reference QoE Model for User Generated Contents
A Ensemble Learning-based No Reference QoE Model for User Generated ContentsA Ensemble Learning-based No Reference QoE Model for User Generated Contents
A Ensemble Learning-based No Reference QoE Model for User Generated Contents
 
Icme2020 tutorial video_summarization_part1
Icme2020 tutorial video_summarization_part1Icme2020 tutorial video_summarization_part1
Icme2020 tutorial video_summarization_part1
 
Interactive Video and Adult Education
Interactive Video and Adult EducationInteractive Video and Adult Education
Interactive Video and Adult Education
 
YouTube Trending Video Dashboard
YouTube Trending Video DashboardYouTube Trending Video Dashboard
YouTube Trending Video Dashboard
 
Jing
JingJing
Jing
 
Jing
JingJing
Jing
 
EDUC5101G Session 5 Presentation (March 8, 2016)
EDUC5101G Session 5 Presentation (March 8, 2016)EDUC5101G Session 5 Presentation (March 8, 2016)
EDUC5101G Session 5 Presentation (March 8, 2016)
 
Eye Tracking for Predicting ADHD
Eye Tracking for Predicting ADHDEye Tracking for Predicting ADHD
Eye Tracking for Predicting ADHD
 
軒銘Icalt.pptx
軒銘Icalt.pptx軒銘Icalt.pptx
軒銘Icalt.pptx
 
OSGi Alliance and its Technology - Where Are We Now, and What is Your Vision ...
OSGi Alliance and its Technology - Where Are We Now, and What is Your Vision ...OSGi Alliance and its Technology - Where Are We Now, and What is Your Vision ...
OSGi Alliance and its Technology - Where Are We Now, and What is Your Vision ...
 
SUMMARY GENERATION FOR LECTURING VIDEOS
SUMMARY GENERATION FOR LECTURING VIDEOSSUMMARY GENERATION FOR LECTURING VIDEOS
SUMMARY GENERATION FOR LECTURING VIDEOS
 
The LEGO Strategy: Guidelines for a Profitable Deployment
The LEGO Strategy: Guidelines for a Profitable DeploymentThe LEGO Strategy: Guidelines for a Profitable Deployment
The LEGO Strategy: Guidelines for a Profitable Deployment
 
IGoogle presentation
IGoogle presentationIGoogle presentation
IGoogle presentation
 
IGoogle presentation
IGoogle presentationIGoogle presentation
IGoogle presentation
 
IGoogle presentation
IGoogle presentationIGoogle presentation
IGoogle presentation
 
OSGeo INSPIRE Ping-Pong Match
OSGeo INSPIRE Ping-Pong MatchOSGeo INSPIRE Ping-Pong Match
OSGeo INSPIRE Ping-Pong Match
 
Jenkins and visual regression – Exove
Jenkins and visual regression – ExoveJenkins and visual regression – Exove
Jenkins and visual regression – Exove
 
How to start WebGL easily?
How to start WebGL easily?How to start WebGL easily?
How to start WebGL easily?
 
Modeling and Performance Analysis of Scrumban with Test-Driven Development us...
Modeling and Performance Analysis of Scrumban with Test-Driven Development us...Modeling and Performance Analysis of Scrumban with Test-Driven Development us...
Modeling and Performance Analysis of Scrumban with Test-Driven Development us...
 

More from multimediaeval

Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
multimediaeval
 
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
multimediaeval
 
Sports Video Classification: Classification of Strokes in Table Tennis for Me...
Sports Video Classification: Classification of Strokes in Table Tennis for Me...Sports Video Classification: Classification of Strokes in Table Tennis for Me...
Sports Video Classification: Classification of Strokes in Table Tennis for Me...
multimediaeval
 
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
multimediaeval
 
Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task
Essex-NLIP at MediaEval Predicting Media Memorability 2020 TaskEssex-NLIP at MediaEval Predicting Media Memorability 2020 Task
Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task
multimediaeval
 
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
multimediaeval
 
Fooling an Automatic Image Quality Estimator
Fooling an Automatic Image Quality EstimatorFooling an Automatic Image Quality Estimator
Fooling an Automatic Image Quality Estimator
multimediaeval
 
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
multimediaeval
 
Pixel Privacy: Quality Camouflage for Social Images
Pixel Privacy: Quality Camouflage for Social ImagesPixel Privacy: Quality Camouflage for Social Images
Pixel Privacy: Quality Camouflage for Social Images
multimediaeval
 
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-MatchingHCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
multimediaeval
 
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
multimediaeval
 
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
multimediaeval
 
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
multimediaeval
 
Deep Conditional Adversarial learning for polyp Segmentation
Deep Conditional Adversarial learning for polyp SegmentationDeep Conditional Adversarial learning for polyp Segmentation
Deep Conditional Adversarial learning for polyp Segmentation
multimediaeval
 
A Temporal-Spatial Attention Model for Medical Image Detection
A Temporal-Spatial Attention Model for Medical Image DetectionA Temporal-Spatial Attention Model for Medical Image Detection
A Temporal-Spatial Attention Model for Medical Image Detection
multimediaeval
 
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
multimediaeval
 
Fine-tuning for Polyp Segmentation with Attention
Fine-tuning for Polyp Segmentation with AttentionFine-tuning for Polyp Segmentation with Attention
Fine-tuning for Polyp Segmentation with Attention
multimediaeval
 
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
multimediaeval
 
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
multimediaeval
 
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
 Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ... Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
multimediaeval
 

More from multimediaeval (20)

Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
 
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
 
Sports Video Classification: Classification of Strokes in Table Tennis for Me...
Sports Video Classification: Classification of Strokes in Table Tennis for Me...Sports Video Classification: Classification of Strokes in Table Tennis for Me...
Sports Video Classification: Classification of Strokes in Table Tennis for Me...
 
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
 
Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task
Essex-NLIP at MediaEval Predicting Media Memorability 2020 TaskEssex-NLIP at MediaEval Predicting Media Memorability 2020 Task
Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task
 
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
 
Fooling an Automatic Image Quality Estimator
Fooling an Automatic Image Quality EstimatorFooling an Automatic Image Quality Estimator
Fooling an Automatic Image Quality Estimator
 
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
 
Pixel Privacy: Quality Camouflage for Social Images
Pixel Privacy: Quality Camouflage for Social ImagesPixel Privacy: Quality Camouflage for Social Images
Pixel Privacy: Quality Camouflage for Social Images
 
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-MatchingHCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
 
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
 
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
 
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
 
Deep Conditional Adversarial learning for polyp Segmentation
Deep Conditional Adversarial learning for polyp SegmentationDeep Conditional Adversarial learning for polyp Segmentation
Deep Conditional Adversarial learning for polyp Segmentation
 
A Temporal-Spatial Attention Model for Medical Image Detection
A Temporal-Spatial Attention Model for Medical Image DetectionA Temporal-Spatial Attention Model for Medical Image Detection
A Temporal-Spatial Attention Model for Medical Image Detection
 
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
 
Fine-tuning for Polyp Segmentation with Attention
Fine-tuning for Polyp Segmentation with AttentionFine-tuning for Polyp Segmentation with Attention
Fine-tuning for Polyp Segmentation with Attention
 
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
 
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
 
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
 Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ... Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
 

Recently uploaded

BREEDING METHODS FOR DISEASE RESISTANCE.pptx
BREEDING METHODS FOR DISEASE RESISTANCE.pptxBREEDING METHODS FOR DISEASE RESISTANCE.pptx
BREEDING METHODS FOR DISEASE RESISTANCE.pptx
RASHMI M G
 
Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
University of Maribor
 
Phenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvementPhenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvement
IshaGoswami9
 
Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
tonzsalvador2222
 
Oedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptxOedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptx
muralinath2
 
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
Abdul Wali Khan University Mardan,kP,Pakistan
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
University of Rennes, INSA Rennes, Inria/IRISA, CNRS
 
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốtmô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
HongcNguyn6
 
8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf
by6843629
 
Medical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptxMedical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptx
terusbelajar5
 
Nucleophilic Addition of carbonyl compounds.pptx
Nucleophilic Addition of carbonyl  compounds.pptxNucleophilic Addition of carbonyl  compounds.pptx
Nucleophilic Addition of carbonyl compounds.pptx
SSR02
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
PRIYANKA PATEL
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
Nistarini College, Purulia (W.B) India
 
Eukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptxEukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptx
RitabrataSarkar3
 
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
AbdullaAlAsif1
 
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptxANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
RASHMI M G
 
Applied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdfApplied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdf
University of Hertfordshire
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
University of Maribor
 
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxThe use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
MAGOTI ERNEST
 
molar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptxmolar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptx
Anagha Prasad
 

Recently uploaded (20)

BREEDING METHODS FOR DISEASE RESISTANCE.pptx
BREEDING METHODS FOR DISEASE RESISTANCE.pptxBREEDING METHODS FOR DISEASE RESISTANCE.pptx
BREEDING METHODS FOR DISEASE RESISTANCE.pptx
 
Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
 
Phenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvementPhenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvement
 
Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
 
Oedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptxOedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptx
 
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
 
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốtmô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
 
8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf
 
Medical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptxMedical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptx
 
Nucleophilic Addition of carbonyl compounds.pptx
Nucleophilic Addition of carbonyl  compounds.pptxNucleophilic Addition of carbonyl  compounds.pptx
Nucleophilic Addition of carbonyl compounds.pptx
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
 
Eukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptxEukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptx
 
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
 
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptxANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
 
Applied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdfApplied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdf
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
 
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxThe use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
 
molar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptxmolar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptx
 

MediaEval 2016 - ETH-CVL: Textual-Visual Embeddings and Video2GIF for Video Interestingness

  • 1. ETH-CVL @ MediaEval 2016: Textual-Visual Embeddings and Video2GIF for Video Interestingness Michael Gygli, PhD student @ Computer Vision Lab, ETH Zurich Work with Arun Balajee Vasudevan, Anna Volokitin and Luc Van Gool 1
  • 2. Content ▪ Overview ▪ Textual-visual embedding ▪ Video2GIF ▪ Results 2Michael Gygli, PhD student @ ETH Zurich06/21/2016
  • 3. Finding the most interesting and relevant content ▪ Two dominant approaches ▪ Use of textual information (title, category) to obtain a video specific model, often using web image priors E.g. [Khosla et al. CVPR 13], [Liu et al. IJCAI 09], Song et al. CVPR 15] ▪ Supervised learning with generic features and large training set E.g. [Potapov et al. ECCV 2014], [Sun et al. ECCV 2014], [Gygli et al. CVPR 16] ▪ Some methods use both E.g. [Liu et al. CVPR 15] 3
  • 5. Multi-task deep visual-semantic embedding for video thumbnail selection [Liu et al. CVPR 15] ▪ Use Bing image search data (query, image, # of clicks) to learn a joint embedding space for images and text ▪ Compute frame relevance as cosine similarity between the query or title embedding and the frame embedding [Liu et al. CVPR 2015] 5Michael Gygli, PhD student @ ETH Zurich06/21/2016
  • 6. Our contribution with improvements over Liu et al. ● Siamese network ● Text Embedding Model ○ Words encoded through word2vec ○ LSTM to obtain fixed length embedding ● Convolutional Neural Network model ○ Fine-tuned VGG-19 ● Training Data based on learning a ranking of: (query+, image+, image- (“cat”, , ) 6
  • 8. Video2GIF: Automatic Generation of Animated GIFs from Video [Gygli et al. CVPR 16] Approach ▪ Work with segments as units ▪ Obtained through change-point detection [Song et al. CVPR 15] ▪ Train a deep neural network for ranking segments ... 8 Example video Highest scoring Lowest scoring Michael Gygli, PhD student @ ETH Zurich06/21/2016
  • 9. Video2GIF: Automatic Generation of Animated GIFs from Video Approach ▪ Train a deep neural network for ranking segments ▪ Built on C3D network [Tran et al. ICCV 2015] ▪ Objective: score positives higher than negatives h: scoring function s+ : positive segment s- : negative segment 9Michael Gygli, PhD student @ ETH Zurich06/21/2016
  • 10. Video2GIF: Dataset 10Michael Gygli, PhD student @ ETH Zurich06/21/2016 Available on github.com/gyglim/video2gif_dataset ▪ Large-scale training data: GIFs created from YouTube videos ▪ Align GIF back to video ▪ This part defines a positive example ▪ Assume non-selected parts are less interesting than selected part
  • 11. Results Task Run mAP Image 1 0.1866 2 0.1952 3 0.1858 Video 1 0.1362 2 0.1574 Frame-based: • Run 1: Visual-semantic embedding trained on Clickture dataset • Run 2: As Run 1, with fine-tuning on development set • Run 3: As Run 1, but trained on a larger subset of Clickture Segment-based: • Run 1: Video2GIF • Run 2: Averaged score of Visual-semantic embedding and Video2GIF
  • 12. Qualitative Results - Run 2 Title: Captives Title: After Earth Predicted best frame True best frame Predicted best frame True best frame
  • 13. More information ▪ Paper Video2GIF: Automatic Generation of Animated GIFs from Video. M. Gygli, Y. Song, L. Cao, CVPR 2016 ▪ Slides on Video Summarization as Subset Selection @ Tutorial on Optimization Algorithms for Subset Selection and Summarization in Large Data Sets, CVPR 2016 https://t.co/mQIpxMab3v ▪ Demo website for Video2GIF: http://video2gif.info/autogif ▪ Paper Multi-task deep visual-semantic embedding for video thumbnail selection. W. Liu, T. Mei, Y. Zhang, C. Che, and J. Luo, CVPR 2015​