SlideShare a Scribd company logo
Reverse Video Search on
Large-scale Media Collections
MeVer
Giorgos Kordopatis-Zilos
Reverse Video Search
Query video Retrieved videos
Video database
Video-level methods
Limited retrieval
performance
Very fast
retrieval time
Frame-level methods
Slow retrieval
time
High retrieval
performance
Aim & solution
Our Aim
• Development of a system for reverse video search
• Very high retrieval performance
• Fast retrieval speed
Our Solution
• Two main components
• Video indexing & filtering
• Video-level method
• Efficient video indexing and filtering
• Video similarity calculation
• Frame-level method
• Video similarity learning
Video indexing & filtering (1/2)
Layer Bag-of-Word (LBoW)
• Extract a number of L visual words from each video frame
• Index videos based on the extracted words
Kordopatis-Zilos et al. “Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers”. MMM, 2017.
Video indexing & filtering (2/2)
Similarity calculation
• Video-level representations with tf-idf weighting
• Cosine similarity
Video filtering
• Rank videos based on their similarity
• Select the top N videos (set to N = 5,000)
• Select videos with similarity greater than t
Kordopatis-Zilos et al. “Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers”. MMM, 2017.
Video similarity calculation (1/2)
Video Similarity Learning (ViSiL)
• Learn a video similarity function that considers:
• Spatial structure of video frames (intra-frame relations)
• Temporal structure of videos (inter-frame relations)
Kordopatis-Zilos et al. “ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning”. ICCV, 2019.
Video similarity calculation (2/2)
Video Similarity Learning network
• 4-layer CNN
• Captures the temporal structures
in the similarity matrix
Kordopatis-Zilos et al. “ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning”. ICCV, 2019.
Experimental setup
FIVR-200K dataset
• 225,960 videos from 4,687 news events
• 100 query videos
• Three retrieval tasks
• Simulate different scenarios
Evaluation metrics
• mean Average Precision (mAP)
Kordopatis-Zilos et al. “FIVR: Fine-grained Incident Video Retrieval”. IEEE TMM, 2019.
Video examples
Query Video
Complementary
Scene Video
Duplicate
Scene Video
Incident
Scene Video
Kordopatis-Zilos et al. “FIVR: Fine-grained Incident Video Retrieval”. IEEE TMM, 2019.
Experiments
Experiments
Experiments
Experiments
Experiments
Kordopatis-Zilos et al. “ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning”. ICCV, 2019.
Additional experimental results
• Evaluation on four video retrieval problems
• Achieves state-of-the-art performance
Video verification
query video database video
frame-to-frame
similarity matrix
ViSiL output video
similarity
0.8
0.5
near-duplicate
videos
same event
videos
Copyright management (1/2)
original video reaction video
ViSiL output video similarity
0.82
Copyright management (2/2)
original video reaction video
ViSiL output video similarity
0.84
Online demo
Tips
• Video-level methods offer a fast video retrieval solution but with limited
performance
• Frame-level methods achieve high retrieval performance, but with very high
computation cost
• Combination of the two method types with a carefully selected similarity
threshold according to the application scenario
Thank you!
Get in touch:
Giorgos Kordopatis-Zilos: georgekordopatis@iti.gr / @g_kordo
Team info:
https://mever.iti.gr/
https://twitter.com/meverteam
Code & models:
https://github.com/MKLab-ITI/FIVR-200K
https://github.com/MKLab-ITI/visil
With the support of:
MeVer

More Related Content

Similar to Reverse Video Search Large-scale Media Collections | Presentation@Q3-AIDI

ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning
ViSiL: Fine-grained Spatio-Temporal Video Similarity LearningViSiL: Fine-grained Spatio-Temporal Video Similarity Learning
ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning
gkordo
 
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN LayersNear-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
Symeon Papadopoulos
 
Preliminary Exploration of the Use of Geographical Information for Content-ba...
Preliminary Exploration of the Use of Geographical Information for Content-ba...Preliminary Exploration of the Use of Geographical Information for Content-ba...
Preliminary Exploration of the Use of Geographical Information for Content-ba...
MediaEval2012
 
LIvRE: A Video Extension to the LIRE Content-Based Image Retrieval System
LIvRE: A Video Extension to the LIRE Content-Based Image Retrieval SystemLIvRE: A Video Extension to the LIRE Content-Based Image Retrieval System
LIvRE: A Video Extension to the LIRE Content-Based Image Retrieval System
Universitat Politècnica de Catalunya
 
Content based video retrieval using discrete cosine transform
Content based video retrieval using discrete cosine transformContent based video retrieval using discrete cosine transform
Content based video retrieval using discrete cosine transform
nooriasukmaningtyas
 
Research Proposal Presentation Pitch
Research Proposal Presentation PitchResearch Proposal Presentation Pitch
Research Proposal Presentation Pitch
tchoonyong
 
Query clip genre recognition using tree pruning technique for video retrieval
Query clip genre recognition using tree pruning technique for video retrievalQuery clip genre recognition using tree pruning technique for video retrieval
Query clip genre recognition using tree pruning technique for video retrieval
IAEME Publication
 
Query clip genre recognition using tree pruning technique for video retrieval
Query clip genre recognition using tree pruning technique for video retrievalQuery clip genre recognition using tree pruning technique for video retrieval
Query clip genre recognition using tree pruning technique for video retrieval
IAEME Publication
 
Paul Wang SOED 2016
Paul Wang SOED 2016Paul Wang SOED 2016
Paul Wang SOED 2016
Colleen Ganley
 
Fast object re-detection and localization in video for spatio-temporal fragme...
Fast object re-detection and localization in video for spatio-temporal fragme...Fast object re-detection and localization in video for spatio-temporal fragme...
Fast object re-detection and localization in video for spatio-temporal fragme...
LinkedTV
 
A framework for visual search in broadcasting companies' multimedia archives
A framework for visual search in broadcasting companies' multimedia archives A framework for visual search in broadcasting companies' multimedia archives
A framework for visual search in broadcasting companies' multimedia archives
FIAT/IFTA
 
Delay Analysis of Layered Video Caching in Crowdsourced Heterogeneous Wireles...
Delay Analysis of Layered Video Caching in Crowdsourced Heterogeneous Wireles...Delay Analysis of Layered Video Caching in Crowdsourced Heterogeneous Wireles...
Delay Analysis of Layered Video Caching in Crowdsourced Heterogeneous Wireles...
Behrouz Jedari
 
ADS Team 8 Final Presentation
ADS Team 8 Final PresentationADS Team 8 Final Presentation
ADS Team 8 Final Presentation
Pranay Mankad
 
Recent advances in content based video copy detection (IEEE)
Recent advances in content based video copy detection (IEEE)Recent advances in content based video copy detection (IEEE)
Recent advances in content based video copy detection (IEEE)
PACE 2.0
 
Parking Surveillance Footage Summarization
Parking Surveillance Footage SummarizationParking Surveillance Footage Summarization
Parking Surveillance Footage Summarization
IRJET Journal
 
Fast object re detection and localization in video for spatio-temporal fragme...
Fast object re detection and localization in video for spatio-temporal fragme...Fast object re detection and localization in video for spatio-temporal fragme...
Fast object re detection and localization in video for spatio-temporal fragme...
MediaMixerCommunity
 
Video processing.pptx
Video processing.pptxVideo processing.pptx
Video processing.pptx
MohamedRiyaz115278
 
Key Frame Extraction in Video Stream using Two Stage Method with Colour and S...
Key Frame Extraction in Video Stream using Two Stage Method with Colour and S...Key Frame Extraction in Video Stream using Two Stage Method with Colour and S...
Key Frame Extraction in Video Stream using Two Stage Method with Colour and S...
ijtsrd
 
Content Based Video Retrieval Using Integrated Feature Extraction and Persona...
Content Based Video Retrieval Using Integrated Feature Extraction and Persona...Content Based Video Retrieval Using Integrated Feature Extraction and Persona...
Content Based Video Retrieval Using Integrated Feature Extraction and Persona...
IJERD Editor
 
Human Activity Recognition (HAR) using HMM based Intermediate matching kernel...
Human Activity Recognition (HAR) using HMM based Intermediate matching kernel...Human Activity Recognition (HAR) using HMM based Intermediate matching kernel...
Human Activity Recognition (HAR) using HMM based Intermediate matching kernel...
Rupali Bhatnagar
 

Similar to Reverse Video Search Large-scale Media Collections | Presentation@Q3-AIDI (20)

ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning
ViSiL: Fine-grained Spatio-Temporal Video Similarity LearningViSiL: Fine-grained Spatio-Temporal Video Similarity Learning
ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning
 
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN LayersNear-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
 
Preliminary Exploration of the Use of Geographical Information for Content-ba...
Preliminary Exploration of the Use of Geographical Information for Content-ba...Preliminary Exploration of the Use of Geographical Information for Content-ba...
Preliminary Exploration of the Use of Geographical Information for Content-ba...
 
LIvRE: A Video Extension to the LIRE Content-Based Image Retrieval System
LIvRE: A Video Extension to the LIRE Content-Based Image Retrieval SystemLIvRE: A Video Extension to the LIRE Content-Based Image Retrieval System
LIvRE: A Video Extension to the LIRE Content-Based Image Retrieval System
 
Content based video retrieval using discrete cosine transform
Content based video retrieval using discrete cosine transformContent based video retrieval using discrete cosine transform
Content based video retrieval using discrete cosine transform
 
Research Proposal Presentation Pitch
Research Proposal Presentation PitchResearch Proposal Presentation Pitch
Research Proposal Presentation Pitch
 
Query clip genre recognition using tree pruning technique for video retrieval
Query clip genre recognition using tree pruning technique for video retrievalQuery clip genre recognition using tree pruning technique for video retrieval
Query clip genre recognition using tree pruning technique for video retrieval
 
Query clip genre recognition using tree pruning technique for video retrieval
Query clip genre recognition using tree pruning technique for video retrievalQuery clip genre recognition using tree pruning technique for video retrieval
Query clip genre recognition using tree pruning technique for video retrieval
 
Paul Wang SOED 2016
Paul Wang SOED 2016Paul Wang SOED 2016
Paul Wang SOED 2016
 
Fast object re-detection and localization in video for spatio-temporal fragme...
Fast object re-detection and localization in video for spatio-temporal fragme...Fast object re-detection and localization in video for spatio-temporal fragme...
Fast object re-detection and localization in video for spatio-temporal fragme...
 
A framework for visual search in broadcasting companies' multimedia archives
A framework for visual search in broadcasting companies' multimedia archives A framework for visual search in broadcasting companies' multimedia archives
A framework for visual search in broadcasting companies' multimedia archives
 
Delay Analysis of Layered Video Caching in Crowdsourced Heterogeneous Wireles...
Delay Analysis of Layered Video Caching in Crowdsourced Heterogeneous Wireles...Delay Analysis of Layered Video Caching in Crowdsourced Heterogeneous Wireles...
Delay Analysis of Layered Video Caching in Crowdsourced Heterogeneous Wireles...
 
ADS Team 8 Final Presentation
ADS Team 8 Final PresentationADS Team 8 Final Presentation
ADS Team 8 Final Presentation
 
Recent advances in content based video copy detection (IEEE)
Recent advances in content based video copy detection (IEEE)Recent advances in content based video copy detection (IEEE)
Recent advances in content based video copy detection (IEEE)
 
Parking Surveillance Footage Summarization
Parking Surveillance Footage SummarizationParking Surveillance Footage Summarization
Parking Surveillance Footage Summarization
 
Fast object re detection and localization in video for spatio-temporal fragme...
Fast object re detection and localization in video for spatio-temporal fragme...Fast object re detection and localization in video for spatio-temporal fragme...
Fast object re detection and localization in video for spatio-temporal fragme...
 
Video processing.pptx
Video processing.pptxVideo processing.pptx
Video processing.pptx
 
Key Frame Extraction in Video Stream using Two Stage Method with Colour and S...
Key Frame Extraction in Video Stream using Two Stage Method with Colour and S...Key Frame Extraction in Video Stream using Two Stage Method with Colour and S...
Key Frame Extraction in Video Stream using Two Stage Method with Colour and S...
 
Content Based Video Retrieval Using Integrated Feature Extraction and Persona...
Content Based Video Retrieval Using Integrated Feature Extraction and Persona...Content Based Video Retrieval Using Integrated Feature Extraction and Persona...
Content Based Video Retrieval Using Integrated Feature Extraction and Persona...
 
Human Activity Recognition (HAR) using HMM based Intermediate matching kernel...
Human Activity Recognition (HAR) using HMM based Intermediate matching kernel...Human Activity Recognition (HAR) using HMM based Intermediate matching kernel...
Human Activity Recognition (HAR) using HMM based Intermediate matching kernel...
 

Recently uploaded

Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Speck&Tech
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
Pixlogix Infotech
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
UI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentationUI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentation
Wouter Lemaire
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
Daiki Mogmet Ito
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
Mariano Tinti
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Wask
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
Postman
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Alpen-Adria-Universität
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
Edge AI and Vision Alliance
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
kumardaparthi1024
 
Recommendation System using RAG Architecture
Recommendation System using RAG ArchitectureRecommendation System using RAG Architecture
Recommendation System using RAG Architecture
fredae14
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
akankshawande
 

Recently uploaded (20)

Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
UI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentationUI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentation
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
 
Recommendation System using RAG Architecture
Recommendation System using RAG ArchitectureRecommendation System using RAG Architecture
Recommendation System using RAG Architecture
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
 

Reverse Video Search Large-scale Media Collections | Presentation@Q3-AIDI

  • 1. Reverse Video Search on Large-scale Media Collections MeVer Giorgos Kordopatis-Zilos
  • 2. Reverse Video Search Query video Retrieved videos Video database
  • 5. Aim & solution Our Aim • Development of a system for reverse video search • Very high retrieval performance • Fast retrieval speed Our Solution • Two main components • Video indexing & filtering • Video-level method • Efficient video indexing and filtering • Video similarity calculation • Frame-level method • Video similarity learning
  • 6. Video indexing & filtering (1/2) Layer Bag-of-Word (LBoW) • Extract a number of L visual words from each video frame • Index videos based on the extracted words Kordopatis-Zilos et al. “Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers”. MMM, 2017.
  • 7. Video indexing & filtering (2/2) Similarity calculation • Video-level representations with tf-idf weighting • Cosine similarity Video filtering • Rank videos based on their similarity • Select the top N videos (set to N = 5,000) • Select videos with similarity greater than t Kordopatis-Zilos et al. “Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers”. MMM, 2017.
  • 8. Video similarity calculation (1/2) Video Similarity Learning (ViSiL) • Learn a video similarity function that considers: • Spatial structure of video frames (intra-frame relations) • Temporal structure of videos (inter-frame relations) Kordopatis-Zilos et al. “ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning”. ICCV, 2019.
  • 9. Video similarity calculation (2/2) Video Similarity Learning network • 4-layer CNN • Captures the temporal structures in the similarity matrix Kordopatis-Zilos et al. “ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning”. ICCV, 2019.
  • 10. Experimental setup FIVR-200K dataset • 225,960 videos from 4,687 news events • 100 query videos • Three retrieval tasks • Simulate different scenarios Evaluation metrics • mean Average Precision (mAP) Kordopatis-Zilos et al. “FIVR: Fine-grained Incident Video Retrieval”. IEEE TMM, 2019.
  • 11. Video examples Query Video Complementary Scene Video Duplicate Scene Video Incident Scene Video Kordopatis-Zilos et al. “FIVR: Fine-grained Incident Video Retrieval”. IEEE TMM, 2019.
  • 16. Experiments Kordopatis-Zilos et al. “ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning”. ICCV, 2019. Additional experimental results • Evaluation on four video retrieval problems • Achieves state-of-the-art performance
  • 17. Video verification query video database video frame-to-frame similarity matrix ViSiL output video similarity 0.8 0.5 near-duplicate videos same event videos
  • 18. Copyright management (1/2) original video reaction video ViSiL output video similarity 0.82
  • 19. Copyright management (2/2) original video reaction video ViSiL output video similarity 0.84
  • 21. Tips • Video-level methods offer a fast video retrieval solution but with limited performance • Frame-level methods achieve high retrieval performance, but with very high computation cost • Combination of the two method types with a carefully selected similarity threshold according to the application scenario
  • 22. Thank you! Get in touch: Giorgos Kordopatis-Zilos: georgekordopatis@iti.gr / @g_kordo Team info: https://mever.iti.gr/ https://twitter.com/meverteam Code & models: https://github.com/MKLab-ITI/FIVR-200K https://github.com/MKLab-ITI/visil With the support of: MeVer