SlideShare a Scribd company logo
1 of 1
Exploiting Collective Knowledge in an Image Folksonomy
                                                                                                            for Semantic-based Near-duplicate Video Detection
                                                                                                                                                                                                                                                                Hyun-seok Min, Wesley De Neve, and Yong Man Ro
                                                                                                                                                                                                                                                                             Image and Video Systems Lab
                                                                                                                                                                                                                                                                Korea Advanced Institute of Science and Technology (KAIST)
                                                                                                                                                                                                                                                                                  Daejeon, South Korea
                                                                                                                                                        e-mail: hsmin@kaist.ac.kr                                                                                                                                                                           website: http://ivylab.kaist.ac.kr
I. INTRODUCTION                                                                                                                                                                                                                                                                                                                   IV. DETECTION OF NEAR-DUPLICATES
- Increasing number of duplicates and near-duplicates on websites for
                                                                                                                                                                                                                                                                                                                                  Video matching aims at determining whether a given query video
  video sharing
                                                                                                                                                                                                                                                                                                                                  sequence Vq appears in a target or reference video sequence Vt
   - need for efficient and effective near-duplicate detection techniques
- Conventional video signatures are based on low-level visual features                                                                                                                                                                                                                                                            - The semantic dissimilarity between two video sequences Vq and Vt:
   - highly sensitive to spatiotemporal transformations                                                                                                                                                                                                                                                                                                                   N
- This paper proposes a novel technique for semantic-based near-                                                                                                                                                                                                                                                                                                     1
  duplicate video detection
                                                                                                                                                                                                                                                                                                                                           d video ( U q , Ut ) =
                                                                                                                                                                                                                                                                                                                                                                     N   ∑d
                                                                                                                                                                                                                                                                                                                                                                          i =1
                                                                                                                                                                                                                                                                                                                                                                                          q     t
                                                                                                                                                                                                                                                                                                                                                                                 shot ( A i , A i + p ),

   - based on the observation that near-duplicates still convey the same
     semantic information                                                                                                                                                                                                                                                                                                                              U q , U t : the semantic video signatures of Vq and Vt
   - takes advantage of the wide variety of user-supplied tags present in                                                                                                                                                                                                                                                                                 p      : the video shot in the reference video sequence
     a set of user-contributed images (i.e., an image folksonomy)                                                                                                                                                                                                                                                                                                  at which similarity measurement starts
                                                                                                                                                                                                                                                                                                                                  - The semantic distance between two video shots:
II. SYSTEM ARCHITECTURE
                                                                                                                            Query video sequence
                                                                                                                                                                                                                                                                                                                                                                         A iq ∩ A tj
                                                                                                                                                                                                                                                                                                                                            d shot ( A iq , A tj )   =                   ,            A : the cardinality of A
                              Pre-processing                                                                                                                                                                                                                                                                                                                             A iq × A tj
                                                                                                                                     Shot segmentation
                                                                                                                                                                                                                                                                                                                                  V. EXPERIMENTS
                                                                                                           Low-level feature extraction
                                                                                                                                                                                                                                                                                                                                  1. Experimental setup
                              Creation of a semantic video signature                                                                                                                                                                                                                                                               - Our experiments made use of the MUSCLE-VCD-2007 dataset
                                                                                                                                                                                                                                                                                                                                   - To construct an image folksonomy, 3000 images with at least one or
                                                                                                     Detection of semantic concepts                                                                                                                                                     Image
                                                                                                                                                                                                                                                                                     folksonomy                                      more relevant tags were retrieved from Flickr
                                                                                                     Creation of semantic signature                                                                                                                                                                                               2. Experimental results
                                                                                                                                                                                                                                                                                                                                   - The proposed method misclassified only two out of 15 spatially
                             Video matching using semantic video signatures                                                                                                                                                                                                                                                          transformed query video sequences
                                                                                                                                                                                                                                                                                     Reference
                                                                                                                   Semantic video matching                                                                                                                                             video
                                                                                                                                                                                                                                                                                                                                   - For the 1,604 query video shots, the total number of detected semantic
                                                                                                                                                                                                                                                                                      database                                       concepts is 7,927
                                                                                                                Computation of similarity                                                                                                                                                                                             - five semantic concepts were predicted on average for a video shot
                                                                                                                                                                                                                                                                                                                                      - among the 7,927 detected semantic concepts, 272 different concepts
                             Near-duplicate detection                                                                                                                                                                                                                                                                                   could be identified
                                    Decide whether the query video is a near-
                                                  duplicate or not                                                                                                                                                                                                                                                                 3. Visual results
                      Fig. 1. Semantic-based near-duplicate detection using an image folksonomy                                                                                                                                                                                                                                                             Reference video sequence                 Query video sequence

III. MODEL-FREE SEMANTIC CONCEPT DETECTION
The image cannot be display ed. Your computer may not hav e enough memory to open the image, or the image may hav e been corrupted. Restart y our computer, and then open the file again. If the red x still appears, y ou may hav e to delete the image and then insert it again.




                                                                                                                                                                                                                     Folksonomy images (strongly tagged images)                                                                               Key
                                                                                                                                                                                                  I1                                                                           I2        …             IF
                                                                                                                                                                                                                                                                                                                                             frame



                                                                                                                                                                                                                                                              Visual similarity measurement
                                                                        si                                                                                          Nearest neighbor images
                                                                                                                                                                                                                                                                                                                                            Nearest
                     ith shot of a query video                                                                                                                                                                                                               I1                      …            IK                                        neighbor
                              sequence
                                                                                                                                                                                                                                                                                                                                             images
                               If                                    : folksonomy image
                                                                                                                                                                            Folksonomy-based semantic concept detection                                                                                                                                                          …
                                                                                                                                                                                                                                                                                                                                                                                 …
                                                                                                                                                                                                                                                                                                                                                                                 …
                                                                                                                                                                                                                                                                                                                                                                                 …                                   …
                                                                                                                                                                                                                                                                                                                                                                                                                     …
                                                                                                                                                                                                                                                                                                                                                                                                                     …
                                                                                                                                                                                                                                                                                                                                                                                                                     …
                                                                 : tag                                                                                                                     Set of tags                                                                                                      The frequency of
                                                                                                                                                                                                                                                                                                            tag t in the set of             Detected
                                                               : tag frequency & the                                                                                                                                                                                                 …                      visual neighbors                               interior, home, inside, night,      home, house, interior, inside, style,
                                                                                                                                                                                                                                                                                                                                            semantic
                                                               number of images                                                                                                                                                                                                                                reflects the                                            sunset                              cottage
                                                                                                                                                                                            Semantic concepts                                                                                                                               concepts
                                                               labeled with t in the                                                                                                                                                                                                                        relevance of tag t
                                                               image folksonomy                                                                                                                                                                                                                              with respect to        Fig. 3. Example key frames with visual neighbors and detected semantic concepts
                                                                                                                                                                                                                               …
                                                                                                                                                                                                                               …
                                                                                                                                                                                                                               …
                                                                                                                                                                                                                               …                                                                            the content of si .                (underlined semantic concepts are considered to be correct)
                                                                                         Fig. 2. Folksonomy-based semantic concept detection                                                                                                                                                                                      VI. CONCLUSIONS
- Metric for measuring the relevance of a tag t:                                                                                                                                                                                                                                                                                   - This paper discussed a novel technique for semantic-based near-
                                                                                                                                                                                                                                                                                                                                     duplicate video detection
                                                                    c Lt                                                                                           c : neighbor images tag t in the set of K nearest
                                                                                                                                                                        the frequency of                                                                                                                                              - near-duplicates still convey the same semantic information
           J (t ) =                                                  − ,                                                                                                                                                                                                                                                              - takes advantage of the wide variety of user-supplied tags present in
                                                                    K F                                                                                            Lt : the number of images labeled with tag t in the                                                                                                                  an image folksonomy (i.e., collective knowledge)
                                                                                                                                                                                                  image folksonomy (containing F images)
                                                                                                                                                                                                                                                                                                                                   - Semantic video signatures are constructed by detecting semantic
- The semantic signature U of V, with V = {S1, S2, …, SN}:                                                                                                                                                                                                                                                                           concepts along the temporal axis of video sequences
                                                                                                                                                                                                                                                                                                                                      - our model-free approach is able to exploit an unrestricted tag
         U = {A1, A2,K, AN }. Ai : the set of semantic concepts for Sj                                                                                                                                                                                                                                                                  vocabulary (unlike model-based semantic concept detection)
                                                                                                                                                                                                                                                                                                                                   - Preliminary experimental results look encouraging

                                                                                                                                                                                 IEEE International Conference on Image Processing (ICIP), September 2010, Hong Kong

More Related Content

More from Wesley De Neve

Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...
Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...
Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...Wesley De Neve
 
Investigating the biological relevance in trained embedding representations o...
Investigating the biological relevance in trained embedding representations o...Investigating the biological relevance in trained embedding representations o...
Investigating the biological relevance in trained embedding representations o...Wesley De Neve
 
Impact of adversarial examples on deep learning models for biomedical image s...
Impact of adversarial examples on deep learning models for biomedical image s...Impact of adversarial examples on deep learning models for biomedical image s...
Impact of adversarial examples on deep learning models for biomedical image s...Wesley De Neve
 
Learning Biologically Relevant Features Using Convolutional Neural Networks f...
Learning Biologically Relevant Features Using Convolutional Neural Networks f...Learning Biologically Relevant Features Using Convolutional Neural Networks f...
Learning Biologically Relevant Features Using Convolutional Neural Networks f...Wesley De Neve
 
The 5th Aslla Symposium
The 5th Aslla SymposiumThe 5th Aslla Symposium
The 5th Aslla SymposiumWesley De Neve
 
Ghent University Global Campus 101
Ghent University Global Campus 101Ghent University Global Campus 101
Ghent University Global Campus 101Wesley De Neve
 
Booklet for the First GUGC Research Symposium
Booklet for the First GUGC Research SymposiumBooklet for the First GUGC Research Symposium
Booklet for the First GUGC Research SymposiumWesley De Neve
 
Center for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global CampusCenter for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global CampusWesley De Neve
 
Center for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global CampusCenter for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global CampusWesley De Neve
 
Learning biologically relevant features using convolutional neural networks f...
Learning biologically relevant features using convolutional neural networks f...Learning biologically relevant features using convolutional neural networks f...
Learning biologically relevant features using convolutional neural networks f...Wesley De Neve
 
Towards reading genomic data using deep learning-driven NLP techniques
Towards reading genomic data using deep learning-driven NLP techniquesTowards reading genomic data using deep learning-driven NLP techniques
Towards reading genomic data using deep learning-driven NLP techniquesWesley De Neve
 
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...Wesley De Neve
 
GUGC Info Session - Informatics and Bioinformatics
GUGC Info Session - Informatics and BioinformaticsGUGC Info Session - Informatics and Bioinformatics
GUGC Info Session - Informatics and BioinformaticsWesley De Neve
 
Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...
Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...
Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...Wesley De Neve
 
Ghent University and GUGC-K: Overview of Teaching and Research Activities
Ghent University and GUGC-K: Overview of Teaching and Research ActivitiesGhent University and GUGC-K: Overview of Teaching and Research Activities
Ghent University and GUGC-K: Overview of Teaching and Research ActivitiesWesley De Neve
 
Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...
Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...
Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...Wesley De Neve
 
Exploring Deep Machine Learning for Automatic Right Whale Recognition and No...
 Exploring Deep Machine Learning for Automatic Right Whale Recognition and No... Exploring Deep Machine Learning for Automatic Right Whale Recognition and No...
Exploring Deep Machine Learning for Automatic Right Whale Recognition and No...Wesley De Neve
 
Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...
Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...
Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...Wesley De Neve
 
Towards using multimedia technology for biological data processing
Towards using multimedia technology for biological data processingTowards using multimedia technology for biological data processing
Towards using multimedia technology for biological data processingWesley De Neve
 
Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...
Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...
Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...Wesley De Neve
 

More from Wesley De Neve (20)

Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...
Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...
Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...
 
Investigating the biological relevance in trained embedding representations o...
Investigating the biological relevance in trained embedding representations o...Investigating the biological relevance in trained embedding representations o...
Investigating the biological relevance in trained embedding representations o...
 
Impact of adversarial examples on deep learning models for biomedical image s...
Impact of adversarial examples on deep learning models for biomedical image s...Impact of adversarial examples on deep learning models for biomedical image s...
Impact of adversarial examples on deep learning models for biomedical image s...
 
Learning Biologically Relevant Features Using Convolutional Neural Networks f...
Learning Biologically Relevant Features Using Convolutional Neural Networks f...Learning Biologically Relevant Features Using Convolutional Neural Networks f...
Learning Biologically Relevant Features Using Convolutional Neural Networks f...
 
The 5th Aslla Symposium
The 5th Aslla SymposiumThe 5th Aslla Symposium
The 5th Aslla Symposium
 
Ghent University Global Campus 101
Ghent University Global Campus 101Ghent University Global Campus 101
Ghent University Global Campus 101
 
Booklet for the First GUGC Research Symposium
Booklet for the First GUGC Research SymposiumBooklet for the First GUGC Research Symposium
Booklet for the First GUGC Research Symposium
 
Center for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global CampusCenter for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global Campus
 
Center for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global CampusCenter for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global Campus
 
Learning biologically relevant features using convolutional neural networks f...
Learning biologically relevant features using convolutional neural networks f...Learning biologically relevant features using convolutional neural networks f...
Learning biologically relevant features using convolutional neural networks f...
 
Towards reading genomic data using deep learning-driven NLP techniques
Towards reading genomic data using deep learning-driven NLP techniquesTowards reading genomic data using deep learning-driven NLP techniques
Towards reading genomic data using deep learning-driven NLP techniques
 
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
 
GUGC Info Session - Informatics and Bioinformatics
GUGC Info Session - Informatics and BioinformaticsGUGC Info Session - Informatics and Bioinformatics
GUGC Info Session - Informatics and Bioinformatics
 
Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...
Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...
Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...
 
Ghent University and GUGC-K: Overview of Teaching and Research Activities
Ghent University and GUGC-K: Overview of Teaching and Research ActivitiesGhent University and GUGC-K: Overview of Teaching and Research Activities
Ghent University and GUGC-K: Overview of Teaching and Research Activities
 
Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...
Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...
Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...
 
Exploring Deep Machine Learning for Automatic Right Whale Recognition and No...
 Exploring Deep Machine Learning for Automatic Right Whale Recognition and No... Exploring Deep Machine Learning for Automatic Right Whale Recognition and No...
Exploring Deep Machine Learning for Automatic Right Whale Recognition and No...
 
Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...
Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...
Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...
 
Towards using multimedia technology for biological data processing
Towards using multimedia technology for biological data processingTowards using multimedia technology for biological data processing
Towards using multimedia technology for biological data processing
 
Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...
Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...
Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...
 

Recently uploaded

AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsPrecisely
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 

Recently uploaded (20)

AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power Systems
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 

Exploiting collective knowledge in an image folksonomy for semantic-based near-duplicate video detection

  • 1. Exploiting Collective Knowledge in an Image Folksonomy for Semantic-based Near-duplicate Video Detection Hyun-seok Min, Wesley De Neve, and Yong Man Ro Image and Video Systems Lab Korea Advanced Institute of Science and Technology (KAIST) Daejeon, South Korea e-mail: hsmin@kaist.ac.kr website: http://ivylab.kaist.ac.kr I. INTRODUCTION IV. DETECTION OF NEAR-DUPLICATES - Increasing number of duplicates and near-duplicates on websites for Video matching aims at determining whether a given query video video sharing sequence Vq appears in a target or reference video sequence Vt - need for efficient and effective near-duplicate detection techniques - Conventional video signatures are based on low-level visual features - The semantic dissimilarity between two video sequences Vq and Vt: - highly sensitive to spatiotemporal transformations N - This paper proposes a novel technique for semantic-based near- 1 duplicate video detection d video ( U q , Ut ) = N ∑d i =1 q t shot ( A i , A i + p ), - based on the observation that near-duplicates still convey the same semantic information U q , U t : the semantic video signatures of Vq and Vt - takes advantage of the wide variety of user-supplied tags present in p : the video shot in the reference video sequence a set of user-contributed images (i.e., an image folksonomy) at which similarity measurement starts - The semantic distance between two video shots: II. SYSTEM ARCHITECTURE Query video sequence A iq ∩ A tj d shot ( A iq , A tj ) = , A : the cardinality of A Pre-processing A iq × A tj Shot segmentation V. EXPERIMENTS Low-level feature extraction 1. Experimental setup Creation of a semantic video signature - Our experiments made use of the MUSCLE-VCD-2007 dataset - To construct an image folksonomy, 3000 images with at least one or Detection of semantic concepts Image folksonomy more relevant tags were retrieved from Flickr Creation of semantic signature 2. Experimental results - The proposed method misclassified only two out of 15 spatially Video matching using semantic video signatures transformed query video sequences Reference Semantic video matching video - For the 1,604 query video shots, the total number of detected semantic database concepts is 7,927 Computation of similarity - five semantic concepts were predicted on average for a video shot - among the 7,927 detected semantic concepts, 272 different concepts Near-duplicate detection could be identified Decide whether the query video is a near- duplicate or not 3. Visual results Fig. 1. Semantic-based near-duplicate detection using an image folksonomy Reference video sequence Query video sequence III. MODEL-FREE SEMANTIC CONCEPT DETECTION The image cannot be display ed. Your computer may not hav e enough memory to open the image, or the image may hav e been corrupted. Restart y our computer, and then open the file again. If the red x still appears, y ou may hav e to delete the image and then insert it again. Folksonomy images (strongly tagged images) Key I1 I2 … IF frame Visual similarity measurement si Nearest neighbor images Nearest ith shot of a query video I1 … IK neighbor sequence images If : folksonomy image Folksonomy-based semantic concept detection … … … … … … … … : tag Set of tags The frequency of tag t in the set of Detected : tag frequency & the … visual neighbors interior, home, inside, night, home, house, interior, inside, style, semantic number of images reflects the sunset cottage Semantic concepts concepts labeled with t in the relevance of tag t image folksonomy with respect to Fig. 3. Example key frames with visual neighbors and detected semantic concepts … … … … the content of si . (underlined semantic concepts are considered to be correct) Fig. 2. Folksonomy-based semantic concept detection VI. CONCLUSIONS - Metric for measuring the relevance of a tag t: - This paper discussed a novel technique for semantic-based near- duplicate video detection c Lt c : neighbor images tag t in the set of K nearest the frequency of - near-duplicates still convey the same semantic information J (t ) = − , - takes advantage of the wide variety of user-supplied tags present in K F Lt : the number of images labeled with tag t in the an image folksonomy (i.e., collective knowledge) image folksonomy (containing F images) - Semantic video signatures are constructed by detecting semantic - The semantic signature U of V, with V = {S1, S2, …, SN}: concepts along the temporal axis of video sequences - our model-free approach is able to exploit an unrestricted tag U = {A1, A2,K, AN }. Ai : the set of semantic concepts for Sj vocabulary (unlike model-based semantic concept detection) - Preliminary experimental results look encouraging IEEE International Conference on Image Processing (ICIP), September 2010, Hong Kong