SlideShare a Scribd company logo
1 of 13
Download to read offline
Jorge Proença 1,2
Arlindo Veiga 1,2
Fernando Perdigão 1,2
The SPL-IT Query by Example Search on Speech
system for MediaEval 2014
The 2014 Query by Example Search on Speech (QUESST)
1 Instituto de Telecomunicações,
Coimbra, Portugal
2 Electrical and Computer Eng.
Department,
University of Coimbra, Portugal
2
SPL-IT system
MediaEval 2014
| October 16-17 2014, Barcelona, SPAIN
Overview of the system:
 Fuses Dynamic Time Warping (DTW) modifications
 Fuses results from systems with phonetic recognizers for 3
languages
3
Phonetic Recognizer
MediaEval 2014
| October 16-17 2014, Barcelona, SPAIN
 Hard to extract good posteriorgrams with an HMM system (our in-
house system).
 Used 3 systems/languages (for 8 kHz) based on long temporal context
and neural networks from Brnu University of Technology (BUT):
 Czech
 Hungarian
 Russian
 Output: posteriorgrams (3 states per phoneme).
 Leading and trailing silence/noise removed
PhonemeState
Frame
State Posteriorgram example for one query
4
Dynamic Time Warping
MediaEval 2014
| October 16-17 2014, Barcelona, SPAIN
 Local Distance matrix:
 Dot Product of Query and Audio posterior probability vectors;
 Back-off with l =10-4
   , logD q x q x  
Distance Matrix of Query vs Audio
5
Dynamic Time Warping
MediaEval 2014
| October 16-17 2014, Barcelona, SPAIN
 Basic DTW strategy (A1):
 Smallest distance in identically
weighted unitary jumps:
Distance Matrix (top) and accumulated Distance matrix (bottom) of Query vs Audio
6
DTW Modifications
MediaEval 2014
| October 16-17 2014, Barcelona, SPAIN
 4 additional approaches:
(A2) – Cutting up to 250ms at the end of the query,
keeping the segment above 500ms
(A3) – Cutting up to 250ms at the beginning of the query,
keeping the segment above 500ms
QueryQuery
Audio
Query vs. Audio posterior distance matrix (top) and the best path from A2 (bottom)
7
DTW Modifications
MediaEval 2014
| October 16-17 2014, Barcelona, SPAIN
(A4) – Allowing one jump in the path up to ½ Query’s length,
can’t occur at initial and final 250ms of the query
can’t occur for queries shorter than 800ms
QueryQuery
Audio
Query vs. Audio posterior distance matrix (top) and the best path from A4 (bottom)
8
DTW Modifications
MediaEval 2014
| October 16-17 2014, Barcelona, SPAIN
(A5) – Swaps: accounting for re-ordering of words.
Backtrack the best 5 candidates from (A1) from the end,
Find the best path for the beginning of the query, ahead of the
end of the first one, with restrictions similar to (A4).QueryQuery
Audio
Query vs. Audio posterior distance matrix (top) and the best path from A5 (bottom)
9
Fusing systems
MediaEval 2014
| October 16-17 2014, Barcelona, SPAIN
 Different approaches:
 Minimum of the approaches – not the best.
 Harmonic mean found to be a good compromise.
 Per-query normalization (standard score):
 Different languages:
 Arithmetic mean of the 3 scores.

X
10
Submissions and Results
MediaEval 2014
| October 16-17 2014, Barcelona, SPAIN
 Primary: fusing (A1) and (A2) (basic and cutting the end)
 Late: fusing the 5 approaches.
 Late provided worse overall results
primary late
Cnxe, MinCnxe - Dev 0.6797, 0.5438 0.7106, 0.5881
Cnxe, MinCnxe - Eval 0.6588, 0.5080 0.6708, 0.5240
ATWV, MTWV - Dev 0.4494, 0.4494 0.4051, 0.4052
ATWV, MTWV - Eval 0.4399, 0.4423 0.3918, 0.4218
11
Submissions and Results (cont.)
MediaEval 2014
| October 16-17 2014, Barcelona, SPAIN
 Primary: fusing (A1) and (A2) (basic and cutting the end)
 Late: fusing the 5 approaches.
Cnxe for isolated approaches on Eval:
 A1: 0.6823, A2: 0.6721, A3: 0.6947, A4: 0.6957 A5: 0.6999
 For Type 3 queries, late system was better:
 0.8049 Cnxe on primary to 0.7865 Cnxe on late
primary late
Cnxe, MinCnxe - Eval 0.6588, 0.5080 0.6708, 0.5240
12
Conclusions
MediaEval 2014
| October 16-17 2014, Barcelona, SPAIN
 Although this year’s task has an added difficulty, a simple DTW still works
well for most cases.
 Cutting queries at the end revealed to be the best strategy, and fusing it
with A1 was even better.
 Including the possibility of jumps and re-orders increased False Positives
overall, since these special cases are a small part of the database.
 We lacked an optimization method for Cnxe
 Which would greatly improve the results.
13
END – Thank You
MediaEval 2014
| October 16-17 2014, Barcelona, SPAIN
Processing Speed:
 Hardware – CRAY CX1 Cluster, running windows server 2008 HPC, using 16 of 56
cores (7 nodes with double Intel Xeon 5520 2.27GHz quad-core and 24GB RAM per
node).
 Indexing Speed Factor – 1.4
 Searching Speed Factor – 0.0029 per sec and per language
 Peak Memory – 0.098 GB

More Related Content

Viewers also liked

Emotion in Music Task at MediaEval 2014
Emotion in Music Task at MediaEval 2014Emotion in Music Task at MediaEval 2014
Emotion in Music Task at MediaEval 2014multimediaeval
 
The Search and Hyperlinking Task at MediaEval 2014
The Search and Hyperlinking Task at MediaEval 2014The Search and Hyperlinking Task at MediaEval 2014
The Search and Hyperlinking Task at MediaEval 2014multimediaeval
 
TALP-UPC at MediaEval 2014 Placing Task: Combining Geographical Knowledge Bas...
TALP-UPC at MediaEval 2014 Placing Task: Combining Geographical Knowledge Bas...TALP-UPC at MediaEval 2014 Placing Task: Combining Geographical Knowledge Bas...
TALP-UPC at MediaEval 2014 Placing Task: Combining Geographical Knowledge Bas...multimediaeval
 
MediaEval 2014: THU-HCSIL Approach to Emotion in Music Task using Multi-level...
MediaEval 2014: THU-HCSIL Approach to Emotion in Music Task using Multi-level...MediaEval 2014: THU-HCSIL Approach to Emotion in Music Task using Multi-level...
MediaEval 2014: THU-HCSIL Approach to Emotion in Music Task using Multi-level...multimediaeval
 
UPC at MediaEval 2014 Social Event Detection Task
UPC at MediaEval 2014 Social Event Detection TaskUPC at MediaEval 2014 Social Event Detection Task
UPC at MediaEval 2014 Social Event Detection Taskmultimediaeval
 
Synchronizing Multi-User Photo Galleries with MRF
Synchronizing Multi-User Photo Galleries with MRFSynchronizing Multi-User Photo Galleries with MRF
Synchronizing Multi-User Photo Galleries with MRFmultimediaeval
 
Stravinsqi/De Montfort University at the MediaEval 2014 C@merata Task
Stravinsqi/De Montfort University at the MediaEval 2014 C@merata TaskStravinsqi/De Montfort University at the MediaEval 2014 C@merata Task
Stravinsqi/De Montfort University at the MediaEval 2014 C@merata Taskmultimediaeval
 
4845 Programa de Embajadores Rotarios 2016 2017
4845 Programa de Embajadores Rotarios 2016 20174845 Programa de Embajadores Rotarios 2016 2017
4845 Programa de Embajadores Rotarios 2016 2017Miguel DE PAOLI
 
04 sem cert hparticipation_final
04 sem cert hparticipation_final04 sem cert hparticipation_final
04 sem cert hparticipation_finalmultimediaeval
 
RECOD at MediaEval 2014: Violent Scenes Detection Task
RECOD at MediaEval 2014: Violent Scenes Detection TaskRECOD at MediaEval 2014: Violent Scenes Detection Task
RECOD at MediaEval 2014: Violent Scenes Detection Taskmultimediaeval
 
The Munich LSTM-RNN Approach to the MediaEval 2014 “Emotion in Music” Task
The Munich LSTM-RNN Approach to the MediaEval 2014 “Emotion in Music” TaskThe Munich LSTM-RNN Approach to the MediaEval 2014 “Emotion in Music” Task
The Munich LSTM-RNN Approach to the MediaEval 2014 “Emotion in Music” Taskmultimediaeval
 

Viewers also liked (11)

Emotion in Music Task at MediaEval 2014
Emotion in Music Task at MediaEval 2014Emotion in Music Task at MediaEval 2014
Emotion in Music Task at MediaEval 2014
 
The Search and Hyperlinking Task at MediaEval 2014
The Search and Hyperlinking Task at MediaEval 2014The Search and Hyperlinking Task at MediaEval 2014
The Search and Hyperlinking Task at MediaEval 2014
 
TALP-UPC at MediaEval 2014 Placing Task: Combining Geographical Knowledge Bas...
TALP-UPC at MediaEval 2014 Placing Task: Combining Geographical Knowledge Bas...TALP-UPC at MediaEval 2014 Placing Task: Combining Geographical Knowledge Bas...
TALP-UPC at MediaEval 2014 Placing Task: Combining Geographical Knowledge Bas...
 
MediaEval 2014: THU-HCSIL Approach to Emotion in Music Task using Multi-level...
MediaEval 2014: THU-HCSIL Approach to Emotion in Music Task using Multi-level...MediaEval 2014: THU-HCSIL Approach to Emotion in Music Task using Multi-level...
MediaEval 2014: THU-HCSIL Approach to Emotion in Music Task using Multi-level...
 
UPC at MediaEval 2014 Social Event Detection Task
UPC at MediaEval 2014 Social Event Detection TaskUPC at MediaEval 2014 Social Event Detection Task
UPC at MediaEval 2014 Social Event Detection Task
 
Synchronizing Multi-User Photo Galleries with MRF
Synchronizing Multi-User Photo Galleries with MRFSynchronizing Multi-User Photo Galleries with MRF
Synchronizing Multi-User Photo Galleries with MRF
 
Stravinsqi/De Montfort University at the MediaEval 2014 C@merata Task
Stravinsqi/De Montfort University at the MediaEval 2014 C@merata TaskStravinsqi/De Montfort University at the MediaEval 2014 C@merata Task
Stravinsqi/De Montfort University at the MediaEval 2014 C@merata Task
 
4845 Programa de Embajadores Rotarios 2016 2017
4845 Programa de Embajadores Rotarios 2016 20174845 Programa de Embajadores Rotarios 2016 2017
4845 Programa de Embajadores Rotarios 2016 2017
 
04 sem cert hparticipation_final
04 sem cert hparticipation_final04 sem cert hparticipation_final
04 sem cert hparticipation_final
 
RECOD at MediaEval 2014: Violent Scenes Detection Task
RECOD at MediaEval 2014: Violent Scenes Detection TaskRECOD at MediaEval 2014: Violent Scenes Detection Task
RECOD at MediaEval 2014: Violent Scenes Detection Task
 
The Munich LSTM-RNN Approach to the MediaEval 2014 “Emotion in Music” Task
The Munich LSTM-RNN Approach to the MediaEval 2014 “Emotion in Music” TaskThe Munich LSTM-RNN Approach to the MediaEval 2014 “Emotion in Music” Task
The Munich LSTM-RNN Approach to the MediaEval 2014 “Emotion in Music” Task
 

Similar to T he SPL - IT Query by Example Search on Speech system for MediaEval 2014

2014 CISAP6 Atmospheric Turbulent Dispersion Modeling Methods using Machine L...
2014 CISAP6 Atmospheric Turbulent Dispersion Modeling Methods using Machine L...2014 CISAP6 Atmospheric Turbulent Dispersion Modeling Methods using Machine L...
2014 CISAP6 Atmospheric Turbulent Dispersion Modeling Methods using Machine L...Pierre Lauret
 
Mediaeval 2013 Spoken Web Search results slides
Mediaeval 2013 Spoken Web Search results slidesMediaeval 2013 Spoken Web Search results slides
Mediaeval 2013 Spoken Web Search results slidesXavier Anguera
 
MediaEval2015 - The SPL-IT-UC Query by Example Search on Speech system for Me...
MediaEval2015 - The SPL-IT-UC Query by Example Search on Speech system for Me...MediaEval2015 - The SPL-IT-UC Query by Example Search on Speech system for Me...
MediaEval2015 - The SPL-IT-UC Query by Example Search on Speech system for Me...multimediaeval
 
Performance Analysis of MIMO–OFDM for PCHBF , RELAY Technique with MMSE For T...
Performance Analysis of MIMO–OFDM for PCHBF , RELAY Technique with MMSE For T...Performance Analysis of MIMO–OFDM for PCHBF , RELAY Technique with MMSE For T...
Performance Analysis of MIMO–OFDM for PCHBF , RELAY Technique with MMSE For T...Sri Manakula Vinayagar Engineering College
 
Master Thesis of Computer Engineering: OpenTranslator
Master Thesis of Computer Engineering: OpenTranslatorMaster Thesis of Computer Engineering: OpenTranslator
Master Thesis of Computer Engineering: OpenTranslatorGiuseppe D'Onofrio
 
Convolutional Neural Network to Model Articulation Impairments in Patients wi...
Convolutional Neural Network to Model Articulation Impairments in Patients wi...Convolutional Neural Network to Model Articulation Impairments in Patients wi...
Convolutional Neural Network to Model Articulation Impairments in Patients wi...Juan Camilo Vasquez
 
Spectral Efficient Blind Channel Estimation Technique for MIMO-OFDM Communica...
Spectral Efficient Blind Channel Estimation Technique for MIMO-OFDM Communica...Spectral Efficient Blind Channel Estimation Technique for MIMO-OFDM Communica...
Spectral Efficient Blind Channel Estimation Technique for MIMO-OFDM Communica...IJAAS Team
 
MediaEval 2015 - The SPL-IT-UC Query by Example Search on Speech system for M...
MediaEval 2015 - The SPL-IT-UC Query by Example Search on Speech system for M...MediaEval 2015 - The SPL-IT-UC Query by Example Search on Speech system for M...
MediaEval 2015 - The SPL-IT-UC Query by Example Search on Speech system for M...multimediaeval
 
VOICE PASSWORD BASED SPEAKER VERIFICATION SYSTEM USING VOWEL AND NON VOWEL RE...
VOICE PASSWORD BASED SPEAKER VERIFICATION SYSTEM USING VOWEL AND NON VOWEL RE...VOICE PASSWORD BASED SPEAKER VERIFICATION SYSTEM USING VOWEL AND NON VOWEL RE...
VOICE PASSWORD BASED SPEAKER VERIFICATION SYSTEM USING VOWEL AND NON VOWEL RE...niranjan kumar
 
A Summative Comparison of Blind Channel Estimation Techniques for Orthogonal ...
A Summative Comparison of Blind Channel Estimation Techniques for Orthogonal ...A Summative Comparison of Blind Channel Estimation Techniques for Orthogonal ...
A Summative Comparison of Blind Channel Estimation Techniques for Orthogonal ...IJECEIAES
 
USRP Project Final Report
USRP Project Final ReportUSRP Project Final Report
USRP Project Final ReportArjan Gupta
 
Peak detection using wavelet transform
Peak detection using wavelet transformPeak detection using wavelet transform
Peak detection using wavelet transformIJCNCJournal
 
Modeling Text Independent Speaker Identification with Vector Quantization
Modeling Text Independent Speaker Identification with Vector QuantizationModeling Text Independent Speaker Identification with Vector Quantization
Modeling Text Independent Speaker Identification with Vector QuantizationTELKOMNIKA JOURNAL
 
ICT-GroupProject-Report2-NguyenDangHoa_2
ICT-GroupProject-Report2-NguyenDangHoa_2ICT-GroupProject-Report2-NguyenDangHoa_2
ICT-GroupProject-Report2-NguyenDangHoa_2Minh Tuan Nguyen
 
J03502050055
J03502050055J03502050055
J03502050055theijes
 
An experience on empirical research about rdf stream
An experience on empirical research about rdf streamAn experience on empirical research about rdf stream
An experience on empirical research about rdf streamDaniele Dell'Aglio
 
Semantics, Automatic Metadata and Audiovisual Contents. A case of study: the ...
Semantics, Automatic Metadata and Audiovisual Contents. A case of study: the ...Semantics, Automatic Metadata and Audiovisual Contents. A case of study: the ...
Semantics, Automatic Metadata and Audiovisual Contents. A case of study: the ...FIAT/IFTA
 

Similar to T he SPL - IT Query by Example Search on Speech system for MediaEval 2014 (20)

2014 CISAP6 Atmospheric Turbulent Dispersion Modeling Methods using Machine L...
2014 CISAP6 Atmospheric Turbulent Dispersion Modeling Methods using Machine L...2014 CISAP6 Atmospheric Turbulent Dispersion Modeling Methods using Machine L...
2014 CISAP6 Atmospheric Turbulent Dispersion Modeling Methods using Machine L...
 
Mediaeval 2013 Spoken Web Search results slides
Mediaeval 2013 Spoken Web Search results slidesMediaeval 2013 Spoken Web Search results slides
Mediaeval 2013 Spoken Web Search results slides
 
MediaEval2015 - The SPL-IT-UC Query by Example Search on Speech system for Me...
MediaEval2015 - The SPL-IT-UC Query by Example Search on Speech system for Me...MediaEval2015 - The SPL-IT-UC Query by Example Search on Speech system for Me...
MediaEval2015 - The SPL-IT-UC Query by Example Search on Speech system for Me...
 
Performance Analysis of MIMO–OFDM for PCHBF , RELAY Technique with MMSE For T...
Performance Analysis of MIMO–OFDM for PCHBF , RELAY Technique with MMSE For T...Performance Analysis of MIMO–OFDM for PCHBF , RELAY Technique with MMSE For T...
Performance Analysis of MIMO–OFDM for PCHBF , RELAY Technique with MMSE For T...
 
Master Thesis of Computer Engineering: OpenTranslator
Master Thesis of Computer Engineering: OpenTranslatorMaster Thesis of Computer Engineering: OpenTranslator
Master Thesis of Computer Engineering: OpenTranslator
 
Convolutional Neural Network to Model Articulation Impairments in Patients wi...
Convolutional Neural Network to Model Articulation Impairments in Patients wi...Convolutional Neural Network to Model Articulation Impairments in Patients wi...
Convolutional Neural Network to Model Articulation Impairments in Patients wi...
 
Spectral Efficient Blind Channel Estimation Technique for MIMO-OFDM Communica...
Spectral Efficient Blind Channel Estimation Technique for MIMO-OFDM Communica...Spectral Efficient Blind Channel Estimation Technique for MIMO-OFDM Communica...
Spectral Efficient Blind Channel Estimation Technique for MIMO-OFDM Communica...
 
MediaEval 2015 - The SPL-IT-UC Query by Example Search on Speech system for M...
MediaEval 2015 - The SPL-IT-UC Query by Example Search on Speech system for M...MediaEval 2015 - The SPL-IT-UC Query by Example Search on Speech system for M...
MediaEval 2015 - The SPL-IT-UC Query by Example Search on Speech system for M...
 
VOICE PASSWORD BASED SPEAKER VERIFICATION SYSTEM USING VOWEL AND NON VOWEL RE...
VOICE PASSWORD BASED SPEAKER VERIFICATION SYSTEM USING VOWEL AND NON VOWEL RE...VOICE PASSWORD BASED SPEAKER VERIFICATION SYSTEM USING VOWEL AND NON VOWEL RE...
VOICE PASSWORD BASED SPEAKER VERIFICATION SYSTEM USING VOWEL AND NON VOWEL RE...
 
A Summative Comparison of Blind Channel Estimation Techniques for Orthogonal ...
A Summative Comparison of Blind Channel Estimation Techniques for Orthogonal ...A Summative Comparison of Blind Channel Estimation Techniques for Orthogonal ...
A Summative Comparison of Blind Channel Estimation Techniques for Orthogonal ...
 
USRP Project Final Report
USRP Project Final ReportUSRP Project Final Report
USRP Project Final Report
 
Speaker Recognition
Speaker RecognitionSpeaker Recognition
Speaker Recognition
 
Peak detection using wavelet transform
Peak detection using wavelet transformPeak detection using wavelet transform
Peak detection using wavelet transform
 
Modeling Text Independent Speaker Identification with Vector Quantization
Modeling Text Independent Speaker Identification with Vector QuantizationModeling Text Independent Speaker Identification with Vector Quantization
Modeling Text Independent Speaker Identification with Vector Quantization
 
Nni v7
Nni v7Nni v7
Nni v7
 
ICT-GroupProject-Report2-NguyenDangHoa_2
ICT-GroupProject-Report2-NguyenDangHoa_2ICT-GroupProject-Report2-NguyenDangHoa_2
ICT-GroupProject-Report2-NguyenDangHoa_2
 
J03502050055
J03502050055J03502050055
J03502050055
 
An experience on empirical research about rdf stream
An experience on empirical research about rdf streamAn experience on empirical research about rdf stream
An experience on empirical research about rdf stream
 
CV_English_Fernando_Andres_Sanchez_Gonzalez_17-Feb-2017
CV_English_Fernando_Andres_Sanchez_Gonzalez_17-Feb-2017CV_English_Fernando_Andres_Sanchez_Gonzalez_17-Feb-2017
CV_English_Fernando_Andres_Sanchez_Gonzalez_17-Feb-2017
 
Semantics, Automatic Metadata and Audiovisual Contents. A case of study: the ...
Semantics, Automatic Metadata and Audiovisual Contents. A case of study: the ...Semantics, Automatic Metadata and Audiovisual Contents. A case of study: the ...
Semantics, Automatic Metadata and Audiovisual Contents. A case of study: the ...
 

More from multimediaeval

Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...multimediaeval
 
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...multimediaeval
 
Sports Video Classification: Classification of Strokes in Table Tennis for Me...
Sports Video Classification: Classification of Strokes in Table Tennis for Me...Sports Video Classification: Classification of Strokes in Table Tennis for Me...
Sports Video Classification: Classification of Strokes in Table Tennis for Me...multimediaeval
 
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...multimediaeval
 
Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task
Essex-NLIP at MediaEval Predicting Media Memorability 2020 TaskEssex-NLIP at MediaEval Predicting Media Memorability 2020 Task
Essex-NLIP at MediaEval Predicting Media Memorability 2020 Taskmultimediaeval
 
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...multimediaeval
 
Fooling an Automatic Image Quality Estimator
Fooling an Automatic Image Quality EstimatorFooling an Automatic Image Quality Estimator
Fooling an Automatic Image Quality Estimatormultimediaeval
 
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...multimediaeval
 
Pixel Privacy: Quality Camouflage for Social Images
Pixel Privacy: Quality Camouflage for Social ImagesPixel Privacy: Quality Camouflage for Social Images
Pixel Privacy: Quality Camouflage for Social Imagesmultimediaeval
 
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-MatchingHCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matchingmultimediaeval
 
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...multimediaeval
 
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...multimediaeval
 
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...multimediaeval
 
Deep Conditional Adversarial learning for polyp Segmentation
Deep Conditional Adversarial learning for polyp SegmentationDeep Conditional Adversarial learning for polyp Segmentation
Deep Conditional Adversarial learning for polyp Segmentationmultimediaeval
 
A Temporal-Spatial Attention Model for Medical Image Detection
A Temporal-Spatial Attention Model for Medical Image DetectionA Temporal-Spatial Attention Model for Medical Image Detection
A Temporal-Spatial Attention Model for Medical Image Detectionmultimediaeval
 
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...multimediaeval
 
Fine-tuning for Polyp Segmentation with Attention
Fine-tuning for Polyp Segmentation with AttentionFine-tuning for Polyp Segmentation with Attention
Fine-tuning for Polyp Segmentation with Attentionmultimediaeval
 
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...multimediaeval
 
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...multimediaeval
 
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
 Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ... Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...multimediaeval
 

More from multimediaeval (20)

Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
 
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
 
Sports Video Classification: Classification of Strokes in Table Tennis for Me...
Sports Video Classification: Classification of Strokes in Table Tennis for Me...Sports Video Classification: Classification of Strokes in Table Tennis for Me...
Sports Video Classification: Classification of Strokes in Table Tennis for Me...
 
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
 
Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task
Essex-NLIP at MediaEval Predicting Media Memorability 2020 TaskEssex-NLIP at MediaEval Predicting Media Memorability 2020 Task
Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task
 
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
 
Fooling an Automatic Image Quality Estimator
Fooling an Automatic Image Quality EstimatorFooling an Automatic Image Quality Estimator
Fooling an Automatic Image Quality Estimator
 
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
 
Pixel Privacy: Quality Camouflage for Social Images
Pixel Privacy: Quality Camouflage for Social ImagesPixel Privacy: Quality Camouflage for Social Images
Pixel Privacy: Quality Camouflage for Social Images
 
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-MatchingHCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
 
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
 
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
 
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
 
Deep Conditional Adversarial learning for polyp Segmentation
Deep Conditional Adversarial learning for polyp SegmentationDeep Conditional Adversarial learning for polyp Segmentation
Deep Conditional Adversarial learning for polyp Segmentation
 
A Temporal-Spatial Attention Model for Medical Image Detection
A Temporal-Spatial Attention Model for Medical Image DetectionA Temporal-Spatial Attention Model for Medical Image Detection
A Temporal-Spatial Attention Model for Medical Image Detection
 
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
 
Fine-tuning for Polyp Segmentation with Attention
Fine-tuning for Polyp Segmentation with AttentionFine-tuning for Polyp Segmentation with Attention
Fine-tuning for Polyp Segmentation with Attention
 
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
 
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
 
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
 Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ... Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
 

Recently uploaded

Auto Affiliate AI Earns First Commission in 3 Hours..pdf
Auto Affiliate  AI Earns First Commission in 3 Hours..pdfAuto Affiliate  AI Earns First Commission in 3 Hours..pdf
Auto Affiliate AI Earns First Commission in 3 Hours..pdfSelfMade bd
 
Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...drm1699
 
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale IbridaUNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale IbridaNeo4j
 
Workshop - Architecting Innovative Graph Applications- GraphSummit Milan
Workshop -  Architecting Innovative Graph Applications- GraphSummit MilanWorkshop -  Architecting Innovative Graph Applications- GraphSummit Milan
Workshop - Architecting Innovative Graph Applications- GraphSummit MilanNeo4j
 
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...Neo4j
 
Community is Just as Important as Code by Andrea Goulet
Community is Just as Important as Code by Andrea GouletCommunity is Just as Important as Code by Andrea Goulet
Community is Just as Important as Code by Andrea GouletAndrea Goulet
 
A Deep Dive into Secure Product Development Frameworks.pdf
A Deep Dive into Secure Product Development Frameworks.pdfA Deep Dive into Secure Product Development Frameworks.pdf
A Deep Dive into Secure Product Development Frameworks.pdfICS
 
Alluxio Monthly Webinar | Simplify Data Access for AI in Multi-Cloud
Alluxio Monthly Webinar | Simplify Data Access for AI in Multi-CloudAlluxio Monthly Webinar | Simplify Data Access for AI in Multi-Cloud
Alluxio Monthly Webinar | Simplify Data Access for AI in Multi-CloudAlluxio, Inc.
 
Software Engineering - Introduction + Process Models + Requirements Engineering
Software Engineering - Introduction + Process Models + Requirements EngineeringSoftware Engineering - Introduction + Process Models + Requirements Engineering
Software Engineering - Introduction + Process Models + Requirements EngineeringPrakhyath Rai
 
Navigation in flutter – how to add stack, tab, and drawer navigators to your ...
Navigation in flutter – how to add stack, tab, and drawer navigators to your ...Navigation in flutter – how to add stack, tab, and drawer navigators to your ...
Navigation in flutter – how to add stack, tab, and drawer navigators to your ...Flutter Agency
 
Evolving Data Governance for the Real-time Streaming and AI Era
Evolving Data Governance for the Real-time Streaming and AI EraEvolving Data Governance for the Real-time Streaming and AI Era
Evolving Data Governance for the Real-time Streaming and AI Eraconfluent
 
Incident handling is a clearly defined set of procedures to manage and respon...
Incident handling is a clearly defined set of procedures to manage and respon...Incident handling is a clearly defined set of procedures to manage and respon...
Incident handling is a clearly defined set of procedures to manage and respon...Varun Mithran
 
From Knowledge Graphs via Lego Bricks to scientific conversations.pptx
From Knowledge Graphs via Lego Bricks to scientific conversations.pptxFrom Knowledge Graphs via Lego Bricks to scientific conversations.pptx
From Knowledge Graphs via Lego Bricks to scientific conversations.pptxNeo4j
 
Novo Nordisk: When Knowledge Graphs meet LLMs
Novo Nordisk: When Knowledge Graphs meet LLMsNovo Nordisk: When Knowledge Graphs meet LLMs
Novo Nordisk: When Knowledge Graphs meet LLMsNeo4j
 
Anypoint Code Builder - Munich MuleSoft Meetup - 16th May 2024
Anypoint Code Builder - Munich MuleSoft Meetup - 16th May 2024Anypoint Code Builder - Munich MuleSoft Meetup - 16th May 2024
Anypoint Code Builder - Munich MuleSoft Meetup - 16th May 2024MulesoftMunichMeetup
 
Automate your OpenSIPS config tests - OpenSIPS Summit 2024
Automate your OpenSIPS config tests - OpenSIPS Summit 2024Automate your OpenSIPS config tests - OpenSIPS Summit 2024
Automate your OpenSIPS config tests - OpenSIPS Summit 2024Andreas Granig
 

Recently uploaded (20)

Auto Affiliate AI Earns First Commission in 3 Hours..pdf
Auto Affiliate  AI Earns First Commission in 3 Hours..pdfAuto Affiliate  AI Earns First Commission in 3 Hours..pdf
Auto Affiliate AI Earns First Commission in 3 Hours..pdf
 
Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
 
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale IbridaUNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
 
Workshop - Architecting Innovative Graph Applications- GraphSummit Milan
Workshop -  Architecting Innovative Graph Applications- GraphSummit MilanWorkshop -  Architecting Innovative Graph Applications- GraphSummit Milan
Workshop - Architecting Innovative Graph Applications- GraphSummit Milan
 
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
 
Community is Just as Important as Code by Andrea Goulet
Community is Just as Important as Code by Andrea GouletCommunity is Just as Important as Code by Andrea Goulet
Community is Just as Important as Code by Andrea Goulet
 
A Deep Dive into Secure Product Development Frameworks.pdf
A Deep Dive into Secure Product Development Frameworks.pdfA Deep Dive into Secure Product Development Frameworks.pdf
A Deep Dive into Secure Product Development Frameworks.pdf
 
Alluxio Monthly Webinar | Simplify Data Access for AI in Multi-Cloud
Alluxio Monthly Webinar | Simplify Data Access for AI in Multi-CloudAlluxio Monthly Webinar | Simplify Data Access for AI in Multi-Cloud
Alluxio Monthly Webinar | Simplify Data Access for AI in Multi-Cloud
 
Software Engineering - Introduction + Process Models + Requirements Engineering
Software Engineering - Introduction + Process Models + Requirements EngineeringSoftware Engineering - Introduction + Process Models + Requirements Engineering
Software Engineering - Introduction + Process Models + Requirements Engineering
 
Abortion Clinic In Pretoria ](+27832195400*)[ 🏥 Safe Abortion Pills in Pretor...
Abortion Clinic In Pretoria ](+27832195400*)[ 🏥 Safe Abortion Pills in Pretor...Abortion Clinic In Pretoria ](+27832195400*)[ 🏥 Safe Abortion Pills in Pretor...
Abortion Clinic In Pretoria ](+27832195400*)[ 🏥 Safe Abortion Pills in Pretor...
 
Abortion Clinic In Springs ](+27832195400*)[ 🏥 Safe Abortion Pills in Springs...
Abortion Clinic In Springs ](+27832195400*)[ 🏥 Safe Abortion Pills in Springs...Abortion Clinic In Springs ](+27832195400*)[ 🏥 Safe Abortion Pills in Springs...
Abortion Clinic In Springs ](+27832195400*)[ 🏥 Safe Abortion Pills in Springs...
 
Navigation in flutter – how to add stack, tab, and drawer navigators to your ...
Navigation in flutter – how to add stack, tab, and drawer navigators to your ...Navigation in flutter – how to add stack, tab, and drawer navigators to your ...
Navigation in flutter – how to add stack, tab, and drawer navigators to your ...
 
Evolving Data Governance for the Real-time Streaming and AI Era
Evolving Data Governance for the Real-time Streaming and AI EraEvolving Data Governance for the Real-time Streaming and AI Era
Evolving Data Governance for the Real-time Streaming and AI Era
 
Abortion Pill Prices Turfloop ](+27832195400*)[ 🏥 Women's Abortion Clinic in ...
Abortion Pill Prices Turfloop ](+27832195400*)[ 🏥 Women's Abortion Clinic in ...Abortion Pill Prices Turfloop ](+27832195400*)[ 🏥 Women's Abortion Clinic in ...
Abortion Pill Prices Turfloop ](+27832195400*)[ 🏥 Women's Abortion Clinic in ...
 
Incident handling is a clearly defined set of procedures to manage and respon...
Incident handling is a clearly defined set of procedures to manage and respon...Incident handling is a clearly defined set of procedures to manage and respon...
Incident handling is a clearly defined set of procedures to manage and respon...
 
Abortion Pill Prices Mthatha (@](+27832195400*)[ 🏥 Women's Abortion Clinic In...
Abortion Pill Prices Mthatha (@](+27832195400*)[ 🏥 Women's Abortion Clinic In...Abortion Pill Prices Mthatha (@](+27832195400*)[ 🏥 Women's Abortion Clinic In...
Abortion Pill Prices Mthatha (@](+27832195400*)[ 🏥 Women's Abortion Clinic In...
 
From Knowledge Graphs via Lego Bricks to scientific conversations.pptx
From Knowledge Graphs via Lego Bricks to scientific conversations.pptxFrom Knowledge Graphs via Lego Bricks to scientific conversations.pptx
From Knowledge Graphs via Lego Bricks to scientific conversations.pptx
 
Novo Nordisk: When Knowledge Graphs meet LLMs
Novo Nordisk: When Knowledge Graphs meet LLMsNovo Nordisk: When Knowledge Graphs meet LLMs
Novo Nordisk: When Knowledge Graphs meet LLMs
 
Anypoint Code Builder - Munich MuleSoft Meetup - 16th May 2024
Anypoint Code Builder - Munich MuleSoft Meetup - 16th May 2024Anypoint Code Builder - Munich MuleSoft Meetup - 16th May 2024
Anypoint Code Builder - Munich MuleSoft Meetup - 16th May 2024
 
Automate your OpenSIPS config tests - OpenSIPS Summit 2024
Automate your OpenSIPS config tests - OpenSIPS Summit 2024Automate your OpenSIPS config tests - OpenSIPS Summit 2024
Automate your OpenSIPS config tests - OpenSIPS Summit 2024
 

T he SPL - IT Query by Example Search on Speech system for MediaEval 2014

  • 1. Jorge Proença 1,2 Arlindo Veiga 1,2 Fernando Perdigão 1,2 The SPL-IT Query by Example Search on Speech system for MediaEval 2014 The 2014 Query by Example Search on Speech (QUESST) 1 Instituto de Telecomunicações, Coimbra, Portugal 2 Electrical and Computer Eng. Department, University of Coimbra, Portugal
  • 2. 2 SPL-IT system MediaEval 2014 | October 16-17 2014, Barcelona, SPAIN Overview of the system:  Fuses Dynamic Time Warping (DTW) modifications  Fuses results from systems with phonetic recognizers for 3 languages
  • 3. 3 Phonetic Recognizer MediaEval 2014 | October 16-17 2014, Barcelona, SPAIN  Hard to extract good posteriorgrams with an HMM system (our in- house system).  Used 3 systems/languages (for 8 kHz) based on long temporal context and neural networks from Brnu University of Technology (BUT):  Czech  Hungarian  Russian  Output: posteriorgrams (3 states per phoneme).  Leading and trailing silence/noise removed PhonemeState Frame State Posteriorgram example for one query
  • 4. 4 Dynamic Time Warping MediaEval 2014 | October 16-17 2014, Barcelona, SPAIN  Local Distance matrix:  Dot Product of Query and Audio posterior probability vectors;  Back-off with l =10-4    , logD q x q x   Distance Matrix of Query vs Audio
  • 5. 5 Dynamic Time Warping MediaEval 2014 | October 16-17 2014, Barcelona, SPAIN  Basic DTW strategy (A1):  Smallest distance in identically weighted unitary jumps: Distance Matrix (top) and accumulated Distance matrix (bottom) of Query vs Audio
  • 6. 6 DTW Modifications MediaEval 2014 | October 16-17 2014, Barcelona, SPAIN  4 additional approaches: (A2) – Cutting up to 250ms at the end of the query, keeping the segment above 500ms (A3) – Cutting up to 250ms at the beginning of the query, keeping the segment above 500ms QueryQuery Audio Query vs. Audio posterior distance matrix (top) and the best path from A2 (bottom)
  • 7. 7 DTW Modifications MediaEval 2014 | October 16-17 2014, Barcelona, SPAIN (A4) – Allowing one jump in the path up to ½ Query’s length, can’t occur at initial and final 250ms of the query can’t occur for queries shorter than 800ms QueryQuery Audio Query vs. Audio posterior distance matrix (top) and the best path from A4 (bottom)
  • 8. 8 DTW Modifications MediaEval 2014 | October 16-17 2014, Barcelona, SPAIN (A5) – Swaps: accounting for re-ordering of words. Backtrack the best 5 candidates from (A1) from the end, Find the best path for the beginning of the query, ahead of the end of the first one, with restrictions similar to (A4).QueryQuery Audio Query vs. Audio posterior distance matrix (top) and the best path from A5 (bottom)
  • 9. 9 Fusing systems MediaEval 2014 | October 16-17 2014, Barcelona, SPAIN  Different approaches:  Minimum of the approaches – not the best.  Harmonic mean found to be a good compromise.  Per-query normalization (standard score):  Different languages:  Arithmetic mean of the 3 scores.  X
  • 10. 10 Submissions and Results MediaEval 2014 | October 16-17 2014, Barcelona, SPAIN  Primary: fusing (A1) and (A2) (basic and cutting the end)  Late: fusing the 5 approaches.  Late provided worse overall results primary late Cnxe, MinCnxe - Dev 0.6797, 0.5438 0.7106, 0.5881 Cnxe, MinCnxe - Eval 0.6588, 0.5080 0.6708, 0.5240 ATWV, MTWV - Dev 0.4494, 0.4494 0.4051, 0.4052 ATWV, MTWV - Eval 0.4399, 0.4423 0.3918, 0.4218
  • 11. 11 Submissions and Results (cont.) MediaEval 2014 | October 16-17 2014, Barcelona, SPAIN  Primary: fusing (A1) and (A2) (basic and cutting the end)  Late: fusing the 5 approaches. Cnxe for isolated approaches on Eval:  A1: 0.6823, A2: 0.6721, A3: 0.6947, A4: 0.6957 A5: 0.6999  For Type 3 queries, late system was better:  0.8049 Cnxe on primary to 0.7865 Cnxe on late primary late Cnxe, MinCnxe - Eval 0.6588, 0.5080 0.6708, 0.5240
  • 12. 12 Conclusions MediaEval 2014 | October 16-17 2014, Barcelona, SPAIN  Although this year’s task has an added difficulty, a simple DTW still works well for most cases.  Cutting queries at the end revealed to be the best strategy, and fusing it with A1 was even better.  Including the possibility of jumps and re-orders increased False Positives overall, since these special cases are a small part of the database.  We lacked an optimization method for Cnxe  Which would greatly improve the results.
  • 13. 13 END – Thank You MediaEval 2014 | October 16-17 2014, Barcelona, SPAIN Processing Speed:  Hardware – CRAY CX1 Cluster, running windows server 2008 HPC, using 16 of 56 cores (7 nodes with double Intel Xeon 5520 2.27GHz quad-core and 24GB RAM per node).  Indexing Speed Factor – 1.4  Searching Speed Factor – 0.0029 per sec and per language  Peak Memory – 0.098 GB