SlideShare a Scribd company logo
RECOD @ Placing Task of
MediaEval 2015
L. T. Li1
, J. A. V. Muñoz1
, J. Almeida1,3
,
R. T. Calumby1,4
, O. A. B. Penatti1,2
, I. C. Dourado1
,
K. Nogueira6
, P. R. Mendes Júnior1
,
D. C. G. Pedronette1,5
, J. A. dos Santos6
,
M. A. Gonçalves6
, and R. S. Torres1
Daniel Moreira
On behalf of the authors.
1. UNICAMP – 2. UNIFESP – 3. SAMSUNG – 4. UEFS – 5. UNESP – 6. UFMG
– BRAZIL –
2015 Participation
● We focused on the Localization subtask;
● Innovations concerning the last year (2014)
– Rank agreggation based on Genetic
Programming;
– Geocoding improvement with Ranked List Density
Analysis (RLDA).
Submitted Runs
● Run 1
Textual Features with Genetic Programming
– Image and Video
Descriptors
BM25*
TF-IDF*
IBS*
LMD*
Ranks
Ranked list 1
Ranked list 2
Ranked list 3
Ranked list 4
Rank Aggregation
Single ranked list
(GP-Agg - based)
* All from Lucene package (http://lucene.apache.org/core/)
GP-Agg Framework
Parameter Value
Number of
generations
30
Genetics
operators
Reproduction, Mutation,
Crossover
Fitness
functions
FFP1, WAS, MAP, NDCG
Rank Agg.
methods
CombMAX, CombMIN,
CombSUM, CombMED,
CombANZ, CombMNZ,
RLSim, BordaCount, RRF,
MRA
Submitted Runs
● Run 1
Textual Features with Genetic Programming
– Image and Video
Descriptors Ranks
Ranked list 1
Ranked list 2
Ranked list 3
Ranked list 4
Rank Aggregation
Single ranked list
(GP-Agg - based)
BM25*
TF-IDF*
IBS*
LMD*
* All from Lucene package (http://lucene.apache.org/core/)
Submitted Runs
● Run 2
Visual Features with RLDA
– Image
Descriptor
BIC
Rank
BIC
Ranked list
Geocoding
Improvement
Improved geocode
(top 100 RLDA-based)
Ranked List Density Analysis
Ranked list
1~N
ID1 LAT1 LONG1
ID2 LAT2 LONG2
… … ...
IDX LATX LONGX
Top X items'
lat/long
defined as
points of a
OPF cluster
Node v:
(lat1,long1)
edge (v, u):
connect k-nn d(v, u)
d(v, u): geo-
distance between v
and u
k=3
u: (lat2,long2)
Submitted Runs
● Run 2
Visual Features with RLDA
– Image
Descriptor
BIC
Rank
BIC
Ranked list
Improved geocode
(top 100 RLDA-based)
Geocoding
Improvement
Submitted Runs
● Run 2
Visual Features with RLDA
– Video
Descriptors
LIRE 1
...
LIRE n
HMP
Ranks
Ranked list 1
...
Ranked list n
Ranked list n+1
Rank Aggregation
Single ranked list
(GP-Agg - based)
Improved geocode
(top 100 RLDA-based)
Geocoding
Improvement
Submitted Runs
● Run 3
Multimodal Solution with RLDA
– Image
Descriptors
BM25
TF-IDF
BS
LMD
BIC
...
SCD
Ranks
Ranked list 1
Ranked list 2
Ranked list 3
Ranked list 4
Ranked list 5
…
Ranked list 8
Rank Aggregation
Single ranked list
(GP-Agg - based)
Submitted Runs
● Run 3
Multimodal Solution with RLDA
– Video
Descriptors
BM25
TF-IDF
BS
LMD
HMP
...
MFCC
Ranks
Ranked list 1
Ranked list 2
Ranked list 3
Ranked list 4
Ranked list 5
…
Ranked list n
Rank Aggregation
Single ranked list
(GP-Agg - based)
Submitted Runs
● Run 4
Textual with RLDA
– Image and Video
Descriptors
BM25
TF-IDF
BS
LMD
Ranks
Ranked list 1
Ranked list 2
Ranked list 3
Ranked list 4
Rank Aggregation
and Improvement
Single ranked list
(top 5 RLDA-based)
Results – 2015 Global
Run 1
Run 2
Run 3
Run 4
0 5 10 15 20 25 30 35 40 45 50
0,15
0
0,14
0,12
0,54
0,01
0,53
0,62
5,49
0,09
5,35
6,44
19,75
0,44
19,11
21,74
36,6
1,99
35,31
38,38
44,89
3,57
43,26
46,91
58,97
20,38
57,67
63,22
1m
10m
100m
1km
10km
100km
1,000km
GP-Agg Combined Textual
GP-Agg Combined Non-textual
GP-Agg Multimodal
Textual RLDL
Results – Median Distance
With Metadata Runs
Run4
Run3
Run1
50 150 250 350 450 550 650 750
309.86
394.89
196.01
Conclusions
● GP-Agg automatically combines lists and
aggregation functions.
● Top-N RLDA improves even GP-Agg results.
● RLDA was better (see Run 4) than using GP-
Agg alone.
● There is room for improving the GP-Agg
approach.
Future Work
● Develop other fitness fuctions in the GP-Agg
approach.
● Use more visual descriptors.
● Evaluate different clustering strategies.
Acknowledgments
● MediaEval 2015
● FAPESP
● CNPq
● CAPES
● Samsung
Thank You!
{lintzyli,pedro.mendes,luis.pereira,rtorres}@ic.unicamp.br,
jurandy.almeida@unifesp.br,rtcalumby@ecomp.uefs.br,
o.penatti@samsung.com,jalvarm.acm@gmail.com,
icaro.dourado@students.ic.unicamp.br,daniel@rc.unesp.br,
{keiller.nogueira,jefersson, mgoncalv}@dcc.ufmg.br
L. T. Li, J. A. V. Muñoz, J. Almeida,
R. T. Calumby, O. A. B. Penatti, I. C. Dourado,
K. Nogueira, P. R. Mendes Júnior,
D. C. G. Pedronette, J. A. dos Santos,
M. A. Gonçalves, and R. S. Torres
Support Slides
Run1 Run3 Run4
Min. : 0.000 Min. : 0.000 Min. : 0.000
1st Qu.: 1.897 1st Qu.: 2.124 1st Qu.: 1.482
Median : 309.865 Median : 394.889 Median : 196.008
Mean : 2913.598 Mean : 2976.632 Mean : 2483.614
3rd Qu.: 5573.930 3rd Qu.: 5766.894 3rd Qu.: 3798.622
Max. :19959.808 Max. :19959.808 Max. :19954.130
Basic analysis of distances (km) in Test set:
from Predicted to Expected lat/long
Run2
Min. : 0
1st Qu.: 1240
Median : 5883
Mean : 5597
3rd Qu.: 8637
Max. :19972
Videos-only Test Results (%)
Run 1: GP-Agg Textual
Run 2: GP-Agg only visual (HMP+ all Lire) + RLDA (top100)
Run 3: GP-Agg multimodal (text, visual, audio)
Run 4: BM25_RLDA (top 5)
Run 1 Run 2 Run 3 Run 4
1m 0.08 0 0.08 0.06
10m 0.4 0 0.37 0.41
100m 5.46 0.01 5.13 5.79
1km 17.62 0.02 16.74 17.89
10km 32.44 0.11 32.1 32.24
100km 40.27 3.8 39.69 39.92
1,000km 54.13 20.39 53.67 55.68
10,000km 90.57 91.97 91.5 93.16
Video-only Summary
Run1 Run2 Run3
Min. : 0.000 Min. : 0.05 Min. : 0.000
1st Qu.: 2.914 1st Qu.: 1304.13 1st Qu.: 3.138
Median : 619.747 Median : 6351.71 Median : 660.864
Mean : 3191.163 Mean : 5709.17 Mean : 3158.724
3rd Qu.: 6035.939 3rd Qu.: 8463.49 3rd Qu.: 6154.147
Max. :19656.624 Max. :19596.38 Max. :19524.628
Run4
Min. : 0.000
1st Qu.: 2.992
Median : 547.648
Mean : 2879.899
3rd Qu.: 5548.780
Max. :19656.624
GP-Agg Individual example – Run 1
Input: bm25 (description, fusion, tags, title), tf-
idf(description, fusion, tags, title), lmd_fusion, ibs_fusion
Individual:
CombSUM( MRA( CombMNZ( RRF(ibs_fusion,lmd_fusion),
CombMNZ(td-idf_fusion, bm25_fusion), RLSim(lmd_fusion, bm25_tags,
tf-idf_fusion)), CombMNZ(CombSUM(tf-idf_tags, tf-idf_fusion, tf-
idf_tags), CombMIN(tf-idf_description, bm25_description, tf-idf_tags),
RLSim(bm25_title, tf-idf_title, tf-idf_title)), RRF(CombMAX(bm25_fusion,
bm25_tags), RRF(tf-idf_fusion, ibs_fusion, tf-idf_fusion), BordaCount(tf-
idf_fusion, bm25_fusion, tf-idf_tags))), RLSim(CombSUM(CombSUM(tf-
idf_tags, tf-idf_tags, lmd_fusion), bm25_fusion), bm25_fusion))
Visual Features (HMP): Extracting
●
Histograms of Motion Patterns
●
Keyframes: Not used
●
Applying an algorithm to compare video
sequence
(1)partial decoding;
(2)feature extraction;
(3)signature generation.
“Comparison of video sequences with histograms of
motion patterns”, J. Almeida et al. ICIP, 2011.
Visual Features (HMP): overview
[Almeida et al., Comparison of video sequences with histograms of motion patterns. ICIP 2011]
HMP: Comparing Video
OPF Density formula
● d(s,t) distânce from s to t (used haversine dist.)
● A(s): list of adjacency of s.
● directed graph

More Related Content

Similar to MediaEval 2015 - RECOD@Placing Task of MediaEval 2015

Multimedia Geocoding: The RECOD 2014 Approach
Multimedia Geocoding: The RECOD 2014 ApproachMultimedia Geocoding: The RECOD 2014 Approach
Multimedia Geocoding: The RECOD 2014 Approach
multimediaeval
 
MediaEval 2016 - RECOD at Placing Task
MediaEval 2016 - RECOD at Placing TaskMediaEval 2016 - RECOD at Placing Task
MediaEval 2016 - RECOD at Placing Task
multimediaeval
 
Real-time Non-Intrusive Speech Quality Estimation of VoIP Using Genetic Progr...
Real-time Non-Intrusive Speech Quality Estimation of VoIP Using Genetic Progr...Real-time Non-Intrusive Speech Quality Estimation of VoIP Using Genetic Progr...
Real-time Non-Intrusive Speech Quality Estimation of VoIP Using Genetic Progr...
adil raja
 
IIBMP2019 講演資料「オープンソースで始める深層学習」
IIBMP2019 講演資料「オープンソースで始める深層学習」IIBMP2019 講演資料「オープンソースで始める深層学習」
IIBMP2019 講演資料「オープンソースで始める深層学習」
Preferred Networks
 
An Evolutionary Approach to Speech Quality Estimation Using Genetic Programming
An Evolutionary Approach to Speech Quality Estimation Using Genetic ProgrammingAn Evolutionary Approach to Speech Quality Estimation Using Genetic Programming
An Evolutionary Approach to Speech Quality Estimation Using Genetic Programming
adil raja
 
An Evolutionary Approach to Speech Quality Estimation
An Evolutionary Approach to Speech Quality EstimationAn Evolutionary Approach to Speech Quality Estimation
An Evolutionary Approach to Speech Quality Estimation
adil raja
 
Icsm19.ppt
Icsm19.pptIcsm19.ppt
Realtime, Non-Intrusive Evaluation of VoIP Using Genetic Programming
Realtime, Non-Intrusive Evaluation of VoIP Using Genetic ProgrammingRealtime, Non-Intrusive Evaluation of VoIP Using Genetic Programming
Realtime, Non-Intrusive Evaluation of VoIP Using Genetic Programming
adil raja
 
Mastering Machine Learning with Competitions
Mastering Machine Learning with CompetitionsMastering Machine Learning with Competitions
Mastering Machine Learning with Competitions
Jeong-Yoon Lee
 
DCU Search Runs at MediaEval 2014 Search and Hyperlinking
DCU Search Runs at MediaEval 2014 Search and HyperlinkingDCU Search Runs at MediaEval 2014 Search and Hyperlinking
DCU Search Runs at MediaEval 2014 Search and Hyperlinking
multimediaeval
 
InstructGPT: Follow instructions with human feedback
InstructGPT: Follow instructions with human feedbackInstructGPT: Follow instructions with human feedback
InstructGPT: Follow instructions with human feedback
Yan Xu
 

Similar to MediaEval 2015 - RECOD@Placing Task of MediaEval 2015 (11)

Multimedia Geocoding: The RECOD 2014 Approach
Multimedia Geocoding: The RECOD 2014 ApproachMultimedia Geocoding: The RECOD 2014 Approach
Multimedia Geocoding: The RECOD 2014 Approach
 
MediaEval 2016 - RECOD at Placing Task
MediaEval 2016 - RECOD at Placing TaskMediaEval 2016 - RECOD at Placing Task
MediaEval 2016 - RECOD at Placing Task
 
Real-time Non-Intrusive Speech Quality Estimation of VoIP Using Genetic Progr...
Real-time Non-Intrusive Speech Quality Estimation of VoIP Using Genetic Progr...Real-time Non-Intrusive Speech Quality Estimation of VoIP Using Genetic Progr...
Real-time Non-Intrusive Speech Quality Estimation of VoIP Using Genetic Progr...
 
IIBMP2019 講演資料「オープンソースで始める深層学習」
IIBMP2019 講演資料「オープンソースで始める深層学習」IIBMP2019 講演資料「オープンソースで始める深層学習」
IIBMP2019 講演資料「オープンソースで始める深層学習」
 
An Evolutionary Approach to Speech Quality Estimation Using Genetic Programming
An Evolutionary Approach to Speech Quality Estimation Using Genetic ProgrammingAn Evolutionary Approach to Speech Quality Estimation Using Genetic Programming
An Evolutionary Approach to Speech Quality Estimation Using Genetic Programming
 
An Evolutionary Approach to Speech Quality Estimation
An Evolutionary Approach to Speech Quality EstimationAn Evolutionary Approach to Speech Quality Estimation
An Evolutionary Approach to Speech Quality Estimation
 
Icsm19.ppt
Icsm19.pptIcsm19.ppt
Icsm19.ppt
 
Realtime, Non-Intrusive Evaluation of VoIP Using Genetic Programming
Realtime, Non-Intrusive Evaluation of VoIP Using Genetic ProgrammingRealtime, Non-Intrusive Evaluation of VoIP Using Genetic Programming
Realtime, Non-Intrusive Evaluation of VoIP Using Genetic Programming
 
Mastering Machine Learning with Competitions
Mastering Machine Learning with CompetitionsMastering Machine Learning with Competitions
Mastering Machine Learning with Competitions
 
DCU Search Runs at MediaEval 2014 Search and Hyperlinking
DCU Search Runs at MediaEval 2014 Search and HyperlinkingDCU Search Runs at MediaEval 2014 Search and Hyperlinking
DCU Search Runs at MediaEval 2014 Search and Hyperlinking
 
InstructGPT: Follow instructions with human feedback
InstructGPT: Follow instructions with human feedbackInstructGPT: Follow instructions with human feedback
InstructGPT: Follow instructions with human feedback
 

More from multimediaeval

Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
multimediaeval
 
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
multimediaeval
 
Sports Video Classification: Classification of Strokes in Table Tennis for Me...
Sports Video Classification: Classification of Strokes in Table Tennis for Me...Sports Video Classification: Classification of Strokes in Table Tennis for Me...
Sports Video Classification: Classification of Strokes in Table Tennis for Me...
multimediaeval
 
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
multimediaeval
 
Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task
Essex-NLIP at MediaEval Predicting Media Memorability 2020 TaskEssex-NLIP at MediaEval Predicting Media Memorability 2020 Task
Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task
multimediaeval
 
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
multimediaeval
 
Fooling an Automatic Image Quality Estimator
Fooling an Automatic Image Quality EstimatorFooling an Automatic Image Quality Estimator
Fooling an Automatic Image Quality Estimator
multimediaeval
 
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
multimediaeval
 
Pixel Privacy: Quality Camouflage for Social Images
Pixel Privacy: Quality Camouflage for Social ImagesPixel Privacy: Quality Camouflage for Social Images
Pixel Privacy: Quality Camouflage for Social Images
multimediaeval
 
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-MatchingHCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
multimediaeval
 
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
multimediaeval
 
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
multimediaeval
 
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
multimediaeval
 
Deep Conditional Adversarial learning for polyp Segmentation
Deep Conditional Adversarial learning for polyp SegmentationDeep Conditional Adversarial learning for polyp Segmentation
Deep Conditional Adversarial learning for polyp Segmentation
multimediaeval
 
A Temporal-Spatial Attention Model for Medical Image Detection
A Temporal-Spatial Attention Model for Medical Image DetectionA Temporal-Spatial Attention Model for Medical Image Detection
A Temporal-Spatial Attention Model for Medical Image Detection
multimediaeval
 
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
multimediaeval
 
Fine-tuning for Polyp Segmentation with Attention
Fine-tuning for Polyp Segmentation with AttentionFine-tuning for Polyp Segmentation with Attention
Fine-tuning for Polyp Segmentation with Attention
multimediaeval
 
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
multimediaeval
 
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
multimediaeval
 
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
 Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ... Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
multimediaeval
 

More from multimediaeval (20)

Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
 
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
 
Sports Video Classification: Classification of Strokes in Table Tennis for Me...
Sports Video Classification: Classification of Strokes in Table Tennis for Me...Sports Video Classification: Classification of Strokes in Table Tennis for Me...
Sports Video Classification: Classification of Strokes in Table Tennis for Me...
 
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
 
Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task
Essex-NLIP at MediaEval Predicting Media Memorability 2020 TaskEssex-NLIP at MediaEval Predicting Media Memorability 2020 Task
Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task
 
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
 
Fooling an Automatic Image Quality Estimator
Fooling an Automatic Image Quality EstimatorFooling an Automatic Image Quality Estimator
Fooling an Automatic Image Quality Estimator
 
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
 
Pixel Privacy: Quality Camouflage for Social Images
Pixel Privacy: Quality Camouflage for Social ImagesPixel Privacy: Quality Camouflage for Social Images
Pixel Privacy: Quality Camouflage for Social Images
 
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-MatchingHCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
 
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
 
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
 
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
 
Deep Conditional Adversarial learning for polyp Segmentation
Deep Conditional Adversarial learning for polyp SegmentationDeep Conditional Adversarial learning for polyp Segmentation
Deep Conditional Adversarial learning for polyp Segmentation
 
A Temporal-Spatial Attention Model for Medical Image Detection
A Temporal-Spatial Attention Model for Medical Image DetectionA Temporal-Spatial Attention Model for Medical Image Detection
A Temporal-Spatial Attention Model for Medical Image Detection
 
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
 
Fine-tuning for Polyp Segmentation with Attention
Fine-tuning for Polyp Segmentation with AttentionFine-tuning for Polyp Segmentation with Attention
Fine-tuning for Polyp Segmentation with Attention
 
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
 
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
 
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
 Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ... Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
 

Recently uploaded

Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
vaibhavrinwa19
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
EduSkills OECD
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
TechSoup
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
SACHIN R KONDAGURI
 
Multithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race conditionMultithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race condition
Mohammed Sikander
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
Sandy Millin
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
Peter Windle
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
Thiyagu K
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
MysoreMuleSoftMeetup
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
EverAndrsGuerraGuerr
 
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th SemesterGuidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Atul Kumar Singh
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Thiyagu K
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
Peter Windle
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Special education needs
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
Tamralipta Mahavidyalaya
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
EugeneSaldivar
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
 

Recently uploaded (20)

Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
 
Multithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race conditionMultithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race condition
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
 
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th SemesterGuidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th Semester
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 

MediaEval 2015 - RECOD@Placing Task of MediaEval 2015

  • 1. RECOD @ Placing Task of MediaEval 2015 L. T. Li1 , J. A. V. Muñoz1 , J. Almeida1,3 , R. T. Calumby1,4 , O. A. B. Penatti1,2 , I. C. Dourado1 , K. Nogueira6 , P. R. Mendes Júnior1 , D. C. G. Pedronette1,5 , J. A. dos Santos6 , M. A. Gonçalves6 , and R. S. Torres1 Daniel Moreira On behalf of the authors. 1. UNICAMP – 2. UNIFESP – 3. SAMSUNG – 4. UEFS – 5. UNESP – 6. UFMG – BRAZIL –
  • 2. 2015 Participation ● We focused on the Localization subtask; ● Innovations concerning the last year (2014) – Rank agreggation based on Genetic Programming; – Geocoding improvement with Ranked List Density Analysis (RLDA).
  • 3. Submitted Runs ● Run 1 Textual Features with Genetic Programming – Image and Video Descriptors BM25* TF-IDF* IBS* LMD* Ranks Ranked list 1 Ranked list 2 Ranked list 3 Ranked list 4 Rank Aggregation Single ranked list (GP-Agg - based) * All from Lucene package (http://lucene.apache.org/core/)
  • 4. GP-Agg Framework Parameter Value Number of generations 30 Genetics operators Reproduction, Mutation, Crossover Fitness functions FFP1, WAS, MAP, NDCG Rank Agg. methods CombMAX, CombMIN, CombSUM, CombMED, CombANZ, CombMNZ, RLSim, BordaCount, RRF, MRA
  • 5. Submitted Runs ● Run 1 Textual Features with Genetic Programming – Image and Video Descriptors Ranks Ranked list 1 Ranked list 2 Ranked list 3 Ranked list 4 Rank Aggregation Single ranked list (GP-Agg - based) BM25* TF-IDF* IBS* LMD* * All from Lucene package (http://lucene.apache.org/core/)
  • 6. Submitted Runs ● Run 2 Visual Features with RLDA – Image Descriptor BIC Rank BIC Ranked list Geocoding Improvement Improved geocode (top 100 RLDA-based)
  • 7. Ranked List Density Analysis Ranked list 1~N ID1 LAT1 LONG1 ID2 LAT2 LONG2 … … ... IDX LATX LONGX Top X items' lat/long defined as points of a OPF cluster Node v: (lat1,long1) edge (v, u): connect k-nn d(v, u) d(v, u): geo- distance between v and u k=3 u: (lat2,long2)
  • 8. Submitted Runs ● Run 2 Visual Features with RLDA – Image Descriptor BIC Rank BIC Ranked list Improved geocode (top 100 RLDA-based) Geocoding Improvement
  • 9. Submitted Runs ● Run 2 Visual Features with RLDA – Video Descriptors LIRE 1 ... LIRE n HMP Ranks Ranked list 1 ... Ranked list n Ranked list n+1 Rank Aggregation Single ranked list (GP-Agg - based) Improved geocode (top 100 RLDA-based) Geocoding Improvement
  • 10. Submitted Runs ● Run 3 Multimodal Solution with RLDA – Image Descriptors BM25 TF-IDF BS LMD BIC ... SCD Ranks Ranked list 1 Ranked list 2 Ranked list 3 Ranked list 4 Ranked list 5 … Ranked list 8 Rank Aggregation Single ranked list (GP-Agg - based)
  • 11. Submitted Runs ● Run 3 Multimodal Solution with RLDA – Video Descriptors BM25 TF-IDF BS LMD HMP ... MFCC Ranks Ranked list 1 Ranked list 2 Ranked list 3 Ranked list 4 Ranked list 5 … Ranked list n Rank Aggregation Single ranked list (GP-Agg - based)
  • 12. Submitted Runs ● Run 4 Textual with RLDA – Image and Video Descriptors BM25 TF-IDF BS LMD Ranks Ranked list 1 Ranked list 2 Ranked list 3 Ranked list 4 Rank Aggregation and Improvement Single ranked list (top 5 RLDA-based)
  • 13. Results – 2015 Global Run 1 Run 2 Run 3 Run 4 0 5 10 15 20 25 30 35 40 45 50 0,15 0 0,14 0,12 0,54 0,01 0,53 0,62 5,49 0,09 5,35 6,44 19,75 0,44 19,11 21,74 36,6 1,99 35,31 38,38 44,89 3,57 43,26 46,91 58,97 20,38 57,67 63,22 1m 10m 100m 1km 10km 100km 1,000km GP-Agg Combined Textual GP-Agg Combined Non-textual GP-Agg Multimodal Textual RLDL
  • 14. Results – Median Distance With Metadata Runs Run4 Run3 Run1 50 150 250 350 450 550 650 750 309.86 394.89 196.01
  • 15. Conclusions ● GP-Agg automatically combines lists and aggregation functions. ● Top-N RLDA improves even GP-Agg results. ● RLDA was better (see Run 4) than using GP- Agg alone. ● There is room for improving the GP-Agg approach.
  • 16. Future Work ● Develop other fitness fuctions in the GP-Agg approach. ● Use more visual descriptors. ● Evaluate different clustering strategies.
  • 17. Acknowledgments ● MediaEval 2015 ● FAPESP ● CNPq ● CAPES ● Samsung
  • 18. Thank You! {lintzyli,pedro.mendes,luis.pereira,rtorres}@ic.unicamp.br, jurandy.almeida@unifesp.br,rtcalumby@ecomp.uefs.br, o.penatti@samsung.com,jalvarm.acm@gmail.com, icaro.dourado@students.ic.unicamp.br,daniel@rc.unesp.br, {keiller.nogueira,jefersson, mgoncalv}@dcc.ufmg.br L. T. Li, J. A. V. Muñoz, J. Almeida, R. T. Calumby, O. A. B. Penatti, I. C. Dourado, K. Nogueira, P. R. Mendes Júnior, D. C. G. Pedronette, J. A. dos Santos, M. A. Gonçalves, and R. S. Torres
  • 20. Run1 Run3 Run4 Min. : 0.000 Min. : 0.000 Min. : 0.000 1st Qu.: 1.897 1st Qu.: 2.124 1st Qu.: 1.482 Median : 309.865 Median : 394.889 Median : 196.008 Mean : 2913.598 Mean : 2976.632 Mean : 2483.614 3rd Qu.: 5573.930 3rd Qu.: 5766.894 3rd Qu.: 3798.622 Max. :19959.808 Max. :19959.808 Max. :19954.130 Basic analysis of distances (km) in Test set: from Predicted to Expected lat/long Run2 Min. : 0 1st Qu.: 1240 Median : 5883 Mean : 5597 3rd Qu.: 8637 Max. :19972
  • 21. Videos-only Test Results (%) Run 1: GP-Agg Textual Run 2: GP-Agg only visual (HMP+ all Lire) + RLDA (top100) Run 3: GP-Agg multimodal (text, visual, audio) Run 4: BM25_RLDA (top 5) Run 1 Run 2 Run 3 Run 4 1m 0.08 0 0.08 0.06 10m 0.4 0 0.37 0.41 100m 5.46 0.01 5.13 5.79 1km 17.62 0.02 16.74 17.89 10km 32.44 0.11 32.1 32.24 100km 40.27 3.8 39.69 39.92 1,000km 54.13 20.39 53.67 55.68 10,000km 90.57 91.97 91.5 93.16
  • 22. Video-only Summary Run1 Run2 Run3 Min. : 0.000 Min. : 0.05 Min. : 0.000 1st Qu.: 2.914 1st Qu.: 1304.13 1st Qu.: 3.138 Median : 619.747 Median : 6351.71 Median : 660.864 Mean : 3191.163 Mean : 5709.17 Mean : 3158.724 3rd Qu.: 6035.939 3rd Qu.: 8463.49 3rd Qu.: 6154.147 Max. :19656.624 Max. :19596.38 Max. :19524.628 Run4 Min. : 0.000 1st Qu.: 2.992 Median : 547.648 Mean : 2879.899 3rd Qu.: 5548.780 Max. :19656.624
  • 23. GP-Agg Individual example – Run 1 Input: bm25 (description, fusion, tags, title), tf- idf(description, fusion, tags, title), lmd_fusion, ibs_fusion Individual: CombSUM( MRA( CombMNZ( RRF(ibs_fusion,lmd_fusion), CombMNZ(td-idf_fusion, bm25_fusion), RLSim(lmd_fusion, bm25_tags, tf-idf_fusion)), CombMNZ(CombSUM(tf-idf_tags, tf-idf_fusion, tf- idf_tags), CombMIN(tf-idf_description, bm25_description, tf-idf_tags), RLSim(bm25_title, tf-idf_title, tf-idf_title)), RRF(CombMAX(bm25_fusion, bm25_tags), RRF(tf-idf_fusion, ibs_fusion, tf-idf_fusion), BordaCount(tf- idf_fusion, bm25_fusion, tf-idf_tags))), RLSim(CombSUM(CombSUM(tf- idf_tags, tf-idf_tags, lmd_fusion), bm25_fusion), bm25_fusion))
  • 24. Visual Features (HMP): Extracting ● Histograms of Motion Patterns ● Keyframes: Not used ● Applying an algorithm to compare video sequence (1)partial decoding; (2)feature extraction; (3)signature generation. “Comparison of video sequences with histograms of motion patterns”, J. Almeida et al. ICIP, 2011.
  • 25. Visual Features (HMP): overview [Almeida et al., Comparison of video sequences with histograms of motion patterns. ICIP 2011]
  • 27. OPF Density formula ● d(s,t) distânce from s to t (used haversine dist.) ● A(s): list of adjacency of s. ● directed graph