SlideShare a Scribd company logo
1 of 37
Download to read offline
TRECVID 2018 INSTANCE RETRIEVAL
INTRODUCTION AND TASK OVERVIEW
Wessel Kraaij
Leiden University; The Netherlands
Organisation for Applied Scientific Research TNO;
George Awad
Dakota Consulting ; National Institute of Standards and
Technology
Keith Curtis
National Institute of Standards and Technology
Disclaimer
The identification of any commercial product or trade name does not imply endorsement or
recommendation by the National Institute of Standards and Technology.
TRECVID 2018
Table of contents
•Task Definition
•Data
•Topics (Queries)
•Participating teams
•Evaluation & results
•General observation
2
TRECVID 2018
Task
From 2013 – 2015
• The task asked systems to find a specific object, person or location
in any context using a small set of image and video examples.
In 2016 - 2018
• A new query type was used: find a specific person in a specific
location.
System task:
▪ Given a topic with:
▪ 4 example images of the target person
▪ 4 Region of Interest (ROI)-masked images of the target person
▪ 4 shots from which the target person example images came
▪ 6 to 12 image and video examples of a known location
▪ Return a list of up to 1000 shots ranked by likelihood that they contain
the topic target person in the target location
▪ Automatic or interactive runs are accepted
3
TRECVID 2018
Data …
• The British Broadcasting Corporation (BBC) and the Access
to Audiovisual Archives (AXES) project made 464 h of the
BBC soap opera EastEnders available for research
• 244 weekly “omnibus” files (MPEG-4) from 5 years of broadcasts
• 471527 shots
• Average shot length: 3.5 seconds
• Transcripts from BBC
• Per-file metadata
• Represents a “small world” with a slowly changing set of:
• People (several dozen)
• Locales: homes, workplaces, pubs, cafes, open-air market, clubs
• Objects: clothes, cars, household goods, personal possessions,
pets, etc
• Views: various camera positions, times of year, times of day,
• Use of fan community metadata allowed, if documented
5
TRECVID 20186
Majority of episodes filmed at Elstree
studios. Sometimes filmed on ‘location’.
EastEnders’ world
TRECVID 20187
Topic creation procedure @ NIST
• Viewed several test videos to develop a list of recurring
people, locations and their overlapping.
• Chose 10 master locations and identified 6 to 12 image and
video examples to each depending on location type (private:
kitchen, room, etc; public: pub, café, market, etc)
• Created ≈90 topics targeting recurring specific persons in
specific locations.
• Chose representative sample of 30 topics. Each topic
includes images for target persons from test videos, many
from the sample video (ID 0) and a named location.
• Filtered example shots from the submissions if it satisfies the
topic.
TRECVID 20188
Global test condition: type of training
data
Effect of examples – 2 conditions:
• A – one or more provided images – no video
• E - video examples (+ optional image examples)
TRECVID 20189
Topics – segmented “person” example
images
Chelsea Darrin
Garry Heather
TRECVID 201810
Topics – segmented example images
Jack Jane
Max Minty
TRECVID 201811
Topics – segmented example images
Mo
Zainab
TRECVID 201812
Foyer Kitchen1 Kitchen2 LR1
LaundretteCafe2Cafe1LR2
Topics – 10 Master locations
TRECVID 201813
Topics – 2018
Jane Chelsea Minty Garry Mo Darrin Zainab Heather Jack Max
Cafe2 x x x x x x x x x
Market x x x x x x x
Pub x x x x x x x
Launderette x x x x x x x
30 x topics : find {Chelsea, Darrin, Garry, Heather, Jack, Jane, Max,
Minty, Mo, Zainab} in {Cafe2,Market,Pub,Launderette}
TRECVID 201814
Team Organization Run Types
Submitted
F: automatic, I:
Interactive
BUPT_MCPRL Beijing University of Posts and Telecommunications F_E (3), I_E (1)
HSMW_TUC Chemnitz University of Technology, University of Applied Sciences Mittweida F_A (3), I_A (1)
ITI_CERTH Information Technologies Institute, Centre for Research and Technology Hellas I_A (1)
IRIM EURECOM; LABRI ; LIG ; LIMSI; LISTIC F_A (4), F_E (4)
NII_Hitachi_UIT National Institute of Informatics, Japan (NII); Hitachi, Ltd; University of
Information Technology, VNU-HCM, Vietnam (HCM-UIT)
F_A (4) , I_A(1)
WHU_NERCMS National Engineering Research Center for Multimedia Software,
Wuhan University
F_A (4) , I_A (4)
PLUMCOT LIMSI, Karlsruhe Institute of Technology F_A (3)
PKU_ICST Peking University F_A (3), F_E (3),
I_E (1)
INS 2018: 8 Finishers (out of 17)
TRECVID 201815
Evaluation
For each topic the submissions were pooled and judged down to
max rank 520, resulting in 128117 judged shots (≈ 480 person-h).
• 10 NIST assessors played the clips and determined if they
contained the topic target or not.
• 11717 clips (avg. 390 / topic) contained the topic target (9 %)
• True positives per topic: min 30 med 168 max 1340
• The task is treated as a form of ranking and thus the
trec_eval_video tool was used to calculate average precision,
recall, precision, etc.
• To measure efficiency, speed was also measured.
• In total, 31 automatic and 9 interactive runs were submitted.
TRECVID 201816
Results by team (Automatic)
TRECVID 201817
Results by team (Interactive)
TRECVID 201818
# Query
9230 Find Garry in this Laundrette
9236 Find Darrin in this Laundrette
9241 Find Heather in this Laundrette
9233 Find Mo in this Laundrette
9239 Find Zainab in this Mini-Market
9244 Find Jack in this Laundrette
9237 Find Zainab in this Cafe 2
9238 Find Zainab in this Laundrette
9242 Find Heather in this Mini-Market
9248 Find Max in this Mini-Market
9225 Find Minty in this Cafe 2
9219 Find Jane in this Cafe 2
9229 Find Garry in this Pub
9226 Find Minty in this Pub
9245 Find Jack in this Mini-Market
9228 Find Garry in this Cafe 2
9243 Find Jack in this Pub
9227 Find Minty in this Mini-Market
9240 Find Heather in this Cafe 2
9246 Find Max in this Cafe 2
9221 Find Jane in this Mini-Market
9247 Find Max in this Laundrette
9223 Find Chelsea in this Pub
9224 Find Chelsea in this Mini-Market
9235 Find Darrin in this Pub
9232 Find Mo in this Pub
9234 Find Darrin in this Cafe 2
9231 Find Mo in this Cafe 2
9222 Find Chelsea in this Cafe 2
9220 Find Jane in this Pub
Results by topic - automatic
Zainab (0.44)* & Heather (0.558) easy to find.
Chelsea (0.229) & Max (0.235) difficult to find.
Laundrette (0.479) & Mini-Market(0.411) is easy.
Pub(0.259) & Cafe 2(0.211) is hard.
*Mean score of median MAP per character/location
TRECVID 201819
Automatic Run results + Randomization testing
= > > > > > >
= > > > >
= > >
= > >
= > >
= > >
= >
=
=
=
1 2 3 4 5 6 7 8 9 10
> p < 0.05
0.463 F_E_PKU_ICST_1
0.459 F_E_PKU_ICST_4
0.443 F_A_IRIM_2
0.442 F_A_IRIM_1
0.437 F_E_IRIM_2
0.433 F_E_IRIM_1
0.429 F_A_PKU_ICST_3
0.420 F_A_PKU_ICST_6
0.398 F_A_IRIM_3
0.395 F_E_IRIM_3
MAP Top 10 runs across all teams (automatic)
p = probability the row run scored better than the column run due to chance
TRECVID 201820
2017 (s)
2016 (s)
2018 (s)
Mean Average Precision vs. per run clock processing time (automatic)
IRIM
runs
TRECVID 201821
Results by topic - interactive
# Query
9230 Find Garry in this Laundrette
9228 Find Garry in this Cafe 2
9233 Find Mo in this Laundrette
9236 Find Darrin in this Laundrette
9239 Find Zainab in this Mini-Market
9238 Find Zainab in this Laundrette
9225 Find Minty in this Cafe 2
9229 Find Garry in this Pub
9221 Find Jane in this Mini-Market
9226 Find Minty in this Pub
9237 Find Zainab in this Cafe 2
9223 Find Chelsea in this Pub
9224 Find Chelsea in this Mini-Market
9222 Find Chelsea in this Cafe 2
9232 Find Mo in this Pub
9227 Find Minty in this Mini-Market
9219 Find Jane in this Cafe 2
9235 Find Darrin in this Pub
9220 Find Jane in this Pub
9231 Find Mo in this Cafe 2
9234 Find Darrin in this Cafe 2
Minty(0.319)*, Zainab(0.316) & Garry(0.3) are easy to find.
Jane(0.175), Darrin(0.208) & Chelsea(0.228) are difficult.
Laundrette (0.33) & Mini-Market (0.372) are easy.
Cafe 2 (0.117) & Pub (0.293) are hard.
*Mean score of median MAP per character/location
TRECVID 201822
> p < 0.05
MAP
p = probability the row run scored better than the column run due to chance
0.524 I_E_PKU_ICST_2 = > > > > > > > >
0.447 I_E_BUPT_MCPRL_4 = > > > > > > >
0.367 I_A_NII_Hitachi_UIT_1 = > > > > > >
0.261 I_A_WHU_NERCMS_1 = > > > >
0.252 I_A_HSMW_TUC_4 = > > >
0.235 I_A_WHU_NERCMS_3 = > >
0.200 I_A_WHU_NERCMS_4 = > >
0.184 I_A_WHU_NERCMS_2 = >
0.064 I_A_ITI_CERTH_1 =
1 2 3 4 5 6 7 8 9
ALL 9 runs by all teams (interactive)
Interactive Run Results, Randomization testing
TRECVID 201823
Results by example set (A/E) - automatic
TRECVID 201824
Results by Data Source
TRECVID 201825
Some general observations
about the task
• Slight decrease in number of participants but same number
of finishers – higher % finished.
• Less teams are using E condition - training with video
examples – (e.g tracking characters)
• Interactive search task:
• Limited participation
• Third year: Slight decrease in best performances from 2nd
year – Why? Queries more difficult?
• We encourage teams to test their 2016 or 2017 system on
the 2018 topics or vice versa.
TRECVID 201826
Some general observations
about the task – Data Source
•Best results achieved using external data plus
NIST provided data.
•Next best results are achieved using only the NIST
provided images and video.
•Systems using only external data do not perform
as well as systems which include the NIST
provided data.
TRECVID 201828
Observations over last three
years of the task (Automatic):
•High Score down on last year but still up on 2016.
•Low Scores increasing year on year.
•Mean and Median scores increased significantly
last year on 2016 but have since stabilized.
•Standard Deviation between MAP scores decreased
on last year, now similar to 2016.
•Number of participants has been decreasing year
on year.
TRECVID 201829
Observations over last three
years of the task - Locations
•Laundrette consistently shows to be among the
easiest locations to recognize.
•Pub consistently shows to be among the most
difficult locations to recognize.
2016 2017 2018
Pub 0.021 N/A 0.259
Laundrette 0.172 0.338 0.479
Market N/A 0.343 0.411
Average scores (automatic systems) across all topics for
common locations per year
TRECVID 201830
BUPT-MCPRL
•Location Retrieval: Two Independent Methods:
• Hessian-Affine detector with RootSIFT, MSER detector with
RootSIFT, and CNN features.
• Fine-Tuned publicly available VGG-16 model, GoogleNet
model, and ResNet-152 models.
•Person Retrieval: Face retrieval and transcript-
based search:
• Detect face on key frames captures from video by MTCNN,
face representations extracted from bounding-box and
cosine distance is employed to match faces.
• Transcript search - locate character name in transcripts.
•Submitted 4 runs
• Three automatic runs
• One interactive run
TRECVID 201831
ITI CERTH
•Focus on interactive task
•VERGE system includes several modes for
navigation:
• Visual similarity (DCNN)
• Visual Concept Retrieval - 346 visual concepts
• Face detection
• Scene similarity
• Multimodal Fusion
•Late fusion of DCNN face descriptors and scene
descriptors
•Submitted 1 interactive run
TRECVID 201832
TU Chemnitz
• Complete overhaul of INS system architecture
using Docker
• Allows combination of various open face
recognition and scene recognition pipelines
• All indexing is done offline, retrieval is very fast
Submitted 3 automatic and 1 interactive runs
TRECVID 201833
IRIM (LaBRI, LIMSI, LIG)
• Combination of two person recognition methods
• One location recognition method
• Late fusion on person methods, additional late
fusion to mix in the location scores
• 2018: focus on person recognition
• Positive impact: data augmentation, faces
reranking
Submitted 8 automatic runs (A and E)
TRECVID 201834
Peking University (ICST)
• Location search: BOW plus CNN
• Person search:
• Query preprocessing based on super
resolution
• Deep models for face recognition
• Text based refinement
• Fusion based on combination of score and rank
based fusion (boosting)
• Filtering noisy shots (outliers)
Submitted 6 automatic runs (A and E), 1 interactive
TRECVID 201836
Overview of submissions (1)
•8 out of 8 teams described INS runs for the TV
notebook
•3 teams will present their INS experiments
3:40 - 4:10, (HSMW_TUC – University of Applied Sciences Mittweida, Chemnitz
University of Technology)
4:10 - 4:40, (NII_Hitachi_UIT – National Institute of Informatics, Japan Hitachi,
Ltd., Japan University of Information Technology, VNU_HCMC, Vietnam)
4:40 - 5:10, (WHU_NERCMS – National Engineering Research Centre for
Multimedia Software, Wuhan University)
5:10 - 5:25, INS Discussion
TRECVID 201837
INS 2019 plans
•Move on to a new query type
• Action instances (drinking, walking, sleeping, talking,
driving, etc)
• Person + Action (Brad fighting)
• Action + Location (e.g drinking in the cafe)
• Mix of the above ?!
•Keep the newly added additional training data
sources.
•Add manual run type ?
TRECVID 201838
INS 2019 plans – Example Action
Images
Heather Talking on PhoneJack Holding Money Ian Drinking in Cafe
Garry Holding Flowers Minty and Mo Singing Jane Holding Baby
TRECVID 201839
Future INS plans – Common queries
• Plans to now include a set of common queries each year to
measure yearly progress.
• 2019 teams will submit runs for 50 queries in total. 30
unique queries plus 20 common queries to be repeated each
year.
• 2020 and 2021 teams to submit runs for 40 queries in total.
20 unique queries plus 20 common queries each year.
• The common queries each year provide a basis for the
comparison of team performances year on year.
TRECVID 201840
Future INS plans – Evaluations
2019 2020 2021
2019 30 + 10A + 10B
2020 20 + 10A + 10B
2021 20 + 10A + 10B
• 2019 Assessors to evaluate just the 30 unique queries for that
year.
• 2020 Assessors to evaluate the 20 unique queries for that year
plus the first 10 common queries (10A), allows to measure
progress on 2019.
• 2021 Assessors to evaluate the 20 unique queries plus the second
10 common queries (10B), allows to measure progress on 2019
and 2020.

More Related Content

Similar to [TRECVID 2018] Instance Search

Maximizing_Supply_Chain_Transparency_SaudiTranstec_Dec2014_Swift
Maximizing_Supply_Chain_Transparency_SaudiTranstec_Dec2014_SwiftMaximizing_Supply_Chain_Transparency_SaudiTranstec_Dec2014_Swift
Maximizing_Supply_Chain_Transparency_SaudiTranstec_Dec2014_SwiftAndrew Swift
 
TRECVID 2019 Ad-hoc Video Search
TRECVID 2019 Ad-hoc Video SearchTRECVID 2019 Ad-hoc Video Search
TRECVID 2019 Ad-hoc Video SearchGeorge Awad
 
Six Sigma: Process Improvement
Six Sigma: Process ImprovementSix Sigma: Process Improvement
Six Sigma: Process ImprovementJohn_Sullivan
 
[TRECVID 2018] Ad-hoc Video Search
[TRECVID 2018] Ad-hoc Video Search[TRECVID 2018] Ad-hoc Video Search
[TRECVID 2018] Ad-hoc Video SearchGeorge Awad
 
Using Social Media Data for Online Television Recommendation Services at RTÉ ...
Using Social Media Data for Online Television Recommendation Services at RTÉ ...Using Social Media Data for Online Television Recommendation Services at RTÉ ...
Using Social Media Data for Online Television Recommendation Services at RTÉ ...Andrea Barraza-Urbina
 
Eating the enterprise mobile elephant - Digital Workplace Group (DWG)
Eating the enterprise mobile elephant - Digital Workplace Group (DWG)Eating the enterprise mobile elephant - Digital Workplace Group (DWG)
Eating the enterprise mobile elephant - Digital Workplace Group (DWG)Digital Workplace Group
 
Dynamic Search and Beyond
Dynamic Search and BeyondDynamic Search and Beyond
Dynamic Search and BeyondGrace Hui Yang
 
Coffee Machine Case Study
Coffee Machine Case StudyCoffee Machine Case Study
Coffee Machine Case StudyDragon Sourcing
 
Case Study-Sourcing of Automatic Coffee Machine
Case Study-Sourcing of Automatic Coffee MachineCase Study-Sourcing of Automatic Coffee Machine
Case Study-Sourcing of Automatic Coffee MachineJohn William
 
Machine Learning and Industrie 4.0
Machine Learning and Industrie 4.0Machine Learning and Industrie 4.0
Machine Learning and Industrie 4.0Peter Schleinitz
 
Churn Prediction in Practice
Churn Prediction in PracticeChurn Prediction in Practice
Churn Prediction in PracticeBigData Republic
 
EPG content recommendation in large scale: a case study on interactive TV pla...
EPG content recommendation in large scale: a case study on interactive TV pla...EPG content recommendation in large scale: a case study on interactive TV pla...
EPG content recommendation in large scale: a case study on interactive TV pla...David Zibriczky
 
Agile London at Ticketmaster
Agile London at TicketmasterAgile London at Ticketmaster
Agile London at TicketmasterBilly Jenkins
 
Love It or List It: Challenges to Staying in DITA
Love It or List It: Challenges to Staying in DITALove It or List It: Challenges to Staying in DITA
Love It or List It: Challenges to Staying in DITAStan Doherty
 
Public Data and Data Mining Competitions - What are Lessons?
Public Data and Data Mining Competitions - What are Lessons?Public Data and Data Mining Competitions - What are Lessons?
Public Data and Data Mining Competitions - What are Lessons?Gregory Piatetsky-Shapiro
 
Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...
Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...
Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...Michael Lew
 

Similar to [TRECVID 2018] Instance Search (20)

Maximizing_Supply_Chain_Transparency_SaudiTranstec_Dec2014_Swift
Maximizing_Supply_Chain_Transparency_SaudiTranstec_Dec2014_SwiftMaximizing_Supply_Chain_Transparency_SaudiTranstec_Dec2014_Swift
Maximizing_Supply_Chain_Transparency_SaudiTranstec_Dec2014_Swift
 
TRECVID 2019 Ad-hoc Video Search
TRECVID 2019 Ad-hoc Video SearchTRECVID 2019 Ad-hoc Video Search
TRECVID 2019 Ad-hoc Video Search
 
Six Sigma: Process Improvement
Six Sigma: Process ImprovementSix Sigma: Process Improvement
Six Sigma: Process Improvement
 
[TRECVID 2018] Ad-hoc Video Search
[TRECVID 2018] Ad-hoc Video Search[TRECVID 2018] Ad-hoc Video Search
[TRECVID 2018] Ad-hoc Video Search
 
Using Social Media Data for Online Television Recommendation Services at RTÉ ...
Using Social Media Data for Online Television Recommendation Services at RTÉ ...Using Social Media Data for Online Television Recommendation Services at RTÉ ...
Using Social Media Data for Online Television Recommendation Services at RTÉ ...
 
Eating the enterprise mobile elephant - Digital Workplace Group (DWG)
Eating the enterprise mobile elephant - Digital Workplace Group (DWG)Eating the enterprise mobile elephant - Digital Workplace Group (DWG)
Eating the enterprise mobile elephant - Digital Workplace Group (DWG)
 
Using Transactional Data to Determine the Usual Environment of Cardholders
Using Transactional Data to Determine the Usual Environment of CardholdersUsing Transactional Data to Determine the Usual Environment of Cardholders
Using Transactional Data to Determine the Usual Environment of Cardholders
 
Dynamic Search and Beyond
Dynamic Search and BeyondDynamic Search and Beyond
Dynamic Search and Beyond
 
Coffee Machine Case Study
Coffee Machine Case StudyCoffee Machine Case Study
Coffee Machine Case Study
 
Case Study-Sourcing of Automatic Coffee Machine
Case Study-Sourcing of Automatic Coffee MachineCase Study-Sourcing of Automatic Coffee Machine
Case Study-Sourcing of Automatic Coffee Machine
 
INNOVATION IMPACT
INNOVATION IMPACTINNOVATION IMPACT
INNOVATION IMPACT
 
PyData Paris 2015 - Track 2.3 AXA
PyData Paris 2015 - Track 2.3 AXA PyData Paris 2015 - Track 2.3 AXA
PyData Paris 2015 - Track 2.3 AXA
 
Machine Learning and Industrie 4.0
Machine Learning and Industrie 4.0Machine Learning and Industrie 4.0
Machine Learning and Industrie 4.0
 
Churn Prediction in Practice
Churn Prediction in PracticeChurn Prediction in Practice
Churn Prediction in Practice
 
EPG content recommendation in large scale: a case study on interactive TV pla...
EPG content recommendation in large scale: a case study on interactive TV pla...EPG content recommendation in large scale: a case study on interactive TV pla...
EPG content recommendation in large scale: a case study on interactive TV pla...
 
Bob Moesta | JTBD Workshop
Bob Moesta | JTBD WorkshopBob Moesta | JTBD Workshop
Bob Moesta | JTBD Workshop
 
Agile London at Ticketmaster
Agile London at TicketmasterAgile London at Ticketmaster
Agile London at Ticketmaster
 
Love It or List It: Challenges to Staying in DITA
Love It or List It: Challenges to Staying in DITALove It or List It: Challenges to Staying in DITA
Love It or List It: Challenges to Staying in DITA
 
Public Data and Data Mining Competitions - What are Lessons?
Public Data and Data Mining Competitions - What are Lessons?Public Data and Data Mining Competitions - What are Lessons?
Public Data and Data Mining Competitions - What are Lessons?
 
Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...
Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...
Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...
 

More from George Awad

Movie Summarization at TRECVID 2022
Movie Summarization at TRECVID 2022Movie Summarization at TRECVID 2022
Movie Summarization at TRECVID 2022George Awad
 
Disaster Scene Indexing at TRECVID 2022
Disaster Scene Indexing at TRECVID 2022Disaster Scene Indexing at TRECVID 2022
Disaster Scene Indexing at TRECVID 2022George Awad
 
Activity Detection at TRECVID 2022
Activity Detection at TRECVID 2022Activity Detection at TRECVID 2022
Activity Detection at TRECVID 2022George Awad
 
Multimedia Understanding at TRECVID 2022
Multimedia Understanding at TRECVID 2022Multimedia Understanding at TRECVID 2022
Multimedia Understanding at TRECVID 2022George Awad
 
Video Search at TRECVID 2022
Video Search at TRECVID 2022Video Search at TRECVID 2022
Video Search at TRECVID 2022George Awad
 
Video Captioning at TRECVID 2022
Video Captioning at TRECVID 2022Video Captioning at TRECVID 2022
Video Captioning at TRECVID 2022George Awad
 
TRECVID 2019 Video to Text
TRECVID 2019 Video to TextTRECVID 2019 Video to Text
TRECVID 2019 Video to TextGeorge Awad
 
TRECVID 2019 introduction slides
TRECVID 2019 introduction slidesTRECVID 2019 introduction slides
TRECVID 2019 introduction slidesGeorge Awad
 
Instance Search (INS) task at TRECVID
Instance Search (INS) task at TRECVIDInstance Search (INS) task at TRECVID
Instance Search (INS) task at TRECVIDGeorge Awad
 
Activity detection at TRECVID
Activity detection at TRECVIDActivity detection at TRECVID
Activity detection at TRECVIDGeorge Awad
 
The Video-to-Text task evaluation at TRECVID
The Video-to-Text task evaluation at TRECVIDThe Video-to-Text task evaluation at TRECVID
The Video-to-Text task evaluation at TRECVIDGeorge Awad
 
Introduction about TRECVID (The Video Retreival benchmark)
Introduction about TRECVID (The Video Retreival benchmark)Introduction about TRECVID (The Video Retreival benchmark)
Introduction about TRECVID (The Video Retreival benchmark)George Awad
 
TRECVID 2016 : Multimedia event Detection
TRECVID 2016 : Multimedia event DetectionTRECVID 2016 : Multimedia event Detection
TRECVID 2016 : Multimedia event DetectionGeorge Awad
 
TRECVID 2016 : Concept Localization
TRECVID 2016 : Concept LocalizationTRECVID 2016 : Concept Localization
TRECVID 2016 : Concept LocalizationGeorge Awad
 
TRECVID 2016 : Video to Text Description
TRECVID 2016 : Video to Text DescriptionTRECVID 2016 : Video to Text Description
TRECVID 2016 : Video to Text DescriptionGeorge Awad
 
TRECVID 2016 : Ad-hoc Video Search
TRECVID 2016 : Ad-hoc Video Search TRECVID 2016 : Ad-hoc Video Search
TRECVID 2016 : Ad-hoc Video Search George Awad
 
TRECVID 2016 workshop
TRECVID 2016 workshopTRECVID 2016 workshop
TRECVID 2016 workshopGeorge Awad
 

More from George Awad (17)

Movie Summarization at TRECVID 2022
Movie Summarization at TRECVID 2022Movie Summarization at TRECVID 2022
Movie Summarization at TRECVID 2022
 
Disaster Scene Indexing at TRECVID 2022
Disaster Scene Indexing at TRECVID 2022Disaster Scene Indexing at TRECVID 2022
Disaster Scene Indexing at TRECVID 2022
 
Activity Detection at TRECVID 2022
Activity Detection at TRECVID 2022Activity Detection at TRECVID 2022
Activity Detection at TRECVID 2022
 
Multimedia Understanding at TRECVID 2022
Multimedia Understanding at TRECVID 2022Multimedia Understanding at TRECVID 2022
Multimedia Understanding at TRECVID 2022
 
Video Search at TRECVID 2022
Video Search at TRECVID 2022Video Search at TRECVID 2022
Video Search at TRECVID 2022
 
Video Captioning at TRECVID 2022
Video Captioning at TRECVID 2022Video Captioning at TRECVID 2022
Video Captioning at TRECVID 2022
 
TRECVID 2019 Video to Text
TRECVID 2019 Video to TextTRECVID 2019 Video to Text
TRECVID 2019 Video to Text
 
TRECVID 2019 introduction slides
TRECVID 2019 introduction slidesTRECVID 2019 introduction slides
TRECVID 2019 introduction slides
 
Instance Search (INS) task at TRECVID
Instance Search (INS) task at TRECVIDInstance Search (INS) task at TRECVID
Instance Search (INS) task at TRECVID
 
Activity detection at TRECVID
Activity detection at TRECVIDActivity detection at TRECVID
Activity detection at TRECVID
 
The Video-to-Text task evaluation at TRECVID
The Video-to-Text task evaluation at TRECVIDThe Video-to-Text task evaluation at TRECVID
The Video-to-Text task evaluation at TRECVID
 
Introduction about TRECVID (The Video Retreival benchmark)
Introduction about TRECVID (The Video Retreival benchmark)Introduction about TRECVID (The Video Retreival benchmark)
Introduction about TRECVID (The Video Retreival benchmark)
 
TRECVID 2016 : Multimedia event Detection
TRECVID 2016 : Multimedia event DetectionTRECVID 2016 : Multimedia event Detection
TRECVID 2016 : Multimedia event Detection
 
TRECVID 2016 : Concept Localization
TRECVID 2016 : Concept LocalizationTRECVID 2016 : Concept Localization
TRECVID 2016 : Concept Localization
 
TRECVID 2016 : Video to Text Description
TRECVID 2016 : Video to Text DescriptionTRECVID 2016 : Video to Text Description
TRECVID 2016 : Video to Text Description
 
TRECVID 2016 : Ad-hoc Video Search
TRECVID 2016 : Ad-hoc Video Search TRECVID 2016 : Ad-hoc Video Search
TRECVID 2016 : Ad-hoc Video Search
 
TRECVID 2016 workshop
TRECVID 2016 workshopTRECVID 2016 workshop
TRECVID 2016 workshop
 

Recently uploaded

Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 

Recently uploaded (20)

Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 

[TRECVID 2018] Instance Search

  • 1. TRECVID 2018 INSTANCE RETRIEVAL INTRODUCTION AND TASK OVERVIEW Wessel Kraaij Leiden University; The Netherlands Organisation for Applied Scientific Research TNO; George Awad Dakota Consulting ; National Institute of Standards and Technology Keith Curtis National Institute of Standards and Technology Disclaimer The identification of any commercial product or trade name does not imply endorsement or recommendation by the National Institute of Standards and Technology.
  • 2. TRECVID 2018 Table of contents •Task Definition •Data •Topics (Queries) •Participating teams •Evaluation & results •General observation 2
  • 3. TRECVID 2018 Task From 2013 – 2015 • The task asked systems to find a specific object, person or location in any context using a small set of image and video examples. In 2016 - 2018 • A new query type was used: find a specific person in a specific location. System task: ▪ Given a topic with: ▪ 4 example images of the target person ▪ 4 Region of Interest (ROI)-masked images of the target person ▪ 4 shots from which the target person example images came ▪ 6 to 12 image and video examples of a known location ▪ Return a list of up to 1000 shots ranked by likelihood that they contain the topic target person in the target location ▪ Automatic or interactive runs are accepted 3
  • 4. TRECVID 2018 Data … • The British Broadcasting Corporation (BBC) and the Access to Audiovisual Archives (AXES) project made 464 h of the BBC soap opera EastEnders available for research • 244 weekly “omnibus” files (MPEG-4) from 5 years of broadcasts • 471527 shots • Average shot length: 3.5 seconds • Transcripts from BBC • Per-file metadata • Represents a “small world” with a slowly changing set of: • People (several dozen) • Locales: homes, workplaces, pubs, cafes, open-air market, clubs • Objects: clothes, cars, household goods, personal possessions, pets, etc • Views: various camera positions, times of year, times of day, • Use of fan community metadata allowed, if documented 5
  • 5. TRECVID 20186 Majority of episodes filmed at Elstree studios. Sometimes filmed on ‘location’. EastEnders’ world
  • 6. TRECVID 20187 Topic creation procedure @ NIST • Viewed several test videos to develop a list of recurring people, locations and their overlapping. • Chose 10 master locations and identified 6 to 12 image and video examples to each depending on location type (private: kitchen, room, etc; public: pub, café, market, etc) • Created ≈90 topics targeting recurring specific persons in specific locations. • Chose representative sample of 30 topics. Each topic includes images for target persons from test videos, many from the sample video (ID 0) and a named location. • Filtered example shots from the submissions if it satisfies the topic.
  • 7. TRECVID 20188 Global test condition: type of training data Effect of examples – 2 conditions: • A – one or more provided images – no video • E - video examples (+ optional image examples)
  • 8. TRECVID 20189 Topics – segmented “person” example images Chelsea Darrin Garry Heather
  • 9. TRECVID 201810 Topics – segmented example images Jack Jane Max Minty
  • 10. TRECVID 201811 Topics – segmented example images Mo Zainab
  • 11. TRECVID 201812 Foyer Kitchen1 Kitchen2 LR1 LaundretteCafe2Cafe1LR2 Topics – 10 Master locations
  • 12. TRECVID 201813 Topics – 2018 Jane Chelsea Minty Garry Mo Darrin Zainab Heather Jack Max Cafe2 x x x x x x x x x Market x x x x x x x Pub x x x x x x x Launderette x x x x x x x 30 x topics : find {Chelsea, Darrin, Garry, Heather, Jack, Jane, Max, Minty, Mo, Zainab} in {Cafe2,Market,Pub,Launderette}
  • 13. TRECVID 201814 Team Organization Run Types Submitted F: automatic, I: Interactive BUPT_MCPRL Beijing University of Posts and Telecommunications F_E (3), I_E (1) HSMW_TUC Chemnitz University of Technology, University of Applied Sciences Mittweida F_A (3), I_A (1) ITI_CERTH Information Technologies Institute, Centre for Research and Technology Hellas I_A (1) IRIM EURECOM; LABRI ; LIG ; LIMSI; LISTIC F_A (4), F_E (4) NII_Hitachi_UIT National Institute of Informatics, Japan (NII); Hitachi, Ltd; University of Information Technology, VNU-HCM, Vietnam (HCM-UIT) F_A (4) , I_A(1) WHU_NERCMS National Engineering Research Center for Multimedia Software, Wuhan University F_A (4) , I_A (4) PLUMCOT LIMSI, Karlsruhe Institute of Technology F_A (3) PKU_ICST Peking University F_A (3), F_E (3), I_E (1) INS 2018: 8 Finishers (out of 17)
  • 14. TRECVID 201815 Evaluation For each topic the submissions were pooled and judged down to max rank 520, resulting in 128117 judged shots (≈ 480 person-h). • 10 NIST assessors played the clips and determined if they contained the topic target or not. • 11717 clips (avg. 390 / topic) contained the topic target (9 %) • True positives per topic: min 30 med 168 max 1340 • The task is treated as a form of ranking and thus the trec_eval_video tool was used to calculate average precision, recall, precision, etc. • To measure efficiency, speed was also measured. • In total, 31 automatic and 9 interactive runs were submitted.
  • 15. TRECVID 201816 Results by team (Automatic)
  • 16. TRECVID 201817 Results by team (Interactive)
  • 17. TRECVID 201818 # Query 9230 Find Garry in this Laundrette 9236 Find Darrin in this Laundrette 9241 Find Heather in this Laundrette 9233 Find Mo in this Laundrette 9239 Find Zainab in this Mini-Market 9244 Find Jack in this Laundrette 9237 Find Zainab in this Cafe 2 9238 Find Zainab in this Laundrette 9242 Find Heather in this Mini-Market 9248 Find Max in this Mini-Market 9225 Find Minty in this Cafe 2 9219 Find Jane in this Cafe 2 9229 Find Garry in this Pub 9226 Find Minty in this Pub 9245 Find Jack in this Mini-Market 9228 Find Garry in this Cafe 2 9243 Find Jack in this Pub 9227 Find Minty in this Mini-Market 9240 Find Heather in this Cafe 2 9246 Find Max in this Cafe 2 9221 Find Jane in this Mini-Market 9247 Find Max in this Laundrette 9223 Find Chelsea in this Pub 9224 Find Chelsea in this Mini-Market 9235 Find Darrin in this Pub 9232 Find Mo in this Pub 9234 Find Darrin in this Cafe 2 9231 Find Mo in this Cafe 2 9222 Find Chelsea in this Cafe 2 9220 Find Jane in this Pub Results by topic - automatic Zainab (0.44)* & Heather (0.558) easy to find. Chelsea (0.229) & Max (0.235) difficult to find. Laundrette (0.479) & Mini-Market(0.411) is easy. Pub(0.259) & Cafe 2(0.211) is hard. *Mean score of median MAP per character/location
  • 18. TRECVID 201819 Automatic Run results + Randomization testing = > > > > > > = > > > > = > > = > > = > > = > > = > = = = 1 2 3 4 5 6 7 8 9 10 > p < 0.05 0.463 F_E_PKU_ICST_1 0.459 F_E_PKU_ICST_4 0.443 F_A_IRIM_2 0.442 F_A_IRIM_1 0.437 F_E_IRIM_2 0.433 F_E_IRIM_1 0.429 F_A_PKU_ICST_3 0.420 F_A_PKU_ICST_6 0.398 F_A_IRIM_3 0.395 F_E_IRIM_3 MAP Top 10 runs across all teams (automatic) p = probability the row run scored better than the column run due to chance
  • 19. TRECVID 201820 2017 (s) 2016 (s) 2018 (s) Mean Average Precision vs. per run clock processing time (automatic) IRIM runs
  • 20. TRECVID 201821 Results by topic - interactive # Query 9230 Find Garry in this Laundrette 9228 Find Garry in this Cafe 2 9233 Find Mo in this Laundrette 9236 Find Darrin in this Laundrette 9239 Find Zainab in this Mini-Market 9238 Find Zainab in this Laundrette 9225 Find Minty in this Cafe 2 9229 Find Garry in this Pub 9221 Find Jane in this Mini-Market 9226 Find Minty in this Pub 9237 Find Zainab in this Cafe 2 9223 Find Chelsea in this Pub 9224 Find Chelsea in this Mini-Market 9222 Find Chelsea in this Cafe 2 9232 Find Mo in this Pub 9227 Find Minty in this Mini-Market 9219 Find Jane in this Cafe 2 9235 Find Darrin in this Pub 9220 Find Jane in this Pub 9231 Find Mo in this Cafe 2 9234 Find Darrin in this Cafe 2 Minty(0.319)*, Zainab(0.316) & Garry(0.3) are easy to find. Jane(0.175), Darrin(0.208) & Chelsea(0.228) are difficult. Laundrette (0.33) & Mini-Market (0.372) are easy. Cafe 2 (0.117) & Pub (0.293) are hard. *Mean score of median MAP per character/location
  • 21. TRECVID 201822 > p < 0.05 MAP p = probability the row run scored better than the column run due to chance 0.524 I_E_PKU_ICST_2 = > > > > > > > > 0.447 I_E_BUPT_MCPRL_4 = > > > > > > > 0.367 I_A_NII_Hitachi_UIT_1 = > > > > > > 0.261 I_A_WHU_NERCMS_1 = > > > > 0.252 I_A_HSMW_TUC_4 = > > > 0.235 I_A_WHU_NERCMS_3 = > > 0.200 I_A_WHU_NERCMS_4 = > > 0.184 I_A_WHU_NERCMS_2 = > 0.064 I_A_ITI_CERTH_1 = 1 2 3 4 5 6 7 8 9 ALL 9 runs by all teams (interactive) Interactive Run Results, Randomization testing
  • 22. TRECVID 201823 Results by example set (A/E) - automatic
  • 24. TRECVID 201825 Some general observations about the task • Slight decrease in number of participants but same number of finishers – higher % finished. • Less teams are using E condition - training with video examples – (e.g tracking characters) • Interactive search task: • Limited participation • Third year: Slight decrease in best performances from 2nd year – Why? Queries more difficult? • We encourage teams to test their 2016 or 2017 system on the 2018 topics or vice versa.
  • 25. TRECVID 201826 Some general observations about the task – Data Source •Best results achieved using external data plus NIST provided data. •Next best results are achieved using only the NIST provided images and video. •Systems using only external data do not perform as well as systems which include the NIST provided data.
  • 26. TRECVID 201828 Observations over last three years of the task (Automatic): •High Score down on last year but still up on 2016. •Low Scores increasing year on year. •Mean and Median scores increased significantly last year on 2016 but have since stabilized. •Standard Deviation between MAP scores decreased on last year, now similar to 2016. •Number of participants has been decreasing year on year.
  • 27. TRECVID 201829 Observations over last three years of the task - Locations •Laundrette consistently shows to be among the easiest locations to recognize. •Pub consistently shows to be among the most difficult locations to recognize. 2016 2017 2018 Pub 0.021 N/A 0.259 Laundrette 0.172 0.338 0.479 Market N/A 0.343 0.411 Average scores (automatic systems) across all topics for common locations per year
  • 28. TRECVID 201830 BUPT-MCPRL •Location Retrieval: Two Independent Methods: • Hessian-Affine detector with RootSIFT, MSER detector with RootSIFT, and CNN features. • Fine-Tuned publicly available VGG-16 model, GoogleNet model, and ResNet-152 models. •Person Retrieval: Face retrieval and transcript- based search: • Detect face on key frames captures from video by MTCNN, face representations extracted from bounding-box and cosine distance is employed to match faces. • Transcript search - locate character name in transcripts. •Submitted 4 runs • Three automatic runs • One interactive run
  • 29. TRECVID 201831 ITI CERTH •Focus on interactive task •VERGE system includes several modes for navigation: • Visual similarity (DCNN) • Visual Concept Retrieval - 346 visual concepts • Face detection • Scene similarity • Multimodal Fusion •Late fusion of DCNN face descriptors and scene descriptors •Submitted 1 interactive run
  • 30. TRECVID 201832 TU Chemnitz • Complete overhaul of INS system architecture using Docker • Allows combination of various open face recognition and scene recognition pipelines • All indexing is done offline, retrieval is very fast Submitted 3 automatic and 1 interactive runs
  • 31. TRECVID 201833 IRIM (LaBRI, LIMSI, LIG) • Combination of two person recognition methods • One location recognition method • Late fusion on person methods, additional late fusion to mix in the location scores • 2018: focus on person recognition • Positive impact: data augmentation, faces reranking Submitted 8 automatic runs (A and E)
  • 32. TRECVID 201834 Peking University (ICST) • Location search: BOW plus CNN • Person search: • Query preprocessing based on super resolution • Deep models for face recognition • Text based refinement • Fusion based on combination of score and rank based fusion (boosting) • Filtering noisy shots (outliers) Submitted 6 automatic runs (A and E), 1 interactive
  • 33. TRECVID 201836 Overview of submissions (1) •8 out of 8 teams described INS runs for the TV notebook •3 teams will present their INS experiments 3:40 - 4:10, (HSMW_TUC – University of Applied Sciences Mittweida, Chemnitz University of Technology) 4:10 - 4:40, (NII_Hitachi_UIT – National Institute of Informatics, Japan Hitachi, Ltd., Japan University of Information Technology, VNU_HCMC, Vietnam) 4:40 - 5:10, (WHU_NERCMS – National Engineering Research Centre for Multimedia Software, Wuhan University) 5:10 - 5:25, INS Discussion
  • 34. TRECVID 201837 INS 2019 plans •Move on to a new query type • Action instances (drinking, walking, sleeping, talking, driving, etc) • Person + Action (Brad fighting) • Action + Location (e.g drinking in the cafe) • Mix of the above ?! •Keep the newly added additional training data sources. •Add manual run type ?
  • 35. TRECVID 201838 INS 2019 plans – Example Action Images Heather Talking on PhoneJack Holding Money Ian Drinking in Cafe Garry Holding Flowers Minty and Mo Singing Jane Holding Baby
  • 36. TRECVID 201839 Future INS plans – Common queries • Plans to now include a set of common queries each year to measure yearly progress. • 2019 teams will submit runs for 50 queries in total. 30 unique queries plus 20 common queries to be repeated each year. • 2020 and 2021 teams to submit runs for 40 queries in total. 20 unique queries plus 20 common queries each year. • The common queries each year provide a basis for the comparison of team performances year on year.
  • 37. TRECVID 201840 Future INS plans – Evaluations 2019 2020 2021 2019 30 + 10A + 10B 2020 20 + 10A + 10B 2021 20 + 10A + 10B • 2019 Assessors to evaluate just the 30 unique queries for that year. • 2020 Assessors to evaluate the 20 unique queries for that year plus the first 10 common queries (10A), allows to measure progress on 2019. • 2021 Assessors to evaluate the 20 unique queries plus the second 10 common queries (10B), allows to measure progress on 2019 and 2020.