SlideShare a Scribd company logo
1 of 30
Download to read offline
TRECVID 2022
Ad-hoc Video Search (AVS)
Task Overview
Information Access Division
Information Technology
Laboratory
Georges Quénot
Laboratoire d’Informatique de Grenoble, France
George Awad
Retrieval Group, Information Access Division, Information Technology Laboratory, NIST;
Outline
TRECVID 2022
Task Definition & Dataset
Topics (Queries)
Participating Teams
Evaluation & Results
General Observations
Task Definition
Goal: promote progress in content-based video retrieval based on end user ad-hoc (generic) textual queries
that include searching for persons, objects, locations, actions and their combinations.
Task: Given a test collection, a query, and a master shot boundary reference, return a ranked list of at most
1000 shots (out of 1, 425,454) which best satisfy the query.
Queries:
• Main : New 20 to 30 queries each year
• Progress : A set of fixed 20 queries for 3 years
Testing data: 9760 Vimeo Creative Commons Videos (V3C2), 1300 total hours with mean video durations of 8
min. Reflects a wide variety of content, style and source devices.
Development data:
• ≈2000 hours of previous IACC.1-3 (Internet Archive) data used between 2010-2018 with concept and ad-hoc query
annotations.
• V3C1 (Vimeo Creative Commons ) dataset, 1000 hours, with ad-hoc query annotations (used between 2019 – 2021).
Task Parameters
TRECVID 2022
System Types Description
Fully Automatic
(F)
System uses official
query directly
Manually-Assisted
(M)
Query built
manually
Relevance-Feedback
(R)
Evaluating top-30
results up to 3
iterations
Training data
categories
Description
A Only V3C1 training data
D Other training data sources
E Only training data collected automatically
using the query text
F Only training data collected automatically
using a query built manually from the official
query text
->> Novelty (optional) run type to encourage retrieving non-common relevant shots easily found
across systems.
->> Explainability of result items were allowed as extra optional information with the submitted shots
The Vimeo Creative Commons Collection (V3C)* consists of ‘free’ video material sourced from the web
video platform vimeo.com. It is designed to contain a wide range of content which is representative of what
is found on the platform in general. All videos in the collection have been released by their creators under a
Creative Commons License which allows for unrestricted redistribution.
Vimeo Creative Commons Collection
Partition V3C1 V3C2 V3C3 Total
File Size 2.4TB 3.0TB 3.3TB 8.7TB
Number of Videos 7,475 9,760 11,215 28,450
Combined Video
Duration
1000 hours,
23 minutes,
50 seconds
1300 hours,
52 minutes,
48 seconds
1500 hours,
8 minutes,
57 seconds
3801 hours,
25 minutes,
35 seconds
Mean Video
Duration
8 minutes,
2 seconds
7 minutes,
59 seconds
8 minutes,
1 seconds
8 minutes,
1 seconds
Number of
Segments
1,082,659 1,425,454 1,635,580 4,143,693
* Rossetto, L., Schuldt, H., Awad, G., & Butt, A. (2019). V3C – a Research Video Collection. Proceedings of the 25th International Conference on MultiMedia Modeling.
AVS 2022 (30 main) Queries by complexity
A man with a white beard
A room with blue wall
A construction site
A parked white car
A type of cloth hanging on a rack, hanger, or line
Building with columns during daytime
A person is mixing ingredients in a bowl, cup, or similar type of containers
A female person bending downwards
A person is in the act of swinging
A person wearing a light t-shirt with dark or black writing on it
A woman wearing a head kerchief
A man wearing black shorts
A kneeling man outdoors
Two or more persons in a room with a fireplace
A drone landing or rising from the ground
A large stone building from the outside
A piece of heavy farm equipment or machine seen outdoors
A clock on a wall in a room
A Person, Location, or Object Person + Action
Person + Object
Person + Location
Object + Location
Object + Action
AVS 2022 (30 main) Queries by complexity
A black bird seen on a dry area sitting, walking, or eating
Two persons are seen while at least one of them is speaking in a non-English language outdoors
A woman is eating something outdoors
A person is biking through a path in a forest
An Asian bride and groom celebrating outdoors
A man and a bike in the air after jumping from a ramp
A woman holding or smoking a cigarette
Two teams playing a game where one team have their players wearing white t-shirts.
Two persons wearing white outfits and black belts demonstrate martial arts in a room with floor mats
Two adults are seated in a flying paraglider in the air
A ring shown on the left hand of a person
A man is holding a knife in a non-kitchen location
Person + Action + Location
Person + Action + Object
Person + Object + Location
Person + Action + Object + Location
Object + Action + Location
2022-2024 (20 progress) Queries by complexity
A woman with a ponytail
A person's Hands with a red nail polish
A building with balconies seen from the outside during daytime
A room with a wood floor
A wooden bridge
A round table
A Person, Location, or Object
A person is throwing an object away
A person is washing oneself or another thing
Person + Action
A vehicle driving under a tunnel
A big building that is being camera panned or tilted from the outside
Object + Location
Person + Object
A man wearing a lanyard around his neck
Person + Location
A man is seen at a gas station
Person + Action + Location
A person is lying on the ground outdoors
A person is rubbing part of their face using their hands
Person + Action + Object
A man holding a gun but not shooting
A person is pouring liquid into a type of container
Person + Object + Location
A person wearing a ring in their nose
A man wearing a dark colored hooded jacket outdoors Person + Action + Object + Location
A man holding a fishing rod while being dipped in a body of water
A person holding a long stick which is not a drum stick outdoors
Teams – Main Task (33 runs)
TRECVID 2022
Team Name
(7 Finishers)
Organization
System Type
Manually
assisted
Fully
automatic
Novelty
run
VIREO Singapore Management University; City University of Hong Kong 5 5 1
Kindai_ogu_osaka Kindai University; Osaka Gakuin University; Osaka University 4
ITI_CERTH
Information Technologies Institute, Centre for Research
and Technology Hellas
2
RUCAIM3-Tencent Renmin University of China 4
RUCMM Renmin University of China 4
WasedaMeiseiSoftbank Waseda University; Meisei University; SoftBank Corporation 4
CamiloUchile Uchile 4
Teams – Progress Task (28 runs)
TRECVID 2022
Team Name
(6 Finishers)
Organization
System Type
Manually
assisted
Fully
automatic
Novelty
run
VIREO Singapore Management University; City University of Hong Kong 5 5
Kindai_ogu_osaka Kindai University; Osaka Gakuin University; Osaka University 4
ITI_CERTH
Information Technologies Institute, Centre for Research
and Technology Hellas
2
RUCAIM3-Tencent Renmin University of China 4
RUCMM Renmin University of China 4
WasedaMeiseiSoftbank Waseda University; Meisei University; SoftBank Corporation 4
Evaluation Methodology
Ø NIST judged 100% of top (ranks 1 – 300) pooled results from all submissions and
sampled 25% from the rest of pooled results (ranks 301 – 1000).
Ø Stats of sampled and judged clips (ranks 301 to 1000) across all runs and topics
Ø At minimum, 24.3 % of any run and query results were sampled and judged
Ø At maximum, 76.7 % of any run and query results were sampled and judged
Ø On average, 55 % of any run and query results were sampled and judged
Ø One assessor per query, watched complete shot while listening to the audio.
Ø Each query assumed to be binary: absent or present for each master reference shot.
Evaluation Methodology
Ø Top submitted results were double judged if at least 10 runs submitted them, and
assessor judged them as false positive.
Ø submitted results were double judged if keyframes of close neighbourhood (+/- 5) shots
are visually similar but judged differently.
Ø Extended inferred average precision (xinfAP1) was calculated using the judged and
unjudged pool by sample_eval2 tool.
Ø Compared runs in terms of mean extended inferred average precision across all
evaluated queries.
1
J.A. Aslam, V. Pavlu and E. Yilmaz, Statistical Method for System Evaluation Using Incomplete Judgments Proceedings of the 29th ACM SIGIR Conference, Seattle, 2006.
2 https://www-nlpir.nist.gov/projects/trecvid/trecvid.tools/sample_eval/
Human Judgments
TRECVID 2022
Total Judged Shots 148 234
Total Hits 20 125
Hits at ranks
1 - 100
7762
Hits at ranks
101 - 300 7745
Hits at ranks
301 - 1000 4618
5 human assessors
30 queries
500 hours of labor work
Sorted Overall Scores – Automatic Runs
0
0.05
0.1
0.15
0.2
0.25
0.3
C
_
D
_
W
a
s
e
d
a
M
e
i
s
e
i
S
o
f
t
b
a
n
k
.
2
2
_
2
C
_
D
_
W
a
s
e
d
a
M
e
i
s
e
i
S
o
f
t
b
a
n
k
.
2
2
_
3
C
_
D
_
W
a
s
e
d
a
M
e
i
s
e
i
S
o
f
t
b
a
n
k
.
2
2
_
1
C
_
D
_
W
a
s
e
d
a
M
e
i
s
e
i
S
o
f
t
b
a
n
k
.
2
2
_
4
C
_
D
_
R
U
C
M
M
.
2
2
_
2
C
_
D
_
R
U
C
M
M
.
2
2
_
3
C
_
D
_
R
U
C
M
M
.
2
2
_
4
C
_
D
_
R
U
C
M
M
.
2
2
_
1
C
_
D
_
I
T
I
_
C
E
R
T
H
.
2
2
_
2
C
_
D
_
I
T
I
_
C
E
R
T
H
.
2
2
_
1
C
_
D
_
k
i
n
d
a
i
_
o
g
u
_
o
s
a
k
a
.
2
2
_
1
C
_
D
_
k
i
n
d
a
i
_
o
g
u
_
o
s
a
k
a
.
2
2
_
2
C
_
D
_
k
i
n
d
a
i
_
o
g
u
_
o
s
a
k
a
.
2
2
_
4
C
_
D
_
k
i
n
d
a
i
_
o
g
u
_
o
s
a
k
a
.
2
2
_
3
C
_
D
_
R
U
C
A
I
M
3
-
T
e
n
c
e
n
t
.
2
2
_
2
C
_
D
_
V
I
R
E
O
.
2
2
_
3
C
_
D
_
V
I
R
E
O
.
2
2
_
2
C
_
D
_
V
I
R
E
O
.
2
2
_
4
C
_
D
_
R
U
C
A
I
M
3
-
T
e
n
c
e
n
t
.
2
2
_
1
C
_
D
_
V
I
R
E
O
.
2
2
_
1
C
_
D
_
R
U
C
A
I
M
3
-
T
e
n
c
e
n
t
.
2
2
_
3
C
_
D
_
V
I
R
E
O
.
2
2
_
5
C
_
D
_
R
U
C
A
I
M
3
-
T
e
n
c
e
n
t
.
2
2
_
4
N
_
D
_
V
I
R
E
O
.
2
2
_
6
C
_
D
_
C
a
m
i
l
o
U
c
h
i
l
e
.
2
2
_
3
C
_
D
_
C
a
m
i
l
o
U
c
h
i
l
e
.
2
2
_
4
C
_
D
_
C
a
m
i
l
o
U
c
h
i
l
e
.
2
2
_
2
C
_
D
_
C
a
m
i
l
o
U
c
h
i
l
e
.
2
2
_
1
Mean
xinfAP 28 Automatic runs across 30 main queries
Median = 0.176
Higher
is
better
Sorted Overall Scores – Manually Assisted
0
0.02
0.04
0.06
0.08
0.1
0.12
0.14
0.16
0.18
0.2
C_D_VIREO.22_3 C_D_VIREO.22_4 C_D_VIREO.22_2 C_D_VIREO.22_1 C_D_VIREO.22_5
Mean
xinfAP
5 Manually-Assisted runs across 30 main queries
Higher
is
better
Statistical Significance (top 10 runs)
Top 10 automatic runs - randomization test (p < 0.05)
F_M_C_D_WasedaMeiseiSoftbank.22_1
F_M_C_D_WasedaMeiseiSoftbank.22_3
F_M_C_D_WasedaMeiseiSoftbank.22_2
F_M_C_D_RUCMM.22_4
F_M_C_D_WasedaMeiseiSoftbank.22_4
F_M_C_D_RUCMM.22_2
F_M_C_D_RUCMM.22_3
F_M_C_D_RUCMM.22_1
F_M_C_D_ITI_CERTH.22_2
F_M_C_D_ITI_CERTH.22_1
F_M_C_D_Wase
daMeiseiSoft
bank.22_4
F_M_C_D_ITI_
CERTH.22_2
F_M_C_D_ITI_
CERTH.22_1
No statistical diff. between WasedaMeiseiSoftbank
runs 1, 3, & 2.
No statistical diff. between all RUCMM runs.
ITI_CERTH run 2 is better than run 1
Statistical Significance (top 10 runs)
5 manually-assisted runs - randomization test (p < 0.05)
M_M_C_D_VIREO.22_3
M_M_C_D_VIREO.22_4
M_M_C_D_VIREO.22_1
M_M_C_D_VIREO.22_2
M_M_C_D_VIREO.22_5
VIREO run 3 is better than all other VIREO runs.
VIREO run 4 is better than runs 1, 2, & 5.
VIREO runs 1 & 2 are better than run 5
M_M_C_D_VIREO.22_4
M_M_C_D_VIREO.22_1
M_M_C_D_VIREO.22_2
M_M_C_D_VIREO.22_5
Hits Per Topic
0
200
400
600
800
1000
1200
1400
1600
1800
2000
1726 1718 1709 1723 1712 1721 1708 1706 1704 1705 1702 1729 1703 1713 1701 1715 1724 1710 1714 1711 1720 1728 1730 1707 1717 1725 1719 1722 1716 1727
Number
of
relevant
shots
Query Ids
Unique vs Common (submitted by at least 2 teams) True Positive Shots
Unique Common
Two persons
wearing white
outfits and
black belts
demonstrate
martial arts in a
room with floor
mats
A person is
biking through
a path in a
forest
36 % of all hits are
unique
A drone landing
or rising from
the ground
A woman is
eating
something
outdoors
A construction
site
Two teams
playing a game
where one
team have
their players
wearing white
t-shirts.
A person is
mixing
ingredients in a
bowl, cup, or
similar type of
containers
A room with
blue walls
A large stone
building from
the outside
Sorted Unique Hits by Team
0
500
1000
1500
2000
2500
3000
3500
V
I
R
E
O
R
U
C
A
I
M
3
-
T
e
n
c
e
n
t
W
a
s
e
d
a
M
e
i
s
e
i
S
o
f
t
b
a
n
k
R
U
C
M
M
k
i
n
d
a
i
_
o
g
u
_
o
s
a
k
a
C
a
m
i
l
o
U
c
h
i
l
e
I
T
I
_
C
E
R
T
H
Number
of
unique
relevant
shots
7336 Unique Shots from 7 teams in their automatic & manually-assisted runs
Teams
Top scoring teams
are not necessarily
contributing many
unique true shots
and vice-versa.
Top runs per query (Main Task)
Higher
is
better
0.00
0.10
0.20
0.30
0.40
0.50
0.60
0.70
0.80
0.90
701 702 703 704 705 706 707 708 709 710 711 712 713 714 715 716 717 718 719 720 721 722 723 724 725 726 727 728 729 730
InfAP
Queries
Top 10 automatic runs
1
2
3
4
5
6
7
8
9
10
Median
A construction
site
A person is in
the act of
swinging
A person is
biking through
a path in a
forest
A man is
holding a knife
in a non-
kitchen
location
A kneeling man
outdoors
A woman
wearing a head
kerchief
Top runs per query (Main Task)
Higher
is
better
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
701 702 703 704 705 706 707 708 709 710 711 712 713 714 715 716 717 718 719 720 721 722 723 724 725 726 727 728 729 730
InfAP
Queries
All Manually-assisted runs
1
2
3
4
5
Median
Easy vs Hard Queries
Query
A person is biking through a path in a forest
A construction site
A person is in the act of swinging
A female person bending downwards
A type of cloth hanging on a rack, hanger, or line
Top 5 easiest queries (based on avg infAP of runs scored >= 0.38)
Top 5 hardest queries (based on avg infAP of runs scored < 0.38)
Query
A kneeling man outdoors
Two or more persons in a room with a fireplace
A woman wearing a head kerchief
A room with blue wall
A person wearing a light t-shirt with dark or black writing on it
Informal method of declaring
easy/hard topic:
1- Threshold of 0.38 xinfAP is
calculated as the mid point
between all topics score range.
2- Sorted number of runs
scored above / below 0.38 for
any topic.
Novelty Scores
TRECVID 2022
0
5
10
15
20
25
30
35
40
45
F
_
M
_
N
_
D
_
V
I
R
E
O
.
2
2
_
6
F
_
M
_
C
_
D
_
R
U
C
A
I
M
3
-
…
F
_
M
_
C
_
D
_
W
a
s
e
d
a
M
e
i
s
e
i
S
o
f
…
F
_
M
_
C
_
D
_
k
i
n
d
a
i
_
o
g
u
_
o
s
a
k
a
.
…
F
_
M
_
C
_
D
_
R
U
C
M
M
.
2
2
_
2
F
_
M
_
C
_
D
_
I
T
I
_
C
E
R
T
H
.
2
2
_
2
F
_
M
_
C
_
D
_
C
a
m
i
l
o
U
c
h
i
l
e
.
2
2
_
3
Mean
Unique
Shot
Weights
Novelty runs vs best common run from each team
Novelty runs
Common (non-novel) runs
A weight is given to each
topic and shot pairs in the
ground truth such that
highest weight is given to
unique shots.
Efficiency
TRECVID 2022
1
10
100
1000
10000
100000
1000000
0 0.2 0.4 0.6 0.8 1
Time
(s)
InfAP
Automatic Systems
1
10
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8
Time
(s)
InfAP
Manually-Assisted Systems
Good
and
fast
VIREO runs
Progress Task Plan
TRECVID 2022
Evaluation year
Submission year
2022 2023 2024
2022
Systems: Submit 20 fixed progress
queries
2023
Systems: Submit 20 fixed
progress queries
NIST: Eval 10 queries (set A)
2024
Systems: Submit 20 fixed
progress queries
NIST: Eval 10 queries (set B)
Goals : Evaluate 10 (set A) common queries submitted in 2 years (2022 - 2023)
Evaluate 10 (set B) common queries submitted in 3 years (2022 - 2024)
A person is mixing ingredients in a
bowl, cup, or similar type of containers
Samples of frequent false positives
A man wearing black shorts
A type of cloth hanging on a
rack, hanger, or line
A clock on a wall in a room
A ring shown on the left hand of a
person
A parked white car
Samples of hard true positives
A type of cloth hanging on a rack,
hanger, or line
Building with columns during
daytime
A female person bending
downwards
A person is in the act of
swinging
A kneeling man outdoors
2022 Main Approaches
ØUse of multiple text-image / text-video common latent embeddings: VSE++, GSMN, CLIP, SLIP, …
ØUse of multiple text-image / text-video annotated collections: MSR-VTT, TGIF, Flickr8k/30k, MS-
COCO, Conceptual Captions, …
ØUse of multiple visual and textual feature extractors
ØTriplet loss with margin for embedding space learning
ØLarge number of combinations and fusion (normalization | averaging)
ØLightweight Attentional Feature Fusion
ØStacked Cross Attention Network
ØBidirectional Negation Learning (for query with negative cues)
ØDual SoftMax with “background queries”
ØNo more concept bank approaches but “dual task” (interpretable embeddings)
ØHard to distinguish between data / features effects and algorithmic effects
2022 Task Observations
ØSubmissions
Ø 7 teams finished the main task including 6 teams submitting to the progress task with 28 runs.
Ø 28 automatic systems and 5 manually-assisted systems submitted runs in the main task.
Ø Run training types are dominated by “D” runs. No “E” or “F” runs.
Ø No teams submitted “optional” explainability results with their runs!
Ø Only 1 Novelty system submitted. Better than common runs on novelty metric.
ØPerformance
Ø Below 2021 & 2020 in general. However, queries are different and meant to search for more fine grained information.
Ø Few automatic systems are good and fast (< 10 sec).
Ø High similarity between automatic and manually-assisted systems in terms of query performance relatively to each
other.
Ø Top scoring teams not necessary contributing a lot of unique true shots and vice-versa.
Ø About 36% of all hits are unique. 64% are common hits across the runs.
Ø 13.5% of all judged shots across all queries are true positives.
Ø Hard queries are the ones asked for unusual combinations of facets (compared to well-known concepts)
Ø For low performance queries, usually all systems are condensed in small range.
Ø For mid to high performance queries, the top 10 runs vary in their range of performance.
Interactive Video Retrieval
At MMM 2023
29th International Conference on Multimedia Modeling,
January 2023, Bergen, Norway
• 10 Ad-Hoc Video Search (AVS) topics : Each AVS topic has several/many target shots (from V3C1 + V3C2
datasets) that should be found.
• 10 Known-Item Search (KIS) tasks, which are selected completely random on site. Each KIS task has only one
single 20 s long target segment.
• Registration for the task is now closed
During the Video Browser Showdown (VBS)

More Related Content

Similar to Video Search at TRECVID 2022

pres_drone_forensics_program.pptx
pres_drone_forensics_program.pptxpres_drone_forensics_program.pptx
pres_drone_forensics_program.pptxVolgaTC
 
UGC NET Paper 1 Major Topics in New Syllabus
UGC NET Paper 1 Major Topics in New SyllabusUGC NET Paper 1 Major Topics in New Syllabus
UGC NET Paper 1 Major Topics in New SyllabusDr Rajnikant Dodiya
 
Linked Data for Digital Humanities research at Media Archives
Linked Data for Digital Humanities research at Media ArchivesLinked Data for Digital Humanities research at Media Archives
Linked Data for Digital Humanities research at Media ArchivesVictor de Boer
 
Structural Biology in the Clouds: A Success Story of 10 years
Structural Biology in the Clouds: A Success Story of 10 yearsStructural Biology in the Clouds: A Success Story of 10 years
Structural Biology in the Clouds: A Success Story of 10 yearsAlexandreBonvin2
 
画像キャプションと動作認識の最前線 〜データセットに注目して〜(第17回ステアラボ人工知能セミナー)
画像キャプションと動作認識の最前線 〜データセットに注目して〜(第17回ステアラボ人工知能セミナー)画像キャプションと動作認識の最前線 〜データセットに注目して〜(第17回ステアラボ人工知能セミナー)
画像キャプションと動作認識の最前線 〜データセットに注目して〜(第17回ステアラボ人工知能セミナー)STAIR Lab, Chiba Institute of Technology
 
R&D Halfyearly Report.pptx
R&D Halfyearly Report.pptxR&D Halfyearly Report.pptx
R&D Halfyearly Report.pptxSaumya Acharya
 
Huawei STW 2018 public
Huawei STW 2018 publicHuawei STW 2018 public
Huawei STW 2018 publicAlan Smeaton
 
20170410 CENTRA2 meeting - AirBox
20170410 CENTRA2 meeting - AirBox20170410 CENTRA2 meeting - AirBox
20170410 CENTRA2 meeting - AirBoxLing-Jyh Chen
 
Intrusion Detection and Prevention System in an Enterprise Network
Intrusion Detection and Prevention System in an Enterprise NetworkIntrusion Detection and Prevention System in an Enterprise Network
Intrusion Detection and Prevention System in an Enterprise NetworkOkehie Collins
 
Short TRIZ Workshop for the University of the Philippines
Short TRIZ Workshop for the University of the PhilippinesShort TRIZ Workshop for the University of the Philippines
Short TRIZ Workshop for the University of the PhilippinesRichard Platt
 
A Large-Scale Analysis of YouTube Videos Depicting Everyday Thermal Camera Use
A Large-Scale Analysis of YouTube Videos Depicting Everyday Thermal Camera UseA Large-Scale Analysis of YouTube Videos Depicting Everyday Thermal Camera Use
A Large-Scale Analysis of YouTube Videos Depicting Everyday Thermal Camera UseMatthew Louis Mauriello
 
Crowdsourcing the Acquisition and Analysis of Mobile Videos for Disaster Resp...
Crowdsourcing the Acquisition and Analysis of Mobile Videos for Disaster Resp...Crowdsourcing the Acquisition and Analysis of Mobile Videos for Disaster Resp...
Crowdsourcing the Acquisition and Analysis of Mobile Videos for Disaster Resp...University of Southern California
 
Data is the new oil: Big data, data mining and bio - inspiring techniques
Data is the new oil: Big data, data mining and bio - inspiring techniquesData is the new oil: Big data, data mining and bio - inspiring techniques
Data is the new oil: Big data, data mining and bio - inspiring techniquesAboul Ella Hassanien
 
Data are the new oil: Big data, data mining and bio - inspiring techniques
Data are the new oil: Big data, data mining and bio - inspiring techniquesData are the new oil: Big data, data mining and bio - inspiring techniques
Data are the new oil: Big data, data mining and bio - inspiring techniquesAboul Ella Hassanien
 
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN LayersNear-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN LayersSymeon Papadopoulos
 
Cloud Services for Education - HNSciCloud applied to the UP2U project
Cloud Services for Education - HNSciCloud applied to the UP2U projectCloud Services for Education - HNSciCloud applied to the UP2U project
Cloud Services for Education - HNSciCloud applied to the UP2U projectHelix Nebula The Science Cloud
 
IEEE SMC TCHS Award Ceremony at IEEE CSR conference 2021
IEEE SMC TCHS Award Ceremony at IEEE CSR conference 2021IEEE SMC TCHS Award Ceremony at IEEE CSR conference 2021
IEEE SMC TCHS Award Ceremony at IEEE CSR conference 2021Francesco Flammini
 
Video analysis techniques
Video analysis techniquesVideo analysis techniques
Video analysis techniquesLiz FitzGerald
 

Similar to Video Search at TRECVID 2022 (20)

Paul Wang SOED 2016
Paul Wang SOED 2016Paul Wang SOED 2016
Paul Wang SOED 2016
 
pres_drone_forensics_program.pptx
pres_drone_forensics_program.pptxpres_drone_forensics_program.pptx
pres_drone_forensics_program.pptx
 
UGC NET Paper 1 Major Topics in New Syllabus
UGC NET Paper 1 Major Topics in New SyllabusUGC NET Paper 1 Major Topics in New Syllabus
UGC NET Paper 1 Major Topics in New Syllabus
 
Linked Data for Digital Humanities research at Media Archives
Linked Data for Digital Humanities research at Media ArchivesLinked Data for Digital Humanities research at Media Archives
Linked Data for Digital Humanities research at Media Archives
 
Structural Biology in the Clouds: A Success Story of 10 years
Structural Biology in the Clouds: A Success Story of 10 yearsStructural Biology in the Clouds: A Success Story of 10 years
Structural Biology in the Clouds: A Success Story of 10 years
 
Progress Reprot.pptx
Progress Reprot.pptxProgress Reprot.pptx
Progress Reprot.pptx
 
画像キャプションと動作認識の最前線 〜データセットに注目して〜(第17回ステアラボ人工知能セミナー)
画像キャプションと動作認識の最前線 〜データセットに注目して〜(第17回ステアラボ人工知能セミナー)画像キャプションと動作認識の最前線 〜データセットに注目して〜(第17回ステアラボ人工知能セミナー)
画像キャプションと動作認識の最前線 〜データセットに注目して〜(第17回ステアラボ人工知能セミナー)
 
R&D Halfyearly Report.pptx
R&D Halfyearly Report.pptxR&D Halfyearly Report.pptx
R&D Halfyearly Report.pptx
 
Huawei STW 2018 public
Huawei STW 2018 publicHuawei STW 2018 public
Huawei STW 2018 public
 
20170410 CENTRA2 meeting - AirBox
20170410 CENTRA2 meeting - AirBox20170410 CENTRA2 meeting - AirBox
20170410 CENTRA2 meeting - AirBox
 
Intrusion Detection and Prevention System in an Enterprise Network
Intrusion Detection and Prevention System in an Enterprise NetworkIntrusion Detection and Prevention System in an Enterprise Network
Intrusion Detection and Prevention System in an Enterprise Network
 
Short TRIZ Workshop for the University of the Philippines
Short TRIZ Workshop for the University of the PhilippinesShort TRIZ Workshop for the University of the Philippines
Short TRIZ Workshop for the University of the Philippines
 
A Large-Scale Analysis of YouTube Videos Depicting Everyday Thermal Camera Use
A Large-Scale Analysis of YouTube Videos Depicting Everyday Thermal Camera UseA Large-Scale Analysis of YouTube Videos Depicting Everyday Thermal Camera Use
A Large-Scale Analysis of YouTube Videos Depicting Everyday Thermal Camera Use
 
Crowdsourcing the Acquisition and Analysis of Mobile Videos for Disaster Resp...
Crowdsourcing the Acquisition and Analysis of Mobile Videos for Disaster Resp...Crowdsourcing the Acquisition and Analysis of Mobile Videos for Disaster Resp...
Crowdsourcing the Acquisition and Analysis of Mobile Videos for Disaster Resp...
 
Data is the new oil: Big data, data mining and bio - inspiring techniques
Data is the new oil: Big data, data mining and bio - inspiring techniquesData is the new oil: Big data, data mining and bio - inspiring techniques
Data is the new oil: Big data, data mining and bio - inspiring techniques
 
Data are the new oil: Big data, data mining and bio - inspiring techniques
Data are the new oil: Big data, data mining and bio - inspiring techniquesData are the new oil: Big data, data mining and bio - inspiring techniques
Data are the new oil: Big data, data mining and bio - inspiring techniques
 
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN LayersNear-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
 
Cloud Services for Education - HNSciCloud applied to the UP2U project
Cloud Services for Education - HNSciCloud applied to the UP2U projectCloud Services for Education - HNSciCloud applied to the UP2U project
Cloud Services for Education - HNSciCloud applied to the UP2U project
 
IEEE SMC TCHS Award Ceremony at IEEE CSR conference 2021
IEEE SMC TCHS Award Ceremony at IEEE CSR conference 2021IEEE SMC TCHS Award Ceremony at IEEE CSR conference 2021
IEEE SMC TCHS Award Ceremony at IEEE CSR conference 2021
 
Video analysis techniques
Video analysis techniquesVideo analysis techniques
Video analysis techniques
 

More from George Awad

Movie Summarization at TRECVID 2022
Movie Summarization at TRECVID 2022Movie Summarization at TRECVID 2022
Movie Summarization at TRECVID 2022George Awad
 
Disaster Scene Indexing at TRECVID 2022
Disaster Scene Indexing at TRECVID 2022Disaster Scene Indexing at TRECVID 2022
Disaster Scene Indexing at TRECVID 2022George Awad
 
Activity Detection at TRECVID 2022
Activity Detection at TRECVID 2022Activity Detection at TRECVID 2022
Activity Detection at TRECVID 2022George Awad
 
Multimedia Understanding at TRECVID 2022
Multimedia Understanding at TRECVID 2022Multimedia Understanding at TRECVID 2022
Multimedia Understanding at TRECVID 2022George Awad
 
Video Captioning at TRECVID 2022
Video Captioning at TRECVID 2022Video Captioning at TRECVID 2022
Video Captioning at TRECVID 2022George Awad
 
TRECVID 2019 Video to Text
TRECVID 2019 Video to TextTRECVID 2019 Video to Text
TRECVID 2019 Video to TextGeorge Awad
 
TRECVID 2019 Instance Search
TRECVID 2019 Instance SearchTRECVID 2019 Instance Search
TRECVID 2019 Instance SearchGeorge Awad
 
[TRECVID 2018] Video to Text
[TRECVID 2018] Video to Text[TRECVID 2018] Video to Text
[TRECVID 2018] Video to TextGeorge Awad
 
[TRECVID 2018] Instance Search
[TRECVID 2018] Instance Search[TRECVID 2018] Instance Search
[TRECVID 2018] Instance SearchGeorge Awad
 
Instance Search (INS) task at TRECVID
Instance Search (INS) task at TRECVIDInstance Search (INS) task at TRECVID
Instance Search (INS) task at TRECVIDGeorge Awad
 
Activity detection at TRECVID
Activity detection at TRECVIDActivity detection at TRECVID
Activity detection at TRECVIDGeorge Awad
 
The Video-to-Text task evaluation at TRECVID
The Video-to-Text task evaluation at TRECVIDThe Video-to-Text task evaluation at TRECVID
The Video-to-Text task evaluation at TRECVIDGeorge Awad
 
Introduction about TRECVID (The Video Retreival benchmark)
Introduction about TRECVID (The Video Retreival benchmark)Introduction about TRECVID (The Video Retreival benchmark)
Introduction about TRECVID (The Video Retreival benchmark)George Awad
 
TRECVID 2016 : Multimedia event Detection
TRECVID 2016 : Multimedia event DetectionTRECVID 2016 : Multimedia event Detection
TRECVID 2016 : Multimedia event DetectionGeorge Awad
 
TRECVID 2016 : Concept Localization
TRECVID 2016 : Concept LocalizationTRECVID 2016 : Concept Localization
TRECVID 2016 : Concept LocalizationGeorge Awad
 
TRECVID 2016 : Video to Text Description
TRECVID 2016 : Video to Text DescriptionTRECVID 2016 : Video to Text Description
TRECVID 2016 : Video to Text DescriptionGeorge Awad
 
TRECVID 2016 : Instance Search
TRECVID 2016 : Instance SearchTRECVID 2016 : Instance Search
TRECVID 2016 : Instance SearchGeorge Awad
 
TRECVID 2016 workshop
TRECVID 2016 workshopTRECVID 2016 workshop
TRECVID 2016 workshopGeorge Awad
 

More from George Awad (18)

Movie Summarization at TRECVID 2022
Movie Summarization at TRECVID 2022Movie Summarization at TRECVID 2022
Movie Summarization at TRECVID 2022
 
Disaster Scene Indexing at TRECVID 2022
Disaster Scene Indexing at TRECVID 2022Disaster Scene Indexing at TRECVID 2022
Disaster Scene Indexing at TRECVID 2022
 
Activity Detection at TRECVID 2022
Activity Detection at TRECVID 2022Activity Detection at TRECVID 2022
Activity Detection at TRECVID 2022
 
Multimedia Understanding at TRECVID 2022
Multimedia Understanding at TRECVID 2022Multimedia Understanding at TRECVID 2022
Multimedia Understanding at TRECVID 2022
 
Video Captioning at TRECVID 2022
Video Captioning at TRECVID 2022Video Captioning at TRECVID 2022
Video Captioning at TRECVID 2022
 
TRECVID 2019 Video to Text
TRECVID 2019 Video to TextTRECVID 2019 Video to Text
TRECVID 2019 Video to Text
 
TRECVID 2019 Instance Search
TRECVID 2019 Instance SearchTRECVID 2019 Instance Search
TRECVID 2019 Instance Search
 
[TRECVID 2018] Video to Text
[TRECVID 2018] Video to Text[TRECVID 2018] Video to Text
[TRECVID 2018] Video to Text
 
[TRECVID 2018] Instance Search
[TRECVID 2018] Instance Search[TRECVID 2018] Instance Search
[TRECVID 2018] Instance Search
 
Instance Search (INS) task at TRECVID
Instance Search (INS) task at TRECVIDInstance Search (INS) task at TRECVID
Instance Search (INS) task at TRECVID
 
Activity detection at TRECVID
Activity detection at TRECVIDActivity detection at TRECVID
Activity detection at TRECVID
 
The Video-to-Text task evaluation at TRECVID
The Video-to-Text task evaluation at TRECVIDThe Video-to-Text task evaluation at TRECVID
The Video-to-Text task evaluation at TRECVID
 
Introduction about TRECVID (The Video Retreival benchmark)
Introduction about TRECVID (The Video Retreival benchmark)Introduction about TRECVID (The Video Retreival benchmark)
Introduction about TRECVID (The Video Retreival benchmark)
 
TRECVID 2016 : Multimedia event Detection
TRECVID 2016 : Multimedia event DetectionTRECVID 2016 : Multimedia event Detection
TRECVID 2016 : Multimedia event Detection
 
TRECVID 2016 : Concept Localization
TRECVID 2016 : Concept LocalizationTRECVID 2016 : Concept Localization
TRECVID 2016 : Concept Localization
 
TRECVID 2016 : Video to Text Description
TRECVID 2016 : Video to Text DescriptionTRECVID 2016 : Video to Text Description
TRECVID 2016 : Video to Text Description
 
TRECVID 2016 : Instance Search
TRECVID 2016 : Instance SearchTRECVID 2016 : Instance Search
TRECVID 2016 : Instance Search
 
TRECVID 2016 workshop
TRECVID 2016 workshopTRECVID 2016 workshop
TRECVID 2016 workshop
 

Recently uploaded

Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 

Recently uploaded (20)

Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 

Video Search at TRECVID 2022

  • 1. TRECVID 2022 Ad-hoc Video Search (AVS) Task Overview Information Access Division Information Technology Laboratory Georges Quénot Laboratoire d’Informatique de Grenoble, France George Awad Retrieval Group, Information Access Division, Information Technology Laboratory, NIST;
  • 2. Outline TRECVID 2022 Task Definition & Dataset Topics (Queries) Participating Teams Evaluation & Results General Observations
  • 3. Task Definition Goal: promote progress in content-based video retrieval based on end user ad-hoc (generic) textual queries that include searching for persons, objects, locations, actions and their combinations. Task: Given a test collection, a query, and a master shot boundary reference, return a ranked list of at most 1000 shots (out of 1, 425,454) which best satisfy the query. Queries: • Main : New 20 to 30 queries each year • Progress : A set of fixed 20 queries for 3 years Testing data: 9760 Vimeo Creative Commons Videos (V3C2), 1300 total hours with mean video durations of 8 min. Reflects a wide variety of content, style and source devices. Development data: • ≈2000 hours of previous IACC.1-3 (Internet Archive) data used between 2010-2018 with concept and ad-hoc query annotations. • V3C1 (Vimeo Creative Commons ) dataset, 1000 hours, with ad-hoc query annotations (used between 2019 – 2021).
  • 4. Task Parameters TRECVID 2022 System Types Description Fully Automatic (F) System uses official query directly Manually-Assisted (M) Query built manually Relevance-Feedback (R) Evaluating top-30 results up to 3 iterations Training data categories Description A Only V3C1 training data D Other training data sources E Only training data collected automatically using the query text F Only training data collected automatically using a query built manually from the official query text ->> Novelty (optional) run type to encourage retrieving non-common relevant shots easily found across systems. ->> Explainability of result items were allowed as extra optional information with the submitted shots
  • 5. The Vimeo Creative Commons Collection (V3C)* consists of ‘free’ video material sourced from the web video platform vimeo.com. It is designed to contain a wide range of content which is representative of what is found on the platform in general. All videos in the collection have been released by their creators under a Creative Commons License which allows for unrestricted redistribution. Vimeo Creative Commons Collection Partition V3C1 V3C2 V3C3 Total File Size 2.4TB 3.0TB 3.3TB 8.7TB Number of Videos 7,475 9,760 11,215 28,450 Combined Video Duration 1000 hours, 23 minutes, 50 seconds 1300 hours, 52 minutes, 48 seconds 1500 hours, 8 minutes, 57 seconds 3801 hours, 25 minutes, 35 seconds Mean Video Duration 8 minutes, 2 seconds 7 minutes, 59 seconds 8 minutes, 1 seconds 8 minutes, 1 seconds Number of Segments 1,082,659 1,425,454 1,635,580 4,143,693 * Rossetto, L., Schuldt, H., Awad, G., & Butt, A. (2019). V3C – a Research Video Collection. Proceedings of the 25th International Conference on MultiMedia Modeling.
  • 6. AVS 2022 (30 main) Queries by complexity A man with a white beard A room with blue wall A construction site A parked white car A type of cloth hanging on a rack, hanger, or line Building with columns during daytime A person is mixing ingredients in a bowl, cup, or similar type of containers A female person bending downwards A person is in the act of swinging A person wearing a light t-shirt with dark or black writing on it A woman wearing a head kerchief A man wearing black shorts A kneeling man outdoors Two or more persons in a room with a fireplace A drone landing or rising from the ground A large stone building from the outside A piece of heavy farm equipment or machine seen outdoors A clock on a wall in a room A Person, Location, or Object Person + Action Person + Object Person + Location Object + Location Object + Action
  • 7. AVS 2022 (30 main) Queries by complexity A black bird seen on a dry area sitting, walking, or eating Two persons are seen while at least one of them is speaking in a non-English language outdoors A woman is eating something outdoors A person is biking through a path in a forest An Asian bride and groom celebrating outdoors A man and a bike in the air after jumping from a ramp A woman holding or smoking a cigarette Two teams playing a game where one team have their players wearing white t-shirts. Two persons wearing white outfits and black belts demonstrate martial arts in a room with floor mats Two adults are seated in a flying paraglider in the air A ring shown on the left hand of a person A man is holding a knife in a non-kitchen location Person + Action + Location Person + Action + Object Person + Object + Location Person + Action + Object + Location Object + Action + Location
  • 8. 2022-2024 (20 progress) Queries by complexity A woman with a ponytail A person's Hands with a red nail polish A building with balconies seen from the outside during daytime A room with a wood floor A wooden bridge A round table A Person, Location, or Object A person is throwing an object away A person is washing oneself or another thing Person + Action A vehicle driving under a tunnel A big building that is being camera panned or tilted from the outside Object + Location Person + Object A man wearing a lanyard around his neck Person + Location A man is seen at a gas station Person + Action + Location A person is lying on the ground outdoors A person is rubbing part of their face using their hands Person + Action + Object A man holding a gun but not shooting A person is pouring liquid into a type of container Person + Object + Location A person wearing a ring in their nose A man wearing a dark colored hooded jacket outdoors Person + Action + Object + Location A man holding a fishing rod while being dipped in a body of water A person holding a long stick which is not a drum stick outdoors
  • 9. Teams – Main Task (33 runs) TRECVID 2022 Team Name (7 Finishers) Organization System Type Manually assisted Fully automatic Novelty run VIREO Singapore Management University; City University of Hong Kong 5 5 1 Kindai_ogu_osaka Kindai University; Osaka Gakuin University; Osaka University 4 ITI_CERTH Information Technologies Institute, Centre for Research and Technology Hellas 2 RUCAIM3-Tencent Renmin University of China 4 RUCMM Renmin University of China 4 WasedaMeiseiSoftbank Waseda University; Meisei University; SoftBank Corporation 4 CamiloUchile Uchile 4
  • 10. Teams – Progress Task (28 runs) TRECVID 2022 Team Name (6 Finishers) Organization System Type Manually assisted Fully automatic Novelty run VIREO Singapore Management University; City University of Hong Kong 5 5 Kindai_ogu_osaka Kindai University; Osaka Gakuin University; Osaka University 4 ITI_CERTH Information Technologies Institute, Centre for Research and Technology Hellas 2 RUCAIM3-Tencent Renmin University of China 4 RUCMM Renmin University of China 4 WasedaMeiseiSoftbank Waseda University; Meisei University; SoftBank Corporation 4
  • 11. Evaluation Methodology Ø NIST judged 100% of top (ranks 1 – 300) pooled results from all submissions and sampled 25% from the rest of pooled results (ranks 301 – 1000). Ø Stats of sampled and judged clips (ranks 301 to 1000) across all runs and topics Ø At minimum, 24.3 % of any run and query results were sampled and judged Ø At maximum, 76.7 % of any run and query results were sampled and judged Ø On average, 55 % of any run and query results were sampled and judged Ø One assessor per query, watched complete shot while listening to the audio. Ø Each query assumed to be binary: absent or present for each master reference shot.
  • 12. Evaluation Methodology Ø Top submitted results were double judged if at least 10 runs submitted them, and assessor judged them as false positive. Ø submitted results were double judged if keyframes of close neighbourhood (+/- 5) shots are visually similar but judged differently. Ø Extended inferred average precision (xinfAP1) was calculated using the judged and unjudged pool by sample_eval2 tool. Ø Compared runs in terms of mean extended inferred average precision across all evaluated queries. 1 J.A. Aslam, V. Pavlu and E. Yilmaz, Statistical Method for System Evaluation Using Incomplete Judgments Proceedings of the 29th ACM SIGIR Conference, Seattle, 2006. 2 https://www-nlpir.nist.gov/projects/trecvid/trecvid.tools/sample_eval/
  • 13. Human Judgments TRECVID 2022 Total Judged Shots 148 234 Total Hits 20 125 Hits at ranks 1 - 100 7762 Hits at ranks 101 - 300 7745 Hits at ranks 301 - 1000 4618 5 human assessors 30 queries 500 hours of labor work
  • 14. Sorted Overall Scores – Automatic Runs 0 0.05 0.1 0.15 0.2 0.25 0.3 C _ D _ W a s e d a M e i s e i S o f t b a n k . 2 2 _ 2 C _ D _ W a s e d a M e i s e i S o f t b a n k . 2 2 _ 3 C _ D _ W a s e d a M e i s e i S o f t b a n k . 2 2 _ 1 C _ D _ W a s e d a M e i s e i S o f t b a n k . 2 2 _ 4 C _ D _ R U C M M . 2 2 _ 2 C _ D _ R U C M M . 2 2 _ 3 C _ D _ R U C M M . 2 2 _ 4 C _ D _ R U C M M . 2 2 _ 1 C _ D _ I T I _ C E R T H . 2 2 _ 2 C _ D _ I T I _ C E R T H . 2 2 _ 1 C _ D _ k i n d a i _ o g u _ o s a k a . 2 2 _ 1 C _ D _ k i n d a i _ o g u _ o s a k a . 2 2 _ 2 C _ D _ k i n d a i _ o g u _ o s a k a . 2 2 _ 4 C _ D _ k i n d a i _ o g u _ o s a k a . 2 2 _ 3 C _ D _ R U C A I M 3 - T e n c e n t . 2 2 _ 2 C _ D _ V I R E O . 2 2 _ 3 C _ D _ V I R E O . 2 2 _ 2 C _ D _ V I R E O . 2 2 _ 4 C _ D _ R U C A I M 3 - T e n c e n t . 2 2 _ 1 C _ D _ V I R E O . 2 2 _ 1 C _ D _ R U C A I M 3 - T e n c e n t . 2 2 _ 3 C _ D _ V I R E O . 2 2 _ 5 C _ D _ R U C A I M 3 - T e n c e n t . 2 2 _ 4 N _ D _ V I R E O . 2 2 _ 6 C _ D _ C a m i l o U c h i l e . 2 2 _ 3 C _ D _ C a m i l o U c h i l e . 2 2 _ 4 C _ D _ C a m i l o U c h i l e . 2 2 _ 2 C _ D _ C a m i l o U c h i l e . 2 2 _ 1 Mean xinfAP 28 Automatic runs across 30 main queries Median = 0.176 Higher is better
  • 15. Sorted Overall Scores – Manually Assisted 0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0.2 C_D_VIREO.22_3 C_D_VIREO.22_4 C_D_VIREO.22_2 C_D_VIREO.22_1 C_D_VIREO.22_5 Mean xinfAP 5 Manually-Assisted runs across 30 main queries Higher is better
  • 16. Statistical Significance (top 10 runs) Top 10 automatic runs - randomization test (p < 0.05) F_M_C_D_WasedaMeiseiSoftbank.22_1 F_M_C_D_WasedaMeiseiSoftbank.22_3 F_M_C_D_WasedaMeiseiSoftbank.22_2 F_M_C_D_RUCMM.22_4 F_M_C_D_WasedaMeiseiSoftbank.22_4 F_M_C_D_RUCMM.22_2 F_M_C_D_RUCMM.22_3 F_M_C_D_RUCMM.22_1 F_M_C_D_ITI_CERTH.22_2 F_M_C_D_ITI_CERTH.22_1 F_M_C_D_Wase daMeiseiSoft bank.22_4 F_M_C_D_ITI_ CERTH.22_2 F_M_C_D_ITI_ CERTH.22_1 No statistical diff. between WasedaMeiseiSoftbank runs 1, 3, & 2. No statistical diff. between all RUCMM runs. ITI_CERTH run 2 is better than run 1
  • 17. Statistical Significance (top 10 runs) 5 manually-assisted runs - randomization test (p < 0.05) M_M_C_D_VIREO.22_3 M_M_C_D_VIREO.22_4 M_M_C_D_VIREO.22_1 M_M_C_D_VIREO.22_2 M_M_C_D_VIREO.22_5 VIREO run 3 is better than all other VIREO runs. VIREO run 4 is better than runs 1, 2, & 5. VIREO runs 1 & 2 are better than run 5 M_M_C_D_VIREO.22_4 M_M_C_D_VIREO.22_1 M_M_C_D_VIREO.22_2 M_M_C_D_VIREO.22_5
  • 18. Hits Per Topic 0 200 400 600 800 1000 1200 1400 1600 1800 2000 1726 1718 1709 1723 1712 1721 1708 1706 1704 1705 1702 1729 1703 1713 1701 1715 1724 1710 1714 1711 1720 1728 1730 1707 1717 1725 1719 1722 1716 1727 Number of relevant shots Query Ids Unique vs Common (submitted by at least 2 teams) True Positive Shots Unique Common Two persons wearing white outfits and black belts demonstrate martial arts in a room with floor mats A person is biking through a path in a forest 36 % of all hits are unique A drone landing or rising from the ground A woman is eating something outdoors A construction site Two teams playing a game where one team have their players wearing white t-shirts. A person is mixing ingredients in a bowl, cup, or similar type of containers A room with blue walls A large stone building from the outside
  • 19. Sorted Unique Hits by Team 0 500 1000 1500 2000 2500 3000 3500 V I R E O R U C A I M 3 - T e n c e n t W a s e d a M e i s e i S o f t b a n k R U C M M k i n d a i _ o g u _ o s a k a C a m i l o U c h i l e I T I _ C E R T H Number of unique relevant shots 7336 Unique Shots from 7 teams in their automatic & manually-assisted runs Teams Top scoring teams are not necessarily contributing many unique true shots and vice-versa.
  • 20. Top runs per query (Main Task) Higher is better 0.00 0.10 0.20 0.30 0.40 0.50 0.60 0.70 0.80 0.90 701 702 703 704 705 706 707 708 709 710 711 712 713 714 715 716 717 718 719 720 721 722 723 724 725 726 727 728 729 730 InfAP Queries Top 10 automatic runs 1 2 3 4 5 6 7 8 9 10 Median A construction site A person is in the act of swinging A person is biking through a path in a forest A man is holding a knife in a non- kitchen location A kneeling man outdoors A woman wearing a head kerchief
  • 21. Top runs per query (Main Task) Higher is better 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 701 702 703 704 705 706 707 708 709 710 711 712 713 714 715 716 717 718 719 720 721 722 723 724 725 726 727 728 729 730 InfAP Queries All Manually-assisted runs 1 2 3 4 5 Median
  • 22. Easy vs Hard Queries Query A person is biking through a path in a forest A construction site A person is in the act of swinging A female person bending downwards A type of cloth hanging on a rack, hanger, or line Top 5 easiest queries (based on avg infAP of runs scored >= 0.38) Top 5 hardest queries (based on avg infAP of runs scored < 0.38) Query A kneeling man outdoors Two or more persons in a room with a fireplace A woman wearing a head kerchief A room with blue wall A person wearing a light t-shirt with dark or black writing on it Informal method of declaring easy/hard topic: 1- Threshold of 0.38 xinfAP is calculated as the mid point between all topics score range. 2- Sorted number of runs scored above / below 0.38 for any topic.
  • 24. Efficiency TRECVID 2022 1 10 100 1000 10000 100000 1000000 0 0.2 0.4 0.6 0.8 1 Time (s) InfAP Automatic Systems 1 10 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 Time (s) InfAP Manually-Assisted Systems Good and fast VIREO runs
  • 25. Progress Task Plan TRECVID 2022 Evaluation year Submission year 2022 2023 2024 2022 Systems: Submit 20 fixed progress queries 2023 Systems: Submit 20 fixed progress queries NIST: Eval 10 queries (set A) 2024 Systems: Submit 20 fixed progress queries NIST: Eval 10 queries (set B) Goals : Evaluate 10 (set A) common queries submitted in 2 years (2022 - 2023) Evaluate 10 (set B) common queries submitted in 3 years (2022 - 2024)
  • 26. A person is mixing ingredients in a bowl, cup, or similar type of containers Samples of frequent false positives A man wearing black shorts A type of cloth hanging on a rack, hanger, or line A clock on a wall in a room A ring shown on the left hand of a person
  • 27. A parked white car Samples of hard true positives A type of cloth hanging on a rack, hanger, or line Building with columns during daytime A female person bending downwards A person is in the act of swinging A kneeling man outdoors
  • 28. 2022 Main Approaches ØUse of multiple text-image / text-video common latent embeddings: VSE++, GSMN, CLIP, SLIP, … ØUse of multiple text-image / text-video annotated collections: MSR-VTT, TGIF, Flickr8k/30k, MS- COCO, Conceptual Captions, … ØUse of multiple visual and textual feature extractors ØTriplet loss with margin for embedding space learning ØLarge number of combinations and fusion (normalization | averaging) ØLightweight Attentional Feature Fusion ØStacked Cross Attention Network ØBidirectional Negation Learning (for query with negative cues) ØDual SoftMax with “background queries” ØNo more concept bank approaches but “dual task” (interpretable embeddings) ØHard to distinguish between data / features effects and algorithmic effects
  • 29. 2022 Task Observations ØSubmissions Ø 7 teams finished the main task including 6 teams submitting to the progress task with 28 runs. Ø 28 automatic systems and 5 manually-assisted systems submitted runs in the main task. Ø Run training types are dominated by “D” runs. No “E” or “F” runs. Ø No teams submitted “optional” explainability results with their runs! Ø Only 1 Novelty system submitted. Better than common runs on novelty metric. ØPerformance Ø Below 2021 & 2020 in general. However, queries are different and meant to search for more fine grained information. Ø Few automatic systems are good and fast (< 10 sec). Ø High similarity between automatic and manually-assisted systems in terms of query performance relatively to each other. Ø Top scoring teams not necessary contributing a lot of unique true shots and vice-versa. Ø About 36% of all hits are unique. 64% are common hits across the runs. Ø 13.5% of all judged shots across all queries are true positives. Ø Hard queries are the ones asked for unusual combinations of facets (compared to well-known concepts) Ø For low performance queries, usually all systems are condensed in small range. Ø For mid to high performance queries, the top 10 runs vary in their range of performance.
  • 30. Interactive Video Retrieval At MMM 2023 29th International Conference on Multimedia Modeling, January 2023, Bergen, Norway • 10 Ad-Hoc Video Search (AVS) topics : Each AVS topic has several/many target shots (from V3C1 + V3C2 datasets) that should be found. • 10 Known-Item Search (KIS) tasks, which are selected completely random on site. Each KIS task has only one single 20 s long target segment. • Registration for the task is now closed During the Video Browser Showdown (VBS)