SlideShare a Scribd company logo
1 of 38
Download to read offline
TRECVID 2019
Ad-hoc Video Search Task : Overview
Georges Quénot
Laboratoire d'Informatique de Grenoble
George Awad
Georgetown University;
National Institute of Standards and Technology
Disclaimer
The identification of any commercial product or trade name does not imply endorsement or
recommendation by the National Institute of Standards and Technology.
Outline
• Task Definition
• Video Data
• Topics (Queries)
• Participating teams
• Evaluation & results
• General observation
TRECVID 20192
TRECVID 20193
Task Definition
• Goal: promote progress in content-based retrieval based on end user ad-
hoc (generic) textual queries that include searching for persons, objects,
locations, actions and their combinations.
• Task: Given a test collection, a query, and a master shot boundary
reference, return a ranked list of at most 1000 shots (out of 1,082,657)
which best satisfy the need.
• Testing data: 7475 Vimeo Creative Commons Videos (V3C1), 1000 total
hours with mean video durations of 8 min. Reflects a wide variety of
content, style and source device.
• Development data: ≈2000 hours of previous IACC.1-3 data used between
2010-2018 with concept and ad-hoc query annotations.
TRECVID 20194
Query Development
• Test videos were viewed by 10 human assessors hired by the National
Institute of Standards and Technology (NIST).
• 4 facet descriptions of different scenes were used (if applicable):
– Who : concrete objects and beings (kind of persons, animals, things)
– What : are the objects and/or beings doing ? (generic actions, conditions/state)
– Where : locale, site, place, geographic, architectural
– When : time of day, season
• In total assessors watched random selection of ≈1% (12k videos) of the
V3C1 segmented shots.
• All random shots were selected to cover all original 7475 videos.
• 90 candidate queries chosen from human written descriptions to be
used between 2019-2021 including 20 progress topics (10 shared with
the Video Browser Showdown (VBS)).
TRECVID 20195
TV2019 Queries by complexity
• Person + Action + Object + Location (most complex)
Find shots of a woman riding or holding a bike outdoors
Find shots of a person smoking a cigarette outdoors
Find shots of a woman wearing a red dress outside in the daytime
• Person + Action + Location
Find shots of a man and a woman dancing together indoors
Find shots of a person running in the woods
Find shots of a group of people walking on the beach
• Person + Action/state + Object
Find shots of a person wearing a backpack
Find shots of a race car driver racing a car
Find shots of a person holding a tool and cutting something
• Person + Object + Location
Find shots of a person wearing shorts outdoors
Find shots of a person in front of a curtain indoors
• Person + Object
Find shots of a person with a painted face or mask
Find shots of person in front of a graffiti painted on a wall
Find shots of a person in a tent
• Object + Location
Find shots of one or more picnic tables outdoors
Find shots of coral reef underwater
Find shots of one or more art pieces on a wall
TRECVID 20196
TV2019 Queries by complexity
• Object + Action
Find shots of a drone flying
Find shots of a truck being driven in the daytime
Find shots of a door being opened by someone
Find shots of a small airplane flying from the inside
• Person + Action
Find shots of a man and a woman holding hands
Find shots of a black man singing
Find shots of a man and a woman hugging each other
• Person/being + Location
Find shots of a shirtless man standing up or walking outdoors
Find shots of one or more birds in a tree
TRECVID 20197
TV2019 Queries by complexity
• Object
Find shots of a red hat or cap
• Person
Find shots of a woman and a little boy both visible during daytime
Find shots of a bald man
Find shots of a man and a baby both visible
TRECVID 20198
TV2019 Queries by complexity
TRECVID 20199
Training and run types
• Three run submission types:
ü Fully automatic (F): System uses official query directly(37 runs)
ü Manually-assisted (M): Query built manually (10 runs)
ü Relevance Feedback (R): Allow judging top-5 once (0 runs)
• Four training data types:
ü A – used only IACC training data (7 runs)
ü D – used any other training data (33 runs)
ü E – used only training data collected automatically using
only the query text (7 run)
ü F – used only training data collected automatically using a
query built manually from the given query text (0 runs)
• New novelty run was introduced to encourage retrieving non-common
relevant shots easily found across runs.
TRECVID 201910
Main Task Finishers : 10 out of 19
Team Organization
Runs
M F R N
INF
Carnegie Mellon University(USA); Monash University
(Australia) Renmin University (China) Shandong University
(China)
- 4 -
Kindai_kobe
Department of Informatics, Kindai University; Graduate
School of System Informatics, Kobe University
- 4 - 1
EURECOM EURECOM - 3 -
IMFD_IMPRESEE Millennium Institute Foundational Research on Data
(IMFD) Chile; Impresee Inc ORAND S.A. Chile
- 4 -
ATL Alibaba group; ZheJiang University - 4 -
WasedaMeiseiSoft
bank
Waseda University; Meisei University; SoftBank
Corporation
4 1 -
VIREO City University of Hong Kong 2 4 - 1
FIU_UM Florida International University; University of Miami - 6 - 1
RUCMM
Renmin University of China; Zhejiang Gongshang
University
- 4 -
SIRET Charles University 4 - -
M: Manually-assisted, F: Fully automatic, R: Relevance feedback, N: Novelty run
TRECVID 201911
Progress Task Submitters : 9 out of 10
Team Organization
Runs
M F R N
INF
Carnegie Mellon University(USA); Monash University
(Australia) Renmin University (China) Shandong University
(China)
- 4 -
Kindai_kobe
Department of Informatics, Kindai University; Graduate
School of System Informatics, Kobe University
- 4 - -
EURECOM EURECOM - 3 -
ATL Alibaba group; ZheJiang University - 4 -
WasedaMeiseiSoft
bank
Waseda University; Meisei University; SoftBank
Corporation
4 1 -
VIREO City University of Hong Kong 2 4 - -
FIU_UM Florida International University; University of Miami - 4 - -
RUCMM
Renmin University of China; Zhejiang Gongshang
University
- 4 -
SIRET Charles University 4 - -
M: Manually-assisted, F: Fully automatic, R: Relevance feedback, N: Novelty run
Evaluation
TRECVID 201912
Each query assumed to be binary: absent or present for each
master reference shot.
NIST judged top ranked pooled results from all submissions
100% and sampled the rest of pooled results.
Metrics: Extended inferred average precision per query.
Compared runs in terms of mean extended inferred average
precision across the 30 queries.
TRECVID 201913
Mean Extended Inferred Average Precision (XInfAP)
2 pools were created for each query and sampled as:
ü Top pool (ranks 1 to 250) sampled at 100 %
ü Bottom pool (ranks 251 to 1000) sampled at 11.1 %
ü % of sampled and judged clips from rank 251 to 1000 across all runs
and topics (min= 10.8 %, max = 86.4 %, mean = 47.6 %)
Judgment process: one assessor per query, watched complete
shot while listening to the audio. infAP was calculated using the
judged and unjudged pool by sample_eval tool
30 queries
181649 total judgments
23549 total hits
10910 hits at ranks (1 to100)
8428 hits at ranks (101 to 250)
4211 hits at ranks (251 to 1000)
# Hits >> IACC
data (2016-2018)
0.00
1,000.00
2,000.00
3,000.00
4,000.00
5,000.00
611 613 615 617 619 621 623 625 627 629 631 633 635 637 639
Inf. Hits / query
TRECVID 201914
Inferred frequency of hits varies by query
Queries
Inf.hits
person in front of
a curtain indoors
woman
wearing a
red dress
outside in
the daytime
0.5% of test shots
person holding a tool and
cutting something
truck being driven
in the daytime
shirtless
man
standing up
or walking
outdoors
TRECVID 201915
Total unique relevant shots contributed by team
across all runs
0
200
400
600
800
1000
1200
VIREO
EURECOM
ATL
FIU_UM
W
asedaM
eiseiSoftbank
IM
FD_IM
PRESEEkindai_kobe
RUCM
M
SIRET
Inf
Numberoftrueuniqueshots
Top scoring teams
not necessary
contributing a lot of
unique true shots
Novelty Metric
TRECVID 201916
• Goal
Novelty runs are supposed to retrieve more unique relevant shots as opposed
to more common relevant shots easily found by most runs.
• Metric
1- A weight is given to each topic and shot pairs in the ground truth such that
highest weight is given to unique shots:
TopicX_ShotY_weight = 1 - (N/M)
Where N : Number of times Shot Y was retrieved for topic X by any run submission.
M : Number of total runs submitted by all teams
E.g. A unique relevant shot weight = 0.978 (given 47 runs in 2019), a shot submitted by all runs = 0.
2- For Run R and for all topics, we calculate the summation S of all *unique*
shot weights ONLY.
Final novelty score = S/30 (the mean across all evaluated 30 topics)
TRECVID 201917
Novelty scores
0
5
10
15
20
25
30
35
F_M
_N_E_FIU_UM
.19_7
F_M
_N_D_VIREO.19_5
F_M
_N_D_kindai_kobe.19_5
F_M
_C_A_EURECOM
.19_2
F_M
_C_D_IM
FD_IM
PRESEE.19_3
F_M
_C_A_EURECOM
.19_1
F_M
_C_D_W
asedaM
eiseiSoftbank.19_1
F_M
_C_D_ATL.19_3
F_M
_C_D_IM
FD_IM
PRESEE.19_2
F_M
_C_D_ATL.19_1
F_M
_C_A_EURECOM
.19_3
F_M
_C_D_ATL.19_4
M
_M
_C_D_W
asedaM
eiseiSoftbank.19…
F_M
_C_D_IM
FD_IM
PRESEE.19_4
F_M
_C_D_RUCM
M
.19_2
F_M
_C_D_ATL.19_2
M
_M
_C_D_W
asedaM
eiseiSoftbank.19…
F_M
_C_D_RUCM
M
.19_3
M
_M
_C_A_SIRET.19_1
M
_M
_C_A_SIRET.19_3
F_M
_C_D_IM
FD_IM
PRESEE.19_1
F_M
_C_D_RUCM
M
.19_4
M
_M
_C_A_SIRET.19_4
M
_M
_C_A_SIRET.19_2
M
_M
_C_D_W
asedaM
eiseiSoftbank.19…
F_M
_C_D_RUCM
M
.19_1
F_M
_C_D_Inf.19_4
F_M
_C_D_Inf.19_1
Noveltymetricscore
Runs
Novelty runs
Common runs
TRECVID 201918
Sorted overall scores
(37 Fully automatic runs, 9 teams)
0
0.02
0.04
0.06
0.08
0.1
0.12
0.14
0.16
0.18
C_D_ATL.19_2
C_D_ATL.19_1
C_D_RUCMM.19_1
C_D_ATL.19_4
C_D_RUCMM.19_2
C_D_RUCMM.19_4
C_D_RUCMM.19_3
C_D_WasedaMeiseiSoftbank.…
C_D_Inf.19_3
C_D_Inf.19_2
C_D_Inf.19_4
C_D_Inf.19_1
C_D_ATL.19_3
C_D_kindai_kobe.19_2
C_D_kindai_kobe.19_1
C_E_FIU_UM.19_4
N_D_kindai_kobe.19_5
C_E_FIU_UM.19_1
C_D_kindai_kobe.19_3
C_E_FIU_UM.19_3
C_E_FIU_UM.19_6
N_D_VIREO.19_5
C_D_VIREO.19_2
C_E_FIU_UM.19_2
N_E_FIU_UM.19_7
C_E_FIU_UM.19_5
C_D_VIREO.19_4
C_D_VIREO.19_3
C_D_kindai_kobe.19_4
C_D_IMFD_IMPRESEE.19_2
C_D_IMFD_IMPRESEE.19_1
C_D_IMFD_IMPRESEE.19_3
C_D_IMFD_IMPRESEE.19_4
C_D_VIREO.19_1
C_A_EURECOM.19_3
C_A_EURECOM.19_2
C_A_EURECOM.19_1
MeanInf.AP
Runs
Median = 0.08
TRECVID 201919
Sorted scores
(10 Manually-assisted runs, 3 teams)
0
0.02
0.04
0.06
0.08
0.1
0.12
0.14
0.16
C_D_W
asedaM
eiseiSoftbank.1…
C_D_W
asedaM
eiseiSoftbank.1…
C_D_W
asedaM
eiseiSoftbank.1…
C_D_VIREO.19_2
C_D_W
asedaM
eiseiSoftbank.1…
C_D_VIREO.19_1
C_A_SIRET.19_3
C_A_SIRET.19_2
C_A_SIRET.19_1
C_A_SIRET.19_4
MeanInf.AP
Runs
Median = 0.09
TRECVID 201920
Top 10 runs by query (Fully Automatic)
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
611
613
615
617
619
621
623
625
627
629
631
633
635
637
639
Inf.AP
1
2
3
4
5
6
7
8
9
10
Median
Topics
person wearing a
backpack
inside views of a
small airplane flying
person in front of a
graffiti painted on a wall
Person holding tool
and cutting something
TRECVID 201921
Top 10 runs by query (Manually-Assisted)
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7 611
613
615
617
619
621
623
625
627
629
631
633
635
637
639
Inf.AP
1
2
3
4
5
6
7
8
9
10
Median
Topics
Unique vs Common relevant shots
0
100
200
300
400
500
600
700
800
900
1000
1100
1200
1300
1400
1500
1600
1700
1800
1900
2000
2100
1611
1612
1613
1614
1615
1616
1617
1618
1619
1620
1621
1622
1623
1624
1625
1626
1627
1628
1629
1630
1631
1632
1633
1634
1635
1636
1637
1638
1639
1640
Unique Overlapped
TRECVID 201922
13% of all hits are
unique
Performance in the last 4 years ?
TRECVID 201923
IACC.3 Dataset V3C1 Dataset
Automatic 2016 2017 2018 2019
Teams 9 8 10 9
Runs 30 33 33 37
Min xInfAP 0 0.026 0.003 0.014
Max xInfAP 0.054 0.206 0.121 0.163
Median xInfAP 0.024 0.092 0.058 0.08
Manually-Assisted 2016 2017 2018 2019
Teams 8 5 6 3
Runs 22 19 16 10
Min xInfAP 0.005 0.048 0.012 0.033
Max xInfAP 0.169 0.207 0.106 0.152
Median xInfAP 0.043 0.111 0.072 0.09
TRECVID 201924
Top 10 Easy
sorted by count of runs with InfAP >= 0.3
Top 10 Hard
sorted by count of runs with InfAP < 0.3
person in front of a graffiti painted on a wall one or more picnic tables outdoors
coral reef underwater inside views of a small airplane flying
person in front of a curtain indoors person holding a tool and cutting something
person wearing shorts outdoors door being opened by someone
person wearing a backpack woman wearing a red dress outside in the
daytime
bald man a black man singing
person with a painted face or mask truck being driven in the daytime
shirtless man standing up or walking outdoors man and a woman holding hands
man and a baby both visible man and a woman hugging each other
drone flying woman riding or holding a bike outdoors
Easy vs difficult topics overall (2019)
Easiness
Hardness
TRECVID 201925
Statistical significant differences among top 10 “F” runs
(using randomization test, p < 0.05)
Run Mean Inf. AP score
C_D_ATL.19_2 0.163 #
C_D_ATL.19_1 0.161 #
C_D_RUCMM.19_1 0.160 #
C_D_ATL.19_4 0.157 #
C_D_RUCMM.19_2 0.152 #
C_D_RUCMM.19_4 0.127 *
C_D_RUCMM.19_3 0.124 *
C_D_WasedaMeiseiSoftbank.19_1 0.123 *
C_D_Inf.19_3 0.118 *
C_D_Inf.19_2 0.118 *
#* : no significant
difference among
each set of runs
Ø Runs higher
in the
hierarchy are
significantly
better than
runs more
indented.
C_D_RUCMM.19_2
Ø C_D_RUCMM.19_3
Ø C_D_RUCMM.19_4
Ø C_D_Inf.19_2
Ø C_D_Inf.19_3
C_D_RUCMM.19_1
Ø C_D_RUCMM.19_3
Ø C_D_RUCMM.19_4
Ø C_D_Inf.19_2
Ø C_D_Inf.19_3
C_D_ATL.19_1
Ø C_D_RUCMM.19_3
Ø C_D_RUCMM.19_4
Ø C_D_Inf.19_2
Ø C_D_Inf.19_3
C_D_ATL.19_2
Ø C_D_RUCMM.19_3
Ø C_D_RUCMM.19_4
Ø C_D_Inf.19_2
Ø C_D_Inf.19_3
C_D_ATL.19_4
Ø C_D_RUCMM.19_3
Ø C_D_RUCMM.19_4
Ø C_D_Inf.19_2
TRECVID 201926
Statistical significant differences among top 10 “M”
runs (using randomization test, p < 0.05)
Run Mean Inf. AP score
C_D_WasedaMeiseiSoftbank.19_2 0.152
C_D_WasedaMeiseiSoftbank.19_3 0.136 #
C_D_WasedaMeiseiSoftbank.19_1 0.133 #
C_D_VIREO.19_2 0.118 #
C_D_WasedaMeiseiSoftbank.19_4 0.114
C_D_VIREO.19_1 0.066 *
C_A_SIRET.19_3 0.035 *
C_A_SIRET.19_2 0.035 *
C_A_SIRET.19_1 0.034 !
C_A_SIRET.19_4 0.033 !
!#* : no significant difference
among each set of runs
Ø Runs higher in the hierarchy are
significantly better than runs more
indented.
C_D_WasedaMeiseiSoftbank.19_2
Ø C_D_WasedaMeiseiSoftbank.19_3
Ø C_D_WasedaMeiseiSoftbank.19_4
Ø C_A_SIRET.19_3
Ø C_A_SIRET.19_2
Ø C_D_VIREO.19_1
Ø C_A_SIRET.19_1
Ø C_A_SIRET.19_4
Ø C_D_WasedaMeiseiSoftbank.19_1
Ø C_D_WasedaMeiseiSoftbank.19_4
Ø C_A_SIRET.19_3
Ø C_A_SIRET.19_2
Ø C_D_VIREO.19_1
Ø C_A_SIRET.19_1
Ø C_A_SIRET.19_4
Ø C_D_VIREO.19_2
Ø C_A_SIRET.19_3
Ø C_A_SIRET.19_2
Ø C_D_VIREO.19_1
Ø C_A_SIRET.19_1
Ø C_A_SIRET.19_4
TRECVID 201927
Processing time vs Inf. AP (“F” runs)
Across all topics and runs
1
10
100
1000
10000
100000
0 0.2 0.4 0.6
Time(s)
Inf. AP
Few topics
with fast
response
and high
score
TRECVID 201928
Processing time vs Inf. AP (“M” runs)
Across all topics and runs
1
10
100
1000
10000
0 0.1 0.2 0.3 0.4 0.5
Time(s)
Inf. AP
Consuming longer
time processing
queries didn’t help
many teams/topics
Samples of (tricky/failed) results
TRECVID 201929
Truck driven in the daytime Drone flying Person in a tent
Person wearing shorts
outdoors
Man and a woman holding hands Black man singing
Birds in a tree Red hat or a cap
2019 Main approaches
• Two main competing approaches: “concept banks”
and “(visual-textual) embedding spaces”
• Currently: significant advantage for “embedding
space” approaches, especially for fully automatic
search and even overall
• Training data for semantic spaces: MSR and TRECVid
VTT tasks, TGIF, IACC.3, Flickr8k, Flickr30k, MS
COCO], and Conceptual Captions
TRECVID 201930
2019 Main approaches
• Alibaba Group (presentation to follow):
– Fully automatic (0.163): mapping video embedding and language
embedding into a learned semantic space with graph sequence and
aggregated modeling, and gated CNNs
• Renmin University of China and Zhejiang Gongshang
University (presentation to follow):
– Fully automatic (0.160): Word to Visual Word (W2VV++) similar to
TRECVid 2018 plus “dual encoding network” and BERT as text encoder
• Waseda University; Meisei University; SoftBank Corporation
(presentation to follow):
– Manually assisted (0.152): concept-based retrieval similar to previous
years’ concept bank approach
– Fully automatic (0.123): visual-semantic embedding (VSE++)
TRECVID 201931
2019 Main approaches
• Shandong Normal University; Carnegie Mellon University;
Monash University:
– Fully automatic (0.118): submitted fully automatic runs but notebook
paper currently only about their INS task participation.
• City University of Hong Kong (VIRE0) and Eurecom:
– Manually assisted runs (0.118): concept based approach with manual
query parsing and manual concept filtering
– Fully automatic (0.075): concept based approach
• Kindai University and Kobe University:
– Fully automatic (0.087): embedding that maps visual and textual
information into a common space
• Florida International University; University of Miami
(presentation to follow)
– Fully automatic (0.082): weighted concept fusion and W2VV
TRECVID 201932
2019 Task observations
• New dataset : Vimeo Creative Commons Collection (V3C1) is being used for testing
• Development of 90 queries to be used between 2019-2021 including progress subtask.
• Run training types are dominated by “D” runs. No relevance feedback submissions
received.
• New “novelty” run type (and metric). Novelty runs proved to submit unique true shots
compared to common run types.
• Stable team participation and task completion rate. Manually-assisted runs decreasing.
• High participation in the progress subtask
• Absolute number of hits are higher than previous years.
• We can’t compare performance with IACC.3 (2016-2018) : New dataset + New queries
• Fully automatic and Manually-assisted performance are almost similar.
• Among high scoring topics, there is more room for improvement among systems.
• Among low scoring topics, most systems scores are collapsed in small narrow range.
• Dynamic topics (actions, interactions, multi-facets ..etc) are the hardest topics.
• Most systems are slow. Few topics scored high in fast time.
• Task is still challenging!
TRECVID 201933
RUCMM 2019 system on previous years
TRECVID 201934
TRECVID 201935
At MMM 2020
26th International Conference on Multimedia Modeling,
January 5-8, 2020 Daejeon, Korea
• 10 Ad-Hoc Video Search (AVS) topics : Each AVS topic has several/many target
shots that should be found.
• 10 Known-Item Search (KIS) tasks, which are selected completely random on
site. Each KIS task has only one single 20 s long target segment.
• Registration for the task is now closed
Interactive Video Retrieval subtask will be held
as part of the Video Browser Showdown (VBS)
TRECVID 201936
9:10 – 12:20 : Ad-hoc Video Search
9:10 – 9:40 am Ad-hoc Video Search Task Overview
9:40 – 10:10 am Learn to Represent Queries and Videos for Ad-hoc Video
Search, RUCMM Team - Renmin University of China; Zhejiang Gongshang University
10:10 – 10:40 am Zero-shot Video Retrieval for Ad-hoc Video Search Task
WasedaMeiseiSoftbank Team – Waseda University; Meisei University; SoftBank Corporation
10:40 – 11:00 am Break with refreshments
11:00 – 11:30 am Query-Based Concept Tree for Score Fusion in Ad-hoc Video
Search Task, FIU_UM Team – Florida Intl. University; University of
Miami
11:30 – 12:00 pm Hybrid Sequence Encoder for Text Based Video Retrieval ATL
Team – Alibaba Group
12:00 – 12:20 pm AVS Task discussion
2019 Questions and 2020 plans
TRECVID 201937
• Was the task/queries realistic enough?!
• How teams feel the difference between IACC data vs V3C ?
• Do we need to change/add/remove anything to the task in 2020 ?
• Is there any specific reason for the low submissions in “E” & “F” training type
runs? (training data collected automatically from the given query text)
• Do we need the relevance feedback run type? 0 submissions this year.
• Did any team run their 2019 system on IACC.3 (2016-2018) topics ? (Yes)
• Any feedback about the new novelty metric (runs)?
• Engineering versus research efforts?
• Shared “consolidated” concept banks?
• How to encourage teams to share resources/concept models,… etc.
• Current plan is to continue the task V3C1 for main and progress subtask.
• Please continue participating in the “progress subtask” to measure accurate
performance difference
• What about an explainability subtask (related to embedding approaches)?
AVS Progress subtask
TRECVID 201938
Evaluation year
Submission
year
2019 2020 2021
2019
Submit 50 queries (30
new + 20 common)
Eval 30 new Queries
2020
Submit 40 queries (20
new + 20 common)
Eval 30 (20 new + 10
common)
2021
Submit 40 queries (20
new + 20 common)
Eval 30 (20 New + 10
common)
Goals : Evaluate 10 (set A) common queries submitted in 2 years (2019, 2020)
Evaluate 10 (set B) common queries submitted in 3 years (2019, 2020, 2021)
Evaluate 20 common queries submitted in 3 years (2019 , 2020, 2021)
Ground truth for 20 common queries can be released only in 2021

More Related Content

Similar to TRECVID 2019 Ad-hoc Video Search

How to Deliver Data Insights by Fmr Google Sr Analytical Lead
How to Deliver Data Insights by Fmr Google Sr Analytical LeadHow to Deliver Data Insights by Fmr Google Sr Analytical Lead
How to Deliver Data Insights by Fmr Google Sr Analytical LeadProduct School
 
[TRECVID 2018] Video to Text
[TRECVID 2018] Video to Text[TRECVID 2018] Video to Text
[TRECVID 2018] Video to TextGeorge Awad
 
Transparency and Contracts: Continuous Integration and Delivery in the Automo...
Transparency and Contracts: Continuous Integration and Delivery in the Automo...Transparency and Contracts: Continuous Integration and Delivery in the Automo...
Transparency and Contracts: Continuous Integration and Delivery in the Automo...Eric Knauss
 
Master's Thesis Presentation
Master's Thesis PresentationMaster's Thesis Presentation
Master's Thesis PresentationWajdi Khattel
 
TRECVID 2019 introduction slides
TRECVID 2019 introduction slidesTRECVID 2019 introduction slides
TRECVID 2019 introduction slidesGeorge Awad
 
Explosives Detection Final NSF I-Corps Presentation
Explosives Detection Final NSF I-Corps PresentationExplosives Detection Final NSF I-Corps Presentation
Explosives Detection Final NSF I-Corps PresentationStanford University
 
Week six team_a_presentation
Week six team_a_presentationWeek six team_a_presentation
Week six team_a_presentationCharles James
 
An Evaluation of Diverting Mobile Calls via VoIP While Traveling Abroad: The ...
An Evaluation of Diverting Mobile Calls via VoIP While Traveling Abroad: The ...An Evaluation of Diverting Mobile Calls via VoIP While Traveling Abroad: The ...
An Evaluation of Diverting Mobile Calls via VoIP While Traveling Abroad: The ...Ibrahim Malek
 
CERIC 2015 Survey of Career Service Professionals, Government Sector
CERIC 2015 Survey of Career Service Professionals, Government SectorCERIC 2015 Survey of Career Service Professionals, Government Sector
CERIC 2015 Survey of Career Service Professionals, Government SectorCERIC
 
CERIC 2015 Survey of Career Service Professionals, Post-Secondary Education S...
CERIC 2015 Survey of Career Service Professionals, Post-Secondary Education S...CERIC 2015 Survey of Career Service Professionals, Post-Secondary Education S...
CERIC 2015 Survey of Career Service Professionals, Post-Secondary Education S...CERIC
 
University Mobile App or Emergency App
University Mobile App or Emergency AppUniversity Mobile App or Emergency App
University Mobile App or Emergency AppTarun Pattnaik
 
CERIC 2015 Survey of Career Service Professionals, Private Sector
CERIC 2015 Survey of Career Service Professionals, Private SectorCERIC 2015 Survey of Career Service Professionals, Private Sector
CERIC 2015 Survey of Career Service Professionals, Private SectorCERIC
 
Beyond the gold
Beyond the goldBeyond the gold
Beyond the goldTheILC
 
UGC NET Paper 1 Major Topics in New Syllabus
UGC NET Paper 1 Major Topics in New SyllabusUGC NET Paper 1 Major Topics in New Syllabus
UGC NET Paper 1 Major Topics in New SyllabusDr Rajnikant Dodiya
 
Common Sense Guide to Mitigating Insider Threats
Common Sense Guide to Mitigating Insider ThreatsCommon Sense Guide to Mitigating Insider Threats
Common Sense Guide to Mitigating Insider ThreatsSammie Gillaspie
 
Improving the understanding of safety performance of commercial motorcycles i...
Improving the understanding of safety performance of commercial motorcycles i...Improving the understanding of safety performance of commercial motorcycles i...
Improving the understanding of safety performance of commercial motorcycles i...Institute for Transport Studies (ITS)
 
CERIC 2015 Survey of Career Service Professionals, Charitable & Non-Profit Se...
CERIC 2015 Survey of Career Service Professionals, Charitable & Non-Profit Se...CERIC 2015 Survey of Career Service Professionals, Charitable & Non-Profit Se...
CERIC 2015 Survey of Career Service Professionals, Charitable & Non-Profit Se...CERIC
 

Similar to TRECVID 2019 Ad-hoc Video Search (20)

How to Deliver Data Insights by Fmr Google Sr Analytical Lead
How to Deliver Data Insights by Fmr Google Sr Analytical LeadHow to Deliver Data Insights by Fmr Google Sr Analytical Lead
How to Deliver Data Insights by Fmr Google Sr Analytical Lead
 
TSA's Innovation Task Force
TSA's Innovation Task ForceTSA's Innovation Task Force
TSA's Innovation Task Force
 
[TRECVID 2018] Video to Text
[TRECVID 2018] Video to Text[TRECVID 2018] Video to Text
[TRECVID 2018] Video to Text
 
Transparency and Contracts: Continuous Integration and Delivery in the Automo...
Transparency and Contracts: Continuous Integration and Delivery in the Automo...Transparency and Contracts: Continuous Integration and Delivery in the Automo...
Transparency and Contracts: Continuous Integration and Delivery in the Automo...
 
Master's Thesis Presentation
Master's Thesis PresentationMaster's Thesis Presentation
Master's Thesis Presentation
 
TRECVID 2019 introduction slides
TRECVID 2019 introduction slidesTRECVID 2019 introduction slides
TRECVID 2019 introduction slides
 
Explosives Detection Final NSF I-Corps Presentation
Explosives Detection Final NSF I-Corps PresentationExplosives Detection Final NSF I-Corps Presentation
Explosives Detection Final NSF I-Corps Presentation
 
Week six team_a_presentation
Week six team_a_presentationWeek six team_a_presentation
Week six team_a_presentation
 
An Evaluation of Diverting Mobile Calls via VoIP While Traveling Abroad: The ...
An Evaluation of Diverting Mobile Calls via VoIP While Traveling Abroad: The ...An Evaluation of Diverting Mobile Calls via VoIP While Traveling Abroad: The ...
An Evaluation of Diverting Mobile Calls via VoIP While Traveling Abroad: The ...
 
CERIC 2015 Survey of Career Service Professionals, Government Sector
CERIC 2015 Survey of Career Service Professionals, Government SectorCERIC 2015 Survey of Career Service Professionals, Government Sector
CERIC 2015 Survey of Career Service Professionals, Government Sector
 
CERIC 2015 Survey of Career Service Professionals, Post-Secondary Education S...
CERIC 2015 Survey of Career Service Professionals, Post-Secondary Education S...CERIC 2015 Survey of Career Service Professionals, Post-Secondary Education S...
CERIC 2015 Survey of Career Service Professionals, Post-Secondary Education S...
 
University Mobile App or Emergency App
University Mobile App or Emergency AppUniversity Mobile App or Emergency App
University Mobile App or Emergency App
 
CERIC 2015 Survey of Career Service Professionals, Private Sector
CERIC 2015 Survey of Career Service Professionals, Private SectorCERIC 2015 Survey of Career Service Professionals, Private Sector
CERIC 2015 Survey of Career Service Professionals, Private Sector
 
Beyond the gold
Beyond the goldBeyond the gold
Beyond the gold
 
UGC NET Paper 1 Major Topics in New Syllabus
UGC NET Paper 1 Major Topics in New SyllabusUGC NET Paper 1 Major Topics in New Syllabus
UGC NET Paper 1 Major Topics in New Syllabus
 
Terra preta toilet
Terra preta toiletTerra preta toilet
Terra preta toilet
 
Common Sense Guide to Mitigating Insider Threats
Common Sense Guide to Mitigating Insider ThreatsCommon Sense Guide to Mitigating Insider Threats
Common Sense Guide to Mitigating Insider Threats
 
Improving the understanding of safety performance of commercial motorcycles i...
Improving the understanding of safety performance of commercial motorcycles i...Improving the understanding of safety performance of commercial motorcycles i...
Improving the understanding of safety performance of commercial motorcycles i...
 
Machine learning
Machine learningMachine learning
Machine learning
 
CERIC 2015 Survey of Career Service Professionals, Charitable & Non-Profit Se...
CERIC 2015 Survey of Career Service Professionals, Charitable & Non-Profit Se...CERIC 2015 Survey of Career Service Professionals, Charitable & Non-Profit Se...
CERIC 2015 Survey of Career Service Professionals, Charitable & Non-Profit Se...
 

More from George Awad

Movie Summarization at TRECVID 2022
Movie Summarization at TRECVID 2022Movie Summarization at TRECVID 2022
Movie Summarization at TRECVID 2022George Awad
 
Disaster Scene Indexing at TRECVID 2022
Disaster Scene Indexing at TRECVID 2022Disaster Scene Indexing at TRECVID 2022
Disaster Scene Indexing at TRECVID 2022George Awad
 
Activity Detection at TRECVID 2022
Activity Detection at TRECVID 2022Activity Detection at TRECVID 2022
Activity Detection at TRECVID 2022George Awad
 
Multimedia Understanding at TRECVID 2022
Multimedia Understanding at TRECVID 2022Multimedia Understanding at TRECVID 2022
Multimedia Understanding at TRECVID 2022George Awad
 
Video Captioning at TRECVID 2022
Video Captioning at TRECVID 2022Video Captioning at TRECVID 2022
Video Captioning at TRECVID 2022George Awad
 
TRECVID 2019 Video to Text
TRECVID 2019 Video to TextTRECVID 2019 Video to Text
TRECVID 2019 Video to TextGeorge Awad
 
TRECVID 2019 Instance Search
TRECVID 2019 Instance SearchTRECVID 2019 Instance Search
TRECVID 2019 Instance SearchGeorge Awad
 
Instance Search (INS) task at TRECVID
Instance Search (INS) task at TRECVIDInstance Search (INS) task at TRECVID
Instance Search (INS) task at TRECVIDGeorge Awad
 
Activity detection at TRECVID
Activity detection at TRECVIDActivity detection at TRECVID
Activity detection at TRECVIDGeorge Awad
 
The Video-to-Text task evaluation at TRECVID
The Video-to-Text task evaluation at TRECVIDThe Video-to-Text task evaluation at TRECVID
The Video-to-Text task evaluation at TRECVIDGeorge Awad
 
Introduction about TRECVID (The Video Retreival benchmark)
Introduction about TRECVID (The Video Retreival benchmark)Introduction about TRECVID (The Video Retreival benchmark)
Introduction about TRECVID (The Video Retreival benchmark)George Awad
 
TRECVID ECCV2018 Tutorial (Ad-hoc Video Search)
TRECVID ECCV2018 Tutorial (Ad-hoc Video Search)TRECVID ECCV2018 Tutorial (Ad-hoc Video Search)
TRECVID ECCV2018 Tutorial (Ad-hoc Video Search)George Awad
 
TRECVID 2016 : Multimedia event Detection
TRECVID 2016 : Multimedia event DetectionTRECVID 2016 : Multimedia event Detection
TRECVID 2016 : Multimedia event DetectionGeorge Awad
 
TRECVID 2016 : Concept Localization
TRECVID 2016 : Concept LocalizationTRECVID 2016 : Concept Localization
TRECVID 2016 : Concept LocalizationGeorge Awad
 
TRECVID 2016 : Video to Text Description
TRECVID 2016 : Video to Text DescriptionTRECVID 2016 : Video to Text Description
TRECVID 2016 : Video to Text DescriptionGeorge Awad
 
TRECVID 2016 : Instance Search
TRECVID 2016 : Instance SearchTRECVID 2016 : Instance Search
TRECVID 2016 : Instance SearchGeorge Awad
 
TRECVID 2016 workshop
TRECVID 2016 workshopTRECVID 2016 workshop
TRECVID 2016 workshopGeorge Awad
 

More from George Awad (17)

Movie Summarization at TRECVID 2022
Movie Summarization at TRECVID 2022Movie Summarization at TRECVID 2022
Movie Summarization at TRECVID 2022
 
Disaster Scene Indexing at TRECVID 2022
Disaster Scene Indexing at TRECVID 2022Disaster Scene Indexing at TRECVID 2022
Disaster Scene Indexing at TRECVID 2022
 
Activity Detection at TRECVID 2022
Activity Detection at TRECVID 2022Activity Detection at TRECVID 2022
Activity Detection at TRECVID 2022
 
Multimedia Understanding at TRECVID 2022
Multimedia Understanding at TRECVID 2022Multimedia Understanding at TRECVID 2022
Multimedia Understanding at TRECVID 2022
 
Video Captioning at TRECVID 2022
Video Captioning at TRECVID 2022Video Captioning at TRECVID 2022
Video Captioning at TRECVID 2022
 
TRECVID 2019 Video to Text
TRECVID 2019 Video to TextTRECVID 2019 Video to Text
TRECVID 2019 Video to Text
 
TRECVID 2019 Instance Search
TRECVID 2019 Instance SearchTRECVID 2019 Instance Search
TRECVID 2019 Instance Search
 
Instance Search (INS) task at TRECVID
Instance Search (INS) task at TRECVIDInstance Search (INS) task at TRECVID
Instance Search (INS) task at TRECVID
 
Activity detection at TRECVID
Activity detection at TRECVIDActivity detection at TRECVID
Activity detection at TRECVID
 
The Video-to-Text task evaluation at TRECVID
The Video-to-Text task evaluation at TRECVIDThe Video-to-Text task evaluation at TRECVID
The Video-to-Text task evaluation at TRECVID
 
Introduction about TRECVID (The Video Retreival benchmark)
Introduction about TRECVID (The Video Retreival benchmark)Introduction about TRECVID (The Video Retreival benchmark)
Introduction about TRECVID (The Video Retreival benchmark)
 
TRECVID ECCV2018 Tutorial (Ad-hoc Video Search)
TRECVID ECCV2018 Tutorial (Ad-hoc Video Search)TRECVID ECCV2018 Tutorial (Ad-hoc Video Search)
TRECVID ECCV2018 Tutorial (Ad-hoc Video Search)
 
TRECVID 2016 : Multimedia event Detection
TRECVID 2016 : Multimedia event DetectionTRECVID 2016 : Multimedia event Detection
TRECVID 2016 : Multimedia event Detection
 
TRECVID 2016 : Concept Localization
TRECVID 2016 : Concept LocalizationTRECVID 2016 : Concept Localization
TRECVID 2016 : Concept Localization
 
TRECVID 2016 : Video to Text Description
TRECVID 2016 : Video to Text DescriptionTRECVID 2016 : Video to Text Description
TRECVID 2016 : Video to Text Description
 
TRECVID 2016 : Instance Search
TRECVID 2016 : Instance SearchTRECVID 2016 : Instance Search
TRECVID 2016 : Instance Search
 
TRECVID 2016 workshop
TRECVID 2016 workshopTRECVID 2016 workshop
TRECVID 2016 workshop
 

Recently uploaded

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 

Recently uploaded (20)

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 

TRECVID 2019 Ad-hoc Video Search

  • 1. TRECVID 2019 Ad-hoc Video Search Task : Overview Georges Quénot Laboratoire d'Informatique de Grenoble George Awad Georgetown University; National Institute of Standards and Technology Disclaimer The identification of any commercial product or trade name does not imply endorsement or recommendation by the National Institute of Standards and Technology.
  • 2. Outline • Task Definition • Video Data • Topics (Queries) • Participating teams • Evaluation & results • General observation TRECVID 20192
  • 3. TRECVID 20193 Task Definition • Goal: promote progress in content-based retrieval based on end user ad- hoc (generic) textual queries that include searching for persons, objects, locations, actions and their combinations. • Task: Given a test collection, a query, and a master shot boundary reference, return a ranked list of at most 1000 shots (out of 1,082,657) which best satisfy the need. • Testing data: 7475 Vimeo Creative Commons Videos (V3C1), 1000 total hours with mean video durations of 8 min. Reflects a wide variety of content, style and source device. • Development data: ≈2000 hours of previous IACC.1-3 data used between 2010-2018 with concept and ad-hoc query annotations.
  • 4. TRECVID 20194 Query Development • Test videos were viewed by 10 human assessors hired by the National Institute of Standards and Technology (NIST). • 4 facet descriptions of different scenes were used (if applicable): – Who : concrete objects and beings (kind of persons, animals, things) – What : are the objects and/or beings doing ? (generic actions, conditions/state) – Where : locale, site, place, geographic, architectural – When : time of day, season • In total assessors watched random selection of ≈1% (12k videos) of the V3C1 segmented shots. • All random shots were selected to cover all original 7475 videos. • 90 candidate queries chosen from human written descriptions to be used between 2019-2021 including 20 progress topics (10 shared with the Video Browser Showdown (VBS)).
  • 5. TRECVID 20195 TV2019 Queries by complexity • Person + Action + Object + Location (most complex) Find shots of a woman riding or holding a bike outdoors Find shots of a person smoking a cigarette outdoors Find shots of a woman wearing a red dress outside in the daytime • Person + Action + Location Find shots of a man and a woman dancing together indoors Find shots of a person running in the woods Find shots of a group of people walking on the beach • Person + Action/state + Object Find shots of a person wearing a backpack Find shots of a race car driver racing a car Find shots of a person holding a tool and cutting something
  • 6. • Person + Object + Location Find shots of a person wearing shorts outdoors Find shots of a person in front of a curtain indoors • Person + Object Find shots of a person with a painted face or mask Find shots of person in front of a graffiti painted on a wall Find shots of a person in a tent • Object + Location Find shots of one or more picnic tables outdoors Find shots of coral reef underwater Find shots of one or more art pieces on a wall TRECVID 20196 TV2019 Queries by complexity
  • 7. • Object + Action Find shots of a drone flying Find shots of a truck being driven in the daytime Find shots of a door being opened by someone Find shots of a small airplane flying from the inside • Person + Action Find shots of a man and a woman holding hands Find shots of a black man singing Find shots of a man and a woman hugging each other • Person/being + Location Find shots of a shirtless man standing up or walking outdoors Find shots of one or more birds in a tree TRECVID 20197 TV2019 Queries by complexity
  • 8. • Object Find shots of a red hat or cap • Person Find shots of a woman and a little boy both visible during daytime Find shots of a bald man Find shots of a man and a baby both visible TRECVID 20198 TV2019 Queries by complexity
  • 9. TRECVID 20199 Training and run types • Three run submission types: ü Fully automatic (F): System uses official query directly(37 runs) ü Manually-assisted (M): Query built manually (10 runs) ü Relevance Feedback (R): Allow judging top-5 once (0 runs) • Four training data types: ü A – used only IACC training data (7 runs) ü D – used any other training data (33 runs) ü E – used only training data collected automatically using only the query text (7 run) ü F – used only training data collected automatically using a query built manually from the given query text (0 runs) • New novelty run was introduced to encourage retrieving non-common relevant shots easily found across runs.
  • 10. TRECVID 201910 Main Task Finishers : 10 out of 19 Team Organization Runs M F R N INF Carnegie Mellon University(USA); Monash University (Australia) Renmin University (China) Shandong University (China) - 4 - Kindai_kobe Department of Informatics, Kindai University; Graduate School of System Informatics, Kobe University - 4 - 1 EURECOM EURECOM - 3 - IMFD_IMPRESEE Millennium Institute Foundational Research on Data (IMFD) Chile; Impresee Inc ORAND S.A. Chile - 4 - ATL Alibaba group; ZheJiang University - 4 - WasedaMeiseiSoft bank Waseda University; Meisei University; SoftBank Corporation 4 1 - VIREO City University of Hong Kong 2 4 - 1 FIU_UM Florida International University; University of Miami - 6 - 1 RUCMM Renmin University of China; Zhejiang Gongshang University - 4 - SIRET Charles University 4 - - M: Manually-assisted, F: Fully automatic, R: Relevance feedback, N: Novelty run
  • 11. TRECVID 201911 Progress Task Submitters : 9 out of 10 Team Organization Runs M F R N INF Carnegie Mellon University(USA); Monash University (Australia) Renmin University (China) Shandong University (China) - 4 - Kindai_kobe Department of Informatics, Kindai University; Graduate School of System Informatics, Kobe University - 4 - - EURECOM EURECOM - 3 - ATL Alibaba group; ZheJiang University - 4 - WasedaMeiseiSoft bank Waseda University; Meisei University; SoftBank Corporation 4 1 - VIREO City University of Hong Kong 2 4 - - FIU_UM Florida International University; University of Miami - 4 - - RUCMM Renmin University of China; Zhejiang Gongshang University - 4 - SIRET Charles University 4 - - M: Manually-assisted, F: Fully automatic, R: Relevance feedback, N: Novelty run
  • 12. Evaluation TRECVID 201912 Each query assumed to be binary: absent or present for each master reference shot. NIST judged top ranked pooled results from all submissions 100% and sampled the rest of pooled results. Metrics: Extended inferred average precision per query. Compared runs in terms of mean extended inferred average precision across the 30 queries.
  • 13. TRECVID 201913 Mean Extended Inferred Average Precision (XInfAP) 2 pools were created for each query and sampled as: ü Top pool (ranks 1 to 250) sampled at 100 % ü Bottom pool (ranks 251 to 1000) sampled at 11.1 % ü % of sampled and judged clips from rank 251 to 1000 across all runs and topics (min= 10.8 %, max = 86.4 %, mean = 47.6 %) Judgment process: one assessor per query, watched complete shot while listening to the audio. infAP was calculated using the judged and unjudged pool by sample_eval tool 30 queries 181649 total judgments 23549 total hits 10910 hits at ranks (1 to100) 8428 hits at ranks (101 to 250) 4211 hits at ranks (251 to 1000) # Hits >> IACC data (2016-2018)
  • 14. 0.00 1,000.00 2,000.00 3,000.00 4,000.00 5,000.00 611 613 615 617 619 621 623 625 627 629 631 633 635 637 639 Inf. Hits / query TRECVID 201914 Inferred frequency of hits varies by query Queries Inf.hits person in front of a curtain indoors woman wearing a red dress outside in the daytime 0.5% of test shots person holding a tool and cutting something truck being driven in the daytime shirtless man standing up or walking outdoors
  • 15. TRECVID 201915 Total unique relevant shots contributed by team across all runs 0 200 400 600 800 1000 1200 VIREO EURECOM ATL FIU_UM W asedaM eiseiSoftbank IM FD_IM PRESEEkindai_kobe RUCM M SIRET Inf Numberoftrueuniqueshots Top scoring teams not necessary contributing a lot of unique true shots
  • 16. Novelty Metric TRECVID 201916 • Goal Novelty runs are supposed to retrieve more unique relevant shots as opposed to more common relevant shots easily found by most runs. • Metric 1- A weight is given to each topic and shot pairs in the ground truth such that highest weight is given to unique shots: TopicX_ShotY_weight = 1 - (N/M) Where N : Number of times Shot Y was retrieved for topic X by any run submission. M : Number of total runs submitted by all teams E.g. A unique relevant shot weight = 0.978 (given 47 runs in 2019), a shot submitted by all runs = 0. 2- For Run R and for all topics, we calculate the summation S of all *unique* shot weights ONLY. Final novelty score = S/30 (the mean across all evaluated 30 topics)
  • 18. TRECVID 201918 Sorted overall scores (37 Fully automatic runs, 9 teams) 0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 C_D_ATL.19_2 C_D_ATL.19_1 C_D_RUCMM.19_1 C_D_ATL.19_4 C_D_RUCMM.19_2 C_D_RUCMM.19_4 C_D_RUCMM.19_3 C_D_WasedaMeiseiSoftbank.… C_D_Inf.19_3 C_D_Inf.19_2 C_D_Inf.19_4 C_D_Inf.19_1 C_D_ATL.19_3 C_D_kindai_kobe.19_2 C_D_kindai_kobe.19_1 C_E_FIU_UM.19_4 N_D_kindai_kobe.19_5 C_E_FIU_UM.19_1 C_D_kindai_kobe.19_3 C_E_FIU_UM.19_3 C_E_FIU_UM.19_6 N_D_VIREO.19_5 C_D_VIREO.19_2 C_E_FIU_UM.19_2 N_E_FIU_UM.19_7 C_E_FIU_UM.19_5 C_D_VIREO.19_4 C_D_VIREO.19_3 C_D_kindai_kobe.19_4 C_D_IMFD_IMPRESEE.19_2 C_D_IMFD_IMPRESEE.19_1 C_D_IMFD_IMPRESEE.19_3 C_D_IMFD_IMPRESEE.19_4 C_D_VIREO.19_1 C_A_EURECOM.19_3 C_A_EURECOM.19_2 C_A_EURECOM.19_1 MeanInf.AP Runs Median = 0.08
  • 19. TRECVID 201919 Sorted scores (10 Manually-assisted runs, 3 teams) 0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 C_D_W asedaM eiseiSoftbank.1… C_D_W asedaM eiseiSoftbank.1… C_D_W asedaM eiseiSoftbank.1… C_D_VIREO.19_2 C_D_W asedaM eiseiSoftbank.1… C_D_VIREO.19_1 C_A_SIRET.19_3 C_A_SIRET.19_2 C_A_SIRET.19_1 C_A_SIRET.19_4 MeanInf.AP Runs Median = 0.09
  • 20. TRECVID 201920 Top 10 runs by query (Fully Automatic) 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 611 613 615 617 619 621 623 625 627 629 631 633 635 637 639 Inf.AP 1 2 3 4 5 6 7 8 9 10 Median Topics person wearing a backpack inside views of a small airplane flying person in front of a graffiti painted on a wall Person holding tool and cutting something
  • 21. TRECVID 201921 Top 10 runs by query (Manually-Assisted) 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 611 613 615 617 619 621 623 625 627 629 631 633 635 637 639 Inf.AP 1 2 3 4 5 6 7 8 9 10 Median Topics
  • 22. Unique vs Common relevant shots 0 100 200 300 400 500 600 700 800 900 1000 1100 1200 1300 1400 1500 1600 1700 1800 1900 2000 2100 1611 1612 1613 1614 1615 1616 1617 1618 1619 1620 1621 1622 1623 1624 1625 1626 1627 1628 1629 1630 1631 1632 1633 1634 1635 1636 1637 1638 1639 1640 Unique Overlapped TRECVID 201922 13% of all hits are unique
  • 23. Performance in the last 4 years ? TRECVID 201923 IACC.3 Dataset V3C1 Dataset Automatic 2016 2017 2018 2019 Teams 9 8 10 9 Runs 30 33 33 37 Min xInfAP 0 0.026 0.003 0.014 Max xInfAP 0.054 0.206 0.121 0.163 Median xInfAP 0.024 0.092 0.058 0.08 Manually-Assisted 2016 2017 2018 2019 Teams 8 5 6 3 Runs 22 19 16 10 Min xInfAP 0.005 0.048 0.012 0.033 Max xInfAP 0.169 0.207 0.106 0.152 Median xInfAP 0.043 0.111 0.072 0.09
  • 24. TRECVID 201924 Top 10 Easy sorted by count of runs with InfAP >= 0.3 Top 10 Hard sorted by count of runs with InfAP < 0.3 person in front of a graffiti painted on a wall one or more picnic tables outdoors coral reef underwater inside views of a small airplane flying person in front of a curtain indoors person holding a tool and cutting something person wearing shorts outdoors door being opened by someone person wearing a backpack woman wearing a red dress outside in the daytime bald man a black man singing person with a painted face or mask truck being driven in the daytime shirtless man standing up or walking outdoors man and a woman holding hands man and a baby both visible man and a woman hugging each other drone flying woman riding or holding a bike outdoors Easy vs difficult topics overall (2019) Easiness Hardness
  • 25. TRECVID 201925 Statistical significant differences among top 10 “F” runs (using randomization test, p < 0.05) Run Mean Inf. AP score C_D_ATL.19_2 0.163 # C_D_ATL.19_1 0.161 # C_D_RUCMM.19_1 0.160 # C_D_ATL.19_4 0.157 # C_D_RUCMM.19_2 0.152 # C_D_RUCMM.19_4 0.127 * C_D_RUCMM.19_3 0.124 * C_D_WasedaMeiseiSoftbank.19_1 0.123 * C_D_Inf.19_3 0.118 * C_D_Inf.19_2 0.118 * #* : no significant difference among each set of runs Ø Runs higher in the hierarchy are significantly better than runs more indented. C_D_RUCMM.19_2 Ø C_D_RUCMM.19_3 Ø C_D_RUCMM.19_4 Ø C_D_Inf.19_2 Ø C_D_Inf.19_3 C_D_RUCMM.19_1 Ø C_D_RUCMM.19_3 Ø C_D_RUCMM.19_4 Ø C_D_Inf.19_2 Ø C_D_Inf.19_3 C_D_ATL.19_1 Ø C_D_RUCMM.19_3 Ø C_D_RUCMM.19_4 Ø C_D_Inf.19_2 Ø C_D_Inf.19_3 C_D_ATL.19_2 Ø C_D_RUCMM.19_3 Ø C_D_RUCMM.19_4 Ø C_D_Inf.19_2 Ø C_D_Inf.19_3 C_D_ATL.19_4 Ø C_D_RUCMM.19_3 Ø C_D_RUCMM.19_4 Ø C_D_Inf.19_2
  • 26. TRECVID 201926 Statistical significant differences among top 10 “M” runs (using randomization test, p < 0.05) Run Mean Inf. AP score C_D_WasedaMeiseiSoftbank.19_2 0.152 C_D_WasedaMeiseiSoftbank.19_3 0.136 # C_D_WasedaMeiseiSoftbank.19_1 0.133 # C_D_VIREO.19_2 0.118 # C_D_WasedaMeiseiSoftbank.19_4 0.114 C_D_VIREO.19_1 0.066 * C_A_SIRET.19_3 0.035 * C_A_SIRET.19_2 0.035 * C_A_SIRET.19_1 0.034 ! C_A_SIRET.19_4 0.033 ! !#* : no significant difference among each set of runs Ø Runs higher in the hierarchy are significantly better than runs more indented. C_D_WasedaMeiseiSoftbank.19_2 Ø C_D_WasedaMeiseiSoftbank.19_3 Ø C_D_WasedaMeiseiSoftbank.19_4 Ø C_A_SIRET.19_3 Ø C_A_SIRET.19_2 Ø C_D_VIREO.19_1 Ø C_A_SIRET.19_1 Ø C_A_SIRET.19_4 Ø C_D_WasedaMeiseiSoftbank.19_1 Ø C_D_WasedaMeiseiSoftbank.19_4 Ø C_A_SIRET.19_3 Ø C_A_SIRET.19_2 Ø C_D_VIREO.19_1 Ø C_A_SIRET.19_1 Ø C_A_SIRET.19_4 Ø C_D_VIREO.19_2 Ø C_A_SIRET.19_3 Ø C_A_SIRET.19_2 Ø C_D_VIREO.19_1 Ø C_A_SIRET.19_1 Ø C_A_SIRET.19_4
  • 27. TRECVID 201927 Processing time vs Inf. AP (“F” runs) Across all topics and runs 1 10 100 1000 10000 100000 0 0.2 0.4 0.6 Time(s) Inf. AP Few topics with fast response and high score
  • 28. TRECVID 201928 Processing time vs Inf. AP (“M” runs) Across all topics and runs 1 10 100 1000 10000 0 0.1 0.2 0.3 0.4 0.5 Time(s) Inf. AP Consuming longer time processing queries didn’t help many teams/topics
  • 29. Samples of (tricky/failed) results TRECVID 201929 Truck driven in the daytime Drone flying Person in a tent Person wearing shorts outdoors Man and a woman holding hands Black man singing Birds in a tree Red hat or a cap
  • 30. 2019 Main approaches • Two main competing approaches: “concept banks” and “(visual-textual) embedding spaces” • Currently: significant advantage for “embedding space” approaches, especially for fully automatic search and even overall • Training data for semantic spaces: MSR and TRECVid VTT tasks, TGIF, IACC.3, Flickr8k, Flickr30k, MS COCO], and Conceptual Captions TRECVID 201930
  • 31. 2019 Main approaches • Alibaba Group (presentation to follow): – Fully automatic (0.163): mapping video embedding and language embedding into a learned semantic space with graph sequence and aggregated modeling, and gated CNNs • Renmin University of China and Zhejiang Gongshang University (presentation to follow): – Fully automatic (0.160): Word to Visual Word (W2VV++) similar to TRECVid 2018 plus “dual encoding network” and BERT as text encoder • Waseda University; Meisei University; SoftBank Corporation (presentation to follow): – Manually assisted (0.152): concept-based retrieval similar to previous years’ concept bank approach – Fully automatic (0.123): visual-semantic embedding (VSE++) TRECVID 201931
  • 32. 2019 Main approaches • Shandong Normal University; Carnegie Mellon University; Monash University: – Fully automatic (0.118): submitted fully automatic runs but notebook paper currently only about their INS task participation. • City University of Hong Kong (VIRE0) and Eurecom: – Manually assisted runs (0.118): concept based approach with manual query parsing and manual concept filtering – Fully automatic (0.075): concept based approach • Kindai University and Kobe University: – Fully automatic (0.087): embedding that maps visual and textual information into a common space • Florida International University; University of Miami (presentation to follow) – Fully automatic (0.082): weighted concept fusion and W2VV TRECVID 201932
  • 33. 2019 Task observations • New dataset : Vimeo Creative Commons Collection (V3C1) is being used for testing • Development of 90 queries to be used between 2019-2021 including progress subtask. • Run training types are dominated by “D” runs. No relevance feedback submissions received. • New “novelty” run type (and metric). Novelty runs proved to submit unique true shots compared to common run types. • Stable team participation and task completion rate. Manually-assisted runs decreasing. • High participation in the progress subtask • Absolute number of hits are higher than previous years. • We can’t compare performance with IACC.3 (2016-2018) : New dataset + New queries • Fully automatic and Manually-assisted performance are almost similar. • Among high scoring topics, there is more room for improvement among systems. • Among low scoring topics, most systems scores are collapsed in small narrow range. • Dynamic topics (actions, interactions, multi-facets ..etc) are the hardest topics. • Most systems are slow. Few topics scored high in fast time. • Task is still challenging! TRECVID 201933
  • 34. RUCMM 2019 system on previous years TRECVID 201934
  • 35. TRECVID 201935 At MMM 2020 26th International Conference on Multimedia Modeling, January 5-8, 2020 Daejeon, Korea • 10 Ad-Hoc Video Search (AVS) topics : Each AVS topic has several/many target shots that should be found. • 10 Known-Item Search (KIS) tasks, which are selected completely random on site. Each KIS task has only one single 20 s long target segment. • Registration for the task is now closed Interactive Video Retrieval subtask will be held as part of the Video Browser Showdown (VBS)
  • 36. TRECVID 201936 9:10 – 12:20 : Ad-hoc Video Search 9:10 – 9:40 am Ad-hoc Video Search Task Overview 9:40 – 10:10 am Learn to Represent Queries and Videos for Ad-hoc Video Search, RUCMM Team - Renmin University of China; Zhejiang Gongshang University 10:10 – 10:40 am Zero-shot Video Retrieval for Ad-hoc Video Search Task WasedaMeiseiSoftbank Team – Waseda University; Meisei University; SoftBank Corporation 10:40 – 11:00 am Break with refreshments 11:00 – 11:30 am Query-Based Concept Tree for Score Fusion in Ad-hoc Video Search Task, FIU_UM Team – Florida Intl. University; University of Miami 11:30 – 12:00 pm Hybrid Sequence Encoder for Text Based Video Retrieval ATL Team – Alibaba Group 12:00 – 12:20 pm AVS Task discussion
  • 37. 2019 Questions and 2020 plans TRECVID 201937 • Was the task/queries realistic enough?! • How teams feel the difference between IACC data vs V3C ? • Do we need to change/add/remove anything to the task in 2020 ? • Is there any specific reason for the low submissions in “E” & “F” training type runs? (training data collected automatically from the given query text) • Do we need the relevance feedback run type? 0 submissions this year. • Did any team run their 2019 system on IACC.3 (2016-2018) topics ? (Yes) • Any feedback about the new novelty metric (runs)? • Engineering versus research efforts? • Shared “consolidated” concept banks? • How to encourage teams to share resources/concept models,… etc. • Current plan is to continue the task V3C1 for main and progress subtask. • Please continue participating in the “progress subtask” to measure accurate performance difference • What about an explainability subtask (related to embedding approaches)?
  • 38. AVS Progress subtask TRECVID 201938 Evaluation year Submission year 2019 2020 2021 2019 Submit 50 queries (30 new + 20 common) Eval 30 new Queries 2020 Submit 40 queries (20 new + 20 common) Eval 30 (20 new + 10 common) 2021 Submit 40 queries (20 new + 20 common) Eval 30 (20 New + 10 common) Goals : Evaluate 10 (set A) common queries submitted in 2 years (2019, 2020) Evaluate 10 (set B) common queries submitted in 3 years (2019, 2020, 2021) Evaluate 20 common queries submitted in 3 years (2019 , 2020, 2021) Ground truth for 20 common queries can be released only in 2021