Comparing Retrieval Effectiveness of Alternative Content Segmentation Methods for Internet Video Search

Comparing Retrieval Effectiveness of Alternative Content Segmentation Methods for Internet Video Search

Comparing Retrieval Effectiveness of
Alternative Content Segmentation Methods
for Internet Video Search

Maria Eskevich1 , Gareth J.F. Jones1 , Martha Larson2
Christian Wartena2,3
Robin Aly4 , Thijs Verschoor4 , Roeland Ordelman4

1 Centre for Digital Video Processing, Centre for Next Generation Localisation
School of Computing, Dublin City University, Dublin, Ireland
2 Delft University of Technology, Delft, The Netherlands
3 Univ. of Applied Sciences and Arts Hannover
4 University of Twente, The Netherlands


Outline

MediaEval 2011 Rich Speech Retrieval Task
3 participant groups methods
Results and examples
Conclusion
Future Work: Brave New Task at MediaEval 2012


ediaEval 2011
Rich Speech Retrieval (RSR) Task

Task Goal:
Information to be found - combination of required
audio and visual content, and speaker’s intention


ediaEval 2011

Task Goal:

Transcript 1 Transcript 2


ediaEval 2011

Task Goal:

Transcript 1 Transcript 2
Meaning 1 Meaning 2


ediaEval 2011

Task Goal:

Transcript 1 = Transcript 2
Meaning 1 = Meaning 2


ediaEval 2011

Task Goal:

Conventional retrieval


ediaEval 2011

Task Goal:

Speech act 1 = Speech act 2


ediaEval 2011

Task Goal:

Speech act 1 = Speech act 2
Extended speech retrieval (ﬁnd jump-in points)


ediaEval 2011


ediaEval 2011
Data provided to task participants:


ediaEval 2011
Videos from Internet video sharing platform blip.tv
(ME10WWW dataset: testset: 1727 episodes, ca. 300 hs)


ediaEval 2011
Automatic Speech Recognition (ASR) transcript provided
by LIMSI and Vocapia Research


ediaEval 2011
Metadata manually added by the uploader


ediaEval 2011
50 user-generated short web style queries collected via
crowdsourcing, associated with following speech act types:


ediaEval 2011
’expressives’: apology (1), opinion (21)
’assertives’: deﬁnition (17)
’directives’: warning (6)
’commissives’: promise (5)


ediaEval 2011
Data available for results assessment:


ediaEval 2011
Time of the relevant item for the labeled speech act


ediaEval 2011
Time of the relevant item for the labeled speech act
Accurate transcript of the labeled speech act


Evaluation Metrics


Evaluation Metrics

Mean Reciprocal Rank (MRR):

1
RR =
RANK
Mean Generalized Average Precision (mGAP):

1
GAP = . PENALTY
RANK


Approach 1: Sliding Window (SW)

Tag and lemmatize the words



Segmentation with sliding window:




Retrieval using BM25, BM25F - for use of metadata




Retrieval using BM25, BM25F - for use of metadata
Post-processing:


Approach 2: Speech Segments (Sp)

Segmentation based on silence points and changes of
speakers


Approach 2: Speech Segments (Sp)

Segmentation based on silence points and changes of
speakers

Search engine used: PFTijah


Approach 3: Lexical cohesion (LC)

Segmentation:
into lexically coherent segments, using 2 algorithms:
C99 and TextTiling
additional segment boundaries for silences > 0.5 sec


Approach 3: Lexical cohesion (LC)

Segmentation:
into lexically coherent segments, using 2 algorithms:
C99 and TextTiling
additional segment boundaries for silences > 0.5 sec

SMART IR system with language modeling


RSR Results: MRR and mGAP

RunName WindowSize
60 30 10
MRR mGAP MRR mGAP MRR mGAP
SW asr sh 0.37 0.32 0.32 0.27 0.19 0.19
SW asr meta sh 0.35 0.29 0.30 0.25 0.14 0.14
SW meta 0.20 0.15 0.18 0.13 0.06 0.06
Sp asr 0.34 0.27 0.27 0.22 0.16 0.16
Sp asr meta 0.34 0.25 0.26 0.21 0.15 0.15
Sp meta 0.18 0.14 0.14 0.11 0.07 0.07
LC asr tt 0.36 0.25 0.29 0.18 0.09 0.09
LC asr meta tt 0.39 0.28 0.30 0.20 0.14 0.14
LC meta 0.18 0.11 0.09 0.07 0.03 0.03


Relationship Between
Retrieval Effectiveness and Segmentation Methods

Segment:
100 % Recall of the relevant content
High Precision (30, 56 %) of the relevant content
Topic consistency


Example 1


Example 2


Overlap of query words
with ASR transcript or Metadata. Example 3



Segments on the same topic
are retrieved in top of the list;



Segments on the same topic Use of metadata for segments
are retrieved in top of the list; containing relevant content:
decrease the rank, if
segment > 1 topic
does not affect the rank, if
segment = 1 topic



Segment:
High Recall (83, 100 %)
Precision = 23 %
Several topics covered



Segment:
High Recall (83, 100 %)
Precision = 23 %
Several topics covered
− > Use of metadata increase the rank (Rank = 1)


Conclusions and Future Work

Segmentation plays signiﬁcant role in retrieving relevant
content



content
High recall and precision of the relevant content within the
segment lead to good segment ranking.



content
Related metadata is useful to improve ranking of the
segment with high recall and non relevant content.



content

− > Current exploration of segmentation methods to generate
segments that have high recall and precision of the relevant
content for the query



content

− > Current exploration of segmentation methods to generate
segments that have high recall and precision of the relevant
content for the query
Due to small size of the query set no general conclusions
on the difference based on the speech act type


ediaEval 2012 Brave New Task:
Search and Hyperlinking

Use Scenario:
A user is searching for a known segment in a video
collection.
Furthermore, because the information in the segment might
not be sufﬁcient for his information need, s/he wants to
have links to other related video segments, which may help
to satisfy information need related to this video.



Use Scenario:
collection.

Sub-tasks:



Use Scenario:
collection.

Sub-tasks:
Search: ﬁnding suitable video segments based on a short
natural language query,



Use Scenario:
collection.

Sub-tasks:
Search: ﬁnding suitable video segments based on a short
natural language query,
Linking: deﬁning links to other relevant video segments in
the collection.


ediaEval 2012

Thank you for your attention!

Welcome to MediaEval 2012! http://multimediaeval.org

Comparing Retrieval Effectiveness of Alternative Content Segmentation Methods for Internet Video Search

Recommended

Recommended

More Related Content

Similar to Comparing Retrieval Effectiveness of Alternative Content Segmentation Methods for Internet Video Search

Similar to Comparing Retrieval Effectiveness of Alternative Content Segmentation Methods for Internet Video Search (20)

Recently uploaded

Recently uploaded (20)

Comparing Retrieval Effectiveness of Alternative Content Segmentation Methods for Internet Video Search