SlideShare a Scribd company logo
1 of 36
Search and Hyperlinking 
2014 
Overview 
Maria Eskevich, Robin Aly, 
David Nicolás Racca 
Roeland Ordelman, Shu Chen, 
Gareth J.F. Jones
Find what you were (not) 
looking for 
Search & Explore
Jump-in points! 
X
Users 
Main group User Target 
Researchers & 
Educators 
Journalists Research 
Academic 
researchers & 
students 
Investigate 
Academic educators Educate 
Public users Citizens Entertainment, 
Infotainment 
Media Professionals Broadcast 
Professionals 
Reuse 
Media Archivists Annotate
Recommendation (Linking) 
Not what we want
Linking Audio-Visual Content
1998 2002 2008 
2010 2013 2015 
DATA 
BIG DATA? 
not representative 
representative
Search & Hyperlinking task 
• User oriented: aim to explore the needs of real users 
expressed as queries. 
– How: UK citizens and crowd sourcing for retrieval 
assessment 
• Temporal aspect: seek to direct users to the relevant 
parts of retrieved video (“jump-in point”). 
– How: segmentation, segment overlap, transcripts. 
prosodic, visual (low-level, high-level; keyframes) 
• Multimodal: want to investigate technologies for 
addressing variety in user needs and expectations 
– varied visual and audio contributions, intentional gap 
between query and multimodal descriptors in content
ME Search & Hyperlinking task 
in development: 2012 – 2014 
Search Hyperlinking 
2012 2013 2014 2012 2013 2014 
Dataset BlipTv BBC BlipTv BBC 
Features released: 
 Transcripts 2 ASR 3 ASR 2 ASR 3 ASR 
 Prosodic features no yes no yes 
 Visual clues for queries yes no no 
 Concept detection yes yes 
Type of the task Known-item Ad-hoc Ad-hoc 
Query/Anchors creation PC iPad PC iPad 
Number of 
queries/anchors 
30/30 4/50 50/30 30/30 11/ 98/30 
Relevance assessment MTurk users (BBC) MTurk MTurk 
Numbers of assessed cases 30 50 9 900 3 517 9 975 13 141 
Evaluation metrics MRR, MASP, MASDWP MAP(- 
bin/tol), 
MAP MAP(-bin/tol), P@5/10
Dataset: Video collection 
• BBC copyright cleared broadcast material: 
– Videos: 
• Development set: 6 weeks between 01.04.2008 and 11.05.2008 (1335 hours/2323 videos) 
• Test set: 11 weeks between 12.05.2008 and 31.07.2008 (2686 hours, 3528 videos) 
– Manually transcribed subtitles 
– Metadata 
• Additional data: 
– ASR: LIMSI/Vocapia, LIUM, NST-Sheffield 
– Shot boundaries, keyframes 
– Output of visual concept detectors by University of Leuven, and University of 
Oxford
Dataset: Query 
• 28 Users 
- Policemen, Hair dresser, Bouncer, Sales manger, 
Student, Self-employed 
• Two hour session on iPads: 
– Search the archive (document level) 
– Define clips (segment level) 
– Define anchors (anchor level) 
Statement of 
Information Need 
Search 
Refine 
Relevant Clips 
Define 
Anchors
Data cleaning: Usable Information Need 
• Description clearly specifies what is relevant 
• A query with a suitable title exists 
• Sufficient relevant segments exist (try query)
Data cleaning: Process 
• For each information need in batch 
1. check if usable 
2. If in doubt use search to search for relevant data 
3. reword & spellcheck description 
4. select the first suitable query 
5. Save
Data cleaning: Usable Anchor 
• Longer than 5 seconds 
• Destination description clearly identifies the 
material the user wants to see when he would 
activate the anchor described by label 
• It is likely that there are some relevant items 
in the collection
Data cleaning: Process 
• For each information need in assigned batch 
– Go through anchors 
• check if usable 
• reword & spellcheck description 
• Assess whether it is like to find links in the collection 
(possibly using search) 
– Save
Dataset: outcome (1/2) 
• 30 queries 
<top> 
<queryId>query_6</queryId> 
<refId>53b3cf9d42b47e4c32545510</refId> 
<queryText>saturday kitchen cocktails</queryText> 
</top> 
<top> 
<queryId>query_1</queryId> 
<refId>53b3c64b42b47e4a362be4ce</refId> 
<queryText>sightseeing london</queryText> 
</top>
Dataset: outcome (2/2) 
• 30 anchors: 
<anchor> 
<anchorId>anchor_1</anchorId> 
<refId>53b3c46f42b47e459265d06f</refId> 
<startTime>16.38</startTime> 
<endTime>17.35</endTime> 
<fileName>v20080629_184000_bbctwo_killer_wh 
ales_in_the</fileName> 
</anchor>
Ground truth creation 
• Queries/Anchors: user studies at BBC: 
- 28 users with following profile: 
 Age: 18-30 years old 
 Use of search engines and services on iPads on the daily basis 
• Relevance assessment: via crowdsourcing on Amazon MTurk platform: 
– Top 10 results from 58 search and 62 hyperlinking submissions 
– 1 judgment per query or anchor that was accepted/rejected based on 
an automated algorithm, special cases of users typos checked 
manually 
– Number of evaluated HITs: 
9 900 for search, and 13 141 for hyperlinking
• P@5/10/20 
• MAP based: 
Evaluation metrics 
• MAP: taking into account any overlapping segment: 
• MAP-bin: relevant segments are binned for relevance: 
• MAP-tol: only start times of the segments are considered:
RESULTS
Results: Search sub-task: MAP 
18 
16 
14 
12 
10 
8 
6 
4 
2 
0 
LIMSI/Vocapia Manual No ASR NST/Sheffield LIUM
Results: Search sub-task: MAP_bin 
0.45 
0.4 
0.35 
0.3 
0.25 
0.2 
0.15 
0.1 
0.05 
0 
LIMSI/Vocapia Manual No ASR NST/Sheffield LIUM
Results: Search sub-task: MAP_tol 
0.35 
0.3 
0.25 
0.2 
0.15 
0.1 
0.05 
0 
LIMSI/Vocapia Manual No ASR NST/Sheffield LIUM
Results: Hyperlinking sub-task: MAP 
0 
0.5 
1 
1.5 
2 
2.5 
3 
3.5 
4 
4.5 
CUNI_F_M_NoOverlapAu… 
CUNI_F_M_NoOverlapKSI… 
CUNI_F_M_NoOverlapKSI… 
CUNI_F_M_NoOverlapNo… 
CUNI_F_M_OverlapKSIWe… 
CUNI_F_N_NoOverlapAud… 
CUNI_F_N_NoOverlapKSI… 
CUNI_F_N_NoOverlapNo… 
CUNI_O_M_NoOverlapKSI… 
DCLab_Sh_N_Concept2 
DCLab_Sh_N_ConceptEnri… 
IRISAKUL_Ss_N_HTM 
IRISAKUL_Ss_N_NGRAM 
IRISAKUL_Ss_N_TM1 
IRISAKUL_Ss_N_TM2 
IRISAKUL_Ss_O_NGRAMN… 
JRS_F_MV_ATextVisR 
JRS_F_MV_AwConcept 
JRS_F_MV_CTextVisR 
JRS_F_MV_CwConcept 
JRS_F_M_AText 
JRS_F_M_CText 
JRS_F_V_AcOnly 
JRS_F_V_CcOnly 
LINKEDTV2014_O_O_K 
LINKEDTV2014_O_VO_KC7S 
LINKEDTV2014_O_VO_KC… 
LINKEDTV2014_Ss_N_ALL 
LINKEDTV2014_Ss_N_TEXT 
LIMSI/Vocapia Manual No ASR NST/Sheffield LIUM
Results: Hyperlinking sub-task: MAP_bin 
0 
0.05 
0.1 
0.15 
0.2 
0.25 
0.3 
0.35 
CUNI_F_M_NoOverlapA… 
CUNI_F_M_NoOverlapK… 
CUNI_F_M_NoOverlapK… 
CUNI_F_M_NoOverlapN… 
CUNI_F_M_OverlapKSI… 
CUNI_F_N_NoOverlapAu… 
CUNI_F_N_NoOverlapKS… 
CUNI_F_N_NoOverlapNo… 
CUNI_O_M_NoOverlapK… 
DCLab_Sh_N_Concept2 
DCLab_Sh_N_ConceptEn… 
IRISAKUL_Ss_N_HTM 
IRISAKUL_Ss_N_NGRAM 
IRISAKUL_Ss_N_TM1 
IRISAKUL_Ss_N_TM2 
IRISAKUL_Ss_O_NGRAM… 
JRS_F_MV_ATextVisR 
JRS_F_MV_AwConcept 
JRS_F_MV_CTextVisR 
JRS_F_MV_CwConcept 
JRS_F_M_AText 
JRS_F_M_CText 
JRS_F_V_AcOnly 
JRS_F_V_CcOnly 
LINKEDTV2014_O_O_K 
LINKEDTV2014_O_VO_K… 
LINKEDTV2014_O_VO_K… 
LINKEDTV2014_Ss_N_ALL 
LINKEDTV2014_Ss_N_TE… 
LIMSI/Vocapia Manual No ASR NST/Sheffield LIUM
Results: Hyperlinking sub-task: MAP_tol 
0 
0.05 
0.1 
0.15 
0.2 
0.25 
0.3 
CUNI_F_M_NoOverlapA… 
CUNI_F_M_NoOverlapKS… 
CUNI_F_M_NoOverlapKS… 
CUNI_F_M_NoOverlapN… 
CUNI_F_M_OverlapKSIW… 
CUNI_F_N_NoOverlapAu… 
CUNI_F_N_NoOverlapKS… 
CUNI_F_N_NoOverlapNo… 
CUNI_O_M_NoOverlapK… 
DCLab_Sh_N_Concept2 
DCLab_Sh_N_ConceptEn… 
IRISAKUL_Ss_N_HTM 
IRISAKUL_Ss_N_NGRAM 
IRISAKUL_Ss_N_TM1 
IRISAKUL_Ss_N_TM2 
IRISAKUL_Ss_O_NGRAM… 
JRS_F_MV_ATextVisR 
JRS_F_MV_AwConcept 
JRS_F_MV_CTextVisR 
JRS_F_MV_CwConcept 
JRS_F_M_AText 
JRS_F_M_CText 
JRS_F_V_AcOnly 
JRS_F_V_CcOnly 
LINKEDTV2014_O_O_K 
LINKEDTV2014_O_VO_K… 
LINKEDTV2014_O_VO_K… 
LINKEDTV2014_Ss_N_ALL 
LINKEDTV2014_Ss_N_TEXT 
LIMSI/Vocapia Manual No ASR NST/Sheffield LIUM
Lessons learned 
1. iPad vs PC = different user behaviour and 
expectation from the system. 
2. Prosodic features broaden the scope of the 
search sub-task. 
3. Use of shot segmentation based units 
achieves the worst scores for both sub-tasks. 
4. Use of metadata improves results for both 
sub-tasks.
Lessons learned 
1. iPad vs PC = different user behaviour and 
expectation from the system. 
2. Prosodic features broaden the scope of the 
search sub-task. 
3. Use of shot segmentation based units 
achieves the worst scores for both sub-tasks. 
4. Use of metadata improves results for both 
sub-tasks.
The Search and Hyperlinking task was supported by 
We are grateful to 
Jana Eggink and 
Andy O'Dwyer 
from the BBC for preparing the collection and hosting the user trials. 
... and of course Martha for advise & crowdsourcing access.
JRS at Search and Hyperlinking of Television 
Content Task 
Werner Bailer, Harald Stiegler 
MediaEval Workshop, Barcelona, Oct. 2014
Linking sub-task 
• Matching terms from textual resources 
• Reranking based on visual similarity (VLAT) 
• Using visual concepts (only/in addition) 
• Results 
– Differences between different text resources 
– Context helped only in few of the cases 
– Visual reranking provides small improvement 
– Visual concepts did not provide improvements 
35
Zsombor Paróczi, Bálint Fodor, Gábor Szűcs 
Solution with concept enrichment 
• Concept enrichment: the set of words is 
extended with their synonyms or other 
conceptually connected words. 
• Top 10 vs top 50 conceptually connected words 
for each word 
• Conclusion: the results show that concept 
enrichment with less words give better precision 
because at the opposite case the noise is greater.
Television Linked To The Web 
LinkedTV @ MediaEval 2014 
Search and Hyperlinking Task 
H.A. Le1, Q.M. Bui1, B. Huet1, B. Cervenková2, J. Bouchner2, E. Apostolidis3, 
F. Markatopoulou3, A. Pournaras3, V. Mezaris3, D. Stein4, S. Eickeler4, and M. Stadtschnitzer4 
1 - Eurecom, Sophia Antipolis, France. 
2 - University of Economics, Prague, Czech Republic. 
3 - Information Technologies Institute, CERTH, Thessaloniki, Greece. 
4 - Fraunhofer IAIS, Sankt Augustin, Germany. 
16-17 Oct 2014 
www.linkedtv.eu
Reasons to visit the LinkedTV 
poster 
• Different granularities: video level, scene 
level (visual/topic) and sentence level. 
• Different features: text (subtitles / 
transcripts), visual concepts, keywords, etc… 
LinkedTV @ MediaEval 2014 Search and Hyperlinking Task
Reasons to visit the LinkedTV 
poster 
• How to incorporate visual information 
to the search? 
• Visual concept detection in the search 
query: 
Mapping between query keywords and visual 
concepts (151 semantic concepts from 
TRECVID 2012) 
– Semantic word distance based on WordNet 
– Identification of salient visual concepts from 
Google Image search results (query keywords) 
LinkedTV @ MediaEval 2014 Search and Hyperlinking Task
Reasons to visit the LinkedTV 
poster 
• How to incorporate visual information 
to the search? 
• Integration of detected visual concepts 
to the search: 
– Designing an enriched query, based on textual 
(text query) and visual information (range 
query) 
– Fusion of text score (Solr) and visual concepts 
scores 
LinkedTV @ MediaEval 2014 Search and Hyperlinking Task

More Related Content

Viewers also liked

Comparing Retrieval Effectiveness of Alternative Content Segmentation Methods...
Comparing Retrieval Effectiveness of Alternative Content Segmentation Methods...Comparing Retrieval Effectiveness of Alternative Content Segmentation Methods...
Comparing Retrieval Effectiveness of Alternative Content Segmentation Methods...Maria Eskevich
 
Audio/Video Search: Why? What? How?
Audio/Video Search: Why? What? How?Audio/Video Search: Why? What? How?
Audio/Video Search: Why? What? How?Maria Eskevich
 
HVP Critical Assessment of Genome Interpretation
HVP Critical Assessment of Genome InterpretationHVP Critical Assessment of Genome Interpretation
HVP Critical Assessment of Genome InterpretationReece Hart
 
Focus on spoken content in multimedia retrieval
Focus on spoken content in multimedia retrievalFocus on spoken content in multimedia retrieval
Focus on spoken content in multimedia retrievalMaria Eskevich
 
Building a clinical genome interpretation services company
Building a clinical genome interpretation services companyBuilding a clinical genome interpretation services company
Building a clinical genome interpretation services companyReece Hart
 
The Clinical Significance of Transcript Alignment Discrepancies
The Clinical Significance of Transcript Alignment DiscrepanciesThe Clinical Significance of Transcript Alignment Discrepancies
The Clinical Significance of Transcript Alignment DiscrepanciesReece Hart
 
Video Hyperlinking (LNK) Task at TRECVid 2016
Video Hyperlinking (LNK) Task at TRECVid 2016Video Hyperlinking (LNK) Task at TRECVid 2016
Video Hyperlinking (LNK) Task at TRECVid 2016Maria Eskevich
 

Viewers also liked (7)

Comparing Retrieval Effectiveness of Alternative Content Segmentation Methods...
Comparing Retrieval Effectiveness of Alternative Content Segmentation Methods...Comparing Retrieval Effectiveness of Alternative Content Segmentation Methods...
Comparing Retrieval Effectiveness of Alternative Content Segmentation Methods...
 
Audio/Video Search: Why? What? How?
Audio/Video Search: Why? What? How?Audio/Video Search: Why? What? How?
Audio/Video Search: Why? What? How?
 
HVP Critical Assessment of Genome Interpretation
HVP Critical Assessment of Genome InterpretationHVP Critical Assessment of Genome Interpretation
HVP Critical Assessment of Genome Interpretation
 
Focus on spoken content in multimedia retrieval
Focus on spoken content in multimedia retrievalFocus on spoken content in multimedia retrieval
Focus on spoken content in multimedia retrieval
 
Building a clinical genome interpretation services company
Building a clinical genome interpretation services companyBuilding a clinical genome interpretation services company
Building a clinical genome interpretation services company
 
The Clinical Significance of Transcript Alignment Discrepancies
The Clinical Significance of Transcript Alignment DiscrepanciesThe Clinical Significance of Transcript Alignment Discrepancies
The Clinical Significance of Transcript Alignment Discrepancies
 
Video Hyperlinking (LNK) Task at TRECVid 2016
Video Hyperlinking (LNK) Task at TRECVid 2016Video Hyperlinking (LNK) Task at TRECVid 2016
Video Hyperlinking (LNK) Task at TRECVid 2016
 

Similar to Search and Hyperlinking Overview @MediaEval2014

The Search and Hyperlinking Task at MediaEval 2014
The Search and Hyperlinking Task at MediaEval 2014The Search and Hyperlinking Task at MediaEval 2014
The Search and Hyperlinking Task at MediaEval 2014multimediaeval
 
Investigating Crowdsourcing as an Evaluation Method for (TEL) Recommender Sy...
Investigating Crowdsourcing as an Evaluation Method for (TEL) Recommender Sy...Investigating Crowdsourcing as an Evaluation Method for (TEL) Recommender Sy...
Investigating Crowdsourcing as an Evaluation Method for (TEL) Recommender Sy...Christoph Rensing
 
Writing a scientific manuscript
Writing a scientific manuscriptWriting a scientific manuscript
Writing a scientific manuscriptMartin McMorrow
 
Metadata for Research Objects
Metadata for Research ObjectsMetadata for Research Objects
Metadata for Research Objectsseanb
 
Benchmarking Domain-specific Expert Search using Workshop Program Committees
Benchmarking Domain-specific Expert Search using Workshop Program CommitteesBenchmarking Domain-specific Expert Search using Workshop Program Committees
Benchmarking Domain-specific Expert Search using Workshop Program CommitteesToine Bogers
 
Udacity webinar on Recommendation Systems
Udacity webinar on Recommendation SystemsUdacity webinar on Recommendation Systems
Udacity webinar on Recommendation SystemsAxel de Romblay
 
Learning by example: training users through high-quality query suggestions
Learning by example: training users through high-quality query suggestionsLearning by example: training users through high-quality query suggestions
Learning by example: training users through high-quality query suggestionsClaudia Hauff
 
Personalised Access to Linked Data
Personalised Access to Linked DataPersonalised Access to Linked Data
Personalised Access to Linked DataMilan Dojchinovski
 
Cse443 Project Report - LPU (Modern Big Data Analysis with SQL Specialization)
Cse443 Project Report - LPU (Modern Big Data Analysis with SQL Specialization)Cse443 Project Report - LPU (Modern Big Data Analysis with SQL Specialization)
Cse443 Project Report - LPU (Modern Big Data Analysis with SQL Specialization)Qazi Maaz Arshad
 
Improving Semantic Search Using Query Log Analysis
Improving Semantic Search Using Query Log AnalysisImproving Semantic Search Using Query Log Analysis
Improving Semantic Search Using Query Log AnalysisStuart Wrigley
 
WWW'15: A Hybrid Resource Recommender Mimicking Attention-Interpretation Dyna...
WWW'15: A Hybrid Resource Recommender Mimicking Attention-Interpretation Dyna...WWW'15: A Hybrid Resource Recommender Mimicking Attention-Interpretation Dyna...
WWW'15: A Hybrid Resource Recommender Mimicking Attention-Interpretation Dyna...Dominik Kowald
 
[UPDATE] Udacity webinar on Recommendation Systems
[UPDATE] Udacity webinar on Recommendation Systems[UPDATE] Udacity webinar on Recommendation Systems
[UPDATE] Udacity webinar on Recommendation SystemsAxel de Romblay
 
Research on Recommender Systems: Beyond Ratings and Lists
Research on Recommender Systems: Beyond Ratings and ListsResearch on Recommender Systems: Beyond Ratings and Lists
Research on Recommender Systems: Beyond Ratings and ListsDenis Parra Santander
 
Amet University- B.Sc Data Science Syllabus
Amet University-  B.Sc Data Science SyllabusAmet University-  B.Sc Data Science Syllabus
Amet University- B.Sc Data Science Syllabusametinstitute
 
Discovery Hub: on-the-fly linked data exploratory search
Discovery Hub: on-the-fly linked data exploratory searchDiscovery Hub: on-the-fly linked data exploratory search
Discovery Hub: on-the-fly linked data exploratory searchFabien Gandon
 
Research culture presentation Sept 4, 2013
Research culture presentation Sept 4, 2013Research culture presentation Sept 4, 2013
Research culture presentation Sept 4, 2013Shawna Reibling
 
Applications of Machine Learning to Location-based Social Networks
Applications of Machine Learning to Location-based Social NetworksApplications of Machine Learning to Location-based Social Networks
Applications of Machine Learning to Location-based Social NetworksJoan Capdevila Pujol
 
Mobile Visual Search: Object Re-Identification Against Large Repositories
Mobile Visual Search: Object Re-Identification Against Large RepositoriesMobile Visual Search: Object Re-Identification Against Large Repositories
Mobile Visual Search: Object Re-Identification Against Large RepositoriesUnited States Air Force Academy
 

Similar to Search and Hyperlinking Overview @MediaEval2014 (20)

The Search and Hyperlinking Task at MediaEval 2014
The Search and Hyperlinking Task at MediaEval 2014The Search and Hyperlinking Task at MediaEval 2014
The Search and Hyperlinking Task at MediaEval 2014
 
Investigating Crowdsourcing as an Evaluation Method for (TEL) Recommender Sy...
Investigating Crowdsourcing as an Evaluation Method for (TEL) Recommender Sy...Investigating Crowdsourcing as an Evaluation Method for (TEL) Recommender Sy...
Investigating Crowdsourcing as an Evaluation Method for (TEL) Recommender Sy...
 
Writing a scientific manuscript
Writing a scientific manuscriptWriting a scientific manuscript
Writing a scientific manuscript
 
Metadata for Research Objects
Metadata for Research ObjectsMetadata for Research Objects
Metadata for Research Objects
 
Benchmarking Domain-specific Expert Search using Workshop Program Committees
Benchmarking Domain-specific Expert Search using Workshop Program CommitteesBenchmarking Domain-specific Expert Search using Workshop Program Committees
Benchmarking Domain-specific Expert Search using Workshop Program Committees
 
Udacity webinar on Recommendation Systems
Udacity webinar on Recommendation SystemsUdacity webinar on Recommendation Systems
Udacity webinar on Recommendation Systems
 
Learning by example: training users through high-quality query suggestions
Learning by example: training users through high-quality query suggestionsLearning by example: training users through high-quality query suggestions
Learning by example: training users through high-quality query suggestions
 
Personalised Access to Linked Data
Personalised Access to Linked DataPersonalised Access to Linked Data
Personalised Access to Linked Data
 
Cse443 Project Report - LPU (Modern Big Data Analysis with SQL Specialization)
Cse443 Project Report - LPU (Modern Big Data Analysis with SQL Specialization)Cse443 Project Report - LPU (Modern Big Data Analysis with SQL Specialization)
Cse443 Project Report - LPU (Modern Big Data Analysis with SQL Specialization)
 
Improving Semantic Search Using Query Log Analysis
Improving Semantic Search Using Query Log AnalysisImproving Semantic Search Using Query Log Analysis
Improving Semantic Search Using Query Log Analysis
 
WWW'15: A Hybrid Resource Recommender Mimicking Attention-Interpretation Dyna...
WWW'15: A Hybrid Resource Recommender Mimicking Attention-Interpretation Dyna...WWW'15: A Hybrid Resource Recommender Mimicking Attention-Interpretation Dyna...
WWW'15: A Hybrid Resource Recommender Mimicking Attention-Interpretation Dyna...
 
[UPDATE] Udacity webinar on Recommendation Systems
[UPDATE] Udacity webinar on Recommendation Systems[UPDATE] Udacity webinar on Recommendation Systems
[UPDATE] Udacity webinar on Recommendation Systems
 
Research on Recommender Systems: Beyond Ratings and Lists
Research on Recommender Systems: Beyond Ratings and ListsResearch on Recommender Systems: Beyond Ratings and Lists
Research on Recommender Systems: Beyond Ratings and Lists
 
Eval rec algo_crowdsourcing__icalt_2014_ma
Eval rec algo_crowdsourcing__icalt_2014_maEval rec algo_crowdsourcing__icalt_2014_ma
Eval rec algo_crowdsourcing__icalt_2014_ma
 
Amet University- B.Sc Data Science Syllabus
Amet University-  B.Sc Data Science SyllabusAmet University-  B.Sc Data Science Syllabus
Amet University- B.Sc Data Science Syllabus
 
8th sem (1)
8th sem (1)8th sem (1)
8th sem (1)
 
Discovery Hub: on-the-fly linked data exploratory search
Discovery Hub: on-the-fly linked data exploratory searchDiscovery Hub: on-the-fly linked data exploratory search
Discovery Hub: on-the-fly linked data exploratory search
 
Research culture presentation Sept 4, 2013
Research culture presentation Sept 4, 2013Research culture presentation Sept 4, 2013
Research culture presentation Sept 4, 2013
 
Applications of Machine Learning to Location-based Social Networks
Applications of Machine Learning to Location-based Social NetworksApplications of Machine Learning to Location-based Social Networks
Applications of Machine Learning to Location-based Social Networks
 
Mobile Visual Search: Object Re-Identification Against Large Repositories
Mobile Visual Search: Object Re-Identification Against Large RepositoriesMobile Visual Search: Object Re-Identification Against Large Repositories
Mobile Visual Search: Object Re-Identification Against Large Repositories
 

Recently uploaded

final waves properties grade 7 - third quarter
final waves properties grade 7 - third quarterfinal waves properties grade 7 - third quarter
final waves properties grade 7 - third quarterHanHyoKim
 
well logging & petrophysical analysis.pptx
well logging & petrophysical analysis.pptxwell logging & petrophysical analysis.pptx
well logging & petrophysical analysis.pptxzaydmeerab121
 
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptxGENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptxRitchAndruAgustin
 
DNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptxDNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptxGiDMOh
 
Combining Asynchronous Task Parallelism and Intel SGX for Secure Deep Learning
Combining Asynchronous Task Parallelism and Intel SGX for Secure Deep LearningCombining Asynchronous Task Parallelism and Intel SGX for Secure Deep Learning
Combining Asynchronous Task Parallelism and Intel SGX for Secure Deep Learningvschiavoni
 
Explainable AI for distinguishing future climate change scenarios
Explainable AI for distinguishing future climate change scenariosExplainable AI for distinguishing future climate change scenarios
Explainable AI for distinguishing future climate change scenariosZachary Labe
 
Observational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsObservational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsSérgio Sacani
 
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdfKDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdfGABYFIORELAMALPARTID1
 
Environmental Acoustics- Speech interference level, acoustics calibrator.pptx
Environmental Acoustics- Speech interference level, acoustics calibrator.pptxEnvironmental Acoustics- Speech interference level, acoustics calibrator.pptx
Environmental Acoustics- Speech interference level, acoustics calibrator.pptxpriyankatabhane
 
Environmental acoustics- noise criteria.pptx
Environmental acoustics- noise criteria.pptxEnvironmental acoustics- noise criteria.pptx
Environmental acoustics- noise criteria.pptxpriyankatabhane
 
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
6.2 Pests of Sesame_Identification_Binomics_Dr.UPRPirithiRaju
 
FBI Profiling - Forensic Psychology.pptx
FBI Profiling - Forensic Psychology.pptxFBI Profiling - Forensic Psychology.pptx
FBI Profiling - Forensic Psychology.pptxPayal Shrivastava
 
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...Christina Parmionova
 
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests GlycosidesGLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests GlycosidesNandakishor Bhaurao Deshmukh
 
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...D. B. S. College Kanpur
 
whole genome sequencing new and its types including shortgun and clone by clone
whole genome sequencing new  and its types including shortgun and clone by clonewhole genome sequencing new  and its types including shortgun and clone by clone
whole genome sequencing new and its types including shortgun and clone by clonechaudhary charan shingh university
 
DECOMPOSITION PATHWAYS of TM-alkyl complexes.pdf
DECOMPOSITION PATHWAYS of TM-alkyl complexes.pdfDECOMPOSITION PATHWAYS of TM-alkyl complexes.pdf
DECOMPOSITION PATHWAYS of TM-alkyl complexes.pdfDivyaK787011
 
Replisome-Cohesin Interfacing A Molecular Perspective.pdf
Replisome-Cohesin Interfacing A Molecular Perspective.pdfReplisome-Cohesin Interfacing A Molecular Perspective.pdf
Replisome-Cohesin Interfacing A Molecular Perspective.pdfAtiaGohar1
 

Recently uploaded (20)

final waves properties grade 7 - third quarter
final waves properties grade 7 - third quarterfinal waves properties grade 7 - third quarter
final waves properties grade 7 - third quarter
 
well logging & petrophysical analysis.pptx
well logging & petrophysical analysis.pptxwell logging & petrophysical analysis.pptx
well logging & petrophysical analysis.pptx
 
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptxGENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
 
DNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptxDNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptx
 
Combining Asynchronous Task Parallelism and Intel SGX for Secure Deep Learning
Combining Asynchronous Task Parallelism and Intel SGX for Secure Deep LearningCombining Asynchronous Task Parallelism and Intel SGX for Secure Deep Learning
Combining Asynchronous Task Parallelism and Intel SGX for Secure Deep Learning
 
Explainable AI for distinguishing future climate change scenarios
Explainable AI for distinguishing future climate change scenariosExplainable AI for distinguishing future climate change scenarios
Explainable AI for distinguishing future climate change scenarios
 
Observational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsObservational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive stars
 
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdfKDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
 
Environmental Acoustics- Speech interference level, acoustics calibrator.pptx
Environmental Acoustics- Speech interference level, acoustics calibrator.pptxEnvironmental Acoustics- Speech interference level, acoustics calibrator.pptx
Environmental Acoustics- Speech interference level, acoustics calibrator.pptx
 
Environmental acoustics- noise criteria.pptx
Environmental acoustics- noise criteria.pptxEnvironmental acoustics- noise criteria.pptx
Environmental acoustics- noise criteria.pptx
 
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
 
Interferons.pptx.
Interferons.pptx.Interferons.pptx.
Interferons.pptx.
 
AZOTOBACTER AS BIOFERILIZER.PPTX
AZOTOBACTER AS BIOFERILIZER.PPTXAZOTOBACTER AS BIOFERILIZER.PPTX
AZOTOBACTER AS BIOFERILIZER.PPTX
 
FBI Profiling - Forensic Psychology.pptx
FBI Profiling - Forensic Psychology.pptxFBI Profiling - Forensic Psychology.pptx
FBI Profiling - Forensic Psychology.pptx
 
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
 
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests GlycosidesGLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests Glycosides
 
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
 
whole genome sequencing new and its types including shortgun and clone by clone
whole genome sequencing new  and its types including shortgun and clone by clonewhole genome sequencing new  and its types including shortgun and clone by clone
whole genome sequencing new and its types including shortgun and clone by clone
 
DECOMPOSITION PATHWAYS of TM-alkyl complexes.pdf
DECOMPOSITION PATHWAYS of TM-alkyl complexes.pdfDECOMPOSITION PATHWAYS of TM-alkyl complexes.pdf
DECOMPOSITION PATHWAYS of TM-alkyl complexes.pdf
 
Replisome-Cohesin Interfacing A Molecular Perspective.pdf
Replisome-Cohesin Interfacing A Molecular Perspective.pdfReplisome-Cohesin Interfacing A Molecular Perspective.pdf
Replisome-Cohesin Interfacing A Molecular Perspective.pdf
 

Search and Hyperlinking Overview @MediaEval2014

  • 1. Search and Hyperlinking 2014 Overview Maria Eskevich, Robin Aly, David Nicolás Racca Roeland Ordelman, Shu Chen, Gareth J.F. Jones
  • 2. Find what you were (not) looking for Search & Explore
  • 4. Users Main group User Target Researchers & Educators Journalists Research Academic researchers & students Investigate Academic educators Educate Public users Citizens Entertainment, Infotainment Media Professionals Broadcast Professionals Reuse Media Archivists Annotate
  • 7. 1998 2002 2008 2010 2013 2015 DATA BIG DATA? not representative representative
  • 8. Search & Hyperlinking task • User oriented: aim to explore the needs of real users expressed as queries. – How: UK citizens and crowd sourcing for retrieval assessment • Temporal aspect: seek to direct users to the relevant parts of retrieved video (“jump-in point”). – How: segmentation, segment overlap, transcripts. prosodic, visual (low-level, high-level; keyframes) • Multimodal: want to investigate technologies for addressing variety in user needs and expectations – varied visual and audio contributions, intentional gap between query and multimodal descriptors in content
  • 9. ME Search & Hyperlinking task in development: 2012 – 2014 Search Hyperlinking 2012 2013 2014 2012 2013 2014 Dataset BlipTv BBC BlipTv BBC Features released:  Transcripts 2 ASR 3 ASR 2 ASR 3 ASR  Prosodic features no yes no yes  Visual clues for queries yes no no  Concept detection yes yes Type of the task Known-item Ad-hoc Ad-hoc Query/Anchors creation PC iPad PC iPad Number of queries/anchors 30/30 4/50 50/30 30/30 11/ 98/30 Relevance assessment MTurk users (BBC) MTurk MTurk Numbers of assessed cases 30 50 9 900 3 517 9 975 13 141 Evaluation metrics MRR, MASP, MASDWP MAP(- bin/tol), MAP MAP(-bin/tol), P@5/10
  • 10. Dataset: Video collection • BBC copyright cleared broadcast material: – Videos: • Development set: 6 weeks between 01.04.2008 and 11.05.2008 (1335 hours/2323 videos) • Test set: 11 weeks between 12.05.2008 and 31.07.2008 (2686 hours, 3528 videos) – Manually transcribed subtitles – Metadata • Additional data: – ASR: LIMSI/Vocapia, LIUM, NST-Sheffield – Shot boundaries, keyframes – Output of visual concept detectors by University of Leuven, and University of Oxford
  • 11. Dataset: Query • 28 Users - Policemen, Hair dresser, Bouncer, Sales manger, Student, Self-employed • Two hour session on iPads: – Search the archive (document level) – Define clips (segment level) – Define anchors (anchor level) Statement of Information Need Search Refine Relevant Clips Define Anchors
  • 12. Data cleaning: Usable Information Need • Description clearly specifies what is relevant • A query with a suitable title exists • Sufficient relevant segments exist (try query)
  • 13. Data cleaning: Process • For each information need in batch 1. check if usable 2. If in doubt use search to search for relevant data 3. reword & spellcheck description 4. select the first suitable query 5. Save
  • 14. Data cleaning: Usable Anchor • Longer than 5 seconds • Destination description clearly identifies the material the user wants to see when he would activate the anchor described by label • It is likely that there are some relevant items in the collection
  • 15. Data cleaning: Process • For each information need in assigned batch – Go through anchors • check if usable • reword & spellcheck description • Assess whether it is like to find links in the collection (possibly using search) – Save
  • 16. Dataset: outcome (1/2) • 30 queries <top> <queryId>query_6</queryId> <refId>53b3cf9d42b47e4c32545510</refId> <queryText>saturday kitchen cocktails</queryText> </top> <top> <queryId>query_1</queryId> <refId>53b3c64b42b47e4a362be4ce</refId> <queryText>sightseeing london</queryText> </top>
  • 17. Dataset: outcome (2/2) • 30 anchors: <anchor> <anchorId>anchor_1</anchorId> <refId>53b3c46f42b47e459265d06f</refId> <startTime>16.38</startTime> <endTime>17.35</endTime> <fileName>v20080629_184000_bbctwo_killer_wh ales_in_the</fileName> </anchor>
  • 18. Ground truth creation • Queries/Anchors: user studies at BBC: - 28 users with following profile:  Age: 18-30 years old  Use of search engines and services on iPads on the daily basis • Relevance assessment: via crowdsourcing on Amazon MTurk platform: – Top 10 results from 58 search and 62 hyperlinking submissions – 1 judgment per query or anchor that was accepted/rejected based on an automated algorithm, special cases of users typos checked manually – Number of evaluated HITs: 9 900 for search, and 13 141 for hyperlinking
  • 19. • P@5/10/20 • MAP based: Evaluation metrics • MAP: taking into account any overlapping segment: • MAP-bin: relevant segments are binned for relevance: • MAP-tol: only start times of the segments are considered:
  • 21. Results: Search sub-task: MAP 18 16 14 12 10 8 6 4 2 0 LIMSI/Vocapia Manual No ASR NST/Sheffield LIUM
  • 22. Results: Search sub-task: MAP_bin 0.45 0.4 0.35 0.3 0.25 0.2 0.15 0.1 0.05 0 LIMSI/Vocapia Manual No ASR NST/Sheffield LIUM
  • 23. Results: Search sub-task: MAP_tol 0.35 0.3 0.25 0.2 0.15 0.1 0.05 0 LIMSI/Vocapia Manual No ASR NST/Sheffield LIUM
  • 24. Results: Hyperlinking sub-task: MAP 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 CUNI_F_M_NoOverlapAu… CUNI_F_M_NoOverlapKSI… CUNI_F_M_NoOverlapKSI… CUNI_F_M_NoOverlapNo… CUNI_F_M_OverlapKSIWe… CUNI_F_N_NoOverlapAud… CUNI_F_N_NoOverlapKSI… CUNI_F_N_NoOverlapNo… CUNI_O_M_NoOverlapKSI… DCLab_Sh_N_Concept2 DCLab_Sh_N_ConceptEnri… IRISAKUL_Ss_N_HTM IRISAKUL_Ss_N_NGRAM IRISAKUL_Ss_N_TM1 IRISAKUL_Ss_N_TM2 IRISAKUL_Ss_O_NGRAMN… JRS_F_MV_ATextVisR JRS_F_MV_AwConcept JRS_F_MV_CTextVisR JRS_F_MV_CwConcept JRS_F_M_AText JRS_F_M_CText JRS_F_V_AcOnly JRS_F_V_CcOnly LINKEDTV2014_O_O_K LINKEDTV2014_O_VO_KC7S LINKEDTV2014_O_VO_KC… LINKEDTV2014_Ss_N_ALL LINKEDTV2014_Ss_N_TEXT LIMSI/Vocapia Manual No ASR NST/Sheffield LIUM
  • 25. Results: Hyperlinking sub-task: MAP_bin 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 CUNI_F_M_NoOverlapA… CUNI_F_M_NoOverlapK… CUNI_F_M_NoOverlapK… CUNI_F_M_NoOverlapN… CUNI_F_M_OverlapKSI… CUNI_F_N_NoOverlapAu… CUNI_F_N_NoOverlapKS… CUNI_F_N_NoOverlapNo… CUNI_O_M_NoOverlapK… DCLab_Sh_N_Concept2 DCLab_Sh_N_ConceptEn… IRISAKUL_Ss_N_HTM IRISAKUL_Ss_N_NGRAM IRISAKUL_Ss_N_TM1 IRISAKUL_Ss_N_TM2 IRISAKUL_Ss_O_NGRAM… JRS_F_MV_ATextVisR JRS_F_MV_AwConcept JRS_F_MV_CTextVisR JRS_F_MV_CwConcept JRS_F_M_AText JRS_F_M_CText JRS_F_V_AcOnly JRS_F_V_CcOnly LINKEDTV2014_O_O_K LINKEDTV2014_O_VO_K… LINKEDTV2014_O_VO_K… LINKEDTV2014_Ss_N_ALL LINKEDTV2014_Ss_N_TE… LIMSI/Vocapia Manual No ASR NST/Sheffield LIUM
  • 26. Results: Hyperlinking sub-task: MAP_tol 0 0.05 0.1 0.15 0.2 0.25 0.3 CUNI_F_M_NoOverlapA… CUNI_F_M_NoOverlapKS… CUNI_F_M_NoOverlapKS… CUNI_F_M_NoOverlapN… CUNI_F_M_OverlapKSIW… CUNI_F_N_NoOverlapAu… CUNI_F_N_NoOverlapKS… CUNI_F_N_NoOverlapNo… CUNI_O_M_NoOverlapK… DCLab_Sh_N_Concept2 DCLab_Sh_N_ConceptEn… IRISAKUL_Ss_N_HTM IRISAKUL_Ss_N_NGRAM IRISAKUL_Ss_N_TM1 IRISAKUL_Ss_N_TM2 IRISAKUL_Ss_O_NGRAM… JRS_F_MV_ATextVisR JRS_F_MV_AwConcept JRS_F_MV_CTextVisR JRS_F_MV_CwConcept JRS_F_M_AText JRS_F_M_CText JRS_F_V_AcOnly JRS_F_V_CcOnly LINKEDTV2014_O_O_K LINKEDTV2014_O_VO_K… LINKEDTV2014_O_VO_K… LINKEDTV2014_Ss_N_ALL LINKEDTV2014_Ss_N_TEXT LIMSI/Vocapia Manual No ASR NST/Sheffield LIUM
  • 27. Lessons learned 1. iPad vs PC = different user behaviour and expectation from the system. 2. Prosodic features broaden the scope of the search sub-task. 3. Use of shot segmentation based units achieves the worst scores for both sub-tasks. 4. Use of metadata improves results for both sub-tasks.
  • 28. Lessons learned 1. iPad vs PC = different user behaviour and expectation from the system. 2. Prosodic features broaden the scope of the search sub-task. 3. Use of shot segmentation based units achieves the worst scores for both sub-tasks. 4. Use of metadata improves results for both sub-tasks.
  • 29. The Search and Hyperlinking task was supported by We are grateful to Jana Eggink and Andy O'Dwyer from the BBC for preparing the collection and hosting the user trials. ... and of course Martha for advise & crowdsourcing access.
  • 30. JRS at Search and Hyperlinking of Television Content Task Werner Bailer, Harald Stiegler MediaEval Workshop, Barcelona, Oct. 2014
  • 31. Linking sub-task • Matching terms from textual resources • Reranking based on visual similarity (VLAT) • Using visual concepts (only/in addition) • Results – Differences between different text resources – Context helped only in few of the cases – Visual reranking provides small improvement – Visual concepts did not provide improvements 35
  • 32. Zsombor Paróczi, Bálint Fodor, Gábor Szűcs Solution with concept enrichment • Concept enrichment: the set of words is extended with their synonyms or other conceptually connected words. • Top 10 vs top 50 conceptually connected words for each word • Conclusion: the results show that concept enrichment with less words give better precision because at the opposite case the noise is greater.
  • 33. Television Linked To The Web LinkedTV @ MediaEval 2014 Search and Hyperlinking Task H.A. Le1, Q.M. Bui1, B. Huet1, B. Cervenková2, J. Bouchner2, E. Apostolidis3, F. Markatopoulou3, A. Pournaras3, V. Mezaris3, D. Stein4, S. Eickeler4, and M. Stadtschnitzer4 1 - Eurecom, Sophia Antipolis, France. 2 - University of Economics, Prague, Czech Republic. 3 - Information Technologies Institute, CERTH, Thessaloniki, Greece. 4 - Fraunhofer IAIS, Sankt Augustin, Germany. 16-17 Oct 2014 www.linkedtv.eu
  • 34. Reasons to visit the LinkedTV poster • Different granularities: video level, scene level (visual/topic) and sentence level. • Different features: text (subtitles / transcripts), visual concepts, keywords, etc… LinkedTV @ MediaEval 2014 Search and Hyperlinking Task
  • 35. Reasons to visit the LinkedTV poster • How to incorporate visual information to the search? • Visual concept detection in the search query: Mapping between query keywords and visual concepts (151 semantic concepts from TRECVID 2012) – Semantic word distance based on WordNet – Identification of salient visual concepts from Google Image search results (query keywords) LinkedTV @ MediaEval 2014 Search and Hyperlinking Task
  • 36. Reasons to visit the LinkedTV poster • How to incorporate visual information to the search? • Integration of detected visual concepts to the search: – Designing an enriched query, based on textual (text query) and visual information (range query) – Fusion of text score (Solr) and visual concepts scores LinkedTV @ MediaEval 2014 Search and Hyperlinking Task