SlideShare a Scribd company logo
1 of 22
MM P05 automatic labeling 
term extraction 
Victor de Boer 
Josefien Schuurman 
Roeland Ordelman
Term extraction from TT888 
• Input: 
– TT888 subtitles 
• Output: 
– GTAA terms 
• Onderwerpen 
• Persoonsnamen 
• Namen 
• Geografische namen 
– For entire video 
(corresponds to 
documentalist tasks)
Planning 
• version 0.1 
– `naive baseline’ 
– Test input andoutput 
• version 0.2 
– Multiple GTAA axes 
– Improve statistics 
– Bespreking met metadatabeheer 
• version 0.3 
– More improvements 
– Evaluation 
• version 1.0 
– To be reimplemented 
http://www.recensiekoning.nl/2011/09/48928/ondertiteling
Implementation details 
• Java to make integration easier 
• XML and CSV outputs 
– URI of GTAA term 
– pref-label 
– Confidence value 
– Axis 
• Input comes from Immix OAI API, where segmentation 
should already have taken place 
– Algorithm expects one OAI identifier (Expressie or Selectie) 
• Matching with GTAA using ElasticSearch instance
version 0.1 
For every item 
1. Get TT888 words in a frequency list 
2. Discard stop words (‘de’, ‘het’, ‘op’, ‘naar’..) 
3. Take all words with freq > n 
4. Match with GTAA “Onderwerpen” with ElasticSearch score > m 
– Preflabel + altlabel 
Algorithm 
GTAA 
gtaa:002151 
“theater” 
OAI 
Stop words
version 0.1 
Informal Evaluation: 
Compare to hist labels (“Onderwerpen”) 
Works a bit (< 20% correct). Input for version 0.2 
Algorithm 
GTAA 
gtaa:002151 
“theater” 
OAI 
Stop words
version 0.2 
• Intermediate version, uses Named Entity 
Recognizer. Results discussed with Lisette and 
Vincent -> Version 0.3 
Algorithm 
GTAA 
“theater” 
“Jos Brink” 
“Amsterdam” 
OAI 
Stop words 
Named Entity 
Recognition 
Word freq NL
Named Entity Recognition 
• Webservice CLTL @ VU 
• Input: 
– “Hallo, mijn naam is Victor de Boer en ik woon in de mooie stad Haarlem. Ik werk nu bij het 
Nederlands Instituut voor Beeld en Geluid in Hilversum. Hiervoor was ik werkzaam bij de 
Vrije Universiteit. “ 
• Output: 
[ Victor de Boer | PERSON ], 
[ Haarlem | LOCATION ], 
[ Nederlands | MISC ], 
[ Instituut voor Beeld en Geluid | ORGANIZATION ], 
[ Hilversum | LOCATION ], 
[ Vrije Universiteit | ORGANIZATION ]
version 0.3 
For every item 
1. Track 1 
1. Get TT888 words in a frequency list 
2. Discard stop words (‘de’, ‘het’, ‘op’, ‘naar’..) 
3. Take all N-GRAMS with normalized frequency > n 
4. Match with GTAA “Onderwerpen” with score > m 
2. Track 2 
1. Present TT888 to Named Entity Recognizer (VU-webservice) 
2. Match result (with freq > L) with GTAA “PersoonsNamen”, “Geografische 
Namen”, “Onderwerpen”, “Namen” 
Algorithm 
GTAA 
“theater” 
“Jos Brink” 
“Amsterdam” 
OAI 
Stop words 
Named Entity 
Recognition 
Word freq NL
version 0.3 > Example output
Evaluation 
• Setup 
– 4 evaluators (Vincent, Lisette , Alma, Tim) 
• 3 in one 50 min session 
• 1 in another session 
– ~8 minutes per item 
– Video + extracted terms 
• Open Videos in IE browser 
• GTAA URIS + preflabels 
• Any other info allowed 
– Five point Likert scale 
• Only precision, no recall 
De gebruikte evaluatieschaal. 0 betekent echt 
fout (bv een verkeerd homonym) of echt niet 
relevant (verkeerd persoon). Aangezien hier 
wisselwerking optreedt kan dit niet veel verder 
uitgesplitst worden. 
0: Term is geheel niet relevant 
1: Term is niet relevant 
2: Term is een beetje relevant 
3: Term is relevant 
4: Term is zeer relevant
Evaluation
Results 
• Total of 70 terms for 13 videos (5.4 term per vid) 
– Some videos did not start-> discarded 
– 38 terms with three evaluations 
– 32 with one
Results 
eval_1 eval_2 eval_ 
3 
eval_4 Avg 
gem: F Term 2,59 1,35 2,00 2,37 2,08 
item 1 6 licht 0 0 0 0 
item 1 2 Friesland 0 0 2 0,666667 
item 3 2 soul 0 1 1 0,666667 
item 3 3 Romme, 
Gianni 
3 4 4 3,666667 
item 3 2 Somerville, 
Jimmy 
4 2 2 2,666667 
item 3 3 Harrison, 
George 
4 4 3 3,666667 
item 3 4 Clapton, Eric 4 4 2 3,333333 
item 3 2 Milwaukee 3 1 1 1,666667
Example of disagreement 
• Term “Milwaukee” 
– Top2000 a gogo 
Eval 1-> score=3 
“Term an sich niet heel relevant, maar in combinatie 
met Romme, Gianni toch waardevol. Alweer: NER 
wint aan kracht als user tijdcode meekrijgt en kan 
afspelen ter check of fragment relevant of niet is 
voor zijn zoekactie/hergebruik.” 
Eval 3-> score=1 
“twee keer genoemd, niet relevant” 
Eval 2-> score=1 
“…”
Inter-annotator agreement 
Pearson eval1 eval2 eval3 
eval1 1 
eval2 0,52 1 
eval3 0,67 0,58 1 
eval4 0.78 x 0.92 
Agreement between 3 and 4 is large 
between 1 and 4 is substantive 
between 1 and 2 , 1 and 3, 2 and 3 is lower but ok 
Task is fairly objective, but somewhat subjective 
We look mainly at averages for the rest
Results: average scores 
• Total average of 2.15 (“beetje relevant”+) 
At threshold of 2: Precision = 0.61 
At threshold of 3: Precision = 0.36 
4.5 
4 
3.5 
3 
2.5 
2 
1.5 
1 
0.5 
0 
0 0.2 0.4 0.6 0.8 1 1.2
Results per video 
item average 
item 1 0,3333333 
item 3 2,6111111 
item 5 2,4444444 
item 6 1,75 
item 8 1,4 
item 9 3,6666667 
item 10 2,4545455 
item 13 2.375 
item 14 0(!) 
item 15 1.33 
item 17 2.08 
item 19 4.00 (!) 
item 20 1.67 
• For some videos we 
shouldn’t do this 
– Nederland in Beweging 
– Metadata on Reeks-level 
“Advies: Niveau 1 programma's uitsluiten van 
trefwoordextractie, ws. ook van NER”
Results correlation freq/score 
• Correlation between frequency of term in text 
and average score 
– No correlation (?)
Evaluator remarks 
• For some videos this shouldn’t be done 
– Game shows, drama.. 
– Annotate at Reeks level 
• Some axes seem to work better then others 
– Persoonsnamen, Namen, Geografische namen 
• More abstraction or combination would be helpful 
– Semantic Clustering? 
• Subtitles with * are song lyrics 
• Still a need for time-coded terms
Conclusion and current steps 
• Limited evaluation 
• But it works (prec 0.61) 
– With some tweaks to 0.7-0.8 
• NEs lower threshold, Subjects higher 
• Better Elasticsearch matching 
– With semantic clustering to 0.8-0.9? 
• Currently re-implemented by Arjen as a proper 
service 
• Re-use for annotating program guides
A huge thanks to the annotators for their valuable effort!! 
Questions? 
antwoordnu.nl

More Related Content

Viewers also liked (8)

Adrian
AdrianAdrian
Adrian
 
Deboer ld4 iati_knowescape
Deboer ld4 iati_knowescapeDeboer ld4 iati_knowescape
Deboer ld4 iati_knowescape
 
Planning For Content
Planning For ContentPlanning For Content
Planning For Content
 
Ld4d 2013 part 3
Ld4d 2013 part 3Ld4d 2013 part 3
Ld4d 2013 part 3
 
ESTUDO EM GRUPO NA FORMAÇÃO INICIAL COM ÊNFASE NA PRÁXIS COTIDIANA DA EDUCAÇ...
ESTUDO EM GRUPO NA FORMAÇÃO INICIAL COM ÊNFASE NA PRÁXIS  COTIDIANA DA EDUCAÇ...ESTUDO EM GRUPO NA FORMAÇÃO INICIAL COM ÊNFASE NA PRÁXIS  COTIDIANA DA EDUCAÇ...
ESTUDO EM GRUPO NA FORMAÇÃO INICIAL COM ÊNFASE NA PRÁXIS COTIDIANA DA EDUCAÇ...
 
My unfunded projects WAI talk
My unfunded projects WAI talkMy unfunded projects WAI talk
My unfunded projects WAI talk
 
Prograd calendario academico-2013_v2[1]
Prograd calendario academico-2013_v2[1]Prograd calendario academico-2013_v2[1]
Prograd calendario academico-2013_v2[1]
 
LLTC Agindaasowigamig
LLTC AgindaasowigamigLLTC Agindaasowigamig
LLTC Agindaasowigamig
 

Similar to BenG Update on automatic labelling

Unit Testing Best Practices
Unit Testing Best PracticesUnit Testing Best Practices
Unit Testing Best Practices
Tomaš Maconko
 
Babysitting your orm essenmacher, adam
Babysitting your orm   essenmacher, adamBabysitting your orm   essenmacher, adam
Babysitting your orm essenmacher, adam
Adam Essenmacher
 
ZcCsXvjIRFKcqd2Yzt4d_Shallahamer-MLPoorPerf-3d.pdf
ZcCsXvjIRFKcqd2Yzt4d_Shallahamer-MLPoorPerf-3d.pdfZcCsXvjIRFKcqd2Yzt4d_Shallahamer-MLPoorPerf-3d.pdf
ZcCsXvjIRFKcqd2Yzt4d_Shallahamer-MLPoorPerf-3d.pdf
cookie1969
 

Similar to BenG Update on automatic labelling (20)

The Evolution of Development Testing
The Evolution of Development TestingThe Evolution of Development Testing
The Evolution of Development Testing
 
Test Driven Development
Test Driven DevelopmentTest Driven Development
Test Driven Development
 
KorraAI - a probabilistic virtual agent framework
KorraAI - a probabilistic virtual agent frameworkKorraAI - a probabilistic virtual agent framework
KorraAI - a probabilistic virtual agent framework
 
Unit Testing Best Practices
Unit Testing Best PracticesUnit Testing Best Practices
Unit Testing Best Practices
 
OpenAI GPT in Depth - Questions and Misconceptions
OpenAI GPT in Depth - Questions and MisconceptionsOpenAI GPT in Depth - Questions and Misconceptions
OpenAI GPT in Depth - Questions and Misconceptions
 
Adopting tdd in the workplace
Adopting tdd in the workplaceAdopting tdd in the workplace
Adopting tdd in the workplace
 
Adopting tdd in the workplace
Adopting tdd in the workplaceAdopting tdd in the workplace
Adopting tdd in the workplace
 
Scaling Security Threat Detection with Apache Spark and Databricks
Scaling Security Threat Detection with Apache Spark and DatabricksScaling Security Threat Detection with Apache Spark and Databricks
Scaling Security Threat Detection with Apache Spark and Databricks
 
Talk-Foutse-SrangeLoop.pdf
Talk-Foutse-SrangeLoop.pdfTalk-Foutse-SrangeLoop.pdf
Talk-Foutse-SrangeLoop.pdf
 
Awesome Test Automation Made Simple w/ Dave Haeffner
Awesome Test Automation Made Simple w/ Dave HaeffnerAwesome Test Automation Made Simple w/ Dave Haeffner
Awesome Test Automation Made Simple w/ Dave Haeffner
 
Solving the 3 Biggest Questions in Continuous Testing
Solving the 3 Biggest Questions in Continuous TestingSolving the 3 Biggest Questions in Continuous Testing
Solving the 3 Biggest Questions in Continuous Testing
 
How do OpenAI GPT Models Work - Misconceptions and Tips for Developers
How do OpenAI GPT Models Work - Misconceptions and Tips for DevelopersHow do OpenAI GPT Models Work - Misconceptions and Tips for Developers
How do OpenAI GPT Models Work - Misconceptions and Tips for Developers
 
MRT 2018: reflecting on the past and the present with temporal graph models
MRT 2018: reflecting on the past and the present with temporal graph modelsMRT 2018: reflecting on the past and the present with temporal graph models
MRT 2018: reflecting on the past and the present with temporal graph models
 
Seeding a Tree in a Gherkin
Seeding a Tree in a GherkinSeeding a Tree in a Gherkin
Seeding a Tree in a Gherkin
 
Data Migration Lessons from the Trenches
Data Migration Lessons from the TrenchesData Migration Lessons from the Trenches
Data Migration Lessons from the Trenches
 
Babysitting your orm essenmacher, adam
Babysitting your orm   essenmacher, adamBabysitting your orm   essenmacher, adam
Babysitting your orm essenmacher, adam
 
Can we induce change with what we measure?
Can we induce change with what we measure?Can we induce change with what we measure?
Can we induce change with what we measure?
 
Ost math
Ost mathOst math
Ost math
 
ZcCsXvjIRFKcqd2Yzt4d_Shallahamer-MLPoorPerf-3d.pdf
ZcCsXvjIRFKcqd2Yzt4d_Shallahamer-MLPoorPerf-3d.pdfZcCsXvjIRFKcqd2Yzt4d_Shallahamer-MLPoorPerf-3d.pdf
ZcCsXvjIRFKcqd2Yzt4d_Shallahamer-MLPoorPerf-3d.pdf
 
Deep learning to the rescue - solving long standing problems of recommender ...
Deep learning to the rescue - solving long standing problems of recommender ...Deep learning to the rescue - solving long standing problems of recommender ...
Deep learning to the rescue - solving long standing problems of recommender ...
 

More from Victor de Boer

More from Victor de Boer (20)

One day workshop Linked Data and Semantic Web
One day workshop Linked Data and Semantic WebOne day workshop Linked Data and Semantic Web
One day workshop Linked Data and Semantic Web
 
Linked Data for Digital Humanities research at Media Archives
Linked Data for Digital Humanities research at Media ArchivesLinked Data for Digital Humanities research at Media Archives
Linked Data for Digital Humanities research at Media Archives
 
The Benefits of Linking Metadata for Internal and External users of an Audiov...
The Benefits of Linking Metadata for Internal and External users of an Audiov...The Benefits of Linking Metadata for Internal and External users of an Audiov...
The Benefits of Linking Metadata for Internal and External users of an Audiov...
 
UX Challenges of Information Organisation: Assessment of Language Impairment ...
UX Challenges of Information Organisation: Assessment of Language Impairment ...UX Challenges of Information Organisation: Assessment of Language Impairment ...
UX Challenges of Information Organisation: Assessment of Language Impairment ...
 
Interactive Dance Choreography Assistance presentation for ACE entertainment ...
Interactive Dance Choreography Assistance presentation for ACE entertainment ...Interactive Dance Choreography Assistance presentation for ACE entertainment ...
Interactive Dance Choreography Assistance presentation for ACE entertainment ...
 
Fahad Ali's slides for Machine to-machine communication in rural conditions ...
Fahad Ali's slides for Machine to-machine communication in rural conditions  ...Fahad Ali's slides for Machine to-machine communication in rural conditions  ...
Fahad Ali's slides for Machine to-machine communication in rural conditions ...
 
Linking African Traditional Medicine Knowledge - by Gossa Lo
Linking African Traditional Medicine Knowledge - by Gossa LoLinking African Traditional Medicine Knowledge - by Gossa Lo
Linking African Traditional Medicine Knowledge - by Gossa Lo
 
Enriching Media Collections for Event-based Exploration
Enriching Media Collections for Event-based ExplorationEnriching Media Collections for Event-based Exploration
Enriching Media Collections for Event-based Exploration
 
New Life for Old Media (NEM presentation)
New Life for Old Media  (NEM presentation)New Life for Old Media  (NEM presentation)
New Life for Old Media (NEM presentation)
 
User-centered Data Science for Digital Humanities
User-centered Data Science for Digital HumanitiesUser-centered Data Science for Digital Humanities
User-centered Data Science for Digital Humanities
 
Linked Data for Audiovisual Archives (Guest lecture at NISV)
Linked Data for Audiovisual Archives (Guest lecture at NISV)Linked Data for Audiovisual Archives (Guest lecture at NISV)
Linked Data for Audiovisual Archives (Guest lecture at NISV)
 
Semantic Technology for Development: Semantic Web without the Web?
Semantic Technology for Development: Semantic Web without the Web?Semantic Technology for Development: Semantic Web without the Web?
Semantic Technology for Development: Semantic Web without the Web?
 
DIVE+ and Events at EVENTS2017
DIVE+ and Events at EVENTS2017DIVE+ and Events at EVENTS2017
DIVE+ and Events at EVENTS2017
 
About Cultuurlink
About CultuurlinkAbout Cultuurlink
About Cultuurlink
 
Intro to Linked, Dutch Ships and Sailors and SPARQL handson
Intro to Linked, Dutch Ships and Sailors and SPARQL handson Intro to Linked, Dutch Ships and Sailors and SPARQL handson
Intro to Linked, Dutch Ships and Sailors and SPARQL handson
 
Kasadaka and ICT4D at VU
Kasadaka and ICT4D at VUKasadaka and ICT4D at VU
Kasadaka and ICT4D at VU
 
VU ICT4D symposium 2017 Francis Dittoh Mr. Meteo
VU ICT4D symposium 2017 Francis Dittoh  Mr. MeteoVU ICT4D symposium 2017 Francis Dittoh  Mr. Meteo
VU ICT4D symposium 2017 Francis Dittoh Mr. Meteo
 
VU ICT4D symposium 2017 Chris van Aart
VU ICT4D symposium 2017 Chris van AartVU ICT4D symposium 2017 Chris van Aart
VU ICT4D symposium 2017 Chris van Aart
 
VU ICT4D symposium 2017 Gayo Diallo Towards a Digital African Traditional Hea...
VU ICT4D symposium 2017 Gayo Diallo Towards a Digital African Traditional Hea...VU ICT4D symposium 2017 Gayo Diallo Towards a Digital African Traditional Hea...
VU ICT4D symposium 2017 Gayo Diallo Towards a Digital African Traditional Hea...
 
VU ICT4D symposium 2017 Wendelien Tuyp: Boosting african agriculture
VU ICT4D symposium 2017 Wendelien Tuyp: Boosting african agriculture VU ICT4D symposium 2017 Wendelien Tuyp: Boosting african agriculture
VU ICT4D symposium 2017 Wendelien Tuyp: Boosting african agriculture
 

Recently uploaded

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
Chris Hunter
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
heathfieldcps1
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 
An Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdfAn Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdf
SanaAli374401
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
PECB
 

Recently uploaded (20)

Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
An Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdfAn Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdf
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 

BenG Update on automatic labelling

  • 1. MM P05 automatic labeling term extraction Victor de Boer Josefien Schuurman Roeland Ordelman
  • 2. Term extraction from TT888 • Input: – TT888 subtitles • Output: – GTAA terms • Onderwerpen • Persoonsnamen • Namen • Geografische namen – For entire video (corresponds to documentalist tasks)
  • 3. Planning • version 0.1 – `naive baseline’ – Test input andoutput • version 0.2 – Multiple GTAA axes – Improve statistics – Bespreking met metadatabeheer • version 0.3 – More improvements – Evaluation • version 1.0 – To be reimplemented http://www.recensiekoning.nl/2011/09/48928/ondertiteling
  • 4. Implementation details • Java to make integration easier • XML and CSV outputs – URI of GTAA term – pref-label – Confidence value – Axis • Input comes from Immix OAI API, where segmentation should already have taken place – Algorithm expects one OAI identifier (Expressie or Selectie) • Matching with GTAA using ElasticSearch instance
  • 5. version 0.1 For every item 1. Get TT888 words in a frequency list 2. Discard stop words (‘de’, ‘het’, ‘op’, ‘naar’..) 3. Take all words with freq > n 4. Match with GTAA “Onderwerpen” with ElasticSearch score > m – Preflabel + altlabel Algorithm GTAA gtaa:002151 “theater” OAI Stop words
  • 6. version 0.1 Informal Evaluation: Compare to hist labels (“Onderwerpen”) Works a bit (< 20% correct). Input for version 0.2 Algorithm GTAA gtaa:002151 “theater” OAI Stop words
  • 7. version 0.2 • Intermediate version, uses Named Entity Recognizer. Results discussed with Lisette and Vincent -> Version 0.3 Algorithm GTAA “theater” “Jos Brink” “Amsterdam” OAI Stop words Named Entity Recognition Word freq NL
  • 8. Named Entity Recognition • Webservice CLTL @ VU • Input: – “Hallo, mijn naam is Victor de Boer en ik woon in de mooie stad Haarlem. Ik werk nu bij het Nederlands Instituut voor Beeld en Geluid in Hilversum. Hiervoor was ik werkzaam bij de Vrije Universiteit. “ • Output: [ Victor de Boer | PERSON ], [ Haarlem | LOCATION ], [ Nederlands | MISC ], [ Instituut voor Beeld en Geluid | ORGANIZATION ], [ Hilversum | LOCATION ], [ Vrije Universiteit | ORGANIZATION ]
  • 9. version 0.3 For every item 1. Track 1 1. Get TT888 words in a frequency list 2. Discard stop words (‘de’, ‘het’, ‘op’, ‘naar’..) 3. Take all N-GRAMS with normalized frequency > n 4. Match with GTAA “Onderwerpen” with score > m 2. Track 2 1. Present TT888 to Named Entity Recognizer (VU-webservice) 2. Match result (with freq > L) with GTAA “PersoonsNamen”, “Geografische Namen”, “Onderwerpen”, “Namen” Algorithm GTAA “theater” “Jos Brink” “Amsterdam” OAI Stop words Named Entity Recognition Word freq NL
  • 10. version 0.3 > Example output
  • 11. Evaluation • Setup – 4 evaluators (Vincent, Lisette , Alma, Tim) • 3 in one 50 min session • 1 in another session – ~8 minutes per item – Video + extracted terms • Open Videos in IE browser • GTAA URIS + preflabels • Any other info allowed – Five point Likert scale • Only precision, no recall De gebruikte evaluatieschaal. 0 betekent echt fout (bv een verkeerd homonym) of echt niet relevant (verkeerd persoon). Aangezien hier wisselwerking optreedt kan dit niet veel verder uitgesplitst worden. 0: Term is geheel niet relevant 1: Term is niet relevant 2: Term is een beetje relevant 3: Term is relevant 4: Term is zeer relevant
  • 13. Results • Total of 70 terms for 13 videos (5.4 term per vid) – Some videos did not start-> discarded – 38 terms with three evaluations – 32 with one
  • 14. Results eval_1 eval_2 eval_ 3 eval_4 Avg gem: F Term 2,59 1,35 2,00 2,37 2,08 item 1 6 licht 0 0 0 0 item 1 2 Friesland 0 0 2 0,666667 item 3 2 soul 0 1 1 0,666667 item 3 3 Romme, Gianni 3 4 4 3,666667 item 3 2 Somerville, Jimmy 4 2 2 2,666667 item 3 3 Harrison, George 4 4 3 3,666667 item 3 4 Clapton, Eric 4 4 2 3,333333 item 3 2 Milwaukee 3 1 1 1,666667
  • 15. Example of disagreement • Term “Milwaukee” – Top2000 a gogo Eval 1-> score=3 “Term an sich niet heel relevant, maar in combinatie met Romme, Gianni toch waardevol. Alweer: NER wint aan kracht als user tijdcode meekrijgt en kan afspelen ter check of fragment relevant of niet is voor zijn zoekactie/hergebruik.” Eval 3-> score=1 “twee keer genoemd, niet relevant” Eval 2-> score=1 “…”
  • 16. Inter-annotator agreement Pearson eval1 eval2 eval3 eval1 1 eval2 0,52 1 eval3 0,67 0,58 1 eval4 0.78 x 0.92 Agreement between 3 and 4 is large between 1 and 4 is substantive between 1 and 2 , 1 and 3, 2 and 3 is lower but ok Task is fairly objective, but somewhat subjective We look mainly at averages for the rest
  • 17. Results: average scores • Total average of 2.15 (“beetje relevant”+) At threshold of 2: Precision = 0.61 At threshold of 3: Precision = 0.36 4.5 4 3.5 3 2.5 2 1.5 1 0.5 0 0 0.2 0.4 0.6 0.8 1 1.2
  • 18. Results per video item average item 1 0,3333333 item 3 2,6111111 item 5 2,4444444 item 6 1,75 item 8 1,4 item 9 3,6666667 item 10 2,4545455 item 13 2.375 item 14 0(!) item 15 1.33 item 17 2.08 item 19 4.00 (!) item 20 1.67 • For some videos we shouldn’t do this – Nederland in Beweging – Metadata on Reeks-level “Advies: Niveau 1 programma's uitsluiten van trefwoordextractie, ws. ook van NER”
  • 19. Results correlation freq/score • Correlation between frequency of term in text and average score – No correlation (?)
  • 20. Evaluator remarks • For some videos this shouldn’t be done – Game shows, drama.. – Annotate at Reeks level • Some axes seem to work better then others – Persoonsnamen, Namen, Geografische namen • More abstraction or combination would be helpful – Semantic Clustering? • Subtitles with * are song lyrics • Still a need for time-coded terms
  • 21. Conclusion and current steps • Limited evaluation • But it works (prec 0.61) – With some tweaks to 0.7-0.8 • NEs lower threshold, Subjects higher • Better Elasticsearch matching – With semantic clustering to 0.8-0.9? • Currently re-implemented by Arjen as a proper service • Re-use for annotating program guides
  • 22. A huge thanks to the annotators for their valuable effort!! Questions? antwoordnu.nl

Editor's Notes

  1. Computational Lexicology & Terminology Lab