Crowdsourcing metadata for audiovisual collections

1,536 views
1,450 views

Published on

Crowdsourcing metadata for audiovisual collections: from free tekst tags to semantic concepts

7 December 2011 | DISH | Rotterdam

Session: http://www.dish2011.nl/sessions/new-models-of-interaction-glams-linked-open-data-and-user-participation

Published in: Technology, Education
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
1,536
On SlideShare
0
From Embeds
0
Number of Embeds
49
Actions
Shares
0
Downloads
5
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Crowdsourcing metadata for audiovisual collections

  1. 1. Crowdsourcing metadata for audiovisual collectionsfrom free tekst tags to semantic concepts Lotte Belice Baltussen – Sound and Vision 7 December 2011 | DISH
  2. 2. Waisda? What’s that? Allows people to annotateaudiovisual archive material in the form of a game.
  3. 3. Added value• Time-related metadata• Social tagging (bridging the semantic gap)• Interaction between the archive /broadcaster and the public• Gathering data for further research• Efficiency?annotating video takes up to 5 x the length of the video• New business model? 4
  4. 4. Project partners pilot• Netherlands Institute for Sound and Vision (project management, content, research)• KRO (concept, content, PR)• VU (research within PrestoPRIME)• Q42 (developer)
  5. 5. Man bijt hond Woordentikkertje After evaluation: • Improved interface • New scoring mechanisms (semantics) • New content • More feedback
  6. 6. How does it work?Players choose from‘channels’ with differentepisodes
  7. 7. How does it work? Scoring:Scoring as filter • Basic rule – players score points when their tag exactly matches the tag entered by another player within 10 seconds • Multiple other scoring mechanisms to create various tag incentives
  8. 8. Evaluation Martorrel
  9. 9. Generating a constant flow of traffic is a challenge!Important: Partners, publicity on external websites withrelevant communities and a large number of visitors.Example FWAW, in one week:• Triple # of tags to 160.000• Double # of registeredplayers to 362
  10. 10. Outcomes• Stats• 340,551 tags added to 604 items, 42,068 unique tags• 39.134 pageviews, 555 registered players, 10,926 visits• Average playing time 6min45, 4.287 sessions• Matches in Waisda? • Matches GTAA / Cornetto
  11. 11. Evaluationav-documentalist
  12. 12. Evaluation av-documentalist• Tags mostly describe short fragments and are often not very specific. They don’t describe a programme as a whole.• BUT! Can be solved by filtering and mapping free tekst tags to existing vocabularies.• The WNW tags were the most useful and specifc; content influences specificity.• Tags can be used in different ways and the relevance varies per user group.• Documentalists exicted about further development!
  13. 13. Evaluation
  14. 14. Evaluation
  15. 15. Source: Jakob Nielsen’s Alertblog 9 October 2006
  16. 16. ‘Fun’ + Competition + Altruism + Content + Reward + … =Motivation
  17. 17. Waisda? Woordentikkertje Months 8 4,5 Videos 648 2,892 Players 2,435 689 Tags – total 428,832 392,860 Tags – unique 48,242 (11%) 43,407 (11%) Matches • Players • 156,546 (37%) • 215,156 (55%) • Geo. names* • 6,089 (1,4%) • 23,142 (5,8%) • Persons* • 107 (0,25%) • 2,423 (0,6%)* For Waisda? we looked at unique tags, for Woordentikkertje at the total number of tags
  18. 18. Tips and lessons learned so far• What are your success criteria?• How do you define your target users, and how do you reach them?• How do you motivate your target users?• Read existing reports and literature!• Keep learning and improving!
  19. 19. And beyond…
  20. 20. Future work• Open Source version of Waisda?• Crowdsourcing Olympics• More research into the added value of tags for retrieval (subtitle comparison, tests with various end users, more research on linking semantically rich sources to tags)
  21. 21. ...recommended sources blogs, feeds, people • http://museumtwo.blogspot.com/ • http://80gb.wordpress.com/ • http://themuseumofthefuture.com/ • http://www.delicious.com/RuncocoProject/ • @ammeveleigh • @archivesopen • @digitalst • @microtask • @mia_out • @museweb • @runcoco • @wittylamaThis presentation is partly based on Oomen & Aroyo 2011:http://www.slideshare.net/PaulaUdondek/crowdsourcing-in-het-cultureel-erfgoed-kansen-uitdagingen
  22. 22. Thanks!@lottebelice / lbbaltussen@beeldengeluid.nlBig thank you to:B&G: @johanoomen / @mbrinkerinkVU: @laroyo / @McHildebrandhttp://blog.waisda.nlhttp://woordentikkertje.manbijthond.nl

×