DigiBird: on the fly collection integration using crowdsourcing

205 views

Published on

Presentation by Chris Dijkshoorn (VU University Amsterdam) on October 31st 2016 at the workshop Two Birds, One Stone: Bridging cultural heritage collections with crowds and niches.

Published in: Software
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
205
On SlideShare
0
From Embeds
0
Number of Embeds
68
Actions
Shares
0
Downloads
0
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

DigiBird: on the fly collection integration using crowdsourcing

  1. 1. Chris Dijkshoorn On the fly collection integration supported by the crowd
  2. 2. ‣ Crowdsourcing tasks are undertaken in isolation ‣ It takes time to collect data ‣ It demands continuous promotional effort ‣ It is challenging for institutions to incorporate the results of crowdsourcing into their existing infrastructure Crowdsourcing Challenges
  3. 3. valorisation project 
 May 2016 to November 2016 
 DigiBird project Chris Dijkshoorn Cristina Bucur Lora Aroyo Maarten Brinkerink Sander Pietersen Saskia Scheltjens
  4. 4. Crowdsourced collections Collections Crowdsourced metadata Sounds Artworks Images Videos
  5. 5. ‣ Every institution has its own system ‣ No visibility similar initiatives
 DigiBird solution ‣ Create a hub ‣ Provide on the fly integration ‣ Use a shared vocabulary Challenge 1: Crowdsourcing tasks are undertaken in isolation
  6. 6. Why use vocabulary terms instead of text?
  7. 7. Why use vocabulary terms instead of text? Grote trap
  8. 8. Thesauri can bridge collection IOC World Bird List ‣ 33,801 terms ‣ Structured using Simple Knowledge Organization System (SKOS) ‣ (Semi) persistent identifiers Importance shared vocabulary
  9. 9. Goals ‣ Make results available on the fly ‣ Provide insights in progress DigiBird pipeline
  10. 10. Data retrieval Request formulation Data integration Response formulation Query filter Merel Request search Merel Request parameter Turdus merula Query concept ioc:Turdus_merula DigiBird pipeline example: retrieve information about a blackbird - === - rec === dc:creator creator === dc:creator creator === dc:creator JSON result list SPARQL result list SPARQL result list Return JSON, JSON-LD, N-Quads or Turtle JSON result list
  11. 11. ‣ Crowdsourcing relies on voluntary contributions ‣ Unpredictable when people will contribute 
 How DigiBird helps ‣ Monitor progress Challenge 2: It takes time to collect data
  12. 12. ‣ Organise events ‣ Market initiatives 
 DigiBird solution ‣ Generate challenging tasks (2.0?) Challenge 3: It demands continuous promotional effort
  13. 13. ‣ Data siloes ‣ Trust in data DigiBird solutions ‣ Provide a way to directly access data ‣ Different output formats ‣ Refine and review contributions (2.0?) Challenge 4: It is challenging for institutions to incorporate the results of crowdsourcing into their existing infrastructure
  14. 14. Monitoring Species view Annotation wall Source code is available ‣ https://github.com/rasvaan/digibird_api ‣ https://github.com/rasvaan/digibird_client DigiBird website
  15. 15. ‣ Use standardised vocabularies ‣ Get persistent identifiers ‣ Document how to access your data ‣ Realise effort is required to create a mature codebase ‣ Some code does not age well How to make the life of a programmer easier

×