Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

872 views

Published on

Presentation about my project (https://en.wikipedia.org/wiki/nl:Wikipedia:Wikiproject/Verzetskranten) describing and interlinking all 1300 Dutch resistance newspapers from WW2 on Wikipedia using DBpedia.

Given during the 6th DBpedia International Community Meeting on 12-2-2016 in The Hague (http://wiki.dbpedia.org/meetings/TheHague2016)




Published in: Government & Nonprofit
  • Be the first to comment

  • Be the first to like this

WW2 underground newspapers on Wikipedia using DBPedia , 12-2-2016, The Hague

  1. 1. Dutch WW2 underground newspapers on Wikipedia 6th International DBpedia Community Meeting, 12-02-2016, The Hague Olaf Janssen, Koninklijke Bibliotheek olaf.janssen@kb.nl - @ookgezellig - slideshare.net/OlafJanssenNL CC-BY-SA
  2. 2. http://www.4en5meiamsterdam.nl/attachment/47454
  3. 3. During WW2 ± 1.300 Dutch underground newspapers have been issued In every shape & form… http://www.4en5meiamsterdam.nl/attachment/47454
  4. 4. http://resolver.kb.nl/resolve?urn=ddd:010436323 http://resolver.kb.nl/resolve?urn=ddd:010442948 http://resolver.kb.nl/resolve?urn=ddd:010447825 http://resolver.kb.nl/resolve?urn=ddd:010450508 From well-known big titles (o.a. Parool, Vrij Nederland, Trouw, de Waarheid)
  5. 5. To very small, home-made, pamphlet-like issues
  6. 6. After the war many titles have 1) been (physically) preserved at the NIOD … https://commons.wikimedia.org/wiki/File:Verzetskrant_in_archiefdozen_bij_het_NIOD.jpg – CC-BY-SA - OlafJanssen The national Institute for War, Holocaust and Genocide Studies in Amsterdam
  7. 7. http://opac-gonext.oclc.org:8180/DB=8/XMLPRS=Y/PPN?PPN=107123223 .. were 2) described in formal library catalogues Bibliographic metadata
  8. 8. .. were 3) digitized in Delpher … The Dutch national aggregator for historic full-text newspapers, books and magzines http://resolver.kb.nl/resolve?urn=ddd:010424553:mpeg21:p001 • Scans • Full-text OCR
  9. 9. .. and were 4) contextualized & interlinked 1 by 1 in a book Context
  10. 10. .. and were 4) contextualized & interlinked 1 by 1 in a book Relation Newspaper  Placename  semantics, linked data
  11. 11. .. and were 4) contextualized & interlinked 1 by 1 in a book Relation Newspaper  Persons  semantics, linked data
  12. 12. .. and were 4) contextualized & interlinked 1 by 1 in a book Relation Newspaper  Other newspapers  semantics, linked data
  13. 13. This book has been OCRed into PDF (CC-BY-SA) http://www.niod.nl/nl/de-ondergrondse-pers-1940-1945 (PDF)
  14. 14. This book has been OCRed into PDF (CC-BY-SA) http://www.niod.nl/nl/de-ondergrondse-pers-1940-1945 (PDF) Available online (PDF, flat file) Open license (CC-BY-SA) Converted into structured, linked data Linked to KB-catalogue (metadata) and Delpher (full-text) Linked to other sources (DBpedia, VIAF, Gemeentegeschiedenis.nl, Nationaal Archief)
  15. 15. This book has been OCRed into PDF (CC-BY-SA) http://www.niod.nl/nl/de-ondergrondse-pers-1940-1945 (PDF) Available online (PDF, flat file) Open license (CC-BY-SA) Convert PDF into structured, linked data Link to KB-catalogue (metadata) and Delpher (full-text) Link people and places to external sources (VIAF, Gemeentegeschiedenis.nl, Nationaal Archief, Biografisch Portaal)
  16. 16. This book has been OCRed into PDF (CC-BY-SA) http://www.niod.nl/nl/de-ondergrondse-pers-1940-1945 (PDF) Available online (PDF, flat file) Open license (CC-BY-SA) Convert PDF into structured, linked data Link titles to KB-catalogue (metadata) and Delpher (full-text) Link people and places to external sources (VIAF, Gemeentegeschiedenis.nl, Nationaal Archief, Biografisch Portaal)
  17. 17. This book has been OCRed into PDF (CC-BY-SA) http://www.niod.nl/nl/de-ondergrondse-pers-1940-1945 (PDF) Available online (PDF, flat file) Open license (CC-BY-SA) Convert PDF into structured, linked data Link titles to KB-catalogue (metadata) and Delpher (full-text) Link titles, people and places to external sources (VIAF, Gemeentegeschiedenis.nl, Nationaal Archief, Biografisch Portaal)
  18. 18. So: a lot of information is available about these WW2 underground newspapers (and the related persons & places) … ... but the chunks of data are (largely) unconnected!
  19. 19. http://2.bp.blogspot.com/_BWzuYwiS6-I/TMgeRsFd3mI/AAAAAAAAElw/3cvgbZSPWcs/s1600/doctor+macro+judy+scared.jpg ... making discovery, understanding & research for many people harder than necessary.
  20. 20. ... making discovery, understanding & research for many people harder than necessary.
  21. 21. https://nl.wikipedia.org/wiki/Categorie:Illegale_pers_in_de_Tweede_Wereldoorlog Today, only 14 of these 1.300 newspapers are described on WP:NL
  22. 22. The Wikiproject Verzetskranten will change this! Systematically and uniformly describe & interlink all 1.300 Dutch underground newspapers from WW2 on Wikipedia tinyurl.com/verzetskranten Automatically makes data available for open reuse projects Wikidata -- DBpedia -- Dataviz
  23. 23. From 14  1.300 titles
  24. 24. Global approach 1. Make central LOD-database 2. Build article template 3. Generate WP-article stubs -- using 1. and 2. 4. Involve WP-community to expand stubs into full WP-articles 5. Make dataset available for open reuse Wikidata -- DBpedia -- Dataviz -- et al. First time data about undergound newspapers is systematically collected and linked online!
  25. 25. LOD-database for underground newspapers Convert PDF into structured, linked data  RDF-triplestore (Virtuoso, SPARQL, Bibframe) Link titles to KB-catalogue (metadata) & Delpher (full-text)  Using PPNs (unique IDs for publications in NL)  Dbpedia  VIAF  Gemeentegeschiedenis.nl  Nationaal Archief  Biografisch Portaal
  26. 26. LOD-database for underground newspapers Convert PDF into structured, linked data  RDF-triplestore (Virtuoso, SPARQL, Bibframe) Link titles to KB-catalogue (metadata) & Delpher (full-text)  Using PPNs (unique IDs for publications in NL)  Dbpedia  VIAF  Gemeentegeschiedenis.nl  Nationaal Archief  Biografisch Portaal
  27. 27. LOD-database for underground newspapers Convert PDF into structured, linked data  RDF-triplestore (Virtuoso, SPARQL, Bibframe) Link titles to KB-catalogue (metadata) & Delpher (full-text)  Using PPNs (unique IDs for publications in NL)  Dbpedia  VIAF  Gemeentegeschiedenis.nl  Nationaal Archief  Biografisch Portaal
  28. 28. LOD-database for underground newspapers Convert PDF into structured, linked data  RDF-triplestore (Virtuoso, SPARQL, Bibframe) Link titles to KB-catalogue (metadata) & Delpher (full-text)  Using PPNs (unique IDs for publications in NL)  Dbpedia  VIAF  Gemeentegeschiedenis.nl  Nationaal Archief  Biografisch Portaal
  29. 29. LOD-database for underground newspapers Convert PDF into structured, linked data  RDF-triplestore (Virtuoso, SPARQL, Bibframe) Link titles to KB-catalogue (metadata) & Delpher (full-text)  Using PPNs (unique IDs for publications in NL) Link titles, people and places to external sources  Dbpedia  VIAF  Gemeentegeschiedenis.nl  Nationaal Archief  Biografisch Portaal
  30. 30. LOD-database for underground newspapers Convert PDF into structured, linked data  RDF-triplestore (Virtuoso, SPARQL, Bibframe) Link titles to KB-catalogue (metadata) & Delpher (full-text)  Using PPNs (unique IDs for publications in NL) Link titles, people and places to external sources  DBpedia  VIAF  Gemeentegeschiedenis.nl  Nationaal Archief  Biografisch Portaal
  31. 31. LOD-database for underground newspapers Convert PDF into structured, linked data  RDF-triplestore (Virtuoso, SPARQL, Bibframe) Link titles to KB-catalogue (metadata) & Delpher (full-text)  Using PPNs (unique IDs for publications in NL) Link titles, people and places to external sources  DBpedia  Wikipedia  VIAF  Nationaal Archief  Biografisch Portaal
  32. 32. http://www.4en5meiamsterdam.nl/attachment/47454
  33. 33. So we have a LOD-database with data about 1.300 underground newspapers Using an article template we can generate 1.300 uniform and interlinked WP-stubs https://c1.staticflickr.com/9/8281/7699231918_11a7356c38_b.jpg
  34. 34. LOD-db + article template = article stub
  35. 35. https://nl.wikipedia.org/wiki/De_Geus_onder_studenten_(verzetsblad)
  36. 36. Grey = • From database • Predefined fixed strings
  37. 37. All that WP-writers need to add manually to create a full article
  38. 38. Current status
  39. 39. Global approach 1. Make central LOD-database 2. Build article template 3. Generate WP-article stubs 4. Involve WP-community to expand stubs into full WP-articles Current status
  40. 40. Global approach 1. Make central LOD-database 2. Build article template 3. Generate WP-article stubs 4. Involve WP-community to expand stubs into full WP-articles Current status
  41. 41. Global approach 1. Make central LOD-database 2. Build article template 3. Generate WP-article stubs 4. Involve WP-community to expand stubs into full WP-articles Current status
  42. 42. Global approach 1. Make central LOD-database 2. Build article template 3. Generate WP-article stubs 4. Involve WP-community to expand stubs into full WP-articles Current status This month March onwards
  43. 43. http://upload.wikimedia.org/wikipedia/commons/1/12/Planning_tank_operations,_Siege_of_Tobruk_cph.3b18203.jpg Questions? olaf.janssen@kb.nl - @ookgezellig tinyurl.com/verzetskranten

×