Text and Data Mining at the Royal Library in the Netherlands
1. Data for Reseachers at the Koninklijke
Bibliotheek
Lotte Wilms (Research department) @lottewilms
2. The data
Mostly machine readable, structured or semi-
structured data.
The result of:
-More than 200 years of collecting
-Over 30 years of digitisation
-Almost 10 years of collecting born-digital
5. •Staten Generaal Digitaal,
KB newspapers & ANP
Radio bulletins
•Developed in CLARIN
project at TU Delft, EUR,
VU, Sound & Vision
www.polimedia.nl
6. • Developed at University of
Amsterdam
• Digitised newspaper
collection 1840 - 1995
• Transferred to KB Research
Lab & freely available
Lab.kbresearch.nl/find/ngrams
KB Newspapers
Ngramviewer
7. PoliticalMashup
• Developed by University of
Amsterdam
• Speeches of statengeneraaldigitaal.nl
and overheid.nl (1814 – now)
• Enriched content
• Visualisation tools (graphs & word
clouds)
www.search.politicalmashup.nl
8. Lessons learned/benefits
• Researchers use our data in more ways than we imagined
• Collaborations provide us with good insights into our data & users
• Opening up our data created opportunities for other users
• Strong connections with the research community
• New funding opportunities in research projects