Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
Linked Data for
Digital History
Connecting Data for Research
Victor de Boer
With input from Christophe Guéret, Serge ter B...
Victor de Boer
Web & Media Group, CS, Vrije Universiteit Amsterdam
Netherlands Institute for Sound and Vision
Cultural Her...
Digital History
Sub-discipline of digital humanities
Part of the effort of historian is moved from
the physical archives t...
Tools and visualisations
http://armstrongdigitalhistory.org/, http://www.vcdh.virginia.edu/courses/fall07/hius401-f/,
http...
“That is great. I would love that…
…but my research questions are slightly different.”
Img:Monty Python
Aging
Data Tool
C. Guéret based on http://redmonk.com/jgovernor/2007/04/05/why-applciations-are-like-fish-and-data-is-like...
Even better
Do not bake the data into the tool and treat
data as an end product.
Build tools on top of the data.
Make sure...
Linked Data for Digital History
• Represent heterogeneous datasets with their own
data models in common format: Resource
D...
Some examples
Dutch Ships and Sailors
The Problem:
((Maritime) historical) data is not integrated
KB NEWSPAPERS
Dutch-Asiatic Shipping“VOC Opvarenden”
Jur Leinenga Matthias van Rossum
Elbing voyagesArchangel voyages
DIFFERENT but LINKED DATAMODELS BASED
ON COMPETENCY QUESTIONS
dss:Record
gzmvoc:Telling
gzmvoc:telling-1046-De_Berkel
__bn...
ACCESS IT AT
HTTP://DUTCHSHIPSANDSAILORS.NL/DATA
OR
HTTP://SEMANTICWEB.CS.VU.NL/DSS
SELECT * WHERE {
?record dss:hasOrigin...
Data analysis and visualisation
DIVE
MEDIA HISTORIANS AND RESEARCHERS
MediaresearcherLarsArveRøsslandoftheUniversityofBergen.(Photo:AndreasR.Graven)
EXPLORATIV...
DATA: OPENIMAGES.EU and DELPHER.NL
ENTITY EXTRACTION
CROWDTRUTH.ORG
ENTITY EXTRACTION
EVENTS CROWDSOURCING AND LINKING TO
CONCEPTS THROUGH CROWDTRUTH.ORG
SEG...
DATA CONNECTED IN KNOWLEDGE GRAPH
DIVE:MEDIA OBJECT SEM:EVENT
SEM:PLACE
SEM:TIME
SEM:ACTOR
SKOS:CONCEPT
OA:ANNOTATION
LINK...
“DIGITAL SUBMARINE” INTERFACE
DIVE.BEELDENGELUID.NL
BiographyNet
Starting Point: Biography
Portal of the Netherlands;
www.biografischportaal.nl
125,000 short biographical
des...
Johan Rudolph Thorbecke werd
in 1798 geboren op 14 januari
in Zwolle en komt uit een half-Duit
Johan Rudolph Thorbecke wer...
a
Provenance in Biographynet
Ensure credibility of the demonstrator, to evaluate its
performance and to improve the academ...
Interface for historians
Biographynet.nl
Framework generic solutions with historians
1. Preprocess, Clean, Model, Link, Enrich data in a collaboration with
domain ...
Historical tool criticism
… willingness from historians to invest the time to
learn about computer processes (at least the...
Thank you!
Victor de Boer
http://victordeboer.com
v.de.boer@vu.nl
@victordeboer
Verrijkt Koninkrijk
30
National-
Socialist
29%
Social-
Democrat
21%
Protestant
13%
Liberal
12%
R-Catholic
12%
Communist
8%
Jewish
5%
http://se...
Results are links to paragraphs
re-usability
http://qhp.science.uva.nl/
Upcoming SlideShare
Loading in …5
×

Linked Data for Digital History presentation for VU symposium "Connecting Data for Research"

900 views

Published on

Linked Data for Digital History presentation for VU symposium "Connecting Data for Research". This presentation talks about the need for publishing interconnected research data using linked data and publishing tools and visualisaitions alongside those data. Examples include Dutch Ships and Sailors, DIVE and BiographyNet.

Published in: Education
  • Be the first to comment

Linked Data for Digital History presentation for VU symposium "Connecting Data for Research"

  1. 1. Linked Data for Digital History Connecting Data for Research Victor de Boer With input from Christophe Guéret, Serge ter Braake, Niels Ockeloen, Antske Fokkens, Dirk Roorda, Lora Aroyo, Johan Oomen, Oana Inel, Jan Wielemaker, Jeroen Entjes
  2. 2. Victor de Boer Web & Media Group, CS, Vrije Universiteit Amsterdam Netherlands Institute for Sound and Vision Cultural Heritage Digital History Linked Data for Development
  3. 3. Digital History Sub-discipline of digital humanities Part of the effort of historian is moved from the physical archives to digital ones Cross-domain collaboration Img:www.doaks.org, www.dkrz.de
  4. 4. Tools and visualisations http://armstrongdigitalhistory.org/, http://www.vcdh.virginia.edu/courses/fall07/hius401-f/, http://digitalhistory.unl.edu/essays/thomasessay.php, http://www.philipvickersfithian.com/2013/05/gender-in-stacks-on-managing-small.html
  5. 5. “That is great. I would love that… …but my research questions are slightly different.” Img:Monty Python
  6. 6. Aging Data Tool C. Guéret based on http://redmonk.com/jgovernor/2007/04/05/why-applciations-are-like-fish-and-data-is-like0wine/
  7. 7. Even better Do not bake the data into the tool and treat data as an end product. Build tools on top of the data. Make sure others can do so as well. Fig: C. Guéret
  8. 8. Linked Data for Digital History • Represent heterogeneous datasets with their own data models in common format: Resource Description Format (RDF) – Link what can be linked • re-use and re-usability • Linked Data is the (technically) best way to publish and share your (research) data OBJECT EVENT PLACE TIME PERSON CONCEPT PROVENANCE
  9. 9. Some examples
  10. 10. Dutch Ships and Sailors
  11. 11. The Problem: ((Maritime) historical) data is not integrated
  12. 12. KB NEWSPAPERS Dutch-Asiatic Shipping“VOC Opvarenden” Jur Leinenga Matthias van Rossum Elbing voyagesArchangel voyages
  13. 13. DIFFERENT but LINKED DATAMODELS BASED ON COMPETENCY QUESTIONS dss:Record gzmvoc:Telling gzmvoc:telling-1046-De_Berkel __bnode_1 gzmvoc:aziatischeBemanning dss:Ship gzmvoc:Schip gzmvoc: schip-1046-De_Berkel dss:has_ship gzmvoc:schip "1046" “Schip” “De Berkel” rdfs:label dss:scheepsnaam gzmvoc:scheepsnaam dss:ShipType gzmvoc:Scheepstype gzmvoc: type-Ship dss:has_shiptype gzmvoc:has_shiptype gzmvoc:scheepstype “21” “Moorse mattroosen” dss:azRegistratieKop gzmvoc:azAantalMatrozen gzmvoc:telling gzmvoc:heeft DAS heenreis dss:Record das:Voyage das:voyage-1918_61
  14. 14. ACCESS IT AT HTTP://DUTCHSHIPSANDSAILORS.NL/DATA OR HTTP://SEMANTICWEB.CS.VU.NL/DSS SELECT * WHERE { ?record dss:hasOriginalScan ?scan. ?record dss:has_kb_link ?kblink. ?record mdb:schip ?schip. ?schip mdb:scheepstype ?shiptype. ?shiptype skos:exactMatch ?em. ?em skos:broader* aat:kustvaarders. }
  15. 15. Data analysis and visualisation
  16. 16. DIVE
  17. 17. MEDIA HISTORIANS AND RESEARCHERS MediaresearcherLarsArveRøsslandoftheUniversityofBergen.(Photo:AndreasR.Graven) EXPLORATIVE SEARCH Digital Hermeneutics: The combination of digital (Web) technology and theory of interpretation
  18. 18. DATA: OPENIMAGES.EU and DELPHER.NL
  19. 19. ENTITY EXTRACTION CROWDTRUTH.ORG ENTITY EXTRACTION EVENTS CROWDSOURCING AND LINKING TO CONCEPTS THROUGH CROWDTRUTH.ORG SEGMENTATION & KEYFRAMES LINKING EVENTS AND CONCEPTS TO KEYFRAMES
  20. 20. DATA CONNECTED IN KNOWLEDGE GRAPH DIVE:MEDIA OBJECT SEM:EVENT SEM:PLACE SEM:TIME SEM:ACTOR SKOS:CONCEPT OA:ANNOTATION LINKS TO EUROPEANA LINKS TO DBPEDIA
  21. 21. “DIGITAL SUBMARINE” INTERFACE DIVE.BEELDENGELUID.NL
  22. 22. BiographyNet Starting Point: Biography Portal of the Netherlands; www.biografischportaal.nl 125,000 short biographical descriptions with limited metadata from 23 Dutch biographical dictionaries (~76,000 individuals) What kind of historical questions can be answered with these data with the help of computational methods Biographynet.nl
  23. 23. Johan Rudolph Thorbecke werd in 1798 geboren op 14 januari in Zwolle en komt uit een half-Duit Johan Rudolph Thorbecke werd in 1798 geboren op 14 januari in Zwolle en komt uit een half-Duit Linked Data for BiograpyNet Thorbecke Biographical Description Provenance Meta Data NNBW Person Meta Data “Thorbecke” Biography Parts Birth 1798 Event Biographical Description Enrichment NLP Tool Person Meta Data Event Birth Johan Rudolph Thorbecke werd in 1798 geboren op 14 januari in Zwolle en komt uit een half-Duit Zwolle 1798-01-14 Biographynet.nl
  24. 24. a Provenance in Biographynet Ensure credibility of the demonstrator, to evaluate its performance and to improve the academic status of the tool Information involved Sources, but also: NER input data, etc. Processes involved All steps in enrichment, aggregation… People involved Who was responsible for pipeline, tool, Biographynet.nl*Daniel Garijo, Yolanda Gil; http://www.opmw.org/model/p-plan
  25. 25. Interface for historians Biographynet.nl
  26. 26. Framework generic solutions with historians 1. Preprocess, Clean, Model, Link, Enrich data in a collaboration with domain experts 2. Access heterogeneous datasets in a convenient way to get an intuition of the character and anomalies of the (linked) data; 3. Perform arbitrary queries to retrieve results relevant to their research questions; 4. Verify the veracity of query results, by following provenance links to original material 5. Retrieve and analyze the data with tool of preference. 6. Republish and share results
  27. 27. Historical tool criticism … willingness from historians to invest the time to learn about computer processes (at least the basic principles) Possibilities for education at universities to bridge the gap between computer science and humanities studies and make tool criticism an integral part of student’s curricula “Why do we still teach history student to decipher 17th Century handwriting, but not SQL”
  28. 28. Thank you! Victor de Boer http://victordeboer.com v.de.boer@vu.nl @victordeboer
  29. 29. Verrijkt Koninkrijk
  30. 30. 30 National- Socialist 29% Social- Democrat 21% Protestant 13% Liberal 12% R-Catholic 12% Communist 8% Jewish 5% http://semanticweb.cs.vu.nl/verrijktkoninkrijk/ http://search.loedejongdigitaal.nl/
  31. 31. Results are links to paragraphs
  32. 32. re-usability http://qhp.science.uva.nl/

×