SlideShare a Scribd company logo
Using Semantic Technologies to 
Create Virtual Families from 
Historical Vital Records! 
Christophe Debruyne1,2, Oya Beyan1, Stefan Decker1 and Sandra Collins2! 
! 
1Insight @ NUI Galway! 
2Digital Repository of Ireland! 
! 
2014-09-25 @ EUON 2014!
Irish Record Linkage, 1864-1913! 
Developing a platform applying 
semantic technologies to historical 
birth, death and marriage certi!cates." 
" 
Answering questions such as: “How 
accurate are historic maternal 
mortality rates (MMR) and infant 
mortality rates (IMR) for Dublin?”" 
" 
Team consists of researchers 
(historians), digital archivists, and 
knowledge engineers." 
" 
Knowledge and 
Linked Data 
Engineers! 
Digital Historians! 
Archivists!
General Records O"ce ! 
• Vital registration data! 
– Birth-certi!cates" 
– Death-certi!cates" 
– Marriage records" 
• Digitised TIFF images of 
hardcopy indexes and registers.! 
• 2 TB of data! 
• Database describing the 
digitised records allowing 
searches on some "elds.! 
©General Records O#ce of Ireland 2014!
Challenges! 
• With respect to requirements! 
– Identifying certi!ed causes of death that can be attributed to 
maternal death." 
– Death certi!cates with no corresponding birth certi!cate" 
– Terminology used pre-1900. " 
– Capturing the socio-economical status of the families via, for 
instance, the professions, ranks of fathers." 
– … " 
• With respect to the platform! 
– Data protection" 
– Records vs. Knowledge" 
– Provenance vs. Interpretation"
GRO$Triplestore$ 
Triplestore$2$ Data$Analysis$ 
Transforma)on*from*one*model*to*another* 
• SPIN$–$SPARQL$Inference$ 
• SWRL$/$RuleML$ 
• SPARQL$Construct$ 
• …$ 
SEPARATION $OF $CONCERNS$ 
Obviously,$due$to$ 
the$sensiJve$ 
nature$of$the$ 
data,$data$ 
protecJon$is$key.$
Development of 2 ontologies! 
• 2 ontologies were developed – separation of concerns! 
• First ontology for describing the contents of records! 
– OWL 2 shallow, “#at ontology”" 
– Created by “lifting” the structure of the vital records" 
– (Marriage) Record, (Birth|Death) Certificate, Return! 
• Second ontology for data analysis! 
– OWL 2 + Rules to capture background and domain knowledge" 
– Created by means of Competency Questions (Grüninger and Fox)" 
– Person, Birth, Marriage, Death, withChild, motherOf, …! 
Grüninger, M., Fox, M.S.: The role of competency questions in enterprise engineering. In: Benchmarking 
Theory and Practice, pp. 22-31. Springer (1995)"
Tool for the Digital Archivist! 
• Records are encoded using spreadsheets – a tool the digital archivist 
is familiar with! 
• RDB-to-RDF mapping "les were de"ned to generate RDF from the in-memory 
databases created for each spreadsheet.!
Next steps! 
• Encoding a signi"cant amount of vital records in the excel "les! 
– To create the !rst triplestore; and" 
– To obtain a dataset for validating the transformations; and" 
– By consequence, validating the second ontology." 
• To investigate proper interaction with the data for the historians.! 
• Linking the data with additional context; i.e., Linked Logainm! 
– http://data.logainm.ie/ " 
– Nuno Lopes, Rebecca Grant, Brian Ó Raghallaigh, Eoghan Ó Carragáin, Sandra Collins, 
Stefan Decker: Linked Logainm: Enhancing Library Metadata Using Linked Data of Irish 
Place Names. TPDL Workshops 2013: 65-76"
More information! 
• @IRL_Project! 
• Project website http://irishrecordlinkage.wordpress.com/ ! 
! 
• In partnership with!

More Related Content

Similar to Using Semantic Technologies to Create Virtual Families from Historical Vital Records

Towards Linked Vital Registration Data for Reconstituting Families and Creati...
Towards Linked Vital Registration Data for Reconstituting Families and Creati...Towards Linked Vital Registration Data for Reconstituting Families and Creati...
Towards Linked Vital Registration Data for Reconstituting Families and Creati...
dri_ireland
 
Towards linked vital registration data for reconstituting families and creati...
Towards linked vital registration data for reconstituting families and creati...Towards linked vital registration data for reconstituting families and creati...
Towards linked vital registration data for reconstituting families and creati...
IRL_Project
 
Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.
Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.
Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.
dri_ireland
 
Big Data in the Arts and Humanities: Stirling presentation
Big Data in the Arts and Humanities: Stirling presentationBig Data in the Arts and Humanities: Stirling presentation
Big Data in the Arts and Humanities: Stirling presentation
Andrew Prescott
 
Visualizing Data in Elasticsearch DevFest DC 2016
Visualizing Data in Elasticsearch DevFest DC 2016Visualizing Data in Elasticsearch DevFest DC 2016
Visualizing Data in Elasticsearch DevFest DC 2016
David Erickson
 
Predictive Analytics - BarCamp Boston 2011
Predictive Analytics - BarCamp Boston 2011Predictive Analytics - BarCamp Boston 2011
Predictive Analytics - BarCamp Boston 2011
Vedant Misra
 
Beyond Preservation: Situating Archaeological Data in Professional Practice
Beyond Preservation: Situating Archaeological Data in Professional PracticeBeyond Preservation: Situating Archaeological Data in Professional Practice
Beyond Preservation: Situating Archaeological Data in Professional Practice
Eric Kansa
 
Towards Semantic Enrichment of Newspapers: a historical ecology use case
Towards Semantic Enrichment of Newspapers: a historical ecology use case Towards Semantic Enrichment of Newspapers: a historical ecology use case
Towards Semantic Enrichment of Newspapers: a historical ecology use case
Marieke van Erp
 
Exploring the "Search" in FamilySearch
Exploring the "Search" in FamilySearchExploring the "Search" in FamilySearch
Exploring the "Search" in FamilySearch
Carol Petranek
 
Integrated Technology for Archaeological Imaging in the Field and in the Lab
Integrated Technology for Archaeological Imaging in the Field and in the LabIntegrated Technology for Archaeological Imaging in the Field and in the Lab
Integrated Technology for Archaeological Imaging in the Field and in the Lab
Ashley M. Richter
 
Session5 01.rutger vankoert
Session5 01.rutger vankoertSession5 01.rutger vankoert
Session5 01.rutger vankoert
IMPACT Centre of Competence
 
Networked history of institutions
Networked history of institutionsNetworked history of institutions
Networked history of institutions
Brian Keegan
 
Rogers digitalmethodsaftersocialmedia nov2013_optimized_
Rogers digitalmethodsaftersocialmedia nov2013_optimized_Rogers digitalmethodsaftersocialmedia nov2013_optimized_
Rogers digitalmethodsaftersocialmedia nov2013_optimized_
Digital Methods Initiative
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsJon Voss
 
Big Data Past, Present and Future – Where are we Headed? - StampedeCon 2014
Big Data Past, Present and Future – Where are we Headed? - StampedeCon 2014Big Data Past, Present and Future – Where are we Headed? - StampedeCon 2014
Big Data Past, Present and Future – Where are we Headed? - StampedeCon 2014
StampedeCon
 
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI PresentationOpen Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation
ekansa
 
Big Data in the Arts and Humanities
Big Data in the Arts and HumanitiesBig Data in the Arts and Humanities
Big Data in the Arts and Humanities
Andrew Prescott
 
Content Curation by Daniel Wilksch, PROV
Content Curation by Daniel Wilksch, PROV Content Curation by Daniel Wilksch, PROV
Content Curation by Daniel Wilksch, PROV
Libmark
 
New Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data CitationsNew Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data CitationsJohn Kunze
 
Call for papers, IJORCS, Volume 3 - Issue 3
Call for papers, IJORCS, Volume 3 - Issue 3Call for papers, IJORCS, Volume 3 - Issue 3
Call for papers, IJORCS, Volume 3 - Issue 3IJORCS
 

Similar to Using Semantic Technologies to Create Virtual Families from Historical Vital Records (20)

Towards Linked Vital Registration Data for Reconstituting Families and Creati...
Towards Linked Vital Registration Data for Reconstituting Families and Creati...Towards Linked Vital Registration Data for Reconstituting Families and Creati...
Towards Linked Vital Registration Data for Reconstituting Families and Creati...
 
Towards linked vital registration data for reconstituting families and creati...
Towards linked vital registration data for reconstituting families and creati...Towards linked vital registration data for reconstituting families and creati...
Towards linked vital registration data for reconstituting families and creati...
 
Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.
Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.
Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.
 
Big Data in the Arts and Humanities: Stirling presentation
Big Data in the Arts and Humanities: Stirling presentationBig Data in the Arts and Humanities: Stirling presentation
Big Data in the Arts and Humanities: Stirling presentation
 
Visualizing Data in Elasticsearch DevFest DC 2016
Visualizing Data in Elasticsearch DevFest DC 2016Visualizing Data in Elasticsearch DevFest DC 2016
Visualizing Data in Elasticsearch DevFest DC 2016
 
Predictive Analytics - BarCamp Boston 2011
Predictive Analytics - BarCamp Boston 2011Predictive Analytics - BarCamp Boston 2011
Predictive Analytics - BarCamp Boston 2011
 
Beyond Preservation: Situating Archaeological Data in Professional Practice
Beyond Preservation: Situating Archaeological Data in Professional PracticeBeyond Preservation: Situating Archaeological Data in Professional Practice
Beyond Preservation: Situating Archaeological Data in Professional Practice
 
Towards Semantic Enrichment of Newspapers: a historical ecology use case
Towards Semantic Enrichment of Newspapers: a historical ecology use case Towards Semantic Enrichment of Newspapers: a historical ecology use case
Towards Semantic Enrichment of Newspapers: a historical ecology use case
 
Exploring the "Search" in FamilySearch
Exploring the "Search" in FamilySearchExploring the "Search" in FamilySearch
Exploring the "Search" in FamilySearch
 
Integrated Technology for Archaeological Imaging in the Field and in the Lab
Integrated Technology for Archaeological Imaging in the Field and in the LabIntegrated Technology for Archaeological Imaging in the Field and in the Lab
Integrated Technology for Archaeological Imaging in the Field and in the Lab
 
Session5 01.rutger vankoert
Session5 01.rutger vankoertSession5 01.rutger vankoert
Session5 01.rutger vankoert
 
Networked history of institutions
Networked history of institutionsNetworked history of institutions
Networked history of institutions
 
Rogers digitalmethodsaftersocialmedia nov2013_optimized_
Rogers digitalmethodsaftersocialmedia nov2013_optimized_Rogers digitalmethodsaftersocialmedia nov2013_optimized_
Rogers digitalmethodsaftersocialmedia nov2013_optimized_
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
 
Big Data Past, Present and Future – Where are we Headed? - StampedeCon 2014
Big Data Past, Present and Future – Where are we Headed? - StampedeCon 2014Big Data Past, Present and Future – Where are we Headed? - StampedeCon 2014
Big Data Past, Present and Future – Where are we Headed? - StampedeCon 2014
 
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI PresentationOpen Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation
 
Big Data in the Arts and Humanities
Big Data in the Arts and HumanitiesBig Data in the Arts and Humanities
Big Data in the Arts and Humanities
 
Content Curation by Daniel Wilksch, PROV
Content Curation by Daniel Wilksch, PROV Content Curation by Daniel Wilksch, PROV
Content Curation by Daniel Wilksch, PROV
 
New Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data CitationsNew Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data Citations
 
Call for papers, IJORCS, Volume 3 - Issue 3
Call for papers, IJORCS, Volume 3 - Issue 3Call for papers, IJORCS, Volume 3 - Issue 3
Call for papers, IJORCS, Volume 3 - Issue 3
 

Recently uploaded

By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
UiPathCommunity
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
Jen Stirrup
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
Alex Pruden
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
Ralf Eggert
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 

Recently uploaded (20)

By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 

Using Semantic Technologies to Create Virtual Families from Historical Vital Records

  • 1. Using Semantic Technologies to Create Virtual Families from Historical Vital Records! Christophe Debruyne1,2, Oya Beyan1, Stefan Decker1 and Sandra Collins2! ! 1Insight @ NUI Galway! 2Digital Repository of Ireland! ! 2014-09-25 @ EUON 2014!
  • 2. Irish Record Linkage, 1864-1913! Developing a platform applying semantic technologies to historical birth, death and marriage certi!cates." " Answering questions such as: “How accurate are historic maternal mortality rates (MMR) and infant mortality rates (IMR) for Dublin?”" " Team consists of researchers (historians), digital archivists, and knowledge engineers." " Knowledge and Linked Data Engineers! Digital Historians! Archivists!
  • 3. General Records O"ce ! • Vital registration data! – Birth-certi!cates" – Death-certi!cates" – Marriage records" • Digitised TIFF images of hardcopy indexes and registers.! • 2 TB of data! • Database describing the digitised records allowing searches on some "elds.! ©General Records O#ce of Ireland 2014!
  • 4. Challenges! • With respect to requirements! – Identifying certi!ed causes of death that can be attributed to maternal death." – Death certi!cates with no corresponding birth certi!cate" – Terminology used pre-1900. " – Capturing the socio-economical status of the families via, for instance, the professions, ranks of fathers." – … " • With respect to the platform! – Data protection" – Records vs. Knowledge" – Provenance vs. Interpretation"
  • 5. GRO$Triplestore$ Triplestore$2$ Data$Analysis$ Transforma)on*from*one*model*to*another* • SPIN$–$SPARQL$Inference$ • SWRL$/$RuleML$ • SPARQL$Construct$ • …$ SEPARATION $OF $CONCERNS$ Obviously,$due$to$ the$sensiJve$ nature$of$the$ data,$data$ protecJon$is$key.$
  • 6. Development of 2 ontologies! • 2 ontologies were developed – separation of concerns! • First ontology for describing the contents of records! – OWL 2 shallow, “#at ontology”" – Created by “lifting” the structure of the vital records" – (Marriage) Record, (Birth|Death) Certificate, Return! • Second ontology for data analysis! – OWL 2 + Rules to capture background and domain knowledge" – Created by means of Competency Questions (Grüninger and Fox)" – Person, Birth, Marriage, Death, withChild, motherOf, …! Grüninger, M., Fox, M.S.: The role of competency questions in enterprise engineering. In: Benchmarking Theory and Practice, pp. 22-31. Springer (1995)"
  • 7. Tool for the Digital Archivist! • Records are encoded using spreadsheets – a tool the digital archivist is familiar with! • RDB-to-RDF mapping "les were de"ned to generate RDF from the in-memory databases created for each spreadsheet.!
  • 8. Next steps! • Encoding a signi"cant amount of vital records in the excel "les! – To create the !rst triplestore; and" – To obtain a dataset for validating the transformations; and" – By consequence, validating the second ontology." • To investigate proper interaction with the data for the historians.! • Linking the data with additional context; i.e., Linked Logainm! – http://data.logainm.ie/ " – Nuno Lopes, Rebecca Grant, Brian Ó Raghallaigh, Eoghan Ó Carragáin, Sandra Collins, Stefan Decker: Linked Logainm: Enhancing Library Metadata Using Linked Data of Irish Place Names. TPDL Workshops 2013: 65-76"
  • 9. More information! • @IRL_Project! • Project website http://irishrecordlinkage.wordpress.com/ ! ! • In partnership with!