SlideShare a Scribd company logo
1 of 10
Download to read offline
Towards Linked Vital Registration Data for
Reconstituting Families and Creating
Longitudinal Health HistoriesLongitudinal Health Histories
Oya Beyan, Ciara Breathnach, Sandra Collins,
Christophe Debruyne, Stefan Decker, Dolores Grant,
Rebecca Grant, and Brian Gurrin
21st of July 2014 – KR4HC Workshop – Vienna, Austria21st of July 2014 – KR4HC Workshop – Vienna, Austria
Irish Record Linkage, 1864-1913
• Developing a platform applying semantic
technologies to historical birth-, death andtechnologies to historical birth-, death and
marriage certificates.
• Answering questions such as: “How accurate are
historic maternal mortality rates (MMR) and
infant mortality rates (IMR) for Dublin?”
• Team consists of researchers (historians), digital
archivists, and knowledge engineers.
21/07/2014 2
Data: General Office Records
• Vital registration data
– Birth-certificates– Birth-certificates
– Death-certificates
– Marriage records
• Digitised TIFF images of
hardcopy indexes and
registers.
• 2 TB of data• 2 TB of data
• Database describing the
digitised records allowing
searches on some fields.
21/07/2014 3
©General Records Office of Ireland 2014
Challenges
• Certified causes of death that can be attributed to maternal
death
– Within 42 days after labour – before (1864) it was 12– Within 42 days after labour – before (1864) it was 12
– Septicemia (blood poisoning), Fever, …
– “Corresponding” birth certificate?
• Death certificates with no corresponding birth certificate
• “Gaps” in sibship interval, even though no birth- or death
certificates can be found.
• The terminology used pre-1900. E.g., “debile” to denote• The terminology used pre-1900. E.g., “debile” to denote
weak or a failure to thrive.
• Capturing the socio-economical status of the families via,
for instance, the professions, ranks of fathers.
21/07/2014 4
Conceptual Architecture
Digital Archivist
SPARQL endpoint /
Linked Data Server
Updates
GRO records
as RDF
LinksLinker UpdaterRepository
Triple-
store
Linked Data Server
Analytics
Researcher
21/07/2014 5
DATA ANALYTICSPRESERVATION
Links to external datasets: e.g., Logainm – a database of Irish historical and
contemporary place names to provide additional context.
Development of 2 ontologies
Triplestore 2 Data Analysis
CONCERNSSEPARATIONOFCONCERNS
Obviously, due to
the sensitive
nature of the
data, data
protection is key.
21/07/2014 6
GRO Triplestore
Transformation from one model to another
• SPIN – SPARQL Inference
• SWRL / RuleML
• SPARQL Construct
• …
SEPARATION
protection is key.
Development of 2 ontologies
• 2 ontologies were developed – separation of concerns
• First ontology for describing the contents of records
– OWL 2 shallow, “flat ontology”
• Second ontology for data analysis
– OWL 2 + rules
– Rules to capture background and domain knowledge– Rules to capture background and domain knowledge
– Developed by having the historians formulate competency
questions (Grüninger and Fox)
– Captured graphically using Object Role Modelling
21/07/2014 7
Graphical Representation in ORM
21/07/2014 8
### Prefixes ommitted …
irl:Record a owl:Class ;
rdfs:label "Record" ; .
irl:Certificate a owl:Class ;
rdfs:label "Certificate" ;
rdfs:subClassOf irl:Record; .rdfs:subClassOf irl:Record; .
irl:BirthRecord a owl:Class ;
rdfs:label "Birth Record" ;
rdfs:subClassOf irl:Certificate ; .
irl:DeathRecord a owl:Class ;
rdfs:label "Death Record" ;
rdfs:subClassOf irl:Certificate ; .
irl:MarriageRecord a owl:Class ;
rdfs:label "Marriage Record" ;rdfs:label "Marriage Record" ;
rdfs:subClassOf irl:Record ; .
irl:Return a owl:Class ;
rdfs:label "Return" ; .
…
21/07/2014 9
Conclusions
• Presented the problem and highlighted the
challengeschallenges
• Developed two ontologies
– Encoding contents of digitized GRO records for
long-term digital preservation DRI
– Data analytics to answer the researchers’
question – in this case a historianquestion – in this case a historian
• Data exploration and annotation of the
records started on a subset of the dataset
21/07/2014 10

More Related Content

Viewers also liked

Kathryn Cassidy - Preservation and the DRI
Kathryn Cassidy - Preservation and the DRIKathryn Cassidy - Preservation and the DRI
Kathryn Cassidy - Preservation and the DRIdri_ireland
 
Kathryn Cassidy - What metadata does the Digital Repository of Ireland want, ...
Kathryn Cassidy - What metadata does the Digital Repository of Ireland want, ...Kathryn Cassidy - What metadata does the Digital Repository of Ireland want, ...
Kathryn Cassidy - What metadata does the Digital Repository of Ireland want, ...dri_ireland
 
Rebecca Grant, Kathryn Cassidy, Marta Bustillo - Implementing Orphan Works Le...
Rebecca Grant, Kathryn Cassidy, Marta Bustillo - Implementing Orphan Works Le...Rebecca Grant, Kathryn Cassidy, Marta Bustillo - Implementing Orphan Works Le...
Rebecca Grant, Kathryn Cassidy, Marta Bustillo - Implementing Orphan Works Le...dri_ireland
 
Fergus Fahey - DRI/ARA(I) Training: Introduction to EAD - EAD Workshop
Fergus Fahey - DRI/ARA(I) Training: Introduction to EAD - EAD WorkshopFergus Fahey - DRI/ARA(I) Training: Introduction to EAD - EAD Workshop
Fergus Fahey - DRI/ARA(I) Training: Introduction to EAD - EAD Workshopdri_ireland
 
Kathryn Cassidy - Using MOAB versioning for preservation storage
Kathryn Cassidy - Using MOAB versioning for preservation storage Kathryn Cassidy - Using MOAB versioning for preservation storage
Kathryn Cassidy - Using MOAB versioning for preservation storage dri_ireland
 
Kathryn Cassidy - DRI Training Series: 4. Metadata and XML
Kathryn Cassidy - DRI Training Series: 4. Metadata and XMLKathryn Cassidy - DRI Training Series: 4. Metadata and XML
Kathryn Cassidy - DRI Training Series: 4. Metadata and XMLdri_ireland
 
Tim Keefe - DRI Training Series: 2. Digitising Your Collection
Tim Keefe - DRI Training Series: 2. Digitising Your CollectionTim Keefe - DRI Training Series: 2. Digitising Your Collection
Tim Keefe - DRI Training Series: 2. Digitising Your Collectiondri_ireland
 
Aileen O'Carroll - DRI Training UCC: Introduction to Metadata
Aileen O'Carroll - DRI Training UCC: Introduction to MetadataAileen O'Carroll - DRI Training UCC: Introduction to Metadata
Aileen O'Carroll - DRI Training UCC: Introduction to Metadatadri_ireland
 
Clare Lanigan - DRI Training Day UCC: Understanding Copyright
Clare Lanigan - DRI Training Day UCC: Understanding CopyrightClare Lanigan - DRI Training Day UCC: Understanding Copyright
Clare Lanigan - DRI Training Day UCC: Understanding Copyrightdri_ireland
 

Viewers also liked (11)

Kathryn Cassidy - Preservation and the DRI
Kathryn Cassidy - Preservation and the DRIKathryn Cassidy - Preservation and the DRI
Kathryn Cassidy - Preservation and the DRI
 
Kathryn Cassidy - What metadata does the Digital Repository of Ireland want, ...
Kathryn Cassidy - What metadata does the Digital Repository of Ireland want, ...Kathryn Cassidy - What metadata does the Digital Repository of Ireland want, ...
Kathryn Cassidy - What metadata does the Digital Repository of Ireland want, ...
 
Andrea Martin
Andrea MartinAndrea Martin
Andrea Martin
 
Rebecca Grant, Kathryn Cassidy, Marta Bustillo - Implementing Orphan Works Le...
Rebecca Grant, Kathryn Cassidy, Marta Bustillo - Implementing Orphan Works Le...Rebecca Grant, Kathryn Cassidy, Marta Bustillo - Implementing Orphan Works Le...
Rebecca Grant, Kathryn Cassidy, Marta Bustillo - Implementing Orphan Works Le...
 
Fergus Fahey - DRI/ARA(I) Training: Introduction to EAD - EAD Workshop
Fergus Fahey - DRI/ARA(I) Training: Introduction to EAD - EAD WorkshopFergus Fahey - DRI/ARA(I) Training: Introduction to EAD - EAD Workshop
Fergus Fahey - DRI/ARA(I) Training: Introduction to EAD - EAD Workshop
 
Kathryn Cassidy - Using MOAB versioning for preservation storage
Kathryn Cassidy - Using MOAB versioning for preservation storage Kathryn Cassidy - Using MOAB versioning for preservation storage
Kathryn Cassidy - Using MOAB versioning for preservation storage
 
Kathryn Cassidy - DRI Training Series: 4. Metadata and XML
Kathryn Cassidy - DRI Training Series: 4. Metadata and XMLKathryn Cassidy - DRI Training Series: 4. Metadata and XML
Kathryn Cassidy - DRI Training Series: 4. Metadata and XML
 
Tim Keefe - DRI Training Series: 2. Digitising Your Collection
Tim Keefe - DRI Training Series: 2. Digitising Your CollectionTim Keefe - DRI Training Series: 2. Digitising Your Collection
Tim Keefe - DRI Training Series: 2. Digitising Your Collection
 
Aileen O'Carroll - DRI Training UCC: Introduction to Metadata
Aileen O'Carroll - DRI Training UCC: Introduction to MetadataAileen O'Carroll - DRI Training UCC: Introduction to Metadata
Aileen O'Carroll - DRI Training UCC: Introduction to Metadata
 
Curtis Wong
Curtis WongCurtis Wong
Curtis Wong
 
Clare Lanigan - DRI Training Day UCC: Understanding Copyright
Clare Lanigan - DRI Training Day UCC: Understanding CopyrightClare Lanigan - DRI Training Day UCC: Understanding Copyright
Clare Lanigan - DRI Training Day UCC: Understanding Copyright
 

Similar to Towards Linked Vital Registration Data for Reconstituting Families and Creating Longitudinal Health Histories

Using Semantic Technologies to Create Virtual Families from Historical Vital ...
Using Semantic Technologies to Create Virtual Families from Historical Vital ...Using Semantic Technologies to Create Virtual Families from Historical Vital ...
Using Semantic Technologies to Create Virtual Families from Historical Vital ...IRL_Project
 
Using Semantic Technologies to Create Virtual Families from Historical Vital ...
Using Semantic Technologies to Create Virtual Families from Historical Vital ...Using Semantic Technologies to Create Virtual Families from Historical Vital ...
Using Semantic Technologies to Create Virtual Families from Historical Vital ...Christophe Debruyne
 
Mid-Sweden University/SNIA Conference 13 October 2008
Mid-Sweden University/SNIA Conference 13 October 2008Mid-Sweden University/SNIA Conference 13 October 2008
Mid-Sweden University/SNIA Conference 13 October 2008Mark Conrad
 
Creating and Consuming Metadata from Transcribed Historical Vital Records for...
Creating and Consuming Metadata from Transcribed Historical Vital Records for...Creating and Consuming Metadata from Transcribed Historical Vital Records for...
Creating and Consuming Metadata from Transcribed Historical Vital Records for...Christophe Debruyne
 
The eCrystals Federation
The eCrystals FederationThe eCrystals Federation
The eCrystals FederationManjulaPatel
 
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...VHIR Vall d’Hebron Institut de Recerca
 
CLARIAH-clio-dap
CLARIAH-clio-dapCLARIAH-clio-dap
CLARIAH-clio-dapCLARIAH
 
RDAP14: Learning to Curate Panel
RDAP14: Learning to Curate Panel RDAP14: Learning to Curate Panel
RDAP14: Learning to Curate Panel ASIS&T
 
Sarah Jones RDM from a disciplinary perspective
Sarah Jones RDM from a disciplinary perspectiveSarah Jones RDM from a disciplinary perspective
Sarah Jones RDM from a disciplinary perspectiveJisc
 
The Future of Semantics on the Web
The Future of Semantics on the WebThe Future of Semantics on the Web
The Future of Semantics on the WebJohn Domingue
 
Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.
Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.
Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.dri_ireland
 
Scientists’ Hard Drives, Databases, and Blogs: Preservation Intent and Source...
Scientists’ Hard Drives, Databases, and Blogs: Preservation Intent and Source...Scientists’ Hard Drives, Databases, and Blogs: Preservation Intent and Source...
Scientists’ Hard Drives, Databases, and Blogs: Preservation Intent and Source...Trevor Owens
 
Rebecca Grant DPASSH presentation 2015
Rebecca Grant DPASSH presentation 2015Rebecca Grant DPASSH presentation 2015
Rebecca Grant DPASSH presentation 2015dri_ireland
 
Saving private data, sharing Open Data? Role of libraries and institutional r...
Saving private data, sharing Open Data? Role of libraries and institutional r...Saving private data, sharing Open Data? Role of libraries and institutional r...
Saving private data, sharing Open Data? Role of libraries and institutional r...Chris Rusbridge
 
20141112 courtot big_datasemwebontologies
20141112 courtot big_datasemwebontologies20141112 courtot big_datasemwebontologies
20141112 courtot big_datasemwebontologiesMelanie Courtot
 
Open science in RIKEN-KI doctorial course on March 20, 2019
Open science in RIKEN-KI doctorial course on March 20, 2019Open science in RIKEN-KI doctorial course on March 20, 2019
Open science in RIKEN-KI doctorial course on March 20, 2019Takeya Kasukawa
 

Similar to Towards Linked Vital Registration Data for Reconstituting Families and Creating Longitudinal Health Histories (20)

Using Semantic Technologies to Create Virtual Families from Historical Vital ...
Using Semantic Technologies to Create Virtual Families from Historical Vital ...Using Semantic Technologies to Create Virtual Families from Historical Vital ...
Using Semantic Technologies to Create Virtual Families from Historical Vital ...
 
Using Semantic Technologies to Create Virtual Families from Historical Vital ...
Using Semantic Technologies to Create Virtual Families from Historical Vital ...Using Semantic Technologies to Create Virtual Families from Historical Vital ...
Using Semantic Technologies to Create Virtual Families from Historical Vital ...
 
Mid-Sweden University/SNIA Conference 13 October 2008
Mid-Sweden University/SNIA Conference 13 October 2008Mid-Sweden University/SNIA Conference 13 October 2008
Mid-Sweden University/SNIA Conference 13 October 2008
 
Creating and Consuming Metadata from Transcribed Historical Vital Records for...
Creating and Consuming Metadata from Transcribed Historical Vital Records for...Creating and Consuming Metadata from Transcribed Historical Vital Records for...
Creating and Consuming Metadata from Transcribed Historical Vital Records for...
 
R - datascience
R - datascienceR - datascience
R - datascience
 
The eCrystals Federation
The eCrystals FederationThe eCrystals Federation
The eCrystals Federation
 
Sharing data
Sharing dataSharing data
Sharing data
 
Rdm slides march 2014
Rdm slides march 2014Rdm slides march 2014
Rdm slides march 2014
 
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
 
CLARIAH-clio-dap
CLARIAH-clio-dapCLARIAH-clio-dap
CLARIAH-clio-dap
 
RDAP14: Learning to Curate Panel
RDAP14: Learning to Curate Panel RDAP14: Learning to Curate Panel
RDAP14: Learning to Curate Panel
 
Ji cv6n1
Ji cv6n1Ji cv6n1
Ji cv6n1
 
Sarah Jones RDM from a disciplinary perspective
Sarah Jones RDM from a disciplinary perspectiveSarah Jones RDM from a disciplinary perspective
Sarah Jones RDM from a disciplinary perspective
 
The Future of Semantics on the Web
The Future of Semantics on the WebThe Future of Semantics on the Web
The Future of Semantics on the Web
 
Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.
Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.
Rebecca Grant - Approaching Archival Authenticity: when 'Records' become 'Data.
 
Scientists’ Hard Drives, Databases, and Blogs: Preservation Intent and Source...
Scientists’ Hard Drives, Databases, and Blogs: Preservation Intent and Source...Scientists’ Hard Drives, Databases, and Blogs: Preservation Intent and Source...
Scientists’ Hard Drives, Databases, and Blogs: Preservation Intent and Source...
 
Rebecca Grant DPASSH presentation 2015
Rebecca Grant DPASSH presentation 2015Rebecca Grant DPASSH presentation 2015
Rebecca Grant DPASSH presentation 2015
 
Saving private data, sharing Open Data? Role of libraries and institutional r...
Saving private data, sharing Open Data? Role of libraries and institutional r...Saving private data, sharing Open Data? Role of libraries and institutional r...
Saving private data, sharing Open Data? Role of libraries and institutional r...
 
20141112 courtot big_datasemwebontologies
20141112 courtot big_datasemwebontologies20141112 courtot big_datasemwebontologies
20141112 courtot big_datasemwebontologies
 
Open science in RIKEN-KI doctorial course on March 20, 2019
Open science in RIKEN-KI doctorial course on March 20, 2019Open science in RIKEN-KI doctorial course on March 20, 2019
Open science in RIKEN-KI doctorial course on March 20, 2019
 

More from dri_ireland

NORFest 2023 Lightning Talks Session Two
NORFest 2023 Lightning Talks Session TwoNORFest 2023 Lightning Talks Session Two
NORFest 2023 Lightning Talks Session Twodri_ireland
 
NORFest 2023: Early Career Researcher Panel on Research Assessment
NORFest 2023: Early Career Researcher Panel on Research AssessmentNORFest 2023: Early Career Researcher Panel on Research Assessment
NORFest 2023: Early Career Researcher Panel on Research Assessmentdri_ireland
 
NORFest 2023: National Open Research Fund 2023, Projects Launch
NORFest 2023: National Open Research Fund 2023, Projects LaunchNORFest 2023: National Open Research Fund 2023, Projects Launch
NORFest 2023: National Open Research Fund 2023, Projects Launchdri_ireland
 
NORFest 2023 Lightning Talks Session Three
NORFest 2023 Lightning Talks Session Three NORFest 2023 Lightning Talks Session Three
NORFest 2023 Lightning Talks Session Three dri_ireland
 
NORFest 2023 Lightning Talks Session One
NORFest 2023 Lightning Talks Session OneNORFest 2023 Lightning Talks Session One
NORFest 2023 Lightning Talks Session Onedri_ireland
 
NORFest2023 Keynote address: Chelle Gentemann (NASA)
NORFest2023 Keynote address: Chelle Gentemann (NASA)NORFest2023 Keynote address: Chelle Gentemann (NASA)
NORFest2023 Keynote address: Chelle Gentemann (NASA)dri_ireland
 
The Archiving Reproductive Health project as a FAIR data resource for humanit...
The Archiving Reproductive Health project as a FAIR data resource for humanit...The Archiving Reproductive Health project as a FAIR data resource for humanit...
The Archiving Reproductive Health project as a FAIR data resource for humanit...dri_ireland
 
Developing a self-care protocol for working with potentially traumatic data: ...
Developing a self-care protocol for working with potentially traumatic data: ...Developing a self-care protocol for working with potentially traumatic data: ...
Developing a self-care protocol for working with potentially traumatic data: ...dri_ireland
 
An Introduction to the Digital Repository of Ireland
An Introduction to the Digital Repository of Ireland An Introduction to the Digital Repository of Ireland
An Introduction to the Digital Repository of Ireland dri_ireland
 
DRI Copyright and Licencing_UCC_Mar23.pptx
DRI Copyright and Licencing_UCC_Mar23.pptxDRI Copyright and Licencing_UCC_Mar23.pptx
DRI Copyright and Licencing_UCC_Mar23.pptxdri_ireland
 
The Digital Repository of Ireland Digital Preservation and Research Sustainab...
The Digital Repository of Ireland Digital Preservation and Research Sustainab...The Digital Repository of Ireland Digital Preservation and Research Sustainab...
The Digital Repository of Ireland Digital Preservation and Research Sustainab...dri_ireland
 
DRI's role in WorldFAIR: Cultural Heritage / Image Sharing
DRI's role in WorldFAIR: Cultural Heritage / Image SharingDRI's role in WorldFAIR: Cultural Heritage / Image Sharing
DRI's role in WorldFAIR: Cultural Heritage / Image Sharingdri_ireland
 
Introduction to research data management
Introduction to research data managementIntroduction to research data management
Introduction to research data managementdri_ireland
 
Archiving Ports, Ports as Archives
Archiving Ports, Ports as ArchivesArchiving Ports, Ports as Archives
Archiving Ports, Ports as Archivesdri_ireland
 
Preservation, Access, Discovery
Preservation, Access, DiscoveryPreservation, Access, Discovery
Preservation, Access, Discoverydri_ireland
 
Dublin in the Fingal Archives
Dublin in the Fingal ArchivesDublin in the Fingal Archives
Dublin in the Fingal Archivesdri_ireland
 
Dublin Ghost Signs
Dublin Ghost SignsDublin Ghost Signs
Dublin Ghost Signsdri_ireland
 
Mapping Memories: Participatory Media, Place-Based Stories, Refugee Youth
Mapping Memories: Participatory Media, Place-Based Stories, Refugee YouthMapping Memories: Participatory Media, Place-Based Stories, Refugee Youth
Mapping Memories: Participatory Media, Place-Based Stories, Refugee Youthdri_ireland
 
Supporting Activists to Preserve Video Documentation
Supporting Activists to Preserve Video Documentation Supporting Activists to Preserve Video Documentation
Supporting Activists to Preserve Video Documentation dri_ireland
 
Making the Future
Making the FutureMaking the Future
Making the Futuredri_ireland
 

More from dri_ireland (20)

NORFest 2023 Lightning Talks Session Two
NORFest 2023 Lightning Talks Session TwoNORFest 2023 Lightning Talks Session Two
NORFest 2023 Lightning Talks Session Two
 
NORFest 2023: Early Career Researcher Panel on Research Assessment
NORFest 2023: Early Career Researcher Panel on Research AssessmentNORFest 2023: Early Career Researcher Panel on Research Assessment
NORFest 2023: Early Career Researcher Panel on Research Assessment
 
NORFest 2023: National Open Research Fund 2023, Projects Launch
NORFest 2023: National Open Research Fund 2023, Projects LaunchNORFest 2023: National Open Research Fund 2023, Projects Launch
NORFest 2023: National Open Research Fund 2023, Projects Launch
 
NORFest 2023 Lightning Talks Session Three
NORFest 2023 Lightning Talks Session Three NORFest 2023 Lightning Talks Session Three
NORFest 2023 Lightning Talks Session Three
 
NORFest 2023 Lightning Talks Session One
NORFest 2023 Lightning Talks Session OneNORFest 2023 Lightning Talks Session One
NORFest 2023 Lightning Talks Session One
 
NORFest2023 Keynote address: Chelle Gentemann (NASA)
NORFest2023 Keynote address: Chelle Gentemann (NASA)NORFest2023 Keynote address: Chelle Gentemann (NASA)
NORFest2023 Keynote address: Chelle Gentemann (NASA)
 
The Archiving Reproductive Health project as a FAIR data resource for humanit...
The Archiving Reproductive Health project as a FAIR data resource for humanit...The Archiving Reproductive Health project as a FAIR data resource for humanit...
The Archiving Reproductive Health project as a FAIR data resource for humanit...
 
Developing a self-care protocol for working with potentially traumatic data: ...
Developing a self-care protocol for working with potentially traumatic data: ...Developing a self-care protocol for working with potentially traumatic data: ...
Developing a self-care protocol for working with potentially traumatic data: ...
 
An Introduction to the Digital Repository of Ireland
An Introduction to the Digital Repository of Ireland An Introduction to the Digital Repository of Ireland
An Introduction to the Digital Repository of Ireland
 
DRI Copyright and Licencing_UCC_Mar23.pptx
DRI Copyright and Licencing_UCC_Mar23.pptxDRI Copyright and Licencing_UCC_Mar23.pptx
DRI Copyright and Licencing_UCC_Mar23.pptx
 
The Digital Repository of Ireland Digital Preservation and Research Sustainab...
The Digital Repository of Ireland Digital Preservation and Research Sustainab...The Digital Repository of Ireland Digital Preservation and Research Sustainab...
The Digital Repository of Ireland Digital Preservation and Research Sustainab...
 
DRI's role in WorldFAIR: Cultural Heritage / Image Sharing
DRI's role in WorldFAIR: Cultural Heritage / Image SharingDRI's role in WorldFAIR: Cultural Heritage / Image Sharing
DRI's role in WorldFAIR: Cultural Heritage / Image Sharing
 
Introduction to research data management
Introduction to research data managementIntroduction to research data management
Introduction to research data management
 
Archiving Ports, Ports as Archives
Archiving Ports, Ports as ArchivesArchiving Ports, Ports as Archives
Archiving Ports, Ports as Archives
 
Preservation, Access, Discovery
Preservation, Access, DiscoveryPreservation, Access, Discovery
Preservation, Access, Discovery
 
Dublin in the Fingal Archives
Dublin in the Fingal ArchivesDublin in the Fingal Archives
Dublin in the Fingal Archives
 
Dublin Ghost Signs
Dublin Ghost SignsDublin Ghost Signs
Dublin Ghost Signs
 
Mapping Memories: Participatory Media, Place-Based Stories, Refugee Youth
Mapping Memories: Participatory Media, Place-Based Stories, Refugee YouthMapping Memories: Participatory Media, Place-Based Stories, Refugee Youth
Mapping Memories: Participatory Media, Place-Based Stories, Refugee Youth
 
Supporting Activists to Preserve Video Documentation
Supporting Activists to Preserve Video Documentation Supporting Activists to Preserve Video Documentation
Supporting Activists to Preserve Video Documentation
 
Making the Future
Making the FutureMaking the Future
Making the Future
 

Recently uploaded

"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 

Recently uploaded (20)

"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 

Towards Linked Vital Registration Data for Reconstituting Families and Creating Longitudinal Health Histories

  • 1. Towards Linked Vital Registration Data for Reconstituting Families and Creating Longitudinal Health HistoriesLongitudinal Health Histories Oya Beyan, Ciara Breathnach, Sandra Collins, Christophe Debruyne, Stefan Decker, Dolores Grant, Rebecca Grant, and Brian Gurrin 21st of July 2014 – KR4HC Workshop – Vienna, Austria21st of July 2014 – KR4HC Workshop – Vienna, Austria
  • 2. Irish Record Linkage, 1864-1913 • Developing a platform applying semantic technologies to historical birth-, death andtechnologies to historical birth-, death and marriage certificates. • Answering questions such as: “How accurate are historic maternal mortality rates (MMR) and infant mortality rates (IMR) for Dublin?” • Team consists of researchers (historians), digital archivists, and knowledge engineers. 21/07/2014 2
  • 3. Data: General Office Records • Vital registration data – Birth-certificates– Birth-certificates – Death-certificates – Marriage records • Digitised TIFF images of hardcopy indexes and registers. • 2 TB of data• 2 TB of data • Database describing the digitised records allowing searches on some fields. 21/07/2014 3 ©General Records Office of Ireland 2014
  • 4. Challenges • Certified causes of death that can be attributed to maternal death – Within 42 days after labour – before (1864) it was 12– Within 42 days after labour – before (1864) it was 12 – Septicemia (blood poisoning), Fever, … – “Corresponding” birth certificate? • Death certificates with no corresponding birth certificate • “Gaps” in sibship interval, even though no birth- or death certificates can be found. • The terminology used pre-1900. E.g., “debile” to denote• The terminology used pre-1900. E.g., “debile” to denote weak or a failure to thrive. • Capturing the socio-economical status of the families via, for instance, the professions, ranks of fathers. 21/07/2014 4
  • 5. Conceptual Architecture Digital Archivist SPARQL endpoint / Linked Data Server Updates GRO records as RDF LinksLinker UpdaterRepository Triple- store Linked Data Server Analytics Researcher 21/07/2014 5 DATA ANALYTICSPRESERVATION Links to external datasets: e.g., Logainm – a database of Irish historical and contemporary place names to provide additional context.
  • 6. Development of 2 ontologies Triplestore 2 Data Analysis CONCERNSSEPARATIONOFCONCERNS Obviously, due to the sensitive nature of the data, data protection is key. 21/07/2014 6 GRO Triplestore Transformation from one model to another • SPIN – SPARQL Inference • SWRL / RuleML • SPARQL Construct • … SEPARATION protection is key.
  • 7. Development of 2 ontologies • 2 ontologies were developed – separation of concerns • First ontology for describing the contents of records – OWL 2 shallow, “flat ontology” • Second ontology for data analysis – OWL 2 + rules – Rules to capture background and domain knowledge– Rules to capture background and domain knowledge – Developed by having the historians formulate competency questions (Grüninger and Fox) – Captured graphically using Object Role Modelling 21/07/2014 7
  • 8. Graphical Representation in ORM 21/07/2014 8
  • 9. ### Prefixes ommitted … irl:Record a owl:Class ; rdfs:label "Record" ; . irl:Certificate a owl:Class ; rdfs:label "Certificate" ; rdfs:subClassOf irl:Record; .rdfs:subClassOf irl:Record; . irl:BirthRecord a owl:Class ; rdfs:label "Birth Record" ; rdfs:subClassOf irl:Certificate ; . irl:DeathRecord a owl:Class ; rdfs:label "Death Record" ; rdfs:subClassOf irl:Certificate ; . irl:MarriageRecord a owl:Class ; rdfs:label "Marriage Record" ;rdfs:label "Marriage Record" ; rdfs:subClassOf irl:Record ; . irl:Return a owl:Class ; rdfs:label "Return" ; . … 21/07/2014 9
  • 10. Conclusions • Presented the problem and highlighted the challengeschallenges • Developed two ontologies – Encoding contents of digitized GRO records for long-term digital preservation DRI – Data analytics to answer the researchers’ question – in this case a historianquestion – in this case a historian • Data exploration and annotation of the records started on a subset of the dataset 21/07/2014 10