SlideShare a Scribd company logo
1 of 49
Download to read offline
A distributed network of digital
heritage information
UNESCO-NDL INDIA INTERNATIONAL WORKSHOP
ON KNOWLEDGE ENGINEERING FOR DIGITAL LIBRARY DESIGN
Enno Meijers / 25 October 2017 / New Delhi - India
https://www.google.nl/search?q=enno+meijers+netherlands
A short introduction….
Contents
1. Introduction to Dutch Digital Heritage Network
2. Evaluating our current digital heritage infrastructure
3. Strategies for improvement
4. Building a distributed network for digital heritage information
1. Introduction to Dutch Digital Heritage Network
The Digital Heritage Network (NDE) aims at
increasing the social value of the heritage
information maintained by libraries, archives,
museums and other cultural heritage institutions.
This strategy offers a perspective on developing
a national, cross-sector infrastructure of digital
heritage facilities.
It focuses on long term cooperation between the
government and the institutions on national,
regional and local level. It is about organizing the
network of people and information!
National Digital Heritage strategic plan (2015)
Available metadata and collections online for general use (Europe)
Source: Enumerate Survey 2017
Thinking from the user’s perspective
Thinking from the user’s perspective also
means seeking out the digital platforms
and work environments where potential
users can already be found.
The attractiveness of information to a
certain user group is not determined only
by the nature of the information, but also
by the method and location through which
that information is offered.
2. Evaluating current infrastructure
General setup of digital heritage portals
Heritage information consisting of GLAM datasets and science collections
Evaluating current approach (1)
Positive results so far:
• many sources available through OAI-PMH protocol
• powerful and smart protocol for metadata synchronization
• opened up data silos
• created the need for aligning data models
• made cross-collection and cross-domain discovery possible (e.g. Europeana)
Evaluating current approach (2)
But there are two main problems areas:
• poor semantic alignment
• inefficient data integration
Problem #1: Poor semantic alignment
Infrastructure:
• no sustainable identifiers (URIs) for objects
• use of strings instead of URIs for terms (who, what, where, when,..)
• no shared terminology sources available
• no provisions for linking to external references
Publishing:
• implementations lack support for multiple data models
• at best data is ‘flattened’ to a common data model (EDM, Dublin Core)
• loss of meaning due to transformation
Problem #2: Inefficient data integration
Aggregation is based on physical data integration
• Dutch cultural heritage sector: 1500 institutions, >>1500 collections
• integration model is based on copying data (OAI-PMH)
• synchronization with the data source needs permanent attention
• ownership, licensing, provenance, control over access are difficult topics
• no feedback loop to the data source (usage, cleaning, enrichments)
• large distance between data source owner and user of the data
• centralized model leads to scalability problems
See also:
Miel Vander Sande et al. , Towards sustainable publishing and querying of distributed Linked Data archives - Journal of Documentation (2017)
Herbert Van de Sompel - Reminiscing About 15 Years of Interoperability Efforts - D-lib Magazine - December (2015)
Networks of aggregators...
In general - discovery of the “deep web”
• Institutional repositories, collection management systems
• Millions of ‘invisible’ datasets: publications, research data, heritage collections
• Poor coverage by regular search engines
• Metadata is key, describing physical materials or (licensed) digital content
• Demand for cross-institutional, cross-domain discovery
• Many specialized portals giving access to different views
3. Strategies for improvement
The Digital Heritage Network
is developing a three-layered
approach for improving the
sustainability, the usability
and the visibility of digital
heritage information.
sustainable
usable
visible
The Digital Heritage Network
is developing a three-layered
approach for improving the
sustainability, the usability
and the visibility of digital
heritage information.
sustainable
usable
visible
• build service portals as views based on a common data layer
• minimize the intermediate layers
• support decentralized discovery
• refer to the source instead of copying
• maximize the usability of data at the source
• develop a sustainable, ‘web-centric’ solution
• use HTTP, RDF and RESTful APIs as building blocks
=> implement the Linked Data principals
Inspired by the work of Ruben Verborgh, Herbert Van de Sompel and colleagues:
See for example: Miel Vander Sande et al. , Towards sustainable publishing and querying of distributed
Linked Data archives - Journal of Documentation (2017)
Design principles for a discovery infrastructure
At the data source level:
• use sustainable URIs to identify the resources
• use formal definitions for persons, places, concepts, events
• use domain data models to describe the data
• add support for cross-domain discovery (Europeana Data Model, Schema.org,...)
• publish the collection information as Linked Data
=> Work with the IT suppliers as strategic partners for the implementation!
Implementing Linked Data principles (1)
At the network level:
• create a ‘network of terms’ for shared terminology
• provide tools for alignment and linking
• create alignments and links between different terminology sources
• provide easy access to shared terminology for collection management systems
(API)
=> Provide open and cross-domain solutions at the network level!
Implementing Linked Data principles (2)
Building on previous work
Ok, but how will our Linked Data be found?
The Linked Open Data cloud…
The Semantic Web is still a dream… #1
 So discovery of Linked Data requires
registering datasets?!
A tiny example...suppose a resource is defined as:
museum_X:object1
a nde:painting ;
dct:subject aat:windmill .
The Semantic Web is still a dream… #2
“Windmill”
A tiny example...suppose a resource is defined as:
museum_X:object1
a nde:painting ;
dct:subject aat:windmill .
For ‘browsable Linked Data’ you should(!) add the inverse relation [1],[2]:
aat:windmill
a skos:Concept ;
skos:prefLabel “Windmill“@en ;
dct:isSubjectOf museum_X:object1 .
The Semantic Web is still a dream… #2
[1]: Tim Berner’s Lee on ‘browsable linked data’ (2006) [2]: Tom Heath and Christian Bizer on ‘Incoming Links’ (2011)
“Windmill”
“Windmill”
A tiny example...suppose a resource is defined as:
museum_X:object1
a nde:painting ;
dct:subject aat:windmill .
For ‘browsable Linked Data’ you should(!) add the inverse relation [1],[2]:
aat:windmill
a skos:Concept ;
skos:prefLabel “Windmill“@en ;
dct:isSubjectOf museum_X:object1 .
=> a Linked Data integration problem, the lack of “backlinks”
The Semantic Web is still a dream… #2
[1]: Tim Berner’s Lee on ‘browsable linked data’ (2006) [2]: Tom Heath and Christian Bizer on ‘Incoming Links’ (2011)
“Windmill”
“Windmill”
Some theory on Linked Data integration
Possible approaches to make Linked Data work:
1. Semantic integration
2. Physical integration
3. Virtual integration – standard approach
4. Virtual integration – using Linked Data Fragments
Actions:
• implement schema.org
• let search engines ‘infer’ the relations
• query the search engines
Outcome:
• is the data interesting enough for Google?
• what about special thematic or regional views?
• can we reuse the results of the integration? (NO!)
1. Semantic integration only
Actions:
• aggregate all the related Linked Data sources
• build large triplestore and infer the relations
• query the aggregated data
Outcome:
• approach still based on copying
• same problems as traditional aggregation!
2. Physical integration of linked data
Actions:
• publish Linked Data through triplestore with
SPARQL endpoint
• build a central query engine to integrate the
results
Outcome:
• implementing a triplestore is hard for small data
providers
• federated querying over multiple triplestores
performs poorly
3a. Virtual integration - standard approach
Actions:
• publish Linked Data using Linked Data Fragments
(LDF) technology
• build a central LDF based query engine to integrate
the results
Outcome:
• easy implementation for small data providers
• federated querying is supported
• more difficult to process the result
• possible support for time-based versions (Memento)
See also: Miel Vander Sande et al. , (2017) Towards sustainable publishing and querying of distributed Linked Data archives -
Journal of Documentation
3b. Virtual integration - using LDF
Problem:
• query many data sources at the same time
is not realistic…
Solution:
• build a Knowledge Base with backlinks to
support the discovery process
• select relevant sources for querying based
on the Knowledge Base
See also: Miel Vander Sande et al. (2016) Hypermedia-Based Discovery for Source Selection Using Low-Cost Linked Data Interfaces
(IJSWIS) 12(3) 79–110
More advanced:
data source profiling or dataset summaries
But federation needs selection of sources…
4. Building the distributed network
of Dutch Digital Heritage information
Strategy for our distributed network
1. build a service for shared entities for Dutch digital heritage
2. improve the usability of the data source:
- align object descriptions with shared terminology
- publish data as Linked Data
semantic
alignment
Strategy for our distributed network
1. build a service for shared entities for Dutch digital heritage
2. improve the usability of the data source:
- align object descriptions with shared terminology
- publish data as Linked Data
3. build a discovery infrastructure:
- register organizations and datasets in a registry
- build knowledge graph to support discovery (“backlinks”)
4. implement virtual data integration technology :
- use registry and knowledge graph for selecting the resources
- support federated querying (or selective aggregation)
semantic
alignment
data
integration
https://github.com/netwerk-digitaal-erfgoed/high-level-design
High-level design of our discovery infrastructure
Roadmap (1)
Phase 1 – functional design / developing partnerships:
• design of supporting cross-domain functionality
• develop partnerships with IT suppliers and specialists
• develop domain and cross domain strategies
Phase 2 – enrich the current (OAI-PMH based) infrastructure:
• build a registry for organizations and datasets
• build a network of terms to provide shared terminology for discovery
• upgrade object descriptions with formal definitions (URIs)
Roadmap (2)
Phase 3: implement Linked Data technology at the network level
• make aggregators Linked Data compliant
• build a knowledge graph with backlinks for discovery
• support federated querying (or selective harvesting)
Phase 4: realize the distributed network of heritage information
• make collection management systems Linked Data compliant
• transform aggregators to service portals for discovery
Thank you for your attention!
please share your thoughts with us...
email: enno.meijers at kb.nl
twitter, slideshare: ennomeijers
https://github.com/netwerk-digitaal-erfgoed

More Related Content

What's hot

Next Steps for IMLS's National Digital Platform
Next Steps for IMLS's National Digital PlatformNext Steps for IMLS's National Digital Platform
Next Steps for IMLS's National Digital Platform
Trevor Owens
 
The cloud and the commons
The cloud and the commonsThe cloud and the commons
The cloud and the commons
Tom Ipri
 
Creating digital libraries in support of learning communities using free t...
Creating digital libraries in support of learning communities using free t...Creating digital libraries in support of learning communities using free t...
Creating digital libraries in support of learning communities using free t...
Maggie Verster
 
Overview AG AKSW
Overview AG AKSWOverview AG AKSW
Overview AG AKSW
Sören Auer
 
Web Science Synergies: Exploring Web Knowledge through the Semantic Web
Web Science Synergies: Exploring Web Knowledge through the Semantic WebWeb Science Synergies: Exploring Web Knowledge through the Semantic Web
Web Science Synergies: Exploring Web Knowledge through the Semantic Web
Stefan Dietze
 
ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Kn...
ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Kn...ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Kn...
ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Kn...
eswcsummerschool
 

What's hot (20)

Beyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Beyond Meta-Data: Nano-Publications Recording Scientific EndeavourBeyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Beyond Meta-Data: Nano-Publications Recording Scientific Endeavour
 
Next Steps for IMLS's National Digital Platform
Next Steps for IMLS's National Digital PlatformNext Steps for IMLS's National Digital Platform
Next Steps for IMLS's National Digital Platform
 
Access to Digital Back Copy
Access to Digital Back CopyAccess to Digital Back Copy
Access to Digital Back Copy
 
Auclair 3D, Immersive Virtual Reality Augmented Reality
Auclair 3D, Immersive Virtual Reality Augmented RealityAuclair 3D, Immersive Virtual Reality Augmented Reality
Auclair 3D, Immersive Virtual Reality Augmented Reality
 
EDF2014: Vedran Sabol, Head of the Knowledge Visualisation Area, Know-Center,...
EDF2014: Vedran Sabol, Head of the Knowledge Visualisation Area, Know-Center,...EDF2014: Vedran Sabol, Head of the Knowledge Visualisation Area, Know-Center,...
EDF2014: Vedran Sabol, Head of the Knowledge Visualisation Area, Know-Center,...
 
New challenges for digital scholarship and curation in the era of ubiquitous ...
New challenges for digital scholarship and curation in the era of ubiquitous ...New challenges for digital scholarship and curation in the era of ubiquitous ...
New challenges for digital scholarship and curation in the era of ubiquitous ...
 
A structured catalog of open educational datasets
A structured catalog of open educational datasetsA structured catalog of open educational datasets
A structured catalog of open educational datasets
 
How can the cultural heritage community best meet the challenges of email arc...
How can the cultural heritage community best meet the challenges of email arc...How can the cultural heritage community best meet the challenges of email arc...
How can the cultural heritage community best meet the challenges of email arc...
 
Linked Open Data for Digital Humanities
Linked Open Data for Digital HumanitiesLinked Open Data for Digital Humanities
Linked Open Data for Digital Humanities
 
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...
 
DIGITAL LIBRARIES AND THE CHALLENGE OF A "DIGITAL DARK AGES"
DIGITAL LIBRARIES AND THE CHALLENGE OF A "DIGITAL DARK AGES"DIGITAL LIBRARIES AND THE CHALLENGE OF A "DIGITAL DARK AGES"
DIGITAL LIBRARIES AND THE CHALLENGE OF A "DIGITAL DARK AGES"
 
The cloud and the commons
The cloud and the commonsThe cloud and the commons
The cloud and the commons
 
WORLD CAT AS BIG DATA
WORLD CAT AS  BIG DATAWORLD CAT AS  BIG DATA
WORLD CAT AS BIG DATA
 
Semantic web on Cloud Infrastructure
Semantic web on Cloud InfrastructureSemantic web on Cloud Infrastructure
Semantic web on Cloud Infrastructure
 
Self-Organizing Logical-Clustering Topology for Managing Distributed Context ...
Self-Organizing Logical-Clustering Topology for Managing Distributed Context ...Self-Organizing Logical-Clustering Topology for Managing Distributed Context ...
Self-Organizing Logical-Clustering Topology for Managing Distributed Context ...
 
Creating digital libraries in support of learning communities using free t...
Creating digital libraries in support of learning communities using free t...Creating digital libraries in support of learning communities using free t...
Creating digital libraries in support of learning communities using free t...
 
Overview AG AKSW
Overview AG AKSWOverview AG AKSW
Overview AG AKSW
 
Web Science Synergies: Exploring Web Knowledge through the Semantic Web
Web Science Synergies: Exploring Web Knowledge through the Semantic WebWeb Science Synergies: Exploring Web Knowledge through the Semantic Web
Web Science Synergies: Exploring Web Knowledge through the Semantic Web
 
Using a digital library to organise your life
Using a digital library to organise your lifeUsing a digital library to organise your life
Using a digital library to organise your life
 
ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Kn...
ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Kn...ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Kn...
ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Kn...
 

Similar to A distributed network of digital heritage information - Unesco/NDL India

Connecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosConnecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
OCLC
 
Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)
Anja Jentzsch
 

Similar to A distributed network of digital heritage information - Unesco/NDL India (20)

Session 1.4 a distributed network of heritage information
Session 1.4   a distributed network of heritage informationSession 1.4   a distributed network of heritage information
Session 1.4 a distributed network of heritage information
 
lodlam summit session browsable linked data
lodlam summit session browsable linked datalodlam summit session browsable linked data
lodlam summit session browsable linked data
 
20170501 Distributed Network of Digital Heritage Information
20170501  Distributed Network of Digital Heritage Information20170501  Distributed Network of Digital Heritage Information
20170501 Distributed Network of Digital Heritage Information
 
A new approach to aggregation
A new approach to aggregation A new approach to aggregation
A new approach to aggregation
 
20191210 NDLI KEDL2019 Building the dutch digital heritage network
20191210 NDLI KEDL2019 Building the dutch digital heritage network20191210 NDLI KEDL2019 Building the dutch digital heritage network
20191210 NDLI KEDL2019 Building the dutch digital heritage network
 
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
 
Aggregation as Tactic
Aggregation as TacticAggregation as Tactic
Aggregation as Tactic
 
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosConnecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
 
Toward universal information access on the digital object cloud
Toward universal information access on the digital object cloudToward universal information access on the digital object cloud
Toward universal information access on the digital object cloud
 
Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)
 
C N I20080404
C N I20080404C N I20080404
C N I20080404
 
Torsten Reimer
Torsten ReimerTorsten Reimer
Torsten Reimer
 
Towards long-term preservation of linked data - the PRELIDA project
Towards long-term preservation of linked data - the PRELIDA projectTowards long-term preservation of linked data - the PRELIDA project
Towards long-term preservation of linked data - the PRELIDA project
 
HIBERLINK: Reference Rot and Linked Data: Threat and Remedy
HIBERLINK: Reference Rot and Linked Data: Threat and RemedyHIBERLINK: Reference Rot and Linked Data: Threat and Remedy
HIBERLINK: Reference Rot and Linked Data: Threat and Remedy
 
Linked Data Workshop Stanford University
Linked Data Workshop Stanford University Linked Data Workshop Stanford University
Linked Data Workshop Stanford University
 
Linked (Open) Data
Linked (Open) DataLinked (Open) Data
Linked (Open) Data
 
Research into Practice case study 2: Library linked data implementations an...
	Research into Practice case study 2:  Library linked data implementations an...	Research into Practice case study 2:  Library linked data implementations an...
Research into Practice case study 2: Library linked data implementations an...
 
from local/regional OER Silos towards an OER Global Dataspace
from local/regional OER Silos towards an OER Global Dataspacefrom local/regional OER Silos towards an OER Global Dataspace
from local/regional OER Silos towards an OER Global Dataspace
 
2010 EGITF Amsterdam - Gap between GRID and Humanities
2010 EGITF Amsterdam - Gap between GRID and Humanities2010 EGITF Amsterdam - Gap between GRID and Humanities
2010 EGITF Amsterdam - Gap between GRID and Humanities
 
Open Data Masterclass - Europeana and LOD
Open Data Masterclass - Europeana and LODOpen Data Masterclass - Europeana and LOD
Open Data Masterclass - Europeana and LOD
 

More from Enno Meijers

Workshop linked open data en nbc+ 16 juni 2015
Workshop linked open data en nbc+ 16 juni 2015Workshop linked open data en nbc+ 16 juni 2015
Workshop linked open data en nbc+ 16 juni 2015
Enno Meijers
 

More from Enno Meijers (12)

Building library networks with linked data
Building library networks with linked dataBuilding library networks with linked data
Building library networks with linked data
 
Linked Open Netwerken
Linked Open NetwerkenLinked Open Netwerken
Linked Open Netwerken
 
20170407 Bruikbaar Erfgoed - Week van het Digitaal Erfgoed
20170407 Bruikbaar Erfgoed - Week van het Digitaal Erfgoed20170407 Bruikbaar Erfgoed - Week van het Digitaal Erfgoed
20170407 Bruikbaar Erfgoed - Week van het Digitaal Erfgoed
 
Presentatie PCDB overleg Utrecht 28 juni 2016
Presentatie PCDB overleg Utrecht 28 juni 2016Presentatie PCDB overleg Utrecht 28 juni 2016
Presentatie PCDB overleg Utrecht 28 juni 2016
 
Slimmer werken met metadata COPE 25 mei 2016
Slimmer werken met metadata COPE 25 mei 2016Slimmer werken met metadata COPE 25 mei 2016
Slimmer werken met metadata COPE 25 mei 2016
 
Verbonden & Gevonden
Verbonden & Gevonden Verbonden & Gevonden
Verbonden & Gevonden
 
Developing a national digital library stapel - meijers 20160302
Developing a national digital library   stapel - meijers 20160302Developing a national digital library   stapel - meijers 20160302
Developing a national digital library stapel - meijers 20160302
 
Betere toegang tot onze informatie
Betere toegang tot onze informatieBetere toegang tot onze informatie
Betere toegang tot onze informatie
 
Workshop linked open data en nbc+ 16 juni 2015
Workshop linked open data en nbc+ 16 juni 2015Workshop linked open data en nbc+ 16 juni 2015
Workshop linked open data en nbc+ 16 juni 2015
 
Workshop linked open data en nbc+ 16 juni 2015
Workshop linked open data en nbc+ 16 juni 2015Workshop linked open data en nbc+ 16 juni 2015
Workshop linked open data en nbc+ 16 juni 2015
 
Nbc+ deel 2 informatiedag lokale collecties
Nbc+ deel 2   informatiedag lokale collectiesNbc+ deel 2   informatiedag lokale collecties
Nbc+ deel 2 informatiedag lokale collecties
 
Presentatie nl.dbpedia.org Datasalon 8 Gent 24 Februari 2012
Presentatie nl.dbpedia.org Datasalon 8 Gent 24 Februari 2012Presentatie nl.dbpedia.org Datasalon 8 Gent 24 Februari 2012
Presentatie nl.dbpedia.org Datasalon 8 Gent 24 Februari 2012
 

Recently uploaded

Recently uploaded (20)

2024: The FAR, Federal Acquisition Regulations, Part 30
2024: The FAR, Federal Acquisition Regulations, Part 302024: The FAR, Federal Acquisition Regulations, Part 30
2024: The FAR, Federal Acquisition Regulations, Part 30
 
Antisemitism Awareness Act: pénaliser la critique de l'Etat d'Israël
Antisemitism Awareness Act: pénaliser la critique de l'Etat d'IsraëlAntisemitism Awareness Act: pénaliser la critique de l'Etat d'Israël
Antisemitism Awareness Act: pénaliser la critique de l'Etat d'Israël
 
The Most Attractive Pune Call Girls Handewadi Road 8250192130 Will You Miss T...
The Most Attractive Pune Call Girls Handewadi Road 8250192130 Will You Miss T...The Most Attractive Pune Call Girls Handewadi Road 8250192130 Will You Miss T...
The Most Attractive Pune Call Girls Handewadi Road 8250192130 Will You Miss T...
 
WORLD DEVELOPMENT REPORT 2024 - Economic Growth in Middle-Income Countries.
WORLD DEVELOPMENT REPORT 2024 - Economic Growth in Middle-Income Countries.WORLD DEVELOPMENT REPORT 2024 - Economic Growth in Middle-Income Countries.
WORLD DEVELOPMENT REPORT 2024 - Economic Growth in Middle-Income Countries.
 
Financing strategies for adaptation. Presentation for CANCC
Financing strategies for adaptation. Presentation for CANCCFinancing strategies for adaptation. Presentation for CANCC
Financing strategies for adaptation. Presentation for CANCC
 
Call Girls Chakan Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Chakan Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Chakan Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Chakan Call Me 7737669865 Budget Friendly No Advance Booking
 
Election 2024 Presiding Duty Keypoints_01.pdf
Election 2024 Presiding Duty Keypoints_01.pdfElection 2024 Presiding Duty Keypoints_01.pdf
Election 2024 Presiding Duty Keypoints_01.pdf
 
World Press Freedom Day 2024; May 3rd - Poster
World Press Freedom Day 2024; May 3rd - PosterWorld Press Freedom Day 2024; May 3rd - Poster
World Press Freedom Day 2024; May 3rd - Poster
 
(NEHA) Call Girls Nagpur Call Now 8250077686 Nagpur Escorts 24x7
(NEHA) Call Girls Nagpur Call Now 8250077686 Nagpur Escorts 24x7(NEHA) Call Girls Nagpur Call Now 8250077686 Nagpur Escorts 24x7
(NEHA) Call Girls Nagpur Call Now 8250077686 Nagpur Escorts 24x7
 
Top Rated Pune Call Girls Bhosari ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...
Top Rated  Pune Call Girls Bhosari ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...Top Rated  Pune Call Girls Bhosari ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...
Top Rated Pune Call Girls Bhosari ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...
 
PPT BIJNOR COUNTING Counting of Votes on ETPBs (FOR SERVICE ELECTORS
PPT BIJNOR COUNTING Counting of Votes on ETPBs (FOR SERVICE ELECTORSPPT BIJNOR COUNTING Counting of Votes on ETPBs (FOR SERVICE ELECTORS
PPT BIJNOR COUNTING Counting of Votes on ETPBs (FOR SERVICE ELECTORS
 
Item # 4 - 231 Encino Ave (Significance Only).pdf
Item # 4 - 231 Encino Ave (Significance Only).pdfItem # 4 - 231 Encino Ave (Significance Only).pdf
Item # 4 - 231 Encino Ave (Significance Only).pdf
 
↑VVIP celebrity ( Pune ) Serampore Call Girls 8250192130 unlimited shot and a...
↑VVIP celebrity ( Pune ) Serampore Call Girls 8250192130 unlimited shot and a...↑VVIP celebrity ( Pune ) Serampore Call Girls 8250192130 unlimited shot and a...
↑VVIP celebrity ( Pune ) Serampore Call Girls 8250192130 unlimited shot and a...
 
Pimpri Chinchwad ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi R...
Pimpri Chinchwad ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi R...Pimpri Chinchwad ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi R...
Pimpri Chinchwad ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi R...
 
celebrity 💋 Agra Escorts Just Dail 8250092165 service available anytime 24 hour
celebrity 💋 Agra Escorts Just Dail 8250092165 service available anytime 24 hourcelebrity 💋 Agra Escorts Just Dail 8250092165 service available anytime 24 hour
celebrity 💋 Agra Escorts Just Dail 8250092165 service available anytime 24 hour
 
Call On 6297143586 Viman Nagar Call Girls In All Pune 24/7 Provide Call With...
Call On 6297143586  Viman Nagar Call Girls In All Pune 24/7 Provide Call With...Call On 6297143586  Viman Nagar Call Girls In All Pune 24/7 Provide Call With...
Call On 6297143586 Viman Nagar Call Girls In All Pune 24/7 Provide Call With...
 
Finance strategies for adaptation. Presentation for CANCC
Finance strategies for adaptation. Presentation for CANCCFinance strategies for adaptation. Presentation for CANCC
Finance strategies for adaptation. Presentation for CANCC
 
Postal Ballots-For home voting step by step process 2024.pptx
Postal Ballots-For home voting step by step process 2024.pptxPostal Ballots-For home voting step by step process 2024.pptx
Postal Ballots-For home voting step by step process 2024.pptx
 
Coastal Protection Measures in Hulhumale'
Coastal Protection Measures in Hulhumale'Coastal Protection Measures in Hulhumale'
Coastal Protection Measures in Hulhumale'
 
2024 Zoom Reinstein Legacy Asbestos Webinar
2024 Zoom Reinstein Legacy Asbestos Webinar2024 Zoom Reinstein Legacy Asbestos Webinar
2024 Zoom Reinstein Legacy Asbestos Webinar
 

A distributed network of digital heritage information - Unesco/NDL India

  • 1. A distributed network of digital heritage information UNESCO-NDL INDIA INTERNATIONAL WORKSHOP ON KNOWLEDGE ENGINEERING FOR DIGITAL LIBRARY DESIGN Enno Meijers / 25 October 2017 / New Delhi - India
  • 3. Contents 1. Introduction to Dutch Digital Heritage Network 2. Evaluating our current digital heritage infrastructure 3. Strategies for improvement 4. Building a distributed network for digital heritage information
  • 4. 1. Introduction to Dutch Digital Heritage Network
  • 5.
  • 6.
  • 7.
  • 8.
  • 9.
  • 10. The Digital Heritage Network (NDE) aims at increasing the social value of the heritage information maintained by libraries, archives, museums and other cultural heritage institutions. This strategy offers a perspective on developing a national, cross-sector infrastructure of digital heritage facilities. It focuses on long term cooperation between the government and the institutions on national, regional and local level. It is about organizing the network of people and information! National Digital Heritage strategic plan (2015)
  • 11. Available metadata and collections online for general use (Europe) Source: Enumerate Survey 2017
  • 12.
  • 13. Thinking from the user’s perspective Thinking from the user’s perspective also means seeking out the digital platforms and work environments where potential users can already be found. The attractiveness of information to a certain user group is not determined only by the nature of the information, but also by the method and location through which that information is offered.
  • 14. 2. Evaluating current infrastructure
  • 15. General setup of digital heritage portals Heritage information consisting of GLAM datasets and science collections
  • 16. Evaluating current approach (1) Positive results so far: • many sources available through OAI-PMH protocol • powerful and smart protocol for metadata synchronization • opened up data silos • created the need for aligning data models • made cross-collection and cross-domain discovery possible (e.g. Europeana)
  • 17. Evaluating current approach (2) But there are two main problems areas: • poor semantic alignment • inefficient data integration
  • 18. Problem #1: Poor semantic alignment Infrastructure: • no sustainable identifiers (URIs) for objects • use of strings instead of URIs for terms (who, what, where, when,..) • no shared terminology sources available • no provisions for linking to external references Publishing: • implementations lack support for multiple data models • at best data is ‘flattened’ to a common data model (EDM, Dublin Core) • loss of meaning due to transformation
  • 19. Problem #2: Inefficient data integration Aggregation is based on physical data integration • Dutch cultural heritage sector: 1500 institutions, >>1500 collections • integration model is based on copying data (OAI-PMH) • synchronization with the data source needs permanent attention • ownership, licensing, provenance, control over access are difficult topics • no feedback loop to the data source (usage, cleaning, enrichments) • large distance between data source owner and user of the data • centralized model leads to scalability problems See also: Miel Vander Sande et al. , Towards sustainable publishing and querying of distributed Linked Data archives - Journal of Documentation (2017) Herbert Van de Sompel - Reminiscing About 15 Years of Interoperability Efforts - D-lib Magazine - December (2015)
  • 21. In general - discovery of the “deep web” • Institutional repositories, collection management systems • Millions of ‘invisible’ datasets: publications, research data, heritage collections • Poor coverage by regular search engines • Metadata is key, describing physical materials or (licensed) digital content • Demand for cross-institutional, cross-domain discovery • Many specialized portals giving access to different views
  • 22. 3. Strategies for improvement
  • 23. The Digital Heritage Network is developing a three-layered approach for improving the sustainability, the usability and the visibility of digital heritage information. sustainable usable visible
  • 24. The Digital Heritage Network is developing a three-layered approach for improving the sustainability, the usability and the visibility of digital heritage information. sustainable usable visible
  • 25. • build service portals as views based on a common data layer • minimize the intermediate layers • support decentralized discovery • refer to the source instead of copying • maximize the usability of data at the source • develop a sustainable, ‘web-centric’ solution • use HTTP, RDF and RESTful APIs as building blocks => implement the Linked Data principals Inspired by the work of Ruben Verborgh, Herbert Van de Sompel and colleagues: See for example: Miel Vander Sande et al. , Towards sustainable publishing and querying of distributed Linked Data archives - Journal of Documentation (2017) Design principles for a discovery infrastructure
  • 26. At the data source level: • use sustainable URIs to identify the resources • use formal definitions for persons, places, concepts, events • use domain data models to describe the data • add support for cross-domain discovery (Europeana Data Model, Schema.org,...) • publish the collection information as Linked Data => Work with the IT suppliers as strategic partners for the implementation! Implementing Linked Data principles (1)
  • 27. At the network level: • create a ‘network of terms’ for shared terminology • provide tools for alignment and linking • create alignments and links between different terminology sources • provide easy access to shared terminology for collection management systems (API) => Provide open and cross-domain solutions at the network level! Implementing Linked Data principles (2)
  • 29.
  • 30. Ok, but how will our Linked Data be found?
  • 31. The Linked Open Data cloud…
  • 32. The Semantic Web is still a dream… #1  So discovery of Linked Data requires registering datasets?!
  • 33. A tiny example...suppose a resource is defined as: museum_X:object1 a nde:painting ; dct:subject aat:windmill . The Semantic Web is still a dream… #2 “Windmill”
  • 34. A tiny example...suppose a resource is defined as: museum_X:object1 a nde:painting ; dct:subject aat:windmill . For ‘browsable Linked Data’ you should(!) add the inverse relation [1],[2]: aat:windmill a skos:Concept ; skos:prefLabel “Windmill“@en ; dct:isSubjectOf museum_X:object1 . The Semantic Web is still a dream… #2 [1]: Tim Berner’s Lee on ‘browsable linked data’ (2006) [2]: Tom Heath and Christian Bizer on ‘Incoming Links’ (2011) “Windmill” “Windmill”
  • 35. A tiny example...suppose a resource is defined as: museum_X:object1 a nde:painting ; dct:subject aat:windmill . For ‘browsable Linked Data’ you should(!) add the inverse relation [1],[2]: aat:windmill a skos:Concept ; skos:prefLabel “Windmill“@en ; dct:isSubjectOf museum_X:object1 . => a Linked Data integration problem, the lack of “backlinks” The Semantic Web is still a dream… #2 [1]: Tim Berner’s Lee on ‘browsable linked data’ (2006) [2]: Tom Heath and Christian Bizer on ‘Incoming Links’ (2011) “Windmill” “Windmill”
  • 36. Some theory on Linked Data integration Possible approaches to make Linked Data work: 1. Semantic integration 2. Physical integration 3. Virtual integration – standard approach 4. Virtual integration – using Linked Data Fragments
  • 37. Actions: • implement schema.org • let search engines ‘infer’ the relations • query the search engines Outcome: • is the data interesting enough for Google? • what about special thematic or regional views? • can we reuse the results of the integration? (NO!) 1. Semantic integration only
  • 38. Actions: • aggregate all the related Linked Data sources • build large triplestore and infer the relations • query the aggregated data Outcome: • approach still based on copying • same problems as traditional aggregation! 2. Physical integration of linked data
  • 39.
  • 40. Actions: • publish Linked Data through triplestore with SPARQL endpoint • build a central query engine to integrate the results Outcome: • implementing a triplestore is hard for small data providers • federated querying over multiple triplestores performs poorly 3a. Virtual integration - standard approach
  • 41. Actions: • publish Linked Data using Linked Data Fragments (LDF) technology • build a central LDF based query engine to integrate the results Outcome: • easy implementation for small data providers • federated querying is supported • more difficult to process the result • possible support for time-based versions (Memento) See also: Miel Vander Sande et al. , (2017) Towards sustainable publishing and querying of distributed Linked Data archives - Journal of Documentation 3b. Virtual integration - using LDF
  • 42. Problem: • query many data sources at the same time is not realistic… Solution: • build a Knowledge Base with backlinks to support the discovery process • select relevant sources for querying based on the Knowledge Base See also: Miel Vander Sande et al. (2016) Hypermedia-Based Discovery for Source Selection Using Low-Cost Linked Data Interfaces (IJSWIS) 12(3) 79–110 More advanced: data source profiling or dataset summaries But federation needs selection of sources…
  • 43. 4. Building the distributed network of Dutch Digital Heritage information
  • 44. Strategy for our distributed network 1. build a service for shared entities for Dutch digital heritage 2. improve the usability of the data source: - align object descriptions with shared terminology - publish data as Linked Data semantic alignment
  • 45. Strategy for our distributed network 1. build a service for shared entities for Dutch digital heritage 2. improve the usability of the data source: - align object descriptions with shared terminology - publish data as Linked Data 3. build a discovery infrastructure: - register organizations and datasets in a registry - build knowledge graph to support discovery (“backlinks”) 4. implement virtual data integration technology : - use registry and knowledge graph for selecting the resources - support federated querying (or selective aggregation) semantic alignment data integration
  • 47. Roadmap (1) Phase 1 – functional design / developing partnerships: • design of supporting cross-domain functionality • develop partnerships with IT suppliers and specialists • develop domain and cross domain strategies Phase 2 – enrich the current (OAI-PMH based) infrastructure: • build a registry for organizations and datasets • build a network of terms to provide shared terminology for discovery • upgrade object descriptions with formal definitions (URIs)
  • 48. Roadmap (2) Phase 3: implement Linked Data technology at the network level • make aggregators Linked Data compliant • build a knowledge graph with backlinks for discovery • support federated querying (or selective harvesting) Phase 4: realize the distributed network of heritage information • make collection management systems Linked Data compliant • transform aggregators to service portals for discovery
  • 49. Thank you for your attention! please share your thoughts with us... email: enno.meijers at kb.nl twitter, slideshare: ennomeijers https://github.com/netwerk-digitaal-erfgoed