SlideShare a Scribd company logo
1 of 23
Download to read offline
Digital Manuscripts Without Borders
A Discovery Platform of Manuscripts and Rare Books
Régis ROBINEAU
@regisrob
2019 IIIF Conference
University of Göttingen (June 26th
, 2019)
@biblissima
Biblissima IIIF-Collections platform
➔ cross-collections search and discovery prototype for
manuscripts and rare books (prior to 1800 only)
➔ aggregates metadata from 10 IIIF digital libraries
➔ complementary to the main Biblissima portal at
biblissima.fr
Current scope: 10 datasets (June 2019)
◆ Gallica (BnF)
◆ Digital.Bodleian (Oxford)
◆ BVMM
◆ e-codices
◆ British Library
(BL-BnF Polonsky project)
◆ Europeana Regia
◆ Parker Library On the Web
◆ Bibliothèque Mazarine
◆ Ghent UL
◆ Cambridge UL
◆ Durham University and
Cathedral Library
~65,000 IIIF Manifests
Targeted IIIF datasets
➔ 57 manuscripts repositories identified in the IIIF
world (mostly from UK, France, USA, Germany)
frama.link/iiif-mss-registry
➔ A few observations on:
◆ open licences
◆ use of the seeAlso property
◆ metadata schemas linked from the seeAlso
◆ presence of a crawlable IIIF endpoint
Licences
Use of the seeAlso property
Metadata schemas (from seeAlso property)
+ different flavours of RDF (mix of
vocabularies / various serialization
formats)
Crawlable IIIF endpoint?
some of them have an
OAI-PMH endpoint, as an
alternative...
Workflow
1. Crawl and harvest the repositories’ endpoints
2. Extract and process the data:
◆ filter by date, normalise the dates formats
◆ normalise strings of languages and roles
◆ reconcile and cluster named entities (agents, places…)
3. Transform and ingest the data into ElasticSearch
Demo
iiif.biblissima.fr/collections
Advanced search parameters :
- exact search
- wildcards
- fuzzyness
- inclusion/exclusion
- boosting
Filter by library
Filter by language
Filter by date
Further info on this author in the
Biblissima portal
Further info on this manuscript
in the Biblissima portal
URI of the entity in the Biblissima
authority file:
https://data.biblissima.fr/entity/Q32764
Biblissima Authority data
➔ hub to manage and share our authority data:
◆ wiki-based technology (Wikibase): natively collaborative + versioning
◆ handles URIs identifiers
◆ natively produces RDF
◆ user-friendly forms to edit entries
◆ remote access for machines: Web API + SPARQL endpoint
data.biblissima.fr
Biblissima Authority data
Publication spread over 2019:
✓ Persons (done, March 2019) ~26,500 entities
✓ Geographical names (done, April 2019) ~5,500 entities
🚧 Organisations (in progress)
🚧 Shelfmarks of manuscripts and early printed books
➔ Textual works
➔ Iconographic descriptors
Feedback to data providers
➔ on the IIIF Manifests, on the metadata encodings…
➔ on the metadata itself (new identifications, corrections…)
➔ provision of data alignments/enrichments (URIs...)
Biblissima IIIF-Collections:
iiif.biblissima.fr/collections
Biblissima authority data:
data.biblissima.fr
Thank you!
Régis ROBINEAU
(Biblissima - Campus Condorcet / EPHE-PSL, Paris)
Biblissima Team:
Kévin BOIS
Eduard FRUNZEANU
biblissima.fr

More Related Content

What's hot

From Sql Server To Mongo D Bv1.0
From Sql Server To Mongo D Bv1.0From Sql Server To Mongo D Bv1.0
From Sql Server To Mongo D Bv1.0
Aleks Drozdov
 

What's hot (20)

Biblissima: Medieval Manuscripts and the Semantic Web
Biblissima: Medieval Manuscripts and the Semantic WebBiblissima: Medieval Manuscripts and the Semantic Web
Biblissima: Medieval Manuscripts and the Semantic Web
 
Drupal Simple DCAT Export module
Drupal Simple DCAT Export moduleDrupal Simple DCAT Export module
Drupal Simple DCAT Export module
 
De castro sonex work group
De castro sonex work groupDe castro sonex work group
De castro sonex work group
 
Sonex deposit meeting_ws_20110301
Sonex deposit meeting_ws_20110301Sonex deposit meeting_ws_20110301
Sonex deposit meeting_ws_20110301
 
(Big) bibliographic data @ ScaDS project meeting - 2015-06-12
(Big) bibliographic data @ ScaDS project meeting - 2015-06-12(Big) bibliographic data @ ScaDS project meeting - 2015-06-12
(Big) bibliographic data @ ScaDS project meeting - 2015-06-12
 
Keynote csws2013
Keynote csws2013Keynote csws2013
Keynote csws2013
 
Monday 4 May: From linear to non-linear broadcast contents: considering an “...
Monday 4 May: From linear to non-linear broadcast contents:  considering an “...Monday 4 May: From linear to non-linear broadcast contents:  considering an “...
Monday 4 May: From linear to non-linear broadcast contents: considering an “...
 
Bingham, De Wild & Aasman Presentation
Bingham, De Wild & Aasman PresentationBingham, De Wild & Aasman Presentation
Bingham, De Wild & Aasman Presentation
 
Digitised Manuscripts and the British Library's new IIIF viewer
Digitised Manuscripts and the British Library's new IIIF viewer Digitised Manuscripts and the British Library's new IIIF viewer
Digitised Manuscripts and the British Library's new IIIF viewer
 
What’s in a URL? Analysing COVID-19 web archive collections
What’s in a URL? Analysing COVID-19 web archive collectionsWhat’s in a URL? Analysing COVID-19 web archive collections
What’s in a URL? Analysing COVID-19 web archive collections
 
Nemeth Marton - Widening the limits of cognitive reception with online digita...
Nemeth Marton - Widening the limits of cognitive reception with online digita...Nemeth Marton - Widening the limits of cognitive reception with online digita...
Nemeth Marton - Widening the limits of cognitive reception with online digita...
 
From WG2 Datathon to AWAC2. Exploring IIPC special COVID collection thanks to...
From WG2 Datathon to AWAC2. Exploring IIPC special COVID collection thanks to...From WG2 Datathon to AWAC2. Exploring IIPC special COVID collection thanks to...
From WG2 Datathon to AWAC2. Exploring IIPC special COVID collection thanks to...
 
Maurer Presentation - WARCnet Spring Meeting 2021
Maurer Presentation - WARCnet Spring Meeting 2021Maurer Presentation - WARCnet Spring Meeting 2021
Maurer Presentation - WARCnet Spring Meeting 2021
 
EuropeanaTech x AI: Qurator.ai @ Berlin State Library
EuropeanaTech x AI: Qurator.ai @ Berlin State LibraryEuropeanaTech x AI: Qurator.ai @ Berlin State Library
EuropeanaTech x AI: Qurator.ai @ Berlin State Library
 
Linked Data at BnF : We Made It Happen... Now What? / Mélanie Roche (Nationa...
Linked Data at BnF : We Made It Happen... Now What? / Mélanie Roche (Nationa...Linked Data at BnF : We Made It Happen... Now What? / Mélanie Roche (Nationa...
Linked Data at BnF : We Made It Happen... Now What? / Mélanie Roche (Nationa...
 
Widening the limits of cognitive reception with online digital library graph ...
Widening the limits of cognitive reception with online digital library graph ...Widening the limits of cognitive reception with online digital library graph ...
Widening the limits of cognitive reception with online digital library graph ...
 
Tuesday 5 May 2020: Contextualizing and engaging with Web domains, Valérie Sc...
Tuesday 5 May 2020: Contextualizing and engaging with Web domains, Valérie Sc...Tuesday 5 May 2020: Contextualizing and engaging with Web domains, Valérie Sc...
Tuesday 5 May 2020: Contextualizing and engaging with Web domains, Valérie Sc...
 
From Sql Server To Mongo D Bv1.0
From Sql Server To Mongo D Bv1.0From Sql Server To Mongo D Bv1.0
From Sql Server To Mongo D Bv1.0
 
Presentatie for "Studiemiddag Linked Data Archieven"
Presentatie for "Studiemiddag Linked Data Archieven"Presentatie for "Studiemiddag Linked Data Archieven"
Presentatie for "Studiemiddag Linked Data Archieven"
 
Doing Digital Research @ British Library
Doing Digital Research @ British LibraryDoing Digital Research @ British Library
Doing Digital Research @ British Library
 

Similar to Digital Manuscripts Without Borders: A Discovery Platform of Manuscripts and Rare Books

SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
Micah Altman
 
Semantic Web special interest group meeting - IFLA WLIC 2012
Semantic Web special interest group meeting - IFLA WLIC 2012Semantic Web special interest group meeting - IFLA WLIC 2012
Semantic Web special interest group meeting - IFLA WLIC 2012
Figoblog
 
Metadata Aggregation: Assessing the Application of IIIF and Sitemaps within C...
Metadata Aggregation: Assessing the Application of IIIF and Sitemaps within C...Metadata Aggregation: Assessing the Application of IIIF and Sitemaps within C...
Metadata Aggregation: Assessing the Application of IIIF and Sitemaps within C...
Nuno Freire
 
Ifla swsig meeting - Puerto Rico - 20110817
Ifla swsig meeting - Puerto Rico - 20110817Ifla swsig meeting - Puerto Rico - 20110817
Ifla swsig meeting - Puerto Rico - 20110817
Figoblog
 
Adlug annual meeting 2013
Adlug annual meeting 2013Adlug annual meeting 2013
Adlug annual meeting 2013
@CULT Srl
 
BHL Developments - Prague
BHL Developments - PragueBHL Developments - Prague
BHL Developments - Prague
Chris Freeland
 

Similar to Digital Manuscripts Without Borders: A Discovery Platform of Manuscripts and Rare Books (20)

Biblissima: Connecting Manuscripts Collections
Biblissima: Connecting Manuscripts CollectionsBiblissima: Connecting Manuscripts Collections
Biblissima: Connecting Manuscripts Collections
 
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
 
IIIF at europeana, IIIF conference, Vatican, 2017
IIIF at europeana, IIIF conference, Vatican, 2017IIIF at europeana, IIIF conference, Vatican, 2017
IIIF at europeana, IIIF conference, Vatican, 2017
 
Publishing the British National Bibliography as Linked Open Data / Corine Del...
Publishing the British National Bibliography as Linked Open Data / Corine Del...Publishing the British National Bibliography as Linked Open Data / Corine Del...
Publishing the British National Bibliography as Linked Open Data / Corine Del...
 
Semantic Web special interest group meeting - IFLA WLIC 2012
Semantic Web special interest group meeting - IFLA WLIC 2012Semantic Web special interest group meeting - IFLA WLIC 2012
Semantic Web special interest group meeting - IFLA WLIC 2012
 
Biblissima’s Choices of Tools and Methodology for Interoperability Purposes
Biblissima’s Choices of Tools and Methodology for Interoperability PurposesBiblissima’s Choices of Tools and Methodology for Interoperability Purposes
Biblissima’s Choices of Tools and Methodology for Interoperability Purposes
 
It summit2015
It summit2015It summit2015
It summit2015
 
Metadata Aggregation: Assessing the Application of IIIF and Sitemaps within C...
Metadata Aggregation: Assessing the Application of IIIF and Sitemaps within C...Metadata Aggregation: Assessing the Application of IIIF and Sitemaps within C...
Metadata Aggregation: Assessing the Application of IIIF and Sitemaps within C...
 
APIdays 2018 BnF API projects
APIdays 2018 BnF API projectsAPIdays 2018 BnF API projects
APIdays 2018 BnF API projects
 
Links, languages and semantics: linked data approaches in The European Libra...
Links, languages and semantics: linked data approaches in The European Libra...Links, languages and semantics: linked data approaches in The European Libra...
Links, languages and semantics: linked data approaches in The European Libra...
 
International Image Interoperability Framework (IIIF). Sharing high resolutio...
International Image Interoperability Framework (IIIF). Sharing high resolutio...International Image Interoperability Framework (IIIF). Sharing high resolutio...
International Image Interoperability Framework (IIIF). Sharing high resolutio...
 
Electronic Library Bremen – state & focus of development
Electronic Library Bremen – state & focus of developmentElectronic Library Bremen – state & focus of development
Electronic Library Bremen – state & focus of development
 
Ifla swsig meeting - Puerto Rico - 20110817
Ifla swsig meeting - Puerto Rico - 20110817Ifla swsig meeting - Puerto Rico - 20110817
Ifla swsig meeting - Puerto Rico - 20110817
 
LD4 conference 2020 The Use of Linked Data at the ISSN International Centre
LD4 conference 2020 The Use of Linked Data at the ISSN International CentreLD4 conference 2020 The Use of Linked Data at the ISSN International Centre
LD4 conference 2020 The Use of Linked Data at the ISSN International Centre
 
NISO REST Training IIIF
NISO REST Training IIIF NISO REST Training IIIF
NISO REST Training IIIF
 
Matthew Hale - Open Source at the Kings Fund
Matthew Hale - Open Source at the Kings FundMatthew Hale - Open Source at the Kings Fund
Matthew Hale - Open Source at the Kings Fund
 
Adlug annual meeting 2013
Adlug annual meeting 2013Adlug annual meeting 2013
Adlug annual meeting 2013
 
Shieh "Enabling Descriptive Data to be Linked at the Smithsonian Libraries"
Shieh "Enabling Descriptive Data to be Linked at the Smithsonian Libraries"Shieh "Enabling Descriptive Data to be Linked at the Smithsonian Libraries"
Shieh "Enabling Descriptive Data to be Linked at the Smithsonian Libraries"
 
BHL Developments - Prague
BHL Developments - PragueBHL Developments - Prague
BHL Developments - Prague
 
20190304 shifting minds open belgium 2019
20190304 shifting minds open belgium 201920190304 shifting minds open belgium 2019
20190304 shifting minds open belgium 2019
 

More from Equipex Biblissima

More from Equipex Biblissima (20)

eScriptorium: An Open Source Platform for Historical Document Analysis
eScriptorium: An Open Source Platform for Historical Document AnalysiseScriptorium: An Open Source Platform for Historical Document Analysis
eScriptorium: An Open Source Platform for Historical Document Analysis
 
Annotate (E-ReColNat) : annotation rapide d’images et de vidéos en sciences n...
Annotate (E-ReColNat) : annotation rapide d’images et de vidéos en sciences n...Annotate (E-ReColNat) : annotation rapide d’images et de vidéos en sciences n...
Annotate (E-ReColNat) : annotation rapide d’images et de vidéos en sciences n...
 
Appliquer les techniques d'apprentissage profond pour détecter les enluminure...
Appliquer les techniques d'apprentissage profond pour détecter les enluminure...Appliquer les techniques d'apprentissage profond pour détecter les enluminure...
Appliquer les techniques d'apprentissage profond pour détecter les enluminure...
 
Représentations du chant du Moyen Âge dans les images IIIF
Représentations du chant du Moyen Âge dans les images IIIFReprésentations du chant du Moyen Âge dans les images IIIF
Représentations du chant du Moyen Âge dans les images IIIF
 
Réflexions et explorations croisées autour de IIIF, Omeka-s et NumaHOP à la B...
Réflexions et explorations croisées autour de IIIF, Omeka-s et NumaHOP à la B...Réflexions et explorations croisées autour de IIIF, Omeka-s et NumaHOP à la B...
Réflexions et explorations croisées autour de IIIF, Omeka-s et NumaHOP à la B...
 
Mise en œuvre de IIIF pour la reconnaissance automatique de documents
Mise en œuvre de IIIF pour la reconnaissance automatique de documentsMise en œuvre de IIIF pour la reconnaissance automatique de documents
Mise en œuvre de IIIF pour la reconnaissance automatique de documents
 
Nakala et IIIF
Nakala et IIIFNakala et IIIF
Nakala et IIIF
 
Actualités et perspectives de IIIF
Actualités et perspectives de IIIFActualités et perspectives de IIIF
Actualités et perspectives de IIIF
 
Mieux diffuser et valoriser ses images sur le Web grâce aux standards IIIF
Mieux diffuser et valoriser ses images sur le Web grâce aux standards IIIFMieux diffuser et valoriser ses images sur le Web grâce aux standards IIIF
Mieux diffuser et valoriser ses images sur le Web grâce aux standards IIIF
 
Les référentiels Biblissima : épine dorsale du portail Biblissima et de IIIF-...
Les référentiels Biblissima : épine dorsale du portail Biblissima et de IIIF-...Les référentiels Biblissima : épine dorsale du portail Biblissima et de IIIF-...
Les référentiels Biblissima : épine dorsale du portail Biblissima et de IIIF-...
 
Introduction aux protocoles IIIF. Formation Enssib 23.01.2019 (Régis Robineau)
Introduction aux protocoles IIIF. Formation Enssib 23.01.2019 (Régis Robineau)Introduction aux protocoles IIIF. Formation Enssib 23.01.2019 (Régis Robineau)
Introduction aux protocoles IIIF. Formation Enssib 23.01.2019 (Régis Robineau)
 
IIIF et Biblissima
IIIF et BiblissimaIIIF et Biblissima
IIIF et Biblissima
 
A la recherche du patrimoine écrit avec le portail Biblissima
A la recherche du patrimoine écrit avec le portail BiblissimaA la recherche du patrimoine écrit avec le portail Biblissima
A la recherche du patrimoine écrit avec le portail Biblissima
 
Browse and Visualize Manuscripts Illuminations with IIIF
Browse and Visualize Manuscripts Illuminations with IIIFBrowse and Visualize Manuscripts Illuminations with IIIF
Browse and Visualize Manuscripts Illuminations with IIIF
 
Les descripteurs des bases iconographiques Mandragore (BnF) et Initiale (IRHT...
Les descripteurs des bases iconographiques Mandragore (BnF) et Initiale (IRHT...Les descripteurs des bases iconographiques Mandragore (BnF) et Initiale (IRHT...
Les descripteurs des bases iconographiques Mandragore (BnF) et Initiale (IRHT...
 
A la recherche du patrimoine écrit avec le portail Biblissima
A la recherche du patrimoine écrit avec le portail BiblissimaA la recherche du patrimoine écrit avec le portail Biblissima
A la recherche du patrimoine écrit avec le portail Biblissima
 
Les protocoles IIIF : présentation des potentialités
Les protocoles IIIF : présentation des potentialitésLes protocoles IIIF : présentation des potentialités
Les protocoles IIIF : présentation des potentialités
 
Éparpillés façon puzzle. Rassembler aujourd'hui les manuscrits d'un auteur ca...
Éparpillés façon puzzle. Rassembler aujourd'hui les manuscrits d'un auteur ca...Éparpillés façon puzzle. Rassembler aujourd'hui les manuscrits d'un auteur ca...
Éparpillés façon puzzle. Rassembler aujourd'hui les manuscrits d'un auteur ca...
 
IIIF au coeur de la recherche: publier, agréger et divulguer les fragments de...
IIIF au coeur de la recherche: publier, agréger et divulguer les fragments de...IIIF au coeur de la recherche: publier, agréger et divulguer les fragments de...
IIIF au coeur de la recherche: publier, agréger et divulguer les fragments de...
 
Biblissima pour l'édition scientifique
Biblissima pour l'édition scientifiqueBiblissima pour l'édition scientifique
Biblissima pour l'édition scientifique
 

Recently uploaded

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Recently uploaded (20)

Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
API Governance and Monetization - The evolution of API governance
API Governance and Monetization -  The evolution of API governanceAPI Governance and Monetization -  The evolution of API governance
API Governance and Monetization - The evolution of API governance
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data PlatformLess Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Modernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using BallerinaModernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using Ballerina
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Quantum Leap in Next-Generation Computing
Quantum Leap in Next-Generation ComputingQuantum Leap in Next-Generation Computing
Quantum Leap in Next-Generation Computing
 

Digital Manuscripts Without Borders: A Discovery Platform of Manuscripts and Rare Books

  • 1. Digital Manuscripts Without Borders A Discovery Platform of Manuscripts and Rare Books Régis ROBINEAU @regisrob 2019 IIIF Conference University of Göttingen (June 26th , 2019) @biblissima
  • 2.
  • 3. Biblissima IIIF-Collections platform ➔ cross-collections search and discovery prototype for manuscripts and rare books (prior to 1800 only) ➔ aggregates metadata from 10 IIIF digital libraries ➔ complementary to the main Biblissima portal at biblissima.fr
  • 4. Current scope: 10 datasets (June 2019) ◆ Gallica (BnF) ◆ Digital.Bodleian (Oxford) ◆ BVMM ◆ e-codices ◆ British Library (BL-BnF Polonsky project) ◆ Europeana Regia ◆ Parker Library On the Web ◆ Bibliothèque Mazarine ◆ Ghent UL ◆ Cambridge UL ◆ Durham University and Cathedral Library ~65,000 IIIF Manifests
  • 5. Targeted IIIF datasets ➔ 57 manuscripts repositories identified in the IIIF world (mostly from UK, France, USA, Germany) frama.link/iiif-mss-registry ➔ A few observations on: ◆ open licences ◆ use of the seeAlso property ◆ metadata schemas linked from the seeAlso ◆ presence of a crawlable IIIF endpoint
  • 7. Use of the seeAlso property
  • 8. Metadata schemas (from seeAlso property) + different flavours of RDF (mix of vocabularies / various serialization formats)
  • 9. Crawlable IIIF endpoint? some of them have an OAI-PMH endpoint, as an alternative...
  • 10. Workflow 1. Crawl and harvest the repositories’ endpoints 2. Extract and process the data: ◆ filter by date, normalise the dates formats ◆ normalise strings of languages and roles ◆ reconcile and cluster named entities (agents, places…) 3. Transform and ingest the data into ElasticSearch
  • 12. Advanced search parameters : - exact search - wildcards - fuzzyness - inclusion/exclusion - boosting
  • 16. Further info on this author in the Biblissima portal
  • 17. Further info on this manuscript in the Biblissima portal
  • 18. URI of the entity in the Biblissima authority file: https://data.biblissima.fr/entity/Q32764
  • 19. Biblissima Authority data ➔ hub to manage and share our authority data: ◆ wiki-based technology (Wikibase): natively collaborative + versioning ◆ handles URIs identifiers ◆ natively produces RDF ◆ user-friendly forms to edit entries ◆ remote access for machines: Web API + SPARQL endpoint data.biblissima.fr
  • 20. Biblissima Authority data Publication spread over 2019: ✓ Persons (done, March 2019) ~26,500 entities ✓ Geographical names (done, April 2019) ~5,500 entities 🚧 Organisations (in progress) 🚧 Shelfmarks of manuscripts and early printed books ➔ Textual works ➔ Iconographic descriptors
  • 21. Feedback to data providers ➔ on the IIIF Manifests, on the metadata encodings… ➔ on the metadata itself (new identifications, corrections…) ➔ provision of data alignments/enrichments (URIs...)
  • 23. Thank you! Régis ROBINEAU (Biblissima - Campus Condorcet / EPHE-PSL, Paris) Biblissima Team: Kévin BOIS Eduard FRUNZEANU biblissima.fr