Présentation par Régis Robineau dans le cadre de l'atelier Campus Condorcet “Référentiels géo-historiques sémantisés pour les humanités” (Ecole nationale des chartes, 14 mai 2019)
Da Biblissima a Biblissima+ : per un osservatorio delle culture scritteEquipex Biblissima
Présentation d'Anne-Marie Turcan-Verkerk lors de la LIBER Sixth Summer School on Trends in Manuscript Studies, University of Cassino (13 septembre 2021)
Digital Manuscripts Without Borders: A Discovery Platform of Manuscripts and ...Equipex Biblissima
This document summarizes a presentation about Biblissima IIIF-Collections, a discovery platform for digital manuscripts and rare books. It aggregates metadata from 10 digital libraries containing around 65,000 IIIF manifests. The platform allows for cross-collection searching and filtering by library, language, and date. It also links to additional information on authors, manuscripts, and entities in the Biblissima authority database. The presentation outlines the workflow used to harvest, process, and ingest metadata into Elasticsearch for search. It also discusses feedback provided to data providers and future plans to expand the authority database.
Intervention de Stefanie Gehrke lors du colloque "Digital Humanities 2016" (Cracovie, 13 juillet 2016)
Résumé de l'intervention : http://dh2016.adho.org/abstracts/352
This document discusses Biblissima, a project that aims to interconnect data about medieval manuscripts from various French libraries and research institutions on the semantic web. It describes Biblissima's data, which includes information on manuscripts, texts, people, places, and more from over 40 databases. The challenges of integrating this heterogeneous data are discussed. Biblissima addresses these challenges through data alignment, cleaning, and publishing the data as RDF linked data using vocabularies like FRBRoo. This allows the data to be interlinked, enriched, and shared to increase visibility and usability for both humans and machines.
Biblissima's prototype on Medieval Manuscripts Illuminations and their ContextEquipex Biblissima
Présentation dans le cadre du "Workshop SW4SH" organisé dans le cadre de l'ESWC 2015 (Portoroz, Slovénie, 1er juin 2015 ) par Stefanie Gehrke (coordinatrice metadonnées Biblissima), Eduard Frunzeanu, Pauline Charbonnier et Marie Muffat.
Widening the limits of cognitive reception with online digital library graph ...Marton Nemeth
This document discusses using semantic web technologies like linked data and RDF to improve information retrieval from digital library collections. It provides examples of semantic implementations at libraries like Europeana, the French National Library, and the German National Library. Key points covered include linking diverse data sources to facilitate discovery, creating semantic search interfaces, and addressing challenges of referencing vocabularies and evaluating semantic datasets and user experiences. The research plan proposes comparing new semantic OPACs to traditional interfaces and developing a methodology for evaluating the user experience of semantic library systems.
Maurer Presentation - WARCnet Spring Meeting 2021WARCnet
This document summarizes a meeting about comparing entire web domains that are archived in national web archives. It discusses the need to analyze national web collections with low common denominator data while still providing rich information. It presents a file format called a CDX summary file that provides aggregated statistics about file types, sizes, and protocol usage for domains over time. Examples are given demonstrating average file sizes, domain name frequencies, and overlap between different archive sources. Limitations of the current approach are also outlined.
Da Biblissima a Biblissima+ : per un osservatorio delle culture scritteEquipex Biblissima
Présentation d'Anne-Marie Turcan-Verkerk lors de la LIBER Sixth Summer School on Trends in Manuscript Studies, University of Cassino (13 septembre 2021)
Digital Manuscripts Without Borders: A Discovery Platform of Manuscripts and ...Equipex Biblissima
This document summarizes a presentation about Biblissima IIIF-Collections, a discovery platform for digital manuscripts and rare books. It aggregates metadata from 10 digital libraries containing around 65,000 IIIF manifests. The platform allows for cross-collection searching and filtering by library, language, and date. It also links to additional information on authors, manuscripts, and entities in the Biblissima authority database. The presentation outlines the workflow used to harvest, process, and ingest metadata into Elasticsearch for search. It also discusses feedback provided to data providers and future plans to expand the authority database.
Intervention de Stefanie Gehrke lors du colloque "Digital Humanities 2016" (Cracovie, 13 juillet 2016)
Résumé de l'intervention : http://dh2016.adho.org/abstracts/352
This document discusses Biblissima, a project that aims to interconnect data about medieval manuscripts from various French libraries and research institutions on the semantic web. It describes Biblissima's data, which includes information on manuscripts, texts, people, places, and more from over 40 databases. The challenges of integrating this heterogeneous data are discussed. Biblissima addresses these challenges through data alignment, cleaning, and publishing the data as RDF linked data using vocabularies like FRBRoo. This allows the data to be interlinked, enriched, and shared to increase visibility and usability for both humans and machines.
Biblissima's prototype on Medieval Manuscripts Illuminations and their ContextEquipex Biblissima
Présentation dans le cadre du "Workshop SW4SH" organisé dans le cadre de l'ESWC 2015 (Portoroz, Slovénie, 1er juin 2015 ) par Stefanie Gehrke (coordinatrice metadonnées Biblissima), Eduard Frunzeanu, Pauline Charbonnier et Marie Muffat.
Widening the limits of cognitive reception with online digital library graph ...Marton Nemeth
This document discusses using semantic web technologies like linked data and RDF to improve information retrieval from digital library collections. It provides examples of semantic implementations at libraries like Europeana, the French National Library, and the German National Library. Key points covered include linking diverse data sources to facilitate discovery, creating semantic search interfaces, and addressing challenges of referencing vocabularies and evaluating semantic datasets and user experiences. The research plan proposes comparing new semantic OPACs to traditional interfaces and developing a methodology for evaluating the user experience of semantic library systems.
Maurer Presentation - WARCnet Spring Meeting 2021WARCnet
This document summarizes a meeting about comparing entire web domains that are archived in national web archives. It discusses the need to analyze national web collections with low common denominator data while still providing rich information. It presents a file format called a CDX summary file that provides aggregated statistics about file types, sizes, and protocol usage for domains over time. Examples are given demonstrating average file sizes, domain name frequencies, and overlap between different archive sources. Limitations of the current approach are also outlined.
Linked Digital Collection "Rainis and Aspazija"Uldis Bojars
This document describes a Linked Digital Collection project focused on Rainis and Aspazija, two influential Latvian poets. The collection contains their correspondence, literary works, and other materials annotated with links to related entities. Entities mentioned in the annotations are represented as pages in the collection and linked to from the annotated objects. The objects and entities are also expressed as RDF and linked to external data sources, creating a network of interlinked cultural heritage objects and metadata.
VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium -...Artium Vitoria
"Crossing the boundaries of Arts and Sciences: Can Linked Data help Refactoring Natural Sciences?" by Gildas Illien, Chief Librarian, Muséum national d’histoire naturelle (National Natural History Museum Library), Paris.
"Atravesar las fronteras entre las artes y las ciencias: ¿pueden los datos enlazados reestructurar las ciencias naturales?" por Gildas Illien, bibliotecario jefe del Museo Nacional de Historia Natural (Biblioteca), París.
VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium -...Artium Vitoria
"Publishing and Using Cultural Heritage Linked Data on the Semantic Web" by Eero Hyvönen. Aalto University, Finland.
"Publicar y utilizar la herencia cultural de los datos enlazados en la web semántica" por Eero Hyvönen. Universidad de Aalto, Finlandia.
This document summarizes The National Archives' migration from SQL Server databases to MongoDB. It discusses the organization's datasets and discovery service, how data was integrated and migrated to MongoDB, and the resulting MongoDB implementation and architecture. The migration involved creating information assets in MongoDB using an entity-attribute-value model and implementing a service-oriented architecture with replica sets for high availability. The new system allows users to search, browse, and view details of information assets through an API.
Semantic Tagging for old maps...and other things on the WebBernhard Haslhofer
This document discusses semantic tagging of historical maps and other digital objects on the web. It introduces Maphub, a platform for annotating and semantically tagging digitized historical maps. Maphub allows users to add comments, semantic tags linked to controlled vocabularies, and geo-references to maps. The presentation describes Maphub's features, how semantic tagging works in the system by linking visual tags to unique URIs, and plans to implement semantic tagging plugins for other annotation tools and support additional knowledge organization systems. The benefits of semantic tagging are collecting structured metadata and connecting maps and annotations to other web resources through semantic links.
A researcher driven data description for the archived web: Why and how?WARCnet
This document discusses developing a standardized vocabulary for describing data extracted from archived web collections. It proposes a researcher-driven approach to identify needed descriptors, map existing standards, and discuss how standards match researcher needs and web archive practices. A rough idea is presented where each descriptor name and description could be mapped to existing vocabularies. The goal is to improve interoperability when researchers request extractions from different web archives.
The document summarizes a datathon conducted using various COVID-19 datasets from different European web archives. The goals were to 1) create a sandbox for exploring the data, 2) conduct initial analysis to see what could be achieved, and 3) document the process. Different institutions provided different types of datasets, including seedlists, tweets, and derived datasets. Challenges included restrictions on sharing raw data and representing large collections. Preliminary analysis identified potential research questions and ways to study web archives, collections, and the pandemic response.
Wikidata is a free and open knowledge base that can be edited by anyone to store structured data about things like people, places, events and concepts. It is maintained by the Wikimedia Foundation and powered by Wikibase, an open source MediaWiki extension. Data in Wikidata is stored as statements about entities, with values that can include text, coordinates, quantities and more. This structured data is used and linked by Wikipedia, other Wikimedia projects and external websites and applications.
(Big) bibliographic data @ ScaDS project meeting - 2015-06-12Felix Lohmeier
The document discusses big bibliographic data from UB Leipzig and SLUB Dresden libraries. It notes that libraries are becoming data hubs and describes the libraries' metadata including resources like books, journals, and accessibility information. The libraries are working together on projects like finc and d:swarm to process and integrate metadata, link authority files, and discover resources through a unified search interface. Challenges include scaling the graph database d:swarm to handle large metadata volumes for data integration and enrichment.
Tuesday 5 May: The Shapes of Archives and Memory, Helle Strandgaard JensenWARCnet
This document outlines a research project that will examine how the online activities of national archives in Denmark and the UK have shaped archives and their role in public memory-making. The project will use global history theory to analyze the websites of the Danish National Archive and the National Archives UK from their earliest versions archived by the Wayback Machine and national web archives. The analysis will focus on aspects like connections to other sites, functions, size, topics, internal networks, audiences, and design. It will also consider the broader technological contexts. The goal is to understand how the archives' online presences have evolved and how web archives can shape both the history written about archives and archives' own histories on the web.
Digitised Manuscripts and the British Library's new IIIF viewer Mia
The British Library's implementation of the IIIF-based Universal Viewer. Presentation for 'Digitised Hebrew Manuscripts: British Library and Beyond', London, November 2016
The Europeana Strategy and Linked Open DataDavid Haskiya
The document discusses Europeana's strategy for 2015-2020 and how linked open data and linked open data technologies will help realize this strategy. Key points:
- Europeana's strategy is to transition from metadata to graphs and from strings to things by making data and APIs more linked and open.
- Linked open data allows data from different sources to be combined and helps make content more findable on search engines and in knowledge panels.
- Europeana labs provides APIs, tools, documentation and data to help partners publish linked open data that can be reused in the Europeana portal and other applications.
This talk gives an introduction to linked data, and will focus on challenges around constructing linked data for geographic and spatial information. Examples will focus on work being done at Ordnance Survey and the wider UK Government.
A lot of data references some kind of location whether it is a place name, street name, address, postcode or some kind of coordinate. Because of this it is becoming clear that location provides an important data integration hub on the linked data web. This talk gives an introduction to linked data, and will focus on challenges around constructing linked data for geographic and spatial information. Examples will focus on work being done at Ordnance Survey and the wider UK Government.
Intervention de Stefanie Gehrke au Workshop "TEI and Neighbouring Standards" à la DiXiT Convention Week 2015 (Huygens ING, La Haye, 15 septembre 2015).
SWIB14 presentation
Over time, Knowledge Organization Systems such as thesauri and classifications undergo lots of changes, as the knowledge domains evolve. Most SKOS publishers therefore put a version tag on their vocabularies. With the vocabularies interwoven in the open web of data, however, different versions may be the base for references in other datasets. So, updates by "third parties" are required, in indexing data as well as in mappings from or to other vocabularies. Yet answers to simple user questions such as "What's new?" or "What has changed?" are not easily obtainable. Best practices and shared standards for communicating changes precisely and making them (machine-) actionable still have to emerge. STW Thesaurus for Economics currently is subject to a series of major revisions. In a case study we review the amount and the types of changes in this process, and demonstrate how versioning in general and difficult types of changes such as the abandonment of descriptors in particular are handled. Furthermore, a method to get a tight grip on the changes, based on SPARQL queries over named graphs, is presented. And finally, the skos-history activity is introduced, which aims at the development of an ontology/application profile and best practices to describe SKOS versions and changes.
Tuesday 5 May 2020: Contextualizing and engaging with Web domains, Valérie Sc...WARCnet
The document discusses contextualizing studies of web domains through mixed quantitative and qualitative approaches. It proposes several methods for contextualization, including comparing archived and live web content, investigating the history of domain names in countries studied, and documenting the evolution of web archiving sources and efforts. The document also discusses engaging the public by addressing the usefulness of results for scholars, promoting findings to domain providers, and developing infographics, questions, and an interactive quiz to make the studies more appealing to a general audience.
Linked Digital Collection "Rainis and Aspazija"Uldis Bojars
This document describes a Linked Digital Collection project focused on Rainis and Aspazija, two influential Latvian poets. The collection contains their correspondence, literary works, and other materials annotated with links to related entities. Entities mentioned in the annotations are represented as pages in the collection and linked to from the annotated objects. The objects and entities are also expressed as RDF and linked to external data sources, creating a network of interlinked cultural heritage objects and metadata.
VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium -...Artium Vitoria
"Crossing the boundaries of Arts and Sciences: Can Linked Data help Refactoring Natural Sciences?" by Gildas Illien, Chief Librarian, Muséum national d’histoire naturelle (National Natural History Museum Library), Paris.
"Atravesar las fronteras entre las artes y las ciencias: ¿pueden los datos enlazados reestructurar las ciencias naturales?" por Gildas Illien, bibliotecario jefe del Museo Nacional de Historia Natural (Biblioteca), París.
VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium -...Artium Vitoria
"Publishing and Using Cultural Heritage Linked Data on the Semantic Web" by Eero Hyvönen. Aalto University, Finland.
"Publicar y utilizar la herencia cultural de los datos enlazados en la web semántica" por Eero Hyvönen. Universidad de Aalto, Finlandia.
This document summarizes The National Archives' migration from SQL Server databases to MongoDB. It discusses the organization's datasets and discovery service, how data was integrated and migrated to MongoDB, and the resulting MongoDB implementation and architecture. The migration involved creating information assets in MongoDB using an entity-attribute-value model and implementing a service-oriented architecture with replica sets for high availability. The new system allows users to search, browse, and view details of information assets through an API.
Semantic Tagging for old maps...and other things on the WebBernhard Haslhofer
This document discusses semantic tagging of historical maps and other digital objects on the web. It introduces Maphub, a platform for annotating and semantically tagging digitized historical maps. Maphub allows users to add comments, semantic tags linked to controlled vocabularies, and geo-references to maps. The presentation describes Maphub's features, how semantic tagging works in the system by linking visual tags to unique URIs, and plans to implement semantic tagging plugins for other annotation tools and support additional knowledge organization systems. The benefits of semantic tagging are collecting structured metadata and connecting maps and annotations to other web resources through semantic links.
A researcher driven data description for the archived web: Why and how?WARCnet
This document discusses developing a standardized vocabulary for describing data extracted from archived web collections. It proposes a researcher-driven approach to identify needed descriptors, map existing standards, and discuss how standards match researcher needs and web archive practices. A rough idea is presented where each descriptor name and description could be mapped to existing vocabularies. The goal is to improve interoperability when researchers request extractions from different web archives.
The document summarizes a datathon conducted using various COVID-19 datasets from different European web archives. The goals were to 1) create a sandbox for exploring the data, 2) conduct initial analysis to see what could be achieved, and 3) document the process. Different institutions provided different types of datasets, including seedlists, tweets, and derived datasets. Challenges included restrictions on sharing raw data and representing large collections. Preliminary analysis identified potential research questions and ways to study web archives, collections, and the pandemic response.
Wikidata is a free and open knowledge base that can be edited by anyone to store structured data about things like people, places, events and concepts. It is maintained by the Wikimedia Foundation and powered by Wikibase, an open source MediaWiki extension. Data in Wikidata is stored as statements about entities, with values that can include text, coordinates, quantities and more. This structured data is used and linked by Wikipedia, other Wikimedia projects and external websites and applications.
(Big) bibliographic data @ ScaDS project meeting - 2015-06-12Felix Lohmeier
The document discusses big bibliographic data from UB Leipzig and SLUB Dresden libraries. It notes that libraries are becoming data hubs and describes the libraries' metadata including resources like books, journals, and accessibility information. The libraries are working together on projects like finc and d:swarm to process and integrate metadata, link authority files, and discover resources through a unified search interface. Challenges include scaling the graph database d:swarm to handle large metadata volumes for data integration and enrichment.
Tuesday 5 May: The Shapes of Archives and Memory, Helle Strandgaard JensenWARCnet
This document outlines a research project that will examine how the online activities of national archives in Denmark and the UK have shaped archives and their role in public memory-making. The project will use global history theory to analyze the websites of the Danish National Archive and the National Archives UK from their earliest versions archived by the Wayback Machine and national web archives. The analysis will focus on aspects like connections to other sites, functions, size, topics, internal networks, audiences, and design. It will also consider the broader technological contexts. The goal is to understand how the archives' online presences have evolved and how web archives can shape both the history written about archives and archives' own histories on the web.
Digitised Manuscripts and the British Library's new IIIF viewer Mia
The British Library's implementation of the IIIF-based Universal Viewer. Presentation for 'Digitised Hebrew Manuscripts: British Library and Beyond', London, November 2016
The Europeana Strategy and Linked Open DataDavid Haskiya
The document discusses Europeana's strategy for 2015-2020 and how linked open data and linked open data technologies will help realize this strategy. Key points:
- Europeana's strategy is to transition from metadata to graphs and from strings to things by making data and APIs more linked and open.
- Linked open data allows data from different sources to be combined and helps make content more findable on search engines and in knowledge panels.
- Europeana labs provides APIs, tools, documentation and data to help partners publish linked open data that can be reused in the Europeana portal and other applications.
This talk gives an introduction to linked data, and will focus on challenges around constructing linked data for geographic and spatial information. Examples will focus on work being done at Ordnance Survey and the wider UK Government.
A lot of data references some kind of location whether it is a place name, street name, address, postcode or some kind of coordinate. Because of this it is becoming clear that location provides an important data integration hub on the linked data web. This talk gives an introduction to linked data, and will focus on challenges around constructing linked data for geographic and spatial information. Examples will focus on work being done at Ordnance Survey and the wider UK Government.
Intervention de Stefanie Gehrke au Workshop "TEI and Neighbouring Standards" à la DiXiT Convention Week 2015 (Huygens ING, La Haye, 15 septembre 2015).
SWIB14 presentation
Over time, Knowledge Organization Systems such as thesauri and classifications undergo lots of changes, as the knowledge domains evolve. Most SKOS publishers therefore put a version tag on their vocabularies. With the vocabularies interwoven in the open web of data, however, different versions may be the base for references in other datasets. So, updates by "third parties" are required, in indexing data as well as in mappings from or to other vocabularies. Yet answers to simple user questions such as "What's new?" or "What has changed?" are not easily obtainable. Best practices and shared standards for communicating changes precisely and making them (machine-) actionable still have to emerge. STW Thesaurus for Economics currently is subject to a series of major revisions. In a case study we review the amount and the types of changes in this process, and demonstrate how versioning in general and difficult types of changes such as the abandonment of descriptors in particular are handled. Furthermore, a method to get a tight grip on the changes, based on SPARQL queries over named graphs, is presented. And finally, the skos-history activity is introduced, which aims at the development of an ontology/application profile and best practices to describe SKOS versions and changes.
Tuesday 5 May 2020: Contextualizing and engaging with Web domains, Valérie Sc...WARCnet
The document discusses contextualizing studies of web domains through mixed quantitative and qualitative approaches. It proposes several methods for contextualization, including comparing archived and live web content, investigating the history of domain names in countries studied, and documenting the evolution of web archiving sources and efforts. The document also discusses engaging the public by addressing the usefulness of results for scholars, promoting findings to domain providers, and developing infographics, questions, and an interactive quiz to make the studies more appealing to a general audience.
Biblissima is a data facility that aims to federate digital libraries, structure research data and communities, train researchers, and facilitate access to and reuse of textual and documentary resources. It has over 50 partner projects involving libraries, archives, and universities in France, the UK, Canada, and the US. Biblissima develops tools like Collatinus and Eulexis for analyzing Latin and Greek texts. It also organizes summer courses for cultivating young researchers. The Biblissima portal aggregates data from over 10 sources to visualize manuscripts and books, with features for searching, browsing, and comparing resources using IIIF standards.
Les descripteurs des bases iconographiques Mandragore (BnF) et Initiale (IRHT...Equipex Biblissima
Présentation par Eduard Frunzeanu et Régis Robineau lors du workshop Zoomathia “Zoological an zoology-related Databases” (Muséum national d'histoire naturelle, Paris - 23 novembre 2018)
The National Library of France provides a public data service that includes metadata from over 13.8 million publications in its collections. Researchers and developers can access this data through various APIs and use it for projects like analyzing gender trends in historical fiction writing or building apps that allow users to search for concert recordings based on the musical works performed. The library aims to enhance these services over time by expanding the available datasets and APIs, as well as providing resources for data and code sharing to support digital humanities and other reuse of the cultural data.
New tasks, new roles: Libraries in the tension between Digital Humanities, Re...Stefan Schmunk
This document summarizes Dr. Stefan Schmunk's presentation on new roles for libraries in relation to digital humanities, research data, and research infrastructures. The presentation discusses how digital humanities projects involving tasks like digital scholarly editions require new skills from libraries, such as expertise in XML encoding, long-term preservation of digital materials, and creation of virtual research environments. It also explores how libraries must adapt to help researchers with the growing importance of research data in the humanities by taking on roles like hosting data repositories, providing data management support and training, and building research data infrastructures.
Presentada en la Conferencia Internacional de Dublin Core 2013, que tuvo lugar en Lisboa, del 2 al 6 de septiembre y donde participó la Biblioteca Nacional de España (BNE).
National Library of Finland - open source solutions in the development of nat...Mindtrek
Track | The Future of Open Source Business
Kristiina Hormia-Poutanen, Service Director, National Library of Finland
Mindtrek Conference
3rd of October 2023.
Tampere, Finland
www.mindtrek.org
Cabriology The Bruges Aquabrowser ExperienceKoen Calis
The document discusses the Bruges Public Library's "Cabrio" system which aims to integrate, advise on, and mobilize access to the library's resources in three ways:
1. It integrates the library's catalogue with resources from other libraries and databases to provide a more comprehensive search experience for users.
2. It provides advisory services such as personalized recommendations, subject guides, and enriched metadata to help users discover relevant resources.
3. It mobilizes access to the library's content by optimizing its website and blog, encouraging sharing on social networks, and planning to integrate review and list-making tools to better engage users outside of the library.
Publishing the British National Bibliography as Linked Open Data / Corine Del...CIGScotland
Presented at Linked Open Data: current practice in libraries and archives (Cataloguing & Indexing Group in Scotlland 3rd Linked Open Data Conference), Edinburgh, 18 Nov 2013
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...Micah Altman
The web is now firmly established as the primary communication and publication platform for sharing and accessing social and cultural materials. This networked world has created both opportunities and pitfalls for libraries and archives in their mission to preserve and provide ongoing access to knowledge. How can the affordances of the web be leveraged to drastically extend the plurality of representation in the archive? What challenges are imposed by the intrinsic ephemerality and mutability of online information? What methodological reorientations are demanded by the scale and dynamism of machine-generated cultural artifacts? This talk will explore the interplay of the web, contemporary historical records, and the programs, technologies, and approaches by which libraries and archives are working to extend their mission to preserve and provide access to the evidence of human activity in a world distinguished by the ubiquity of born-digital materials.
Information Science Brown Bag talks, hosted by the Program on Information Science, consists of regular discussions and brainstorming sessions on all aspects of information science and uses of information science and technology to assess and solve institutional, social and research problems. These are informal talks. Discussions are often inspired by real-world problems being faced by the lead discussant.
New Discovery Tools for Digital Humanities and Spatial Data (Summary of the J...Micah Altman
This document discusses new tools for digital humanities and spatial data. It describes how physical discovery of manuscripts led to new methods of transmission and preservation of information over time. Modern libraries are indexing resources through internal catalogs and digital objects. The text advocates for moving resources on the semantic web using linked open data with RDF to better integrate geographic data and connect projects. The future of catalogs may involve direct access to digital resources through APIs, linked open data, and graph databases to allow deeper analysis of content and spatial indexing of metadata.
The document discusses the Harvard University Library's efforts to provide library resources and data through application programming interfaces (APIs) to enable access, reuse and innovation. It outlines several APIs that provide access to metadata, digital content, holdings and availability data. The goals are to support openness, interoperability and reuse of library data through standards-based web services APIs. Examples of apps built using these APIs are provided.
Keynote presentation for CSWS 2013 Conference in Shanghai, China.
Some slides borrowed from Jan Wielemaker, Guus Schreiber, Jacco van Ossenbruggen, Niels Ockeloen, Antske Fokkens, Serge ter Braake.
English (updated) version of my presentation about the new library portal of Fontys University of applied science. First time I did this one was for a delegation from Uzbekistan.
Semantic Web special interest group meeting - IFLA WLIC 2012Figoblog
The document discusses the 2nd open session of the IFLA Semantic Web Special Interest Group (SWSIG) being held in Helsinki. It provides an introduction to semantic web concepts including the semantic web, linked data, RDF triples, and ontologies. It also discusses applications of semantic web standards and namespaces from organizations like IFLA, W3C, and FRBR in areas such as library linked open data projects, element sets, value vocabularies, and dataset applications. Presentations will be given on topics like the Bibliographic Framework Update, licensing issues in linked data projects, and practical linked library data applications.
Designing a multilingual knowledge graph - DCMI2018Antoine Isaac
Presentation for the paper "Designing a multilingual knowledge graph as service for cultural heritage" at the DCMI2018 conference https://www.dublincore.org/conferences/2018/abstracts/#559
eScriptorium: An Open Source Platform for Historical Document AnalysisEquipex Biblissima
Par Daniel Stoekl Ben Ezra (Directeur d'études, EPHE-PSL, UMR 8546 AOrOc).
Rendez-vous IIIF360, un événément en ligne autour des standards et technologies IIIF organisé par le consortium IIIF360 (Biblissima, Campus Condorcet, Huma-Num) le 24 mars 2021 : https://projet.biblissima.fr/fr/evenements/rendez-vous-iiif360-2021
Annotate (E-ReColNat) : annotation rapide d’images et de vidéos en sciences n...Equipex Biblissima
Par Gilles Bertin (Ingénieur de recherche, CNAM).
Rendez-vous IIIF360, un événément en ligne autour des standards et technologies IIIF organisé par le consortium IIIF360 (Biblissima, Campus Condorcet, Huma-Num) le 24 mars 2021 : https://projet.biblissima.fr/fr/evenements/rendez-vous-iiif360-2021
Appliquer les techniques d'apprentissage profond pour détecter les enluminure...Equipex Biblissima
Par Victoria Eyharabide (Professeur associé, Laboratoire STIH, Sorbonne Université), Fouad Aouinti (Chercheur post-doctorant, Laboratoire STIH, Sorbonne Université), Xavier Fresquet (Directeur adjoint, Sorbonne Center for Artificial Intelligence - SCAI, Sorbonne Université)
Rendez-vous IIIF360, un événément en ligne autour des standards et technologies IIIF organisé par le consortium IIIF360 (Biblissima, Campus Condorcet, Huma-Num) le 24 mars 2021 : https://projet.biblissima.fr/fr/evenements/rendez-vous-iiif360-2021
Représentations du chant du Moyen Âge dans les images IIIFEquipex Biblissima
Par Valérie Le Page (Doctorante, Laboratoire IReMus, Sorbonne Université) et Victoria Eyharabide (Professeur associé, Laboratoire STIH, Sorbonne Université).
Rendez-vous IIIF360, un événément en ligne autour des standards et technologies IIIF organisé par le consortium IIIF360 (Biblissima, Campus Condorcet, Huma-Num) le 24 mars 2021 : https://projet.biblissima.fr/fr/evenements/rendez-vous-iiif360-2021
Réflexions et explorations croisées autour de IIIF, Omeka-s et NumaHOP à la B...Equipex Biblissima
Par Pauline Rivière (Chef de projet numérisation à la Bibliothèque Sainte-Geneviève).
Rendez-vous IIIF360, un événément en ligne autour des standards et technologies IIIF organisé par le consortium IIIF360 (Biblissima, Campus Condorcet, Huma-Num) le 24 mars 2021 : https://projet.biblissima.fr/fr/evenements/rendez-vous-iiif360-2021
Mise en œuvre de IIIF pour la reconnaissance automatique de documentsEquipex Biblissima
Par Christopher Kermorvant (Président de TEKLIA).
Rendez-vous IIIF360, un événément en ligne autour des standards et technologies IIIF organisé par le consortium IIIF360 (Biblissima, Campus Condorcet, Huma-Num) le 24 mars 2021 : https://projet.biblissima.fr/fr/evenements/rendez-vous-iiif360-2021
Par Adrien Desseigne (Ingénieur d'études, concepteur et développeur d'applications web, TGIR Huma-Num).
Rendez-vous IIIF360, un événément en ligne autour des standards et technologies IIIF organisé par le consortium IIIF360 (Biblissima, Campus Condorcet, Huma-Num) le 24 mars 2021 : https://projet.biblissima.fr/fr/evenements/rendez-vous-iiif360-2021
Par Régis Robineau (Ingénieur d'études, coordinateur de l'équipe Biblissima, membre du Technical Review Committee de IIIF).
Rendez-vous IIIF360, un événément en ligne autour des standards et technologies IIIF organisé par le consortium IIIF360 (Biblissima, Campus Condorcet, Huma-Num) le 24 mars 2021 : https://projet.biblissima.fr/fr/evenements/rendez-vous-iiif360-2021
Les référentiels Biblissima : épine dorsale du portail Biblissima et de IIIF-...Equipex Biblissima
Présentation lors du séminaire sur l'étude des provenances dans les bibliothèques territoriales françaises (CERL, MCC, Bibliothèque municipale de Lyon), le 3 avril 2019 à Lyon. Par Régis Robineau (Biblissima - Campus Condorcet, EPHE-PSL).
Introduction aux protocoles IIIF. Formation Enssib 23.01.2019 (Régis Robineau)Equipex Biblissima
Présentation des protocoles IIIF dans le cadre de la formation au Diplôme de conservateur de bibliothèque de l'Enssib (DCB 27), à Villeurbanne le 23 janvier 2019. Par Régis Robineau (Biblissima - Campus Condorcet, EPHE-PSL).
A la recherche du patrimoine écrit avec le portail BiblissimaEquipex Biblissima
Présentation par Régis Robineau lors de la journée d'étude Médiadix et URFIST de Paris "Revisiter le patrimoine en bibliothèque : valorisation, médiatisation et démocratisation" (Pôle métiers du livre, Université Paris Nanterre - 14 décembre 2018)
Browse and Visualize Manuscripts Illuminations with IIIFEquipex Biblissima
This document discusses using IIIF to browse and visualize illuminations from medieval manuscripts in the Biblissima Portal. It summarizes the Biblissima Portal's focus on medieval manuscript collections and its aggregation of metadata. It then describes two databases of manuscript illuminations and metadata that have been made available through IIIF, including over 200,000 illuminations from the BnF. Finally, it discusses potential improvements to better integrate supplemental metadata and provide a customized UI for exploring illuminations.
Présentation par Régis Robineau lors de la journée d’études HIMANIS “Reconnaissance par ordinateur des écritures anciennes : le projet HIMANIS” (Archives nationales, Paris - 29 mai 2018)
Éparpillés façon puzzle. Rassembler aujourd'hui les manuscrits d'un auteur ca...Equipex Biblissima
Pierre Chambert-Protat, Éparpillés façon puzzle. Rassembler aujourd'hui les manuscrits d'un auteur carolingien dispersés par les siècles. Journée Biblissima et IIIF "Innover pour redécouvrir le patrimoine écrit" (15 mars 2018, Campus Condorcet)
IIIF au coeur de la recherche: publier, agréger et divulguer les fragments de...Equipex Biblissima
William Duba, IIIF au coeur de la recherche: publier, agréger et divulguer les fragments de manuscrits médiévaux. Journée Biblissima et IIIF "Innover pour redécouvrir le patrimoine écrit" (15 mars 2018, Campus Condorcet)
Pierre-Yves Buard, Biblissima pour l'édition scientifique. Journée Biblissima et IIIF "Innover pour redécouvrir le patrimoine écrit" (15 mars 2018, Campus Condorcet)
Quelques applications pratiques de IIIF pour les bibliothèques numériques e...Equipex Biblissima
Rafael Schwemmer, Quelques applications pratiques de IIIF pour les bibliothèques numériques et au-delà. Journée Biblissima et IIIF "Innover pour redécouvrir le patrimoine écrit" (15 mars 2018, Campus Condorcet)
Codeless Generative AI Pipelines
(GenAI with Milvus)
https://ml.dssconf.pl/user.html#!/lecture/DSSML24-041a/rate
Discover the potential of real-time streaming in the context of GenAI as we delve into the intricacies of Apache NiFi and its capabilities. Learn how this tool can significantly simplify the data engineering workflow for GenAI applications, allowing you to focus on the creative aspects rather than the technical complexities. I will guide you through practical examples and use cases, showing the impact of automation on prompt building. From data ingestion to transformation and delivery, witness how Apache NiFi streamlines the entire pipeline, ensuring a smooth and hassle-free experience.
Timothy Spann
https://www.youtube.com/@FLaNK-Stack
https://medium.com/@tspann
https://www.datainmotion.dev/
milvus, unstructured data, vector database, zilliz, cloud, vectors, python, deep learning, generative ai, genai, nifi, kafka, flink, streaming, iot, edge
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...Kaxil Naik
Navigating today's data landscape isn't just about managing workflows; it's about strategically propelling your business forward. Apache Airflow has stood out as the benchmark in this arena, driving data orchestration forward since its early days. As we dive into the complexities of our current data-rich environment, where the sheer volume of information and its timely, accurate processing are crucial for AI and ML applications, the role of Airflow has never been more critical.
In my journey as the Senior Engineering Director and a pivotal member of Apache Airflow's Project Management Committee (PMC), I've witnessed Airflow transform data handling, making agility and insight the norm in an ever-evolving digital space. At Astronomer, our collaboration with leading AI & ML teams worldwide has not only tested but also proven Airflow's mettle in delivering data reliably and efficiently—data that now powers not just insights but core business functions.
This session is a deep dive into the essence of Airflow's success. We'll trace its evolution from a budding project to the backbone of data orchestration it is today, constantly adapting to meet the next wave of data challenges, including those brought on by Generative AI. It's this forward-thinking adaptability that keeps Airflow at the forefront of innovation, ready for whatever comes next.
The ever-growing demands of AI and ML applications have ushered in an era where sophisticated data management isn't a luxury—it's a necessity. Airflow's innate flexibility and scalability are what makes it indispensable in managing the intricate workflows of today, especially those involving Large Language Models (LLMs).
This talk isn't just a rundown of Airflow's features; it's about harnessing these capabilities to turn your data workflows into a strategic asset. Together, we'll explore how Airflow remains at the cutting edge of data orchestration, ensuring your organization is not just keeping pace but setting the pace in a data-driven future.
Session in https://budapestdata.hu/2024/04/kaxil-naik-astronomer-io/ | https://dataml24.sessionize.com/session/667627
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...Social Samosa
The Modern Marketing Reckoner (MMR) is a comprehensive resource packed with POVs from 60+ industry leaders on how AI is transforming the 4 key pillars of marketing – product, place, price and promotions.
"Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens"sameer shah
Embark on a captivating financial journey with 'Financial Odyssey,' our hackathon project. Delve deep into the past performance of two companies as we employ an array of financial statement analysis techniques. From ratio analysis to trend analysis, uncover insights crucial for informed decision-making in the dynamic world of finance."
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataKiwi Creative
Harness the power of AI-backed reports, benchmarking and data analysis to predict trends and detect anomalies in your marketing efforts.
Peter Caputa, CEO at Databox, reveals how you can discover the strategies and tools to increase your growth rate (and margins!).
From metrics to track to data habits to pick up, enhance your reporting for powerful insights to improve your B2B tech company's marketing.
- - -
This is the webinar recording from the June 2024 HubSpot User Group (HUG) for B2B Technology USA.
Watch the video recording at https://youtu.be/5vjwGfPN9lw
Sign up for future HUG events at https://events.hubspot.com/b2b-technology-usa/
The Biblissima Authority File of Geographical Names
1. The Biblissima Authority File of
Geographical Names
Atelier Campus Condorcet
“Référentiels géo-historiques sémantisés pour les humanités”
Ecole nationale des chartes, 14 mai 2019
Régis ROBINEAU
Biblissima - Campus Condorcet / EPHE-PSL
2. ➔ Data facility for historians of ancient texts
➔ 10 partners, including the Archives nationales (since 2017)
Biblissima?
3. ➔ Federate digital libraries
➔ Structure data corpora and research communities
➔ Facilitate access to and reuse of data (both textual and
documentary resources)
➔ Train researchers
Main Goals of Biblissima
8. ➔ Bayerische
Staatsbibliothek (BSB)
➔ Biblioteca Apostolica
Vaticana
➔ Bodleian Library - Oxford
University
➔ e-codices
➔ Harvard University
➔ Internet Archive
Other digital libraries used in the portal (via
IIIF)
➔ Library of Congress
➔ Mazarinum - Bibliothèque
Mazarine
➔ Numistral
➔ Universität Heidelberg
➔ Wellcome Library
About IIIF in the Biblissima portal:
beta.biblissima.fr/fr/info-iiif
17. Links to the Biblissima portal for differents
entities (manuscript, agent, place)
iiif.biblissima.fr/collections
18. ➔ Shelfmarks of manuscripts and early printed books
➔ Persons
➔ Organisations (including holding institutions)
➔ Geographical names
➔ Textual works
➔ Iconographic descriptors (indexing illuminations)
Types of data
19. ➔ Reconcile : identify, disambiguate and cluster named
entities
➔ Align to libraries’ authority files and other datasets in
the Linked Open Data
➔ Mint unique and stable identifiers
Processing of authority data
20. ➔ First public release in March 2019:
◆ Persons authority file (~26 50 person entities)
➔ “Hub” to manage and share authority data:
◆ wiki-based technology: natively collaborative, data versioning
◆ handle URIs identifiers
◆ natively produce RDF data
◆ user-friendly forms to edit entries
◆ remote access for machines: Web API + SPARQL endpoint
data.biblissima.fr
A Platform for Biblissima Authority Files
21. Publication spread over 2019:
✓ Persons (March 2019)
✓ Geographical names (April 2019)
🚧 Organisations
➔ Shelfmarks of manuscripts and early printed books
➔ Textual works
➔ Iconographic descriptors (after Initiale)
Publication of Biblissima Autority Files
22. ➔ Sources of the data :
◆ catalogues and databases of the Biblissima partners, integrated
into the portal since April 2017: beta.biblissima.fr
◆ datasets merge into the platform IIIF Collections of Manuscripts and
Rare Books: iiif.biblissima.fr/collections
➔ External alignments:
◆ BnF, Library of Congress, DNB, Wikidata, SUDOC, Biblioteca
Nacional de España, CERL Thesaurus
Persons Authority File
23. ➔ Preferred forms of labels:
◆ retrieved from the BnF or LoC authority files
◆ created according based on choices made by other libraries or
dictionaries (e.g. Dizionario biografico degli Italiani, Oxford Dictionary of
National Biography)
➔ Alternativs forms (alias):
◆ labels as present in the source databases
➔ Bibliographical notes:
◆ added to give further details about the identify of a person
data.biblissima.fr/w/Référentiel_des_personnes_physiques
Persons Authority File
24. Page of a Person entity (Cassiodore): data.biblissima.fr/w/Item:Q2785
Preferred form and alternatives forms of the name
25. Page of a Person entity (Cassiodore)
“Statements” section
26. Page of a Person entity (Cassiodore)
“Identifiers” section : alignments and links to source databases
27. ➔ Sources of the data :
◆ catalogues and databases of the Biblissima partners, integrated
into the portal since April 2017: beta.biblissima.fr
◆ datasets merged into the platform IIIF Collections of Manuscripts
and Rare Books: iiif.biblissima.fr/collections
➔ ~ 5500 geographical names (May 2019)
More info at:
data.biblissima.fr/w/Référentiel_des_noms_géographiques
Geographical Names Authority File
28. Geographical Names Authority File
➔ Types of places:
◆ Places of holding institutions
◆ Places of organisations mentioned as agents in relation to a
document (former owner, archives producer etc.)
◆ Places of origin of manuscripts and places of edition of printed
books
◆ Places as iconographic descriptors (Mandragore)
29. Geographical Names Authority File
➔ Provenance of data:
◆ Preferred forms in French from the BnF; in English from Geonames, Wikidata
or other datasets (e.g. Pleiades)
◆ Other alternative forms coming from the source databases
◆ Geo-coordinates taken from GeoNames API or the BnF (Sparql data.bnf.fr)
➔ External alignments:
◆ BnF (Rameau + Cartes et Plans), Wikidata, Geonames, Pleiades
30. Geographical Names Authority File
➔ Hierarchy of concepts: each place falls under two
classifications:
◆ Thematic classification derived from the Mandragore database, extended
to all data (36 categories based on Dewey) : e.g. “Lyon” falls under
“géographie: france et monaco”
◆ Classification by type of entity taken from the Geonames ontology (88
classes retained) : e.g. “Lyon” is a “seat of a first-order administrative
division” (P.PPLA code)
➔ Coming soon...
◆ integration of places identified in the IIIF-Collections datasets and the
geographical descriptors of the Initiale database
31. Page of a Place entity (Istanbul)
https://data.biblissima.fr/entity/Q27525
32. Page of a Place entity (Istanbul)
“Statements” section
https://data.biblissima.fr/entity/Q27525
33. Page of a Place entity (Istanbul)
Qualifier of an identifier
Qualifier to specify the nature of the
second BnF identifier (= RAMEAU
subject heading)
34. Page of a Place entity (Adatha)
Biblissima note