Hutt, Arwen and Jenn Riley. "Semantics and Syntax of Dublin Core Usage in Open Archives Initiative Data Providers of Cultural Heritage Materials." Joint Conference on Digital Libraries, Denver, CO, June 7-11, 2005.
American Art Collaborative Linked Open Data presentation to "The Networked Cu...American Art Collaborative
An August 2017 presentation by Eleanor Fink to "The Networked Curator: Association of Art Museum Curators Foundation Digital Literacy Workshop for Art Curators"
Introduction | Categories for Description of Works of Art | CDWA-LITE Kymberly Keeton
The CDWA-Lite XML schema was created by three organizations to describe cultural works and their related information. It aims to provide core descriptive metadata for works while allowing flexibility in the level of detail provided. The schema includes 22 elements, with 19 for descriptive metadata, 3 for administrative metadata, and 9 required elements. It utilizes controlled vocabularies, XML syntax rules, and semantic elements. Records encoded in CDWA-Lite can be harvested through the Open Archives Initiative protocol.
Handout for Next-generation Subject Access for Music: Infrastructure NeedsJenn Riley
This document discusses next-generation approaches to subject access and terminologies services. It provides examples of projects and implementations that are exploring new ways to utilize controlled vocabularies and subject headings, such as developing a faceted structure for Library of Congress Subject Headings (LCSH) and using subject headings to generate facets for refining search results. Specific projects mentioned include the UK High-Level Thesaurus project, experiments with "subject maps" at the University of Pennsylvania, and implementations at Indiana University, North Carolina State University, and the University of Chicago that automatically expand or suggest subject searches based on LCSH hierarchies.
A Perspective on Wikidata: Ecosystems, Trust, and UsabilityRobert Sanderson
Brief and skeptical presentation about wikidata and its potential for use and abuse in the cultural heritage data ecosystem, presented at the PCC/LDAC forum on wikidata, November 12th, 2021.
Krystal Thomas presented on the challenges of cataloging Theodore Roosevelt's papers for a digital presidential library project spanning multiple institutions. The papers are scattered across various libraries and parks, requiring coordination. Issues include the volume of materials, copyright, and developing a standardized metadata scheme incorporating both library and archival standards. Citizen volunteers are helping to catalog the unprocessed Library of Congress collection. The project uses Dublin Core metadata, FAST and LCNAF controlled vocabularies, and project management software to organize collaboration.
This paper surveys the landscape of linked open data projects in cultural heritage, exam- ining the work of groups from around the world. Traditionally, linked open data has been ranked using the five star method proposed by Tim Berners-Lee. We found this ranking to be lacking when evaluating how cultural heritage groups not merely develop linked open datasets, but find ways to used linked data to augment user experience. Building on the five-star method, we developed a six-stage life cycle describing both dataset development and dataset usage. We use this framework to describe and evaluate fifteen linked open data projects in the realm of cultural heritage.
American Art Collaborative Linked Open Data presentation to "The Networked Cu...American Art Collaborative
An August 2017 presentation by Eleanor Fink to "The Networked Curator: Association of Art Museum Curators Foundation Digital Literacy Workshop for Art Curators"
Introduction | Categories for Description of Works of Art | CDWA-LITE Kymberly Keeton
The CDWA-Lite XML schema was created by three organizations to describe cultural works and their related information. It aims to provide core descriptive metadata for works while allowing flexibility in the level of detail provided. The schema includes 22 elements, with 19 for descriptive metadata, 3 for administrative metadata, and 9 required elements. It utilizes controlled vocabularies, XML syntax rules, and semantic elements. Records encoded in CDWA-Lite can be harvested through the Open Archives Initiative protocol.
Handout for Next-generation Subject Access for Music: Infrastructure NeedsJenn Riley
This document discusses next-generation approaches to subject access and terminologies services. It provides examples of projects and implementations that are exploring new ways to utilize controlled vocabularies and subject headings, such as developing a faceted structure for Library of Congress Subject Headings (LCSH) and using subject headings to generate facets for refining search results. Specific projects mentioned include the UK High-Level Thesaurus project, experiments with "subject maps" at the University of Pennsylvania, and implementations at Indiana University, North Carolina State University, and the University of Chicago that automatically expand or suggest subject searches based on LCSH hierarchies.
A Perspective on Wikidata: Ecosystems, Trust, and UsabilityRobert Sanderson
Brief and skeptical presentation about wikidata and its potential for use and abuse in the cultural heritage data ecosystem, presented at the PCC/LDAC forum on wikidata, November 12th, 2021.
Krystal Thomas presented on the challenges of cataloging Theodore Roosevelt's papers for a digital presidential library project spanning multiple institutions. The papers are scattered across various libraries and parks, requiring coordination. Issues include the volume of materials, copyright, and developing a standardized metadata scheme incorporating both library and archival standards. Citizen volunteers are helping to catalog the unprocessed Library of Congress collection. The project uses Dublin Core metadata, FAST and LCNAF controlled vocabularies, and project management software to organize collaboration.
This paper surveys the landscape of linked open data projects in cultural heritage, exam- ining the work of groups from around the world. Traditionally, linked open data has been ranked using the five star method proposed by Tim Berners-Lee. We found this ranking to be lacking when evaluating how cultural heritage groups not merely develop linked open datasets, but find ways to used linked data to augment user experience. Building on the five-star method, we developed a six-stage life cycle describing both dataset development and dataset usage. We use this framework to describe and evaluate fifteen linked open data projects in the realm of cultural heritage.
Data Citation, The Dataverse Network ®, and Contributor IdentifiersMicah Altman
This document discusses data citation, the Dataverse Network, and contributor identifiers. It provides an overview of the Dataverse Network as an open-source application for sharing, discovering, and preserving data. It also discusses related work on digital preservation through archival collaboration and a proposed standard for the scholarly citation of quantitative data. The document outlines various use cases and benefits of the Dataverse Network for both organizations and scholars.
Digital Infrastructure: Storage and Content ManagementNoreen Whysel
Discusses analogies between the rise of the electric power grid and the Internet. Describes storage capacity issues and requirements for digital repositories. Reviews different repository platforms specific to archival and digital collection management. Has a really cool picture of Burden's Wheel.
Introduction to and overview of digital repository projects at Northwestern University, developed for a guest lecture at the Dominican University Graduate School of Library and Information Science Digital Curation course. Presentation based in part on an earlier presentation developed by Steve DiDomenico and Claire Stewart
An introduction to the Joint Information Systems Committee Resource Discovery iKit. Includes a look at controlled vocabularies declared in the Resource Discovery Framework (RDF)/Simple Knowledge Organisation System (SKOS) and wikipedia entries. Presented by Tony Ross at the CILIPS Centenary Conference Branch and Group Day which took place 5 Jun 2008.
Sherif Metadata Talk - London (June 25th 2018)Getaneh Alemu
This document summarizes the existing challenges and opportunities in the cataloguing and metadata function of Southampton Solent University. It discusses how the university has shifted to primarily electronic resources and moved to enrich metadata through standards like RDA. It also touches on balancing metadata quality with completeness while avoiding duplication through techniques like WEMI and FRBRization. The future of metadata is discussed as being enriched, linked, open and filtered.
One Discovery Layer, Eight Front Doors: Implementing Blacklight @ IUCourtney McDonald
The document summarizes Indiana University's implementation of the Blacklight discovery layer across its eight campuses to provide a shared interface for its online catalog (IUCAT) while allowing for flexibility across campuses. Key points include: IU has a complex data environment with diverse collections across eight campuses previously only served by a one-size-fits-all interface; in 2011 IU selected Blacklight over VuFind as its discovery layer due to flexibility and development community; implementation began in summer 2011 with a public beta in fall 2012 and full transition in May 2013; campus-specific views and call number browsing were customized; and future work includes enhanced customization, transition to Kuali OLE, and improving browse functions.
This document describes a case study where the University of Denver used Getty vocabularies as linked open data in a cataloging tool for an academic teaching collection. The tool was designed with a user-friendly interface, Dublin Core metadata, and integrated authority control drawn from sources like ULAN, AAT, and Library of Congress. Screenshots show how materials could be cataloged and metadata exported to other systems using standards from the semantic web like URIs, RDF, and SPARQL. The tool helped increase efficiency and quality of metadata production for the teaching collection.
The document discusses the Utah State Archives' efforts to digitize government records and make them accessible online. It provides an overview of digitization projects since 2005, partnerships that have contributed collections, and growth in digital holdings. Standards and best practices for digital imaging, metadata, and building online collections are presented, as well as the archives' workflow for digital projects and tools used. The goal is to preserve records and provide public access while maintaining context and usability of the digitized materials.
From the principle of sufficiency and necessity to metadata enrichingGetaneh Alemu
In contrast to the principle of metadata simplicity and sufficiency, the principle of metadata enriching can be considered a departure from traditional cataloguing approaches where the focus was on metadata simplicity. Metadata created and managed following the principle of metadata enriching better responds to users’ needs. Whilst the principle of enriching results in a potential abundance of metadata, the principle of filtering is used to simplify its presentation by enabling a user-centred/focused/led design.
The document discusses guidelines for creating shareable MODS metadata records for the DLF Aquifer initiative. It provides an overview of the Aquifer project and its goal of aggregating digital collections. The guidelines were developed to promote consistent, coherent metadata that is understandable and useful outside local contexts. It outlines required and recommended MODS elements and provides examples to allow varying levels of functionality for end users searching and browsing aggregated collections. Current and future work includes validation tools and converting between MODS and MARC formats.
- The document discusses shareable metadata, the Metadata Object Description Standard (MODS), and the Resource Description and Access (RDA) standard.
- Shareable metadata promotes search interoperability across systems, is understandable outside its local context, is useful outside its local context, and is machine-processable.
- MODS is a simplified version of MARC that uses XML and is a useful middle ground between MARC and Dublin Core for library applications.
- RDA is a content standard that is closely aligned with MARC and MODS, and examples of RDA implementation in MODS would be useful.
Kevin Long - DRI Training Series Day UCC: Organising Your Collectiondri_ireland
Presentation given by Kevin Long, Digital Data Curator on the Inspiring Ireland 1916 project at the Digital Repository of Ireland, in the Digital Humanities Active Learning Space, University College Cork, as part of a day-long DRI Training session on 'Preparing Digital Collections'. This seminar introduces attendees to the basics of arranging collections of heritage material to facilitate cataloguing and discovery. Although the Digital Repository of Ireland’s collection arrangement functionality will be discussed specifically, the themes explored in this seminar are applicable to both digital and non-digital collections.
Visualising Dissertations on Electronic Literature (Visualising E-lit seminar...Jill Walker Rettberg
This document summarizes a presentation about visualizing dissertations on electronic literature. It shows that the number of dissertations on the topic has grown since 1976, with most published since 2008. It also describes how the dissertations and creative works they reference can be visualized as networks, such as a two-mode network connecting dissertations and works or a one-mode network of works connected by multiple references. This allows analyzing the diversity of cited works and connections between ideas in the field of electronic literature.
Presented for managers & researchers at The Global One Health Initiative of the Ohio State University, Africa Regional Branch in Addis Ababa, Ethiopia (April 24th 2019)
The document provides statistics on the FAST (Faceted Application of Subject Terminology) thesaurus as of June 2017, including the number of records and types of headings in FAST. It also lists links from FAST to other datasets like Library of Congress Subject Headings and Wikidata. The FAST team is continuing to synchronize and refine processing rules for FAST and developing an import tool. Information is also provided on the FACETVOC-L discussion list focused on faceted controlled vocabularies.
Metadata enriching and discovery at Solent University Library Getaneh Alemu
This document discusses metadata enriching and discovery at Solent University. It begins with introductions and context about how enriched, linked, open and filtered metadata drives resource usage. It then discusses several principles of metadata including sufficiency, necessity, user convenience, representation and standardization. The document outlines how Solent University has enriched its metadata by importing subject headings and authorities. It discusses metadata linking, openness, filtering and usage. Overall it emphasizes the importance of enriching metadata and keeping interfaces simple while maximizing resource discovery and usage.
DSpace for Cultural Heritage: adding support for images visualization,audio/v...Andrea Bollini
Digital Repositories are continuously evolving into platforms aimed at managing, visualizing, curating and preserving a variety of different cultural digital objects together with their relationships
To support interoperability and to allow a broad dissemination and re-use of cultural heritage and research results, we have built two DSpace add-ons to be released as open source, the IIIF (International Image Interoperability Framework) Image Viewer and the audio/video streaming module. The first one manages the complexity of digital objects such as page sequences, chapters and sections, exposing the metadata and the structure with the IIIF presentation API and use the IIIF API to provide fast visualization and low bandwidth use. The streaming module allows to stream audio / video content loaded in the repository using adaptive streaming and the DASH industry standard. Both modules provide a full open source stack or enable the integration with external Images and Media Server.
Managing the relations between digital objects both in a hierarchical or relational way is a key feature, in order to manage every kind of cultural heritage material. Thus we are enhancing the DSpace Data Model in order to provide not only structural metadata management but also the description of relationships within cultural contexts.
Slides presented at OR2017 - Brisbane, Australia
Towards OpenURL Quality Metrics: Initial Findingsalc28
Presentation on creating a method for benchmarking metadata consistency in OpenURL links. See also: <http: />. Delivered at the July 2009 American Library Association conference in Chicago.
The students in an LS 566 Metadata class were asked to index a collection of six images from the 1992 University of Alabama Crimson Tide football season. The class chose a modified Dublin Core metadata schema containing 15 elements to describe the images. Students worked in groups to define the elements and map them to the images. They used Omeka software to extract image text and connect files to element fields. The indexing process took longer than expected due to a tornado that damaged the university.
Data Citation, The Dataverse Network ®, and Contributor IdentifiersMicah Altman
This document discusses data citation, the Dataverse Network, and contributor identifiers. It provides an overview of the Dataverse Network as an open-source application for sharing, discovering, and preserving data. It also discusses related work on digital preservation through archival collaboration and a proposed standard for the scholarly citation of quantitative data. The document outlines various use cases and benefits of the Dataverse Network for both organizations and scholars.
Digital Infrastructure: Storage and Content ManagementNoreen Whysel
Discusses analogies between the rise of the electric power grid and the Internet. Describes storage capacity issues and requirements for digital repositories. Reviews different repository platforms specific to archival and digital collection management. Has a really cool picture of Burden's Wheel.
Introduction to and overview of digital repository projects at Northwestern University, developed for a guest lecture at the Dominican University Graduate School of Library and Information Science Digital Curation course. Presentation based in part on an earlier presentation developed by Steve DiDomenico and Claire Stewart
An introduction to the Joint Information Systems Committee Resource Discovery iKit. Includes a look at controlled vocabularies declared in the Resource Discovery Framework (RDF)/Simple Knowledge Organisation System (SKOS) and wikipedia entries. Presented by Tony Ross at the CILIPS Centenary Conference Branch and Group Day which took place 5 Jun 2008.
Sherif Metadata Talk - London (June 25th 2018)Getaneh Alemu
This document summarizes the existing challenges and opportunities in the cataloguing and metadata function of Southampton Solent University. It discusses how the university has shifted to primarily electronic resources and moved to enrich metadata through standards like RDA. It also touches on balancing metadata quality with completeness while avoiding duplication through techniques like WEMI and FRBRization. The future of metadata is discussed as being enriched, linked, open and filtered.
One Discovery Layer, Eight Front Doors: Implementing Blacklight @ IUCourtney McDonald
The document summarizes Indiana University's implementation of the Blacklight discovery layer across its eight campuses to provide a shared interface for its online catalog (IUCAT) while allowing for flexibility across campuses. Key points include: IU has a complex data environment with diverse collections across eight campuses previously only served by a one-size-fits-all interface; in 2011 IU selected Blacklight over VuFind as its discovery layer due to flexibility and development community; implementation began in summer 2011 with a public beta in fall 2012 and full transition in May 2013; campus-specific views and call number browsing were customized; and future work includes enhanced customization, transition to Kuali OLE, and improving browse functions.
This document describes a case study where the University of Denver used Getty vocabularies as linked open data in a cataloging tool for an academic teaching collection. The tool was designed with a user-friendly interface, Dublin Core metadata, and integrated authority control drawn from sources like ULAN, AAT, and Library of Congress. Screenshots show how materials could be cataloged and metadata exported to other systems using standards from the semantic web like URIs, RDF, and SPARQL. The tool helped increase efficiency and quality of metadata production for the teaching collection.
The document discusses the Utah State Archives' efforts to digitize government records and make them accessible online. It provides an overview of digitization projects since 2005, partnerships that have contributed collections, and growth in digital holdings. Standards and best practices for digital imaging, metadata, and building online collections are presented, as well as the archives' workflow for digital projects and tools used. The goal is to preserve records and provide public access while maintaining context and usability of the digitized materials.
From the principle of sufficiency and necessity to metadata enrichingGetaneh Alemu
In contrast to the principle of metadata simplicity and sufficiency, the principle of metadata enriching can be considered a departure from traditional cataloguing approaches where the focus was on metadata simplicity. Metadata created and managed following the principle of metadata enriching better responds to users’ needs. Whilst the principle of enriching results in a potential abundance of metadata, the principle of filtering is used to simplify its presentation by enabling a user-centred/focused/led design.
The document discusses guidelines for creating shareable MODS metadata records for the DLF Aquifer initiative. It provides an overview of the Aquifer project and its goal of aggregating digital collections. The guidelines were developed to promote consistent, coherent metadata that is understandable and useful outside local contexts. It outlines required and recommended MODS elements and provides examples to allow varying levels of functionality for end users searching and browsing aggregated collections. Current and future work includes validation tools and converting between MODS and MARC formats.
- The document discusses shareable metadata, the Metadata Object Description Standard (MODS), and the Resource Description and Access (RDA) standard.
- Shareable metadata promotes search interoperability across systems, is understandable outside its local context, is useful outside its local context, and is machine-processable.
- MODS is a simplified version of MARC that uses XML and is a useful middle ground between MARC and Dublin Core for library applications.
- RDA is a content standard that is closely aligned with MARC and MODS, and examples of RDA implementation in MODS would be useful.
Kevin Long - DRI Training Series Day UCC: Organising Your Collectiondri_ireland
Presentation given by Kevin Long, Digital Data Curator on the Inspiring Ireland 1916 project at the Digital Repository of Ireland, in the Digital Humanities Active Learning Space, University College Cork, as part of a day-long DRI Training session on 'Preparing Digital Collections'. This seminar introduces attendees to the basics of arranging collections of heritage material to facilitate cataloguing and discovery. Although the Digital Repository of Ireland’s collection arrangement functionality will be discussed specifically, the themes explored in this seminar are applicable to both digital and non-digital collections.
Visualising Dissertations on Electronic Literature (Visualising E-lit seminar...Jill Walker Rettberg
This document summarizes a presentation about visualizing dissertations on electronic literature. It shows that the number of dissertations on the topic has grown since 1976, with most published since 2008. It also describes how the dissertations and creative works they reference can be visualized as networks, such as a two-mode network connecting dissertations and works or a one-mode network of works connected by multiple references. This allows analyzing the diversity of cited works and connections between ideas in the field of electronic literature.
Presented for managers & researchers at The Global One Health Initiative of the Ohio State University, Africa Regional Branch in Addis Ababa, Ethiopia (April 24th 2019)
The document provides statistics on the FAST (Faceted Application of Subject Terminology) thesaurus as of June 2017, including the number of records and types of headings in FAST. It also lists links from FAST to other datasets like Library of Congress Subject Headings and Wikidata. The FAST team is continuing to synchronize and refine processing rules for FAST and developing an import tool. Information is also provided on the FACETVOC-L discussion list focused on faceted controlled vocabularies.
Metadata enriching and discovery at Solent University Library Getaneh Alemu
This document discusses metadata enriching and discovery at Solent University. It begins with introductions and context about how enriched, linked, open and filtered metadata drives resource usage. It then discusses several principles of metadata including sufficiency, necessity, user convenience, representation and standardization. The document outlines how Solent University has enriched its metadata by importing subject headings and authorities. It discusses metadata linking, openness, filtering and usage. Overall it emphasizes the importance of enriching metadata and keeping interfaces simple while maximizing resource discovery and usage.
DSpace for Cultural Heritage: adding support for images visualization,audio/v...Andrea Bollini
Digital Repositories are continuously evolving into platforms aimed at managing, visualizing, curating and preserving a variety of different cultural digital objects together with their relationships
To support interoperability and to allow a broad dissemination and re-use of cultural heritage and research results, we have built two DSpace add-ons to be released as open source, the IIIF (International Image Interoperability Framework) Image Viewer and the audio/video streaming module. The first one manages the complexity of digital objects such as page sequences, chapters and sections, exposing the metadata and the structure with the IIIF presentation API and use the IIIF API to provide fast visualization and low bandwidth use. The streaming module allows to stream audio / video content loaded in the repository using adaptive streaming and the DASH industry standard. Both modules provide a full open source stack or enable the integration with external Images and Media Server.
Managing the relations between digital objects both in a hierarchical or relational way is a key feature, in order to manage every kind of cultural heritage material. Thus we are enhancing the DSpace Data Model in order to provide not only structural metadata management but also the description of relationships within cultural contexts.
Slides presented at OR2017 - Brisbane, Australia
Towards OpenURL Quality Metrics: Initial Findingsalc28
Presentation on creating a method for benchmarking metadata consistency in OpenURL links. See also: <http: />. Delivered at the July 2009 American Library Association conference in Chicago.
The students in an LS 566 Metadata class were asked to index a collection of six images from the 1992 University of Alabama Crimson Tide football season. The class chose a modified Dublin Core metadata schema containing 15 elements to describe the images. Students worked in groups to define the elements and map them to the images. They used Omeka software to extract image text and connect files to element fields. The indexing process took longer than expected due to a tornado that damaged the university.
A demonstration of transparent and scalable OpenURL quality metrics for use i...alc28
This document summarizes Adam Chandler's presentation on using OpenURL quality metrics to promote metadata consistency across content providers. It discusses literature on OpenURL and metadata quality, analyzes elements in OpenURLs, and presents Chandler's 2008 findings on common and variable elements. The goal is to build a tool to evaluate OpenURL quality from content providers based on Hughes' metadata evaluation approach and analysis of core OpenURL elements.
The document discusses the LOCAH Project which aims to expose data from the Archives Hub and Copac as linked open data. It describes creating URIs and an RDF data model for archival descriptions. It also discusses enhancing the data by linking to external vocabularies and creating a prototype visualization using tools like Timemap and Simile. Key challenges mentioned include the complexity of archival data and ensuring sustainability and scalability of the linked data.
Charleston 2012 - The Future of Serials in a Linked Data WorldProQuest
The educational objective of this session is to review today’s MARC-based environment in which the serial record predominates, and compare that with what might be possible in a future world of linked data. The session will inspire conversation and reflection on a number of questions. What will a world of statement-based rather than record-based metadata look like? What will a new environment mean for library systems, workflows, and information dissemination?
This document summarizes a workshop on metadata and digital libraries. It discusses the objectives of library systems and how they impact metadata. The workshop introduces Dublin Core metadata and examines how functional requirements inform system design and metadata decisions. Participants analyze sample metadata and use cases to understand these concepts. The summary highlights that system objectives guide metadata, and functional requirements defined through use cases specify required system behaviors and metadata.
OCLC Research @ U of Calgary: New directions for metadata workflows across li...OCLC Research
Presentation used as scene setting for 2 days worth of discussion around library, archive & museum convergence, metadata workflows and single search at the University of Calgary.
This document summarizes the Knowledge Engineering efforts for the TELDAP digital library project. It discusses (1) developing metadata models for different types of digital objects, including a union catalog model and models for websites and documents; (2) establishing hyperlinks between objects and keywords to connect related resources; and (3) constructing ontologies and thesauri like Getty AAT and Chinese WordNet to link keywords and establish implicit relationships between objects. The goal is to optimize access, retrieval and understanding of the large and growing collection of digital content.
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...Cory Lampert
This document outlines a presentation about transforming metadata from a CONTENTdm digital collection into linked data. It discusses the concepts of linked data, including defining linked data, linked data principles, technologies and standards. It then explains how these concepts can be applied to digital collection records, including anticipated challenges working with CONTENTdm. The document describes a linked data project at UNLV Libraries to transform collection records into linked data and publish it on the linked data cloud. It provides tips for creating metadata that is more suitable for linked data.
NSF Workshop Data and Software Citation, 6-7 June 2016, Boston USA, Software Panel
FIndable, Accessible, Interoperable, Reusable Software and Data Citation: Europe, Research Objects, and BioSchemas.org
This document discusses a research project that developed tools to extract biographical authority records from archival finding aids. The project built a large test corpus of Encoded Archival Context - Corporate Bodies, Persons, and Families (EAC-CPF) records and created a prototype system to access these records. Future directions discussed include integrating the merged data back into archival access systems, scaling up the sources of archival finding aids, and adding more linked open data.
1. The document discusses the EOSC Dataset Minimum Information (EDMI) approach for exposing research data in the European Open Science Cloud (EOSC).
2. EDMI defines a set of 12 minimum metadata properties to facilitate finding and accessing datasets without being overly descriptive.
3. The approach was developed by engaging EOSC demonstrator data repositories and repositories to propose methods for exposing metadata in a simple and sustainable way.
Alphabet soup: CDM, VRA, CCO, METS, MODS, RDF - Why Metadata MattersNew York University
This presentation given to University of Iowa Libraries on Nov. 17, 2014, discussing 1) the alphabet soup of metadata standards, e.g. CDM, VRA, CCO, METS, MODS, RDF, including sample tagging and their applications for digital libraries, and 2) why metadata matters. It does not address metadata issues and tools for metadata creation, extraction, transformation, quality control, syndication and ingest.
This document provides an overview of metadata standards and sources of information about standards. It describes what metadata standards are and their purposes. Sources discussed provide basic information on metadata in general and information on specific standards. The document also describes the Encoded Archival Description (EAD) standard in detail, including its benefits and sources of information about EAD. Finally, it discusses cross-walking between standards and upcoming changes to the EAD standard.
This document provides an overview of digital libraries, including definitions, benefits, limitations, components, standards, and challenges. It defines a digital library as a collection of information stored and accessed electronically, extending the functions of a traditional library digitally. Benefits include improved access and searchability, easier information sharing and preservation. Emerging technologies discussed include metadata standards, XML, and protocols like OAI-PMH for metadata harvesting. Common digital library software includes DSpace, Greenstone, and EPrints. Challenges involve digitization, description, legal issues, presentation of heterogeneous resources, and economic sustainability.
This document provides an overview of digital libraries, including definitions, benefits, limitations, components, standards, and challenges. It defines a digital library as a collection of information stored and accessed electronically, extending the functions of a traditional library digitally. Benefits include improved access, information sharing, and preservation, while limitations include technological obsolescence and rights management. Key components discussed include digital objects, metadata, and tools like DSpace and Greenstone for developing digital libraries. Emerging standards around identifiers, encoding, and metadata are also summarized.
Presentation slide for this:
Kei Kurakawa, Toward universal information access on the digital object cloud, In book of abstracts of International Workshop on Data Science - Present & Future of Open Data & Open Science -, p.57-59, November 12-15, 2018, Mishima Citizens Cultural Hall & Joint Support-Center for Data Science Research, Mishima, Shizuoka, Japan
RDA is a new cataloging standard designed to replace AACR2 and provide guidelines for describing digital resources. It is based on FRBR and FRAD which are models that organize information by user tasks and relationships between entities like works, expressions, manifestations and items. RDA aims to be more intuitive for users by providing more detailed descriptions of resources and is being tested by various libraries and organizations before its full implementation. However, some questions remain regarding its costs and benefits compared to AACR2.
Similar to Semantics and Syntax of Dublin Core Usage in Open Archives Initiative Data Providers of Cultural Heritage Materials (20)
The document discusses metadata, including how it is used in cultural heritage organizations and the different types of metadata. It talks about how metadata is stored and shared using databases, XML, and RDF. The presentation notes that metadata standards are evolving due to linked data technologies, which are connecting metadata in larger graphs. As a result, metadata is becoming less separated between organizations and more open and intelligent systems are needed to handle the growing scale and connections in metadata. Cultural heritage organizations need to rethink their workflows and business models in light of these changes.
Designing the Garden: Getting Grounded in Linked DataJenn Riley
Riley, Jenn. “Designing the Garden: Getting Grounded in Linked Data.” Beyond the Looking Glass: Real World Linked Data. What Does it Take to Make it Work? ALCTS Preconference, San Francisco, CA, June 26, 2015.
Riley, Jenn. “Launching metaware.buzz.” Panelist, Experimental Scholarly Publishing: Building New Models with Distributed Communities of Practice”, Digital Library Federation Forum, October 28, 2014, Atlanta, GA.
Riley, Jenn. “Getting Comfortable with Metadata Reuse.” O Rare! Performance in Special Collections: The 54th Annual RBMS Preconference, Minneapolis, June 23 – 26, 2013
Handout for Digital Imaging of PhotographsJenn Riley
This document provides guidelines for digitizing sheet music collections at the Lilly Library, including specifications for file formats, resolution, naming conventions, and scanning procedures. Key steps include wearing gloves, handling pages carefully, scanning pages sequentially in color or grayscale as needed, using consistent pixel dimensions within each item, and recording metadata in a scan log spreadsheet. The goal is to digitally capture all relevant content like illustrations, advertisements, and annotations, while preserving the original order and organization of the physical materials.
The Open Archives Initiative and the Sheet Music ConsortiumJenn Riley
Dunn, Jon and Jenn Riley. “The Open Archives Initiative and the Sheet Music Consortium.” Digital Library Program Brown Bag Presentation, October 10, 2003.
Cushman Exposed! Exploiting Controlled Vocabularies to Enhance Browsing and S...Jenn Riley
Dalmau, Michelle and Jenn Riley. "Cushman Exposed! Exploiting Controlled Vocabularies to Enhance Browsing and Searching of an Online Photograph Collection." Digital Library Program Brown Bag Presentation, May 17, 2004.
The document summarizes the Variations2 project, which is building on an earlier Variations project funded by the National Science Foundation. Variations2 aims to create an integrated digital library of musical works, scores, and recordings. It is staffed by several librarians and supported by various Indiana University departments. The project involves developing a data model and software framework to provide search and retrieval of diverse music formats. Usability research is also being conducted to improve the user experience.
Handout for Merging Metadata from Multiple Traditions: IN Harmony Sheet Music...Jenn Riley
Riley, Jenn. "Merging Metadata from Multiple Traditions: IN Harmony Sheet Music from Libraries and Museums." Digital Library Program Brown Bag Presentation, October 19, 2005.
Merging Metadata from Multiple Traditions: IN Harmony Sheet Music from Librar...Jenn Riley
Riley, Jenn. "Merging Metadata from Multiple Traditions: IN Harmony Sheet Music from Libraries and Museums." Digital Library Program Brown Bag Presentation, October 19, 2005.
Challenges in the Nursery: Linking a Finding Aid with Online ContentJenn Riley
Johnson, Elizabeth, and Jenn Riley. "Challenges in the Nursery: Linking a Finding Aid with Online Content." Digital Library Program Brown Bag Presentation, March 8, 2006.
it describes the bony anatomy including the femoral head , acetabulum, labrum . also discusses the capsule , ligaments . muscle that act on the hip joint and the range of motion are outlined. factors affecting hip joint stability and weight transmission through the joint are summarized.
Thinking of getting a dog? Be aware that breeds like Pit Bulls, Rottweilers, and German Shepherds can be loyal and dangerous. Proper training and socialization are crucial to preventing aggressive behaviors. Ensure safety by understanding their needs and always supervising interactions. Stay safe, and enjoy your furry friends!
How to Fix the Import Error in the Odoo 17Celine George
An import error occurs when a program fails to import a module or library, disrupting its execution. In languages like Python, this issue arises when the specified module cannot be found or accessed, hindering the program's functionality. Resolving import errors is crucial for maintaining smooth software operation and uninterrupted development processes.
Introduction to AI for Nonprofits with Tapp NetworkTechSoup
Dive into the world of AI! Experts Jon Hill and Tareq Monaur will guide you through AI's role in enhancing nonprofit websites and basic marketing strategies, making it easy to understand and apply.
Executive Directors Chat Leveraging AI for Diversity, Equity, and InclusionTechSoup
Let’s explore the intersection of technology and equity in the final session of our DEI series. Discover how AI tools, like ChatGPT, can be used to support and enhance your nonprofit's DEI initiatives. Participants will gain insights into practical AI applications and get tips for leveraging technology to advance their DEI goals.
A review of the growth of the Israel Genealogy Research Association Database Collection for the last 12 months. Our collection is now passed the 3 million mark and still growing. See which archives have contributed the most. See the different types of records we have, and which years have had records added. You can also see what we have for the future.
How to Manage Your Lost Opportunities in Odoo 17 CRMCeline George
Odoo 17 CRM allows us to track why we lose sales opportunities with "Lost Reasons." This helps analyze our sales process and identify areas for improvement. Here's how to configure lost reasons in Odoo 17 CRM
বাংলাদেশের অর্থনৈতিক সমীক্ষা ২০২৪ [Bangladesh Economic Review 2024 Bangla.pdf] কম্পিউটার , ট্যাব ও স্মার্ট ফোন ভার্সন সহ সম্পূর্ণ বাংলা ই-বুক বা pdf বই " সুচিপত্র ...বুকমার্ক মেনু 🔖 ও হাইপার লিংক মেনু 📝👆 যুক্ত ..
আমাদের সবার জন্য খুব খুব গুরুত্বপূর্ণ একটি বই ..বিসিএস, ব্যাংক, ইউনিভার্সিটি ভর্তি ও যে কোন প্রতিযোগিতা মূলক পরীক্ষার জন্য এর খুব ইম্পরট্যান্ট একটি বিষয় ...তাছাড়া বাংলাদেশের সাম্প্রতিক যে কোন ডাটা বা তথ্য এই বইতে পাবেন ...
তাই একজন নাগরিক হিসাবে এই তথ্য গুলো আপনার জানা প্রয়োজন ...।
বিসিএস ও ব্যাংক এর লিখিত পরীক্ষা ...+এছাড়া মাধ্যমিক ও উচ্চমাধ্যমিকের স্টুডেন্টদের জন্য অনেক কাজে আসবে ...
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
Semantics and Syntax of Dublin Core Usage in Open Archives Initiative Data Providers of Cultural Heritage Materials
1. Semantics and Syntax of Dublin Core
Usage in Open Archives Initiative Data
Providers of Cultural Heritage Materials
Arwen Hutt, University of Tennessee
Jenn Riley, Indiana University
2. OAI-PMH
Open Archives Initiative Protocol for Metadata Har
Originally developed for sharing metadata
about e-prints
Two players
Data providers
Service providers
Requires unqualified Dublin Core be exposed
for all resources, but supplemental metadata
formats are allowed
3. Dublin Core [Unqualified]
Simple, flexible metadata format
15 elements
All repeatable
None required
“Core” across all knowledge domains
4. “Cultural heritage” defined
The intellectual creative and material output
of society
Libraries, museums and archives generally
considered cultural heritage institutions
Often primary source materials
Tend to be older analog digitized for network
access
5. Significant variability in OAI metadata
Ward: found that only a small number of DC
elements were used in the majority of OAI
records
Liu: Arc service provider studied controlled
vocabulary usage in DC subject, type, format,
language, and date fields
NSDL: found errors missing data, incorrect
data, confusing data, insufficient data
UIUC: date, coverage, format, and type
vocabulary varies significantly
6. Goals of the study
Focus on cultural heritage community
Examined 3 DC fields: date, creator,
contributor
Semantic content
Syntactic form
Results could inform community best
practices
One step towards improving the overall
quality of OAI metadata
7. Harvesting statistics
Successfully harvested metadata from 35
data providers
750,945 total records harvested
5% sample* from each data provider taken
for analysis (37,564 records)
* Minimum of 1 record per provider, values rounded up to the nearest whole number
8. Processing steps
Date, creator, contributor elements extracted
into “silos”
Repeated values grouped, keeping
connections between elements and the
records in which they appeared
Certain characteristics tracked about each
element
Example
9. Characteristics recorded for all
elements
The presence of multiple discrete values in a
single element
<creator>Hutt, Arwen; Riley, Jenn</creator>
The presence of pseudo-qualifiers within the
value that refined the meaning of the element
<creator>Berlin, Irving
[composer]</creator>
Whether the value was appropriate within the
specified element based on DC rules and
usage guidelines
<date>Las Vegas, Nevada</date>
10. Additional characteristics of <date>
The semantic type of the value (creation, copyright or
digitization)
<date>2000</date>
The general specificity of the date (single date, range
or period)
<date>19th Century</date>
Indication that a date is not definitive (that it is
estimated or approximate)
<date>ca. 1930</date>
Whether the value is purely numeric or contains non-
numeric text
<date>March 18, 1902</date>
11. Additional characteristics of <creator>
and <contributor>
The semantic type of the value (personal
name, corporate name or other)
<creator>Newton, Isaac</creator>
Whether the entity is known, unknown or
ambiguous
<creator>Vermeer, Johannes, 1632-1675 ?</creator>
Whether the value is inverted or in direct
order
<creator>Charles Schultz</creator>
12. Strategies for categorization
Automatic
Iteratively developed
Pattern matching
Identification of commonly occurring values
Manual
Where feasible
Not perfect!
13. Findings for <date>
Values largely appropriate for element
Few “pseudo-qualifiers”
Different events represented
Values mostly numeric
Many dates not expressible in W3CDTF
14. Findings for <creator>
Values largely appropriate for element
Most were personal names
Many “pseudo-qualifiers,” in comparison to
other elements
Often included information intended to
disambiguate a name
Some indication of the use of controlled
vocabularies, but many different name forms
present
15. Findings for <contributor>
Used infrequently
Many values inappropriate for element
Majority personal names, but higher
proportion of corporate names than
occurred in <creator>
Few “pseudo-qualifiers”
16. OAI DC record & intellectual object
1:1 principle – each DC record describes only
one version of a resource
BUT
Cultural heritage materials often digitized
from analog originals, resulting in multiple
versions of each intellectual object
17. OAI DC record & intellectual object
Two choices for data providers
Adhere
to 1:1 rule but omit pertinent
information
Violate the 1:1 rule but create more
complete records
Many data providers in practice violate
the 1:1 rule
18. OAI DC record & aggregated search
environment
Extraction of records from original
collection context
Aggregation with records from other
collections
19. Moving towards better metadata –
some possibilities
Remove the OAI requirement for simple
Dublin Core (or “the Nuclear Option”)
Develop best practice documentation for
cultural heritage materials that deviate from
current DC best practice
Combination of data provider education and
service provider normalization
Improved communication between data and
service providers
Encourage use of other metadata formats
supplementing simple DC
20. Some other relevant initiatives
Digital Library Federation and NSDL OAI and
Shareable Metadata Best Practices Working
Group
Development of general OAI best practices
Development of strategies for communication
with vendors
DLF Aquifer Metadata Working Group
Development of profile for DLF institutions
(strong focus on cultural heritage)
Recommendations for specific metadata
elements
21. Plans for extension of this research
Primary analysis of the subject, coverage and
publisher elements
Analyze temporal information across date,
subject and coverage elements
Analyze geographic information across
subject and coverage elements
Analyze name information across creator,
contributor and publisher elements
Our study was performed on metadata shared by OAI-PMH.
OAI is protocol for sharing metadata, not content.
Data providers “expose” metadata for service providers to come get. Service providers make use of that metadata in some way. Currently, by far the most common service provided is cross-repository searching. Our study focused on data providers of cultural heritage materials.
Explain the difference b/n qualified and simple dc
Example of 1:1 principle – mona lisa painting, leonardo painted many years ago– digital image created by jenn riley 2000
All the research in this area talks about the variability problem
Quickly!
What we work with
Better chance of finding patterns within a general community
Semantic content – the meaning of the content of a metadata element
Syntactic form – the structure or format of the value
Processing performed with perl and xslt scripts
Show sample
Grouping
Link back to original record
Attributes – some attributes are used all silos but there are some that are specific to the different elements
[We picked characteristics to record based on our experience as OAI data providers and on reports in the OAI literature]
Date specificity – ming dynasty, 1980’s, 1900-1910, etc.
Perl scripts
Not perfect
Too many records to manually check each one
Certain characteristics require subjective judgements
Digitization and creation most prevalent, but a few copyright also.
3 to 1 numeric to textual
W3CDTF – profile of ISO 8601 recommended by Dublin Core as a best practice for date encoding but 17% of dates cannot be represented by it.
we’re seeing a need for better support for variations on date values
Extraction – on the horse example, Roosevelt
Aggregation
– Administrative metadata is rarely useful in an aggregated environment.
- System hacks like to make all years of a date range searchable typing in every year in the range.