This presentation given to University of Iowa Libraries on Nov. 17, 2014, discussing 1) the alphabet soup of metadata standards, e.g. CDM, VRA, CCO, METS, MODS, RDF, including sample tagging and their applications for digital libraries, and 2) why metadata matters. It does not address metadata issues and tools for metadata creation, extraction, transformation, quality control, syndication and ingest.
1. Alphabet Soup: CDM, VRA, CCO, METS,
MODS, RDF – Why Metadata Matters
Amanda Xu
Candidate for Metadata Analyst at
Iowa University Libraries
2. What is CDM (1)
• CDM = OCLC CONTENTdm Digital Collection Management
Software
• CONTENTdm 6.8 was released on June 16, 2014 with
improvements to Website configuration tool, XML
sitemaps, and expanded range of available image download
sizes
• CDM collection metadata fields compatible to qualified
Dublin Core, VRA, OAI-PMH, etc.
• 125 fields per collection
• 128,000 characters per field
• Initiatives in 1) leveraging linked data in WorldCat via
WorldCat Digital Collection Gateway or 2) transforming
CONTENTdm metadata into linked open data
3. What is CDM (2)
SOURCE: Ingram, G. (2014, August). OCLC CONTENTdm and the WorldCat Digital
Collection Gateway. Available: http://www.contentdm.org/USC/training/CONTENTdm-DCG-Overview-20140812.pdf
4. Sample CDM Metadata Field Properties
SOURCE: Best Practices for CONTENTdm and other OAI-PMH Compliant Repositories;
Creating Shareable Metadata (version 3.1)
5. Sample CDM OAI-PMH Record Using Qualified
Dublin Core from Iowa Digital Library (1)
related collections
6. Sample CDM OAI-PMH Record Using Qualified
Dublin Core from Iowa Digital Library (2)
sound
SOURCE: Univ. of Iowa Libraries. Digital Research and Publishing (2014).
Metadata extracted from OAI feed of records for materials in the Iowa Digital Library
Available: http://www.lib.uiowa.edu/drp/technology/
7. Sample CDM Compound Object Record Display
from Iowa Digital Library (1)
Audio player
SOURCE: Iowa Digital Library. Oral history interview with Dora Martin Berry, August 14, 1997 [excepts]
Available: http://digital.lib.uiowa.edu/cdm/ref/collection/aawiowa/id/2102
8. Sample CDM Compound Object Record Display
from Iowa Digital Library (2)
Sound
SOURCE: Iowa Digital Library. Oral history interview with Dora Martin Berry, August 14, 1997 [excepts]
Available: http://digital.lib.uiowa.edu/cdm/ref/collection/aawiowa/id/2102
9. Sample CDM Compound Object Record Display
from Iowa Digital Library (3)
Page not found
SOURCE: Iowa Digital Library. Oral history interview with Dora Martin Berry, August 14, 1997 [excepts]
Available: http://digital.lib.uiowa.edu/cdm/ref/collection/aawiowa/id/2102
10. Sample CDM Compound Object Record Display
from Libraries’ Smart Search (1)
SOURCE: University of Iowa Libraries Smart Search.
Available: http://smartsearch.uiowa.edu/
Page not found
11. Sample CDM Compound Object Record Display
from Libraries’ Smart Search (2)
Regroup related collections into one field
12. Sample CDM Compound Object Record Display
from Libraries’ Smart Search (3)
Refine search result by genre type “sound”
13. Initiatives in leveraging linked data in WorldCat
via WorldCat Digital Collection Gateway
SOURCE: Sarason, C. (2014, May). CONTENTdm update. CONTENTdm Users Group Meeting at
University of South Carolina.
Available: http://scholarcommons.sc.edu/cgi/viewcontent.cgi?article=1015&context=cdmusers
14. Initiative in Transforming CONTENTdm Metadata
into Linked Open Data
SOURCE: Lampert, C.K. & Southwick, S.B. (2013). Transforming CONTENTdm metadata into linked open data.
CONTENTdm Virtual Conference, December 5, 2013.
Available: http://www.contentdm.org/USC/training/2013/Lampert-Southwick-20131205.pdf
15. What is VRA Core
• “A data standard for the description of works of visual
culture as well as the images that document them”
• Developed and maintained by the VRA Core Oversight
Committee
• Current version VRA 4.0 in compliance to XML schema,
released 4/9/2007
• VRA Core implementation from ARTstor
• In 2013, Linked Data in VRA 4.0 project converting VRA
xml records into RDF/XML was completed by Jeff
Mixter as his master thesis submitted to the College of
Communication and Information of Kent State
16. Changes to Elements Between VRA 3.0 and 4.0
SOURCE: VRA Core 4.0 introduction. Available: http://www.loc.gov/standards/vracore/VRA_Core4_Intro.pdf
17. Outline of VRA Core 4.0 Elements
SOURCE: Eklund, J. “Got metadata?” VRA Core XML SEI 2007. Available:
http://core.vraweb.org/pdfs/core4_xml.pdf
18. VRA Core 4.0 XML Schema
• “A schema is simply an XML file that contains the rules for what can and
cannot reside in an XML data file.”
• “Specifying the valid elements, sub-elements, and attributes as well as
the values they can contain (either by data type or by providing a list of
values)”
• “Specifying which elements are required and/or are repeatable”
• “Controlling the structure by specifying which elements can be nested”
• Having two versions of the VRA Core 4.0 schema: unrestricted imposing no
restrictions on the values entered, and restricted imposing restrictions on
the data values entered into the type attributes
• Enabling the export of information from relational or proprietary database
to XML for the purposes of sharing that data beyond local system
SOURCE: VRA Core 4.0 introduction. Available:
http://www.loc.gov/standards/vracore/VRA_Core4_Intro.pdf
19. Screenshot of VRA Core 4.0 Restricted Schema
Type Values
SOURCE: http://www.loc.gov/standards/vracore/VRA_Core4_Restricted_schema_type_values.pdf
20. VRA Core 4.0 Example from
http://aal.ucsd.edu/vracore4/example001.html (1)
21. VRA Core 4.0 Example from
http://aal.ucsd.edu/vracore4/example001.html (2)
22. VRA Core 4.0 Example from
http://aal.ucsd.edu/vracore4/example001.xml (3)
Work
Image
23. VRA Core 4.0 Example from
http://aal.ucsd.edu/vracore4/example001.xml (4)
24. VRA Core 4.0 Example from
http://aal.ucsd.edu/vracore4/example001.xml (5)
25. VRA Core 4.0 Example from
http://aal.ucsd.edu/vracore4/example001.xml (6)
26. VRA Core Implementation from ARTstor (1)
SOURCE: ARTstor Digital Library. Available: http://www.artstor.org/index.shtml
27. VRA Core Implementation from ARTstor (2)
Create image group in user preferred work folder
SOURCE: ARTstor Digital Library. Available: http://www.artstor.org/index.shtml
28. Screenshot of Linked Data in VRA Core 4.0
Project
SOURCE: Mixter, J. (2013). Linked data in VRA Core 4.0: Converting VRA XML records
into RDF/XML. Available: http://jmixter.s3-website-us-east-1.amazonaws.com/thesis/
29. What is CCO (1)
• CCO = Cataloging Cultural Objects: A Guide to Describing Cultural
Works and Their Images for Cultural Heritage Community, including
Museums, Libraries, Image Collections and Archives
• Provides a set of cataloging rules and syntax conventions
surrounding various elements from CDWA (Categories for
Description of Works of Art) (which contains elements and rules)
and VRA Core (which contains elements); it is more directly
analogous to AACR and DACS (Describing Archives: A Content
Standard)
• Promotes cataloging best practice for creating sharable metadata,
building common practice, complementing diverse data structure,
and improving discovery and access of cultural works
• Sponsored by the Visual Resources Association Foundation
• Published by the American Library Association (ALA) in June 2006
30. What is CCO (2)
SOURCE: CCO Commons. Available from
http://cco.vrafoundation.org/index.php/toolkit/cco_pdf_version/
32. A Sample CCO Record from CCO Commons
SOURCE: CCO Commons. Category examples: 4 paintings.
http://cco.vrafoundation.org/index.php/toolkit/index_of_examples/category_examples_4_paintings/example_1_predella_panel_named_artist
33. Screenshot of Metadata Standards Crosswalk
from Getty Research Institute
SOURCE: Getty Research Institute (2009). Metadata standards crosswalk. Available:
http://www.getty.edu/research/publications/electronic_publications/intrometadata/crosswalks.html
34. What is METS
• METS = Metadata Encoding & Transmission Standard
• Provides a standardized XML format for transmission of complex
digital library objects between systems by representing their
hierarchical structure, the names and locations of the files that
comprise those objects, and the associated metadata in XML
• Current version 1.10, adding International Council of Museums’
LIDO (Lightweight Information Describing Objects) to MDTYPE
enumeration
• In discussion of METS 2.0 data model compatible to Semantic Web
and other technologies
• Maintained by METS Editorial Board, committed to maintaining
editorial control over METS, its XML Schema, METS Profile XML
Schema, and official METS documentation
• Hosted by Library of Congress, and developed as an initiative of the
Digital Library Federation
35. <mets>
<metsHdr>
<dmdSec>
<amdSec>
<fileSec>
<structMap>
<structLink>
<behaviorSec>
mets Header
descriptive metadata Section
administrative metadata Section
file Section
structural Map section
structural Link section
behavior Section
Seven Major Sections of a METS Document
SOURCE: Bredenberg, K. (2011, June). METS. Communicating Archival
Metadata
conference and workshops in Stockholm 28 - 30 June 2011
36. <mets> METS root element
• Establishes the container for the information
being stored and /or transmitted by the
standard
• Has optional attributes: ID, OBJID, Label, Type,
Profile
37. <METS> Example from Hathitrust
Google Book Mets.xml
<METS:mets
xmlns:METS="http://www.loc.gov/METS/"
xmlns:xlink="http://www.w3.org/1999/xlink"
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xmlns:gbs="http://books.google.com/gbs"
xmlns:PREMIS="info:lc/xmlns/premis-v2"
xmlns:marc="http://www.loc.gov/MARC21/slim"
xsi:schemaLocation="http://www.loc.gov/METS/
http://www.loc.gov/standards/mets/mets.xsd http://www.loc.gov/MARC21/slim
http://www.loc.gov/standards/marcxml/schema/MARC21slim.xsd
info:lc/xmlns/premis-v2 http://www.loc.gov/standards/premis/premis.xsd
OBJID="UOM_39015000024524"
PROFILE="gbs">
SOURCE: HathiTrust Google METS Example
http://www.hathitrust.org/documents/example-source-google-mets.xml
38. <metsHdr>
• Captures metadata about the METS document itself
• <agent>provides for various parties and their roles with respect to
the METS record to be documented
• <name>record s full name of document agent
• <note>records any additional information regarding the agent's
activities with respect to the METS document
• ROLE (string/R) specifies the function of the agent with respect to
the METS document
• TYPE(string/O) specifies the type of AGENT, e.g. individual or
organization
• <altRecordID>uses alternative record identifier values for the digital
object represented by the METS document
• <metsDocumentID>a unique identifier for the METS document
itself
39. <metsHdr> Example from Hathitrust Google
Book Mets.xml
<METS:metsHdr ID="UOM_39015000024524" C
REATEDATE="2010-07-
07T05:46:24" RECORDSTATUS="NEW">
<METS:agent ROLE="CREATOR" TYPE="ORGAN
IZATION">
<METS:name>Google</METS:name>
</METS:agent>
</METS:metsHdr>
40. <dmdSec>
• Records descriptive metadata pertaining to the
METS object as a whole or one of its components
• A descriptive metadata element can either wrap
the metadata (mdWrap) or reference it in an
external location (mdRef) or both
• Descriptive metadata can be expressed according
to many current description standards (i.e.,
MARC, MODS, Dublin Core, TEI Header, EAD, VRA,
FGDC, DDI) or a locally produced XML schema
41. <dmdSec> Example from Hathitrust Google
Book Mets.xml (1)
<METS:dmdSec xmlns="http://www.loc.gov/MARC21/sli
m" ID="DMDMARC">
<METS:mdWrap MIMETYPE="text/xml" MDTYPE="MA
RC" LABEL="Source MARC record">
<METS:xmlData>
<record>…
</record>
</METS:xmlData>
</METS:mdWrap>
</METS:dmdSec>
42. <dmdSec> Example from Hathitrust Google
Book Mets.xml (2)
SOURCE: http://www.hathitrust.org/documents/example-source-google-mets.xml
43. <amdSec>
• Contains the administrative metadata pertaining to the
digital object, its components and any original source
material from which the digital object is derived
• Separated into four sub-sections that accommodate
technical metadata (techMD), intellectual property
rights (rightsMD), analog/digital source metadata
(sourceMD), and digital provenance metadata
(digiprovMD)
• Each of these subsections can either wrap the
metadata (mdWrap) or reference it in an external
location (mdRef) or both
44. <amdSec> Example from Hathitrust Google Book
Mets.xml
<METS:amdSec>
<METS:techMD ID="TMD1">
<METS:mdWrap LABEL="production
notes" MDTYPE="OTHER" OTHERMDTYPE="Google Production Notes">
<METS:xmlData>
<gbs:productionNotes>
<gbs:badPages>3.05</gbs:badPages>
<gbs:missingPages>0.00</gbs:missingPages>
<gbs:tightBoundPages>0.46</gbs:tightBoundPages>
</gbs:productionNotes>
</METS:xmlData>
</METS:mdWrap>
</METS:techMD>
…
45. <techMD> Example from Hathitrust Google
Book Mets.xml
<METS:techMD ID="TABLE_OF_CONTENTS">
<METS:mdWrap LABEL="page
tags" MDTYPE="OTHER" OTHERMDTYPE="Google">
<METS:xmlData>
<gbs:pageTag>TABLE_OF_CONTENTS</gbs:pageTag>
</METS:xmlData>
</METS:mdWrap>
</METS:techMD>
46. <sourceMD> Example from Hathitrust Google
Book Mets.xml
<METS:sourceMD ID="S1">
<METS:mdWrap MDTYPE="OTHER" OTHERMDTYPE="Googl
e" LABEL="DESCMDRecord_Source_Library">
<METS:xmlData>
<gbs:sourceLibrary>UOM</gbs:sourceLibrary>
<gbs:sourceIdentifier>39015000024524</gbs:sourceIdentif
ier>
</METS:xmlData>
</METS:mdWrap>
</METS:sourceMD>
47. <digiprovMD> Example from Hathitrust Google
Book Mets.xml (1)
<METS:digiprovMD xmlns="info:lc/xmlns/premis-v2" ID="PREMIS1">
<METS:mdWrap MDTYPE="PREMIS">
<METS:xmlData>
<PREMIS:premis version="2.0">
<PREMIS:object xsi:type="representation">
<PREMIS:objectIdentifier>
<PREMIS:objectIdentifierType>barcode</PREMIS:objectIdentifierType>
<PREMIS:objectIdentifierValue>UOM_39015000024524</PREMIS:objectIdentifierValue>
</PREMIS:objectIdentifier>
<PREMIS:significantProperties>
<PREMIS:significantPropertiesType>file count</PREMIS:significantPropertiesType>
<PREMIS:significantPropertiesValue>526</PREMIS:significantPropertiesValue>
</PREMIS:significantProperties>
<PREMIS:significantProperties>
<PREMIS:significantPropertiesType>page count</PREMIS:significantPropertiesType>
<PREMIS:significantPropertiesValue>262</PREMIS:significantPropertiesValue>
</PREMIS:significantProperties>
</PREMIS:object>
48. <digiprovMD> Example from Hathitrust Google
Book Mets.xml (2)
…
<PREMIS:agent>
<PREMIS:agentIdentifier>
<PREMIS:agentIdentifierType>LCNAF</PREMIS:agentIdentifierType>
<PREMIS:agentIdentifierValue>Google
(Firm)</PREMIS:agentIdentifierValue>
</PREMIS:agentIdentifier>
<PREMIS:agentType>organization</PREMIS:agentType>
</PREMIS:agent>
</PREMIS:premis>
</METS:xmlData>
</METS:mdWrap>
</METS:digiprovMD>
</METS:amdSec>
49. <fileSec>
• Provides an inventory of and the location for
the content files that comprise the digital
object being described in the METS document
• <fileGrp> grouping the digital files
compromising the content of a METS object
either into a flat file arrangement or into a
nested arrangement
50. <fileSec> Example from Hathitrust
Google Book Mets.xml
<METS:fileSec>
<METS:fileGrp ID="FG1" USE="image">
<METS:file ID="IMG00000001" MIMETYPE="image/jp2" SEQ="0000
0001" CREATED="2008-03-
27T00:00:00" SIZE="251854"CHECKSUM="0da406ad7d1957a4d544
648bf504f518" CHECKSUMTYPE="MD5" OWNERID="135107988881
16402-5">
<METS:FLocat xlink:href="00000001.jp2" LOCTYPE="OTHER" OTHER
LOCTYPE="SYSTEM"/>
</METS:file> …
</METS:fileGrp>
</METS:fileSec>
51. <structMap>
• Provides a means for organizing the digital content
represented by the <file> elements in the <fileSec> of
the METS document into a coherent hierarchical
structure
• Facilitates users’ understanding of structural
relationship of the content files or parts of the content
files
• Encoded as a tree of nested <div>, and child file
pointer <fptr>
• Provides a means for linking content at any hierarchical
level with relative descriptive and administrative
metadata
53. <structLink>
• Allows for the specification of hyperlinks
between the different components of a METS
structure that are delineated in a structural
map
• <smLink> indicating two a hyperlink between
two nodes in the structural map
54. <behaviorSec>
• Provides a means to link digital content with applications or
computer programming code that can be used in
conjunction with other info in the METS doc to render or
display the digital object or transform one or more of its
component content files
• Associates executable behaviors with content in the METS
document by means of a repeatable behavior <behavior>
element
• <interfaceDef> represents an abstract definition of the set
of behaviors represented by a particular behavior section
• <mechanism> points to a module of executable code that
implements and runs the behavior defined by the interface
definition
55. Screenshot of Sample Hathitrust
Google Book Mets.xml
SOURCE: http://catalog.hathitrust.org/Record/001225921
56. Importance of METS
• Provides the means to specify metadata necessary for both
management of digital objects within a repository and the exchange
of such objects between repositories (or between repositories and
their users)
• Allows the sharing of efforts to develop information management
tools/services and facilitates the interoperable exchange of digital
materials among institutions (including vendors)
• Used in the role of Submission Info Package (SIP), Archival Info
Package (AIP) or Dissemination Info Package (DIP) within Open
Archival Info System (OAIS) Reference Model
SOURCE: METS Primer, available:
http://www.loc.gov/standards/mets/mets-schemadocs.html
57. What is MODS
• MODS = Metadata Object Description Schema
• Is an XML schema for a bibliographic element set that may be used for a variety of
purposes, and particularly for library applications
• Includes a subset of MARC fields and uses language-based tags rather than numeric
ones
• Regroups elements from MARC 21 bibliographic format
• Current version MODS 3.5 Schema, released July 8, 2013
• MARC 21 to MODS 3.5 mapping, revised October 2014
• MODS XSLT 1.0 Stylesheets for 1) MARCXML to MODS 3.5; 2) MOD 3.5 to DC updated in
August 2014
• Maintained by the MODS Editorial Committee with support from the Network
Development and MARC Standards Office of the LC
• MODS RDF Ontology in draft stage
SOURCES:
1. MODS: Uses and Features. Available: http://www.loc.gov/standards/mods/mods-overview.html
2. MODS: Conversions. Available: http://www.loc.gov/standards/mods/mods-conversions.html
2. MODS RDF Ontology. Available: http://www.loc.gov/standards/mods/modsrdf/
58. Changes for MODS 3.5
• Add attribute @unit to <physicalDescription><extent>
• Add rfc5646 as value for <language><languageTerm>
• Add @altFormat and @contentType attributes to <titleInfo>, <abstract>,
<tableOfContents> and <accessCondition> (Approved 2012-09-17)
• Add @eventType to <originInfo> to indicate production, publication,
distribution, manufacture
• Add @typeURI to <identifier>, <note>, and <physicalDescripton>/<note>
• Add @generator to <classification>
• Add <etal> element, a subelement of <name>
• Add @otherType to <titleInfo> element
SOURCE: Changes for MODS 3.5. Available: http://www.loc.gov/standards/mods/changes-3-
5.html
59. MODS Elements
SOURCE: Outline of elements and attributes in MODS version 3.5. Available:
http://www.loc.gov/standards/mods/mods-outline-3-5.html
60. MODS TitleInfo Element, SubElements, and
Attributes (1)
SOURCE: Outline of elements and attributes in MODS version 3.5. Available:
http://www.loc.gov/standards/mods/mods-outline-3-5.html
61. MODS Name Element, SubElements, and
Attributes (2)
SOURCE: Outline of elements and attributes in MODS version 3.5. Available:
http://www.loc.gov/standards/mods/mods-outline-3-5.html
62. MODS RelatedItem Element, SubElements,
Attributes (3)
SOURCE: Outline of elements and attributes in MODS version 3.5. Available:
http://www.loc.gov/standards/mods/mods-outline-3-5.html
63. Screenshot of “Library of Congress march” using
MODS and METS (1)
SOURCE: http://lcweb2.loc.gov/diglib/ihas/loc.natlib.ihas.100010802/mets.xml
64. Screenshot of “Library of Congress march” using
MODS and METS (2)
SOURCE: http://lcweb2.loc.gov/diglib/ihas/loc.natlib.ihas.100010802/mets.xml
65. Screenshot of “Library of Congress march” using
MODS and METS (3)
SOURCE:
http://lcweb2.loc.gov/diglib/ihas/loc.natlib.ihas.100010802/mets.xml
66. Screenshot of “Library of Congress march” using
MODS and METS (4)
SOURCE: http://lcweb2.loc.gov/diglib/ihas/loc.natlib.ihas.100010802/mets.xml
67. Screenshot of “Library of Congress march” using
MODS and METS (5)
SOURCE: http://lcweb2.loc.gov/diglib/ihas/loc.natlib.ihas.100010802/mets.xml
68. Screenshot of “Library of Congress march” using
MODS and METS (6)
SOURCE: http://lcweb2.loc.gov/diglib/ihas/loc.natlib.ihas.100010802/mets.xml
69. Screenshot of “Library of Congress march” using
MODS and METS (7)
Master & Service
SOURCE: http://lcweb2.loc.gov/diglib/ihas/loc.natlib.ihas.100010802/mets.xml
70. Screenshot of “Library of Congress march” using
MODS and METS (8)
Master & Service
SOURCE: http://lcweb2.loc.gov/diglib/ihas/loc.natlib.ihas.100010802/mets.xml
71. Screenshot of “Library of Congress march” using
MODS and METS (9)
SOURCE: http://lcweb2.loc.gov/diglib/ihas/loc.natlib.ihas.100010802/default.html
72. Screenshot of “Library of Congress march” using
MODS and METS (10)
SOURCE: http://lcweb2.loc.gov/diglib/ihas/loc.natlib.ihas.100010802/default.html
73. Importance of MODS (1)
• Used as an SRU (Search/Retrieval via URL) specified
format, e.g. MODS 3.3
• Used as an extension schema to METS
• Represents metadata for harvesting
• Provides resource description in XML syntax
• Represents a simplified MARC record in XML
• Bundles metadata in xml with an electronic resources
• Supports discovery and management of resources, and
access to them, as well as exchange and management
of encoded descriptions
74. Importance of MODS (2)
7
6
5
4
3
2
1
0
Rich Compatible User-oriented Simple
MODS
DC
ONIX
MARC
75. What is RDF
• RDF = Resource Description Framework
• A standard model for data interchange on the Web through RDF statement in
subject-predicate-object triples, including international resource identifiers (IRIs),
or literals, blank nodes, multiple graphs
• Provides linking through IRIs by setting RDF links between datasets to enable the
discovery of additional data and to support the integration of data from multiple
sources
• Provides semantic info about resources through RDF vocabularies often defined by
RDF schema, e.g. FOAF, Dublin Core, Schema.org, SKOS
• Serialized in various formats, e.g. Turtle, JSON-LD, RDFa, RDF/XML
• Allows systems to make logical inferences by defining a model-theory semantics
for RDF graphs, the RDF, and RDFS vocabularies, e.g. {ex:allen foaf:knows ex:mary
.}(foaf:knows rdfs:domain foaf:person .} ->{ex:allen rdf:type foaf:person .}
• Allows datasets to be published and interlinked on the Web using RDF, e.g. linked
data, queried and manipulated through SPARQL
• Current version RDF 1.1, published in 2014
• Acceptance by libraries, museums and cultural heritage communities for open
linked data initiatives
76. Importance of RDF
• Adds machine-readable information to web pages using, e.g.
schema.org vocabulary, enabling them to be displayed in an
enhanced format on search engines or to be processed
automatically by third-party applications
• Enriches a dataset by linking it to third-party datasets
• Interlinks API feeds, making sure that clients can easily discover
how to access more information
• Uses the datasets currently published as linked data, and builds
aggregations of data around specific topics
• Builds distributed social networks by interlinking RDF descriptions
of people across multiple web sites
• Provides a standards-compliant way for exchanging data between
databases
• Interlinks various datasets within an organization, enabling cross-dataset
queries to be performed using SPARQL
SOURCE: 1. W3C RDF 1.1 Primer, W3C Working Group note 24 June 2014.
Available: http://www.w3.org/TR/rdf11-primer/
2. Schmachtenberg, M., Bizer, C., & Paulheim, H. (2014). Adoption of the Linked Data best practices in different topic domains. 13th International Semantic Web Conference (ISWC2014) –RDB
Track. Available: http://linkeddatacatalog.dws.informatik.uni-mannheim.de/state/#toc8
85. Screenshot of WorldCat Linked Data Explorer (1)
SOURCE: Gandi, an autobiography story of my experiments with truth.
Available: http://experiment.worldcat.org/entity/work/data/1151002411
86. Screenshot of WorldCat Linked Data Explorer (2)
SOURCE: Gandi, an autobiography story of my experiments with truth.
Available: http://experiment.worldcat.org/entity/work/data/1151002411
87. Screenshot of VIAF (1)
SOURCE: VIAF. Available: http://viaf.org/viaf/14817771/
88. Screenshot of VIAF (2)
SOURCE: VIAF. Available: http://viaf.org/viaf/14817771/
89. Screenshot of VIAF (3)
SOURCE: VIAF. Available: http://viaf.org/viaf/14817771/
91. Screenshot of FAST Linked Data
SOURCE: Available: http://experimental.worldcat.org/fast/1354198/
92. Acceptance by Libraries, Museums and Cultural
Heritage Communities for Open Linked Data Initiatives
SOURCE: Crawlable linked datasets as of April 2014.
Available: http://linkeddatacatalog.dws.informatik.uni-mannheim.de/state/LODCloudDiagram.html
93. Why Metadata Matters
• Markup different types of digital resources by applying content description
rules and administration best practices, controlled vocabularies, metadata
standards, e.g. Qualified Dublin Core, VRA Core 4.0, CCO, METS, MODS,
RDF, etc.
• Supports intended users, including end users who search and discover the
resources, and stakeholders who develop and manage the resources
• Supports intended uses, e.g. searching, browsing, navigating, discovering,
consuming, publishing, accessing, citing, linking, repurposing,
interchanging, sharing, archiving, preserving, managing, right
management, securing, reporting and analysis, accessibility, etc.
• Indicates how resources get used, e.g. usage analysis by metadata fields
• Enables systems capabilities, e.g. out of box metadata generation and
capture
• Enables integration, analysis, and enhancement of digital resources
despite of systems, disciplines, data types, formats, file sizes, granularity,
etc.
SOURCE: Xu, A. (2014, Feb.) Role of Cataloger in the 21st Century Academic Library.
Available: http://www.slideshare.net/elephantsmith/interview-presentation-roleofcataloger21stcenturyacademiclibrary
Editor's Notes
University of Iowa Libraries rock at digitization and institutional repository services, featuring close to a million digitized texts, images, audio and video recordings, as well as Iowa Research Online. In FY 2013, Iowa Digital Library had 1,071,000 page views, accessing over 771,000 unique pages. The impressive usage has a lot to do with the Libraries’ effort in increased use of standards and integrated discovery tools. Metadata is the heart of the game. This presentation will take a look at a few metadata standards and their applications that are currently in use by library and cultural heritage communities. It will not address metadata issues and tools for metadata creation, extraction, transformation, quality control, syndication and ingest.
The Artstor Digital Library is a nonprofit resource that provides over 1.8 million digital images in the arts, architecture, humanities, and sciences with an accessible suite of software tools for teaching and research.
The project began in August of 2012. The objectives were to map the existing VRA restricted schema into RDF and then create a XSLT stylesheet that could be used to convert existing VRA/XML data into RDF. As part of the project, a proposed VRA ontology was produced and published. Also, an XSLT stylesheet has been created and tested using a collection sample provided by a member of the VRA listserv. The results of the study were published through OhioLink and are also availiable from the Downloads section of this webpage.
The first part, "General Guidelines", delineates the principles of CCO, and includes extensive sections on levels of description, related works, database design and relationships, and authorities and controlled vocabularies.
Part Two, "Elements", contains a chapter for each element of description. Cataloging rules and recommendations follow, including dealing with capitalization and abbreviations, syntax, terminology, and applying the rules to various materials, as well as any other special aspects of the element. Recommendations are made regarding use of free-text and controlled vocabulary fields, as well as which fields are repeatable or not, referring back to discussions of principles of description, user access, and database design in Part One.
Part Three, "Authorities", covers controlled vocabulary types familiar to library catalogers--subjects and personal, corporate, and geographic names--while also adding the concept authority, which provides controlled vocabulary as well as hierarchical and cross-reference structure for generic terminology in the field.
1) Element set is richer than Dublin Core; 2) The element set is more compatible with library data than ONIX; 3) the schema is more end user-oriented than full MARCXML schema; 4) the element is simpler than the full MARC format