Influencing policy (training slides from Fast Track Impact)
Adlug annual meeting 2013
1. 32st ADLUG ANNUAL MEETING 2013
ARTIUM and the Fundación Sancho El Sabio – Vitoria-Gasteiz
16th – 18th October 2013
RDA, Bibframe, Linked data: follow the
evolution of cataloging inside the LMS
Tiziana Possemato
Copyright 2009-2010 @CULT. All rights reserved
2. The global environment
Where we are moving…
Resource Description and Access
Functional Requirements for Authority Data
Bibframe
Functional Requirements for Bibliographic Records
International Cataloguing Principles
Copyright 2009-2010 @CULT. All rights reserved
Semantic web/Linked data
2
3. FRBR’s Entity-Relationship Model
• Entities
• Relationships
• Attributes
relationship
One Entity
Another Entity
• National level required elements
Copyright 2009-2010 @CULT. All rights reserved
3
5. FRBR: inherent Group 1 Relationships
Work
Group 1
is realized through
Expression
is embodied in
Manifestation
recursive
one
many
Copyright 2009-2010 @CULT. All rights reserved
is exemplified by
Item
6. FRBR: Relationships between Groups 1 & 2
Work
Expression
Manifestation
Item
is owned by
is produced by
is realized by
is created by
Copyright 2009-2010 @CULT. All rights reserved
Group 2
Person
Family
Corporate Body
7. ICP: Statement of International
Cataloguing Principles
ICP general principles
•
•
•
•
•
•
•
•
•
•
•
Convenience of user
Representation
Common usage
Accuracy
Sufficiency and necessity
Significance
Economy
Consistency and Standardization
Integration
Defensible, not arbitrary
If contradict, take a defensible, practical solution
Copyright 2009-2010 @CULT. All rights reserved
7
8. RDA: Resource Description and Access
Based on IFLA’s international models and principles
• Functional Requirements for Bibliographic Records (FRBR; 1998)
• Functional Requirements for Authority Data (FRAD; 2009)
• Statement of International Cataloguing Principles (ICP; 2009)
Copyright 2009-2010 @CULT. All rights reserved
8
9. RDA: goals*
RDA is:
• A new standard for resource description and access
• Designed for the digital world:
• Optimized for use as online product
• Description and access of all resources (all types of content and media)
• Resulting records usable in the digital environment (Internet, Web
OPACs, etc.)
• Rules should be easy to use and interpret
• Be applicable to an online, networked environment
*Barbara Tillett, Seminar on RDA
Copyright 2009-2010 @CULT. All rights reserved
9
10. RDA: goals*
• Provide effective bibliographic control for all types of media
• Encourage use beyond the library community
• Be compatible with other similar standards
• Have a logical structure based on internationally agreed
principles
• Separate content and carrier data
• Examples – more of them, more appropriate
Basic goals: identify and relate (from FRBR/FRAD user tasks
and ICP)
Copyright 2009-2010 @CULT. All rights reserved
10
11. RDA: Summary of differences*
• Based on principles
• ICP
• Based on conceptual models
• FRBR/FRAD
• Content, not encoding, standard:
• not a display standard,
• not an encoding standard: use whatever schema you prefer, such as Marc21,
Dublin Core etc.
• Strictly related to the Bibliographic Framework Transition Initiative
• More international
• Wider scope and resources
*Barbara Tillett, Seminar on RDA
Copyright 2009-2010 @CULT. All rights reserved
11
12. RDA: Summary of differences*
• Includes authority data
• based on attributes and relationships in FRAD.
• Authorized/variant access point and elements continue to be documented in
authority records
• Has controlled vocabularies
• Only a few closed: content, media and carrier types; mode of issuance; etc.
• Most are open: cataloguer can supply term if needed term not in list
• Vocabularies being registered on the Web -- goal of multiple languages and/or
scripts (http://metadataregistry.org/rdabrowse.htm)
• Changes in instruction vocabulary
• Core elements
*Barbara Tillett, Seminar on RDA
Copyright 2009-2010 @CULT. All rights reserved
12
13. RDA in Marc21
RDA: “use whatever schema you prefer, such as
Marc21, Dublin Core etc.”
I can catalogue in RDA using Marc21… but
some updates are needed!
Copyright 2009-2010 @CULT. All rights reserved
13
14. RDA record in Marc21 format
040 $e rda
264 as publisher
content, media, carrier
Copyright 2009-2010 @CULT. All rights reserved
15. Full update of Marc21 Bibliographic
and Authority: numbers
• Marc21 Bibliographic Format updates: 252
tags/subfields
• Marc21 Authority Format updates: 167
tags/subfields (total of 3249 elements checked
and revised)
(strong effort of LibriCore)
• Integration of Authority manage in OliSuite/WeCat
Copyright 2009-2010 @CULT. All rights reserved
16. Marc must die? From record to
data
http://www.loc.gov/marc/transition/pdf/marcld-report-11-21-2012.pdf
Bibliographic Framework as a Web of Data: Linked Data Model and
Supporting Services
Copyright 2009-2010 @CULT. All rights reserved
17. Bibliographic Framework
Bibliographic Framework as a Web of Data: Linked Data
Model and Supporting Services
(Library of Congress – November 21, 2012)
• The result of a reflection and study process about the functions
of the catalog, the actuality of Marc, of cataloging standards, the
new types of resources.
• It‟s the first step of a road map that will lead to a review and
proposal of a new environment for bibliographic libraries,
a new bibliographic ecosystem
• Convertion of bibliographic data in linked data: this process has
to be an evolution and not a revolution
Copyright 2009-2010 @CULT. All rights reserved
18. Bibliographic Framework
• Reflections on new cataloging rules
• Greater level of identification and analysis of the data
• More attention to the controlled vocabularies
• Increased use of terms with respect to codes
• Emphasis on relationships
• Greater flexibility within checked headings
Copyright 2009-2010 @CULT. All rights reserved
19. Library data and Semantic web
As Karen Coyle suggests us, the question that we must face is:
“How we can best transform our data so that it can become part of
the dominant information environment that is the Web”*
How we can transform our data so that they can be OF THE WEB and
not on the web
*Karen Coyle, Understanding the Semantic web: bibliographic data and metadata
Copyright 2009-2010 @CULT. All rights reserved
19
20. Library data and Semantic web
Web of document (traditional web)
vs
Web of data (Semantic web)
Web of document: identity issues and interoperability
Copyright 2009-2010 @CULT. All rights reserved
20
21. Web: identity issues and interoperability
2006: Tim Berners-Lee, the father of the world wide web
(www), offers the Linked Data as elegant and effective
method to simplify and standardize the solutions to the
problems of identity and interoperability.
The method aims to create a web of data (or entities or
things) and is part of the technology for the Semantic
Web
[http://www.w3.org/DesignIssues/LinkedData.html]
Copyright 2009-2010 @CULT. All rights reserved
22. The Semantic web or Web of data
Il web di dati o web semantico
”A web of things in the world, described by data on the web”
*17th International World Wide Web Conference W3C Track @ WWW2008, Beijing, China 23-24 April 2008 Linked Data: Principles and State of the Art
Copyright 2009-2010 @CULT. All rights reserved
23. The four rules for linked data
“The Semantic Web isn't just about putting data on the web. It is about
making links, so that a person or machine can explore the web of
data. With linked data, when you have some of it, you can find
other, related, data.”
Tim Berners-Lee proposes four rules to create linked data on the web:
1. Use URIs as names for things.
2. Use HTTP URIs so that people can look up those names.
3. When someone looks up a URI, provide useful information, using
the standards (RDF*, SPARQL).
4. Include links to other URIs so that they can discover more things.
Copyright 2009-2010 @CULT. All rights reserved
24. Linked data and RDF (Resource
Description Framework)
The Linked data are usually published on the web in RDF
Each statement in RDF is constituted by a triple formed by
subject -- predicate -- object
Copyright 2009-2010 @CULT. All rights reserved
25. The RDF triples
Illustrated by
Mazzanti
Le avventure di
Pinocchio
Author of
Carlo
Collodi
Dramatized as
Published by
Un burattino di
nome Pinocchio
Poggi
Adapted as
Una struttura estremamente semplice che rende
date
1881
Copyright 2009-2010 @CULT. All rights reserved
Pinocchio
possibile
a
qualsiasi cosa
ciascuno
formulare
asserzioni
su
26. The focus of our attention now is…
Copyright 2009-2010 @CULT. All rights reserved
28. Centrality of the bibliographic record –
Drawbacks
• Redundancy of information
• High costs
• Production of closed silos, and mostly not usable outside
of the library world
Copyright 2009-2010 @CULT. All rights reserved
29. Marc21 and Linked data
In the semantic web MARC record is seen as a
primary source of information, for the
enormous amount of data contained therein and to
the high degree of semantics present in it
Copyright 2009-2010 @CULT. All rights reserved
30. Marc21 and Linked data
But the semantics expressed in the record is only comprehensible
to the human eye. The individual elements, isolated from the record,
lose their meaning
xv, 240 p. : ill. ; 24 cm.
Includes bibliographical references (p. 240).
Cambridge University Press, 2009.
edited by Michael Hattaway.
Updated ed.
Fathers and daughters--Drama.
Copyright 2009-2010 @CULT. All rights reserved
32. … many statements or triples
The resource
has author
has title
has publisher
Has subject
has publication date
Shakespeare,William
As you like it
Cambridge U Press
Padri e figlie
2009
Triple
Copyright 2009-2010 @CULT. All rights reserved
34. Linked data vs record
Michael Hattaway.
Shakespeare, William, 1564-1616
As you like it
Cambridge University Press, 2009.
Fathers and daughters--Drama
Controlled vocabulary and ontology
Copyright 2009-2010 @CULT. All rights reserved
35. Linked data vs bibliographic record
• Linked data => not focused on records but on individual
assertions or statements of atomic structure (triples)
• Each statement (RDF triple) thus produced can be
recorded with URI/URIs, which connects the data to
others on the same dataset or in external datasets
• Moving these statements understandable to machines
on the web transforms the data contained in the original
Marc record in integral part of the Semantic Web
Copyright 2009-2010 @CULT. All rights reserved
36. Try to summarize core concepts
• Resource of the Web identified through URI
• Atomic elements related one another through links:
entity/relations model
• Less interest to description (“Take what you see” and
“Accept what you get”) and more relevance to access
control => authority record, controlled vocabulary
and ontology
• Data understandable and re-usable by machines: the
Resource Description Framework (RDF) as framework for
representing information in the Web.
Copyright 2009-2010 @CULT. All rights reserved
37. Authority module in WeCat
The full authority manage into WeCat
Copyright 2009-2010 @CULT. All rights reserved
41. Manage ontology in OliSuite/WeCat
See here how each tag is “mapped” with URI coming from different ontology
Copyright 2009-2010 @CULT. All rights reserved
42. Manage ontology in OliSuite/WeCat
An authorized person can associate each tag to one or more entity from
vocabulary and ontology
Copyright 2009-2010 @CULT. All rights reserved
43. Manage ontology in OliSuite/WeCat
Tag 110 (Name as Main entry)
Copyright 2009-2010 @CULT. All rights reserved
producersName in RDA vocabulary
44. Checking ontology in OliSuite/WeCat
The cataloguer click on the tag
label to open the screen where
the mapping with ontology
element is reported
Copyright 2009-2010 @CULT. All rights reserved
45. Checking ontology in OliSuite/WeCat
Tag 110 (Name as Main entry)
producersName in RDA vocabulary
and M1001_a in Marc 21 Element Set
Copyright 2009-2010 @CULT. All rights reserved
46. Checking ontology in OliSuite/WeCat
The URI refers directly to the specific element present on the Open Metadata
Registry, where many of „library‟ ontologies and vocabularies are published
Copyright 2009-2010 @CULT. All rights reserved
47. GMD as content, media and carrier in RDA
The GMD – General Material Designator of AACR2 and ISBD
is coded in three fields in RDA, and the content of each field is
taken from controlled vocabularies:
• Tag 336: CONTENT
• Tag 337: MEDIA
• Tag 338: CARRIER
In OliSuite/WeCat the cataloguer can choose the content of each
field from e list, that refers to controlled vocabulary on Open
Metadata Registry
Copyright 2009-2010 @CULT. All rights reserved
48. GMD as content, media and carrier in RDA
Tag 336: CONTENT
Copyright 2009-2010 @CULT. All rights reserved
49. GMD as content, media and carrier in RDA
Tag 336: CONTENT
Copyright 2009-2010 @CULT. All rights reserved
50. GMD as content, media and carrier in RDA
Tag 336: CONTENT
Copyright 2009-2010 @CULT. All rights reserved
52. GMD as content, media and carrier in RDA
The same for tag 337: MEDIA
Copyright 2009-2010 @CULT. All rights reserved
53. GMD as content, media and carrier in RDA
The same for tag 338: CARRIER
Copyright 2009-2010 @CULT. All rights reserved
54. Include ID (URI) to identify heading
URIs reported in tag 1xx/7xx $0: see here
Umberto, Eco associated to VIAF URI, to make a link with VIAF –
Virtual International Authority File
Copyright 2009-2010 @CULT. All rights reserved
55. Include ID (URI) to identify heading
The URI associated to an heading is repeatable and identify
each heading. See here it in the authority record
Copyright 2009-2010 @CULT. All rights reserved
56. Include ID (URI) to identify heading
The URI can be used both by cataloguer – to check the heading on VIAF or
other projects, and to end user, aftern, on the OPAC
Copyright 2009-2010 @CULT. All rights reserved
57. Just to close…
…giving a look to “economy” principle of ICP
(see some example to make easier the cataloguing process: the autocomplete
function in heading creation)
Copyright 2009-2010 @CULT. All rights reserved
Editor's Notes
Mostly what we do today, but using a different technology, a different data model.
Noi sappiamo che tutte le informazioni contenute nel medesimo record parlano del medesimo oggetto (il libro)