Future of Metadata:
A Look at 2025
2016 IDPF DIGICON @ BEA
MAY 11 9:50–10:35
Speakers
Patricia Payton
Senior Manager, Provider Relations
Phil Madans
Executive Director, Digital Publishing Technology
Graham Bell
Executive Director
Chip Nilges
Vice President of Business Development
Future of Metadata:
A Look at 2025
IDPF DigiCon
May 11, 2016
“An identifier, generally a sequence of alpha-numeric characters,
unambiguously differentiates one thing from another in a particular
context.”
–BISG Best Practices for Identifying Digital Products
Identifiers
Madans – Future of Metadata – IDPF 2016 4
Discoverability
Identifiers
Book
Madans – Future of Metadata – IDPF 2016 5
Product Identifier
• ISBN is a product identifier, not a work identifier.
– Created to track a product through the supply chain
• A work may comprise many products, in many formats and editions:
Hardcover, Paperback, Large Print, Audio, eBook, Etc.
• Each of those products should have it’s own ISBN
• Each work may have many associated ISBNs
The ISBN is a unique international identifier for monographic
publications… --ISBN User’s Manual, Sixth Edition
ISBN – International Standard Book Number
Madans – Future of Metadata – IDPF 2016 6
Work Identifier
7
Identifies an item of intellectual property (in the abstract; not
a product). – BISG’s Guide to Identifiers
MOBY DICK
Madans – Future of Metadata – IDPF 2016
FRBR
Madans – Future of Metadata – IDPF 2016 8
Book Industry Work Identifiers
9
Madans – Future of Metadata – IDPF 2016
International Standard Text Code
• ISTC – ISO Standard
• Identifies uniquely a textual work and its relationship to
other works
– Enables linkages to various products of the work
• No implementation traction in the U.S.
– Currently no U.S. Registration Agency
Same text, same ISTC
Hardcover Paperback
Original Work
Spanish
Edition
Abridged
Audio
Different texts, different,
but linked, ISTCs
Derived Work
Madans – Future of Metadata – IDPF 2016 10
Work Identifiers for Other Creative Works
• ISWC (International Standard Musical Work Code)
– A unique, permanent and internationally recognized ISO reference
number for the identification of musical works.
• ISAN – International Standard Audiovisual Number
– A voluntary numbering system and metadata schema for the unique
and persistent identification of any audiovisual works and versions
thereof including films, shorts, documentaries, television programs,
sports events, advertising, etc.
• EIDR – Entertainment Identifier Registry
– A universal unique identifier system for movie and television
assets. From top level titles, edits, and DVDs, to encodings, clips and
mash-ups, EIDR provides global unique identifiers for the entire range
of audiovisual object types that are relevant to entertainment
commerce.
Madans – Future of Metadata – IDPF 2016 11
Madans – Future of Metadata – IDPF 2016 12
Madans – Future of Metadata – IDPF 2016 13
How Data is Linked on the Internet
Madans – Future of Metadata – IDPF 2016 14
Digital Object Identifier
• The DOI, an ISO Standard, is a unique, persistent digital identifier of an object—digital or
physical.
– Used extensively in Scientific and Journal Publishing
– Can be assigned to a work, article, chapter, image, etc.
– Promotes discovery
– CrossRef.org
• The DOI System provides a persistent link (most commonly a URL) to an object and
standard metadata for that object.
• A DOI can be any length and is structured: Prefix/Suffix Example: 10.1000/182
– Prefix 10.1000 identifies the DOI Registry and the Registrant (1000), International DOI Foundation
– Suffix: 182 is the unique ID identifying the object, in this case the DOI Handbook
– Should be displayed as URL: http://dx.doi.org/10.1000/182
• DOI’s should not be used in place of ISBNs for book product.
– However, an ISBN can be made actionable by encoding it in a DOI (ISBN-A)
15
Madans – Future of Metadata – IDPF 2016
International Standard Name Identifier
• ISNI – ISO Standard
• Unambiguously identifies contributors to creative works and people
or entities involved in their creation or distribution
– Writers, artists, musicians, performers, producers, publishers, agents,
etc.
– Currently 9.2 million ISNIs assigned, including 500K organizations
– “…the ISNI will act as a bridge identifier across multiple domains and
become a critical component in Linked Data and Semantic Web
applications.” -- www.isni.org
• Bridge Identifier
– Links identities across domains
– Linked to standard and proprietary ID’s in databases
– Search and discovery
• Works are linked to ID’s
Madans – Future of Metadata – IDPF 2016 16
Madans – Future of Metadata – IDPF 2016 17
International Standard Link Identifier
• ISO standard published May 2015
• ISLI is a universal (decimal-based) identifier for links
between entities…
• Identifier is registered with metadata that specifies the
link:
– Jack marries Jill
• Marries is the link and would have metadata about the link: date, place, location, etc.
• Link ID is one-way. Jill marries Jack would need a different ID.
– http://dx.doi.org/XX/XXX is referenced in ISBN 978XXXXXXXXXX
• The music file located here (DOI) can be found in this book
• Metadata can be registered to where in the book the reference occurs, ID for the piece of music
Link
Madans – Future of Metadata – IDPF 2016 18
Madans – Future of Metadata – IDPF 2016 19
“The purpose of the LCC is to facilitate and expand the legitimate
use of content in the digital network through the effective use of
interoperable identifiers and metadata.”
http://www.linkedcontentcoalition.org/index.php
Principles of Identification
http://doi.org/10.1000/287
Index Standard Book Industry Identifiers
• Product Identifiers
– ISBN – International Standard Book Number
• http://www.isbn.org
– GTIN – Global Trade Item Number
• http://barcodes.gs1us.org/Default.aspx?tabid=81
• EAN – International Article Number
• UPC – Uniform Product Code
– EPC – Electronic Product Code
• http://www.epcglobalus.org/
– ISMN – International Standard Music Number
• http://www.ismn-international.org/
– ISSN – Intl Standard Serial Number
• http://www.issn.org/
– OCLC Control Number
• http://oclc.org/en-US/
• Contributor Identifiers
– ISNI – International Standard Name Identifier
• www.isni.org
– IPI – Interested Party Identifier (Music)
• http://www.cisac.org/
– ORCID – Open Researcher and Contributor ID
• https://orcid.org
• Assembly Identifiers
– ISTC – Intl Standard Text Code
• http://www.istc-international.org/
– LC Number – Library of Congress Number
• http://www.loc.gov/catdir
• Expression Identifiers
– ISRC – Intl Standard Recording Code
• http://www.ifpi.org/isrc
• Hybrid Identifiers
– DOI – Digital Object Identifier
• http://www.doi.org/ ,
http://www.crossref.org/
• Distribution Identifiers
– GLN -- Global Location Number
• http://barcodes.gs1us.org/Default.aspx?tabi
d=81
– SAN – Standard Address Number
• http://www.isbn.org/standards/home/isbn/us
/san/
• Link Identifiers
– ISLI – International Standard Link Identifier
• http://www.isli-international.org/isli/en/index
BISG Guide to Identifiers
https://www.bisg.org/guide-identifiers-0
Madans – Future of Metadata – IDPF 2016 21
Thank You!
Phil Madans
Executive Director of Digital Publishing Technology
Hachette Book Group
Chair, BISG Identification Committee
phil.madans@hbgusa.com
International Digital Publishing Forum | May 11, 2016 | Chicago, IL
Future of Metadata: A Look at 2025
The role of library linked data
Vice President, Business Development
Chip Nilges
Founded in
1967
as the Ohio College
Library Center
16,957
members worldwide
1,200+staff
18offices in
10countries OCLC headquarters in Dublin, OH USA
Partnerships
Content Providers Library Services Providers Consumer Services Providers
Web of documents
• Pages and files
• Human readable
• Independent
• Static
Two views of the web
Web of data
• Statements about
entities (“things”)
• Machine processed
• Integrated
• Actionable
• Any conceptual thing can have a name (URI) on the Web.
• Those URIs return important information back in a
standard format.
• This information includes explicit relationships to other
things with URIs.
URI = Uniform Resource Identifier
Principles of linked data
Where can you see linked data?
Linked data triples
raven
is in the genus
objectsubject
2 31
eating
behavior
relationship
corvus
carnivorous
Linked data for libraries (and publishers)
“The
Raven”
1845
“Edgar Allen Poe”
was written by
was first
published in
Publisher
XYZ
is available from…
Schema.org
• Retail structure
• Very general
• Extensible
• “Front office” uses
• Developed by major
search engines Google,
Yahoo, Yandex
Two “flavors” of linked data
BIBFRAME
• Library structure
• Very specific
• Industry-focused
• “Back office” uses
• Developed by the Library of
Congress and others in the
library community
365million records
2.3billion holdings
46million digital items
17million eBooks
*As of February 26, 2016
• WorldCat.org linked data using Schema.org format
• WorldCat Works: 197 million work descriptions
• FAST: faceted subject heading schema derived from the Library of
Congress Subject Headings with links to LCSH Authorities as will as
other authoritative sources such as VIAF, GeoNames, and Wikipedia.
• ISNI: uniquely identifies identities of contributors to media content
• VIAF: international authority file that “switches” forms of names based
on language or script of the user.
OCLC’s linked data resources
work place
person event
conceptorganization
WorldCat linked data
“Same As”
API using
identifier
Janet A. Smith
Name Authority File 2
Janet B. A. Smith
Name Authority File 1
Janet B. Adam Smith
Name Authority File 3
Janet Adam Smith
Name Authority File 4
WorldCat Person Entity Lookup Pilot
Janet A. Smith
Name Authority File 2
?
“Search”
API using
text string
Janet A. Smith
Name Authority File 2
Janet B. A. Smith
Name Authority File 1
Janet B. Adam Smith
Name Authority File 3
Janet Adam Smith
Name Authority File 4
WorldCat Person Entity Lookup Pilot
?
janet smith
By including metadata and holdings in the
OCLC global data network:
– Content is discoverable in OCLC library applications
as well as through library partner applications
– Content is syndicated through OCLC consumer
service provider partners
Benefits to partners
Library linked data:
making the metadata in
libraries:
• Work harder
• Show up in more places
• Easier to work with
• More transparent
• More portable
Thank you!
Chip Nilges
Vice President, Business Development
nilgesc@oclc.org
For more information about
partnerships with OCLC,
contact busdev@oclc.org

IDPF Digicon Future of Metadata

  • 1.
    Future of Metadata: ALook at 2025 2016 IDPF DIGICON @ BEA MAY 11 9:50–10:35
  • 2.
    Speakers Patricia Payton Senior Manager,Provider Relations Phil Madans Executive Director, Digital Publishing Technology Graham Bell Executive Director Chip Nilges Vice President of Business Development
  • 3.
    Future of Metadata: ALook at 2025 IDPF DigiCon May 11, 2016
  • 4.
    “An identifier, generallya sequence of alpha-numeric characters, unambiguously differentiates one thing from another in a particular context.” –BISG Best Practices for Identifying Digital Products Identifiers Madans – Future of Metadata – IDPF 2016 4
  • 5.
  • 6.
    Product Identifier • ISBNis a product identifier, not a work identifier. – Created to track a product through the supply chain • A work may comprise many products, in many formats and editions: Hardcover, Paperback, Large Print, Audio, eBook, Etc. • Each of those products should have it’s own ISBN • Each work may have many associated ISBNs The ISBN is a unique international identifier for monographic publications… --ISBN User’s Manual, Sixth Edition ISBN – International Standard Book Number Madans – Future of Metadata – IDPF 2016 6
  • 7.
    Work Identifier 7 Identifies anitem of intellectual property (in the abstract; not a product). – BISG’s Guide to Identifiers MOBY DICK Madans – Future of Metadata – IDPF 2016
  • 8.
    FRBR Madans – Futureof Metadata – IDPF 2016 8
  • 9.
    Book Industry WorkIdentifiers 9 Madans – Future of Metadata – IDPF 2016
  • 10.
    International Standard TextCode • ISTC – ISO Standard • Identifies uniquely a textual work and its relationship to other works – Enables linkages to various products of the work • No implementation traction in the U.S. – Currently no U.S. Registration Agency Same text, same ISTC Hardcover Paperback Original Work Spanish Edition Abridged Audio Different texts, different, but linked, ISTCs Derived Work Madans – Future of Metadata – IDPF 2016 10
  • 11.
    Work Identifiers forOther Creative Works • ISWC (International Standard Musical Work Code) – A unique, permanent and internationally recognized ISO reference number for the identification of musical works. • ISAN – International Standard Audiovisual Number – A voluntary numbering system and metadata schema for the unique and persistent identification of any audiovisual works and versions thereof including films, shorts, documentaries, television programs, sports events, advertising, etc. • EIDR – Entertainment Identifier Registry – A universal unique identifier system for movie and television assets. From top level titles, edits, and DVDs, to encodings, clips and mash-ups, EIDR provides global unique identifiers for the entire range of audiovisual object types that are relevant to entertainment commerce. Madans – Future of Metadata – IDPF 2016 11
  • 12.
    Madans – Futureof Metadata – IDPF 2016 12
  • 13.
    Madans – Futureof Metadata – IDPF 2016 13
  • 14.
    How Data isLinked on the Internet Madans – Future of Metadata – IDPF 2016 14
  • 15.
    Digital Object Identifier •The DOI, an ISO Standard, is a unique, persistent digital identifier of an object—digital or physical. – Used extensively in Scientific and Journal Publishing – Can be assigned to a work, article, chapter, image, etc. – Promotes discovery – CrossRef.org • The DOI System provides a persistent link (most commonly a URL) to an object and standard metadata for that object. • A DOI can be any length and is structured: Prefix/Suffix Example: 10.1000/182 – Prefix 10.1000 identifies the DOI Registry and the Registrant (1000), International DOI Foundation – Suffix: 182 is the unique ID identifying the object, in this case the DOI Handbook – Should be displayed as URL: http://dx.doi.org/10.1000/182 • DOI’s should not be used in place of ISBNs for book product. – However, an ISBN can be made actionable by encoding it in a DOI (ISBN-A) 15 Madans – Future of Metadata – IDPF 2016
  • 16.
    International Standard NameIdentifier • ISNI – ISO Standard • Unambiguously identifies contributors to creative works and people or entities involved in their creation or distribution – Writers, artists, musicians, performers, producers, publishers, agents, etc. – Currently 9.2 million ISNIs assigned, including 500K organizations – “…the ISNI will act as a bridge identifier across multiple domains and become a critical component in Linked Data and Semantic Web applications.” -- www.isni.org • Bridge Identifier – Links identities across domains – Linked to standard and proprietary ID’s in databases – Search and discovery • Works are linked to ID’s Madans – Future of Metadata – IDPF 2016 16
  • 17.
    Madans – Futureof Metadata – IDPF 2016 17
  • 18.
    International Standard LinkIdentifier • ISO standard published May 2015 • ISLI is a universal (decimal-based) identifier for links between entities… • Identifier is registered with metadata that specifies the link: – Jack marries Jill • Marries is the link and would have metadata about the link: date, place, location, etc. • Link ID is one-way. Jill marries Jack would need a different ID. – http://dx.doi.org/XX/XXX is referenced in ISBN 978XXXXXXXXXX • The music file located here (DOI) can be found in this book • Metadata can be registered to where in the book the reference occurs, ID for the piece of music Link Madans – Future of Metadata – IDPF 2016 18
  • 19.
    Madans – Futureof Metadata – IDPF 2016 19 “The purpose of the LCC is to facilitate and expand the legitimate use of content in the digital network through the effective use of interoperable identifiers and metadata.” http://www.linkedcontentcoalition.org/index.php Principles of Identification http://doi.org/10.1000/287
  • 20.
    Index Standard BookIndustry Identifiers • Product Identifiers – ISBN – International Standard Book Number • http://www.isbn.org – GTIN – Global Trade Item Number • http://barcodes.gs1us.org/Default.aspx?tabid=81 • EAN – International Article Number • UPC – Uniform Product Code – EPC – Electronic Product Code • http://www.epcglobalus.org/ – ISMN – International Standard Music Number • http://www.ismn-international.org/ – ISSN – Intl Standard Serial Number • http://www.issn.org/ – OCLC Control Number • http://oclc.org/en-US/ • Contributor Identifiers – ISNI – International Standard Name Identifier • www.isni.org – IPI – Interested Party Identifier (Music) • http://www.cisac.org/ – ORCID – Open Researcher and Contributor ID • https://orcid.org • Assembly Identifiers – ISTC – Intl Standard Text Code • http://www.istc-international.org/ – LC Number – Library of Congress Number • http://www.loc.gov/catdir • Expression Identifiers – ISRC – Intl Standard Recording Code • http://www.ifpi.org/isrc • Hybrid Identifiers – DOI – Digital Object Identifier • http://www.doi.org/ , http://www.crossref.org/ • Distribution Identifiers – GLN -- Global Location Number • http://barcodes.gs1us.org/Default.aspx?tabi d=81 – SAN – Standard Address Number • http://www.isbn.org/standards/home/isbn/us /san/ • Link Identifiers – ISLI – International Standard Link Identifier • http://www.isli-international.org/isli/en/index
  • 21.
    BISG Guide toIdentifiers https://www.bisg.org/guide-identifiers-0 Madans – Future of Metadata – IDPF 2016 21
  • 22.
    Thank You! Phil Madans ExecutiveDirector of Digital Publishing Technology Hachette Book Group Chair, BISG Identification Committee phil.madans@hbgusa.com
  • 44.
    International Digital PublishingForum | May 11, 2016 | Chicago, IL Future of Metadata: A Look at 2025 The role of library linked data
  • 45.
    Vice President, BusinessDevelopment Chip Nilges
  • 46.
    Founded in 1967 as theOhio College Library Center 16,957 members worldwide 1,200+staff 18offices in 10countries OCLC headquarters in Dublin, OH USA
  • 47.
    Partnerships Content Providers LibraryServices Providers Consumer Services Providers
  • 49.
    Web of documents •Pages and files • Human readable • Independent • Static Two views of the web Web of data • Statements about entities (“things”) • Machine processed • Integrated • Actionable
  • 50.
    • Any conceptualthing can have a name (URI) on the Web. • Those URIs return important information back in a standard format. • This information includes explicit relationships to other things with URIs. URI = Uniform Resource Identifier Principles of linked data
  • 51.
    Where can yousee linked data?
  • 52.
    Linked data triples raven isin the genus objectsubject 2 31 eating behavior relationship corvus carnivorous
  • 53.
    Linked data forlibraries (and publishers) “The Raven” 1845 “Edgar Allen Poe” was written by was first published in Publisher XYZ is available from…
  • 54.
    Schema.org • Retail structure •Very general • Extensible • “Front office” uses • Developed by major search engines Google, Yahoo, Yandex Two “flavors” of linked data BIBFRAME • Library structure • Very specific • Industry-focused • “Back office” uses • Developed by the Library of Congress and others in the library community
  • 55.
    365million records 2.3billion holdings 46milliondigital items 17million eBooks *As of February 26, 2016
  • 56.
    • WorldCat.org linkeddata using Schema.org format • WorldCat Works: 197 million work descriptions • FAST: faceted subject heading schema derived from the Library of Congress Subject Headings with links to LCSH Authorities as will as other authoritative sources such as VIAF, GeoNames, and Wikipedia. • ISNI: uniquely identifies identities of contributors to media content • VIAF: international authority file that “switches” forms of names based on language or script of the user. OCLC’s linked data resources
  • 57.
  • 58.
    “Same As” API using identifier JanetA. Smith Name Authority File 2 Janet B. A. Smith Name Authority File 1 Janet B. Adam Smith Name Authority File 3 Janet Adam Smith Name Authority File 4 WorldCat Person Entity Lookup Pilot Janet A. Smith Name Authority File 2 ?
  • 59.
    “Search” API using text string JanetA. Smith Name Authority File 2 Janet B. A. Smith Name Authority File 1 Janet B. Adam Smith Name Authority File 3 Janet Adam Smith Name Authority File 4 WorldCat Person Entity Lookup Pilot ? janet smith
  • 63.
    By including metadataand holdings in the OCLC global data network: – Content is discoverable in OCLC library applications as well as through library partner applications – Content is syndicated through OCLC consumer service provider partners Benefits to partners
  • 64.
    Library linked data: makingthe metadata in libraries: • Work harder • Show up in more places • Easier to work with • More transparent • More portable
  • 65.
    Thank you! Chip Nilges VicePresident, Business Development nilgesc@oclc.org For more information about partnerships with OCLC, contact busdev@oclc.org