The document discusses Classical Music Library (CML), a digital classical music service for libraries. CML provides over 30,000 classical music recordings for on-demand listening. After 18 months, CML has signed up over 150 library customers globally. While reviews have been positive and sales are increasing, challenges remain in obtaining more content from major labels, developing additional learning tools, and expanding the service internationally on a tight budget. The presentation emphasizes the need to be relentless and patient in overcoming these challenges to continue growing the library market for CML.
This presentation was provided by Regina Reynolds of The Library of Congress, Françoise Pelle of The ISSN International Center, Karen Coombs of The OCLC Developer Network, and Ed Pentz of CrossRef, during the NISO Webinar "The Case of the Disappearing Journal: Solving the Title Transfer and Online Display Mystery" which was held on November 10, 2010.
EDItEUR’s Chris Saynor and Tim Devenport, the Executive Director of the International ISNI agency, give an introduction to the International Standard Name Identifier. Do you know how to distinguish between Victor Hugo the writer and Victor Hugo the salsa musician? How does Margaret Attwood the Human resources specialist and author distinguish herself from Margaret Atwood?
They explain what the ISNI is, how it works, and look at some of its uses.
techforum.booknetcanada.ca
#TechForum
This document discusses identifying e-books with ISBNs. It notes that the International ISBN Agency and Book Industry Study Group have released guidelines recommending that each distinct file format version of an e-book that a publisher produces and distributes externally be assigned a separate ISBN. This approach to assigning ISBNs to e-books is meant to provide clarity about product forms and versions being identified. The document also provides context on NISO and its work developing technical standards related to publishing and information distribution.
Just about everyone is familiar with the ISBN for books and the ISSN for serials. But new identifiers and new identifier standards have been developed for resources—such as the International Standard Text Code (ISTC)— and for people and organizations—such as the International Standard Name Identifier (ISNI). NISO's January 2012 webinar, Identify This! Identify That! New Identifiers and New Uses—to be held on January 11 from 1:00 to 2:30 p.m. EST—will discuss several new identifiers as well as new uses for older identifiers.
This presentation was provided by Barbara Tillett of the Library of Congress, Diane Hillmann of The Information Institute of Syracuse, and William Moen of The University of North Texas, during the NISO event "Bibliographic Control Alphabet Soup: AACR to RDA and Evolution of MARC," held on October 14, 2009.
Searching family history with OurDigitalWorldOurDigitalWorld
Use the OurDigitalWorld single search portals and community history sites to search for your family history. Learn more about how to search and get relevant results from the OurOntario.ca portal, our two newspaper portals, as well as the specialty sites like the Federated Women's institutes of Ontario, Maritime History, and OGS provincial catalogue site. The presentation was made at the Ontario Genealogical Society Toronto Branch Kanata to Canada 150 workshops in Toronto, November 2017.
AIEMpro 2010: CONTENTUS: Technologies for Next Generation Multimedia Librariesacosta-consult
This document discusses the challenges of creating next generation multimedia libraries and archives. It outlines a processing chain to automatically assess and enhance the quality of digitized print, video, and audio content. Named entities would be extracted from text and disambiguated using context. Media would be semantically linked to external references. A semantic multimedia search engine would allow unified searching across different media types. The project aims to demonstrate these techniques on a collection from a East German music archive containing books, articles, recordings, and photos.
The document discusses Classical Music Library (CML), a digital classical music service for libraries. CML provides over 30,000 classical music recordings for on-demand listening. After 18 months, CML has signed up over 150 library customers globally. While reviews have been positive and sales are increasing, challenges remain in obtaining more content from major labels, developing additional learning tools, and expanding the service internationally on a tight budget. The presentation emphasizes the need to be relentless and patient in overcoming these challenges to continue growing the library market for CML.
This presentation was provided by Regina Reynolds of The Library of Congress, Françoise Pelle of The ISSN International Center, Karen Coombs of The OCLC Developer Network, and Ed Pentz of CrossRef, during the NISO Webinar "The Case of the Disappearing Journal: Solving the Title Transfer and Online Display Mystery" which was held on November 10, 2010.
EDItEUR’s Chris Saynor and Tim Devenport, the Executive Director of the International ISNI agency, give an introduction to the International Standard Name Identifier. Do you know how to distinguish between Victor Hugo the writer and Victor Hugo the salsa musician? How does Margaret Attwood the Human resources specialist and author distinguish herself from Margaret Atwood?
They explain what the ISNI is, how it works, and look at some of its uses.
techforum.booknetcanada.ca
#TechForum
This document discusses identifying e-books with ISBNs. It notes that the International ISBN Agency and Book Industry Study Group have released guidelines recommending that each distinct file format version of an e-book that a publisher produces and distributes externally be assigned a separate ISBN. This approach to assigning ISBNs to e-books is meant to provide clarity about product forms and versions being identified. The document also provides context on NISO and its work developing technical standards related to publishing and information distribution.
Just about everyone is familiar with the ISBN for books and the ISSN for serials. But new identifiers and new identifier standards have been developed for resources—such as the International Standard Text Code (ISTC)— and for people and organizations—such as the International Standard Name Identifier (ISNI). NISO's January 2012 webinar, Identify This! Identify That! New Identifiers and New Uses—to be held on January 11 from 1:00 to 2:30 p.m. EST—will discuss several new identifiers as well as new uses for older identifiers.
This presentation was provided by Barbara Tillett of the Library of Congress, Diane Hillmann of The Information Institute of Syracuse, and William Moen of The University of North Texas, during the NISO event "Bibliographic Control Alphabet Soup: AACR to RDA and Evolution of MARC," held on October 14, 2009.
Searching family history with OurDigitalWorldOurDigitalWorld
Use the OurDigitalWorld single search portals and community history sites to search for your family history. Learn more about how to search and get relevant results from the OurOntario.ca portal, our two newspaper portals, as well as the specialty sites like the Federated Women's institutes of Ontario, Maritime History, and OGS provincial catalogue site. The presentation was made at the Ontario Genealogical Society Toronto Branch Kanata to Canada 150 workshops in Toronto, November 2017.
AIEMpro 2010: CONTENTUS: Technologies for Next Generation Multimedia Librariesacosta-consult
This document discusses the challenges of creating next generation multimedia libraries and archives. It outlines a processing chain to automatically assess and enhance the quality of digitized print, video, and audio content. Named entities would be extracted from text and disambiguated using context. Media would be semantically linked to external references. A semantic multimedia search engine would allow unified searching across different media types. The project aims to demonstrate these techniques on a collection from a East German music archive containing books, articles, recordings, and photos.
Metadata: Standards Basics for the Independent Publishing Community, with Gra...bisg
The document discusses metadata standards for independent publishers. It provides an overview of EDItEUR, an organization that develops and promotes metadata standards. It discusses key metadata standards like ONIX, ISBN, and ISNI identifiers. It emphasizes that using metadata standards can provide business benefits like enabling efficient communication between partners and increasing sales by allowing more product information to be shared.
The International Standard Name Identifier (ISNI): A Close Look, with Laura D...bisg
The International Standard Name Identifier, or ISNI, was created to identify the millions of contributors to creative works and those active in their distribution, including researchers, inventors, writers, artists, visual creators, performers, producers, publishers, aggregators, and more in order to resolve the problem of name ambiguity in search and discovery. Now, Laura Dawson, Product Manager of Identifier Services at Bowker, will show us how ISNI has developed since the standard was first published in 2012. How is it managed? Who receives numbers? What impact has it had on publishing? And how can it be incorporated into current metadata management and distribution?
This document discusses challenges related to ebook identification standards in 2010 from EDItEUR's perspective. It notes a lack of consistency in how publishers identify ebooks in the supply chain. EDItEUR is concerned about this for several reasons: identification is key to interoperability, the ISBN has served the supply chain well but alternatives are lacking, and the new ONIX 3.0 release assumes ISBN identification. The document explores proposed solutions and next steps to address the challenges.
This presentation was provided by Kathy Klemperer, Consultant, EDItEUR; Project Manager, HARRASSOWITZ, at the NISO/BISG 6th Annual Forum: The Changing Standards Landscape, held on June 22, 2012.
The document provides an introduction to information standards organizations, specifically the National Information Standards Organization (NISO). It discusses NISO's history beginning in 1939 and outlines its mission to develop and maintain technical standards to help manage information. Key details are provided on NISO's founding, standards development process, and related organizations like ANSI, ISO, and W3C.
Todd Carpenter gave a presentation on content distribution standards at the Council of Science Editors Conference on May 20, 2012. He discussed how NISO develops standards to reduce chaos in content distribution by bringing together publishers, libraries, and automation vendors. He provided examples of NISO standards like ISSN, DOI, and KBART that help identify and describe content. Carpenter emphasized that identifiers and metadata are key to standards and explained concepts like functional granularity. Overall, he highlighted NISO's role in developing interoperability standards to help distribute scientific content.
Managing an Increasingly Complex and Interconnected World of Content
NISO/BISG 8th Annual Forum on The Changing Standards Landscape
June 27, 2014
ALA Annual Conference Las Vegas, NV
The document provides an overview of ONIX, a metadata standard used by publishers to distribute electronic book information. ONIX allows book data to be communicated between organizations in a standardized, electronic format. It has over 200 data elements that can provide information on a book's title, author, price, availability, reviews and more. ONIX messages are sent as XML files and allow book information to be distributed online to booksellers and retailers more efficiently.
You Can’t Browse The Stacks In A Digital Library: Indexed Discovery, Fair Linking & NISO’s Open Discovery Initiative. A presentation by Todd Carpenter at the 2014 Charleston Library Conference #CHS14 on November 6, 2014.
In this webinar, we'll show you how the data submitted by publishers to BiblioShare can be used by retailers to support online book sales. From Web Services like our WordPress plugin to sites like BNC CataList and 49th Shelf, publishers will discover how to better support discoverability and online bookselling through metadata, and retailers big and small will learn how BiblioShare's integrations with widely used modern retail tools take away some of the pain of promoting and selling books online.
NISO Webinar: 21st Century Resource Sharing: Which Inter-Library Loan Standard Should I Use?
October 15, 2014
1:00 p.m. - 2:30 p.m. (Eastern Time)
Has “Rethinking Resource Sharing” Succeeded? – A Survey of Resource Sharing Protocols Ten Years Later
Ted Koppel, Product Manager, VERSO® ILS – Auto-Graphics, Inc.
Invisible Alphabet Soup: How Libraries Use a Variety of ILL Standards Everyday and Don't Necessarily Know It
Margaret Ellingson, Head of Interlibrary Loan and Course Reserves, Robert W. Woodruff Library, Emory University
Occams Reader and the Interlibrary Loan of E-books
Kenny Ketner, Software Development Manager, Texas Tech University Libraries
Ryan Litsey, Document Delivery/Interlibrary Loan Assistant Librarian, Texas Tech University Library
ONIX is a standard for exchanging metadata about products between parties involved in creating, publishing and distributing books. It allows sharing of detailed product information using XML messages rather than databases. ONIX messages can be created manually but are typically generated from a central metadata repository to ensure accuracy and reduce duplication of efforts. Following metadata best practices helps ensure efficient management and communication of product data.
This document provides an overview of the NISO IOTA (Improving OpenURLs Through Analytics) project. The project aims to improve OpenURL linking by analyzing OpenURL data from various sources to identify problems and recommend solutions. The project has analyzed over 9 million OpenURLs and produced reports on element usage. Upcoming work includes developing a vendor completeness index and element weighting system. The document also briefly describes the NISO/UKSG KBART (Knowledge Bases and Related Tools) Recommended Practice for exchanging holdings data.
The document discusses the challenges of applying ISBN rules to eBooks given the growth of the eBook market. While the ISBN rules require a separate ISBN for each eBook format, many publishers have not followed this due to issues with metadata management and workflow. There is no agreement on a single approach. The document considers options like maintaining the status quo with system support for publishers, changing the rules to a single "eISBN", or developing new identifiers. It notes ongoing research to find a way forward that supports identification needs while addressing publishers' practical challenges.
An Introduction to Providence Information centersreejatunnu
A library is a collection of sources of information and similar resources, made accessible to a defined community for reference or borrowing. It provides physical or digital access to material, and may be a physical building or room, or a virtual space, or both.
The document discusses the establishment of a national knowledge bank in Egypt with the following key points:
1. The knowledge bank will have multiple portals for researchers, students, children, and the general public.
2. Registration will require identifying the user's access level and providing basic personal and institutional details.
3. Registered users can access databases of scientific resources, publications, and datasets through their designated portal.
#standardsgoals for 2019: Standards & Certification Roundup - Tom Richardson...BookNet Canada
Bibliographic standards are always changing to keep up with the ever-evolving realities of the book industry and it’s in the best interest of your business to keep up, too. Should you be using BISAC or Thema subject codes (or both)? Which version of ONIX are your business partners using? What improvements to your metadata should you prioritize for 2019?
In this session, Bibliographic Manager Tom Richardson will answer your pressing questions and show you what’s new, what’s important, and what’s coming up in standards.
March 20, 2019
techforum.booknetcanada.ca
#TechForum
1) The document discusses the ISSN Network and its services for tracking and identifying serial publications, including the ISSN Register containing over 2 million records and the Keepers Registry for monitoring digital archiving arrangements.
2) It provides statistics on ISSN assignments and metrics like a rise in digital publications and assignments from countries like Nigeria, Indonesia, and Brazil.
3) It outlines future plans like the JASPER initiative to preserve open access journals, expanding the types of resources eligible for ISSNs, and promoting interoperability between ISSN data and other identification systems.
Ebook Central Submission Guide for Content Providers -- Revised, July 2020Bowker
This submission guide for ProQuest Ebook Central contains all that a content provider needs to know about submitting metadata and content files (cover images, PDFs, EPUBs) to this platform. This thorough guide explains all elements required for your titles (including prices, sales rights to ONIX, Excel, and accepted content files). It also provides valuable contact information, business model explanations and options, and an appendix about accepted ONIX fields. Newest version -- July 2020
1. The document provides tips on metadata best practices from A to Z. It discusses topics like annotations, audience tags, BISAC codes, contributor bios, cover images, publishing statuses, keywords, series information, subtitles, titles, URLs, and validating files to ensure high quality metadata.
2. It emphasizes keeping all metadata elements up-to-date, like contributor bios and cover images, to increase discoverability and sales. Accurate audience tags, BISAC codes, keywords, and other elements can help books be found by the right readers.
3. The tips recommend using the appropriate metadata fields, valid file formats, and reviewing elements regularly to ensure metadata feeds properly and helps expose books to potential buyers
More Related Content
Similar to BEA 2014--Understanding New Developments in Metadata
Metadata: Standards Basics for the Independent Publishing Community, with Gra...bisg
The document discusses metadata standards for independent publishers. It provides an overview of EDItEUR, an organization that develops and promotes metadata standards. It discusses key metadata standards like ONIX, ISBN, and ISNI identifiers. It emphasizes that using metadata standards can provide business benefits like enabling efficient communication between partners and increasing sales by allowing more product information to be shared.
The International Standard Name Identifier (ISNI): A Close Look, with Laura D...bisg
The International Standard Name Identifier, or ISNI, was created to identify the millions of contributors to creative works and those active in their distribution, including researchers, inventors, writers, artists, visual creators, performers, producers, publishers, aggregators, and more in order to resolve the problem of name ambiguity in search and discovery. Now, Laura Dawson, Product Manager of Identifier Services at Bowker, will show us how ISNI has developed since the standard was first published in 2012. How is it managed? Who receives numbers? What impact has it had on publishing? And how can it be incorporated into current metadata management and distribution?
This document discusses challenges related to ebook identification standards in 2010 from EDItEUR's perspective. It notes a lack of consistency in how publishers identify ebooks in the supply chain. EDItEUR is concerned about this for several reasons: identification is key to interoperability, the ISBN has served the supply chain well but alternatives are lacking, and the new ONIX 3.0 release assumes ISBN identification. The document explores proposed solutions and next steps to address the challenges.
This presentation was provided by Kathy Klemperer, Consultant, EDItEUR; Project Manager, HARRASSOWITZ, at the NISO/BISG 6th Annual Forum: The Changing Standards Landscape, held on June 22, 2012.
The document provides an introduction to information standards organizations, specifically the National Information Standards Organization (NISO). It discusses NISO's history beginning in 1939 and outlines its mission to develop and maintain technical standards to help manage information. Key details are provided on NISO's founding, standards development process, and related organizations like ANSI, ISO, and W3C.
Todd Carpenter gave a presentation on content distribution standards at the Council of Science Editors Conference on May 20, 2012. He discussed how NISO develops standards to reduce chaos in content distribution by bringing together publishers, libraries, and automation vendors. He provided examples of NISO standards like ISSN, DOI, and KBART that help identify and describe content. Carpenter emphasized that identifiers and metadata are key to standards and explained concepts like functional granularity. Overall, he highlighted NISO's role in developing interoperability standards to help distribute scientific content.
Managing an Increasingly Complex and Interconnected World of Content
NISO/BISG 8th Annual Forum on The Changing Standards Landscape
June 27, 2014
ALA Annual Conference Las Vegas, NV
The document provides an overview of ONIX, a metadata standard used by publishers to distribute electronic book information. ONIX allows book data to be communicated between organizations in a standardized, electronic format. It has over 200 data elements that can provide information on a book's title, author, price, availability, reviews and more. ONIX messages are sent as XML files and allow book information to be distributed online to booksellers and retailers more efficiently.
You Can’t Browse The Stacks In A Digital Library: Indexed Discovery, Fair Linking & NISO’s Open Discovery Initiative. A presentation by Todd Carpenter at the 2014 Charleston Library Conference #CHS14 on November 6, 2014.
In this webinar, we'll show you how the data submitted by publishers to BiblioShare can be used by retailers to support online book sales. From Web Services like our WordPress plugin to sites like BNC CataList and 49th Shelf, publishers will discover how to better support discoverability and online bookselling through metadata, and retailers big and small will learn how BiblioShare's integrations with widely used modern retail tools take away some of the pain of promoting and selling books online.
NISO Webinar: 21st Century Resource Sharing: Which Inter-Library Loan Standard Should I Use?
October 15, 2014
1:00 p.m. - 2:30 p.m. (Eastern Time)
Has “Rethinking Resource Sharing” Succeeded? – A Survey of Resource Sharing Protocols Ten Years Later
Ted Koppel, Product Manager, VERSO® ILS – Auto-Graphics, Inc.
Invisible Alphabet Soup: How Libraries Use a Variety of ILL Standards Everyday and Don't Necessarily Know It
Margaret Ellingson, Head of Interlibrary Loan and Course Reserves, Robert W. Woodruff Library, Emory University
Occams Reader and the Interlibrary Loan of E-books
Kenny Ketner, Software Development Manager, Texas Tech University Libraries
Ryan Litsey, Document Delivery/Interlibrary Loan Assistant Librarian, Texas Tech University Library
ONIX is a standard for exchanging metadata about products between parties involved in creating, publishing and distributing books. It allows sharing of detailed product information using XML messages rather than databases. ONIX messages can be created manually but are typically generated from a central metadata repository to ensure accuracy and reduce duplication of efforts. Following metadata best practices helps ensure efficient management and communication of product data.
This document provides an overview of the NISO IOTA (Improving OpenURLs Through Analytics) project. The project aims to improve OpenURL linking by analyzing OpenURL data from various sources to identify problems and recommend solutions. The project has analyzed over 9 million OpenURLs and produced reports on element usage. Upcoming work includes developing a vendor completeness index and element weighting system. The document also briefly describes the NISO/UKSG KBART (Knowledge Bases and Related Tools) Recommended Practice for exchanging holdings data.
The document discusses the challenges of applying ISBN rules to eBooks given the growth of the eBook market. While the ISBN rules require a separate ISBN for each eBook format, many publishers have not followed this due to issues with metadata management and workflow. There is no agreement on a single approach. The document considers options like maintaining the status quo with system support for publishers, changing the rules to a single "eISBN", or developing new identifiers. It notes ongoing research to find a way forward that supports identification needs while addressing publishers' practical challenges.
An Introduction to Providence Information centersreejatunnu
A library is a collection of sources of information and similar resources, made accessible to a defined community for reference or borrowing. It provides physical or digital access to material, and may be a physical building or room, or a virtual space, or both.
The document discusses the establishment of a national knowledge bank in Egypt with the following key points:
1. The knowledge bank will have multiple portals for researchers, students, children, and the general public.
2. Registration will require identifying the user's access level and providing basic personal and institutional details.
3. Registered users can access databases of scientific resources, publications, and datasets through their designated portal.
#standardsgoals for 2019: Standards & Certification Roundup - Tom Richardson...BookNet Canada
Bibliographic standards are always changing to keep up with the ever-evolving realities of the book industry and it’s in the best interest of your business to keep up, too. Should you be using BISAC or Thema subject codes (or both)? Which version of ONIX are your business partners using? What improvements to your metadata should you prioritize for 2019?
In this session, Bibliographic Manager Tom Richardson will answer your pressing questions and show you what’s new, what’s important, and what’s coming up in standards.
March 20, 2019
techforum.booknetcanada.ca
#TechForum
1) The document discusses the ISSN Network and its services for tracking and identifying serial publications, including the ISSN Register containing over 2 million records and the Keepers Registry for monitoring digital archiving arrangements.
2) It provides statistics on ISSN assignments and metrics like a rise in digital publications and assignments from countries like Nigeria, Indonesia, and Brazil.
3) It outlines future plans like the JASPER initiative to preserve open access journals, expanding the types of resources eligible for ISSNs, and promoting interoperability between ISSN data and other identification systems.
Similar to BEA 2014--Understanding New Developments in Metadata (20)
Ebook Central Submission Guide for Content Providers -- Revised, July 2020Bowker
This submission guide for ProQuest Ebook Central contains all that a content provider needs to know about submitting metadata and content files (cover images, PDFs, EPUBs) to this platform. This thorough guide explains all elements required for your titles (including prices, sales rights to ONIX, Excel, and accepted content files). It also provides valuable contact information, business model explanations and options, and an appendix about accepted ONIX fields. Newest version -- July 2020
1. The document provides tips on metadata best practices from A to Z. It discusses topics like annotations, audience tags, BISAC codes, contributor bios, cover images, publishing statuses, keywords, series information, subtitles, titles, URLs, and validating files to ensure high quality metadata.
2. It emphasizes keeping all metadata elements up-to-date, like contributor bios and cover images, to increase discoverability and sales. Accurate audience tags, BISAC codes, keywords, and other elements can help books be found by the right readers.
3. The tips recommend using the appropriate metadata fields, valid file formats, and reviewing elements regularly to ensure metadata feeds properly and helps expose books to potential buyers
Enhanced Metadata for Discovery -- Beyond the BasicsBowker
This slide show contains the motivation, expectations and reality of how using enhanced metadata can have a possible effect on the sales and therefore the discovery of a publisher's content. Studies are references and BISG best practices are examined for the submission of enhanced metadata to providers in the supply chain. Focus on the metadata elements and a metadata timeline.
BEA Content & Digital Conference Leading Readers to Your Children's and YA Co...Bowker
Originally presented at BEA, covers recommendations for metadata and content marketing to reach children and young adults. Includes BISAC, levelling, and discovery.
BEA Content & Digital Conference Maximizing Metadata & Improving the Bottom LineBowker
Originally presented at BEA 2016, this presentation includes metadata best practices. It provides information on the effects of metadata on discovery, online sales, and library sales.
BEA Content & Digital Conference & IDPF 2016Bowker
The document lists four presentations occurring at the 2016 IDPF DigiCon @ BEA Program on metadata and copyright registration:
1) A presentation on the future of metadata in 2025 featuring speakers from EDItEUR, Hachette Book Group, and OCLC.
2) A presentation on leading readers to children's and YA content featuring speakers from Penguin Random House, Capstone Publishers, and Unbound Concepts.
3) A workshop on maximizing metadata and improving the bottom line featuring speakers from Baker & Taylor, Ingram Content Group, and Hachette Book Group.
4) A presentation on simply and effectively protecting copyright through registration featuring speakers from Copyright Clearance Center, First Source
A discussion of the various subject classifications for published books and ebooks. This slide deck provides insight into the type of subject classification to include, and the depth needed for discovery purposes. Includes BISAC, BIC and Thema subject classifications.
The Higher Education Persona: Whether it’s print or digital, metadata for Higher Ed deserves special attention because of its particular components and attributes.
Originally presented as a webinar for BookNet Canada, this presentation discusses the BISG Educational Taxonomy and book metadata to improve discovery by teachers and librarians.
This was originally presented at BEA 2105. This presentation looks at the experiences of two publishers as they conducted machine indexing projects. It also shows the capabilities of machine indexing today.
5 Cool Things You Didn't Know You Could Do With MetadataBowker
This was originally presented at BISG Making Information Pay 2015 as part of IDPF and BEA. This presentation covers 5 metadata themes--recommend, visualize, diagnose, acknowledge, and automate--showing cool uses of metadata on websites geared toward readers and through ONIX.
BEA 2015 Demystifying Subject Codes and KeywordsBowker
The document provides information about subject codes and keywords. It defines subjects and keywords, discusses how they differ and are used for search and classification. It also discusses standards for subject codes like BISAC and Thema and provides examples of applying subject codes from these standards. The goal is to assign the most specific non-redundant subject codes to describe works.
This webinar discussed best practices for metadata used to describe books and other published content. It covered the proper use of fields for titles, series, subjects, cover images, descriptions, author biographies and other key metadata. Attendees learned how following standards helps content be discovered through search and supports the distribution of works.
This presentation by a Bowker subject analyst discussed the need for subject codes versus keywords and provides best practices on applying subject codes. It references materials available from the Book Industry Study Group (BISG) as well.
This document discusses metadata standards and the transition from ONIX 2.1 to ONIX 3.0. It provides information on standards, linking products using identifiers like ISNI, and the status of ONIX 3.0 adoption among data providers and recipients. While 46% of surveyed organizations are not yet working on ONIX 3.0, 34% are working on it and 20% are already producing it. Most expect to transition during 2014-2015.
BEA 2014--Let Common Core Power Your Publishing Accompanying ScriptBowker
Originally presented at BEA, this slide deck provides information on product development, collection creation, and metadata for Common Core State Standard supplemental materials.
BEA 2014--Let Common Core Power Your PublishingBowker
Originally presented at BEA, this slide deck provides information on product development, collection creation, and metadata for Common Core State Standard supplemental materials.
uPublishU 2014--5 Easy Ways to Get DiscoveredBowker
This document provides tips for ensuring books are discoverable through effective metadata practices from several metadata experts. It recommends that metadata should look exactly how searchers want it when they search, have depth to interest potential readers, be present where discovery happens, and respect standards used by trading partners. The document outlines best practices for key metadata fields like title format, cover images, descriptions, keywords and author bios to maximize discoverability.
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfGetInData
Recently we have observed the rise of open-source Large Language Models (LLMs) that are community-driven or developed by the AI market leaders, such as Meta (Llama3), Databricks (DBRX) and Snowflake (Arctic). On the other hand, there is a growth in interest in specialized, carefully fine-tuned yet relatively small models that can efficiently assist programmers in day-to-day tasks. Finally, Retrieval-Augmented Generation (RAG) architectures have gained a lot of traction as the preferred approach for LLMs context and prompt augmentation for building conversational SQL data copilots, code copilots and chatbots.
In this presentation, we will show how we built upon these three concepts a robust Data Copilot that can help to democratize access to company data assets and boost performance of everyone working with data platforms.
Why do we need yet another (open-source ) Copilot?
How can we build one?
Architecture and evaluation
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...sameer shah
"Join us for STATATHON, a dynamic 2-day event dedicated to exploring statistical knowledge and its real-world applications. From theory to practice, participants engage in intensive learning sessions, workshops, and challenges, fostering a deeper understanding of statistical methodologies and their significance in various fields."
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeWalaa Eldin Moustafa
Dynamic policy enforcement is becoming an increasingly important topic in today’s world where data privacy and compliance is a top priority for companies, individuals, and regulators alike. In these slides, we discuss how LinkedIn implements a powerful dynamic policy enforcement engine, called ViewShift, and integrates it within its data lake. We show the query engine architecture and how catalog implementations can automatically route table resolutions to compliance-enforcing SQL views. Such views have a set of very interesting properties: (1) They are auto-generated from declarative data annotations. (2) They respect user-level consent and preferences (3) They are context-aware, encoding a different set of transformations for different use cases (4) They are portable; while the SQL logic is only implemented in one SQL dialect, it is accessible in all engines.
#SQL #Views #Privacy #Compliance #DataLake
Analysis insight about a Flyball dog competition team's performanceroli9797
Insight of my analysis about a Flyball dog competition team's last year performance. Find more: https://github.com/rolandnagy-ds/flyball_race_analysis/tree/main
Global Situational Awareness of A.I. and where its headedvikram sood
You can see the future first in San Francisco.
Over the past year, the talk of the town has shifted from $10 billion compute clusters to $100 billion clusters to trillion-dollar clusters. Every six months another zero is added to the boardroom plans. Behind the scenes, there’s a fierce scramble to secure every power contract still available for the rest of the decade, every voltage transformer that can possibly be procured. American big business is gearing up to pour trillions of dollars into a long-unseen mobilization of American industrial might. By the end of the decade, American electricity production will have grown tens of percent; from the shale fields of Pennsylvania to the solar farms of Nevada, hundreds of millions of GPUs will hum.
The AGI race has begun. We are building machines that can think and reason. By 2025/26, these machines will outpace college graduates. By the end of the decade, they will be smarter than you or I; we will have superintelligence, in the true sense of the word. Along the way, national security forces not seen in half a century will be un-leashed, and before long, The Project will be on. If we’re lucky, we’ll be in an all-out race with the CCP; if we’re unlucky, an all-out war.
Everyone is now talking about AI, but few have the faintest glimmer of what is about to hit them. Nvidia analysts still think 2024 might be close to the peak. Mainstream pundits are stuck on the wilful blindness of “it’s just predicting the next word”. They see only hype and business-as-usual; at most they entertain another internet-scale technological change.
Before long, the world will wake up. But right now, there are perhaps a few hundred people, most of them in San Francisco and the AI labs, that have situational awareness. Through whatever peculiar forces of fate, I have found myself amongst them. A few years ago, these people were derided as crazy—but they trusted the trendlines, which allowed them to correctly predict the AI advances of the past few years. Whether these people are also right about the next few years remains to be seen. But these are very smart people—the smartest people I have ever met—and they are the ones building this technology. Perhaps they will be an odd footnote in history, or perhaps they will go down in history like Szilard and Oppenheimer and Teller. If they are seeing the future even close to correctly, we are in for a wild ride.
Let me tell you what we see.
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI
Discussion on Vector Databases, Unstructured Data and AI
https://www.meetup.com/unstructured-data-meetup-new-york/
This meetup is for people working in unstructured data. Speakers will come present about related topics such as vector databases, LLMs, and managing data at scale. The intended audience of this group includes roles like machine learning engineers, data scientists, data engineers, software engineers, and PMs.This meetup was formerly Milvus Meetup, and is sponsored by Zilliz maintainers of Milvus.
The Building Blocks of QuestDB, a Time Series Databasejavier ramirez
Talk Delivered at Valencia Codes Meetup 2024-06.
Traditionally, databases have treated timestamps just as another data type. However, when performing real-time analytics, timestamps should be first class citizens and we need rich time semantics to get the most out of our data. We also need to deal with ever growing datasets while keeping performant, which is as fun as it sounds.
It is no wonder time-series databases are now more popular than ever before. Join me in this session to learn about the internal architecture and building blocks of QuestDB, an open source time-series database designed for speed. We will also review a history of some of the changes we have gone over the past two years to deal with late and unordered data, non-blocking writes, read-replicas, or faster batch ingestion.
2. Richard Stark, Moderator
Director of Product Data
Barnes & Noble
Laura Dawson, Speaker
Product Manager, Identifier Services
Bowker
Chris Saynor, Speaker
Metadata Manager and Project Manage
GiantChair
Kempton Mooney, Speaker
Research and Analytics Director
Nielsen Book
4. What Is ISNI
• ISO Standard, published in 2012
• International Standard Name Identifier
• Numerical representation of a name
– 16 digits
– Assigned to public figures, contributors of content –
researchers, authors, musicians, actors, publishers,
research institutions – and subjects of that content (if
they are people or institutions).
– Example: 0000 0004 1029 5439
5. Who is ISNI
• Founding members
– IFRRO (International Federation of Reproduction
Rights Organizations)
– CISAC (International Confederation of Authors and
Composers Societies)
– SCAPR (Societies’ Council for the Collective
Management of Performers’ Rights)
– OCLC
– CENL (Conference of European National Librarians),
represented by the British Library and the National
Library of France
– ProQuest, represented by Bowker
6. Members
Quality Team
Board of Directors
ISNI Organizational Structure
Registration Agencies
Ongoing
assignments/
general public
7. How Does ISNI Registration Work
• Publisher submits names for assignment through a Registration
Agency
• RA works with the publisher to ensure the data feed is well-
formatted, and sends that feed to the Assignment Agency
• AA assigns as many ISNIs to the names in the feed as it can, using
complex algorithms and business rules that evolve with each feed
• AA returns a file of names with ISNIs attached to them
– This may not be the full file of names
– Ambiguous names are held for review by Quality Team
– QT assignments and other exceptions (assignments as a result
of improvements to the algorithm) are returned to RA quarterly
– Process is not instant. Assignment may be immediate if the
name and other information is unique, but frequently
assignments take a week or two.
8. Stage One
Customer
submits data to
Registration
Agency
Registration
Agency sends
file to
Assignment
Agency
Assignment
Agency assigns
as many ISNIs
to the names as
it can
11. Display
• Only minimal metadata is displayed
• Not meant as a comprehensive profile
• ISNI is a tool for linking data sets, collocation, and
disambiguation
• Enhancements to the record can be made but not
required
16. How many names in the ISNI database?
• Over 8,300,000 assigned
• 10,112,931 provisional (awaiting a match from another
data set for corroboration)
• Your author names may well already have ISNIs.
http://www.isni.org/search.
20. Data Quality
• Based on matching names to existing records in
database (over 18 million names)
• Strict criteria for assigning ISNIs to names
• Quality team oversight (manual edits)
– British Library
– National Library of France
– OCLC
19
21. Assignment Criteria
• If on the common surname list:
– Birth date
– Death date
– ISBN(s)
– Title(s)
– Co-authors or institutional affiliation
• If not on the common surname list
– Title(s)
– Birth date
– Death date
– Any other distinguishing factors (“is not”)
• If unique
– Immediate assignment
20
22. ISNI and ORCID
• ORCID numbers are a subset of ISNI’s database
• Working towards alignment, with ultimate goal of single
assignment
• There is ISNI representation on the ORCID Technical
Steering Group, and ORCID representation on the ISNI
Technical Committee
• A researcher may have both an ORCID and an ISNI
21
30. • ONIX stands for ONline Information eXchange.
• There are over 200 data elements.
31. • ONIX stands for ONline Information eXchange.
• There are over 200 data elements.
• ONIX is an international metadata standard for communicating book
product information.
32. • ONIX stands for ONline Information eXchange.
• There are over 200 data elements.
• ONIX is an international metadata standard for communicating book
product information.
• This electronic information is distributed between publishers,
distributors, wholesalers, bookstores, online retailers, libraries, book
data aggregators and anyone else involved in the supply chain.
33. • ONIX stands for ONline Information eXchange.
• There are over 200 data elements.
• ONIX is an international metadata standard for communicating book
product information.
• This electronic information is distributed between publishers,
distributors, wholesalers, bookstores, online retailers, libraries, book
data aggregators and anyone else involved in the supply chain.
• ONIX allows global communication regardless of language.
34. • ONIX stands for ONline Information eXchange.
• There are over 200 data elements.
• ONIX is an international metadata standard for communicating book
product information.
• This electronic information is distributed between publishers,
distributors, wholesalers, bookstores, online retailers, libraries, book
data aggregators and anyone else involved in the supply chain.
• ONIX allows global communication regardless of language.
• Book information can be communicated between organizations with
different technical infrastructures.
35. • ONIX stands for ONline Information eXchange.
• There are over 200 data elements.
• ONIX is an international metadata standard for communicating book
product information.
• This electronic information is distributed between publishers,
distributors, wholesalers, bookstores, online retailers, libraries, book
data aggregators and anyone else involved in the supply chain.
• ONIX allows global communication regardless of language.
• Book information can be communicated between organizations with
different technical infrastructures.
• ONIX is not a database, but uses XML to organize data storage.
36. ONIX. A history.
With the growth of the internet and e-commerce in the 1990s
there was a compelling need to create a standard digital format
to communicate book information.
The goal was to create a universal, international format with
which publishers large and small could exchange information
about their books.
37. • ONIX was developed jointly in the late 1990s by Editeur with Book
Industry Standards Group (BISG) in the US and Book Industry
Communication in the UK.
38. • ONIX was developed jointly in the late 1990s by Editeur with Book
Industry Standards Group (BISG) in the US and Book Industry
Communication in the UK.
• ONIX for books 1.0 was published in January 2000.
39. • ONIX was developed jointly in the late 1990s by Editeur with Book
Industry Standards Group (BISG) in the US and Book Industry
Communication in the UK.
• ONIX for books 1.0 was published in January 2000.
• ONIX for books 2.1 (revision 02) was published in 2004.
40. • ONIX was developed jointly in the late 1990s by Editeur with Book
Industry Standards Group (BISG) in the US and Book Industry
Communication in the UK.
• ONIX for books 1.0 was published in January 2000.
• ONIX for books 2.1 (revision 02) was published in 2004.
• ONIX for books 3.0 was released in January 2009.
41. • ONIX was developed jointly in the late 1990s by Editeur with Book
Industry Standards Group (BISG) in the US and Book Industry
Communication in the UK.
• ONIX for books 1.0 was published in January 2000.
• ONIX for books 2.1 (revision 02) was published in 2004.
• ONIX for books 3.0 was released in January 2009.
• ONIX is governed by an International Steering Committee with local
committees providing information, support and feedback
internationally.
42. • ONIX was developed jointly in the late 1990s by Editeur with Book
Industry Standards Group (BISG) in the US and Book Industry
Communication
in the UK.
• ONIX for books 1.0 was published in January 2000.
• ONIX for books 2.1 (revision 02) was published in 2004.
• ONIX for books 3.0 was released in January 2009.
• ONIX is governed by an International Steering Committee with local
committees providing information, support and feedback internationally.
• There are national ONIX groups in Australia, Belgium, Canada, China,
Egypt, Finland, France, Germany, Italy, Japan, Korea, The Netherlands,
Norway, Russia, Spain, Sweden, the UK and the USA. It is also used in
many other countries.
48. • ONIX is a message – not a database.
• ONIX is a standard – a common language.
• ONIX is international.
49.
50. • ONIX is a message – not a database.
• ONIX is a standard – a common language.
• ONIX is international.
• ONIX can communicate your title information with everyone.
53. • With the growth of new digital formats ONIX needed revision.
54. • With the growth of new digital formats ONIX needed revision.
• ONIX 2.1 had a lot of depreciated elements left over from earlier
versions of ONIX 2.
56. • ONIX 3.0 reflects the changed global book market.
57. • ONIX 3.0 reflects the changed global book market.
• ONIX 2.1 and 3.0 share many common traits. About 66% of a
typical ONIX 2.1 message does not need significant changes made
to make it valid ONIX 3.0.
58. • ONIX 3.0 reflects the changed global book market.
• ONIX 2.1 and 3.0 share many common traits. About 66% of a
typical ONIX 2.1 message does not need significant changes made
to make it valid ONIX 3.0.
• Outdated and depreciated elements have been removed.
60. • ONIX 3.0 pushes you to express all market data even if it is to say
“Not known for these countries”.
61. • ONIX 3.0 pushes you to express all market data even if it is to say
“Not known for these countries”.
• Can express much more detailed pricing information on a global
scale.
62. • ONIX 3.0 pushes you to express all market data even if it is to say
“Not known for these countries”.
• Can express much more detailed pricing information on a global
scale.
• Can express dates and availability by market.
68. • Formats changed to express method of delivery.
• Information on DRM and usage constraints.
69. • Formats changed to express method of delivery.
• Information on DRM and usage constraints.
• Accessibility information.
70. • Formats changed to express method of delivery.
• Information on DRM and usage constraints.
• Accessibility information.
• Rental information and conditions.
71. “Set” and “Series” replaced by a more
general notion of “Collections”
• It is easier to express a shared identity.
78. • Text content – any text included in your metadata.
• Cited content – any third party content you make reference to that
could improve sales.
79. • Text content – any text included in your metadata.
• Cited content – any third party content you make reference to that
could improve sales.
• Supporting resources – any material a publisher wishes to make
available in their metadata to support the sale of the title.
82. • Can repeat and send textual information in different languages and
different scripts.
83. • Can repeat and send textual information in different languages and
different scripts.
• Add a note about a product in English, French and Spanish.
84. • Not suitable for children under 36 months, due to small parts
• No apto para niños menores de 36 meses, debido a las piezas pequeñas
• Ne convient pas aux enfants de moins de 36 mois, en raison de petites
pièce
• Nicht geeignet für Kinder unter 36 Monaten, wegen verschluckbarer
Kleinteile
• Не подходит для детей в возрасте до 36 месяцев, в связи с мелких
деталей
85. • Can repeat and send textual information in different languages and
different scripts.
• Add a note about a product in English, French, Spanish etcetera...
• Send your author’s biography in English and Spanish.
86. • Miguel de Cervantes Saavedra; 29 September 1547 (assumed) – 22 April 1616) was a
Spanish novelist, poet, and playwright. His magnum opus, Don Quixote, considered to
be the first modern European novel, is a classic of Western literature, and is regarded
amongst the best works of fiction ever written. His influence on the Spanish language
has been so great that the language is often called la lengua de Cervantes ("the
language of Cervantes"). He was dubbed El Príncipe de los Ingenios ("The Prince of
Wits").
• Miguel de Cervantes Saavedra (Alcalá de Henares,29 de septiembre de 1547 – Madrid,
22 de abril de 1616) fue un soldado, novelista, poeta y dramaturgo español.Es
considerado una de las máximas figuras de la literatura española y universalmente
conocido por haber escrito Don Quijote de la Mancha, que muchos críticos han descrito
como la primera novela moderna y una de las mejores obras de la literatura universal,
además de ser el libro más editado y traducido de la historia, sólo superado por la
Biblia. Se le ha dado el sobrenombre de «Príncipe de los Ingenios».
88. • Send updates for part of the product instead of sending the whole
product file.
89. • Send updates for part of the product instead of sending the whole
product file.
• So updates can be sent as smaller files.
90. Even better resources
• Very comprehensive ONIX 3.0 Global Best Practice and
implementation documents available.
• For developers, ONIX 3.0 has XSD and RNG schemas.
91. More about best practices
• BISG – Best Practices for Product Metadata: Guide for North
American Senders and Receivers.
• BISG – Best Practices for Keywords in Metadata: Guide for North
American Senders and Receivers.
• Editeur – Implementation and Best Practice Guide
92. To find out more about ONIX
www.editeur.org
www.bisg.org
http://www.booknetcanada.ca/
93.
94. Thema:
The First Global Subject
Category Codes
May 2014
*Contains information from Howard Willows LBF 2014 Presentation
95. Thema… What is it?
How will it help?
What are its implications?
What does it look like?
94
96. Thema: What is it?
• Thema is a subject category classification system.
• Thema is made for all members of the supply chain to use.
• Thema is meant for use with physical and digital products.
• Thema is an international standard for the global book trade.
95
97. Thema: How will it help?
• Book trade subject schemes tend to be national, not international
• We can now clearly communicate all product data – except subject
classification
• Thema can replace the need for endless mappings & conversions
It is live!
Version 1.0 was released November 2013
Sunrise Date was December 2013
96
98. Thema: How will it help?
• Facilitate international transactions
• Increase understanding in
international markets
• Reduce subject code confusion
• Increase discoverability
97
101. AIE
Amazon
Australian PA
Baker & Taylor
Barnes & Noble
BIC
BISG
Bokrondellen
Booknet Canada
Bowker
BTLF
CB
Danish PA
Dilve
Editis
Electre
Elkotob.com
Elsevier
Giant Chair
Guild of Book Dealers
(Russia)
Hachette
HarperCollins
Informazioni Editoriali
Ingram
Japan Publishers
Organisation
Kobo
Kogan Page
Libri
MVB
Nielsen Book
Norske Bokdatabasen
NTCPDS China
Penguin Random House
Springer
Waterstones
100
Current Participants
* As of London Book Fair 2014
102. Implications for BISAC Subject Heading Users
• Thema will reduce mappings to BIC, BISAC, CLIL, etc.
• Thema and BISAC will operate in parallel.
• No timeline for BISAC being deprecated.
• There is a BISAC-to-Thema mapping.
101
(Can use BISAC to select a Thema code.)
103. What does Thema look like?
F Fiction & related
FJ Speculative fiction
FJB Dystopian fiction Use for any fiction set in
dysfunctional or degraded
society; use with FL or FB
codes if appropriate
Code Heading Notes
HIERARCHY
Because of hierarchy, F is implied in FJB.
Subject Headings
102
104. What does Thema look like?
Code Heading
Subject Headings – More Examples
103
AGA History of art
FRX Erotic romance
XAMC Manga: Kodomo
NHW Military history
QRRF Zoroastrianism
KJMP Project management
LWKF Shariah law: family relations
MKE Dentistry
UGB Web graphics & design
WBB TV / celebrity chef cookbooks
YBC Children's picture books
105. What does Thema look like?
1K The Americas
1KBB United States of America, USA
1KBB-US-NAKC New York City
Geographic Code Heading
Qualifiers
3M c 1500 onwards to present day
3MPQS c 1960 to c 1969
3MPQS-US-P USA: Civil Rights Movement
Time Period Code Heading
104
106. 1K The Americas
1KBB United States of America, USA
1KBB-US-NAKC New York City
3M c 1500 onwards to present day
3MPQS c 1960 to c 1969
3MPQS-US-P USA: Civil Rights Movement
What does Thema look like?
Geographic Code Heading
Qualifiers
Time Period Code Heading
105
107. Geographic 1HFGU Uganda
Language 2ACSC Icelandic
Time Period 3MD 16th century, c 1500 to c 1599
Education 4GH For International GCSE (IGCSE)
Interest 5AG Interest age: from c 6 years
Artistic Style 6BA Baroque
What does Thema look like?
Code Heading
Qualifiers – More Examples
106
Type
(about, not in)
108. Diving Deeper: Technical specs
Summary of Elements
107
Element Code
begins
May contain Length Mandatory /
Optional
Categories A-Y A-Z 1-9 1-8 Mandatory
Geographical Qualifiers 1 1 A-Z - 2-19 Optional
Language Qualifiers 2 2 A-Z - 2-19 Optional
Time Period Qualifiers 3 3 A-Z - 2-19 Optional
Educ Purpose Qualifiers 4 4 A-Z - 2-19 Optional
Interest Qualifiers 5 5 A-Z - 2-19 Optional
Artistic Style Qualifiers 6 6 A-Z - 2-19 Optional
Thema in Onix: Use following values from code lists 26 & 27
93 Thema subject category
94 Thema geographical qualifier
95 Thema language qualifier
96 Thema time period qualifier
97 Thema educational purpose qualifier
98 Thema interest age / special interest qualifier
99 Thema style qualifier
109. Diving Deeper: Technical specs
• Only a Subject Category is mandatory; Qualifiers are optional.
• The first Subject Category entered is the primary subject.
• Thema is recognized in ONIX, and can be sent as part of any ONIX 2.*
and ONIX 3.* messages, using standard ONIX practice for subject
classification metadata.
• In product records and message formats (such as ONIX), only the code
is required to be communicated.
• There is no defined upper limit of the number of Subject Category
values or Qualifier values that may be assigned.
• It is expected that a maximum of 10 of each type would sufficiently
cover all reasonable circumstances.
• Systems designers working with systems which require limits to be
placed on data element lengths and/or number of occurrences are
advised to provide for the full length of codes and recommended
maximum number of occurrences.
108
110. Notes on Implementation
• The schema is now available via the EDItEUR website.
• Documentation on structure definitions is available.
• A document of basic user instructions is available.
• A BISAC-to-Thema mapping is available.
It is live! Version 1.0 was released November 2013
• Mappings from BIC & BISAC schemes completed
• Full translations into French, German and Norwegian
• Workshops & presentations for publishers in Germany
• Other groups working on translations into Italian, Spanish, Swedish etc…
• In the US, various supply chain partners have said they are working
towards transmitting and receiving Thema
109
111. More on Thema
110
Official Thema Documentation
http://www.editeur.org/151/Thema/
The US Thema Working Group
www.bisg.org
BISAC to Thema Translator
http://bisactothema.biblioshare.org/
Kempton MooneyResearch and Analytics Director, Nielsen