SlideShare a Scribd company logo
1 of 44
In and out: how does that
metadata get
into a knowledgebase anyhow?
Heather Sherman
Head of Library Programme Management – Dawson Books
Connect Group PLC
Creation process
2In and out: how does that metadata get into a knowledgebase anyhow?
Sign contract
with publisher
Acquire content
and basic
metadata
Correct
metadata
errors
Enhance basic
metadata
Create
ProQuest xml
feed
Create TOC
data
Connect Group PLC 3In and out: how does that metadata get into a knowledgebase anyhow?
Sign contract with publisher
Process starts with a publisher
agreeing to host their titles on
dawsonera.
Publishers are asked to send Dawson
the ebook content, jacket image and
associated metadata.
Some send this in xml. Others
complete a spreadsheet.
Connect Group PLC 4In and out: how does that metadata get into a knowledgebase anyhow?
Publisher sends files of metadata
Publishers supply key pieces of metadata
 eISBN
 Title
 Subtitle
 Author(s)
 Price
 Currency
 PDF file name
 Jacket image
 Publisher
 Imprint
 Publication date
 Edition
 Country of publication
 Usage model
Connect Group PLC
Spreadsheet of metadata
5In and out: how does that metadata get into a knowledgebase anyhow?
Connect Group PLC 6In and out: how does that metadata get into a knowledgebase anyhow?
Publisher sends files of metadata
However….
Not all publishers supply the key data,
so we have to go and find it.
Some supply incorrect data, so we
have to fix that.
Dawson’s automated import process
checks that key data is present and
correct, and reports on error.
Connect Group PLC
Metadata errors
7In and out: how does that metadata get into a knowledgebase anyhow?
Connect Group PLC 8In and out: how does that metadata get into a knowledgebase anyhow?
Table of contents data created
PDF files are sent to an agency who
create Table of Contents (TOC) data.
For ePub files, the TOC is extracted
directly from the file.
TOC data is imported into the Dawson
system and matched up with the PDFs
and metadata.
Connect Group PLC
TOC xml
9In and out: how does that metadata get into a knowledgebase anyhow?
Connect Group PLC 10In and out: how does that metadata get into a knowledgebase anyhow?
Metadata enhanced
Publisher metadata and TOC data is
matched to existing print records in
the Dawson title database.
Hybrid record is created incorporating
data from the publishers and Dawson.
Produces a record containing as much
information as Dawson have about the
title.
Connect Group PLC
Dawson ebook MARC record
=LDR 01354nam 2200349 4500
=001 DAW28874972
=007 cr
=008 140327s2014enkfs001|0|eng|d
=020 $a0191015024 (e-book)
=020 $a9780191015021 (e-book)
=040 $aStDuBDS$cStDuBDS$erda$dDAWSON
=041 1$aeng$hita
=082 04$a320.53209$223
=100 1$aPons, Silvio,$eauthor.
=245 14$aThe global revolution$h[electronic resource] : $ba history of international communism, 1917-1991 / $cSilvio Pons ;
translated by Allan Cameron.
=264 1$aOxford :$bOxford University Press,$c2014.
=300 $axx, 365 pages
=336 $atext$2rdacontent
=337 $acomputer$2rdamedia
=338 $aonline resource$2rdacarrier
=490 1$aOxford studies in modern European history
=500 $aTranslated from the Italian.
=504 $aIncludes bibliographical references and index.
=530 $aAlso available in printed form.
=533 $aElectronic reproduction.$cDawson Books.$nMode of access: World Wide Web.
=650 0$aCommunism$xHistory.
=650 0$aCommunism.
=655 7$aElectronic books.$2lcsh
=700 1$aCameron, Allan,$d1952-$etranslator.
=776 0$cHardback$z9780199657629
=830 0$aOxford studies in modern European history.
11In and out: how does that metadata get into a knowledgebase anyhow?
Connect Group PLC 12In and out: how does that metadata get into a knowledgebase anyhow?
ProQuest feed created
Hybrid record is extracted and turned
into an xml record.
Dawson sends daily files of new titles
and updated data to ProQuest.
A weekly file of data for all titles is
sent.
Connect Group PLC
xml data sent to ProQuest
<document initial-page="4" jacket="9780191015021.jpg" lang="eng">
<eisbn>
<eisbn13>9780191015021</eisbn13>
<eisbn10>0191015024</eisbn10>
</eisbn>
<isbn-group>
<isbn10 type="hb">0199657629</isbn10>
<isbn13 type="hb">9780199657629</isbn13>
</isbn-group>
<title-group>
<title>The Global Revolution: A History of International Communism 1917-1991</title>
<subtitle>A History of International Communism 1917-1991</subtitle>
</title-group>
<author-group>
<author>
<person-name>Silvio Pons ; Translated By Allan Cameron.</person-name>
</author>
</author-group>
13In and out: how does that metadata get into a knowledgebase anyhow?
IN AND OUT: HOW DOES THAT
METADATA GET
INTO A KNOWLEDGEBASE ANYHOW?
Ben Johnson
Lead Metadata Librarian, KB Provider Data
Benjamin.Johnson@proquest.com
Acquisition and Ingestion of Provider Data
into a Knowledgebase (KB)
Introduction
What do I
do?
4/15/2015 15
Lots of times it feels more like this:
4/15/2015 16
Introduction
Acquire
• Get the data
• Verify
compatibility
• Map the data
Ingest
• Transform the
data
• Load
• Review
• Accept/Reject
Correct
• Customer
inquiries
• Content
integrity
• Product
interoperability
… Profit!
4/15/2015 17
Providers we partner with
Publishers
Content
Aggregators
(PQ, Gale)
University
and Library
Local
Content
Library
Consortia
(JISC,
BIBSAM)
4/15/2015 18
Content Acquisition
• No data
• No contracts
• Provider Relations
4/15/2015 19
KBART
• Joint NISO/UKSG Group
• Librarians, Vendors, Providers
• Transmission of metadata to vendors
• Human and machine readable data
• http://www.niso.org/workrooms/kbart
4/15/2015 20
Ingestion – mapping and transformation
• FTP, HTML
• CSV/Text, Excel, XML, HTML
Acquire
the data
• Data for existing content is mapped
to KB packages (new T&F package,
JISC/BIBSAM new license)
Create
packages
• Map the content to our schema
• Normalize the data (dates, diacritics)
Transform
the content
4/15/2015 21
XML Data from Dawsonera
4/15/2015 22
File ready for ingestion
4/15/2015 23
Ingestion – Loading and Review
4/15/2015 24
Currency (Updating)
Acquisition
IngestionReview
4/15/2015 25
Corrections
4/15/2015 26
Correcshunz Corrections
Downstream products
Data in KB
Downstream
Products
Product
functionality
Discovery Access
4/15/2015 27
IN AND OUT: HOW DOES THAT
METADATA GET INTO A
KNOWLEDGEBASE ANYHOW?
Dave Hovenden – Content Operations Manager, Summon
ProQuest
UKSG Conference – 30 March – 1 April, 2015
The Content Ingestion Streams for Summon
4/15/2015 29
The Content Ingestion Process at Summon for Commercial
Content
Identify New
Content to Add
into Summon
4/15/2015 30
• Product Management, Sales,
and our Global Content Alliance
work together to identify new
content to add into Summon
• New content requests from
Summon customers are also
considered
• Publishers and content
providers may also request to
have their content added into
Summon
4/15/2015 31
Identifying New Commercial Content to Add into Summon
The Content Ingestion Process at Summon for Commercial
Content
Identify New
Content to Add
into Summon
Engage with
Publisher/Provider
Pre-Agreement
Content Sample
Analysis
4/15/2015 32
• The sample analysis is used to
help determine the quality and
extent of the metadata and the
metadata schema
• We also try to determine things
such as linking methods, how
rights are assigned to the content,
and what databases we may need
in our knowledgebase (if they don’t
already exist)
• Summon often indexes content at
the article-level, or chapter-level
as that is usually the level of
granularity that the content is
supplied at
4/15/2015 33
Pre-Agreement Content Sample Analysis
What Metadata Do We Look For During Sample Analysis?
4/15/2015 34
Title Metadata
• Article titles, Book titles, Publication titles, Subtitles, etc.
Identifier Metadata
• Unique IDs for specific articles, chapters, etc.
• Publication-level unique identifiers such as ISSN or ISBN
• Additional identifiers such as OCLC Number, LCCN, Dewey, DOI, etc.
Publication Information Metadata
• Publisher, Author(s), Corporate Authors, Volume Numbers, Issue
Numbers, Start Page, Publication Date, Publication Series, etc.
Additional Metadata
• Subject Headings, Keywords, Language
Dawsonera Book Example – The Global Revolution: A History of
International Communism 1917-1991 (ISBN-13 – 9780199657629)
4/15/2015 35
<document initial-page="4" jacket="9780191015021.jpg" lang="eng">
<eisbn>
<eisbn13>9780191015021</eisbn13>
<eisbn10>0191015024</eisbn10>
</eisbn>
<territory-group/>
<parent-isbn/>
<isbn-group>
<isbn10 type="hb">0199657629</isbn10>
<isbn13 type="hb">9780199657629</isbn13>
</isbn-group>
<title-group>
<title>The Global Revolution: A History of International Communism 1917-1991</title>
<subtitle>A History of International Communism 1917-1991</subtitle>
</title-group>
<author-group>
<author>
<person-name>Silvio Pons ; Translated By Allan Cameron.</person-name>
</author>
</author-group>
<endnote-authors>
<endnote-author>Pons, Silvio,</endnote-author>
<endnote-author>Cameron, Allan,</endnote-author>
</endnote-authors>
Dawsonera Book Example (cont.) – The Global Revolution: A
History of International Communism 1917-1991 (ISBN-13 –
9780199657629)
<publisher>
<publisher-name>Oxford University Press</publisher-name>
<imprint>Oxford University Press</imprint>
</publisher>
<pub-place>GB</pub-place>
<pub-date>20140815</pub-date>
<date-added>20140911</date-added>
<first-published/>
<edition/>
<copyright>© Oxford University Press 2014</copyright>
<classification type="dewey">320.53209</classification>
<classification type="loc">HX40</classification>
<classification type="bic">HB</classification>
<series issn="" series-name="Oxford studies in modern European history." number-within-series="">Oxford studies in
modern European history.</series>
<abstract-text>The Global Revolution. A History of International Communism 1917-1991 establishes a relationship
between the history of communism and the main processes of globalization in the past century. Drawing on a wealth of
archival sources, Silvio Pons analyses the multifaceted and contradictory relationship between the Soviet Union and the
international communist movement, to show how communism played a major part in the formation of our modern world.
The volume presents the argument that during the age of wars from 1914 to 1945, the establishment of the Soviet state in
Russia and the birth of the communist movement had an enormous impact because of their promise of world revolution
and international civil war. Such perspective appeared even more plausible in the aftermath of the Second World War and
of revolution in China, which paved the way for the expansion of communism in the post-colonial world. Communism
challenged the West in the Cold War - by means of anti-capitalist modernization and anti-imperialist mobilization - showing
itself to be a powerful factor in the politicization of global trends. However, the international legitimacy of communism
declined rapidly in the post-war era. Soviet power exposed its inability to exercise hegemony, as distinct from domination.
The consequences of Sovietization in Europe and the break between the Soviet Union and China were the primary
reasons for the decline of communist influence and appeal. Since communism lost its political credibility and cultural
cohesion, its global project had failed. The ground was prepared for the devastating impact of Western globalization on
communist regimes in Europe and the Soviet Union.</abstract-text>4/15/2015 36
• Summon relies upon the
knowledgebase to help facilitate
rights access to the content
• Rights access is assigned by
tracking a particular title by ISSN
or ISBN in the knowledgebase, or
by Database ID
• The knowledgebase also helps
Summon indicate when content
has full-text availability
4/15/2015 37
Summon and the Knowledgebase
The Content Ingestion Process at Summon for Commercial
Content
Identify New
Content to Add
into Summon
Engage with
Publisher/Provider
Pre-Agreement
Content Sample
Analysis
Formalize and
Sign Data Sharing
Agreement
Data is Delivered
in Full from
Publisher/Provider
Data
Normalization,
Mapping, and
Enrichment
4/15/2015 38
Data Normalization, Mapping, and Enrichment Work
• Very basic high-level clean-up of the data to standardize it
• Examples include:
• Remove leading/trailing white spaces in Title and Subtitle fields
• Clean-up diacritics and other encoding issues
Data Normalization
• Map the metadata fields in the records to the Summon schema
• This allows the metadata to appear in the UI and/or be made searchable within Summon
Mapping
• Enriching the content by adding additional metadata when applicable
• Examples:
• Scholarly/peer-reviewed flags from Ulrich’s
• Citation counts from Scopus
• Book cover images from Syndetics
Enrichment
4/15/2015 39
The Content Ingestion Process at Summon for Commercial
Content
Identify New
Content to Add into
Summon
Engage with
Publisher/Provider
Pre-Agreement
Content Sample
Analysis
Formalize and Sign
Data Sharing
Agreement
Data is Delivered
in Full from
Publisher/Provider
Data
Normalization,
Mapping, and
Enrichment
Indexing
4/15/2015 40
The Title as it Appears in Summon Once Indexed
4/15/2015 41
The Content Ingestion Process at Summon for Commercial
Content
Identify New
Content to Add into
Summon
Engage with
Publisher/Provider
Pre-Agreement
Content Sample
Analysis
Formalize and Sign
Data Sharing
Agreement
Data is Delivered
in Full from
Publisher/Provider
Data
Normalization,
Mapping, and
Enrichment
Indexing
Post-Ingestion
Maintenance
4/15/2015 42
Post-Ingestion Maintenance
4/15/2015 43
Currency
• Currency is the process of the publisher/provider sending to Summon
new/updated metadata records, or record deletions for content that
need to be removed
• Frequency of providing updates is often at the discretion of the
publisher/provider
Metadata Issues
• Address reported issues of metadata quality from Summon customers
• Most issues involve incorrect metadata, or slight variations in the
metadata that may impact OpenURL linking or the record deduplication
process (Match & Merge)
Thank you – Any Questions?
Heather Sherman
Heather.sherman@dawsonbooks.co.uk
Benjamin Johnson
Benjamin.Johnson@proquest.com
Dave Hovenden
Dave.Hovenden@proquest.com

More Related Content

What's hot

Linked Data Best Practices and BibFrame
Linked Data Best Practices and BibFrameLinked Data Best Practices and BibFrame
Linked Data Best Practices and BibFrameRobert Sanderson
 
How to Manage Your Metadata with Crossref
How to Manage Your Metadata with CrossrefHow to Manage Your Metadata with Crossref
How to Manage Your Metadata with CrossrefCrossref
 
Participation reports webinar December 2020
Participation reports webinar December 2020Participation reports webinar December 2020
Participation reports webinar December 2020Crossref
 
Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosUsage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosEUCLID project
 
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...Cory Lampert
 
Crossref LIVE US Online
Crossref LIVE US OnlineCrossref LIVE US Online
Crossref LIVE US OnlineCrossref
 
Content Registration, Crossref ALJEBI, Indonesia
Content Registration, Crossref ALJEBI, IndonesiaContent Registration, Crossref ALJEBI, Indonesia
Content Registration, Crossref ALJEBI, IndonesiaCrossref
 
Linked data and the future of libraries
Linked data and the future of librariesLinked data and the future of libraries
Linked data and the future of librariesRegan Harper
 
Efficient Practices for Large Scale Text Mining Process
Efficient Practices for Large Scale Text Mining ProcessEfficient Practices for Large Scale Text Mining Process
Efficient Practices for Large Scale Text Mining ProcessOntotext
 
Crossref LIVE Indonesia: An Introduction to Crossref, CRLIVE-ID 13 July 2021
Crossref LIVE Indonesia: An Introduction to Crossref, CRLIVE-ID 13 July 2021Crossref LIVE Indonesia: An Introduction to Crossref, CRLIVE-ID 13 July 2021
Crossref LIVE Indonesia: An Introduction to Crossref, CRLIVE-ID 13 July 2021Crossref
 
Linked Data Tutorial
Linked Data TutorialLinked Data Tutorial
Linked Data Tutorialtomasknap
 
Linked Data Tutorial
Linked Data TutorialLinked Data Tutorial
Linked Data TutorialSören Auer
 
Epiphany: Adaptable RDFa Generation Linking the Web of Documents to the Web o...
Epiphany: Adaptable RDFa Generation Linking the Web of Documents to the Web o...Epiphany: Adaptable RDFa Generation Linking the Web of Documents to the Web o...
Epiphany: Adaptable RDFa Generation Linking the Web of Documents to the Web o...Benjamin Adrian
 
Setting a Course for Success: Getting Started with Digital Preservation in Yo...
Setting a Course for Success: Getting Started with Digital Preservation in Yo...Setting a Course for Success: Getting Started with Digital Preservation in Yo...
Setting a Course for Success: Getting Started with Digital Preservation in Yo...WiLS
 
Linked Open Data in Romania
Linked Open Data in RomaniaLinked Open Data in Romania
Linked Open Data in RomaniaVlad Posea
 
THGenius, rdf and open linked data for thesaurus management
THGenius, rdf and open linked data for thesaurus managementTHGenius, rdf and open linked data for thesaurus management
THGenius, rdf and open linked data for thesaurus management@CULT Srl
 
Campaign for Richer Metadata
Campaign for Richer MetadataCampaign for Richer Metadata
Campaign for Richer MetadataCrossref
 
KBART ALA Midwinter 2010 Update
KBART ALA Midwinter 2010 UpdateKBART ALA Midwinter 2010 Update
KBART ALA Midwinter 2010 UpdateJason Price, PhD
 
System Update (2011 CrossRef Workshops)
System Update (2011 CrossRef Workshops)System Update (2011 CrossRef Workshops)
System Update (2011 CrossRef Workshops)Crossref
 

What's hot (20)

Linked Data Best Practices and BibFrame
Linked Data Best Practices and BibFrameLinked Data Best Practices and BibFrame
Linked Data Best Practices and BibFrame
 
How to Manage Your Metadata with Crossref
How to Manage Your Metadata with CrossrefHow to Manage Your Metadata with Crossref
How to Manage Your Metadata with Crossref
 
Participation reports webinar December 2020
Participation reports webinar December 2020Participation reports webinar December 2020
Participation reports webinar December 2020
 
Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosUsage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application Scenarios
 
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
 
Crossref LIVE US Online
Crossref LIVE US OnlineCrossref LIVE US Online
Crossref LIVE US Online
 
Content Registration, Crossref ALJEBI, Indonesia
Content Registration, Crossref ALJEBI, IndonesiaContent Registration, Crossref ALJEBI, Indonesia
Content Registration, Crossref ALJEBI, Indonesia
 
Linked data and the future of libraries
Linked data and the future of librariesLinked data and the future of libraries
Linked data and the future of libraries
 
Efficient Practices for Large Scale Text Mining Process
Efficient Practices for Large Scale Text Mining ProcessEfficient Practices for Large Scale Text Mining Process
Efficient Practices for Large Scale Text Mining Process
 
Crossref LIVE Indonesia: An Introduction to Crossref, CRLIVE-ID 13 July 2021
Crossref LIVE Indonesia: An Introduction to Crossref, CRLIVE-ID 13 July 2021Crossref LIVE Indonesia: An Introduction to Crossref, CRLIVE-ID 13 July 2021
Crossref LIVE Indonesia: An Introduction to Crossref, CRLIVE-ID 13 July 2021
 
Linked Data Tutorial
Linked Data TutorialLinked Data Tutorial
Linked Data Tutorial
 
Linked Data Tutorial
Linked Data TutorialLinked Data Tutorial
Linked Data Tutorial
 
Epiphany: Adaptable RDFa Generation Linking the Web of Documents to the Web o...
Epiphany: Adaptable RDFa Generation Linking the Web of Documents to the Web o...Epiphany: Adaptable RDFa Generation Linking the Web of Documents to the Web o...
Epiphany: Adaptable RDFa Generation Linking the Web of Documents to the Web o...
 
Setting a Course for Success: Getting Started with Digital Preservation in Yo...
Setting a Course for Success: Getting Started with Digital Preservation in Yo...Setting a Course for Success: Getting Started with Digital Preservation in Yo...
Setting a Course for Success: Getting Started with Digital Preservation in Yo...
 
Linked Open Data in Romania
Linked Open Data in RomaniaLinked Open Data in Romania
Linked Open Data in Romania
 
THGenius, rdf and open linked data for thesaurus management
THGenius, rdf and open linked data for thesaurus managementTHGenius, rdf and open linked data for thesaurus management
THGenius, rdf and open linked data for thesaurus management
 
Danbri Drupalcon Export
Danbri Drupalcon ExportDanbri Drupalcon Export
Danbri Drupalcon Export
 
Campaign for Richer Metadata
Campaign for Richer MetadataCampaign for Richer Metadata
Campaign for Richer Metadata
 
KBART ALA Midwinter 2010 Update
KBART ALA Midwinter 2010 UpdateKBART ALA Midwinter 2010 Update
KBART ALA Midwinter 2010 Update
 
System Update (2011 CrossRef Workshops)
System Update (2011 CrossRef Workshops)System Update (2011 CrossRef Workshops)
System Update (2011 CrossRef Workshops)
 

Similar to How Metadata Gets Into a Knowledgebase

Deep Dive Into KBART
Deep Dive Into KBARTDeep Dive Into KBART
Deep Dive Into KBARTNASIG
 
Content Registration at Crossref - LIVE Bangkok
Content Registration at Crossref - LIVE BangkokContent Registration at Crossref - LIVE Bangkok
Content Registration at Crossref - LIVE BangkokCrossref
 
Crossref XML and tools for small publishers (EASE Conference 2018)
Crossref XML and tools for small publishers (EASE Conference 2018)Crossref XML and tools for small publishers (EASE Conference 2018)
Crossref XML and tools for small publishers (EASE Conference 2018)Crossref
 
Kbart Update ALA Midwinter 2010
Kbart Update ALA Midwinter 2010Kbart Update ALA Midwinter 2010
Kbart Update ALA Midwinter 2010Jason Price, PhD
 
Crossref Content Registration - LIVE Mumbai
Crossref Content Registration - LIVE MumbaiCrossref Content Registration - LIVE Mumbai
Crossref Content Registration - LIVE MumbaiCrossref
 
Crossref LIVE UK Online
Crossref LIVE UK OnlineCrossref LIVE UK Online
Crossref LIVE UK OnlineCrossref
 
Online Presentation
Online PresentationOnline Presentation
Online Presentationnw13
 
Fox-Keynote-Now and Now of Data Publishing-nfdp13
Fox-Keynote-Now and Now of Data Publishing-nfdp13Fox-Keynote-Now and Now of Data Publishing-nfdp13
Fox-Keynote-Now and Now of Data Publishing-nfdp13DataDryad
 
Accelerating Delivery of Data Products - The EBSCO Way
Accelerating Delivery of Data Products - The EBSCO WayAccelerating Delivery of Data Products - The EBSCO Way
Accelerating Delivery of Data Products - The EBSCO WayMongoDB
 
Crossref LIVE Indonesia: Content Registration at Crossref, CRLIVE-ID 14 July ...
Crossref LIVE Indonesia: Content Registration at Crossref, CRLIVE-ID 14 July ...Crossref LIVE Indonesia: Content Registration at Crossref, CRLIVE-ID 14 July ...
Crossref LIVE Indonesia: Content Registration at Crossref, CRLIVE-ID 14 July ...Crossref
 
Update on Crossref Services - Rachael Lammey
Update on Crossref Services - Rachael LammeyUpdate on Crossref Services - Rachael Lammey
Update on Crossref Services - Rachael LammeyCrossref
 
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...Databricks
 
The Power of Semantic Technologies to Explore Linked Open Data
The Power of Semantic Technologies to Explore Linked Open DataThe Power of Semantic Technologies to Explore Linked Open Data
The Power of Semantic Technologies to Explore Linked Open DataOntotext
 
DataONE Education Module 07: Metadata
DataONE Education Module 07: MetadataDataONE Education Module 07: Metadata
DataONE Education Module 07: MetadataDataONE
 
How BiblioShare Supports Bookselling
How BiblioShare Supports BooksellingHow BiblioShare Supports Bookselling
How BiblioShare Supports BooksellingBookNet Canada
 
Enhancing-Relevancy-and-user-experience-with-SharePoint-search-spsdc-2015
Enhancing-Relevancy-and-user-experience-with-SharePoint-search-spsdc-2015Enhancing-Relevancy-and-user-experience-with-SharePoint-search-spsdc-2015
Enhancing-Relevancy-and-user-experience-with-SharePoint-search-spsdc-2015Gina Montgomery, V-TSP
 

Similar to How Metadata Gets Into a Knowledgebase (20)

Deep Dive Into KBART
Deep Dive Into KBARTDeep Dive Into KBART
Deep Dive Into KBART
 
Content Registration at Crossref - LIVE Bangkok
Content Registration at Crossref - LIVE BangkokContent Registration at Crossref - LIVE Bangkok
Content Registration at Crossref - LIVE Bangkok
 
Register "New Directions in Cataloging and Metadata Creation"
Register "New Directions in Cataloging and Metadata Creation"Register "New Directions in Cataloging and Metadata Creation"
Register "New Directions in Cataloging and Metadata Creation"
 
Crossref XML and tools for small publishers (EASE Conference 2018)
Crossref XML and tools for small publishers (EASE Conference 2018)Crossref XML and tools for small publishers (EASE Conference 2018)
Crossref XML and tools for small publishers (EASE Conference 2018)
 
Kbart Update ALA Midwinter 2010
Kbart Update ALA Midwinter 2010Kbart Update ALA Midwinter 2010
Kbart Update ALA Midwinter 2010
 
NISO Webinar: Library Linked Data: From Vision to Reality
NISO Webinar: Library Linked Data: From Vision to RealityNISO Webinar: Library Linked Data: From Vision to Reality
NISO Webinar: Library Linked Data: From Vision to Reality
 
Crossref Content Registration - LIVE Mumbai
Crossref Content Registration - LIVE MumbaiCrossref Content Registration - LIVE Mumbai
Crossref Content Registration - LIVE Mumbai
 
Crossref LIVE UK Online
Crossref LIVE UK OnlineCrossref LIVE UK Online
Crossref LIVE UK Online
 
Linked data 20171106
Linked data 20171106Linked data 20171106
Linked data 20171106
 
Online Presentation
Online PresentationOnline Presentation
Online Presentation
 
Fox-Keynote-Now and Now of Data Publishing-nfdp13
Fox-Keynote-Now and Now of Data Publishing-nfdp13Fox-Keynote-Now and Now of Data Publishing-nfdp13
Fox-Keynote-Now and Now of Data Publishing-nfdp13
 
Accelerating Delivery of Data Products - The EBSCO Way
Accelerating Delivery of Data Products - The EBSCO WayAccelerating Delivery of Data Products - The EBSCO Way
Accelerating Delivery of Data Products - The EBSCO Way
 
Crossref LIVE Indonesia: Content Registration at Crossref, CRLIVE-ID 14 July ...
Crossref LIVE Indonesia: Content Registration at Crossref, CRLIVE-ID 14 July ...Crossref LIVE Indonesia: Content Registration at Crossref, CRLIVE-ID 14 July ...
Crossref LIVE Indonesia: Content Registration at Crossref, CRLIVE-ID 14 July ...
 
Update on Crossref Services - Rachael Lammey
Update on Crossref Services - Rachael LammeyUpdate on Crossref Services - Rachael Lammey
Update on Crossref Services - Rachael Lammey
 
L07 metadata
L07 metadataL07 metadata
L07 metadata
 
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...
 
The Power of Semantic Technologies to Explore Linked Open Data
The Power of Semantic Technologies to Explore Linked Open DataThe Power of Semantic Technologies to Explore Linked Open Data
The Power of Semantic Technologies to Explore Linked Open Data
 
DataONE Education Module 07: Metadata
DataONE Education Module 07: MetadataDataONE Education Module 07: Metadata
DataONE Education Module 07: Metadata
 
How BiblioShare Supports Bookselling
How BiblioShare Supports BooksellingHow BiblioShare Supports Bookselling
How BiblioShare Supports Bookselling
 
Enhancing-Relevancy-and-user-experience-with-SharePoint-search-spsdc-2015
Enhancing-Relevancy-and-user-experience-with-SharePoint-search-spsdc-2015Enhancing-Relevancy-and-user-experience-with-SharePoint-search-spsdc-2015
Enhancing-Relevancy-and-user-experience-with-SharePoint-search-spsdc-2015
 

More from UKSG: connecting the knowledge community

UKSG 2024 Plenary 4 - Combining Open Access research and large language model...
UKSG 2024 Plenary 4 - Combining Open Access research and large language model...UKSG 2024 Plenary 4 - Combining Open Access research and large language model...
UKSG 2024 Plenary 4 - Combining Open Access research and large language model...UKSG: connecting the knowledge community
 
UKSG 2024 Plenary 3 - There is No List: (How) Can We Combat “Predatory” Publi...
UKSG 2024 Plenary 3 - There is No List: (How) Can We Combat “Predatory” Publi...UKSG 2024 Plenary 3 - There is No List: (How) Can We Combat “Predatory” Publi...
UKSG 2024 Plenary 3 - There is No List: (How) Can We Combat “Predatory” Publi...UKSG: connecting the knowledge community
 
UKSG 2024 Plenary 2 - Are we there yet? A review of transitional agreements i...
UKSG 2024 Plenary 2 - Are we there yet? A review of transitional agreements i...UKSG 2024 Plenary 2 - Are we there yet? A review of transitional agreements i...
UKSG 2024 Plenary 2 - Are we there yet? A review of transitional agreements i...UKSG: connecting the knowledge community
 
UKSG 2024 Plenary 2 - What did we Read, What did we Publish: Distilling the d...
UKSG 2024 Plenary 2 - What did we Read, What did we Publish: Distilling the d...UKSG 2024 Plenary 2 - What did we Read, What did we Publish: Distilling the d...
UKSG 2024 Plenary 2 - What did we Read, What did we Publish: Distilling the d...UKSG: connecting the knowledge community
 
UKSG 2024 Lightning 2 - How GetFTR Supports Discovery and Access of OA Content
UKSG 2024 Lightning 2 - How GetFTR Supports Discovery and Access of OA ContentUKSG 2024 Lightning 2 - How GetFTR Supports Discovery and Access of OA Content
UKSG 2024 Lightning 2 - How GetFTR Supports Discovery and Access of OA ContentUKSG: connecting the knowledge community
 
UKSG 2024 Lightning 2 - Advocating for data sharing: messaging frameworks for...
UKSG 2024 Lightning 2 - Advocating for data sharing: messaging frameworks for...UKSG 2024 Lightning 2 - Advocating for data sharing: messaging frameworks for...
UKSG 2024 Lightning 2 - Advocating for data sharing: messaging frameworks for...UKSG: connecting the knowledge community
 
UKSG 2024 Lightning 2 - All Watched Over By Machines That Love Open Research
UKSG 2024 Lightning 2 - All Watched Over By Machines That Love Open ResearchUKSG 2024 Lightning 2 - All Watched Over By Machines That Love Open Research
UKSG 2024 Lightning 2 - All Watched Over By Machines That Love Open ResearchUKSG: connecting the knowledge community
 
UKSG 2024 Lightning 1 - Responding to the UN SDG Publishers Compact – Bristol...
UKSG 2024 Lightning 1 - Responding to the UN SDG Publishers Compact – Bristol...UKSG 2024 Lightning 1 - Responding to the UN SDG Publishers Compact – Bristol...
UKSG 2024 Lightning 1 - Responding to the UN SDG Publishers Compact – Bristol...UKSG: connecting the knowledge community
 
UKSG 2024 Lightning 1 - Practical steps towards an open research culture: Bui...
UKSG 2024 Lightning 1 - Practical steps towards an open research culture: Bui...UKSG 2024 Lightning 1 - Practical steps towards an open research culture: Bui...
UKSG 2024 Lightning 1 - Practical steps towards an open research culture: Bui...UKSG: connecting the knowledge community
 
UKSG 2024 - Reckoning or Retreat? A Longitudinal Look at DEIA in Scholarly Co...
UKSG 2024 - Reckoning or Retreat? A Longitudinal Look at DEIA in Scholarly Co...UKSG 2024 - Reckoning or Retreat? A Longitudinal Look at DEIA in Scholarly Co...
UKSG 2024 - Reckoning or Retreat? A Longitudinal Look at DEIA in Scholarly Co...UKSG: connecting the knowledge community
 
UKSG 2024 - You don't know what you've got till it's gone: Future directions ...
UKSG 2024 - You don't know what you've got till it's gone: Future directions ...UKSG 2024 - You don't know what you've got till it's gone: Future directions ...
UKSG 2024 - You don't know what you've got till it's gone: Future directions ...UKSG: connecting the knowledge community
 
UKSG 2024 - Vision, mission, passion: how UK University Presses collaborate t...
UKSG 2024 - Vision, mission, passion: how UK University Presses collaborate t...UKSG 2024 - Vision, mission, passion: how UK University Presses collaborate t...
UKSG 2024 - Vision, mission, passion: how UK University Presses collaborate t...UKSG: connecting the knowledge community
 
UKSG - 2024 - Fostering an Open Research culture: ARU's Graduate Trainee Seco...
UKSG - 2024 - Fostering an Open Research culture: ARU's Graduate Trainee Seco...UKSG - 2024 - Fostering an Open Research culture: ARU's Graduate Trainee Seco...
UKSG - 2024 - Fostering an Open Research culture: ARU's Graduate Trainee Seco...UKSG: connecting the knowledge community
 
UKSG 2024 - Creating credibility through community: Encouraging high quality ...
UKSG 2024 - Creating credibility through community: Encouraging high quality ...UKSG 2024 - Creating credibility through community: Encouraging high quality ...
UKSG 2024 - Creating credibility through community: Encouraging high quality ...UKSG: connecting the knowledge community
 
UKSG 2024 - Author Identity Metadata: Why a Small Publisher Can Address a Maj...
UKSG 2024 - Author Identity Metadata: Why a Small Publisher Can Address a Maj...UKSG 2024 - Author Identity Metadata: Why a Small Publisher Can Address a Maj...
UKSG 2024 - Author Identity Metadata: Why a Small Publisher Can Address a Maj...UKSG: connecting the knowledge community
 
UKSG 2024 - Captivate, Connect, and Convert: Unlocking the art of Collections...
UKSG 2024 - Captivate, Connect, and Convert: Unlocking the art of Collections...UKSG 2024 - Captivate, Connect, and Convert: Unlocking the art of Collections...
UKSG 2024 - Captivate, Connect, and Convert: Unlocking the art of Collections...UKSG: connecting the knowledge community
 
UKSG 2024 - A critical review of transitional agreements in the UK: why, how,...
UKSG 2024 - A critical review of transitional agreements in the UK: why, how,...UKSG 2024 - A critical review of transitional agreements in the UK: why, how,...
UKSG 2024 - A critical review of transitional agreements in the UK: why, how,...UKSG: connecting the knowledge community
 
UKSG 2024 - What next for sustainable open scholarship? The Cambridge Univers...
UKSG 2024 - What next for sustainable open scholarship? The Cambridge Univers...UKSG 2024 - What next for sustainable open scholarship? The Cambridge Univers...
UKSG 2024 - What next for sustainable open scholarship? The Cambridge Univers...UKSG: connecting the knowledge community
 

More from UKSG: connecting the knowledge community (20)

UKSG 2024 Plenary 4 - Combining Open Access research and large language model...
UKSG 2024 Plenary 4 - Combining Open Access research and large language model...UKSG 2024 Plenary 4 - Combining Open Access research and large language model...
UKSG 2024 Plenary 4 - Combining Open Access research and large language model...
 
UKSG 2024 Plenary 3 - There is No List: (How) Can We Combat “Predatory” Publi...
UKSG 2024 Plenary 3 - There is No List: (How) Can We Combat “Predatory” Publi...UKSG 2024 Plenary 3 - There is No List: (How) Can We Combat “Predatory” Publi...
UKSG 2024 Plenary 3 - There is No List: (How) Can We Combat “Predatory” Publi...
 
UKSG 2024 Plenary 2 - Let's Talk About Green
UKSG 2024 Plenary 2 - Let's Talk About GreenUKSG 2024 Plenary 2 - Let's Talk About Green
UKSG 2024 Plenary 2 - Let's Talk About Green
 
UKSG 2024 Plenary 2 - Are we there yet? A review of transitional agreements i...
UKSG 2024 Plenary 2 - Are we there yet? A review of transitional agreements i...UKSG 2024 Plenary 2 - Are we there yet? A review of transitional agreements i...
UKSG 2024 Plenary 2 - Are we there yet? A review of transitional agreements i...
 
UKSG 2024 Plenary 2 - What did we Read, What did we Publish: Distilling the d...
UKSG 2024 Plenary 2 - What did we Read, What did we Publish: Distilling the d...UKSG 2024 Plenary 2 - What did we Read, What did we Publish: Distilling the d...
UKSG 2024 Plenary 2 - What did we Read, What did we Publish: Distilling the d...
 
UKSG 2024 Lightning 2 - How GetFTR Supports Discovery and Access of OA Content
UKSG 2024 Lightning 2 - How GetFTR Supports Discovery and Access of OA ContentUKSG 2024 Lightning 2 - How GetFTR Supports Discovery and Access of OA Content
UKSG 2024 Lightning 2 - How GetFTR Supports Discovery and Access of OA Content
 
UKSG 2024 Lightning 2 - Advocating for data sharing: messaging frameworks for...
UKSG 2024 Lightning 2 - Advocating for data sharing: messaging frameworks for...UKSG 2024 Lightning 2 - Advocating for data sharing: messaging frameworks for...
UKSG 2024 Lightning 2 - Advocating for data sharing: messaging frameworks for...
 
UKSG 2024 Lightning 2 - All Watched Over By Machines That Love Open Research
UKSG 2024 Lightning 2 - All Watched Over By Machines That Love Open ResearchUKSG 2024 Lightning 2 - All Watched Over By Machines That Love Open Research
UKSG 2024 Lightning 2 - All Watched Over By Machines That Love Open Research
 
UKSG 2024 Lightning 1 - Responding to the UN SDG Publishers Compact – Bristol...
UKSG 2024 Lightning 1 - Responding to the UN SDG Publishers Compact – Bristol...UKSG 2024 Lightning 1 - Responding to the UN SDG Publishers Compact – Bristol...
UKSG 2024 Lightning 1 - Responding to the UN SDG Publishers Compact – Bristol...
 
UKSG 2024 Lightning 1 - Practical steps towards an open research culture: Bui...
UKSG 2024 Lightning 1 - Practical steps towards an open research culture: Bui...UKSG 2024 Lightning 1 - Practical steps towards an open research culture: Bui...
UKSG 2024 Lightning 1 - Practical steps towards an open research culture: Bui...
 
UKSG 2024 - Open infrastructure and standards: small bodies, big impact
UKSG 2024 - Open infrastructure and standards: small bodies, big impactUKSG 2024 - Open infrastructure and standards: small bodies, big impact
UKSG 2024 - Open infrastructure and standards: small bodies, big impact
 
UKSG 2024 - Reckoning or Retreat? A Longitudinal Look at DEIA in Scholarly Co...
UKSG 2024 - Reckoning or Retreat? A Longitudinal Look at DEIA in Scholarly Co...UKSG 2024 - Reckoning or Retreat? A Longitudinal Look at DEIA in Scholarly Co...
UKSG 2024 - Reckoning or Retreat? A Longitudinal Look at DEIA in Scholarly Co...
 
UKSG 2024 - You don't know what you've got till it's gone: Future directions ...
UKSG 2024 - You don't know what you've got till it's gone: Future directions ...UKSG 2024 - You don't know what you've got till it's gone: Future directions ...
UKSG 2024 - You don't know what you've got till it's gone: Future directions ...
 
UKSG 2024 - Vision, mission, passion: how UK University Presses collaborate t...
UKSG 2024 - Vision, mission, passion: how UK University Presses collaborate t...UKSG 2024 - Vision, mission, passion: how UK University Presses collaborate t...
UKSG 2024 - Vision, mission, passion: how UK University Presses collaborate t...
 
UKSG - 2024 - Fostering an Open Research culture: ARU's Graduate Trainee Seco...
UKSG - 2024 - Fostering an Open Research culture: ARU's Graduate Trainee Seco...UKSG - 2024 - Fostering an Open Research culture: ARU's Graduate Trainee Seco...
UKSG - 2024 - Fostering an Open Research culture: ARU's Graduate Trainee Seco...
 
UKSG 2024 - Creating credibility through community: Encouraging high quality ...
UKSG 2024 - Creating credibility through community: Encouraging high quality ...UKSG 2024 - Creating credibility through community: Encouraging high quality ...
UKSG 2024 - Creating credibility through community: Encouraging high quality ...
 
UKSG 2024 - Author Identity Metadata: Why a Small Publisher Can Address a Maj...
UKSG 2024 - Author Identity Metadata: Why a Small Publisher Can Address a Maj...UKSG 2024 - Author Identity Metadata: Why a Small Publisher Can Address a Maj...
UKSG 2024 - Author Identity Metadata: Why a Small Publisher Can Address a Maj...
 
UKSG 2024 - Captivate, Connect, and Convert: Unlocking the art of Collections...
UKSG 2024 - Captivate, Connect, and Convert: Unlocking the art of Collections...UKSG 2024 - Captivate, Connect, and Convert: Unlocking the art of Collections...
UKSG 2024 - Captivate, Connect, and Convert: Unlocking the art of Collections...
 
UKSG 2024 - A critical review of transitional agreements in the UK: why, how,...
UKSG 2024 - A critical review of transitional agreements in the UK: why, how,...UKSG 2024 - A critical review of transitional agreements in the UK: why, how,...
UKSG 2024 - A critical review of transitional agreements in the UK: why, how,...
 
UKSG 2024 - What next for sustainable open scholarship? The Cambridge Univers...
UKSG 2024 - What next for sustainable open scholarship? The Cambridge Univers...UKSG 2024 - What next for sustainable open scholarship? The Cambridge Univers...
UKSG 2024 - What next for sustainable open scholarship? The Cambridge Univers...
 

Recently uploaded

BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaVirag Sontakke
 
Class 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfClass 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfakmcokerachita
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,Virag Sontakke
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 

Recently uploaded (20)

BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of India
 
Class 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfClass 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdf
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 

How Metadata Gets Into a Knowledgebase

  • 1. In and out: how does that metadata get into a knowledgebase anyhow? Heather Sherman Head of Library Programme Management – Dawson Books
  • 2. Connect Group PLC Creation process 2In and out: how does that metadata get into a knowledgebase anyhow? Sign contract with publisher Acquire content and basic metadata Correct metadata errors Enhance basic metadata Create ProQuest xml feed Create TOC data
  • 3. Connect Group PLC 3In and out: how does that metadata get into a knowledgebase anyhow? Sign contract with publisher Process starts with a publisher agreeing to host their titles on dawsonera. Publishers are asked to send Dawson the ebook content, jacket image and associated metadata. Some send this in xml. Others complete a spreadsheet.
  • 4. Connect Group PLC 4In and out: how does that metadata get into a knowledgebase anyhow? Publisher sends files of metadata Publishers supply key pieces of metadata  eISBN  Title  Subtitle  Author(s)  Price  Currency  PDF file name  Jacket image  Publisher  Imprint  Publication date  Edition  Country of publication  Usage model
  • 5. Connect Group PLC Spreadsheet of metadata 5In and out: how does that metadata get into a knowledgebase anyhow?
  • 6. Connect Group PLC 6In and out: how does that metadata get into a knowledgebase anyhow? Publisher sends files of metadata However…. Not all publishers supply the key data, so we have to go and find it. Some supply incorrect data, so we have to fix that. Dawson’s automated import process checks that key data is present and correct, and reports on error.
  • 7. Connect Group PLC Metadata errors 7In and out: how does that metadata get into a knowledgebase anyhow?
  • 8. Connect Group PLC 8In and out: how does that metadata get into a knowledgebase anyhow? Table of contents data created PDF files are sent to an agency who create Table of Contents (TOC) data. For ePub files, the TOC is extracted directly from the file. TOC data is imported into the Dawson system and matched up with the PDFs and metadata.
  • 9. Connect Group PLC TOC xml 9In and out: how does that metadata get into a knowledgebase anyhow?
  • 10. Connect Group PLC 10In and out: how does that metadata get into a knowledgebase anyhow? Metadata enhanced Publisher metadata and TOC data is matched to existing print records in the Dawson title database. Hybrid record is created incorporating data from the publishers and Dawson. Produces a record containing as much information as Dawson have about the title.
  • 11. Connect Group PLC Dawson ebook MARC record =LDR 01354nam 2200349 4500 =001 DAW28874972 =007 cr =008 140327s2014enkfs001|0|eng|d =020 $a0191015024 (e-book) =020 $a9780191015021 (e-book) =040 $aStDuBDS$cStDuBDS$erda$dDAWSON =041 1$aeng$hita =082 04$a320.53209$223 =100 1$aPons, Silvio,$eauthor. =245 14$aThe global revolution$h[electronic resource] : $ba history of international communism, 1917-1991 / $cSilvio Pons ; translated by Allan Cameron. =264 1$aOxford :$bOxford University Press,$c2014. =300 $axx, 365 pages =336 $atext$2rdacontent =337 $acomputer$2rdamedia =338 $aonline resource$2rdacarrier =490 1$aOxford studies in modern European history =500 $aTranslated from the Italian. =504 $aIncludes bibliographical references and index. =530 $aAlso available in printed form. =533 $aElectronic reproduction.$cDawson Books.$nMode of access: World Wide Web. =650 0$aCommunism$xHistory. =650 0$aCommunism. =655 7$aElectronic books.$2lcsh =700 1$aCameron, Allan,$d1952-$etranslator. =776 0$cHardback$z9780199657629 =830 0$aOxford studies in modern European history. 11In and out: how does that metadata get into a knowledgebase anyhow?
  • 12. Connect Group PLC 12In and out: how does that metadata get into a knowledgebase anyhow? ProQuest feed created Hybrid record is extracted and turned into an xml record. Dawson sends daily files of new titles and updated data to ProQuest. A weekly file of data for all titles is sent.
  • 13. Connect Group PLC xml data sent to ProQuest <document initial-page="4" jacket="9780191015021.jpg" lang="eng"> <eisbn> <eisbn13>9780191015021</eisbn13> <eisbn10>0191015024</eisbn10> </eisbn> <isbn-group> <isbn10 type="hb">0199657629</isbn10> <isbn13 type="hb">9780199657629</isbn13> </isbn-group> <title-group> <title>The Global Revolution: A History of International Communism 1917-1991</title> <subtitle>A History of International Communism 1917-1991</subtitle> </title-group> <author-group> <author> <person-name>Silvio Pons ; Translated By Allan Cameron.</person-name> </author> </author-group> 13In and out: how does that metadata get into a knowledgebase anyhow?
  • 14. IN AND OUT: HOW DOES THAT METADATA GET INTO A KNOWLEDGEBASE ANYHOW? Ben Johnson Lead Metadata Librarian, KB Provider Data Benjamin.Johnson@proquest.com Acquisition and Ingestion of Provider Data into a Knowledgebase (KB)
  • 16. Lots of times it feels more like this: 4/15/2015 16
  • 17. Introduction Acquire • Get the data • Verify compatibility • Map the data Ingest • Transform the data • Load • Review • Accept/Reject Correct • Customer inquiries • Content integrity • Product interoperability … Profit! 4/15/2015 17
  • 18. Providers we partner with Publishers Content Aggregators (PQ, Gale) University and Library Local Content Library Consortia (JISC, BIBSAM) 4/15/2015 18
  • 19. Content Acquisition • No data • No contracts • Provider Relations 4/15/2015 19
  • 20. KBART • Joint NISO/UKSG Group • Librarians, Vendors, Providers • Transmission of metadata to vendors • Human and machine readable data • http://www.niso.org/workrooms/kbart 4/15/2015 20
  • 21. Ingestion – mapping and transformation • FTP, HTML • CSV/Text, Excel, XML, HTML Acquire the data • Data for existing content is mapped to KB packages (new T&F package, JISC/BIBSAM new license) Create packages • Map the content to our schema • Normalize the data (dates, diacritics) Transform the content 4/15/2015 21
  • 22. XML Data from Dawsonera 4/15/2015 22
  • 23. File ready for ingestion 4/15/2015 23
  • 24. Ingestion – Loading and Review 4/15/2015 24
  • 27. Downstream products Data in KB Downstream Products Product functionality Discovery Access 4/15/2015 27
  • 28. IN AND OUT: HOW DOES THAT METADATA GET INTO A KNOWLEDGEBASE ANYHOW? Dave Hovenden – Content Operations Manager, Summon ProQuest UKSG Conference – 30 March – 1 April, 2015
  • 29. The Content Ingestion Streams for Summon 4/15/2015 29
  • 30. The Content Ingestion Process at Summon for Commercial Content Identify New Content to Add into Summon 4/15/2015 30
  • 31. • Product Management, Sales, and our Global Content Alliance work together to identify new content to add into Summon • New content requests from Summon customers are also considered • Publishers and content providers may also request to have their content added into Summon 4/15/2015 31 Identifying New Commercial Content to Add into Summon
  • 32. The Content Ingestion Process at Summon for Commercial Content Identify New Content to Add into Summon Engage with Publisher/Provider Pre-Agreement Content Sample Analysis 4/15/2015 32
  • 33. • The sample analysis is used to help determine the quality and extent of the metadata and the metadata schema • We also try to determine things such as linking methods, how rights are assigned to the content, and what databases we may need in our knowledgebase (if they don’t already exist) • Summon often indexes content at the article-level, or chapter-level as that is usually the level of granularity that the content is supplied at 4/15/2015 33 Pre-Agreement Content Sample Analysis
  • 34. What Metadata Do We Look For During Sample Analysis? 4/15/2015 34 Title Metadata • Article titles, Book titles, Publication titles, Subtitles, etc. Identifier Metadata • Unique IDs for specific articles, chapters, etc. • Publication-level unique identifiers such as ISSN or ISBN • Additional identifiers such as OCLC Number, LCCN, Dewey, DOI, etc. Publication Information Metadata • Publisher, Author(s), Corporate Authors, Volume Numbers, Issue Numbers, Start Page, Publication Date, Publication Series, etc. Additional Metadata • Subject Headings, Keywords, Language
  • 35. Dawsonera Book Example – The Global Revolution: A History of International Communism 1917-1991 (ISBN-13 – 9780199657629) 4/15/2015 35 <document initial-page="4" jacket="9780191015021.jpg" lang="eng"> <eisbn> <eisbn13>9780191015021</eisbn13> <eisbn10>0191015024</eisbn10> </eisbn> <territory-group/> <parent-isbn/> <isbn-group> <isbn10 type="hb">0199657629</isbn10> <isbn13 type="hb">9780199657629</isbn13> </isbn-group> <title-group> <title>The Global Revolution: A History of International Communism 1917-1991</title> <subtitle>A History of International Communism 1917-1991</subtitle> </title-group> <author-group> <author> <person-name>Silvio Pons ; Translated By Allan Cameron.</person-name> </author> </author-group> <endnote-authors> <endnote-author>Pons, Silvio,</endnote-author> <endnote-author>Cameron, Allan,</endnote-author> </endnote-authors>
  • 36. Dawsonera Book Example (cont.) – The Global Revolution: A History of International Communism 1917-1991 (ISBN-13 – 9780199657629) <publisher> <publisher-name>Oxford University Press</publisher-name> <imprint>Oxford University Press</imprint> </publisher> <pub-place>GB</pub-place> <pub-date>20140815</pub-date> <date-added>20140911</date-added> <first-published/> <edition/> <copyright>© Oxford University Press 2014</copyright> <classification type="dewey">320.53209</classification> <classification type="loc">HX40</classification> <classification type="bic">HB</classification> <series issn="" series-name="Oxford studies in modern European history." number-within-series="">Oxford studies in modern European history.</series> <abstract-text>The Global Revolution. A History of International Communism 1917-1991 establishes a relationship between the history of communism and the main processes of globalization in the past century. Drawing on a wealth of archival sources, Silvio Pons analyses the multifaceted and contradictory relationship between the Soviet Union and the international communist movement, to show how communism played a major part in the formation of our modern world. The volume presents the argument that during the age of wars from 1914 to 1945, the establishment of the Soviet state in Russia and the birth of the communist movement had an enormous impact because of their promise of world revolution and international civil war. Such perspective appeared even more plausible in the aftermath of the Second World War and of revolution in China, which paved the way for the expansion of communism in the post-colonial world. Communism challenged the West in the Cold War - by means of anti-capitalist modernization and anti-imperialist mobilization - showing itself to be a powerful factor in the politicization of global trends. However, the international legitimacy of communism declined rapidly in the post-war era. Soviet power exposed its inability to exercise hegemony, as distinct from domination. The consequences of Sovietization in Europe and the break between the Soviet Union and China were the primary reasons for the decline of communist influence and appeal. Since communism lost its political credibility and cultural cohesion, its global project had failed. The ground was prepared for the devastating impact of Western globalization on communist regimes in Europe and the Soviet Union.</abstract-text>4/15/2015 36
  • 37. • Summon relies upon the knowledgebase to help facilitate rights access to the content • Rights access is assigned by tracking a particular title by ISSN or ISBN in the knowledgebase, or by Database ID • The knowledgebase also helps Summon indicate when content has full-text availability 4/15/2015 37 Summon and the Knowledgebase
  • 38. The Content Ingestion Process at Summon for Commercial Content Identify New Content to Add into Summon Engage with Publisher/Provider Pre-Agreement Content Sample Analysis Formalize and Sign Data Sharing Agreement Data is Delivered in Full from Publisher/Provider Data Normalization, Mapping, and Enrichment 4/15/2015 38
  • 39. Data Normalization, Mapping, and Enrichment Work • Very basic high-level clean-up of the data to standardize it • Examples include: • Remove leading/trailing white spaces in Title and Subtitle fields • Clean-up diacritics and other encoding issues Data Normalization • Map the metadata fields in the records to the Summon schema • This allows the metadata to appear in the UI and/or be made searchable within Summon Mapping • Enriching the content by adding additional metadata when applicable • Examples: • Scholarly/peer-reviewed flags from Ulrich’s • Citation counts from Scopus • Book cover images from Syndetics Enrichment 4/15/2015 39
  • 40. The Content Ingestion Process at Summon for Commercial Content Identify New Content to Add into Summon Engage with Publisher/Provider Pre-Agreement Content Sample Analysis Formalize and Sign Data Sharing Agreement Data is Delivered in Full from Publisher/Provider Data Normalization, Mapping, and Enrichment Indexing 4/15/2015 40
  • 41. The Title as it Appears in Summon Once Indexed 4/15/2015 41
  • 42. The Content Ingestion Process at Summon for Commercial Content Identify New Content to Add into Summon Engage with Publisher/Provider Pre-Agreement Content Sample Analysis Formalize and Sign Data Sharing Agreement Data is Delivered in Full from Publisher/Provider Data Normalization, Mapping, and Enrichment Indexing Post-Ingestion Maintenance 4/15/2015 42
  • 43. Post-Ingestion Maintenance 4/15/2015 43 Currency • Currency is the process of the publisher/provider sending to Summon new/updated metadata records, or record deletions for content that need to be removed • Frequency of providing updates is often at the discretion of the publisher/provider Metadata Issues • Address reported issues of metadata quality from Summon customers • Most issues involve incorrect metadata, or slight variations in the metadata that may impact OpenURL linking or the record deduplication process (Match & Merge)
  • 44. Thank you – Any Questions? Heather Sherman Heather.sherman@dawsonbooks.co.uk Benjamin Johnson Benjamin.Johnson@proquest.com Dave Hovenden Dave.Hovenden@proquest.com

Editor's Notes

  1. Hello, I am Ben Johnson, the Lead Metadata Librarian for the Provider Data side of our Knowledgebase. I manage the team responsible for getting and maintaining the Provider Data in our knowledgebase (KB). The KB drives all of our ERM and access products such as Intota and the 360 Suite of products (Link, Resource Manager, etc.), and provides rights data to our discovery layer, Summon. It’s what makes it so you don’t always search everything in Summon, only what you have access to. I am also the co-chair for the KBART Standing Committee along with Magaly Bascones from JISC/KB+.
  2. If you’re a World of Warcraft player and can tell me what’s going on here, I might have a job for you.
  3. As you’ll see, our process is, at a high level, very much similar to Heather’s at Dawsonera. We acquire the data from the content providers, ingest it into our systems (which is a multi-step process as we’ll see in a minute), and make corrections to the data so that it is as accurate as we are able to get it and so that it works well with our downstream products. And stuff happens, blah blah blah profit.
  4. We work with many different kinds of content providers, from the traditional publishers such as Taylor & Francis, Oxford, Cambridge, Springer- and content aggregators such as EBSCO, Dawsonera, Gale- to self-publishing/hosted content at universities and libraries, to library consortia who provide us data about their members’ licensed deals, such as Jisc through their KB+ platform (who I’m sure you’re familiar with as they provide license data for UK institutions), and BIBSAM, who provide us with similar packaged content through KB+ for Swedish institutions.
  5. Content acquisition is the trickiest part of our entire process. We can’t serve customers with good data if we don’t have any data to begin with. Unlike Dawsonera, for the KB, we do not pay anyone to give us metadata. Unlike both Dawsonera and Summon, we do not acquire full text data, so there are generally no contracts or other agreements that are signed to use that data. Someone (either someone on my team, someone on our Provider Relations team, or a customer) needs to convince the content provider to provide us with metadata so that our mutual customers can manage, access and discover the provider’s data using our products. This increases the usage of the provider’s products in turn, which is how we are able to get them on board.
  6. Here’s where, as the co-Chair, I feel like I need to make a plug for KBART (KnowledgeBases And Related Tools), a joint NISO/UKSG Working Group comprised of librarians, vendors and content providers who have come up with a set of recommendations for the transmission of title-level metadata between content providers, vendors, and libraries. The chief use is for populating Knowledgebases with provider content, but an overlooked secondary use is as a human-readable format for use by librarians. Now, even though it’s called a “title list”, the focus of the data is not in cataloging or thorough description of the title; the titles are instead attributes that describe the database/package of content being sold or accessed through the content provider, including what those titles are, where they can be accessed and browsed, and, in the case of serials, the date ranges that are available through that package of content. If you are interested in all of the details of the recommended practices, which also give a good overview of what kinds of data is tracked and used by Knowledgebase, methods of transmission of that data, and related issues around title list metadata and the quality (or absence of) that data, I recommend that you take a look at the NISO KBART site. We encourage all of the providers that we work with to develop KBART title lists, as it is the quasi-standard for this kind of data. We’ll take data if it is not a KBART list (or if it doesn’t meet all of the requirements), but KBART really is preferred. [incorporate something about “it’s OK if it’s not KBART, but….”]
  7. So we get the data from the provider via their website, an FTP site, or some other method, using a content acquisition tool to automate the process. In the KB, we represent as closely as possible the Provider’s packages (as recommended by KBART). In the case of consortia such as JISC and Bibsam we will provide a mapping for member libraries to be able to discern which package in the KB they should have access to, provided it’s not consortium-specific content. We also often offer A-Z lists for libraries for a la carte/cherrypicking titles, since that is also a popular method for purchasing content, for example, through Dawson. Now we need to prepare the data for ingestion in to our system. We take the content (most often coming in CSV or tab-delimited text files, occasionally XML or HTML scraped content from web sites), and map it to our schema. We transform the data to be compatible with our systems, for example, standardizing date formats and converting HTML characters to Unicode.
  8. We have: Flattened the structure; greatly reduced the number of fields from the original XML. This avoids ingesting fields that we don’t need. We’ve mapped the XML fields to our schema. We are only using the Print (hardback) ISBN here, as that identifier works much better to match the provider record in the title list to an authoritative MARC record as part of our title reconciliation process (which we also call normalization). Data normalization – removed initial articles, also aiding in that MARC record matching process (basically anything that would be considered a nonfiling character in MARC). We’ve also mapped the content to our internal database code- that’s what you see on the right there – 20A – which is the Dawsonera database in the KB.
  9. After the data is mapped and transformed, it is loaded and compared with the current data set that we have from the provider. The system compiles a list of changes (or a delta) and these changes are reviewed, sometimes automatically, sometimes by a human, to ensure the integrity of the content. Once those changes are vetted and the content looks good, the data load is accepted and written to our database/the KB.
  10. We can’t just load a set of content once and be done, though. The data is very fluid and it needs to be refreshed continually in order to be of any real use. Some content needs to be updated more frequently, such Dawsonera and other large aggregated ebooks providers, others less frequently. Our average and typical update or currency cycle is monthly, as this is how often most providers give us new data to load. Providers such as Jisc and Bibsam go so far as to notify us when there are changes, however we usually just pull automatically at the appropriate update interval. [Currency]
  11. Once the data is in the KB, we keep our eyes on it and make sure it behaves. If customers let us know that something is wrong with that set of content, then we will take steps to make those corrections to ensure that the data is as accurate as possible and works well with our downstream products.
  12. Once the data is in the KB, downstream products- Intota, 360 suite, Summon – use the data in the KB in various ways, whether it be to link directly to the content, provide a base of information for a library to manage their content, or to, based on a library’s subscriptions in ERM (populated by the KB), to drive discovery and access of that content. With that I’ll turn the mic over to my colleague, Dave Hovenden.