SlideShare a Scribd company logo
The Differences Problem
Or why consistency in metadata is critical in the discovery process
Shana L. McDanold
First A few caveats…
2
Inthenotso
distantpast…
There were two main options when searching for ebooks:
1. Search each individual vendor’s website/database
2. Load MARC records (one record for each title) into the
catalog for each vendor
3
Inthenotso
distantpast…
Problems with this approach:
 Loading records is a LOT of work and requires regular
maintenance
 Massaging/editing/enhancing metadata; loading;
updates; replacements; deletes
 Number of records/titles to load
 Lack of records available for loading
 Records come from numerous places and each vendor
requires a different procedure to download files
 Tracking titles in multiple places (duplicate work)
4
Now:more
options…
1. Search each individual vendor’s website/database
2. Load MARC records (one record for each title) into the
catalog for each vendor
3. Integration of various vendors metadata into
discovery layers via APIs and linked data rather than
importing records into the catalog
4. Federated search tools that index multiple databases
(e.g. unified index search tools)
…but are more options better?
5
Thegoodand
thebad
GOOD:
 fewer places to search (possibly even only one)
 most public libraries, while they have other ebook
databases, will have a single integrated discovery layer
BAD:
 MORE places to search
BUT discovery is still a challenge no matter which search
option you choose, and those challenges are centered
around:
METADATA
6
Printbook
7
Ebook
8
Differences?
 ISBN
 Subjects
 Title
 Author
 Date
9
Printbook
10
Ebook
11
Differences?
 ISBN
 Subjects
 Title
 Author
 Date
12
Printbook
13
Ebook
14
Differences?
 ISBN
 Subjects
 Title
 Author
 Date
15
Differences
defined
 Differences in description
 Current vs past rules and guidelines;
 RDA provider neutral vs individual vendor records
 Differences between vendors for same title
 Differences in how data is entered/presented
 Record proliferation
 Related to metadata differences: records cannot be
“collapsed” because the discovery layer doesn’t recognize
them as the same
 Different vocabularies and identity databases
16
More
differences
 Missing metadata/missing records
 Data changes/updates
 Branding or custom text/collections
17
Whydothese
differences
matter?
 How people search
 Keyword - forces dependency on keyword indexes
 Follow links - if you click on the subject search for
Obama, Michelle, search results include only print books
(no ebooks)
 Limits/facets - dependent on metadata, both visible
and invisible (coded)
 Missing metadata
 Discovery layer exposes ALL the metadata (good, bad,
missing)
All means items get “hidden” because they’re not
findable.
18
How dowefix
it?
 CONSISTENCY
 use of controlled vocabularies and existing authority
databases (name matching, subjects, etc.)
 Use existing metadata sources
 Follow standards and recommended/best practices
 Communication
 Data points
 complete
 consistency across vendors
19
Questions?
20

More Related Content

What's hot

Introduction to Endnote
Introduction to EndnoteIntroduction to Endnote
Introduction to Endnote
dh895
 
SharePoint Folders vs. Metadata Best Practices
SharePoint Folders vs. Metadata Best PracticesSharePoint Folders vs. Metadata Best Practices
SharePoint Folders vs. Metadata Best Practices
Chris Woodill
 
Using lirn revised
Using lirn revisedUsing lirn revised
Using lirn revised
ghall84
 
ENDNOTE presentation
ENDNOTE presentation ENDNOTE presentation
ENDNOTE presentation
Jay Bhatt
 
crossmark update
crossmark updatecrossmark update
crossmark update
Crossref
 
Anatomy Of Ia
Anatomy Of IaAnatomy Of Ia
Anatomy Of Ia
Miles Price
 
Soc 355
Soc 355Soc 355
Basics of EndNote research tool
Basics of EndNote research toolBasics of EndNote research tool
Basics of EndNote research tool
Mohammed Alansi
 
Best Practices for Organizing Documents in SharePoint 2010
Best Practices for Organizing Documents in SharePoint 2010Best Practices for Organizing Documents in SharePoint 2010
Best Practices for Organizing Documents in SharePoint 2010
Agnes Molnar
 
#mashcat: Evolving MarcEdit: Leveraging Semantic Data in MarcEdit
#mashcat: Evolving MarcEdit: Leveraging Semantic Data in MarcEdit#mashcat: Evolving MarcEdit: Leveraging Semantic Data in MarcEdit
#mashcat: Evolving MarcEdit: Leveraging Semantic Data in MarcEdit
Terry Reese
 
Using Endnote
Using EndnoteUsing Endnote
Using Endnote
Clive McGoun
 
Endnote
EndnoteEndnote
Folders vs. Metadata: SharePoint Engage Oct. 20, 2015
Folders vs. Metadata: SharePoint Engage Oct. 20, 2015Folders vs. Metadata: SharePoint Engage Oct. 20, 2015
Folders vs. Metadata: SharePoint Engage Oct. 20, 2015
Donna Rodriguez
 
Participation reports webinar November 2020
Participation reports webinar November 2020Participation reports webinar November 2020
Participation reports webinar November 2020
Crossref
 
Ms access 2010
Ms access 2010Ms access 2010
Ms access 2010
Alsufaacademy
 
MS Access 2010 tutorial 1
MS Access 2010 tutorial 1MS Access 2010 tutorial 1
MS Access 2010 tutorial 1
Khalfan Alshuaili
 
Preparing Catalogers for Linked data
Preparing Catalogers for Linked dataPreparing Catalogers for Linked data
Preparing Catalogers for Linked data
Terry Reese
 
Intranet mockups
Intranet mockupsIntranet mockups
Intranet mockups
GabrieleSani3
 
Endnote FAQs
Endnote FAQsEndnote FAQs
FundRef on the AAP/PSP panel: CHORUS: A Collaborative Approach to Public Access
FundRef on the AAP/PSP panel: CHORUS: A Collaborative Approach to Public AccessFundRef on the AAP/PSP panel: CHORUS: A Collaborative Approach to Public Access
FundRef on the AAP/PSP panel: CHORUS: A Collaborative Approach to Public Access
Crossref
 

What's hot (20)

Introduction to Endnote
Introduction to EndnoteIntroduction to Endnote
Introduction to Endnote
 
SharePoint Folders vs. Metadata Best Practices
SharePoint Folders vs. Metadata Best PracticesSharePoint Folders vs. Metadata Best Practices
SharePoint Folders vs. Metadata Best Practices
 
Using lirn revised
Using lirn revisedUsing lirn revised
Using lirn revised
 
ENDNOTE presentation
ENDNOTE presentation ENDNOTE presentation
ENDNOTE presentation
 
crossmark update
crossmark updatecrossmark update
crossmark update
 
Anatomy Of Ia
Anatomy Of IaAnatomy Of Ia
Anatomy Of Ia
 
Soc 355
Soc 355Soc 355
Soc 355
 
Basics of EndNote research tool
Basics of EndNote research toolBasics of EndNote research tool
Basics of EndNote research tool
 
Best Practices for Organizing Documents in SharePoint 2010
Best Practices for Organizing Documents in SharePoint 2010Best Practices for Organizing Documents in SharePoint 2010
Best Practices for Organizing Documents in SharePoint 2010
 
#mashcat: Evolving MarcEdit: Leveraging Semantic Data in MarcEdit
#mashcat: Evolving MarcEdit: Leveraging Semantic Data in MarcEdit#mashcat: Evolving MarcEdit: Leveraging Semantic Data in MarcEdit
#mashcat: Evolving MarcEdit: Leveraging Semantic Data in MarcEdit
 
Using Endnote
Using EndnoteUsing Endnote
Using Endnote
 
Endnote
EndnoteEndnote
Endnote
 
Folders vs. Metadata: SharePoint Engage Oct. 20, 2015
Folders vs. Metadata: SharePoint Engage Oct. 20, 2015Folders vs. Metadata: SharePoint Engage Oct. 20, 2015
Folders vs. Metadata: SharePoint Engage Oct. 20, 2015
 
Participation reports webinar November 2020
Participation reports webinar November 2020Participation reports webinar November 2020
Participation reports webinar November 2020
 
Ms access 2010
Ms access 2010Ms access 2010
Ms access 2010
 
MS Access 2010 tutorial 1
MS Access 2010 tutorial 1MS Access 2010 tutorial 1
MS Access 2010 tutorial 1
 
Preparing Catalogers for Linked data
Preparing Catalogers for Linked dataPreparing Catalogers for Linked data
Preparing Catalogers for Linked data
 
Intranet mockups
Intranet mockupsIntranet mockups
Intranet mockups
 
Endnote FAQs
Endnote FAQsEndnote FAQs
Endnote FAQs
 
FundRef on the AAP/PSP panel: CHORUS: A Collaborative Approach to Public Access
FundRef on the AAP/PSP panel: CHORUS: A Collaborative Approach to Public AccessFundRef on the AAP/PSP panel: CHORUS: A Collaborative Approach to Public Access
FundRef on the AAP/PSP panel: CHORUS: A Collaborative Approach to Public Access
 

Similar to McDanold, "The Differences Problem: Or why Consistency in Metadata is Critical in the Discovery Process"

IA Summit 09 - User Interfaces with Metasearch Capabilities
IA Summit 09 - User Interfaces with Metasearch CapabilitiesIA Summit 09 - User Interfaces with Metasearch Capabilities
IA Summit 09 - User Interfaces with Metasearch Capabilities
guestbc914e
 
Kbart Update ALA Midwinter 2010
Kbart Update ALA Midwinter 2010Kbart Update ALA Midwinter 2010
Kbart Update ALA Midwinter 2010
Jason Price, PhD
 
KBART ALA Midwinter 2010 Update
KBART ALA Midwinter 2010 UpdateKBART ALA Midwinter 2010 Update
KBART ALA Midwinter 2010 Update
Jason Price, PhD
 
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early AdoptersApril 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
National Information Standards Organization (NISO)
 
Building A Digital Ref Collection
Building A Digital Ref CollectionBuilding A Digital Ref Collection
Building A Digital Ref Collection
deborah katz
 
IWMW 2002: The Value of Metadata and How to Realise It
IWMW 2002: The Value of Metadata and How to Realise ItIWMW 2002: The Value of Metadata and How to Realise It
IWMW 2002: The Value of Metadata and How to Realise It
IWMW
 
Metadata : Concentrating on the data, not on the scheme
Metadata : Concentrating on the data, not on the schemeMetadata : Concentrating on the data, not on the scheme
Metadata : Concentrating on the data, not on the scheme
AIMS (Agricultural Information Management Standards)
 
Hearst Faceted Metadata for Site Navigation and Search
Hearst Faceted Metadata for Site Navigation and SearchHearst Faceted Metadata for Site Navigation and Search
Hearst Faceted Metadata for Site Navigation and Search
灿辉 葛
 
Metadata
MetadataMetadata
Metadata
Dorothea Salo
 
Many flavors of linked data
Many flavors of linked dataMany flavors of linked data
Many flavors of linked data
Debra Shapiro
 
Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...
Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...
Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...
Ringgold Inc
 
Webinar: Ditching File Shares For SharePoint Metadata
Webinar: Ditching File Shares For SharePoint MetadataWebinar: Ditching File Shares For SharePoint Metadata
Webinar: Ditching File Shares For SharePoint Metadata
WithumSmith+Brown, formerly Portal Solutions
 
Information Systems For Business and BeyondChapter 4Data a.docx
Information Systems For Business and BeyondChapter 4Data a.docxInformation Systems For Business and BeyondChapter 4Data a.docx
Information Systems For Business and BeyondChapter 4Data a.docx
jaggernaoma
 
Establishing the Connection: Creating a Linked Data Version of the BNB
Establishing the Connection: Creating a Linked Data Version of the BNBEstablishing the Connection: Creating a Linked Data Version of the BNB
Establishing the Connection: Creating a Linked Data Version of the BNB
nw13
 
Relational database concept and technology
Relational database concept and technologyRelational database concept and technology
Relational database concept and technology
Ducat
 
Sorting & Extracting Data
Sorting & Extracting DataSorting & Extracting Data
Sorting & Extracting Data
mary_ramsay
 
Payton Eliminating Conflicts in Ebook Metadata
Payton Eliminating Conflicts in Ebook MetadataPayton Eliminating Conflicts in Ebook Metadata
Payton Eliminating Conflicts in Ebook Metadata
National Information Standards Organization (NISO)
 
Using metadata repositories with search
Using metadata repositories with searchUsing metadata repositories with search
Using metadata repositories with search
Jean Graef
 
A theory of Metadata enriching & filtering
A theory of  Metadata enriching & filteringA theory of  Metadata enriching & filtering
A theory of Metadata enriching & filtering
Cuerpo Academico 'Estudios de la Información'
 
Being an independent & assertive learner 2
Being an independent & assertive learner 2Being an independent & assertive learner 2
Being an independent & assertive learner 2
SaKuchi Saku
 

Similar to McDanold, "The Differences Problem: Or why Consistency in Metadata is Critical in the Discovery Process" (20)

IA Summit 09 - User Interfaces with Metasearch Capabilities
IA Summit 09 - User Interfaces with Metasearch CapabilitiesIA Summit 09 - User Interfaces with Metasearch Capabilities
IA Summit 09 - User Interfaces with Metasearch Capabilities
 
Kbart Update ALA Midwinter 2010
Kbart Update ALA Midwinter 2010Kbart Update ALA Midwinter 2010
Kbart Update ALA Midwinter 2010
 
KBART ALA Midwinter 2010 Update
KBART ALA Midwinter 2010 UpdateKBART ALA Midwinter 2010 Update
KBART ALA Midwinter 2010 Update
 
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early AdoptersApril 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
 
Building A Digital Ref Collection
Building A Digital Ref CollectionBuilding A Digital Ref Collection
Building A Digital Ref Collection
 
IWMW 2002: The Value of Metadata and How to Realise It
IWMW 2002: The Value of Metadata and How to Realise ItIWMW 2002: The Value of Metadata and How to Realise It
IWMW 2002: The Value of Metadata and How to Realise It
 
Metadata : Concentrating on the data, not on the scheme
Metadata : Concentrating on the data, not on the schemeMetadata : Concentrating on the data, not on the scheme
Metadata : Concentrating on the data, not on the scheme
 
Hearst Faceted Metadata for Site Navigation and Search
Hearst Faceted Metadata for Site Navigation and SearchHearst Faceted Metadata for Site Navigation and Search
Hearst Faceted Metadata for Site Navigation and Search
 
Metadata
MetadataMetadata
Metadata
 
Many flavors of linked data
Many flavors of linked dataMany flavors of linked data
Many flavors of linked data
 
Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...
Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...
Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...
 
Webinar: Ditching File Shares For SharePoint Metadata
Webinar: Ditching File Shares For SharePoint MetadataWebinar: Ditching File Shares For SharePoint Metadata
Webinar: Ditching File Shares For SharePoint Metadata
 
Information Systems For Business and BeyondChapter 4Data a.docx
Information Systems For Business and BeyondChapter 4Data a.docxInformation Systems For Business and BeyondChapter 4Data a.docx
Information Systems For Business and BeyondChapter 4Data a.docx
 
Establishing the Connection: Creating a Linked Data Version of the BNB
Establishing the Connection: Creating a Linked Data Version of the BNBEstablishing the Connection: Creating a Linked Data Version of the BNB
Establishing the Connection: Creating a Linked Data Version of the BNB
 
Relational database concept and technology
Relational database concept and technologyRelational database concept and technology
Relational database concept and technology
 
Sorting & Extracting Data
Sorting & Extracting DataSorting & Extracting Data
Sorting & Extracting Data
 
Payton Eliminating Conflicts in Ebook Metadata
Payton Eliminating Conflicts in Ebook MetadataPayton Eliminating Conflicts in Ebook Metadata
Payton Eliminating Conflicts in Ebook Metadata
 
Using metadata repositories with search
Using metadata repositories with searchUsing metadata repositories with search
Using metadata repositories with search
 
A theory of Metadata enriching & filtering
A theory of  Metadata enriching & filteringA theory of  Metadata enriching & filtering
A theory of Metadata enriching & filtering
 
Being an independent & assertive learner 2
Being an independent & assertive learner 2Being an independent & assertive learner 2
Being an independent & assertive learner 2
 

More from National Information Standards Organization (NISO)

Jemison, MacLaughlin, and Majumder "Broadening Pathways for Editors and Authors"
Jemison, MacLaughlin, and Majumder "Broadening Pathways for Editors and Authors"Jemison, MacLaughlin, and Majumder "Broadening Pathways for Editors and Authors"
Jemison, MacLaughlin, and Majumder "Broadening Pathways for Editors and Authors"
National Information Standards Organization (NISO)
 
Benner "Expanding Pathways to Publishing Careers"
Benner "Expanding Pathways to Publishing Careers"Benner "Expanding Pathways to Publishing Careers"
Benner "Expanding Pathways to Publishing Careers"
National Information Standards Organization (NISO)
 
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
National Information Standards Organization (NISO)
 
Mattingly "AI & Prompt Design: Limitations and Solutions with LLMs"
Mattingly "AI & Prompt Design: Limitations and Solutions with LLMs"Mattingly "AI & Prompt Design: Limitations and Solutions with LLMs"
Mattingly "AI & Prompt Design: Limitations and Solutions with LLMs"
National Information Standards Organization (NISO)
 
Mattingly "AI and Prompt Design: LLMs with Text Classification and Open Source"
Mattingly "AI and Prompt Design: LLMs with Text Classification and Open Source"Mattingly "AI and Prompt Design: LLMs with Text Classification and Open Source"
Mattingly "AI and Prompt Design: LLMs with Text Classification and Open Source"
National Information Standards Organization (NISO)
 
Mattingly "AI and Prompt Design: LLMs with NER"
Mattingly "AI and Prompt Design: LLMs with NER"Mattingly "AI and Prompt Design: LLMs with NER"
Mattingly "AI and Prompt Design: LLMs with NER"
National Information Standards Organization (NISO)
 
Mattingly "AI & Prompt Design: Named Entity Recognition"
Mattingly "AI & Prompt Design: Named Entity Recognition"Mattingly "AI & Prompt Design: Named Entity Recognition"
Mattingly "AI & Prompt Design: Named Entity Recognition"
National Information Standards Organization (NISO)
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
National Information Standards Organization (NISO)
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
National Information Standards Organization (NISO)
 
Bazargan "NISO Webinar, Sustainability in Publishing"
Bazargan "NISO Webinar, Sustainability in Publishing"Bazargan "NISO Webinar, Sustainability in Publishing"
Bazargan "NISO Webinar, Sustainability in Publishing"
National Information Standards Organization (NISO)
 
Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"
National Information Standards Organization (NISO)
 
Compton "NISO Webinar, Sustainability in Publishing"
Compton "NISO Webinar, Sustainability in Publishing"Compton "NISO Webinar, Sustainability in Publishing"
Compton "NISO Webinar, Sustainability in Publishing"
National Information Standards Organization (NISO)
 
Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"
National Information Standards Organization (NISO)
 
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
National Information Standards Organization (NISO)
 
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
National Information Standards Organization (NISO)
 
Mattingly "Text and Data Mining: Building Data Driven Applications"
Mattingly "Text and Data Mining: Building Data Driven Applications"Mattingly "Text and Data Mining: Building Data Driven Applications"
Mattingly "Text and Data Mining: Building Data Driven Applications"
National Information Standards Organization (NISO)
 
Mattingly "Text and Data Mining: Searching Vectors"
Mattingly "Text and Data Mining: Searching Vectors"Mattingly "Text and Data Mining: Searching Vectors"
Mattingly "Text and Data Mining: Searching Vectors"
National Information Standards Organization (NISO)
 
Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"
National Information Standards Organization (NISO)
 
Mattingly "Text Processing for Library Data: Representing Text as Data"
Mattingly "Text Processing for Library Data: Representing Text as Data"Mattingly "Text Processing for Library Data: Representing Text as Data"
Mattingly "Text Processing for Library Data: Representing Text as Data"
National Information Standards Organization (NISO)
 
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
National Information Standards Organization (NISO)
 

More from National Information Standards Organization (NISO) (20)

Jemison, MacLaughlin, and Majumder "Broadening Pathways for Editors and Authors"
Jemison, MacLaughlin, and Majumder "Broadening Pathways for Editors and Authors"Jemison, MacLaughlin, and Majumder "Broadening Pathways for Editors and Authors"
Jemison, MacLaughlin, and Majumder "Broadening Pathways for Editors and Authors"
 
Benner "Expanding Pathways to Publishing Careers"
Benner "Expanding Pathways to Publishing Careers"Benner "Expanding Pathways to Publishing Careers"
Benner "Expanding Pathways to Publishing Careers"
 
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
 
Mattingly "AI & Prompt Design: Limitations and Solutions with LLMs"
Mattingly "AI & Prompt Design: Limitations and Solutions with LLMs"Mattingly "AI & Prompt Design: Limitations and Solutions with LLMs"
Mattingly "AI & Prompt Design: Limitations and Solutions with LLMs"
 
Mattingly "AI and Prompt Design: LLMs with Text Classification and Open Source"
Mattingly "AI and Prompt Design: LLMs with Text Classification and Open Source"Mattingly "AI and Prompt Design: LLMs with Text Classification and Open Source"
Mattingly "AI and Prompt Design: LLMs with Text Classification and Open Source"
 
Mattingly "AI and Prompt Design: LLMs with NER"
Mattingly "AI and Prompt Design: LLMs with NER"Mattingly "AI and Prompt Design: LLMs with NER"
Mattingly "AI and Prompt Design: LLMs with NER"
 
Mattingly "AI & Prompt Design: Named Entity Recognition"
Mattingly "AI & Prompt Design: Named Entity Recognition"Mattingly "AI & Prompt Design: Named Entity Recognition"
Mattingly "AI & Prompt Design: Named Entity Recognition"
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Bazargan "NISO Webinar, Sustainability in Publishing"
Bazargan "NISO Webinar, Sustainability in Publishing"Bazargan "NISO Webinar, Sustainability in Publishing"
Bazargan "NISO Webinar, Sustainability in Publishing"
 
Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"
 
Compton "NISO Webinar, Sustainability in Publishing"
Compton "NISO Webinar, Sustainability in Publishing"Compton "NISO Webinar, Sustainability in Publishing"
Compton "NISO Webinar, Sustainability in Publishing"
 
Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"
 
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
 
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
 
Mattingly "Text and Data Mining: Building Data Driven Applications"
Mattingly "Text and Data Mining: Building Data Driven Applications"Mattingly "Text and Data Mining: Building Data Driven Applications"
Mattingly "Text and Data Mining: Building Data Driven Applications"
 
Mattingly "Text and Data Mining: Searching Vectors"
Mattingly "Text and Data Mining: Searching Vectors"Mattingly "Text and Data Mining: Searching Vectors"
Mattingly "Text and Data Mining: Searching Vectors"
 
Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"
 
Mattingly "Text Processing for Library Data: Representing Text as Data"
Mattingly "Text Processing for Library Data: Representing Text as Data"Mattingly "Text Processing for Library Data: Representing Text as Data"
Mattingly "Text Processing for Library Data: Representing Text as Data"
 
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
 

Recently uploaded

Lifelines of National Economy chapter for Class 10 STUDY MATERIAL PDF
Lifelines of National Economy chapter for Class 10 STUDY MATERIAL PDFLifelines of National Economy chapter for Class 10 STUDY MATERIAL PDF
Lifelines of National Economy chapter for Class 10 STUDY MATERIAL PDF
Vivekanand Anglo Vedic Academy
 
Philippine Edukasyong Pantahanan at Pangkabuhayan (EPP) Curriculum
Philippine Edukasyong Pantahanan at Pangkabuhayan (EPP) CurriculumPhilippine Edukasyong Pantahanan at Pangkabuhayan (EPP) Curriculum
Philippine Edukasyong Pantahanan at Pangkabuhayan (EPP) Curriculum
MJDuyan
 
Pengantar Penggunaan Flutter - Dart programming language1.pptx
Pengantar Penggunaan Flutter - Dart programming language1.pptxPengantar Penggunaan Flutter - Dart programming language1.pptx
Pengantar Penggunaan Flutter - Dart programming language1.pptx
Fajar Baskoro
 
How to Setup Warehouse & Location in Odoo 17 Inventory
How to Setup Warehouse & Location in Odoo 17 InventoryHow to Setup Warehouse & Location in Odoo 17 Inventory
How to Setup Warehouse & Location in Odoo 17 Inventory
Celine George
 
B. Ed Syllabus for babasaheb ambedkar education university.pdf
B. Ed Syllabus for babasaheb ambedkar education university.pdfB. Ed Syllabus for babasaheb ambedkar education university.pdf
B. Ed Syllabus for babasaheb ambedkar education university.pdf
BoudhayanBhattachari
 
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptxPrésentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
siemaillard
 
Film vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movieFilm vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movie
Nicholas Montgomery
 
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
Nguyen Thanh Tu Collection
 
Leveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit InnovationLeveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit Innovation
TechSoup
 
Mule event processing models | MuleSoft Mysore Meetup #47
Mule event processing models | MuleSoft Mysore Meetup #47Mule event processing models | MuleSoft Mysore Meetup #47
Mule event processing models | MuleSoft Mysore Meetup #47
MysoreMuleSoftMeetup
 
writing about opinions about Australia the movie
writing about opinions about Australia the moviewriting about opinions about Australia the movie
writing about opinions about Australia the movie
Nicholas Montgomery
 
HYPERTENSION - SLIDE SHARE PRESENTATION.
HYPERTENSION - SLIDE SHARE PRESENTATION.HYPERTENSION - SLIDE SHARE PRESENTATION.
HYPERTENSION - SLIDE SHARE PRESENTATION.
deepaannamalai16
 
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
GeorgeMilliken2
 
How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17
Celine George
 
مصحف القراءات العشر أعد أحرف الخلاف سمير بسيوني.pdf
مصحف القراءات العشر   أعد أحرف الخلاف سمير بسيوني.pdfمصحف القراءات العشر   أعد أحرف الخلاف سمير بسيوني.pdf
مصحف القراءات العشر أعد أحرف الخلاف سمير بسيوني.pdf
سمير بسيوني
 
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skillsspot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
haiqairshad
 
Temple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation resultsTemple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation results
Krassimira Luka
 
A Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdfA Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdf
Jean Carlos Nunes Paixão
 
Pharmaceutics Pharmaceuticals best of brub
Pharmaceutics Pharmaceuticals best of brubPharmaceutics Pharmaceuticals best of brub
Pharmaceutics Pharmaceuticals best of brub
danielkiash986
 
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptxNEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
iammrhaywood
 

Recently uploaded (20)

Lifelines of National Economy chapter for Class 10 STUDY MATERIAL PDF
Lifelines of National Economy chapter for Class 10 STUDY MATERIAL PDFLifelines of National Economy chapter for Class 10 STUDY MATERIAL PDF
Lifelines of National Economy chapter for Class 10 STUDY MATERIAL PDF
 
Philippine Edukasyong Pantahanan at Pangkabuhayan (EPP) Curriculum
Philippine Edukasyong Pantahanan at Pangkabuhayan (EPP) CurriculumPhilippine Edukasyong Pantahanan at Pangkabuhayan (EPP) Curriculum
Philippine Edukasyong Pantahanan at Pangkabuhayan (EPP) Curriculum
 
Pengantar Penggunaan Flutter - Dart programming language1.pptx
Pengantar Penggunaan Flutter - Dart programming language1.pptxPengantar Penggunaan Flutter - Dart programming language1.pptx
Pengantar Penggunaan Flutter - Dart programming language1.pptx
 
How to Setup Warehouse & Location in Odoo 17 Inventory
How to Setup Warehouse & Location in Odoo 17 InventoryHow to Setup Warehouse & Location in Odoo 17 Inventory
How to Setup Warehouse & Location in Odoo 17 Inventory
 
B. Ed Syllabus for babasaheb ambedkar education university.pdf
B. Ed Syllabus for babasaheb ambedkar education university.pdfB. Ed Syllabus for babasaheb ambedkar education university.pdf
B. Ed Syllabus for babasaheb ambedkar education university.pdf
 
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptxPrésentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
 
Film vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movieFilm vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movie
 
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
 
Leveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit InnovationLeveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit Innovation
 
Mule event processing models | MuleSoft Mysore Meetup #47
Mule event processing models | MuleSoft Mysore Meetup #47Mule event processing models | MuleSoft Mysore Meetup #47
Mule event processing models | MuleSoft Mysore Meetup #47
 
writing about opinions about Australia the movie
writing about opinions about Australia the moviewriting about opinions about Australia the movie
writing about opinions about Australia the movie
 
HYPERTENSION - SLIDE SHARE PRESENTATION.
HYPERTENSION - SLIDE SHARE PRESENTATION.HYPERTENSION - SLIDE SHARE PRESENTATION.
HYPERTENSION - SLIDE SHARE PRESENTATION.
 
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
 
How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17
 
مصحف القراءات العشر أعد أحرف الخلاف سمير بسيوني.pdf
مصحف القراءات العشر   أعد أحرف الخلاف سمير بسيوني.pdfمصحف القراءات العشر   أعد أحرف الخلاف سمير بسيوني.pdf
مصحف القراءات العشر أعد أحرف الخلاف سمير بسيوني.pdf
 
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skillsspot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
 
Temple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation resultsTemple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation results
 
A Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdfA Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdf
 
Pharmaceutics Pharmaceuticals best of brub
Pharmaceutics Pharmaceuticals best of brubPharmaceutics Pharmaceuticals best of brub
Pharmaceutics Pharmaceuticals best of brub
 
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptxNEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
 

McDanold, "The Differences Problem: Or why Consistency in Metadata is Critical in the Discovery Process"

  • 1. The Differences Problem Or why consistency in metadata is critical in the discovery process Shana L. McDanold
  • 2. First A few caveats… 2
  • 3. Inthenotso distantpast… There were two main options when searching for ebooks: 1. Search each individual vendor’s website/database 2. Load MARC records (one record for each title) into the catalog for each vendor 3
  • 4. Inthenotso distantpast… Problems with this approach:  Loading records is a LOT of work and requires regular maintenance  Massaging/editing/enhancing metadata; loading; updates; replacements; deletes  Number of records/titles to load  Lack of records available for loading  Records come from numerous places and each vendor requires a different procedure to download files  Tracking titles in multiple places (duplicate work) 4
  • 5. Now:more options… 1. Search each individual vendor’s website/database 2. Load MARC records (one record for each title) into the catalog for each vendor 3. Integration of various vendors metadata into discovery layers via APIs and linked data rather than importing records into the catalog 4. Federated search tools that index multiple databases (e.g. unified index search tools) …but are more options better? 5
  • 6. Thegoodand thebad GOOD:  fewer places to search (possibly even only one)  most public libraries, while they have other ebook databases, will have a single integrated discovery layer BAD:  MORE places to search BUT discovery is still a challenge no matter which search option you choose, and those challenges are centered around: METADATA 6
  • 9. Differences?  ISBN  Subjects  Title  Author  Date 9
  • 12. Differences?  ISBN  Subjects  Title  Author  Date 12
  • 15. Differences?  ISBN  Subjects  Title  Author  Date 15
  • 16. Differences defined  Differences in description  Current vs past rules and guidelines;  RDA provider neutral vs individual vendor records  Differences between vendors for same title  Differences in how data is entered/presented  Record proliferation  Related to metadata differences: records cannot be “collapsed” because the discovery layer doesn’t recognize them as the same  Different vocabularies and identity databases 16
  • 17. More differences  Missing metadata/missing records  Data changes/updates  Branding or custom text/collections 17
  • 18. Whydothese differences matter?  How people search  Keyword - forces dependency on keyword indexes  Follow links - if you click on the subject search for Obama, Michelle, search results include only print books (no ebooks)  Limits/facets - dependent on metadata, both visible and invisible (coded)  Missing metadata  Discovery layer exposes ALL the metadata (good, bad, missing) All means items get “hidden” because they’re not findable. 18
  • 19. How dowefix it?  CONSISTENCY  use of controlled vocabularies and existing authority databases (name matching, subjects, etc.)  Use existing metadata sources  Follow standards and recommended/best practices  Communication  Data points  complete  consistency across vendors 19

Editor's Notes

  1. Usually differences are a GOOD thing, providing diversity; but not in this case Caveat: speaking from a public library perspective mainly; although most of the issues public libraries have are present in academic environments; differences are resource types and focus on currency/popularity of materials (collection is more ephemeral than permanent) BUT my background is serials and nonprint format cataloging – been dealing with managing metadata/cataloging for ejournals and ebooks for almost 2 decades now My philosophy: job of cataloging/metadata is to make stuff findable, which includes unique identification of resources I don’t believe in the “perfect” record If it’s not wrong, leave it alone (don’t delete data, just exclude it from indexes…you may want it in the future) When editing: Fix errors or delete if wrong Add access points Enhance content/description (add value) Make it pretty
  2. Number of vendors increased – more complex  more time Each vendor: different procedure for downloading; different edits (some need proxy added, some don’t); files may be in various formats and require conversion to MARC Tools to help streamline (MarcEdit – TASK LISTS saving the edits for each vendor are a savior) BUT still very time consuming Multiple places: ERM and the Catalog and possibly the vendor website – have to keep in sync
  3. Looking at a single search option for ebooks and print books, where an API is used to search both ebook vendor and the catalog in one search So lets look at examples – examples are current popular titles or authors
  4. Who’s watching the show on Netflix?
  5. ISBN: this is often a key match point for OpenURL resolvers or other API/linked data tools Title: ebook version is incomplete Author: translator is missing, an issue when looking for a specific translation or if searching by translator name Date format – indexing issue – how does your system handle dates?
  6. ISBN: this is often a key match point for OpenURL resolvers or other API/linked data tools Title: ebook version is incomplete Author: indexing issues; identity management/authority control issues Date format – indexing issue – how does your system handle dates?
  7. ISBN: this is often a key match point for OpenURL resolvers or other API/linked data tools Subject: where’s DC?? Title: ebook version is different Author: indexing issues; identity management/authority control issues Date format – indexing issue – how does your system handle dates? Do you see a trend yet?
  8. Description: AACR2 vs RDA – fundamental change in how you approach describing a resource Provider neutral – one records for ALL online versions of a title (formats, platform, etc.) – just have multiple links/URLs to various options; Hard to do that with APIs/linked data tools Date format, author format (last, first or first last?) Proliferation: more vendors = more records We get patron complaints about ebook display all the time Different vocabularies and identity databases – name formats, subjects, locations, etc.  Creates indexing and filing issues; split indexes
  9. Missing: sometimes records just don’t appear – API/linked data tool errors, delays, Data changes: records get “out of sync” – print book may be complete but ebook is still minimal/prepublication Branding: can’t add custom text to create collections, or other data to ebook records; limits to control over display and what data is included – stuck with what the vendor sends/makes available
  10. Forcing dependency on keyword indexing or indexing of the WHOLE records – specific author indexes, etc. become not useful How people search: Subjects/identities – FORM matters “see also” Collections Links – find something the want/like, follow links to “similar” or “like” items using subjects, authors, etc. (internet rabbit hole…) Limits/facets – such as format, publication date, location, etc. Missing metadata – subjects, ISBN, names, locations, etc.; lose match points; may result in records not appearing – search ISBN and the ebooks don’t show up Discovery layers – good at exposing EVERYTHING (great way to identify database cleanup projects…)
  11. Communication – between libraries and vendors Data points – more is better, even if they don’t display