Bibliographic Database Integrity

•Download as PPTX, PDF•

1 like•550 views

1) The document discusses the issues caused by duplicate bibliographic records in a consortial catalog, such as increased workload and costs for database maintenance. 2) It provides statistics on duplicate records for several authors before and after consolidation in the PINES catalog. 3) The document also discusses patron feedback expressing confusion over multiple listings for the same title and issues that can arise from inconsistencies in record creation and data quality.

Technology Education

A Unit of of the University System of Georgia
A Unit the University System of Georgia

Bibliographic database integrity in a
consortial environment

Evergreen International Conference
May 21, 2009
• Elaine Hardy
• PINES Bibliographic Projects and Metadata Manager

Twentieth Century Literary Criticism:
illustration of single record for each serial
volume

GPLS Intern’s statistics

Before After
Alexander McCall Smith 245 172
Grace Livingston Hill 1119 549
Mary Higgins Clark 771 386
Magic School Bus (print) 554 218
Danielle Steel 1235 718

Duplicate records cause
– “User information overload”
– “Reduced system efficiency”
– “Low cataloging productivity”
– “Increased cost for database maintenance”
Sitas and Kapidakis, 2008

“There is no question that merging such records is vital to
effective user services in a cooperative environment.”
Tennant, 2002

What patrons think ---
• wish that you would list the most current book first and have only
one entry for each book instead of showing multiple entries.
Sometimes I have to look through 50 - 100 entries to see 20 books
and the newest book by the author is entry 80. There should be a
way to stream line this procedure.
• Consolidate entries for the same title. There are numerous entries
on some titles beyond the breakdown of hard cover, PB, large
print,audio, etc.”
• Why so many listings for the same books--that's confusing
• When I look up a book, many times I get two pages all of the same
title with the same cover. It confuses me because I see that my
library system doesn't have it, but if I scroll down...Whoops! We do
have it. What is that all about? It sucks.
• Creating a standard for the way an items information is entered.
Some books only have half the title entered and this can create
problems when searching for specific materials

• Big library does not equal good data
• A large library does not always follow rules and adhere
to standards
• Size can they cut corners for “efficiency”
• Local notes don’t belong in subject fields
• Make the time to check your data
• Publishers are not catalogers’ friends

Examples of problem reference library records

.

.
http://www-03.ibm.com/ibm/history/exhibits/mainframe/mainframe_2423PH3090.html

Legacy system characteristics
• All were IBM based systems
• No tags, thus no definition of fields
• All fields fixed length
– allotted so many characters for each field
• No standards
– Not required to enter pagination or publisher
• Extraction of data a problem
– had to count in to find beginning of next field
– In many cases, had to supply a pub date. One lib has 1901 as a
pub date on most of their extracted records

Phase II

http://commons.wikimedia.org/wiki/Template:Potd/2007-01

Lessons learned
• Big library does not equal good data
• Make the time to check your data
• Publishers are not catalogers’ friends
• Be careful about CIPs with no description and records with multiple ISBNs
• Come up with realistic match when records are same but information differs
• One library will not have the same good records across all their collections
– may have good print but bad AV
• LOTS of programming if multiple sources of records.
• No matter -- budget, personnel, time -- is as important as concentrating on
clean-up prior to migration
• Be as specific as possible with vendors, test and have a penalty phase.
• Have the right people in place from day one

Viewers also liked

Evergreen in Small LibrariesEvergreen ILS

You're Live, Now What?Evergreen ILS

Evergreen Sysadmin Survival SkillsEvergreen ILS

OpenSRF and EvergreenEvergreen ILS

ERM and EvergreenEvergreen ILS

Bibliographic Control and OclcDenise Garofalo

Effective search of bibliographic databasesTarek Tawfik Amin

Viewers also liked (7)

Evergreen in Small Libraries

You're Live, Now What?

Evergreen Sysadmin Survival Skills

OpenSRF and Evergreen

ERM and Evergreen

Bibliographic Control and Oclc

Effective search of bibliographic databases

Similar to Bibliographic Database Integrity

The Effects of Cross-Pollination : How non-library mass market services are c...Baden Hughes

Why libraries should embrace Linked Dataeby

ALA 2010 -- Jane Burkebisg

Mechanical LibrarianAndre Vellino

Crisis or Opportunity? Cataloging, Catalogers, RDA, and ChangeDiane Hillmann

PLAYING PERFORMERS. IDEAS ABOUT MEDIATED NETWORK MUSIC PERFORMANCE.crysatal16

Summary of Trends in CatalogingWilliam Worford

When?Dan Brickley

Extreme Makeover: Web Site EditionOhio Public Library Information Network (OPLIN)

Extreme Makeover: Web Site Edition (OPLIN)Laura Solomon

Blogs & Wikis (and what you can do with them)amanda etches

Back to the Future: The Reinvention of the Library Catalog, Yesterday, Today,...zepheiraorg

Ir1Tomas Anikevičius

Carpenter, McCraken, Ventimiglia, Noonan, and Walker "KBART and the OpenURL: ...National Information Standards Organization (NISO)

An Introduction to NOSQL, Graph Databases and Neo4jDebanjan Mahata

Building a Better Knowledgebase: An Investigation of Current Practical Uses a...NASIG

Honey on the Wire KohaCon18Joy Nelson

Organizing Infoshop Libraries and Their Collections: Bringing the Community i...Nicole Pagowsky

Kampmeier ecn 2012ECNOfficer

Some news about the SWIvan Herman

Similar to Bibliographic Database Integrity (20)

The Effects of Cross-Pollination : How non-library mass market services are c...

Why libraries should embrace Linked Data

ALA 2010 -- Jane Burke

Mechanical Librarian

Crisis or Opportunity? Cataloging, Catalogers, RDA, and Change

PLAYING PERFORMERS. IDEAS ABOUT MEDIATED NETWORK MUSIC PERFORMANCE.

Summary of Trends in Cataloging

When?

Extreme Makeover: Web Site Edition

Extreme Makeover: Web Site Edition (OPLIN)

Blogs & Wikis (and what you can do with them)

Back to the Future: The Reinvention of the Library Catalog, Yesterday, Today,...

Ir1

Carpenter, McCraken, Ventimiglia, Noonan, and Walker "KBART and the OpenURL: ...

An Introduction to NOSQL, Graph Databases and Neo4j

Building a Better Knowledgebase: An Investigation of Current Practical Uses a...

Honey on the Wire KohaCon18

Organizing Infoshop Libraries and Their Collections: Bringing the Community i...

Kampmeier ecn 2012

Some news about the SW

Recently uploaded

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science

Developing An App To Navigate The Roads of BrazilV3cube

Histor y of HAM Radio presentation slidevu2urc

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun

Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal

CNv6 Instructor Chapter 6 Quality of Servicegiselly40

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls

Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC

🐬 The future of MySQL is Postgres 🐘RTylerCroy

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

A Domino Admins Adventures (Engage 2024)Gabriella Davis

Recently uploaded (20)

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx

Developing An App To Navigate The Roads of Brazil

Histor y of HAM Radio presentation slide

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

Data Cloud, More than a CDP by Matt Robison

Unblocking The Main Thread Solving ANRs and Frozen Frames

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service

CNv6 Instructor Chapter 6 Quality of Service

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...

08448380779 Call Girls In Civil Lines Women Seeking Men

Boost PC performance: How more available memory can improve productivity

[2024]Digital Global Overview Report 2024 Meltwater.pdf

08448380779 Call Girls In Friends Colony Women Seeking Men

08448380779 Call Girls In Greater Kailash - I Women Seeking Men

Breaking the Kubernetes Kill Chain: Host Path Mount

🐬 The future of MySQL is Postgres 🐘

How to Troubleshoot Apps for the Modern Connected Worker

A Domino Admins Adventures (Engage 2024)

Bibliographic Database Integrity

1. A Unit of of the University System of Georgia A Unit the University System of Georgia

2. Bibliographic database integrity in a consortial environment Evergreen International Conference May 21, 2009 • Elaine Hardy • PINES Bibliographic Projects and Metadata Manager

5. Twentieth Century Literary Criticism: illustration of single record for each serial volume

6. GPLS Intern’s statistics Before After Alexander McCall Smith 245 172 Grace Livingston Hill 1119 549 Mary Higgins Clark 771 386 Magic School Bus (print) 554 218 Danielle Steel 1235 718

7. Duplicate records cause – “User information overload” – “Reduced system efficiency” – “Low cataloging productivity” – “Increased cost for database maintenance” Sitas and Kapidakis, 2008 “There is no question that merging such records is vital to effective user services in a cooperative environment.” Tennant, 2002

8. What patrons think --- • wish that you would list the most current book first and have only one entry for each book instead of showing multiple entries. Sometimes I have to look through 50 - 100 entries to see 20 books and the newest book by the author is entry 80. There should be a way to stream line this procedure. • Consolidate entries for the same title. There are numerous entries on some titles beyond the breakdown of hard cover, PB, large print,audio, etc.” • Why so many listings for the same books--that's confusing • When I look up a book, many times I get two pages all of the same title with the same cover. It confuses me because I see that my library system doesn't have it, but if I scroll down...Whoops! We do have it. What is that all about? It sucks. • Creating a standard for the way an items information is entered. Some books only have half the title entered and this can create problems when searching for specific materials

9. Why?

10.

11.

12.

13.

14. • Big library does not equal good data • A large library does not always follow rules and adhere to standards • Size can they cut corners for “efficiency” • Local notes don’t belong in subject fields • Make the time to check your data • Publishers are not catalogers’ friends

15. Examples of problem reference library records

16. . . http://www-03.ibm.com/ibm/history/exhibits/mainframe/mainframe_2423PH3090.html

17. Legacy system characteristics • All were IBM based systems • No tags, thus no definition of fields • All fields fixed length – allotted so many characters for each field • No standards – Not required to enter pagination or publisher • Extraction of data a problem – had to count in to find beginning of next field – In many cases, had to supply a pub date. One lib has 1901 as a pub date on most of their extracted records

18. Records from a nonMARC system

19.

20. Phase II http://commons.wikimedia.org/wiki/Template:Potd/2007-01

21. Records with corrupted headings

22.

23.

24.

25.

26.

27.

28.

29.

30. Lessons learned • Big library does not equal good data • Make the time to check your data • Publishers are not catalogers’ friends • Be careful about CIPs with no description and records with multiple ISBNs • Come up with realistic match when records are same but information differs • One library will not have the same good records across all their collections – may have good print but bad AV • LOTS of programming if multiple sources of records. • No matter -- budget, personnel, time -- is as important as concentrating on clean-up prior to migration • Be as specific as possible with vendors, test and have a penalty phase. • Have the right people in place from day one

31. Enable discover

32. Goodbye

Bibliographic Database Integrity

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (7)

Similar to Bibliographic Database Integrity

Similar to Bibliographic Database Integrity (20)

More from Evergreen ILS

More from Evergreen ILS (7)

Recently uploaded

Recently uploaded (20)

Bibliographic Database Integrity