Next Generation Technical Services May 2009 Calhoun


Published on

This is a long version of the talk I gave for the Spring Assembly of the Librarians Association of the University of California, May 13, 2009, UC Riverside Palm Desert Campus

Published in: Education
1 Like
  • Be the first to comment

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide
  • Next Generation Technical Services May 2009 Calhoun

    1. 1. Karen Calhoun Vice President WorldCat and Metadata Services OCLC Next Generation Library Collections Management LAUC May 2009
    2. 2. “ Next Generation” – During Whose Generation? By Joe Penniston
    3. 3. What Does “Next Generation Cataloging” Mean to Recent Survey Respondents? Source: Robert Young Survey, OCLC Western, spring 2009
    4. 4. What is NOT Next Gen? Redundancy Library 2 Library 3 Library 4 And on and on … Library 1
    5. 5. Redundancy Today <ul><li>In collection building </li></ul><ul><li>In acquisitions </li></ul><ul><li>In cataloging and metadata management </li></ul><ul><li>In off-site storage </li></ul><ul><li>In digital projects </li></ul><ul><li>… </li></ul>
    6. 6. From Jordan, Bill and Pozenel, Mindy. 2008. Presented as a CNI Project Briefing. Slide 35. <ul><li>In the ILS, all data is local </li></ul><ul><li>Local control once downloaded, they are “my” records </li></ul><ul><li>“ My” maintenance and improvements </li></ul><ul><ul><li>Post-cataloging authority control, cleanup </li></ul></ul><ul><li>Master record doesn’t get our improvements </li></ul><ul><li>We don’t get improvements to OCLC master records </li></ul><ul><li>Duplication of effort: all maintaining same bib records in our local systems </li></ul>
    7. 7. What Does ‘Next Gen’ Mean to Me? <ul><li>Disruptive </li></ul><ul><li>Simultaneously global, collective, and local </li></ul><ul><li>User-driven </li></ul><ul><li>Social </li></ul><ul><ul><li>Expert community </li></ul></ul><ul><ul><li>User community </li></ul></ul><ul><li>Multithreaded, mashed-up </li></ul><ul><li>Efficient </li></ul><ul><li>Economies of scale </li></ul><ul><li>Network effects </li></ul><ul><li>Moving into “the cloud” </li></ul>
    8. 8. GLOBAL GROUP LOCAL Outward Integration, Exposure, And Linking Of Collections (e.g., Google Books, WorldCat, Other aggregators, national libraries, consortia) Local/Group Authentication, Discovery And Delivery Services Data Sharing, Syndication, Synchronization, Linking 1. Coordinated Global, Collective, and Local Metadata Management
    9. 9. Connecting the Dots: “Metadata Switch” <ul><li>Discovery and delivery are mediated by large information hubs </li></ul>
    10. 10. “ Metadata Switch”: Click 3 Times and You’re Back in Kansas (er, UC Riverside) Google Books UC Riverside
    11. 11. Destination or Metadata Switch? It’s a switch. Referrals to, Feb. 1 – Apr. 30, 2009 12.3% Typed/Bookmarked URLs 46.9% Other Web Sites 40.7% Search Engines
    12. 12. The “3 S’s” of Metadata Exchange <ul><ul><li>Synchronization </li></ul></ul><ul><ul><li>Syndication </li></ul></ul><ul><ul><li>Sharing </li></ul></ul>
    13. 13. Synchronization : UC Libraries <ul><li>U.C. Santa Cruz - September 2008. UCSC is sending files for additional processing as they clean up unresolved records. </li></ul><ul><li>  </li></ul><ul><li>U.C. Davis – October 2008 - Included a special cancels project. Also completed onetime cleanup projects for On-order and In-Process records. </li></ul><ul><li>  </li></ul><ul><li>U.C. Santa Barbara - March 2008. </li></ul><ul><li>  </li></ul><ul><li>U.C. Berkeley - May 2008. Now have an ongoing project. These projects also involved loading Institution Records. </li></ul><ul><li>  </li></ul><ul><li>U.C. Davis, Health Sci Libr - October 2008. Also completed onetime cleanup projects for On-order and In process records. </li></ul><ul><li>  </li></ul><ul><li>U.C. Riverside - April 2009. </li></ul><ul><li>  </li></ul><ul><li>UCLA - Involves four OCLC symbols (CLU, ZAS, CLETH, UCFTA). Processing for CLU completed last week. </li></ul>
    14. 14. Synchronization: UC Libraries (2) <ul><li>In process or analysis: </li></ul><ul><li>UCLA </li></ul><ul><ul><li>Univ of California Southern Regional Lib Facility </li></ul></ul><ul><ul><li>UCLA Ethnomusicology Archive </li></ul></ul><ul><li>U.C. Irvine </li></ul><ul><li>U.C. San Diego </li></ul><ul><ul><li>Main library </li></ul></ul><ul><ul><li>University of California Shared Cataloging Program </li></ul></ul><ul><li>U.C. San Francisco </li></ul>
    15. 15. What is Syndication? Low resolution image of copyrighted work used for commentary on the topic of syndication. For news features like comics, syndication publishes the feature in multiple newspapers simultaneously. Web syndication makes website material available to multiple other sites.
    16. 16. Syndication: WorldCat Partners <ul><li>Abebooks ( ) </li></ul><ul><li>Alibris ( ) </li></ul><ul><li> ( ) </li></ul><ul><li>Antiquarian Booksellers' Association of America ( ) </li></ul><ul><li> ( ) </li></ul><ul><li>Better World Books ( ) </li></ul><ul><li>Biblio ( ) </li></ul><ul><li>BookMooch ( ) </li></ul><ul><li>BookPage ( ) </li></ul><ul><li>Boopsie ( ) </li></ul><ul><li>CompareEverywhere ( ) </li></ul><ul><li>DirectTextbook ( ) </li></ul><ul><li>ERIC ( ) </li></ul><ul><li>Goodreads ( ) </li></ul><ul><li>Google, Google Scholar and Google Books ( , , ) </li></ul><ul><li>Greenwood Publishing Group ( ) </li></ul><ul><li>HCI Bibliography ( ) </li></ul><ul><li>Internet Archive ( ) </li></ul><ul><li>Project MUSE ( ) </li></ul><ul><li>Questia ( ) </li></ul><ul><li>TRIS Database (NTB/NTIS) ( ) </li></ul><ul><li>Windows Live Search and Windows Live Academic ( , ) </li></ul><ul><li>Yahoo! Search ( ) </li></ul>These partner Web sites provide links to and/or other access to WorldCat-based information:
    17. 17. Data Sharing: WorldCat API <ul><li>Integrate WorldCat data into your web site </li></ul><ul><li>FRBR work clusters </li></ul><ul><li>Citations </li></ul><ul><li>Holdings (sorted geographically) </li></ul><ul><li>Citation links </li></ul><ul><li>Accepts: SRU & OpenSearch </li></ul><ul><li>Outputs: MARCXML, Dublin Core, RSS, Atom </li></ul><ul><li> </li></ul>
    18. 18. Data Sharing, Then and Now: A Time of Transition <ul><li>THEN: </li></ul><ul><li>“ A model based upon the value in the creation and control of data” </li></ul><ul><li>NOW: </li></ul><ul><li>A model based upon the value in the exchange and linking of data </li></ul>Janus, guardian of doors and gates
    19. 19. OCLC Review Board on Principles of Shared Data Creation and Stewardship <ul><li>Review Board appointed by OCLC Board of Trustees and Members Council </li></ul><ul><li>URL of Review Board page: </li></ul><ul><li>Online feedback forum (blog): http:// /   </li></ul><ul><li>Email:  [email_address] </li></ul><ul><li>Preliminary report at OCLC Members Council May 18-19 </li></ul>
    20. 20. 2. User Driven <ul><li>“ Quality” in the user workflow from discovery to delivery </li></ul><ul><ul><li>How does what end users say they want relate to what catalogers do? </li></ul></ul>The origin of the tire swing cartoon is discussed at
    21. 21. <ul><li>Key findings: </li></ul><ul><li>End users bring their expectations from popular Web sites to online catalogs </li></ul><ul><li>Different groups of end users have different priorities, but there are some commonalities across groups: </li></ul><ul><ul><li>The end user’s delivery experience is as important, if not more important than the discovery experience </li></ul></ul><ul><ul><li>Most important for analog materials: summaries, tables of contents, etc. </li></ul></ul><ul><ul><li>Most important for e- content: linking to the content itself </li></ul></ul>
    22. 22. How to Get the Report <ul><li>Publication of report May 2009 </li></ul><ul><li>Download free PDF or purchase printed version </li></ul><ul><li> </li></ul><ul><li>Many thanks to our reviewers! </li></ul>
    23. 23. 3. Social <ul><li>Blurring of the lines between metadata or content consumers and creators (e.g., SoundUnwound, Wikipedia, more) </li></ul><ul><li>Expert community </li></ul><ul><li>End user community </li></ul>
    24. 24. OCLC’s Expert Community Experiment <ul><li>Experiment to test “social cataloging” with OCLC’s expert community – began Feb 15 </li></ul><ul><li>Interest and motivation from WorldCat Local pilot sites that want to use WorldCat Local as their “database of record” </li></ul><ul><li>Allows member libraries with full-level Cataloging authos to make additions and changes to almost all fields in almost all records </li></ul>
    25. 25. Expert Community Experiment <ul><li>Allows member libraries with full-level Cataloging authos to make additions and changes to almost all fields in almost all records </li></ul><ul><ul><li>PCC records (both BIBCO and CONSER) are excluded </li></ul></ul><ul><ul><li>LC records not coded as PCC are included </li></ul></ul><ul><ul><li>More information: </li></ul></ul><ul><li> </li></ul>
    26. 26. End User Engagement: With Thanks to John MacColl’s “Are Archives the New Libraries?” <ul><li>Do not describe everything in painstaking detail </li></ul><ul><li>Start with basic description, then… </li></ul><ul><li>… allow serious researchers to contact you for more detail, and… </li></ul><ul><li>… engage your user community with adding to the descriptions </li></ul>
    27. 27. January 16 th 2008: LC photographs on Flickr John MacColl
    28. 28. 24 hours later Exposure John MacColl
    29. 29. Impact: exposure Flickr: Top 50 LC: Top 6000 John MacColl
    30. 30. Contributions How to lose control John MacColl
    31. 31. Go with it
    32. 32. 4. Multithreaded, Mashed-Up, Assembled from Various Sources, Dynamic/Changing, Social Remixes data from IMDb, Amazon’s music catalog, MusicBrainz, Mechanical Turk, more. “Views” by artist, album, genre, more. Links out to excerpts, content, other sites. Entries are editable by the community of users.
    33. 33. What factors influence an end user’s discovery experience today? <ul><li>“ Dewey arranged books by subject, but Amazon tries to find every way we might want to get from the A of a book we know to the B, C, and Z of books we don’t know, including the fact that lots of other people bought Z.” </li></ul><ul><li>--David Weinberger, Everything is Miscellaneous, p. 62. </li></ul>
    34. 34. What Will a ‘Record’ Be? <ul><li>Library metadata </li></ul><ul><li>practices: </li></ul><ul><li>Title </li></ul><ul><li>Author(s) </li></ul><ul><li>Publication details </li></ul><ul><li>Pagination, etc. </li></ul><ul><li>Series </li></ul><ul><li>Knowledge </li></ul><ul><li>‘ categories’: </li></ul><ul><ul><li>-Classification </li></ul></ul><ul><ul><li>-Subject </li></ul></ul><ul><li>ISBN </li></ul><ul><li>Etc. </li></ul><ul><li>Other attributes: </li></ul><ul><li>How/where to get it </li></ul><ul><li>Rights </li></ul><ul><li>How many sold </li></ul><ul><li>How many circulations </li></ul><ul><li>Banned anywhere? </li></ul><ul><li>Awards won? </li></ul><ul><li>Works cited by this one? </li></ul><ul><li>Reviews? </li></ul><ul><ul><li>-Publisher </li></ul></ul><ul><ul><li>-Reader </li></ul></ul><ul><li>Author information </li></ul><ul><li>Other editions </li></ul><ul><li>Technical metadata </li></ul><ul><li>Identifiers </li></ul><ul><li>Related works </li></ul><ul><li>Blog conversations </li></ul><ul><li>User tags </li></ul><ul><li>Derivative works </li></ul><ul><li>And on and on … </li></ul>
    35. 35. WorldCat Detail Record Redesign – Assembles Data from and Links to Various Sources …
    36. 36. … As Does WorldCat Identities
    37. 37. OCLC FRBR Work-set Algorithm <ul><li>Provides a FRBR-based view of the data </li></ul><ul><li>Records clustered into works using author and title fields from bibliographic and authority records </li></ul><ul><li>Author names and titles normalized to construct a work key </li></ul><ul><li>All records with the same key are grouped together in a work set or cluster </li></ul>Thanks to Diane Vizine-Goetz for the next sequence of slides
    38. 38. 4.87 records/work set for multi-record sets FRBR Work Set Count Cover Art 2,365,961 Summary/Abstract 6,278,014 Table of Contents 4,714,103 Total works sets 12,169,454 Single record sets 9,535,415 Multi-record sets 2,634,039
    39. 39. Share data elements across a FRBR Work Set
    40. 40. Work pages beta Provides a rich context from cataloging data
    41. 41. Project team <ul><ul><li>JD Shipengrover </li></ul></ul><ul><ul><li>Roger Thompson </li></ul></ul><ul><ul><li>Diane Vizine-Goetz </li></ul></ul><ul><ul><li>Harry Wagner </li></ul></ul>Abstracts Authority Data Form/Genre Terms FRBR Clusters Holdings Data Class Numbers Subject Headings Summaries Tables of Contents
    42. 46. Opening Metadata Silos <ul><li>Metadata </li></ul><ul><li>Communities </li></ul><ul><li>for: </li></ul><ul><li>Text </li></ul><ul><ul><li>Print </li></ul></ul><ul><ul><li>Licensed </li></ul></ul><ul><li>Digital </li></ul><ul><ul><li>Archival </li></ul></ul><ul><li>Data </li></ul><ul><li>Images </li></ul><ul><li>Sound </li></ul><ul><li>Video </li></ul><ul><li>Multimedia </li></ul><ul><li>Objects </li></ul><ul><li>More </li></ul>Tree silo By Northern Miniatures
    43. 47. Publisher Supply Chain Data Flow + MARC Records and Library Data Flow By Renee Register
    44. 48. New Models for Creating and Sharing Metadata By Renee Register
    45. 49. Metadata Reuse and Exchange: OCLC Next Generation Cataloging <ul><li>Establish partnerships with publishers </li></ul><ul><li>Ingest publisher and vendor metadata in ONIX </li></ul><ul><li>Crosswalk to MARC (poss. to other formats) </li></ul><ul><li>Enhance publisher metadata </li></ul><ul><li>Output MARC records (available early in the publishing life-cycle) </li></ul><ul><li>Output enhanced ONIX data to publishers/other partners </li></ul><ul><li> </li></ul>FY09 objective: Launch Next Generation Cataloging
    46. 50. 5. Moving into the cloud The (invisible) cloud of complexity on the global network <ul><li>Text </li></ul><ul><ul><li>Print </li></ul></ul><ul><ul><li>Licensed </li></ul></ul><ul><ul><li>Digital </li></ul></ul><ul><ul><li>Archival </li></ul></ul><ul><li>Data </li></ul><ul><li>Images </li></ul><ul><li>Sound </li></ul><ul><li>Video </li></ul><ul><li>Multimedia </li></ul><ul><li>Objects </li></ul><ul><li>Other people </li></ul><ul><li>More </li></ul>Expectation: Easily Find AND Easily Get
    47. 51. 5. Moving “into the cloud” <ul><li>The cloud – like serving up electricity </li></ul><ul><li>“ I don’t care what’s up there as long as it works” </li></ul><ul><li>“ All you need is an Internet connection” </li></ul><ul><li>Don’t need software, don’t need hardware (except your laptop) </li></ul><ul><li>Web 2.0 </li></ul><ul><li>“ The network is the platform” </li></ul><ul><li>Connected devices </li></ul><ul><li>Network effects </li></ul>YouTube video: What is cloud computing? By WhatKnot From home generators to … THE GRID
    48. 52. How Might OCLC Help? <ul><li>Harness the cooperative by enabling libraries to share hardware, services and data ‘in the cloud’ </li></ul><ul><li>WorldCat Local, as a network-level end-user interface, is the first step </li></ul><ul><li>Continue by building cloud services on top of WorldCat to support library resource management </li></ul><ul><li>Generate significant savings for libraries to deploy to new initiatives </li></ul>
    49. 53. OCLC Cooperative Web-scale Library Management Services <ul><li>Web-Scale … </li></ul><ul><ul><li>Circulation and Delivery </li></ul></ul><ul><ul><li>Print and Electronic Acquisitions </li></ul></ul><ul><ul><li>License Management </li></ul></ul><ul><ul><li>Self-Configuration </li></ul></ul><ul><li>More at </li></ul>
    50. 54. Moving Tech Services to the Cloud: What Would We Need to Do Differently? <ul><li>Incrementally move technical services to the network </li></ul><ul><li>Think of selection-ordering-receiving-payment-cataloging-catalog maintenance as ONE stream </li></ul><ul><li>Normalize or step away from many local practices </li></ul><ul><li>Exploit all sources of selection, acquisitions, cataloging and classification data </li></ul><ul><li>Mine identifier, catalog, authority, and classification data for new uses (e.g., data services, collocation, browsing, visualization) </li></ul><ul><li>Metadata production – manual (special collections) and automated (core collections of published materials) </li></ul><ul><li>Move beyond MARC (new metadata management platform) </li></ul>
    51. 55. In Conclusion: Opportunities for ‘Next Generation’ Library Metadata Practice <ul><li>Take the collections to a wider audience: be where their eyes are; expose your metadata! </li></ul><ul><li>Open up metadata silos ; support metadata exchange, reuse, interoperation </li></ul><ul><li>Develop user-centered definitions of metadata ‘quality’; engage with users as metadata contributors </li></ul><ul><li>Move metadata management to the cloud </li></ul>
    52. 56. What Will It Mean “to Catalog”? <ul><li>Point and click ‘in the cloud’ </li></ul><ul><li>Fuzzier boundaries between consumers and creators of metadata </li></ul><ul><li>Many types of information objects (things and people) </li></ul><ul><li>Metadata production – manual (special collections) and automated (core collections of published materials) </li></ul><ul><li>Metadata reuse, conversion, ‘mash-ups’ </li></ul><ul><li>Data mining; metadata consulting, design and development </li></ul>By Angela Ben de Cosanostra
    53. 57. Thank You! [email_address] http://