• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
PEPRS and the Keepers Registry
 

PEPRS and the Keepers Registry

on

  • 996 views

Presentation given on PEPRS and the Keepers Registry at the British Library UKSG 2011 event held in Boston Spa on Thursday 13th October 2011

Presentation given on PEPRS and the Keepers Registry at the British Library UKSG 2011 event held in Boston Spa on Thursday 13th October 2011

Statistics

Views

Total Views
996
Views on SlideShare
996
Embed Views
0

Actions

Likes
0
Downloads
3
Comments
0

0 Embeds 0

No embeds

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment
  • Going to talk about a metadata project which provides a key role in providing information for librarians and collection managers about the archiving arrangements for e-journals
  • This is the old scenario where users were faced with row upon row of printed journals.
  • Now users are inclined to access this information via computers although not quite like these ones!
  • All the statistics point to increased e-journal publication, expenditure and usage. RIN has done a lot of work to quantify the situation. Chart one shows availability by discipline and clearly it is no great surprise to learn that the sciences have a very high %. The second chart shows that the bigger publishers have moved into online in a bigger way than the smaller publishers but it is increasing no matter the size and more recent statistics would no doubt show significant increases. The third chart shows the increase in usage. There was a 23% increase in downloading between 2005/6 – 2006/07 and a 19% increase 2006-7. The increase is greater for the Scottish Higher Education Digital Libraries (SHEDL). 19.58% 2007-8 and 41.2% 2009-9.
  • Print aspects. Essentially under library control.
  • Essential aspect is that it is not under the control of the library as is the case with print.
  • Schemes have emerged to provide solutions for libraries but there is a key issue in trying to obtain a coherent overall view.
  • This is the key report on which PEPRS/theKeepers Registry is based.
  • There is a lot of background literature but the key report is that prepared by Rigtscm and Loughborough University. Essentially PEPRS has evolved from the findings in the report.
  • There are about 16 member libraries in the UKLOCKSS Alliance
  • Just under 50 participating publishers
  • 12k titles preserved
  • How do we find what is in the vaults but more critically how can we avoid having to look into each vault separately?
  • This is a snow drop showing that the Keepers Registry is essentially an aggregation of metadata from a number of suppliers or as they are called in the KR context archiving agencies.
  • The key components. Essentially the KR is based upon metadata from the different participating agencies associated with authoritative metadata from the ISSN Register.
  • These are the open source components used in the KR.
  • Demonstration of the beta service.
  • The Home page. Search Box, list of agencies with last update date, project news and tabs for journal title, publisher and FAQ and Help
  • Entering an ISSN. (Note that the p-ISSN or the e-ISSN can be entered. The KR uses the ISSN-L which links the two).
  • The results screen. In this case there is only a single record.
  • Search results when there are a few possibilities.
  • Full record display
  • Full record.
  • Full record display showing first of all that there are four different agencies involved and secondly the status codes. Only using two at present: Preserved and In process.
  • HathiTrust summary information.
  • HathiTrust extent information.
  • Browsing journals.
  • Browsing publishers.
  • Some information about Phase 2.
  • Key stages for Phase 2.
  • ISSN issues which have been discovered.
  • Extent information. A variety of practices
  • Different terms used. We had planned to make progress with standardising the vocabulary used but haven’t really managed to take this too far.
  • PAP has been in contact with us and they are very keen to be able to use any API which we develop.
  • It was always the intention to add additional agencies. We have added the HathiTrust to the original 5 and will be adding the data from NSLC in the relatively near future. We have received the data and are assessing and analysing it at the moment. Should be adding it within the next couple of months. Have drawn up a document on inclusion criteria and have received some feedback on it and will be seeking more and then issuing it to the potential candidate agencies.

PEPRS and the Keepers Registry PEPRS and the Keepers Registry Presentation Transcript

  • Finding out about the preservation of e-journals: an overview of the PEPRS project and the Keepers Registry Beta service Fred Guy, EDINA, University of Edinburgh British Library, Boston Spa 13 th October 2011 on behalf of the Project Team – EDINA/ISSN-IC
  • http://www.flickr.com/photos/sinclairlibrary/769777273/sizes/z/in/photostream/
  • Computer room in London School of Economics 1981 http://www.flickr.com/photos/lselibrary/4401344940/sizes/o/in/photostream/
  • Statistics related to e-journals RIN. E-only scholarly journals: overcoming the barriers . November 2010. 23% 3
  • Print – key aspects
    • Once purchased is owned by the library and can be retained, transferred to remote store or disposed of when library determines this
    • Library can check if other libraries hold the material and it can be consulted on the premises or be available via Inter-Library loan
    • Likely that it will be available in a national library via legal deposit legislation (goes back to 17 th century in UK)
  • E-journals: key aspects
    • Libraries are licensed for usage – do not host the material
    • Control lies with the publisher rather than with the subscriber
    • Publishers are not a constant in the life of a journal– titles are often transferred between publishers
    • Publishers may decide that they do not want to host back material
    • Legislation for legal deposit is not yet in place in UK and many other countries
  • Why a Preservation Registry?
    • Many schemes emerging to meet challenge
    • But who is doing what?
      • How can libraries & policy-makers assess which e-journals are being archived, by what methods, and under what terms of access?
    • JISC commissioned a scoping study for an e-journals preservation registry
      • the idea had been mentioned in the literature
  • Scoping Study for a Registry
  • Scoping Study Report Precedes PEPRS
    • Rightscom / Loughborough University, 2007
      • Confirmed expressed need among libraries and policy makers
      • Warned of potential burden on digital preservation agencies
      • Recommended:
        • an e-journals preservation registry should be built
        • UK Union Catalogue of Serials (SUNCAT) or SHERPA (Open Access) get involved
          • SUNCAT is hosted and managed at EDINA
  • PROJECT DETAILS
    • Phase 1 funded by JISC (Preservation Programme) from August 2008 – July 2010
    • EDINA, University of Edinburgh, grant recipient
    • Project partner – ISSN International Centre, Paris
    • Evaluation carried out by Charles Beagrie Limited for the JISC in February 2010
  • Digital Preservation Agencies in the Pilot
      • * Two 3 rd Party Organisations
        • CLOCKSS ( C ontrolled L ots O f C opies K eeps S tuff S afe)
        • Portico
      • * Two National Libraries (c.f. legal deposit)
        • British Library (BL) British Library e-Journal Digital Archive
        • Koninklijke Bibliotheek (KB e-Depot) KB, National Library of the Netherlands
      • Two library cooperatives
        • General LOCKSS Network ( L ots O f C opies K eeps S tuff S afe)
        • HathiTrust
  • The Agencies - LOCKSS
    • LOCKSS ( L ots O f C opies K eeps S tuff S afe), based at Stanford University Libraries, is an international community initiative that provides libraries with digital preservation tools and support so that they can easily and inexpensively collect and preserve their own copies of authorized e-content.
  • The Agencies - CLOCKSS
    • CLOCKSS ( C ontrolled LOCKSS ) is a not for profit joint venture between the world’s leading scholarly publishers and research libraries whose mission is to build a sustainable, geographically distributed dark archive with which to ensure the long-term survival of Web-based scholarly publications for the benefit of the greater global research community.
  • The Agencies - Portico
    • Portico provides libraries and publishers with a reliable, cost-effective solution to one of the most critical challenges facing the scholarly community today—ensuring that the electronic resources you rely on everyday will be accessible to future researchers, scholars, and students.
  • The Agencies – e-Depot
    • The e-Depot is a digital archiving environment that ensures long-term access to digital objects.
    • e-Depot is based at the Koninklijke Bibliotheek in The Hague
  • The Agencies – British Library
    • The BL preserves digital content that is collected but also material that is created, such as digitised collections. The store is an important component for forthcoming e-Legal Deposit.
  • The Agencies – the HathiTrust HathiTrust is a partnership of major research institutions and libraries working to ensure that the cultural record is preserved and accessible long into the future. There are more than fifty partners in HathiTrust, and membership is open to institutions worldwide. HathiTrust is a partnership of major research institutions and libraries working to ensure that the cultural record is preserved and accessible long into the future. There are more than fifty partners in HathiTrust, and membership is open to institutions worldwide. HathiTrust is a partnership of major research institutions and libraries working to ensure that the cultural record is preserved and accessible long into the future. There are more than fifty partners in HathiTrust, and membership is open to institutions worldwide. HathiTrust is a partnership of major research institutions and libraries working to ensure that the cultural record is preserved and accessible long into the future. There are more than fifty partners in HathiTrust, and membership is open to institutions worldwide. HathiTrust is a partnership of major research institutions and libraries working to ensure that the cultural record is preserved and accessible long into the future. There are more than fifty partners in HathiTrust, and membership is open to institutions worldwide.
  • What is in the vaults? http://www.flickr.com/photos/wka/4283285201 / http:// www.flickr.com/photos/mcfull/421644442/sizes/s/in/photostream /
  • http://www.flickr.com/photos/akeeh/4300472592/sizes/z/in/photostream/ Agency metadata Agency metadata Agency metadata Agency metadata Agency metadata The Keepers Registry Agency metadata
  • Creating the database Agency data ISSN Register ISSNs The Keepers Registry ISSN-L + p-ISSN & e-ISSN Register metadata Agency metadata
  • Open Source components used in the Keepers Registry Abstract Perl API supporting search and retrieval. Based on YAZ toolkit. ZOOM http://zoom.z3950.org/api/ Z39.50 support in Perl Each preservation agency supplies custom data at the moment, so scripts will be created for each data source. ISSN data is in MARC21 format and will be processed using MARC::Record CPAN package Custom Perl and CPAN packages including MARC::Record http://search.cpan.org/~gmcharlt/MARC-Record-2.0.2/ Normalisation Data files will be collected using FTP and HTTP. Custom Perl and CPAN packages Harvester Provides structured text indexing and retrieval. Fast and scales well. Provides powerful and flexible text retrieval capabilities. Zebra http:// www.indexdata.dk /zebra/ Database: metadata hosted by the Keepers Registry Offers fast and easy development and is extremely flexible Apache::ASP http://www.apache-asp.org / User interface Comment Software choice Component
  • Beta service demonstration
    • The Keepers Registry
  • HOME PAGE HOME PAGE
  •  
  • Search results screen – 1 record
  • Search Results screen – multiple records
  • Full record display
  • Variant title Variant title
  • 4 agencies Status Status
  • HathiTrust - summary
  • HathiTrust – full record display
  • Journal browse
  • Browse by publishers
  • PEPRS Phase 2
    • Funding provided from August 2010 – July 2012
    • Beta service – end of April 2011 www.peprs.org/
    • The Keepers Registry – October 2011 http:// thekeepers.org
    • Full service –2012
    • Involve international users in testing
  • Phase 2: key stages       Business Planning         Establishment of Advisory Board     0.4   0.3   0.2   0.1     Software releases                                           Full service operation       The Keepers Registry Beta service start                 PEPRS Beta service start and end                       Dec-12 Aug-12 May-11 Apr-12 Feb-11 Dec-11 Oct-11 Aug-11 Apr-11 Dec-10 Aug-10 ACTIVITY Phase 2 start and end PEPRS Development activity
  • ISSN issues
    • ISSNs missing in some agency records and some not in ISSN Register
    • Some duplicate records
    • Some p-ISSNs used as e-ISSNs
    • Some p-ISSNs linked via a common ISSN-L to a number of e-ISSNs but which one is correct?
    • Some were incorrect
  • Holdings information - variation
    • HathiTrust : Preserved: n.s. v. 3 (1883/85); n.s. v. 12 (1897/98); n.s. v. 16 (1903/04); 1864-1865; n.s. v. 4
    e-Depot : Preserved: v. 1 - 36, 38 - 46. UK LOCKSS Alliance : Preserved: v. 42 - 45. In progress: v. 46, 47. Portico : Preserved: (2002-2009) v.40, v.41, v.42, v.43, v.44, v.45, v.46, v.47 .
  • Terms used by preservation agencies
  • Involvement with international initiatives
    • Print Archives Program of the Center for Research Libraries – “ CRL is working with consortial partners to plan a prototype print archives framework to link existing print archiving efforts. has developed a searchable Print Archives Registry of information about print-archiving initiatives, including:
      • Projects
      • Serial Holdings.
  • Additional agencies
    • National Science Library, Chinese Academy of Sciences (NSLC) (c.20,000 journals with ISSNs) - metadata submitted.
    • Others to be considered: California Digital Library; Ontario Scholars Portal; Archaeology Data Service (York)
    • Drawing up inclusion criteria
  • PEPRS: Further information and Contact details
    • The Keepers Registry Beta service
    • http://thekeepers.org
    • Project website
    • http://edina.ac.uk/projects/peprs/index.html
    • Help Desk edina@ed.ac.uk
    • Fred Guy, EDINA, University of Edinburgh
    • [email_address]