Using a dumb identifier to do smart things
Upcoming SlideShare
Loading in...5
×
 

Using a dumb identifier to do smart things

on

  • 1,302 views

Presented by Peter Burnhill at the 2012 ACNP-NILDE Conference.

Presented by Peter Burnhill at the 2012 ACNP-NILDE Conference.

Statistics

Views

Total Views
1,302
Views on SlideShare
1,302
Embed Views
0

Actions

Likes
0
Downloads
0
Comments
0

0 Embeds 0

No embeds

Accessibility

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment
  • JISC is the Joint Systems Committee of the UK funding bodies for higher and further education. EDINA and Mimas are designated as National Data Centres, at the Universities of Edinburgh and Manchester, respectively. Like JISC itself and JANET / UKERNA, EDINA and Mimas are HEFCE-related Bodies, governed by Funding Agreements between HEFCE and those two Universities. JISC has a number of sub-committees which help inform policy and also watch over programmes of funding and the operation of services, such as those provided by the two National Data Centres. It has also set up a company, JISC Collections as a legal body to broker licences. UK = England, Scotland, Wales and Northern Ireland EDINA & Mimas are based at Universities of Edinburgh and Manchester
  • The priority task for librarians and academic support, locally and ‘at the network-level’, is to ensure that researchers, students and their teachers have ease and continuing access to online scholarly resources. Presented here is a framework and analysis that might assist understanding of that task. Particular attention is given to article-length work published in e-journals, available in digital format, online under some form of licence - either by subscription or to be observed by the user (as is the case with Creative Commons licensing). Ease of access is regarded here as to do with usability and with licence conditions and management of authorisation. Continuity of access is regarded here as including both long-term preservation and continuing access to back copy, regardless of current subscription status. There is also reminder that we all seek ease of access to back content as well as current content, for what is online in digital format as well as what has long been on-shelf as bound paper: ease and continuity of access to content on the digital shelf. The challenge is to reason what are the cost-effective points of service for that content, and identify the required infrastructure, typically as authoritative network-level registries - above the level of the institution, potentially of international as well as national character. This is about access; not considered here are the additional, although related tasks of ‘search & discovery’ and ‘sharing’, meaning the formal issue (or publication) of such material. Accordingly, only passing reference is made to rhe open access agend: journals registered in the DOAJ, or articles (most often the authors’ final copy) deposited in Institutional Repositories (IRs), and to the network-level role played by the Depot for that purpose.
  • 10 10 Phase 1 completed in 1997 Phase 2 agreed for 1998 & 1999 EDINA will host a CASA workshop in the UK during 1998 to focus on ISSN based identifiers
  • 12 12 13 Verify identifier, e.g. ISSN, SICI is the linchpin of the MODELS chain ISSN key to unique identification of serials provides an economic way to enhance existing records CASA will provide verify function through ISSN-based identifier
  • The priority task for librarians and academic support, locally and ‘at the network-level’, is to ensure that researchers, students and their teachers have ease and continuing access to online scholarly resources. Presented here is a framework and analysis that might assist understanding of that task. Particular attention is given to article-length work published in e-journals, available in digital format, online under some form of licence - either by subscription or to be observed by the user (as is the case with Creative Commons licensing). Ease of access is regarded here as to do with usability and with licence conditions and management of authorisation. Continuity of access is regarded here as including both long-term preservation and continuing access to back copy, regardless of current subscription status. There is also reminder that we all seek ease of access to back content as well as current content, for what is online in digital format as well as what has long been on-shelf as bound paper: ease and continuity of access to content on the digital shelf. The challenge is to reason what are the cost-effective points of service for that content, and identify the required infrastructure, typically as authoritative network-level registries - above the level of the institution, potentially of international as well as national character. This is about access; not considered here are the additional, although related tasks of ‘search & discovery’ and ‘sharing’, meaning the formal issue (or publication) of such material. Accordingly, only passing reference is made to rhe open access agend: journals registered in the DOAJ, or articles (most often the authors’ final copy) deposited in Institutional Repositories (IRs), and to the network-level role played by the Depot for that purpose.
  • The key aspects of this diagram are the data dependencies – seeking simplicity by linking out across the network to the most authoritative source. In this case, as shown in (a) it’s the ISSN Register for authoritative information on known e-journals – and after six years work the number of ISSNs assigned to e-serials has increased from about 13,000 to over 100,000 *probably* accounting for well over 95% of the 30 to 40 thousand e-journals that the academic and scientific world is interested in. And as shown in (b) what we want to have is authoritative statement made by the archiving organisations on their archival actions. Another feature of this approach is that the responsibility for being up-to-date is re-defined. Of course, there is some simplicity here. To name just two complexities. The fields for describing archival action – and the terms of access – are not standard and we have some way to go on that. Second, some of hose agencies – and there will be more being added all the time – want to report archival action on digitised journals, for which use of the the ISSN Register currently has major limitations. More on all that to be said later, along with the general comment that this schematic deals only with ‘titles’ not with the extent of a given title that is archived – it side-steps the matter of the holdings statement. But in Phase two, and with the release of the Public Beta we have begun to tackle ‘holdings’.
  • The priority task for librarians and academic support, locally and ‘at the network-level’, is to ensure that researchers, students and their teachers have ease and continuing access to online scholarly resources. Presented here is a framework and analysis that might assist understanding of that task. Particular attention is given to article-length work published in e-journals, available in digital format, online under some form of licence - either by subscription or to be observed by the user (as is the case with Creative Commons licensing). Ease of access is regarded here as to do with usability and with licence conditions and management of authorisation. Continuity of access is regarded here as including both long-term preservation and continuing access to back copy, regardless of current subscription status. There is also reminder that we all seek ease of access to back content as well as current content, for what is online in digital format as well as what has long been on-shelf as bound paper: ease and continuity of access to content on the digital shelf. The challenge is to reason what are the cost-effective points of service for that content, and identify the required infrastructure, typically as authoritative network-level registries - above the level of the institution, potentially of international as well as national character. This is about access; not considered here are the additional, although related tasks of ‘search & discovery’ and ‘sharing’, meaning the formal issue (or publication) of such material. Accordingly, only passing reference is made to rhe open access agend: journals registered in the DOAJ, or articles (most often the authors’ final copy) deposited in Institutional Repositories (IRs), and to the network-level role played by the Depot for that purpose.

Using a dumb identifier to do smart things Using a dumb identifier to do smart things Presentation Transcript

  • ACNP/NILDE Conferenze, Bari May 22th-23th 2012 Using a dumb identifier to do smart things Peter Burnhill Director, EDINA, University of Edinburgh, UK
  • Overview for talk0. Introduction – EDINA, JISC national datacentre at University of Edinburgh• Opportunity to share a digital library journey – and say hello to old friends• SUNCAT: union catalogue of serials in UK (& OpenURL Router)• Keepers Registry: e-journal preservation (& ISSN-L) – Digitised journals & print archiving• Entitlement Registry & KnowledgeBase (KB+)• Forward Look
  • research, learning & teaching in UK universities & colleges Formed acting as two platforms for network-level services NDCs in 1995/96, based upon Edinburghof JISC Integrated Information Environment as part University Data Library (1983/4) National Data CentresDigital Content Tools & & Metadata Infrastructure JISC Collections JISC Sub-Committees UK funding councils for HE & FE UK Research Councils
  • Overview1. Internet and emergence of the Web 1994 - – Digital access to metadata about Serial Content * Content was mostly PRINT ; becoming digital• SUNCAT & OpenURL Router, 2003 - – for Serial Content that is available as Print & Digital• Keepers Registry: e-journal preservation, 2008 – – focus on DIGITAL, Digitised (and maybe print)• Entitlement Registry & KnowledgeBasePlus, 2011/12 – for Serial Content that is DIGITAL (and subscribed)• Forward Look, with 2020 Vision!! – General scheme for DIGITAL & PRINT(?) Content
  • Spotlight upon essential role for the ISSNand upon useful changes made by ISSN Network: – Invention and use of the ISSN-L linking field – ISSNs for digitised journal content (d-journals) – Growth in ISSNs assigned to e-serials * circa 100,000 ISSNs for electronic ‘continuing resources’
  • ISSNs assignedto online‘continuingresources’by country
  • 1. Some Consequences of The Web/Internet • Essentials of supply chain have changed • Libraries do not take physical custody of key content * online remotely, not on-shelf locally • Role of libraries as trusted keepers of information and culture has been disrupted – Need assurance of continuity of access • Can re-state the role of libraries graphically …
  • Our Central Task: To ensure researchers, students and their teachers have ease and continuing access to online scholarly resources “ease” access “continuing” to content & services access to long-termusability Licence restricted back content preservation openClassic use case: libraries secure provision of journal content P.Burnhill, Edinburgh 2009
  • EDINAbegan in 1996 as a JISC National Datacentre at the University of Edinburgh – Art Abstracts, BIOSIS Previews, Compendex, EconLit, INSPEC, MLA, PAIS, etcIn 1994, we had launched SALSER, the first (one of) national union catalogues on the Web: – union catalogue of serials:60 libraries in Scotland * 14 universities, National Library of Scotland, special libraries * Emphasis on assisting the searcher and visits to other libraries – early use of WWW and also use of Z39.50 for federated searchingIn late 1990s, we gained knowledge of distributed architecture and Z39.50 in the JOIN-UP Programme: Table of Contents & British Library Document Supply, with federated searching using Z39.50 and then the use of OpenURL
  • EDINA – Art Abstracts, BIOSIS Previews, Compendex, EconLit, INSPEC,• We had SALSER, a national union catalogue for Scotland on the Web• We had knowledge of distributed architecture and Z39.50• We were learning the importance of the ISSNAnd then …… in 1995/6, CIB (Bologna) made contact to propose an EU project: the CASA Project …
  • CASA: ‘Co-operative Archive on Serials & Articles’• EU 4th Framework: Telematics for Libraries Programme• Prof. J. Di Cocco, CIB (Bologna, It); [A. Bollini, A. Citti, V. Verniti]• Partners: CIB, ISSN-IC, ICCU (It), Ariadne (It), EDINA (UK) + many others (It) – F.Pelle• 3-year, 2-phase project that began in 1997 – 2nd phase: January 1998 - December 2000• By 2nd phase, focus of project was switching from centralised ‘Archive’ to distributed ‘Activity’, hence new ‘descriptive title’:• CASA: ‘Co-operative Activity on Serials & Articles’ • (although ‘title proper’ remained unchanged) [From a PPT presented to EU Commission in Luxembourg, November 2000]
  • The information chainneed to ‘verify’ the object: ISSN as ‘identifier’to provide the linchpin not all serial lists have ISSN present  Verify identification of article/serial Discover Locate Request Access article of article/serial access/delivery article of interest service of article/serial interest Based on the MODELS verbs (UKOLN, UK) [From a PPT presented to EU Commission in Luxembourg, November 2000]
  • UK decides to create union catalogue of serials• www.SUNCAT.ac.uk , 2005 - – v. many titles, mostly print; CONSER + ISSN + DOAJ * British Library, Oxford, Cambridge etc; 85 libraries – ISSN used to enhance OPAC records & assist matching, but missing/unassigned ISSNs Now funded as a JISC Core Service
  • a Locate facility for researchers & students on the Web www.SUNCAT.ac.uk
  • and for researchers& students ‘on the go …’• Service enhancement: mobile App/geo-locate SUNCAT - ‘who holds what where?’
  • SUNCAT – extra functions1. Source of catalogue records for Contributing Libraries, including ISSN records2. Support for researchers to access to text, via: – Tables of Content (ToC) – the UK OpenURL Router * a register of OpenURL Routers used by Institutions – the UK Access Management Federation (Shibboleth)• Support for decision-making by libraries who want to remove little-used journals from their shelves – UKResearchReserve (UKRR) * de-shelving; print archiving ‘if it is worth archiving it should have an identifier’
  • 3. E-journal Preservation: Who looks after what? • Switch of focus: –Digital/online/electronic only –Digital preservation
  • Our Central Task: To ensure researchers, students and their teachers have ease and continuing access to online scholarly resources “ease” access “continuing” to content & services access to long-termusability Licence restricted back content preservation openClassic use case: libraries secure provision of journal content P.Burnhill, Edinburgh 2009
  • E-journal Preservation: Who looks after what?• EDINA had gained some experience through – CLOCKSS (EDINA represents U. of Edinburgh) and – UK LOCKSS Alliance (Digital Curation Centre & EDINA)• In 2007, JISC had commissioned scoping study for a e-journal preservation registry which suggested basing this on SUNCAT• EDINA & ISSN-IC combined as partners in the JISC-funded PEPRS Project in 2008.• The design of the E-journal Preservation Registry had the ISSN the centre from the start …
  • Abstract Data Model: Figure 1 in reference paper in Serials, March 2009 SERVICES: user requirements m2m/API E-J Preservation Registry Service Piloting an E-journals E-Journal METADATA Preservation Preservation on preservation action (b) Registry Registry Service (a) METADATA Digital Preservation Agencies e.g. CLOCKSS, Portico; BL, KB; on extant e-journals UK LOCKSS Alliance etc. Data dependency ISSN Register
  • E-journal Preservation: Who looks after what?• The outcome of the PEPRS project is the Keepers Registry, http://thekeepers.org – Launched as a full Beta service in October 2011, at the ISSN Directors’ Meeting in Sarajevo• Aiming to be a global facility, operating at web- scale, with international governance …
  • http://thekeepers.orgThe Keepers Registry allows you to search ontitle or ISSN* and with the ISSN-L as the key matchingfield, ISSNs for print can be tolerated *Also acts as showcasefor archivingorganisations3 CLOCKSS Nodes in Europe: Humboldt University, Germany Università Cattolica del Sacro Cuore, Italy University of Edinburgh, Scotland UK
  • ISSN & ISSN-L has been essentialProgress To Date: [ISSN-L as the basic unit] 17,000 unique titlesnow ‘preserved’ by the 7 archiving organisations registered asKeepers; ‘progress’ for a further 10,000.Some overlap, titles being preserved by more than oneagency: good But1. The coverage of volume/issues held is incomplete 2. Still many more titles to be preserved: • The 17,000 ‘preserved’ includes both the ‘born-digital’ e-
  • http://thekeepers.org Can also search on publisher** as bonus for EDINA & ISSN-IC,the Keepers report the currentpublisher in their metadata ** + 3 others waiting:This helps the searcher but, as noted later, from Canada, UK and USAthis could help build a PublisherInformation Database linked to the ISSN.
  • Busy building international support…2008: JISC Journals WG, London; ISSN National Directors Meeting2009: NASIG Annual Conference, Ashville NC, USA; Lib. of Academy of Science, Beijing; ISSN Directors, Beijing; PARSE.Insight, Germany; Knowledge Exchange, Edinburgh2010: E-journals are Forever Workshop, JISC/DPC, London; IFLA 2010, Gothenburg; RLUK Conference, Edinburgh; Columbia Univ., NYC2011: UKSG; ISSN Governing Body; ARL, Montreal; ALA; JISC Archiving Implementation Group (JARVIG) …2012: Digital Preservation Coalition; ACNP/NILDE; poster at LIBER (June); UNESCO Conference in Vancouver (September) P.Burnhill, F.Pelle, P.Godefroy, F.Guy, M.Macgregor, A.Rusbridge & C.Rees Piloting an e-journals preservation registry service. Serials 22(1) March 2009. [UK Serials Group]
  • 4. E-Licensing: Who has entitlement to what? • In days of print, what was paid for was put on the shelf, so the back-copy was there for perpetual access and use. Reminder Ensuring Continuity of Access means 2 things: 1. Preservation 2. Access to Back-Copy: the digital shelf
  • Our Central Task: To ensure researchers, students and their teachers have ease and continuing access to online scholarly resources “ease” access “continuing” to content & services long-termusability Licence restricted access to preservation back content open P.Burnhill, Edinburgh 2009
  • Reminder: Consequences of Web/Internet• Essentials of supply chain have changed * licensed to access, not sale for delivery of goods (content) * But is sale of good = access service + sale of entitlement??• Libraries do not take physical custody of key content * online remotely, not on-shelf locally• Role of libraries as trusted keepers of information and culture has been disrupted – Need assurance of continuity of access * of all content for future generations * of the back copies, especially if post-cancellation of the licence
  • E-Licensing: Who has entitlement to what? • In 2010, EDINA & JISC Collections began to investigate how best to support libraries (and their patrons) through access to e- journal content post-cancellation. • We have been looking to see how we can learn from the design for the Keepers Registry
  • licence registry entitlement history OpenURL Router Thoughts about a UK Private LOCKSS Network as dark store
  • E-Licensing: Who has entitlement to what? • About the same time, 2010/11, the UK Standing Committee of University & Research Libraries (SCURL) was investigating the cost-benefits of an infrastructure for ‘shared services’. • This is all coming together for a shared service for Electronic Resource Management (ERM) in what is called KnowledgeBasePLus (KB+) All work-in-progress, but some (more) diagrams that may suggest what is going on … … and the importance and potential role of ISSN
  • Library Provides OPAC & User Has Online Access Serial DOI ISSN Article delivers OPAC bibliographi c record Platform uses(“Two Systems” Dempsey) Library belongs to Reader Institution
  • Issuing has Authors’ Body Serial Vol. # Date Final Copy is DOI ISSN issued Is/authorises as Issue ArticleOrg/ID Publisher(s) Table of Contents ‘as RDF Triples’ Basic bibliographic elements Library belongs to Patron Org/ID Institution UKAMF
  • has Serial DOI ISSN Is/authorises Article Publisher(s) delivers Licence/ Holdings Package knows about PlatformSubscription Taken may differ in Licence Subscription Agent(Terms & Conditions) and Content: Taken Date(a) needed for digital library Terms(b) and by librarian ‘ buys uses“what serials are in Package?” Library belongs to Patron Institution UKAMF
  • eJournal Publications Database eISSN / ISSN-L has Serial which Serials in which Packages? Is/authorises -> Need very good metadataOrg/ID that relates Serials (ISSN) with current Publishers Publisher(s) Licence/ Is/authorises Holdings Package Licence to buy/access Serials Subscription Is often offered in Packages, Agent Subscription Offered and negotiated by Consortia Consortium NESLi Negotiator Deal Library’s own Deal Library PB/EDINA 2012 Institution
  • 5. Forward Look, with 2020 Vision!! – General scheme for DIGITAL & PRINT(?) Content – Semantic Web, RDF Triples, URIs and all that jazz – Print will be on shelves for a long while yet, and the visit to a library elsewhere may be as (or more) important as (than) a visit to your own library – Hope to be sitting on a beach in Puglia in 2020
  • In the UK I recall the JOIN-UP Programme in which 4 Projects tried to become an integrated set of National Services but failed  DiscoverPortals: Desk Local Subject Other top Inst. RDNProjects that ZETOC GetRef GetCopy DOCUSENDwanted to beServices:Rights: Subscriber Non-SubscriberMedium: Print Electronic Print ElectronicGeography: Local Remote Local Remote Union Aggregator/ Document DeliverySources: OPAC List Publisher Service
  • CASA WP4: Serial Services Directory Of fer Locat ion ( URI) Another interestingPart y Service It em Terms and Condit ions graphic. CASA WP4 made reference to XML/RDF + Z39.50 Serial Volume Issue Art icle But did not go beyond Loan Subscript ion Full-Text Delivery the prototype  ... dPublisher Supplier Union Cat alogLibrary ... Bollini, Burnhill & Di Cocco, 1998
  • To return to the essential role for the ISSN& useful changes being made by ISSN Network1. Invention and use of the ISSN-L linking field * Makes a ‘family’ of ISSN (print & for electronic) for easier linking– ISSNs for digitised journal content (d-journals) * (in my opinion) Makes the electronic/digital the general case – as print becomes the special case – ‘fixity’ of content * and do not need (or want) a different ISSN for iPads etc– ISSNs for e-serials will keep on growing * as digitised versions of (unassigned) print journals emerge (libraries)– Wish to have ISSN integrate within Semantic Web– Need to recognise other identifiers (eg DOI, those created in union catalogues, and a muddle of URIs) * They will not go away * Need for ‘cross-walk’ between different identifiers for same Serial, as ‘same as’ in RDF Triples
  • Grazie Mille  Time for Q&A ???Information about all projects and services, including ‘The Community Report: What EDINA Does’ http://edina.ac.uk A splendid Helpdesk: edina@ed.ac.uk Peter Burnhill: p.burnhill@ed.ac.uk
  • eJournal Publications Database eISSN / ISSN-LOrg/ID Issuing owns Authors’ Body Serial Vol. # Date Final Copy Imprint is DOI issued Is/authorises Date as Issue ArticleOrg/ID has lists Publisher(s) Table of Contents can deliver Licence/ Is/authorises Holdings Date Subscription Package Package knows about Platform Agent Subscription Subscription Agent Offered Taken DateOrg/ID Org/ID negotiates Terms uses Consortium buys NESLi Deal Is/authorises Library’s own Deal Library belongs to Patron PB/EDINA 2012 Institution UKAMF
  • eJournal Publications Database eISSN / ISSN-L Date RepNet Issuing has has Authors’ Body Serial Volume Final Copy DOI Keepers Is/authorises Date Issue ArticleTransfer Publisher(s) Imprint Table of Suncat Contents delivers Licence/ Is/authorises Holdings KB+ JouTOCs Subscription Package Package knows about Platform Agent Subscription Subscription Agent Offered Taken PeCAN JISC Coll. JUSP Date Consortium buys uses NESLi Negotiator Deal Library’s own belongs to Deal Library Patron PB/EDINA 2012 UKAMF