“Who does forever?” : A Registry of Keepers
Who is looking after e-journals with archival intent?
2. Dr Who and the Scholarly Record
Time Travel for Scholarly Web
Evidence from the Keepers Registry
Statistics on who is looking after what, & what is at risk
Delivered by Peter Burnhill at CNI Fall 2014 Membership Meeting, December 8-9, 2014
Washington, DC. This is about ensuring that online serial content, whether issued in parts or changes over time via a website, continues to be available for scholarship. The central take home message is that we all have a lot still to do.
Overview of the problems of Reference Rot and what actions to take to ensure the persistence of the digital scholarly record. Presented by Peter Burnhill with Adam Rusbridge & Muriel Mewissen, EDINA, University of Edinburgh, UK; Herbert Van De Sompel, Los Alamos National Laboratory Research Library, USA; Gaelle Bequet, ISSN International Centre, France; at Towards Open Science, LIBER, London, June 2015.
Delivered by Peter Burnhill, Director of EDINA, at the PRELIDA Consolidation and Dissemination workshop on 17/18 October 2014 (http://prelida.eu/consolidation-workshop).
Summary: The web changes over time, and significant reference rot inevitably occurs. Web archiving delivers only a 50% chance of success. So in addition to the original URI, the link should be augmented with temporal context to increase robustness.
Presented in Glasgow at UKSG, 31 March - 1 April, by Peter Burnhill and Richard Wincewicz.
This presentation looks at reference rot, link rot, and the work of Hiberlink to ensure web citations persist through time.
Presentation given by Peter Burnhill, director of EDINA, at #ReCon_15 : Beyond the paper: publishing data, software and more. Edinburgh, 19 June 2015
Peter Burnhill
http://reconevent.com/
Presented by Peter Burnhill at the ost ALA Annual Holdings Update Forum, Universal and repurposed holdings information -- Emerging initiatives and projects, Morial Convention Center, New Orleans, Louisiana, USA, 25 June 2011
Delivered by Peter Burnhill at CNI Fall 2014 Membership Meeting, December 8-9, 2014
Washington, DC. This is about ensuring that online serial content, whether issued in parts or changes over time via a website, continues to be available for scholarship. The central take home message is that we all have a lot still to do.
Overview of the problems of Reference Rot and what actions to take to ensure the persistence of the digital scholarly record. Presented by Peter Burnhill with Adam Rusbridge & Muriel Mewissen, EDINA, University of Edinburgh, UK; Herbert Van De Sompel, Los Alamos National Laboratory Research Library, USA; Gaelle Bequet, ISSN International Centre, France; at Towards Open Science, LIBER, London, June 2015.
Delivered by Peter Burnhill, Director of EDINA, at the PRELIDA Consolidation and Dissemination workshop on 17/18 October 2014 (http://prelida.eu/consolidation-workshop).
Summary: The web changes over time, and significant reference rot inevitably occurs. Web archiving delivers only a 50% chance of success. So in addition to the original URI, the link should be augmented with temporal context to increase robustness.
Presented in Glasgow at UKSG, 31 March - 1 April, by Peter Burnhill and Richard Wincewicz.
This presentation looks at reference rot, link rot, and the work of Hiberlink to ensure web citations persist through time.
Presentation given by Peter Burnhill, director of EDINA, at #ReCon_15 : Beyond the paper: publishing data, software and more. Edinburgh, 19 June 2015
Peter Burnhill
http://reconevent.com/
Presented by Peter Burnhill at the ost ALA Annual Holdings Update Forum, Universal and repurposed holdings information -- Emerging initiatives and projects, Morial Convention Center, New Orleans, Louisiana, USA, 25 June 2011
Delivered by Peter Burnhill at Academic Publishing in Europe 9, 29 January 2014. Our shared task is to ensure ease and continuity of access to the scholarly & cultural record.
An overview of how the Hiberlink project relates to the persistence on the web of digital versions of theses. Given by Peter Burnhill, Director of EDINA, at the 17th International Symposium on Electronic Theses & Dissertations - which took place from 23 July to 25 July 2014 at the University of Leicester in the UK.
In order to be reused, research data must be discoverable.
The EPSRC Research Data Expectations* requires research organisations to maintain a data catalogue to record metadata about research data generated by EPSRC-funded research projects.
Universities are increasingly making research data assets available through repositories or other data portals.
The requirement for a UK research data discovery service has grown as universities become more involved in RDM and capacity develops.
Presenter: Peter Burnhill, Director, EDINA national academic data centre, University of Edinburgh, Scotland UK
Presentation given at Beyond Books: What STM & Social Science publishing should learn from each other Marriott Hotel/Kensington, London, 22 April 2010
Delivered by Peter Burnhill at Academic Publishing in Europe 9, 29 January 2014. Our shared task is to ensure ease and continuity of access to the scholarly & cultural record.
An overview of how the Hiberlink project relates to the persistence on the web of digital versions of theses. Given by Peter Burnhill, Director of EDINA, at the 17th International Symposium on Electronic Theses & Dissertations - which took place from 23 July to 25 July 2014 at the University of Leicester in the UK.
In order to be reused, research data must be discoverable.
The EPSRC Research Data Expectations* requires research organisations to maintain a data catalogue to record metadata about research data generated by EPSRC-funded research projects.
Universities are increasingly making research data assets available through repositories or other data portals.
The requirement for a UK research data discovery service has grown as universities become more involved in RDM and capacity develops.
Presenter: Peter Burnhill, Director, EDINA national academic data centre, University of Edinburgh, Scotland UK
Presentation given at Beyond Books: What STM & Social Science publishing should learn from each other Marriott Hotel/Kensington, London, 22 April 2010
An overview of what EDINA has to offer to researchers in UK HE and FE. Presented by Nicola Osborne and Lisa Otty at Supporting Digital Scholarship in CHSS on 2 December 2015
Slides accompanying the University of Edinburgh Digital Day of Ideas 2016 (#DigScholEd) workshop on Tweeting and Blogging for Academics run by Nicola Osborne (EDINA) and Lorna Campbell (EDINA/LTW). The workshop took place on 18th May 2016. Read more about the event here: http://www.digital.hss.ed.ac.uk/ddi/ddi-2016/
Presented by Adam Rusbridge at e-Journals are forever? Preservation and Continuing Access to e-journal Content. A DPC, EDINA and JISC joint initiative, British Library, London, 26 April 2010.
Presentation given by Stuart Macdonald on Addressing History at RunCoCo workshop: Community Collection Online - sustainability and business models, University of Leeds, 3 November 2010
Stronger together: community initiatives in journal managementJisc
There has been a recent growth of initiatives to address common problems regarding current and long-term access to e-journal content. Jisc is at the forefront of many of these with the close participation and active input of educational institutions.
This session aims to summarise the current state of key themes with pointers to future directions of areas such as sustainability, the move towards e-only environments, and shared consortia approaches. It will provide an overview and panel discussion on developing the supporting infrastructure to meet the needs of users. The discussion will focus on how institutions, community bodies and service providers can best work together to ensure sustainable, long-term initiatives by seeking to introduce uniformity, standardisation and collaboration to an even greater extent.
The session will introduce two new Jisc-supported projects in this area, the Keepers Registry Extra and SafeNet initiatives, and discuss how these fit alongside existing Jisc services such as Knowledge Base+, UK LOCKSS Alliance, Journal Archives and JUSP (Journal Usage Statistics Portal). The panel will address how this catalogue of services contributes towards a coherent strategy in the management of e-journal content.
The state of play currently with the preservation of all things webby and concrete actions to take. Delivered by Peter Burnhill at the ALSP event "Standing on the Digits of Giants: Research data, preservation and innovation" on 8 March 2015 in London.
Stewardship of the Digital Scholarly Record and Digital Published HeritageNASIG
This presentation discusses how The Keepers Registry and the network of Keepers is attempting to tackle the issue of digital preservation for electronic serials specifically. First identifying the scope of the problem being addressed, it moves on to the successes, in preservation and in measuring that preservation, before moving on to the challenges still to be surmounted. It touches upon some of the specific cases on which this preservation is focussing, including legal deposit and regional library consortia, as well as engagement with OA journals. It finishes with the broader plan of action to help allow the Keepers to accomplish their digital preservation goals, laid out in the statement they issued last August, calling upon all stakeholders in the world of scholarly communication, notably both publishers and research libraries, and setting actions they can take to help in this mission.
Speaker: Ted Westervelt, Library of Congress
Web Today, Good Tomorrow? Transactional archiving of web contentPeter Burnhill
Report from Hiberlink Project into threat of and remedy for Reference Rot. Archiving what is cited on the web. Need for action by scholarly publishers.
As delivered at Innovators Session, Professional/Scholarly Publishing (PSP) Division, Association of American Publishers (AAP), Washington DC, 1-4 February 2017.
HIBERLINK: Reference Rot and Linked Data: Threat and RemedyPRELIDA Project
Peter Burnhill (EDINA, University of Edinburgh), presented at the 3rd PRELIDA Consolidation and Dissemination Workshop, Riva, Italy, October, 17, 2014. More information about the workshop at: prelida.eu
For many libraries, an institutional repository is an online archive to collect, preserve, and make accessible the intellectual output of an institution. For a growing bloc, the goal is to go further, beyond knowledge preservation to knowledge creation. These libraries are using their repositories to provide faculty with a proven publishing option by facilitating the production and distribution of original content often too niche for traditional publishers.
How do metadata librarians sift the incoming metadata with these different goals in mind? How do they optimize content for discovery in a wide range of resources such as online catalogs, external research databases, and major search engines? For a library that is also providing publishing services, what additional steps are necessary?
As the provider of Digital Commons, a repository and publishing platform for over 350 institutions, bepress has first-hand experience with these topics, and our consultants advise regularly on best practices for collecting, publishing, distributing, and archiving content. This presentation is intended for library professionals, whether their goal is to collect previously published works or to go further into library-led publishing. After an overview of common sources and destinations for metadata, attendees will come away with a set of considerations for streamlining workflows and optimizing content for discovery and distribution in major venues.
Eli Windchy is the VP, Consulting Services at bepress which provides software and services to the scholarly community. She received a Master's in Archaeology from University of Virginia, taught organic gardening, and for the last ten years has also been getting dirty with the metadata of Digital Commons repositories. She co-directs courses in institutional repository management and publishing, and she enjoys addressing the challenges of interoperability and scholarly communication.
Presented by Peter Burnhill and Lisa Otty at 36th Annual IATUL Conference in Hannover, Germany, 5 - 9 July 2015 “Strategic Partnerships for Access and Discovery”
Web Today, Good Tomorrow? Transactional archiving of web content [Long Version]Peter Burnhill
Report from Hiberlink Project into threat of and remedy for Reference Rot. Archiving what is cited on the web. Need for action by scholarly publishers, the software they use and the software used by authors in note taking.
As delivered at Innovators Session, Professional/Scholarly Publishing (PSP) Division, Association of American Publishers (AAP), Washington DC, 1-4 February 2017.
Long Version, containing extra slides hidden in presentation.
Presentation given by Peter Burnhill of EDINA, at the Digital Preservation Coalition's "Trust and E-journals" event on 31 January 2012 at the Wellcome Collection Conference Centre, Euston Road, London, UK.
Presented by Peter Burnhill at the "Taking the Long View" conference in Edinburgh, 7 September 2015.
Strategies to ensure long-term access to digital collections.
Come Together: Interdepartmental Collaboration to Connect the IR and Library ...NASIG
Presenter: Amanda Makula, University Of San Diego
While institutional repositories (IRs) often include a built-in searching mechanism and/or are indexed by web search engines, what about our patrons who go straight to the library catalog with their information need? Rather than hope that users will stumble upon the IR from the library website or assume that they will start their research with a Google search, librarians can facilitate greater IR discoverability and usage by integrating its content into the library catalog. With strong teamwork, good communication, and a shared vision, this endeavor helps transform the IR and library catalog from separate, siloed platforms into a more cohesive collections package.
At the University of San Diego, librarians and administrators across three departments -- Technical Services, Systems, and Archives / Special Collections / Digital Initiatives --recognized this opportunity and came together to share information and work in concert to explore and enact the benefits of auto-harvesting IR content into the library catalog. Driven by a vision of providing enhanced discoverability and access, as well as promoting the IR as a whole and enriching the catalog, the team members worked cooperatively to identify specific IR collections appropriate for harvest, investigate technical logistics, consult outside vendors (including Innovative and bepress), and experiment with implementation.
Similar to Tales from the Keepers Registry: Dr Who and the Scholarly Record (20)
A look at the research being carried out by Dr Stuart Dunn at Kings College London. This includes his work on rediscovering Corpse Paths in Great Britain.
A presentation by Clare Rowland from the Centre for Ecology and Hydrology given at EDINA's GeoForum 2017 about the new Landcover 2015 data now available in Environment Digimap.
A presentation by John Murray from Fusion Data Science given at EDINA's GeoForum 2017 about the use of Lidar Data and the technology and techniques that can be used on it to create useful datasets.
Slides accompanying the presentation:"Reference Rot in Theses: A HiberActive Pilot", a 10x10 session (10 slides over 10 minutes) presented by Nicola Osborne (EDINA, University of Edinburgh). This presentation was part of Repository Fringe 2017 (#rfringe17) held on 3rd August 2017 in Edinburgh. The slides describe a project to develop Site2Cite, a new (pilot) tool for researchers to archive their web citations and ensure their readers can access that archive copy should the website change over time (including "Reference Rot" and "Content Drift").
Slides accompanying the "If I Googled You, What Would I Find? Managing your digital footprint" session at the CILIPS Conference 2017: Strategies for Success, presented at the Apex Hotel, Dundee, on Tuesday 6th June 2017 by Nicola Osborne, EDINA Digital Education Manager.
"Managing your Digital Footprint : Taking control of the metadata and tracks and traces that define us online" invited presentation for CIG Scotland's 7th Metadata & Web 2.0 Seminar: "Somewhere over the Rainbow: our metadata online, past, present & future", which took place at the National Library of Scotland, 5th April 2017.
Slides accompanying Nicola Osborne's(EDINA Digital Education Manager) session on "Social media and blogging to develop and communicate research in the arts and humanities" at the "Academic Publishing: Routes to Success" event held at the University of Stirling on 23rd January 2017.
"Enhancing your research impact through social media" - presentation given by Nicola Osborne, EDINA Digital Education Manager, at the Edinburgh Postgraduate Law Conference 2017 (19th January 2017).
Social Media in Marketing in Support of Your Personal Brand - Nicola Osborne, EDINA Digital Education Manager, for Abertay University (Dundee) 4th Year Marketing Students.
Best Practice for Social Media in Teaching & Learning Contexts, slides accompanying a presentation by Nicola Osborne, EDINA Digital Education Manager, for Abertay University (Dundee). The hashtag for this event was #AbTLEJan2017.
Safalta Digital marketing institute in Noida, provide complete applications that encompass a huge range of virtual advertising and marketing additives, which includes search engine optimization, virtual communication advertising, pay-per-click on marketing, content material advertising, internet analytics, and greater. These university courses are designed for students who possess a comprehensive understanding of virtual marketing strategies and attributes.Safalta Digital Marketing Institute in Noida is a first choice for young individuals or students who are looking to start their careers in the field of digital advertising. The institute gives specialized courses designed and certification.
for beginners, providing thorough training in areas such as SEO, digital communication marketing, and PPC training in Noida. After finishing the program, students receive the certifications recognised by top different universitie, setting a strong foundation for a successful career in digital marketing.
Normal Labour/ Stages of Labour/ Mechanism of LabourWasim Ak
Normal labor is also termed spontaneous labor, defined as the natural physiological process through which the fetus, placenta, and membranes are expelled from the uterus through the birth canal at term (37 to 42 weeks
Biological screening of herbal drugs: Introduction and Need for
Phyto-Pharmacological Screening, New Strategies for evaluating
Natural Products, In vitro evaluation techniques for Antioxidants, Antimicrobial and Anticancer drugs. In vivo evaluation techniques
for Anti-inflammatory, Antiulcer, Anticancer, Wound healing, Antidiabetic, Hepatoprotective, Cardio protective, Diuretics and
Antifertility, Toxicity studies as per OECD guidelines
Introduction to AI for Nonprofits with Tapp NetworkTechSoup
Dive into the world of AI! Experts Jon Hill and Tareq Monaur will guide you through AI's role in enhancing nonprofit websites and basic marketing strategies, making it easy to understand and apply.
2024.06.01 Introducing a competency framework for languag learning materials ...Sandy Millin
http://sandymillin.wordpress.com/iateflwebinar2024
Published classroom materials form the basis of syllabuses, drive teacher professional development, and have a potentially huge influence on learners, teachers and education systems. All teachers also create their own materials, whether a few sentences on a blackboard, a highly-structured fully-realised online course, or anything in between. Despite this, the knowledge and skills needed to create effective language learning materials are rarely part of teacher training, and are mostly learnt by trial and error.
Knowledge and skills frameworks, generally called competency frameworks, for ELT teachers, trainers and managers have existed for a few years now. However, until I created one for my MA dissertation, there wasn’t one drawing together what we need to know and do to be able to effectively produce language learning materials.
This webinar will introduce you to my framework, highlighting the key competencies I identified from my research. It will also show how anybody involved in language teaching (any language, not just English!), teacher training, managing schools or developing language learning materials can benefit from using the framework.
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdfTechSoup
In this webinar you will learn how your organization can access TechSoup's wide variety of product discount and donation programs. From hardware to software, we'll give you a tour of the tools available to help your nonprofit with productivity, collaboration, financial management, donor tracking, security, and more.
Macroeconomics- Movie Location
This will be used as part of your Personal Professional Portfolio once graded.
Objective:
Prepare a presentation or a paper using research, basic comparative analysis, data organization and application of economic information. You will make an informed assessment of an economic climate outside of the United States to accomplish an entertainment industry objective.
Operation “Blue Star” is the only event in the history of Independent India where the state went into war with its own people. Even after about 40 years it is not clear if it was culmination of states anger over people of the region, a political game of power or start of dictatorial chapter in the democratic setup.
The people of Punjab felt alienated from main stream due to denial of their just demands during a long democratic struggle since independence. As it happen all over the word, it led to militant struggle with great loss of lives of military, police and civilian personnel. Killing of Indira Gandhi and massacre of innocent Sikhs in Delhi and other India cities was also associated with this movement.
Honest Reviews of Tim Han LMA Course Program.pptxtimhan337
Personal development courses are widely available today, with each one promising life-changing outcomes. Tim Han’s Life Mastery Achievers (LMA) Course has drawn a lot of interest. In addition to offering my frank assessment of Success Insider’s LMA Course, this piece examines the course’s effects via a variety of Tim Han LMA course reviews and Success Insider comments.
Tales from the Keepers Registry: Dr Who and the Scholarly Record
1. Association of Subscription Agents & Intermediaries
ASA ANNUAL CONFERENCE 2014
24-25 February 2014
Tales from the Keepers Registry
Dr Who and the Scholarly Record
Peter Burnhill
EDINA, University of Edinburgh, UK
http://creativecommons.org/licenses/by/3.0/
2. Overview
1. “Who does forever?” : A Registry of Keepers
Who is looking after e-journals with archival intent?
2. Dr Who and the Scholarly Record
Time Travel for Scholarly Web
3. Evidence from the Keepers Registry
Statistics on who is looking after what, & what is at risk
3. Some Consequences of Web
• Essentials of supply chain have changed
• licensed to access, not sale of content
• Libraries no longer take physical custody of much
“The Library [Committee], which
key content made up of librarians and
is
• online academics,
remotely, not on-shelf locally
• Role of libraries as reassurance about
… wants trusted keepers of information
long-term disrupted
and culture has beenpreservation before
confirming a University policy
– Need assurance of continuity of access
of goinge-only.”
• of all content for future generations
from email sent by a the licence
• of the back copies, post-cancellation of big UK Library
• Does this mean that the Scholarly Record is at risk?
4. 1. “Who does forever?”
Many reports over past 10 years highlighted risks
•
„digital decay‟: format obsolescence & bit rot
and warned against single points of failure:
•
•
natural disasters (earthquake, fire and flood)
human folly (criminal and political action): hacking
+ risks with commercial events in the publisher/supply chain
Some early archiving initiatives emerged …
•eDepotat KoninklijkeBibliotheek
• international significance(Elsevier &Kluwer)
•the LOCKSS project at Stanford University
• from which came CLOCKSS [as library/publisher „dark archive‟]
•the electronic-archiving initiative at JSTOR
• from which came Portico[as service provider]
5. A „global challenge‟: trans-national action
UK.BL 10%
Netherlands
& Germany:
c. 4.5% each
„hidden‟ e-journals:
low % ISSN
US.LoC 20%
Brazil 4%
%age of the 113,000 ISSN issued for e-serials
Researchers (and therefore libraries) in any one country
are dependent upon content written and published
in countries other than their own
6. A Variety of „Archiving Organisations‟
① web-scale
not-for-profit
archiving agencies
e.g. CLOCKSS Archive & Portico
② national libraries (with legal deposit in mind)
e.g. e-Depot (Netherlands); British Library;DnB etc
③
research libraries: consortia & specialist centres
e.g. Global LOCKSS Network, HathiTrust,
Scholars Portal, Archaeology Data Service
Disclaimer:
University of Edinburgh is a CLOCKSS Node & Board Member:
Jisc supports UK LOCKSS Alliance
7. How can we know who is looking after what & how?
(and uncover what is still at risk)
SERVICES: user
requirements
E-J Preservation Registry Service
Data
dependency
E-Journal
Preservation
Registry
(b)
The Keepers Registry,
product of Jisc-funded
PEPRS Project
(EDINA & the ISSN IC)
METADATA
on preservation action
(a)
METADATA
on extant e-journals
ISSN Register at heart
of the Data Model;
ISSN-L as kernel field
ISSN
Register
(Taken from Figure 1 in reference paper in Serials, March 2009)
Digital Preservation
Agencies
e.g. CLOCKSS, Portico; BL, KB;
UK LOCKSS Alliance etc.
8. How can we know who is looking after what & how?
(and uncover what is still at risk)
SERVICES: user
requirements
E-J Preservation Registry Service
Data
dependency
E-Journal
Preservation
Registry
(b)
The Keepers Registry,
product of Jisc-funded
PEPRS Project
(EDINA & the ISSN IC)
METADATA
on preservation action
(a)
Look forward to
ISNI for publisher
as kernel field
ISSN Register at heart
of the Data Model;
ISSN-L as kernel field
METADATA
on extant e-journals
ISSN
Register
(Taken from Figure 1 in reference paper in Serials, March 2009)
Digital Preservation
Agencies
e.g. CLOCKSS, Portico; BL, KB;
UK LOCKSS Alliance etc.
9. Many archiving organisations is a Good Thing
“Digital information is best preserved by replicating it at
multiple archives run by autonomous organizations”
B. Cooper and H. Garcia-Molina (2002)
10. Now have a global Registry of e-journal archiving
… to discover who is looking after what
Enter title
or ISSN
to search across metadata
reported by leading
archiving organisations
*news*
Library of Congress has now joined the Keepers Registry
[& have high hopes for some others …]
11. … and discover details of its „archival status‟
This e-journal is being archived
by 5 archiving agencies …
… but coverage
of volumes is
partial & patchy
Example search: „Origins of Life’
11
12. Overview: Time for Part 2
2. Dr Who and the Scholarly Record
(Time Travel for Scholarly Web)
• ‘Reference Rot’: When what was referenced and cited
ceases to say the same thing, or ‘has ceased to be’
http://www.snorgtees.com/this-parrot-has-ceased-to-be
13. The „reference rot‟ problem definition
Investigating Reference Rot in Web-Based Scholarly Communication
1. http:// link to a resource no longer works
•
Link rot
2. The citation is inadequate
•
Not robust over time
3. The content referenced at the end of the link
a) has evolved,
b) changed dramatically,
c) disappeared completely.
http://hiberlink.org #hiberlink
14. Hiberlink Project: Andrew W. Mellon Foundation
Investigating Reference Rot in Web-Based Scholarly Communication
Partners
• Los Alamos National Laboratory: Research Library
•
Martin Klein, Robert Sanderson, Herbert Van de Sompel
• University of Edinburgh: EDINA&Language Technology Group
•
Peter Burnhill, Neil Mayo, Muriel Mewissen, Christine Rees, Tim Stickland,
Richard Wincewicz&Beatrice Alex, Claire Grover, Richard Tobin, Ke „Adam‟ Zhou
Acknowledgments
• Primary datasets: arXiv, Chesapeake Project, Elsevier, PubMed Central,
PLoS, … Planning on large-scale investigation (looking for more …)
• Secondary datasets: Ex Libris, MS Academic, SerialsSolutions
• Technology support: CrossRef Labs, CrossRef Prospect, Elsevier
• Liaisons: archive.is, CrossRef, Internet Archive, Old Dominion University
Web Science & Digital Library Research Group, perma.cc
http://hiberlink.org #hiberlink
15. Hiberlink Project: Four work packages
Investigating Reference Rot in Web-Based Scholarly Communication
1. Problem Quantification. text mining of vast corpus of
scholarly literature to uncover references to web resource (URIs);
using Memento; determine availability on live web and in archives.
2. Archival Solution Infrastructure. Prototyping proactive, web-centric archiving approaches mechanisms for archiving
cited web resources at the point of use or publication.
3. Temporal Reference Solutions. Prototyping new methods of
citation to enable creation of precise & actionable time-specific
references.
4. Dissemination and Outreach. Raising awareness of the
challenges at the heart of digital scholarly communication.
http://hiberlink.org #hiberlink
16. Investigating Reference Rot in Web-Based Scholarly Communication
References in Web-Based Scholarly
Communication
References to other online
scholarly works
Link Rot
DOI, HTTP version of DOI
Content
Decay
References to online
resources on the „wider Web‟
Fixity of content
Archiving: CLoCKSS,
LoCKSS, Portico…
(Keepers Registry)
This is becoming
understood but issues, see
This is unexplored, so
to be Hiberlink focus
David Rosenthal blog post http://blog.dshr.org/2013/11/patio-perspectives-at-anadp-ii.html
17. Articles Increasingly link to online resources
on the „wider Web‟
URIs extracted from PubMed papers – links to Web at Large resources
18. Quantifying the extent of „Reference Rot‟ – Early Results
Using: PubMed Central Corpus 01/1997 - 12/2012
•
•
•
•
Articles processed:
Articles that contain links (URIs) to „Web at Large‟ :
Number of references to „Web at Large‟ URIs:
Unique referenced Web at Large URIs:
494,785
176,527
557,432
327,782
Percentage Exists & Archived Referenced URIs
31.2%
Exists & Archived
!Exists & Archived
Exists & !Archived
!Exists & !Archived
16.8%
11.3%
40.7%
31%
11%
41%
17%
are available & safe
can be retrieved
at risk
are lost
19. Thoughts on How to Address Content Decay
Who is not selling defective goods?
• Remedy: Pro-active approach to trigger web
archiving when web-based content is referenced
in scholarly work:
– By authors
• during note taking, authoring, when submitting
– By publication platforms
• During submission, editing, acceptance, issue
20. + Tool with Temporal Context for Links
• Memento for Chrome is an application that uses Original URI-R and
dates to access Mementos in various web archives
Memento Time Travel for Chrome
http://bit.ly/memento-for-chrome
21. BackTo The Overview - Part 3
E-journals should be easy
– right?
… but is the e-journals
problem is being solved?
3. Evidence from the
Keepers Registry
Statistics on who is looking
after what, & what is at risk
22. 3. Evidence from the Keepers Registry
a)
21,557 e-serial titles are reported as being
ingested by the 10 Keepers
– organisations with archival intent
– with many „missing volumes and issues‟
b) 113,092 ISSN assigned to „online serials‟ in the
ISSN Register
Progress with a key indicator: ratio of a/b = 19%
– was 17% at close of 2011 (16,558 / 97,563)
Progress, but far from „job done‟
23. Do we need to agree a „priority list‟ of titles?
1. Should we only be interested in the c.30,000 „peer-reviewed‟
scholarly journals? [Ulrich‟s]
2. Do we look only at on what individual libraries list?
– In 2012 we checked „archival status‟ for 3 large university libraries
c.75%
„at risk‟
c.11%
held by
3 or more
• Two key indicators: %age (& number) of titles that are „at risk of loss‟
%age (& number) titles that are ‘preserved by 3 or more Keepers’.
1. Should we ask the audience?
•
The researchers and students who read online serials
24. Looking from the user‟s point of view …
… with usage logs for the UK OpenURL Router
• 10.4m full text requests in 2012; ISSN-L to de-duplicate ISSN
• 53,311 online titles requested by researchers & student from 108/160+
Analysis using the Keepers Registry:
• Only 15% (7,862) are being kept by 3+ Keepers
• Over two thirds (68%) held by none
36,326 titles „at risk‟ of loss
So „preservation‟ (or lack of it) is still a real
and present problem!
25. Good News & Main Challenge?
Good news?
• Most of the big publishers engage with archiving initiatives
– typically CLOCKSS, e-Depot and Portico.
• Are those titles, volumes & issues actually being archived?
Main challenge?
• Long tail of smaller publishers - regardless of business model.
• Everyone in the audience should check whether they are
participating in at least one preservation approach?
• Role of Agents, who arrange subscriptions with those
small publishers?
– Or only role of national libraries & research library consortia?
26. Choice of future with 2020 Vision
•
Best Case scenario for ASA 2020
– Libraries, Agents & Publishers have acted to reduce that
alarming 80% figure to near to zero
– They have ensured that all the e-journal content used by
their researchers in 2013 has been preserved and can
be successfully used in 2020, and assuredly beyond.
•
Worst Case scenario for ASA 2020
– Libraries, Agents & Publishers have failed to act
– Important literature has been lost
– Citizens & scholars complain of neglect!
27. The Keepers Registry: Actionable Evidence
Sidebar note on monitoring their progress …
1. To assist publishers „do the right thing‟
–
A showcase for the real heroes: the archiving organisations
–
Means to check what content is being reported as archived
–
Provide libraries, publishers & archiving organisations with lists of
titles that seem to be at risk of loss
2. To keep a close focusBreaking News:
on volumes & issues
Need New release (end of Q12014) Members Area:
for Publishers & Libraries to make sure all issued content is
being kept safe
Upload a list of ISSNs& get back archival status of Titles
3. To assist collaboration between Keepers: „a safe places network
Access to API, to report archival status on 3rd Party websites
–
4. If it is worth preserving, it really should have an identifier
28. Gentle Wake-up Call to Ensure Continuity of Access
‘Go Smell The Coffee’
#hiberlink
http://thekeepers.blogs.edina.ac.uk/
http://thekeepers.orghttp://hiberlink.org/
29. Ask a librarian in 2020: 3 possible answers
1. "Yes, we have it (we've checked recently, both in the
catalogue and in actuality), and you can access it now"
2. "No, but we know some body that does (we trust),
– so we can point you to (or arrange access to) it now/soon-ish"
3. "Sorry, we don't know …
- perhaps nobody has it
- it may be lost forever, altho' perhaps somebody somewhere ...”
- That was true for the print world
- Unfortunately, unless we do something now, the 3rd answer
could become the common one for a lot of e-journal content
30. Sidebar note on National Libraries
Should we wait upon Legal Deposit?
– 94% of libraries have some form of legal deposit for print.
• Only 44% national libraries had legislation in 2011 for e-books or
e-journals; expected to rise to 58% by June 2012.
from presentation, CENL 2011 Survey by Lynne Brindley
to CDNL Annual Meeting Puerto Rico, 15/8/11
• Only 27%[expected to rise to 37% by June 2012] actually ingesting via legal
deposit
Total national libraries collecting = those 14 via legal deposit
+ 9 by other means (Netherlands, UK/BL, Switzerland voluntary deposit)
Only KB e-Depot, BL, NSLC (+ LoC) in The Keepers Registry
Only when the other 19 join will all know about their activity
Key point is not about call for „legal deposit‟ but that on its
own it is taking too much time
E-journals, and online serials more generally, are a big part of the scholarly record – if we use the distribution of assigned ISSN as a guide, then we have some measure of just how international is the problem space:- centering the map on Singapore, Asia and the Pacific for a change yes a lot is published in US, UK, Netherlands and Germany – but over 60% is not – and that is an underestimate because so many online serials in countries in the centre of this map do not have ISSN assigned – they remain hidden to our arithmetic.
by considering 3 answers that might be given to a user asking after a particular e-journal content – typically an article