SlideShare a Scribd company logo
1 of 58
Reference Rot: Threat and Remedy
UKSG15
30 March - 1 April 2015
Funded by the Andrew W. Mellon Foundation
Peter Burnhill & Richard Wincewicz
EDINA, University of Edinburgh
for the Hiberlink Team at University of Edinburgh & LANL Research Library
The Project Team
2013 – 2015, funded by the
Andrew W. Mellon Foundation
• Los Alamos National Laboratory:
Research Library: Herbert Van de Sompel
Harihar Shankar, [Martin Klein, Rob Sanderson],
• University of Edinburgh:
Language Technology Group: Claire Grover,
Beatrice Alex, Colin Matheson, Richard Tobin, [Ke “Adam” Zhou]
EDINA * : Peter Burnhill, Muriel Mewissen (Project Manager),
Neil Mayo, Tim Stickland, Richard Wincewicz,
Centre for Service Delivery & Digital Expertise
Funded by the Andrew W. Mellon Foundation
UKSG15
30 March - 1 April 2015
… acts as part of the Jisc Family
edina.ac.uk
hiberlink.org
Overview
1. Introduction / Threat
2. Analysis
3. Large-scale Evidence
4. Devising Remedy
5. Summary
Tweet to #UKSG15
When what was referenced & cited
ceases to say the same thing, or ‘has ceased to be’
http://www.snorgtees.com/this-parrot-has-ceased-to-be
1. The Threat of Reference Rot
“when links to web resources
no longer point to what they once did”
Reference Rot = Link Rot + Content Drift
Link Rot
‘Link Rot’
+ Content Drift: What is at end of URI has changed, or gone!
http://dl00.org
2000
http://dl00.org
2004
http://dl00.org
2005
http://dl00.org
2008
(a) Dynamic content
as values on webpage
changes over time
(b) Static content
but very different (often
unrelated) web pages
2. Analysis
Tweets on #hiberlink to
#UKSG15
Take a landmark publication, 10+ years ago
Few of those references to the Web now work as intendedA re-direct [from RLG to OCLC] but ‘content drift’
Few of those references to the Web now work as intendedA re-direct [from RLG to OCLC] but ‘content drift’
Fail !!
Reference no longer works: ‘link rot’
Fail !!
Reference no longer works: ‘link rot’
Fail !!
A re-direct but content not found
A re-direct but content not found
Fail !!
Successful link: URI worke as expected  in December 2014
But sadly, now does not 
Fail !!
Successful link: URI works as expected 
Classic link rot: ‘Page Not Found’
Fail !!
reference to the Web is to an e-journal that is still current
Classic link rot: ‘Page Not Found’
Fail !!
URI works but content drift: reference is not as intended
Fail !!
=> Content of Citations Rot over Time!!
… meaning rotten references for the reader
… in what is then a rotten article!
… & sale of rotten goods & undermining the
integrity of the scholarly record
3. Large-scale Evidence
Tweets on #hiberlink to
#UKSG15
Hiberlink Project Methodology
to discover answer to a 2-part question
Do references to web-based content (URIs) work?
• Focus on content on ‘the wild Web’
• not that which is in e-journals etc
i. Impact of Time: Is the URI still on the ‘Live Web’’?
• Allowed up to a maximum of 50 redirects
ii. Is a ‘Memento’ of that content in the ‘Archived Web’?
Memento: a prior version, what the Original Resource was like at some time in the past.
3. Large-scale Empirical Evidence
c. 400,000 articles across the three corpora (Row #5 in Table 2)
contained over a million web at large references (Row #4 in Table 3)
Klein M, Van de Sompel H, Sanderson R, Shankar H, Balakireva L, et al. (2014) Scholarly Context Not Found: One
in Five Articles Suffers from Reference Rot. PLoS ONE 9(12): e115253. doi:10.1371/journal.pone.0115253
http://127.0.0.1:8081/plosone/article?id=info:doi/10.1371/journal.pone.0115253
A Key Aspect of Hiberlink Project Methodology
1. Convert Scholarly Statement from PDF into XML
2. Locate the references & extract each and every URL
• Many technical challenges
• URL broken/newline; underscore as image
• Use up to 15 regular expression for matching; regard as URI
University of Edinburgh Language Technology Group:
Beatrice Alex, Claire Grover, Colin Matheson, Richard Tobin, Ke Zhou
Scholarly Articles [in PMC] increasingly link to
Web Resources, not just back to other Articles
Scholarly Articles [in Elsevier] increasingly link to
Web Resources, not just back to other Articles
Mementos for URIs archived within 14 days of being referenced
PMC corpus
Klein M, Van de Sompel H, Sanderson R, Shankar H, Balakireva L, et al. (2014) Scholarly Context Not Found: One
in Five Articles Suffers from Reference Rot. PLoS ONE 9(12): e115253. doi:10.1371/journal.pone.0115253
http://127.0.0.1:8081/plosone/article?id=info:doi/10.1371/journal.pone.0115253
6 publicly accessible web archives for lookup: Internet Archive, archive.is (archive.today),
Archive-It, BL Web Archive, UK National Archives Web Archive & Icelandic National Archive
Klein M, Van de Sompel H, Sanderson R, Shankar H, Balakireva L, et al. (2014) Scholarly Context Not Found: One
in Five Articles Suffers from Reference Rot. PLoS ONE 9(12): e115253. doi:10.1371/journal.pone.0115253
http://127.0.0.1:8081/plosone/article?id=info:doi/10.1371/journal.pone.0115253
Mementos for URIs archived within 14 days of being referenced
Elsevier corpus
6 publicly accessible web archives for lookup: Internet Archive, archive.is (archive.today),
Archive-It, BL Web Archive, UK National Archives Web Archive & Icelandic National Archive
4. Devising Remedy for Reference Rot
Tweets on #hiberlink to
#UKSG15
The Remedy Is Quick Freeze & Archive
3 workflows in scholarly statement
①Preparation -> Study - > Compose -> Submission
②Publication -> Editing -> (Revision) -> Acceptance -> Issue
③Post-Publication-> Deposit/Ingest -> Reader Access -> Use
To identify the best opportunities for Intervention to make Remedy,
to ‘flash-freeze’, either to avoid reference rot or to ‘stop the rot’
Identify the Actors & how to assist them do the right thing!
Ideally at the earliest moment of capture
… when the Authors are trawling for content
… for what an Author regards as significant
… or needs to provide as evidence
… re-factoring the HTML link that is returned
• http://www.newyorker.com/magazine/2015/01/26/cobweb
• Archive timestamp: 2015-02-19T09:46:36
• http://web.archive.org/web/20150219094636/http://www.n
ewyorker.com/magazine/2015/01/26/cobweb
Hiberlink Remedy: Components in a Robust Link
b) Augment Link with Datetime and Archive URI
a) Take simple URI - to article in New Yorker magazine (say)
Hiberlink.org
What Robust Hiberlinks look like
• Hiberlinks are modified <a> HTML elements
• Include archive URL and timestamp as
additional attributes
<a
href=“http://www.newyorker.com/magazine/2015/01/26/cobweb”
data-
versionurl=“http://web.archive.org/web/20150219094636/http://w
ww.newyorker.com/magazine/2015/01/26/cobweb”
data-versiondate=“2015-02-19T09:46:36”>Cobweb Article</a>
Help authors do the right thing:
① Triggering archiving of referenced web content
when it is noted, using a reference manager
eg EndNote, Reference Manager, Zotero
– Hiberlink Plug-in developed for Zotero
② Returns Datetime URI for archived content that
can be used in the citation
Remedy To Avoid Reference Rot
https://www.zotero.org/
Zotero workflow
Create
reference
Add URL
Update
URL
Duplicate
reference
Pass URL to
archive service
Receive
archive URL
Store data
in database
Add data to
reference
Using the Plugin in Zotero
Opportunity / Time for a Demo ?
So what should we expect of the Publisher?
Beyond the assurance that
the fish / references / articles
sold are not rotten
Help Publishers do the right thing
The next best opportunity for Quick Freeze
• to avoid reference rot & to ‘stop the rot’
① Study: Preparation -> (Review) -> Submission
② Publication: Editorial -> (Revision) -> Issue
③ Post-Publication: Deposit/Ingest -> Provide/Access -> Use
Actors:
①The Author
② The Editor / Publisher
③The Access Platform / Librarian /Archival Organisations
OJS plugin
1. Parses the document
• Converts .pdf to .html
• Extracts URIs
2. Archives the content for each reference
• The Author and Editor can choose which version is
used as the archival copy
3. Creates an HTML version of the document
• including a link to the archived version of each of the
references
Well Published References & Augmented Links
Post-Publication (& other bulk processing)
The last ‘best’ opportunity for Quick Freeze
• not to avoid reference rot but to ‘stop the rot’
① Study: Preparation -> (Review) -> Submission
• Should note & act for each URI, one by one
② Publication: Editorial -> (Revision) -> Issue
• (Probably) should examine each one by one
③Post-Publication: Deposit/Ingest
• Cannot hope to process one by one
Post-Publication (& other bulk processing)
The last ‘best’ opportunity for Quick Freeze
• not to avoid reference rot but to ‘stop the rot’
① Study: Preparation -> (Review) -> Submission
• Should note & act for each URI, one by one
② Publication: Editorial -> (Revision) -> Issue
• (Probably) should examine each one by one
③Post-Publication: Deposit/Ingest
• Cannot hope to process one by one
Actors:
①The Author
② The Editor / Publisher
③Access Platforms / Archival Organisations
/ Librarians
& each article contains many references
Recall Key Aspect of Hiberlink Methodology
1. Convert Scholarly Statement from PDF into XML
2. Locate the references & extract each and every URL
• Many technical challenges
• URL broken/newline; underscore as image
• Use up to 15 regular expression for matching; regard as URI
=> Edinburgh Parser [github.com/hiberlink]
University of Edinburgh Language Technology Group:
Beatrice Alex, Claire Grover, Colin Matheson, Richard Tobin, Ke Zhou
Time to Build Infrastructure:
HiberActive
Publishing
platform HiberActive
External archival
service
(e.g. Internet Archive)
• Asynchronous (returns Robust Link)
• Distributed (archived with different organisations)
• Lightweight (leveraging HTTP & what already exists)
5. Summary
Tweets on #hiberlink to
#UKSG15
Hiberlink Outcomes
1. Defined the Threat of Reference Rot
2. Quantified the extent and way in which it
exists & undermines the Scholarly Record
3. Pointed to potential & practical Remedy
Hiberlink Outcomes
1. Defined the Threat of Reference Rot
2. Quantified the extent and way in which it exists & undermines the
Scholarly Record
3. Pointed to potential & practical Remedy
As project comes to an end (June 2015) so we wish to:
• Tell the world about these achievements
• Engage with others
– to build infrastructure
– To prompt adoption (copying) of prototypes by 3rd
parties
• such as reference managers, editorial systems, publication
systems, archival systems
Thank you,
Questions welcome
http://hiberlink.org #hiberlink
Email: edina@ed.ac.uk
Still Time to Tweet to #UKSG15
Funded by the Andrew W. Mellon Foundation

More Related Content

What's hot

Is It Too Late to Ensure Continuity of Access to the Scholarly Record?
Is It Too Late to Ensure Continuity of Access to the Scholarly Record?Is It Too Late to Ensure Continuity of Access to the Scholarly Record?
Is It Too Late to Ensure Continuity of Access to the Scholarly Record?EDINA, University of Edinburgh
 
CLOCKSS: Time and Places for Community-Based Archiving
CLOCKSS: Time and Places for Community-Based ArchivingCLOCKSS: Time and Places for Community-Based Archiving
CLOCKSS: Time and Places for Community-Based ArchivingEDINA, University of Edinburgh
 
SUNCAT: the next steps for the UK’s national serials catalogue
SUNCAT: the next steps for the UK’s national serials catalogueSUNCAT: the next steps for the UK’s national serials catalogue
SUNCAT: the next steps for the UK’s national serials catalogueEDINA, University of Edinburgh
 
Research Data Mantra (Management Training) Online Course Launch
Research Data Mantra (Management Training) Online Course LaunchResearch Data Mantra (Management Training) Online Course Launch
Research Data Mantra (Management Training) Online Course LaunchEDINA, University of Edinburgh
 
CLOCKSS archive - Preserving our digital heritage - The community taking control
CLOCKSS archive - Preserving our digital heritage - The community taking controlCLOCKSS archive - Preserving our digital heritage - The community taking control
CLOCKSS archive - Preserving our digital heritage - The community taking controlEDINA, University of Edinburgh
 
Piloting an E-Journals Preservation Registry Service (PEPRS)
Piloting an E-Journals Preservation Registry Service (PEPRS)Piloting an E-Journals Preservation Registry Service (PEPRS)
Piloting an E-Journals Preservation Registry Service (PEPRS)EDINA, University of Edinburgh
 
UKSG Conference 2016 Breakout Session - Of Libraries and Labs: effecting user...
UKSG Conference 2016 Breakout Session - Of Libraries and Labs: effecting user...UKSG Conference 2016 Breakout Session - Of Libraries and Labs: effecting user...
UKSG Conference 2016 Breakout Session - Of Libraries and Labs: effecting user...UKSG: connecting the knowledge community
 
Stop Press: Libraries' Role in the Future of Publishing
Stop Press: Libraries' Role in the Future of PublishingStop Press: Libraries' Role in the Future of Publishing
Stop Press: Libraries' Role in the Future of PublishingDanny Kingsley
 
Doing data in the social sciences and humanities: links to and from published...
Doing data in the social sciences and humanities: links to and from published...Doing data in the social sciences and humanities: links to and from published...
Doing data in the social sciences and humanities: links to and from published...EDINA, University of Edinburgh
 
Relationship Building and Advocacy Across the Campus
Relationship Building and Advocacy Across the CampusRelationship Building and Advocacy Across the Campus
Relationship Building and Advocacy Across the CampusUCD Library
 
Altmetrics and Social Media: Publicising, Discovering, Engaging
Altmetrics and Social Media: Publicising, Discovering, EngagingAltmetrics and Social Media: Publicising, Discovering, Engaging
Altmetrics and Social Media: Publicising, Discovering, EngagingUCD Library
 
Making ‘Everything Available’ – Transforming the (online) services and experi...
Making ‘Everything Available’ – Transforming the (online) services and experi...Making ‘Everything Available’ – Transforming the (online) services and experi...
Making ‘Everything Available’ – Transforming the (online) services and experi...Torsten Reimer
 
Too Many Copies! The confusion between duplication and versioning
Too Many Copies! The confusion between duplication and versioningToo Many Copies! The confusion between duplication and versioning
Too Many Copies! The confusion between duplication and versioningEDINA, University of Edinburgh
 
Research Data Management at Edinburgh: Effecting Culture Change
Research Data Management at Edinburgh: Effecting Culture ChangeResearch Data Management at Edinburgh: Effecting Culture Change
Research Data Management at Edinburgh: Effecting Culture ChangeEDINA, University of Edinburgh
 
Exploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadataExploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadataShenghui Wang
 
UKSG Conference 2017 Breakout - Advancing the Research Paper of the Future: c...
UKSG Conference 2017 Breakout - Advancing the Research Paper of the Future: c...UKSG Conference 2017 Breakout - Advancing the Research Paper of the Future: c...
UKSG Conference 2017 Breakout - Advancing the Research Paper of the Future: c...UKSG: connecting the knowledge community
 

What's hot (20)

PEPRS: Recording The Extent Preserved
PEPRS: Recording The Extent PreservedPEPRS: Recording The Extent Preserved
PEPRS: Recording The Extent Preserved
 
UKLA Content Development
UKLA Content DevelopmentUKLA Content Development
UKLA Content Development
 
Is It Too Late to Ensure Continuity of Access to the Scholarly Record?
Is It Too Late to Ensure Continuity of Access to the Scholarly Record?Is It Too Late to Ensure Continuity of Access to the Scholarly Record?
Is It Too Late to Ensure Continuity of Access to the Scholarly Record?
 
Roles & Skills for RDM
Roles & Skills for RDMRoles & Skills for RDM
Roles & Skills for RDM
 
CLOCKSS: Time and Places for Community-Based Archiving
CLOCKSS: Time and Places for Community-Based ArchivingCLOCKSS: Time and Places for Community-Based Archiving
CLOCKSS: Time and Places for Community-Based Archiving
 
SUNCAT: the next steps for the UK’s national serials catalogue
SUNCAT: the next steps for the UK’s national serials catalogueSUNCAT: the next steps for the UK’s national serials catalogue
SUNCAT: the next steps for the UK’s national serials catalogue
 
Research Data Mantra (Management Training) Online Course Launch
Research Data Mantra (Management Training) Online Course LaunchResearch Data Mantra (Management Training) Online Course Launch
Research Data Mantra (Management Training) Online Course Launch
 
CLOCKSS archive - Preserving our digital heritage - The community taking control
CLOCKSS archive - Preserving our digital heritage - The community taking controlCLOCKSS archive - Preserving our digital heritage - The community taking control
CLOCKSS archive - Preserving our digital heritage - The community taking control
 
Piloting an E-Journals Preservation Registry Service (PEPRS)
Piloting an E-Journals Preservation Registry Service (PEPRS)Piloting an E-Journals Preservation Registry Service (PEPRS)
Piloting an E-Journals Preservation Registry Service (PEPRS)
 
UKSG Conference 2016 Breakout Session - Of Libraries and Labs: effecting user...
UKSG Conference 2016 Breakout Session - Of Libraries and Labs: effecting user...UKSG Conference 2016 Breakout Session - Of Libraries and Labs: effecting user...
UKSG Conference 2016 Breakout Session - Of Libraries and Labs: effecting user...
 
Stop Press: Libraries' Role in the Future of Publishing
Stop Press: Libraries' Role in the Future of PublishingStop Press: Libraries' Role in the Future of Publishing
Stop Press: Libraries' Role in the Future of Publishing
 
Doing data in the social sciences and humanities: links to and from published...
Doing data in the social sciences and humanities: links to and from published...Doing data in the social sciences and humanities: links to and from published...
Doing data in the social sciences and humanities: links to and from published...
 
Relationship Building and Advocacy Across the Campus
Relationship Building and Advocacy Across the CampusRelationship Building and Advocacy Across the Campus
Relationship Building and Advocacy Across the Campus
 
Altmetrics and Social Media: Publicising, Discovering, Engaging
Altmetrics and Social Media: Publicising, Discovering, EngagingAltmetrics and Social Media: Publicising, Discovering, Engaging
Altmetrics and Social Media: Publicising, Discovering, Engaging
 
Making ‘Everything Available’ – Transforming the (online) services and experi...
Making ‘Everything Available’ – Transforming the (online) services and experi...Making ‘Everything Available’ – Transforming the (online) services and experi...
Making ‘Everything Available’ – Transforming the (online) services and experi...
 
Too Many Copies! The confusion between duplication and versioning
Too Many Copies! The confusion between duplication and versioningToo Many Copies! The confusion between duplication and versioning
Too Many Copies! The confusion between duplication and versioning
 
Research Data Management at Edinburgh: Effecting Culture Change
Research Data Management at Edinburgh: Effecting Culture ChangeResearch Data Management at Edinburgh: Effecting Culture Change
Research Data Management at Edinburgh: Effecting Culture Change
 
Exploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadataExploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadata
 
UKSG Conference 2017 Breakout - Advancing the Research Paper of the Future: c...
UKSG Conference 2017 Breakout - Advancing the Research Paper of the Future: c...UKSG Conference 2017 Breakout - Advancing the Research Paper of the Future: c...
UKSG Conference 2017 Breakout - Advancing the Research Paper of the Future: c...
 
2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery
 

Viewers also liked

Licence to Share: Research and Collaboration through Go-Geo! and ShareGeo
Licence to Share: Research and Collaboration through Go-Geo! and ShareGeoLicence to Share: Research and Collaboration through Go-Geo! and ShareGeo
Licence to Share: Research and Collaboration through Go-Geo! and ShareGeoEDINA, University of Edinburgh
 
Exploiting the value of Dublin Core through pragmatic development
Exploiting the value of Dublin Core through pragmatic developmentExploiting the value of Dublin Core through pragmatic development
Exploiting the value of Dublin Core through pragmatic developmentPaul Walk
 
Scottish Open Education Declaration
Scottish Open Education Declaration Scottish Open Education Declaration
Scottish Open Education Declaration Lorna Campbell
 
EPSRC Policy Compliance: What researchers need to know
EPSRC Policy Compliance: What researchers need to knowEPSRC Policy Compliance: What researchers need to know
EPSRC Policy Compliance: What researchers need to knowEDINA, University of Edinburgh
 
Shibboleth Access Management Federations as an Organisational Model for SDI
Shibboleth Access Management Federations as an Organisational Model for SDIShibboleth Access Management Federations as an Organisational Model for SDI
Shibboleth Access Management Federations as an Organisational Model for SDIEDINA, University of Edinburgh
 
Leeds University Geospatial Metadata Workshop 20110617
Leeds University Geospatial Metadata Workshop 20110617Leeds University Geospatial Metadata Workshop 20110617
Leeds University Geospatial Metadata Workshop 20110617EDINA, University of Edinburgh
 
SUNCAT: Transforming the service to create an open bridge from resource disco...
SUNCAT: Transforming the service to create an open bridge from resource disco...SUNCAT: Transforming the service to create an open bridge from resource disco...
SUNCAT: Transforming the service to create an open bridge from resource disco...EDINA, University of Edinburgh
 

Viewers also liked (20)

Licence to Share: Research and Collaboration through Go-Geo! and ShareGeo
Licence to Share: Research and Collaboration through Go-Geo! and ShareGeoLicence to Share: Research and Collaboration through Go-Geo! and ShareGeo
Licence to Share: Research and Collaboration through Go-Geo! and ShareGeo
 
Exploiting the value of Dublin Core through pragmatic development
Exploiting the value of Dublin Core through pragmatic developmentExploiting the value of Dublin Core through pragmatic development
Exploiting the value of Dublin Core through pragmatic development
 
IGIBS - BDB Research Forum, May 2011
IGIBS - BDB Research Forum, May 2011IGIBS - BDB Research Forum, May 2011
IGIBS - BDB Research Forum, May 2011
 
RDM Programme at University of Edinburgh
RDM Programme at University of EdinburghRDM Programme at University of Edinburgh
RDM Programme at University of Edinburgh
 
Scottish Open Education Declaration
Scottish Open Education Declaration Scottish Open Education Declaration
Scottish Open Education Declaration
 
EPSRC Policy Compliance: What researchers need to know
EPSRC Policy Compliance: What researchers need to knowEPSRC Policy Compliance: What researchers need to know
EPSRC Policy Compliance: What researchers need to know
 
SCURL and SUNCAT serials holdings comparison service
SCURL and SUNCAT serials holdings comparison serviceSCURL and SUNCAT serials holdings comparison service
SCURL and SUNCAT serials holdings comparison service
 
Access to Digital Back Copy
Access to Digital Back CopyAccess to Digital Back Copy
Access to Digital Back Copy
 
Delivering Postgraduate Training - MANTRA
Delivering Postgraduate Training - MANTRADelivering Postgraduate Training - MANTRA
Delivering Postgraduate Training - MANTRA
 
AddressingHistory - Tracing the Past
AddressingHistory - Tracing the PastAddressingHistory - Tracing the Past
AddressingHistory - Tracing the Past
 
Shibboleth Access Management Federations as an Organisational Model for SDI
Shibboleth Access Management Federations as an Organisational Model for SDIShibboleth Access Management Federations as an Organisational Model for SDI
Shibboleth Access Management Federations as an Organisational Model for SDI
 
Access Control in ESDIN: Shibboleth
Access Control in ESDIN: ShibbolethAccess Control in ESDIN: Shibboleth
Access Control in ESDIN: Shibboleth
 
Using a dumb identifier to do smart things
Using a dumb identifier to do smart thingsUsing a dumb identifier to do smart things
Using a dumb identifier to do smart things
 
Agile Data Access Initiative
Agile Data Access InitiativeAgile Data Access Initiative
Agile Data Access Initiative
 
Leeds University Geospatial Metadata Workshop 20110617
Leeds University Geospatial Metadata Workshop 20110617Leeds University Geospatial Metadata Workshop 20110617
Leeds University Geospatial Metadata Workshop 20110617
 
SUNCAT: Transforming the service to create an open bridge from resource disco...
SUNCAT: Transforming the service to create an open bridge from resource disco...SUNCAT: Transforming the service to create an open bridge from resource disco...
SUNCAT: Transforming the service to create an open bridge from resource disco...
 
An Introduction to 2011 Census Geography
An Introduction to 2011 Census GeographyAn Introduction to 2011 Census Geography
An Introduction to 2011 Census Geography
 
SafeNet: Progress and Data Gathering
SafeNet: Progress and Data GatheringSafeNet: Progress and Data Gathering
SafeNet: Progress and Data Gathering
 
Deep Impact: Metadata and SUNCAT
Deep Impact: Metadata and SUNCATDeep Impact: Metadata and SUNCAT
Deep Impact: Metadata and SUNCAT
 
Digimap Roam
Digimap RoamDigimap Roam
Digimap Roam
 

Similar to Reference Rot: Threat and Remedy

Web Today, Good Tomorrow? Transactional archiving of web content
Web Today, Good Tomorrow? Transactional archiving of web contentWeb Today, Good Tomorrow? Transactional archiving of web content
Web Today, Good Tomorrow? Transactional archiving of web contentPeter Burnhill
 
Prototypes of pro-active approaches to support the archiving of web reference...
Prototypes of pro-active approaches to support the archiving of web reference...Prototypes of pro-active approaches to support the archiving of web reference...
Prototypes of pro-active approaches to support the archiving of web reference...EDINA, University of Edinburgh
 
Hiberlink: Prototypes of pro-active approaches to support the archiving of we...
Hiberlink: Prototypes of pro-active approaches to support the archiving of we...Hiberlink: Prototypes of pro-active approaches to support the archiving of we...
Hiberlink: Prototypes of pro-active approaches to support the archiving of we...EDINA, University of Edinburgh
 
Copthall School Feb 2022
Copthall School Feb 2022Copthall School Feb 2022
Copthall School Feb 2022EISLibrarian
 
Reference Rot and Link Decoration
Reference Rot and Link DecorationReference Rot and Link Decoration
Reference Rot and Link DecorationMartin Klein
 
Where data and journal content collide: what does it mean to ‘publish your da...
Where data and journal content collide: what does it mean to ‘publish your da...Where data and journal content collide: what does it mean to ‘publish your da...
Where data and journal content collide: what does it mean to ‘publish your da...EDINA, University of Edinburgh
 
Data Citation: A Critical Role for Publishers
Data Citation: A Critical Role for PublishersData Citation: A Critical Role for Publishers
Data Citation: A Critical Role for PublishersBrian Hole
 
Stronger together: community initiatives in journal management
Stronger together: community initiatives in journal managementStronger together: community initiatives in journal management
Stronger together: community initiatives in journal managementJisc
 
HIBERLINK: Reference Rot and Linked Data: Threat and Remedy
HIBERLINK: Reference Rot and Linked Data: Threat and RemedyHIBERLINK: Reference Rot and Linked Data: Threat and Remedy
HIBERLINK: Reference Rot and Linked Data: Threat and RemedyPRELIDA Project
 
Workshop 3: Becoming a critical searcher
Workshop 3: Becoming a critical searcher Workshop 3: Becoming a critical searcher
Workshop 3: Becoming a critical searcher EISLibrarian
 
Research dissemination presentation
Research dissemination presentationResearch dissemination presentation
Research dissemination presentationJohn Turner
 
Actions to Ensure the Integrity and Continuity of the Scholarly Record
Actions to Ensure the Integrity and Continuity of the Scholarly Record Actions to Ensure the Integrity and Continuity of the Scholarly Record
Actions to Ensure the Integrity and Continuity of the Scholarly Record EDINA, University of Edinburgh
 
Advocating Open Access: Before, during and after HEFCE
Advocating Open Access: Before, during and after HEFCEAdvocating Open Access: Before, during and after HEFCE
Advocating Open Access: Before, during and after HEFCENick Sheppard
 
Open access for academics
Open access for academicsOpen access for academics
Open access for academicsDeborah Lupton
 
Hiberlink: Investigating Reference Rot, December 2013
Hiberlink: Investigating Reference Rot, December 2013Hiberlink: Investigating Reference Rot, December 2013
Hiberlink: Investigating Reference Rot, December 2013Herbert Van de Sompel
 

Similar to Reference Rot: Threat and Remedy (20)

Web Today, Good Tomorrow? Transactional archiving of web content
Web Today, Good Tomorrow? Transactional archiving of web contentWeb Today, Good Tomorrow? Transactional archiving of web content
Web Today, Good Tomorrow? Transactional archiving of web content
 
Prototypes of pro-active approaches to support the archiving of web reference...
Prototypes of pro-active approaches to support the archiving of web reference...Prototypes of pro-active approaches to support the archiving of web reference...
Prototypes of pro-active approaches to support the archiving of web reference...
 
Hiberlink: Prototypes of pro-active approaches to support the archiving of we...
Hiberlink: Prototypes of pro-active approaches to support the archiving of we...Hiberlink: Prototypes of pro-active approaches to support the archiving of we...
Hiberlink: Prototypes of pro-active approaches to support the archiving of we...
 
Reference Rot and E-Theses: Threat and Remedy
Reference Rot and E-Theses: Threat and RemedyReference Rot and E-Theses: Threat and Remedy
Reference Rot and E-Theses: Threat and Remedy
 
Copthall School Feb 2022
Copthall School Feb 2022Copthall School Feb 2022
Copthall School Feb 2022
 
Reference Rot and Link Decoration
Reference Rot and Link DecorationReference Rot and Link Decoration
Reference Rot and Link Decoration
 
Where data and journal content collide: what does it mean to ‘publish your da...
Where data and journal content collide: what does it mean to ‘publish your da...Where data and journal content collide: what does it mean to ‘publish your da...
Where data and journal content collide: what does it mean to ‘publish your da...
 
Data Citation: A Critical Role for Publishers
Data Citation: A Critical Role for PublishersData Citation: A Critical Role for Publishers
Data Citation: A Critical Role for Publishers
 
Stronger together: community initiatives in journal management
Stronger together: community initiatives in journal managementStronger together: community initiatives in journal management
Stronger together: community initiatives in journal management
 
HIBERLINK: Reference Rot and Linked Data: Threat and Remedy
HIBERLINK: Reference Rot and Linked Data: Threat and RemedyHIBERLINK: Reference Rot and Linked Data: Threat and Remedy
HIBERLINK: Reference Rot and Linked Data: Threat and Remedy
 
Workshop 3: Becoming a critical searcher
Workshop 3: Becoming a critical searcher Workshop 3: Becoming a critical searcher
Workshop 3: Becoming a critical searcher
 
Research dissemination presentation
Research dissemination presentationResearch dissemination presentation
Research dissemination presentation
 
Actions to Ensure the Integrity and Continuity of the Scholarly Record
Actions to Ensure the Integrity and Continuity of the Scholarly Record Actions to Ensure the Integrity and Continuity of the Scholarly Record
Actions to Ensure the Integrity and Continuity of the Scholarly Record
 
Advocating Open Access: Before, during and after HEFCE
Advocating Open Access: Before, during and after HEFCEAdvocating Open Access: Before, during and after HEFCE
Advocating Open Access: Before, during and after HEFCE
 
SES1501 Oct 2021
SES1501 Oct 2021SES1501 Oct 2021
SES1501 Oct 2021
 
CST4599 Nov 2021
CST4599 Nov 2021CST4599 Nov 2021
CST4599 Nov 2021
 
Envs100
Envs100Envs100
Envs100
 
Envs100
Envs100Envs100
Envs100
 
Open access for academics
Open access for academicsOpen access for academics
Open access for academics
 
Hiberlink: Investigating Reference Rot, December 2013
Hiberlink: Investigating Reference Rot, December 2013Hiberlink: Investigating Reference Rot, December 2013
Hiberlink: Investigating Reference Rot, December 2013
 

More from EDINA, University of Edinburgh

We have the technology... We have the data... What next?
We have the technology... We have the data... What next?We have the technology... We have the data... What next?
We have the technology... We have the data... What next?EDINA, University of Edinburgh
 
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...EDINA, University of Edinburgh
 
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...If I Googled You, What Would I Find? Managing your digital footprint - Nicola...
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...EDINA, University of Edinburgh
 
Managing your Digital Footprint : Taking control of the metadata and tracks a...
Managing your Digital Footprint : Taking control of the metadata and tracks a...Managing your Digital Footprint : Taking control of the metadata and tracks a...
Managing your Digital Footprint : Taking control of the metadata and tracks a...EDINA, University of Edinburgh
 
Social media and blogging to develop and communicate research in the arts and...
Social media and blogging to develop and communicate research in the arts and...Social media and blogging to develop and communicate research in the arts and...
Social media and blogging to develop and communicate research in the arts and...EDINA, University of Edinburgh
 
Enhancing your research impact through social media - Nicola Osborne
Enhancing your research impact through social media - Nicola OsborneEnhancing your research impact through social media - Nicola Osborne
Enhancing your research impact through social media - Nicola OsborneEDINA, University of Edinburgh
 
Social Media in Marketing in Support of Your Personal Brand - Nicola Osborne
Social Media in Marketing in Support of Your Personal Brand - Nicola OsborneSocial Media in Marketing in Support of Your Personal Brand - Nicola Osborne
Social Media in Marketing in Support of Your Personal Brand - Nicola OsborneEDINA, University of Edinburgh
 
Best Practice for Social Media in Teaching & Learning Contexts - Nicola Osborne
Best Practice for Social Media in Teaching & Learning Contexts - Nicola OsborneBest Practice for Social Media in Teaching & Learning Contexts - Nicola Osborne
Best Practice for Social Media in Teaching & Learning Contexts - Nicola OsborneEDINA, University of Edinburgh
 
Introduction to Edinburgh University Data Library and national data services
Introduction to Edinburgh University Data Library and national data servicesIntroduction to Edinburgh University Data Library and national data services
Introduction to Edinburgh University Data Library and national data servicesEDINA, University of Edinburgh
 
Digimap for Schools: Introduction to an ICT based cross curricular resource f...
Digimap for Schools: Introduction to an ICT based cross curricular resource f...Digimap for Schools: Introduction to an ICT based cross curricular resource f...
Digimap for Schools: Introduction to an ICT based cross curricular resource f...EDINA, University of Edinburgh
 

More from EDINA, University of Edinburgh (20)

The Making of the English Landscape:
The Making of the English Landscape: The Making of the English Landscape:
The Making of the English Landscape:
 
Spatial Data, Spatial Humanities
Spatial Data, Spatial HumanitiesSpatial Data, Spatial Humanities
Spatial Data, Spatial Humanities
 
Land Cover Map 2015
Land Cover Map 2015Land Cover Map 2015
Land Cover Map 2015
 
We have the technology... We have the data... What next?
We have the technology... We have the data... What next?We have the technology... We have the data... What next?
We have the technology... We have the data... What next?
 
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
 
GeoForum EDINA report 2017
GeoForum EDINA report 2017GeoForum EDINA report 2017
GeoForum EDINA report 2017
 
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...If I Googled You, What Would I Find? Managing your digital footprint - Nicola...
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...
 
Moray housemarch2017
Moray housemarch2017Moray housemarch2017
Moray housemarch2017
 
Uniof stirlingmarch2017secondary
Uniof stirlingmarch2017secondaryUniof stirlingmarch2017secondary
Uniof stirlingmarch2017secondary
 
Uniof glasgow jan2017_secondary
Uniof glasgow jan2017_secondaryUniof glasgow jan2017_secondary
Uniof glasgow jan2017_secondary
 
Managing your Digital Footprint : Taking control of the metadata and tracks a...
Managing your Digital Footprint : Taking control of the metadata and tracks a...Managing your Digital Footprint : Taking control of the metadata and tracks a...
Managing your Digital Footprint : Taking control of the metadata and tracks a...
 
Social media and blogging to develop and communicate research in the arts and...
Social media and blogging to develop and communicate research in the arts and...Social media and blogging to develop and communicate research in the arts and...
Social media and blogging to develop and communicate research in the arts and...
 
Enhancing your research impact through social media - Nicola Osborne
Enhancing your research impact through social media - Nicola OsborneEnhancing your research impact through social media - Nicola Osborne
Enhancing your research impact through social media - Nicola Osborne
 
Social Media in Marketing in Support of Your Personal Brand - Nicola Osborne
Social Media in Marketing in Support of Your Personal Brand - Nicola OsborneSocial Media in Marketing in Support of Your Personal Brand - Nicola Osborne
Social Media in Marketing in Support of Your Personal Brand - Nicola Osborne
 
Best Practice for Social Media in Teaching & Learning Contexts - Nicola Osborne
Best Practice for Social Media in Teaching & Learning Contexts - Nicola OsborneBest Practice for Social Media in Teaching & Learning Contexts - Nicola Osborne
Best Practice for Social Media in Teaching & Learning Contexts - Nicola Osborne
 
Big data in Digimap
Big data in DigimapBig data in Digimap
Big data in Digimap
 
Introduction to Edinburgh University Data Library and national data services
Introduction to Edinburgh University Data Library and national data servicesIntroduction to Edinburgh University Data Library and national data services
Introduction to Edinburgh University Data Library and national data services
 
Digimap for Schools: Introduction to an ICT based cross curricular resource f...
Digimap for Schools: Introduction to an ICT based cross curricular resource f...Digimap for Schools: Introduction to an ICT based cross curricular resource f...
Digimap for Schools: Introduction to an ICT based cross curricular resource f...
 
Digimap Update - Geoforum 2016 - Guy McGarva
Digimap Update - Geoforum 2016 - Guy McGarvaDigimap Update - Geoforum 2016 - Guy McGarva
Digimap Update - Geoforum 2016 - Guy McGarva
 
Data, Data, Data - Geoforum 2016 - Phil Bartie
Data, Data, Data - Geoforum 2016 - Phil BartieData, Data, Data - Geoforum 2016 - Phil Bartie
Data, Data, Data - Geoforum 2016 - Phil Bartie
 

Recently uploaded

Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerunnathinaik
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxsocialsciencegdgrohi
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaVirag Sontakke
 

Recently uploaded (20)

Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developer
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of India
 

Reference Rot: Threat and Remedy

  • 1. Reference Rot: Threat and Remedy UKSG15 30 March - 1 April 2015 Funded by the Andrew W. Mellon Foundation Peter Burnhill & Richard Wincewicz EDINA, University of Edinburgh for the Hiberlink Team at University of Edinburgh & LANL Research Library
  • 2. The Project Team 2013 – 2015, funded by the Andrew W. Mellon Foundation • Los Alamos National Laboratory: Research Library: Herbert Van de Sompel Harihar Shankar, [Martin Klein, Rob Sanderson], • University of Edinburgh: Language Technology Group: Claire Grover, Beatrice Alex, Colin Matheson, Richard Tobin, [Ke “Adam” Zhou] EDINA * : Peter Burnhill, Muriel Mewissen (Project Manager), Neil Mayo, Tim Stickland, Richard Wincewicz, Centre for Service Delivery & Digital Expertise Funded by the Andrew W. Mellon Foundation UKSG15 30 March - 1 April 2015
  • 3. … acts as part of the Jisc Family edina.ac.uk
  • 4. hiberlink.org Overview 1. Introduction / Threat 2. Analysis 3. Large-scale Evidence 4. Devising Remedy 5. Summary Tweet to #UKSG15
  • 5. When what was referenced & cited ceases to say the same thing, or ‘has ceased to be’ http://www.snorgtees.com/this-parrot-has-ceased-to-be 1. The Threat of Reference Rot “when links to web resources no longer point to what they once did” Reference Rot = Link Rot + Content Drift
  • 7. + Content Drift: What is at end of URI has changed, or gone! http://dl00.org 2000 http://dl00.org 2004 http://dl00.org 2005 http://dl00.org 2008 (a) Dynamic content as values on webpage changes over time (b) Static content but very different (often unrelated) web pages
  • 8. 2. Analysis Tweets on #hiberlink to #UKSG15
  • 9. Take a landmark publication, 10+ years ago
  • 10. Few of those references to the Web now work as intendedA re-direct [from RLG to OCLC] but ‘content drift’
  • 11. Few of those references to the Web now work as intendedA re-direct [from RLG to OCLC] but ‘content drift’ Fail !!
  • 12. Reference no longer works: ‘link rot’ Fail !!
  • 13. Reference no longer works: ‘link rot’ Fail !!
  • 14. A re-direct but content not found
  • 15. A re-direct but content not found Fail !!
  • 16. Successful link: URI worke as expected  in December 2014
  • 17. But sadly, now does not  Fail !!
  • 18. Successful link: URI works as expected 
  • 19. Classic link rot: ‘Page Not Found’ Fail !!
  • 20. reference to the Web is to an e-journal that is still current
  • 21. Classic link rot: ‘Page Not Found’ Fail !!
  • 22. URI works but content drift: reference is not as intended Fail !!
  • 23. => Content of Citations Rot over Time!!
  • 24. … meaning rotten references for the reader
  • 25. … in what is then a rotten article! … & sale of rotten goods & undermining the integrity of the scholarly record
  • 26. 3. Large-scale Evidence Tweets on #hiberlink to #UKSG15
  • 27. Hiberlink Project Methodology to discover answer to a 2-part question Do references to web-based content (URIs) work? • Focus on content on ‘the wild Web’ • not that which is in e-journals etc i. Impact of Time: Is the URI still on the ‘Live Web’’? • Allowed up to a maximum of 50 redirects ii. Is a ‘Memento’ of that content in the ‘Archived Web’? Memento: a prior version, what the Original Resource was like at some time in the past.
  • 28. 3. Large-scale Empirical Evidence c. 400,000 articles across the three corpora (Row #5 in Table 2) contained over a million web at large references (Row #4 in Table 3) Klein M, Van de Sompel H, Sanderson R, Shankar H, Balakireva L, et al. (2014) Scholarly Context Not Found: One in Five Articles Suffers from Reference Rot. PLoS ONE 9(12): e115253. doi:10.1371/journal.pone.0115253 http://127.0.0.1:8081/plosone/article?id=info:doi/10.1371/journal.pone.0115253
  • 29. A Key Aspect of Hiberlink Project Methodology 1. Convert Scholarly Statement from PDF into XML 2. Locate the references & extract each and every URL • Many technical challenges • URL broken/newline; underscore as image • Use up to 15 regular expression for matching; regard as URI University of Edinburgh Language Technology Group: Beatrice Alex, Claire Grover, Colin Matheson, Richard Tobin, Ke Zhou
  • 30. Scholarly Articles [in PMC] increasingly link to Web Resources, not just back to other Articles
  • 31. Scholarly Articles [in Elsevier] increasingly link to Web Resources, not just back to other Articles
  • 32. Mementos for URIs archived within 14 days of being referenced PMC corpus Klein M, Van de Sompel H, Sanderson R, Shankar H, Balakireva L, et al. (2014) Scholarly Context Not Found: One in Five Articles Suffers from Reference Rot. PLoS ONE 9(12): e115253. doi:10.1371/journal.pone.0115253 http://127.0.0.1:8081/plosone/article?id=info:doi/10.1371/journal.pone.0115253 6 publicly accessible web archives for lookup: Internet Archive, archive.is (archive.today), Archive-It, BL Web Archive, UK National Archives Web Archive & Icelandic National Archive
  • 33. Klein M, Van de Sompel H, Sanderson R, Shankar H, Balakireva L, et al. (2014) Scholarly Context Not Found: One in Five Articles Suffers from Reference Rot. PLoS ONE 9(12): e115253. doi:10.1371/journal.pone.0115253 http://127.0.0.1:8081/plosone/article?id=info:doi/10.1371/journal.pone.0115253 Mementos for URIs archived within 14 days of being referenced Elsevier corpus 6 publicly accessible web archives for lookup: Internet Archive, archive.is (archive.today), Archive-It, BL Web Archive, UK National Archives Web Archive & Icelandic National Archive
  • 34. 4. Devising Remedy for Reference Rot Tweets on #hiberlink to #UKSG15
  • 35. The Remedy Is Quick Freeze & Archive
  • 36. 3 workflows in scholarly statement ①Preparation -> Study - > Compose -> Submission ②Publication -> Editing -> (Revision) -> Acceptance -> Issue ③Post-Publication-> Deposit/Ingest -> Reader Access -> Use To identify the best opportunities for Intervention to make Remedy, to ‘flash-freeze’, either to avoid reference rot or to ‘stop the rot’ Identify the Actors & how to assist them do the right thing!
  • 37. Ideally at the earliest moment of capture
  • 38. … when the Authors are trawling for content
  • 39. … for what an Author regards as significant
  • 40. … or needs to provide as evidence
  • 41. … re-factoring the HTML link that is returned • http://www.newyorker.com/magazine/2015/01/26/cobweb • Archive timestamp: 2015-02-19T09:46:36 • http://web.archive.org/web/20150219094636/http://www.n ewyorker.com/magazine/2015/01/26/cobweb Hiberlink Remedy: Components in a Robust Link b) Augment Link with Datetime and Archive URI a) Take simple URI - to article in New Yorker magazine (say) Hiberlink.org
  • 42. What Robust Hiberlinks look like • Hiberlinks are modified <a> HTML elements • Include archive URL and timestamp as additional attributes <a href=“http://www.newyorker.com/magazine/2015/01/26/cobweb” data- versionurl=“http://web.archive.org/web/20150219094636/http://w ww.newyorker.com/magazine/2015/01/26/cobweb” data-versiondate=“2015-02-19T09:46:36”>Cobweb Article</a>
  • 43. Help authors do the right thing: ① Triggering archiving of referenced web content when it is noted, using a reference manager eg EndNote, Reference Manager, Zotero – Hiberlink Plug-in developed for Zotero ② Returns Datetime URI for archived content that can be used in the citation Remedy To Avoid Reference Rot https://www.zotero.org/
  • 44. Zotero workflow Create reference Add URL Update URL Duplicate reference Pass URL to archive service Receive archive URL Store data in database Add data to reference
  • 45. Using the Plugin in Zotero Opportunity / Time for a Demo ?
  • 46. So what should we expect of the Publisher? Beyond the assurance that the fish / references / articles sold are not rotten
  • 47. Help Publishers do the right thing The next best opportunity for Quick Freeze • to avoid reference rot & to ‘stop the rot’ ① Study: Preparation -> (Review) -> Submission ② Publication: Editorial -> (Revision) -> Issue ③ Post-Publication: Deposit/Ingest -> Provide/Access -> Use Actors: ①The Author ② The Editor / Publisher ③The Access Platform / Librarian /Archival Organisations
  • 48. OJS plugin 1. Parses the document • Converts .pdf to .html • Extracts URIs 2. Archives the content for each reference • The Author and Editor can choose which version is used as the archival copy 3. Creates an HTML version of the document • including a link to the archived version of each of the references
  • 49. Well Published References & Augmented Links
  • 50. Post-Publication (& other bulk processing) The last ‘best’ opportunity for Quick Freeze • not to avoid reference rot but to ‘stop the rot’ ① Study: Preparation -> (Review) -> Submission • Should note & act for each URI, one by one ② Publication: Editorial -> (Revision) -> Issue • (Probably) should examine each one by one ③Post-Publication: Deposit/Ingest • Cannot hope to process one by one
  • 51. Post-Publication (& other bulk processing) The last ‘best’ opportunity for Quick Freeze • not to avoid reference rot but to ‘stop the rot’ ① Study: Preparation -> (Review) -> Submission • Should note & act for each URI, one by one ② Publication: Editorial -> (Revision) -> Issue • (Probably) should examine each one by one ③Post-Publication: Deposit/Ingest • Cannot hope to process one by one Actors: ①The Author ② The Editor / Publisher ③Access Platforms / Archival Organisations / Librarians
  • 52. & each article contains many references
  • 53. Recall Key Aspect of Hiberlink Methodology 1. Convert Scholarly Statement from PDF into XML 2. Locate the references & extract each and every URL • Many technical challenges • URL broken/newline; underscore as image • Use up to 15 regular expression for matching; regard as URI => Edinburgh Parser [github.com/hiberlink] University of Edinburgh Language Technology Group: Beatrice Alex, Claire Grover, Colin Matheson, Richard Tobin, Ke Zhou
  • 54. Time to Build Infrastructure: HiberActive Publishing platform HiberActive External archival service (e.g. Internet Archive) • Asynchronous (returns Robust Link) • Distributed (archived with different organisations) • Lightweight (leveraging HTTP & what already exists)
  • 55. 5. Summary Tweets on #hiberlink to #UKSG15
  • 56. Hiberlink Outcomes 1. Defined the Threat of Reference Rot 2. Quantified the extent and way in which it exists & undermines the Scholarly Record 3. Pointed to potential & practical Remedy
  • 57. Hiberlink Outcomes 1. Defined the Threat of Reference Rot 2. Quantified the extent and way in which it exists & undermines the Scholarly Record 3. Pointed to potential & practical Remedy As project comes to an end (June 2015) so we wish to: • Tell the world about these achievements • Engage with others – to build infrastructure – To prompt adoption (copying) of prototypes by 3rd parties • such as reference managers, editorial systems, publication systems, archival systems
  • 58. Thank you, Questions welcome http://hiberlink.org #hiberlink Email: edina@ed.ac.uk Still Time to Tweet to #UKSG15 Funded by the Andrew W. Mellon Foundation