SlideShare a Scribd company logo
1 of 40
Reference Rot and Linked Data: Threat and Remedy 
Peter Burnhill 
EDINA, University of Edinburgh 
for the Hiberlink Team at University of Edinburgh & LANL Research Library 
PRELIDA 
18/19thOctober 2014 
Funded by the Andrew W. Mellon Foundation
The Project Team 
2013 – 2015, funded by the 
Andrew W. Mellon Foundation 
• Los Alamos National Laboratory: 
Research Library: Martin Klein, [Rob Sanderson], 
Harihar Shankar, Herbert Van de Sompel 
• University of Edinburgh: 
Language Technology Group: Beatrice Alex, Claire Grover, 
Colin Matheson, Richard Tobin, [Ke “Adam” Zhou] 
EDINA * : Neil Mayo, Muriel Mewissen (Project Manager), 
Tim Stickland, Richard Wincewicz, Peter Burnhill 
Centre for Service Delivery & Digital Expertise 
Funded by the Andrew W. Mellon Foundation 
PRELIDA 
18/19thOctober 2014
1. Social Science Research Council [now ESRC, UK] 
– ‘Scientific Officer’ 
funding Data 
& use of Data 
2. Scottish Education Data Archive, 1979 – 1984/1987 
– Survey statistician: school leavers, YTS, 16-19 cohort surveys; demand for HE 
3. Edinburgh University Data Library, 1984 to present 
– President of IASSIST, 1997 – 2001: social science data professionals 
4. ESRC Regional Research Laboratory for Scotland, 1986 -1990 
– Co-director, early days of Geographical Information Systems (GIS) 
– member of Data Task Force, UK Inter-Agency Global Env. Change 
5. Graduate School, Faculty of Social Science, UofEd 1987 – 1997 
– Senior Lecturer (p/t), teaching quantitative/survey methods 
– Director of RAPID: ESRC Research Activity & Publications Information Database 
6. EDINA national data centre, 1995/6 to present 
– Director: set-up and continuous development; Jisc-funded UK national services 
7. UK Digital Curation Centre (DCC), 2003/04 - 2004/05 
– Director for set-up & definition of ‘data curation + digital preservation’ 
8. CLOCKSS Founder & Board Member / LOCKSS deployment 
3 
Data Manufacturing 
Data Brokering 
Spatial Data 
& MetaData
licence 
to use 
Ensuring 
researchers, students and their teachers have 
ease and continuing access 
to online resources used for scholarship 
P.Burnhill, Edinburgh 2009 
access 
to content & services
Buckland: thinking about Digital Libraries 
mix of the document tradition (signifying objects & their use) 
& the computation tradition (applying algorithmic, logical, 
mathematical, and mechanical techniques to information management) 
“Both traditions are needed. Information Science is rooted in part in 
humanities and qualitative social sciences. The landscape of Information 
Science is complex. An ecumenical view is needed.” 
– M.Buckland, Journal of American Society for Information Science, 50, 1999 
2 (non-convergent) mentalities,Document-ness & Computation 
+ a third dimension, the domain of application: 
• Academic discipline – if we do this for ourselves 
• Business area – if we do this for use beyond …
Related Activity 
by Partners 
• Los Alamos National Laboratory Research Library: 
• Memento 
• ResourceSync 
• http://www.niso.org/workrooms/resourcesync/ 
• University of Edinburgh / Informatics / Language Technology Group: 
• Text mining / Edinburgh Parser 
• University of Edinburgh/ Jisc / EDINA : 
• CLOCKSS / LOCKSS 
• Keepers Registry 
• https://www.era.lib.ed.ac.uk/handle/1842/6682
Top level Problem: We would like to assume that our libraries 
are ensuring that online e-journal content is being kept safe 
But online articles in the Scholarly Record are not in 
the custody of Libraries, nor on their digital shelves. 
Picture credit: http://somanybooksblog.com/2009/03/27/library-tour/
Evidence from <thekeepers.org> is worrying! 
The Keepers Registry aggregates what is being kept by the (10) leading 
archiving agencies (CLOCKSS, Portico, national libraries etc) with all 
issued with ISSN 
① ‘Ingest Ratio’ = titles being ingested by one or more Keeper 
/ ‘online serials’ in ISSN Register 
= 23,268 / 136,965 [in March 2014] => 17% 
* We do not know about 83% of e-serials having ISSN * 
‘KeepSafe Ratio’ = ingest by 3+ Keepers = 9,652 / 136,965 => 7% 
② Title Lists of 3 US research libraries (Columbia, Cornell & Duke), 
checked i2011/12 ‘Ingest Ratio’ = 22% to 28%; c.75% unknown fate 
③ User-centric Evidence, UK usage in 2012, UK OpenURL Router logs 
=> over two thirds 68% (36,326 titles) held by none!
Memento 
The Memento "Time Travel for the Web" protocol 
http://mementoweb.org/ 
• an interoperable approach to access web archives 
(IETF RFC 7089) 
• adopted by all major public archives worldwide, including the 
Internet Archive. 
• Memento for Chrome http://bit.ly/memento-for-chrome 
• This protocol underpins the work being done in Hiberlink
Now, about Reference Rot & Linked Data … 
1. Some definitions 
• What is Reference Rot? 
• What may be special about Linked Data? 
2. Evoking metaphor 
• The moment / snapshot / memento 
• Flash-freezing to avoid or to stop the rot (of fruit on vine) 
3. Evidence of Threat of Reference 
4. Devising Remedy for Reference Rot 
• Proposals for intervention: plug-ins & infrastructural solutions 
5. Next Steps: how to take this work forward?
Investigating Reference Rot in Web-Based Scholarly Communication 
Reference Rot = Link Rot + Content Drift 
“when links to web resources 
no longer point to what they once did”
Link Rot 
‘Link Rot’
+ Content Drift: What is at end of URI has changed, or gone! 
http://dl00.org 
2000 
http://dl00.org 
2004 
(a) Dynamic content 
as values on webpage 
changes over time 
http://dl00.org 
2005 
http://dl00.org 
2008 
(b) Static content 
but very different (often 
unrelated) web pages
What of Linked Data? 
One or more sets of 3 linked URIs: conversation or 
statements for the long term? As time passes, so the 
content at the end of each of those URIs will suffer: 
Reference Rot = Link Rot + Content Drift 
“when links to web resources 
no longer point to what they once did” 
“Adding eScience Assets to the Data Web”, Herbert Van de Sompel, 
Carl Lagoze, Michael L. Nelson, Simeon Warner, Robert Sanderson, Pete Johnston. 
Proceedings of Linked Data on the Web (LDOW2009) Workshop, [v1] Thu, 11 Jun 2009 
15:33:37 GMT http://arxiv.org/abs/0906.2135v1
Example: ‘mark up’ archaeological site record (metadata)
RDF graph: Article & Supplementary Data 
http://www.emeraldinsight.com/fig/0350570303002.png 
1. Build and publish as metadata in XML format to be found on the web 
2. Publishing text and data/multimedia content in XML will delight researchers 
• Researchers want to access ‘article as data’, via computational algorithm
What we are doing in Hiberlink 
1. Creating evidence on extent of ‘Reference Rot’ 
– Main focus has been on references (& URIs) made in Journal Articles 
• Inc. reference rot in Supreme Court judgments with Harvard Law Library & permaCC 
– ETD2014 was opportunity to look at Reference Rot & the e-Thesis 
– PRELIDA is opportunity to look at impact on Linked Data 
2. Understanding the preparation/publication/ingest workflow(s) 
– Identifying opportunity for productive intervention 
1. Prototypes for pro-active archiving to enable remedy 
– Embedding such ‘solutions’ in existing tools & infrastructure 
2. Raising awareness & seeking collaborative actions 
…. through events like this
Empirical evidence on the Threat of 
Reference Rot 
Large-scale analyses: Journal Articles & E-Theses
Methodology: to discover answer to 2 questions 
i. Do those links (URIs) still work? Is the URI on the ‘Live Web’’? 
• Allowing up to a maximum of 50 redirects, recording the HTTP transaction chain and 
regarding an 2XX status code as ‘live’
Methodology: to discover answer to 2 questions 
i. Do those links (URIs) still work? Is the URI on the ‘Live Web’’? 
• Allowing up to a maximum of 50 redirects, recording the HTTP transaction chain and regarding an 2XX status 
code as ‘live’ 
ii. Is there a ‘Memento’ of that reference in the ‘Archived Web’? 
Memento: a prior version, what the Original Resource was like at some time in the past.
A Measure of Reference Rot: Are those references available? 
[in 6,400 e-Theses defended in 2003-2010 at 5 US universities] 
After up to 50 redirects 
Live on Web Not Found on ‘Live Web’ All 
Count 29,122 16,860 45,982 
% 63.3 36.7 100% 
Less than two-thirds 
of those links lead 
to live content 
1st Order Indicator of 
‘Reference Rot’ more than one 
third of references 
to the Web subject to ‘rot’
References in Citations Rot over Time: 
URIs cease to exist on the live Web 
[excluding 0s&1s: a few theses are unaffected; a few are ruined] 
We can’t stop that process of rot: 
Web content changes over time, 
Reference Rot is inevitable function of time 
Number of months elapsed from Date Thesis Defended until date archives checked (June 2014)
Searching for ‘Datetime’ Mementos of content in ‘Archived Web’ 
[in 6,400 e-Theses defended in 2003-2010 at 5 US universities] 
% Live on Web Not found on ‘Live Web’ All 
Found to be 
Archived 
47.6 
Not Found 52.4 
All 100% 
There seems a 50:50 chance that 
referenced content is in the ‘Archived Web’. 
=> half of those references are at ‘risk of loss’ 
Some content is being ‘co-incidentally harvested’ by routine web archiving.
‘Incidental Archiving’ is constant over time 
(This is an ‘upper bound estimate’, independent of age of e-thesis) 
We can improve upon this ‘50:50 chance’ 
by pro-actively archiving what we cite
We already have ‘Lost Content’ for References to Web 
[in 6,400 e-Theses defended in 2003-2010 at 5 US universities] 
% Live on Web Not found on ‘Live Web’ All 
Found to be 
Archived 
29.3 18.3 47.6 
Not Found 34.0 18.4 52.4 
All 63.3 36.7 100% 
18.4% 
‘not live & not found in archive’ 
judged to be lost forever 
34% 
‘live’ & ‘not in archive’ 
at is risk of loss 
NB: The 34% ‘at risk’ could be saved by pro-active archiving
Hiberlink Next Phase: in-depth study of Content Drift 
But demonstrated that problem exists & is severe 
• The Web changes over time: significant reference 
rot occurs 
• Routine Web Archiving delivers no better than 
50:50 chance of success of having co-Incidentally 
archived what you referenced 
- and probably much less chance when we check 
extent of content drift 
- Not (yet) studied impact on Linked Data but 
expect similar
“Researchers need to know when information on a viewed page has changed. 
“Authors of long-shelf-life material want to be sure that their links will still work 
far into the future. 
Jonathan Zittrain, Larry Lessig and Kendra Albert report that 
• Harvard Law Review 
75% of links are dead 
• top 1% Impact Factor Journals 
10% of links dead just 15 months after publication 
• US Supreme Court decisions 
29% of links dead 
49% of links do not point to the original target 
http://papers.ssrn.com/sol3/papers.cfm?abstract_id=2329161
Devising Remedy for Reference Rot 
for Linked Data?
Strategy for Making Remedy 
Seek pro-active ‘transactional archiving’ solutions 
– focus on what is regarded by authors as important 
a) Understand the preparation/publication workflow 
– identifying where there can be productive intervention 
a) Devise prototypes for pro-active archiving 
– writing & implementing code! 
b) Propose/test infrastructure for temporal referencing 
– supporting & using the Memento protocol 
Where possible, we wish to embed ‘solutions’ in existing 
tools & infrastructure
3 workflows in scholarly statement 
① Preparation-> Study - > Compose -> (Review) -> Submission 
② Publication -> (Editorial)Examination -> (Revision) -> Acceptance -> Issue 
③ Post-Publication-> Deposit/Ingest -> Provide/Access -> Use 
Extended length of stages in workflows magnify reference rot & affect, 
as referenced content on the web rots over time 
Identify the best opportunities for Intervention to make Remedy, 
to ‘flash-freeze’, either to avoid reference rot or to ‘stop the rot’ 
What are the key workflows for the manufacture, 
release and use of Linked Data?
3 workflows in Linked Data 
What are the key workflows for the manufacture, 
release and use of Linked Data? 
① Manufacture-> Create- > (Review) -> Prepare to publish/release/commit 
② Authority: Release-> (Editorial)Examination -> (Revision) -> Acceptance 
③ Use: Curate -> Deposit/Ingest -> Provide/Access -> Use 
What is it that changes over time: concepts, assigned attributes; 
why and on what timescale? 
Identify the best opportunities for Intervention to make Remedy, 
to ‘flash-freeze’, either to avoid reference rot or to ‘stop the rot’
‘Work in progress’ to effect Remedy 
1. Hiberlink Plug-in - for pro-active ‘transactional’ archiving 
– At the time of authoring (ie manufacture) 
2. Missing Link - re-factoring the HTML link 
– By which one annotates with {DateTime; location of archived copy/ies} 
3. HiberActive - a system for actively archiving references 
– Designed to ‘stop the rot’, a lossy 2nd Best to transactional archiving’ 
LANL: Martin Klein, Harihar Shankar, Herbert Van de Sompel 
UoEd EDINA: Neil Mayo, Tim Stickland, Richard Wincewicz 
Hiberlink 
ETD2014, Leicester UK July 25th 2014 
Funded by the Andrew W. Mellon Foundation
Hiberlink Plug-in [for Zotero] 
① Triggers archiving of referenced web content 
② Returns DateTime URI for archived content 
For use during authoring [manufacture] of information object & 
before final issue 
but also 
before ingest by ‘library’ (& maybe for repair by ‘library’ …)
‘Work in progress’ to effect Remedy (2) 
1. Hiberlink Plug-in - to enable pro-active archiving 
2. Missing Link - re-factor the HTML link that is returned 
a) Take simple URI - to French National Library (say) 
b) Augment Link with a set of Datetime & location pairs 
Prepared by: 
Herbert Van de Sompel, Martin Klein, Robert Sanderson - Los Alamos National Laboratory 
Michael Nelson - Old Dominion University 
http://mementoweb.org/missing-link/
‘Work in progress’ to effect Remedy (3) 
1. Hiberlink Plug-in - to enable pro-active archiving 
2. Missing Link - re-factoring the HTML link 
First two approaches support ‘perfect scenario’: 
• All authors archive all their cited URIs 
• e.g. (but not exclusively) with Hiberlink / Zotero 
3. HiberActive 
– Enables repositories to ‘stop the rot’ 
by actively archiving those references in e-theses 
– A notification hub, a component for the infrastructure 
• testing workflow with ResourceSync, CORE 
& external archive programme
Summary 
• The Web changes over time: significant reference rot 
inevitably occurs (as a function of time) 
• Web Archiving delivers only c.50:50 chance of 
success of co-incidentally archiving what you 
referenced 
• Link by means of the original URI, at time of manufacture 
• But then …. Augment the link with temporal context, to 
increase robustness of link to referenced content 
o Date of linking 
o URI of archived snapshot(s) 
• Then again, maybe this is all about archiving to support 
citation and not really about ‘preservation’, but it does 
assist continuity of access
Multi-level Problem: Digital Shelving for The Research Object; 
First Order References; Second Order References; …. 
Simple Statements [with URIs] 
1st Order References [with URIs] 
Complex Research Objects {URIs} 
Picture credit: http://somanybooksblog.com/2009/03/27/library-tour/ 
1st Order References {URI} 
2nd Order References [with URIs] 2nd Order References {URI} 
“Digital information is best preserved by replicating it [on digital 
shelving] at multiple archives run by autonomous organizations” 
B. Cooper and H. Garcia-Molina (2002)
Next Steps: how to take this work forward? 
to ensure URI/references don’t rot 
• Need to move from the ‘incidental Web archiving’ of cited URIs 
to pro-active archiving, by makers of Linked Data & by repositories? 
• Engage with these Hiberlink remedies 
• The Hiberlink Plug-in for Zotero / HiberActive 
Email: edina@ed.ac.uk 
Subject: Hiberlink ETD
http://hiberlink.org #hiberlink 
Thank you, 
Questions welcome 
& check: 
http://hiberlink.org/news.html 
Email: edina@ed.ac.uk 
Funded by the Andrew W. Mellon Foundation

More Related Content

What's hot

Tales from the Keepers Registry: Dr Who and the Scholarly Record
Tales from the Keepers Registry: Dr Who and the Scholarly RecordTales from the Keepers Registry: Dr Who and the Scholarly Record
Tales from the Keepers Registry: Dr Who and the Scholarly RecordEDINA, University of Edinburgh
 
Pushing Open The Jorum: A national repository for learning materials
Pushing Open The Jorum: A national repository for learning materialsPushing Open The Jorum: A national repository for learning materials
Pushing Open The Jorum: A national repository for learning materialsEDINA, University of Edinburgh
 
Hiberlink: Prototypes of pro-active approaches to support the archiving of we...
Hiberlink: Prototypes of pro-active approaches to support the archiving of we...Hiberlink: Prototypes of pro-active approaches to support the archiving of we...
Hiberlink: Prototypes of pro-active approaches to support the archiving of we...EDINA, University of Edinburgh
 
A national repository (library?) service for learning materials
A national repository (library?) service for learning materialsA national repository (library?) service for learning materials
A national repository (library?) service for learning materialsEDINA, University of Edinburgh
 
Credo reference promoting resources workshop edina slides
Credo reference promoting resources workshop   edina slidesCredo reference promoting resources workshop   edina slides
Credo reference promoting resources workshop edina slidesAndrew Bevan
 
Actions to Ensure the Integrity and Continuity of the Scholarly Record
Actions to Ensure the Integrity and Continuity of the Scholarly Record Actions to Ensure the Integrity and Continuity of the Scholarly Record
Actions to Ensure the Integrity and Continuity of the Scholarly Record EDINA, University of Edinburgh
 
Recommendation to the EU Hearing on Access to and Preservation of Scientific ...
Recommendation to the EU Hearing on Access to and Preservation of Scientific ...Recommendation to the EU Hearing on Access to and Preservation of Scientific ...
Recommendation to the EU Hearing on Access to and Preservation of Scientific ...EDINA, University of Edinburgh
 
Archiving The Worlds E-Journals:The Keepers Registry As Global Monitor
Archiving The Worlds E-Journals:The Keepers Registry As Global MonitorArchiving The Worlds E-Journals:The Keepers Registry As Global Monitor
Archiving The Worlds E-Journals:The Keepers Registry As Global MonitorEDINA, University of Edinburgh
 

What's hot (20)

Who is looking after your e-journals?
Who is looking after your e-journals?Who is looking after your e-journals?
Who is looking after your e-journals?
 
Tales from the Keepers Registry: Dr Who and the Scholarly Record
Tales from the Keepers Registry: Dr Who and the Scholarly RecordTales from the Keepers Registry: Dr Who and the Scholarly Record
Tales from the Keepers Registry: Dr Who and the Scholarly Record
 
Pushing Open The Jorum: A national repository for learning materials
Pushing Open The Jorum: A national repository for learning materialsPushing Open The Jorum: A national repository for learning materials
Pushing Open The Jorum: A national repository for learning materials
 
Edina cigs-21-september-2012
Edina cigs-21-september-2012Edina cigs-21-september-2012
Edina cigs-21-september-2012
 
Tales from the Keepers Registry
Tales from the Keepers RegistryTales from the Keepers Registry
Tales from the Keepers Registry
 
UKLA Update On Activities
UKLA Update On ActivitiesUKLA Update On Activities
UKLA Update On Activities
 
UKLA Content Development
UKLA Content DevelopmentUKLA Content Development
UKLA Content Development
 
Hiberlink: Prototypes of pro-active approaches to support the archiving of we...
Hiberlink: Prototypes of pro-active approaches to support the archiving of we...Hiberlink: Prototypes of pro-active approaches to support the archiving of we...
Hiberlink: Prototypes of pro-active approaches to support the archiving of we...
 
Access to Digital Back Copy
Access to Digital Back CopyAccess to Digital Back Copy
Access to Digital Back Copy
 
Research Data MANTRA Project at Edinburgh
Research Data MANTRA Project at EdinburghResearch Data MANTRA Project at Edinburgh
Research Data MANTRA Project at Edinburgh
 
A national repository (library?) service for learning materials
A national repository (library?) service for learning materialsA national repository (library?) service for learning materials
A national repository (library?) service for learning materials
 
Credo reference promoting resources workshop edina slides
Credo reference promoting resources workshop   edina slidesCredo reference promoting resources workshop   edina slides
Credo reference promoting resources workshop edina slides
 
Using a dumb identifier to do smart things
Using a dumb identifier to do smart thingsUsing a dumb identifier to do smart things
Using a dumb identifier to do smart things
 
Actions to Ensure the Integrity and Continuity of the Scholarly Record
Actions to Ensure the Integrity and Continuity of the Scholarly Record Actions to Ensure the Integrity and Continuity of the Scholarly Record
Actions to Ensure the Integrity and Continuity of the Scholarly Record
 
Ukla uksg 2013_final
Ukla uksg 2013_finalUkla uksg 2013_final
Ukla uksg 2013_final
 
Preserving Streams of Issued Content
Preserving Streams of Issued ContentPreserving Streams of Issued Content
Preserving Streams of Issued Content
 
RDM Programme @ Edinburgh - Service Interoperation
RDM Programme @ Edinburgh - Service InteroperationRDM Programme @ Edinburgh - Service Interoperation
RDM Programme @ Edinburgh - Service Interoperation
 
Research Data Management Training and Support
Research Data Management Training and SupportResearch Data Management Training and Support
Research Data Management Training and Support
 
Recommendation to the EU Hearing on Access to and Preservation of Scientific ...
Recommendation to the EU Hearing on Access to and Preservation of Scientific ...Recommendation to the EU Hearing on Access to and Preservation of Scientific ...
Recommendation to the EU Hearing on Access to and Preservation of Scientific ...
 
Archiving The Worlds E-Journals:The Keepers Registry As Global Monitor
Archiving The Worlds E-Journals:The Keepers Registry As Global MonitorArchiving The Worlds E-Journals:The Keepers Registry As Global Monitor
Archiving The Worlds E-Journals:The Keepers Registry As Global Monitor
 

Viewers also liked

ESDIN - OGC Web Services Shibboleth Interoperability Experiment (OSI)
ESDIN - OGC Web Services Shibboleth Interoperability Experiment (OSI)ESDIN - OGC Web Services Shibboleth Interoperability Experiment (OSI)
ESDIN - OGC Web Services Shibboleth Interoperability Experiment (OSI)EDINA, University of Edinburgh
 
ShareGeo: Discovering and Sharing Geospatial Data - 12 months on and going open!
ShareGeo: Discovering and Sharing Geospatial Data - 12 months on and going open!ShareGeo: Discovering and Sharing Geospatial Data - 12 months on and going open!
ShareGeo: Discovering and Sharing Geospatial Data - 12 months on and going open!EDINA, University of Edinburgh
 
Jisc Publications Router: Delivering Open Access Content to Institutions
Jisc Publications Router: Delivering Open Access Content to InstitutionsJisc Publications Router: Delivering Open Access Content to Institutions
Jisc Publications Router: Delivering Open Access Content to InstitutionsEDINA, University of Edinburgh
 
SAML protected resources: the theory and practice of granularity and manageme...
SAML protected resources: the theory and practice of granularity and manageme...SAML protected resources: the theory and practice of granularity and manageme...
SAML protected resources: the theory and practice of granularity and manageme...EDINA, University of Edinburgh
 
Research Data Management: Approaches to Institutional Policy
Research Data Management: Approaches to Institutional PolicyResearch Data Management: Approaches to Institutional Policy
Research Data Management: Approaches to Institutional PolicyRobin Rice
 
Piloting an E-journals Preservation Registry Service: overview of PEPRS
Piloting an E-journals Preservation Registry Service: overview of PEPRSPiloting an E-journals Preservation Registry Service: overview of PEPRS
Piloting an E-journals Preservation Registry Service: overview of PEPRSSUNCAT
 
Research Data Management at the University of Edinburgh
Research Data Management at the University of EdinburghResearch Data Management at the University of Edinburgh
Research Data Management at the University of EdinburghEDINA, University of Edinburgh
 
University of Edinburgh's first QGIS Training Course
University of Edinburgh's first QGIS Training CourseUniversity of Edinburgh's first QGIS Training Course
University of Edinburgh's first QGIS Training CourseTom Armitage
 
Introduction to data and support services for Political Data Analysis
Introduction to data and support services for Political Data AnalysisIntroduction to data and support services for Political Data Analysis
Introduction to data and support services for Political Data AnalysisEDINA, University of Edinburgh
 
Edin casestudy-ou-rr-2011
Edin casestudy-ou-rr-2011Edin casestudy-ou-rr-2011
Edin casestudy-ou-rr-2011Robin Rice
 

Viewers also liked (20)

ESDIN - OGC Web Services Shibboleth Interoperability Experiment (OSI)
ESDIN - OGC Web Services Shibboleth Interoperability Experiment (OSI)ESDIN - OGC Web Services Shibboleth Interoperability Experiment (OSI)
ESDIN - OGC Web Services Shibboleth Interoperability Experiment (OSI)
 
ShareGeo: Discovering and Sharing Geospatial Data - 12 months on and going open!
ShareGeo: Discovering and Sharing Geospatial Data - 12 months on and going open!ShareGeo: Discovering and Sharing Geospatial Data - 12 months on and going open!
ShareGeo: Discovering and Sharing Geospatial Data - 12 months on and going open!
 
Jisc Publications Router: Delivering Open Access Content to Institutions
Jisc Publications Router: Delivering Open Access Content to InstitutionsJisc Publications Router: Delivering Open Access Content to Institutions
Jisc Publications Router: Delivering Open Access Content to Institutions
 
Palimpsest Social Media
Palimpsest Social MediaPalimpsest Social Media
Palimpsest Social Media
 
Deep Impact: Metadata and SUNCAT
Deep Impact: Metadata and SUNCATDeep Impact: Metadata and SUNCAT
Deep Impact: Metadata and SUNCAT
 
SAML protected resources: the theory and practice of granularity and manageme...
SAML protected resources: the theory and practice of granularity and manageme...SAML protected resources: the theory and practice of granularity and manageme...
SAML protected resources: the theory and practice of granularity and manageme...
 
Using OpenURL Activity Data Project 03 Aug 2011
Using OpenURL Activity Data Project 03 Aug 2011Using OpenURL Activity Data Project 03 Aug 2011
Using OpenURL Activity Data Project 03 Aug 2011
 
Research Data Management: Approaches to Institutional Policy
Research Data Management: Approaches to Institutional PolicyResearch Data Management: Approaches to Institutional Policy
Research Data Management: Approaches to Institutional Policy
 
Piloting an E-journals Preservation Registry Service: overview of PEPRS
Piloting an E-journals Preservation Registry Service: overview of PEPRSPiloting an E-journals Preservation Registry Service: overview of PEPRS
Piloting an E-journals Preservation Registry Service: overview of PEPRS
 
Research Data Management: Policy Development
Research Data Management: Policy DevelopmentResearch Data Management: Policy Development
Research Data Management: Policy Development
 
Research Data Management at the University of Edinburgh
Research Data Management at the University of EdinburghResearch Data Management at the University of Edinburgh
Research Data Management at the University of Edinburgh
 
University of Edinburgh's first QGIS Training Course
University of Edinburgh's first QGIS Training CourseUniversity of Edinburgh's first QGIS Training Course
University of Edinburgh's first QGIS Training Course
 
Digimap for Schools: Mapping the Nation
Digimap for Schools: Mapping the NationDigimap for Schools: Mapping the Nation
Digimap for Schools: Mapping the Nation
 
Nicola osborne socialmedia-leeds
Nicola osborne socialmedia-leedsNicola osborne socialmedia-leeds
Nicola osborne socialmedia-leeds
 
What does it mean to build a Citizen Science Project?
What does it mean to build a Citizen Science Project?What does it mean to build a Citizen Science Project?
What does it mean to build a Citizen Science Project?
 
IGIBS - BDB Research Forum, May 2011
IGIBS - BDB Research Forum, May 2011IGIBS - BDB Research Forum, May 2011
IGIBS - BDB Research Forum, May 2011
 
JISC MediaHub, Overview January 2013
JISC MediaHub, Overview January 2013JISC MediaHub, Overview January 2013
JISC MediaHub, Overview January 2013
 
Introduction to data and support services for Political Data Analysis
Introduction to data and support services for Political Data AnalysisIntroduction to data and support services for Political Data Analysis
Introduction to data and support services for Political Data Analysis
 
Accessing Treasure on lands and peoples
Accessing Treasure on lands and peoplesAccessing Treasure on lands and peoples
Accessing Treasure on lands and peoples
 
Edin casestudy-ou-rr-2011
Edin casestudy-ou-rr-2011Edin casestudy-ou-rr-2011
Edin casestudy-ou-rr-2011
 

Similar to Reference Rot and Linked Data: Threat and Remedy

HIBERLINK: Reference Rot and Linked Data: Threat and Remedy
HIBERLINK: Reference Rot and Linked Data: Threat and RemedyHIBERLINK: Reference Rot and Linked Data: Threat and Remedy
HIBERLINK: Reference Rot and Linked Data: Threat and RemedyPRELIDA Project
 
Stronger together: community initiatives in journal management
Stronger together: community initiatives in journal managementStronger together: community initiatives in journal management
Stronger together: community initiatives in journal managementJisc
 
Where data and journal content collide: what does it mean to ‘publish your da...
Where data and journal content collide: what does it mean to ‘publish your da...Where data and journal content collide: what does it mean to ‘publish your da...
Where data and journal content collide: what does it mean to ‘publish your da...EDINA, University of Edinburgh
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsJon Voss
 
Web Today, Good Tomorrow? Transactional archiving of web content
Web Today, Good Tomorrow? Transactional archiving of web contentWeb Today, Good Tomorrow? Transactional archiving of web content
Web Today, Good Tomorrow? Transactional archiving of web contentPeter Burnhill
 
Prototypes of pro-active approaches to support the archiving of web reference...
Prototypes of pro-active approaches to support the archiving of web reference...Prototypes of pro-active approaches to support the archiving of web reference...
Prototypes of pro-active approaches to support the archiving of web reference...EDINA, University of Edinburgh
 
Ensuring Continuity of Access To Our Published Heritage
Ensuring Continuity of Access To Our Published HeritageEnsuring Continuity of Access To Our Published Heritage
Ensuring Continuity of Access To Our Published HeritageEDINA, University of Edinburgh
 
Linked Data - the Future for Open Repositories?
Linked Data - the Future for Open Repositories?Linked Data - the Future for Open Repositories?
Linked Data - the Future for Open Repositories?Adrian Stevenson
 
Web Today, Good Tomorrow? Transactional archiving of web content [Long Version]
Web Today, Good Tomorrow? Transactional archiving of web content [Long Version]Web Today, Good Tomorrow? Transactional archiving of web content [Long Version]
Web Today, Good Tomorrow? Transactional archiving of web content [Long Version]Peter Burnhill
 
Locah Project Show and Tell
Locah Project Show and TellLocah Project Show and Tell
Locah Project Show and TellAdrian Stevenson
 
High and Lows of Library Linked Data
High and Lows of Library Linked DataHigh and Lows of Library Linked Data
High and Lows of Library Linked DataAdrian Stevenson
 
Towards OpenURL Quality Metrics: Initial Findings
Towards OpenURL Quality Metrics: Initial FindingsTowards OpenURL Quality Metrics: Initial Findings
Towards OpenURL Quality Metrics: Initial Findingsalc28
 
'Your Scholarship. Our World. Preserving the Long Tail' by Vicky Reich
'Your Scholarship. Our World. Preserving the Long Tail' by Vicky Reich'Your Scholarship. Our World. Preserving the Long Tail' by Vicky Reich
'Your Scholarship. Our World. Preserving the Long Tail' by Vicky ReichEDINA, University of Edinburgh
 
Institutional Repositories (NLA 2011)
Institutional Repositories (NLA 2011)Institutional Repositories (NLA 2011)
Institutional Repositories (NLA 2011)Paul Royster
 
Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.Jon Voss
 
Linked data presentation for libraries (COMO)
Linked data presentation for libraries (COMO)Linked data presentation for libraries (COMO)
Linked data presentation for libraries (COMO)robin fay
 

Similar to Reference Rot and Linked Data: Threat and Remedy (20)

HIBERLINK: Reference Rot and Linked Data: Threat and Remedy
HIBERLINK: Reference Rot and Linked Data: Threat and RemedyHIBERLINK: Reference Rot and Linked Data: Threat and Remedy
HIBERLINK: Reference Rot and Linked Data: Threat and Remedy
 
Reference Rot and E-Theses: Threat and Remedy
Reference Rot and E-Theses: Threat and RemedyReference Rot and E-Theses: Threat and Remedy
Reference Rot and E-Theses: Threat and Remedy
 
Stronger together: community initiatives in journal management
Stronger together: community initiatives in journal managementStronger together: community initiatives in journal management
Stronger together: community initiatives in journal management
 
Where data and journal content collide: what does it mean to ‘publish your da...
Where data and journal content collide: what does it mean to ‘publish your da...Where data and journal content collide: what does it mean to ‘publish your da...
Where data and journal content collide: what does it mean to ‘publish your da...
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
 
Web Today, Good Tomorrow? Transactional archiving of web content
Web Today, Good Tomorrow? Transactional archiving of web contentWeb Today, Good Tomorrow? Transactional archiving of web content
Web Today, Good Tomorrow? Transactional archiving of web content
 
Prototypes of pro-active approaches to support the archiving of web reference...
Prototypes of pro-active approaches to support the archiving of web reference...Prototypes of pro-active approaches to support the archiving of web reference...
Prototypes of pro-active approaches to support the archiving of web reference...
 
Ensuring Continuity of Access To Our Published Heritage
Ensuring Continuity of Access To Our Published HeritageEnsuring Continuity of Access To Our Published Heritage
Ensuring Continuity of Access To Our Published Heritage
 
Linked Data - the Future for Open Repositories?
Linked Data - the Future for Open Repositories?Linked Data - the Future for Open Repositories?
Linked Data - the Future for Open Repositories?
 
"In the Early Days of a Better Nation": Enhancing the power of metadata today...
"In the Early Days of a Better Nation": Enhancing the power of metadata today..."In the Early Days of a Better Nation": Enhancing the power of metadata today...
"In the Early Days of a Better Nation": Enhancing the power of metadata today...
 
Web Today, Good Tomorrow? Transactional archiving of web content [Long Version]
Web Today, Good Tomorrow? Transactional archiving of web content [Long Version]Web Today, Good Tomorrow? Transactional archiving of web content [Long Version]
Web Today, Good Tomorrow? Transactional archiving of web content [Long Version]
 
Locah Project Show and Tell
Locah Project Show and TellLocah Project Show and Tell
Locah Project Show and Tell
 
High and Lows of Library Linked Data
High and Lows of Library Linked DataHigh and Lows of Library Linked Data
High and Lows of Library Linked Data
 
Kohacon2016
Kohacon2016Kohacon2016
Kohacon2016
 
Towards OpenURL Quality Metrics: Initial Findings
Towards OpenURL Quality Metrics: Initial FindingsTowards OpenURL Quality Metrics: Initial Findings
Towards OpenURL Quality Metrics: Initial Findings
 
NISO Webinar: Library Linked Data: From Vision to Reality
NISO Webinar: Library Linked Data: From Vision to RealityNISO Webinar: Library Linked Data: From Vision to Reality
NISO Webinar: Library Linked Data: From Vision to Reality
 
'Your Scholarship. Our World. Preserving the Long Tail' by Vicky Reich
'Your Scholarship. Our World. Preserving the Long Tail' by Vicky Reich'Your Scholarship. Our World. Preserving the Long Tail' by Vicky Reich
'Your Scholarship. Our World. Preserving the Long Tail' by Vicky Reich
 
Institutional Repositories (NLA 2011)
Institutional Repositories (NLA 2011)Institutional Repositories (NLA 2011)
Institutional Repositories (NLA 2011)
 
Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.
 
Linked data presentation for libraries (COMO)
Linked data presentation for libraries (COMO)Linked data presentation for libraries (COMO)
Linked data presentation for libraries (COMO)
 

More from EDINA, University of Edinburgh

We have the technology... We have the data... What next?
We have the technology... We have the data... What next?We have the technology... We have the data... What next?
We have the technology... We have the data... What next?EDINA, University of Edinburgh
 
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...EDINA, University of Edinburgh
 
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...If I Googled You, What Would I Find? Managing your digital footprint - Nicola...
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...EDINA, University of Edinburgh
 
Managing your Digital Footprint : Taking control of the metadata and tracks a...
Managing your Digital Footprint : Taking control of the metadata and tracks a...Managing your Digital Footprint : Taking control of the metadata and tracks a...
Managing your Digital Footprint : Taking control of the metadata and tracks a...EDINA, University of Edinburgh
 
Social media and blogging to develop and communicate research in the arts and...
Social media and blogging to develop and communicate research in the arts and...Social media and blogging to develop and communicate research in the arts and...
Social media and blogging to develop and communicate research in the arts and...EDINA, University of Edinburgh
 
Enhancing your research impact through social media - Nicola Osborne
Enhancing your research impact through social media - Nicola OsborneEnhancing your research impact through social media - Nicola Osborne
Enhancing your research impact through social media - Nicola OsborneEDINA, University of Edinburgh
 
Social Media in Marketing in Support of Your Personal Brand - Nicola Osborne
Social Media in Marketing in Support of Your Personal Brand - Nicola OsborneSocial Media in Marketing in Support of Your Personal Brand - Nicola Osborne
Social Media in Marketing in Support of Your Personal Brand - Nicola OsborneEDINA, University of Edinburgh
 
Best Practice for Social Media in Teaching & Learning Contexts - Nicola Osborne
Best Practice for Social Media in Teaching & Learning Contexts - Nicola OsborneBest Practice for Social Media in Teaching & Learning Contexts - Nicola Osborne
Best Practice for Social Media in Teaching & Learning Contexts - Nicola OsborneEDINA, University of Edinburgh
 
Introduction to Edinburgh University Data Library and national data services
Introduction to Edinburgh University Data Library and national data servicesIntroduction to Edinburgh University Data Library and national data services
Introduction to Edinburgh University Data Library and national data servicesEDINA, University of Edinburgh
 
Digimap for Schools: Introduction to an ICT based cross curricular resource f...
Digimap for Schools: Introduction to an ICT based cross curricular resource f...Digimap for Schools: Introduction to an ICT based cross curricular resource f...
Digimap for Schools: Introduction to an ICT based cross curricular resource f...EDINA, University of Edinburgh
 

More from EDINA, University of Edinburgh (20)

The Making of the English Landscape:
The Making of the English Landscape: The Making of the English Landscape:
The Making of the English Landscape:
 
Spatial Data, Spatial Humanities
Spatial Data, Spatial HumanitiesSpatial Data, Spatial Humanities
Spatial Data, Spatial Humanities
 
Land Cover Map 2015
Land Cover Map 2015Land Cover Map 2015
Land Cover Map 2015
 
We have the technology... We have the data... What next?
We have the technology... We have the data... What next?We have the technology... We have the data... What next?
We have the technology... We have the data... What next?
 
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
 
GeoForum EDINA report 2017
GeoForum EDINA report 2017GeoForum EDINA report 2017
GeoForum EDINA report 2017
 
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...If I Googled You, What Would I Find? Managing your digital footprint - Nicola...
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...
 
Moray housemarch2017
Moray housemarch2017Moray housemarch2017
Moray housemarch2017
 
Uniof stirlingmarch2017secondary
Uniof stirlingmarch2017secondaryUniof stirlingmarch2017secondary
Uniof stirlingmarch2017secondary
 
Uniof glasgow jan2017_secondary
Uniof glasgow jan2017_secondaryUniof glasgow jan2017_secondary
Uniof glasgow jan2017_secondary
 
Managing your Digital Footprint : Taking control of the metadata and tracks a...
Managing your Digital Footprint : Taking control of the metadata and tracks a...Managing your Digital Footprint : Taking control of the metadata and tracks a...
Managing your Digital Footprint : Taking control of the metadata and tracks a...
 
Social media and blogging to develop and communicate research in the arts and...
Social media and blogging to develop and communicate research in the arts and...Social media and blogging to develop and communicate research in the arts and...
Social media and blogging to develop and communicate research in the arts and...
 
Enhancing your research impact through social media - Nicola Osborne
Enhancing your research impact through social media - Nicola OsborneEnhancing your research impact through social media - Nicola Osborne
Enhancing your research impact through social media - Nicola Osborne
 
Social Media in Marketing in Support of Your Personal Brand - Nicola Osborne
Social Media in Marketing in Support of Your Personal Brand - Nicola OsborneSocial Media in Marketing in Support of Your Personal Brand - Nicola Osborne
Social Media in Marketing in Support of Your Personal Brand - Nicola Osborne
 
Best Practice for Social Media in Teaching & Learning Contexts - Nicola Osborne
Best Practice for Social Media in Teaching & Learning Contexts - Nicola OsborneBest Practice for Social Media in Teaching & Learning Contexts - Nicola Osborne
Best Practice for Social Media in Teaching & Learning Contexts - Nicola Osborne
 
SCURL and SUNCAT serials holdings comparison service
SCURL and SUNCAT serials holdings comparison serviceSCURL and SUNCAT serials holdings comparison service
SCURL and SUNCAT serials holdings comparison service
 
Big data in Digimap
Big data in DigimapBig data in Digimap
Big data in Digimap
 
Introduction to Edinburgh University Data Library and national data services
Introduction to Edinburgh University Data Library and national data servicesIntroduction to Edinburgh University Data Library and national data services
Introduction to Edinburgh University Data Library and national data services
 
Digimap for Schools: Introduction to an ICT based cross curricular resource f...
Digimap for Schools: Introduction to an ICT based cross curricular resource f...Digimap for Schools: Introduction to an ICT based cross curricular resource f...
Digimap for Schools: Introduction to an ICT based cross curricular resource f...
 
Digimap Update - Geoforum 2016 - Guy McGarva
Digimap Update - Geoforum 2016 - Guy McGarvaDigimap Update - Geoforum 2016 - Guy McGarva
Digimap Update - Geoforum 2016 - Guy McGarva
 

Recently uploaded

DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersSabitha Banu
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Celine George
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...Postal Advocate Inc.
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfSpandanaRallapalli
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17Celine George
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYKayeClaireEstoconing
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfphamnguyenenglishnb
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 

Recently uploaded (20)

DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginners
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
 
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptxFINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdf
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
 
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
 
Raw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptxRaw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptx
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 

Reference Rot and Linked Data: Threat and Remedy

  • 1. Reference Rot and Linked Data: Threat and Remedy Peter Burnhill EDINA, University of Edinburgh for the Hiberlink Team at University of Edinburgh & LANL Research Library PRELIDA 18/19thOctober 2014 Funded by the Andrew W. Mellon Foundation
  • 2. The Project Team 2013 – 2015, funded by the Andrew W. Mellon Foundation • Los Alamos National Laboratory: Research Library: Martin Klein, [Rob Sanderson], Harihar Shankar, Herbert Van de Sompel • University of Edinburgh: Language Technology Group: Beatrice Alex, Claire Grover, Colin Matheson, Richard Tobin, [Ke “Adam” Zhou] EDINA * : Neil Mayo, Muriel Mewissen (Project Manager), Tim Stickland, Richard Wincewicz, Peter Burnhill Centre for Service Delivery & Digital Expertise Funded by the Andrew W. Mellon Foundation PRELIDA 18/19thOctober 2014
  • 3. 1. Social Science Research Council [now ESRC, UK] – ‘Scientific Officer’ funding Data & use of Data 2. Scottish Education Data Archive, 1979 – 1984/1987 – Survey statistician: school leavers, YTS, 16-19 cohort surveys; demand for HE 3. Edinburgh University Data Library, 1984 to present – President of IASSIST, 1997 – 2001: social science data professionals 4. ESRC Regional Research Laboratory for Scotland, 1986 -1990 – Co-director, early days of Geographical Information Systems (GIS) – member of Data Task Force, UK Inter-Agency Global Env. Change 5. Graduate School, Faculty of Social Science, UofEd 1987 – 1997 – Senior Lecturer (p/t), teaching quantitative/survey methods – Director of RAPID: ESRC Research Activity & Publications Information Database 6. EDINA national data centre, 1995/6 to present – Director: set-up and continuous development; Jisc-funded UK national services 7. UK Digital Curation Centre (DCC), 2003/04 - 2004/05 – Director for set-up & definition of ‘data curation + digital preservation’ 8. CLOCKSS Founder & Board Member / LOCKSS deployment 3 Data Manufacturing Data Brokering Spatial Data & MetaData
  • 4. licence to use Ensuring researchers, students and their teachers have ease and continuing access to online resources used for scholarship P.Burnhill, Edinburgh 2009 access to content & services
  • 5. Buckland: thinking about Digital Libraries mix of the document tradition (signifying objects & their use) & the computation tradition (applying algorithmic, logical, mathematical, and mechanical techniques to information management) “Both traditions are needed. Information Science is rooted in part in humanities and qualitative social sciences. The landscape of Information Science is complex. An ecumenical view is needed.” – M.Buckland, Journal of American Society for Information Science, 50, 1999 2 (non-convergent) mentalities,Document-ness & Computation + a third dimension, the domain of application: • Academic discipline – if we do this for ourselves • Business area – if we do this for use beyond …
  • 6. Related Activity by Partners • Los Alamos National Laboratory Research Library: • Memento • ResourceSync • http://www.niso.org/workrooms/resourcesync/ • University of Edinburgh / Informatics / Language Technology Group: • Text mining / Edinburgh Parser • University of Edinburgh/ Jisc / EDINA : • CLOCKSS / LOCKSS • Keepers Registry • https://www.era.lib.ed.ac.uk/handle/1842/6682
  • 7. Top level Problem: We would like to assume that our libraries are ensuring that online e-journal content is being kept safe But online articles in the Scholarly Record are not in the custody of Libraries, nor on their digital shelves. Picture credit: http://somanybooksblog.com/2009/03/27/library-tour/
  • 8. Evidence from <thekeepers.org> is worrying! The Keepers Registry aggregates what is being kept by the (10) leading archiving agencies (CLOCKSS, Portico, national libraries etc) with all issued with ISSN ① ‘Ingest Ratio’ = titles being ingested by one or more Keeper / ‘online serials’ in ISSN Register = 23,268 / 136,965 [in March 2014] => 17% * We do not know about 83% of e-serials having ISSN * ‘KeepSafe Ratio’ = ingest by 3+ Keepers = 9,652 / 136,965 => 7% ② Title Lists of 3 US research libraries (Columbia, Cornell & Duke), checked i2011/12 ‘Ingest Ratio’ = 22% to 28%; c.75% unknown fate ③ User-centric Evidence, UK usage in 2012, UK OpenURL Router logs => over two thirds 68% (36,326 titles) held by none!
  • 9. Memento The Memento "Time Travel for the Web" protocol http://mementoweb.org/ • an interoperable approach to access web archives (IETF RFC 7089) • adopted by all major public archives worldwide, including the Internet Archive. • Memento for Chrome http://bit.ly/memento-for-chrome • This protocol underpins the work being done in Hiberlink
  • 10. Now, about Reference Rot & Linked Data … 1. Some definitions • What is Reference Rot? • What may be special about Linked Data? 2. Evoking metaphor • The moment / snapshot / memento • Flash-freezing to avoid or to stop the rot (of fruit on vine) 3. Evidence of Threat of Reference 4. Devising Remedy for Reference Rot • Proposals for intervention: plug-ins & infrastructural solutions 5. Next Steps: how to take this work forward?
  • 11. Investigating Reference Rot in Web-Based Scholarly Communication Reference Rot = Link Rot + Content Drift “when links to web resources no longer point to what they once did”
  • 13. + Content Drift: What is at end of URI has changed, or gone! http://dl00.org 2000 http://dl00.org 2004 (a) Dynamic content as values on webpage changes over time http://dl00.org 2005 http://dl00.org 2008 (b) Static content but very different (often unrelated) web pages
  • 14. What of Linked Data? One or more sets of 3 linked URIs: conversation or statements for the long term? As time passes, so the content at the end of each of those URIs will suffer: Reference Rot = Link Rot + Content Drift “when links to web resources no longer point to what they once did” “Adding eScience Assets to the Data Web”, Herbert Van de Sompel, Carl Lagoze, Michael L. Nelson, Simeon Warner, Robert Sanderson, Pete Johnston. Proceedings of Linked Data on the Web (LDOW2009) Workshop, [v1] Thu, 11 Jun 2009 15:33:37 GMT http://arxiv.org/abs/0906.2135v1
  • 15. Example: ‘mark up’ archaeological site record (metadata)
  • 16.
  • 17. RDF graph: Article & Supplementary Data http://www.emeraldinsight.com/fig/0350570303002.png 1. Build and publish as metadata in XML format to be found on the web 2. Publishing text and data/multimedia content in XML will delight researchers • Researchers want to access ‘article as data’, via computational algorithm
  • 18. What we are doing in Hiberlink 1. Creating evidence on extent of ‘Reference Rot’ – Main focus has been on references (& URIs) made in Journal Articles • Inc. reference rot in Supreme Court judgments with Harvard Law Library & permaCC – ETD2014 was opportunity to look at Reference Rot & the e-Thesis – PRELIDA is opportunity to look at impact on Linked Data 2. Understanding the preparation/publication/ingest workflow(s) – Identifying opportunity for productive intervention 1. Prototypes for pro-active archiving to enable remedy – Embedding such ‘solutions’ in existing tools & infrastructure 2. Raising awareness & seeking collaborative actions …. through events like this
  • 19. Empirical evidence on the Threat of Reference Rot Large-scale analyses: Journal Articles & E-Theses
  • 20. Methodology: to discover answer to 2 questions i. Do those links (URIs) still work? Is the URI on the ‘Live Web’’? • Allowing up to a maximum of 50 redirects, recording the HTTP transaction chain and regarding an 2XX status code as ‘live’
  • 21. Methodology: to discover answer to 2 questions i. Do those links (URIs) still work? Is the URI on the ‘Live Web’’? • Allowing up to a maximum of 50 redirects, recording the HTTP transaction chain and regarding an 2XX status code as ‘live’ ii. Is there a ‘Memento’ of that reference in the ‘Archived Web’? Memento: a prior version, what the Original Resource was like at some time in the past.
  • 22. A Measure of Reference Rot: Are those references available? [in 6,400 e-Theses defended in 2003-2010 at 5 US universities] After up to 50 redirects Live on Web Not Found on ‘Live Web’ All Count 29,122 16,860 45,982 % 63.3 36.7 100% Less than two-thirds of those links lead to live content 1st Order Indicator of ‘Reference Rot’ more than one third of references to the Web subject to ‘rot’
  • 23. References in Citations Rot over Time: URIs cease to exist on the live Web [excluding 0s&1s: a few theses are unaffected; a few are ruined] We can’t stop that process of rot: Web content changes over time, Reference Rot is inevitable function of time Number of months elapsed from Date Thesis Defended until date archives checked (June 2014)
  • 24. Searching for ‘Datetime’ Mementos of content in ‘Archived Web’ [in 6,400 e-Theses defended in 2003-2010 at 5 US universities] % Live on Web Not found on ‘Live Web’ All Found to be Archived 47.6 Not Found 52.4 All 100% There seems a 50:50 chance that referenced content is in the ‘Archived Web’. => half of those references are at ‘risk of loss’ Some content is being ‘co-incidentally harvested’ by routine web archiving.
  • 25. ‘Incidental Archiving’ is constant over time (This is an ‘upper bound estimate’, independent of age of e-thesis) We can improve upon this ‘50:50 chance’ by pro-actively archiving what we cite
  • 26. We already have ‘Lost Content’ for References to Web [in 6,400 e-Theses defended in 2003-2010 at 5 US universities] % Live on Web Not found on ‘Live Web’ All Found to be Archived 29.3 18.3 47.6 Not Found 34.0 18.4 52.4 All 63.3 36.7 100% 18.4% ‘not live & not found in archive’ judged to be lost forever 34% ‘live’ & ‘not in archive’ at is risk of loss NB: The 34% ‘at risk’ could be saved by pro-active archiving
  • 27. Hiberlink Next Phase: in-depth study of Content Drift But demonstrated that problem exists & is severe • The Web changes over time: significant reference rot occurs • Routine Web Archiving delivers no better than 50:50 chance of success of having co-Incidentally archived what you referenced - and probably much less chance when we check extent of content drift - Not (yet) studied impact on Linked Data but expect similar
  • 28. “Researchers need to know when information on a viewed page has changed. “Authors of long-shelf-life material want to be sure that their links will still work far into the future. Jonathan Zittrain, Larry Lessig and Kendra Albert report that • Harvard Law Review 75% of links are dead • top 1% Impact Factor Journals 10% of links dead just 15 months after publication • US Supreme Court decisions 29% of links dead 49% of links do not point to the original target http://papers.ssrn.com/sol3/papers.cfm?abstract_id=2329161
  • 29. Devising Remedy for Reference Rot for Linked Data?
  • 30. Strategy for Making Remedy Seek pro-active ‘transactional archiving’ solutions – focus on what is regarded by authors as important a) Understand the preparation/publication workflow – identifying where there can be productive intervention a) Devise prototypes for pro-active archiving – writing & implementing code! b) Propose/test infrastructure for temporal referencing – supporting & using the Memento protocol Where possible, we wish to embed ‘solutions’ in existing tools & infrastructure
  • 31. 3 workflows in scholarly statement ① Preparation-> Study - > Compose -> (Review) -> Submission ② Publication -> (Editorial)Examination -> (Revision) -> Acceptance -> Issue ③ Post-Publication-> Deposit/Ingest -> Provide/Access -> Use Extended length of stages in workflows magnify reference rot & affect, as referenced content on the web rots over time Identify the best opportunities for Intervention to make Remedy, to ‘flash-freeze’, either to avoid reference rot or to ‘stop the rot’ What are the key workflows for the manufacture, release and use of Linked Data?
  • 32. 3 workflows in Linked Data What are the key workflows for the manufacture, release and use of Linked Data? ① Manufacture-> Create- > (Review) -> Prepare to publish/release/commit ② Authority: Release-> (Editorial)Examination -> (Revision) -> Acceptance ③ Use: Curate -> Deposit/Ingest -> Provide/Access -> Use What is it that changes over time: concepts, assigned attributes; why and on what timescale? Identify the best opportunities for Intervention to make Remedy, to ‘flash-freeze’, either to avoid reference rot or to ‘stop the rot’
  • 33. ‘Work in progress’ to effect Remedy 1. Hiberlink Plug-in - for pro-active ‘transactional’ archiving – At the time of authoring (ie manufacture) 2. Missing Link - re-factoring the HTML link – By which one annotates with {DateTime; location of archived copy/ies} 3. HiberActive - a system for actively archiving references – Designed to ‘stop the rot’, a lossy 2nd Best to transactional archiving’ LANL: Martin Klein, Harihar Shankar, Herbert Van de Sompel UoEd EDINA: Neil Mayo, Tim Stickland, Richard Wincewicz Hiberlink ETD2014, Leicester UK July 25th 2014 Funded by the Andrew W. Mellon Foundation
  • 34. Hiberlink Plug-in [for Zotero] ① Triggers archiving of referenced web content ② Returns DateTime URI for archived content For use during authoring [manufacture] of information object & before final issue but also before ingest by ‘library’ (& maybe for repair by ‘library’ …)
  • 35. ‘Work in progress’ to effect Remedy (2) 1. Hiberlink Plug-in - to enable pro-active archiving 2. Missing Link - re-factor the HTML link that is returned a) Take simple URI - to French National Library (say) b) Augment Link with a set of Datetime & location pairs Prepared by: Herbert Van de Sompel, Martin Klein, Robert Sanderson - Los Alamos National Laboratory Michael Nelson - Old Dominion University http://mementoweb.org/missing-link/
  • 36. ‘Work in progress’ to effect Remedy (3) 1. Hiberlink Plug-in - to enable pro-active archiving 2. Missing Link - re-factoring the HTML link First two approaches support ‘perfect scenario’: • All authors archive all their cited URIs • e.g. (but not exclusively) with Hiberlink / Zotero 3. HiberActive – Enables repositories to ‘stop the rot’ by actively archiving those references in e-theses – A notification hub, a component for the infrastructure • testing workflow with ResourceSync, CORE & external archive programme
  • 37. Summary • The Web changes over time: significant reference rot inevitably occurs (as a function of time) • Web Archiving delivers only c.50:50 chance of success of co-incidentally archiving what you referenced • Link by means of the original URI, at time of manufacture • But then …. Augment the link with temporal context, to increase robustness of link to referenced content o Date of linking o URI of archived snapshot(s) • Then again, maybe this is all about archiving to support citation and not really about ‘preservation’, but it does assist continuity of access
  • 38. Multi-level Problem: Digital Shelving for The Research Object; First Order References; Second Order References; …. Simple Statements [with URIs] 1st Order References [with URIs] Complex Research Objects {URIs} Picture credit: http://somanybooksblog.com/2009/03/27/library-tour/ 1st Order References {URI} 2nd Order References [with URIs] 2nd Order References {URI} “Digital information is best preserved by replicating it [on digital shelving] at multiple archives run by autonomous organizations” B. Cooper and H. Garcia-Molina (2002)
  • 39. Next Steps: how to take this work forward? to ensure URI/references don’t rot • Need to move from the ‘incidental Web archiving’ of cited URIs to pro-active archiving, by makers of Linked Data & by repositories? • Engage with these Hiberlink remedies • The Hiberlink Plug-in for Zotero / HiberActive Email: edina@ed.ac.uk Subject: Hiberlink ETD
  • 40. http://hiberlink.org #hiberlink Thank you, Questions welcome & check: http://hiberlink.org/news.html Email: edina@ed.ac.uk Funded by the Andrew W. Mellon Foundation