SlideShare a Scribd company logo
Creating Pockets of Persistence 
Herbert Van de Sompel 
@hvdsomp 
http://public.lanl.gov/herbertv/ 
Los Alamos National Laboratory 
Acknowledgements: 
Michael L. Nelson 
@phonedude_mln 
Old Dominion University 
Herbert Van de Sompel 
404/File Not Found, Washington, DC, October 24 2014
Addressing the Link/Reference Rot Challenge 
• Pockets of Persistence 
• Capture – Archive Pro-Actively, Selectively 
• Reference – Annotate Links 
• Access – Travel in Time 
Herbert Van de Sompel 
404/File Not Found, Washington, DC, October 24 2014
Pockets of Persistence 
Herbert Van de Sompel 
How to achieve the ability to: 
404/File Not Found, Washington, DC, October 24 2014 
• Persistently 
• Precisely 
• Seamlessly 
revisit the Web of the Past 
and the Web of the Now at 
some point in the Future
Pockets of Persistence 
Herbert Van de Sompel 
How to achieve the ability to: 
404/File Not Found, Washington, DC, October 24 2014 
• Persistently 
• Precisely 
• Seamlessly 
revisit the Web of the Past 
and the Web of the Now at 
some point in the Future 
Two components to the link/reference rot 
challenge: 
• Link rot: Links stop working aka 404 
Not Found 
• Content drift: Referenced content 
changes over time
Illustration 
Herbert Van de Sompel 
Current version of http://en.wikipedia.org/wiki/Coil_(band) on October 22 2014 
404/File Not Found, Washington, DC, October 24 2014
Illustration – Link Rot 
Herbert Van de Sompel 
Current version of http://en.wikipedia.org/wiki/Coil_(band) on October 22 2014 
404/File Not Found, Washington, DC, October 24 2014
Illustration – Link Rot 
Herbert Van de Sompel 
Current version of http://liarsociety.tripod.com/blog/index.blog?from=20041130 on October 22 2014 
404/File Not Found, Washington, DC, October 24 2014
Illustration – Content Drift 
Version of http://en.wikipedia.Herbert org/Van wiki/de Coil_(Sompel 
band) dated October 2 2014 
http://en.wikipedia.org/w/index.php?title=Coil_(band)&oldid=388321480 
404/File Not Found, Washington, DC, October 24 2014
Illustration – Content Drift 
Herbert Van de Sompel 
Current version of http://en.wikipedia.org/wiki/Peter_Christopherson on October 22 2014 
404/File Not Found, Washington, DC, October 24 2014
Illustration – Content Drift 
Version of http://en.wikipedia.org/wiki/Peter_Christopherson that was current on October 2 2010 
Herbert Van de Sompel 
http://en.wikipedia.org/w/index.php?title=Peter_Christopherson&oldid=387987414 
404/File Not Found, Washington, DC, October 24 2014
Pockets of Persistence 
Herbert Van de Sompel 
How to achieve the ability to: 
404/File Not Found, Washington, DC, October 24 2014 
• Persistently 
• Precisely 
• Seamlessly 
revisit the Web of the Past 
and the Web of the Now at 
some point in the Future 
This challenge exists for the entire web, 
but some communities actually care 
about addressing it: 
• scholarly communication, 
• legal publications, 
• journalism, 
• Wikipedia, 
• … 
Mobilize the communities that care about 
this problem to work towards joint, 
interoperable solutions, approaches
Addressing the Link/Reference Rot Challenge 
• Pockets of Persistence 
• Capture – Archive Pro-Actively, Selectively 
• Reference – Annotate Links 
• Access – Travel in Time 
Herbert Van de Sompel 
404/File Not Found, Washington, DC, October 24 2014
Pro-Active Capture for a Seed Collection 
• Seed Collection - Starting point for capture is a seed collection of 
interest to communities that care, e.g. 
o On-Line journalism 
• Lifecycle Events – Intervene at critical moments in the lifecycle of 
items in these collections to pro-actively capture 
o Collection items – some solutions in place 
o Web resources referenced in collection items 
Herbert Van de Sompel 
o Scholarly literature 
o Legal documents 
o Wikipedia articles 
404/File Not Found, Washington, DC, October 24 2014
Pro-Active Capture for Seed Collection 
• What those crucial lifecycle events are may depend on the 
• Creation of new article 
• Creation of new version of 
article 
• Creation of substantially 
new version of article 
• Addition of external 
reference to article 
• References to article 
exceed a certain threshold 
Scholarly Literature 
Herbert Van de Sompel 
collection type 
Wikipedia 
404/File Not Found, Washington, DC, October 24 2014
Authoring Legal Documents – perma.cc 
Herbert Van de Sompel 
http://perma.cc 
404/File Not Found, Washington, DC, October 24 2014
Authoring Scholarly Literature: Experimental Zotero Extension 
Richard Wincewicz (2014) Prototype Hiberlink plugin for Zotero for pro-active archiving and temporal references 
Herbert Van de Sompel 
https://www.youtube.com/v/ZYmi_Ydr65M%26vq 
404/File Not Found, Washington, DC, October 24 2014
Submitting Scholarly Literature: Experimental HiberActive Service 
Martin Klein et al. (2014) HiberActive: Pro-Active Archiving of web references from scholarly articles 
Herbert Van de Sompel 
Open Repositories 2014 http://www.slideshare.net/martinklein0815/hiberactive 
404/File Not Found, Washington, DC, October 24 2014
Pro-Active Capture for Seed Collection 
• Interoperability for on-demand capture: 
o Need basic interoperability for machine-driven on-demand 
capture: 
- Discovery of capture interface 
- Interface IN - [ Original URI ] 
- Interface OUT - [ URI of Capture ; Capture Datetime ] 
Herbert Van de Sompel 
404/File Not Found, Washington, DC, October 24 2014
Addressing the Link/Reference Rot Challenge 
• Pockets of Persistence 
• Capture – Archive Pro-Actively, Selectively 
• Reference – Annotate Links 
• Access – Travel in Time 
Herbert Van de Sompel 
404/File Not Found, Washington, DC, October 24 2014
Reference Captures and Annotate Links 
• Existing practice for linking to captures: 
o Link to URI of Capture 
o Lose Capture Datetime 
• Problems with existing practice: 
o Impossible to visit the original URI, if desired 
o Requires the permanent existence/uptime of the archive that 
holds the capture 
- One link rot problem replaced by another 
Van de Sompel, H. et al. (2013) Thoughts on referencing, linking, reference rot 
Herbert Van de Sompel 
o Lose Original URI 
http://mementoweb.org/missing-link/ 
404/File Not Found, Washington, DC, October 24 2014
Permanent Existence/Uptime of Archives? 
Capture of http://webcitation.org dated July 17 2013 
Herbert Van de Sompel 
https://archive.today/eAETp 
404/File Not Found, Washington, DC, October 24 2014
Permanent Existence/Uptime of Archives? 
Herbert Van de Sompel 
http://webcitation.org/ on August 6 2014 
404/File Not Found, Washington, DC, October 24 2014
Permanent Existence/Uptime of Archives? 
Remnant of discontinued web archive http://mummify.it captured on February 14 2014 
Herbert Van de Sompel 
https://web.archive.org/web/20140214233752/https://www.mummify.it/ 
404/File Not Found, Washington, DC, October 24 2014
Permanent Existence/Uptime of Archives? 
http://www.themoscowtimes.com/news/article/russia-bans-wayback-machine-internet-archive-over-islamic-state-video/ 
Herbert Van de Sompel 
510074.html 
404/File Not Found, Washington, DC, October 24 2014
Hacking Original URI, Capture Datetime from Capture URI? 
URI of Capture Original URI Datetime T 
https://web.archive.org/web/20140214233752/https:// 
www.mummify.it 
https://archive.today/eAETp no no 
http://perma.cc/4RH7-999Q?type=source no no 
http://en.wikipedia.org/w/index.php?title=Coil_(band) 
&oldid=388321480 
Herbert Van de Sompel 
yes yes 
no no 
404/File Not Found, Washington, DC, October 24 2014
Using Capture URI to find Captures in Other Web Archives? 
Herbert Van de Sompel 
404/File Not Found, Washington, DC, October 24 2014
Using Capture URI to find Captures in Other Web Archives? 
Herbert Van de Sompel 
404/File Not Found, Washington, DC, October 24 2014
Reference Captures and Annotate Links 
• Desired practice for linking to captures is to annotate the link so it 
Herbert Van de Sompel 
404/File Not Found, Washington, DC, October 24 2014 
conveys: 
- URI of Capture 
- Original URI 
- Capture Datetime 
• Link annotation supports fallback to other archives: 
o Original URI allows finding captures in all web archives 
o Capture Datetime allows finding an appropriate capture in all 
web archives 
o Original URI and Capture Datetime allows automatic access 
to an appropriate capture in all web archives (see Access) 
Van de Sompel, H. et al. (2013) Thoughts on referencing, linking, reference rot 
http://mementoweb.org/missing-link/
Reference Captures and Annotate Links 
• Desired practice for linking to captures is to annotate the link so it 
Herbert Van de Sompel 
404/File Not Found, Washington, DC, October 24 2014 
conveys: 
URI of Capture 
Original URI Capture Datetime
Reference Captures and Annotate Links 
• Interoperability for link annotation: 
o Need an approach to convey, in a uniform, machine-actionable 
- URI of Capture 
- Original URI 
- Capture Datetime 
o Missing Link Proposal 
- http://mementoweb.org/missing-link/ 
o W3C Robustness and Archiving Community Group 
- http://www.w3.org/community/irobar/ 
Herbert Van de Sompel 
way: 
• Ongoing efforts: 
404/File Not Found, Washington, DC, October 24 2014
Missing Link Proposal 
<a href=“http://liarsociety.tripod.com/blog/index.blog?from=20041130” 
data-versionurl=“https://archive.today/ElCHn” 
data-versiondate=“2008-02-06T00:00:00Z”> 
Herbert Van de Sompel 
URI of Capture 
Capture Datetime 
404/File Not Found, Washington, DC, October 24 2014 
Original URI 
Van de Sompel, H. et al. (2013) Thoughts on referencing, linking, reference rot 
http://mementoweb.org/missing-link/
Addressing the Link/Reference Rot Challenge 
• Pockets of Persistence 
• Capture – Archive Pro-Actively, Selectively 
• Reference – Annotate Links 
• Access – Travel in Time 
Herbert Van de Sompel 
404/File Not Found, Washington, DC, October 24 2014
Memento Web Time Travel 
Use the Original URI 
Herbert Van de Sompel 
Current version of http://law.georgetown.edu/library/404/ on October 22 2014 
404/File Not Found, Washington, DC, October 24 2014
Memento Web Time Travel 
And a Datetime 
Herbert Van de Sompel 
404/File Not Found, Washington, DC, October 24 2014
Memento Web Time Travel 
To automatically retrieve the temporally nearest available capture 
Capture of http://law.georgetown.edu/library/404/ dated May 3 2014 
Herbert Van de Sompel 
http://wayback.archive-it.org/all/20140503094327/http://www.law.georgetown.edu/library/404/ 
404/File Not Found, Washington, DC, October 24 2014
Memento Web Time Travel 
http://bit.ly/memento-for-chrome 
Herbert Van de Sompel 
http://mementoweb.org 
404/File Not Found, Washington, DC, October 24 2014
Travel in Time - Persistently, Precisely, Seamlessly 
On-Demand Capture URI of Capture Original URI Datetime T 
Herbert Van de Sompel 
404/File Not Found, Washington, DC, October 24 2014 
Available 
Accessible 
+ - - 
• Time Travel is: 
• Persistent – See next slide 
• Precise – Following link to URI of Capture retrieves exact 
capture 
• Seamless – Requires clicking a link as usual
Travel in Time - Persistently, Precisely, Seamlessly 
On-Demand Capture URI of Capture Original URI Datetime T 
Herbert Van de Sompel 
404/File Not Found, Washington, DC, October 24 2014 
Available 
Not Accessible 
+ - - 
• Time Travel is: 
• Persistent – Following link to URI of Capture leads nowhere 
• Precise – Following link to URI of Capture leads nowhere 
• Seamless – Following link to URI of Capture leads nowhere
Travel in Time - Persistently, Precisely, Seamlessly 
On-Demand Capture URI of Capture Original URI Datetime T 
Herbert Van de Sompel 
404/File Not Found, Washington, DC, October 24 2014 
Available 
Not Accessible 
+ + + 
• Time Travel is: 
• Persistent – Using Memento with [ Original URI ; Datetime ] 
works across web archives, versioning systems 
• Precise – Using Memento with [ Original URI ; Datetime ] 
retrieves nearest capture from other archive 
• Seamless – Requires browser plugin
Travel in Time - Persistently, Precisely, Seamlessly 
On-Demand Capture URI of Capture Original URI Datetime T 
Herbert Van de Sompel 
404/File Not Found, Washington, DC, October 24 2014 
Available 
Accessible 
- + + 
• Time Travel is: 
• Persistent – Using Memento with [ Original URI ; Datetime ] 
works across web archives, versioning systems 
• Precise – Using Memento with [ Original URI ; Datetime ] 
retrieves exact capture from other archive 
• Seamless – Requires browser plugin
Travel in Time - Persistently, Precisely, Seamlessly 
On-Demand Capture URI of Capture Original URI Datetime T 
Not Available - + + 
• Persistent – Using Memento with [ Original URI ; Datetime ] 
works across web archives, versioning systems 
• Precise – Using Memento with [ Original URI ; Datetime ] 
retrieves nearest capture from other archive 
• Seamless – Requires browser plugin 
Herbert Van de Sompel 
• Time Travel is: 
404/File Not Found, Washington, DC, October 24 2014
Reference Captures and Annotate Links 
• Interoperability for time travel: 
o Memento protocol specifies interoperability across web 
archives, version management systems 
o Memento protocol is supported by major web archives 
o Need to work towards Memento support by version 
management systems 
o Need to work towards making Memento experience 
seamless through native browser support 
o Need to work towards robustness and sustainability of 
Memento infrastructure 
Herbert Van de Sompel 
404/File Not Found, Washington, DC, October 24 2014
Conclusion 
• Significant technical solutions, infrastructure, ideas exist to 
address the link rot/reference rot challenge 
• Mobilize the communities that care about this challenge to work 
towards joint, interoperable approaches 
Herbert Van de Sompel 
404/File Not Found, Washington, DC, October 24 2014
Creating Pockets of Persistence 
http://mementoweb.org 
http://hiberlink.org 
Herbert Van de Sompel 
404/File Not Found, Washington, DC, October 24 2014

More Related Content

What's hot

Signposting Overview
Signposting OverviewSignposting Overview
Signposting Overview
Herbert Van de Sompel
 
The web is rotting and what to do about it
The web is rotting and what to do about itThe web is rotting and what to do about it
The web is rotting and what to do about it
Herbert Van de Sompel
 
Signposting Overview (Version November 2017)
Signposting Overview (Version November 2017)Signposting Overview (Version November 2017)
Signposting Overview (Version November 2017)
Herbert Van de Sompel
 
Discovering Scholarly Orphans Using ORCID
Discovering Scholarly Orphans Using ORCIDDiscovering Scholarly Orphans Using ORCID
Discovering Scholarly Orphans Using ORCID
Martin Klein
 
OAC Presentation at CNI 09 Fall Forum
OAC Presentation at CNI 09 Fall ForumOAC Presentation at CNI 09 Fall Forum
OAC Presentation at CNI 09 Fall Forum
Robert Sanderson
 
Quantifying Orphaned Annotations in Hypothes.is
Quantifying Orphaned Annotations in Hypothes.isQuantifying Orphaned Annotations in Hypothes.is
Quantifying Orphaned Annotations in Hypothes.is
maturban
 
Signposting for Repositories
Signposting for RepositoriesSignposting for Repositories
Signposting for Repositories
Martin Klein
 
Achieving Link Integrity for Managed Collections
Achieving Link Integrity for Managed CollectionsAchieving Link Integrity for Managed Collections
Achieving Link Integrity for Managed Collections
Herbert Van de Sompel
 
How much does $1.7 billion buy?
How much does $1.7 billion buy?How much does $1.7 billion buy?
How much does $1.7 billion buy?
Martin Klein
 
Paul Evan Peters Lecture
Paul Evan Peters LecturePaul Evan Peters Lecture
Paul Evan Peters Lecture
Herbert Van de Sompel
 
Memento 101
Memento 101Memento 101
Creating Topical Collections: Web Archives vs. Live Web
Creating Topical Collections:Web Archives vs. Live WebCreating Topical Collections:Web Archives vs. Live Web
Creating Topical Collections: Web Archives vs. Live Web
Martin Klein
 
Linked Open Data for Libraries
Linked Open Data for LibrariesLinked Open Data for Libraries
Linked Open Data for Libraries
Lukas Koster
 
Reference Rot in Scholarly Communication: A Reliable Quantification and a P...
Reference Rot in Scholarly Communication: A Reliable Quantification and a P...Reference Rot in Scholarly Communication: A Reliable Quantification and a P...
Reference Rot in Scholarly Communication: A Reliable Quantification and a P...
Martin Klein
 
The Web We Want
The Web We WantThe Web We Want
The Web We Want
Steffen Staab
 
Registration / Certification Interoperability Architecture (overlay peer-review)
Registration / Certification Interoperability Architecture (overlay peer-review)Registration / Certification Interoperability Architecture (overlay peer-review)
Registration / Certification Interoperability Architecture (overlay peer-review)
Herbert Van de Sompel
 
Scripts in a Frame: A Two-Tiered Approach for Archiving Deferred Representations
Scripts in a Frame: A Two-Tiered Approach for Archiving Deferred RepresentationsScripts in a Frame: A Two-Tiered Approach for Archiving Deferred Representations
Scripts in a Frame: A Two-Tiered Approach for Archiving Deferred Representations
Justin Brunelle
 
Robust Linking to Web Resources
Robust Linking to Web ResourcesRobust Linking to Web Resources
Robust Linking to Web Resources
Martin Klein
 
TPDL2013 tutorial linked data for digital libraries 2013-10-22
TPDL2013 tutorial linked data for digital libraries 2013-10-22TPDL2013 tutorial linked data for digital libraries 2013-10-22
TPDL2013 tutorial linked data for digital libraries 2013-10-22
jodischneider
 
Publishing and Using Linked Open Data - Day 1
Publishing and Using Linked Open Data - Day 1 Publishing and Using Linked Open Data - Day 1
Publishing and Using Linked Open Data - Day 1
Richard Urban
 

What's hot (20)

Signposting Overview
Signposting OverviewSignposting Overview
Signposting Overview
 
The web is rotting and what to do about it
The web is rotting and what to do about itThe web is rotting and what to do about it
The web is rotting and what to do about it
 
Signposting Overview (Version November 2017)
Signposting Overview (Version November 2017)Signposting Overview (Version November 2017)
Signposting Overview (Version November 2017)
 
Discovering Scholarly Orphans Using ORCID
Discovering Scholarly Orphans Using ORCIDDiscovering Scholarly Orphans Using ORCID
Discovering Scholarly Orphans Using ORCID
 
OAC Presentation at CNI 09 Fall Forum
OAC Presentation at CNI 09 Fall ForumOAC Presentation at CNI 09 Fall Forum
OAC Presentation at CNI 09 Fall Forum
 
Quantifying Orphaned Annotations in Hypothes.is
Quantifying Orphaned Annotations in Hypothes.isQuantifying Orphaned Annotations in Hypothes.is
Quantifying Orphaned Annotations in Hypothes.is
 
Signposting for Repositories
Signposting for RepositoriesSignposting for Repositories
Signposting for Repositories
 
Achieving Link Integrity for Managed Collections
Achieving Link Integrity for Managed CollectionsAchieving Link Integrity for Managed Collections
Achieving Link Integrity for Managed Collections
 
How much does $1.7 billion buy?
How much does $1.7 billion buy?How much does $1.7 billion buy?
How much does $1.7 billion buy?
 
Paul Evan Peters Lecture
Paul Evan Peters LecturePaul Evan Peters Lecture
Paul Evan Peters Lecture
 
Memento 101
Memento 101Memento 101
Memento 101
 
Creating Topical Collections: Web Archives vs. Live Web
Creating Topical Collections:Web Archives vs. Live WebCreating Topical Collections:Web Archives vs. Live Web
Creating Topical Collections: Web Archives vs. Live Web
 
Linked Open Data for Libraries
Linked Open Data for LibrariesLinked Open Data for Libraries
Linked Open Data for Libraries
 
Reference Rot in Scholarly Communication: A Reliable Quantification and a P...
Reference Rot in Scholarly Communication: A Reliable Quantification and a P...Reference Rot in Scholarly Communication: A Reliable Quantification and a P...
Reference Rot in Scholarly Communication: A Reliable Quantification and a P...
 
The Web We Want
The Web We WantThe Web We Want
The Web We Want
 
Registration / Certification Interoperability Architecture (overlay peer-review)
Registration / Certification Interoperability Architecture (overlay peer-review)Registration / Certification Interoperability Architecture (overlay peer-review)
Registration / Certification Interoperability Architecture (overlay peer-review)
 
Scripts in a Frame: A Two-Tiered Approach for Archiving Deferred Representations
Scripts in a Frame: A Two-Tiered Approach for Archiving Deferred RepresentationsScripts in a Frame: A Two-Tiered Approach for Archiving Deferred Representations
Scripts in a Frame: A Two-Tiered Approach for Archiving Deferred Representations
 
Robust Linking to Web Resources
Robust Linking to Web ResourcesRobust Linking to Web Resources
Robust Linking to Web Resources
 
TPDL2013 tutorial linked data for digital libraries 2013-10-22
TPDL2013 tutorial linked data for digital libraries 2013-10-22TPDL2013 tutorial linked data for digital libraries 2013-10-22
TPDL2013 tutorial linked data for digital libraries 2013-10-22
 
Publishing and Using Linked Open Data - Day 1
Publishing and Using Linked Open Data - Day 1 Publishing and Using Linked Open Data - Day 1
Publishing and Using Linked Open Data - Day 1
 

More from Herbert Van de Sompel

Researcher Pod: Scholarly Communication Using the Decentralized Web
Researcher Pod: Scholarly Communication Using the Decentralized WebResearcher Pod: Scholarly Communication Using the Decentralized Web
Researcher Pod: Scholarly Communication Using the Decentralized Web
Herbert Van de Sompel
 
Collecting the organizational scholarly record
Collecting the organizational scholarly recordCollecting the organizational scholarly record
Collecting the organizational scholarly record
Herbert Van de Sompel
 
To the Rescue of Scholarly Orphans
To the Rescue of Scholarly OrphansTo the Rescue of Scholarly Orphans
To the Rescue of Scholarly Orphans
Herbert Van de Sompel
 
Almost two decades at LANL
Almost two decades at LANLAlmost two decades at LANL
Almost two decades at LANL
Herbert Van de Sompel
 
Perseverance on Persistence
Perseverance on PersistencePerseverance on Persistence
Perseverance on Persistence
Herbert Van de Sompel
 
ResourceSync Quick Overview
ResourceSync Quick OverviewResourceSync Quick Overview
ResourceSync Quick Overview
Herbert Van de Sompel
 
A Perspective on Archiving the Scholarly Record
A Perspective on Archiving the Scholarly RecordA Perspective on Archiving the Scholarly Record
A Perspective on Archiving the Scholarly Record
Herbert Van de Sompel
 
ResourceSync Overview
ResourceSync OverviewResourceSync Overview
ResourceSync Overview
Herbert Van de Sompel
 
Persistent Identifiers and the Web: The Need for an Unambiguous Mapping
Persistent Identifiers and the Web: The Need for an Unambiguous MappingPersistent Identifiers and the Web: The Need for an Unambiguous Mapping
Persistent Identifiers and the Web: The Need for an Unambiguous Mapping
Herbert Van de Sompel
 
ResourceSync tutorial OAI8
ResourceSync tutorial OAI8ResourceSync tutorial OAI8
ResourceSync tutorial OAI8
Herbert Van de Sompel
 
A Clean Slate?
A Clean Slate?A Clean Slate?
A Clean Slate?
Herbert Van de Sompel
 
The Web as infrastructure for scholarly research and communication
The Web as infrastructure for scholarly research and communicationThe Web as infrastructure for scholarly research and communication
The Web as infrastructure for scholarly research and communication
Herbert Van de Sompel
 
Paint-Yourself-In-The-Corner Infrastructure
Paint-Yourself-In-The-Corner InfrastructurePaint-Yourself-In-The-Corner Infrastructure
Paint-Yourself-In-The-Corner Infrastructure
Herbert Van de Sompel
 
ResourceSync: Web-Based Resource Synchronization
ResourceSync: Web-Based Resource SynchronizationResourceSync: Web-Based Resource Synchronization
ResourceSync: Web-Based Resource Synchronization
Herbert Van de Sompel
 
ResourceSync: Conceptual and Technical Problem Perspective
ResourceSync: Conceptual and Technical Problem PerspectiveResourceSync: Conceptual and Technical Problem Perspective
ResourceSync: Conceptual and Technical Problem Perspective
Herbert Van de Sompel
 
Towards a Machine-Actionable Scholarly Communication System
Towards a Machine-Actionable Scholarly Communication SystemTowards a Machine-Actionable Scholarly Communication System
Towards a Machine-Actionable Scholarly Communication System
Herbert Van de Sompel
 

More from Herbert Van de Sompel (16)

Researcher Pod: Scholarly Communication Using the Decentralized Web
Researcher Pod: Scholarly Communication Using the Decentralized WebResearcher Pod: Scholarly Communication Using the Decentralized Web
Researcher Pod: Scholarly Communication Using the Decentralized Web
 
Collecting the organizational scholarly record
Collecting the organizational scholarly recordCollecting the organizational scholarly record
Collecting the organizational scholarly record
 
To the Rescue of Scholarly Orphans
To the Rescue of Scholarly OrphansTo the Rescue of Scholarly Orphans
To the Rescue of Scholarly Orphans
 
Almost two decades at LANL
Almost two decades at LANLAlmost two decades at LANL
Almost two decades at LANL
 
Perseverance on Persistence
Perseverance on PersistencePerseverance on Persistence
Perseverance on Persistence
 
ResourceSync Quick Overview
ResourceSync Quick OverviewResourceSync Quick Overview
ResourceSync Quick Overview
 
A Perspective on Archiving the Scholarly Record
A Perspective on Archiving the Scholarly RecordA Perspective on Archiving the Scholarly Record
A Perspective on Archiving the Scholarly Record
 
ResourceSync Overview
ResourceSync OverviewResourceSync Overview
ResourceSync Overview
 
Persistent Identifiers and the Web: The Need for an Unambiguous Mapping
Persistent Identifiers and the Web: The Need for an Unambiguous MappingPersistent Identifiers and the Web: The Need for an Unambiguous Mapping
Persistent Identifiers and the Web: The Need for an Unambiguous Mapping
 
ResourceSync tutorial OAI8
ResourceSync tutorial OAI8ResourceSync tutorial OAI8
ResourceSync tutorial OAI8
 
A Clean Slate?
A Clean Slate?A Clean Slate?
A Clean Slate?
 
The Web as infrastructure for scholarly research and communication
The Web as infrastructure for scholarly research and communicationThe Web as infrastructure for scholarly research and communication
The Web as infrastructure for scholarly research and communication
 
Paint-Yourself-In-The-Corner Infrastructure
Paint-Yourself-In-The-Corner InfrastructurePaint-Yourself-In-The-Corner Infrastructure
Paint-Yourself-In-The-Corner Infrastructure
 
ResourceSync: Web-Based Resource Synchronization
ResourceSync: Web-Based Resource SynchronizationResourceSync: Web-Based Resource Synchronization
ResourceSync: Web-Based Resource Synchronization
 
ResourceSync: Conceptual and Technical Problem Perspective
ResourceSync: Conceptual and Technical Problem PerspectiveResourceSync: Conceptual and Technical Problem Perspective
ResourceSync: Conceptual and Technical Problem Perspective
 
Towards a Machine-Actionable Scholarly Communication System
Towards a Machine-Actionable Scholarly Communication SystemTowards a Machine-Actionable Scholarly Communication System
Towards a Machine-Actionable Scholarly Communication System
 

Recently uploaded

Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
Frank van Harmelen
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Ramesh Iyer
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Product School
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
Abida Shariff
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
Bhaskar Mitra
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Product School
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Tobias Schneck
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Jeffrey Haguewood
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
Product School
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
CatarinaPereira64715
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 

Recently uploaded (20)

Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 

Creating Pockets of Persistence

  • 1. Creating Pockets of Persistence Herbert Van de Sompel @hvdsomp http://public.lanl.gov/herbertv/ Los Alamos National Laboratory Acknowledgements: Michael L. Nelson @phonedude_mln Old Dominion University Herbert Van de Sompel 404/File Not Found, Washington, DC, October 24 2014
  • 2. Addressing the Link/Reference Rot Challenge • Pockets of Persistence • Capture – Archive Pro-Actively, Selectively • Reference – Annotate Links • Access – Travel in Time Herbert Van de Sompel 404/File Not Found, Washington, DC, October 24 2014
  • 3. Pockets of Persistence Herbert Van de Sompel How to achieve the ability to: 404/File Not Found, Washington, DC, October 24 2014 • Persistently • Precisely • Seamlessly revisit the Web of the Past and the Web of the Now at some point in the Future
  • 4. Pockets of Persistence Herbert Van de Sompel How to achieve the ability to: 404/File Not Found, Washington, DC, October 24 2014 • Persistently • Precisely • Seamlessly revisit the Web of the Past and the Web of the Now at some point in the Future Two components to the link/reference rot challenge: • Link rot: Links stop working aka 404 Not Found • Content drift: Referenced content changes over time
  • 5. Illustration Herbert Van de Sompel Current version of http://en.wikipedia.org/wiki/Coil_(band) on October 22 2014 404/File Not Found, Washington, DC, October 24 2014
  • 6. Illustration – Link Rot Herbert Van de Sompel Current version of http://en.wikipedia.org/wiki/Coil_(band) on October 22 2014 404/File Not Found, Washington, DC, October 24 2014
  • 7. Illustration – Link Rot Herbert Van de Sompel Current version of http://liarsociety.tripod.com/blog/index.blog?from=20041130 on October 22 2014 404/File Not Found, Washington, DC, October 24 2014
  • 8. Illustration – Content Drift Version of http://en.wikipedia.Herbert org/Van wiki/de Coil_(Sompel band) dated October 2 2014 http://en.wikipedia.org/w/index.php?title=Coil_(band)&oldid=388321480 404/File Not Found, Washington, DC, October 24 2014
  • 9. Illustration – Content Drift Herbert Van de Sompel Current version of http://en.wikipedia.org/wiki/Peter_Christopherson on October 22 2014 404/File Not Found, Washington, DC, October 24 2014
  • 10. Illustration – Content Drift Version of http://en.wikipedia.org/wiki/Peter_Christopherson that was current on October 2 2010 Herbert Van de Sompel http://en.wikipedia.org/w/index.php?title=Peter_Christopherson&oldid=387987414 404/File Not Found, Washington, DC, October 24 2014
  • 11. Pockets of Persistence Herbert Van de Sompel How to achieve the ability to: 404/File Not Found, Washington, DC, October 24 2014 • Persistently • Precisely • Seamlessly revisit the Web of the Past and the Web of the Now at some point in the Future This challenge exists for the entire web, but some communities actually care about addressing it: • scholarly communication, • legal publications, • journalism, • Wikipedia, • … Mobilize the communities that care about this problem to work towards joint, interoperable solutions, approaches
  • 12. Addressing the Link/Reference Rot Challenge • Pockets of Persistence • Capture – Archive Pro-Actively, Selectively • Reference – Annotate Links • Access – Travel in Time Herbert Van de Sompel 404/File Not Found, Washington, DC, October 24 2014
  • 13. Pro-Active Capture for a Seed Collection • Seed Collection - Starting point for capture is a seed collection of interest to communities that care, e.g. o On-Line journalism • Lifecycle Events – Intervene at critical moments in the lifecycle of items in these collections to pro-actively capture o Collection items – some solutions in place o Web resources referenced in collection items Herbert Van de Sompel o Scholarly literature o Legal documents o Wikipedia articles 404/File Not Found, Washington, DC, October 24 2014
  • 14. Pro-Active Capture for Seed Collection • What those crucial lifecycle events are may depend on the • Creation of new article • Creation of new version of article • Creation of substantially new version of article • Addition of external reference to article • References to article exceed a certain threshold Scholarly Literature Herbert Van de Sompel collection type Wikipedia 404/File Not Found, Washington, DC, October 24 2014
  • 15. Authoring Legal Documents – perma.cc Herbert Van de Sompel http://perma.cc 404/File Not Found, Washington, DC, October 24 2014
  • 16. Authoring Scholarly Literature: Experimental Zotero Extension Richard Wincewicz (2014) Prototype Hiberlink plugin for Zotero for pro-active archiving and temporal references Herbert Van de Sompel https://www.youtube.com/v/ZYmi_Ydr65M%26vq 404/File Not Found, Washington, DC, October 24 2014
  • 17. Submitting Scholarly Literature: Experimental HiberActive Service Martin Klein et al. (2014) HiberActive: Pro-Active Archiving of web references from scholarly articles Herbert Van de Sompel Open Repositories 2014 http://www.slideshare.net/martinklein0815/hiberactive 404/File Not Found, Washington, DC, October 24 2014
  • 18. Pro-Active Capture for Seed Collection • Interoperability for on-demand capture: o Need basic interoperability for machine-driven on-demand capture: - Discovery of capture interface - Interface IN - [ Original URI ] - Interface OUT - [ URI of Capture ; Capture Datetime ] Herbert Van de Sompel 404/File Not Found, Washington, DC, October 24 2014
  • 19. Addressing the Link/Reference Rot Challenge • Pockets of Persistence • Capture – Archive Pro-Actively, Selectively • Reference – Annotate Links • Access – Travel in Time Herbert Van de Sompel 404/File Not Found, Washington, DC, October 24 2014
  • 20. Reference Captures and Annotate Links • Existing practice for linking to captures: o Link to URI of Capture o Lose Capture Datetime • Problems with existing practice: o Impossible to visit the original URI, if desired o Requires the permanent existence/uptime of the archive that holds the capture - One link rot problem replaced by another Van de Sompel, H. et al. (2013) Thoughts on referencing, linking, reference rot Herbert Van de Sompel o Lose Original URI http://mementoweb.org/missing-link/ 404/File Not Found, Washington, DC, October 24 2014
  • 21. Permanent Existence/Uptime of Archives? Capture of http://webcitation.org dated July 17 2013 Herbert Van de Sompel https://archive.today/eAETp 404/File Not Found, Washington, DC, October 24 2014
  • 22. Permanent Existence/Uptime of Archives? Herbert Van de Sompel http://webcitation.org/ on August 6 2014 404/File Not Found, Washington, DC, October 24 2014
  • 23. Permanent Existence/Uptime of Archives? Remnant of discontinued web archive http://mummify.it captured on February 14 2014 Herbert Van de Sompel https://web.archive.org/web/20140214233752/https://www.mummify.it/ 404/File Not Found, Washington, DC, October 24 2014
  • 24. Permanent Existence/Uptime of Archives? http://www.themoscowtimes.com/news/article/russia-bans-wayback-machine-internet-archive-over-islamic-state-video/ Herbert Van de Sompel 510074.html 404/File Not Found, Washington, DC, October 24 2014
  • 25. Hacking Original URI, Capture Datetime from Capture URI? URI of Capture Original URI Datetime T https://web.archive.org/web/20140214233752/https:// www.mummify.it https://archive.today/eAETp no no http://perma.cc/4RH7-999Q?type=source no no http://en.wikipedia.org/w/index.php?title=Coil_(band) &oldid=388321480 Herbert Van de Sompel yes yes no no 404/File Not Found, Washington, DC, October 24 2014
  • 26. Using Capture URI to find Captures in Other Web Archives? Herbert Van de Sompel 404/File Not Found, Washington, DC, October 24 2014
  • 27. Using Capture URI to find Captures in Other Web Archives? Herbert Van de Sompel 404/File Not Found, Washington, DC, October 24 2014
  • 28. Reference Captures and Annotate Links • Desired practice for linking to captures is to annotate the link so it Herbert Van de Sompel 404/File Not Found, Washington, DC, October 24 2014 conveys: - URI of Capture - Original URI - Capture Datetime • Link annotation supports fallback to other archives: o Original URI allows finding captures in all web archives o Capture Datetime allows finding an appropriate capture in all web archives o Original URI and Capture Datetime allows automatic access to an appropriate capture in all web archives (see Access) Van de Sompel, H. et al. (2013) Thoughts on referencing, linking, reference rot http://mementoweb.org/missing-link/
  • 29. Reference Captures and Annotate Links • Desired practice for linking to captures is to annotate the link so it Herbert Van de Sompel 404/File Not Found, Washington, DC, October 24 2014 conveys: URI of Capture Original URI Capture Datetime
  • 30. Reference Captures and Annotate Links • Interoperability for link annotation: o Need an approach to convey, in a uniform, machine-actionable - URI of Capture - Original URI - Capture Datetime o Missing Link Proposal - http://mementoweb.org/missing-link/ o W3C Robustness and Archiving Community Group - http://www.w3.org/community/irobar/ Herbert Van de Sompel way: • Ongoing efforts: 404/File Not Found, Washington, DC, October 24 2014
  • 31. Missing Link Proposal <a href=“http://liarsociety.tripod.com/blog/index.blog?from=20041130” data-versionurl=“https://archive.today/ElCHn” data-versiondate=“2008-02-06T00:00:00Z”> Herbert Van de Sompel URI of Capture Capture Datetime 404/File Not Found, Washington, DC, October 24 2014 Original URI Van de Sompel, H. et al. (2013) Thoughts on referencing, linking, reference rot http://mementoweb.org/missing-link/
  • 32. Addressing the Link/Reference Rot Challenge • Pockets of Persistence • Capture – Archive Pro-Actively, Selectively • Reference – Annotate Links • Access – Travel in Time Herbert Van de Sompel 404/File Not Found, Washington, DC, October 24 2014
  • 33. Memento Web Time Travel Use the Original URI Herbert Van de Sompel Current version of http://law.georgetown.edu/library/404/ on October 22 2014 404/File Not Found, Washington, DC, October 24 2014
  • 34. Memento Web Time Travel And a Datetime Herbert Van de Sompel 404/File Not Found, Washington, DC, October 24 2014
  • 35. Memento Web Time Travel To automatically retrieve the temporally nearest available capture Capture of http://law.georgetown.edu/library/404/ dated May 3 2014 Herbert Van de Sompel http://wayback.archive-it.org/all/20140503094327/http://www.law.georgetown.edu/library/404/ 404/File Not Found, Washington, DC, October 24 2014
  • 36. Memento Web Time Travel http://bit.ly/memento-for-chrome Herbert Van de Sompel http://mementoweb.org 404/File Not Found, Washington, DC, October 24 2014
  • 37. Travel in Time - Persistently, Precisely, Seamlessly On-Demand Capture URI of Capture Original URI Datetime T Herbert Van de Sompel 404/File Not Found, Washington, DC, October 24 2014 Available Accessible + - - • Time Travel is: • Persistent – See next slide • Precise – Following link to URI of Capture retrieves exact capture • Seamless – Requires clicking a link as usual
  • 38. Travel in Time - Persistently, Precisely, Seamlessly On-Demand Capture URI of Capture Original URI Datetime T Herbert Van de Sompel 404/File Not Found, Washington, DC, October 24 2014 Available Not Accessible + - - • Time Travel is: • Persistent – Following link to URI of Capture leads nowhere • Precise – Following link to URI of Capture leads nowhere • Seamless – Following link to URI of Capture leads nowhere
  • 39. Travel in Time - Persistently, Precisely, Seamlessly On-Demand Capture URI of Capture Original URI Datetime T Herbert Van de Sompel 404/File Not Found, Washington, DC, October 24 2014 Available Not Accessible + + + • Time Travel is: • Persistent – Using Memento with [ Original URI ; Datetime ] works across web archives, versioning systems • Precise – Using Memento with [ Original URI ; Datetime ] retrieves nearest capture from other archive • Seamless – Requires browser plugin
  • 40. Travel in Time - Persistently, Precisely, Seamlessly On-Demand Capture URI of Capture Original URI Datetime T Herbert Van de Sompel 404/File Not Found, Washington, DC, October 24 2014 Available Accessible - + + • Time Travel is: • Persistent – Using Memento with [ Original URI ; Datetime ] works across web archives, versioning systems • Precise – Using Memento with [ Original URI ; Datetime ] retrieves exact capture from other archive • Seamless – Requires browser plugin
  • 41. Travel in Time - Persistently, Precisely, Seamlessly On-Demand Capture URI of Capture Original URI Datetime T Not Available - + + • Persistent – Using Memento with [ Original URI ; Datetime ] works across web archives, versioning systems • Precise – Using Memento with [ Original URI ; Datetime ] retrieves nearest capture from other archive • Seamless – Requires browser plugin Herbert Van de Sompel • Time Travel is: 404/File Not Found, Washington, DC, October 24 2014
  • 42. Reference Captures and Annotate Links • Interoperability for time travel: o Memento protocol specifies interoperability across web archives, version management systems o Memento protocol is supported by major web archives o Need to work towards Memento support by version management systems o Need to work towards making Memento experience seamless through native browser support o Need to work towards robustness and sustainability of Memento infrastructure Herbert Van de Sompel 404/File Not Found, Washington, DC, October 24 2014
  • 43. Conclusion • Significant technical solutions, infrastructure, ideas exist to address the link rot/reference rot challenge • Mobilize the communities that care about this challenge to work towards joint, interoperable approaches Herbert Van de Sompel 404/File Not Found, Washington, DC, October 24 2014
  • 44. Creating Pockets of Persistence http://mementoweb.org http://hiberlink.org Herbert Van de Sompel 404/File Not Found, Washington, DC, October 24 2014