Memento: Time Travel for the Web


                                           The Memento Team

                          ...
Memento wants to make navigating the Web’s Past Easy




               http://www.mementoweb.org
     http://groups.googl...
Recap of the Basics …




         Memento: Time Travel for the Web
Microsoft Research Faculty Summit, July 12-13, 2010   3
W3C Web Architecture: Resource – URI - Representation


                               dereference

           URI

      ...
W3C Web Architecture: Resource – URI - Representation


                 dereference          content negotiation

       ...
Problem Statement …




         Memento: Time Travel for the Web
Microsoft Research Faculty Summit, July 12-13, 2010   6
Resources




         Memento: Time Travel for the Web
Microsoft Research Faculty Summit, July 12-13, 2010   7
Resources have Representations




           Memento: Time Travel for the Web
  Microsoft Research Faculty Summit, July 1...
Resources have Representations that Change over Time




                      Memento: Time Travel for the Web
          ...
Only the Current Representation is Available from a Resource




                         Memento: Time Travel for the Web...
Old Representations are Lost Forever




             Memento: Time Travel for the Web
    Microsoft Research Faculty Summ...
Archived Resources Exist




         Memento: Time Travel for the Web
Microsoft Research Faculty Summit, July 12-13, 2010...
Sep 11 2001, 20:36:10 UTC                                                         Dec 20 2001, 4:51:00 UTC

              ...
Finding Archived Resources




Go to http://www.archive.org/ and search              On http://web.archive.org/web/*/http:...
Finding Archived Resources




                        Go to
http://en.wikipedia.org/wiki/September_11_attacks            ...
Dec 20 2001, 4:51:00 UTC                                                                        current

                 ...
Sep 11 2001, 20:36:10 UTC                                                            Sep 11 2001, 21:38:55 UTC

          ...
Current and Past Web are Not Integrated

                                                     • Current and Past Web
     ...
The Memento Approach …




         Memento: Time Travel for the Web
Microsoft Research Faculty Summit, July 12-13, 2010  ...
Navigate the Web of the Past

http://en.wikipedia.org/wiki/
Web_Archiving




                                         Mem...
Navigate the Web of the Past

http://en.wikipedia.org/wiki/
Web_Archiving
Oct 11 2009, 05:30:33 UTC




 Set browser time ...
Navigate the Web of the Past

http://en.wikipedia.org/wiki/                                                          From ...
Navigate the Web of the Past

http://en.wikipedia.org/wiki/                                                          From ...
Navigate the Web of the Past

http://en.wikipedia.org/wiki/
Robots_exclusion_protocol
Oct 11 2009, 05:30:33 UTC




 Brows...
Navigate the Web of the Past

http://en.wikipedia.org/wiki/                                                          From ...
Navigate the Web of the Past

http://en.wikipedia.org/wiki/                                                          From ...
Navigate the Web of the Past

http://www.robotstxt.org/


Oct 11 2009, 05:30:33 UTC




Browser time dial still at …




 ...
Navigate the Web of the Past

http://www.robotstxt.org/                                                             From I...
How does Memento achieve this?


There are two components to the Memento Solution:

• Component 1: Navigation towards an a...
How does Memento achieve this?




• Component 1: Navigation towards an archived
  resource via its original resource, by ...
The Web without a Time Dimension




            Memento: Time Travel for the Web
   Microsoft Research Faculty Summit, Ju...
The Web without a Time Dimension




            Memento: Time Travel for the Web
   Microsoft Research Faculty Summit, Ju...
The Web without a Time Dimension




Need to use a different URI to access archived versions of a resource and its current...
The Web with Time Dimension added by Memento




In Memento: use URI of the current version to access archived versions, b...
The Web with Time Dimension added by Memento




        … and magically arrive at an archived version

                  ...
How does Memento achieve this?


In order to fully understand how Memento introduces
   a time dimension to the Web, we pr...
HTTP GET on URI A




         Memento: Time Travel for the Web
Microsoft Research Faculty Summit, July 12-13, 2010   37
GET with conneg on URI T – Server Choice – 302 Found – Step 1



                                                         ...
GET with conneg on URI T – Server Choice – 302 Found – Step 2




                          Memento: Time Travel for the W...
GET with conneg on URI T – Server List – 406 Not Acceptable




                         Memento: Time Travel for the Web
...
How does Memento do This?




• Component 1: Navigation towards an archived
  resource via its original resource, by lever...
Terminology Intermission

We introduce the term Memento to refer to an
 archived version of a resource.



A Memento for a...
DT-conneg: Content Negotiation in the datetime dimension

 •   RFC 2295 introduces conneg in the following dimensions: med...
DT-conneg: Content Negotiation in the datetime dimension

 •   This means that somewhere, we will need transparently negot...
Servers Without Internal Archival Capabilities

•   This type includes:
     o Servers that are crawled by a web archive, ...
Oct 04 2009, 12:00:01 UTC




          current

                                                                         ...
original
  resource                                                                   Mementos




original server        ...
DT-conneg with URI-G to get URI-M


   original
  resource                                                                ...
HTTP
                                              DT-conneg with URI-G to get URI-M
                   Link

   original
...
Terminology Intermission

We introduce the term TimeGate to refer to a
 transparently negotiable resource that supports th...
HTTP
                   Link                       DT-conneg with URI-G to get URI-M


   original
  resource             ...
Servers With Internal Archival Capabilities

•   This type includes:
     o Content Management Systems

     o Version Con...
Dec 20 2001, 4:51:00 UTC


                                                                                       Dec 31 2...
original
                                                                       Mementos
resource




                    ...
HTTP
             Link                       DT-conneg with URI-G to get URI-M


 original             TimeGate           ...
A Memento HTTP Navigation involving an Aggregator




                     Memento: Time Travel for the Web
            Mi...
A Memento HTTP Navigation involving an Aggregator




                                Scenario

• www.digitalpreservation....
Memento HTTP Flow

     HEAD R, Accept-Datetime


            LinkG


     GET G, Accept-Datetime


 302M, Vary, TCN, Li...
Memento HTTP Flow: URI-R

                         HEAD R, Accept-Datetime


HEAD / HTTP/1.1
Host: www.digitalpreservation...
Memento HTTP Flow

     HEAD R, Accept-Datetime


            LinkG


     GET G, Accept-Datetime


 302M, Vary, TCN, Li...
Memento HTTP Flow: Success – URI-R

                                          LinkG


HTTP/1.1 200 OK
Date: Thu, 21 Jan 2...
Memento HTTP Flow

     HEAD R, Accept-Datetime


            LinkG


     GET G, Accept-Datetime


 302M, Vary, TCN, Li...
Memento HTTP Flow: URI-G

                          GET G, Accept-Datetime


GET /aggr/timegate/http://www.digitalpreserva...
Memento HTTP Flow

     HEAD R, Accept-Datetime


            LinkG


     GET G, Accept-Datetime


 302M, Vary, TCN, Li...
Memento HTTP Flow: Success – URI-G

                        302M, Vary, LinkR,B,M

HTTP/1.1 302 Found
Date: Thu, 21 Jan ...
Memento HTTP Flow

     HEAD R, Accept-Datetime


            LinkG


     GET G, Accept-Datetime


 302M, Vary, TCN, Li...
Memento HTTP Flow: URI-M

                          GET M, Accept-Datetime

GET /1610/20090928171405/http://www.digitalpre...
Memento HTTP Flow

     HEAD R, Accept-Datetime


            LinkG


     GET G, Accept-Datetime


 302M, Vary, TCN, Li...
Memento HTTP Flow: Success – URI-M

                   200, Content-Datetime, LinkR,B,M

HTTP/1.1 200 OK
Server: Apache-C...
The Web with Time Dimension added by Memento




                   Memento: Time Travel for the Web
          Microsoft R...
Why Care About The Past?



From an anonymous reviewer (emphasis mine):

"Is there any statistics to show that many or a g...
Replaying the Experience…

…can be more compelling than a summary




                         Memento: Time Travel for th...
Memento: Time Travel for the Web
Microsoft Research Faculty Summit, July 12-13, 2010
vs.




(thanks to Michele Weigle for the following Memento selection)




            Memento: Time Travel for the Web
  ...
Memento: Time Travel for the Web
Microsoft Research Faculty Summit, July 12-13, 2010
Memento: Time Travel for the Web
Microsoft Research Faculty Summit, July 12-13, 2010
Memento: Time Travel for the Web
Microsoft Research Faculty Summit, July 12-13, 2010
Memento: Time Travel for the Web
Microsoft Research Faculty Summit, July 12-13, 2010
Memento: Time Travel for the Web
Microsoft Research Faculty Summit, July 12-13, 2010
Memento: Time Travel for the Web
Microsoft Research Faculty Summit, July 12-13, 2010
Memento: Time Travel for the Web
Microsoft Research Faculty Summit, July 12-13, 2010
Memento: Time Travel for the Web
Microsoft Research Faculty Summit, July 12-13, 2010
Memento: Time Travel for the Web
Microsoft Research Faculty Summit, July 12-13, 2010
Memento: Time Travel for the Web
Microsoft Research Faculty Summit, July 12-13, 2010
Memento: Time Travel for the Web
Microsoft Research Faculty Summit, July 12-13, 2010
Memento: Time Travel for the Web
Microsoft Research Faculty Summit, July 12-13, 2010
Memento wants to make navigating the Web’s Past Easy




               http://www.mementoweb.org
     http://groups.googl...
Upcoming SlideShare
Loading in …5
×

Memento: Time Travel for the Web

1,144 views

Published on

Presented at the Microsoft Research Faculty Summit 2010

http://research.microsoft.com/en-us/events/fs2010/

Published in: Technology
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
1,144
On SlideShare
0
From Embeds
0
Number of Embeds
55
Actions
Shares
0
Downloads
4
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

Memento: Time Travel for the Web

  1. 1. Memento: Time Travel for the Web The Memento Team Herbert Van de Sompel Michael L. Nelson Robert Sanderson Lyudmila Balakireva Scott Ainsworth Harihar Shankar Memento is partially funded by the Library of Congress Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010
  2. 2. Memento wants to make navigating the Web’s Past Easy http://www.mementoweb.org http://groups.google.com/group/memento-dev Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 2
  3. 3. Recap of the Basics … Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 3
  4. 4. W3C Web Architecture: Resource – URI - Representation dereference URI Identifies Resource Represents Representation Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 4
  5. 5. W3C Web Architecture: Resource – URI - Representation dereference content negotiation URI Identifies Resource Represents Representation 1 Represents Representation 2 Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 5
  6. 6. Problem Statement … Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 6
  7. 7. Resources Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 7
  8. 8. Resources have Representations Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 8
  9. 9. Resources have Representations that Change over Time Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 9
  10. 10. Only the Current Representation is Available from a Resource Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 10
  11. 11. Old Representations are Lost Forever Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 11
  12. 12. Archived Resources Exist Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 12
  13. 13. Sep 11 2001, 20:36:10 UTC Dec 20 2001, 4:51:00 UTC Archived Resources http://en.wikipedia.org/w/index.php?title=September_1 http://web.archive.org/web/20010911203610/http://ww 1_attacks&oldid=282333 archived resource for w.cnn.com/ archived resource for http://cnn.com http://en.wikipedia.org/wiki/September_11_attacks Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 13
  14. 14. Finding Archived Resources Go to http://www.archive.org/ and search On http://web.archive.org/web/*/http://cnn.com, select http://cnn.com desired datetime Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 14
  15. 15. Finding Archived Resources Go to http://en.wikipedia.org/wiki/September_11_attacks Browse History and click History Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 15
  16. 16. Dec 20 2001, 4:51:00 UTC current Navigating Archived Resources Pentagon http://en.wikipedia.org/w/index.php?title=September_1 1_attacks&oldid=282333 archived resource for http://en.wikipedia.org/wiki/The_Pentagon http://en.wikipedia.org/wiki/September_11_attacks3 Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 16
  17. 17. Sep 11 2001, 20:36:10 UTC Sep 11 2001, 21:38:55 UTC Navigating Archived Resources SPACE http://web.archive.org/web/20010911203610/http://ww http://web.archive.org/web/20010911213855/www.cnn w.cnn.com/ archived resource for http://cnn.com .com/TECH/space/ Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 17
  18. 18. Current and Past Web are Not Integrated • Current and Past Web based on same technology. • But, going from Current to Past Web is a matter of (manual) discovery. • Memento wants to make going from Current to Past Web a (HTTP) protocol matter. • Memento wants to integrate Current And Past Web. Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 18
  19. 19. The Memento Approach … Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 19
  20. 20. Navigate the Web of the Past http://en.wikipedia.org/wiki/ Web_Archiving Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 20
  21. 21. Navigate the Web of the Past http://en.wikipedia.org/wiki/ Web_Archiving Oct 11 2009, 05:30:33 UTC Set browser time dial to … Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 21
  22. 22. Navigate the Web of the Past http://en.wikipedia.org/wiki/ From Wikipedia History Web_Archiving Oct 11 2009, 05:30:33 UTC Oct 01 2009, 16:30:00 UTC Set browser time dial to … Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 22
  23. 23. Navigate the Web of the Past http://en.wikipedia.org/wiki/ From Wikipedia History Web_Archiving Oct 11 2009, 05:30:33 UTC Oct 01 2009, 16:30:00 UTC Set browser time dial to … Robots Exclusion Protocol Oct 11 2009, 05:30:33 UTC Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 23
  24. 24. Navigate the Web of the Past http://en.wikipedia.org/wiki/ Robots_exclusion_protocol Oct 11 2009, 05:30:33 UTC Browser time dial still at … Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 24
  25. 25. Navigate the Web of the Past http://en.wikipedia.org/wiki/ From Wikipedia History Robots_exclusion_protocol Oct 11 2009, 05:30:33 UTC Sep 15 2009, 20:49:00 UTC Browser time dial still at … Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 25
  26. 26. Navigate the Web of the Past http://en.wikipedia.org/wiki/ From Wikipedia History Robots_exclusion_protocol Oct 11 2009, 05:30:33 UTC Sep 15 2009, 20:49:00 UTC Browser time dial still at … Robots Exclusion Oct 11 2009, 05:30:33 UTC Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 26
  27. 27. Navigate the Web of the Past http://www.robotstxt.org/ Oct 11 2009, 05:30:33 UTC Browser time dial still at … Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 27
  28. 28. Navigate the Web of the Past http://www.robotstxt.org/ From Internet Archive Oct 11 2009, 05:30:33 UTC Nov 09 2007, 06:21:04 UTC Browser time dial still at … Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 28
  29. 29. How does Memento achieve this? There are two components to the Memento Solution: • Component 1: Navigation towards an archived resource via its original resource, by leveraging content negotiation. • Component 2: A discovery API for archives that allows requesting a list of all archived versions it holds for a resource with a given URI. Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 29
  30. 30. How does Memento achieve this? • Component 1: Navigation towards an archived resource via its original resource, by leveraging content negotiation. Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 30
  31. 31. The Web without a Time Dimension Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 31
  32. 32. The Web without a Time Dimension Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 32
  33. 33. The Web without a Time Dimension Need to use a different URI to access archived versions of a resource and its current version Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 33
  34. 34. The Web with Time Dimension added by Memento In Memento: use URI of the current version to access archived versions, but qualify it with datetime Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 34
  35. 35. The Web with Time Dimension added by Memento … and magically arrive at an archived version Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 35
  36. 36. How does Memento achieve this? In order to fully understand how Memento introduces a time dimension to the Web, we present a brief recap of Transparent Content Negotiation (conneg) in HTTP. RFC 2295. Transparent Content Negotiation in HTTP, http://www.ietf.org/rfc/rfc2295.txt Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 36
  37. 37. HTTP GET on URI A Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 37
  38. 38. GET with conneg on URI T – Server Choice – 302 Found – Step 1 transparently negotiable resource Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 38
  39. 39. GET with conneg on URI T – Server Choice – 302 Found – Step 2 Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 39
  40. 40. GET with conneg on URI T – Server List – 406 Not Acceptable Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 40
  41. 41. How does Memento do This? • Component 1: Navigation towards an archived resource via its original resource, by leveraging content negotiation. Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 41
  42. 42. Terminology Intermission We introduce the term Memento to refer to an archived version of a resource. A Memento for a resource URI-R (as it existed) at time ti is a resource URI-Mi [URI-R@ti] for which the representation at any moment past its creation time tc is the same as the representation that was available from URI- R at time ti, with tc >= ti. Implicit in this definition is the notion that, once created, a Memento always keeps the same representation. Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 42
  43. 43. DT-conneg: Content Negotiation in the datetime dimension • RFC 2295 introduces conneg in the following dimensions: media type, language, compression, character set, e.g.: - HTTP Request: o Accept-Language: en-US o HTTP Response: o Content-Language: en-US • Inspired by RFC 2295, Memento introduces datetime conneg: - HTTP Request: o Accept-Datetime: Mon, 12 Oct 2009 14:20:33 GMT o HTTP Response: o Content-Datetime: Sun, 11 Oct 2009 11:18:05 GMT Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 43
  44. 44. DT-conneg: Content Negotiation in the datetime dimension • This means that somewhere, we will need transparently negotiable resources (cf. slides 38-40) that supports the datetime dimension to get to appropriate Mementos. • This will be discussed for 2 classes of servers: o Web servers without internal archival capabilities; o Web servers with internal archival capabilities. Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 44
  45. 45. Servers Without Internal Archival Capabilities • This type includes: o Servers that are crawled by a web archive, such as the Internet Archive o Servers with an associated transactional archive • These servers are not aware of the details of Mementos of their resources held by external archives. • These servers do not have the essential information (URI-Ms, and associated datetimes) to respond to a DT-conneg request. • But they can be constructive by pointing (HTTP Link) a client to an archive that can respond to the DT-conneg request. o Unconditionally do this for resources for which Mementos are conceivably available in the archive. Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 45
  46. 46. Oct 04 2009, 12:00:01 UTC current Oct 10 2009, 12:00:03 UTC http://lanlsource.lanl.gov/ hello Oct 21 2009, 12:00:01 UTC http://mementoarchive.lanl.gov/store/ta/20091021 120001/http://lanlsource.lanl.gov/hello Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 46
  47. 47. original resource Mementos original server archival server Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 47
  48. 48. DT-conneg with URI-G to get URI-M original resource Mementos transparently variant negotiable resources resource original server archival server Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 48
  49. 49. HTTP DT-conneg with URI-G to get URI-M Link original resource Mementos transparently variant negotiable resources resource original server archival server Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 49
  50. 50. Terminology Intermission We introduce the term TimeGate to refer to a transparently negotiable resource that supports the datetime dimension. A TimeGate for an original resource URI-R is a transparently negotiable resource URI- G[URI-R] for which all variant resources are Mementos URI-Mi[URI-R@ti] of the resource URI-R. Since multiple archives may host versions of URI-R, multiple TimeGates may exist for any given resource, i.e. one per archive. Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 50
  51. 51. HTTP Link DT-conneg with URI-G to get URI-M original resource TimeGate Mementos transparently variant negotiable resources resource original server archival server Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 51
  52. 52. Servers With Internal Archival Capabilities • This type includes: o Content Management Systems o Version Control Systems o Servers that archive resource representations in the cloud and keep track of the URIs and datetimes of remotely archived resources. • These servers have all the essential information (URI-Ms, and associated datetimes) to respond to a DT-conneg request. • The previous architectural solution is maintained to enforce strict distinction between handling requests for current and past representations. Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 52
  53. 53. Dec 20 2001, 4:51:00 UTC Dec 31 2004, 20:46:00 UTC current http://en.wikipedia.org/wiki/ September_11_attacks Dec 20 2008, 22:21:00 UTC http://en.wikipedia.org/w/index.php? title=September_11_attacks&oldid=259237305 Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 53
  54. 54. original Mementos resource original server Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 54
  55. 55. HTTP Link DT-conneg with URI-G to get URI-M original TimeGate Mementos resource transparently negotiable resource variant resources original server Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 55
  56. 56. A Memento HTTP Navigation involving an Aggregator Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 56
  57. 57. A Memento HTTP Navigation involving an Aggregator Scenario • www.digitalpreservation.gov points at TimeGate provided by an Aggregator • URI-R, URI-G, URI-M on different servers Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 57
  58. 58. Memento HTTP Flow HEAD R, Accept-Datetime LinkG GET G, Accept-Datetime 302M, Vary, TCN, LinkR,B,M GET M, Accept-Datetime 200, Content-Datetime, LinkR,B,M
  59. 59. Memento HTTP Flow: URI-R HEAD R, Accept-Datetime HEAD / HTTP/1.1 Host: www.digitalpreservation.gov Accept-Datetime: Sat, 10 Oct 2009 00:00:00 GMT Connection: close Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 59
  60. 60. Memento HTTP Flow HEAD R, Accept-Datetime LinkG GET G, Accept-Datetime 302M, Vary, TCN, LinkR,B,M GET M, Accept-Datetime 200, Content-Datetime, LinkR,B,M
  61. 61. Memento HTTP Flow: Success – URI-R LinkG HTTP/1.1 200 OK Date: Thu, 21 Jan 2010 00:02:12 GMT Server: Apache Link: <http://mementoproxy.lanl.gov/aggr/timegate/http://www.digitalpreservation.gov/> ; rel=“timegate” Content-Length: 255 Connection: close Content-Type: text/html; charset=iso-8859-1 Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 61
  62. 62. Memento HTTP Flow HEAD R, Accept-Datetime LinkG GET G, Accept-Datetime 302M, Vary, TCN, LinkR,B,M GET M, Accept-Datetime 200, Content-Datetime, LinkR,B,M
  63. 63. Memento HTTP Flow: URI-G GET G, Accept-Datetime GET /aggr/timegate/http://www.digitalpreservation.gov/ HTTP/1.1 Host: mementoproxy.lanl.gov Accept-Datetime: Sat, 10 Oct 2009 00:00:00 GMT Connection: close Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 63
  64. 64. Memento HTTP Flow HEAD R, Accept-Datetime LinkG GET G, Accept-Datetime 302M, Vary, TCN, LinkR,B,M GET M, Accept-Datetime 200, Content-Datetime, LinkR,B,M
  65. 65. Memento HTTP Flow: Success – URI-G 302M, Vary, LinkR,B,M HTTP/1.1 302 Found Date: Thu, 21 Jan 2010 00:06:50 GMT Server: Apache TCN: choice Vary: negotiate, accept-datetime Location: http://wayback.archive-it.org/1610/20090928171405/http:// www.digitalpreservation.gov/ Link: <http://www.digitalpreservation.gov/>; rel="original", <http://mementoproxy.lanl.gov/aggr/timebundle/http://www.digitalpreservation.gov/>; rel="timebundle”, <http://wayback.archive -it.org/256/20051108162921/http://www.digitalpreservation.gov/>; rel=“first-memento”; datetime=“Tue, 08 Nov 2005 00:00:00 GMT”, <http://webcitation.org/query?id=1257028234035091>; rel=“next-memento”; datetime=”Sat, 31 Oct 2009 18:30:35 GMT”, <http://webcitation.org/query?id=1213058061345794>; rel=“prev-memento”; datetime="Mon, 09 Jun 2008 20:34:23 GMT”, <http://wayback.archive -it.org/256/20100120102000/http://www.digitalpreservation.gov/>; rel=“last-memento”; datetime=”Wed, 20 Jan 2010 10:20:00 GMT” Content-Length: 0 Connection: close
  66. 66. Memento HTTP Flow HEAD R, Accept-Datetime LinkG GET G, Accept-Datetime 302M, Vary, TCN, LinkR,B,M GET M, Accept-Datetime 200, Content-Datetime, LinkR,B,M
  67. 67. Memento HTTP Flow: URI-M GET M, Accept-Datetime GET /1610/20090928171405/http://www.digitalpreservation.gov/ HTTP/1.1 Host: wayback.archive-it.org Accept-Datetime: Sat, 10 Oct 2009 00:00:00 GMT Connection: close Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 67
  68. 68. Memento HTTP Flow HEAD R, Accept-Datetime LinkG GET G, Accept-Datetime 302M, Vary, TCN, LinkR,B,M GET M, Accept-Datetime 200, Content-Datetime, LinkR,B,M
  69. 69. Memento HTTP Flow: Success – URI-M 200, Content-Datetime, LinkR,B,M HTTP/1.1 200 OK Server: Apache-Coyote/1.1 Link header values are local X-Archive-Orig-Accept-Ranges: bytes … to wayback.archive-it.org Content-Type: text/html;charset=utf-8 and different than those Content-Length: 23364 provided by URI-G Date: Thu, 21 Jan 2010 00:09:40 GMT Content-Datetime: Mon, 28 Sep 2009 17:14:05 GMT Link: <http://www.digitalpreservation.gov/>; rel="original", <http://wayback.archive- it.org/web/timebundle/http://www.digitalpreservation.gov/>; rel="timebundle”, <http://wayback.archive -it.org/256/20051108162921/http://www.digitalpreservation.gov/>; rel=“first-memento”; datetime=“Tue, 08 Nov 2005 00:00:00 GMT”, <http://wayback.archive -it.org/256/20100120102000/http://www.digitalpreservation.gov/>; rel=“last-memento”; datetime=”Wed, 20 Jan 2010 10:20:00 GMT” Connection: close Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 69
  70. 70. The Web with Time Dimension added by Memento Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 70
  71. 71. Why Care About The Past? From an anonymous reviewer (emphasis mine): "Is there any statistics to show that many or a good number of Web users would like to get obsolete data or resources? " Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010
  72. 72. Replaying the Experience… …can be more compelling than a summary Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010
  73. 73. Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010
  74. 74. vs. (thanks to Michele Weigle for the following Memento selection) Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010
  75. 75. Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010
  76. 76. Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010
  77. 77. Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010
  78. 78. Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010
  79. 79. Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010
  80. 80. Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010
  81. 81. Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010
  82. 82. Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010
  83. 83. Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010
  84. 84. Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010
  85. 85. Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010
  86. 86. Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010
  87. 87. Memento wants to make navigating the Web’s Past Easy http://www.mementoweb.org http://groups.google.com/group/memento-dev Memento: Time Travel for the Web Microsoft Research Faculty Summit, July 12-13, 2010 87

×