Robert
Sanderson
                           Herbert
Van
de
Sompel





rsanderson@lanl.gov
                    



herbert...
Overview


•  
Background
and
Research
Ques@on


•  
Open
Annota@on
Collabora@on


•  
Memento


•  
Experiments


•  
Con...
10+
Years
of
Annota@on
Research





             Persistent Web Annotations                  

Slide:
3

        Rob Sand...
Our
2020
Vision





         web
         Persistent Web Annotations                  

Slide:
4

    Rob Sanderson, Herb...
Scholarship
is
Moving
to
the
Web

•  From
silos
of
highly
curated
content
to
open
world
models


•  Publishing
of
scholarl...
Web
Resources
Change
…





Google Sidewiki Annotation on http://news.bbc.co.uk/ as of 2010-06-14

                       ...
Archived
Copy,
But
No
Annota@ons
                                        





Archived page from:
  http://www.dracos.co....
Can
We
Fix
This?
Automa@cally?





The desired outcome:
  Display the correct representation of the Web Resource with the...
Previous
Annota@on
Persistence
Methods
                                            


•  Migrate
annota@ons
from
one
versi...
Persistent
and
Web‐Centric?



•  OAC:


Describe
Annota@ons
in
a
Web‐centric
Model


   
 
 
 
 
 
 
 
+

•  Memento:


M...
Open
Annota@on
Collabora@on


•  Focus:
 
   

   
Interoperability
between
systems
to
enable
sharing


•  Founda@on:

 
A...
OAC
Data
Model:
Basics
                                        

•  An
oac:Annota@on
is
an
ore:Aggrega@on
of
two
or
more
r...
OAC
Data
Model:
Basics
                     





         Persistent Web Annotations                  

Slide:
13

    Ro...
OAC
Data
Model:
Segments
                                       

•  Most
Annota@ons
are
about
part
of

   a
resource


• ...
OAC
Data
Model:
Segments
                       





          Persistent Web Annotations                  

Slide:
15

 ...
OAC
Data
Model:
Time


•  As
regular
web
resources,
Resource
Map,
Body
and
Target
have

   representa@ons
that
can
change
...
Timeless
Annota@ons
                                       

•  The
Annota@on
is
always
applicable,
regardless
of
the

   ...
Uniform
Time
Annota@ons
                                       

•  The
Annota@on
is
not
always
applicable,
but
pertains
t...
Varied
Time
Annota@ons
                                       

•  The
Annota@on
is
not
always
applicable,
but
pertains
to...
Memento

•  We
know:
the
URI
of
the
Target
and
the
@me
it
was
annotated

•  We
don't
know:

the
URI
where
it
is
archived

...
Issues
with
the
Past
Web

•  New
names
for
archived
resources

      •  What
was
cnn.com,
becomes
archive.org/web/20010120...
The
Web
without
Time
Dimension





eg: http://www.cnn.com/                     eg: http://web.archive.org/web/
          ...
The
Web
with
Time
Dimension
from
Memento





eg: http://www.cnn.com/                     eg: http://web.archive.org/web/
...
Apr 10 2001, 21:39:30 UTC




current



                                                                    Aug 15 2004, ...
Original
Resource
                              ?                                                    Mementos




        ...
Original
Resource
                              ?
                           TimeGate                                     ...
Original
Resource
              Link Headers
                                 ?
                              TimeGate
   ...
Experiments
                                        

1.  Given
an
annota@on,
find
the
appropriate
representa@ons

       •...
Experiment
1





Retrieve




                Persistent Web Annotations                  

Slide:
29

           Rob San...
Experiment
1





Reconstruct




                  Persistent Web Annotations                  

Slide:
30

             ...
Experiment
1:
Create
Annota@on





            Persistent Web Annotations                  

Slide:
31

       Rob Sander...
Experiment
1:
Test
without
Memento





              Persistent Web Annotations                  

Slide:
32

         Ro...
Experiment
1:
Test
with
Memento





             Persistent Web Annotations                  

Slide:
33

        Rob San...
Experiment
2

•  Need to find Original URI, start, end time of representation
•  Need searchable collection of annotations...
Experiment
2:
Create
Annota@ons
                              





             Persistent Web Annotations               ...
Experiment
2:
Create
Annota@ons
                              





             Persistent Web Annotations               ...
Experiment
2:
Create
Annota@ons
                              





             Persistent Web Annotations               ...
Experiment
2:
Test
without
Memento





              Persistent Web Annotations                  

Slide:
38

         Ro...
Experiment
2:
Test
with
Memento





             Persistent Web Annotations                  

Slide:
39

        Rob San...
Conclusions
                                        

•  Annota@on,
as
a
core
scholarly
prac@ce,
is
increasingly
web‐based...
Thank
You



•  Authors:

       •  azaroth42@gmail.com
/


rsanderson@lanl.gov

       •  hvdsomp@gmail.com


/


herber...
Memento
HTTP
Flow

    HEAD R, (Accept-Datetime)


                     LinkG


       GET G, Accept-Datetime


 302M, V...
Upcoming SlideShare
Loading in...5
×

Making Web Annotations Persistent over Time

2,538

Published on

JCDL 2010 presentation about using Memento to reconstruct the state for web resources involved in annotation.

Published in: Technology, Design
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
2,538
On Slideshare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
21
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Making Web Annotations Persistent over Time

  1. 1. Robert
Sanderson
 Herbert
Van
de
Sompel
 



rsanderson@lanl.gov
 



herbertv@lanl.gov
 



azaroth42@gmail.com
 



hvdsomp@gmail.com
 Digital
Library
Research

 and
Prototyping
Team
 Los
Alamos
Na@onal
Laboratory,
 
USA
 Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  2. 2. Overview
 •  
Background
and
Research
Ques@on
 •  
Open
Annota@on
Collabora@on
 •  
Memento
 •  
Experiments
 •  
Conclusions
 Persistent Web Annotations 

Slide:
2
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  3. 3. 10+
Years
of
Annota@on
Research
 Persistent Web Annotations 

Slide:
3
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  4. 4. Our
2020
Vision
 web Persistent Web Annotations 

Slide:
4
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  5. 5. Scholarship
is
Moving
to
the
Web
 •  From
silos
of
highly
curated
content
to
open
world
models
 •  Publishing
of
scholarly
communica@on
online:
 •  First
Journals
(DLib,
PLoS,
…)
 •  Then
Users
(Vivo,
ORCID,
…)
 •  Now
Data
(NVO,
OREChem,
…)
 •  Next
the
Scien@fic
Process?
(MyExperiment,
Annota@on,…)
 •  With
the
web
comes
new
challenges…
 •  Persistence
 Persistent Web Annotations 

Slide:
5
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  6. 6. Web
Resources
Change
…
 Google Sidewiki Annotation on http://news.bbc.co.uk/ as of 2010-06-14 Persistent Web Annotations 

Slide:
6
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  7. 7. Archived
Copy,
But
No
Annota@ons 
 Archived page from: http://www.dracos.co.uk/work/bbc-news-archive/2010/03/08/07.05.html Persistent Web Annotations 

Slide:
7
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  8. 8. Can
We
Fix
This?
Automa@cally?
 The desired outcome: Display the correct representation of the Web Resource with the Annotation. Persistent Web Annotations 

Slide:
8
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  9. 9. Previous
Annota@on
Persistence
Methods 
 •  Migrate
annota@ons
from
one
version
to
the
next:
 •  Seek
to
discover
new
loca@on
of
old
target
segment
 •  Otherwise
discard
the
annota@on
as
no
longer
relevant
 •  Treats
the
Annota@on
as
of
secondary
importance
 •  Focused
on
heuris@cs:
 •  Cross
format,
cross
loca@on
 •  Edited
text
in
same
document
 •  Dynamically
scaling
target
areas,
marks
of
annota@on
 •  
…
 Persistent Web Annotations 

Slide:
9
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  10. 10. Persistent
and
Web‐Centric?
 •  OAC:


Describe
Annota@ons
in
a
Web‐centric
Model
 
 
 
 
 
 
 
 
+
 •  Memento:


Make
Naviga@ng
the
Past
Web
Easy
 
 
 
 
 
 
 
 
=
 •  Given
an
Annota@on,
display
appropriate
archived
Web
Resource?
 •  Given
an
archived
Web
Resource,
display
appropriate
Annota@ons?
 Persistent Web Annotations 

Slide:
10
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  11. 11. Open
Annota@on
Collabora@on
 •  Focus:
 
 

 
Interoperability
between
systems
to
enable
sharing
 •  Founda@on:

 
Architecture
of
the
World
Wide
Web
 •  Framework:

 
Linked
Data
Guidelines
 •  Funding:

 
 
Mellon
Founda@on
for
18
months
 Persistent Web Annotations 

Slide:
11
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  12. 12. OAC
Data
Model:
Basics 
 •  An
oac:Annota@on
is
an
ore:Aggrega@on
of
two
or
more
resources,
 such
that
one
(oac:Body)
annotates
at
least
one
other
(oac:Target)
 •  We
get
OAI‐ORE
en@@es
for
free
(ore:ResourceMap,
ore:Proxy)
 •  All
resources
are
regular
web
resources
 Persistent Web Annotations 

Slide:
12
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  13. 13. OAC
Data
Model:
Basics 
 Persistent Web Annotations 

Slide:
13
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  14. 14. OAC
Data
Model:
Segments 
 •  Most
Annota@ons
are
about
part
of
 a
resource
 •  Resources
are
atomic,
in
terms
of
 iden@fica@on
(by
a
URI)
 •  Segments
of
the
resource
apply
in
 the
context
of
the
Annota@on
 •  Solu@on:
aaach
a
Descrip@on
of
the
 Segment
of
interest
to
an
ORE
 Proxy
for
the
resource
 Persistent Web Annotations 

Slide:
14
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  15. 15. OAC
Data
Model:
Segments 
 Persistent Web Annotations 

Slide:
15
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  16. 16. OAC
Data
Model:
Time
 •  As
regular
web
resources,
Resource
Map,
Body
and
Target
have
 representa@ons
that
can
change
over
@me
 •  The
Resource
Map,
Body
and
Target
can
change
independently
of
 each
other
 •  If
an
Annota@on
involves
resources
as
they
existed
at
a
par@cular
 point
in
@me,
this
needs
to
be
recorded
 •  Three
different
Time
models
are
possible…
 Persistent Web Annotations 

Slide:
16
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  17. 17. Timeless
Annota@ons 
 •  The
Annota@on
is
always
applicable,
regardless
of
the
 representa@on
served
from
the
URIs
of
the
Body
and
Targets.
 •  Example:

"This
is
the
home
page
of
CNN"
 •  Timeless
Annota@ons
do
not
need
a
special
@mestamp.
 Persistent Web Annotations 

Slide:
17
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  18. 18. Uniform
Time
Annota@ons 
 •  The
Annota@on
is
not
always
applicable,
but
pertains
to
the
state
 of
the
Body
and
Target
at
a
single
moment
in
@me.
 •  Example:
Tweet
is
about
contemporary
state
of
a
web
page.
 •  Add
mem:when
property
to
Annota@on
 Persistent Web Annotations 

Slide:
18
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  19. 19. Varied
Time
Annota@ons 
 •  The
Annota@on
is
not
always
applicable,
but
pertains
to
the
state
 of
the
Body
and
Target
at
different
moments
in
@me.
 •  Example:
Blog
post
is
about
previous
day's
state
of
a
web
page
 •  Add
mem:when
property
to
Proxies
for
resources
 Persistent Web Annotations 

Slide:
19
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  20. 20. Memento
 •  We
know:
the
URI
of
the
Target
and
the
@me
it
was
annotated
 •  We
don't
know:

the
URI
where
it
is
archived
 •  We
need:

Memento
 •  Memento
wants
to
make
it
Easy
to
Navigate
the
Web’s
Past

 •  Basic
idea:

Use
HTTP
to
get
from
the
current
resource
to
the
 archived
copy
of
that
resource
at
a
given
@me,
rather
than
 searching
 Persistent Web Annotations 

Slide:
20
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  21. 21. Issues
with
the
Past
Web
 •  New
names
for
archived
resources
 •  What
was
cnn.com,
becomes
archive.org/web/20010120…
 •  …
And
lots
of
other
names
 •  With
no
way
to
discover
them
without
searching
by
hand
 •  People
do
not
like
to
search
 •  Especially
when
a
computer
could
do
it.
 •  Naviga@on
is
inconsistent
 •  Stuck
in
web
archive
content
silo
(URIs
rewriaen)
 •  Or
end
up
back
in
present
(URIs
not
rewriaen)
 Persistent Web Annotations 

Slide:
21
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  22. 22. The
Web
without
Time
Dimension
 eg: http://www.cnn.com/ eg: http://web.archive.org/web/ 20020209001709rn_1/www.cnn.com/? Persistent Web Annotations 

Slide:
22
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  23. 23. The
Web
with
Time
Dimension
from
Memento
 eg: http://www.cnn.com/ eg: http://web.archive.org/web/ 20020209001709rn_1/www.cnn.com/? Persistent Web Annotations 

Slide:
23
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  24. 24. Apr 10 2001, 21:39:30 UTC current Aug 15 2004, 08:45:27 UTC Aug 15 2007, 19:21:58 UTC www.cnn.com web.archive.org Persistent Web Annotations 

Slide:
24
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  25. 25. Original Resource ? Mementos Apr 10 2001, 21:39:30 UTC current Aug 15 2004, 08:45:27 UTC Aug 15 2007, 19:21:58 UTC www.cnn.com web.archive.org Persistent Web Annotations 

Slide:
25
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  26. 26. Original Resource ? TimeGate Mementos Apr 10 2001, 21:39:30 UTC current Aug 15 2004, 08:45:27 UTC Aug 15 2007, 19:21:58 UTC www.cnn.com web.archive.org Persistent Web Annotations 

Slide:
26
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  27. 27. Original Resource Link Headers ? TimeGate Conneg with TimeGate to Mementos Mementos Apr 10 2001, 21:39:30 UTC current Aug 15 2004, 08:45:27 UTC Aug 15 2007, 19:21:58 UTC www.cnn.com web.archive.org Persistent Web Annotations 

Slide:
27
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  28. 28. Experiments 
 1.  Given
an
annota@on,
find
the
appropriate
representa@ons
 •  Create
annota@on
on
resource
known
to
change
 •  Can
we
use
the
informa@on
from
the
annota@on
to
faithfully
 recreate
the
environment
through
Memento?
 2.  Given
an
archived
resource,
find
the
appropriate
annota@ons
 •  Create
annota@ons
at
different
@mes
on
resource
known
to
 be
archived
 •  Can
we
use
the
informa@on
from
Memento
to
find
the
 appropriate
annota@ons?
 Persistent Web Annotations 

Slide:
28
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  29. 29. Experiment
1
 Retrieve Persistent Web Annotations 

Slide:
29
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  30. 30. Experiment
1
 Reconstruct Persistent Web Annotations 

Slide:
30
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  31. 31. Experiment
1:
Create
Annota@on
 Persistent Web Annotations 

Slide:
31
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  32. 32. Experiment
1:
Test
without
Memento
 Persistent Web Annotations 

Slide:
32
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  33. 33. Experiment
1:
Test
with
Memento
 Persistent Web Annotations 

Slide:
33
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  34. 34. Experiment
2
 •  Need to find Original URI, start, end time of representation •  Need searchable collection of annotations Persistent Web Annotations 

Slide:
34
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  35. 35. Experiment
2:
Create
Annota@ons 
 Persistent Web Annotations 

Slide:
35
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  36. 36. Experiment
2:
Create
Annota@ons 
 Persistent Web Annotations 

Slide:
36
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  37. 37. Experiment
2:
Create
Annota@ons 
 Persistent Web Annotations 

Slide:
37
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  38. 38. Experiment
2:
Test
without
Memento
 Persistent Web Annotations 

Slide:
38
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  39. 39. Experiment
2:
Test
with
Memento
 Persistent Web Annotations 

Slide:
39
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  40. 40. Conclusions 
 •  Annota@on,
as
a
core
scholarly
prac@ce,
is
increasingly
web‐based
 •  We
propose
using
OAC
and
Memento
to
provide
a
solu@on
for
 persistence
of
annota@ons,
by
displaying
annota@ons
in
their
 original
context
and
displaying
relevant
annota@ons
for
archived
 resources
 •  Archiving
of
annotated
and
annota@ng
resources
important
 •  Interes@ng
research
ques@on
of
whether
annota@on
spans
 mul@ple
archived
resources
 Persistent Web Annotations 

Slide:
40
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  41. 41. Thank
You


 •  Authors:
 •  azaroth42@gmail.com
/


rsanderson@lanl.gov
 •  hvdsomp@gmail.com


/


herbertv@lanl.gov
 •  OAC:


 •  hap://www.openannota@on.org/
 •  hap://groups.google.com/group/oac‐discuss
 •  Memento:

 •  hap://www.mementoweb.org/
 •  hap://groups.google.com/group/memento‐dev
 •  Thanks
To:
 •  Scoa
Ainsworth,
Luda
Balakireva,
Tim
Cole,
Anna
Gerber,
Bernhard
 Haslhofer,
Eric
Hetzner,
Jane
Hunter,
Cliff
Lynch,
Michael
Nelson,
Doug
 Reside,
Harihar
Shankar
 Persistent Web Annotations 

Slide:
41
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  42. 42. Memento
HTTP
Flow
 HEAD R, (Accept-Datetime) LinkG GET G, Accept-Datetime 302M, Vary, TCN, LinkR,B,M GET M, (Accept-Datetime) 200, Content-Datetime, LinkR,B,M Persistent Web Annotations 

Slide:
42
 Rob Sanderson, Herbert Van de Sompel JCDL 2010, June 21-25, Surfers Paradise, Australia
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×