Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
Modéliser la ressource web,
contextualiser la référence :
des enjeux pour le patrimoine numérique
Philoweb’10
nicolas.dela...
Motivating scenario
Bookmark overload
- Every day usage of bookmarks is boring
- The Bookmark model hasn’t changed since N...
Motivating scenario
What are people doing online ??
- Looking for information
 Tutorials, Wikipedia…
 RSS, Google alerts...
Is that all we can do?
Motivating scenario
Why is this technology so poor ?
Why is it so difficult to design orientation tools ?
Web User Point o...
Possible part of the answer
Once again…the technique is in advance over theory
- The Web has evolved really fast
- This ev...
Web resources are :
- growing
- heterogenous
Observation #1
IPv6 : Adresses multiplication
667 x 1015 IP adresses per mm2
Internet of Things
RFID
QR Codes
Semantic Web
The Web of Data
RDF : Identity Crisis
HTTP code 303
Source : H. Halpin, V. Presutti
Information Resource
vs.
Non Information Resource
URI : W3C Best Practices
Source : W3C, http://www.w3.org/TR/cooluris/
Content Negociation
(Conneg)
Linking Open Data (LOD)
May 2007
April 2008
September 2008
March 2009
Linking
Open Data
Linking
Open Data
September 2010
How to avoid
disorientation ?
http://www.dbpedia.org/id/EiffelTower
http://www.dbpedia.org/doc/EiffelTower
dbpedia://resou...
Web reference is unreliable
Observation #2
HTTP 404
Page not found
No Web without
the 404 error code.
Dynamic
Web sites
First instability cause :
A (potential) content
generation at every request
Dynamic
Web sites
Second instability cause :
client scripts execution
during the reading.
Resident Evil
« Pour être sûr que je demeure fidèle à ma résolution de ne pas
accepter comme vrai rien qui ne soit pas abs...
Proof by Example
Capturing a web site homepage
« Le Monde »
11th October 12th October
Documentation initiatives & Tools
Observation #3
Private Libraries
Web Archiving French Legal Deposit
IIPC
Wayback Machine…
Petabox – Wayback Machine
Social Bookmarking
Social Tagging
Content syndication
Scrapping
Wozaik Zotero
Cartography
The Web native
documentation model
is insufficient.
What do we refer ?
Observation #4
Page based model
How to refer a web site
as a whole?
Homepage
Looks like a document but…
Is it?
Excel
Google Docs
URL
HTML Source Code
HTML5 + Javascript new API
Web pages are becoming Web applications
- Future of the web :
 an open repository of Web appli...
Hypothesis
Building a Conceptual Framework
Assertions
- Modeling the Web objects and their references will
help to design orientation...
Web Page
What it is :
A Web re-presentation (of a resource state)
An Information Medium
+
An Iteraction Device
What it is ...
Web spaces
The Web is made of several layers
P : Pages available through HTTP.
S : Web services available through HTTP
D :...
Web spaces
Intersection Kind of Resources
P* Web 1.0
S* Web services for composition
D* Open Databases (RDF ou autres, sit...
Webmark : Enhanced Bookmark
Webmark, aims
- To redesign the management of the references
- To analyse the intentionnality ...
Intentionnality of the Marking
Identified kinds of marking
- Content mark
 interest for the content of the resource
- Loc...
Let’s play
What kind of
reference is it ?
+
+
+
+
+
+
+
Thank you for
your attention
Questions ?
Nicolas Delaforge: Modeling the Web resource, extracting the context: stakes for digital memory.
Upcoming SlideShare
Loading in …5
×

Nicolas Delaforge: Modeling the Web resource, extracting the context: stakes for digital memory.

1,850 views

Published on

Published in: Technology
  • Be the first to comment

Nicolas Delaforge: Modeling the Web resource, extracting the context: stakes for digital memory.

  1. 1. Modéliser la ressource web, contextualiser la référence : des enjeux pour le patrimoine numérique Philoweb’10 nicolas.delaforge@inria.fr Equipe Edelweiss – INRIA Sophia Antipolis Modeling the Web Resource, extracting the context : Stakes for the digital memory
  2. 2. Motivating scenario Bookmark overload - Every day usage of bookmarks is boring - The Bookmark model hasn’t changed since Netscape - The Bookmark reference is inaccurate - BM is very difficult to reuse out of the browser Yet, Bookmarking system is one of the main way to access WWW (after Google of course...)
  3. 3. Motivating scenario What are people doing online ?? - Looking for information  Tutorials, Wikipedia…  RSS, Google alerts - Producing information  Communicating about themselves - Doing some social activity  Twitter, Facebook, Blogs, Forums,… - Checking news about web sites or topics of interest  Business Intelligence - Using online applications or services  Webmail, Google docs  e-commerce, e-banking, e-administration  Intranet Different activities, Different objects, but a single tool to organize all.
  4. 4. Is that all we can do?
  5. 5. Motivating scenario Why is this technology so poor ? Why is it so difficult to design orientation tools ? Web User Point of View
  6. 6. Possible part of the answer Once again…the technique is in advance over theory - The Web has evolved really fast - This evolution was mainly technology-driven - Lack of definitions  Mainly technical definitions are available Questions to be answered : - What is a Web site ? - What is a Web page ? - What is behind a Web reference ? - Is the Web made of digital documents or is it a big soup of web resources?
  7. 7. Web resources are : - growing - heterogenous Observation #1
  8. 8. IPv6 : Adresses multiplication 667 x 1015 IP adresses per mm2
  9. 9. Internet of Things
  10. 10. RFID
  11. 11. QR Codes
  12. 12. Semantic Web The Web of Data
  13. 13. RDF : Identity Crisis HTTP code 303 Source : H. Halpin, V. Presutti Information Resource vs. Non Information Resource
  14. 14. URI : W3C Best Practices Source : W3C, http://www.w3.org/TR/cooluris/ Content Negociation (Conneg) Linking Open Data (LOD)
  15. 15. May 2007 April 2008 September 2008 March 2009 Linking Open Data
  16. 16. Linking Open Data September 2010
  17. 17. How to avoid disorientation ? http://www.dbpedia.org/id/EiffelTower http://www.dbpedia.org/doc/EiffelTower dbpedia://resource/EiffelTower
  18. 18. Web reference is unreliable Observation #2
  19. 19. HTTP 404 Page not found No Web without the 404 error code.
  20. 20. Dynamic Web sites First instability cause : A (potential) content generation at every request
  21. 21. Dynamic Web sites Second instability cause : client scripts execution during the reading.
  22. 22. Resident Evil « Pour être sûr que je demeure fidèle à ma résolution de ne pas accepter comme vrai rien qui ne soit pas absolument certain, j’assumerai délibérément qu’un démon tout-puissant est continuellement en train de me tromper au sujet de l’existence du monde physique, incluant même mon propre corps. » Méditations Métaphysiques, Descartes
  23. 23. Proof by Example Capturing a web site homepage « Le Monde »
  24. 24. 11th October 12th October
  25. 25. Documentation initiatives & Tools Observation #3
  26. 26. Private Libraries
  27. 27. Web Archiving French Legal Deposit IIPC Wayback Machine… Petabox – Wayback Machine
  28. 28. Social Bookmarking
  29. 29. Social Tagging
  30. 30. Content syndication
  31. 31. Scrapping Wozaik Zotero
  32. 32. Cartography
  33. 33. The Web native documentation model is insufficient. What do we refer ? Observation #4
  34. 34. Page based model How to refer a web site as a whole? Homepage
  35. 35. Looks like a document but… Is it? Excel Google Docs URL HTML Source Code
  36. 36. HTML5 + Javascript new API Web pages are becoming Web applications - Future of the web :  an open repository of Web applications ? Yes this is a Web page !
  37. 37. Hypothesis
  38. 38. Building a Conceptual Framework Assertions - Modeling the Web objects and their references will help to design orientation solutions - Reference types can only be defined from a user point of view
  39. 39. Web Page What it is : A Web re-presentation (of a resource state) An Information Medium + An Iteraction Device What it is not : A Memory extension (Tertiary Retention) * A Document *this property confers a great communication reactivity (data stream ?)
  40. 40. Web spaces The Web is made of several layers P : Pages available through HTTP. S : Web services available through HTTP D : Data available through HTTP
  41. 41. Web spaces Intersection Kind of Resources P* Web 1.0 S* Web services for composition D* Open Databases (RDF ou autres, sitemaps, LDAP…) SP Web 2.0, RIA, collaborative sites, e-commerce, e-banking… DS Connectors, Data convertors, SPARQL End points, OKKAM… DP RDFa annotated pages (ex : OGP), Microformats, Microdata DPS Pages « conneg ready », DBPedia * exclusive
  42. 42. Webmark : Enhanced Bookmark Webmark, aims - To redesign the management of the references - To analyse the intentionnality of the marking - To exploit the context of the marking - To propose dedicated services according to the kind of the marking.
  43. 43. Intentionnality of the Marking Identified kinds of marking - Content mark  interest for the content of the resource - Location mark  Interest for a place, a community - Application mark  Alias to favorite online applications - Interest Mark  The famous “I like it” or FOAF interest - Composition Mark  A service to be used later in a process
  44. 44. Let’s play What kind of reference is it ?
  45. 45. +
  46. 46. +
  47. 47. +
  48. 48. +
  49. 49. +
  50. 50. +
  51. 51. +
  52. 52. Thank you for your attention
  53. 53. Questions ?

×