Data Archiving and Networked Services

Riding the Wave and the Scholarly
Archive of the Future
Thinking in Progress by:
An...
Structure presentation
• Where we are today
• Pointers to the future
• Characterising that future
– Fundamental concepts
–...
Let’s go on a journey
• Republic of Letters
• System of Journals
• Web of Objects

January 20, 2014

CC-BY-SA, @atreloar a...
Functions of Research Communication
Rosendaal and Geurts (1997)
• Registration: Allows claims of precedence for a
scholarl...
System of Journals
• Registration
– submission of manuscript

• Certification
– peer-review (pre-publication)
– commentary...
Pointers to the future
“the future is already here – it’s
just not very evenly distributed”
William Gibson, NPR interview
...
Registration: BioRxiv

January 20, 2014

CC-BY-SA, @atreloar and @hvdsomp
Registration: ideacite

January 20, 2014

CC-BY-SA, @atreloar and @hvdsomp
Registration: Github

January 20, 2014

CC-BY-SA, @atreloar and @hvdsomp
Registration: WikiPathways

January 20, 2014

CC-BY-SA, @atreloar and @hvdsomp
Registration: NeuroLex

January 20, 2014

CC-BY-SA, @atreloar and @hvdsomp
Registration: Nanopublications

January 20, 2014

CC-BY-SA, @atreloar and @hvdsomp
Registration: Observations
•
•
•
•

Decoupling registration from certification
Timestamping, versioning
Registration of va...
Certification: PubMed Commons

January 20, 2014

CC-BY-SA, @atreloar and @hvdsomp
Certification: PubPeer

January 20, 2014

CC-BY-SA, @atreloar and @hvdsomp
Certification: ZooUniverse

January 20, 2014

CC-BY-SA, @atreloar and @hvdsomp
Certification: Slideshare

January 20, 2014

CC-BY-SA, @atreloar and @hvdsomp
Certification: Project FeederWatch

January 20, 2014

CC-BY-SA, @atreloar and @hvdsomp
Certification: Observations
•
•
•
•

Peer-review decoupled from publication process
Certification of various types of obje...
Awareness: NARCIS

January 20, 2014

CC-BY-SA, @atreloar and @hvdsomp
Awareness: myExperiment

January 20, 2014

CC-BY-SA, @atreloar and @hvdsomp
Awareness: eLabNotebook RSS

January 20, 2014

CC-BY-SA, @atreloar and @hvdsomp
Awareness: Twitter

January 20, 2014

CC-BY-SA, @atreloar and @hvdsomp
Awareness: CrossRef Prospect

January 20, 2014

CC-BY-SA, @atreloar and @hvdsomp
Awareness: Observations
•
•
•
•

Awareness for various types of objects
Real time awareness
Awareness support targeted at ...
Archiving: CLOCKSS

January 20, 2014

CC-BY-SA, @atreloar and @hvdsomp
Archiving: DANS Easy

January 20, 2014

CC-BY-SA, @atreloar and @hvdsomp
Archiving: Australian Antarctic Data Centre

January 20, 2014

CC-BY-SA, @atreloar and @hvdsomp
Archiving: perma.cc

January 20, 2014

CC-BY-SA, @atreloar and @hvdsomp
Archiving: EU Trusted Digital Repositories

January 20, 2014

CC-BY-SA, @atreloar and @hvdsomp
Archiving: Observations
•
•
•
•

Archiving for various types of objects
Distributed archives
Archival consortia
Audit for ...
Characterising the future
Hidden

Research Process

Visible

Fixed

Nature of object

Varying

Atomic

Atomicity of object...
Fundamental changes
• The research process (objects, social dimension)
is becoming more exposed

• Articles, books are no ...
Web of Objects
• Registration
– Recording of a wide variety of objects, versions of objects

• Certification
– Content/For...
Archiving: Observation 1

January 20, 2014

CC-BY-SA, @atreloar and @hvdsomp
System of Journals: Publication

January 20, 2014

CC-BY-SA, @atreloar and @hvdsomp
System of Journals: Archiving

January 20, 2014

CC-BY-SA, @atreloar and @hvdsomp
Web of Objects: Registration

January 20, 2014

CC-BY-SA, @atreloar and @hvdsomp
Web of Objects: Archiving?

January 20, 2014

CC-BY-SA, @atreloar and @hvdsomp
Need to do better than this

January 20, 2014

CC-BY-SA, @atreloar and @hvdsomp
Not just citation relationships

January 20, 2014

CC-BY-SA, @atreloar and @hvdsomp
Archiving: Observation 2

January 20, 2014

CC-BY-SA, @atreloar and @hvdsomp
Web platforms for scholarship
• Common web platforms are increasingly used for
scholarship
– Wikis, GitHub, Twitter, Wordp...
Recording not Archiving
“GitHub reserves the right at any time and from time to
time to modify or discontinue, temporarily...
Recording isn’t Archiving
Recording

Archiving

Short-term

Longer-term

No guarantees

Read/Write

Try to provide
guarant...
Infrastructure implications
• This infrastructure needs to include
– use of common platforms to support recording
– availa...
January 20, 2014

CC-BY-SA, @atreloar and @hvdsomp
Implications
• Need organizational, technical,
curational interfaces between recording
and archiving platforms
• Need orga...
Upcoming SlideShare
Loading in …5
×

Scholarly archive-of-the-future

3,005 views

Published on

Talk given by @atreloar and @hvdsomp at workshop sponsored by http://dans.knaw.nl/ with title "Riding the Wave and the Scholarly Archive of the Future". NOTE: This reflects thinking in progress which may well change in the future.

Published in: Technology, Business

Scholarly archive-of-the-future

  1. 1. Data Archiving and Networked Services Riding the Wave and the Scholarly Archive of the Future Thinking in Progress by: Andrew Treloar DANS Visiting Fellow ANDS Director of Technology Herbert van de Sompel DANS Visiting Fellow LANL Scientist #rtwsaf DANS is an institute of KNAW and NWO
  2. 2. Structure presentation • Where we are today • Pointers to the future • Characterising that future – Fundamental concepts – Observations about archiving – Diagramming the infrastructure January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  3. 3. Let’s go on a journey • Republic of Letters • System of Journals • Web of Objects January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  4. 4. Functions of Research Communication Rosendaal and Geurts (1997) • Registration: Allows claims of precedence for a scholarly finding • Certification: Establishes validity of claim • Awareness: Allows actors in the system to remain aware of new claims • Archiving: Preserves the scholarly record January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  5. 5. System of Journals • Registration – submission of manuscript • Certification – peer-review (pre-publication) – commentary (post-publication) • Awareness – discovery services • Archiving – libraries (print) – publishers (electronic) – special purpose organisations (e.g. Portico) January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  6. 6. Pointers to the future “the future is already here – it’s just not very evenly distributed” William Gibson, NPR interview January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  7. 7. Registration: BioRxiv January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  8. 8. Registration: ideacite January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  9. 9. Registration: Github January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  10. 10. Registration: WikiPathways January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  11. 11. Registration: NeuroLex January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  12. 12. Registration: Nanopublications January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  13. 13. Registration: Observations • • • • Decoupling registration from certification Timestamping, versioning Registration of various types of objects Machines as creators and contributors January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  14. 14. Certification: PubMed Commons January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  15. 15. Certification: PubPeer January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  16. 16. Certification: ZooUniverse January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  17. 17. Certification: Slideshare January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  18. 18. Certification: Project FeederWatch January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  19. 19. Certification: Observations • • • • Peer-review decoupled from publication process Certification of various types of objects Machines validating Social endorsement January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  20. 20. Awareness: NARCIS January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  21. 21. Awareness: myExperiment January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  22. 22. Awareness: eLabNotebook RSS January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  23. 23. Awareness: Twitter January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  24. 24. Awareness: CrossRef Prospect January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  25. 25. Awareness: Observations • • • • Awareness for various types of objects Real time awareness Awareness support targeted at machines Awareness through social media January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  26. 26. Archiving: CLOCKSS January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  27. 27. Archiving: DANS Easy January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  28. 28. Archiving: Australian Antarctic Data Centre January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  29. 29. Archiving: perma.cc January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  30. 30. Archiving: EU Trusted Digital Repositories January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  31. 31. Archiving: Observations • • • • Archiving for various types of objects Distributed archives Archival consortia Audit for trustworthiness January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  32. 32. Characterising the future Hidden Research Process Visible Fixed Nature of object Varying Atomic Atomicity of object Compound Discrete Process of making public Continuous Delayed S peed of communication Publication +data proxies Communicated object Formal Nature of process Instant Publication + linked data + linked models Informal January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  33. 33. Fundamental changes • The research process (objects, social dimension) is becoming more exposed • Articles, books are no longer the only relevant objects for research communication • Objects are no longer static • Machines are joining humans as (co-)creators and consumers of research objects January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  34. 34. Web of Objects • Registration – Recording of a wide variety of objects, versions of objects • Certification – Content/Form – Human/Machine • Awareness – Real-time – Social – Variety of objects • Archiving – Archiving a wide variety of objects – Trusted archives January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  35. 35. Archiving: Observation 1 January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  36. 36. System of Journals: Publication January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  37. 37. System of Journals: Archiving January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  38. 38. Web of Objects: Registration January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  39. 39. Web of Objects: Archiving? January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  40. 40. Need to do better than this January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  41. 41. Not just citation relationships January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  42. 42. Archiving: Observation 2 January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  43. 43. Web platforms for scholarship • Common web platforms are increasingly used for scholarship – Wikis, GitHub, Twitter, Wordpress, etc. • Many of these have desirable characteristics: – Versioning – Timestamping – Social embedding • Still, they record rather than archive January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  44. 44. Recording not Archiving “GitHub reserves the right at any time and from time to time to modify or discontinue, temporarily or permanently, the Service (or any part thereof) with or without notice.” “GitHub does not warrant that (i) the service will meet your specific requirements, (ii) the service will be uninterrupted, timely, secure, or error-free, (iii) the results that may be obtained from the use of the service will be accurate or reliable, (iv) the quality of any products, services, information, or other material purchased or obtained by you through the service will meet your expectations, and (v) any errors in the Service will be corrected.” January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  45. 45. Recording isn’t Archiving Recording Archiving Short-term Longer-term No guarantees Read/Write Try to provide guarantees Read Scholarly Process Scholarly Record January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  46. 46. Infrastructure implications • This infrastructure needs to include – use of common platforms to support recording – availability of specialist platforms to support archiving • We need an archiving infrastructure that underpins research activity that is – – – – – trusted sustainable distributed interoperable standards-based January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  47. 47. January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp
  48. 48. Implications • Need organizational, technical, curational interfaces between recording and archiving platforms • Need organizational, technical interfaces across archiving platforms January 20, 2014 CC-BY-SA, @atreloar and @hvdsomp

×