A Clean Slate?

A Clean Slate?
@hvdsomp
http://public.lanl.gov/herbertv/
herbert van de sompel
Includes slides by Sean Bechhofer, Carole Goble, Robert Sanderson
paper-based scholarly communication system
scanned version of paper-based scholarly communication system
natively digital, web-based, scholarly communication system
Context of My Work, My Talk
painful	
  transi,on	
  
In Silico (Computational) Science
Datasets
Data collections
Algorithms
Configurations
Tools and Apps
Codes
Code Libraries
Services,
Infrastructure,
Compilers
Hardware
Simulations, data exploration, data processing, analytics, database based, text
mining, auto recommendation, visual analytics…Actually Digital Science is just
Science
Carole Goble, JCDL 2012 Keynote
https://dl.dropbox.com/u/617206/JCDL2012keynoteGoble.ppt
Scientific Workflows, Services, Data, Workflow Engines	
  
Carole Goble, JCDL 2012 Keynote
https://dl.dropbox.com/u/617206/JCDL2012keynoteGoble.ppt
All components
continuously in
flux. How to
reproduce results
in such an
environment?
A Lot of Rs for Reproducibility
•  Rerun re-execute original experiment using revised setting.
•  Review Validate and justify the results empirically. Trust.
Understand. Train. Convincing and comfort
•  Replicate / Repeat Exactly replicate the original experiment.
Eliminate change.
•  Reproduce Run experiment with differences in elements (materials,
methods, platform or setting) and compare to test for same result.
•  Replay Run through what happened using logs without original
platform or need to execute.
Carole Goble, JCDL 2012 Keynote
https://dl.dropbox.com/u/617206/JCDL2012keynoteGoble.ppt
A Lot of Rs for Reuse
•  Refresh execute an upgraded original experiment.
•  Reconstruct rebuild using new elements or different platform when
they are lost/unavailable/inaccessible
•  Reuse use as part of new experiments.
•  Repurpose/Reassemble reuse elements in a new experiment
Carole Goble, JCDL 2012 Keynote
https://dl.dropbox.com/u/617206/JCDL2012keynoteGoble.ppt
The Article is the Knowledge Bottleneck
“An article about computational science in a scientific
publication is not the scholarship itself, it is merely
advertising of the scholarship. The actual scholarship is the
complete software development environment, [the complete
data] and the complete set of instructions which generated
the figures.”
Backheit, J. and Donoho, D. (1995) Wavelab and reproducible research http://
citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.3.2982
The Article is the Knowledge Bottleneck
“Changes are occurring in the ways in which scientific
research is conducted. Within e-laboratories, methods such
as scientific workflows, research protocols, standard
operating procedures and algorithms for analysis and
simulation are used to manipulate and produce data.
Experimental or observational data and scientific models are
typically born digital with no physical counterpart. This move
to digital content is driving a sea-change in scientific
publication, and challenging traditional scholarly
publication.”
Bechhofer S. et al (2010) Research Objects: Towards Exchange and Reuse of Digital
Knowledge http://dx.doi.org/10.1038/npre.2010.4626.1
•  Involved in each such experiment is a complex set of resources
with complex relationships
•  There is a need to share these resources in order to support
forms of reuse, reproducibility
•  This entails the augmentation of the scholarly record with
an explicit account of the research process
•  Digital exchange of each resource individually is trivial,
exchange of the combined knowledge is not
•  Traditional, electronic publications, can not handle this job
•  Targeted at humans, not machines
•  Communicates findings not all scientific knowledge behind
the findings
•  Content not decomposable in actionable units
•  Outputs, results, methods not reusable
If not the Article, then What?
Bechhofer S. et al (2010) Research Objects: Towards Exchange and Reuse of Digital
Knowledge http://dx.doi.org/10.1038/npre.2010.4626.1
The Clean Slate Challenge
The Clean Slate Challenge
Add features to
support these
needs to the
existing scholarly
communication
system?
The Clean Slate Challenge
Start with
a clean slate?
Research Objects
http://www.researchobject.org/ http://www.wf4ever-project.org/
Research Objects: Aggregated Content
•  Data used or results produced in
an experiment study
•  Methods employed to produce and
analyze that data
•  Provenance and setting
information about the experiments
•  People involved in the
investigation
•  Annotations about these
resources, that are essential to the
understanding and interpretation of
the scientific outcomes captured
by a research object.
http://www.researchobject.org/
http://www.w3.org/community/rosc/
Research Objects
http://www.researchobject.org/
Research Objects: Aggregation
“Research Objects are aggregations of content. Thus a
Research Object framework needs to provide a mechanism
for this aggregation. Aggregations are likely to include
references to resources but there may also, however, be
situations, where, for reasons of efficiency or in order to
support persistence, Research Objects should also be able
to aggregate literal data as well as references to data.”
Bechhofer S. et al (2010) Research Objects: Towards Exchange and Reuse of Digital
Knowledge http://dx.doi.org/10.1038/npre.2010.4626.1
•  OAI-ORE observation: Scholarly assets are
rapidly becoming compound, consisting of
multiple resources
•  e.g. datasets, software, ontologies,
workflows, online debate, slides, blogs,
videos, etc.
with various:
•  Relationships
•  Interdependencies
•  How to convey this compound-ness in an
interoperable manner so that applications
can access, consume such assets?
2007	
  
Funded by the Mellon Foundation & Microsoft Research
http://www.openarchives.org/ore/
A Clean Slate?
A Clean Slate?
Foundations of the ORE Solution
•  Web Architecture - Resource, URI, Representation
•  Semantic Web:
•  URIs for documents (information resources),
•  URIs for physical entities, concepts, abstractions (non-information
resources)
•  RDF – to express properties, relationships pertaining to resources
•  Linked Data:
•  HTTP URIs for both information and non-information resources
•  HTTP 303 redirect:
•  From: The HTTP URI of non-information resource
•  To: The HTTP URI of an information resource that describes
the non-information resource
A Clean Slate?
A Clean Slate?
A Clean Slate?
A Clean Slate?
A Clean Slate?
A Clean Slate?
A Clean Slate?
A Clean Slate?
Adding Account of Research Life Cycles to Scholarly Record
Pepe, A., Mayernik, M., Borgman, C., Van de Sompel, H. (2009) Technology to
Represent Scientific Practice: Data, Life Cycles, and Value Chains. http://dx.doi.org/
10.1002/asi21263
ORE & Research Objects
“…, Research Objects should also be able to aggregate literal data as
well as references to data.”
•  Aggregated Resources in ORE have HTTP URIs; probably needs to
be relaxed.
•  Embedding content in RDF, irrespective of ORE, is … interesting
•  See: Representing Content in RDF 1.0 http://www.w3.org/TR/
Content-in-RDF10/
•  Allows embedding base64, text, XML
•  Resource Map as manifest in e.g. ZIP file?
Research Objects
http://www.researchobject.org/
Research Objects: Annotation
“Annotations about these resources, that are essential to the
understanding and interpretation of the scientific outcomes
captured by a research object.”
http://www.researchobject.org/
•  Annotation is a pervasive scholarly activity,
conducted by people and machines
•  Many annotation efforts and tools
•  But annotations stuck in silos:
•  Only consumable by client that created
it
•  Annotations not shareable beyond
original environment
•  Open Annotation focuses on interoperability
for annotations in order to allow sharing of
annotations across:
•  Annotation clients
•  Content collections
•  Services that leverage annotations
2009	
  
Funded by the Mellon Foundation
http://www.openanotation.org/spec/core/
•  Established to reconcile Open Annotation Collaboration and
Annotation Ontology models
•  67 participants from around the world: 7th of 119 groups
Many universities, also commercial and not-for-profit
•  Mission:
Interoperability between Annotation systems and platforms, by
…following the Architecture of the Web
…reusing existing web standards
…providing a single, coherent model to implement
…without requiring adoption of specific platforms
…while maintaining low implementation costs
W3C Open Annotation Community Group
http://www.w3.org/community/openannotation/
An Annotation is considered to be a set of connected
resources, typically including a body and target, where
the body is related to the target.
“	
   ”	
  
Highlighting, Bookmarking
Commenting, Describing
Tagging, Linking
Classifying, Identifying
Questioning, Replying
Editing, Moderating
…Provide an Aide-Memoire
…Share and Inform
…Improve Discovery
…Organize Resources
…Interact with Others
…Create as well as Consume
What is an Annotation?
http://www.w3.org/community/openannotation/
Annotates	
  
Annotations
Annotates?	
  
Annotations?
Basic Open Annotation Data Model
Use Case: Bookmarking
Use Case: Commenting
Use Case: Commenting
Use Case: Tagging
Specific Body and Specific Target resources identify the region of
interest, and/or the state of the resource.
Need to be able to describe the state of the resource, the segment
of interest, and potentially styling hints for how to render it.
Open Annotation introduces:
State Describes how to retrieve representation
Selector Describes how to select segment
Style Describes how to render/process segment
Scope Describes context of the resource
Further Specification of Resources
Use Case: Changing Content at the Same URI
Use Case: Segment of Interest
W3C Open Annotation & Research Objects
•  Early renderings of Research Objects emerging from the Wf4Ever
project use Annotation Ontology as the annotation framework
•  But since the Annotation Ontology and Open Annotation Collaboration
models now merge into the W3C Open Annotation model, it is safe to
assume W3C Open Annotation will be used for Research Objects
Research Objects
http://www.researchobject.org/
Research Objects: Versioning and Evolution
“Research Objects are dynamic in that their contents can
change and be changed – additional contents may be
added to aggregations, or additional metadata can be
asserted about the contents or relationships between
content. The resources that are aggregated may change.
Thus there is a need for versioning, allowing the recording
of changes to objects, potentially along with facilities for
retrieving objects or aggregated elements at particular
historical points in their lifecycle.”
Bechhofer S. et al (2010) Research Objects: Towards Exchange and Reuse of Digital
Knowledge http://dx.doi.org/10.1038/npre.2010.4626.1
ORE Experiment: Versioning and Evolution of Compound Objects
Van de Sompel, H. et al. (2007) Appendix to Interoperability for the Discovery, Use, and
Re-Use of Units of Scholarly Communication
http://www.ctwatch.org/quarterly/articles/2007/08/interoperability-for-the-discovery-use-
and-re-use-of-units-of-scholarly-communication/
•  Memento is about the Web and time:
•  Resources evolve over time
•  Only the current representation is
available from a resource’s URI
•  How to seamlessly access prior
representation, if they exist?
•  Memento looks at this problem for the Web,
in general
Digital	
  Preserva,on	
  Award	
  2010	
  
2009	
  
Funded by the Library of Congress
http://www.mementoweb.org/
URI for Original, URI for Version	
  
URI-­‐M	
  -­‐	
  hDp://web.archive.org/web/20010911203610/hDp://www.cnn.com/	
  	
  
Web	
  Archive	
  
URI-­‐R	
  -­‐	
  hDp://www.cnn.com/	
  	
  
URI for Original, URI for Version	
  
URI-­‐M	
  -­‐	
  hDp://en.wikipedia.org/w/index.php?,tle=September_11_aDacks&oldid=282333	
  	
  
CMS	
  
URI-­‐R	
  -­‐	
  hDp://en.wikipedia.org/wiki/September_11_aDacks	
  
A Clean Slate?
A Clean Slate?
A Clean Slate?
A Clean Slate?
A Clean Slate?
A Clean Slate?
Time Travel for the Web: Demo	
  
http://www.mementoweb.org/demo/Memento_Time_Travel.mov
A Clean Slate?
A Clean Slate?
A Clean Slate?
A Clean Slate?
Memento & Research Objects
•  The combination of:
•  Pro-active archiving of Research Objects and their constituent
resources, using
•  Web archiving techniques, e.g. crawling, transactional
archiving
•  Platforms with strong versioning capabilities, e.g. datawikis,
github
•  Assigning URIs to Research Objects and their constituent
resources according to the well-established time-generic (URI-R)
and time-specific (URI-M) resource pattern
•  The Memento protocol to access time-specific versions of
Research Objects and their constituent resources via their time-
generic URI and timestamp
makes a good candidate for addressing the versioning and evolution
need.
Research Objects
http://www.researchobject.org/
Research Objects: Provenance
“The issue of provenance, and being able to audit
experiments and investigations is key to the scientific
method. Third parties must be able to audit the steps
performed in an experiment in order to be convinced of the
validity of results. Audit is required not just for regulatory
purposes, but allows for the results of experiments to be
interpreted and reused, thus a Research Object should
provide sufficient information to support audit of the
aggregation as a whole, its constituent parts, and any
process that it may encapsulate.”
Bechhofer S. et al (2010) Research Objects: Towards Exchange and Reuse of Digital
Knowledge http://dx.doi.org/10.1038/npre.2010.4626.1
Van de Sompel, H. (2003) Roadblocks http://www.sis.pitt.edu/~dlwkshop/paper_sompel.html
Provenance
Moreau, L. et al. (2010) The Open Provenance Model: Abstract Model
http://eprints.ecs.soton.ac.uk/21449/
Open Provenance Model
W3C Provenance
http://www.w3.org/TR/prov-primer/
Research Objects
http://www.researchobject.org/
W3C	
  PROV	
  
The Clean Slate Challenge
•  ResourceSync is about synchronization of
web resources, things with a URI that can
be dereferenced
•  Small websites/repositories (a few
resources) to large repositories/datasets/
linked data collections (many millions of
resources)
•  Low change frequency (weeks/months) to
high change frequency (seconds)
•  Synchronization latency and accuracy
needs may vary
•  Modular framework based on Sitemaps and
extensions
2012	
  
Funded by the Sloan Foundation
http://www.openarchives.org/rs/
•  Investigates reference rot at massive scale:
•  Citation rot - Do HTTP references in
scholarly articles still resolve?
•  Content rot - If so, is the content at the
end of the HTTP reference still
representative of the content that was
originally referenced?
•  Investigates pro-active ways to archive
HTTP referenced resources that occur in
scholarly articles
2013	
  
hiberlink
Funded by the Mellon Foundation
Soon at http://www.hiberlink.org
Research Objects
http://www.researchobject.org/ http://www.wf4ever-project.org/
http://www.w3.org/community/rosc/
A Clean Slate?
@hvdsomp
http://public.lanl.gov/herbertv/
herbert van de sompel
Includes slides by Sean Bechhofer, Carole Goble, Robert Sanderson
1 of 77

Recommended

An Overview of the OAI Object Reuse and Exchange Interoperability Framework by
An Overview of the OAI Object Reuse and Exchange Interoperability FrameworkAn Overview of the OAI Object Reuse and Exchange Interoperability Framework
An Overview of the OAI Object Reuse and Exchange Interoperability FrameworkHerbert Van de Sompel
11.3K views167 slides
The aDORe Federation Architecture by
The aDORe Federation ArchitectureThe aDORe Federation Architecture
The aDORe Federation ArchitectureHerbert Van de Sompel
3.5K views33 slides
towards interoperable archives: the Universal Preprint Service initiative by
towards interoperable archives:  the Universal Preprint Service initiativetowards interoperable archives:  the Universal Preprint Service initiative
towards interoperable archives: the Universal Preprint Service initiativeHerbert Van de Sompel
3.4K views30 slides
The OAI-ORE Interoperability Framework in the Context of the Current Scholarl... by
The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...
The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...Herbert Van de Sompel
3.2K views28 slides
OAC Presentation at CNI 09 Fall Forum by
OAC Presentation at CNI 09 Fall ForumOAC Presentation at CNI 09 Fall Forum
OAC Presentation at CNI 09 Fall ForumRobert Sanderson
5K views47 slides
Towards a Machine-Actionable Scholarly Communication System by
Towards a Machine-Actionable Scholarly Communication SystemTowards a Machine-Actionable Scholarly Communication System
Towards a Machine-Actionable Scholarly Communication SystemHerbert Van de Sompel
4.7K views44 slides

More Related Content

What's hot

The bX project: Federating and Mining Usage Logs from Linking Servers by
The bX project: Federating and Mining Usage Logs from Linking ServersThe bX project: Federating and Mining Usage Logs from Linking Servers
The bX project: Federating and Mining Usage Logs from Linking ServersHerbert Van de Sompel
4.8K views37 slides
MESUR: Making sense and use of usage data by
MESUR: Making sense and use of usage dataMESUR: Making sense and use of usage data
MESUR: Making sense and use of usage dataHerbert Van de Sompel
6K views35 slides
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe... by
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...Carole Goble
459 views59 slides
Open Research Data: Licensing | Standards | Future by
Open Research Data: Licensing | Standards | FutureOpen Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | FutureRoss Mounce
1.5K views44 slides
Modern Tools & Rationales for 21st Century Research by
Modern Tools & Rationales  for 21st Century ResearchModern Tools & Rationales  for 21st Century Research
Modern Tools & Rationales for 21st Century ResearchRoss Mounce
1.5K views23 slides
DataCite: the Perfect Complement to CrossRef by
DataCite: the Perfect Complement to CrossRefDataCite: the Perfect Complement to CrossRef
DataCite: the Perfect Complement to CrossRefCrossref
5.7K views14 slides

What's hot(20)

The bX project: Federating and Mining Usage Logs from Linking Servers by Herbert Van de Sompel
The bX project: Federating and Mining Usage Logs from Linking ServersThe bX project: Federating and Mining Usage Logs from Linking Servers
The bX project: Federating and Mining Usage Logs from Linking Servers
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe... by Carole Goble
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
Carole Goble459 views
Open Research Data: Licensing | Standards | Future by Ross Mounce
Open Research Data: Licensing | Standards | FutureOpen Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | Future
Ross Mounce1.5K views
Modern Tools & Rationales for 21st Century Research by Ross Mounce
Modern Tools & Rationales  for 21st Century ResearchModern Tools & Rationales  for 21st Century Research
Modern Tools & Rationales for 21st Century Research
Ross Mounce1.5K views
DataCite: the Perfect Complement to CrossRef by Crossref
DataCite: the Perfect Complement to CrossRefDataCite: the Perfect Complement to CrossRef
DataCite: the Perfect Complement to CrossRef
Crossref5.7K views
Reproducibility, Research Objects and Reality, Leiden 2016 by Carole Goble
Reproducibility, Research Objects and Reality, Leiden 2016Reproducibility, Research Objects and Reality, Leiden 2016
Reproducibility, Research Objects and Reality, Leiden 2016
Carole Goble1.1K views
Hiberlink: Investigating Reference Rot, December 2013 by Herbert Van de Sompel
Hiberlink: Investigating Reference Rot, December 2013Hiberlink: Investigating Reference Rot, December 2013
Hiberlink: Investigating Reference Rot, December 2013
Herbert Van de Sompel11.3K views
Specimen-level mining: bringing knowledge back 'home' to the Natural History ... by Ross Mounce
Specimen-level mining: bringing knowledge back 'home' to the Natural History ...Specimen-level mining: bringing knowledge back 'home' to the Natural History ...
Specimen-level mining: bringing knowledge back 'home' to the Natural History ...
Ross Mounce693 views
Research Shared: researchobject.org by Norman Morrison
Research Shared: researchobject.orgResearch Shared: researchobject.org
Research Shared: researchobject.org
Norman Morrison2K views
Museum impact: linking-up specimens with research published on them by Ross Mounce
Museum impact: linking-up specimens with research published on themMuseum impact: linking-up specimens with research published on them
Museum impact: linking-up specimens with research published on them
Ross Mounce1.5K views
Mtsr2015 goble-keynote by Carole Goble
Mtsr2015 goble-keynoteMtsr2015 goble-keynote
Mtsr2015 goble-keynote
Carole Goble1.5K views
The State of Open Research Data by Ross Mounce
The State of Open Research DataThe State of Open Research Data
The State of Open Research Data
Ross Mounce2.1K views
Semantic Web, Linked Data and Education: A Perfect Fit? by Mathieu d'Aquin
Semantic Web, Linked Data and Education: A Perfect Fit?Semantic Web, Linked Data and Education: A Perfect Fit?
Semantic Web, Linked Data and Education: A Perfect Fit?
Mathieu d'Aquin3.7K views
The European Open Science Cloud: just what is it? by Carole Goble
The European Open Science Cloud: just what is it?The European Open Science Cloud: just what is it?
The European Open Science Cloud: just what is it?
Carole Goble428 views
Dagstuhl "Future" sesssion intro slides by Tim Clark
Dagstuhl "Future" sesssion intro slidesDagstuhl "Future" sesssion intro slides
Dagstuhl "Future" sesssion intro slides
Tim Clark340 views
Doing Clever Things with the Semantic Web by Mathieu d'Aquin
Doing Clever Things with the Semantic WebDoing Clever Things with the Semantic Web
Doing Clever Things with the Semantic Web
Mathieu d'Aquin1.8K views
Extracting Relevant Questions to an RDF Dataset Using Formal Concept Analysis by Mathieu d'Aquin
Extracting Relevant Questions to an RDF Dataset Using Formal Concept AnalysisExtracting Relevant Questions to an RDF Dataset Using Formal Concept Analysis
Extracting Relevant Questions to an RDF Dataset Using Formal Concept Analysis
Mathieu d'Aquin1.8K views
FAIR Data, Operations and Model management for Systems Biology and Systems Me... by Carole Goble
FAIR Data, Operations and Model management for Systems Biology and Systems Me...FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
Carole Goble1.5K views
Data management for researchers by Dirk Roorda
Data management for researchersData management for researchers
Data management for researchers
Dirk Roorda328 views

Viewers also liked

Augmenting interoperability across scholarly repositories by
Augmenting interoperability across scholarly repositoriesAugmenting interoperability across scholarly repositories
Augmenting interoperability across scholarly repositoriesHerbert Van de Sompel
3K views21 slides
An HTTP-Based Versioning Mechanism for Linked Data by
An HTTP-Based Versioning Mechanism for Linked DataAn HTTP-Based Versioning Mechanism for Linked Data
An HTTP-Based Versioning Mechanism for Linked DataHerbert Van de Sompel
5.9K views54 slides
The djatoka Image Server by
The djatoka Image ServerThe djatoka Image Server
The djatoka Image ServerHerbert Van de Sompel
5.2K views22 slides
The Roof is on Fire by
The Roof is on FireThe Roof is on Fire
The Roof is on FireHerbert Van de Sompel
5.1K views51 slides
the UPS protoproto project by
the UPS protoproto projectthe UPS protoproto project
the UPS protoproto projectHerbert Van de Sompel
3.2K views30 slides
Attempts at innovation in scholarly communication by
Attempts at innovation in scholarly communicationAttempts at innovation in scholarly communication
Attempts at innovation in scholarly communicationHerbert Van de Sompel
3.4K views55 slides

Viewers also liked(18)

The Web as infrastructure for scholarly research and communication by Herbert Van de Sompel
The Web as infrastructure for scholarly research and communicationThe Web as infrastructure for scholarly research and communication
The Web as infrastructure for scholarly research and communication
Motivation, inspiration and innovation from frustration by Herbert Van de Sompel
Motivation, inspiration and innovation from frustrationMotivation, inspiration and innovation from frustration
Motivation, inspiration and innovation from frustration
Memento: Big Leaps Towards Seamless Navigation of the Web of the Past by Herbert Van de Sompel
Memento: Big Leaps Towards Seamless Navigation of the Web of the PastMemento: Big Leaps Towards Seamless Navigation of the Web of the Past
Memento: Big Leaps Towards Seamless Navigation of the Web of the Past
DBpedia Archive using Memento, Triple Pattern Fragments, and HDT by Herbert Van de Sompel
DBpedia Archive using Memento, Triple Pattern Fragments, and HDTDBpedia Archive using Memento, Triple Pattern Fragments, and HDT
DBpedia Archive using Memento, Triple Pattern Fragments, and HDT
The SFX Framework for Context-Sensitive Reference Linking by Herbert Van de Sompel
The SFX Framework for  Context-Sensitive Reference LinkingThe SFX Framework for  Context-Sensitive Reference Linking
The SFX Framework for Context-Sensitive Reference Linking
Overview of Digital Publishing by Philip Bourne
Overview of Digital PublishingOverview of Digital Publishing
Overview of Digital Publishing
Philip Bourne1.2K views

Similar to A Clean Slate?

The Rhetoric of Research Objects by
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research ObjectsCarole Goble
2.4K views53 slides
Research Objects: more than the sum of the parts by
Research Objects: more than the sum of the partsResearch Objects: more than the sum of the parts
Research Objects: more than the sum of the partsCarole Goble
1.5K views45 slides
Metadata for Research Objects by
Metadata for Research ObjectsMetadata for Research Objects
Metadata for Research Objectsseanb
2.1K views38 slides
2013 DataCite Summer Meeting - Elsevier's program to support research data (H... by
2013 DataCite Summer Meeting - Elsevier's program to support research data (H...2013 DataCite Summer Meeting - Elsevier's program to support research data (H...
2013 DataCite Summer Meeting - Elsevier's program to support research data (H...datacite
1.1K views24 slides
SEEK for Science: A Data and Model Management Platform to support Open and Re... by
SEEK for Science: A Data and Model Management Platform to support Open and Re...SEEK for Science: A Data and Model Management Platform to support Open and Re...
SEEK for Science: A Data and Model Management Platform to support Open and Re...Carole Goble
2.2K views30 slides
Research Objects for FAIRer Science by
Research Objects for FAIRer Science Research Objects for FAIRer Science
Research Objects for FAIRer Science Carole Goble
2.2K views78 slides

Similar to A Clean Slate?(20)

The Rhetoric of Research Objects by Carole Goble
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research Objects
Carole Goble2.4K views
Research Objects: more than the sum of the parts by Carole Goble
Research Objects: more than the sum of the partsResearch Objects: more than the sum of the parts
Research Objects: more than the sum of the parts
Carole Goble1.5K views
Metadata for Research Objects by seanb
Metadata for Research ObjectsMetadata for Research Objects
Metadata for Research Objects
seanb2.1K views
2013 DataCite Summer Meeting - Elsevier's program to support research data (H... by datacite
2013 DataCite Summer Meeting - Elsevier's program to support research data (H...2013 DataCite Summer Meeting - Elsevier's program to support research data (H...
2013 DataCite Summer Meeting - Elsevier's program to support research data (H...
datacite1.1K views
SEEK for Science: A Data and Model Management Platform to support Open and Re... by Carole Goble
SEEK for Science: A Data and Model Management Platform to support Open and Re...SEEK for Science: A Data and Model Management Platform to support Open and Re...
SEEK for Science: A Data and Model Management Platform to support Open and Re...
Carole Goble2.2K views
Research Objects for FAIRer Science by Carole Goble
Research Objects for FAIRer Science Research Objects for FAIRer Science
Research Objects for FAIRer Science
Carole Goble2.2K views
Open Archives Initiative Object Reuse and Exchange by lagoze
Open Archives Initiative Object Reuse and ExchangeOpen Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and Exchange
lagoze496 views
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13 by DataDryad
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
DataDryad1.4K views
Research Objects @ HARMONY 2014 by seanb
Research Objects @ HARMONY 2014Research Objects @ HARMONY 2014
Research Objects @ HARMONY 2014
seanb1.3K views
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data by Susanna-Assunta Sansone
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific DataNIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
Introducing the Whole Tale Project: Merging Science and Cyberinfrastructure P... by Bertram Ludäscher
Introducing the Whole Tale Project: Merging Science and Cyberinfrastructure P...Introducing the Whole Tale Project: Merging Science and Cyberinfrastructure P...
Introducing the Whole Tale Project: Merging Science and Cyberinfrastructure P...
Bertram Ludäscher684 views
The Research Object Initiative: Frameworks and Use Cases by Carole Goble
The Research Object Initiative:Frameworks and Use CasesThe Research Object Initiative:Frameworks and Use Cases
The Research Object Initiative: Frameworks and Use Cases
Carole Goble1.7K views
Research Object Community Update by Carole Goble
Research Object Community UpdateResearch Object Community Update
Research Object Community Update
Carole Goble196 views
Networked Science, And Integrating with Dataverse by Anita de Waard
Networked Science, And Integrating with DataverseNetworked Science, And Integrating with Dataverse
Networked Science, And Integrating with Dataverse
Anita de Waard596 views
2012 03-28 Wf4ever, preserving workflows as digital research objects by Stian Soiland-Reyes
2012 03-28 Wf4ever, preserving workflows as digital research objects2012 03-28 Wf4ever, preserving workflows as digital research objects
2012 03-28 Wf4ever, preserving workflows as digital research objects
RDA-WDS Publishing Data Interest Group by Anita de Waard
RDA-WDS Publishing Data Interest GroupRDA-WDS Publishing Data Interest Group
RDA-WDS Publishing Data Interest Group
Anita de Waard401 views
Engaging Information Professionals in the Process of Authoritative Interlinki... by Lucy McKenna
Engaging Information Professionals in the Process of Authoritative Interlinki...Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...
Lucy McKenna16 views
Research Object Composer: A Tool for Publishing Complex Data Objects in the C... by Anita de Waard
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Anita de Waard383 views
Knowledge Infrastructure for Global Systems Science by David De Roure
Knowledge Infrastructure for Global Systems ScienceKnowledge Infrastructure for Global Systems Science
Knowledge Infrastructure for Global Systems Science
David De Roure860 views

More from Herbert Van de Sompel

The web is rotting and what to do about it by
The web is rotting and what to do about itThe web is rotting and what to do about it
The web is rotting and what to do about itHerbert Van de Sompel
325 views86 slides
Researcher Pod: Scholarly Communication Using the Decentralized Web by
Researcher Pod: Scholarly Communication Using the Decentralized WebResearcher Pod: Scholarly Communication Using the Decentralized Web
Researcher Pod: Scholarly Communication Using the Decentralized WebHerbert Van de Sompel
393 views42 slides
Persistent Identification: Easier Said than Done by
Persistent Identification: Easier Said than DonePersistent Identification: Easier Said than Done
Persistent Identification: Easier Said than DoneHerbert Van de Sompel
412 views41 slides
FAIR Signposting: A KISS Approach to a Burning Issue by
FAIR Signposting: A KISS Approach to a Burning IssueFAIR Signposting: A KISS Approach to a Burning Issue
FAIR Signposting: A KISS Approach to a Burning IssueHerbert Van de Sompel
1.1K views28 slides
Registration / Certification Interoperability Architecture (overlay peer-review) by
Registration / Certification Interoperability Architecture (overlay peer-review)Registration / Certification Interoperability Architecture (overlay peer-review)
Registration / Certification Interoperability Architecture (overlay peer-review)Herbert Van de Sompel
371 views44 slides
Collecting the organizational scholarly record by
Collecting the organizational scholarly recordCollecting the organizational scholarly record
Collecting the organizational scholarly recordHerbert Van de Sompel
672 views69 slides

More from Herbert Van de Sompel(20)

Researcher Pod: Scholarly Communication Using the Decentralized Web by Herbert Van de Sompel
Researcher Pod: Scholarly Communication Using the Decentralized WebResearcher Pod: Scholarly Communication Using the Decentralized Web
Researcher Pod: Scholarly Communication Using the Decentralized Web
Registration / Certification Interoperability Architecture (overlay peer-review) by Herbert Van de Sompel
Registration / Certification Interoperability Architecture (overlay peer-review)Registration / Certification Interoperability Architecture (overlay peer-review)
Registration / Certification Interoperability Architecture (overlay peer-review)
Persistent Identifiers and the Web: The Need for an Unambiguous Mapping by Herbert Van de Sompel
Persistent Identifiers and the Web: The Need for an Unambiguous MappingPersistent Identifiers and the Web: The Need for an Unambiguous Mapping
Persistent Identifiers and the Web: The Need for an Unambiguous Mapping

Recently uploaded

KVM Security Groups Under the Hood - Wido den Hollander - Your.Online by
KVM Security Groups Under the Hood - Wido den Hollander - Your.OnlineKVM Security Groups Under the Hood - Wido den Hollander - Your.Online
KVM Security Groups Under the Hood - Wido den Hollander - Your.OnlineShapeBlue
181 views19 slides
Updates on the LINSTOR Driver for CloudStack - Rene Peinthor - LINBIT by
Updates on the LINSTOR Driver for CloudStack - Rene Peinthor - LINBITUpdates on the LINSTOR Driver for CloudStack - Rene Peinthor - LINBIT
Updates on the LINSTOR Driver for CloudStack - Rene Peinthor - LINBITShapeBlue
166 views8 slides
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R... by
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...ShapeBlue
132 views15 slides
Migrating VMware Infra to KVM Using CloudStack - Nicolas Vazquez - ShapeBlue by
Migrating VMware Infra to KVM Using CloudStack - Nicolas Vazquez - ShapeBlueMigrating VMware Infra to KVM Using CloudStack - Nicolas Vazquez - ShapeBlue
Migrating VMware Infra to KVM Using CloudStack - Nicolas Vazquez - ShapeBlueShapeBlue
176 views20 slides
Uni Systems for Power Platform.pptx by
Uni Systems for Power Platform.pptxUni Systems for Power Platform.pptx
Uni Systems for Power Platform.pptxUni Systems S.M.S.A.
61 views21 slides
The Role of Patterns in the Era of Large Language Models by
The Role of Patterns in the Era of Large Language ModelsThe Role of Patterns in the Era of Large Language Models
The Role of Patterns in the Era of Large Language ModelsYunyao Li
80 views65 slides

Recently uploaded(20)

KVM Security Groups Under the Hood - Wido den Hollander - Your.Online by ShapeBlue
KVM Security Groups Under the Hood - Wido den Hollander - Your.OnlineKVM Security Groups Under the Hood - Wido den Hollander - Your.Online
KVM Security Groups Under the Hood - Wido den Hollander - Your.Online
ShapeBlue181 views
Updates on the LINSTOR Driver for CloudStack - Rene Peinthor - LINBIT by ShapeBlue
Updates on the LINSTOR Driver for CloudStack - Rene Peinthor - LINBITUpdates on the LINSTOR Driver for CloudStack - Rene Peinthor - LINBIT
Updates on the LINSTOR Driver for CloudStack - Rene Peinthor - LINBIT
ShapeBlue166 views
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R... by ShapeBlue
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...
ShapeBlue132 views
Migrating VMware Infra to KVM Using CloudStack - Nicolas Vazquez - ShapeBlue by ShapeBlue
Migrating VMware Infra to KVM Using CloudStack - Nicolas Vazquez - ShapeBlueMigrating VMware Infra to KVM Using CloudStack - Nicolas Vazquez - ShapeBlue
Migrating VMware Infra to KVM Using CloudStack - Nicolas Vazquez - ShapeBlue
ShapeBlue176 views
The Role of Patterns in the Era of Large Language Models by Yunyao Li
The Role of Patterns in the Era of Large Language ModelsThe Role of Patterns in the Era of Large Language Models
The Role of Patterns in the Era of Large Language Models
Yunyao Li80 views
VNF Integration and Support in CloudStack - Wei Zhou - ShapeBlue by ShapeBlue
VNF Integration and Support in CloudStack - Wei Zhou - ShapeBlueVNF Integration and Support in CloudStack - Wei Zhou - ShapeBlue
VNF Integration and Support in CloudStack - Wei Zhou - ShapeBlue
ShapeBlue163 views
Confidence in CloudStack - Aron Wagner, Nathan Gleason - Americ by ShapeBlue
Confidence in CloudStack - Aron Wagner, Nathan Gleason - AmericConfidence in CloudStack - Aron Wagner, Nathan Gleason - Americ
Confidence in CloudStack - Aron Wagner, Nathan Gleason - Americ
ShapeBlue88 views
Keynote Talk: Open Source is Not Dead - Charles Schulz - Vates by ShapeBlue
Keynote Talk: Open Source is Not Dead - Charles Schulz - VatesKeynote Talk: Open Source is Not Dead - Charles Schulz - Vates
Keynote Talk: Open Source is Not Dead - Charles Schulz - Vates
ShapeBlue210 views
Digital Personal Data Protection (DPDP) Practical Approach For CISOs by Priyanka Aash
Digital Personal Data Protection (DPDP) Practical Approach For CISOsDigital Personal Data Protection (DPDP) Practical Approach For CISOs
Digital Personal Data Protection (DPDP) Practical Approach For CISOs
Priyanka Aash153 views
CloudStack and GitOps at Enterprise Scale - Alex Dometrius, Rene Glover - AT&T by ShapeBlue
CloudStack and GitOps at Enterprise Scale - Alex Dometrius, Rene Glover - AT&TCloudStack and GitOps at Enterprise Scale - Alex Dometrius, Rene Glover - AT&T
CloudStack and GitOps at Enterprise Scale - Alex Dometrius, Rene Glover - AT&T
ShapeBlue112 views
Backup and Disaster Recovery with CloudStack and StorPool - Workshop - Venko ... by ShapeBlue
Backup and Disaster Recovery with CloudStack and StorPool - Workshop - Venko ...Backup and Disaster Recovery with CloudStack and StorPool - Workshop - Venko ...
Backup and Disaster Recovery with CloudStack and StorPool - Workshop - Venko ...
ShapeBlue144 views
How to Re-use Old Hardware with CloudStack. Saving Money and the Environment ... by ShapeBlue
How to Re-use Old Hardware with CloudStack. Saving Money and the Environment ...How to Re-use Old Hardware with CloudStack. Saving Money and the Environment ...
How to Re-use Old Hardware with CloudStack. Saving Money and the Environment ...
ShapeBlue123 views
Data Integrity for Banking and Financial Services by Precisely
Data Integrity for Banking and Financial ServicesData Integrity for Banking and Financial Services
Data Integrity for Banking and Financial Services
Precisely78 views
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ... by ShapeBlue
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...
ShapeBlue79 views
Future of AR - Facebook Presentation by Rob McCarty
Future of AR - Facebook PresentationFuture of AR - Facebook Presentation
Future of AR - Facebook Presentation
Rob McCarty62 views
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti... by ShapeBlue
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...
ShapeBlue98 views
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive by Network Automation Forum
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLiveAutomating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Extending KVM Host HA for Non-NFS Storage - Alex Ivanov - StorPool by ShapeBlue
Extending KVM Host HA for Non-NFS Storage -  Alex Ivanov - StorPoolExtending KVM Host HA for Non-NFS Storage -  Alex Ivanov - StorPool
Extending KVM Host HA for Non-NFS Storage - Alex Ivanov - StorPool
ShapeBlue84 views

A Clean Slate?

  • 1. A Clean Slate? @hvdsomp http://public.lanl.gov/herbertv/ herbert van de sompel Includes slides by Sean Bechhofer, Carole Goble, Robert Sanderson
  • 2. paper-based scholarly communication system scanned version of paper-based scholarly communication system natively digital, web-based, scholarly communication system Context of My Work, My Talk painful  transi,on  
  • 3. In Silico (Computational) Science Datasets Data collections Algorithms Configurations Tools and Apps Codes Code Libraries Services, Infrastructure, Compilers Hardware Simulations, data exploration, data processing, analytics, database based, text mining, auto recommendation, visual analytics…Actually Digital Science is just Science Carole Goble, JCDL 2012 Keynote https://dl.dropbox.com/u/617206/JCDL2012keynoteGoble.ppt
  • 4. Scientific Workflows, Services, Data, Workflow Engines   Carole Goble, JCDL 2012 Keynote https://dl.dropbox.com/u/617206/JCDL2012keynoteGoble.ppt All components continuously in flux. How to reproduce results in such an environment?
  • 5. A Lot of Rs for Reproducibility •  Rerun re-execute original experiment using revised setting. •  Review Validate and justify the results empirically. Trust. Understand. Train. Convincing and comfort •  Replicate / Repeat Exactly replicate the original experiment. Eliminate change. •  Reproduce Run experiment with differences in elements (materials, methods, platform or setting) and compare to test for same result. •  Replay Run through what happened using logs without original platform or need to execute. Carole Goble, JCDL 2012 Keynote https://dl.dropbox.com/u/617206/JCDL2012keynoteGoble.ppt
  • 6. A Lot of Rs for Reuse •  Refresh execute an upgraded original experiment. •  Reconstruct rebuild using new elements or different platform when they are lost/unavailable/inaccessible •  Reuse use as part of new experiments. •  Repurpose/Reassemble reuse elements in a new experiment Carole Goble, JCDL 2012 Keynote https://dl.dropbox.com/u/617206/JCDL2012keynoteGoble.ppt
  • 7. The Article is the Knowledge Bottleneck “An article about computational science in a scientific publication is not the scholarship itself, it is merely advertising of the scholarship. The actual scholarship is the complete software development environment, [the complete data] and the complete set of instructions which generated the figures.” Backheit, J. and Donoho, D. (1995) Wavelab and reproducible research http:// citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.3.2982
  • 8. The Article is the Knowledge Bottleneck “Changes are occurring in the ways in which scientific research is conducted. Within e-laboratories, methods such as scientific workflows, research protocols, standard operating procedures and algorithms for analysis and simulation are used to manipulate and produce data. Experimental or observational data and scientific models are typically born digital with no physical counterpart. This move to digital content is driving a sea-change in scientific publication, and challenging traditional scholarly publication.” Bechhofer S. et al (2010) Research Objects: Towards Exchange and Reuse of Digital Knowledge http://dx.doi.org/10.1038/npre.2010.4626.1
  • 9. •  Involved in each such experiment is a complex set of resources with complex relationships •  There is a need to share these resources in order to support forms of reuse, reproducibility •  This entails the augmentation of the scholarly record with an explicit account of the research process •  Digital exchange of each resource individually is trivial, exchange of the combined knowledge is not •  Traditional, electronic publications, can not handle this job •  Targeted at humans, not machines •  Communicates findings not all scientific knowledge behind the findings •  Content not decomposable in actionable units •  Outputs, results, methods not reusable If not the Article, then What? Bechhofer S. et al (2010) Research Objects: Towards Exchange and Reuse of Digital Knowledge http://dx.doi.org/10.1038/npre.2010.4626.1
  • 10. The Clean Slate Challenge
  • 11. The Clean Slate Challenge Add features to support these needs to the existing scholarly communication system?
  • 12. The Clean Slate Challenge Start with a clean slate?
  • 14. Research Objects: Aggregated Content •  Data used or results produced in an experiment study •  Methods employed to produce and analyze that data •  Provenance and setting information about the experiments •  People involved in the investigation •  Annotations about these resources, that are essential to the understanding and interpretation of the scientific outcomes captured by a research object. http://www.researchobject.org/
  • 17. Research Objects: Aggregation “Research Objects are aggregations of content. Thus a Research Object framework needs to provide a mechanism for this aggregation. Aggregations are likely to include references to resources but there may also, however, be situations, where, for reasons of efficiency or in order to support persistence, Research Objects should also be able to aggregate literal data as well as references to data.” Bechhofer S. et al (2010) Research Objects: Towards Exchange and Reuse of Digital Knowledge http://dx.doi.org/10.1038/npre.2010.4626.1
  • 18. •  OAI-ORE observation: Scholarly assets are rapidly becoming compound, consisting of multiple resources •  e.g. datasets, software, ontologies, workflows, online debate, slides, blogs, videos, etc. with various: •  Relationships •  Interdependencies •  How to convey this compound-ness in an interoperable manner so that applications can access, consume such assets? 2007   Funded by the Mellon Foundation & Microsoft Research http://www.openarchives.org/ore/
  • 21. Foundations of the ORE Solution •  Web Architecture - Resource, URI, Representation •  Semantic Web: •  URIs for documents (information resources), •  URIs for physical entities, concepts, abstractions (non-information resources) •  RDF – to express properties, relationships pertaining to resources •  Linked Data: •  HTTP URIs for both information and non-information resources •  HTTP 303 redirect: •  From: The HTTP URI of non-information resource •  To: The HTTP URI of an information resource that describes the non-information resource
  • 30. Adding Account of Research Life Cycles to Scholarly Record Pepe, A., Mayernik, M., Borgman, C., Van de Sompel, H. (2009) Technology to Represent Scientific Practice: Data, Life Cycles, and Value Chains. http://dx.doi.org/ 10.1002/asi21263
  • 31. ORE & Research Objects “…, Research Objects should also be able to aggregate literal data as well as references to data.” •  Aggregated Resources in ORE have HTTP URIs; probably needs to be relaxed. •  Embedding content in RDF, irrespective of ORE, is … interesting •  See: Representing Content in RDF 1.0 http://www.w3.org/TR/ Content-in-RDF10/ •  Allows embedding base64, text, XML •  Resource Map as manifest in e.g. ZIP file?
  • 33. Research Objects: Annotation “Annotations about these resources, that are essential to the understanding and interpretation of the scientific outcomes captured by a research object.” http://www.researchobject.org/
  • 34. •  Annotation is a pervasive scholarly activity, conducted by people and machines •  Many annotation efforts and tools •  But annotations stuck in silos: •  Only consumable by client that created it •  Annotations not shareable beyond original environment •  Open Annotation focuses on interoperability for annotations in order to allow sharing of annotations across: •  Annotation clients •  Content collections •  Services that leverage annotations 2009   Funded by the Mellon Foundation http://www.openanotation.org/spec/core/
  • 35. •  Established to reconcile Open Annotation Collaboration and Annotation Ontology models •  67 participants from around the world: 7th of 119 groups Many universities, also commercial and not-for-profit •  Mission: Interoperability between Annotation systems and platforms, by …following the Architecture of the Web …reusing existing web standards …providing a single, coherent model to implement …without requiring adoption of specific platforms …while maintaining low implementation costs W3C Open Annotation Community Group http://www.w3.org/community/openannotation/
  • 36. An Annotation is considered to be a set of connected resources, typically including a body and target, where the body is related to the target. “   ”   Highlighting, Bookmarking Commenting, Describing Tagging, Linking Classifying, Identifying Questioning, Replying Editing, Moderating …Provide an Aide-Memoire …Share and Inform …Improve Discovery …Organize Resources …Interact with Others …Create as well as Consume What is an Annotation? http://www.w3.org/community/openannotation/
  • 39. Basic Open Annotation Data Model
  • 44. Specific Body and Specific Target resources identify the region of interest, and/or the state of the resource. Need to be able to describe the state of the resource, the segment of interest, and potentially styling hints for how to render it. Open Annotation introduces: State Describes how to retrieve representation Selector Describes how to select segment Style Describes how to render/process segment Scope Describes context of the resource Further Specification of Resources
  • 45. Use Case: Changing Content at the Same URI
  • 46. Use Case: Segment of Interest
  • 47. W3C Open Annotation & Research Objects •  Early renderings of Research Objects emerging from the Wf4Ever project use Annotation Ontology as the annotation framework •  But since the Annotation Ontology and Open Annotation Collaboration models now merge into the W3C Open Annotation model, it is safe to assume W3C Open Annotation will be used for Research Objects
  • 49. Research Objects: Versioning and Evolution “Research Objects are dynamic in that their contents can change and be changed – additional contents may be added to aggregations, or additional metadata can be asserted about the contents or relationships between content. The resources that are aggregated may change. Thus there is a need for versioning, allowing the recording of changes to objects, potentially along with facilities for retrieving objects or aggregated elements at particular historical points in their lifecycle.” Bechhofer S. et al (2010) Research Objects: Towards Exchange and Reuse of Digital Knowledge http://dx.doi.org/10.1038/npre.2010.4626.1
  • 50. ORE Experiment: Versioning and Evolution of Compound Objects Van de Sompel, H. et al. (2007) Appendix to Interoperability for the Discovery, Use, and Re-Use of Units of Scholarly Communication http://www.ctwatch.org/quarterly/articles/2007/08/interoperability-for-the-discovery-use- and-re-use-of-units-of-scholarly-communication/
  • 51. •  Memento is about the Web and time: •  Resources evolve over time •  Only the current representation is available from a resource’s URI •  How to seamlessly access prior representation, if they exist? •  Memento looks at this problem for the Web, in general Digital  Preserva,on  Award  2010   2009   Funded by the Library of Congress http://www.mementoweb.org/
  • 52. URI for Original, URI for Version   URI-­‐M  -­‐  hDp://web.archive.org/web/20010911203610/hDp://www.cnn.com/     Web  Archive   URI-­‐R  -­‐  hDp://www.cnn.com/    
  • 53. URI for Original, URI for Version   URI-­‐M  -­‐  hDp://en.wikipedia.org/w/index.php?,tle=September_11_aDacks&oldid=282333     CMS   URI-­‐R  -­‐  hDp://en.wikipedia.org/wiki/September_11_aDacks  
  • 60. Time Travel for the Web: Demo   http://www.mementoweb.org/demo/Memento_Time_Travel.mov
  • 65. Memento & Research Objects •  The combination of: •  Pro-active archiving of Research Objects and their constituent resources, using •  Web archiving techniques, e.g. crawling, transactional archiving •  Platforms with strong versioning capabilities, e.g. datawikis, github •  Assigning URIs to Research Objects and their constituent resources according to the well-established time-generic (URI-R) and time-specific (URI-M) resource pattern •  The Memento protocol to access time-specific versions of Research Objects and their constituent resources via their time- generic URI and timestamp makes a good candidate for addressing the versioning and evolution need.
  • 67. Research Objects: Provenance “The issue of provenance, and being able to audit experiments and investigations is key to the scientific method. Third parties must be able to audit the steps performed in an experiment in order to be convinced of the validity of results. Audit is required not just for regulatory purposes, but allows for the results of experiments to be interpreted and reused, thus a Research Object should provide sufficient information to support audit of the aggregation as a whole, its constituent parts, and any process that it may encapsulate.” Bechhofer S. et al (2010) Research Objects: Towards Exchange and Reuse of Digital Knowledge http://dx.doi.org/10.1038/npre.2010.4626.1
  • 68. Van de Sompel, H. (2003) Roadblocks http://www.sis.pitt.edu/~dlwkshop/paper_sompel.html Provenance
  • 69. Moreau, L. et al. (2010) The Open Provenance Model: Abstract Model http://eprints.ecs.soton.ac.uk/21449/ Open Provenance Model
  • 72. The Clean Slate Challenge
  • 73. •  ResourceSync is about synchronization of web resources, things with a URI that can be dereferenced •  Small websites/repositories (a few resources) to large repositories/datasets/ linked data collections (many millions of resources) •  Low change frequency (weeks/months) to high change frequency (seconds) •  Synchronization latency and accuracy needs may vary •  Modular framework based on Sitemaps and extensions 2012   Funded by the Sloan Foundation http://www.openarchives.org/rs/
  • 74. •  Investigates reference rot at massive scale: •  Citation rot - Do HTTP references in scholarly articles still resolve? •  Content rot - If so, is the content at the end of the HTTP reference still representative of the content that was originally referenced? •  Investigates pro-active ways to archive HTTP referenced resources that occur in scholarly articles 2013   hiberlink Funded by the Mellon Foundation Soon at http://www.hiberlink.org
  • 77. A Clean Slate? @hvdsomp http://public.lanl.gov/herbertv/ herbert van de sompel Includes slides by Sean Bechhofer, Carole Goble, Robert Sanderson