SlideShare a Scribd company logo
1 of 48
Download to read offline
Lessons in Cross-Repository Interoperability
                   learned from
                the aDORe effort


           Herbert Van de Sompel & Jeroen Bekaert
                      Research Library
            Los Alamos National Laboratory, USA




                      Herbert Van de Sompel & Jeroen Bekaert
                 Research Library, Los Alamos National Laboratory
RESEARCH    CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
LIBRARY
The repository model




   quot;Pattern Recognition: The 2003 OCLC Environmental Scanquot;
        http://www.oclc.org/membership/escan/toc.htm
                     Herbert Van de Sompel & Jeroen Bekaert
                Research Library, Los Alamos National Laboratory
RESEARCH   CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
LIBRARY
Value chains starting in repositories

• New knowledge is really being created when allowing for non-
  anticipated use of stuff.

• These repositories are not about creating services for local
  users (only)
• These repositories are not about creating a service (user
  interface) for all users

• These repositories are about facilitating the use of materials
  in many contexts
• These repositories are the starting point of value chains




                         Herbert Van de Sompel & Jeroen Bekaert
                    Research Library, Los Alamos National Laboratory
  RESEARCH     CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
  LIBRARY
• Value
                                                                         chains
                                                                         emerging
                                                                         from RSS
                                                                         feeds




                                        http://www.technorati.com

                     Herbert Van de Sompel & Jeroen Bekaert
                Research Library, Los Alamos National Laboratory
RESEARCH   CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
LIBRARY
Value chains starting in repositories




                             recombine
                              add value




                       Herbert Van de Sompel & Jeroen Bekaert
                  Research Library, Los Alamos National Laboratory
RESEARCH     CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
LIBRARY
• Scholarly
                                                                         Communication
                                                                         value chains




 http://dx.doi.org/10.1045/september2004-vandesompel

                     Herbert Van de Sompel & Jeroen Bekaert
                Research Library, Los Alamos National Laboratory
RESEARCH   CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
LIBRARY
Value chains starting in repositories

To allow value chains to emerge on the basis of materials in
   repositories, those repositories need clear/clean machine
   interfaces that allow downstream applications to consume
   materials, aggregate them, build services, …

⇒ Disconnection of repository content and service: allows for
  creation of both local and remote services
⇒ On-Web: Protocol-oriented interfaces
⇒ These value chains are about the real stuff not (only) about
  metadata




                         Herbert Van de Sompel & Jeroen Bekaert
                    Research Library, Los Alamos National Laboratory
  RESEARCH     CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
  LIBRARY
Credits

The reported material is based on the following work:
   o  The LANL aDORe repository effort
   o  The upcoming PhD thesis by Jeroen Bekaert (Advisor
      Herbert Van de Sompel) regarding protocol-based
      interfaces for Open Archival Information Systems
      (OAIS)
   o  The NSF-funded Pathways project in collaboration with
      the Information Science group at Cornell University (Carl
      Lagoze, Sandy Payette, Simeon Warner)




                       Herbert Van de Sompel & Jeroen Bekaert
                  Research Library, Los Alamos National Laboratory
  RESEARCH   CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
  LIBRARY
Outline

                                           aDORe
               A few words about the aDORe architecture


                   A Federation of Repositories
           A new level of cross-repository interoperability


                   Pathways InterDisseminator
A context-sensitive service overlay for a federation of repositories




                            Herbert Van de Sompel & Jeroen Bekaert
                       Research Library, Los Alamos National Laboratory
    RESEARCH      CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
    LIBRARY
aDORe effort

aDORe is 2 things:
   o Standards-based, modular repository architecture
       - Distributed architecture
       - Protocol-based interactions between modules
       - Applicable to create interoperable federations of
         heterogeneous repositories
   o Actual implementation of the architecture at LANL for
     local storage of digital assets (currently in its 2nd version)

aDORe is not a product
   o Components of aDORe software, usable in other
     environments, will be released


                        Herbert Van de Sompel & Jeroen Bekaert
                   Research Library, Los Alamos National Laboratory
   RESEARCH   CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
   LIBRARY
aDORe effort



           Standards


     Distributed
                                                                        Insights
     architecture
                                                                           in
                                                                    Cross-Repository
 Protocol-based                                                     Interoperability
 communication




                        Herbert Van de Sompel & Jeroen Bekaert
                   Research Library, Los Alamos National Laboratory
RESEARCH      CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
LIBRARY
aDORe effort

• Standards used in aDORe include:
   o XML,
   o XML Schema,
   o MPEG-21 Digital Item Declaration,
   o MPEG-21 Digital Item Identification,
   o W3C XML Signatures,
   o OAI-PMH,
   o NISO OpenURL Framework for Context-Sensitive Services,
   o Internet Archive ARC file format,
   o OAIS concepts




                       Herbert Van de Sompel & Jeroen Bekaert
                  Research Library, Los Alamos National Laboratory
  RESEARCH   CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
  LIBRARY
Compound
              objects

Identifier
 Locator




             Repository
              Registry
OAI-PMH
Federator




               Dynamic
            Dissemination
                Engine




OpenURL
Resolver
OAI-PMH Federator & OpenURL Resolver




                                                                            OAIS     # items
 aDORe      Interface
                                         identifier                         Access      in
front-end   standard
                                                                             Type    response
OAI-PMH
            OAI-PMH                Package Identifier                   OAIS DIP     1 or more
Federator
                              Content Identifier, Package
OpenURL      NISO                                                      OAIS DIP &
                                 Identifier (with XML ID                                 1
Resolver    OpenURL                                                     Result Set
                                        fragment)




                        Herbert Van de Sompel & Jeroen Bekaert
                   Research Library, Los Alamos National Laboratory
 RESEARCH     CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
 LIBRARY
http://cordra.net

                     Herbert Van de Sompel & Jeroen Bekaert
                Research Library, Los Alamos National Laboratory
RESEARCH   CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
LIBRARY
Outline

                                           aDORe
               A few words about the aDORe architecture


                   A Federation of Repositories
           A new level of cross-repository interoperability


                   Pathways InterDisseminator
A context-sensitive service overlay for a federation of repositories




                            Herbert Van de Sompel & Jeroen Bekaert
                       Research Library, Los Alamos National Laboratory
    RESEARCH      CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
    LIBRARY
The interoperable repository model


I will try to show that:
   • a significantly higher level of cross-repository
   interoperability can be achieved with relatively modest means
   • those means are largely available and agreed upon in our
   community

I will introduce:
    • Repository-level requirements
    • Infrastructure-level requirements




                          Herbert Van de Sompel & Jeroen Bekaert
                     Research Library, Los Alamos National Laboratory
   RESEARCH     CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
   LIBRARY
Part 1 : Requirements for a repository in a federation




                       Herbert Van de Sompel & Jeroen Bekaert
                  Research Library, Los Alamos National Laboratory
  RESEARCH   CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
  LIBRARY
Repositories & Units of Communication

• Data-oriented research
   => not only textual materials, but also datasets, software,
      simulations, dynamic knowledge presentations, …
• Research results represented by variety of digital media
   ⇒   these media must receive status similar to that of text
      in current system
• Materials in various stages of certification:
   ⇒   units of communication not only ‘papers’ but also
      preprints, raw datasets, prototype simulations, …
• Facilitate collaboration
   ⇒   re-use of units of communications




                          Herbert Van de Sompel & Jeroen Bekaert
                     Research Library, Los Alamos National Laboratory
  RESEARCH      CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
  LIBRARY
Repositories & Units of Communication

• Handling this requires:
   o a compound object view of a unit of communication
   o stop thinking in terms of metadata versus content

• Compound object:
   o Has a persistent identifier
   o Contain materials and metadata about those materials
   o Can contain other compound objects




                          Herbert Van de Sompel & Jeroen Bekaert
                     Research Library, Los Alamos National Laboratory
  RESEARCH      CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
  LIBRARY
Compound objects

                         URI_7

                                                            URIs:

                               URI_3                       • minted by different
                                                           repositories
                                                           • from different namespaces
                                                           • not (necessarily) locators
                               URI_9




                        compound object


                     Herbert Van de Sompel & Jeroen Bekaert
                Research Library, Los Alamos National Laboratory
RESEARCH   CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
LIBRARY
XML-based representation of compound objects

           URI_7                                                             URI_7


                                         MPEG-21 DIDL
             URI_3                          METS                               URI_3
                                           IMS/CP
                                             RDF

             URI_9                                                             URI_9




       compound object                                                         XML-based
                                                                             representation
                         Herbert Van de Sompel & Jeroen Bekaert
                    Research Library, Los Alamos National Laboratory
RESEARCH       CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
LIBRARY
Repository Interop Interface 1: OAI-PMH & CO




                                                          URI_7



                                                             URI_3



                                                             URI_9




 repository_a           OAI-PMH                                               OAI-PMH
                       baseURL_m                                              harvester

• machine consumption
• batches of compound objects
• OAI-PMH datestamp ~ new version of object
                          Herbert Van de Sompel & Jeroen Bekaert
                     Research Library, Los Alamos National Laboratory
   RESEARCH     CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
   LIBRARY
OAI-PMH interface to OAIS (Jeroen Bekaert)

agent


           baseURL(OAIPMH_CIID)?
           verb=ListMetadataFormats

           list of DIP formats ( ListMetadataFormats response)



           baseURL(OAIPMH_CIID)?
           verb=ListRecords&
           metadataPrefix= info:pathways/svc/dip.rdf

                         list of DIPs (derived from most recent AIPs)




                        Herbert Van de Sompel & Jeroen Bekaert
                   Research Library, Los Alamos National Laboratory
RESEARCH      CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
LIBRARY
Repository Interop Interface 1: OAI-PMH & CO

                                        URI_12
                      add value             URI_7

                      recombine
            URI_7


                                              URI_3
              URI_3


                                              URI_9
              URI_9




OAI-PMH                                                                    repository_b   OAI-PMH
harvester                                                                                 baseURL_n



   • include provenance ~ version of compound object

                                 Herbert Van de Sompel & Jeroen Bekaert
                            Research Library, Los Alamos National Laboratory
       RESEARCH        CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
       LIBRARY
Repository Interop Interface 2: OpenURL & CO

                                                                              URI_7



                                                                                URI_3


                       OpenURL                                                  URI_9



                                         baseURL_x?
                                          url_ver=Z39.88-2004 &
                                          rft_id=URI_7 &
                                          svc_id=info:pathways/svc/dip.*

 repository_n         OpenURL
                     baseURL_o


• machine (& human) consumption
• single object dissemination ~ identifier of compound object

                          Herbert Van de Sompel & Jeroen Bekaert
                     Research Library, Los Alamos National Laboratory
    RESEARCH    CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
    LIBRARY
Repository Interop Interface 2: OpenURL & CO

• ServiceType = Request a representation of the DO
  expressed using a compound object format
   o Example:
      - svc_id = info:pathways/svc/dip.didl (request MPEG-21
        DIDL representation)
      - svc_id = info:pathways/svc/dip.mets (request METS
        representation)
      - svc_id = info:pathways/svc/dip.rdf (request RDF
        representation – see later)

• Other Entities could be added to Interface #2 (think
  Requester)



                       Herbert Van de Sompel & Jeroen Bekaert
                  Research Library, Los Alamos National Laboratory
  RESEARCH   CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
  LIBRARY
Repository Interop Interface 2: OpenURL & CO


                                            • independent of nature of identifiers
                                            • ‘resolution’ independent of scheme-
                      OpenURL               specific mechanisms
                                            • conceptual interface is persistent
                                            over time
                                                 • KEV & HTTP (REST)
                                                 • XML & SOAP
                                                 •…

repository_n         OpenURL
                    baseURL_o




                         Herbert Van de Sompel & Jeroen Bekaert
                    Research Library, Los Alamos National Laboratory
  RESEARCH     CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
  LIBRARY
OpenURL interface to OAIS (Jeroen Bekaert)

agent
        BaseURL(OpenURL_CIID)?                               OpenURL
        url_ver=Z39.88-2004&
        rft_id=ContentInfoIdentifier&
        svc_id= info:pathways/svc/dip


                                    list of ContextObjects

        BaseURL(OpenURL_CIID)?                                         for each AIP (version)
        url_ver=Z39.88-2004&                                           BaseURL(OpenURL_CIID)?
        rft_id=ContentInfoIdentifier&                                  url_ver=Z39.88-2004&
        rft_val_fmt= info:ofi/fmt:kev:mtx:pathways &                   rft_id=ContentInfoIdentifier&
        rft.aip= AIPIdentifier &                                       rft_val_fmt= info:ofi/fmt:kev:mtx:pathways &
        svc_id=info:pathways/svc/dip                                   rft.aip= AIPIdentifier &
                                                                       svc_id=info:pathways/svc/dip
                                    list of ContextObjects

        BaseURL(OpenURL_CIID)?                                         for each DIP format
        url_ver=Z39.88-2004&                                           BaseURL(OpenURL_CIID)?
        rft_id=ContentInfoIdentifier&                                  url_ver=Z39.88-2004&
        rft_val_fmt=info:ofi/fmt:kev:mtx:pathways&                     rft_id=ContentInfoIdentifier&
        rft.aip=AIPIdentifier&                                         rft_val_fmt=info:ofi/fmt:kev:mtx:pathways&
        svc_id= info:pathways/svc/dip.rdf                              rft.aip=AIPIdentifier&
                                                                       svc_id= info:pathways/svc/dip.*
                                               DIP (RDF)




                                    Herbert Van de Sompel & Jeroen Bekaert
                               Research Library, Los Alamos National Laboratory
        RESEARCH          CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
        LIBRARY
Part 2 :Requirements for an infrastructure
       supporting a federation of repositories




                     Herbert Van de Sompel & Jeroen Bekaert
                Research Library, Los Alamos National Laboratory
RESEARCH   CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
LIBRARY
Repository Registry: Who is part of the Federation?

            register

                                    Repository
                                     Registry


                                    Per Repository:

                                   • Repository identifier

                                   • baseURL of OAI-PMH interface
                                   • baseURL of OpenURL interface

                                   • whichever kind of information that helps
                                   downstream applications understand about
                                   the nature of the repository
                         Herbert Van de Sompel & Jeroen Bekaert
                    Research Library, Los Alamos National Laboratory
 RESEARCH      CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
 LIBRARY
Object Registry: What is part of the Federation?
                                 Per compound object:

                                • Object identifier
                                • Object datetime ~ OAI-PMH datestamp
                                • OAI-PMH identifier
                                • Repository identifier

                                of the object itself, and of its contained
                                objects

                                                           SRU
                                                           SRW
                                   Object
                                   Registry               handle
       harvest
    (identifiers)

                      Herbert Van de Sompel & Jeroen Bekaert
                 Research Library, Los Alamos National Laboratory
RESEARCH    CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
LIBRARY
OAI-PMH & OpenURL access to objects in federation


                 Repository                                                   URI_7

                  Registry
                                                                                URI_3



                                                                                URI_9




                                                       • List of existing copies
                                                       • Per copy:
                                           SRU              • OAI-PMH access info
                                           SRW
                   Object                                   • OpenURL access info
                   Registry               handle
                                                                           URI_7



                       Herbert Van de Sompel & Jeroen Bekaert
                  Research Library, Los Alamos National Laboratory
  RESEARCH   CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
  LIBRARY
Part 3 : Summary of requirements




                      Herbert Van de Sompel & Jeroen Bekaert
                 Research Library, Los Alamos National Laboratory
RESEARCH    CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
LIBRARY
Summary of requirements

     Requirement                       Repository                           Infrastructure

Compound Object                               X
model support
XML-based                                     X                                   ?
representations
support
OAI-PMH CO support                            X

OpenURL CO support                            X

Repository Registry                                                               X

Object Registry                                                                   X


                        Herbert Van de Sompel & Jeroen Bekaert
                   Research Library, Los Alamos National Laboratory
   RESEARCH   CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
   LIBRARY
Summary of requirements

Many variations on the design possible, yet most of this can be
  achieved with:
• Off-the-shelf tools
   o  OAI-PMH tools
   o  Handle system, SRU/W tools
   o  OpenURL tools
   o  Tools to generate XML-based representations of objects
• Surprisingly little effort
• A feasible amount of coordination/specification
• Some shared infrastructure




                         Herbert Van de Sompel & Jeroen Bekaert
                    Research Library, Los Alamos National Laboratory
   RESEARCH    CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
   LIBRARY
The Good News ™

• Microsoft & Mellon Foundation interested in taking
  interoperability across repositories to a new level
• Meeting is being planned to consult with the major
  stakeholders on identifying, specifying, and implementing
  concrete ways forward
• Tony Hey has (t)asked me to call that meeting




                       Herbert Van de Sompel & Jeroen Bekaert
                  Research Library, Los Alamos National Laboratory
  RESEARCH   CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
  LIBRARY
Outline

                                           aDORe
               A few words about the aDORe architecture


                   A Federation of Repositories
           A new level of cross-repository interoperability


                   Pathways InterDisseminator
A context-sensitive service overlay for a federation of repositories




                            Herbert Van de Sompel & Jeroen Bekaert
                       Research Library, Los Alamos National Laboratory
    RESEARCH      CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
    LIBRARY
Pathways InterDisseminator Service Overlay

• Pathways InterDisseminator: Dynamic Service-Oriented
  Overlay upon the federated architecture
• Assumes the existence of:
   •  OpenURL Interface to all repositories in the federation
   •  Object Registry (given an identifier, at which OpenURL
      interface is the object available?)
   •  Availability of an RDF-based representation of DO
      compliant with a Pathways OWL core ontology
• Is itself exposed as a different OpenURL Resolver




                       Herbert Van de Sompel & Jeroen Bekaert
                  Research Library, Los Alamos National Laboratory
  RESEARCH   CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
  LIBRARY
Pathways InterDisseminator : core ontology




hasEntity                                                                                            Location
                                                                                               ion
                                                                                          cat
                                                                                   ha sLo



            Entity        hasRepresentation
                                                         Represen-
                                                           tation

                                                                                 has
                                                                                    For
                                                                                          ma
                                                                                            t
                                                                                                     Format




                                     Herbert Van de Sompel & Jeroen Bekaert
                                Research Library, Los Alamos National Laboratory
              RESEARCH     CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
              LIBRARY
baseURL_y?
                                                                                      url_ver=Z39.88-2004 &
                       baseURL_y?                                                     rft_id=URI_7 &
                        url_ver=Z39.88-2004 &                                         svc_id=info:pathways/boostrap
                        rft_id=URI_7 &
                        svc_id=info:pathways/dip.rdf
                                                                      magic
DSpace                                                                engine

                                       URI_7



                                          URI_3



                                          URI_9

Fedora
                                                                                                  OpenURL
                                          RDF                                                  ContextObject
                                                                                                 Container



aDORe
                  Interop                                                               Service Overlay
                Interface 2                                                                OpenURL
                 OpenURL                                                                  Application

                                 Herbert Van de Sompel & Jeroen Bekaert
                            Research Library, Los Alamos National Laboratory
         RESEARCH      CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
         LIBRARY
Pathways InterDisseminator Service Overlay
• Part of the dissemination OpenURL Application is an engine
  that dynamically decides upon services for a given object
  from a repository (in a federation).
   o It grabs the (RDF) representation of the DO from its
     origin repository
   o It introspects on the properties expressed in that (RDF)
     representation
   o It compares these properties with its knowledge database
   o It returns a list of possible services/disseminations

• There can be many of these engines in a federation. The
  result is the ability to provide context-sensitive
  disseminations of DOs in (a federation of) repositories.


                       Herbert Van de Sompel & Jeroen Bekaert
                  Research Library, Los Alamos National Laboratory
  RESEARCH   CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
  LIBRARY
Pathways InterDisseminator Service Overlay


• There can be many of these engines in a federation. The
  result is the ability to provide context-sensitive
  disseminations of DOs in (a federation of) repositories.




                       Herbert Van de Sompel & Jeroen Bekaert
                  Research Library, Los Alamos National Laboratory
  RESEARCH   CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
  LIBRARY
baseURL_y?
                                                                                   url_ver=Z39.88-2004 &
                    baseURL_y?                                                     rft_id=URI_7 &
                     url_ver=Z39.88-2004 &                                         svc_id=info:magic/justdoit
                     rft_id=URI_7 &
                     svc_id=info:pathways/dip.rdf

DSpace

                                    URI_7



                                       URI_3



                                       URI_9



Fedora                                                             service
                                       RDF                        execution
                                                                   engine




                                                                 web service
aDORe


                              Herbert Van de Sompel & Jeroen Bekaert
                         Research Library, Los Alamos National Laboratory
         RESEARCH   CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
         LIBRARY
Pathways InterDisseminator Demo

                      aDORe Digital Object in Demo
                                            Type                          MIME        identifier
     Digital Object                 scholarly paper                           N/A        DOI
                                                                                        aDORe
                                                                   application/xml    datastream
Constituent Datastream 1           metadata record
                                                                    (MARCXML)           id (info
                                                                                         URI)
                                                                                        aDORe
                                                                  application/xml     datastream
Constituent Datastream 2           metadata record
                                                                (original metadata)     id (info
                                                                                         URI)
                                                                                        aDORe
                                                                                      datastream
Constituent Datastream 3               fulltext file               application/pdf
                                                                                        id (info
                                                                                         URI)



                          Herbert Van de Sompel & Jeroen Bekaert
                     Research Library, Los Alamos National Laboratory
    RESEARCH    CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
    LIBRARY
Demo

•   Install TSCC coded (http://www.techsmith.com)
•   Launch movie Pathways_InterDisseminator.avi in same path as this
    presentation




                         Herbert Van de Sompel & Jeroen Bekaert
                    Research Library, Los Alamos National Laboratory
    RESEARCH   CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
    LIBRARY
Comments, Flames, Questions




                     Herbert Van de Sompel & Jeroen Bekaert
                Research Library, Los Alamos National Laboratory
RESEARCH   CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona
LIBRARY

More Related Content

Similar to Lessons in Cross-Repository Interoperability learned from the aDORe effort

Resource and Metadata Management with a Linked Data perspective
Resource and Metadata Management with a Linked Data perspectiveResource and Metadata Management with a Linked Data perspective
Resource and Metadata Management with a Linked Data perspectiveHannes Ebner
 
Motivation, inspiration and innovation from frustration
Motivation, inspiration and innovation from frustrationMotivation, inspiration and innovation from frustration
Motivation, inspiration and innovation from frustrationHerbert Van de Sompel
 
OAI-ORE: The Open Archives Initiative Object Reuse and Exchange Project
OAI-ORE:  The Open Archives Initiative  Object Reuse and Exchange ProjectOAI-ORE:  The Open Archives Initiative  Object Reuse and Exchange Project
OAI-ORE: The Open Archives Initiative Object Reuse and Exchange ProjectMichael Nelson
 
Open Science Days 2014 - Becker - Repositories and Linked Data
Open Science Days 2014 - Becker - Repositories and Linked DataOpen Science Days 2014 - Becker - Repositories and Linked Data
Open Science Days 2014 - Becker - Repositories and Linked DataPascal-Nicolas Becker
 
British Library Seminar: Shared Canvas (September 2011)
British Library Seminar: Shared Canvas (September 2011)British Library Seminar: Shared Canvas (September 2011)
British Library Seminar: Shared Canvas (September 2011)Robert Sanderson
 
Information Extraction on Noisy Texts for Historical Research
Information Extraction on Noisy Texts for Historical ResearchInformation Extraction on Noisy Texts for Historical Research
Information Extraction on Noisy Texts for Historical ResearchKepa J. Rodriguez
 
Reasoning on the Semantic Web
Reasoning on the Semantic WebReasoning on the Semantic Web
Reasoning on the Semantic WebYannis Kalfoglou
 
Facilitating semantic alignment.-biohackathon-jupp
Facilitating semantic alignment.-biohackathon-juppFacilitating semantic alignment.-biohackathon-jupp
Facilitating semantic alignment.-biohackathon-juppSimon Jupp
 
Parker Keio 2011: Interoperable Manuscript Framework
Parker Keio 2011: Interoperable Manuscript FrameworkParker Keio 2011: Interoperable Manuscript Framework
Parker Keio 2011: Interoperable Manuscript FrameworkRobert Sanderson
 
Building OBO Foundry ontology using semantic web tools
Building OBO Foundry ontology using semantic web toolsBuilding OBO Foundry ontology using semantic web tools
Building OBO Foundry ontology using semantic web toolsMelanie Courtot
 
Digital library initiatives in Turkey: A brief overview
Digital library initiatives in Turkey: A brief overviewDigital library initiatives in Turkey: A brief overview
Digital library initiatives in Turkey: A brief overviewYasar Tonta
 
LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...
LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...
LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...locloud
 
Libraries and Linked Data: Looking to the Future (3)
Libraries and Linked Data: Looking to the Future (3)Libraries and Linked Data: Looking to the Future (3)
Libraries and Linked Data: Looking to the Future (3)ALATechSource
 
Analyzing the Persistence of Referenced Web Resources with Memento
Analyzing the Persistence of Referenced Web Resources with MementoAnalyzing the Persistence of Referenced Web Resources with Memento
Analyzing the Persistence of Referenced Web Resources with MementoRobert Sanderson
 
Libraries, OA research and OER: towards symbiosis?
Libraries, OA research and OER: towards symbiosis?Libraries, OA research and OER: towards symbiosis?
Libraries, OA research and OER: towards symbiosis?Nick Sheppard
 

Similar to Lessons in Cross-Repository Interoperability learned from the aDORe effort (20)

OpenAIRE schirrwagen
OpenAIRE schirrwagenOpenAIRE schirrwagen
OpenAIRE schirrwagen
 
Resource and Metadata Management with a Linked Data perspective
Resource and Metadata Management with a Linked Data perspectiveResource and Metadata Management with a Linked Data perspective
Resource and Metadata Management with a Linked Data perspective
 
Motivation, inspiration and innovation from frustration
Motivation, inspiration and innovation from frustrationMotivation, inspiration and innovation from frustration
Motivation, inspiration and innovation from frustration
 
OAI-ORE: The Open Archives Initiative Object Reuse and Exchange Project
OAI-ORE:  The Open Archives Initiative  Object Reuse and Exchange ProjectOAI-ORE:  The Open Archives Initiative  Object Reuse and Exchange Project
OAI-ORE: The Open Archives Initiative Object Reuse and Exchange Project
 
Open Science Days 2014 - Becker - Repositories and Linked Data
Open Science Days 2014 - Becker - Repositories and Linked DataOpen Science Days 2014 - Becker - Repositories and Linked Data
Open Science Days 2014 - Becker - Repositories and Linked Data
 
British Library Seminar: Shared Canvas (September 2011)
British Library Seminar: Shared Canvas (September 2011)British Library Seminar: Shared Canvas (September 2011)
British Library Seminar: Shared Canvas (September 2011)
 
Information Extraction on Noisy Texts for Historical Research
Information Extraction on Noisy Texts for Historical ResearchInformation Extraction on Noisy Texts for Historical Research
Information Extraction on Noisy Texts for Historical Research
 
Reasoning on the Semantic Web
Reasoning on the Semantic WebReasoning on the Semantic Web
Reasoning on the Semantic Web
 
Facilitating semantic alignment.-biohackathon-jupp
Facilitating semantic alignment.-biohackathon-juppFacilitating semantic alignment.-biohackathon-jupp
Facilitating semantic alignment.-biohackathon-jupp
 
Parker Keio 2011: Interoperable Manuscript Framework
Parker Keio 2011: Interoperable Manuscript FrameworkParker Keio 2011: Interoperable Manuscript Framework
Parker Keio 2011: Interoperable Manuscript Framework
 
20110728 datalift-rpi-troy
20110728 datalift-rpi-troy20110728 datalift-rpi-troy
20110728 datalift-rpi-troy
 
Building OBO Foundry ontology using semantic web tools
Building OBO Foundry ontology using semantic web toolsBuilding OBO Foundry ontology using semantic web tools
Building OBO Foundry ontology using semantic web tools
 
Digital library initiatives in Turkey: A brief overview
Digital library initiatives in Turkey: A brief overviewDigital library initiatives in Turkey: A brief overview
Digital library initiatives in Turkey: A brief overview
 
LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...
LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...
LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...
 
Libraries and Linked Data: Looking to the Future (3)
Libraries and Linked Data: Looking to the Future (3)Libraries and Linked Data: Looking to the Future (3)
Libraries and Linked Data: Looking to the Future (3)
 
Analyzing the Persistence of Referenced Web Resources with Memento
Analyzing the Persistence of Referenced Web Resources with MementoAnalyzing the Persistence of Referenced Web Resources with Memento
Analyzing the Persistence of Referenced Web Resources with Memento
 
Research Objects in Wf4Ever
Research Objects in Wf4EverResearch Objects in Wf4Ever
Research Objects in Wf4Ever
 
Libraries, OA research and OER: towards symbiosis?
Libraries, OA research and OER: towards symbiosis?Libraries, OA research and OER: towards symbiosis?
Libraries, OA research and OER: towards symbiosis?
 
2013-01-17 Research Object
2013-01-17 Research Object2013-01-17 Research Object
2013-01-17 Research Object
 
Burke siobhan link_resolver
Burke siobhan link_resolverBurke siobhan link_resolver
Burke siobhan link_resolver
 

More from Herbert Van de Sompel

The web is rotting and what to do about it
The web is rotting and what to do about itThe web is rotting and what to do about it
The web is rotting and what to do about itHerbert Van de Sompel
 
Researcher Pod: Scholarly Communication Using the Decentralized Web
Researcher Pod: Scholarly Communication Using the Decentralized WebResearcher Pod: Scholarly Communication Using the Decentralized Web
Researcher Pod: Scholarly Communication Using the Decentralized WebHerbert Van de Sompel
 
Persistent Identification: Easier Said than Done
Persistent Identification: Easier Said than DonePersistent Identification: Easier Said than Done
Persistent Identification: Easier Said than DoneHerbert Van de Sompel
 
FAIR Signposting: A KISS Approach to a Burning Issue
FAIR Signposting: A KISS Approach to a Burning IssueFAIR Signposting: A KISS Approach to a Burning Issue
FAIR Signposting: A KISS Approach to a Burning IssueHerbert Van de Sompel
 
Registration / Certification Interoperability Architecture (overlay peer-review)
Registration / Certification Interoperability Architecture (overlay peer-review)Registration / Certification Interoperability Architecture (overlay peer-review)
Registration / Certification Interoperability Architecture (overlay peer-review)Herbert Van de Sompel
 
Collecting the organizational scholarly record
Collecting the organizational scholarly recordCollecting the organizational scholarly record
Collecting the organizational scholarly recordHerbert Van de Sompel
 
Achieving Link Integrity for Managed Collections
Achieving Link Integrity for Managed CollectionsAchieving Link Integrity for Managed Collections
Achieving Link Integrity for Managed CollectionsHerbert Van de Sompel
 
Signposting Overview (Version November 2017)
Signposting Overview (Version November 2017)Signposting Overview (Version November 2017)
Signposting Overview (Version November 2017)Herbert Van de Sompel
 
DBpedia Archive using Memento, Triple Pattern Fragments, and HDT
DBpedia Archive using Memento, Triple Pattern Fragments, and HDTDBpedia Archive using Memento, Triple Pattern Fragments, and HDT
DBpedia Archive using Memento, Triple Pattern Fragments, and HDTHerbert Van de Sompel
 
Interoperability for web based scholarship
Interoperability for web based scholarshipInteroperability for web based scholarship
Interoperability for web based scholarshipHerbert Van de Sompel
 

More from Herbert Van de Sompel (20)

The web is rotting and what to do about it
The web is rotting and what to do about itThe web is rotting and what to do about it
The web is rotting and what to do about it
 
Researcher Pod: Scholarly Communication Using the Decentralized Web
Researcher Pod: Scholarly Communication Using the Decentralized WebResearcher Pod: Scholarly Communication Using the Decentralized Web
Researcher Pod: Scholarly Communication Using the Decentralized Web
 
Persistent Identification: Easier Said than Done
Persistent Identification: Easier Said than DonePersistent Identification: Easier Said than Done
Persistent Identification: Easier Said than Done
 
FAIR Signposting: A KISS Approach to a Burning Issue
FAIR Signposting: A KISS Approach to a Burning IssueFAIR Signposting: A KISS Approach to a Burning Issue
FAIR Signposting: A KISS Approach to a Burning Issue
 
Registration / Certification Interoperability Architecture (overlay peer-review)
Registration / Certification Interoperability Architecture (overlay peer-review)Registration / Certification Interoperability Architecture (overlay peer-review)
Registration / Certification Interoperability Architecture (overlay peer-review)
 
Collecting the organizational scholarly record
Collecting the organizational scholarly recordCollecting the organizational scholarly record
Collecting the organizational scholarly record
 
To the Rescue of Scholarly Orphans
To the Rescue of Scholarly OrphansTo the Rescue of Scholarly Orphans
To the Rescue of Scholarly Orphans
 
Almost two decades at LANL
Almost two decades at LANLAlmost two decades at LANL
Almost two decades at LANL
 
Perseverance on Persistence
Perseverance on PersistencePerseverance on Persistence
Perseverance on Persistence
 
Paul Evan Peters Lecture
Paul Evan Peters LecturePaul Evan Peters Lecture
Paul Evan Peters Lecture
 
Achieving Link Integrity for Managed Collections
Achieving Link Integrity for Managed CollectionsAchieving Link Integrity for Managed Collections
Achieving Link Integrity for Managed Collections
 
Signposting Overview (Version November 2017)
Signposting Overview (Version November 2017)Signposting Overview (Version November 2017)
Signposting Overview (Version November 2017)
 
Signposting Overview
Signposting OverviewSignposting Overview
Signposting Overview
 
PID Signposting Pattern
PID Signposting PatternPID Signposting Pattern
PID Signposting Pattern
 
DBpedia Archive using Memento, Triple Pattern Fragments, and HDT
DBpedia Archive using Memento, Triple Pattern Fragments, and HDTDBpedia Archive using Memento, Triple Pattern Fragments, and HDT
DBpedia Archive using Memento, Triple Pattern Fragments, and HDT
 
Interoperability for web based scholarship
Interoperability for web based scholarshipInteroperability for web based scholarship
Interoperability for web based scholarship
 
Reminiscing about interoperability
Reminiscing about interoperabilityReminiscing about interoperability
Reminiscing about interoperability
 
Creating Pockets of Persistence
Creating Pockets of PersistenceCreating Pockets of Persistence
Creating Pockets of Persistence
 
ResourceSync Quick Overview
ResourceSync Quick OverviewResourceSync Quick Overview
ResourceSync Quick Overview
 
Memento 101
Memento 101Memento 101
Memento 101
 

Recently uploaded

Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 

Recently uploaded (20)

Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 

Lessons in Cross-Repository Interoperability learned from the aDORe effort

  • 1. Lessons in Cross-Repository Interoperability learned from the aDORe effort Herbert Van de Sompel & Jeroen Bekaert Research Library Los Alamos National Laboratory, USA Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 2. The repository model quot;Pattern Recognition: The 2003 OCLC Environmental Scanquot; http://www.oclc.org/membership/escan/toc.htm Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 3. Value chains starting in repositories • New knowledge is really being created when allowing for non- anticipated use of stuff. • These repositories are not about creating services for local users (only) • These repositories are not about creating a service (user interface) for all users • These repositories are about facilitating the use of materials in many contexts • These repositories are the starting point of value chains Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 4. • Value chains emerging from RSS feeds http://www.technorati.com Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 5. Value chains starting in repositories recombine add value Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 6. • Scholarly Communication value chains http://dx.doi.org/10.1045/september2004-vandesompel Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 7. Value chains starting in repositories To allow value chains to emerge on the basis of materials in repositories, those repositories need clear/clean machine interfaces that allow downstream applications to consume materials, aggregate them, build services, … ⇒ Disconnection of repository content and service: allows for creation of both local and remote services ⇒ On-Web: Protocol-oriented interfaces ⇒ These value chains are about the real stuff not (only) about metadata Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 8. Credits The reported material is based on the following work: o The LANL aDORe repository effort o The upcoming PhD thesis by Jeroen Bekaert (Advisor Herbert Van de Sompel) regarding protocol-based interfaces for Open Archival Information Systems (OAIS) o The NSF-funded Pathways project in collaboration with the Information Science group at Cornell University (Carl Lagoze, Sandy Payette, Simeon Warner) Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 9. Outline aDORe A few words about the aDORe architecture A Federation of Repositories A new level of cross-repository interoperability Pathways InterDisseminator A context-sensitive service overlay for a federation of repositories Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 10. aDORe effort aDORe is 2 things: o Standards-based, modular repository architecture - Distributed architecture - Protocol-based interactions between modules - Applicable to create interoperable federations of heterogeneous repositories o Actual implementation of the architecture at LANL for local storage of digital assets (currently in its 2nd version) aDORe is not a product o Components of aDORe software, usable in other environments, will be released Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 11. aDORe effort Standards Distributed Insights architecture in Cross-Repository Protocol-based Interoperability communication Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 12. aDORe effort • Standards used in aDORe include: o XML, o XML Schema, o MPEG-21 Digital Item Declaration, o MPEG-21 Digital Item Identification, o W3C XML Signatures, o OAI-PMH, o NISO OpenURL Framework for Context-Sensitive Services, o Internet Archive ARC file format, o OAIS concepts Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 13. Compound objects Identifier Locator Repository Registry
  • 14. OAI-PMH Federator Dynamic Dissemination Engine OpenURL Resolver
  • 15. OAI-PMH Federator & OpenURL Resolver OAIS # items aDORe Interface identifier Access in front-end standard Type response OAI-PMH OAI-PMH Package Identifier OAIS DIP 1 or more Federator Content Identifier, Package OpenURL NISO OAIS DIP & Identifier (with XML ID 1 Resolver OpenURL Result Set fragment) Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 16. http://cordra.net Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 17. Outline aDORe A few words about the aDORe architecture A Federation of Repositories A new level of cross-repository interoperability Pathways InterDisseminator A context-sensitive service overlay for a federation of repositories Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 18. The interoperable repository model I will try to show that: • a significantly higher level of cross-repository interoperability can be achieved with relatively modest means • those means are largely available and agreed upon in our community I will introduce: • Repository-level requirements • Infrastructure-level requirements Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 19. Part 1 : Requirements for a repository in a federation Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 20. Repositories & Units of Communication • Data-oriented research => not only textual materials, but also datasets, software, simulations, dynamic knowledge presentations, … • Research results represented by variety of digital media ⇒ these media must receive status similar to that of text in current system • Materials in various stages of certification: ⇒ units of communication not only ‘papers’ but also preprints, raw datasets, prototype simulations, … • Facilitate collaboration ⇒ re-use of units of communications Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 21. Repositories & Units of Communication • Handling this requires: o a compound object view of a unit of communication o stop thinking in terms of metadata versus content • Compound object: o Has a persistent identifier o Contain materials and metadata about those materials o Can contain other compound objects Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 22. Compound objects URI_7 URIs: URI_3 • minted by different repositories • from different namespaces • not (necessarily) locators URI_9 compound object Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 23. XML-based representation of compound objects URI_7 URI_7 MPEG-21 DIDL URI_3 METS URI_3 IMS/CP RDF URI_9 URI_9 compound object XML-based representation Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 24. Repository Interop Interface 1: OAI-PMH & CO URI_7 URI_3 URI_9 repository_a OAI-PMH OAI-PMH baseURL_m harvester • machine consumption • batches of compound objects • OAI-PMH datestamp ~ new version of object Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 25. OAI-PMH interface to OAIS (Jeroen Bekaert) agent baseURL(OAIPMH_CIID)? verb=ListMetadataFormats list of DIP formats ( ListMetadataFormats response) baseURL(OAIPMH_CIID)? verb=ListRecords& metadataPrefix= info:pathways/svc/dip.rdf list of DIPs (derived from most recent AIPs) Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 26. Repository Interop Interface 1: OAI-PMH & CO URI_12 add value URI_7 recombine URI_7 URI_3 URI_3 URI_9 URI_9 OAI-PMH repository_b OAI-PMH harvester baseURL_n • include provenance ~ version of compound object Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 27. Repository Interop Interface 2: OpenURL & CO URI_7 URI_3 OpenURL URI_9 baseURL_x? url_ver=Z39.88-2004 & rft_id=URI_7 & svc_id=info:pathways/svc/dip.* repository_n OpenURL baseURL_o • machine (& human) consumption • single object dissemination ~ identifier of compound object Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 28. Repository Interop Interface 2: OpenURL & CO • ServiceType = Request a representation of the DO expressed using a compound object format o Example: - svc_id = info:pathways/svc/dip.didl (request MPEG-21 DIDL representation) - svc_id = info:pathways/svc/dip.mets (request METS representation) - svc_id = info:pathways/svc/dip.rdf (request RDF representation – see later) • Other Entities could be added to Interface #2 (think Requester) Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 29. Repository Interop Interface 2: OpenURL & CO • independent of nature of identifiers • ‘resolution’ independent of scheme- OpenURL specific mechanisms • conceptual interface is persistent over time • KEV & HTTP (REST) • XML & SOAP •… repository_n OpenURL baseURL_o Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 30. OpenURL interface to OAIS (Jeroen Bekaert) agent BaseURL(OpenURL_CIID)? OpenURL url_ver=Z39.88-2004& rft_id=ContentInfoIdentifier& svc_id= info:pathways/svc/dip list of ContextObjects BaseURL(OpenURL_CIID)? for each AIP (version) url_ver=Z39.88-2004& BaseURL(OpenURL_CIID)? rft_id=ContentInfoIdentifier& url_ver=Z39.88-2004& rft_val_fmt= info:ofi/fmt:kev:mtx:pathways & rft_id=ContentInfoIdentifier& rft.aip= AIPIdentifier & rft_val_fmt= info:ofi/fmt:kev:mtx:pathways & svc_id=info:pathways/svc/dip rft.aip= AIPIdentifier & svc_id=info:pathways/svc/dip list of ContextObjects BaseURL(OpenURL_CIID)? for each DIP format url_ver=Z39.88-2004& BaseURL(OpenURL_CIID)? rft_id=ContentInfoIdentifier& url_ver=Z39.88-2004& rft_val_fmt=info:ofi/fmt:kev:mtx:pathways& rft_id=ContentInfoIdentifier& rft.aip=AIPIdentifier& rft_val_fmt=info:ofi/fmt:kev:mtx:pathways& svc_id= info:pathways/svc/dip.rdf rft.aip=AIPIdentifier& svc_id= info:pathways/svc/dip.* DIP (RDF) Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 31. Part 2 :Requirements for an infrastructure supporting a federation of repositories Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 32. Repository Registry: Who is part of the Federation? register Repository Registry Per Repository: • Repository identifier • baseURL of OAI-PMH interface • baseURL of OpenURL interface • whichever kind of information that helps downstream applications understand about the nature of the repository Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 33. Object Registry: What is part of the Federation? Per compound object: • Object identifier • Object datetime ~ OAI-PMH datestamp • OAI-PMH identifier • Repository identifier of the object itself, and of its contained objects SRU SRW Object Registry handle harvest (identifiers) Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 34. OAI-PMH & OpenURL access to objects in federation Repository URI_7 Registry URI_3 URI_9 • List of existing copies • Per copy: SRU • OAI-PMH access info SRW Object • OpenURL access info Registry handle URI_7 Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 35. Part 3 : Summary of requirements Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 36. Summary of requirements Requirement Repository Infrastructure Compound Object X model support XML-based X ? representations support OAI-PMH CO support X OpenURL CO support X Repository Registry X Object Registry X Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 37. Summary of requirements Many variations on the design possible, yet most of this can be achieved with: • Off-the-shelf tools o OAI-PMH tools o Handle system, SRU/W tools o OpenURL tools o Tools to generate XML-based representations of objects • Surprisingly little effort • A feasible amount of coordination/specification • Some shared infrastructure Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 38. The Good News ™ • Microsoft & Mellon Foundation interested in taking interoperability across repositories to a new level • Meeting is being planned to consult with the major stakeholders on identifying, specifying, and implementing concrete ways forward • Tony Hey has (t)asked me to call that meeting Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 39. Outline aDORe A few words about the aDORe architecture A Federation of Repositories A new level of cross-repository interoperability Pathways InterDisseminator A context-sensitive service overlay for a federation of repositories Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 40. Pathways InterDisseminator Service Overlay • Pathways InterDisseminator: Dynamic Service-Oriented Overlay upon the federated architecture • Assumes the existence of: • OpenURL Interface to all repositories in the federation • Object Registry (given an identifier, at which OpenURL interface is the object available?) • Availability of an RDF-based representation of DO compliant with a Pathways OWL core ontology • Is itself exposed as a different OpenURL Resolver Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 41. Pathways InterDisseminator : core ontology hasEntity Location ion cat ha sLo Entity hasRepresentation Represen- tation has For ma t Format Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 42. baseURL_y? url_ver=Z39.88-2004 & baseURL_y? rft_id=URI_7 & url_ver=Z39.88-2004 & svc_id=info:pathways/boostrap rft_id=URI_7 & svc_id=info:pathways/dip.rdf magic DSpace engine URI_7 URI_3 URI_9 Fedora OpenURL RDF ContextObject Container aDORe Interop Service Overlay Interface 2 OpenURL OpenURL Application Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 43. Pathways InterDisseminator Service Overlay • Part of the dissemination OpenURL Application is an engine that dynamically decides upon services for a given object from a repository (in a federation). o It grabs the (RDF) representation of the DO from its origin repository o It introspects on the properties expressed in that (RDF) representation o It compares these properties with its knowledge database o It returns a list of possible services/disseminations • There can be many of these engines in a federation. The result is the ability to provide context-sensitive disseminations of DOs in (a federation of) repositories. Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 44. Pathways InterDisseminator Service Overlay • There can be many of these engines in a federation. The result is the ability to provide context-sensitive disseminations of DOs in (a federation of) repositories. Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 45. baseURL_y? url_ver=Z39.88-2004 & baseURL_y? rft_id=URI_7 & url_ver=Z39.88-2004 & svc_id=info:magic/justdoit rft_id=URI_7 & svc_id=info:pathways/dip.rdf DSpace URI_7 URI_3 URI_9 Fedora service RDF execution engine web service aDORe Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 46. Pathways InterDisseminator Demo aDORe Digital Object in Demo Type MIME identifier Digital Object scholarly paper N/A DOI aDORe application/xml datastream Constituent Datastream 1 metadata record (MARCXML) id (info URI) aDORe application/xml datastream Constituent Datastream 2 metadata record (original metadata) id (info URI) aDORe datastream Constituent Datastream 3 fulltext file application/pdf id (info URI) Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 47. Demo • Install TSCC coded (http://www.techsmith.com) • Launch movie Pathways_InterDisseminator.avi in same path as this presentation Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY
  • 48. Comments, Flames, Questions Herbert Van de Sompel & Jeroen Bekaert Research Library, Los Alamos National Laboratory RESEARCH CNI Task Force Meeting, December 5-6 2005, Phoenix, Arizona LIBRARY