herbert van de sompel CNI FALL 2000   –   San Antonio, Texas – December 8th  2000 Closing Keynote Address   Herbert Van de Sompel Cornell University Computer Science – Digital Library Group
herbert van de sompel Concrete part Preview of the Open Archives Metadata Harvesting protocol Speculative part Back to the preprint roots of the OAI: What does the OAMH protocol mean? What lies beyond the OAMH protocol?
herbert van de sompel the concrete part 0
herbert van de sompel The OAMH protocol is a low-barrier interoperability specification for the recurrent exchange of metadata between systems
herbert van de sompel the OAMH protocol service provider data provider 6 Requests Replies repos i tory harves ter
herbert van de sompel federated services A&I image FTXT OPAC e-print
herbert van de sompel metadata harvesting via OAMH metadata FTXT A&I image OPAC e-print harvester FTXT
herbert van de sompel federated services via OAMH metadata A&I image FTXT e-print Author Title Abstract Identifer OPAC
herbert van de sompel Core concepts in OAMH low-barrier interoperability data-provider & service-provider model metadata harvesting model OAMH protocol Dublin Core HTTP based Community specific shared metadata format and parallel, community-specific metadata formats acceptable use Reply XML Schema  Self contained
herbert van de sompel OAI harvesting tools service provider data provider Datestamp Identifier Set Records repos i tory harves ter
herbert van de sompel OAI harvesting tools Supporting protocol requests: Identify ListMetadataFormats ListSets Harvesting protocol requests: ListRecords ListIdentifiers GetRecord service provider data provider repos i tory harves ter
herbert van de sompel supporting protocol requests ListMetadataFormats ListMetadataFormats / Time / Request REPEAT Format prefix Format XML schema /REPEAT service provider data provider repos i tory harves ter
herbert van de sompel harvesting requests * from=a * until=b * set=klm ListRecords  * metadataPrefix=dc ListRecords / Time / Request REPEAT Identifier Datestamp Metadata /REPEAT service provider data provider repos i tory harves ter
herbert van de sompel Applications of the OAMH protocol? federated services [S&R, SDI, alerting, linking, ...] database synchronization harvesting the deep Web ...
herbert van de sompel the speculative part 1
herbert van de sompel OAI roots: advance interoperability of preprints as a means to promote their global acceptance What does the OAMH protocol mean in the preprint context? Are there any steps to be taken beyond the OAMH protocol in the preprint context?
herbert van de sompel What follows: Scholarly communication systems The paper and PDF implementation The attractiveness of preprints An interoperable, decomposed scholarly communication system built around preprints?
herbert van de sompel Market of scholarly communication {Roosendaal & Geurts} registration claiming a new finding certification certifying the claim awareness ensure information throughput archiving preserving the heritage
herbert van de sompel Market of scholarly communication {Roosendaal & Geurts} registration claiming a new finding certification certifying the claim awareness ensure information throughput archiving preserving the heritage rewarding evaluating  & rewarding  performance accessibility availibility and searchability
herbert van de sompel The paper version {the journal system} registration publisher certification publisher awareness library [selection, service, support] archiving library A R P U B D I S L I B
herbert van de sompel The PDF version {the journal system} registration publisher certification publisher awareness publisher, distributor, library archiving A R P U B D I S L I B
herbert van de sompel {the journal system} registration publisher certification publisher awareness archiving rewarding accessibility
herbert van de sompel {the journal system} registration publisher certification publisher awareness archiving rewarding accessibility low availibility [serials crisis]
herbert van de sompel {the journal system} registration publisher certification publisher awareness archiving rewarding citation databases accessibility low availibility [serials crisis]
herbert van de sompel {the journal system} registration publisher certification publisher awareness publisher, distributor, library archiving rewarding citation databases accessibility low availibility [serials crisis]
herbert van de sompel {the journal system} registration publisher certification publisher awareness publisher, distributor, library archiving ??? rewarding citation databases accessibility low availibility [serials crisis]
herbert van de sompel It is -- at least -- legitimate to reflect on the possibility of a digital system for scholarly communication that is not merely a scanned copy of the paper system. In order to free our minds: let’s forget about who has been doing what and how in the existing system.
herbert van de sompel A preprint in a digital scholarly communication system registration yes certification no awareness yes archiving no rewarding no accessibility high availibility
herbert van de sompel A preprint in a digital scholarly communication system registration yes certification no awareness yes archiving no rewarding no accessibility high availibility
herbert van de sompel Registration via preprints suggests the feasibility of a decomposed system for scholarly communication.
herbert van de sompel registration awareness archiving certification rewarding A R
herbert van de sompel Communication via preprints suggests the possibility of preprints being the starting point of a new value chain in which the raw material -- the non-certified preprint -- is in open access.
herbert van de sompel sustaining versus disruptive technologies; disruptive technologies : somehow  perform worse than established ones not accepted by core customer base but:  convenient, cheap, … disruptive technologies can create competition in an existing value network by creating a new one first. => preprints as a disruptive technology The Innovator’s Dilemma {Christensen}
herbert van de sompel registration awareness archiving certification rewarding A R new value chain
herbert van de sompel This urges us to think about: how the functions of a scholarly communication system can be implemented in a digital environment how the functions of a scholarly communication system can interoperate disregarding their implementation
herbert van de sompel technology law economy sociology establishing a technological basis that allows addressing the other issues.
herbert van de sompel registration awareness archiving certification rewarding A R discovery metadata pointing at a preprint OAI
herbert van de sompel registration awareness archiving certification rewarding A R OAI
herbert van de sompel registration awareness archiving certification rewarding discovery certification A R OAI
herbert van de sompel registration awareness archiving certification rewarding discovery certification usage logs A R
herbert van de sompel registration awareness archiving certification rewarding discovery certification usage logs preservation A R
herbert van de sompel registration awareness archiving certification rewarding A R interoperable grid
herbert van de sompel registration awareness archiving certification rewarding A R new value chain
herbert van de sompel So what about the library? [ back to the who does what question ]
herbert van de sompel Within the existing system, Libraries are trying hard to optimize the output of a system with far from optimal input.
herbert van de sompel It has become increasingly difficult for Libraries to fullfill their fundamental role of safeguarding equity of access.
herbert van de sompel In the PDF version of the information chain, Libraries are aggregating the aggregators.  That is a lot of aggregating for a digital world.
herbert van de sompel At the core of the problems that Libraries are facing is the total dependancy on information held upstream in the information chain.
herbert van de sompel As such, there are numerous incentives for Libraries: to rethink themselves to be pro-active in exploring alternative mechanisms for scholarly communication
herbert van de sompel Libraries  are close to authors :  a great position to fullfill the registration function i.e. obtain institutional material a great position to archive institutional material Libraries  are fast at embracing new tech nologies Libraries have very  knowledgeable people Libraries   provide a level of redundancy in services that is no longer required in a digital environment The   L ibrary  as an institution that safeguardes equity of access has g lobal representation Libraries: the good news
herbert van de sompel As organizations Libraries  are  slow movers, hosted by slowly moving institutions Libraries  are  slow to recognize the fact that a new technology may allow [or beg] for a new mode of operation The -- information -- world runs on Internet time Libraries: the bad news

The Roof is on Fire

  • 1.
    herbert van desompel CNI FALL 2000 – San Antonio, Texas – December 8th 2000 Closing Keynote Address Herbert Van de Sompel Cornell University Computer Science – Digital Library Group
  • 2.
    herbert van desompel Concrete part Preview of the Open Archives Metadata Harvesting protocol Speculative part Back to the preprint roots of the OAI: What does the OAMH protocol mean? What lies beyond the OAMH protocol?
  • 3.
    herbert van desompel the concrete part 0
  • 4.
    herbert van desompel The OAMH protocol is a low-barrier interoperability specification for the recurrent exchange of metadata between systems
  • 5.
    herbert van desompel the OAMH protocol service provider data provider 6 Requests Replies repos i tory harves ter
  • 6.
    herbert van desompel federated services A&I image FTXT OPAC e-print
  • 7.
    herbert van desompel metadata harvesting via OAMH metadata FTXT A&I image OPAC e-print harvester FTXT
  • 8.
    herbert van desompel federated services via OAMH metadata A&I image FTXT e-print Author Title Abstract Identifer OPAC
  • 9.
    herbert van desompel Core concepts in OAMH low-barrier interoperability data-provider & service-provider model metadata harvesting model OAMH protocol Dublin Core HTTP based Community specific shared metadata format and parallel, community-specific metadata formats acceptable use Reply XML Schema Self contained
  • 10.
    herbert van desompel OAI harvesting tools service provider data provider Datestamp Identifier Set Records repos i tory harves ter
  • 11.
    herbert van desompel OAI harvesting tools Supporting protocol requests: Identify ListMetadataFormats ListSets Harvesting protocol requests: ListRecords ListIdentifiers GetRecord service provider data provider repos i tory harves ter
  • 12.
    herbert van desompel supporting protocol requests ListMetadataFormats ListMetadataFormats / Time / Request REPEAT Format prefix Format XML schema /REPEAT service provider data provider repos i tory harves ter
  • 13.
    herbert van desompel harvesting requests * from=a * until=b * set=klm ListRecords * metadataPrefix=dc ListRecords / Time / Request REPEAT Identifier Datestamp Metadata /REPEAT service provider data provider repos i tory harves ter
  • 14.
    herbert van desompel Applications of the OAMH protocol? federated services [S&R, SDI, alerting, linking, ...] database synchronization harvesting the deep Web ...
  • 15.
    herbert van desompel the speculative part 1
  • 16.
    herbert van desompel OAI roots: advance interoperability of preprints as a means to promote their global acceptance What does the OAMH protocol mean in the preprint context? Are there any steps to be taken beyond the OAMH protocol in the preprint context?
  • 17.
    herbert van desompel What follows: Scholarly communication systems The paper and PDF implementation The attractiveness of preprints An interoperable, decomposed scholarly communication system built around preprints?
  • 18.
    herbert van desompel Market of scholarly communication {Roosendaal & Geurts} registration claiming a new finding certification certifying the claim awareness ensure information throughput archiving preserving the heritage
  • 19.
    herbert van desompel Market of scholarly communication {Roosendaal & Geurts} registration claiming a new finding certification certifying the claim awareness ensure information throughput archiving preserving the heritage rewarding evaluating & rewarding performance accessibility availibility and searchability
  • 20.
    herbert van desompel The paper version {the journal system} registration publisher certification publisher awareness library [selection, service, support] archiving library A R P U B D I S L I B
  • 21.
    herbert van desompel The PDF version {the journal system} registration publisher certification publisher awareness publisher, distributor, library archiving A R P U B D I S L I B
  • 22.
    herbert van desompel {the journal system} registration publisher certification publisher awareness archiving rewarding accessibility
  • 23.
    herbert van desompel {the journal system} registration publisher certification publisher awareness archiving rewarding accessibility low availibility [serials crisis]
  • 24.
    herbert van desompel {the journal system} registration publisher certification publisher awareness archiving rewarding citation databases accessibility low availibility [serials crisis]
  • 25.
    herbert van desompel {the journal system} registration publisher certification publisher awareness publisher, distributor, library archiving rewarding citation databases accessibility low availibility [serials crisis]
  • 26.
    herbert van desompel {the journal system} registration publisher certification publisher awareness publisher, distributor, library archiving ??? rewarding citation databases accessibility low availibility [serials crisis]
  • 27.
    herbert van desompel It is -- at least -- legitimate to reflect on the possibility of a digital system for scholarly communication that is not merely a scanned copy of the paper system. In order to free our minds: let’s forget about who has been doing what and how in the existing system.
  • 28.
    herbert van desompel A preprint in a digital scholarly communication system registration yes certification no awareness yes archiving no rewarding no accessibility high availibility
  • 29.
    herbert van desompel A preprint in a digital scholarly communication system registration yes certification no awareness yes archiving no rewarding no accessibility high availibility
  • 30.
    herbert van desompel Registration via preprints suggests the feasibility of a decomposed system for scholarly communication.
  • 31.
    herbert van desompel registration awareness archiving certification rewarding A R
  • 32.
    herbert van desompel Communication via preprints suggests the possibility of preprints being the starting point of a new value chain in which the raw material -- the non-certified preprint -- is in open access.
  • 33.
    herbert van desompel sustaining versus disruptive technologies; disruptive technologies : somehow perform worse than established ones not accepted by core customer base but: convenient, cheap, … disruptive technologies can create competition in an existing value network by creating a new one first. => preprints as a disruptive technology The Innovator’s Dilemma {Christensen}
  • 34.
    herbert van desompel registration awareness archiving certification rewarding A R new value chain
  • 35.
    herbert van desompel This urges us to think about: how the functions of a scholarly communication system can be implemented in a digital environment how the functions of a scholarly communication system can interoperate disregarding their implementation
  • 36.
    herbert van desompel technology law economy sociology establishing a technological basis that allows addressing the other issues.
  • 37.
    herbert van desompel registration awareness archiving certification rewarding A R discovery metadata pointing at a preprint OAI
  • 38.
    herbert van desompel registration awareness archiving certification rewarding A R OAI
  • 39.
    herbert van desompel registration awareness archiving certification rewarding discovery certification A R OAI
  • 40.
    herbert van desompel registration awareness archiving certification rewarding discovery certification usage logs A R
  • 41.
    herbert van desompel registration awareness archiving certification rewarding discovery certification usage logs preservation A R
  • 42.
    herbert van desompel registration awareness archiving certification rewarding A R interoperable grid
  • 43.
    herbert van desompel registration awareness archiving certification rewarding A R new value chain
  • 44.
    herbert van desompel So what about the library? [ back to the who does what question ]
  • 45.
    herbert van desompel Within the existing system, Libraries are trying hard to optimize the output of a system with far from optimal input.
  • 46.
    herbert van desompel It has become increasingly difficult for Libraries to fullfill their fundamental role of safeguarding equity of access.
  • 47.
    herbert van desompel In the PDF version of the information chain, Libraries are aggregating the aggregators. That is a lot of aggregating for a digital world.
  • 48.
    herbert van desompel At the core of the problems that Libraries are facing is the total dependancy on information held upstream in the information chain.
  • 49.
    herbert van desompel As such, there are numerous incentives for Libraries: to rethink themselves to be pro-active in exploring alternative mechanisms for scholarly communication
  • 50.
    herbert van desompel Libraries are close to authors : a great position to fullfill the registration function i.e. obtain institutional material a great position to archive institutional material Libraries are fast at embracing new tech nologies Libraries have very knowledgeable people Libraries provide a level of redundancy in services that is no longer required in a digital environment The L ibrary as an institution that safeguardes equity of access has g lobal representation Libraries: the good news
  • 51.
    herbert van desompel As organizations Libraries are slow movers, hosted by slowly moving institutions Libraries are slow to recognize the fact that a new technology may allow [or beg] for a new mode of operation The -- information -- world runs on Internet time Libraries: the bad news