Driver Guidelines and Repository Interoperability
Upcoming SlideShare
Loading in...5
×
 

Driver Guidelines and Repository Interoperability

on

  • 2,156 views

On 2008-11-15 Maurice Vanderfeesten gave a presentation in Baltimore at the SPARC OpenAccess confenrence. ...

On 2008-11-15 Maurice Vanderfeesten gave a presentation in Baltimore at the SPARC OpenAccess confenrence.
This presentation explains about the needs for interoperability amoung repository systems. DRIVER provides guidelines how to expose metadata via OAI-PMH is a way that has international compliance.

Statistics

Views

Total Views
2,156
Slideshare-icon Views on SlideShare
1,814
Embed Views
342

Actions

Likes
0
Downloads
15
Comments
0

6 Embeds 342

http://wiki.surffoundation.nl 178
http://wiki.surf.nl 136
http://www.surffoundation.nl 18
http://translate.googleusercontent.com 5
http://www.slideshare.net 4
http://www.linkedin.com 1

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

Driver Guidelines and Repository Interoperability Driver Guidelines and Repository Interoperability Presentation Transcript

  • Fasten … Seatbelt Maurice Vanderfeesten - SURFfoundation (NL) 15 November 2008 – Baltimore – DRIVER meeting
  • Fasten Excel in Scholarly communication
  • Seatbelt Get to the finish line safely
  • Innovation towards the intelligent web The Intelligent Web Productivity of Search Web 4.0 2020 - 2030 Reasoning The Semantic Web Web 3.0 Semantic Search 2010 - 2020 The Social Web Natural language search The World Wide Web Web 2.0 2000 - 2010 Tagging Web 1.0 1990 - 2000 Keyword search The Desktop Directories PC Era 1980 - 1990 Files & Folders Databases By: Radar Networks / TWINE Amount of data 4
  • Work together: Respect some rules
  • One goal: “Reliable Content Provision” Global Digital Repository Infrastructure
  • Reality: Efforts to interpret and normalize data Wide spread metadata standards: Unqualified Dublin Core & OAI-PMH Problem: interpreting semantics; standard specifications not enough Example: Electronic theses need context specific descriptions for date, type, roles & language TICER 2008, Tilburg 7
  • Effort interpreting dates - Trouble automatically interpreting semantics ex. [date] (Cranfield) Recommendation: in Unqualified <dc:contributor>Partington, Dublin Core use one date field that David(supervisor)</dc:contributor> <dc:creator>Lupson, Jonathan</dc:creator> represents the Publication date! <dc:date>2007-06-06T18:17:13Z</dc:date>(Publication?) <dc:date>2007-06-06T18:17:13Z</dc:date>(Graduation?) <dc:date>2007-02</dc:date> (Start ?) <dc:identifier>http://hdl.handle.net/1826/1729</dc:ident Humboldt: ifier> <dc:date>2007-06-07</dc:date> (Graduation) <dc:description> <dc:date>2007-03-06</dc:date> (Publication) <dc:date>2003-02</dc:date> Tilburg TICER 2008, (Start) 8
  • Effort interpreting types - Trouble automatically interpreting semantics ex. [type] Recommendation: use the following qualifications: Cranfield: <dc:type>Thesis or dissertation</dc:type> “ bachelorThesis”, <dc:type>Doctoral</dc:type> info:eu-repo/semantics/ <dc:type>PhD</dc:type> “ masterThesis”, info:eu-repo/semantics/ DIVA: “ doctoralThesis” info:eu-repo/semantics/ <dc:type>text.thesis.doctoral</dc:type> (Bologna Convention) Humboldt: <dc:type>Text</dc:type> <dc:type>dissertation</dc:type> TICER 2008, Tilburg 9
  • Effort interpreting roles 1. Electronic theses need context specific descriptions Recommendation: use the contributor field in Dublin Core only <dc:contributor>Partington, David(supervisor)</dc:contributor> for the person who supervised the <dc:creator>Lupson, Jonathan</dc:creator> <dc:date>2007-06-06T18:17:13Z</dc:date> Doctoral thesis project. <dc:date>2007-06-06T18:17:13Z</dc:date> <dc:date>2007-02</dc:date> <dc:identifier>http://hdl.handle.net/1826/1729</dc:ident ifier> <dc:description> TICER 2008, Tilburg 10
  • Effort interpreting languages Personal notation flavour of a language Recommendation: <dc:language>Nederlands</dc:language> use ISO639-3 <dc:language>ned</dc:language> As a standard way of writing down a language in a repository <dc:language>nl</dc:language> <dc:language>nld/dut</dc:language> <dc:language>en_UK</dc:language> <dc:language>mn</dc:language> TICER 2008, Tilburg 11
  • Number of repositories increase DRIVER: Collection of Quality Metadata for OpenAccess Material
  • All services providers must build adaptors for every single repository
  • Interoperability shares workload
  • One goal: “Reliable Content Provision” Global Digital Repository Infrastructure
  • Reliability: Broken Links Issue Repository URL
  • Reliability: Link resolvers OAI-PMH ID Global Repository Resolver URL ID + URL Updates • Use ID’s for citation reference • Obligation to update • Technology independent (future proof)
  • Standards, Agreements, Rules: Interoperability guidelines
  • Towards web-reasoning: data efficiency & interoperability levels By: Andreas Tolk et al., quot;Composable M&S Web Services for Net-centric Applications,quot; Journal for Defense Modeling & Simulation (JDMS), Volume 3 Number 1, pp. 27-44, January 2006
  • Interoperability leads towards improved retrieval and recall The Intelligent Web Productivity of Search Web 4.0 2020 - 2030 Reasoning The Semantic Web Web 3.0 Semantic Search 2010 - 2020 The Social Web Natural language search The World Wide Web Web 2.0 2000 - 2010 Tagging Web 1.0 1990 - 2000 Keyword search The Desktop Directories PC Era 1980 - 1990 Files & Folders Databases By: Radar Networks / TWINE Amount of data
  • We have: Tools for Syntactic & Semantic Interoperability - Guidelines for content providers, exposing textual resources with OAI-PMH - Validator, checking the rate of compliance to the “Guidelines for content providers” 21
  • Guidelines 2.0 - Build on knowledge from past & current IR projects (EU) - 26 actively involved contributors (experts and repository managers) from 8 countries. - Practical answers for IR’s on how to: - Improve full-text access - Standardize metadata quality - Create a reliable infrastructure for permanent identification, resolution, traceability and storage - Resolve semantic and classification issues
  • Guidelines 2.0 - Chapters 1. Use of OAI-PMH 2. Use of Metadata OAI_DC 3. Use of Best Practices for OAI_DC 4. Use of Compound Object Wrapping 5. Use of Vocabularies and Semantics 6. Use of Quality labels (Long Term Preservation) 7. Use of Persistent Identifiers 8. Use of Usage Statistics Exchange 9. Use of Intellectual Property Rights (IPR)
  • Validator
  • Validator - Deep validation - Points to exact location of the - Experimental tool issue for easy debugging - Self-test for Repository - Offers recommendations on Managers how to correctly modify your repository to interoperable - Embedded in DRIVER standards registration process - Creates a report for future - Detects interoperability issues reference - Provides explanation per - Provides a weighted score for interoperability issue. balanced effort - Score influences the result list.
  • Looking back on what we have: - Guidelines for content providers, exposing textual resources with OAI-PMH - Validator, checking the rate of compliance to the “Guidelines for content providers” 27
  • What is missing? Guidelines 28
  • Trias Politica Model Legislative 29
  • We DON’T have - A structure for acceptance of Intelligent Web Productivity of Search The Repository Interoperability Web 4.0 2020 - 2030 Reasoning Guidelines World Wide Semantic Web The Web 3.0 Semantic Search 2010 - 2020 The Social Web Natural language search The World Wide Web Web 2.0 2000 - 2010 Tagging Web 1.0 - The Desktop 1990 - 2000 Executive enforcement Keyword search enabling action on adopting PC Era 1980 - 1990 Directories Interoperability Guidelines for Files & Folders Repositories, World Wide, on Databases a National and local level By: Radar Networks / TWINE Amount of data 30
  • Questions • What strategies can be used to create a global “Trias Politica” for repositories in order to enforce “reliable content provision” by using interoperability guidelines? • What strategies are there to maintain repository guidelines? Who is responsible? • What strategies are known to create an acceptance mechanism for global agreement to repository guidelines? • What strategies can be used to enforce repository guidelines? • Who is responsible for the (metadata) quality of the repository output?
  • The end Thank you Maurice Vanderfeesten www.SURFfoundation.nl vanderfeesten@surf.nl