Metadata Harvesting And Validation
Upcoming SlideShare
Loading in...5
×
 

Metadata Harvesting And Validation

on

  • 2,434 views

 

Statistics

Views

Total Views
2,434
Views on SlideShare
2,430
Embed Views
4

Actions

Likes
0
Downloads
27
Comments
0

2 Embeds 4

http://www.slideshare.net 3
http://www.slideee.com 1

Accessibility

Categories

Upload Details

Uploaded via as Adobe PDF

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

Metadata Harvesting And Validation Metadata Harvesting And Validation Presentation Transcript

  • Metadata Harvesting and Validation Bram Vandeputte K.U.Leuven 1
  • slideshare • http://www.slideshare.net/bramvandeputte
  • Overview • Validation Service • Online Validation Service • OAI-PMH • Harvesting Infrastructure 3
  • Validation Service • Interoperability : Application Profile (AP) • Manual check : very time consuming • Need a tool for enforcing an AP => validation scheme Best practices derived from previous projects • A set of validation rules such as MELT and MACE Reusable : modular + • Reusable & extendable inheritance possible 4
  • Validation Service • Components : • XML schema : structure • schematron : • mandatory/conditional elements • empty fields • vocabularies (auto generated) • ... • Vcard component 5
  • Validation Service component : atomic block which does specific validation checking scheme : collection of components that • Terminology : ensures validity against a whole AP • Validation Component URI : unique identifier of a scheme • Validation Scheme • Validation Scheme URI : • http://aspect-project.org/validation/ASPECTv1.0/core 6
  • Validation Service 7
  • Validation Service ASPECTv1.0/ ASPECTv1.0/ LOM loose recommended core recommended lomloose.xsd vocabulary bank schematron rules core schematron vcard validator Legend rules uses empty attribute fields extends IMS ILOX ASPECT validationScheme vcard validator validation component 8
  • Validation Service 9
  • Online Validation Service demo 10
  • validation to lre AP refer to lre ap document
  • OAI-PMH • Client - Server model • Pull mechanism • options : • selective harvesting (date and set) • incremental harvesting • Metadata-agnostic 13
  • OAI-PMH • Verbs : Identify, ListRecords, GetRecord • Parameters : • baseUrl • from & until date • metadataPrefix • sets 14
  • Harvest Component • Multiple targets • Each target separate properties (sets, date granularity, metadataPrefix, ...) • Storing metadata (SPI, Filesystem, APP, ...) • Extra features : • Incremental harvesting • harvesting scheduling • Metadata validation + reporting • OAI-PMH Target validation • (User Friendly) GUI 15
  • invalid : discarded or identifier recorded for next harvesting 16
  • The Harvest component invalid : discarded or identifier recorded for next harvesting 16
  • ARIADNE Harvester invalid : discarded or identifier recorded for harvester log next harvesting 16
  • ARIADNE Harvester ASPECT Repository SPI SQI invalid : discarded or identifier recorded for harvester log validation service next harvesting 16
  • ARIADNE Harvester ASPECT Repository SPI SQI External Repository OAI OAI-PMH LOM LOM LOM invalid : discarded or identifier recorded for harvester log validation service next harvesting 16
  • ARIADNE Harvester ASPECT Repository OAI-PMH SPI SQI External Repository LOM LOM LOM OAI OAI-PMH LOM LOM LOM invalid : discarded or identifier recorded for harvester log validation service next harvesting 16
  • ARIADNE Harvester ASPECT Repository OAI-PMH SPI SQI External Repository LOM LOM OAI OAI-PMH LOM LOM LOM LOM invalid : discarded or identifier recorded for harvester log validation service next harvesting 16
  • ARIADNE Harvester ASPECT Repository OAI-PMH SPI SQI External Repository LOM LOM OAI OAI-PMH LOM LOM LOM LOM LOM invalid : discarded or identifier recorded for harvester log validation service next harvesting 16
  • ARIADNE Harvester ASPECT Repository OAI-PMH SPI SQI External Repository LOM LOM OAI OAI-PMH LOM LOM LOM LOM Validation Msg invalid : discarded or identifier recorded for harvester log validation service next harvesting 16
  • ARIADNE Harvester ASPECT Repository OAI-PMH SPI SQI External Repository LOM LOM OAI OAI-PMH Validation LOM Msg LOM LOM LOM invalid : discarded or identifier recorded for harvester log validation service next harvesting 16
  • ARIADNE Harvester ASPECT Repository OAI-PMH SPI SQI External Repository LOM LOM OAI OAI-PMH LOM LOM LOM LOM Validation Msg invalid : discarded or identifier recorded for harvester log validation service next harvesting 16
  • ARIADNE Harvester ASPECT Repository OAI-PMH SPI SQI External Repository LOM LOM OAI LOM OAI-PMH LOM LOM LOM Validation Msg invalid : discarded or identifier recorded for harvester log validation service next harvesting 16
  • ARIADNE Harvester ASPECT Repository OAI-PMH SPI SQI External Repository LOM LOM LOM OAI 2 6 LOM OAI-PMH 1 Validation LOM Msg LOM LOM LOM 4 5 3 Validation Validation LOM Msg Msg harvester log validation service 17
  • Harvester Screenshot or live demo 18
  • Validation Reports • After harvesting -> report generated and put online • report has 4 “levels” : • full log (incl. metadata) • reporting log • Grouped Errors • Error Summary
  • • Questions ? 23
  • References • SPI : http://ariadne.cs.kuleuven.be/lomi/index.php/ SimplePublishingInterface • IEEE LOM : http://ltsc.ieee.org/wg12/ • OAI-PMH : http://www.openarchives.org/