SlideShare a Scribd company logo
Metadata Harvesting and
      Validation

       Bram Vandeputte
         K.U.Leuven




              1
slideshare



• http://www.slideshare.net/bramvandeputte




                    2
Overview

• Validation Service
• Integration
• Online Validation Service
• Harvesting Component


                       3
Validation Service

• Interoperability : Application Profile (AP)
• Manual check : very time consuming
• Need a tool for enforcing an AP =>
  validation scheme
                                       Best practices derived
                                       from previous projects
• A set of validation rules            such as MELT and MACE

                                       Reusable : modular +
• Reusable & extendable                inheritance possible




                     4
Validation Service

• Components :
 • XML schema : structure
 • schematron :
   •   mandatory/conditional elements
   •   empty fields
   •   vocabularies (auto generated)
   •   ...

 • Vcard component

                               5
Validation Service
                                                      component : atomic
                                                      block which does specific
                                                      validation checking

                                                      scheme : collection of
                                                      components that
• Terminology :                                       ensures validity against
                                                      a whole AP


 • Validation Component                               URI : unique identifier of
                                                      a scheme


 • Validation Scheme
 • Validation Scheme URI :
   •   http://aspect-project.org/validation/ASPECTv1.0/core




                          6
Validation Service




        7
Validation Service
         ASPECTv1.0/                       ASPECTv1.0/
                                                              LOM loose
         recommended                          core




       recommended                                          lomloose.xsd
                                          vocabulary bank
      schematron rules

                                          core schematron   vcard validator
                  Legend
                                                rules
                            uses                            empty attribute
                                                                fields
                           extends           IMS ILOX
   ASPECT            validationScheme

vcard validator    validation component




                                             8
Validation Service




        9
Integration

• When do we validate ?
• Possible integration points :
 • Mapping
 • Harvesting
 • Enriching


                    10
Online Validation
  Service demo


        11
12
validation to lre AP
     refer to lre ap document




13
invalid : discarded or
identifier recorded for
next harvesting




                          14
The Harvest component


invalid : discarded or
identifier recorded for
next harvesting




                          14
ARIADNE Harvester




invalid : discarded or
identifier recorded for
                             harvester log
next harvesting




                                         14
ARIADNE Harvester
                                                    ASPECT Repository

                                              SPI                         SQI




invalid : discarded or
identifier recorded for
                             harvester log
                                                     validation service
next harvesting




                                         14
ARIADNE Harvester
                                                       ASPECT Repository

                                                 SPI                         SQI
External Repository

                      OAI


     OAI-PMH



     LOM
      LOM
       LOM




   invalid : discarded or
   identifier recorded for
                                harvester log
                                                        validation service
   next harvesting




                                            14
ARIADNE Harvester
                                                            ASPECT Repository
                             OAI-PMH
                                                      SPI                         SQI
External Repository
                             LOM
                              LOM
                               LOM
                      OAI


     OAI-PMH



     LOM
      LOM
       LOM




   invalid : discarded or
   identifier recorded for
                                     harvester log
                                                             validation service
   next harvesting




                                                 14
ARIADNE Harvester
                                                            ASPECT Repository
                             OAI-PMH
                                                      SPI                         SQI
External Repository
                             LOM
                              LOM

                      OAI


     OAI-PMH

                               LOM
     LOM
      LOM
       LOM




   invalid : discarded or
   identifier recorded for
                                     harvester log
                                                             validation service
   next harvesting




                                                 14
ARIADNE Harvester
                                                            ASPECT Repository
                             OAI-PMH
                                                      SPI                         SQI
External Repository
                             LOM
                              LOM

                      OAI


     OAI-PMH

                               LOM
     LOM
      LOM
       LOM




                                                              LOM




   invalid : discarded or
   identifier recorded for
                                     harvester log
                                                             validation service
   next harvesting




                                                 14
ARIADNE Harvester
                                                            ASPECT Repository
                             OAI-PMH
                                                      SPI                            SQI
External Repository
                             LOM
                              LOM

                      OAI


     OAI-PMH

                               LOM
     LOM
      LOM
       LOM




                                                                        Validation
                                                                          Msg



   invalid : discarded or
   identifier recorded for
                                     harvester log
                                                             validation service
   next harvesting




                                                 14
ARIADNE Harvester
                                                                   ASPECT Repository
                             OAI-PMH
                                                             SPI                         SQI
External Repository
                             LOM
                              LOM

                      OAI


     OAI-PMH

                                                Validation
                               LOM
                                                  Msg
     LOM
      LOM
       LOM




   invalid : discarded or
   identifier recorded for
                                     harvester log
                                                                    validation service
   next harvesting




                                                 14
ARIADNE Harvester
                                                                ASPECT Repository
                             OAI-PMH
                                                          SPI                         SQI
External Repository
                             LOM
                              LOM

                      OAI


     OAI-PMH

                               LOM
     LOM
      LOM
       LOM




                                        Validation
                                          Msg



   invalid : discarded or
   identifier recorded for
                                     harvester log
                                                                 validation service
   next harvesting




                                                     14
ARIADNE Harvester
                                                               ASPECT Repository
                             OAI-PMH
                                                         SPI                         SQI
External Repository
                             LOM
                              LOM

                      OAI
                                                                       LOM

     OAI-PMH



     LOM
      LOM
       LOM




                                       Validation
                                         Msg



   invalid : discarded or
   identifier recorded for
                                    harvester log
                                                                validation service
   next harvesting




                                                    14
ARIADNE Harvester
                                                                           ASPECT Repository

                            OAI-PMH
                                                                     SPI                              SQI
External Repository
                            LOM
                             LOM
                              LOM
                  OAI
                             2                                   6                 LOM


     OAI-PMH            1
                                                    Validation
                             LOM
                                                      Msg
     LOM
      LOM
       LOM

                                                                           4
                                                   5             3
                                      Validation                                         Validation
                                                                             LOM
                                        Msg                                                Msg




                                   harvester log
                                                                            validation service




                                                   15
Validation Reports
• After harvesting -> report generated and put
  online
• report has 4 “levels” :
  • full log (incl. metadata)
  • reporting log
  • Grouped Errors
  • Error Summary

                        16
• Questions ?




                17
References

• SPI : http://ariadne.cs.kuleuven.be/lomi/index.php/
   SimplePublishingInterface

• IEEE LOM : http://ltsc.ieee.org/wg12/
• OAI-PMH : http://www.openarchives.org/
• Online Validation :
   http://ariadne.cs.kuleuven.be/validationService/




                                  18

More Related Content

Similar to Metadata Harvesting And Validationv2

ARIADNE overview
ARIADNE overviewARIADNE overview
ARIADNE overview
Joris Klerkx
 
OSGi Service Platform 4.2
OSGi Service Platform 4.2OSGi Service Platform 4.2
OSGi Service Platform 4.2
Ilya Katsov
 
Self-Service in EPAM Private Cloud
Self-Service in EPAM Private CloudSelf-Service in EPAM Private Cloud
Self-Service in EPAM Private Cloud
Alex Tregubov
 
44rd CEN WS/LT meeting PT interoperability of registries
44rd CEN WS/LT meeting PT interoperability of registries44rd CEN WS/LT meeting PT interoperability of registries
44rd CEN WS/LT meeting PT interoperability of registries
Joris Klerkx
 
How to Share and Reuse Learning Resources: the ARIADNE Experience
How to Share and Reuse Learning Resources: the ARIADNE ExperienceHow to Share and Reuse Learning Resources: the ARIADNE Experience
How to Share and Reuse Learning Resources: the ARIADNE Experience
Joris Klerkx
 
IdM in Smart Applications on Virtual Infrastructure
IdM in Smart Applications on Virtual InfrastructureIdM in Smart Applications on Virtual Infrastructure
IdM in Smart Applications on Virtual Infrastructure
Mohammad Faraji
 
Harvesting metadata - ASPECT webinar
Harvesting metadata - ASPECT webinarHarvesting metadata - ASPECT webinar
Harvesting metadata - ASPECT webinar
Joris Klerkx
 
WF and WCF with AppFabric – Application Infrastructure for OnPremise Services
WF and WCF with AppFabric – Application Infrastructure for OnPremise ServicesWF and WCF with AppFabric – Application Infrastructure for OnPremise Services
WF and WCF with AppFabric – Application Infrastructure for OnPremise Services
Saltmarch Media
 

Similar to Metadata Harvesting And Validationv2 (8)

ARIADNE overview
ARIADNE overviewARIADNE overview
ARIADNE overview
 
OSGi Service Platform 4.2
OSGi Service Platform 4.2OSGi Service Platform 4.2
OSGi Service Platform 4.2
 
Self-Service in EPAM Private Cloud
Self-Service in EPAM Private CloudSelf-Service in EPAM Private Cloud
Self-Service in EPAM Private Cloud
 
44rd CEN WS/LT meeting PT interoperability of registries
44rd CEN WS/LT meeting PT interoperability of registries44rd CEN WS/LT meeting PT interoperability of registries
44rd CEN WS/LT meeting PT interoperability of registries
 
How to Share and Reuse Learning Resources: the ARIADNE Experience
How to Share and Reuse Learning Resources: the ARIADNE ExperienceHow to Share and Reuse Learning Resources: the ARIADNE Experience
How to Share and Reuse Learning Resources: the ARIADNE Experience
 
IdM in Smart Applications on Virtual Infrastructure
IdM in Smart Applications on Virtual InfrastructureIdM in Smart Applications on Virtual Infrastructure
IdM in Smart Applications on Virtual Infrastructure
 
Harvesting metadata - ASPECT webinar
Harvesting metadata - ASPECT webinarHarvesting metadata - ASPECT webinar
Harvesting metadata - ASPECT webinar
 
WF and WCF with AppFabric – Application Infrastructure for OnPremise Services
WF and WCF with AppFabric – Application Infrastructure for OnPremise ServicesWF and WCF with AppFabric – Application Infrastructure for OnPremise Services
WF and WCF with AppFabric – Application Infrastructure for OnPremise Services
 

More from Bram Vandeputte

Exploring the research world
Exploring the research worldExploring the research world
Exploring the research world
Bram Vandeputte
 
Research20
Research20Research20
Research20
Bram Vandeputte
 
Ariadne Overview
Ariadne OverviewAriadne Overview
Ariadne Overview
Bram Vandeputte
 
Multi Touch Introduction
Multi Touch IntroductionMulti Touch Introduction
Multi Touch Introduction
Bram Vandeputte
 
Multi Touch Hands On
Multi Touch Hands OnMulti Touch Hands On
Multi Touch Hands On
Bram Vandeputte
 
Oai Workshop Extended
Oai Workshop ExtendedOai Workshop Extended
Oai Workshop Extended
Bram Vandeputte
 
IEEE LOM
IEEE LOMIEEE LOM
IEEE LOM
Bram Vandeputte
 
Android Presentation
Android PresentationAndroid Presentation
Android Presentation
Bram Vandeputte
 

More from Bram Vandeputte (8)

Exploring the research world
Exploring the research worldExploring the research world
Exploring the research world
 
Research20
Research20Research20
Research20
 
Ariadne Overview
Ariadne OverviewAriadne Overview
Ariadne Overview
 
Multi Touch Introduction
Multi Touch IntroductionMulti Touch Introduction
Multi Touch Introduction
 
Multi Touch Hands On
Multi Touch Hands OnMulti Touch Hands On
Multi Touch Hands On
 
Oai Workshop Extended
Oai Workshop ExtendedOai Workshop Extended
Oai Workshop Extended
 
IEEE LOM
IEEE LOMIEEE LOM
IEEE LOM
 
Android Presentation
Android PresentationAndroid Presentation
Android Presentation
 

Recently uploaded

“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
Edge AI and Vision Alliance
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
Claudio Di Ciccio
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
kumardaparthi1024
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
Neo4j
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Speck&Tech
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
Tomaz Bratanic
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 

Recently uploaded (20)

“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 

Metadata Harvesting And Validationv2

  • 1. Metadata Harvesting and Validation Bram Vandeputte K.U.Leuven 1
  • 3. Overview • Validation Service • Integration • Online Validation Service • Harvesting Component 3
  • 4. Validation Service • Interoperability : Application Profile (AP) • Manual check : very time consuming • Need a tool for enforcing an AP => validation scheme Best practices derived from previous projects • A set of validation rules such as MELT and MACE Reusable : modular + • Reusable & extendable inheritance possible 4
  • 5. Validation Service • Components : • XML schema : structure • schematron : • mandatory/conditional elements • empty fields • vocabularies (auto generated) • ... • Vcard component 5
  • 6. Validation Service component : atomic block which does specific validation checking scheme : collection of components that • Terminology : ensures validity against a whole AP • Validation Component URI : unique identifier of a scheme • Validation Scheme • Validation Scheme URI : • http://aspect-project.org/validation/ASPECTv1.0/core 6
  • 8. Validation Service ASPECTv1.0/ ASPECTv1.0/ LOM loose recommended core recommended lomloose.xsd vocabulary bank schematron rules core schematron vcard validator Legend rules uses empty attribute fields extends IMS ILOX ASPECT validationScheme vcard validator validation component 8
  • 10. Integration • When do we validate ? • Possible integration points : • Mapping • Harvesting • Enriching 10
  • 11. Online Validation Service demo 11
  • 12. 12
  • 13. validation to lre AP refer to lre ap document 13
  • 14. invalid : discarded or identifier recorded for next harvesting 14
  • 15. The Harvest component invalid : discarded or identifier recorded for next harvesting 14
  • 16. ARIADNE Harvester invalid : discarded or identifier recorded for harvester log next harvesting 14
  • 17. ARIADNE Harvester ASPECT Repository SPI SQI invalid : discarded or identifier recorded for harvester log validation service next harvesting 14
  • 18. ARIADNE Harvester ASPECT Repository SPI SQI External Repository OAI OAI-PMH LOM LOM LOM invalid : discarded or identifier recorded for harvester log validation service next harvesting 14
  • 19. ARIADNE Harvester ASPECT Repository OAI-PMH SPI SQI External Repository LOM LOM LOM OAI OAI-PMH LOM LOM LOM invalid : discarded or identifier recorded for harvester log validation service next harvesting 14
  • 20. ARIADNE Harvester ASPECT Repository OAI-PMH SPI SQI External Repository LOM LOM OAI OAI-PMH LOM LOM LOM LOM invalid : discarded or identifier recorded for harvester log validation service next harvesting 14
  • 21. ARIADNE Harvester ASPECT Repository OAI-PMH SPI SQI External Repository LOM LOM OAI OAI-PMH LOM LOM LOM LOM LOM invalid : discarded or identifier recorded for harvester log validation service next harvesting 14
  • 22. ARIADNE Harvester ASPECT Repository OAI-PMH SPI SQI External Repository LOM LOM OAI OAI-PMH LOM LOM LOM LOM Validation Msg invalid : discarded or identifier recorded for harvester log validation service next harvesting 14
  • 23. ARIADNE Harvester ASPECT Repository OAI-PMH SPI SQI External Repository LOM LOM OAI OAI-PMH Validation LOM Msg LOM LOM LOM invalid : discarded or identifier recorded for harvester log validation service next harvesting 14
  • 24. ARIADNE Harvester ASPECT Repository OAI-PMH SPI SQI External Repository LOM LOM OAI OAI-PMH LOM LOM LOM LOM Validation Msg invalid : discarded or identifier recorded for harvester log validation service next harvesting 14
  • 25. ARIADNE Harvester ASPECT Repository OAI-PMH SPI SQI External Repository LOM LOM OAI LOM OAI-PMH LOM LOM LOM Validation Msg invalid : discarded or identifier recorded for harvester log validation service next harvesting 14
  • 26. ARIADNE Harvester ASPECT Repository OAI-PMH SPI SQI External Repository LOM LOM LOM OAI 2 6 LOM OAI-PMH 1 Validation LOM Msg LOM LOM LOM 4 5 3 Validation Validation LOM Msg Msg harvester log validation service 15
  • 27. Validation Reports • After harvesting -> report generated and put online • report has 4 “levels” : • full log (incl. metadata) • reporting log • Grouped Errors • Error Summary 16
  • 29. References • SPI : http://ariadne.cs.kuleuven.be/lomi/index.php/ SimplePublishingInterface • IEEE LOM : http://ltsc.ieee.org/wg12/ • OAI-PMH : http://www.openarchives.org/ • Online Validation : http://ariadne.cs.kuleuven.be/validationService/ 18