Presentation titled "Introducing a content integration process for a federation of agricultural institutional repositories". MTSR 2011, Izmir, Turkey, 12/10/2011
Introducing a content integration process for a federation of agricultural institutional repositories (MTSR 2011)
1. Introducing a content integration
process for a federation of
agricultural institutional repositories
V. Protonotarios1, L. Gavrilut1, I. Athanasiadis1,
Hatzakis1, M-A. Sicilia2
1Greek Research & Technology Network (GRNET)
II University of Alcala, Computer Science Department
4th International Workshop on Metadata and Semantics for
Agriculture, Food and Environment
MTSR 2011, October 12th, 2011, Izmir
3. About VOA3R Project
What is VOA3R?
◦ Virtual Open Access Agriculture &
Aquaculture Repository
◦ 36-months CIP-ICT-PSP EU project
VOA3R aims to:
Improve access to EU agriculture &
aquaculture open access research
results
4. About VOA3R Project
What is VOA3R about?
Sharing Scientific and Scholarly
Research related to Agriculture, Food
& Environment, using (among others):
◦ A federated repository feeding with scholarly content …
◦ A social platform which makes use of ….
◦ A set of domain ontologies
◦ and other integrated components…
5. About VOA3R Project
What is VOA3R going to develop?
Among others, the VOA3R federated
repository, which will harvest scholarly
content from institutional repositories.
How is this going to happen?
VOA3R will develop an AP based on the
requirements of the project’s content
providers
6. Where to find VOA3R?
1. Website: http://www.voa3r.eu
2. Social Platform: currently in beta
3. VOA3R Repository Tool (Confolio)
9. What about the content?
Scholarly content from institutional
repositories on agriculture and
aquaculture will be aggregated to
VOA3R repository
= metadata descriptions
VOA3R Content Providers currently
use a wide variety of metadata
standards
(e.g. AGRIS, Dublin Core)
10. What about the content?
The issue:
How to align all these different metadata
AP
The solution:
To work on a common AP (VOA3R AP),
based on the requirements of the
VOA3R content providers
15. Content Population Methodology
Controlled Testing phase (7-9/2011)
Enrichment of test metadata records using
Confolio
Phase 1 (10-12/2011)
Integration of repositories using OAI-PMH
Phase 2 (1-8/2012)
Integration of repositories with no OAI-PMH
support
Phase 3 (9/2012 – 5/2013)
Content population with content from external
collaborators
17. Overview of the Process
1. Uploading/Integration
Pre-Check against Core Criteria
yes no
1. Accessibility under the specified technical criteria.
The provider confirms that the resource can be opened or accessed through the provided URL (link). yes no
2. Appropriateness against violence, pornography, racism, etc.
The provider confirms that the resource does not contain any violent, pornograpic or racist
content/information. yes no
3. Relation of the metadata/content to Agriculture & Aquaculture.
The provider confirms that the resource is relevant to agriculture or aquaculture. yes no
4. The IPR (intellectual property rights) rules do not prohibit that the resource is promoted through the
VOA3R network.
The provider confirms that the resource is free of any IPR restrictions that are against its
promotion/description within the VOA3R network.
21. Scenario of Use: Testing
Phase
Confolio was used by the VOA3R
content providers as a controlled
environment for creating the metadata
records of their resources:
24. Scenario of Use: Testing
Phase
Validation:
Pre-Check against Core Criteria
yes no
1. Accessibility under the specified technical criteria.
The provider confirms that the resource can be opened or accessed through the provided URL (link). yes no
2. Appropriateness against violence, pornography, racism, etc.
The provider confirms that the resource does not contain any violent, pornograpic or racist
content/information. yes no
3. Relation of the metadata/content to Agriculture & Aquaculture.
The provider confirms that the resource is relevant to agriculture or aquaculture. yes no
4. The IPR (intellectual property rights) rules do not prohibit that the resource is promoted through the
VOA3R network.
The provider confirms that the resource is free of any IPR restrictions that are against its
promotion/description within the VOA3R network.
25. Scenario of Use: Testing
Phase
Quality Review/Assessment:
Grid for VOA3R Subject Experts
1 2 3 4 5
1 Clarity & Relevance: Is the content clear and relevant to the agricultural environment ?
1 (not clear & relevant) to 5 (absolutely clear & relevant) 1 2 3 4 5
2 Quality: Does the content has a high quality in terms of balanced presentation of ideas, and appropriate
level of detail ?
1 (no) to 5 (yes) 1 2 3 4 5
3 Appropriateness: Does the resource use appropriate vocabulary, language and concepts for the target
age of people it is adressing ?
1 (no, the resource uses inappropriate vocabulary) to 5 (yes, the resource uses appropriate vocabulary) 1 2 3 4 5
4 Motivation: Is the content motivating a target group of people to start reading more about the subject it
presents ?
1 (the content is not motivating) to 5 (the content is motivating) 1 2 3 4 5
5 Veracity & accuracy: Is the content true and accurate regarding the agricultural environment ?
1 (no) to 5 (yes) 1 2 3 4 5
6 Updated: Is the content up to date or the data and information presented are outdated ?
1 (information is outdated) to 5 (information is up to date) 1 2 3 4 5
7 Accessibility: How accessible is the content to the target group of people ?
1 (poorly accessible) to 5 (fully accessible)
8 Reusability: Does the content has ability to be used again in another environment and to be understood
by people with different backgrounds ?
accept without accept with
1 (the content cannot be reused) to 5 (the content can be reused) reject
modification modification
Final recommendation: Please give your final mark and a short comment to justify it.
Comment to the submitter:
Comment to the VOA3R federation:
26. Conclusions
Despite the wealth of scholarly content found
in institutional repositories, the use of different
metadata APs raises an issue
Agreeing on a common metadata format is a
challenge but the VOA3R AP aims to achieve
this goal
The design and implementation of a well-
defined content population/integration
process is a crucial component in populating
a repository