Open Archives Initiative Protocol for Metadata Harvesting - Presentation Transcript
The Open Archives Initiative Protocol for Metadata Harvesting Muriel Foulonneau Tudor Research Centre [email_address] 10/2009 Dublin Core conference 2009, Seoul
The protocol was born
To create a minimal layer of interoperability between distributed repositories of scientific publications
An alternative to federated search
Networking of digital repositories
Oct 2009 [email_address]
“ OAI divides the world between data providers and service providers” Oct 2009 [email_address]
Sharing metadata : Data aggregation
The portal gathers metadata and implements its own retrieval system
Oct 2009 [email_address] Mill? Eg. Search engines, union catalogs, OAI <title>My resource</title> <date>04
The OAI framework Oct 2009 [email_address] Service provider Harvester Data provider Data provider Data provider Agregator
Mechanisms to transfer large datasets
Resumption tokens
Incremental harvesting
Portal interface Repository Data provider Repository Repository
Incremental harvest Harvester Data providers What’s new since the last time I came?
New or modified records
Deleted records
[email_address] Oct 2009 <title>My resource</title> <date>04
OAI is based on standards
HTTP protocol
XML and XML Schemas
Dublin Core
Oct 2009 [email_address]
Dublin Core MARC21 MODS Multiple representations of an object School of arts for girls Kiz Sanayi Mektebi] oai:lcoa1.loc.gov:loc.pnp/cph.3b23005 [email_address] Oct 2009
OAI repositories can be organized in sets Oct 2009 [email_address]
“ Using OAI has the advantage that only new and changed records need to be harvested, while for web crawl harvesting all records have to be re-harvested each time a harvest is run.” http://www.pictureaustralia.org/schemas/pa/index.html [email_address] Oct 2009
DRIVER – aggregation as an infrastructure [email_address] Oct 2009
Europeana [email_address] Oct 2009
IVOA – synchronization of service repositories [email_address] Oct 2009
Vth-VIth century AD (McNamee); IVth century AD (Cribiore); 1982
XVIII Dynasty Winter 2003 era of redevelopment various 2002-00 1980, refurbished 1997 China: Neolithic Period (5000 BCE-ca 1600 BCE)? 19691968 21. Nouemb. Anno. 1564 . And finisshed on the euen of thanunciacion of our said bilissid Lady falling on the wednesday the xxiiij daye of Marche. in the xix yeer of Kyng Edwarde the fourthe [1479]] 19193 xxxx Oct xx Various 1938-05-38 1963 to 1953 [not after 1579] 163[5?] [email_address] Oct 2009
Who is a metadata made for?
machine
Dc:type “Text.Correspondence.Letter”
Dc:language “wln”
human
Dc:type Correspondence
Dc:language “wallon”
Who knows ?
Dc:date “197- “
Dc:description “First ed. Cf. BM. “
[email_address] Oct 2009
Improving quality
Quality certificates for open access repositories
DINI - Deutsche Initiative für Netzwerkinformation
Best practices for OAI and shareable metadata by the Digital Library Federation and the National Science Digital Library
http://www.diglib.org/pubs/dlf108.pdf
Meeting with software providers
Test environment (eg Europeana)
Community guidelines
Oct 2009 [email_address]
Conclusion
The protocol « crossed the chasm »?
The objective is to create a network of repositories rather than networking individual resources
Lack of specific mechanism to relate resources to each other
Approach to linked data and OAI-ORE
Oct 2009 [email_address]
OAI-PMH
http://www.openarchives.org/pmh/
Best practices for OAI and shareable metadata
http://www.diglib.org/pubs/dlf108.pdf
Tim Cole and Muriel Foulonneau, Using the Open Archives Initiative Protocol for Metadata Harvesting , Libraries Unlimited, 2007
Muriel Foulonneau and Jenn Riley Metadata for Digital resources, Chandos Publishing, 2008
0 comments
Post a comment