Open for Business - Open Archives, OpenURL, RSS and the Dublin Core - Presentation Transcript
UKOLN is supported by: Open for Business Open Archives, OpenURL, RSS and the Dublin Core Andy Powell, UKOLN, University of Bath [email_address] UKSG 2004, Manchester www.bath.ac.uk a centre of expertise in digital information management www.ukoln.ac.uk
Contents
context – metasearching and open ‘context sensitive’ linking
bluffer’s guides to…
Dublin Core
OAI Protocol for Metadata Harvesting
RSS
OpenURL
discussion about the benefits, problems and issues of using these standards in the publishing ‘business’ environment…
Things to note…
this is a briefing session about technologies…
…but it is not intended to be overly technical
you should leave with an understanding of what the key technologies are – but not necessarily be expert in them!
Important
this is a briefing session…
… please feel free to ask questions as we go through!
Context: metasearching and context sensitive linking
The ‘problem’…
end-user often has access to large number of heterogeneous collections - full-text, A&I, images, video, data, etc. (e.g. thru JISC licening agreements)
however, experience of these collections is less than optimal:
end-users not aware of available content
end-user has to interact with (search or browse) multiple different Web sites to work across range of content
content ‘discovery’ services not joined-up with delivery services
Or, to put it another way…
from perspective of ‘data consumer’
need to interact with multiple collections of stuff - bibliographic, full-text, data, image, video, etc.
delivered thru multiple Web sites
few cross-collection discovery services (with exception of big search engines like Google, but still some issues with use of Google – e.g. the ‘invisible Web’, the lack of metadata, keywords with multiple meanings, etc.)
from perspective of ‘data provider’
few agreed mechanisms for disclosing availability of content
A solution…
an ‘information environment’
framework of machine-oriented services allowing the end-user to
discover , access , use , publish resources across a range of content providers
move away from lots of stand-alone Web sites...
content providers expose metadata for
searching, harvesting , alerting
develop end-user services and tools that bring stuff together…
…based on open ‘standards’
End-user services and tools
tend to focus on library portal (metasearch) tools (e.g. Encompass, MetaLib or ZPortal)
but, there will be lots of user-focused services and tools…
subject portals developed within academia
reading list and other tools in VLE (e.g. externally hosted by Sentient Discover)
commercial ‘portals’ (ISI Web of Knowledge, ingenta, Bb Resource Center, etc.)
SFX service component (or other OpenURL resolver)
personal desktop reference manager (e.g. Endnote)
Link resolvers
‘ discovery’ is only part of the problem…
in the case of books, journals, journal articles, end-user wants access to the most appropriate copy
need to join up discovery services with access/delivery services (local library OPAC, ingentaJournals, Amazon, etc.)
need localised view of available services
linking services that provide access to the most appropriate copy
user and institutional preferences, cost, access rights, location, etc.
A shared problem space
the problems outlined here are shared across sectors and communities
student or researcher looking for information from variety of bibliographic sources
lecturer searching for e-learning resources from multiple learning object repositories
researcher working across multiple data-sets and compute servers on the Grid
a GP searching the National electronic Library for Health
school child searching BBC, museum and library Web sites for homework project
someone searching across multiple e-government Sites
even someone looking to buy or sell a second-hand car…
move from focus on eprints to more generic protocol
move from OAI-specific metadata schema to mandatory support for DC
Bluffer’s guide to OAI
OAI-PMH short for Open Archives Initiative Protocol for Metadata Harvesting
a low-cost mechanism for harvesting metadata records
from ‘data providers’ to ‘service providers’
allows ‘service provider’ to say ‘give me some or all of your metadata records’
where ‘some’ is based on date-stamps, sets, metadata formats
eprint heritage but widely deployed
images, museum artefacts, learning objects, …
http://www.openarchives.org/
Bluffer’s guide to OAI
based on HTTP and XML
simple, Web-friendly, fast deployment
OAI-PMH is not a search protocol
but use can underpin search-based services based on Z39.50 or SRW or SOAP or…
OAI-PMH carries only metadata
content (e.g. full-text or image) made available separately – typically at URL in metadata
mandates simple DC as record format
but extensible to any XML format – IMS metadata, IEEE LOM, ONIX, MARC, METS, MPEG-21, etc.
Bluffer’s guide to OAI
metadata and ‘content’ often made freely available – but not a requirement
OAI-PMH can be used between closed groups
or, can make metadata available but restrict access to content in some way
underlying HTTP protocol provides
access control – e.g. HTTP BASIC
compression mechanisms (for improving performance of harvesters)
could, in theory, also provide encryption if required
Bluffer’s Guide to… RSS
Bluffer’s guide to RSS
simple XML application for sharing (syndicating) ‘news’ feeds on the Web
RDF Site Summary or Rich Site Summary (depending on who you ask)
‘news’ can be interpreted quite loosely, e.g. new items added to database
uses ‘channel’ and ‘item’ terminology
a ‘channel’ is an XML document that is made available on a Web-site – to update the channel, simply update the XML
http://www.eevl.ac.uk/rss_primer/
Bluffer’s guide to RSS
each ‘item’ has simple metadata (title, description) and URL link to resource (news story or whatever)
RSS also provides channel branding (logo, etc.)
three versions currently 0.9, 1.0 and 2.0 - 1.0 is based on RDF and is more flexible (but slightly more complex) (Also worth noting Atom – an attempt to resolve some of the tensions in RSS)
no single registry of all channels yet
Bluffer’s guide to RSS
fairly widespread usage, e.g. channels available from the BBC, Microsoft, Apple, … as well as from several academic sites and services (RDN, LTSN, …)
easy to use within ‘portals’ (e.g. uPortal)
lots of software and toolkits available – open source and commercial
Bluffer’s Guide to… OpenURLs
OpenURL roots
the context
distributed information environment (e.g. the JISC IE)
multiple A&I and other discovery services
rapidly growing e-journal collection
need to interlink available resources
the problem
links controlled by external info services
links not sensitive to user’s context (appropriate copy problem)
links dependent on vendor agreements
links don’t cover complete collection
a library perspective?
The problem
the context
distributed information environment (e.g. the JISC IE)
multiple A&I and other discovery services
rapidly growing e-journal collection
need to interlink available resources
the REAL problem
libraries have no say in linking
libraries losing core part of ‘organising information’ task
expensive collection not used optimally
users not well served
a library perspective?
The solution…
do NOT hardwire a link to a single service on the referenced item (e.g. a link from an A&I service to the corresponding full-text)
BUT rather
provide a link that transports metadata about the referenced item
to another service that is better placed to provide service links
OpenURL OpenURL resolver (link server)
Non-OpenURL linking resolution of metadata into a link (typically a URL) A&I service document delivery service link to referenced work . reference link destination link source
OpenURL linking . user-specific resolution of metadata & identifiers into services reference provision of OpenURL transportation of metadata & identifiers context-sensitive A&I service document delivery service link source OpenURL OpenURL resolver link link destination link link destination link link destination link link destination
Example 1
journal article
from Web of Science to ingenta Journals
button indicating OpenURL ‘link’ is available
OpenURL resolver offering context-sensitive links, including link to ingenta
also links to other services such as Google search for related information
Example 2
book
from University of Bath OPAC to Amazon
button indicating OpenURL ‘link’ is available
OpenURL resolver offering context-sensitive links, including link to Amazon
also links to other services such as Google search for related information
Summary… ISI Web of Science University of Bath OPAC OpenURL resolver ingenta Google Amazon OpenURL Source OpenURL Resolver OpenURL Target
Summary (2)
OpenURL source
a service that embeds OpenURLs into its user-interface in order to enable linking to most appropriate copy
OpenURL resolver
a service that links to appropriate copy(ies) and other value added services based on metadata in OpenURL
OpenURL target
a service that can be linked to from an OpenURL resolver using metadata in OpenURL
Bluffer’s guide to OpenURLs
standard for linking ‘discovery’ services to ‘delivery’ services
supports linking from OpenURL ‘source’ to OpenURL ‘target’ via OpenURL ‘resolver’
End-user source resolver target e.g. Web of Science e.g. ingenta
0 comments
Post a comment