Integrated Access and
Shareable Metadata
Jenn Riley
Metadata Librarian
IU Digital Library Program
6/25/07 SEI 2
Building “Good digital
collections”*
 Interoperable – with the important goal
of cross-collection searching...
6/25/07 SEI 3
Metadata is a view of the resource
 There is no monolithic, one-size-fits-all
metadata record
 Metadata fo...
6/25/07 SEI 4
Choice of vocabularies as a
view
 Names
 LCNAF: Michelangelo Buonarroti, 1475-
1564
 ULAN: Buonarroti, Mi...
6/25/07 SEI 5
Sharing metadata: Federated search
The distributed databases are searched directly.
Mill?
<title>My
resource...
6/25/07 SEI 6
Sharing metadata: Data aggregation
The user searches a pre-aggregated database of metadata from
diverse sour...
6/25/07 SEI 7
OAI-PMH Structure
 Intentionally designed to be simple
 Data providers
 Have metadata they want to share
...
6/25/07 SEI 8
Data Providers
 Set up a server that responds to harvesting
requests
 Required to expose metadata in simpl...
6/25/07 SEI 9
Service Providers
 Harvest and store metadata
 Generally provide search/browse access to
this metadata
 C...
6/25/07 SEI 10
Finding the right balance
 Metadata providers know the materials
 Document encoding schemes and controlle...
6/25/07 SEI 11
Why share metadata?
 Benefits to users
 One-stop searching
 Aggregation of subject-specific resources
 ...
6/25/07 SEI 12
Why share metadata with OAI?
 “Low barrier” protocol
 Shares metadata only, not content,
simplifying righ...
6/25/07 SEI 13
Three possible architectures
OAIHarvester
Digital asset management system
Metadata
creation
module
OAI data...
6/25/07 SEI 14
What does this record describe?
identifier: http://name.university.edu/IC-FISH3IC-X0802]1004_112
publisher:...
6/25/07 SEI 15
6/25/07 SEI 16
Shareable metadata defined
 Metadata for aggregation with records from other
institutions
 Promotes searc...
6/25/07 SEI 17
6 Cs and lots of Ss of shareable
metadata
Content
Consistency
Coherence
Context
Communication
Conformance
M...
6/25/07 SEI 18
Content
 Choose appropriate vocabularies
 Choose appropriate granularity
 Make it obvious what to displa...
6/25/07 SEI 19
Consistency
 Records in a set should all reflect the same
practice
 Fields used
 Vocabularies
 Syntax e...
6/25/07 SEI 20
Coherence
 Record should be self-explanatory
 Values must appear in appropriate elements
 Repeat fields ...
6/25/07 SEI 21
Context
 Include information not used locally
 Exclude information only used locally
 Current safe assum...
6/25/07 SEI 22
Communication
 Method for creating shared records
 Vocabularies and content standards used in
shared reco...
6/25/07 SEI 23
Conformance to Standards
 Metadata standards (and not just DC)
 Vocabulary and encoding standards
 Descr...
6/25/07 SEI 24
Before you share…
 Check your metadata
 Appropriate view?
 Consistent?
 Context provided?
 Does the ag...
6/25/07 SEI 25
The reality of sharing metadata
 We can no longer afford to only think about our local
users
 Creating sh...
Upcoming SlideShare
Loading in...5
×

Integrated Access and Shareable Metadata

157

Published on

Riley, Jenn. "Integrated Access and Shareable Metadata." Summer Educational Institute for Visual Resources and Image Management, Indiana University June 25, 2007.

Published in: Education
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
157
On Slideshare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
1
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Integrated Access and Shareable Metadata

  1. 1. Integrated Access and Shareable Metadata Jenn Riley Metadata Librarian IU Digital Library Program
  2. 2. 6/25/07 SEI 2 Building “Good digital collections”*  Interoperable – with the important goal of cross-collection searching  Persistent – reliably accessible  Re-usable – repositories of digital objects that can be used for multiple purposes *Institute for Museum and Library Services. A Framework of Guidance for Building Good Digital Collections. Washington, D.C.: Institute for Museum and Library Services, November 2001. http://www.niso.org/framework/Framework2.html
  3. 3. 6/25/07 SEI 3 Metadata is a view of the resource  There is no monolithic, one-size-fits-all metadata record  Metadata for the same thing is different depending on use and audience
  4. 4. 6/25/07 SEI 4 Choice of vocabularies as a view  Names  LCNAF: Michelangelo Buonarroti, 1475- 1564  ULAN: Buonarroti, Michelangelo  Places  LCSH: Jakarta (Indonesia)  TGN: Jakarta  Subjects  LCSH: Neo-impressionism (Art)  AAT: Pointillism
  5. 5. 6/25/07 SEI 5 Sharing metadata: Federated search The distributed databases are searched directly. Mill? <title>My resource</ title> <date>04 <title>My resource</ title> <date>04 <title>My resource</ title> <date>04 For Example: Z39.50, SRU
  6. 6. 6/25/07 SEI 6 Sharing metadata: Data aggregation The user searches a pre-aggregated database of metadata from diverse sources. Mill? <title>My resource</ title> <date>04 For Example: Search engines, union catalogs, OAI-PMH
  7. 7. 6/25/07 SEI 7 OAI-PMH Structure  Intentionally designed to be simple  Data providers  Have metadata they want to share  “Expose” their metadata to be harvested  Service providers  Harvest metadata from data providers  Provide searching of harvested metadata from multiple sources  Can also provide other value-added services
  8. 8. 6/25/07 SEI 8 Data Providers  Set up a server that responds to harvesting requests  Required to expose metadata in simple Dublin Core (DC) format  Can also expose metadata in any other format expressible with an XML schema
  9. 9. 6/25/07 SEI 9 Service Providers  Harvest and store metadata  Generally provide search/browse access to this metadata  Can be general or domain-specific  Can choose to collect metadata in formats other than DC  Generally link out to holding institutions for access to digital content  OAIster is a good example
  10. 10. 6/25/07 SEI 10 Finding the right balance  Metadata providers know the materials  Document encoding schemes and controlled vocabularies  Document practices  Ensure record validity  Aggregators have the processing power  Format conversion  Reconcile known vocabularies  Normalize data  Batch metadata enhancement
  11. 11. 6/25/07 SEI 11 Why share metadata?  Benefits to users  One-stop searching  Aggregation of subject-specific resources  Benefits to institutions  Increased exposure for collections  Broader user base  Bringing together of distributed collections Don’t expect users will know about your collection and remember to visit it.
  12. 12. 6/25/07 SEI 12 Why share metadata with OAI?  “Low barrier” protocol  Shares metadata only, not content, simplifying rights issues  Same effort on your part to share with one or a hundred service providers (basically)  Wide adoption in the cultural heritage sector  Quickly eclipsed older methods such as Z39.50
  13. 13. 6/25/07 SEI 13 Three possible architectures OAIHarvester Digital asset management system Metadata creation module OAI data provider module Transformation Metadata creation system Stand-alone OAI data provider Transformation DC QDC MODS MARCXML DC MARCXML QDC MODS Metadata creation module Static Repository Gateway Transformation
  14. 14. 6/25/07 SEI 14 What does this record describe? identifier: http://name.university.edu/IC-FISH3IC-X0802]1004_112 publisher: Museum of Zoology, Fish Field Notes format: jpeg rights: These pages may be freely searched and displayed. Permission must be received for subsequent distribution in print or electronically. type: image subject: 1926-05-18; 1926; 0812; 18; Trib. to Sixteen Cr. Trib. Pine River, Manistee R.; JAM26-460; 05; 1926/05/18; R10W; S26; S27; T21N language: UND source: Michigan 1926 Metzelaar, 1926--1926; description: Flora and Fauna of the Great Lakes Region Example courtesy of Sarah Shreeves, University of Illinois at Urbana-Champaign
  15. 15. 6/25/07 SEI 15
  16. 16. 6/25/07 SEI 16 Shareable metadata defined  Metadata for aggregation with records from other institutions  Promotes search interoperability - “the ability to perform a search over diverse sets of metadata records and obtain meaningful results” (Priscilla Caplan)  Is human understandable outside of its local context  Is useful outside of its local context  Preferably is machine processable
  17. 17. 6/25/07 SEI 17 6 Cs and lots of Ss of shareable metadata Content Consistency Coherence Context Communication Conformance Metadata standards Vocabulary and encoding standards Descriptive content standards Technical standards
  18. 18. 6/25/07 SEI 18 Content  Choose appropriate vocabularies  Choose appropriate granularity  Make it obvious what to display  Make it obvious what to index  Exclude unnecessary “filler”  Make it clear what links point to
  19. 19. 6/25/07 SEI 19 Consistency  Records in a set should all reflect the same practice  Fields used  Vocabularies  Syntax encoding schemes  Allows aggregators to apply same enhancement logic to an entire group of records
  20. 20. 6/25/07 SEI 20 Coherence  Record should be self-explanatory  Values must appear in appropriate elements  Repeat fields instead of “packing” to explicitly indicate where one value ends and another begins
  21. 21. 6/25/07 SEI 21 Context  Include information not used locally  Exclude information only used locally  Current safe assumptions  Users discover material through shared record  User then delivered to your environment for full context  Context driven by intended use
  22. 22. 6/25/07 SEI 22 Communication  Method for creating shared records  Vocabularies and content standards used in shared records  Record updating practices and schedules  Accrual practices and schedules  Existence of analytical or supplementary materials  Provenance of materials
  23. 23. 6/25/07 SEI 23 Conformance to Standards  Metadata standards (and not just DC)  Vocabulary and encoding standards  Descriptive content standards (AACR2, CCO, DACS)  Technical standards (XML, Character encoding, etc)
  24. 24. 6/25/07 SEI 24 Before you share…  Check your metadata  Appropriate view?  Consistent?  Context provided?  Does the aggregator have what they need?  Documented? Can a stranger tell you what the record describes?
  25. 25. 6/25/07 SEI 25 The reality of sharing metadata  We can no longer afford to only think about our local users  Creating shareable metadata will require more work on your part  Creating shareable metadata will require our vendors to support (more) standards  Creating shareable metadata is no longer an option, it’s a requirement  Indiana is moving toward a portal of Indiana-related digital content – you should be planning for this now
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×