Metadata

Loading...

Flash Player 9 (or above) is needed to view presentations.
We have detected that you do not have it on your computer. To install it, go here.

0 comments

Post a comment

    Post a comment
    Embed Video
    Edit your comment Cancel

    Notes on slide 1

    What about “standards”? Best practice says “use standards”, but it’s not always clear what a standard is Is it something: Approved by a standards body? – few of the commonly used metadata schemas and vocabularies have gone right the way through this process. Dublin Core was only finally approved as an ISO standard earlier this year (2003) – de jure standards Community endorsement? Widespread practice? – de facto standards We will try to avoid using “standards”, if we slip into it, understand that we mean “standards” in a broad sense, with quotation marks around the term

    What about “standards”? Best practice says “use standards”, but it’s not always clear what a standard is Is it something: Approved by a standards body? – few of the commonly used metadata schemas and vocabularies have gone right the way through this process. Dublin Core was only finally approved as an ISO standard earlier this year (2003) – de jure standards Community endorsement? Widespread practice? – de facto standards We will try to avoid using “standards”, if we slip into it, understand that we mean “standards” in a broad sense, with quotation marks around the term

    For some projects the choice is clear

    Metadata is derived from 2 sources…

    New European schema

    New European schema

    North American, has influenced a number of other schema

    Ask the audience to suggest reasons for controlling vocabs?

    Ask the audience to suggest ways to control vocabs

    Finally moving away from controlled vocabularies to uncontrolled “keywords” – words supplied by cataloguer or even the user It’s NOT an either/or situation - Metadata frameworks can accommodate both Opinion is divided on whether formal controlled vocab or keywords are better. The research suggests that controlled vocabularies are better than uncontrolled keywords, but a mix of both is even better

    Ask the audience to suggest ways to control vocabs

    Image file header information - BMP, PCX, JPEG, FLI/FLC, and AVI files include headers that define the image size, number of colors, and other information needed to display the image. GPS - time and location could be used for a lot of things DRM data Audio annotation - though has obvious implications for file size and hasn’t been widely accepted is a option it may useful in some circumstances DOI Are unique numbers assigned to each unique published object These numbers are stored in a server that knows where on the net to find the specific document. very useful for DRM Histories – particularly medical/legal images User generate metadata Cookies can help personalise the online experience recall specific information on subsequent visits simplifying the process of recording information- provide a convenience feature to save you time Most Web browsers automatically accept cookies Log and track usage of the system - data could be used to analyse amount and reasons for failed searches - indicating that your chosen schema or vocabulary is ineffective Also be used to enable customisation & notification - MORE HERE User contributed - talk about in more detail > next slide

    Closure and evaluation checklist: review the topics covered on the timetable. At the end of your presentation you might ask, “How will that work in your institution?” share what they feel has been the most valuable thing they have learnt, what they will do with their new learning in the coming week what they want to understand better

    Favorites, Groups & Events

    Metadata - Presentation Transcript

    1. METADATA John Hargreaves Technical Support officer JISC Digital Media
    2. Before we begin
      • Why are we going to all this trouble?
      • Staffordshire Past Track
      • http://www.staffspasttrack.org.uk
    3. Metadata
      • Definition
      • Types
      • Location
      • Schemas
      • Vocabularies
      • Examples
    4. Definition
      • Common definition:
      • Information about information
      • For my purposes:
      • Structured information about information
    5. Structure
      • Organised information is created using
      • Schemas (element sets)
      • Vocabularies (values)
    6. Purposes
        • Finding, identifying and understanding a resource Descriptive/Discovery metadata e.g. “Title”, “Subject”
        • Creating, managing and preserving a resource Administrative, Technical, and Preservation metadata e.g. “Format”, “Filesize”
    7. Purposes
        • Organising and relating resources Structural and Packaging metadata e.g. “Is part of”, “Master image location”
        • Using a resource Usage and User-contributed metadata e.g. “Published in”, “License requirements”, “User rating”
    8. Purposes
      • Funder requirements…
      • Images for Education - A JISC funded project.
      • http:// imagesforeducation.org.uk /
    9. Metadata - Attributes
        • Different ‘levels’ of a resource (e.g. item, component, collection)
        • Different ‘layers’ within a resource (e.g. physical resources, intermediaries, digital resources)
        • Things outside the resource (e.g. rights ownership)
    10. Metadata can have different origins…
        • “ Implicit” – derived from the image itself (typically technical data)
        • “ Explicit” – brought to the image (typically descriptive metadata; might be ‘legacy’ data, or newly created)
        • New metadata might be:
          • Provided by an image contributor
          • Added by a cataloguer
          • Added by a user
          • All of above
    11. … and can exist in different locations
        • Embedded within the digital resource itself
        • Held in a traditional database
        • Within an XML encoding
      <image> <ID> Jga-0019a </ID> <Title> Sanctuary of Apollo </Title> </image>
    12. “ Standards”
        • Commonly, consistently applied formats or processes; measurable; well documented; endorsed by somebody
        • JISC Digital Media recommends:
          • Where there are clear standards, use them
          • Where standards are unclear/competing, follow models of good practice within your ‘community’
          • Where there are no standards/models, create your own (and document them carefully!)
          • Watch this space…
    13. Be very aware of differences
        • How do they deal with different “layers” within a resource? (e.g. images of images of images…)
        • What purposes are they serving? (description, administration, presevervation…)
    14. Choosing, adapting, and mapping schemas
        • Ideally we ’ d pull a schema off the shelf and begin cataloguing
        • Choice is clear for some collections but difficult for others (esp. where collection spans resource types or communities)
        • Adaptation is common and generally necessary (but needs to be done carefully!)
        • You might be combining several standard schemas or developing your own and mapping to standards for particular purposes
    15. Dublin Core
        • International (ISO 15836-2003) cross-community standard for describing digital resources http://dublincore.org/
        • Concentrates on descriptive/ discovery metadata
        • “ 1:1 rule” (1 record for 1 thing)
        • Frequently adapted, mapped-to, used to achieve interoperability
      Title Creator Subject Description Publisher Contributor Date Type Format Identifier Source Language Relation Coverage Rights
    16. VRA Core
        • Visual Resources Association
        • Version 4.0 is now also available
        • Concentrates on descriptive/discovery metadata
        • For art and cultural images
        • Influenced by Dublin Core
        • 1:1 rule (Work/Image)
        • Frequently adapted
        • http:// www.vraweb.org /
      Record Type Type Title Measurements Material Technique Creator Date Location ID Number Style/Period Culture Subject Relation Description Source Rights
    17. SEPIADES
      • Safeguarding European Photographic Images for Access
        • For photographic collections
        • Very extensive, with many sub-categories
        • Covers description and administration, physical works and their digital reproductions
        • Multi-level description which can describe a whole collection at many levels at once (based on archival metadata )
        • http://www.knaw.nl/ecpa/sepia/workinggroups/wp5/sepiadestool/sepiadesdef.pdf
    18. CDWA
      • Categories for the Description of Works of Art
      • Describes art works or cultural objects
        • Museum/gallery community
        • Extensive with many sub-categories
        • Covers description and administration, original works and their reproductions
        • Can describe complex objects with multiple parts
        • Note that there is a ‘lite’ version
        • http://www.getty.edu/research/conducting_research/standards/cdwa/index.html
    19. Some Established Mappings
        • Mapping metadata schemas:
          • Getty crosswalks: http://www.getty.edu/research/conducting_research/standards/intrometadata/crosswalks.html
          • UKOLN resources: http:// www.ukoln.ac.uk /metadata/
    20. Vocabularies Image courtesy of stock.xchng
    21. Why Use Controlled Vocabularies?
        • Better retrieval
        • Improved cataloguing efficiency and consistency
        • ‘ Disambiguate’ the language (e.g. ‘bank’)
        • Put things in their place (e.g. classify, identify relationships)
        • Support interoperability (improved cross-searching and metadata sharing)
    22. Ways to Control Vocabularies
        • Data entry rules or guidelines
        • Formal subject headings
        • Thesauri
        • Classifications
        • Authority lists (people, places, events…)
        • In-house keyword lists
        • Uncontrolled cataloguer-added keywords?
        • Combination of approaches
    23. What about ‘Uncontrolled’ Keywords?
        • Made up by a cataloguer at the point of cataloguing
        • Not an either/or situation – metadata can accommodate both
        • A mix of both can assist with retrieval
    24. Alternative Vocabularies
      • Consider some more creative approaches :
        • Ask some of your users to ‘catalogue’ a representative sample of your collection
        • Get your users to do the cataloguing!
        • Get the technology to do the cataloguing! (e.g. CBIR)
        • Draw on vocabularies from other communities, traditions and disciplines
        • Use an alternative vocabulary source (e.g. a children’s encyclopaedia, book index)
    25. CBIR & Community Involvement
        • Exploring Flickr by colour http:// labs.systemone.at/retrievr /
        • Using Flickr to catalogue a collection http:// www.flickr.com/photos/Library_of_Congress /
        • Galaxy Zoo - http:// www.galaxyzoo.org /
    26. Further Support and Guidance
      • Web site: http://www.jiscdigitalmedia.ac.uk/
      • helpdesk: http://www.jiscdigitalmedia.ac.uk/helpdesk/
      • JISC Mail:
      • https://www.jiscmail.ac.uk/cgi-bin/webadmin?A1=ind0907&L=JISCDIGITALMEDIA
    SlideShare Zeitgeist 2009

    + JISC Digital MediaJISC Digital Media Nominate

    custom

    134 views, 0 favs, 2 embeds more stats

    John Hargreaves introduces the topic of metadata

    More info about this document

    © All Rights Reserved

    Go to text version

    • Total Views 134
      • 102 on SlideShare
      • 32 from embeds
    • Comments 0
    • Favorites 0
    • Downloads 7
    Most viewed embeds
    • 31 views on http://www.jiscdigitalmedia.ac.uk
    • 1 views on http://jdm-dev.ilrt.bris.ac.uk

    more

    All embeds
    • 31 views on http://www.jiscdigitalmedia.ac.uk
    • 1 views on http://jdm-dev.ilrt.bris.ac.uk

    less

    Flagged as inappropriate Flag as inappropriate
    Flag as inappropriate

    Select your reason for flagging this presentation as inappropriate. If needed, use the feedback form to let us know more details.

    Cancel
    File a copyright complaint
    Having problems? Go to our helpdesk?

    Categories