Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
Extending Models for Controlled Vocabularies to Classification Systems: Modelling DDC with FRSAD Joan S. Mitchell  OCLC, I...
The big question Can the FRSAD conceptual model be extended beyond subject authority data (its original focus) to model cl...
Outline <ul><li>From Knowledge Organisation Systems (KOS) to data and conceptual models </li></ul><ul><li>FRSAD conceptual...
DDC UDC LCSH FRSAD FRAD FRBR TEST* * Thesaurus of engineering and scientific terms ISO 2788 (1974)  Guidelines for the Est...
From Knowledge Organisation Systems  to Data and Conceptual Models: Modelling efforts Classifi-cation Subject headings FRS...
The  “FRBR family”  <ul><li>FRBR: the original framework </li></ul><ul><ul><li>All entities, focusing on Group 1 entities:...
The FRBR family models: main entities and relationships FRBR FRAD FRSAD
2. FRSAD Conceptual Model 2.1 The core of the FRSAD conceptual model
FRSAD  –  generalisation of FRBR
The core of the FRSAD conceptual model  FRSAD Part 1:   WORK  has as subject  THEMA /  THEMA  is subject of  WORK FRSAD Pa...
Note: in a given controlled vocabulary and within a domain, a  nomen  should be an appellation of only one  thema . The  ‘...
<ul><ul><li>NOMEN  =  any sign or sequence of signs (alphanumeric characters, symbols,  </li></ul></ul><ul><ul><li>sound, ...
terms  ( preferred  & non-preferred) notations terms of pre-coordinated strings category labels  (w or w/t notations) term...
2.2  Relationships (1) Thema-to-thema  relationships <ul><li>Hierarchical </li></ul><ul><ul><li>The generic relationship <...
<ul><li>Equivalence </li></ul><ul><ul><li>Two  nomens  are considered equivalent only if they are appellations of the same...
2.3 Attributes <ul><li>Some general attributes of  thema  and  nomen  are proposed  </li></ul><ul><ul><li>(1) thema  attri...
Nomen  attributes  <ul><ul><li>Type of nomen (identifier, controlled name, …) </li></ul></ul><ul><ul><li>Scheme (LCSH, DDC...
2.4 The importance of the  THEMA-NOMEN  model to the subject authority data <ul><li>Separating what are usually called  co...
3.  FRSAD model for classification systems <ul><li>Each class corresponds to a  thema </li></ul><ul><li>Notation associate...
4. DDC case study
Thema: Class 025.04
Nomens: DDC number, Full caption, URI 025.04 Computer science, information & general works/Library & information sciences/...
Thema: Any topic co-extensive with the full meaning of the class topics that are functionally equivalent to the class
Scope note: Text describing or defining thema or specifying scope within particular system Scope note (≠ thema/class) Scop...
Thema-to-thema relationships associative  relationship  associative  relationship  (poly)hierarchical relationship
Alternative nomens: Relative Index terms with equivalence relationship to class
equivalence relationship ? ? ?  ? ? ? ? ? scope note SN SN SN SN ? unknown relationship ?
Derived alternative nomens 150 ## $a Databanks 260 ## $i see also $a Databases
equivalence relationship ? ? ?  ? ? ? scope note SN SN SN SN ? unknown relationship Derived
5. Findings and limitations <ul><li>FRSAD conceptual model appears to accommodate DDC data at a broad level </li></ul><ul>...
6. Future work <ul><li>Specify all relationships between Relative Index terms and classes (see earlier work by Green, Mitc...
equivalence relationship ? ? ?  ? ? ? scope note SN SN SN SN ? unknown relationship Derived
6. Future work <ul><li>Specify all relationships between Relative Index terms and classes (see earlier work by Green, Mitc...
French DDC 22 German DDC 22 Italian DDC 22 Swedish Mixed  DDC 22 Italian  A14 Vietnamese A14 French  A14 Spanish A14 Hebre...
Mappings and crosswalks DDC LCSH MeSH SWD RAMEAU SAB BISAC SEARS CSH UDC LCC SAO Nuovo Soggettario
Thema-to-thema relationships across languages:  Class 025.04 (22/swe) = Class 025.04 (22)
Thema-to-thema relationships (Complex case): T2—43414  (22) = T2—43414 (22/ger), but . . . <ul><li>T2—43414 Giessen distri...
6. Future work <ul><li>Specify all relationships between Relative Index terms and classes (see earlier work by Green, Mitc...
Upcoming SlideShare
Loading in …5
×

Extending models for controlled vocabularies to classification systems: modelling DDC with FRSAD

1,274 views

Published on

Mitchell, Joan S., Marcia Lei Zeng, and Maja Zumer. Presented at the International UDC Seminar 2011, Classification & Ontology, The Hague, The Netherlands, Sept. 19-20, 2011.

Published in: Education, Technology
  • Be the first to comment

Extending models for controlled vocabularies to classification systems: modelling DDC with FRSAD

  1. 1. Extending Models for Controlled Vocabularies to Classification Systems: Modelling DDC with FRSAD Joan S. Mitchell OCLC, Inc.   Marcia Lei Zeng Kent State University   Maja Žumer University of Ljubljana , Slovenia  
  2. 2. The big question Can the FRSAD conceptual model be extended beyond subject authority data (its original focus) to model classification data?
  3. 3. Outline <ul><li>From Knowledge Organisation Systems (KOS) to data and conceptual models </li></ul><ul><li>FRSAD conceptual model </li></ul><ul><li>FRSAD model for classification systems </li></ul><ul><li>DDC case study </li></ul><ul><li>Findings and limitations </li></ul><ul><li>Future work </li></ul>
  4. 4. DDC UDC LCSH FRSAD FRAD FRBR TEST* * Thesaurus of engineering and scientific terms ISO 2788 (1974) Guidelines for the Establishment and Development of Monolingual Thesauri ISO 5964 (1985) Guidelines for the Establishment and Development of Multilingual Thesauri ISO 2788* ISO5964* SKOS OWL 1. From Knowledge Organisation Systems to Data and Conceptual Models: Timeline 2009 1998 2010 1876 1905 1898 1967 1974 1985 2004-2009
  5. 5. From Knowledge Organisation Systems to Data and Conceptual Models: Modelling efforts Classifi-cation Subject headings FRSAD FRAD FRBR ISO 2788 ISO5964 SKOS OWL Classifi-cation Thesauri Thesauri KOS KOS ontology Thesauri: mostly comply with ISO 2788 and ISO 5964. Subject heading schemes : adopted the basic structure of the thesaurus since 1990s. Classification systems : implemented different practices and are usually constructed according to specific conventions and examples. 2009 1998 2010 1876 1905 1898 1967 1974 1985 2004-2009
  6. 6. The “FRBR family” <ul><li>FRBR: the original framework </li></ul><ul><ul><li>All entities, focusing on Group 1 entities: work, expression, manifestation, item </li></ul></ul><ul><ul><li>Published 1998 </li></ul></ul><ul><li>FRAD: Functional Requirements for Authority Data </li></ul><ul><ul><li>Focusing on Group 2 entities: person, corporate body, family </li></ul></ul><ul><ul><li>Published 2009 </li></ul></ul><ul><li>FRSAD: Functional Requirements for Subject Authority Data </li></ul><ul><ul><li>Focusing on Group3 entities </li></ul></ul><ul><ul><li>FRSAR WG established in 2005 </li></ul></ul><ul><ul><li>Published 2010 </li></ul></ul>
  7. 7. The FRBR family models: main entities and relationships FRBR FRAD FRSAD
  8. 8. 2. FRSAD Conceptual Model 2.1 The core of the FRSAD conceptual model
  9. 9. FRSAD – generalisation of FRBR
  10. 10. The core of the FRSAD conceptual model FRSAD Part 1: WORK has as subject THEMA / THEMA is subject of WORK FRSAD Part 2: THEMA has appellation NOMEN / NOMEN is appellation of THEMA NOMEN = any sign or sequence of signs (alphanumeric characters, symbols, sound, etc.) that a thema is known by, referred to or addressed as
  11. 11. Note: in a given controlled vocabulary and within a domain, a nomen should be an appellation of only one thema . The ‘has appellation’ relationship between thema and nomen in a controlled vocabulary:
  12. 12. <ul><ul><li>NOMEN = any sign or sequence of signs (alphanumeric characters, symbols, </li></ul></ul><ul><ul><li>sound, etc.) that a thema is known by, referred to or addressed as . </li></ul></ul>Source: STN Database Summary Sheet: USAN (The USP Dictionary of U.S. Adopted Names and International Drug Names) An example of nomens in an authority record for a chemical compound Nomen 1-8 Nomen 9
  13. 13. terms ( preferred & non-preferred) notations terms of pre-coordinated strings category labels (w or w/t notations) terms or identifiers … … <ul><li>thesauri: </li></ul><ul><li>classification schemes: </li></ul><ul><li>subject heading systems: </li></ul><ul><li>taxonomies: </li></ul><ul><li>controlled lists: </li></ul><ul><li>… … </li></ul>themas represented by: Nomens in different types of KOS
  14. 14. 2.2 Relationships (1) Thema-to-thema relationships <ul><li>Hierarchical </li></ul><ul><ul><li>The generic relationship </li></ul></ul><ul><ul><li>The hierarchical whole-part relationship </li></ul></ul><ul><ul><li>The instance relationship </li></ul></ul><ul><ul><li>Other hierarchical relationships </li></ul></ul><ul><li>Associative </li></ul><ul><ul><li>[most commonly considered categories are listed in the report] </li></ul></ul><ul><li>Other thema- to -thema relationships are domain- or implementation-dependent </li></ul>
  15. 15. <ul><li>Equivalence </li></ul><ul><ul><li>Two nomens are considered equivalent only if they are appellations of the same thema in a controlled vocabulary. </li></ul></ul><ul><li>Partitive </li></ul><ul><ul><li>An instance of a nomen may have parts. </li></ul></ul><ul><ul><li>A whole-part relationship may exist between a nomen and its components. </li></ul></ul>2.2 Relationships (2) Nomen-to-nomen relationships
  16. 16. 2.3 Attributes <ul><li>Some general attributes of thema and nomen are proposed </li></ul><ul><ul><li>(1) thema attributes: - type of thema </li></ul></ul><ul><ul><ul><ul><li>In an implementation themas can be organized based on category, kind, or type </li></ul></ul></ul></ul><ul><ul><li>- scope note </li></ul></ul><ul><ul><li>- In an implementation additional attributes may be defined/recorded </li></ul></ul><ul><ul><li>(2) nomen attributes: see next slide  </li></ul></ul>
  17. 17. Nomen attributes <ul><ul><li>Type of nomen (identifier, controlled name, …) </li></ul></ul><ul><ul><li>Scheme (LCSH, DDC, UDC, ULAN, ISO 8601…) </li></ul></ul><ul><ul><li>Reference source of nomen (Encyclopaedia Britannica…) </li></ul></ul><ul><ul><li>Representation of nomen (alphanumeric, sound, visual,...) </li></ul></ul><ul><ul><li>Language of nomen (English, Japanese, Slovenian,…) </li></ul></ul><ul><ul><li>Script of nomen (Cyrillic, Thai, Chinese-simplified,…) </li></ul></ul><ul><ul><li>Script conversion (Pinyin, ISO 3601, Romanisation of Japanese…) </li></ul></ul><ul><ul><li>Form of nomen (full name, abbreviation, formula…) </li></ul></ul><ul><ul><li>Time of validity of nomen (until xxxx, after xxxx, from… to …) </li></ul></ul><ul><ul><li>Audience (English-speaking users, scientists, children …) </li></ul></ul><ul><ul><li>Status of nomen (provisional, accepted, official,...) </li></ul></ul><ul><ul><li>Note: examples of attribute values in parenthesis </li></ul></ul><ul><ul><li>- In an implementation additional attributes may be defined </li></ul></ul>include but not limited to:
  18. 18. 2.4 The importance of the THEMA-NOMEN model to the subject authority data <ul><li>Separating what are usually called concepts (or topics , subjects, classes [of concepts] ) from what they are known by, referred to, or addressed as </li></ul><ul><li>A general abstract model, not limited to any particular domain or implementation </li></ul><ul><li>Potential for interoperability within the library field and beyond </li></ul>
  19. 19. 3. FRSAD model for classification systems <ul><li>Each class corresponds to a thema </li></ul><ul><li>Notation associated with the class is the nomen </li></ul><ul><li>Thema is the full category description of the class </li></ul><ul><li>Nomen is the symbol (or surrogate) used to represent the full category description </li></ul>
  20. 20. 4. DDC case study
  21. 21. Thema: Class 025.04
  22. 22. Nomens: DDC number, Full caption, URI 025.04 Computer science, information & general works/Library & information sciences/Operations of libraries, archives, information centers/Information storage and retrieval systems http://dewey.info/class/025.04/
  23. 23. Thema: Any topic co-extensive with the full meaning of the class topics that are functionally equivalent to the class
  24. 24. Scope note: Text describing or defining thema or specifying scope within particular system Scope note (≠ thema/class) Scope note (≠ thema/class)
  25. 25. Thema-to-thema relationships associative relationship associative relationship (poly)hierarchical relationship
  26. 26. Alternative nomens: Relative Index terms with equivalence relationship to class
  27. 27. equivalence relationship ? ? ? ? ? ? ? ? scope note SN SN SN SN ? unknown relationship ?
  28. 28. Derived alternative nomens 150 ## $a Databanks 260 ## $i see also $a Databases
  29. 29. equivalence relationship ? ? ? ? ? ? scope note SN SN SN SN ? unknown relationship Derived
  30. 30. 5. Findings and limitations <ul><li>FRSAD conceptual model appears to accommodate DDC data at a broad level </li></ul><ul><li>Topic-to-topic relationships require further study </li></ul><ul><li>The study did not consider the usefulness of classification data modelled using FRSAD in real-world applications </li></ul>
  31. 31. 6. Future work <ul><li>Specify all relationships between Relative Index terms and classes (see earlier work by Green, Mitchell) </li></ul>
  32. 32. equivalence relationship ? ? ? ? ? ? scope note SN SN SN SN ? unknown relationship Derived
  33. 33. 6. Future work <ul><li>Specify all relationships between Relative Index terms and classes (see earlier work by Green, Mitchell) </li></ul><ul><li>Investigate DDC translations and mappings in context of model </li></ul>
  34. 34. French DDC 22 German DDC 22 Italian DDC 22 Swedish Mixed DDC 22 Italian A14 Vietnamese A14 French A14 Spanish A14 Hebrew A14 200 Religion Class Guide (French) DDC 22 A14 DDC Sach-Gruppen (German) DDC Summaries English French Italian Rhaeto-Romansch Afrikaans Arabic Chinese French German Norwegian Portuguese Russian Scots Gaelic Spanish Swedish
  35. 35. Mappings and crosswalks DDC LCSH MeSH SWD RAMEAU SAB BISAC SEARS CSH UDC LCC SAO Nuovo Soggettario
  36. 36. Thema-to-thema relationships across languages: Class 025.04 (22/swe) = Class 025.04 (22)
  37. 37. Thema-to-thema relationships (Complex case): T2—43414 (22) = T2—43414 (22/ger), but . . . <ul><li>T2—43414 Giessen district (Giessen Regierungsbezirk) </li></ul><ul><li>Including *Lahn River </li></ul><ul><li>T2—43414 Regierungsbezirk Gießen </li></ul><ul><li>T2—434147 Lahn-Dill-Kreis </li></ul><ul><li>Hier auch: der Fluss *Lahn </li></ul>not equivalent to thema/class T2—43414 functionally equivalent to thema/class T2—434147
  38. 38. 6. Future work <ul><li>Specify all relationships between Relative Index terms and classes (see earlier work by Green, Mitchell) </li></ul><ul><li>Investigate DDC translations and mappings in context of model </li></ul><ul><li>Investigate modelling the Relative Index as a separate controlled vocabulary to provide a topic-centered view </li></ul><ul><li>Experiment with modelling other classification schemes </li></ul><ul><li>Investigate usefulness of classification data modelled using FRSAD </li></ul>

×