2011 Taxonomy Standards Update

3,319 views

Published on

Published in: Technology, Education
0 Comments
2 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
3,319
On SlideShare
0
From Embeds
0
Number of Embeds
17
Actions
Shares
0
Downloads
71
Comments
0
Likes
2
Embeds 0
No embeds

No notes for slide

2011 Taxonomy Standards Update

  1. 1. Taxonomy Boot Camp Standards Update Marjorie M.K. Hlava mhlava@accessinn.com Data Harmony Software Access Innovations, Inc. www.accessinn.comLeveraging your content semantically
  2. 2. What do we need a standard for?? 2 © 2010. Access Innovations, Inc. All Rights Reserved.
  3. 3. Why Standards? “Standards save money.” “Standards accelerate projects.” “Standards make better solutions.” “Standards provide better governance for solutions.” “Standards protect your investment for the future.” “Standards save money for future projects.” http://michaelrowell.com/wp/ 3 © 2010. Access Innovations, Inc. All Rights Reserved.
  4. 4. Standardization Makes Life Easier 4 © 2010. Access Innovations, Inc. All Rights Reserved.
  5. 5. What does it take to make thecomponents of the digitalinformation model work… 5 © 2010. Access Innovations, Inc. All Rights Reserved.
  6. 6. 6© 2010. Access Innovations, Inc. All Rights Reserved.
  7. 7. 7© 2010. Access Innovations, Inc. All Rights Reserved.
  8. 8. Standards Groups ISO W3C – World Wide Web Consortium NISO W3C IFLA US Library of Congress Dublin Core New things as they come up © 2010. Access Innovations, Inc. All Rights Reserved.
  9. 9. Standard Definition Universally or widely accepted, agreed upon, or established means of determining what something should be. Major classifications of this term include: Material or substance whose properties are known with a level of accuracy that is sufficient to allow its use as a physical reference in calibrating or measuring the same properties of another material or substance. Concept, norm, or principle established by agreement, authority, or custom, and used generally as an example or model to compare or measure the quality or performance of a practice or procedure. 9
  10. 10. Standard Definition Written definition, limit, or rule approved and monitored for compliance by an authoritative agency (or professional or recognized body) as a minimum acceptable benchmark. This is the usual meaning of the plural term (standards). See also specification. 10
  11. 11. Traditional Standards ISO  TC 46  SC 9  ISO/CD 25964-1  ANSI  NISO  Z39.19  BSI  BS 8723 1 1 © 2010. Access Innovations, Inc. All Rights Reserved.
  12. 12. More Standards Groups W3C  OWL  SKOS US Government  Office of Management and Budget European Union Individual Organization Attempts 12 © 2010. Access Innovations, Inc. All Rights Reserved.
  13. 13. The Process ISO  137 Nations one vote each  ANSI  US organization  NISO US Information and Library Standards Tag 46 – ISO information Standards NISO is the secretariat 13 © 2010. Access Innovations, Inc. All Rights Reserved.
  14. 14. Impatient others W3C  World Wide Web Consortium Ad hoc groups  Author ID’s  ORCID  VIVO Industry and National Groups  Library of Congress  Associations for their industry  SMPTE Book Industry Study Group  ONIX 14 © 2010. Access Innovations, Inc. All Rights Reserved.
  15. 15. Proprietary ―Standards‖ PDF – Portable Document Format  Adobe CSV  Comma Separated Values  Microsoft 15 © 2010. Access Innovations, Inc. All Rights Reserved.
  16. 16. Soliciting Input – Formal Process Standard need Group with a frustration – need Standard proposal  NISO topic groups, CCM and others Call for participation Write the draft standard Open for comments NO must be resolved Standard for vote Approved for five years 16 © 2010. Access Innovations, Inc. All Rights Reserved.
  17. 17. Soliciting Input – Informal Process Standard need Group with a frustration – need Create a Working Group Write the draft standard Publish to the web Allow comments Resolve comments When no comments considered OK Start implementing / Early implementers 17 © 2010. Access Innovations, Inc. All Rights Reserved.
  18. 18. Taxonomies don’t exist in a vacuum  They are part of metadata  They are used to tag information objects  They are used on • Web sites • In Search • To profile people • To link resources  So we have to know a little about those standards as well 18
  19. 19. Groups of standards Identifiers Records management Country Codes Transliteration Formats Taxonomy related 19 © 2010. Access Innovations, Inc. All Rights Reserved.
  20. 20. Identifiers ISO/IEC CD 27037, Information technology — Security techniques —  Guidelines for identification, collection, acquisition, and preservation of digital evidence Information and documentation –  International standard document link ISO/FDIS 24619, Language resource management —  Persistent identification and sustainable access (PISA) 20 © 2010. Access Innovations, Inc. All Rights Reserved.
  21. 21. Records management ISO/FDIS 16175-2, Information and documentation  Principles and functional requirements for records in electronic office environments —  Part 2: Guidelines and functional requirements for digital records management systems ISO/DTR 17068 Information and documentation  Records management — Third party repository for electronic records ISO/DIS 13008, Information and documentation —  Digital records conversion and migration process 21 © 2010. Access Innovations, Inc. All Rights Reserved.
  22. 22. Country Codes Systematic review ISO 3166-2:2007, Codes for the representation of names of countries and their subdivisions — Part 2: Country subdivision code 22 © 2010. Access Innovations, Inc. All Rights Reserved.
  23. 23. Transliteration Systematic review ISO 3602:1989, Documentation –  Romanization of Japanese (kana script) Systematic review of ISO 11940-2:2007, Information and documentation —  Transliteration of Thai characters into Latin characters, Part 2: Simplified transcription of Thai language 23 © 2010. Access Innovations, Inc. All Rights Reserved.
  24. 24. Formats ISO/DIS 14289-1, Document management applications — Electronic document file format enhancement for accessibility — Part 1: Use of ISO 32000- 1 (PDF/UA-1) 24 © 2010. Access Innovations, Inc. All Rights Reserved.
  25. 25. RFID ISO/FDIS 28560, Information and documentation — RFID in libraries  Part 1: Data elements and general guidelines for implementation  Part 2: Encoding of RFID data elements based on rules from ISO/IEC 15962  Part 3: Fixed length encoding 25 © 2010. Access Innovations, Inc. All Rights Reserved.
  26. 26. Internet Engineering Task Force (IETF) Brought you EMAIL Brought you the Internet Now XSF  XMPP Standards Foundation  define open protocols for presence, instant messaging, and real-time communication  collaboration on top of the IETF’s Extensible Messaging and Presence Protocol (XMPP). Now in Google 2011 26 © 2010. Access Innovations, Inc. All Rights Reserved.
  27. 27. W3C HTML 5 Linked Data Ontologies (OWL) and SKOS  Simple Knowledge Organization System Cascading Style Sheets (CSS),  Adding style to Web content. Widgets  Widget Packaging and XML Configuration,  Widget Interface,  API to metadata and persistently storing data.  XML Digital Signatures for Widgets, 27 © 2010. Access Innovations, Inc. All Rights Reserved.
  28. 28. Big Library Followings DCMI – Dublin Core Metadata Initiative  Functional requirements Library of Congress 28 © 2010. Access Innovations, Inc. All Rights Reserved.
  29. 29. Library of Congress MARC 21 formats and MARCXML VRA Core -- them METS (Metadata Encoding & Transmission Standard) MIX (NISO Metadata for Images in XML) - PREMIS (Preservation Metadata) - TextMD (Technical Metadata for Text) – ALTO - Technical Metadata for Optical Character Recognition Extended Date/Time Format (EDTF) 29 © 2010. Access Innovations, Inc. All Rights Reserved.
  30. 30. Taxonomy related standards Vocabularies Taxonomies Thesaurus Knowledge Organization Systems Terminologies 30 © 2010. Access Innovations, Inc. All Rights Reserved.
  31. 31. Vocabularies ISO/TC 46  SC 6 and SC 9  Information and documentation ISO/TC 37,  Terminology and other language and content resources 31 © 2010. Access Innovations, Inc. All Rights Reserved.
  32. 32. Thesaurus related NISO Z39.19 2010 www.niso.org ISO 2788 - Monolingual (1986) ISO 5964 - Multilingual (1985) ISO 5127, Information and documentation  Vocabulary BSI (BS 8723) the next revised ISO ISO 25964 / part 1 – Controlled Vocabularies ISO25964 / part 2 – Taxonomy Interoperability OWL from W3C SKOS the W3C thesaurus standard Dublin Core DCMI Functional requirements 3 2 © 2010. Access Innovations, Inc. All Rights Reserved.
  33. 33. Thesaurus and Indexing Standards –ANSI/NISO NISO Z39.19-2005 (2010) Guidelines for the Construction, Format, and Management of Monolingual Controlled Vocabularies NISO TR02-1997 Guidelines for Indexes and Related Information Retrieval Devices by James D. Anderson 3 3 © 2010. Access Innovations, Inc. All Rights Reserved.
  34. 34. ISO TC 46/SC 9 Information and Documentation - Identification and Description TC 46 is ISOs Technical Committee (TC) for information and documentation standards. SC 9 is the TC 46 Subcommittee (SC) that develops and maintains ISO standards on the identification and description of information resources. 3 4 © 2010. Access Innovations, Inc. All Rights Reserved.
  35. 35. Thesaurus and Indexing Standards – ISO ISO 2788:1986 Documentation - Guidelines for the establishment and development of monolingual thesauri ISO 5964:1985 Documentation - Guidelines for the establishment and development of multilingual thesauri ISO 5963:1985 Documentation - Methods for examining documents, determining their subjects, and selecting indexing terms ISO 999:1996 Information and documentation - Guidelines for the content, organization and presentation of indexes 3 5 © 2010. Access Innovations, Inc. All Rights Reserved.
  36. 36. New ISO Taxonomy Standard ISO 25964. Thesauri and interoperability with other vocabularies  Part 1: Thesauri for information retrieval  (Approved)  Part 2: Interoperability with other vocabularies (released for comment ) Stella Dexter Clarke, principal author 3 6 © 2010. Access Innovations, Inc. All Rights Reserved.
  37. 37. ISO TC 37Scope of ISO TC 37: Standardization of principles, methods and applications relating to terminology and other language resources. TC 37/SC 1 - Principles and methods TC 37/SC 2 - Terminography and lexicography TC 37/SC 3 - Computer applications for terminology TC 37/SC 4 - Language resource management 3 7 © 2010. Access Innovations, Inc. All Rights Reserved.
  38. 38. OntoIOp Terminographical and lexicographical working methods, for a new project proposal on: Ontology Integration and Interoperability (OntoIOp). A reference ontology for the interchange of cultural heritage information.  ICOM (International Council of Museums) CIDOC (International Committee for Documentation) 38 © 2010. Access Innovations, Inc. All Rights Reserved.
  39. 39. ISO 1087 Terminology Work Vocabulary ISO 1087-1:2000 Terminology work - Vocabulary - Part 1: Theory and application ISO 1087-2:2000 Terminology work - Vocabulary - Part 2: Computer applications 39 © 2010. Access Innovations, Inc. All Rights Reserved.
  40. 40. ISO 1087 40 © 2010. Access Innovations, Inc. All Rights Reserved.
  41. 41. Terminology work — Vocabulary 41 © 2010. Access Innovations, Inc. All Rights Reserved.
  42. 42. 42© 2010. Access Innovations, Inc. All Rights Reserved.
  43. 43. Other ISO standards:Concept-oriented terminologyISO 704:2000 Terminology work - Principles and methodsISO 860:1996 Terminology work - Harmonization of concepts and termsISO 10241:1992 Preparation and layout of international terminology standards 43 © 2010. Access Innovations, Inc. All Rights Reserved.
  44. 44. ISO - Data Categories ISO 12200:1999 Computer applications in terminology - Machine-readable terminology interchange format (MARTIF) - Negotiated interchange ISO 12616:2002 Translation-oriented terminography ISO/TR 12618:1994 Computer aids in terminology - Creation and use of terminological databases and text corpora ISO 12620:1999 Computer applications in terminology - Data categories used to create glossaries 44 Others in the work codes © 2010. Access Innovations, Inc. All Rights Reserved.
  45. 45. W3C OWL – Web Ontology Language RDF – Resource Description Format Topic Maps SKOS - Simple Knowledge Organization Systems SKOS 2 DCMI TURTLE Which community to serve? 45 © 2010. Access Innovations, Inc. All Rights Reserved.
  46. 46. 46© 2010. Access Innovations, Inc. All Rights Reserved.
  47. 47. A data wrapper 47© 2010. Access Innovations, Inc. All Rights Reserved.
  48. 48. RDF Triples Subject Object Predicate Linked data source standard Action path of ontologies 48 © 2010. Access Innovations, Inc. All Rights Reserved.
  49. 49. 49© 2010. Access Innovations, Inc. All Rights Reserved.
  50. 50. 50© 2010. Access Innovations, Inc. All Rights Reserved.
  51. 51. 51© 2010. Access Innovations, Inc. All Rights Reserved.
  52. 52. Other Relevant ISO & W3CStandards Metadata standards overview  http://www.slis.kent.edu/~mzeng/metadataba sics/completelist.htm Review of SKOS / DCMI / Taxonomy Standards  http://nkos.slis.kent.edu/ 52 © 2010. Access Innovations, Inc. All Rights Reserved.
  53. 53. 53© 2010. Access Innovations, Inc. All Rights Reserved.
  54. 54. SKOS SKOS 1 –  no synonyms,  no polyhierarchies SKOS 2 –  Added the above  Allow other fields (elements) on request OWL Crosswalk NISO Z39.19, BSI 8723, and ISO 25964 54 © 2010. Access Innovations, Inc. All Rights Reserved.
  55. 55. Who supports SKOS - Everyone Data Harmony Thesaurus Master Synaptica SmartLogic WordMap Poolparty Top Quadrant Protégé Etc. 55 © 2010. Access Innovations, Inc. All Rights Reserved.
  56. 56. Places to watch Other W3C and ISO areas Support groups  Blogs  Communities of Practice WSDL – Web Services Digital Library DCMI NKOS ISKO Linked Data 56 © 2010. Access Innovations, Inc. All Rights Reserved.
  57. 57. 57© 2010. Access Innovations, Inc. All Rights Reserved.
  58. 58. 58© 2010. Access Innovations, Inc. All Rights Reserved.
  59. 59. 59© 2010. Access Innovations, Inc. All Rights Reserved.
  60. 60. 60© 2010. Access Innovations, Inc. All Rights Reserved.
  61. 61. 61© 2010. Access Innovations, Inc. All Rights Reserved.
  62. 62. 62© 2010. Access Innovations, Inc. All Rights Reserved.
  63. 63. Lists of Taxonomy Resources Registry? NKOS KOS of KOS SKOS participants – W3C KOS typology – Tudhope  TaxoBank.org Tesauro.com – Spanish - Salama Kent.edu site – Marcia Zeng Taxonomy Warehouse – Synaptica UMLS - Unified Medical Language System - NIH 63 © 2010. Access Innovations, Inc. All Rights Reserved.
  64. 64. 64© 2010. Access Innovations, Inc. All Rights Reserved.
  65. 65. What about? Authority Files Other pick lists Rogets and other synonym rings Dictionaries Gazetteers Glossaries Etc. 65 © 2010. Access Innovations, Inc. All Rights Reserved.
  66. 66. Standards and pragmatism Use Standards  Lead to richer, more informative product  Promote interoperability -- Allow you to adopt or adapt other controlled vocabularies  Promote predictability  Allow repurposing within your organization and by other organizations Follow thesaurus standards for taxonomy  Incorporate authority files / final nodes as needed Your taxonomy or thesaurus must meet your needs 66 © 2010. Access Innovations, Inc. All Rights Reserved.
  67. 67. Author Systems & Disambiguation Vivo UniPHY AIP Collexis  Now part of ESP  SciVal Experts Author Net  Cambridge University Press ORCID 67 © 2010. Access Innovations, Inc. All Rights Reserved.
  68. 68. 68© 2010. Access Innovations, Inc. All Rights Reserved.
  69. 69. The Problem – KEEPING UP Many players we know and don’t know Between controlled vocabulary standards  ISO 2788 and 5964,  BSI 8723 Groups developing guidelines and standards  W3C with SKOS and OWL  Governments world wide developing and mandating taxonomies Communities  Increase reuse  Mapping interoperability between controlled 6 vocabularies 9 © 2010. Access Innovations, Inc. All Rights Reserved.
  70. 70. How to be active -Standards Role Voting member of NISO, CEBEMA, etc Member of the NISO Content Board Crosswalks for NISO, ISO, W3C SKOS Standards committees SLA, NFAIS, ASIST DCMI Functional requirements for KOS Active in NKOS Active in ISKO Read the blogs, COP 70 ORCID Prototype (one of…) © 2010. Access Innovations, Inc. All Rights Reserved.
  71. 71. The Games 71 © 2010. Access Innovations, Inc. All Rights Reserved.
  72. 72. Standards are our guiding lightThank youMarjorie M.K. HlavaPresidentAccess Innovations, Inc.Data Harmonymhlava@accessinn.com505-998-0800www.taxodiary.com - thetaxonomy news blogmmkhlava = twittermhlava = facebook,linkedin, eacademy, plaxo © 2010. Access Innovations, Inc. All Rights Reserved.
  73. 73. Questions??Okay Enough – go have a drink! Marjorie M.K. Hlava President Data Harmony Software Access Innovations, Inc. mhlava@accessinn.com

×