SlideShare a Scribd company logo
1 of 40
Taxonomy:
Do I need one?


                     Leigh White
            ElementalSource, LLC
Yes
What I’ll talk about

•   What happens without a taxonomy
•   What a taxonomy is and does
•   Why a taxonomy is important
•   A few first development steps
What I won’t talk about

• All the different kinds of taxonomies
• Details about development
• Tools for development
  – except DITA subjectScheme (briefly!)
A little history
What the he** IS that???
Oh, let’s call it a…

• Use the native name
• Name it after something familiar
  that it’s kind of “like”
• “Like” is murky; you have to define
  “like”
  – How it looks? Shape? Color? Size?
  – How it tastes?
  – How it acts?
Earth apples, anyone?

• aardappel (Dutch)
• pomme de terre (French)
*not apples
We know this because

• We have a taxonomy (Linnean
  classification) that specifies degrees
  of relationship between living things
Distant cousins, at best

          apple          potato

Kingdom   Plantae        Plantae

Phylum    Anthophyta     Anthophyta

Class     Eudicots       Eudicots

Order     Rosales        Solanales

Family    Rosaceae       Solanaceae

Genus     Malus          Solanum

Species   M. domestica   S. tuberosum
So, a taxonomy is

• A way of defining “like”
• A way of expressing relationships
  between things
  – We might already be instinctively
    aware of these relationships but need
    to formalize them
• A way of discovering relationships
  between things
• An information model
Taxonomies are

• typically organized by parent-child
  relationships
• typically indicated by the phrase 'is
  a kind of' or 'is a subtype of'
• the subtype has the same
  properties, behaviors, and
  constraints as the supertype plus
  one or more additional properties,
  behaviors, or constraints
Uhh…what?

• For example: car is a kind of
  vehicle, so any car is also a vehicle,
  but not every vehicle is a car
• The level “car” is more constrained
  than the level “vehicle”
• A car has all the properties of a
  vehicle plus some other properties
  specific to a car
Taxonomies are all around us

• It’s our nature to classify
• Many of these taxonomies are
  internal, arbitrary and personal
• A true taxonomy must be uniform
  and unambiguous
Other familiar taxonomies

• Dewey Decimal System
• Library of Congress System
• ICD-9/10 codes
• computer folder system
  – probably most common
    taxonomy in tech comm
And one I especially dig

• A taxonomy of wrongness!
  – www.fallacyfiles.org/taxonomy.html
We have metadata…why do we need
a taxonomy too?

• Where did that metadata come
  from?
  – You must have had some idea of how
    your content should be classified
  – If so, then you already have the
    beginnings of a taxonomy, at least in
    your head
  – So take it a step further
Metadata compliments taxonomy
and vice-versa
• Metadata describes an individual piece of
  content but doesn’t capture relationships
  very well.
• Metadata is part of content so updates
  can be unwieldy; better to maintain the
  model outside the content
• A taxonomy serves as a roadmap…it both
  describes current content and predicts
  future content
• A taxonomy highlights similarities (and
  differences) across products
• Metadata can pick up where taxonomy
  leaves off
What else are taxonomies good for?

• Controlled vocabularies
  – indexing
  – keywords
  – glossaries

• Searching/browsing/filtering
  – Faceted search
  – Filtering for custom doc publishing

• Content reuse
Amazon.com
So far…

• we’ve looked at hierarchical
  taxonomies
When hierarchy isn’t enough

 A Cockapoo is a kind of dog. It’s the
  product of a poodle and a Cocker
  Spaniel. A hierarchy cannot capture
  all these relationships.
There’s an alternative (polyarchical)
Purists might say…

• that you need different notations to
  express different kinds of
  relationships
• or that you must express the
  relationships uniformly
Maybe, maybe not

• You need what you need to capture
  the relationships you need to
  express
• No more, no less - KISS
• The relationships already exist; you
  are just using the taxonomy to
  express them
Decisions to make

• What kind of taxonomy:
  – hierarchical, polyarchical, something
    else?
• If hierarchical, how many levels?
• If polyarchical, what kinds of
  relationships and how designated?
• Tool to use? (meh)
• How to associate content with
  taxonomy?
Questions to ask
• What will the taxonomy be used for?
  – indexing, search, etc.
• Who are the users?
  – content creators, clients, SMEs, support, etc.
• What content will the taxonomy cover?
  – topics, images, demos, videos, etc.
• What are the scope and limits?
  – handling off-topic content—what to
    include/exclude
• What are the resources and constraints?
  – skills/expertise, timing, technology, funding,
    stakeholder roles, etc.
More questions to ask

• Who is responsible for development?
• What are secondary/contributor
  roles?
• How does taxonomy fit in with other
  metadata?
• How to handle ongoing support and
  maintenance?
Some first steps
• Start small—maybe just one small product
• Do content audit of everything the
  taxonomy will categorize
• Compare TOCs of existing deliverables
  – Find commonalities, differences
• Compare indexes of existing deliverables
  – Discover terms already in use
• Use folder structure
More first steps
• Assemble starting list of categories
  that cover existing content based on
  TOC, index and content audit
• Place existing content within
  taxonomy (on paper)
• Create taxonomy task force to
  review and refine
  – Avoid too many cooks
DITA Classification and Subject
Scheme
• Subject scheme
  – Defines controlled values (“buckets”)
    for classifying content
  – Defines relationships between those
    buckets
• Classification
  – Groups content into appropriate
    buckets
Subject classification scheme
subjectScheme map
<subjectScheme>
   <hasInstance>
      <subjectdef keys="product">
         <subjectdef keys="Widget"/>
            <subjectdef keys="module">
               <subjectdef keys="Meds"/>
               <subjectdef keys="AdminW"/>
            </subjectdef>
         </subjectdef>
         <subjectdef keys="Gadget"/>
            <subjectdef keys="module">
               <subjectdef keys="AdminG"/>
               <subjectdef keys="Labs"/>
            </subjectdef>
         </subjectdef>
      </subjectdef>
   </hasInstance>
</subjectScheme>
Associate topics with subjects
<map>
   <topicref href="t_configure_med.xml">
      <topicsubject>
         <subjectref keys="Meds"/>
         <subjectref keys="AdminW"/>
         <subjectref keys="AdminG"/>
      </topicsubject>
   </topicref>
</map>
Recommended reading/viewing

• The Accidental Taxonomist, Heather
  Hedden
• Organising Knowledge: Taxonomies,
  Knowledge, and Organisational
  Effectiveness, Patrick Lambe
• Joe Gelb’s presentation on
  subjectScheme:
  http://svdig.ditamap.com/videos/sv
  dig-2011-05-11.htm
Contact me



               Leigh White
      ElementalSource, LLC

elementalsource@gmail.com
              678.467.7706

More Related Content

What's hot

Introduction to Dublin Core Metadata
Introduction to Dublin Core MetadataIntroduction to Dublin Core Metadata
Introduction to Dublin Core Metadata
Hannes Ebner
 

What's hot (20)

Taxonomies for Users
Taxonomies for UsersTaxonomies for Users
Taxonomies for Users
 
DITA Interoperability
DITA InteroperabilityDITA Interoperability
DITA Interoperability
 
Contours of DITA 2.0
Contours of DITA 2.0Contours of DITA 2.0
Contours of DITA 2.0
 
Successful Content Management Through Taxonomy And Metadata Design
Successful Content Management Through Taxonomy And Metadata DesignSuccessful Content Management Through Taxonomy And Metadata Design
Successful Content Management Through Taxonomy And Metadata Design
 
Taxonomies and Metadata
Taxonomies and MetadataTaxonomies and Metadata
Taxonomies and Metadata
 
Building and using ontologies
Building and using ontologies Building and using ontologies
Building and using ontologies
 
ESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge GraphsESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge Graphs
 
Essential Metadata Strategies
Essential Metadata StrategiesEssential Metadata Strategies
Essential Metadata Strategies
 
Selecting Software for Taxonomy, Thesaurus and Ontology Management
Selecting Software for Taxonomy, Thesaurus and Ontology ManagementSelecting Software for Taxonomy, Thesaurus and Ontology Management
Selecting Software for Taxonomy, Thesaurus and Ontology Management
 
Dublin Core Intro
Dublin Core IntroDublin Core Intro
Dublin Core Intro
 
Introducation to metadata
Introducation to metadataIntroducation to metadata
Introducation to metadata
 
Organizing Knowledge: A Knowledge Manager’s Primer to Taxonomy Development
Organizing Knowledge: A Knowledge Manager’s Primer to Taxonomy DevelopmentOrganizing Knowledge: A Knowledge Manager’s Primer to Taxonomy Development
Organizing Knowledge: A Knowledge Manager’s Primer to Taxonomy Development
 
Hybrid Approaches to Taxonomy & Folksonmy
Hybrid Approaches to Taxonomy & FolksonmyHybrid Approaches to Taxonomy & Folksonmy
Hybrid Approaches to Taxonomy & Folksonmy
 
SharePoint Taxonomy Introduction
SharePoint Taxonomy IntroductionSharePoint Taxonomy Introduction
SharePoint Taxonomy Introduction
 
Enterprise Data Management
Enterprise Data ManagementEnterprise Data Management
Enterprise Data Management
 
Taxonomy 101
Taxonomy 101Taxonomy 101
Taxonomy 101
 
Introduction to Dublin Core Metadata
Introduction to Dublin Core MetadataIntroduction to Dublin Core Metadata
Introduction to Dublin Core Metadata
 
Chapter 3: Data Governance
Chapter 3: Data Governance Chapter 3: Data Governance
Chapter 3: Data Governance
 
Collibra Data Citizen '19 - Bridging Data Privacy with Data Governance
Collibra Data Citizen '19 - Bridging Data Privacy with Data Governance Collibra Data Citizen '19 - Bridging Data Privacy with Data Governance
Collibra Data Citizen '19 - Bridging Data Privacy with Data Governance
 
Metadata
MetadataMetadata
Metadata
 

Viewers also liked

Using DITA's Subject Scheme Support for Educational Assessment Content
Using DITA's Subject Scheme Support for Educational Assessment ContentUsing DITA's Subject Scheme Support for Educational Assessment Content
Using DITA's Subject Scheme Support for Educational Assessment Content
Edwina Lui
 
Understanding Information Architecture
Understanding Information ArchitectureUnderstanding Information Architecture
Understanding Information Architecture
Scott Abel
 
Multiplying the Power of Taxonomy with Granular, Structured Content
Multiplying the Power of Taxonomy with Granular, Structured ContentMultiplying the Power of Taxonomy with Granular, Structured Content
Multiplying the Power of Taxonomy with Granular, Structured Content
Joe Pairman
 

Viewers also liked (19)

Taxonomy
TaxonomyTaxonomy
Taxonomy
 
Using DITA's Subject Scheme Support for Educational Assessment Content
Using DITA's Subject Scheme Support for Educational Assessment ContentUsing DITA's Subject Scheme Support for Educational Assessment Content
Using DITA's Subject Scheme Support for Educational Assessment Content
 
Localization and DITA: What you Need to Know - LocWorld32
Localization and DITA: What you Need to Know - LocWorld32Localization and DITA: What you Need to Know - LocWorld32
Localization and DITA: What you Need to Know - LocWorld32
 
Pat Farrell, Migrating Legacy Documentation to XML and DITA
Pat Farrell, Migrating Legacy Documentation to XML and DITAPat Farrell, Migrating Legacy Documentation to XML and DITA
Pat Farrell, Migrating Legacy Documentation to XML and DITA
 
Surviving the Transition to DITA: Trusted Partners can Ease the Pain
Surviving the Transition to DITA: Trusted Partners can Ease the PainSurviving the Transition to DITA: Trusted Partners can Ease the Pain
Surviving the Transition to DITA: Trusted Partners can Ease the Pain
 
Converting Unstructured Docs to XML/DITA/ePub
Converting Unstructured Docs to XML/DITA/ePubConverting Unstructured Docs to XML/DITA/ePub
Converting Unstructured Docs to XML/DITA/ePub
 
Metadata: Queen to King Content?
Metadata: Queen to King Content?Metadata: Queen to King Content?
Metadata: Queen to King Content?
 
Joe Gelb: Taxonomy and Delivery
Joe Gelb: Taxonomy and DeliveryJoe Gelb: Taxonomy and Delivery
Joe Gelb: Taxonomy and Delivery
 
The Elusive Promise of Reuse
The Elusive Promise of ReuseThe Elusive Promise of Reuse
The Elusive Promise of Reuse
 
Easy steps to convert your content to structured (frame maker and xml)
Easy steps to convert your content to structured (frame maker and xml)Easy steps to convert your content to structured (frame maker and xml)
Easy steps to convert your content to structured (frame maker and xml)
 
How to Optimize Your Metadata and Taxonomy
How to Optimize Your Metadata and TaxonomyHow to Optimize Your Metadata and Taxonomy
How to Optimize Your Metadata and Taxonomy
 
Reports and DITA Metrics IXIASOFT User Conference 2016
Reports and DITA Metrics IXIASOFT User Conference 2016Reports and DITA Metrics IXIASOFT User Conference 2016
Reports and DITA Metrics IXIASOFT User Conference 2016
 
Optimizing Content Reuse with DITA
Optimizing Content Reuse with DITAOptimizing Content Reuse with DITA
Optimizing Content Reuse with DITA
 
Developing training websites in multiple languages with (mostly) open-source ...
Developing training websites in multiple languages with (mostly) open-source ...Developing training websites in multiple languages with (mostly) open-source ...
Developing training websites in multiple languages with (mostly) open-source ...
 
Blurring the Lines between ECM and CCMS
Blurring the Lines between ECM and CCMSBlurring the Lines between ECM and CCMS
Blurring the Lines between ECM and CCMS
 
Understanding Information Architecture
Understanding Information ArchitectureUnderstanding Information Architecture
Understanding Information Architecture
 
Increasing Findability with Subject Schemes (Advanced DITA Webinar)
Increasing Findability with Subject Schemes (Advanced DITA Webinar)Increasing Findability with Subject Schemes (Advanced DITA Webinar)
Increasing Findability with Subject Schemes (Advanced DITA Webinar)
 
Multiplying the Power of Taxonomy with Granular, Structured Content
Multiplying the Power of Taxonomy with Granular, Structured ContentMultiplying the Power of Taxonomy with Granular, Structured Content
Multiplying the Power of Taxonomy with Granular, Structured Content
 
Wireframing, Mockups, and Prototyping Made Easy
Wireframing, Mockups, and Prototyping Made EasyWireframing, Mockups, and Prototyping Made Easy
Wireframing, Mockups, and Prototyping Made Easy
 

Similar to Taxonomy: Do I Need One

Realizing the Full Potential of Taxonomies by Branka Kosovac
Realizing the Full Potential of Taxonomies by Branka KosovacRealizing the Full Potential of Taxonomies by Branka Kosovac
Realizing the Full Potential of Taxonomies by Branka Kosovac
Content Strategy Workshops
 
The Elusive Promise of Reuse
The Elusive Promise of ReuseThe Elusive Promise of Reuse
The Elusive Promise of Reuse
IXIASOFT
 
Designing an effective information architecture (
Designing an effective information architecture (Designing an effective information architecture (
Designing an effective information architecture (
Vickey Bird
 
Library Research for Human Rights Guide
Library Research for Human Rights GuideLibrary Research for Human Rights Guide
Library Research for Human Rights Guide
Annelise Sklar
 

Similar to Taxonomy: Do I Need One (20)

Realizing the Full Potential of Taxonomies by Branka Kosovac
Realizing the Full Potential of Taxonomies by Branka KosovacRealizing the Full Potential of Taxonomies by Branka Kosovac
Realizing the Full Potential of Taxonomies by Branka Kosovac
 
Metadata
MetadataMetadata
Metadata
 
Folksonomies & social tagging
Folksonomies & social taggingFolksonomies & social tagging
Folksonomies & social tagging
 
Taxonomy 101: What do rockets and arugula have in common?
Taxonomy 101: What do rockets and arugula have in common?Taxonomy 101: What do rockets and arugula have in common?
Taxonomy 101: What do rockets and arugula have in common?
 
What Is Taxonomy and Why Is It Useful?
What Is Taxonomy and Why Is It Useful?What Is Taxonomy and Why Is It Useful?
What Is Taxonomy and Why Is It Useful?
 
Some thoughts on social tagging
Some thoughts on social taggingSome thoughts on social tagging
Some thoughts on social tagging
 
[AIIM17] Data Categorization You Can Live With - Monica Crocker
[AIIM17]  Data Categorization You Can Live With - Monica Crocker [AIIM17]  Data Categorization You Can Live With - Monica Crocker
[AIIM17] Data Categorization You Can Live With - Monica Crocker
 
System Concepts for Object Modelling.pptx
System Concepts for Object Modelling.pptxSystem Concepts for Object Modelling.pptx
System Concepts for Object Modelling.pptx
 
The Elusive Promise of Reuse
The Elusive Promise of ReuseThe Elusive Promise of Reuse
The Elusive Promise of Reuse
 
How To Go About Researching
How To Go About ResearchingHow To Go About Researching
How To Go About Researching
 
INFORMATION SKILLS: NAVIGATING RESEARCH IN LIBRARY
INFORMATION SKILLS: NAVIGATING RESEARCH IN LIBRARYINFORMATION SKILLS: NAVIGATING RESEARCH IN LIBRARY
INFORMATION SKILLS: NAVIGATING RESEARCH IN LIBRARY
 
Card Sorting Your Way to Meaningful Metadata
Card Sorting Your Way to Meaningful MetadataCard Sorting Your Way to Meaningful Metadata
Card Sorting Your Way to Meaningful Metadata
 
Designing an effective information architecture (
Designing an effective information architecture (Designing an effective information architecture (
Designing an effective information architecture (
 
Library Research for Human Rights Guide
Library Research for Human Rights GuideLibrary Research for Human Rights Guide
Library Research for Human Rights Guide
 
Analysing Qualitative Data
Analysing Qualitative DataAnalysing Qualitative Data
Analysing Qualitative Data
 
Taxonomy Fundamentals Workshop
Taxonomy Fundamentals WorkshopTaxonomy Fundamentals Workshop
Taxonomy Fundamentals Workshop
 
Michael Bolton - Heuristics: Solving Problems Rapidly
Michael Bolton - Heuristics: Solving Problems RapidlyMichael Bolton - Heuristics: Solving Problems Rapidly
Michael Bolton - Heuristics: Solving Problems Rapidly
 
Library research for Environmental Studies at UCSD
Library research for Environmental Studies at UCSDLibrary research for Environmental Studies at UCSD
Library research for Environmental Studies at UCSD
 
xAPI Vocabulary Stone Soup: LAK 2016 JISC Learning Analytics Hackathon
xAPI Vocabulary Stone Soup: LAK 2016 JISC Learning Analytics HackathonxAPI Vocabulary Stone Soup: LAK 2016 JISC Learning Analytics Hackathon
xAPI Vocabulary Stone Soup: LAK 2016 JISC Learning Analytics Hackathon
 
Thematic content analysis in psychology
Thematic content analysis in psychologyThematic content analysis in psychology
Thematic content analysis in psychology
 

Recently uploaded

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Recently uploaded (20)

Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 

Taxonomy: Do I Need One

  • 1. Taxonomy: Do I need one? Leigh White ElementalSource, LLC
  • 2. Yes
  • 3. What I’ll talk about • What happens without a taxonomy • What a taxonomy is and does • Why a taxonomy is important • A few first development steps
  • 4. What I won’t talk about • All the different kinds of taxonomies • Details about development • Tools for development – except DITA subjectScheme (briefly!)
  • 6. What the he** IS that???
  • 7. Oh, let’s call it a… • Use the native name • Name it after something familiar that it’s kind of “like” • “Like” is murky; you have to define “like” – How it looks? Shape? Color? Size? – How it tastes? – How it acts?
  • 8. Earth apples, anyone? • aardappel (Dutch) • pomme de terre (French)
  • 10. We know this because • We have a taxonomy (Linnean classification) that specifies degrees of relationship between living things
  • 11. Distant cousins, at best apple potato Kingdom Plantae Plantae Phylum Anthophyta Anthophyta Class Eudicots Eudicots Order Rosales Solanales Family Rosaceae Solanaceae Genus Malus Solanum Species M. domestica S. tuberosum
  • 12. So, a taxonomy is • A way of defining “like” • A way of expressing relationships between things – We might already be instinctively aware of these relationships but need to formalize them • A way of discovering relationships between things • An information model
  • 13. Taxonomies are • typically organized by parent-child relationships • typically indicated by the phrase 'is a kind of' or 'is a subtype of' • the subtype has the same properties, behaviors, and constraints as the supertype plus one or more additional properties, behaviors, or constraints
  • 14. Uhh…what? • For example: car is a kind of vehicle, so any car is also a vehicle, but not every vehicle is a car • The level “car” is more constrained than the level “vehicle” • A car has all the properties of a vehicle plus some other properties specific to a car
  • 15. Taxonomies are all around us • It’s our nature to classify • Many of these taxonomies are internal, arbitrary and personal • A true taxonomy must be uniform and unambiguous
  • 16. Other familiar taxonomies • Dewey Decimal System • Library of Congress System • ICD-9/10 codes • computer folder system – probably most common taxonomy in tech comm
  • 17. And one I especially dig • A taxonomy of wrongness! – www.fallacyfiles.org/taxonomy.html
  • 18. We have metadata…why do we need a taxonomy too? • Where did that metadata come from? – You must have had some idea of how your content should be classified – If so, then you already have the beginnings of a taxonomy, at least in your head – So take it a step further
  • 19. Metadata compliments taxonomy and vice-versa • Metadata describes an individual piece of content but doesn’t capture relationships very well. • Metadata is part of content so updates can be unwieldy; better to maintain the model outside the content • A taxonomy serves as a roadmap…it both describes current content and predicts future content • A taxonomy highlights similarities (and differences) across products • Metadata can pick up where taxonomy leaves off
  • 20. What else are taxonomies good for? • Controlled vocabularies – indexing – keywords – glossaries • Searching/browsing/filtering – Faceted search – Filtering for custom doc publishing • Content reuse
  • 22.
  • 23.
  • 24.
  • 25. So far… • we’ve looked at hierarchical taxonomies
  • 26. When hierarchy isn’t enough  A Cockapoo is a kind of dog. It’s the product of a poodle and a Cocker Spaniel. A hierarchy cannot capture all these relationships.
  • 27. There’s an alternative (polyarchical)
  • 28. Purists might say… • that you need different notations to express different kinds of relationships • or that you must express the relationships uniformly
  • 29. Maybe, maybe not • You need what you need to capture the relationships you need to express • No more, no less - KISS • The relationships already exist; you are just using the taxonomy to express them
  • 30. Decisions to make • What kind of taxonomy: – hierarchical, polyarchical, something else? • If hierarchical, how many levels? • If polyarchical, what kinds of relationships and how designated? • Tool to use? (meh) • How to associate content with taxonomy?
  • 31. Questions to ask • What will the taxonomy be used for? – indexing, search, etc. • Who are the users? – content creators, clients, SMEs, support, etc. • What content will the taxonomy cover? – topics, images, demos, videos, etc. • What are the scope and limits? – handling off-topic content—what to include/exclude • What are the resources and constraints? – skills/expertise, timing, technology, funding, stakeholder roles, etc.
  • 32. More questions to ask • Who is responsible for development? • What are secondary/contributor roles? • How does taxonomy fit in with other metadata? • How to handle ongoing support and maintenance?
  • 33. Some first steps • Start small—maybe just one small product • Do content audit of everything the taxonomy will categorize • Compare TOCs of existing deliverables – Find commonalities, differences • Compare indexes of existing deliverables – Discover terms already in use • Use folder structure
  • 34. More first steps • Assemble starting list of categories that cover existing content based on TOC, index and content audit • Place existing content within taxonomy (on paper) • Create taxonomy task force to review and refine – Avoid too many cooks
  • 35. DITA Classification and Subject Scheme • Subject scheme – Defines controlled values (“buckets”) for classifying content – Defines relationships between those buckets • Classification – Groups content into appropriate buckets
  • 37. subjectScheme map <subjectScheme> <hasInstance> <subjectdef keys="product"> <subjectdef keys="Widget"/> <subjectdef keys="module"> <subjectdef keys="Meds"/> <subjectdef keys="AdminW"/> </subjectdef> </subjectdef> <subjectdef keys="Gadget"/> <subjectdef keys="module"> <subjectdef keys="AdminG"/> <subjectdef keys="Labs"/> </subjectdef> </subjectdef> </subjectdef> </hasInstance> </subjectScheme>
  • 38. Associate topics with subjects <map> <topicref href="t_configure_med.xml"> <topicsubject> <subjectref keys="Meds"/> <subjectref keys="AdminW"/> <subjectref keys="AdminG"/> </topicsubject> </topicref> </map>
  • 39. Recommended reading/viewing • The Accidental Taxonomist, Heather Hedden • Organising Knowledge: Taxonomies, Knowledge, and Organisational Effectiveness, Patrick Lambe • Joe Gelb’s presentation on subjectScheme: http://svdig.ditamap.com/videos/sv dig-2011-05-11.htm
  • 40. Contact me Leigh White ElementalSource, LLC elementalsource@gmail.com 678.467.7706