Oh, let’s call it a…• Use the native name• Name it after something familiar that it’s kind of “like”• “Like” is murky; you have to define “like” – How it looks? Shape? Color? Size? – How it tastes? – How it acts?
Earth apples, anyone?• aardappel (Dutch)• pomme de terre (French)
We know this because• We have a taxonomy (Linnean classification) that specifies degrees of relationship between living things
Distant cousins, at best apple potatoKingdom Plantae PlantaePhylum Anthophyta AnthophytaClass Eudicots EudicotsOrder Rosales SolanalesFamily Rosaceae SolanaceaeGenus Malus SolanumSpecies M. domestica S. tuberosum
So, a taxonomy is• A way of defining “like”• A way of expressing relationships between things – We might already be instinctively aware of these relationships but need to formalize them• A way of discovering relationships between things• An information model
Taxonomies are• typically organized by parent-child relationships• typically indicated by the phrase is a kind of or is a subtype of• the subtype has the same properties, behaviors, and constraints as the supertype plus one or more additional properties, behaviors, or constraints
Uhh…what?• For example: car is a kind of vehicle, so any car is also a vehicle, but not every vehicle is a car• The level “car” is more constrained than the level “vehicle”• A car has all the properties of a vehicle plus some other properties specific to a car
Taxonomies are all around us• It’s our nature to classify• Many of these taxonomies are internal, arbitrary and personal• A true taxonomy must be uniform and unambiguous
Other familiar taxonomies• Dewey Decimal System• Library of Congress System• ICD-9/10 codes• computer folder system – probably most common taxonomy in tech comm
And one I especially dig• A taxonomy of wrongness! – www.fallacyfiles.org/taxonomy.html
We have metadata…why do we needa taxonomy too?• Where did that metadata come from? – You must have had some idea of how your content should be classified – If so, then you already have the beginnings of a taxonomy, at least in your head – So take it a step further
Metadata compliments taxonomyand vice-versa• Metadata describes an individual piece of content but doesn’t capture relationships very well.• Metadata is part of content so updates can be unwieldy; better to maintain the model outside the content• A taxonomy serves as a roadmap…it both describes current content and predicts future content• A taxonomy highlights similarities (and differences) across products• Metadata can pick up where taxonomy leaves off
What else are taxonomies good for?• Controlled vocabularies – indexing – keywords – glossaries• Searching/browsing/filtering – Faceted search – Filtering for custom doc publishing• Content reuse
Purists might say…• that you need different notations to express different kinds of relationships• or that you must express the relationships uniformly
Maybe, maybe not• You need what you need to capture the relationships you need to express• No more, no less - KISS• The relationships already exist; you are just using the taxonomy to express them
Decisions to make• What kind of taxonomy: – hierarchical, polyarchical, something else?• If hierarchical, how many levels?• If polyarchical, what kinds of relationships and how designated?• Tool to use? (meh)• How to associate content with taxonomy?
Questions to ask• What will the taxonomy be used for? – indexing, search, etc.• Who are the users? – content creators, clients, SMEs, support, etc.• What content will the taxonomy cover? – topics, images, demos, videos, etc.• What are the scope and limits? – handling off-topic content—what to include/exclude• What are the resources and constraints? – skills/expertise, timing, technology, funding, stakeholder roles, etc.
More questions to ask• Who is responsible for development?• What are secondary/contributor roles?• How does taxonomy fit in with other metadata?• How to handle ongoing support and maintenance?
Some first steps• Start small—maybe just one small product• Do content audit of everything the taxonomy will categorize• Compare TOCs of existing deliverables – Find commonalities, differences• Compare indexes of existing deliverables – Discover terms already in use• Use folder structure
More first steps• Assemble starting list of categories that cover existing content based on TOC, index and content audit• Place existing content within taxonomy (on paper)• Create taxonomy task force to review and refine – Avoid too many cooks
DITA Classification and SubjectScheme• Subject scheme – Defines controlled values (“buckets”) for classifying content – Defines relationships between those buckets• Classification – Groups content into appropriate buckets
Recommended reading/viewing• The Accidental Taxonomist, Heather Hedden• Organising Knowledge: Taxonomies, Knowledge, and Organisational Effectiveness, Patrick Lambe• Joe Gelb’s presentation on subjectScheme: http://svdig.ditamap.com/videos/sv dig-2011-05-11.htm
Contact me Leigh White ElementalSource, LLCelementalsource@gmail.com 678.467.7706