SlideShare a Scribd company logo
Indexing & Tagging:
Taxonomies & Folksonomies
National Federation of Advanced Information Services (NFAIS)
Improving the User Search Experience
October 13, 2010, Philadelphia, Pennsylvania

Heather Hedden, Taxonomy Consultant
Heather Hedden’s Background
 Periodical database indexer for Information Access
 Company (IAC): Trade & Industry and PROMT
 Controlled vocabulary editor, IAC/Gale
 Freelance indexer and taxonomy consultant
 Taxonomy manager, Viziant and First Wind
 Continuing education instructor, Simmons GSLIS
 Author, The Accidental Taxonomist




                     © 2010 Heather Hedden            2
What’s the difference?

 Cataloging   with a CV
 Indexing     with a CV
 Keywording
 Tagging




              © 2010 Heather Hedden   3
Definitions
 Controlled Vocabulary – A controlled list of terms for
 concepts, usually with nonpreferred terms (“synonyms”).
 May or may not have structure and relationships
 between terms.
    Broader and includes taxonomies.
 Taxonomy – A hierarchical structure of terms, which
 may or many not include nonpreferred terms.
    Has popularly replaced “controlled vocabulary” as a
    broader concept, which may or may not be
    hierarchical.

                     © 2010 Heather Hedden                 4
Indexing

 Closed does not use CV; open does
 Trained, dedicated indexers
 Indexing guidelines and policies used
 Controlled vocabularies designed to support
   Indexer-focused scope notes
   Nonpreferred terms, displayed
   Extensive hierarchical & associative relationships
   Browsable, alphabetical display, type-ahead, etc.

                     © 2010 Heather Hedden              5
Indexers and Index Terms

 Indexers are closest to the content
 They see new concepts and terminology
 They need methods of:
   suggesting additional nonpreferred terms and
   relationships between terms
   suggesting new terms to add to the controlled
   vocabulary
   adding terms that supplement the controlled
   vocabulary

                    © 2010 Heather Hedden          6
Levels of Vocabulary Management

1.   Only controlled vocabulary used
      Indexer suggests terms for approval prior to use.
2.   Controlled vocabulary plus unapproved terms
      Indexer may use terms prior to approval.
3.   Controlled vocabulary plus keywords
      Indexer supplements with terms never reviewed.
4.   Controlled vocabulary plus shared keywords
      Indexer supplements with terms never reviewed, but
      re-usable.
                        © 2010 Heather Hedden              7
Levels of Vocabulary Management

1.   Only controlled vocabulary used
       Simple to manage and implement
       Indexers email taxonomist with suggestions
       For small controlled vocabularies
       For limited scope, relatively static content
       For small indexing operations




                       © 2010 Heather Hedden          8
Levels of Vocabulary Management

2.   Controlled vocabulary plus unapproved terms
       Indexer may create “candidate”, “unapproved,” or
       “override terms” for immediate use, but also entered into
       the system for taxonomist review
       Unapproved indexed terms could convert to nonpreferred
       terms.
       Can be restricted to only certain types of terms (usually
       named entities) or all terms
       For large indexing operations, large controlled
       vocabularies, extensive content
       More technological complex to implement
                         © 2010 Heather Hedden                 9
Levels of Vocabulary Management

3.   Controlled vocabulary plus keywords
       Indexer supplements controlled vocabulary indexing with
       any keywords entered into a separate field
       Keywords may be for new concepts, but are often for
       more specific concepts and names
       Keywords do not become part of the controlled
       vocabulary. Taxonomist may or may not look at them.
       Varies: (a) more indexing is with controlled vocabulary or
       (b) more with keywords and CV is only broad categories
       Less indexing technology, but more complex end-user
       retrieval options (3 types: taxonomy, keywords, freetext)
       Variants/synonyms are not controlled, redundant
                         © 2010 Heather Hedden                  10
Levels of Vocabulary Management

4.   Controlled vocabulary and shared keywords
       Indexer supplements controlled vocabulary indexing with
       any keywords entered into a separate field
       Keywords do not become part of the controlled
       vocabulary, but are stored in another database.
       All keywords become immediately available for all
       indexers to use and reuse
       Indexers can browse the list of previously used
       keywords; redundancies are reduced, not eliminated



                        © 2010 Heather Hedden                11
Folksonomy
 Shared keywords, created and shared by those “indexing”
 Indexing is not by indexers, but by end-users, consumers
 of the content.
     Common people, the “folk.”
     “Tagging”
 Users might also contribute structure, creating
 (hierarchical) relationships
 Originally defined as:
 “user-created bottom-up categorical structure development
 with an emergent thesaurus”
 --Thomas Vander Wal, July 2004
                     © 2010 Heather Hedden             12
Folksonomy: Who Contributes to it?

 Indexers
 Editors
 Employees
 End-user subscribers
 Any end-users



               © 2010 Heather Hedden   13
Folksonomy: Where used?
 Public web sites of high volume content and users:
    Delicious (http://delicious.com)
    Connotea (www.connotea.org)
     Diigo (www.diigo.com)
    Flickr (www.flickr.com)
    LibraryThing (www.librarything.com)
 Large enterprises that want to foster collaboration and
 innovation
 Subscription content providers?



                        © 2010 Heather Hedden              14
Social Tagging
 Also called collaborative tagging, social classification,
 social indexing
 Folksonomies plus Web 2.0/social networking features
 Social communities can be built around shared sets of
 popular content or popular tags
 Tags for popularity ratings
 Now moving into enterprises




                      © 2010 Heather Hedden                  15
Folksonomies/Social tagging

 Advantages
   Reflects trends, up-to-date, can monitor
   change and popularity. Dynamic.
   Cheaper and quicker than building and
   maintaining a taxonomy
   Facilitates workplace democracy and the
   distribution of management tasks
   Responsive to user needs

                  © 2010 Heather Hedden       16
Folksonomies/Social tagging

 Disadvantages
   Inconsistent – precision & recall deficiencies
   Biased
   Requires critical mass of involvement to be
   useful
   Does not scale well to a large volume of
   content



                   © 2010 Heather Hedden            17
Folksonomies/Social tagging
 Solutions/trends:
   Some degree of vocabulary control
   Applicable to certain areas of content, not all

 Partial vocabulary control:
   Dual taxonomy & folksonomy system
       Requires policy of taxonomy first
   Single vocabulary system with some terms
   managed/edited, and some not (yet)
       like a wiki with editors

                      © 2010 Heather Hedden          18
Conclusions
 Different people often apply taxonomies or
 folksonomies.
 Taxonomies and Folksonomies may supplement
 each other.
 Technology facilitates the application of both.
 Users understand the distinction and can use
 both.



                  © 2010 Heather Hedden        19
Questions/Contact
Heather Hedden
978-467-5195
heather@hedden.net
www.hedden-information.com
www.accidental-taxonomist.com




                     © 2010 Heather Hedden   20

More Related Content

What's hot

The Library of Congress Classification
The Library of Congress ClassificationThe Library of Congress Classification
The Library of Congress ClassificationDaryl Superio
 
Marc 21
Marc 21Marc 21
Taxonomies & folksonomies
Taxonomies  & folksonomiesTaxonomies  & folksonomies
Taxonomies & folksonomies
Aparna Sane
 
MARC -21.pptx
MARC -21.pptxMARC -21.pptx
MARC -21.pptx
Rbalasubramani
 
Code of Ethics for Librarians (LIS 55)
Code of Ethics for Librarians (LIS 55)Code of Ethics for Librarians (LIS 55)
Code of Ethics for Librarians (LIS 55)
Roy Santos Necesario
 
Forms of catalogue
Forms of catalogueForms of catalogue
Forms of catalogue
Prince Raja
 
Bibliographic control : Basics
Bibliographic control : BasicsBibliographic control : Basics
Bibliographic control : Basics
Jayatunga Amaraweera
 
Electronic Resource Management in the library
Electronic Resource Management in the libraryElectronic Resource Management in the library
Electronic Resource Management in the library
Dr. Nihar K. Patra
 
Canons of cataloguing
Canons of cataloguingCanons of cataloguing
Canons of cataloguing
saurabh kaushik
 
Collection development by Muhammad Tufail Khan & Aneela Zahid
Collection development by Muhammad Tufail Khan & Aneela ZahidCollection development by Muhammad Tufail Khan & Aneela Zahid
Collection development by Muhammad Tufail Khan & Aneela ZahidMuhammad Tufail Khan
 
Theory of Library Cataloguing
Theory of Library Cataloguing Theory of Library Cataloguing
Theory of Library Cataloguing
Anupama Saini
 
PHASE RELATION .ppt.
PHASE RELATION .ppt.PHASE RELATION .ppt.
PHASE RELATION .ppt.
Jiwaji university
 
Collection development
Collection developmentCollection development
Collection development
Shwethanaik31
 
CANONS OF CATALOGUING ppt
CANONS OF CATALOGUING pptCANONS OF CATALOGUING ppt
CANONS OF CATALOGUING ppt
University of Delhi
 
SLSH ppt
SLSH pptSLSH ppt
SLSH ppt
Kumar Gpt
 
Introduction to subject cataloguing
Introduction to subject cataloguingIntroduction to subject cataloguing
Introduction to subject cataloguing
Liah Shonhe
 
Classified and coordinate indexes clydee
Classified and coordinate indexes clydeeClassified and coordinate indexes clydee
Classified and coordinate indexes clydeejeancly
 
LISTA Database Analysis
LISTA Database AnalysisLISTA Database Analysis
LISTA Database Analysis
Melendra Sanders
 

What's hot (20)

The Library of Congress Classification
The Library of Congress ClassificationThe Library of Congress Classification
The Library of Congress Classification
 
Marc 21
Marc 21Marc 21
Marc 21
 
Taxonomies & folksonomies
Taxonomies  & folksonomiesTaxonomies  & folksonomies
Taxonomies & folksonomies
 
MARC -21.pptx
MARC -21.pptxMARC -21.pptx
MARC -21.pptx
 
Subject cataloguing
Subject cataloguingSubject cataloguing
Subject cataloguing
 
Code of Ethics for Librarians (LIS 55)
Code of Ethics for Librarians (LIS 55)Code of Ethics for Librarians (LIS 55)
Code of Ethics for Librarians (LIS 55)
 
Forms of catalogue
Forms of catalogueForms of catalogue
Forms of catalogue
 
Bibliographic control : Basics
Bibliographic control : BasicsBibliographic control : Basics
Bibliographic control : Basics
 
Electronic Resource Management in the library
Electronic Resource Management in the libraryElectronic Resource Management in the library
Electronic Resource Management in the library
 
Canons of cataloguing
Canons of cataloguingCanons of cataloguing
Canons of cataloguing
 
Collection development by Muhammad Tufail Khan & Aneela Zahid
Collection development by Muhammad Tufail Khan & Aneela ZahidCollection development by Muhammad Tufail Khan & Aneela Zahid
Collection development by Muhammad Tufail Khan & Aneela Zahid
 
Theory of Library Cataloguing
Theory of Library Cataloguing Theory of Library Cataloguing
Theory of Library Cataloguing
 
PHASE RELATION .ppt.
PHASE RELATION .ppt.PHASE RELATION .ppt.
PHASE RELATION .ppt.
 
Collection development
Collection developmentCollection development
Collection development
 
CANONS OF CATALOGUING ppt
CANONS OF CATALOGUING pptCANONS OF CATALOGUING ppt
CANONS OF CATALOGUING ppt
 
SLSH ppt
SLSH pptSLSH ppt
SLSH ppt
 
Introduction to subject cataloguing
Introduction to subject cataloguingIntroduction to subject cataloguing
Introduction to subject cataloguing
 
MARC21
MARC21MARC21
MARC21
 
Classified and coordinate indexes clydee
Classified and coordinate indexes clydeeClassified and coordinate indexes clydee
Classified and coordinate indexes clydee
 
LISTA Database Analysis
LISTA Database AnalysisLISTA Database Analysis
LISTA Database Analysis
 

Similar to Taxonomies and Folksonomies

Taxonomy Displays: Bridging UX & Taxonomy Design
Taxonomy Displays: Bridging UX & Taxonomy DesignTaxonomy Displays: Bridging UX & Taxonomy Design
Taxonomy Displays: Bridging UX & Taxonomy Design
Heather Hedden
 
Taxonomies for Human vs Auto-Indexing
Taxonomies for Human vs Auto-IndexingTaxonomies for Human vs Auto-Indexing
Taxonomies for Human vs Auto-Indexing
Heather Hedden
 
Mapping, Merging, and Multilingual Taxonomies
Mapping, Merging, and Multilingual TaxonomiesMapping, Merging, and Multilingual Taxonomies
Mapping, Merging, and Multilingual Taxonomies
Heather Hedden
 
Selecting Software for Taxonomy, Thesaurus and Ontology Management
Selecting Software for Taxonomy, Thesaurus and Ontology ManagementSelecting Software for Taxonomy, Thesaurus and Ontology Management
Selecting Software for Taxonomy, Thesaurus and Ontology Management
Heather Hedden
 
Controlled Vocabulary.pptx
Controlled Vocabulary.pptxControlled Vocabulary.pptx
Taxonomies and Metadata in Information Architecture
Taxonomies and Metadata in Information ArchitectureTaxonomies and Metadata in Information Architecture
Taxonomies and Metadata in Information Architecture
Access Innovations, Inc.
 
Benefits of Taxonomies
Benefits of TaxonomiesBenefits of Taxonomies
Benefits of Taxonomies
Heather Hedden
 
Standards (or lack thereof) in Institutional Repositories. Presentation for t...
Standards (or lack thereof) in Institutional Repositories. Presentation for t...Standards (or lack thereof) in Institutional Repositories. Presentation for t...
Standards (or lack thereof) in Institutional Repositories. Presentation for t...Sarah Shreeves
 
Taxonomy Design for SharePoint
Taxonomy Design for SharePointTaxonomy Design for SharePoint
Taxonomy Design for SharePoint
Heather Hedden
 
Putting Controlled Vocabulary To Work I Davis 2008
Putting Controlled Vocabulary To Work I Davis 2008Putting Controlled Vocabulary To Work I Davis 2008
Putting Controlled Vocabulary To Work I Davis 2008Ian Davis
 
Developing a draft Information Literacy thesaurus
Developing a draft Information Literacy thesaurusDeveloping a draft Information Literacy thesaurus
Developing a draft Information Literacy thesaurus
Centre for Information Literacy Research
 
Testing Taxonomies
Testing TaxonomiesTesting Taxonomies
Testing Taxonomies
Heather Hedden
 
SharePoint 2010 Managed Metadata
SharePoint 2010 Managed MetadataSharePoint 2010 Managed Metadata
SharePoint 2010 Managed Metadata
Nick Hobbs
 
Synonyms, Alternative Labels, and Nonpreferred Terms
Synonyms, Alternative Labels, and Nonpreferred TermsSynonyms, Alternative Labels, and Nonpreferred Terms
Synonyms, Alternative Labels, and Nonpreferred Terms
Heather Hedden
 
User-Driven Taxonomies
User-Driven TaxonomiesUser-Driven Taxonomies
User-Driven Taxonomies
Christine Connors
 
Controlled Vocabulary.pptx
Controlled Vocabulary.pptxControlled Vocabulary.pptx
Controlled Vocabulary.pptx
IhsanSani4
 
Policy implication of the e2.0 study D.Osimo Tech4i2
Policy implication of the e2.0 study D.Osimo Tech4i2Policy implication of the e2.0 study D.Osimo Tech4i2
Policy implication of the e2.0 study D.Osimo Tech4i2
Kasia Szkuta
 
Terminology Management
Terminology ManagementTerminology Management
Terminology Management
Uwe Muegge
 
Successful Content Management Through Taxonomy And Metadata Design
Successful Content Management Through Taxonomy And Metadata DesignSuccessful Content Management Through Taxonomy And Metadata Design
Successful Content Management Through Taxonomy And Metadata Design
sarakirsten
 

Similar to Taxonomies and Folksonomies (20)

Taxonomy Displays: Bridging UX & Taxonomy Design
Taxonomy Displays: Bridging UX & Taxonomy DesignTaxonomy Displays: Bridging UX & Taxonomy Design
Taxonomy Displays: Bridging UX & Taxonomy Design
 
Taxonomies for Human vs Auto-Indexing
Taxonomies for Human vs Auto-IndexingTaxonomies for Human vs Auto-Indexing
Taxonomies for Human vs Auto-Indexing
 
Mapping, Merging, and Multilingual Taxonomies
Mapping, Merging, and Multilingual TaxonomiesMapping, Merging, and Multilingual Taxonomies
Mapping, Merging, and Multilingual Taxonomies
 
Selecting Software for Taxonomy, Thesaurus and Ontology Management
Selecting Software for Taxonomy, Thesaurus and Ontology ManagementSelecting Software for Taxonomy, Thesaurus and Ontology Management
Selecting Software for Taxonomy, Thesaurus and Ontology Management
 
Controlled Vocabulary.pptx
Controlled Vocabulary.pptxControlled Vocabulary.pptx
Controlled Vocabulary.pptx
 
Taxonomies and Metadata in Information Architecture
Taxonomies and Metadata in Information ArchitectureTaxonomies and Metadata in Information Architecture
Taxonomies and Metadata in Information Architecture
 
Benefits of Taxonomies
Benefits of TaxonomiesBenefits of Taxonomies
Benefits of Taxonomies
 
Standards (or lack thereof) in Institutional Repositories. Presentation for t...
Standards (or lack thereof) in Institutional Repositories. Presentation for t...Standards (or lack thereof) in Institutional Repositories. Presentation for t...
Standards (or lack thereof) in Institutional Repositories. Presentation for t...
 
Taxonomy Design for SharePoint
Taxonomy Design for SharePointTaxonomy Design for SharePoint
Taxonomy Design for SharePoint
 
Putting Controlled Vocabulary To Work I Davis 2008
Putting Controlled Vocabulary To Work I Davis 2008Putting Controlled Vocabulary To Work I Davis 2008
Putting Controlled Vocabulary To Work I Davis 2008
 
Developing a draft Information Literacy thesaurus
Developing a draft Information Literacy thesaurusDeveloping a draft Information Literacy thesaurus
Developing a draft Information Literacy thesaurus
 
Testing Taxonomies
Testing TaxonomiesTesting Taxonomies
Testing Taxonomies
 
SharePoint 2010 Managed Metadata
SharePoint 2010 Managed MetadataSharePoint 2010 Managed Metadata
SharePoint 2010 Managed Metadata
 
Synonyms, Alternative Labels, and Nonpreferred Terms
Synonyms, Alternative Labels, and Nonpreferred TermsSynonyms, Alternative Labels, and Nonpreferred Terms
Synonyms, Alternative Labels, and Nonpreferred Terms
 
User-Driven Taxonomies
User-Driven TaxonomiesUser-Driven Taxonomies
User-Driven Taxonomies
 
Indexing
IndexingIndexing
Indexing
 
Controlled Vocabulary.pptx
Controlled Vocabulary.pptxControlled Vocabulary.pptx
Controlled Vocabulary.pptx
 
Policy implication of the e2.0 study D.Osimo Tech4i2
Policy implication of the e2.0 study D.Osimo Tech4i2Policy implication of the e2.0 study D.Osimo Tech4i2
Policy implication of the e2.0 study D.Osimo Tech4i2
 
Terminology Management
Terminology ManagementTerminology Management
Terminology Management
 
Successful Content Management Through Taxonomy And Metadata Design
Successful Content Management Through Taxonomy And Metadata DesignSuccessful Content Management Through Taxonomy And Metadata Design
Successful Content Management Through Taxonomy And Metadata Design
 

More from Heather Hedden

Introduction to Knowledge Graphs for Information Architects.pdf
Introduction to Knowledge Graphs for Information Architects.pdfIntroduction to Knowledge Graphs for Information Architects.pdf
Introduction to Knowledge Graphs for Information Architects.pdf
Heather Hedden
 
Thesauri for Indexing Support / Thesauri zur Unterstützung der Registererstel...
Thesauri for Indexing Support / Thesauri zur Unterstützung der Registererstel...Thesauri for Indexing Support / Thesauri zur Unterstützung der Registererstel...
Thesauri for Indexing Support / Thesauri zur Unterstützung der Registererstel...
Heather Hedden
 
Taxonomies in Support of Search
Taxonomies in Support of SearchTaxonomies in Support of Search
Taxonomies in Support of Search
Heather Hedden
 
A Brief Introduction to SKOS
A Brief Introduction to SKOSA Brief Introduction to SKOS
A Brief Introduction to SKOS
Heather Hedden
 
Mapping Taxonomies, Thesauri, and Ontologies
Mapping Taxonomies, Thesauri, and OntologiesMapping Taxonomies, Thesauri, and Ontologies
Mapping Taxonomies, Thesauri, and Ontologies
Heather Hedden
 
A Brief Introduction to Knowledge Graphs
A Brief Introduction to Knowledge GraphsA Brief Introduction to Knowledge Graphs
A Brief Introduction to Knowledge Graphs
Heather Hedden
 
Managing Taxonomy Tagging
Managing Taxonomy TaggingManaging Taxonomy Tagging
Managing Taxonomy Tagging
Heather Hedden
 
Taxonomies for Users
Taxonomies for UsersTaxonomies for Users
Taxonomies for Users
Heather Hedden
 
Taxonomies, Categories, and Tags in WordPress
Taxonomies, Categories, and Tags in WordPressTaxonomies, Categories, and Tags in WordPress
Taxonomies, Categories, and Tags in WordPress
Heather Hedden
 
Customer-Focused Thesauri
Customer-Focused ThesauriCustomer-Focused Thesauri
Customer-Focused Thesauri
Heather Hedden
 
Managing Mature Taxonomies: Resolving Orphan Terms
Managing Mature Taxonomies: Resolving Orphan TermsManaging Mature Taxonomies: Resolving Orphan Terms
Managing Mature Taxonomies: Resolving Orphan Terms
Heather Hedden
 
Taxonomies for E-commerce
Taxonomies for E-commerceTaxonomies for E-commerce
Taxonomies for E-commerce
Heather Hedden
 
Taxonomies for Text Analytics and Auto-indexing
Taxonomies for Text Analytics and Auto-indexingTaxonomies for Text Analytics and Auto-indexing
Taxonomies for Text Analytics and Auto-indexing
Heather Hedden
 
Making Decisions in Creating Taxonomies
Making Decisions in Creating TaxonomiesMaking Decisions in Creating Taxonomies
Making Decisions in Creating Taxonomies
Heather Hedden
 

More from Heather Hedden (14)

Introduction to Knowledge Graphs for Information Architects.pdf
Introduction to Knowledge Graphs for Information Architects.pdfIntroduction to Knowledge Graphs for Information Architects.pdf
Introduction to Knowledge Graphs for Information Architects.pdf
 
Thesauri for Indexing Support / Thesauri zur Unterstützung der Registererstel...
Thesauri for Indexing Support / Thesauri zur Unterstützung der Registererstel...Thesauri for Indexing Support / Thesauri zur Unterstützung der Registererstel...
Thesauri for Indexing Support / Thesauri zur Unterstützung der Registererstel...
 
Taxonomies in Support of Search
Taxonomies in Support of SearchTaxonomies in Support of Search
Taxonomies in Support of Search
 
A Brief Introduction to SKOS
A Brief Introduction to SKOSA Brief Introduction to SKOS
A Brief Introduction to SKOS
 
Mapping Taxonomies, Thesauri, and Ontologies
Mapping Taxonomies, Thesauri, and OntologiesMapping Taxonomies, Thesauri, and Ontologies
Mapping Taxonomies, Thesauri, and Ontologies
 
A Brief Introduction to Knowledge Graphs
A Brief Introduction to Knowledge GraphsA Brief Introduction to Knowledge Graphs
A Brief Introduction to Knowledge Graphs
 
Managing Taxonomy Tagging
Managing Taxonomy TaggingManaging Taxonomy Tagging
Managing Taxonomy Tagging
 
Taxonomies for Users
Taxonomies for UsersTaxonomies for Users
Taxonomies for Users
 
Taxonomies, Categories, and Tags in WordPress
Taxonomies, Categories, and Tags in WordPressTaxonomies, Categories, and Tags in WordPress
Taxonomies, Categories, and Tags in WordPress
 
Customer-Focused Thesauri
Customer-Focused ThesauriCustomer-Focused Thesauri
Customer-Focused Thesauri
 
Managing Mature Taxonomies: Resolving Orphan Terms
Managing Mature Taxonomies: Resolving Orphan TermsManaging Mature Taxonomies: Resolving Orphan Terms
Managing Mature Taxonomies: Resolving Orphan Terms
 
Taxonomies for E-commerce
Taxonomies for E-commerceTaxonomies for E-commerce
Taxonomies for E-commerce
 
Taxonomies for Text Analytics and Auto-indexing
Taxonomies for Text Analytics and Auto-indexingTaxonomies for Text Analytics and Auto-indexing
Taxonomies for Text Analytics and Auto-indexing
 
Making Decisions in Creating Taxonomies
Making Decisions in Creating TaxonomiesMaking Decisions in Creating Taxonomies
Making Decisions in Creating Taxonomies
 

Recently uploaded

JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
RTTS
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
DianaGray10
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
Product School
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Ramesh Iyer
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
Product School
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Tobias Schneck
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 

Recently uploaded (20)

JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 

Taxonomies and Folksonomies

  • 1. Indexing & Tagging: Taxonomies & Folksonomies National Federation of Advanced Information Services (NFAIS) Improving the User Search Experience October 13, 2010, Philadelphia, Pennsylvania Heather Hedden, Taxonomy Consultant
  • 2. Heather Hedden’s Background Periodical database indexer for Information Access Company (IAC): Trade & Industry and PROMT Controlled vocabulary editor, IAC/Gale Freelance indexer and taxonomy consultant Taxonomy manager, Viziant and First Wind Continuing education instructor, Simmons GSLIS Author, The Accidental Taxonomist © 2010 Heather Hedden 2
  • 3. What’s the difference? Cataloging with a CV Indexing with a CV Keywording Tagging © 2010 Heather Hedden 3
  • 4. Definitions Controlled Vocabulary – A controlled list of terms for concepts, usually with nonpreferred terms (“synonyms”). May or may not have structure and relationships between terms. Broader and includes taxonomies. Taxonomy – A hierarchical structure of terms, which may or many not include nonpreferred terms. Has popularly replaced “controlled vocabulary” as a broader concept, which may or may not be hierarchical. © 2010 Heather Hedden 4
  • 5. Indexing Closed does not use CV; open does Trained, dedicated indexers Indexing guidelines and policies used Controlled vocabularies designed to support Indexer-focused scope notes Nonpreferred terms, displayed Extensive hierarchical & associative relationships Browsable, alphabetical display, type-ahead, etc. © 2010 Heather Hedden 5
  • 6. Indexers and Index Terms Indexers are closest to the content They see new concepts and terminology They need methods of: suggesting additional nonpreferred terms and relationships between terms suggesting new terms to add to the controlled vocabulary adding terms that supplement the controlled vocabulary © 2010 Heather Hedden 6
  • 7. Levels of Vocabulary Management 1. Only controlled vocabulary used Indexer suggests terms for approval prior to use. 2. Controlled vocabulary plus unapproved terms Indexer may use terms prior to approval. 3. Controlled vocabulary plus keywords Indexer supplements with terms never reviewed. 4. Controlled vocabulary plus shared keywords Indexer supplements with terms never reviewed, but re-usable. © 2010 Heather Hedden 7
  • 8. Levels of Vocabulary Management 1. Only controlled vocabulary used Simple to manage and implement Indexers email taxonomist with suggestions For small controlled vocabularies For limited scope, relatively static content For small indexing operations © 2010 Heather Hedden 8
  • 9. Levels of Vocabulary Management 2. Controlled vocabulary plus unapproved terms Indexer may create “candidate”, “unapproved,” or “override terms” for immediate use, but also entered into the system for taxonomist review Unapproved indexed terms could convert to nonpreferred terms. Can be restricted to only certain types of terms (usually named entities) or all terms For large indexing operations, large controlled vocabularies, extensive content More technological complex to implement © 2010 Heather Hedden 9
  • 10. Levels of Vocabulary Management 3. Controlled vocabulary plus keywords Indexer supplements controlled vocabulary indexing with any keywords entered into a separate field Keywords may be for new concepts, but are often for more specific concepts and names Keywords do not become part of the controlled vocabulary. Taxonomist may or may not look at them. Varies: (a) more indexing is with controlled vocabulary or (b) more with keywords and CV is only broad categories Less indexing technology, but more complex end-user retrieval options (3 types: taxonomy, keywords, freetext) Variants/synonyms are not controlled, redundant © 2010 Heather Hedden 10
  • 11. Levels of Vocabulary Management 4. Controlled vocabulary and shared keywords Indexer supplements controlled vocabulary indexing with any keywords entered into a separate field Keywords do not become part of the controlled vocabulary, but are stored in another database. All keywords become immediately available for all indexers to use and reuse Indexers can browse the list of previously used keywords; redundancies are reduced, not eliminated © 2010 Heather Hedden 11
  • 12. Folksonomy Shared keywords, created and shared by those “indexing” Indexing is not by indexers, but by end-users, consumers of the content. Common people, the “folk.” “Tagging” Users might also contribute structure, creating (hierarchical) relationships Originally defined as: “user-created bottom-up categorical structure development with an emergent thesaurus” --Thomas Vander Wal, July 2004 © 2010 Heather Hedden 12
  • 13. Folksonomy: Who Contributes to it? Indexers Editors Employees End-user subscribers Any end-users © 2010 Heather Hedden 13
  • 14. Folksonomy: Where used? Public web sites of high volume content and users: Delicious (http://delicious.com) Connotea (www.connotea.org) Diigo (www.diigo.com) Flickr (www.flickr.com) LibraryThing (www.librarything.com) Large enterprises that want to foster collaboration and innovation Subscription content providers? © 2010 Heather Hedden 14
  • 15. Social Tagging Also called collaborative tagging, social classification, social indexing Folksonomies plus Web 2.0/social networking features Social communities can be built around shared sets of popular content or popular tags Tags for popularity ratings Now moving into enterprises © 2010 Heather Hedden 15
  • 16. Folksonomies/Social tagging Advantages Reflects trends, up-to-date, can monitor change and popularity. Dynamic. Cheaper and quicker than building and maintaining a taxonomy Facilitates workplace democracy and the distribution of management tasks Responsive to user needs © 2010 Heather Hedden 16
  • 17. Folksonomies/Social tagging Disadvantages Inconsistent – precision & recall deficiencies Biased Requires critical mass of involvement to be useful Does not scale well to a large volume of content © 2010 Heather Hedden 17
  • 18. Folksonomies/Social tagging Solutions/trends: Some degree of vocabulary control Applicable to certain areas of content, not all Partial vocabulary control: Dual taxonomy & folksonomy system Requires policy of taxonomy first Single vocabulary system with some terms managed/edited, and some not (yet) like a wiki with editors © 2010 Heather Hedden 18
  • 19. Conclusions Different people often apply taxonomies or folksonomies. Taxonomies and Folksonomies may supplement each other. Technology facilitates the application of both. Users understand the distinction and can use both. © 2010 Heather Hedden 19