Machine Processing of Taxonomy

1,157 views
1,036 views

Published on

John looks beyond taxonomy as classification and discusses ways of giving systems more data about the information they are processing.

Published in: Technology, Economy & Finance
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
1,157
On SlideShare
0
From Embeds
0
Number of Embeds
34
Actions
Shares
0
Downloads
48
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Machine Processing of Taxonomy

  1. 1. Machine Processing of Taxonomies John Ferrara
  2. 2. Introduction <ul><li>Taxonomy as metadata (not necessarily as navigation) </li></ul><ul><li>Machine processing of information </li></ul><ul><li>All about the meanings of word forms </li></ul><ul><li>Smart systems </li></ul><ul><ul><li>Today, that usually means search </li></ul></ul><ul><ul><li>Tomorrow, it’ll mean intelligent agents </li></ul></ul>
  3. 3. A Riddle <ul><li>2 Vanguard systems </li></ul><ul><ul><li>Same search engine </li></ul></ul><ul><ul><li>Similar content </li></ul></ul><ul><ul><li>Both use metadata in similar ways </li></ul></ul><ul><ul><li>Same queries </li></ul></ul><ul><li>One system returns higher quality search results much more reliably than the other. What makes the difference? </li></ul><ul><li>A Thesaurus! </li></ul>
  4. 4. Controlled Vocabularies & Thesauri <ul><li>Equivalence: These things are the same </li></ul><ul><ul><li>IRA = individual retirement account </li></ul></ul><ul><ul><li>redemption = sale </li></ul></ul><ul><ul><li>401(k) = 401k = 401 k </li></ul></ul><ul><ul><li>AKA a “synonym ring” </li></ul></ul><ul><li>Preference: This is the standard term, these are variants </li></ul><ul><ul><li>ETF over VIPER </li></ul></ul><ul><ul><li>Electronic Bank Transfer over wire </li></ul></ul><ul><ul><li>Beneficiary over Beneficary </li></ul></ul><ul><ul><li>AKA an “authority file” </li></ul></ul>
  5. 5. Controlled Vocabularies & Thesauri <ul><li>Classification: This is the parent (or child) of that </li></ul><ul><ul><li>investment > mutual fund > stock fund > S&P 500 Index </li></ul></ul><ul><ul><li>“broader terms” and “narrower terms” </li></ul></ul><ul><ul><li>Similar to (but not the same as) a navigational taxonomy </li></ul></ul><ul><li>Related: This is associated with that </li></ul><ul><ul><li>distributions & capital gains </li></ul></ul><ul><ul><li>download & Quicken & tax forms </li></ul></ul><ul><ul><li>May be used as a “See also” or “Best Bets” function </li></ul></ul>
  6. 6. Ontologies <ul><li>“A specification of a conceptualization” </li></ul><ul><li>It explains the relationships between concepts </li></ul><ul><li>Languages include RDF, DAML+OIL, and OWL </li></ul>Subject Object Predicate
  7. 7. Example of an Ontology Planet Star Goes around Mercury Venus Earth Mars Is a Satellite Artificial Natural Type of Goes around Hubble The moon Is a Is a Goes around The sun Is a Atmosphere Crust Mantle Core Part of

×