Leveraging Your Taxonomy With Navtree and MAIQuery
Upcoming SlideShare
Loading in...5
×
 

Leveraging Your Taxonomy With Navtree and MAIQuery

on

  • 218 views

How to make your content users more productive using Access Innovations, Inc.'s Navtree and Machine Aided Indexer (M.A.I.™), parts of the Data Harmony® software suite.

How to make your content users more productive using Access Innovations, Inc.'s Navtree and Machine Aided Indexer (M.A.I.™), parts of the Data Harmony® software suite.

Statistics

Views

Total Views
218
Slideshare-icon Views on SlideShare
208
Embed Views
10

Actions

Likes
1
Downloads
2
Comments
0

2 Embeds 10

http://www.dataharmony.com 9
http://www.accessinn.com 1

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment
  • There are other forms of organization – alpha, chronological, geographical, audience, etc. <br /> Taxonomy organizes by topic, by subject, by aboutness. <br />
  • We already know that it helps your authors and editors assign effective subject terms <br /> And that it automates indexing items in legacy collections <br />
  • Recognizing term equivalents – important point, we’ll see more on this later. <br />
  • We already know that it helps your authors and editors assign effective subject terms <br /> And that it automates indexing items in legacy collections <br />
  • We already know that it helps your authors and editors assign effective subject terms <br /> And that it automates indexing items in legacy collections <br />
  • Your user interface can offer a rule-base assisted search or a full text search <br />
  • Any language – jsp, asp, Perl – can be used <br />
  • When connected, all the advantages of using a controlled vocabulary for indexing are made available to the user <br />
  • Level 2 <br />
  • Level 3 - success for this category <br />
  • Your user interface can offer a rule-base assisted search or a full text search <br />
  • Our demo page includes an MAIQuery search box also <br /> We see 4 levels here – Business, Business Enterprises, Corporations, Corporate structure <br /> Each indicates how many titles are indexed with it and how many are indexed with either it or its child, narrower terms <br />
  • See options at bottom <br />
  • When connected, all the advantages of using a controlled vocabulary for indexing are made available to the user <br />
  • When connected, all the advantages of using a controlled vocabulary for indexing are made available to the user <br />

Leveraging Your Taxonomy With Navtree and MAIQuery Leveraging Your Taxonomy With Navtree and MAIQuery Presentation Transcript

  • Leveraging Your Taxonomy to Increase User Productivity MAIQuery and TM Navtree
  • 2 Taxonomies aid site organization Taxonomy provides:  Framework for content organization  Hierarchical outline of your content by subject categories  Basis for effective browsing
  • 3 Integrated taxonomy enhances findability  Browsable categories of a directory  Smart search for term equivalents  Taxonomy terms (original or modified) as labels  Navigation aids incorporate taxonomy terms and relationships
  • 4 Example Search: body growth Complete database (60,000 + titles)  Free text search  8 hits — some irrelevant  Free text search on titles  6 hits — limited recall  Search by taxonomy descriptor (AKA subject term or category)  470 hits  100% relevant  100% recall
  • 5 Increasing User Productivity  Items in an information collection can be retrieved with better precision (relevance) and better recall by using a controlled vocabulary to assign subject terms (key words) to them How do you connect your users to the controlled vocabulary?
  • 6 Connecting Users 1. Use the rulebase you’ve developed for machine aided indexing (MAIQuery) 2. Use the controlled vocabulary itself (TM Navtree)
  • 7 MAI’s talents  MAI (Machine Aided Indexer) helps authors and editors assign effective subject terms automates the assignment of subject terms to items in legacy collections
  • 8  M.A.I. suggests the correct terms from the taxonomy as descriptors  M.A.I. rulebase recognizes term equivalents  germs  Microorganisms  vaccin*  Pharmaceutical drugs Recognizing term equivalents enables enhanced search Taxonomy terms on documents help sort and organize the content
  • 9 MAI’s “hidden talents”  MAI can also: Provide for the appropriate preferred term when given a word or phrase Return preferred terms for uses of the word in different contexts
  • 10 More “hidden talents”  MAIQuery can: Show related terms from the thesaurus to broaden a search Show the rules and preferred term’s scope notes to clarify how the preferred term relates to others in the thesaurus
  • 11 Presenting: MAIQuery™  Web page presents a search box that will use the MAI rulebase  Can be in addition to full text search and advanced search  User enters a word or phrase in the search box  MAI searches the rulebase for any occurrences of the word(s)
  • 12 MAIQuery
  • 13 the MAIQuery demo  Uses web pages and php coding:  Passes the search words to “dosearch.php”  dosearch.php passes the term to MAI’s concept extractor  MAI returns a list of suggested terms from the controlled vocabulary
  • 14 Suggested terms The term Music is suggested by the rule for music*(1) Click on the first (the preferred term) to see the term record; click on the second to see the MAI rule The term Instrumental Music is suggested by the rule for music*(1) Click on the first (the preferred term) to see the term record; click on the second to see the MAI rule
  • 15 Options  Thesaurus Master can be queried to show the term record  Broader term  Narrower terms  Use For terms (“synonyms”)  Related terms  Scope notes
  • 16
  • 17  MAI can be queried to return the rule that includes the search word(s) Options, continued
  • 18 Show the rule
  • 19 Options, continued  Your database/index of items is then queried to bring back the records in your collection that are indexed with the preferred term  For our demo, we wrote an xquery request into the gettitles.php file  Our 1100-title demo records are maintained by a MarkLogic server
  • 20 A list of items
  • 21 Choose the item  Your user clicks on the item(s) appropriate to their query  The document details (or the item itself) is returned
  • 22 The right stuff
  • 23 How’s it working? What words and phrases do your users search for?  a search log can record “misses”  a user focus group can suggest additions  subject matter experts can help in their area of expertise
  • 24 Fine tuning Modify your taxonomy to respond to more words  add common misspellings to rules  add alternate words as Use For terms (synonyms) in the thesaurus (or as additions to the rules)  consider terms for addition to the thesaurus (candidates)
  • 25 The advantages  MAIQuery connects your user with the controlled vocabulary  Your user can review term records and rulebase rules to learn more about your taxonomy  Your user becomes more productive
  • 26 Another way to connect users  Category search used more than half the time for research  Also known as directory search, your user “drills down” from general to specific
  • 27 Value of Category search  Searchers find info 50% faster using browsable categories than using list returned from free text search  Results even stronger when results not in top 20 returns  Searchers prefer browsable category search Chen, H., and Dumais, S.
  • 28 Search – the Directory Approach
  • 29 Category: Business and Economy
  • 30 Results: Business Libraries
  • 31 Your Thesaurus as Directory  Present your controlled vocabulary as a guide to your collection
  • 32
  • 33 Thesauri OnLine  Australian Governments' Interactive Functions Thesaurus – AGIFT http://www.naa.gov.au/recordkeeping/thesauru  Transportation Research Thesaurus – TRT http://ntl.bts.gov/trt/trt_topterms.jsp  NBII (National Biological Information Infrastructure) http ://thesaurus.nbii.gov/SearchNBIIThesaurus/ab
  • 34 Presenting: TM Navtree  Your thesaurus presented as a navigation aid  User “drill down” with all the neighboring terms visible  Each term indicates the number of documents indexed with it  Terms are hyperlinks to a list of items
  • 35 A hierarchical tree
  • 36 See full topic coverage by revealing Narrower Terms
  • 37 Choose a term  Click on a term, get the titles indexed with it
  • 38 Choose a title  Click on a title, get its details (or bring up the item)
  • 39 How it’s done  We used PHP Levels, an open source application from SourceForge to create the tree  An exported XML version of the thesaurus is parsed to produce the required text file to populate the tree  The content manager is queried for the document totals
  • 40 How it’s done, continued  When a term is selected, it is passed to a gettitles.php  A bit of php code connects to the content manager and returns a string of data about each title  The web page displays the data in the format desired
  • 41 The advantages  TM Navtree Top Terms describe the organization of your collection(s)  Narrower terms help your user hone in on the most appropriate term  Adjacent terms impart connotation
  • 42 The advantages  ALL the records indexed with the chosen term are returned  Your user finds what’s needed more quickly and is more productive
  • 43 Questions? Comments? Try out the demo at www.mediasleuth.com See more details: Data Harmony Programmer Interface for Web Applications Thank you. Mary Garcia
  • 44 MAI Query and NavTree from Data Harmony Making Users More Productive