SlideShare a Scribd company logo
Leveraging Your
Taxonomy to
Increase User
Productivity
MAIQuery and TM Navtree
2
Taxonomies aid site
organization
Taxonomy provides:
 Framework for content
organization
 Hierarchical outline of your
content by subject categories
 Basis for effective browsing
3
Integrated taxonomy enhances
findability
 Browsable categories of a directory
 Smart search for term equivalents
 Taxonomy terms (original or modified)
as labels
 Navigation aids incorporate taxonomy
terms and relationships
4
Example Search: body growth
Complete database (60,000 + titles)
 Free text search
 8 hits — some irrelevant
 Free text search on titles
 6 hits — limited recall
 Search by taxonomy descriptor (AKA
subject term or category)
 470 hits
 100% relevant
 100% recall
5
Increasing User
Productivity
 Items in an information collection
can be retrieved with better
precision (relevance) and better
recall by using a controlled
vocabulary to assign subject terms
(key words) to them
How do you connect your
users to the controlled
vocabulary?
6
Connecting Users
1. Use the rulebase you’ve
developed for machine
aided indexing (MAIQuery)
2. Use the controlled
vocabulary itself
(TM Navtree)
7
MAI’s talents
 MAI (Machine Aided Indexer)
helps authors and editors
assign effective subject terms
automates the assignment of
subject terms to items in legacy
collections
8
 M.A.I. suggests the correct terms
from the taxonomy as descriptors
 M.A.I. rulebase recognizes term
equivalents
 germs  Microorganisms
 vaccin*  Pharmaceutical drugs
Recognizing term equivalents
enables enhanced search
Taxonomy terms on documents
help sort and organize the content
9
MAI’s “hidden talents”
 MAI can also:
Provide for the appropriate
preferred term when given a
word or phrase
Return preferred terms for uses
of the word in different contexts
10
More “hidden talents”
 MAIQuery can:
Show related terms from the
thesaurus to broaden a search
Show the rules and preferred
term’s scope notes to clarify
how the preferred term relates
to others in the thesaurus
11
Presenting: MAIQuery™
 Web page presents a search box
that will use the MAI rulebase
 Can be in addition to full text
search and advanced search
 User enters a word or phrase in
the search box
 MAI searches the rulebase for
any occurrences of the word(s)
12
MAIQuery
13
the MAIQuery demo
 Uses web pages and php coding:
 Passes the search words to
“dosearch.php”
 dosearch.php passes the term to
MAI’s concept extractor
 MAI returns a list of suggested
terms from the controlled
vocabulary
14
Suggested terms
The term Music is suggested
by the rule for music*(1)
Click on the first (the preferred term) to see the term record; click on the second to see the MAI rule
The term Instrumental Music is suggested
by the rule for music*(1)
Click on the first (the preferred term) to see the term record; click on the second to see the MAI rule
15
Options
 Thesaurus Master can be queried
to show the term record
 Broader term
 Narrower terms
 Use For terms (“synonyms”)
 Related terms
 Scope notes
16
17
 MAI can be queried to return the
rule that includes the search
word(s)
Options, continued
18
Show the rule
19
Options, continued
 Your database/index of items is
then queried to bring back the
records in your collection that are
indexed with the preferred term
 For our demo, we wrote an xquery
request into the gettitles.php file
 Our 1100-title demo records are
maintained by a MarkLogic server
20
A list of items
21
Choose the item
 Your user clicks on the item(s)
appropriate to their query
 The document details (or the item
itself) is returned
22
The right stuff
23
How’s it working?
What words and phrases do your
users search for?
 a search log can record “misses”
 a user focus group can suggest
additions
 subject matter experts can help in
their area of expertise
24
Fine tuning
Modify your taxonomy to respond
to more words
 add common misspellings to
rules
 add alternate words as Use For
terms (synonyms) in the
thesaurus
(or as additions to the rules)
 consider terms for addition to
the thesaurus (candidates)
25
The advantages
 MAIQuery connects your user
with the controlled vocabulary
 Your user can review term
records and rulebase rules to
learn more about your taxonomy
 Your user becomes more
productive
26
Another way to connect users
 Category search used more than
half the time for research
 Also known as directory search,
your user “drills down” from
general to specific
27
Value of Category search
 Searchers find info 50% faster
using browsable categories than
using list returned from free text
search
 Results even stronger when results
not in top 20 returns
 Searchers prefer browsable
category search
Chen, H., and Dumais, S.
28
Search – the Directory
Approach
29
Category: Business and Economy
30
Results: Business Libraries
31
Your Thesaurus as Directory
 Present your controlled
vocabulary as a guide to your
collection
32
33
Thesauri OnLine
 Australian Governments' Interactive
Functions Thesaurus – AGIFT
http://www.naa.gov.au/recordkeeping/thesauru
 Transportation Research Thesaurus – TRT
http://ntl.bts.gov/trt/trt_topterms.jsp
 NBII (National Biological Information
Infrastructure)
http
://thesaurus.nbii.gov/SearchNBIIThesaurus/ab
34
Presenting: TM Navtree
 Your thesaurus presented as a
navigation aid
 User “drill down” with all the
neighboring terms visible
 Each term indicates the number
of documents indexed with it
 Terms are hyperlinks to a list of
items
35
A hierarchical tree
36
See full topic coverage by revealing
Narrower Terms
37
Choose a term
 Click on a term, get the titles
indexed with it
38
Choose a title
 Click on a title, get its details (or
bring up the item)
39
How it’s done
 We used PHP Levels, an open
source application from
SourceForge to create the tree
 An exported XML version of the
thesaurus is parsed to produce
the required text file to populate
the tree
 The content manager is queried
for the document totals
40
How it’s done, continued
 When a term is selected, it is
passed to a gettitles.php
 A bit of php code connects to the
content manager and returns a
string of data about each title
 The web page displays the data
in the format desired
41
The advantages
 TM Navtree Top Terms describe
the organization of your
collection(s)
 Narrower terms help your user
hone in on the most appropriate
term
 Adjacent terms impart
connotation
42
The advantages
 ALL the records indexed with the
chosen term are returned
 Your user finds what’s needed
more quickly and is more
productive
43
Questions?
Comments?
Try out the demo at
www.mediasleuth.com
See more details:
Data Harmony Programmer
Interface for Web Applications
Thank you.
Mary Garcia
44
MAI Query and NavTree from
Data Harmony
Making Users
More Productive

More Related Content

What's hot

CrossRef at SciELO15 Conference 2013
CrossRef at SciELO15 Conference 2013CrossRef at SciELO15 Conference 2013
CrossRef at SciELO15 Conference 2013
Crossref
 

What's hot (20)

THOR Workshop - Persistent Identifier Linking
THOR Workshop - Persistent Identifier LinkingTHOR Workshop - Persistent Identifier Linking
THOR Workshop - Persistent Identifier Linking
 
CrossRef at SciELO15 Conference 2013
CrossRef at SciELO15 Conference 2013CrossRef at SciELO15 Conference 2013
CrossRef at SciELO15 Conference 2013
 
D4Science Data infrastructure: a facilitator for a FAIR data management
D4Science Data infrastructure: a facilitator for a FAIR data managementD4Science Data infrastructure: a facilitator for a FAIR data management
D4Science Data infrastructure: a facilitator for a FAIR data management
 
Crossref LIVE: The Benefits of Open Infrastructure (APAC time zones) - 29th O...
Crossref LIVE: The Benefits of Open Infrastructure (APAC time zones) - 29th O...Crossref LIVE: The Benefits of Open Infrastructure (APAC time zones) - 29th O...
Crossref LIVE: The Benefits of Open Infrastructure (APAC time zones) - 29th O...
 
Dk net webinar tutorial pen
Dk net webinar tutorial penDk net webinar tutorial pen
Dk net webinar tutorial pen
 
Taxonomy And Metadata
Taxonomy And MetadataTaxonomy And Metadata
Taxonomy And Metadata
 
RDA FAIR Data Maturity Model
RDA FAIR Data Maturity ModelRDA FAIR Data Maturity Model
RDA FAIR Data Maturity Model
 
FAIR data overview
FAIR data overviewFAIR data overview
FAIR data overview
 
Organizational Identifiers - Crossref LIVE Hannover
Organizational Identifiers - Crossref LIVE HannoverOrganizational Identifiers - Crossref LIVE Hannover
Organizational Identifiers - Crossref LIVE Hannover
 
FAIR Data Management and FAIR Data Sharing
FAIR Data Management and FAIR Data SharingFAIR Data Management and FAIR Data Sharing
FAIR Data Management and FAIR Data Sharing
 
Data Citation Implementation Guidelines By Tim Clark
Data Citation Implementation Guidelines By Tim ClarkData Citation Implementation Guidelines By Tim Clark
Data Citation Implementation Guidelines By Tim Clark
 
Mendeley Data FAIR hackathon
Mendeley Data FAIR hackathonMendeley Data FAIR hackathon
Mendeley Data FAIR hackathon
 
Altman RDAP11 Policy-based Data Management
Altman RDAP11 Policy-based Data ManagementAltman RDAP11 Policy-based Data Management
Altman RDAP11 Policy-based Data Management
 
1. Introduction to Crossref
1. Introduction to Crossref1. Introduction to Crossref
1. Introduction to Crossref
 
THOR Workshop - Data Publishing Elsevier
THOR Workshop - Data Publishing ElsevierTHOR Workshop - Data Publishing Elsevier
THOR Workshop - Data Publishing Elsevier
 
Shareable by Design: Making Better Use of your Research
Shareable by Design: Making Better Use of your ResearchShareable by Design: Making Better Use of your Research
Shareable by Design: Making Better Use of your Research
 
Smith RDAP11 NSF Data Management Plan Case Studies
Smith RDAP11 NSF Data Management Plan Case StudiesSmith RDAP11 NSF Data Management Plan Case Studies
Smith RDAP11 NSF Data Management Plan Case Studies
 
I Don’t Have Time for Metadata!
I Don’t Have Time for Metadata!I Don’t Have Time for Metadata!
I Don’t Have Time for Metadata!
 
OAIS: What is it and Where is it Going? - Don Sawyer (2002)
OAIS: What is it and Where is it Going? - Don Sawyer (2002)OAIS: What is it and Where is it Going? - Don Sawyer (2002)
OAIS: What is it and Where is it Going? - Don Sawyer (2002)
 
Dspace Webinar
Dspace WebinarDspace Webinar
Dspace Webinar
 

Similar to Leveraging Your Taxonomy With Navtree and MAIQuery

5 Accessing Information Resources
5 Accessing Information Resources5 Accessing Information Resources
5 Accessing Information Resources
Patty Ramsey
 
Tutorial 3 - Searcing the Web
Tutorial 3 - Searcing the WebTutorial 3 - Searcing the Web
Tutorial 3 - Searcing the Web
dpd
 
GContext: A context-based query construction service for Google
GContext: A context-based query construction service for GoogleGContext: A context-based query construction service for Google
GContext: A context-based query construction service for Google
John Pap
 

Similar to Leveraging Your Taxonomy With Navtree and MAIQuery (20)

Customer-Focused Thesauri
Customer-Focused ThesauriCustomer-Focused Thesauri
Customer-Focused Thesauri
 
Why Are Taxonomies Necessary?
Why Are Taxonomies Necessary?Why Are Taxonomies Necessary?
Why Are Taxonomies Necessary?
 
5 Accessing Information Resources
5 Accessing Information Resources5 Accessing Information Resources
5 Accessing Information Resources
 
Internet Tutorial 03
Internet  Tutorial 03Internet  Tutorial 03
Internet Tutorial 03
 
Tutorial 3 - Searcing the Web
Tutorial 3 - Searcing the WebTutorial 3 - Searcing the Web
Tutorial 3 - Searcing the Web
 
Share point summit_2010_lemieux-toc
Share point summit_2010_lemieux-tocShare point summit_2010_lemieux-toc
Share point summit_2010_lemieux-toc
 
Introduction to internet.
Introduction to internet.Introduction to internet.
Introduction to internet.
 
Thesauri
ThesauriThesauri
Thesauri
 
Taxonomies and Metadata in Information Architecture
Taxonomies and Metadata in Information ArchitectureTaxonomies and Metadata in Information Architecture
Taxonomies and Metadata in Information Architecture
 
Managed metadata – SharePoint 2013
Managed metadata – SharePoint 2013Managed metadata – SharePoint 2013
Managed metadata – SharePoint 2013
 
GContext: A context-based query construction service for Google
GContext: A context-based query construction service for GoogleGContext: A context-based query construction service for Google
GContext: A context-based query construction service for Google
 
Surfing the web
Surfing the webSurfing the web
Surfing the web
 
Drilling Down to the Challenges of SharePoint Taxonomy Implementation
Drilling Down to the Challenges of SharePoint Taxonomy ImplementationDrilling Down to the Challenges of SharePoint Taxonomy Implementation
Drilling Down to the Challenges of SharePoint Taxonomy Implementation
 
Taxonomy design best practices
Taxonomy design best practices Taxonomy design best practices
Taxonomy design best practices
 
Introduction to EBSCOhost Research databases
Introduction to EBSCOhost Research databasesIntroduction to EBSCOhost Research databases
Introduction to EBSCOhost Research databases
 
Searching techniques
Searching techniquesSearching techniques
Searching techniques
 
Searching techniques
Searching techniquesSearching techniques
Searching techniques
 
Using metadata repositories with search
Using metadata repositories with searchUsing metadata repositories with search
Using metadata repositories with search
 
SharePoint Connections Coast to Coast Overview of Enterprise Content Management
SharePoint Connections Coast to Coast Overview of Enterprise Content ManagementSharePoint Connections Coast to Coast Overview of Enterprise Content Management
SharePoint Connections Coast to Coast Overview of Enterprise Content Management
 
Encore Presentation - ACRL/NEC ITIG Annual Meeting
Encore Presentation - ACRL/NEC ITIG Annual MeetingEncore Presentation - ACRL/NEC ITIG Annual Meeting
Encore Presentation - ACRL/NEC ITIG Annual Meeting
 

More from Access Innovations, Inc.

More from Access Innovations, Inc. (20)

Eureka, I found it! - Special Libraries Association 2021 Presentation
Eureka, I found it! - Special Libraries Association 2021 PresentationEureka, I found it! - Special Libraries Association 2021 Presentation
Eureka, I found it! - Special Libraries Association 2021 Presentation
 
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy ResultsMaking AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
 
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
 
Smart submit
Smart submitSmart submit
Smart submit
 
Plos taxonomy beyond search dhug 2021
Plos taxonomy beyond search   dhug 2021Plos taxonomy beyond search   dhug 2021
Plos taxonomy beyond search dhug 2021
 
Hindawi taxonomy and personalization 27.10 (1)
Hindawi taxonomy and personalization 27.10 (1)Hindawi taxonomy and personalization 27.10 (1)
Hindawi taxonomy and personalization 27.10 (1)
 
Data harmonycloudpowerpointclientfacing
Data harmonycloudpowerpointclientfacingData harmonycloudpowerpointclientfacing
Data harmonycloudpowerpointclientfacing
 
Data harmony update 2021
Data harmony update 2021 Data harmony update 2021
Data harmony update 2021
 
Atypon dhug2021
Atypon dhug2021Atypon dhug2021
Atypon dhug2021
 
Asco using ai-taxos-for meta-titles-february-2021
Asco using ai-taxos-for meta-titles-february-2021Asco using ai-taxos-for meta-titles-february-2021
Asco using ai-taxos-for meta-titles-february-2021
 
Asce more than just topic taxonomies
Asce more than just topic taxonomiesAsce more than just topic taxonomies
Asce more than just topic taxonomies
 
Acs discoverability-dhug2021
Acs discoverability-dhug2021Acs discoverability-dhug2021
Acs discoverability-dhug2021
 
Ai webinar 2 -what's in a name (consolidated pdf)
Ai webinar 2 -what's in a name (consolidated pdf)Ai webinar 2 -what's in a name (consolidated pdf)
Ai webinar 2 -what's in a name (consolidated pdf)
 
Tagging overview - Why Keywords Don't Cut It
Tagging overview  - Why Keywords Don't Cut ItTagging overview  - Why Keywords Don't Cut It
Tagging overview - Why Keywords Don't Cut It
 
Health Affairs - Why Keywords Don't Cut It
Health Affairs - Why Keywords Don't Cut ItHealth Affairs - Why Keywords Don't Cut It
Health Affairs - Why Keywords Don't Cut It
 
Why Keywords Don't Cut It
Why Keywords Don't Cut ItWhy Keywords Don't Cut It
Why Keywords Don't Cut It
 
Data Harmony update 2020 final
Data Harmony update 2020 finalData Harmony update 2020 final
Data Harmony update 2020 final
 
Data Harmony Update 2020 final
Data Harmony Update 2020 finalData Harmony Update 2020 final
Data Harmony Update 2020 final
 
DHUG 2018: Towards Web-Centric Repository Interoperability
DHUG 2018: Towards Web-Centric Repository InteroperabilityDHUG 2018: Towards Web-Centric Repository Interoperability
DHUG 2018: Towards Web-Centric Repository Interoperability
 
DHUG 2018 - Florida Thesis OCR
DHUG 2018 - Florida Thesis OCRDHUG 2018 - Florida Thesis OCR
DHUG 2018 - Florida Thesis OCR
 

Recently uploaded

Recently uploaded (20)

Pragya Champions Chalice 2024 Prelims & Finals Q/A set, General Quiz
Pragya Champions Chalice 2024 Prelims & Finals Q/A set, General QuizPragya Champions Chalice 2024 Prelims & Finals Q/A set, General Quiz
Pragya Champions Chalice 2024 Prelims & Finals Q/A set, General Quiz
 
Keeping Your Information Safe with Centralized Security Services
Keeping Your Information Safe with Centralized Security ServicesKeeping Your Information Safe with Centralized Security Services
Keeping Your Information Safe with Centralized Security Services
 
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
 
Introduction to Quality Improvement Essentials
Introduction to Quality Improvement EssentialsIntroduction to Quality Improvement Essentials
Introduction to Quality Improvement Essentials
 
Salient features of Environment protection Act 1986.pptx
Salient features of Environment protection Act 1986.pptxSalient features of Environment protection Act 1986.pptx
Salient features of Environment protection Act 1986.pptx
 
[GDSC YCCE] Build with AI Online Presentation
[GDSC YCCE] Build with AI Online Presentation[GDSC YCCE] Build with AI Online Presentation
[GDSC YCCE] Build with AI Online Presentation
 
50 ĐỀ LUYỆN THI IOE LỚP 9 - NĂM HỌC 2022-2023 (CÓ LINK HÌNH, FILE AUDIO VÀ ĐÁ...
50 ĐỀ LUYỆN THI IOE LỚP 9 - NĂM HỌC 2022-2023 (CÓ LINK HÌNH, FILE AUDIO VÀ ĐÁ...50 ĐỀ LUYỆN THI IOE LỚP 9 - NĂM HỌC 2022-2023 (CÓ LINK HÌNH, FILE AUDIO VÀ ĐÁ...
50 ĐỀ LUYỆN THI IOE LỚP 9 - NĂM HỌC 2022-2023 (CÓ LINK HÌNH, FILE AUDIO VÀ ĐÁ...
 
Open Educational Resources Primer PowerPoint
Open Educational Resources Primer PowerPointOpen Educational Resources Primer PowerPoint
Open Educational Resources Primer PowerPoint
 
How to Manage Notification Preferences in the Odoo 17
How to Manage Notification Preferences in the Odoo 17How to Manage Notification Preferences in the Odoo 17
How to Manage Notification Preferences in the Odoo 17
 
Gyanartha SciBizTech Quiz slideshare.pptx
Gyanartha SciBizTech Quiz slideshare.pptxGyanartha SciBizTech Quiz slideshare.pptx
Gyanartha SciBizTech Quiz slideshare.pptx
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
The Last Leaf, a short story by O. Henry
The Last Leaf, a short story by O. HenryThe Last Leaf, a short story by O. Henry
The Last Leaf, a short story by O. Henry
 
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptxMARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
 
Morse OER Some Benefits and Challenges.pptx
Morse OER Some Benefits and Challenges.pptxMorse OER Some Benefits and Challenges.pptx
Morse OER Some Benefits and Challenges.pptx
 
B.ed spl. HI pdusu exam paper-2023-24.pdf
B.ed spl. HI pdusu exam paper-2023-24.pdfB.ed spl. HI pdusu exam paper-2023-24.pdf
B.ed spl. HI pdusu exam paper-2023-24.pdf
 
PART A. Introduction to Costumer Service
PART A. Introduction to Costumer ServicePART A. Introduction to Costumer Service
PART A. Introduction to Costumer Service
 
INU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdf
INU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdfINU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdf
INU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdf
 
Benefits and Challenges of Using Open Educational Resources
Benefits and Challenges of Using Open Educational ResourcesBenefits and Challenges of Using Open Educational Resources
Benefits and Challenges of Using Open Educational Resources
 
Research Methods in Psychology | Cambridge AS Level | Cambridge Assessment In...
Research Methods in Psychology | Cambridge AS Level | Cambridge Assessment In...Research Methods in Psychology | Cambridge AS Level | Cambridge Assessment In...
Research Methods in Psychology | Cambridge AS Level | Cambridge Assessment In...
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
 

Leveraging Your Taxonomy With Navtree and MAIQuery

  • 1. Leveraging Your Taxonomy to Increase User Productivity MAIQuery and TM Navtree
  • 2. 2 Taxonomies aid site organization Taxonomy provides:  Framework for content organization  Hierarchical outline of your content by subject categories  Basis for effective browsing
  • 3. 3 Integrated taxonomy enhances findability  Browsable categories of a directory  Smart search for term equivalents  Taxonomy terms (original or modified) as labels  Navigation aids incorporate taxonomy terms and relationships
  • 4. 4 Example Search: body growth Complete database (60,000 + titles)  Free text search  8 hits — some irrelevant  Free text search on titles  6 hits — limited recall  Search by taxonomy descriptor (AKA subject term or category)  470 hits  100% relevant  100% recall
  • 5. 5 Increasing User Productivity  Items in an information collection can be retrieved with better precision (relevance) and better recall by using a controlled vocabulary to assign subject terms (key words) to them How do you connect your users to the controlled vocabulary?
  • 6. 6 Connecting Users 1. Use the rulebase you’ve developed for machine aided indexing (MAIQuery) 2. Use the controlled vocabulary itself (TM Navtree)
  • 7. 7 MAI’s talents  MAI (Machine Aided Indexer) helps authors and editors assign effective subject terms automates the assignment of subject terms to items in legacy collections
  • 8. 8  M.A.I. suggests the correct terms from the taxonomy as descriptors  M.A.I. rulebase recognizes term equivalents  germs  Microorganisms  vaccin*  Pharmaceutical drugs Recognizing term equivalents enables enhanced search Taxonomy terms on documents help sort and organize the content
  • 9. 9 MAI’s “hidden talents”  MAI can also: Provide for the appropriate preferred term when given a word or phrase Return preferred terms for uses of the word in different contexts
  • 10. 10 More “hidden talents”  MAIQuery can: Show related terms from the thesaurus to broaden a search Show the rules and preferred term’s scope notes to clarify how the preferred term relates to others in the thesaurus
  • 11. 11 Presenting: MAIQuery™  Web page presents a search box that will use the MAI rulebase  Can be in addition to full text search and advanced search  User enters a word or phrase in the search box  MAI searches the rulebase for any occurrences of the word(s)
  • 13. 13 the MAIQuery demo  Uses web pages and php coding:  Passes the search words to “dosearch.php”  dosearch.php passes the term to MAI’s concept extractor  MAI returns a list of suggested terms from the controlled vocabulary
  • 14. 14 Suggested terms The term Music is suggested by the rule for music*(1) Click on the first (the preferred term) to see the term record; click on the second to see the MAI rule The term Instrumental Music is suggested by the rule for music*(1) Click on the first (the preferred term) to see the term record; click on the second to see the MAI rule
  • 15. 15 Options  Thesaurus Master can be queried to show the term record  Broader term  Narrower terms  Use For terms (“synonyms”)  Related terms  Scope notes
  • 16. 16
  • 17. 17  MAI can be queried to return the rule that includes the search word(s) Options, continued
  • 19. 19 Options, continued  Your database/index of items is then queried to bring back the records in your collection that are indexed with the preferred term  For our demo, we wrote an xquery request into the gettitles.php file  Our 1100-title demo records are maintained by a MarkLogic server
  • 20. 20 A list of items
  • 21. 21 Choose the item  Your user clicks on the item(s) appropriate to their query  The document details (or the item itself) is returned
  • 23. 23 How’s it working? What words and phrases do your users search for?  a search log can record “misses”  a user focus group can suggest additions  subject matter experts can help in their area of expertise
  • 24. 24 Fine tuning Modify your taxonomy to respond to more words  add common misspellings to rules  add alternate words as Use For terms (synonyms) in the thesaurus (or as additions to the rules)  consider terms for addition to the thesaurus (candidates)
  • 25. 25 The advantages  MAIQuery connects your user with the controlled vocabulary  Your user can review term records and rulebase rules to learn more about your taxonomy  Your user becomes more productive
  • 26. 26 Another way to connect users  Category search used more than half the time for research  Also known as directory search, your user “drills down” from general to specific
  • 27. 27 Value of Category search  Searchers find info 50% faster using browsable categories than using list returned from free text search  Results even stronger when results not in top 20 returns  Searchers prefer browsable category search Chen, H., and Dumais, S.
  • 28. 28 Search – the Directory Approach
  • 31. 31 Your Thesaurus as Directory  Present your controlled vocabulary as a guide to your collection
  • 32. 32
  • 33. 33 Thesauri OnLine  Australian Governments' Interactive Functions Thesaurus – AGIFT http://www.naa.gov.au/recordkeeping/thesauru  Transportation Research Thesaurus – TRT http://ntl.bts.gov/trt/trt_topterms.jsp  NBII (National Biological Information Infrastructure) http ://thesaurus.nbii.gov/SearchNBIIThesaurus/ab
  • 34. 34 Presenting: TM Navtree  Your thesaurus presented as a navigation aid  User “drill down” with all the neighboring terms visible  Each term indicates the number of documents indexed with it  Terms are hyperlinks to a list of items
  • 36. 36 See full topic coverage by revealing Narrower Terms
  • 37. 37 Choose a term  Click on a term, get the titles indexed with it
  • 38. 38 Choose a title  Click on a title, get its details (or bring up the item)
  • 39. 39 How it’s done  We used PHP Levels, an open source application from SourceForge to create the tree  An exported XML version of the thesaurus is parsed to produce the required text file to populate the tree  The content manager is queried for the document totals
  • 40. 40 How it’s done, continued  When a term is selected, it is passed to a gettitles.php  A bit of php code connects to the content manager and returns a string of data about each title  The web page displays the data in the format desired
  • 41. 41 The advantages  TM Navtree Top Terms describe the organization of your collection(s)  Narrower terms help your user hone in on the most appropriate term  Adjacent terms impart connotation
  • 42. 42 The advantages  ALL the records indexed with the chosen term are returned  Your user finds what’s needed more quickly and is more productive
  • 43. 43 Questions? Comments? Try out the demo at www.mediasleuth.com See more details: Data Harmony Programmer Interface for Web Applications Thank you. Mary Garcia
  • 44. 44 MAI Query and NavTree from Data Harmony Making Users More Productive

Editor's Notes

  1. There are other forms of organization – alpha, chronological, geographical, audience, etc. Taxonomy organizes by topic, by subject, by aboutness.
  2. We already know that it helps your authors and editors assign effective subject terms And that it automates indexing items in legacy collections
  3. Recognizing term equivalents – important point, we’ll see more on this later.
  4. We already know that it helps your authors and editors assign effective subject terms And that it automates indexing items in legacy collections
  5. We already know that it helps your authors and editors assign effective subject terms And that it automates indexing items in legacy collections
  6. Your user interface can offer a rule-base assisted search or a full text search
  7. Any language – jsp, asp, Perl – can be used
  8. When connected, all the advantages of using a controlled vocabulary for indexing are made available to the user
  9. Level 2
  10. Level 3 - success for this category
  11. Your user interface can offer a rule-base assisted search or a full text search
  12. Our demo page includes an MAIQuery search box also We see 4 levels here – Business, Business Enterprises, Corporations, Corporate structure Each indicates how many titles are indexed with it and how many are indexed with either it or its child, narrower terms
  13. See options at bottom
  14. When connected, all the advantages of using a controlled vocabulary for indexing are made available to the user
  15. When connected, all the advantages of using a controlled vocabulary for indexing are made available to the user