SlideShare a Scribd company logo
1 of 21
Developing the AIP Thesaurus:
 The Platform for an Ontology



        Mark Cassar
 American Institute of Physics

          Jack Bruce
 Marjorie (Margie) M.K. Hlava
     Access Innovations:
        505-998-0800
Background

• Physics and Astronomy Classification Scheme (PACS)
• Six digit code schema used for indexing scholarly
  content
• 10 digit based
   – domain headings with subcategories nested under
     each domain.
• Precoordinated system
   – Combine terms (concepts) at the time of indexing
Why Change?
• Improve searchability
• Move to Post coordinated system
   – Combine terms at time of search
• Semantic enrichment
• Flexible metadata for many applications
• Naturalize the vocabulary
   – Represent concepts succinctly and concisely
   – Easily add new concepts based on new and emerging
     technologies and applications
   – Allow unlimited hierarchy levels and polyhierarchy
Better ROI

• Rules-assisted indexing
   – Provide end users with a swift indexing solution
     based on the Machine-Aided Indexer (M.A.I.)
     engine.
   – Batch index large corpus of scholarly content, as
     well as future content.
• Improve costs
   – Automate a large portion of electronic indexing
   – Less overhead for indexing
Roadmap of the AIP Thesaurus
• Data Collection
   – Load PACS codes and terms
   – Incorporate Search logs; add top searched concepts into the
     vocabulary
• Analysis of Content
   – Test comparison of indexing to humanly indexed articles
• Thesaurus Construction
   – Separate, disambiguate, and migrate concepts; Break up top
     domains
   – Apply thesaurus and taxonomy standardization to each term
   – Multiple reviews for each top section
• Evaluation and Feedback
   – Send back working draft to AIP for review
   – Gather feedback from subject matter experts and incorporate the
     changes into the thesaurus
• Finalization and Product Delivery
Source Data

• PACS 2009 ed.
• 1999 ed. Of AIP Thesaurus (out of date)
• Terms added to INSPEC since 2000
• Internal and external search logs
• Cumulative journal indexes
   – Digital
   – (2006 through 2009)
• List of AIP divisions and their internal classifications
Analysis of Content


• Organizational warrant
   – PACS 2009 (2010)
   – www.aip.org
   – UniPHY
• Literary warrant
   – Where we found the term used
• Most frequent search terms loaded into thesaurus
Thesaurus Creation Process
• Load data (vocabulary) into Data Harmony MAIstro™
• PACS
   – Restructure top domains
   – Separate into discrete
   – Disambiguate terms
   – Remove parenthetical qualifiers
   – Create post coordinated terms
   – Migrate separated terms into new/relevant categories
• Sort flat lists (search logs) into main categories determined
• Use multiple reviewers for each physics domain
• About 8181 preferred terms and 5217 synonyms
PACS TERM:
– Low-energy electron diffraction (LEED) and reflection
high-energy electron diffraction (RHEED) (condensed
matter structure determination)
– Becomes
– BT Condensed matter structure determination
 • NT Low energy electron diffraction
    –Synonym LEED
 • NT Reflection high energy electron diffraction
    –Synonym RHEED
Evaluation and Feedback


• Weekly scheduled live demos of the thesaurus
• Free web-hosted version of the thesaurus and
  periodic spreadsheet exports
• Collect feedback based on SME suggestions and AIP
  PACS experts
   – Correspondence via email
• Incorporate changes into thesaurus
Available versions


• Electronic copy of AIP thesaurus supplied in
   – XML
   – Excel
   – Web-based, read-only versions (Thesviewer)
   – MARC, SKOS, OWL, CSV etc
Taxonomy
  view
            Thesaurus
           Term Record
               view
To make an ontology


• Define additional Associative relationships
• Define additional Hierarchical relationships
   – IsA, IsPartOf, HasA
• Define additional Equivalence relationship
       • Multilingual options
       • Weights and measures
Clearer disambiguation?

                              Temperature
Planets
                IsA
                                         TypeOf

      IsA                                         BrandOf
                  Mercury
Roman god                        IsA                 Automobile




                      Metallic element
Knowledge Organization Systems
•   Uncontrolled list                  Not complex

•   Name authority file
•   Synonym set/ring
•   Controlled vocabulary
•   Taxonomy
•   Thesaurus AIP Thesaurus is here
•   Ontology
•   Semantic network                  Highly complex
Lessons Learned
• Learning the style for indexing
• Tendency to reversion to PACS style of language and
  classification
• SME feedback turnaround
   – Sit with them 2 hours
   – Incorporate suggestions 8 hours
   – 2117 Terms Added
     1354 Terms changed or updated
     1333 Terms deleted
     11259 Other actions
Where are we now?
• Platform is established
• OWL and other formats available
• One kind of Associative relationship
   – (Related terms)
• One kind of Hierarchical relationship
   – Broader Narrower / Parent Child
   – Multiple broader terms for interdisciplinary options
• One kind of Equivalence relationship
      • Synonym non preferred terms
• Built using the Z39.19 standard - interoperable
To Review AIP Thes
• Use a web browser
• http://thesview.accessinn.com/aipThes/
• username/password twice - in all cases both are
  'aip'.
• Begins a java app in your browser that shows the
  thesaurus starting from the top level of the hierarchy.
• Use the collaboration module to comment and
  discuss
Thank you


          Marjorie Hlava
    mhlava@accessin.com
          505-998-0800

More Related Content

Similar to Developing the AIP Thesaurus: The Platform for an Ontology

Hackathon report catalogue-ontology-vocabulary-characteristcs-relevant-to-e...
Hackathon report   catalogue-ontology-vocabulary-characteristcs-relevant-to-e...Hackathon report   catalogue-ontology-vocabulary-characteristcs-relevant-to-e...
Hackathon report catalogue-ontology-vocabulary-characteristcs-relevant-to-e...Amanda Vizedom
 
The state of KOS in the Linked Data movement
The state of KOS in the Linked Data movementThe state of KOS in the Linked Data movement
The state of KOS in the Linked Data movementMarcia Zeng
 
LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...
LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...
LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...locloud
 
NISO access related projects (presented at the Charleston conference 2016)
NISO access related projects (presented at the Charleston conference 2016)NISO access related projects (presented at the Charleston conference 2016)
NISO access related projects (presented at the Charleston conference 2016)Christine Stohn
 
Linking a Thesaurus To SharePoint for Content Management
Linking a Thesaurus To SharePoint for Content ManagementLinking a Thesaurus To SharePoint for Content Management
Linking a Thesaurus To SharePoint for Content ManagementAccess Innovations, Inc.
 
Knowledge engineering and the Web
Knowledge engineering and the WebKnowledge engineering and the Web
Knowledge engineering and the WebGuus Schreiber
 
Ontology Web services for Semantic Applications
Ontology Web services for Semantic ApplicationsOntology Web services for Semantic Applications
Ontology Web services for Semantic ApplicationsTrish Whetzel
 
The Semantic Web meets the Code of Federal Regulations
The Semantic Web meets the Code of Federal RegulationsThe Semantic Web meets the Code of Federal Regulations
The Semantic Web meets the Code of Federal Regulationstbruce
 
Knowledge Representation, Semantic Web
Knowledge Representation, Semantic WebKnowledge Representation, Semantic Web
Knowledge Representation, Semantic WebSerendipity Seraph
 
Leeds Met Open Search - towards an integrated solution for research and OER
Leeds Met Open Search - towards an integrated solution for research and OERLeeds Met Open Search - towards an integrated solution for research and OER
Leeds Met Open Search - towards an integrated solution for research and OERNick Sheppard
 
CapitalCamp DC 2012: Taxonomy
CapitalCamp DC 2012: TaxonomyCapitalCamp DC 2012: Taxonomy
CapitalCamp DC 2012: TaxonomyNatalya Minkovsky
 
Experience with MarkLogic at Elsevier
Experience with MarkLogic at ElsevierExperience with MarkLogic at Elsevier
Experience with MarkLogic at ElsevierDATAVERSITY
 
Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...
Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...
Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...Khirulnizam Abd Rahman
 
Taxonomy, ontology, folksonomies & SKOS.
Taxonomy, ontology, folksonomies & SKOS.Taxonomy, ontology, folksonomies & SKOS.
Taxonomy, ontology, folksonomies & SKOS.Janet Leu
 

Similar to Developing the AIP Thesaurus: The Platform for an Ontology (20)

DHUG 2017 - Thesaurus Construction Training
DHUG 2017 - Thesaurus Construction TrainingDHUG 2017 - Thesaurus Construction Training
DHUG 2017 - Thesaurus Construction Training
 
Hackathon report catalogue-ontology-vocabulary-characteristcs-relevant-to-e...
Hackathon report   catalogue-ontology-vocabulary-characteristcs-relevant-to-e...Hackathon report   catalogue-ontology-vocabulary-characteristcs-relevant-to-e...
Hackathon report catalogue-ontology-vocabulary-characteristcs-relevant-to-e...
 
The state of KOS in the Linked Data movement
The state of KOS in the Linked Data movementThe state of KOS in the Linked Data movement
The state of KOS in the Linked Data movement
 
Taxonomy Fundamentals Workshop 2013
Taxonomy Fundamentals Workshop 2013Taxonomy Fundamentals Workshop 2013
Taxonomy Fundamentals Workshop 2013
 
Taxonomy 101
Taxonomy 101Taxonomy 101
Taxonomy 101
 
Taxonomies and Metadata
Taxonomies and MetadataTaxonomies and Metadata
Taxonomies and Metadata
 
LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...
LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...
LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...
 
NISO access related projects (presented at the Charleston conference 2016)
NISO access related projects (presented at the Charleston conference 2016)NISO access related projects (presented at the Charleston conference 2016)
NISO access related projects (presented at the Charleston conference 2016)
 
Linking a Thesaurus To SharePoint for Content Management
Linking a Thesaurus To SharePoint for Content ManagementLinking a Thesaurus To SharePoint for Content Management
Linking a Thesaurus To SharePoint for Content Management
 
Knowledge engineering and the Web
Knowledge engineering and the WebKnowledge engineering and the Web
Knowledge engineering and the Web
 
Ontology Web services for Semantic Applications
Ontology Web services for Semantic ApplicationsOntology Web services for Semantic Applications
Ontology Web services for Semantic Applications
 
The Semantic Web meets the Code of Federal Regulations
The Semantic Web meets the Code of Federal RegulationsThe Semantic Web meets the Code of Federal Regulations
The Semantic Web meets the Code of Federal Regulations
 
Globe seminar
Globe seminarGlobe seminar
Globe seminar
 
Knowledge Representation, Semantic Web
Knowledge Representation, Semantic WebKnowledge Representation, Semantic Web
Knowledge Representation, Semantic Web
 
Leeds Met Open Search - towards an integrated solution for research and OER
Leeds Met Open Search - towards an integrated solution for research and OERLeeds Met Open Search - towards an integrated solution for research and OER
Leeds Met Open Search - towards an integrated solution for research and OER
 
Knowledge mangement
Knowledge mangementKnowledge mangement
Knowledge mangement
 
CapitalCamp DC 2012: Taxonomy
CapitalCamp DC 2012: TaxonomyCapitalCamp DC 2012: Taxonomy
CapitalCamp DC 2012: Taxonomy
 
Experience with MarkLogic at Elsevier
Experience with MarkLogic at ElsevierExperience with MarkLogic at Elsevier
Experience with MarkLogic at Elsevier
 
Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...
Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...
Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...
 
Taxonomy, ontology, folksonomies & SKOS.
Taxonomy, ontology, folksonomies & SKOS.Taxonomy, ontology, folksonomies & SKOS.
Taxonomy, ontology, folksonomies & SKOS.
 

More from Access Innovations, Inc.

Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy ResultsMaking AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy ResultsAccess Innovations, Inc.
 
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8Access Innovations, Inc.
 
Hindawi taxonomy and personalization 27.10 (1)
Hindawi taxonomy and personalization 27.10 (1)Hindawi taxonomy and personalization 27.10 (1)
Hindawi taxonomy and personalization 27.10 (1)Access Innovations, Inc.
 
Asco using ai-taxos-for meta-titles-february-2021
Asco using ai-taxos-for meta-titles-february-2021Asco using ai-taxos-for meta-titles-february-2021
Asco using ai-taxos-for meta-titles-february-2021Access Innovations, Inc.
 
Ai webinar 2 -what's in a name (consolidated pdf)
Ai webinar 2 -what's in a name (consolidated pdf)Ai webinar 2 -what's in a name (consolidated pdf)
Ai webinar 2 -what's in a name (consolidated pdf)Access Innovations, Inc.
 
Tagging overview - Why Keywords Don't Cut It
Tagging overview  - Why Keywords Don't Cut ItTagging overview  - Why Keywords Don't Cut It
Tagging overview - Why Keywords Don't Cut ItAccess Innovations, Inc.
 
DHUG 2018: Towards Web-Centric Repository Interoperability
DHUG 2018: Towards Web-Centric Repository InteroperabilityDHUG 2018: Towards Web-Centric Repository Interoperability
DHUG 2018: Towards Web-Centric Repository InteroperabilityAccess Innovations, Inc.
 
DHUG 2017 - Understanding ROI Just Enough to Get Your Project Funded
DHUG 2017 - Understanding ROI Just Enough to Get Your Project FundedDHUG 2017 - Understanding ROI Just Enough to Get Your Project Funded
DHUG 2017 - Understanding ROI Just Enough to Get Your Project FundedAccess Innovations, Inc.
 

More from Access Innovations, Inc. (20)

Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy ResultsMaking AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
 
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
 
Smart submit
Smart submitSmart submit
Smart submit
 
Plos taxonomy beyond search dhug 2021
Plos taxonomy beyond search   dhug 2021Plos taxonomy beyond search   dhug 2021
Plos taxonomy beyond search dhug 2021
 
Hindawi taxonomy and personalization 27.10 (1)
Hindawi taxonomy and personalization 27.10 (1)Hindawi taxonomy and personalization 27.10 (1)
Hindawi taxonomy and personalization 27.10 (1)
 
Data harmonycloudpowerpointclientfacing
Data harmonycloudpowerpointclientfacingData harmonycloudpowerpointclientfacing
Data harmonycloudpowerpointclientfacing
 
Data harmony update 2021
Data harmony update 2021 Data harmony update 2021
Data harmony update 2021
 
Atypon dhug2021
Atypon dhug2021Atypon dhug2021
Atypon dhug2021
 
Asco using ai-taxos-for meta-titles-february-2021
Asco using ai-taxos-for meta-titles-february-2021Asco using ai-taxos-for meta-titles-february-2021
Asco using ai-taxos-for meta-titles-february-2021
 
Asce more than just topic taxonomies
Asce more than just topic taxonomiesAsce more than just topic taxonomies
Asce more than just topic taxonomies
 
Acs discoverability-dhug2021
Acs discoverability-dhug2021Acs discoverability-dhug2021
Acs discoverability-dhug2021
 
Ai webinar 2 -what's in a name (consolidated pdf)
Ai webinar 2 -what's in a name (consolidated pdf)Ai webinar 2 -what's in a name (consolidated pdf)
Ai webinar 2 -what's in a name (consolidated pdf)
 
Tagging overview - Why Keywords Don't Cut It
Tagging overview  - Why Keywords Don't Cut ItTagging overview  - Why Keywords Don't Cut It
Tagging overview - Why Keywords Don't Cut It
 
Health Affairs - Why Keywords Don't Cut It
Health Affairs - Why Keywords Don't Cut ItHealth Affairs - Why Keywords Don't Cut It
Health Affairs - Why Keywords Don't Cut It
 
Why Keywords Don't Cut It
Why Keywords Don't Cut ItWhy Keywords Don't Cut It
Why Keywords Don't Cut It
 
Data Harmony update 2020 final
Data Harmony update 2020 finalData Harmony update 2020 final
Data Harmony update 2020 final
 
Data Harmony Update 2020 final
Data Harmony Update 2020 finalData Harmony Update 2020 final
Data Harmony Update 2020 final
 
DHUG 2018: Towards Web-Centric Repository Interoperability
DHUG 2018: Towards Web-Centric Repository InteroperabilityDHUG 2018: Towards Web-Centric Repository Interoperability
DHUG 2018: Towards Web-Centric Repository Interoperability
 
DHUG 2018 - Florida Thesis OCR
DHUG 2018 - Florida Thesis OCRDHUG 2018 - Florida Thesis OCR
DHUG 2018 - Florida Thesis OCR
 
DHUG 2017 - Understanding ROI Just Enough to Get Your Project Funded
DHUG 2017 - Understanding ROI Just Enough to Get Your Project FundedDHUG 2017 - Understanding ROI Just Enough to Get Your Project Funded
DHUG 2017 - Understanding ROI Just Enough to Get Your Project Funded
 

Recently uploaded

Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxAnaBeatriceAblay2
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfadityarao40181
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxUnboundStockton
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 

Recently uploaded (20)

Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdf
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docx
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 

Developing the AIP Thesaurus: The Platform for an Ontology

  • 1. Developing the AIP Thesaurus: The Platform for an Ontology Mark Cassar American Institute of Physics Jack Bruce Marjorie (Margie) M.K. Hlava Access Innovations: 505-998-0800
  • 2. Background • Physics and Astronomy Classification Scheme (PACS) • Six digit code schema used for indexing scholarly content • 10 digit based – domain headings with subcategories nested under each domain. • Precoordinated system – Combine terms (concepts) at the time of indexing
  • 3.
  • 4. Why Change? • Improve searchability • Move to Post coordinated system – Combine terms at time of search • Semantic enrichment • Flexible metadata for many applications • Naturalize the vocabulary – Represent concepts succinctly and concisely – Easily add new concepts based on new and emerging technologies and applications – Allow unlimited hierarchy levels and polyhierarchy
  • 5. Better ROI • Rules-assisted indexing – Provide end users with a swift indexing solution based on the Machine-Aided Indexer (M.A.I.) engine. – Batch index large corpus of scholarly content, as well as future content. • Improve costs – Automate a large portion of electronic indexing – Less overhead for indexing
  • 6. Roadmap of the AIP Thesaurus • Data Collection – Load PACS codes and terms – Incorporate Search logs; add top searched concepts into the vocabulary • Analysis of Content – Test comparison of indexing to humanly indexed articles • Thesaurus Construction – Separate, disambiguate, and migrate concepts; Break up top domains – Apply thesaurus and taxonomy standardization to each term – Multiple reviews for each top section • Evaluation and Feedback – Send back working draft to AIP for review – Gather feedback from subject matter experts and incorporate the changes into the thesaurus • Finalization and Product Delivery
  • 7. Source Data • PACS 2009 ed. • 1999 ed. Of AIP Thesaurus (out of date) • Terms added to INSPEC since 2000 • Internal and external search logs • Cumulative journal indexes – Digital – (2006 through 2009) • List of AIP divisions and their internal classifications
  • 8. Analysis of Content • Organizational warrant – PACS 2009 (2010) – www.aip.org – UniPHY • Literary warrant – Where we found the term used • Most frequent search terms loaded into thesaurus
  • 9. Thesaurus Creation Process • Load data (vocabulary) into Data Harmony MAIstro™ • PACS – Restructure top domains – Separate into discrete – Disambiguate terms – Remove parenthetical qualifiers – Create post coordinated terms – Migrate separated terms into new/relevant categories • Sort flat lists (search logs) into main categories determined • Use multiple reviewers for each physics domain • About 8181 preferred terms and 5217 synonyms
  • 10.
  • 11. PACS TERM: – Low-energy electron diffraction (LEED) and reflection high-energy electron diffraction (RHEED) (condensed matter structure determination) – Becomes – BT Condensed matter structure determination • NT Low energy electron diffraction –Synonym LEED • NT Reflection high energy electron diffraction –Synonym RHEED
  • 12. Evaluation and Feedback • Weekly scheduled live demos of the thesaurus • Free web-hosted version of the thesaurus and periodic spreadsheet exports • Collect feedback based on SME suggestions and AIP PACS experts – Correspondence via email • Incorporate changes into thesaurus
  • 13. Available versions • Electronic copy of AIP thesaurus supplied in – XML – Excel – Web-based, read-only versions (Thesviewer) – MARC, SKOS, OWL, CSV etc
  • 14. Taxonomy view Thesaurus Term Record view
  • 15. To make an ontology • Define additional Associative relationships • Define additional Hierarchical relationships – IsA, IsPartOf, HasA • Define additional Equivalence relationship • Multilingual options • Weights and measures
  • 16. Clearer disambiguation? Temperature Planets IsA TypeOf IsA BrandOf Mercury Roman god IsA Automobile Metallic element
  • 17. Knowledge Organization Systems • Uncontrolled list Not complex • Name authority file • Synonym set/ring • Controlled vocabulary • Taxonomy • Thesaurus AIP Thesaurus is here • Ontology • Semantic network Highly complex
  • 18. Lessons Learned • Learning the style for indexing • Tendency to reversion to PACS style of language and classification • SME feedback turnaround – Sit with them 2 hours – Incorporate suggestions 8 hours – 2117 Terms Added 1354 Terms changed or updated 1333 Terms deleted 11259 Other actions
  • 19. Where are we now? • Platform is established • OWL and other formats available • One kind of Associative relationship – (Related terms) • One kind of Hierarchical relationship – Broader Narrower / Parent Child – Multiple broader terms for interdisciplinary options • One kind of Equivalence relationship • Synonym non preferred terms • Built using the Z39.19 standard - interoperable
  • 20. To Review AIP Thes • Use a web browser • http://thesview.accessinn.com/aipThes/ • username/password twice - in all cases both are 'aip'. • Begins a java app in your browser that shows the thesaurus starting from the top level of the hierarchy. • Use the collaboration module to comment and discuss
  • 21. Thank you Marjorie Hlava mhlava@accessin.com 505-998-0800