SlideShare a Scribd company logo
Developing the AIP Thesaurus:
 The Platform for an Ontology



        Mark Cassar
 American Institute of Physics

          Jack Bruce
 Marjorie (Margie) M.K. Hlava
     Access Innovations:
        505-998-0800
Background

• Physics and Astronomy Classification Scheme (PACS)
• Six digit code schema used for indexing scholarly
  content
• 10 digit based
   – domain headings with subcategories nested under
     each domain.
• Precoordinated system
   – Combine terms (concepts) at the time of indexing
Why Change?
• Improve searchability
• Move to Post coordinated system
   – Combine terms at time of search
• Semantic enrichment
• Flexible metadata for many applications
• Naturalize the vocabulary
   – Represent concepts succinctly and concisely
   – Easily add new concepts based on new and emerging
     technologies and applications
   – Allow unlimited hierarchy levels and polyhierarchy
Better ROI

• Rules-assisted indexing
   – Provide end users with a swift indexing solution
     based on the Machine-Aided Indexer (M.A.I.)
     engine.
   – Batch index large corpus of scholarly content, as
     well as future content.
• Improve costs
   – Automate a large portion of electronic indexing
   – Less overhead for indexing
Roadmap of the AIP Thesaurus
• Data Collection
   – Load PACS codes and terms
   – Incorporate Search logs; add top searched concepts into the
     vocabulary
• Analysis of Content
   – Test comparison of indexing to humanly indexed articles
• Thesaurus Construction
   – Separate, disambiguate, and migrate concepts; Break up top
     domains
   – Apply thesaurus and taxonomy standardization to each term
   – Multiple reviews for each top section
• Evaluation and Feedback
   – Send back working draft to AIP for review
   – Gather feedback from subject matter experts and incorporate the
     changes into the thesaurus
• Finalization and Product Delivery
Source Data

• PACS 2009 ed.
• 1999 ed. Of AIP Thesaurus (out of date)
• Terms added to INSPEC since 2000
• Internal and external search logs
• Cumulative journal indexes
   – Digital
   – (2006 through 2009)
• List of AIP divisions and their internal classifications
Analysis of Content


• Organizational warrant
   – PACS 2009 (2010)
   – www.aip.org
   – UniPHY
• Literary warrant
   – Where we found the term used
• Most frequent search terms loaded into thesaurus
Thesaurus Creation Process
• Load data (vocabulary) into Data Harmony MAIstro™
• PACS
   – Restructure top domains
   – Separate into discrete
   – Disambiguate terms
   – Remove parenthetical qualifiers
   – Create post coordinated terms
   – Migrate separated terms into new/relevant categories
• Sort flat lists (search logs) into main categories determined
• Use multiple reviewers for each physics domain
• About 8181 preferred terms and 5217 synonyms
PACS TERM:
– Low-energy electron diffraction (LEED) and reflection
high-energy electron diffraction (RHEED) (condensed
matter structure determination)
– Becomes
– BT Condensed matter structure determination
 • NT Low energy electron diffraction
    –Synonym LEED
 • NT Reflection high energy electron diffraction
    –Synonym RHEED
Evaluation and Feedback


• Weekly scheduled live demos of the thesaurus
• Free web-hosted version of the thesaurus and
  periodic spreadsheet exports
• Collect feedback based on SME suggestions and AIP
  PACS experts
   – Correspondence via email
• Incorporate changes into thesaurus
Available versions


• Electronic copy of AIP thesaurus supplied in
   – XML
   – Excel
   – Web-based, read-only versions (Thesviewer)
   – MARC, SKOS, OWL, CSV etc
Taxonomy
  view
            Thesaurus
           Term Record
               view
To make an ontology


• Define additional Associative relationships
• Define additional Hierarchical relationships
   – IsA, IsPartOf, HasA
• Define additional Equivalence relationship
       • Multilingual options
       • Weights and measures
Clearer disambiguation?

                              Temperature
Planets
                IsA
                                         TypeOf

      IsA                                         BrandOf
                  Mercury
Roman god                        IsA                 Automobile




                      Metallic element
Knowledge Organization Systems
•   Uncontrolled list                  Not complex

•   Name authority file
•   Synonym set/ring
•   Controlled vocabulary
•   Taxonomy
•   Thesaurus AIP Thesaurus is here
•   Ontology
•   Semantic network                  Highly complex
Lessons Learned
• Learning the style for indexing
• Tendency to reversion to PACS style of language and
  classification
• SME feedback turnaround
   – Sit with them 2 hours
   – Incorporate suggestions 8 hours
   – 2117 Terms Added
     1354 Terms changed or updated
     1333 Terms deleted
     11259 Other actions
Where are we now?
• Platform is established
• OWL and other formats available
• One kind of Associative relationship
   – (Related terms)
• One kind of Hierarchical relationship
   – Broader Narrower / Parent Child
   – Multiple broader terms for interdisciplinary options
• One kind of Equivalence relationship
      • Synonym non preferred terms
• Built using the Z39.19 standard - interoperable
To Review AIP Thes
• Use a web browser
• http://thesview.accessinn.com/aipThes/
• username/password twice - in all cases both are
  'aip'.
• Begins a java app in your browser that shows the
  thesaurus starting from the top level of the hierarchy.
• Use the collaboration module to comment and
  discuss
Thank you


          Marjorie Hlava
    mhlava@accessin.com
          505-998-0800

More Related Content

Similar to Developing the AIP Thesaurus: The Platform for an Ontology

DHUG 2017 - Thesaurus Construction Training
DHUG 2017 - Thesaurus Construction TrainingDHUG 2017 - Thesaurus Construction Training
DHUG 2017 - Thesaurus Construction Training
Access Innovations, Inc.
 
Hackathon report catalogue-ontology-vocabulary-characteristcs-relevant-to-e...
Hackathon report   catalogue-ontology-vocabulary-characteristcs-relevant-to-e...Hackathon report   catalogue-ontology-vocabulary-characteristcs-relevant-to-e...
Hackathon report catalogue-ontology-vocabulary-characteristcs-relevant-to-e...Amanda Vizedom
 
The state of KOS in the Linked Data movement
The state of KOS in the Linked Data movementThe state of KOS in the Linked Data movement
The state of KOS in the Linked Data movement
Marcia Zeng
 
Taxonomy Fundamentals Workshop 2013
Taxonomy Fundamentals Workshop 2013Taxonomy Fundamentals Workshop 2013
Taxonomy Fundamentals Workshop 2013
Access Innovations, Inc.
 
Taxonomy 101
Taxonomy 101Taxonomy 101
Taxonomies and Metadata
Taxonomies and MetadataTaxonomies and Metadata
Taxonomies and Metadata
Aravind Sesagiri Raamkumar
 
LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...
LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...
LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...
locloud
 
Eureka, I found it! - Special Libraries Association 2021 Presentation
Eureka, I found it! - Special Libraries Association 2021 PresentationEureka, I found it! - Special Libraries Association 2021 Presentation
Eureka, I found it! - Special Libraries Association 2021 Presentation
Access Innovations, Inc.
 
NISO access related projects (presented at the Charleston conference 2016)
NISO access related projects (presented at the Charleston conference 2016)NISO access related projects (presented at the Charleston conference 2016)
NISO access related projects (presented at the Charleston conference 2016)
Christine Stohn
 
Linking a Thesaurus To SharePoint for Content Management
Linking a Thesaurus To SharePoint for Content ManagementLinking a Thesaurus To SharePoint for Content Management
Linking a Thesaurus To SharePoint for Content Management
Access Innovations, Inc.
 
Knowledge engineering and the Web
Knowledge engineering and the WebKnowledge engineering and the Web
Knowledge engineering and the Web
Guus Schreiber
 
Ontology Web services for Semantic Applications
Ontology Web services for Semantic ApplicationsOntology Web services for Semantic Applications
Ontology Web services for Semantic Applications
Trish Whetzel
 
The Semantic Web meets the Code of Federal Regulations
The Semantic Web meets the Code of Federal RegulationsThe Semantic Web meets the Code of Federal Regulations
The Semantic Web meets the Code of Federal Regulations
tbruce
 
Globe seminar
Globe seminarGlobe seminar
Globe seminar
Xavier Ochoa
 
Knowledge Representation, Semantic Web
Knowledge Representation, Semantic WebKnowledge Representation, Semantic Web
Knowledge Representation, Semantic WebSerendipity Seraph
 
Leeds Met Open Search - towards an integrated solution for research and OER
Leeds Met Open Search - towards an integrated solution for research and OERLeeds Met Open Search - towards an integrated solution for research and OER
Leeds Met Open Search - towards an integrated solution for research and OERNick Sheppard
 
CapitalCamp DC 2012: Taxonomy
CapitalCamp DC 2012: TaxonomyCapitalCamp DC 2012: Taxonomy
CapitalCamp DC 2012: Taxonomy
Natalya Minkovsky
 
Experience with MarkLogic at Elsevier
Experience with MarkLogic at ElsevierExperience with MarkLogic at Elsevier
Experience with MarkLogic at Elsevier
DATAVERSITY
 
Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...
Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...
Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...
Khirulnizam Abd Rahman
 

Similar to Developing the AIP Thesaurus: The Platform for an Ontology (20)

DHUG 2017 - Thesaurus Construction Training
DHUG 2017 - Thesaurus Construction TrainingDHUG 2017 - Thesaurus Construction Training
DHUG 2017 - Thesaurus Construction Training
 
Hackathon report catalogue-ontology-vocabulary-characteristcs-relevant-to-e...
Hackathon report   catalogue-ontology-vocabulary-characteristcs-relevant-to-e...Hackathon report   catalogue-ontology-vocabulary-characteristcs-relevant-to-e...
Hackathon report catalogue-ontology-vocabulary-characteristcs-relevant-to-e...
 
The state of KOS in the Linked Data movement
The state of KOS in the Linked Data movementThe state of KOS in the Linked Data movement
The state of KOS in the Linked Data movement
 
Taxonomy Fundamentals Workshop 2013
Taxonomy Fundamentals Workshop 2013Taxonomy Fundamentals Workshop 2013
Taxonomy Fundamentals Workshop 2013
 
Taxonomy 101
Taxonomy 101Taxonomy 101
Taxonomy 101
 
Taxonomies and Metadata
Taxonomies and MetadataTaxonomies and Metadata
Taxonomies and Metadata
 
LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...
LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...
LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...
 
Eureka, I found it! - Special Libraries Association 2021 Presentation
Eureka, I found it! - Special Libraries Association 2021 PresentationEureka, I found it! - Special Libraries Association 2021 Presentation
Eureka, I found it! - Special Libraries Association 2021 Presentation
 
NISO access related projects (presented at the Charleston conference 2016)
NISO access related projects (presented at the Charleston conference 2016)NISO access related projects (presented at the Charleston conference 2016)
NISO access related projects (presented at the Charleston conference 2016)
 
Linking a Thesaurus To SharePoint for Content Management
Linking a Thesaurus To SharePoint for Content ManagementLinking a Thesaurus To SharePoint for Content Management
Linking a Thesaurus To SharePoint for Content Management
 
Knowledge engineering and the Web
Knowledge engineering and the WebKnowledge engineering and the Web
Knowledge engineering and the Web
 
Ontology Web services for Semantic Applications
Ontology Web services for Semantic ApplicationsOntology Web services for Semantic Applications
Ontology Web services for Semantic Applications
 
The Semantic Web meets the Code of Federal Regulations
The Semantic Web meets the Code of Federal RegulationsThe Semantic Web meets the Code of Federal Regulations
The Semantic Web meets the Code of Federal Regulations
 
Globe seminar
Globe seminarGlobe seminar
Globe seminar
 
Knowledge Representation, Semantic Web
Knowledge Representation, Semantic WebKnowledge Representation, Semantic Web
Knowledge Representation, Semantic Web
 
Leeds Met Open Search - towards an integrated solution for research and OER
Leeds Met Open Search - towards an integrated solution for research and OERLeeds Met Open Search - towards an integrated solution for research and OER
Leeds Met Open Search - towards an integrated solution for research and OER
 
Knowledge mangement
Knowledge mangementKnowledge mangement
Knowledge mangement
 
CapitalCamp DC 2012: Taxonomy
CapitalCamp DC 2012: TaxonomyCapitalCamp DC 2012: Taxonomy
CapitalCamp DC 2012: Taxonomy
 
Experience with MarkLogic at Elsevier
Experience with MarkLogic at ElsevierExperience with MarkLogic at Elsevier
Experience with MarkLogic at Elsevier
 
Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...
Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...
Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...
 

More from Access Innovations, Inc.

Supercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdf
Supercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdfSupercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdf
Supercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdf
Access Innovations, Inc.
 
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy ResultsMaking AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
Access Innovations, Inc.
 
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
Access Innovations, Inc.
 
Smart submit
Smart submitSmart submit
Plos taxonomy beyond search dhug 2021
Plos taxonomy beyond search   dhug 2021Plos taxonomy beyond search   dhug 2021
Plos taxonomy beyond search dhug 2021
Access Innovations, Inc.
 
Hindawi taxonomy and personalization 27.10 (1)
Hindawi taxonomy and personalization 27.10 (1)Hindawi taxonomy and personalization 27.10 (1)
Hindawi taxonomy and personalization 27.10 (1)
Access Innovations, Inc.
 
Data harmonycloudpowerpointclientfacing
Data harmonycloudpowerpointclientfacingData harmonycloudpowerpointclientfacing
Data harmonycloudpowerpointclientfacing
Access Innovations, Inc.
 
Data harmony update 2021
Data harmony update 2021 Data harmony update 2021
Data harmony update 2021
Access Innovations, Inc.
 
Atypon dhug2021
Atypon dhug2021Atypon dhug2021
Atypon dhug2021
Access Innovations, Inc.
 
Asco using ai-taxos-for meta-titles-february-2021
Asco using ai-taxos-for meta-titles-february-2021Asco using ai-taxos-for meta-titles-february-2021
Asco using ai-taxos-for meta-titles-february-2021
Access Innovations, Inc.
 
Asce more than just topic taxonomies
Asce more than just topic taxonomiesAsce more than just topic taxonomies
Asce more than just topic taxonomies
Access Innovations, Inc.
 
Acs discoverability-dhug2021
Acs discoverability-dhug2021Acs discoverability-dhug2021
Acs discoverability-dhug2021
Access Innovations, Inc.
 
Ai webinar 2 -what's in a name (consolidated pdf)
Ai webinar 2 -what's in a name (consolidated pdf)Ai webinar 2 -what's in a name (consolidated pdf)
Ai webinar 2 -what's in a name (consolidated pdf)
Access Innovations, Inc.
 
Tagging overview - Why Keywords Don't Cut It
Tagging overview  - Why Keywords Don't Cut ItTagging overview  - Why Keywords Don't Cut It
Tagging overview - Why Keywords Don't Cut It
Access Innovations, Inc.
 
Health Affairs - Why Keywords Don't Cut It
Health Affairs - Why Keywords Don't Cut ItHealth Affairs - Why Keywords Don't Cut It
Health Affairs - Why Keywords Don't Cut It
Access Innovations, Inc.
 
Why Keywords Don't Cut It
Why Keywords Don't Cut ItWhy Keywords Don't Cut It
Why Keywords Don't Cut It
Access Innovations, Inc.
 
Data Harmony update 2020 final
Data Harmony update 2020 finalData Harmony update 2020 final
Data Harmony update 2020 final
Access Innovations, Inc.
 
Data Harmony Update 2020 final
Data Harmony Update 2020 finalData Harmony Update 2020 final
Data Harmony Update 2020 final
Access Innovations, Inc.
 
DHUG 2018: Towards Web-Centric Repository Interoperability
DHUG 2018: Towards Web-Centric Repository InteroperabilityDHUG 2018: Towards Web-Centric Repository Interoperability
DHUG 2018: Towards Web-Centric Repository Interoperability
Access Innovations, Inc.
 
DHUG 2018 - Florida Thesis OCR
DHUG 2018 - Florida Thesis OCRDHUG 2018 - Florida Thesis OCR
DHUG 2018 - Florida Thesis OCR
Access Innovations, Inc.
 

More from Access Innovations, Inc. (20)

Supercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdf
Supercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdfSupercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdf
Supercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdf
 
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy ResultsMaking AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
 
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
 
Smart submit
Smart submitSmart submit
Smart submit
 
Plos taxonomy beyond search dhug 2021
Plos taxonomy beyond search   dhug 2021Plos taxonomy beyond search   dhug 2021
Plos taxonomy beyond search dhug 2021
 
Hindawi taxonomy and personalization 27.10 (1)
Hindawi taxonomy and personalization 27.10 (1)Hindawi taxonomy and personalization 27.10 (1)
Hindawi taxonomy and personalization 27.10 (1)
 
Data harmonycloudpowerpointclientfacing
Data harmonycloudpowerpointclientfacingData harmonycloudpowerpointclientfacing
Data harmonycloudpowerpointclientfacing
 
Data harmony update 2021
Data harmony update 2021 Data harmony update 2021
Data harmony update 2021
 
Atypon dhug2021
Atypon dhug2021Atypon dhug2021
Atypon dhug2021
 
Asco using ai-taxos-for meta-titles-february-2021
Asco using ai-taxos-for meta-titles-february-2021Asco using ai-taxos-for meta-titles-february-2021
Asco using ai-taxos-for meta-titles-february-2021
 
Asce more than just topic taxonomies
Asce more than just topic taxonomiesAsce more than just topic taxonomies
Asce more than just topic taxonomies
 
Acs discoverability-dhug2021
Acs discoverability-dhug2021Acs discoverability-dhug2021
Acs discoverability-dhug2021
 
Ai webinar 2 -what's in a name (consolidated pdf)
Ai webinar 2 -what's in a name (consolidated pdf)Ai webinar 2 -what's in a name (consolidated pdf)
Ai webinar 2 -what's in a name (consolidated pdf)
 
Tagging overview - Why Keywords Don't Cut It
Tagging overview  - Why Keywords Don't Cut ItTagging overview  - Why Keywords Don't Cut It
Tagging overview - Why Keywords Don't Cut It
 
Health Affairs - Why Keywords Don't Cut It
Health Affairs - Why Keywords Don't Cut ItHealth Affairs - Why Keywords Don't Cut It
Health Affairs - Why Keywords Don't Cut It
 
Why Keywords Don't Cut It
Why Keywords Don't Cut ItWhy Keywords Don't Cut It
Why Keywords Don't Cut It
 
Data Harmony update 2020 final
Data Harmony update 2020 finalData Harmony update 2020 final
Data Harmony update 2020 final
 
Data Harmony Update 2020 final
Data Harmony Update 2020 finalData Harmony Update 2020 final
Data Harmony Update 2020 final
 
DHUG 2018: Towards Web-Centric Repository Interoperability
DHUG 2018: Towards Web-Centric Repository InteroperabilityDHUG 2018: Towards Web-Centric Repository Interoperability
DHUG 2018: Towards Web-Centric Repository Interoperability
 
DHUG 2018 - Florida Thesis OCR
DHUG 2018 - Florida Thesis OCRDHUG 2018 - Florida Thesis OCR
DHUG 2018 - Florida Thesis OCR
 

Recently uploaded

aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
siemaillard
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
EugeneSaldivar
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
Ethnobotany and Ethnopharmacology ......
Ethnobotany and Ethnopharmacology ......Ethnobotany and Ethnopharmacology ......
Ethnobotany and Ethnopharmacology ......
Ashokrao Mane college of Pharmacy Peth-Vadgaon
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
MysoreMuleSoftMeetup
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
GeoBlogs
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
Anna Sz.
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
siemaillard
 
Template Jadual Bertugas Kelas (Boleh Edit)
Template Jadual Bertugas Kelas (Boleh Edit)Template Jadual Bertugas Kelas (Boleh Edit)
Template Jadual Bertugas Kelas (Boleh Edit)
rosedainty
 
Basic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumersBasic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumers
PedroFerreira53928
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
MIRIAMSALINAS13
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
Pavel ( NSTU)
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
Atul Kumar Singh
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Thiyagu K
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
Celine George
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
EverAndrsGuerraGuerr
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxStudents, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
EduSkills OECD
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
BhavyaRajput3
 

Recently uploaded (20)

aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
 
Ethnobotany and Ethnopharmacology ......
Ethnobotany and Ethnopharmacology ......Ethnobotany and Ethnopharmacology ......
Ethnobotany and Ethnopharmacology ......
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
Template Jadual Bertugas Kelas (Boleh Edit)
Template Jadual Bertugas Kelas (Boleh Edit)Template Jadual Bertugas Kelas (Boleh Edit)
Template Jadual Bertugas Kelas (Boleh Edit)
 
Basic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumersBasic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumers
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxStudents, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
 

Developing the AIP Thesaurus: The Platform for an Ontology

  • 1. Developing the AIP Thesaurus: The Platform for an Ontology Mark Cassar American Institute of Physics Jack Bruce Marjorie (Margie) M.K. Hlava Access Innovations: 505-998-0800
  • 2. Background • Physics and Astronomy Classification Scheme (PACS) • Six digit code schema used for indexing scholarly content • 10 digit based – domain headings with subcategories nested under each domain. • Precoordinated system – Combine terms (concepts) at the time of indexing
  • 3.
  • 4. Why Change? • Improve searchability • Move to Post coordinated system – Combine terms at time of search • Semantic enrichment • Flexible metadata for many applications • Naturalize the vocabulary – Represent concepts succinctly and concisely – Easily add new concepts based on new and emerging technologies and applications – Allow unlimited hierarchy levels and polyhierarchy
  • 5. Better ROI • Rules-assisted indexing – Provide end users with a swift indexing solution based on the Machine-Aided Indexer (M.A.I.) engine. – Batch index large corpus of scholarly content, as well as future content. • Improve costs – Automate a large portion of electronic indexing – Less overhead for indexing
  • 6. Roadmap of the AIP Thesaurus • Data Collection – Load PACS codes and terms – Incorporate Search logs; add top searched concepts into the vocabulary • Analysis of Content – Test comparison of indexing to humanly indexed articles • Thesaurus Construction – Separate, disambiguate, and migrate concepts; Break up top domains – Apply thesaurus and taxonomy standardization to each term – Multiple reviews for each top section • Evaluation and Feedback – Send back working draft to AIP for review – Gather feedback from subject matter experts and incorporate the changes into the thesaurus • Finalization and Product Delivery
  • 7. Source Data • PACS 2009 ed. • 1999 ed. Of AIP Thesaurus (out of date) • Terms added to INSPEC since 2000 • Internal and external search logs • Cumulative journal indexes – Digital – (2006 through 2009) • List of AIP divisions and their internal classifications
  • 8. Analysis of Content • Organizational warrant – PACS 2009 (2010) – www.aip.org – UniPHY • Literary warrant – Where we found the term used • Most frequent search terms loaded into thesaurus
  • 9. Thesaurus Creation Process • Load data (vocabulary) into Data Harmony MAIstro™ • PACS – Restructure top domains – Separate into discrete – Disambiguate terms – Remove parenthetical qualifiers – Create post coordinated terms – Migrate separated terms into new/relevant categories • Sort flat lists (search logs) into main categories determined • Use multiple reviewers for each physics domain • About 8181 preferred terms and 5217 synonyms
  • 10.
  • 11. PACS TERM: – Low-energy electron diffraction (LEED) and reflection high-energy electron diffraction (RHEED) (condensed matter structure determination) – Becomes – BT Condensed matter structure determination • NT Low energy electron diffraction –Synonym LEED • NT Reflection high energy electron diffraction –Synonym RHEED
  • 12. Evaluation and Feedback • Weekly scheduled live demos of the thesaurus • Free web-hosted version of the thesaurus and periodic spreadsheet exports • Collect feedback based on SME suggestions and AIP PACS experts – Correspondence via email • Incorporate changes into thesaurus
  • 13. Available versions • Electronic copy of AIP thesaurus supplied in – XML – Excel – Web-based, read-only versions (Thesviewer) – MARC, SKOS, OWL, CSV etc
  • 14. Taxonomy view Thesaurus Term Record view
  • 15. To make an ontology • Define additional Associative relationships • Define additional Hierarchical relationships – IsA, IsPartOf, HasA • Define additional Equivalence relationship • Multilingual options • Weights and measures
  • 16. Clearer disambiguation? Temperature Planets IsA TypeOf IsA BrandOf Mercury Roman god IsA Automobile Metallic element
  • 17. Knowledge Organization Systems • Uncontrolled list Not complex • Name authority file • Synonym set/ring • Controlled vocabulary • Taxonomy • Thesaurus AIP Thesaurus is here • Ontology • Semantic network Highly complex
  • 18. Lessons Learned • Learning the style for indexing • Tendency to reversion to PACS style of language and classification • SME feedback turnaround – Sit with them 2 hours – Incorporate suggestions 8 hours – 2117 Terms Added 1354 Terms changed or updated 1333 Terms deleted 11259 Other actions
  • 19. Where are we now? • Platform is established • OWL and other formats available • One kind of Associative relationship – (Related terms) • One kind of Hierarchical relationship – Broader Narrower / Parent Child – Multiple broader terms for interdisciplinary options • One kind of Equivalence relationship • Synonym non preferred terms • Built using the Z39.19 standard - interoperable
  • 20. To Review AIP Thes • Use a web browser • http://thesview.accessinn.com/aipThes/ • username/password twice - in all cases both are 'aip'. • Begins a java app in your browser that shows the thesaurus starting from the top level of the hierarchy. • Use the collaboration module to comment and discuss
  • 21. Thank you Marjorie Hlava mhlava@accessin.com 505-998-0800