Case study of the American Institute of Physics thesaurus. Presented by Mark Cassar of the American Institute of Physics and Jack Bruce of Access Innovations, Inc. at the 2012 Data Harmony User Group meeting on February 8, 2012 at the Access Innovations, Inc. offices.
Marjorie M.K. Hlava, President and founder of Access Innovations, Inc. and the Data Harmony suite of indexing software, gives the Miles Conrad Memorial Lecture at the 2014 Annual Conference for the National Federation of Advanced Information Services (NFAIS).
The Miles Conrad Award and accompanying lecture was established in 1965 in commemoration of NFAIS founder, G. Miles Conrad. Hlava earned the Miles Conrad Award this January for her past and continuing services to NFAIS and the Information and Knowledge Management industries.
Case Study: Integrating Data Harmony Terms and the eJournalPress Peer Review...Access Innovations, Inc.
Anna Jester from eJournalPress presents the integration project between eJournalPress and Access Innovations, Inc. Anna's team used Data Harmony software to index terms in order to correctly identify manuscripts for review.
How to make your content users more productive using Access Innovations, Inc.'s Navtree and Machine Aided Indexer (M.A.I.™), parts of the Data Harmony® software suite.
Presented by Bob Kasenchak of Access Innovations, Inc. at the 2014 Special Libraries Association (SLA) annual meeting in Vancouver, British Columbia on June 7, 2014.
A presentation by Dr. Jay Ven Eman, CEO of Access Innovations, Inc., on measuring the financial benefits of taxonomies. First presented at the 2009 Data Harmony Users Group meeting.
The JTHES as Part of the Intelligence Layer for the Sustainability Collection...Access Innovations, Inc.
Presented at the 2015 Data Harmony User Group Meeting in Albuquerque, New Mexico on February 17, 2015 by Ron Snyder and Sharon Garewal of ITHAKA Labs.
The JSTOR Sustainability Collection, which will launch in 2015, is composed of journals, reports, and working papers selected in consultation with scholars, policy researchers, and subject librarians. The collection features journal titles from academic publishers, scholarly societies, and industry groups, as well as a substantial library of indexed reports and working papers from leading research institutes and university centers. It addresses the emerging interdisciplinary discussion about how the environment and human activities and economic gains can be made durable over the long term. Along with this broad set of content, the collection will feature specialized functionality to support research in this emerging field, including a semantic indexing feature that helps researchers locate related terms and concepts that may have varying names across disciplines.
Ron Snyder, ITHAKA Labs Director of Research and Development, and Sharon Garewal, Senior Metadata Librarian, will discuss how the JSTOR Thesaurus (JTHES) was applied as part of the intelligence layer for the Sustainability collection prototype. This includes adding a facet for sustainability within the JTHES to tag terms as part of the collection, working with SME's across disciplines, and applying the curated terms into a live data portal.
On the uses and implementation of taxonomy on the Web, with a particular focus on the taxonomy as part of an enterprise information environment. Presented by Marjorie M.K. Hlava during Content Week 2005 in Miami, Florida.
Marjorie M.K. Hlava, President and founder of Access Innovations, Inc. and the Data Harmony suite of indexing software, gives the Miles Conrad Memorial Lecture at the 2014 Annual Conference for the National Federation of Advanced Information Services (NFAIS).
The Miles Conrad Award and accompanying lecture was established in 1965 in commemoration of NFAIS founder, G. Miles Conrad. Hlava earned the Miles Conrad Award this January for her past and continuing services to NFAIS and the Information and Knowledge Management industries.
Case Study: Integrating Data Harmony Terms and the eJournalPress Peer Review...Access Innovations, Inc.
Anna Jester from eJournalPress presents the integration project between eJournalPress and Access Innovations, Inc. Anna's team used Data Harmony software to index terms in order to correctly identify manuscripts for review.
How to make your content users more productive using Access Innovations, Inc.'s Navtree and Machine Aided Indexer (M.A.I.™), parts of the Data Harmony® software suite.
Presented by Bob Kasenchak of Access Innovations, Inc. at the 2014 Special Libraries Association (SLA) annual meeting in Vancouver, British Columbia on June 7, 2014.
A presentation by Dr. Jay Ven Eman, CEO of Access Innovations, Inc., on measuring the financial benefits of taxonomies. First presented at the 2009 Data Harmony Users Group meeting.
The JTHES as Part of the Intelligence Layer for the Sustainability Collection...Access Innovations, Inc.
Presented at the 2015 Data Harmony User Group Meeting in Albuquerque, New Mexico on February 17, 2015 by Ron Snyder and Sharon Garewal of ITHAKA Labs.
The JSTOR Sustainability Collection, which will launch in 2015, is composed of journals, reports, and working papers selected in consultation with scholars, policy researchers, and subject librarians. The collection features journal titles from academic publishers, scholarly societies, and industry groups, as well as a substantial library of indexed reports and working papers from leading research institutes and university centers. It addresses the emerging interdisciplinary discussion about how the environment and human activities and economic gains can be made durable over the long term. Along with this broad set of content, the collection will feature specialized functionality to support research in this emerging field, including a semantic indexing feature that helps researchers locate related terms and concepts that may have varying names across disciplines.
Ron Snyder, ITHAKA Labs Director of Research and Development, and Sharon Garewal, Senior Metadata Librarian, will discuss how the JSTOR Thesaurus (JTHES) was applied as part of the intelligence layer for the Sustainability collection prototype. This includes adding a facet for sustainability within the JTHES to tag terms as part of the collection, working with SME's across disciplines, and applying the curated terms into a live data portal.
On the uses and implementation of taxonomy on the Web, with a particular focus on the taxonomy as part of an enterprise information environment. Presented by Marjorie M.K. Hlava during Content Week 2005 in Miami, Florida.
The state of KOS in the Linked Data movementMarcia Zeng
- The publishing, management, and interoperating of KOS for the Semantic Web.
Content: 1. Value vocabularies in the Linked Data Hub – CKAN The Data Hub
2. Semantic assets registries
2a. Asset Description Metadata Schema (ADMS)
2b. DCMI Application Profile for KOS Resource
3. Thesaurus data model (ISO 25964) and alignment with SKOS
(presented at ASIS&T 2012 Annual Conference)
Presentation given on March 12, 2013 by Marjorie M.K. Hlava of Access Innovations, Inc. as a webinar for the San Francisco chapter of the Special Libraries Association.
LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...locloud
This presentation provides an introduction to thesaurus management in the LoCloud Vocabulary Services given during the LoCloud training workshops. It provides an introduction to controlled vocabularies, thesaurus for information retrieval and interoperability, to SKOS, multilingual vocabulary issues and to the federated model adopted for thesaurus management within the LoCloud service, which is based in TemaTres. The presentation includes a list of the vocabularies that have been integrated within the LoCloud service. There is also a walk-through of MediaThread and how this was used in the vocabulary management training offered in the workshop.
Have you ever wondered how search works while visiting an e-commerce site, internal website, or searching through other types of online resources? Look no further than this informative session on the ways that taxonomies help end-users navigate the internet! Hear from taxonomists and other information professionals who have first-hand experience creating and working with taxonomies that aid in navigation, search, and discovery across a range of disciplines.
NISO access related projects (presented at the Charleston conference 2016)Christine Stohn
Presentation by Pascal Calarco (University of Windsor), Christine Stohn (Ex Libris/ProQuest), John G. Dove (Paloma Associates), covering NISO D2D work, ResourceSync, KBART and KBART automation, ODI (Open Discovery Initiative), Link origin tracking, ALI (Access and License Indicators), and a discussion around improvements and challenges for open access discovery
The Semantic Web meets the Code of Federal Regulationstbruce
Semantic Web and natural-language-processing techniques meet the Code of Federal Regulations. Presentation from CALICON12 by the Legal Information Institute. Work on definition extraction, linked data publishing, search enhancement, vocabulary discovery.
Joint presentation with Nuria Casellas.
Elsevier is the world's largest publisher of scientific, medical and technical (STM) content. An early adopter of XML as a standard representation for content, Elsevier has used MarkLogic in the development of a range of information access and discovery solutions for its customers. This presentation will cover Elsevier's experience with XML-centric content management systems in general and MarkLogic's technology in specific, describing Elsevier's initial adoption and uptake of the technology, current use within the Elsevier suite of online products and solutions, and opportunities for future use. Design patterns for content repositories within a publishing context that have emerged during our use of the technology will be described, and we will touch on a number of issues that have emerged, including XQuery and its adoption within the developer community, the challenges facing XML from new representations for documents and metadata such as JSON and RDF, and the delivery of search applications based on XML infrastructure.
Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...Khirulnizam Abd Rahman
Application of Ontology in Semantic Information Retrieval
by Prof Shahrul Azman from FSTM, UKM
Presentation for MyREN Seminar 2014
Berjaya Hotel, Kuala Lumpur
27 November 2014
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy ResultsAccess Innovations, Inc.
In today's highly charged atmosphere of anxiety and anticipation about AI, and especially LLMs,
one of the biggest concerns is how to ensure that it returns accurate results (meaning both true
and pertinent to its audience). This is particularly important to scholarly, scientific, and other
technical organizations, whose constituents are often in very specific domains, such as
medicine, engineering, history, biology, chemistry, etc. One extremely useful tool to incorporate in an AI-based process in such cases is a comprehensive and well-structured knowledge domain which is based on a controlled vocabulary.
More Related Content
Similar to Developing the AIP Thesaurus: The Platform for an Ontology
The state of KOS in the Linked Data movementMarcia Zeng
- The publishing, management, and interoperating of KOS for the Semantic Web.
Content: 1. Value vocabularies in the Linked Data Hub – CKAN The Data Hub
2. Semantic assets registries
2a. Asset Description Metadata Schema (ADMS)
2b. DCMI Application Profile for KOS Resource
3. Thesaurus data model (ISO 25964) and alignment with SKOS
(presented at ASIS&T 2012 Annual Conference)
Presentation given on March 12, 2013 by Marjorie M.K. Hlava of Access Innovations, Inc. as a webinar for the San Francisco chapter of the Special Libraries Association.
LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...locloud
This presentation provides an introduction to thesaurus management in the LoCloud Vocabulary Services given during the LoCloud training workshops. It provides an introduction to controlled vocabularies, thesaurus for information retrieval and interoperability, to SKOS, multilingual vocabulary issues and to the federated model adopted for thesaurus management within the LoCloud service, which is based in TemaTres. The presentation includes a list of the vocabularies that have been integrated within the LoCloud service. There is also a walk-through of MediaThread and how this was used in the vocabulary management training offered in the workshop.
Have you ever wondered how search works while visiting an e-commerce site, internal website, or searching through other types of online resources? Look no further than this informative session on the ways that taxonomies help end-users navigate the internet! Hear from taxonomists and other information professionals who have first-hand experience creating and working with taxonomies that aid in navigation, search, and discovery across a range of disciplines.
NISO access related projects (presented at the Charleston conference 2016)Christine Stohn
Presentation by Pascal Calarco (University of Windsor), Christine Stohn (Ex Libris/ProQuest), John G. Dove (Paloma Associates), covering NISO D2D work, ResourceSync, KBART and KBART automation, ODI (Open Discovery Initiative), Link origin tracking, ALI (Access and License Indicators), and a discussion around improvements and challenges for open access discovery
The Semantic Web meets the Code of Federal Regulationstbruce
Semantic Web and natural-language-processing techniques meet the Code of Federal Regulations. Presentation from CALICON12 by the Legal Information Institute. Work on definition extraction, linked data publishing, search enhancement, vocabulary discovery.
Joint presentation with Nuria Casellas.
Elsevier is the world's largest publisher of scientific, medical and technical (STM) content. An early adopter of XML as a standard representation for content, Elsevier has used MarkLogic in the development of a range of information access and discovery solutions for its customers. This presentation will cover Elsevier's experience with XML-centric content management systems in general and MarkLogic's technology in specific, describing Elsevier's initial adoption and uptake of the technology, current use within the Elsevier suite of online products and solutions, and opportunities for future use. Design patterns for content repositories within a publishing context that have emerged during our use of the technology will be described, and we will touch on a number of issues that have emerged, including XQuery and its adoption within the developer community, the challenges facing XML from new representations for documents and metadata such as JSON and RDF, and the delivery of search applications based on XML infrastructure.
Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...Khirulnizam Abd Rahman
Application of Ontology in Semantic Information Retrieval
by Prof Shahrul Azman from FSTM, UKM
Presentation for MyREN Seminar 2014
Berjaya Hotel, Kuala Lumpur
27 November 2014
Similar to Developing the AIP Thesaurus: The Platform for an Ontology (20)
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy ResultsAccess Innovations, Inc.
In today's highly charged atmosphere of anxiety and anticipation about AI, and especially LLMs,
one of the biggest concerns is how to ensure that it returns accurate results (meaning both true
and pertinent to its audience). This is particularly important to scholarly, scientific, and other
technical organizations, whose constituents are often in very specific domains, such as
medicine, engineering, history, biology, chemistry, etc. One extremely useful tool to incorporate in an AI-based process in such cases is a comprehensive and well-structured knowledge domain which is based on a controlled vocabulary.
Smart Submit and Client Support
Michael Millar, Junior Software Developer, and Frank Coates, Client Support Manager
Get a peek at the new and improved Smart Submit and learn about new, easier ways to contact the support team at Access Innovations.
How a Good Taxonomy Can Provide Valuable Business Insights
Kristen Monahan, Public Library of Science (PLOS)
Kristen is a business analyst and she won’t be talking about the PLOS taxonomy but rather how she uses that taxonomy to drill down into the massive amount of content, metadata, and usage and process data that is PLOS for deep, detailed analysis and to drive business decisions. Much of this work involves trend analysis. For example, trend analysis of submissions can look at the time it takes from submission to decision by subject (narrow subjects like Covid, broad subjects like biotechnology), or by institution, or by country, etc. to see not just the overall big picture but where in their submission and peer review workflows the bottlenecks might be. A trend analysis of topics over time can prompt them to issue a call for papers for a topic they think needs to be better covered–and then look at both short-term and long-term trends resulting from that call to papers. Their taxonomy doesn’t just make their content smarter–it makes how they publish that content smarter too.
Editor and Peer Reviewer Assignments Using Data Harmony
Andrew Smeall, Hindawi Publishing
Andrew will show how Hindawi, an open access publisher, applies their taxonomy to make editor and reviewer assignments for incoming submissions to their journals.
Cloud Deployment of Data Harmony
Jeffrey Gordon, Lead Developer, Access Innovations, Inc.
Jeffrey will describe the cloud deployment of the Data Harmony software.
Marjorie M. K. Hlava, President, Chair of the Board, and Chief Scientist, Access Innovations, Inc.
During this annual highlight of the DHUG meetings, Margie will discuss the exciting new changes and additions to the Data Harmony software. She will be joined by some members of our software development team to talk about specific initiatives we have worked on over the past year.
Access Innovations and Atypon: Beyond Content Tagging
Hong Zhou and Gerasimos Razis, Atypon
Gerasimos and Hong will discuss the changes to the Atypon platform since DHUG 2020.
Getting to the Point: Using AI and Taxonomies to Craft Meta -Titles
Travis Hicks, American Society of Clinical Oncology (ASCO)
Looking to better leverage SEO and include key terms in the url construct for research abstracts, ASCO is working with Access Innovations to evaluate how to programmatically create short titles for abstracts. The idea is to index titles against existing taxonomies as a way of producing a short title that succinctly identified what an abstract is about for purposes of constructing a new url configuration. Travis will discuss the need, challenges, and early results of the project.
Expanding the Use of MAIstro at ASCE
Xi Van Fleet, American Society for Civil Engineers
Using MAIstro, ASCE created the subject/topic taxonomies for their publications to enhance content discovery and business insight. After achieving their primary goal, they have been expanding its use for other applications.
Lessons Learned From Building a Taxonomy and Indexing 140 Years of Content
Michael Darr, Project Manager, D33 – American Chemical Society Pubs IT
Michael will talk about the things they would do differently if they were to build a new taxonomy and index a legacy file, and the things they did right the first time.
Bill’s talk is entitled “WHAT’S IN A NAME? How Kew helps drug regulators disambiguate the messy welter of medicinal plant names to shore up regulation and save lives”. It’s really eye-opening to realize how complicated and imprecise names can get, with multiple scientific, pharmaceutical and popular names for the same thing or with one name used for completely different things.
This has real-world consequences. For example, the EU mistakenly banned a useful plant we use every day when intending to ban a poisonous one because of a naming problem. How Kew is using semantic and taxonomic tools and technologies to bring order to this complexity (I almost said chaos) is really fascinating. They’re also helping to disambiguate nomenclature and provide links to authoritative information for botanical terms for use in journal articles, among other things.
Daniel Vasicek discuss the processes undertaken to OCR various kinds of content from the University of Florida Special Collections to make them machine readable for indexing.
Ethnobotany and Ethnopharmacology:
Ethnobotany in herbal drug evaluation,
Impact of Ethnobotany in traditional medicine,
New development in herbals,
Bio-prospecting tools for drug discovery,
Role of Ethnopharmacology in drug evaluation,
Reverse Pharmacology.
Synthetic Fiber Construction in lab .pptxPavel ( NSTU)
Synthetic fiber production is a fascinating and complex field that blends chemistry, engineering, and environmental science. By understanding these aspects, students can gain a comprehensive view of synthetic fiber production, its impact on society and the environment, and the potential for future innovations. Synthetic fibers play a crucial role in modern society, impacting various aspects of daily life, industry, and the environment. ynthetic fibers are integral to modern life, offering a range of benefits from cost-effectiveness and versatility to innovative applications and performance characteristics. While they pose environmental challenges, ongoing research and development aim to create more sustainable and eco-friendly alternatives. Understanding the importance of synthetic fibers helps in appreciating their role in the economy, industry, and daily life, while also emphasizing the need for sustainable practices and innovation.
How to Make a Field invisible in Odoo 17Celine George
It is possible to hide or invisible some fields in odoo. Commonly using “invisible” attribute in the field definition to invisible the fields. This slide will show how to make a field invisible in odoo 17.
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxEduSkills OECD
Andreas Schleicher presents at the OECD webinar ‘Digital devices in schools: detrimental distraction or secret to success?’ on 27 May 2024. The presentation was based on findings from PISA 2022 results and the webinar helped launch the PISA in Focus ‘Managing screen time: How to protect and equip students against distraction’ https://www.oecd-ilibrary.org/education/managing-screen-time_7c225af4-en and the OECD Education Policy Perspective ‘Students, digital devices and success’ can be found here - https://oe.cd/il/5yV
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
Developing the AIP Thesaurus: The Platform for an Ontology
1. Developing the AIP Thesaurus:
The Platform for an Ontology
Mark Cassar
American Institute of Physics
Jack Bruce
Marjorie (Margie) M.K. Hlava
Access Innovations:
505-998-0800
2. Background
• Physics and Astronomy Classification Scheme (PACS)
• Six digit code schema used for indexing scholarly
content
• 10 digit based
– domain headings with subcategories nested under
each domain.
• Precoordinated system
– Combine terms (concepts) at the time of indexing
3.
4. Why Change?
• Improve searchability
• Move to Post coordinated system
– Combine terms at time of search
• Semantic enrichment
• Flexible metadata for many applications
• Naturalize the vocabulary
– Represent concepts succinctly and concisely
– Easily add new concepts based on new and emerging
technologies and applications
– Allow unlimited hierarchy levels and polyhierarchy
5. Better ROI
• Rules-assisted indexing
– Provide end users with a swift indexing solution
based on the Machine-Aided Indexer (M.A.I.)
engine.
– Batch index large corpus of scholarly content, as
well as future content.
• Improve costs
– Automate a large portion of electronic indexing
– Less overhead for indexing
6. Roadmap of the AIP Thesaurus
• Data Collection
– Load PACS codes and terms
– Incorporate Search logs; add top searched concepts into the
vocabulary
• Analysis of Content
– Test comparison of indexing to humanly indexed articles
• Thesaurus Construction
– Separate, disambiguate, and migrate concepts; Break up top
domains
– Apply thesaurus and taxonomy standardization to each term
– Multiple reviews for each top section
• Evaluation and Feedback
– Send back working draft to AIP for review
– Gather feedback from subject matter experts and incorporate the
changes into the thesaurus
• Finalization and Product Delivery
7. Source Data
• PACS 2009 ed.
• 1999 ed. Of AIP Thesaurus (out of date)
• Terms added to INSPEC since 2000
• Internal and external search logs
• Cumulative journal indexes
– Digital
– (2006 through 2009)
• List of AIP divisions and their internal classifications
8. Analysis of Content
• Organizational warrant
– PACS 2009 (2010)
– www.aip.org
– UniPHY
• Literary warrant
– Where we found the term used
• Most frequent search terms loaded into thesaurus
9. Thesaurus Creation Process
• Load data (vocabulary) into Data Harmony MAIstro™
• PACS
– Restructure top domains
– Separate into discrete
– Disambiguate terms
– Remove parenthetical qualifiers
– Create post coordinated terms
– Migrate separated terms into new/relevant categories
• Sort flat lists (search logs) into main categories determined
• Use multiple reviewers for each physics domain
• About 8181 preferred terms and 5217 synonyms
10.
11. PACS TERM:
– Low-energy electron diffraction (LEED) and reflection
high-energy electron diffraction (RHEED) (condensed
matter structure determination)
– Becomes
– BT Condensed matter structure determination
• NT Low energy electron diffraction
–Synonym LEED
• NT Reflection high energy electron diffraction
–Synonym RHEED
12. Evaluation and Feedback
• Weekly scheduled live demos of the thesaurus
• Free web-hosted version of the thesaurus and
periodic spreadsheet exports
• Collect feedback based on SME suggestions and AIP
PACS experts
– Correspondence via email
• Incorporate changes into thesaurus
13. Available versions
• Electronic copy of AIP thesaurus supplied in
– XML
– Excel
– Web-based, read-only versions (Thesviewer)
– MARC, SKOS, OWL, CSV etc
15. To make an ontology
• Define additional Associative relationships
• Define additional Hierarchical relationships
– IsA, IsPartOf, HasA
• Define additional Equivalence relationship
• Multilingual options
• Weights and measures
16. Clearer disambiguation?
Temperature
Planets
IsA
TypeOf
IsA BrandOf
Mercury
Roman god IsA Automobile
Metallic element
17. Knowledge Organization Systems
• Uncontrolled list Not complex
• Name authority file
• Synonym set/ring
• Controlled vocabulary
• Taxonomy
• Thesaurus AIP Thesaurus is here
• Ontology
• Semantic network Highly complex
18. Lessons Learned
• Learning the style for indexing
• Tendency to reversion to PACS style of language and
classification
• SME feedback turnaround
– Sit with them 2 hours
– Incorporate suggestions 8 hours
– 2117 Terms Added
1354 Terms changed or updated
1333 Terms deleted
11259 Other actions
19. Where are we now?
• Platform is established
• OWL and other formats available
• One kind of Associative relationship
– (Related terms)
• One kind of Hierarchical relationship
– Broader Narrower / Parent Child
– Multiple broader terms for interdisciplinary options
• One kind of Equivalence relationship
• Synonym non preferred terms
• Built using the Z39.19 standard - interoperable
20. To Review AIP Thes
• Use a web browser
• http://thesview.accessinn.com/aipThes/
• username/password twice - in all cases both are
'aip'.
• Begins a java app in your browser that shows the
thesaurus starting from the top level of the hierarchy.
• Use the collaboration module to comment and
discuss
21. Thank you
Marjorie Hlava
mhlava@accessin.com
505-998-0800