The document discusses using the IEEE thesaurus as a basis for text analytics and trend forecasting. Access Innovations proposes to expand and map the IEEE thesaurus terms and use term-based analytics to investigate IEEE publication strengths, emerging topics, and future directions. Findings from mapping terms across different data sources found it effective for visualizing distributions, trends, and gaps to answer questions about coverage and emerging areas.
A 125 year professional society, with over 148 journals, conference transactions and magazinesSponsor approx 800 conferences annuallyTotal Membership over 400,000 as of Dec 31, 2009Span the globe, with participation in 160 countries
Key features include personalization w/ up to 15 saved search profiles, improved search, including: facets for faster resolution, type ahead, breadcrumbs to easily navigate your search and refine, and Institutional branding, not to mentioned improved reliability and stability
We knew there was “gold in them thare hills!” but how to unlock it?As a leading source of research materials, could we extract new directions.Are the societies living up to their charters and covering the topical areas they think they are?Are there trends that were just momentary? Are they still vigorously being investigated or were they just a flash in the pan?What other things might we learn?Introducing Dick Klavens
Access Innovations and its software brand Data Harmony are known for the high caliber of data. It is clean, well formed and very accurately semantically enriched. They updated the IEEE thesaurus in 2005, building a rule base for use in indexing at the same time. The application of the terms to the IEEE content was 90% accurate – that is 90% of the terms suggested are what well trained indexers would use from a controlled vocabulary, and 80% accurate from the more difficult proceedings data at launch of the project. Since that time the rule base has improved over time and the IEEE production team only needs to spot check about 10% of the documents to insure a high standard of indexing is maintained. It has allowed IEEE to process a lot more documents with the same team and made the process more fun at the same time. The indexers are allowed time to think about the content, the thesaurus terms, what should be added and what other information can be collected to continue to enrich the files because the Data harmony software removes many of the clerical aspects of the indexing process, leveraging the mental processing of the staff. The accuracy is high enough that we simply indexed the entire contents of the eXplore database back to the earliest records in a single overnight process. Then to explore the edges of science we also indexed the 1.2 million records using Medical Subject headings and the defense Technical Information Center thesauri with similar accuracy results.
What do I mean by accuracy? Here’s an example of an accurate disciplinary map.
What do I mean by visual accuracy? Here’s an example of an accurate disciplinary map.
What do I mean by visual accuracy? Here’s an example of an accurate disciplinary map.