Presented by Marjorie Hlava, president of Access Innovations, Inc., at the American Society for Information Science and Technology's 23rd Annual SIG/CR Classification Research Workshop on October 26, 2012.
Transcript of "Tales From the Field: Implementing Information Technology"
Tales from the Field: Implementing Information Theory SIG CR - 2012Marjorie Hlava, PresidentAccess Innovations, Inc. www.accessinn.com
Implementing Information Theory The case of the missing abstracts Russian information US PTO Getty adventures Vatican bibles Past basics Thoughts on directions
The Bleeding Edge Figure Out the client needs Figure out the specifications Get approval on the specifications Figure out how to deliver the data following the specs Quality control the data delivery …. But then life happens
The Case of Missing Abstracts Tests showed that just searching the indexing did not provide the full answers users wanted. Searching the titles and abstracts as well would improve search Enough space could be found on servers if the data was moved to in-house from Dialog and Orbit. New platform going into production New format – Messenger Specifications written, test file approved
Specifications Need 99.998% accuracy for user acceptance Left tagged ASCII Office in Mexico City – Access de Mexico Triple key - double proof Two sets of volumes 792,000 abstract tapes destroyed 1970 – 1982 data
Access de Mexico7:17 Am Shift changeSeptember 19,19858.7 earthquake
CAS to Philippines Limo from the airport with the remaining volumesTyphoon DotOctober 12, 1985Clark Air Force base evacuatedPower out for weeks
Jamaica Hurricane Kate November 1985 4 inches of water in the computer room No power on the island
Beijing China November 1985 NOTHING HAPPENED Finished On time Under budget At promised accuracy level Client said “ when I read your contract I thought you had an unusual level of detail on the Acts of God clauses…. But I didn’t expect you to use every one of them!”
Success - Failure - Future Successes • Chemical Abstracts • USPTO • Getty AATA • British Map Collection Failures • Access Russia • Ipsoa Video Disk • MAI Mail
All projects use classification To organize the job To organize the information To allow the finding of the items once digital Apply term tags • thesaurus and controlled Apply notation • Not necessarily classification • Just reflects the content The classification is NEVER done • Needs to reflect the ever-changing data
Theoretical Underpinnings Outlines of Knowledge • Thomas Aquinas • John Knox (Bacon) • Morton Taube - Encyclopaedia Britannica Organization of Knowledge • Cutter – 1896 • COSATI – 1964 • Alvin Weinberg • Cranfield Institute papers • Cleverton, Aitcheson, Vickery
Indexes Pre-coordinate • Back of the book • Subject headings Post-coordinate Bayesian Co-occurrence Neural nets Machine learning Rules systems
Now Changing the way we learn Changing the way we find things Easier to manipulate what we know • http://www.youtube.com/watch?v=B8ofWFx5 25s Comprehensive information / invasive • http://www.youtube.com/watch?v=RNJl9EEc soE People now know what search is.
Future Information any place, any time A great big mess - Unless we corral it. • Tag it, • Clean it, • Weed it • Curate it Everyone is creating content