The document discusses challenges in information retrieval and introduces a new product called illumin8. illumin8 uses natural language processing to analyze unstructured data across different sources and presents organized summaries of solutions to problems. It aims to help researchers cut through large amounts of information by rapidly summarizing and providing overviews integrated across domains. The system applies natural language processing throughout the indexing, querying, and results presentation process.
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
SLA Summer 2008
1. Mining Solutions A New Approach to Making the Most of Your Research Time SLA,Strategic Technology Alliance, Seattle, 2008 Joe Buzzanga, Product Manager, Elsevier Science and Technology June 17, 2008
2.
3. Digital Universe: 10x bigger in 5 years “ Searching for meaning in the content of unstructured data like images, video clips, documents, and the numbers and characters in databases is the rocket science of the digital universe.” IDC Source: IDC Whitepaper, The Diverse and Exploding Digital Universe, March 2008
18. Research and its Discontents 5.5 hours / week * Searching and gathering information * Source: 2007 survey of 6,300 knowledge workers, Outsell, Inc. 4.7 hours / week * Organizing and analyzing and applying information
19.
20. Typical Search Current general search Get millions of documents to sift through Page 1 Page 2 Page 180,000 … compostable film There is just no way any researcher can read through all this information. It just takes too long!
21.
22.
23. How does illumin8 work? Full Text Abstracts illumin8 searches on solutions. The solutions are extracted from full text sources, abstracts, web, and patents Internet Patents illumin8 Solution Database 1.1 billion 5 Billion web pages, blogs and forums 3 Million full-text scientific and technical articles from 1,800 Elsevier journals 33 Million scientific records from 15,000 peer reviewed journals & more than 4,000 publishers 21 Million patents from 5 world-wide patent offices Extract and Summarize Solutions Search
24.
25.
26. Natural Language Parsing Help_patterns Succeed2 Correct_problem treatPerson_SAVS positively_influence have_positive_influence protect_sb_against_sth Product_would_do_good provide_sb_with_sth Product_is_shown_to talented_at use_sth_to_do_sth approve_sth rely_on_product_to application_is Product_allows_sb_toVG2 ensure_protagonist A_makes_B_good benefit_of ... Thousands of rules Plus statistical models illumin8 Rules Grammatical Role Role Test Role Assignment provides Capacitive deionization an economical and efficient method for removing salt and impurities from water Solution Benefit Continue … Modal? Check that Verb polarity is positive; this rule would not match if the Verb were modal (i.e. only in certain cases), for example if it said “should provide … but” Check that Subject is not negated; this rule would not match if Subject were not positive, for example if it said “no process provides an economical an efficient …” Check that Object is not antagonistic; this rule would not match if Object were, for example “provides a costly and complicated method” no yes Negated? no yes Antagonistic? no yes Capacitive deionization with carbon aerogel electrodes provides an economical and efficient method for removing salt and impurities from water. Verb Subject Object
27. Analyzing A Sentence Carrier’s Infinity™ Air Purifier uses ultraviolet light to eliminate germs such as viruses, molds, bacteria, mildew and mold spores from the indoor air of homes and offices, ensuring a higher indoor air quality . Germ [Problem] Indoor air quality [Benefit] Carrier [Organization] Infinity Air Purifier [Product] Ultraviolet light [Technology] Virus Mold Bacteria Mildew Makes Uses Solves Provides Kind of Mold spore Concepts, ideas and entities extracted from a single sentence.