10. Questions for business (& government): What are people saying? What’s hot/trending? What are they saying about {topic|person|product} X? ... about X versus {topic|person|product} Y? How has opinion about X and Y evolved? How has opinion correlated with {our|competitors’|general} {news|marketing|sales|events}? What’s behind opinion, the root causes? Who are opinion leaders? How does sentiment propagate across multiple channels?
12. “In this example, you can quickly see that the Drooling Dog Bar B Q has gotten lots of positive reviews, and if you want to see what other people have said about the restaurant, clicking this result is a good choice.” -- http://googleblog.blogspot.com/2009/05/more-search-options-and-other-updates.html “In the recap of [Searchology] from Google’s Matt Cutts, he tells us that: ‘If you sort by reviews, Google will perform sentiment analysis and highlight interesting comments.’ -- Bill Slawski, “Google's New Review Search Option and Sentiment Analysis,” http://www.seobythesea.com/?p=1488
14. Analytics is a collection of tools and techniques that extract insights from data. Apply or embed analytics within business contexts – collect data and information about customers, markets, suppliers, and business processes – use results to inform, drive, and optimize business decision making – and you harness analytics as a core BI asset.
15. Analytics seeks structure in “unstructured” sources. x(t) = t y(t) = ½ a (et/a + e-t/a) = acosh(t/a) http://www.tropicalisland.de/NYC_New_York_Brooklyn_Bridge_from_World_Trade_Center_b.jpg http://en.wikipedia.org/wiki/Seven_Bridges_of_K%C3%B6nigsberg
16. Text analytics models text. “Statistical information derived from word frequency and distribution is used by the machine to compute a relative measure of significance, first for individual words and then for sentences.” -- H.P. Luhn, The Automatic Creation of Literature Abstracts, IBM Journal, 1958. http://wordle.net
17. Document input and processing Knowledge handling is key Desk Set (1957): Computer engineer Richard Sumner (Spencer Tracy) and television network librarian Bunny Watson (Katherine Hepburn) and the "electronic brain" EMERAC. Hans Peter Luhn “A Business Intelligence System” IBM Journal, October 1958
18. “This rather unsophisticated argument on ‘significance’ avoids such linguistic implications as grammar and syntax... No attention is paid to the logical and semantic relationships the author has established.” -- H.P. Luhn
19. My 2009 text-analytics market survey asked, [What information] do you need (or expect to need) to extract or analyze: Text Analytics 2009: User Perspectives on Solutions and Providers
20.
21. From document to DB; an IBM example: “The standard features are stored in the STANDARD_KW table, keywords with their occurrences in the KEYWORD_KW_OCC table, and the text list features in the TEXTLIST_TEXT table. Every feature table contains the DOC_ID as a reference to the DOCUMENT table.”
22. Welcome to the New World. The Far Side by Gary Larson Ken Jennings, IBM Watson, and Brad Rutter play Jeopardy! https://secure.wikimedia.org/wikipedia/en/wiki/File:Watson_Jeopardy.jpg
23. In a sense, text analytics, by generating semantics, bridges search and BI to turn Information Retrieval into Information Access. Information Access Search BI Text Analytics Integrated analytics Semantic search
29. Semantics enables better content production, management & use. Semantics captures – Meaning Relationships Context Understanding –the sense of “unstructured” online, social, and enterprise information, for content consumers and publishers. Semantics unites data of all types.
33. From connections to influence: What’s wrong with these pictures? (Radian6, Sysomos, Klout)
34.
35. Connections.Bring BI to social analyses. 3rd & 4th senses of social analytics: Adopt agile, collaborative methods. Share your data. A challenge: Enterprise-social-online data integration.
36. Text, Content, and Social Analytics: BI for the New World Seth Grimes Alta Plana Corporation @sethgrimes TDWI – Washington DC July 15, 2011