Welcome - 2011 Text Analytics Summit


Published on

Welcome address presented by Seth Grimes at the 2011 Text Analytics Summit

Published in: Technology
  • Be the first to comment

No Downloads
Total Views
On Slideshare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide

Welcome - 2011 Text Analytics Summit

  1. 1. Text Analytics: The Industry At A Glance<br />Where We Are, Where We’re Going, and Your Text Mining Investment<br />Seth Grimes<br />@sethgrimes<br />#TAS11<br />
  2. 2. Where we are<br />Ken Jennings, IBM Watson, and Brad Rutter play Jeopardy!<br />https://secure.wikimedia.org/wikipedia/en/wiki/File:Watson_Jeopardy.jpg<br />
  3. 3. Miles to go<br />http://www.businessweek.com/magazine/content/04_19/b3882029_mz072.htm<br />
  4. 4. Milestones [and goal(s)?]<br />Language+ understanding.<br />Text, speech, images, and video.<br />Narrative, discourse, and argument.<br />Information extraction.<br />Knowledge structuring and integration.<br />Inference; synthesis.<br />Language generation.<br />Conversation; interaction; autonomy.<br />≈> Convergence, a.k.a. Singularity<br />
  5. 5. Singularity?<br />Before we reach that point…<br />
  6. 6. Text+ technologies today<br />Text analytics, by generating semantics, bridges search and BI to turn Information Retrieval into Information Accessfor online, social & enterprise content.<br />Information management<br />Search<br />BI<br />Text Analytics<br />Information access<br />Integrated analytics<br />Semantic search<br />
  7. 7. Applications today<br />Broadly grouped --<br />Intelligence and counter-terrorism.<br />Life sciences.<br />Content management, publishing & search.<br />Customer & market intelligence.<br />E-discovery.<br />Enterprise feedback.<br />Law enforcement.<br />Risk, fraud, compliance, and investigation.<br />
  8. 8. Resegmenting the market<br />Information Acquisition<br />NLP (natural language processing) (including aaS)<br />Information management & semantics <br />Databases, repositories, content management systems • Information integration • Semantic Web <br />Search-based/oriented applications <br />E-discovery and compliance • Semantic search • Media & publishing • Advertising <br />Enterprise applications <br />Customer experience/relationship management and marketing including social • Market research and competitive Intelligence • BI and research • Online commerce • Life sciences • Intelligence<br />
  9. 9. Market size<br />I estimate a global, 2010 text-analytics market of –<br />$15 million ≈ Information acquisition (TA part)<br />E.g., 80legs, Informatica, ISYS Search, Kapow Software, Oracle.<br />$455 million ≈ NLP, semantics & text analytics<br />Installed & as a service, including vendor professional services.<br />$35 million ≈ Information management applications of TA<br />Companies such as EMC, IBM, MarkLogic, Open Text, and Oracle.<br />$30 million ≈ Enterprise applications of text analytics<br />Typically OEM TA licensees, e.g., Radian6, SatMetrix, Vovici.<br />$300 million ≈ Search-based applications (TA part)<br />Companies such as Autonomy, Cataphora, Dow Jones/Factiva, Elsevier, Endeca, FirstRain, Google, IBM, Lixto, Thomson Reuters.<br />= $835 million.<br />
  10. 10. Last year’s estimate<br />I estimated a $425 million global TA market in 2009.<br /><ul><li>Up about 25% from $350 million in 2008, up in turn 40% from $250 million in 2007.</li></ul>Covers software licenses, vendor provided support and professional services.<br />$(hundreds) million more value created by:<br />Universities and research centers, especially in the life sciences.<br />Government, particularly for intelligence & counter-terrorism.<br />OEM licensees, for listening platforms, e-discovery, etc.<br />Systems integrators and consultants.<br />
  11. 11. Text technology initiatives<br />Now and near future.<br />Semantic search. <br />Sentiment analysis.<br />Listening platforms.<br />Question answering.<br />Text visualization.<br />Web 3.0 & the Semantic Web.<br />
  12. 12. Text technology initiatives, revisited<br />But I used that list last year! Revising very slightly:<br />Semantic search. <br />Sentiment analysis.<br />Lots of market confusion, including from some folks at TAS11.<br />Listening platforms.<br />Question answering.<br />Text visualization.<br />Web 3.0 & the Semantic Web.<br />Ronen Feldman, Bar-Ilan University and Hebrew University: “Text analytics [is] driving the Semantic Web” (2006).<br />Copious European government research funding, coupled with wishful thinking, is driving the Semantic Web.<br />
  13. 13. Text technology initiatives++<br />Now and near future.<br />Beyond-polarity sentiment analysis.<br />Emotions, intent signals. etc.<br />Entity/identity resolution & profile extraction.<br />Online-social-enterprise data integration.<br />Semantic data integration, Complex Data. <br />Speech analytics.<br />Discourse analysis.<br />Because isolated messages are not conversations.<br /><ul><li>Rich-media content analytics.
  14. 14. Augmented reality; new human-computer interfaces.</li></li></ul><li>Where to?<br />
  15. 15. And Your Investment?<br />Robust growth across applications.<br />Technical innovation.<br />New frontiers.<br />Consolidation and emergence.<br />Opportunity<br />You have two days to learn more!<br />
  16. 16. Text Analytics: The Industry At A Glance<br />Where We Are, Where We’re Going, and Your Text Mining Investment<br />Seth Grimes<br />@sethgrimes<br />#TAS11<br />
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.