Text Analytics Summit 2010<br />#TAS10<br />The Big Questions Facing the Text Analytics Industry<br />Seth Grimes<br />@se...
>>Past, Present & Future<br />He who controls the present, controls the past. He who controls the past, controls the futur...
>> The (Near) Past: Lacking Use Cases<br />In 1999 –<br />“The nascent field of text data mining (TDM) has the peculiar di...
>>So “Big Questions”…<br />Whatever you call it – “text analytics” ≈ “text mining” ≈ “text data mining” – a lot has happen...
Customers & prospects.
Technology & solutions.</li></li></ul><li>>>What’sPastis Prologue<br />“Don't look back. Something might be gaining on you...
>> The Present: Today’s Market<br />I estimate a $425 million global market in 2009.<br /><ul><li>Up about 25% from $350 m...
>> Applications Today<br />Broadly grouped --<br />Intelligence and counter-terrorism.<br />Life sciences.<br />Content ma...
>> Today’s Text Analytics Players<br />BI, data mining, and analytics.<br />Enterprise- and specialized-application focus....
>>Market Trends<br />“The Diverse and Exploding Digital Universe,” (IDC, 2008)<br />Stronger than ever:<br />Life sciences...
For users, semantic annotations ease navigation and boost findability.</li></ul>Customer experience.<br /><ul><li>Key to q...
>>Technology Initiatives 2<br />Now and near future.<br />Customer experience.<br />Bruce Temkin, ex-Forrester Research: “...
>> Search, from Keywords to Intelligence<br />Text analytics enables smarter search that better responds to user goals.<br />
>> Question Answering<br />Text analytics (information extraction) feeds curated knowledge bases.  Search is transformed f...
>>Sentiment Analysis<br />Two assertions:<br /><ul><li>Human communications are inherently subjective.
Opinion often masquerades as Fact.</li></li></ul><li>>>Sentiment Analysis… & Social Media<br />“Sentiment analysis is the ...
>> Finding Business Value<br />In customer-experience initiatives, “more unsolicited, unstructured data [implies] increasi...
>>Text Visualization<br />
>> Looking Ahead<br />
The Semantic Web Vision<br />“The Semantic Web is a web of data, in some ways like a global database.”-- Tim Berners-Lee, ...
>>Web 3.0<br />Web 3.0 = Web 2.0 + the Semantic Web + semantic tools.  Recurring themes:<br />Semantically enriched -- con...
>>In Sum<br />Robust growth.<br />Consolidation and emergence.<br />Technical challenges.<br />New frontiers.<br />… and t...
Upcoming SlideShare
Loading in …5
×

Welcome - Text Analytics Summit 2010

2,855 views

Published on

Welcome address slides for the Text Analytics Summit 2010, presented by conference chair Seth Grimes, Alta Plana Corporation.

Published in: Education, Technology
0 Comments
5 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
2,855
On SlideShare
0
From Embeds
0
Number of Embeds
35
Actions
Shares
0
Downloads
123
Comments
0
Likes
5
Embeds 0
No embeds

No notes for slide

Welcome - Text Analytics Summit 2010

  1. 1. Text Analytics Summit 2010<br />#TAS10<br />The Big Questions Facing the Text Analytics Industry<br />Seth Grimes<br />@sethgrimes<br />
  2. 2. >>Past, Present & Future<br />He who controls the present, controls the past. He who controls the past, controls the future.<br />-- derived from George Orwell’s 1984<br />
  3. 3. >> The (Near) Past: Lacking Use Cases<br />In 1999 –<br />“The nascent field of text data mining (TDM) has the peculiar distinction of having a name and a fair amount of hype but as yet almost no practitioners.”<br />-- Prof. Marti A. Hearst,<br />“Untangling Text Data Mining”<br />
  4. 4. >>So “Big Questions”…<br />Whatever you call it – “text analytics” ≈ “text mining” ≈ “text data mining” – a lot has happened since.<br />How is the industry developing?<br /><ul><li>Solution providers.
  5. 5. Customers & prospects.
  6. 6. Technology & solutions.</li></li></ul><li>>>What’sPastis Prologue<br />“Don't look back. Something might be gaining on you.”<br />-- Satchel Paige<br />
  7. 7. >> The Present: Today’s Market<br />I estimate a $425 million global market in 2009.<br /><ul><li>Up about 25% from $350 million in 2008, up in turn 40% from $250 million in 2007.</li></ul>Covers software licenses, vendor provided support and professional services.<br />$(hundreds) million more value created by:<br />Universities and research centers, especially in the life sciences.<br />Government, particularly for intelligence & counter-terrorism.<br />OEM licensees, for listening platforms, e-discovery, etc.<br />Systems integrators and consultants.<br />
  8. 8. >> Applications Today<br />Broadly grouped --<br />Intelligence and counter-terrorism.<br />Life sciences.<br />Content management, publishing & search.<br />Customer & market intelligence.<br />E-discovery.<br />Enterprise feedback.<br />Law enforcement.<br />Risk, fraud, compliance, and investigation.<br />
  9. 9. >> Today’s Text Analytics Players<br />BI, data mining, and analytics.<br />Enterprise- and specialized-application focus.<br />Search tools and services.<br />Software-tool, OEM suppliers.<br />Text analytics pure-plays, diverse applications.<br />Web services (APIs).<br />
  10. 10. >>Market Trends<br />“The Diverse and Exploding Digital Universe,” (IDC, 2008)<br />Stronger than ever:<br />Life sciences.<br />Intelligence & counter-terrorism.<br />Continued steep growth:<br />Media & publishing.<br /><ul><li>Seek to mine and to classify/process.
  11. 11. For users, semantic annotations ease navigation and boost findability.</li></ul>Customer experience.<br /><ul><li>Key to quality, satisfaction.</li></ul>Market intelligence including competitive intelligence.<br /><ul><li>Aggregates and details are both important.</li></ul>New on the scene – or at least newly visible:<br /><ul><li>Social-media monitoring, measurement, analysis.</li></li></ul><li>>>Technology Initiatives<br />Now and near future.<br />Semantic search.<br />Guha (IBM), McCool (Stanford), Miller (W3C): “The addition of explicit semantics can improve [navigational and research] search” (2003).<br />Question answering.<br />Matthew Glotzbach, Google: “Question answering is the future of enterprise search” (2006).<br />Sentiment analysis & social-media analytics.<br />Bing Liu, Univ of Illinois: “The Web has dramatically changed the way that people express their views and opinions.”<br />
  12. 12. >>Technology Initiatives 2<br />Now and near future.<br />Customer experience.<br />Bruce Temkin, ex-Forrester Research: “The future is clearly about analyzing feedback in any form that your customers give it. That’s a trend that won’t go away.” <br />Text visualization.<br />We’re still coming to terms with the idea of actually extracting and exploiting the information content of rich media.<br />Web 3.0 & the Semantic Web.<br />Ronen Feldman, Bar-Ilan University and Hebrew University: “Text analytics [is] driving the Semantic Web” (2006).<br />
  13. 13. >> Search, from Keywords to Intelligence<br />Text analytics enables smarter search that better responds to user goals.<br />
  14. 14. >> Question Answering<br />Text analytics (information extraction) feeds curated knowledge bases. Search is transformed from information retrieval to information access.<br />
  15. 15. >>Sentiment Analysis<br />Two assertions:<br /><ul><li>Human communications are inherently subjective.
  16. 16. Opinion often masquerades as Fact.</li></li></ul><li>>>Sentiment Analysis… & Social Media<br />“Sentiment analysis is the task of identifying positive and negative opinions, emotions, and evaluations.”<br />-- Wilson, Wiebe & Hoffman, 2005, “Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis”<br />
  17. 17. >> Finding Business Value<br />In customer-experience initiatives, “more unsolicited, unstructured data [implies] increasing use of text analytics.”<br />-- Bruce Temkin, ex-Forrester Research<br />
  18. 18. >>Text Visualization<br />
  19. 19. >> Looking Ahead<br />
  20. 20. The Semantic Web Vision<br />“The Semantic Web is a web of data, in some ways like a global database.”-- Tim Berners-Lee, 1998<br />"<br />An open-architure, coordinated by the W3C standards (World Wide Web Consortium)<br />Linked Data: “exposing, sharing, and connecting pieces of data, information, and knowledge on the Semantic Web.”<br />
  21. 21. >>Web 3.0<br />Web 3.0 = Web 2.0 + the Semantic Web + semantic tools. Recurring themes:<br />Semantically enriched -- context sensitive -- localized.<br />Text analytics enables Web 3.0 and the Semantic Web.<br />Automated content categorization and classification.<br />Text augmentation: metadata generation, content tagging.<br />Information extraction to databases.<br />Exploratory analysis and visualization.<br />
  22. 22. >>In Sum<br />Robust growth.<br />Consolidation and emergence.<br />Technical challenges.<br />New frontiers.<br />… and two days to learn more.<br />
  23. 23. Text Analytics Summit 2010<br />#TAS10<br />The Big Questions Facing the Text Analytics Industry<br />Seth Grimes<br />@sethgrimes<br />

×