Welcome - 2011 Text Analytics Summit
Upcoming SlideShare
Loading in...5
×

Like this? Share it with your network

Share

Welcome - 2011 Text Analytics Summit

  • 3,831 views
Uploaded on

Welcome address presented by Seth Grimes at the 2011 Text Analytics Summit

Welcome address presented by Seth Grimes at the 2011 Text Analytics Summit

More in: Technology
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
No Downloads

Views

Total Views
3,831
On Slideshare
3,824
From Embeds
7
Number of Embeds
4

Actions

Shares
Downloads
141
Comments
0
Likes
3

Embeds 7

https://www.linkedin.com 3
http://www.linkedin.com 2
https://twitter.com 1
http://paper.li 1

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide

Transcript

  • 1. Text Analytics: The Industry At A Glance
    Where We Are, Where We’re Going, and Your Text Mining Investment
    Seth Grimes
    @sethgrimes
    #TAS11
  • 2. Where we are
    Ken Jennings, IBM Watson, and Brad Rutter play Jeopardy!
    https://secure.wikimedia.org/wikipedia/en/wiki/File:Watson_Jeopardy.jpg
  • 3. Miles to go
    http://www.businessweek.com/magazine/content/04_19/b3882029_mz072.htm
  • 4. Milestones [and goal(s)?]
    Language+ understanding.
    Text, speech, images, and video.
    Narrative, discourse, and argument.
    Information extraction.
    Knowledge structuring and integration.
    Inference; synthesis.
    Language generation.
    Conversation; interaction; autonomy.
    ≈> Convergence, a.k.a. Singularity
  • 5. Singularity?
    Before we reach that point…
  • 6. Text+ technologies today
    Text analytics, by generating semantics, bridges search and BI to turn Information Retrieval into Information Accessfor online, social & enterprise content.
    Information management
    Search
    BI
    Text Analytics
    Information access
    Integrated analytics
    Semantic search
  • 7. Applications today
    Broadly grouped --
    Intelligence and counter-terrorism.
    Life sciences.
    Content management, publishing & search.
    Customer & market intelligence.
    E-discovery.
    Enterprise feedback.
    Law enforcement.
    Risk, fraud, compliance, and investigation.
  • 8. Resegmenting the market
    Information Acquisition
    NLP (natural language processing) (including aaS)
    Information management & semantics
    Databases, repositories, content management systems • Information integration • Semantic Web
    Search-based/oriented applications
    E-discovery and compliance • Semantic search • Media & publishing • Advertising
    Enterprise applications
    Customer experience/relationship management and marketing including social • Market research and competitive Intelligence • BI and research • Online commerce • Life sciences • Intelligence
  • 9. Market size
    I estimate a global, 2010 text-analytics market of –
    $15 million ≈ Information acquisition (TA part)
    E.g., 80legs, Informatica, ISYS Search, Kapow Software, Oracle.
    $455 million ≈ NLP, semantics & text analytics
    Installed & as a service, including vendor professional services.
    $35 million ≈ Information management applications of TA
    Companies such as EMC, IBM, MarkLogic, Open Text, and Oracle.
    $30 million ≈ Enterprise applications of text analytics
    Typically OEM TA licensees, e.g., Radian6, SatMetrix, Vovici.
    $300 million ≈ Search-based applications (TA part)
    Companies such as Autonomy, Cataphora, Dow Jones/Factiva, Elsevier, Endeca, FirstRain, Google, IBM, Lixto, Thomson Reuters.
    = $835 million.
  • 10. Last year’s estimate
    I estimated a $425 million global TA market in 2009.
    • Up about 25% from $350 million in 2008, up in turn 40% from $250 million in 2007.
    Covers software licenses, vendor provided support and professional services.
    $(hundreds) million more value created by:
    Universities and research centers, especially in the life sciences.
    Government, particularly for intelligence & counter-terrorism.
    OEM licensees, for listening platforms, e-discovery, etc.
    Systems integrators and consultants.
  • 11. Text technology initiatives
    Now and near future.
    Semantic search.
    Sentiment analysis.
    Listening platforms.
    Question answering.
    Text visualization.
    Web 3.0 & the Semantic Web.
  • 12. Text technology initiatives, revisited
    But I used that list last year! Revising very slightly:
    Semantic search.
    Sentiment analysis.
    Lots of market confusion, including from some folks at TAS11.
    Listening platforms.
    Question answering.
    Text visualization.
    Web 3.0 & the Semantic Web.
    Ronen Feldman, Bar-Ilan University and Hebrew University: “Text analytics [is] driving the Semantic Web” (2006).
    Copious European government research funding, coupled with wishful thinking, is driving the Semantic Web.
  • 13. Text technology initiatives++
    Now and near future.
    Beyond-polarity sentiment analysis.
    Emotions, intent signals. etc.
    Entity/identity resolution & profile extraction.
    Online-social-enterprise data integration.
    Semantic data integration, Complex Data.
    Speech analytics.
    Discourse analysis.
    Because isolated messages are not conversations.
    • Rich-media content analytics.
    • 14. Augmented reality; new human-computer interfaces.
  • Where to?
  • 15. And Your Investment?
    Robust growth across applications.
    Technical innovation.
    New frontiers.
    Consolidation and emergence.
    Opportunity
    You have two days to learn more!
  • 16. Text Analytics: The Industry At A Glance
    Where We Are, Where We’re Going, and Your Text Mining Investment
    Seth Grimes
    @sethgrimes
    #TAS11