Milestones [and goal(s)?] (circa 2011)Language+ understanding. • Text, speech, and video. • Narrative, discourse, and argument.Information extraction.Knowledge structuring and integration.Inference; synthesis.Language generation.Conversation; interaction; autonomy.≈> Convergence, a.k.a. Singularity
Text stories of the last 12 months…Big Data: the 3 Vs.APIs, platforms, and cloud services.Acquisitions: Information access. • Autonomy HP. • Endeca Oracle. • ISYS Lexmark. • Vivisimo IBM.Social media magic (?), e.g., • Oracle Social Network (+ Collective Intellect). • SAP Social Media Analytics.Knowledge, enrichment & integration.
Velocity & Volume. (Where’s Variety?) Filtering MoreDown with IT!Up with users!
A Big Data analytics architecture (HPCC’s)http://hpccsystems.com/ http://www.geeklawblog.com/2011/12/lexis-advance-platform-launch-two.html
You can’t have it all?! Where are the flexibility, the (data/content) sophistication, and real- timedness?
Text stories of the last 12 months…Big Data: the 3 Vs.APIs, platforms, and cloud services. We’reAcquisitions: Information access. here • Autonomy HP. • Endeca Oracle. • ISYS Lexmark. • Vivisimo IBM.Social media magic (?), e.g., • Oracle Social Network (+ Collective Intellect). • SAP Social Media Analytics.Knowledge, enrichment & integration.
Social media magic (?) (2 examples) “By NetBase”?! No analytics?
Knowledge, enrichment & integrationSemantics enables join across types and/or sources and/or structures, using meaningful identifiers, to create an ensemble that is greater than the sum of the parts.Interrelate information to represent knowledge.Enrichment and integration involve: • Mappings and transformations. • Aggregation and collection. • All the typical data concerns: cleansing, profiling, consistency, security,…
The Semantic Web? A knowledge representation built on an assemblage of standards, protocols, and functions.http://www.cambridgesemantics.com/semantic-university/semantic-search-and-the-semantic-web http://img.freebase.com/api/trans/raw/m/02dtnzv
Text tech initiatives (2011 2012)Now and near future. • Beyond-polarity sentiment analysis. Emotions, intent signals. etc. • Identity resolution & profile extraction. Online-social-enterprise data integration. • Semantic data integration, Complex Data. • Speech analytics. • Discourse analysis. Because isolated messages are not conversations. • Rich-media content analytics. • Augmented reality; new human-computer interfaces.
A focus on information & applicationsNow and near future. • Signal detection. Sentiment, emotion, identity, intent. • Semanticized applications. Experience/satisfaction sentiment polarity Linkable, mashable, enrichable. Positive • Rich information. Overall experience / Neutral Context sensitive, situational. satisfaction 80% NegativeΣ = Sense-making... 60% 40% Availability of professional Ability to solve business services / support 20% problems… but there’s work to do: 0% Solution / technology Solution / technology ease of performance use
Next year’s talk? -- Text Analytics From Sources to Signals to Sense Seth Grimes @sethgrimes