Technology Frontiers: Text, Sentiment, and Sense
Upcoming SlideShare
Loading in...5

Technology Frontiers: Text, Sentiment, and Sense



Presentation by Seth Grimes at the Insight Innovation Exchange (IIEX) conference, June 17, 2013 in Philadelphis.

Presentation by Seth Grimes at the Insight Innovation Exchange (IIEX) conference, June 17, 2013 in Philadelphis.



Total Views
Views on SlideShare
Embed Views



2 Embeds 75 61 14



Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

CC Attribution-ShareAlike LicenseCC Attribution-ShareAlike License

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
Post Comment
Edit your comment

Technology Frontiers: Text, Sentiment, and Sense Technology Frontiers: Text, Sentiment, and Sense Presentation Transcript

  • Technology Frontiers: Text,Sentiment, and SenseSeth Grimes@sethgrimes
  • A Sensemaking StoryNew York Times,September 30, 2012
  • New York Times,September 8, 1957Valium: A Chain of Connections View slide
  • Natural Language ProcessingBy H.P. Luhn, inIBM Journal,April, 1958 View slide
  • Modelling Text“Statistical information derived from word frequency and distribution isused by the machine to compute a relative measure of significance, firstfor individual words and then for sentences. Sentences scoring highest insignificance are extracted and printed out to become the auto-abstract.”-- H.P. Luhn, The Automatic Creation of Literature Abstracts, IBM Journal, 1958.Luhn’s analysis ofMessengers of the NervousSystem, a Scientific Americanarticle, appliedto the NY Times article
  • New York Times,September 8, 1957Luhn’s Example
  • Close Reading
  • Can Software Make the Connection?Mark Lombardi, George W. Bush, Harken Energyand Jackson Stephens, c. 1979-90, Detail
  • Insight from Connections… via graphs, clusters, categories, and counts.… by mining the full set of available data.
  • & Social Change Everything
  • (Accessible) Data Everywhere
  • Lexical, syntactic, and semantic analysis discernfeatures including relationships in source materials.Features = entities, measure-value pairs, concepts,topics, events, sentiment, and more.Text analytics may draw on:• Lexicons & taxonomies.• Statistics.• Patterns.• Linguistics.• Machine learning.Text Analytics
  • How?
  • From POS to RelationshipsUnderstand parts ofspeech (POS), e.g. –<subject> <verb><object> –todiscern facts andrelationships.Semantic networkssuch as WordNetare adisambiguationasset.
  • Clustered ClarityCarrot2.(open source)
  • Platforms and ecosystems.APIs and services.Text and content analytics --Discerns and extracts features including relationships fromsource materials.Features = entities, key-value pairs, concepts, topics,events, sentiment, etc.Provide (for) BI on content-sourced data.Data integration, record linkage, data fusion.The Back End
  • Content, Composites, Connections
  • Content, Composites, Connections, 2
  • Social Sources
  • Sentiment Analysis“Sentiment analysis is the task of identifying positiveand negative opinions, emotions, and evaluations.”-- Wilson, Wiebe & Hoffman, 2005, “Recognizing Contextual Polarity inPhrase-Level Sentiment Analysis”“Sentiment analysis or opinion mining is thecomputational study of opinions, sentiments andemotions expressed in text… An opinion on a feature f isa positive or negative view, attitude, emotion orappraisal on f from an opinion holder.”-- Bing Liu, 2010, “Sentiment Analysis and Subjectivity,” in Handbook ofNatural Language Processing
  • Detection, Classification
  • Beyond Polarity
  • Intent Analysis
  • ComplicationsSentiment may be of interest at multiple levels.Corpus / data space, i.e., across multiple sources.Document.Statement / sentence.Entity / topic / concept.Human language is noisy and chaotic!Jargon, slang, irony, ambiguity, anaphora, polysemy,synonymy, etc.Context is key. Discourse analysis comes into play.Must distinguish the sentiment holder from the object:“Geithner said the recession may worsen.”
  • Audio including speech.Images.Video. Text
  • Sensemaking“It is convenient to divide the entireinformation access process into twomain components: information retrievalthrough searching and browsing, andanalysis and synthesis of results. Thisbroader process is often referred to inthe literature as sensemaking.Sensemaking refers to an iterativeprocess of formulating a conceptualrepresentation from of a large volumeof information. Search plays only onepart in this process.”-- Marti Hearst, 2009
  • Apply new tech to old needs, e.g., automated coding.Select from and use all available data.Marry social to profiles and surveys.Factor in behaviors.Interpret according to context and needs.Understand intent to create situational predictivemodels.Explore; experiment.Suggestions
  • Racing On
  • Technology Frontiers: Text,Sentiment, and SenseSeth Grimes@sethgrimes