Sifting Social Data: Word Sense Disambiguation Using Machine Learning

Stuart Shulman
Stuart ShulmanCEO at Texifter
Sifting Social Data
Word Sense Disambiguation Using Machine Learning
Dr. Stuart Shulman
Founder & CEO, Texifter
“…a wealth of information creates a poverty of attention.”
- Herbert Simon, 1971
Pronounced “tech-sifter” the metaphor is of a sifter
Text Classification
A 2500 year-old problem
Plato argued it would be frustrating and it still is…
Grimmer & Stewart “Text as Data” Political Analysis (2013)
Volume is a problem for scholars
Coders are expensive
Groups struggle to accurately label text at scale
Validation of both humans and machines is “essential”
Some models are easier to validate than others
All models are wrong
Automated models enhance/amplify, but don’t replace humans
There is no one right way to do this
“Validate, validate, validate”
“What should be avoided then, is the blind use of any method without a validation step.”
Our free, open-source, web-based text analytics toolkit
The original software kernel: tools for measurement
A mission to avoid tennis elbow
Items load to the screen and the coder hits the keystroke
Keystroke human coding: alone or in groups
Codes
Metadata
Data
Human coding can be distributed to individuals, groups & crowds
Computer science & NSF influences: measure everything
How fast?
How reliable?
How accurate?
Stuart Shulman – Texifter
Inter-rater reliability is one critical measurement
Stuart Shulman – Texifter
Sifting Social Data: Word Sense Disambiguation Using Machine Learning
Plugged in to APIs & Government
Import data directly via APIs or from your desktop
Stuart Shulman – Texifter
Full historical Twitter access
Stuart Shulman – Texifter
PowerTrack operators for more precise queries
Stuart Shulman – Texifter
Store social data with survey responses and other data
Stuart Shulman – Texifter
Private, 3rd party & free (rate limited) social data sources
Stuart Shulman – Texifter
Unlimited “fire hose” premium data sources
Stuart Shulman – Texifter
The Five Pillars of Text Analytics
Search
Filtering
De-duplication and Clustering
Human Coding
Machine-Learning
Stuart Shulman – Texifter
Pillar #1: Search
Stuart Shulman – Texifter
Pillar #1: Defined multi-term search
Stuart Shulman – Texifter
Pillar #2: Filters
Stuart Shulman – Texifter
Pillar #2: Filters
Stuart Shulman – Texifter
Pillar #3: Deduplication & clustering
Stuart Shulman – Texifter
Pillar #3: Deduplication & clustering
Stuart Shulman – Texifter
Pillar #4: Human coding (a.k.a. labeling or tagging)
Stuart Shulman – Texifter
Pillar #4: Human coding
Stuart Shulman – Texifter
Pillar #4: Human coding (adjudication)
Stuart Shulman – Texifter
Pillar#5: Machine-learning
Stuart Shulman – Texifter
Pillar#5: Machine-learning
Stuart Shulman – Texifter
Our ActiveLearning engine and coding tools combine…
what humans do best… with what computers do best
Humans and machines learning together
Keep humans “in-the-loop” for more accurate results and better insights
Stuart Shulman – Texifter
Word sense disambiguation (relevance)
Stuart Shulman – Texifter
Word sense disambiguation (relevance)
Stuart Shulman – Texifter
Word sense disambiguation (relevance)
Stuart Shulman – Texifter
Stuart Shulman – Texifter
Human coding can be converted into machine classifiers
Accumulated human coding becomes training data via machine-learning
Users can drill into interactive reporting displays
Use metadata to examine sub-sets of responses and create reports.
Slicing big piles of text into smaller, more focused sets is key
Ultimately all text analytics are filtering techniques
Crowdsourcing accelerates the insight generation process through machine-learning
Distributed for synchronous & asynchronous collaboration
CoderRank (patent pending) for enhanced machine-learning is our key innovation
For more information visit the Texifter table or
discovertext.com
@discovertext
Thank-you for listening!
Stuart Shulman – Texifter
1 of 41

Recommended

Lecture: Word Sense Disambiguation by
Lecture: Word Sense DisambiguationLecture: Word Sense Disambiguation
Lecture: Word Sense DisambiguationMarina Santini
7.1K views62 slides
Graph-based Word Sense Disambiguation by
Graph-based Word Sense DisambiguationGraph-based Word Sense Disambiguation
Graph-based Word Sense DisambiguationElena-Oana Tabaranu
1.5K views10 slides
COLING 2014 - An Enhanced Lesk Word Sense Disambiguation Algorithm through a ... by
COLING 2014 - An Enhanced Lesk Word Sense Disambiguation Algorithm through a ...COLING 2014 - An Enhanced Lesk Word Sense Disambiguation Algorithm through a ...
COLING 2014 - An Enhanced Lesk Word Sense Disambiguation Algorithm through a ...Pierpaolo Basile
1K views32 slides
Usage of word sense disambiguation in concept identification in ontology cons... by
Usage of word sense disambiguation in concept identification in ontology cons...Usage of word sense disambiguation in concept identification in ontology cons...
Usage of word sense disambiguation in concept identification in ontology cons...Innovation Quotient Pvt Ltd
507 views19 slides
Similarity based methods for word sense disambiguation by
Similarity based methods for word sense disambiguationSimilarity based methods for word sense disambiguation
Similarity based methods for word sense disambiguationvini89
456 views26 slides
Word sense disambiguation a survey by
Word sense disambiguation a surveyWord sense disambiguation a survey
Word sense disambiguation a surveyunyil96
4.9K views69 slides

More Related Content

Viewers also liked

Demand for Rentals by
Demand for RentalsDemand for Rentals
Demand for RentalsNationalRentalHomeCouncil
1.3K views14 slides
Distressed Property to Rental by
Distressed Property to RentalDistressed Property to Rental
Distressed Property to RentalNationalRentalHomeCouncil
6.4K views17 slides
Similarity based methods for word sense disambiguation by
Similarity based methods for word sense disambiguationSimilarity based methods for word sense disambiguation
Similarity based methods for word sense disambiguationvini89
799 views26 slides
Paying Guest v/s NestAway Rental Homes by
Paying Guest v/s NestAway Rental HomesPaying Guest v/s NestAway Rental Homes
Paying Guest v/s NestAway Rental HomesNestaway.com
485 views1 slide
Error analysis of Word Sense Disambiguation by
Error analysis of Word Sense DisambiguationError analysis of Word Sense Disambiguation
Error analysis of Word Sense DisambiguationRubén Izquierdo Beviá
905 views24 slides
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasks by
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasksTopic Modeling for Information Retrieval and Word Sense Disambiguation tasks
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasksLeonardo Di Donato
2.8K views17 slides

Viewers also liked(19)

Similarity based methods for word sense disambiguation by vini89
Similarity based methods for word sense disambiguationSimilarity based methods for word sense disambiguation
Similarity based methods for word sense disambiguation
vini89799 views
Paying Guest v/s NestAway Rental Homes by Nestaway.com
Paying Guest v/s NestAway Rental HomesPaying Guest v/s NestAway Rental Homes
Paying Guest v/s NestAway Rental Homes
Nestaway.com485 views
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasks by Leonardo Di Donato
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasksTopic Modeling for Information Retrieval and Word Sense Disambiguation tasks
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasks
Leonardo Di Donato2.8K views
Word Sense Disambiguation and Induction by Leon Derczynski
Word Sense Disambiguation and InductionWord Sense Disambiguation and Induction
Word Sense Disambiguation and Induction
Leon Derczynski2.3K views
Ontology-Based Word Sense Disambiguation for Scientific Literature by eXascale Infolab
Ontology-Based Word Sense Disambiguation for Scientific LiteratureOntology-Based Word Sense Disambiguation for Scientific Literature
Ontology-Based Word Sense Disambiguation for Scientific Literature
eXascale Infolab906 views
Google ara project by RUSHIT PATEL
Google ara project Google ara project
Google ara project
RUSHIT PATEL856 views
Babelfy: Entity Linking meets Word Sense Disambiguation. by Grupo HULAT
Babelfy: Entity Linking meets Word Sense Disambiguation.Babelfy: Entity Linking meets Word Sense Disambiguation.
Babelfy: Entity Linking meets Word Sense Disambiguation.
Grupo HULAT843 views
Google Project Ara - The Modular Smartphone by Varun Singh
Google Project Ara - The Modular SmartphoneGoogle Project Ara - The Modular Smartphone
Google Project Ara - The Modular Smartphone
Varun Singh24.2K views
Tracxn Online Rental Startup Landscape - July 2015 by Tracxn
Tracxn Online Rental Startup Landscape - July 2015Tracxn Online Rental Startup Landscape - July 2015
Tracxn Online Rental Startup Landscape - July 2015
Tracxn8.8K views
The last smartphone – Project Ara by Gabriel Sita
The last smartphone – Project AraThe last smartphone – Project Ara
The last smartphone – Project Ara
Gabriel Sita2.8K views

Similar to Sifting Social Data: Word Sense Disambiguation Using Machine Learning

CoderRank: Creating Gold Standards by
CoderRank: Creating Gold StandardsCoderRank: Creating Gold Standards
CoderRank: Creating Gold StandardsStuart Shulman
245 views22 slides
AI - the minimum you have to know by
AI - the minimum you have to knowAI - the minimum you have to know
AI - the minimum you have to knowMarcel Blattner, PhD
143 views9 slides
Summit slide loop ny by
Summit slide loop nySummit slide loop ny
Summit slide loop nyStuart Shulman
344 views44 slides
Text Analytics: From Colored Pens and Crumbly Papers to Custom Machine Classi... by
Text Analytics: From Colored Pens and Crumbly Papers to Custom Machine Classi...Text Analytics: From Colored Pens and Crumbly Papers to Custom Machine Classi...
Text Analytics: From Colored Pens and Crumbly Papers to Custom Machine Classi...Stuart Shulman
170 views80 slides
AI and disinfo (1).pdf by
AI and disinfo (1).pdfAI and disinfo (1).pdf
AI and disinfo (1).pdfCarola Frediani
114 views17 slides
Human and machines learning together by
Human and machines learning together Human and machines learning together
Human and machines learning together Stuart Shulman
21 views33 slides

Similar to Sifting Social Data: Word Sense Disambiguation Using Machine Learning(20)

CoderRank: Creating Gold Standards by Stuart Shulman
CoderRank: Creating Gold StandardsCoderRank: Creating Gold Standards
CoderRank: Creating Gold Standards
Stuart Shulman245 views
Text Analytics: From Colored Pens and Crumbly Papers to Custom Machine Classi... by Stuart Shulman
Text Analytics: From Colored Pens and Crumbly Papers to Custom Machine Classi...Text Analytics: From Colored Pens and Crumbly Papers to Custom Machine Classi...
Text Analytics: From Colored Pens and Crumbly Papers to Custom Machine Classi...
Stuart Shulman170 views
Human and machines learning together by Stuart Shulman
Human and machines learning together Human and machines learning together
Human and machines learning together
Stuart Shulman21 views
Qual, Mixed, Machine and Everything in Between by Stuart Shulman
Qual, Mixed, Machine and Everything in BetweenQual, Mixed, Machine and Everything in Between
Qual, Mixed, Machine and Everything in Between
Stuart Shulman193 views
AI, Machine Learning and Deep Learning - The Overview by Spotle.ai
AI, Machine Learning and Deep Learning - The OverviewAI, Machine Learning and Deep Learning - The Overview
AI, Machine Learning and Deep Learning - The Overview
Spotle.ai1K views
DiscoverText: Tools for Text by Stuart Shulman
DiscoverText: Tools for TextDiscoverText: Tools for Text
DiscoverText: Tools for Text
Stuart Shulman1.6K views
EELU AI lecture 1- fall 2022-2023 - Chapter 01- Introduction.ppt by DaliaMagdy12
EELU AI  lecture 1- fall 2022-2023 - Chapter 01- Introduction.pptEELU AI  lecture 1- fall 2022-2023 - Chapter 01- Introduction.ppt
EELU AI lecture 1- fall 2022-2023 - Chapter 01- Introduction.ppt
DaliaMagdy1218 views
AICHROTH Systemaic evaluation and decentralisation for a (bit more) trusted AI by FIAT/IFTA
AICHROTH Systemaic evaluation and decentralisation for a (bit more) trusted AIAICHROTH Systemaic evaluation and decentralisation for a (bit more) trusted AI
AICHROTH Systemaic evaluation and decentralisation for a (bit more) trusted AI
FIAT/IFTA482 views
Alice Phieu - In AI We Trust by Alice Phieu
Alice Phieu - In AI We TrustAlice Phieu - In AI We Trust
Alice Phieu - In AI We Trust
Alice Phieu5 views
ML All Chapter PDF.pdf by example43
ML All Chapter PDF.pdfML All Chapter PDF.pdf
ML All Chapter PDF.pdf
example43161 views
Say "Hi!" to Your New Boss by Andreas Dewes
Say "Hi!" to Your New BossSay "Hi!" to Your New Boss
Say "Hi!" to Your New Boss
Andreas Dewes2.1K views
Content Marketing Is Nothing New Essay by Amber Wheeler
Content Marketing Is Nothing New EssayContent Marketing Is Nothing New Essay
Content Marketing Is Nothing New Essay
Amber Wheeler2 views
Artificial Intelligence And Artificial Technology by Michele Johnson
Artificial Intelligence And Artificial TechnologyArtificial Intelligence And Artificial Technology
Artificial Intelligence And Artificial Technology
Michele Johnson2 views
Black Box Learning Analytics? Beyond Algorithmic Transparency by Simon Buckingham Shum
Black Box Learning Analytics? Beyond Algorithmic TransparencyBlack Box Learning Analytics? Beyond Algorithmic Transparency
Black Box Learning Analytics? Beyond Algorithmic Transparency
Copy of getting into ai event slides (PDF) by Matthew Miller
Copy of getting into ai   event slides (PDF)Copy of getting into ai   event slides (PDF)
Copy of getting into ai event slides (PDF)
Matthew Miller154 views

More from Stuart Shulman

Fear and loathing on the social campaign trail by
Fear and loathing on the social campaign trailFear and loathing on the social campaign trail
Fear and loathing on the social campaign trailStuart Shulman
476 views88 slides
Fear and Loathing on the Social Campaign Trail by
Fear and Loathing on the Social Campaign TrailFear and Loathing on the Social Campaign Trail
Fear and Loathing on the Social Campaign TrailStuart Shulman
128 views88 slides
Texifter Presentation at Boston New Technology’s #BNT77 Startup Showcase! by
Texifter Presentation at Boston New Technology’s #BNT77 Startup Showcase!Texifter Presentation at Boston New Technology’s #BNT77 Startup Showcase!
Texifter Presentation at Boston New Technology’s #BNT77 Startup Showcase!Stuart Shulman
217 views13 slides
Text Analytics for Social Data Using DiscoverText & Sifter by
 Text Analytics for Social Data Using DiscoverText & Sifter Text Analytics for Social Data Using DiscoverText & Sifter
Text Analytics for Social Data Using DiscoverText & SifterStuart Shulman
809 views60 slides
Text Analytics for Social Data Using DiscoverText & Sifter by
Text Analytics for Social Data Using DiscoverText & SifterText Analytics for Social Data Using DiscoverText & Sifter
Text Analytics for Social Data Using DiscoverText & SifterStuart Shulman
336 views60 slides
Twitter for Research by
Twitter for ResearchTwitter for Research
Twitter for ResearchStuart Shulman
840 views59 slides

More from Stuart Shulman(14)

Fear and loathing on the social campaign trail by Stuart Shulman
Fear and loathing on the social campaign trailFear and loathing on the social campaign trail
Fear and loathing on the social campaign trail
Stuart Shulman476 views
Fear and Loathing on the Social Campaign Trail by Stuart Shulman
Fear and Loathing on the Social Campaign TrailFear and Loathing on the Social Campaign Trail
Fear and Loathing on the Social Campaign Trail
Stuart Shulman128 views
Texifter Presentation at Boston New Technology’s #BNT77 Startup Showcase! by Stuart Shulman
Texifter Presentation at Boston New Technology’s #BNT77 Startup Showcase!Texifter Presentation at Boston New Technology’s #BNT77 Startup Showcase!
Texifter Presentation at Boston New Technology’s #BNT77 Startup Showcase!
Stuart Shulman217 views
Text Analytics for Social Data Using DiscoverText & Sifter by Stuart Shulman
 Text Analytics for Social Data Using DiscoverText & Sifter Text Analytics for Social Data Using DiscoverText & Sifter
Text Analytics for Social Data Using DiscoverText & Sifter
Stuart Shulman809 views
Text Analytics for Social Data Using DiscoverText & Sifter by Stuart Shulman
Text Analytics for Social Data Using DiscoverText & SifterText Analytics for Social Data Using DiscoverText & Sifter
Text Analytics for Social Data Using DiscoverText & Sifter
Stuart Shulman336 views
CAQDAS 2014 Pecha Kucha - Stuart Shulman by Stuart Shulman
CAQDAS 2014 Pecha Kucha - Stuart ShulmanCAQDAS 2014 Pecha Kucha - Stuart Shulman
CAQDAS 2014 Pecha Kucha - Stuart Shulman
Stuart Shulman485 views
Measuring reliability and validity in human coding and machine classification by Stuart Shulman
Measuring reliability and validity in human coding and machine classificationMeasuring reliability and validity in human coding and machine classification
Measuring reliability and validity in human coding and machine classification
Stuart Shulman844 views
Technology for Citizen Voices by Stuart Shulman
Technology for Citizen VoicesTechnology for Citizen Voices
Technology for Citizen Voices
Stuart Shulman591 views
Citizen Voices in a Networked Age of #BigData by Stuart Shulman
Citizen Voices in a Networked Age of #BigDataCitizen Voices in a Networked Age of #BigData
Citizen Voices in a Networked Age of #BigData
Stuart Shulman916 views
DiscoverText Product Overview by Stuart Shulman
DiscoverText Product OverviewDiscoverText Product Overview
DiscoverText Product Overview
Stuart Shulman399 views
Importing bulk outlook email into DiscoverText - the .pst file upload by Stuart Shulman
Importing bulk outlook email into DiscoverText - the .pst file uploadImporting bulk outlook email into DiscoverText - the .pst file upload
Importing bulk outlook email into DiscoverText - the .pst file upload
Stuart Shulman377 views
Future of text analysis forrester briefing by Stuart Shulman
Future of text analysis   forrester briefingFuture of text analysis   forrester briefing
Future of text analysis forrester briefing
Stuart Shulman459 views

Recently uploaded

Dev-HRE-Ops - Addressing the _Last Mile DevOps Challenge_ in Highly Regulated... by
Dev-HRE-Ops - Addressing the _Last Mile DevOps Challenge_ in Highly Regulated...Dev-HRE-Ops - Addressing the _Last Mile DevOps Challenge_ in Highly Regulated...
Dev-HRE-Ops - Addressing the _Last Mile DevOps Challenge_ in Highly Regulated...TomHalpin9
6 views29 slides
AI and Ml presentation .pptx by
AI and Ml presentation .pptxAI and Ml presentation .pptx
AI and Ml presentation .pptxFayazAli87
12 views15 slides
Ports-and-Adapters Architecture for Embedded HMI by
Ports-and-Adapters Architecture for Embedded HMIPorts-and-Adapters Architecture for Embedded HMI
Ports-and-Adapters Architecture for Embedded HMIBurkhard Stubert
21 views19 slides
The Path to DevOps by
The Path to DevOpsThe Path to DevOps
The Path to DevOpsJohn Valentino
5 views6 slides
Dev-Cloud Conference 2023 - Continuous Deployment Showdown: Traditionelles CI... by
Dev-Cloud Conference 2023 - Continuous Deployment Showdown: Traditionelles CI...Dev-Cloud Conference 2023 - Continuous Deployment Showdown: Traditionelles CI...
Dev-Cloud Conference 2023 - Continuous Deployment Showdown: Traditionelles CI...Marc Müller
42 views83 slides
Myths and Facts About Hospice Care: Busting Common Misconceptions by
Myths and Facts About Hospice Care: Busting Common MisconceptionsMyths and Facts About Hospice Care: Busting Common Misconceptions
Myths and Facts About Hospice Care: Busting Common MisconceptionsCare Coordinations
6 views1 slide

Recently uploaded(20)

Dev-HRE-Ops - Addressing the _Last Mile DevOps Challenge_ in Highly Regulated... by TomHalpin9
Dev-HRE-Ops - Addressing the _Last Mile DevOps Challenge_ in Highly Regulated...Dev-HRE-Ops - Addressing the _Last Mile DevOps Challenge_ in Highly Regulated...
Dev-HRE-Ops - Addressing the _Last Mile DevOps Challenge_ in Highly Regulated...
TomHalpin96 views
AI and Ml presentation .pptx by FayazAli87
AI and Ml presentation .pptxAI and Ml presentation .pptx
AI and Ml presentation .pptx
FayazAli8712 views
Ports-and-Adapters Architecture for Embedded HMI by Burkhard Stubert
Ports-and-Adapters Architecture for Embedded HMIPorts-and-Adapters Architecture for Embedded HMI
Ports-and-Adapters Architecture for Embedded HMI
Burkhard Stubert21 views
Dev-Cloud Conference 2023 - Continuous Deployment Showdown: Traditionelles CI... by Marc Müller
Dev-Cloud Conference 2023 - Continuous Deployment Showdown: Traditionelles CI...Dev-Cloud Conference 2023 - Continuous Deployment Showdown: Traditionelles CI...
Dev-Cloud Conference 2023 - Continuous Deployment Showdown: Traditionelles CI...
Marc Müller42 views
Myths and Facts About Hospice Care: Busting Common Misconceptions by Care Coordinations
Myths and Facts About Hospice Care: Busting Common MisconceptionsMyths and Facts About Hospice Care: Busting Common Misconceptions
Myths and Facts About Hospice Care: Busting Common Misconceptions
tecnologia18.docx by nosi6702
tecnologia18.docxtecnologia18.docx
tecnologia18.docx
nosi67025 views
predicting-m3-devopsconMunich-2023.pptx by Tier1 app
predicting-m3-devopsconMunich-2023.pptxpredicting-m3-devopsconMunich-2023.pptx
predicting-m3-devopsconMunich-2023.pptx
Tier1 app7 views
Generic or specific? Making sensible software design decisions by Bert Jan Schrijver
Generic or specific? Making sensible software design decisionsGeneric or specific? Making sensible software design decisions
Generic or specific? Making sensible software design decisions
Quality Engineer: A Day in the Life by John Valentino
Quality Engineer: A Day in the LifeQuality Engineer: A Day in the Life
Quality Engineer: A Day in the Life
John Valentino6 views
Navigating container technology for enhanced security by Niklas Saari by Metosin Oy
Navigating container technology for enhanced security by Niklas SaariNavigating container technology for enhanced security by Niklas Saari
Navigating container technology for enhanced security by Niklas Saari
Metosin Oy14 views
Unlocking the Power of AI in Product Management - A Comprehensive Guide for P... by NimaTorabi2
Unlocking the Power of AI in Product Management - A Comprehensive Guide for P...Unlocking the Power of AI in Product Management - A Comprehensive Guide for P...
Unlocking the Power of AI in Product Management - A Comprehensive Guide for P...
NimaTorabi215 views
Bootstrapping vs Venture Capital.pptx by Zeljko Svedic
Bootstrapping vs Venture Capital.pptxBootstrapping vs Venture Capital.pptx
Bootstrapping vs Venture Capital.pptx
Zeljko Svedic12 views
Fleet Management Software in India by Fleetable
Fleet Management Software in India Fleet Management Software in India
Fleet Management Software in India
Fleetable12 views

Sifting Social Data: Word Sense Disambiguation Using Machine Learning