SlideShare a Scribd company logo
NLP for Small Data
Concepción Polo
María José García
NLP is out there
NLP is complicated
He comprado libros y muebles de segunda mano
El perro de Juan
Aceite de oliva, aceite de palma… aceite de bebé
En casa nos gusta comer cocido
Phonetics and Phonology
Natural Language
Morphology and Lexicon
Phonetics and Phonology
Natural Language
Syntax
Morphology and Lexicon
Phonetics and Phonology
Natural Language
“Me gusta mucho el cuadro de Juan”
¿Juan posee el cuadro?
¿Juan ha pintado el cuadro?
¿Juan está pintado en el cuadro?
Semantics
Syntax
Morphology and Lexicon
Phonetics and Phonology
Natural Language
Juan es / está gordo
Christian Bale #es / está gordo
Paquita Salas es / #está gorda
Pragmatics
Semantics
Syntax
Morphology and Lexicon
Phonetics and Phonology
Natural Language
time
big bang
Deep Learning
we are here
Alan Turing
people drawing strange
pictures of eyes and birds
in Egypt
EMACS doctor
ML
?
time
big bang
Deep Learning
we are here
Alan Turing
people drawing strange
pictures of eyes and birds
in Egypt
EMACS doctor
ML
?
Rationalism
(part of NLTK)
{
(([{ner:PERSON}]) /was/ /born/ /on/ ([{ner:DATE}]))
=> "DATE_OF_BIRTH"
}
(https://nlp.stanford.edu/software/tokensregex.html)
Languages evolve!
time
big bang
Deep Learning
we are here
Alan Turing
people drawing strange
pictures of eyes and birds
in Egypt
EMACS doctor
ML
?
Empiricism
Features
hello
mom
dad
bye
Hello, mom!
Hello, dad!
Bye!
1
2
3
4
[1, 1, 0, 0]
[1, 0, 1, 0]
[0, 0, 0, 1]
A good starting point but...
time
big bang
Deep Learning
we are here
Alan Turing
people drawing strange
pictures of eyes and birds
in Egypt
EMACS doctor
ML
?
Deep Learning
Embeddings
Word2Vec
fastText
GloVe
Good
Bad
Bright
Dark
White
Black
http://jalammar.github.io/illustrated-bert/
Needs a lot of data
Case of study: big telco
15k summaries
16 words average
typos, abbreviations
250+ root causes
10 days
Señor,
llévame
pronto.
Deep
Learning
Deep
Learning
Linguistics
Text
DL Model
Linguistic model
“We are observing the death of a star”
Astronomy
OK astronomy
“Planets move in circles inside other circles”
Astronomy
More like ancient astronomy
“I am building a Death Star”
Astronomy
LOL
no
Accuracy
Time
Deep
Learning
Linguistics
time
big bang
Deep Learning
we are here
Alan Turing
people drawing strange
pictures of eyes and birds
in Egypt
EMACS doctor
ML
?
Transfer
Learning
Linguistics
Transfer
Learning
17h
Track 4
Thank you!
cpolo@meaningcloud.com
mgarcia@meaningcloud.com
New York
3537 36th St
+1 (646) 403-3104
Madrid
Labastida, 1
+34 910 754 276
www.meaningcloud.com
No engineer was hurt in the making of this presentation

More Related Content

More from MeaningCloud

How to extract health market intelligence from the voice of the patient - Mea...
How to extract health market intelligence from the voice of the patient - Mea...How to extract health market intelligence from the voice of the patient - Mea...
How to extract health market intelligence from the voice of the patient - Mea...MeaningCloud
 
Why you need Deep Semantic Analytics MeaningCloud webinar
Why you need Deep Semantic Analytics  MeaningCloud webinarWhy you need Deep Semantic Analytics  MeaningCloud webinar
Why you need Deep Semantic Analytics MeaningCloud webinarMeaningCloud
 
Por qué necesitas Deep Semantic Analytics - MeaningCloud webinar
Por qué necesitas Deep Semantic Analytics - MeaningCloud webinarPor qué necesitas Deep Semantic Analytics - MeaningCloud webinar
Por qué necesitas Deep Semantic Analytics - MeaningCloud webinarMeaningCloud
 
Integrate the most advanced text analytics into your predictive models - Mean...
Integrate the most advanced text analytics into your predictive models - Mean...Integrate the most advanced text analytics into your predictive models - Mean...
Integrate the most advanced text analytics into your predictive models - Mean...MeaningCloud
 
Incorpora la analitica de texto mas avanzada a tus modelos predictivos - Mean...
Incorpora la analitica de texto mas avanzada a tus modelos predictivos - Mean...Incorpora la analitica de texto mas avanzada a tus modelos predictivos - Mean...
Incorpora la analitica de texto mas avanzada a tus modelos predictivos - Mean...MeaningCloud
 
When to use the different text analytics tools - Meaning Cloud
When to use the different text analytics tools - Meaning CloudWhen to use the different text analytics tools - Meaning Cloud
When to use the different text analytics tools - Meaning CloudMeaningCloud
 
Cuándo usar las diferentes herramientas de analítica de texto - Meaningcloud
Cuándo usar las diferentes herramientas de analítica de texto - MeaningcloudCuándo usar las diferentes herramientas de analítica de texto - Meaningcloud
Cuándo usar las diferentes herramientas de analítica de texto - MeaningcloudMeaningCloud
 
Aprende a desarrollar clasificadores de texto a medida con MeaningCloud
Aprende a desarrollar clasificadores de texto a medida con MeaningCloudAprende a desarrollar clasificadores de texto a medida con MeaningCloud
Aprende a desarrollar clasificadores de texto a medida con MeaningCloudMeaningCloud
 
Entirely tailored sentiment analysis - MeaningCloud webinar
Entirely tailored sentiment analysis - MeaningCloud webinarEntirely tailored sentiment analysis - MeaningCloud webinar
Entirely tailored sentiment analysis - MeaningCloud webinarMeaningCloud
 
10 formas de aumentar los beneficios de los medios utilizando metadatos - pre...
10 formas de aumentar los beneficios de los medios utilizando metadatos - pre...10 formas de aumentar los beneficios de los medios utilizando metadatos - pre...
10 formas de aumentar los beneficios de los medios utilizando metadatos - pre...MeaningCloud
 
10 formas de aumentar los beneficios de los medios utilizando metadatos - pre...
10 formas de aumentar los beneficios de los medios utilizando metadatos - pre...10 formas de aumentar los beneficios de los medios utilizando metadatos - pre...
10 formas de aumentar los beneficios de los medios utilizando metadatos - pre...MeaningCloud
 
Intelligent Content for Media & Publishers
Intelligent Content for Media & PublishersIntelligent Content for Media & Publishers
Intelligent Content for Media & PublishersMeaningCloud
 
Voz del Cliente para el Sector de Seguros
Voz del Cliente para el Sector de SegurosVoz del Cliente para el Sector de Seguros
Voz del Cliente para el Sector de SegurosMeaningCloud
 
MeaningCloud - Multidimensional Customer Profiling - Sentiment Analysis Sympo...
MeaningCloud - Multidimensional Customer Profiling - Sentiment Analysis Sympo...MeaningCloud - Multidimensional Customer Profiling - Sentiment Analysis Sympo...
MeaningCloud - Multidimensional Customer Profiling - Sentiment Analysis Sympo...MeaningCloud
 
Improve Customer Experience Management with Text Analytics - MeaningCloud web...
Improve Customer Experience Management with Text Analytics - MeaningCloud web...Improve Customer Experience Management with Text Analytics - MeaningCloud web...
Improve Customer Experience Management with Text Analytics - MeaningCloud web...MeaningCloud
 
Boost Your Text Analytics Accuracy - MeaningCloud Webinar
Boost Your Text Analytics Accuracy - MeaningCloud WebinarBoost Your Text Analytics Accuracy - MeaningCloud Webinar
Boost Your Text Analytics Accuracy - MeaningCloud WebinarMeaningCloud
 

More from MeaningCloud (16)

How to extract health market intelligence from the voice of the patient - Mea...
How to extract health market intelligence from the voice of the patient - Mea...How to extract health market intelligence from the voice of the patient - Mea...
How to extract health market intelligence from the voice of the patient - Mea...
 
Why you need Deep Semantic Analytics MeaningCloud webinar
Why you need Deep Semantic Analytics  MeaningCloud webinarWhy you need Deep Semantic Analytics  MeaningCloud webinar
Why you need Deep Semantic Analytics MeaningCloud webinar
 
Por qué necesitas Deep Semantic Analytics - MeaningCloud webinar
Por qué necesitas Deep Semantic Analytics - MeaningCloud webinarPor qué necesitas Deep Semantic Analytics - MeaningCloud webinar
Por qué necesitas Deep Semantic Analytics - MeaningCloud webinar
 
Integrate the most advanced text analytics into your predictive models - Mean...
Integrate the most advanced text analytics into your predictive models - Mean...Integrate the most advanced text analytics into your predictive models - Mean...
Integrate the most advanced text analytics into your predictive models - Mean...
 
Incorpora la analitica de texto mas avanzada a tus modelos predictivos - Mean...
Incorpora la analitica de texto mas avanzada a tus modelos predictivos - Mean...Incorpora la analitica de texto mas avanzada a tus modelos predictivos - Mean...
Incorpora la analitica de texto mas avanzada a tus modelos predictivos - Mean...
 
When to use the different text analytics tools - Meaning Cloud
When to use the different text analytics tools - Meaning CloudWhen to use the different text analytics tools - Meaning Cloud
When to use the different text analytics tools - Meaning Cloud
 
Cuándo usar las diferentes herramientas de analítica de texto - Meaningcloud
Cuándo usar las diferentes herramientas de analítica de texto - MeaningcloudCuándo usar las diferentes herramientas de analítica de texto - Meaningcloud
Cuándo usar las diferentes herramientas de analítica de texto - Meaningcloud
 
Aprende a desarrollar clasificadores de texto a medida con MeaningCloud
Aprende a desarrollar clasificadores de texto a medida con MeaningCloudAprende a desarrollar clasificadores de texto a medida con MeaningCloud
Aprende a desarrollar clasificadores de texto a medida con MeaningCloud
 
Entirely tailored sentiment analysis - MeaningCloud webinar
Entirely tailored sentiment analysis - MeaningCloud webinarEntirely tailored sentiment analysis - MeaningCloud webinar
Entirely tailored sentiment analysis - MeaningCloud webinar
 
10 formas de aumentar los beneficios de los medios utilizando metadatos - pre...
10 formas de aumentar los beneficios de los medios utilizando metadatos - pre...10 formas de aumentar los beneficios de los medios utilizando metadatos - pre...
10 formas de aumentar los beneficios de los medios utilizando metadatos - pre...
 
10 formas de aumentar los beneficios de los medios utilizando metadatos - pre...
10 formas de aumentar los beneficios de los medios utilizando metadatos - pre...10 formas de aumentar los beneficios de los medios utilizando metadatos - pre...
10 formas de aumentar los beneficios de los medios utilizando metadatos - pre...
 
Intelligent Content for Media & Publishers
Intelligent Content for Media & PublishersIntelligent Content for Media & Publishers
Intelligent Content for Media & Publishers
 
Voz del Cliente para el Sector de Seguros
Voz del Cliente para el Sector de SegurosVoz del Cliente para el Sector de Seguros
Voz del Cliente para el Sector de Seguros
 
MeaningCloud - Multidimensional Customer Profiling - Sentiment Analysis Sympo...
MeaningCloud - Multidimensional Customer Profiling - Sentiment Analysis Sympo...MeaningCloud - Multidimensional Customer Profiling - Sentiment Analysis Sympo...
MeaningCloud - Multidimensional Customer Profiling - Sentiment Analysis Sympo...
 
Improve Customer Experience Management with Text Analytics - MeaningCloud web...
Improve Customer Experience Management with Text Analytics - MeaningCloud web...Improve Customer Experience Management with Text Analytics - MeaningCloud web...
Improve Customer Experience Management with Text Analytics - MeaningCloud web...
 
Boost Your Text Analytics Accuracy - MeaningCloud Webinar
Boost Your Text Analytics Accuracy - MeaningCloud WebinarBoost Your Text Analytics Accuracy - MeaningCloud Webinar
Boost Your Text Analytics Accuracy - MeaningCloud Webinar
 

Recently uploaded

Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Product School
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Frank van Harmelen
 
Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsVlad Stirbu
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Jeffrey Haguewood
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...Product School
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualityInflectra
 
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀DianaGray10
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaRTTS
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Thierry Lestable
 
IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024IoTAnalytics
 
UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2DianaGray10
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlPeter Udo Diehl
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...Product School
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsPaul Groth
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...Elena Simperl
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)Ralf Eggert
 
НАДІЯ ФЕДЮШКО БАЦ «Професійне зростання QA спеціаліста»
НАДІЯ ФЕДЮШКО БАЦ  «Професійне зростання QA спеціаліста»НАДІЯ ФЕДЮШКО БАЦ  «Професійне зростання QA спеціаліста»
НАДІЯ ФЕДЮШКО БАЦ «Професійне зростання QA спеціаліста»QADay
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Alison B. Lowndes
 

Recently uploaded (20)

Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
 
Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024
 
UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 
НАДІЯ ФЕДЮШКО БАЦ «Професійне зростання QA спеціаліста»
НАДІЯ ФЕДЮШКО БАЦ  «Професійне зростання QA спеціаліста»НАДІЯ ФЕДЮШКО БАЦ  «Професійне зростання QA спеціаліста»
НАДІЯ ФЕДЮШКО БАЦ «Професійне зростання QA спеціаліста»
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 

NLP for Small Data - MeaningCloud at T3chFest 2019