DeepCare Chatbot - Generating answers to customers using a hybrid approach of Deep Learning and NLP

DeepCare Chatbot
Generating answers to customers using a hybrid approach
of
Deep Learning and NLP
Pascal van Kooten
2
Introduction
● First conference; first talk
● MSc Methods & Statistics, Utrecht University
● Innovating @ Jibes Data Analytics for ~3 years
● Working for companies to investigate:
– Blockchain
– Computer Vision
– Machine Learning / NLP
● Human Machine Interaction
3
Gartner’s Hype Cycle 2016
4
And this is where you can find me...
5
Fun projects
● whereami
Uses WiFi signals and machine learning to predict where you are
● deep_eye2mouse
Move the mouse by your webcam + gaze
● sky
Using machine intelligence for web crawling
● Neural Complete (presentation in 2 weeks in Moscow)
Neural network to auto complete neural network code
● And…. DeepCare
github.com/kootenpv
6
Chatbots
7
Chatbots
8
Chatbots
● These chatbots take the initiative
● AT&T experimented with chatbots, but cannot
see one online
● Fun experiment to have a reactive chatbot
9
Data
● Find “big QA data” in the English language
● @ATTCares, 1.3M tweets as of May 2017
● 100k Q&A collected through twitter api
● ~15k related to “wanting/needing iphone”
10
Problem
● They receive ~3000 tweets per day
● Corporations want an automated solution
● Very versatile problems:
– Information requests
– Complaints
– Ask for change in system
● Bot needs to detect when it should be quiet
● First task: “social selling”
11
Goal
● Without business constraints…
● Try to see if a bot can reliably reply like a human
● Combine NLP and Deep Learning
– NLP for transparency, answer yes/no
– Deep learning for variation in response
12
http://deepcare.online
(available during PyData)
13
Examples (1/2)
Q: I want an iphone for xmas
A: santa heard you, pvkooten order an iphone at att.com/search?
q=iphone & get #rolloverdata asap. ^fakeAT&Tbot t.co/J4c8G3KR1S
Q: can't decide if I want an iphone 7 or samsung note 4
A: we have great deals on both! get both this your lifestyle! check them
out here att.com/search?q=iphone 7 ^fakeAT&Tbot
Q: I want to see a movie
A: I'm not sure if an iphone is what you really want. I can only help you if
you want an iphone.
14
Examples (2/2)
Q: I want to sell an iphone
A: I'm not sure if an iphone is what you really want. I can only help you if you want an
iphone.
Q: I want to buy an iphone
A: we have got what you need, pvkooten grab an iphone w/ #rolloverdata @ att.com/search?
q=iphone today! t.co/J4c8G3KR1S
Q: I need a charger for my iphone
A: we have one for you, pvkooten get a new charger & many other code tweet-@
att.com/search?q=iphone charger for 20%!^fakeAT&Tbot
Q: i want a case for my iphone 6
A: we have got you covered, pvkooten get a fab case @ att.com/search?q=iphone 6 case
%20cover%20 g & dm me for $$$ off! ^fakeAT&Tbot t.co/J4c8G3KR1S
15
It’s not only deep learning
● Logical mistake:
Q: Stop my samsung contract
A: Ok, I stopped your iphone contract
16
Overview
● Pre-processing
– Generic / Twitter related
– Domain specific
● Phase 1: NLP Model
● Phase 2: Deep Learning answer generation
● Post-processing
Preprocessing
Pascal van Kooten
18
Case study
Q: i want a case for my iphone 6
A: we have got you covered, pvkooten get a fab case @
att.com/search?q=iphone 6 case%20cover%20 g & dm me for $
$$ off! ^fakeAT&Tbot t.co/J4c8G3KR1S
is seen after the preprocessing step as:
Q: i want a case for my PRODUCT
A: we have got you covered, NAME get a fab case @
OFFER_LINK & dm me for MONEY_AMOUNT off! INITIALS
WEBCARE_LINK
19
Generic preprocessing
● Name (API provides actual name) NAME→
● Employee name+time AVAILABILITY→
● Tags (@something) TAG→
● Links LINK (experimented)→
● Emoticons
20
Specific preprocessing
● Brand name
● Products
● Competitors
21
Preprocessing
● Preprocessing to help generalize
● Only preprocess things that can be reused in the
domain
● Plus a crucial component to a company: PRODUCT
● ….
NLP Model
Restrict answering
Pascal van Kooten
23
Traditional NLP
● Depending on language model
– Lemmatization (cats → cat)
– Part of Speech (cat → NOUN)
– Dependency tree (I → subject)
✓Allows generalization!
✗Depends on language model getting it right
✗Feature explosion makes it difficult to generalize
✗Creating rules can be time consuming
✓ It’s very transparant
✓ Rules can give guarantees deep learning can’t
24
Model explanation
● spaCy (https://spacy.io/)
● Rule uses:
– Part of Speech
– Lemma
– Negation
For a sentence,
- need/want... is the lemmatized verb
- which has PRODUCT as a nearby child
- but no negation dependency on the same level or above as PRODUCT
- the subject is “I”
- ???
25
Case study
Q: i want a case for my iphone 6
A: we have got you covered, pvkooten get a fab case @ att.com/search?q=iphone
6 case%20cover%20 g & dm me for $$$ off! ^fakeAT&Tbot t.co/J4c8G3KR1S
want (VERB, ROOT) ✓
---- I (PRON, nsubj) ✓
---- case (NOUN, dobj)
-------- a (DET, det)
-------- for (ADP, prep)
------------ iphone (NOUN, pobj) ✓
---------------- my (ADJ, poss)
---------------- 6 (NUM, nummod)
26
Examples
want galaxy not iphone
not iphone ✓ want iphone!
Generative model
Generating answers
Pascal van Kooten
28
Generative neural network model
● Popularized by Google’s seq2seq:
A general-purpose encoder-decoder framework for Tensorflow that can be
used for Machine Translation, Text Summarization, Conversational Modeling,
Image Captioning, and more.
● Map sequence of words to another sequence of words
● Chinese sentence to English sentence
● Given training examples, minimize loss
● Can also be used for creating a chatbot
● Most likely a variation on this model is being used by Google
Translate
29
Generative neural network model
30
Generative neural network model
https://indico.io/blog/sequence-modeling-neuralnets-part1/
31
Code
● tensorflow
● Started generative model code from
Conchylicultor/DeepQA
32
Neural network parameters
● batch_size: 128
● glob_step: 3186
● max_length: 25
● learning_rate: 0.001
● embedding_size: 128
● hidden_size: 512
● num_layers: 2
---------------------
loss: 0.06
time: 24 hours
33
Conclusion
●
It makes small grammar mistakes
●
It can still make mistakes in logic
– Mostly with unseen data
● Not optimal because 2 models, but fun experiment
● Like OpenAI….
●
Shows signs of sentiment:
Q: I want an iphone but my mom says iphones are the devil
A: do not get upset! order an iphone at att.com/search?q=iphone iphones
with…
34
Questions?
1 of 34

Recommended

What is "Domain Driven Design" and what can you expect from it? by
What is "Domain Driven Design" and what can you expect from it?What is "Domain Driven Design" and what can you expect from it?
What is "Domain Driven Design" and what can you expect from it?Tom Janssens
1.2K views43 slides
Google Assistant Overview by
Google Assistant Overview  Google Assistant Overview
Google Assistant Overview AI.academy
486 views93 slides
Google Cloud: Next'19 Extended Hanoi by
Google Cloud: Next'19 Extended HanoiGoogle Cloud: Next'19 Extended Hanoi
Google Cloud: Next'19 Extended HanoiGCPUserGroupVietnam
471 views75 slides
Easy path to machine learning (Spring 2020) by
Easy path to machine learning (Spring 2020)Easy path to machine learning (Spring 2020)
Easy path to machine learning (Spring 2020)wesley chun
505 views72 slides
Performance #5 cpu and battery by
Performance #5  cpu and batteryPerformance #5  cpu and battery
Performance #5 cpu and batteryVitali Pekelis
198 views134 slides
Scaling Up Machine Learning: How to Benchmark GraphLab Create on Huge Datasets by
Scaling Up Machine Learning: How to Benchmark GraphLab Create on Huge DatasetsScaling Up Machine Learning: How to Benchmark GraphLab Create on Huge Datasets
Scaling Up Machine Learning: How to Benchmark GraphLab Create on Huge DatasetsTuri, Inc.
825 views15 slides

More Related Content

Similar to DeepCare Chatbot - Generating answers to customers using a hybrid approach of Deep Learning and NLP

Everything You Were Taught About Java Is Wrong by
Everything You Were Taught About Java Is WrongEverything You Were Taught About Java Is Wrong
Everything You Were Taught About Java Is WrongTim Boudreau
803 views59 slides
Scaffolding a legacy app with BDD scenarios using SpecFlow/Cucumber (HUSTEF 2... by
Scaffolding a legacy app with BDD scenarios using SpecFlow/Cucumber (HUSTEF 2...Scaffolding a legacy app with BDD scenarios using SpecFlow/Cucumber (HUSTEF 2...
Scaffolding a legacy app with BDD scenarios using SpecFlow/Cucumber (HUSTEF 2...Gáspár Nagy
566 views46 slides
Limits of Machine Learning by
Limits of Machine LearningLimits of Machine Learning
Limits of Machine LearningAlexey Grigorev
1K views76 slides
Mobile App Feature Configuration and A/B Experiments by
Mobile App Feature Configuration and A/B ExperimentsMobile App Feature Configuration and A/B Experiments
Mobile App Feature Configuration and A/B Experimentslacyrhoades
3.1K views74 slides
Pretotyping g motta agile brazil 2013 by
Pretotyping g motta agile brazil 2013Pretotyping g motta agile brazil 2013
Pretotyping g motta agile brazil 2013Guilherme Motta
1.5K views90 slides
Cloud Apps - Running Fully Distributed on Mobile Devices - Dominik Rüttimann by
Cloud Apps - Running Fully Distributed on Mobile Devices - Dominik RüttimannCloud Apps - Running Fully Distributed on Mobile Devices - Dominik Rüttimann
Cloud Apps - Running Fully Distributed on Mobile Devices - Dominik Rüttimanndistributed matters
1.3K views20 slides

Similar to DeepCare Chatbot - Generating answers to customers using a hybrid approach of Deep Learning and NLP(20)

Everything You Were Taught About Java Is Wrong by Tim Boudreau
Everything You Were Taught About Java Is WrongEverything You Were Taught About Java Is Wrong
Everything You Were Taught About Java Is Wrong
Tim Boudreau803 views
Scaffolding a legacy app with BDD scenarios using SpecFlow/Cucumber (HUSTEF 2... by Gáspár Nagy
Scaffolding a legacy app with BDD scenarios using SpecFlow/Cucumber (HUSTEF 2...Scaffolding a legacy app with BDD scenarios using SpecFlow/Cucumber (HUSTEF 2...
Scaffolding a legacy app with BDD scenarios using SpecFlow/Cucumber (HUSTEF 2...
Gáspár Nagy566 views
Mobile App Feature Configuration and A/B Experiments by lacyrhoades
Mobile App Feature Configuration and A/B ExperimentsMobile App Feature Configuration and A/B Experiments
Mobile App Feature Configuration and A/B Experiments
lacyrhoades3.1K views
Pretotyping g motta agile brazil 2013 by Guilherme Motta
Pretotyping g motta agile brazil 2013Pretotyping g motta agile brazil 2013
Pretotyping g motta agile brazil 2013
Guilherme Motta1.5K views
Cloud Apps - Running Fully Distributed on Mobile Devices - Dominik Rüttimann by distributed matters
Cloud Apps - Running Fully Distributed on Mobile Devices - Dominik RüttimannCloud Apps - Running Fully Distributed on Mobile Devices - Dominik Rüttimann
Cloud Apps - Running Fully Distributed on Mobile Devices - Dominik Rüttimann
distributed matters1.3K views
The case for Web components - Drupal4Gov webinar by btopro
The case for Web components - Drupal4Gov webinarThe case for Web components - Drupal4Gov webinar
The case for Web components - Drupal4Gov webinar
btopro148 views
Inteligencia artificial para android como empezar by Isabel Palomar
Inteligencia artificial para android como empezarInteligencia artificial para android como empezar
Inteligencia artificial para android como empezar
Isabel Palomar67 views
Intro To Django by Udi Bauman
Intro To DjangoIntro To Django
Intro To Django
Udi Bauman2.6K views
Welcome to a new reality - DeepCrawl Webinar 2018 by Bastian Grimm
Welcome to a new reality - DeepCrawl Webinar 2018Welcome to a new reality - DeepCrawl Webinar 2018
Welcome to a new reality - DeepCrawl Webinar 2018
Bastian Grimm23.4K views
You shouldneverdo by daniil3
You shouldneverdoYou shouldneverdo
You shouldneverdo
daniil3157 views
Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014 by Austin Ogilvie
Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014
Applied Data Science: Building a Beer Recommender | Data Science MD - Oct 2014
Austin Ogilvie1.7K views
200,000 Lines Later: Our Journey to Manageable Puppet Code by David Danzilio
200,000 Lines Later: Our Journey to Manageable Puppet Code200,000 Lines Later: Our Journey to Manageable Puppet Code
200,000 Lines Later: Our Journey to Manageable Puppet Code
David Danzilio524 views
De-Google Your Life by Lorin Olsen
De-Google Your LifeDe-Google Your Life
De-Google Your Life
Lorin Olsen69 views
Prepare for the Mobilacalypse by Jeff Eaton
Prepare for the MobilacalypsePrepare for the Mobilacalypse
Prepare for the Mobilacalypse
Jeff Eaton1.6K views
Upwork time log and difficulty 20160523 by Sharon Liu
Upwork time log and difficulty 20160523Upwork time log and difficulty 20160523
Upwork time log and difficulty 20160523
Sharon Liu432 views

Recently uploaded

Amy slides.pdf by
Amy slides.pdfAmy slides.pdf
Amy slides.pdfStatsCommunications
5 views13 slides
How Leaders See Data? (Level 1) by
How Leaders See Data? (Level 1)How Leaders See Data? (Level 1)
How Leaders See Data? (Level 1)Narendra Narendra
15 views76 slides
UNEP FI CRS Climate Risk Results.pptx by
UNEP FI CRS Climate Risk Results.pptxUNEP FI CRS Climate Risk Results.pptx
UNEP FI CRS Climate Risk Results.pptxpekka28
11 views51 slides
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M... by
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...DataScienceConferenc1
6 views11 slides
VoxelNet by
VoxelNetVoxelNet
VoxelNettaeseon ryu
9 views21 slides
Data Journeys Hard Talk workshop final.pptx by
Data Journeys Hard Talk workshop final.pptxData Journeys Hard Talk workshop final.pptx
Data Journeys Hard Talk workshop final.pptxinfo828217
10 views18 slides

Recently uploaded(20)

UNEP FI CRS Climate Risk Results.pptx by pekka28
UNEP FI CRS Climate Risk Results.pptxUNEP FI CRS Climate Risk Results.pptx
UNEP FI CRS Climate Risk Results.pptx
pekka2811 views
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M... by DataScienceConferenc1
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...
Data Journeys Hard Talk workshop final.pptx by info828217
Data Journeys Hard Talk workshop final.pptxData Journeys Hard Talk workshop final.pptx
Data Journeys Hard Talk workshop final.pptx
info82821710 views
Cross-network in Google Analytics 4.pdf by GA4 Tutorials
Cross-network in Google Analytics 4.pdfCross-network in Google Analytics 4.pdf
Cross-network in Google Analytics 4.pdf
GA4 Tutorials6 views
PRIVACY AWRE PERSONAL DATA STORAGE by antony420421
PRIVACY AWRE PERSONAL DATA STORAGEPRIVACY AWRE PERSONAL DATA STORAGE
PRIVACY AWRE PERSONAL DATA STORAGE
antony4204215 views
Survey on Factuality in LLM's.pptx by NeethaSherra1
Survey on Factuality in LLM's.pptxSurvey on Factuality in LLM's.pptx
Survey on Factuality in LLM's.pptx
NeethaSherra17 views
[DSC Europe 23][AI:CSI] Dragan Pleskonjic - AI Impact on Cybersecurity and P... by DataScienceConferenc1
[DSC Europe 23][AI:CSI]  Dragan Pleskonjic - AI Impact on Cybersecurity and P...[DSC Europe 23][AI:CSI]  Dragan Pleskonjic - AI Impact on Cybersecurity and P...
[DSC Europe 23][AI:CSI] Dragan Pleskonjic - AI Impact on Cybersecurity and P...
SUPER STORE SQL PROJECT.pptx by khan888620
SUPER STORE SQL PROJECT.pptxSUPER STORE SQL PROJECT.pptx
SUPER STORE SQL PROJECT.pptx
khan88862013 views
[DSC Europe 23][Cryptica] Martin_Summer_Digital_central_bank_money_Ideas_init... by DataScienceConferenc1
[DSC Europe 23][Cryptica] Martin_Summer_Digital_central_bank_money_Ideas_init...[DSC Europe 23][Cryptica] Martin_Summer_Digital_central_bank_money_Ideas_init...
[DSC Europe 23][Cryptica] Martin_Summer_Digital_central_bank_money_Ideas_init...
Organic Shopping in Google Analytics 4.pdf by GA4 Tutorials
Organic Shopping in Google Analytics 4.pdfOrganic Shopping in Google Analytics 4.pdf
Organic Shopping in Google Analytics 4.pdf
GA4 Tutorials16 views
Chapter 3b- Process Communication (1) (1)(1) (1).pptx by ayeshabaig2004
Chapter 3b- Process Communication (1) (1)(1) (1).pptxChapter 3b- Process Communication (1) (1)(1) (1).pptx
Chapter 3b- Process Communication (1) (1)(1) (1).pptx
ayeshabaig20047 views
CRM stick or twist.pptx by info828217
CRM stick or twist.pptxCRM stick or twist.pptx
CRM stick or twist.pptx
info82821711 views
[DSC Europe 23] Stefan Mrsic_Goran Savic - Evolving Technology Excellence.pptx by DataScienceConferenc1
[DSC Europe 23] Stefan Mrsic_Goran Savic - Evolving Technology Excellence.pptx[DSC Europe 23] Stefan Mrsic_Goran Savic - Evolving Technology Excellence.pptx
[DSC Europe 23] Stefan Mrsic_Goran Savic - Evolving Technology Excellence.pptx
CRIJ4385_Death Penalty_F23.pptx by yvettemm100
CRIJ4385_Death Penalty_F23.pptxCRIJ4385_Death Penalty_F23.pptx
CRIJ4385_Death Penalty_F23.pptx
yvettemm1006 views
Advanced_Recommendation_Systems_Presentation.pptx by neeharikasingh29
Advanced_Recommendation_Systems_Presentation.pptxAdvanced_Recommendation_Systems_Presentation.pptx
Advanced_Recommendation_Systems_Presentation.pptx

DeepCare Chatbot - Generating answers to customers using a hybrid approach of Deep Learning and NLP

  • 1. DeepCare Chatbot Generating answers to customers using a hybrid approach of Deep Learning and NLP Pascal van Kooten
  • 2. 2 Introduction ● First conference; first talk ● MSc Methods & Statistics, Utrecht University ● Innovating @ Jibes Data Analytics for ~3 years ● Working for companies to investigate: – Blockchain – Computer Vision – Machine Learning / NLP ● Human Machine Interaction
  • 4. 4 And this is where you can find me...
  • 5. 5 Fun projects ● whereami Uses WiFi signals and machine learning to predict where you are ● deep_eye2mouse Move the mouse by your webcam + gaze ● sky Using machine intelligence for web crawling ● Neural Complete (presentation in 2 weeks in Moscow) Neural network to auto complete neural network code ● And…. DeepCare github.com/kootenpv
  • 8. 8 Chatbots ● These chatbots take the initiative ● AT&T experimented with chatbots, but cannot see one online ● Fun experiment to have a reactive chatbot
  • 9. 9 Data ● Find “big QA data” in the English language ● @ATTCares, 1.3M tweets as of May 2017 ● 100k Q&A collected through twitter api ● ~15k related to “wanting/needing iphone”
  • 10. 10 Problem ● They receive ~3000 tweets per day ● Corporations want an automated solution ● Very versatile problems: – Information requests – Complaints – Ask for change in system ● Bot needs to detect when it should be quiet ● First task: “social selling”
  • 11. 11 Goal ● Without business constraints… ● Try to see if a bot can reliably reply like a human ● Combine NLP and Deep Learning – NLP for transparency, answer yes/no – Deep learning for variation in response
  • 13. 13 Examples (1/2) Q: I want an iphone for xmas A: santa heard you, pvkooten order an iphone at att.com/search? q=iphone & get #rolloverdata asap. ^fakeAT&Tbot t.co/J4c8G3KR1S Q: can't decide if I want an iphone 7 or samsung note 4 A: we have great deals on both! get both this your lifestyle! check them out here att.com/search?q=iphone 7 ^fakeAT&Tbot Q: I want to see a movie A: I'm not sure if an iphone is what you really want. I can only help you if you want an iphone.
  • 14. 14 Examples (2/2) Q: I want to sell an iphone A: I'm not sure if an iphone is what you really want. I can only help you if you want an iphone. Q: I want to buy an iphone A: we have got what you need, pvkooten grab an iphone w/ #rolloverdata @ att.com/search? q=iphone today! t.co/J4c8G3KR1S Q: I need a charger for my iphone A: we have one for you, pvkooten get a new charger & many other code tweet-@ att.com/search?q=iphone charger for 20%!^fakeAT&Tbot Q: i want a case for my iphone 6 A: we have got you covered, pvkooten get a fab case @ att.com/search?q=iphone 6 case %20cover%20 g & dm me for $$$ off! ^fakeAT&Tbot t.co/J4c8G3KR1S
  • 15. 15 It’s not only deep learning ● Logical mistake: Q: Stop my samsung contract A: Ok, I stopped your iphone contract
  • 16. 16 Overview ● Pre-processing – Generic / Twitter related – Domain specific ● Phase 1: NLP Model ● Phase 2: Deep Learning answer generation ● Post-processing
  • 18. 18 Case study Q: i want a case for my iphone 6 A: we have got you covered, pvkooten get a fab case @ att.com/search?q=iphone 6 case%20cover%20 g & dm me for $ $$ off! ^fakeAT&Tbot t.co/J4c8G3KR1S is seen after the preprocessing step as: Q: i want a case for my PRODUCT A: we have got you covered, NAME get a fab case @ OFFER_LINK & dm me for MONEY_AMOUNT off! INITIALS WEBCARE_LINK
  • 19. 19 Generic preprocessing ● Name (API provides actual name) NAME→ ● Employee name+time AVAILABILITY→ ● Tags (@something) TAG→ ● Links LINK (experimented)→ ● Emoticons
  • 20. 20 Specific preprocessing ● Brand name ● Products ● Competitors
  • 21. 21 Preprocessing ● Preprocessing to help generalize ● Only preprocess things that can be reused in the domain ● Plus a crucial component to a company: PRODUCT ● ….
  • 23. 23 Traditional NLP ● Depending on language model – Lemmatization (cats → cat) – Part of Speech (cat → NOUN) – Dependency tree (I → subject) ✓Allows generalization! ✗Depends on language model getting it right ✗Feature explosion makes it difficult to generalize ✗Creating rules can be time consuming ✓ It’s very transparant ✓ Rules can give guarantees deep learning can’t
  • 24. 24 Model explanation ● spaCy (https://spacy.io/) ● Rule uses: – Part of Speech – Lemma – Negation For a sentence, - need/want... is the lemmatized verb - which has PRODUCT as a nearby child - but no negation dependency on the same level or above as PRODUCT - the subject is “I” - ???
  • 25. 25 Case study Q: i want a case for my iphone 6 A: we have got you covered, pvkooten get a fab case @ att.com/search?q=iphone 6 case%20cover%20 g & dm me for $$$ off! ^fakeAT&Tbot t.co/J4c8G3KR1S want (VERB, ROOT) ✓ ---- I (PRON, nsubj) ✓ ---- case (NOUN, dobj) -------- a (DET, det) -------- for (ADP, prep) ------------ iphone (NOUN, pobj) ✓ ---------------- my (ADJ, poss) ---------------- 6 (NUM, nummod)
  • 26. 26 Examples want galaxy not iphone not iphone ✓ want iphone!
  • 28. 28 Generative neural network model ● Popularized by Google’s seq2seq: A general-purpose encoder-decoder framework for Tensorflow that can be used for Machine Translation, Text Summarization, Conversational Modeling, Image Captioning, and more. ● Map sequence of words to another sequence of words ● Chinese sentence to English sentence ● Given training examples, minimize loss ● Can also be used for creating a chatbot ● Most likely a variation on this model is being used by Google Translate
  • 30. 30 Generative neural network model https://indico.io/blog/sequence-modeling-neuralnets-part1/
  • 31. 31 Code ● tensorflow ● Started generative model code from Conchylicultor/DeepQA
  • 32. 32 Neural network parameters ● batch_size: 128 ● glob_step: 3186 ● max_length: 25 ● learning_rate: 0.001 ● embedding_size: 128 ● hidden_size: 512 ● num_layers: 2 --------------------- loss: 0.06 time: 24 hours
  • 33. 33 Conclusion ● It makes small grammar mistakes ● It can still make mistakes in logic – Mostly with unseen data ● Not optimal because 2 models, but fun experiment ● Like OpenAI…. ● Shows signs of sentiment: Q: I want an iphone but my mom says iphones are the devil A: do not get upset! order an iphone at att.com/search?q=iphone iphones with…

Editor's Notes

  1. - Share a project of mine, I’m enthousiastic about - Thank you for coming - Did you come for deep learning? - NLP? - Chatbot?
  2. I really like it at Jibes, if you’re living in the Netherlands and you’re passionate about python and data…. Contact us
  3. First web Then phone apps Now chatbots, in e.g. facebook messenger Or on website Powerful:
  4. First web Then phone apps Now chatbots, in e.g. facebook messenger Or on website
  5. Tweets through API API
  6. Tweets through API API
  7. “Full stack” chatbot Capable of replying to tweets…. → talking you into buying an iphone I want an iphone for xmas Backend in Python Frontend in Angular 2
  8. - Markov chain - I would like to eat - I would like to drink
  9. - Find out what globstep is - twitter limited to 140 characters - this data set ~25 tokens -