Machine Learning Applied - Contextual Chatbots Coding, Oracle JET and TensorFlow

Machine Learning Applied -
Contextual Chatbots Coding,
Oracle JET andTensorFlow
Andrejus Baranovskis, CEO andTechnical Expert, Red Samurai Consulting
Oracle ACE Director and Oracle Groundbreaker Ambassador

Oracle ExpertsTeam
ADF, JET, ORACLE FUSION, ORACLE CLOUD, MACHINE LEARNING
Oracle PaaS Partner Community Award for Outstanding Java Cloud
Service Contribution 2017

Session Goal
HowTo BuildYour Own Machine Learning Chatbot

AGENDA
• Technical Architecture
• Solution WalkThrough
• Machine Learning Introduction
• Implementation Points

Machine Learning Chatbot Context Communication Chatbot UI
Classiﬁcation Chatbot messaging
Oracle JET

Chatbot Custom application logic
Generic listener
Oracle JET

CHATBOT CONTEXT
• Chatbot framework needs a structure in which conversational intents are
defined (this can be JSON file)
• Conversational intent contains:
• tag (unique name)
• patterns (sentence patterns for neural network text classifier)
• responses (one will be used as a response)

GENTLE INTRODUCTIONTO MACHINE
LEARNING

LEARNING AND INFERENCE
Training data Feature vector Learning algorithm Model
Test data Feature vector Model Prediction

KEY PARAMETERS
• Cost Function - score for each candidate parameter, shows sum of
errors in predicting.The higher the cost, the worse the model
parameters will be
• Epoch - each step of looping through all data to update the model
parameters
• Learning rate - the size of the learning step

REGRESSION
Regression
algorithm
Input Output
Continuous
Continuous
Discrete

REGRESSION EXAMPLE
w - parameter to be found usingTensorFlow

CLASSIFICATION
f{x}
Input Output
DiscreteContinuous
Discrete
Classiﬁer

CLASSIFICATION EXAMPLE
Linear boundary line learned from the training data - equal probability for both groups

WHYTENSORFLOW?
• TensorFlow has become the tool of choice to implement machine
learning solutions
• Developed by Google and supported by its ﬂourishing community
• Gives a way to easily implement industry-standard code

STEP 1: PREPARING DATA
• Tokenise patterns into array of words
• Lower case and stem all words. Example: Pharmacy => pharm.
Attempt to represent related words
• Create list of classes - intents
• Create list of documents - combination between list of patterns and
list of intents

STEP 2: PREPARINGTENSORFLOW INPUT
• [X: [0, 0, 0, 1, 0, 1, 0, 1, 0, 0, 0, 0, 0, 1, ...N],Y: [0, 0, 1, 0, 0, 0, ...M]]
• [X: [0, 1, 0, 0, 0, 1, 0, 0, 0, 0, 1, 0, 1, 0, ...N],Y: [0, 0, 0, 1, 0, 0, ...M]]
• Array representing pattern with 0/1. N = vocabulary size. 1 when
word position in vocabulary is matching word from pattern
• Array representing intent with 0/1. M = number of intents. 1 when
intent position in list of intents/classes is matching current intent

STEP 2: PREPARINGTENSORFLOW INPUT

STEP 3:TRAINING NEURAL NETWORK
• Use tflearn - deep learning library featuring a higher-level API forTensorFlow
• Define X input shape - equal to word vocabulary size
• Define two layers with 8 hidden neurones - optimal for text classification task
(based on experiments)
• DefineY input shape - equal to number of intents
• Apply regression to find the best equation parameters

• Define Deep Neural Network model (DNN)
• Run model.fit to construct classification model. Provide X/Y inputs,
number of epochs and batch size
• Per each epoch, multiple operations are executed to find optimal
model parameters to classify future input converted to array of 0/1

• Batch size:
• Smaller batch size requires less memory. Especially important for datasets
with large vocabulary
• Typically networks train faster with smaller batches.Weights and network
parameters are updated after each propagation
• The smaller the batch the less accurate estimate of the gradient (function
which describes the data) could be

STEP 4: INITIAL MODELTESTING
• Tokenise input sentence - split it into array of words
• Create bag of words (array with 0/1) for the input sentence - array
equal to the size of vocabulary, with 1 for each word found in input
sentence
• Run model.predict with given bag of words array, this will return
probability for each intent

STEP 5: REUSETRAINED MODEL
• For better reusability, it is recommended to create separate
TensorFlow notebook, to handle classiﬁcation requests
• We can reuse previously created DNN model, by loading it with
TensorFlow pickle

STEP 6:TEXT CLASSIFICATION
• Define REST interface, so that function will be accessible outside
TensorFlow
• Convert incoming sentence into bag of words array and run
model.predict
• Consider results with probability higher than 0.25 to filter noise
• Return multiple identified intents (if any), together with assigned probability

CONTACTS
• Andrejus Baranovskis (https://andrejusb.blogspot.com)
• Email: abaranovskis@redsamuraiconsulting.com
• Twitter: @andrejusb
• LinkedIn: https://www.linkedin.com/in/andrejus-baranovskis-251b392
• Web: http://redsamuraiconsulting.com

REFERENCES
• Source Code - https://github.com/abaranovskis-redsamurai/shenzhen
• Contextual Chatbot inTensorFlow - https://bit.ly/2pFbTw4
• TensorFlow Book - http://tensorﬂowbook.com/

Machine Learning Applied - Contextual Chatbots Coding, Oracle JET and TensorFlow

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Machine Learning Applied - Contextual Chatbots Coding, Oracle JET and TensorFlow

Similar to Machine Learning Applied - Contextual Chatbots Coding, Oracle JET and TensorFlow (20)

More from andrejusb

More from andrejusb (19)

Recently uploaded

Recently uploaded (20)

Machine Learning Applied - Contextual Chatbots Coding, Oracle JET and TensorFlow