Long Short Term Memory

LSTM
(Long short-term Memory)
Cheng Zhan
Data Scientist

• We are the sum total of our experiences. None of us are the same as
we were yesterday, nor will be tomorrow. (B.J. Neblett)
• memorization: every time we gain new information, we store it for
future reference.
• combination: not all tasks are the same, so we couple our analytical
skills with a combination of our memorized, previous experiences to
reason about the world.
Human Learning

Outline
• Review of RNN
• SimpleRNN
• LSTM (forget, input and output gates)
• GRU (reset and update gates)
• LSTM
• Motivation
• Introduction
• Code example

Review of RNN
• Jack Ma is a Chinese business magnate, and his native language is ____

Summary
• Hidden state can be viewed as “memory”, which tries to capture
previous information.
• Output is determined by current input and all the “memory”
• Hidden stage cannot capture all the information
• Unlike CNN, RNN shares the same parameters W

Solutions
• Adding Skip Connections through Time
• Removing Connections
• Changing Activation Functions
• LSTMs

• Sample may refer to individual training examples. A “batch_size” variable is
hence the count of samples you sent to the neural network. That is, how
many different examples you feed at once to the neural network.
• Time Steps are ticks of time. It is how long in time each of your samples
are. For example, a sample can contain 128 time steps, where each time
steps could be a 30th of a second for signal processing. In Natural Language
Processing (NLP), a time step may be associated with a character, a word,
or a sentence, depending on the setup.
• Features are simply the number of dimensions we feed at each time steps.
For example in NLP, a word could be represented by 300 features
using word2vec. In the case of signal processing, let’s pretend that your
signal is 3D. That is, you have a X, a Y and a Z signal, such as an
accelerometer’s measurements on each axis. This means you would have
3 features sent at each time step for each sample.

• http://harinisuresh.com/2016/10/09/lstms/
• http://colah.github.io/posts/2015-08-Understanding-LSTMs/
• http://cs231n.stanford.edu/slides/2018/cs231n_2018_lecture10.pdf
• https://machinelearningmastery.com/time-series-prediction-lstm-
recurrent-neural-networks-python-keras/
• https://www.youtube.com/watch?v=xCGidAeyS4M
• https://www.youtube.com/watch?v=rTqmWlnwz_0

Long Short Term Memory

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Long Short Term Memory

Similar to Long Short Term Memory (20)

More from Yan Xu

More from Yan Xu (20)

Recently uploaded

Recently uploaded (20)

Long Short Term Memory