lecture_RNN Autoencoder.pdf

•

0 likes•6 views

ssusercc3ff71

ran

Engineering

CS273B lecture 5: RNN and
autoencoder

James Zou

October 10, 2016

Recap: feedforward and convnets

Main take-aways:

• Composition. Units/layers of a NN are modular and can be
composed to form complex architecture.

• Weight-sharing. Enforcing that the weight be equal across a
set of units can dramatically decrease # of parameters.

What are limitations of convnets?

• Fixed input length.

• Unclear how to adapt to time-series data.

• Convolution corresponds to strong prior—not appropriate
for many biological settings.

• Could require many labeled training examples (high sample
complexity).

What are limitations of convnets?

• Fixed input length.

Recurrent neural network

input

hidden units

output

Vanilla RNN: lacks long term memory

+
+
+

LSTM network

Hochreiter, Schmidhuber 1997

LSTM: inside the hood

Figure adapted from Olah blog.

memory

LSTM: inside the hood

= ( · [ , ] + )
= +
Figure adapted from Olah blog.

LSTM: inside the hood

= ( · [ , ] + )
= tanh( · [ , ] + )
= +
Figure adapted from Olah blog.

LSTM: inside the hood

= ( · [ , ] + )
= tanh( )
Figure adapted from Olah blog.

output

LSTM summary

• LSTM is a variant of RNN that makes it easier to retain long-
range interactions.

• Parameters of LSTM:

forget

new memory

weight of new memory (input)

output

,
,
,
,

LSTM application: enhancer/TF prediction

Input: 200bp sequence

Similar convolutional
architecture as before

Bi-directional LSTM

Output: 919 binary vector for
the presence of TF/chromatin

Quang and Xie. DanQ. 2016

Deep supervised learning

• Feedforward

• Convnets

• RNN, LSTM

Learning a nonlinear
mapping from inputs to
outputs.

Predicting:

TF binding,

gene expression,

disease status from images,

risk from SNPs,

protein structure

…

Deep unsupervised learning

• Nonlinear dimensional reduction and patterns mining.

• In many settings, have more unlabeled examples than labeled.

• Learn useful representations from unlabeled data.

• Better representation may improve prediction accuracy.

Low dimensional structure

What is the latent dimensionality of each row of images?

Urtasun and Zemel.

Autoencoder

ˆ
( ) = ( · + )
ˆ = ( · + )
, = arg min
,
|| ˆ||
Train with backprop as before.

Autoencoder

ˆ
( )
, = arg min
,
|| ||
If encoding and decoding are linear
then

What does this remind you of?

Autoencoder

ˆ
( )
, = arg min
,
|| ||
If encoding and decoding are linear
then

Linear autoencoder is basically just
PCA!

General f and g corresponds to
nonlinear dimensional reduction.

What is wrong with this picture?

ˆ
( )

h(X) can just copy X exactly!

Overcomplete. Need to impose
sparsity on h.

Denoising autoencoder

ˆ
( )
independent noise

0
0

Illustration of denoising autoencoder

Figure from Hugo Larochelle

Filters from denoising autoencoder

Basis learned by
denoising autoencoder

Basis learned by weight-
decay autoencoder

Deep autoencoder example

original

DAE

PCA

Hinton and Salakhutdinov. Science. 2016

Deep autoencoder example

Hinton and Salakhutdinov. Science. 2016

PCA

Deep autoencoder

Application: deep patient

Each patient = vector of 41k clinical descriptors

Stack of 3 denoising autoencoder

500 dim representation of each patient

Miotto et al. DeepPatient. 2016

Application: deep patient

500 dim representation of each patient

Random forest to predict future disease

Miotto et al. DeepPatient. 2016

Similar to lecture_RNN Autoencoder.pdf

Deep Learning in Recommender Systems - RecSys Summer School 2017Balázs Hidasi

Deep Learning in Computer VisionSungjoon Choi

Information processing with artificial spiking neural networksAdvanced-Concepts-Team

State-Of-The Art Machine Learning Algorithms and How They Are Affected By Nea...inside-BigData.com

Deep Learning: Application & OpportunityiTrain

2014 nci-edrnc.titus.brown

2019-06-14:6 - Reti neurali e compressione immagineuninfoit

Inverse problems in medical imagingRadboud University Medical Center

Talk Norway Aug2016xavierbresson

deeplearninghuda2018

Deep Learning Sample Class (Jon Lederman)Jon Lederman

2014 anu-canberra-streamingc.titus.brown

Neural Networks. OverviewOleksandr Baiev

20131019 生物物理若手 Journal ClubMed_KU

Poster for RepL4NLP - Multilingual Modal Sense Classification Using a Convolu...Ana Marasović

教師なし画像特徴表現学習の動向 {Un, Self} supervised representation learning (CVPR 2018 完全読破...cvpaper. challenge

Neural Networks in Data Mining - “An Overview”Dr.(Mrs).Gethsiyal Augasta

Engineering Intelligent NLP Applications Using Deep Learning – Part 2 Saurabh Kaushik

Deep Learning for Automatic Speaker RecognitionSai Kiran Kadam

Nural network ER. Abhishek k. upadhyayabhishek upadhyay

Similar to lecture_RNN Autoencoder.pdf (20)

Deep Learning in Recommender Systems - RecSys Summer School 2017

Deep Learning in Computer Vision

Information processing with artificial spiking neural networks

State-Of-The Art Machine Learning Algorithms and How They Are Affected By Nea...

Deep Learning: Application & Opportunity

2014 nci-edrn

2019-06-14:6 - Reti neurali e compressione immagine

Inverse problems in medical imaging

Talk Norway Aug2016

deeplearning

Deep Learning Sample Class (Jon Lederman)

2014 anu-canberra-streaming

Neural Networks. Overview

20131019 生物物理若手 Journal Club

Poster for RepL4NLP - Multilingual Modal Sense Classification Using a Convolu...

教師なし画像特徴表現学習の動向 {Un, Self} supervised representation learning (CVPR 2018 完全読破...

Neural Networks in Data Mining - “An Overview”

Engineering Intelligent NLP Applications Using Deep Learning – Part 2

Deep Learning for Automatic Speaker Recognition

Nural network ER. Abhishek k. upadhyay

Recently uploaded

Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234

main PPT.pptx of girls hostel security using rfidNikhilNagaraju

the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxhumanexperienceaaa

Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...srsj9000

MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N

chaitra-1.pptx fake news detection using machine learningmisbanausheenparvam

HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla

Introduction to Multiple Access Protocol.pptxupamatechverse

GDSC ASEB Gen AI study jams presentationGDSCAESB

High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat

(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat

The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat

Introduction and different types of Ethernet.pptxupamatechverse

MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N

(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...ranjana rawat

Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan

Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝soniya singh

(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat

(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat

Introduction to IEEE STANDARDS and its different types.pptxupamatechverse

Recently uploaded (20)

Microscopic Analysis of Ceramic Materials.pptx

main PPT.pptx of girls hostel security using rfid

the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx

Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...

MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE

chaitra-1.pptx fake news detection using machine learning

HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS

Introduction to Multiple Access Protocol.pptx

GDSC ASEB Gen AI study jams presentation

High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts

(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...

The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...

Introduction and different types of Ethernet.pptx

MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS

(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...

Coefficient of Thermal Expansion and their Importance.pptx

Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝

(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...

(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...

Introduction to IEEE STANDARDS and its different types.pptx

lecture_RNN Autoencoder.pdf

1. CS273B lecture 5: RNN and autoencoder James Zou October 10, 2016

2. Recap

3. Recap: feedforward and convnets Main take-aways: • Composition. Units/layers of a NN are modular and can be composed to form complex architecture. • Weight-sharing. Enforcing that the weight be equal across a set of units can dramatically decrease # of parameters.

4. What are limitations of convnets?

5. What are limitations of convnets? • Fixed input length. • Unclear how to adapt to time-series data. • Convolution corresponds to strong prior—not appropriate for many biological settings. • Could require many labeled training examples (high sample complexity).

6. What are limitations of convnets? • Fixed input length.

7. What are limitations of convnets? • Fixed input length.

8. Recurrent neural network input hidden units output

9. Recurrent neural network = ( · + ) = · + input hidden units output

10. Recurrent neural network input hidden units output

11. Recurrent neural network + + +

12. Recurrent neural network = ( · + · + )

13. Recurrent neural network = · +

14. What does RNN remind you of? + + +

15. Vanilla RNN: lacks long term memory + + +

16. LSTM network

17. LSTM network Hochreiter, Schmidhuber 1997

18. LSTM: inside the hood Figure adapted from Olah blog. memory

19. LSTM: inside the hood = ( · [ , ] + ) = + Figure adapted from Olah blog.

20. LSTM: inside the hood = ( · [ , ] + ) = tanh( · [ , ] + ) = + Figure adapted from Olah blog.

21. LSTM: inside the hood = ( · [ , ] + ) = tanh( ) Figure adapted from Olah blog. output

22. LSTM summary • LSTM is a variant of RNN that makes it easier to retain long- range interactions. • Parameters of LSTM: forget new memory weight of new memory (input) output , , , ,

23. LSTM application: enhancer/TF prediction Input: 200bp sequence Similar convolutional architecture as before Bi-directional LSTM Output: 919 binary vector for the presence of TF/chromatin Quang and Xie. DanQ. 2016

24. Deep supervised learning • Feedforward • Convnets • RNN, LSTM Learning a nonlinear mapping from inputs to outputs. Predicting: TF binding, gene expression, disease status from images, risk from SNPs, protein structure …

25. Deep unsupervised learning • Nonlinear dimensional reduction and patterns mining. • In many settings, have more unlabeled examples than labeled. • Learn useful representations from unlabeled data. • Better representation may improve prediction accuracy.

26. Low dimensional structure What is the latent dimensionality of each row of images? Urtasun and Zemel.

27. Autoencoder ˆ ( ) encoding decoding

28. Autoencoder ˆ ( ) = ( · + ) ˆ = ( · + ) , = arg min , || ˆ|| Train with backprop as before.

29. Autoencoder ˆ ( ) , = arg min , || || If encoding and decoding are linear then What does this remind you of?

30. Autoencoder ˆ ( ) , = arg min , || || If encoding and decoding are linear then Linear autoencoder is basically just PCA! General f and g corresponds to nonlinear dimensional reduction.

31. What is wrong with this picture? ˆ ( )

32. What is wrong with this picture? ˆ ( ) h(X) can just copy X exactly! Overcomplete. Need to impose sparsity on h.

33. Denoising autoencoder ˆ ( ) independent noise 0 0

34. Denoising autoencoder ˆ ( ) independent noise 0 0

35. Illustration of denoising autoencoder Figure from Hugo Larochelle

36. Filters from denoising autoencoder Basis learned by denoising autoencoder Basis learned by weight- decay autoencoder

37. Deep autoencoder ˆ ˆ

38. Deep autoencoder example original DAE PCA Hinton and Salakhutdinov. Science. 2016

39. Deep autoencoder example Hinton and Salakhutdinov. Science. 2016 PCA Deep autoencoder

40. Application: deep patient Each patient = vector of 41k clinical descriptors Stack of 3 denoising autoencoder 500 dim representation of each patient Miotto et al. DeepPatient. 2016

41. Application: deep patient 500 dim representation of each patient Random forest to predict future disease Miotto et al. DeepPatient. 2016

lecture_RNN Autoencoder.pdf

Recommended

Recommended

More Related Content

Similar to lecture_RNN Autoencoder.pdf

Similar to lecture_RNN Autoencoder.pdf (20)

Recently uploaded

Recently uploaded (20)

lecture_RNN Autoencoder.pdf