[PR12] PR-063: Peephole predicting network performance before training

•

0 likes•617 views

Paper review for "Peephole: Predicting Network Performance Before Training (2017)" https://www.youtube.com/watch?v=ZO4bXgdcCQA

Paper reviewed by Taegyun Jeon
Peephole: Predicting Network
Performance Before Training
Boyang Deng, Junjie Yan, Dahua Lin,
“Peephole: Predicting Network Performance Before Training” (2017)
https://arxiv.org/abs/1712.03351
[TensorFlow-KR] PR12

배경 | 높은 성능을 얻으려면?
▪ 결론: 좋은 네트워크를 써야한다.
[PR12] Peephole: Predicting Network Performance Before Training (2017) Page 2

배경 | 좋은 네트워크를 얻으려면?
▪ 2가지 고려요소
▫ Large design space
• For Convolutional Neural Networks (CNN)
◦ the number of layers
◦ the number of channels within these layers
◦ whether to insert a pooling layer at certain points
▫ Costly training process
• Z. Zhong, J. Yan, and C. L. Liu. “Practical network blocks design with q-learning”. arXiv preprint
arXiv:1708.05552, 2017.
• B. Zoph and Q. V. Le. “Neural architecture search with reinforcement learning”. arXiv preprint
arXiv:1611.01578, 2016.
• B. Zoph, V. Vasudevan, J. Shlens, and Q. V. Le. “Learning transferable architectures for scalable
image recognition.” arXiv preprint arXiv:1707.07012, 2017.
[PR12] Peephole: Predicting Network Performance Before Training (2017) Page 3

문제정의 | 모델 성능 예측
[PR12] Peephole: Predicting Network Performance Before Training (2017) Page 4

아이디어 | “네트워크 구조에 대한 성능”을 학습
[PR12] Peephole: Predicting Network Performance Before Training (2017) Page 5
𝑦 = 𝑓(𝑥, 𝑡)

제안 | “네트워크 구조” 표현
▪ Unified Layer Code and Layer Embedding
▫ Integer code: TY, KW, KH, CH
• index of 8-bins: CH = [0.25, 0.5, 0.75, 1.0, 1.5, 2.0, 2.5, 3.0]
▫ Layer embedding
• Hidden state of LSTM cell: structural features
• Epoch index: embedded into real-vector
[PR12] Peephole: Predicting Network Performance Before Training (2017) Page 6

제안 | “네트워크 구조”와 “성능” 의 데이터
▪ 막연한 생각
▫ Random sampling sequences of layers
• The design space grows exponentially as the number of layers increases.
• Many combinations of layers are not reasonable options from a practical point of view.
▪ Block-based generation
▫ Skeleton + generated blocks
▫ One block contains less than 10 layers
• First layer is convolution layer w/ random
kernel size.
▫ Markov chain
• For predefined transition prob.
from practical networks
▫ Restrict the number of convolution layers
within a block to less than 4
▫ 1x1 convolution for dimension matching
[PR12] Peephole: Predicting Network Performance Before Training (2017) Page 7

제안 | 기존 네트워크 구조 = Markov Chain
[PR12] Peephole: Predicting Network Performance Before Training (2017) Page 8

제안 | “X:네트워크, Y:성능” 데이터셋
▪ 데이터셋 구성
▫ N 개의 네트워크: {𝑥𝑖}1:𝑁
▫ Performance curves 𝑦𝑖(𝑡)
• Training data로 학습시키면서 epoch 𝑡에서 validation data에 대한 validation accuracy
▫ 𝒟 = {𝑥𝑖, 𝑦𝑖}1:𝑁
▪ Objective function with smooth L1 loss
▫ ℒ(𝒟; 𝜃) =
1
𝑁
σ𝑖=1
𝑛
𝑙(𝑓 𝑥𝑖, 𝑇 , 𝑦𝑖(𝑇))
[PR12] Peephole: Predicting Network Performance Before Training (2017) Page 9

실험 | 무엇을 학습할 것인가?
[PR12] Peephole: Predicting Network Performance Before Training (2017) Page 10

실험 | 무엇을 학습할 것인가?
[PR12] Peephole: Predicting Network Performance Before Training (2017) Page 11

실험 | 무엇을 학습할 것인가?
▪ Comparison
▫ Bayesian Neural Networks and 𝜐-SVR (Support Vector Regression)
▪ Evaluation metrics
▫ Mean Square Error (MSE)
▫ Kendall’s Tau (Tau)
▫ Coefficient of Determination (R2)
[PR12] Peephole: Predicting Network Performance Before Training (2017) Page 12

실험 | Transfer to ImageNet
▪ a
[PR12] Peephole: Predicting Network Performance Before Training (2017) Page 13

결론 |
▪ Block-based generation
▫ Skeleton + generated blocks
▪ 다른 요소들에 대한 실험은..?
▫ Residual block, Dense connection 등
▪ 결국 평가를 위해선 모든 세팅에 대한 학습 필요
▪ Transfer learning을 위한 최적의 방법인가?
[PR12] Peephole: Predicting Network Performance Before Training (2017) Page 14

https://telecombcn-dl.github.io/2017-dlcv/ Deep learning technologies are at the core of the current revolution in artificial intelligence for multimedia data analysis. The convergence of large-scale annotated datasets and affordable GPU hardware has allowed the training of neural networks for data analysis tasks which were previously addressed with hand-crafted features. Architectures such as convolutional neural networks, recurrent neural networks and Q-nets for reinforcement learning have shaped a brand new scenario in signal processing. This course will cover the basic principles and applications of deep learning to computer vision problems, such as image classification, object detection or image captioning.

Intepretability / Explainable AI for Deep Neural Networks

Universitat Politècnica de Catalunya

Unsupervised Learning (D2L6 2017 UPC Deep Learning for Computer Vision)

Universitat Politècnica de Catalunya

Deep Learning for Computer Vision: Visualization (UPC 2016)

Universitat Politècnica de Catalunya

http://imatge-upc.github.io/telecombcn-2016-dlcv/ Deep learning technologies are at the core of the current revolution in artificial intelligence for multimedia data analysis. The convergence of big annotated data and affordable GPU hardware has allowed the training of neural networks for data analysis tasks which had been addressed until now with hand-crafted features. Architectures such as convolutional neural networks, recurrent neural networks and Q-nets for reinforcement learning have shaped a brand new scenario in signal processing. This course will cover the basic principles and applications of deep learning to computer vision problems, such as image classification, object detection or text captioning.

Deep Learning for Computer Vision: Transfer Learning and Domain Adaptation (U...

Universitat Politècnica de Catalunya

DeepFix: a fully convolutional neural network for predicting human fixations...

Universitat Politècnica de Catalunya

Convolutional neural networks for image classification — evidence from Kaggle...

Dmytro Mishkin

https://github.com/telecombcn-dl/dlmm-2017-dcu Deep learning technologies are at the core of the current revolution in artificial intelligence for multimedia data analysis. The convergence of big annotated data and affordable GPU hardware has allowed the training of neural networks for data analysis tasks which had been addressed until now with hand-crafted features. Architectures such as convolutional neural networks, recurrent neural networks and Q-nets for reinforcement learning have shaped a brand new scenario in signal processing. This course will cover the basic principles and applications of deep learning to computer vision problems, such as image classification, object detection or text captioning.

Deep 3D Visual Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2017

Universitat Politècnica de Catalunya

Transfer Learning and Domain Adaptation (D2L3 2017 UPC Deep Learning for Comp...

Universitat Politècnica de Catalunya

Convolutional Neural Networks (D1L3 2017 UPC Deep Learning for Computer Vision)

Universitat Politècnica de Catalunya

Object Detection (D2L5 Insight@DCU Machine Learning Workshop 2017)

Universitat Politècnica de Catalunya

https://telecombcn-dl.github.io/dlmm-2017-dcu/ Deep learning technologies are at the core of the current revolution in artificial intelligence for multimedia data analysis. The convergence of big annotated data and affordable GPU hardware has allowed the training of neural networks for data analysis tasks which had been addressed until now with hand-crafted features. Architectures such as convolutional neural networks, recurrent neural networks and Q-nets for reinforcement learning have shaped a brand new scenario in signal processing. This course will cover the basic principles and applications of deep learning to computer vision problems, such as image classification, object detection or image captioning.

D1L5 Visualization (D1L2 Insight@DCU Machine Learning Workshop 2017)

Universitat Politècnica de Catalunya

Convolutional neural networks 이론과 응용

홍배 김

Methodology (DLAI D6L2 2017 UPC Deep Learning for Artificial Intelligence)

Universitat Politècnica de Catalunya

Deep learning technologies are at the core of the current revolution in artificial intelligence for multimedia data analysis. The convergence of large-scale annotated datasets and affordable GPU hardware has allowed the training of neural networks for data analysis tasks which were previously addressed with hand-crafted features. Architectures such as convolutional neural networks, recurrent neural networks or Q-nets for reinforcement learning have shaped a brand new scenario in signal processing. This course will cover the basic principles of deep learning from both an algorithmic and computational perspectives.

DLD meetup 2017, Efficient Deep Learning

Brodmann17

Deep Neural Networks (D1L2 Insight@DCU Machine Learning Workshop 2017)

Universitat Politècnica de Catalunya

Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)

Universitat Politècnica de Catalunya

Convolutional Neural Network

Junho Cho

Deep Learning for Computer Vision: Unsupervised Learning (UPC 2016)

Universitat Politècnica de Catalunya

Optimizing Deep Networks (D1L6 Insight@DCU Machine Learning Workshop 2017)

Universitat Politècnica de Catalunya

Training Deep Networks with Backprop (D1L4 Insight@DCU Machine Learning Works...

Universitat Politècnica de Catalunya

Attention Models (D3L6 2017 UPC Deep Learning for Computer Vision)

Universitat Politècnica de Catalunya

Learning where to look: focus and attention in deep vision

Universitat Politècnica de Catalunya

Deep LearningフレームワークChainerと最近の技術動向

Shunta Saito

Finding the best solution for Image Processing

Tech Triveni

What is beyond using Tensorflow, GPU or TPU to process images seamlessly? Do we have a silver bullet for image processing? Over the years, image processing has picked up a different level of attraction. Everyone can think about its ease of usability because it has become a reality now. We have started seeing how Residual Neural Network architecture is being used for different cases and not only that, how Residual Neural network is being tweaked to solve different problems. Along with tweaking the ResNet, preprocessing is also being improved to support different architecture for this matter. Everyone has almost become cyborg already with mobile phones in our hands and apparently until human beings bring the AI/ML to the phones completely they are not taking any rest. We are going to see the development of different architecture and algorithms around running AI/ML on low configuration devices. In this session, we are going to talk about different research papers submitted for these matters and some implementations for the same as well.

DLD_WeightSharing_Slide

Kang-Ho Lee

What's hot

Deep Visual Saliency - Kevin McGuinness - UPC Barcelona 2017

Universitat Politècnica de Catalunya

Electricity price forecasting with Recurrent Neural Networks

Taegyun Jeon

Unsupervised Deep Learning (D2L1 Insight@DCU Machine Learning Workshop 2017)

Universitat Politècnica de Catalunya

Deep 3D Visual Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2017

Universitat Politècnica de Catalunya

Transfer Learning and Domain Adaptation (D2L3 2017 UPC Deep Learning for Comp...

Universitat Politècnica de Catalunya

Convolutional Neural Networks (D1L3 2017 UPC Deep Learning for Computer Vision)

Universitat Politècnica de Catalunya

Object Detection (D2L5 Insight@DCU Machine Learning Workshop 2017)

Universitat Politècnica de Catalunya

D1L5 Visualization (D1L2 Insight@DCU Machine Learning Workshop 2017)

Universitat Politècnica de Catalunya

Convolutional neural networks 이론과 응용

홍배 김

Methodology (DLAI D6L2 2017 UPC Deep Learning for Artificial Intelligence)

Universitat Politècnica de Catalunya

DLD meetup 2017, Efficient Deep Learning

Brodmann17

Deep Neural Networks (D1L2 Insight@DCU Machine Learning Workshop 2017)

Universitat Politècnica de Catalunya

Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)

Universitat Politècnica de Catalunya

Convolutional Neural Network

Junho Cho

Deep Learning for Computer Vision: Unsupervised Learning (UPC 2016)

Universitat Politècnica de Catalunya

Optimizing Deep Networks (D1L6 Insight@DCU Machine Learning Workshop 2017)

Universitat Politècnica de Catalunya

Training Deep Networks with Backprop (D1L4 Insight@DCU Machine Learning Works...

Universitat Politècnica de Catalunya

Attention Models (D3L6 2017 UPC Deep Learning for Computer Vision)

Universitat Politècnica de Catalunya

Learning where to look: focus and attention in deep vision

Universitat Politècnica de Catalunya

Deep LearningフレームワークChainerと最近の技術動向

Shunta Saito

What's hot (20)