PR12-193 NISP: Pruning Networks using Neural Importance Score Propagation

PR-12 presentation
NISP: Pruning Networks using Neuron Importance Score Propagation
CVPR2018

Authors: Ruichi Yu et al

Presented by Taesu Kim

Motivation
• Pruning
• Previous approaches
• Focus on single layer or two layers’ statistics
• Greedy pruning
• Entire network is a whole
• Error propagates, especially when network is deep

Motivation
• Entire CNN is a set of feature extractors
• The final responses are the extracted features
• We measured the importance of the neurons across the entire CNN based on
a unified goal
• Minimizing the reconstruction errors of (important) final responses

Approach
• Feature ranking on the ﬁnal response layer
• NISP: Neuron Importance Score Propagation
• Pruning network using NISP
• Fine-tune the pruned network

NISP: Objective function
• ! , !
• !
• a binary vector ! : neuron prune indicator for the l-th layer
• !
• ! , !
• ! , ! , !

Solution
• The network pruning problem can be formulated as a binary integer program
• Fining the optimal neuron prune indicator s
• It is hard to obtain efﬁcient analytical solutions by directly optimizing the objective
• a sub-optimal solution can be obtained by minimizing the upper bound
• ! !
• Assume the activation function ! is Lipschitz continuous: Identity, ReLU, sigmoid, tanh, etc.
• Lipschitz continuous if ! , ! ,

Solution
• ! , ! , !
• ! , !
• ! ! , ! !
• !
• !

Solution
• Backpropagate the importance values

Experiments
• Comparison with random pruning and training-from-scratch baseline
• randomly pruning the pre-trained CNN and then ﬁne-tuning
• training a small CNN with the same number of neurons/ﬁlters per layer as our pruned model
• !

Experiments
• Feature selection vs. Magnitude of weights
• NISP-FS: using feature selection method in [34]
• NISP-Mag: considering only magnitude of weights
•
[34] Inﬁnite feature selection. G. Roffo et al. ICCV 2015

Experiments
• NISP vs. Layer-by-Layer pruning
•

Experiments
• Comparison with existing methods
[11] Acceleration through elimination of redundant convolutions, M. Figurnov et al, NIPS2016
[20] Compression of deep convolutional neural networks for fast and low power mobile applications,
Y. Kim et al, ICLR 2016
[36] Learning the architecture of deep neural networks, S. Srinivas et al, BMVC 2016
[25] Pruning filters for efficient convnets, H. Li et al, ICLR 2017
[29] Thinnet: A filter level pruning method for deep neural network compression, J.-H. Luo et al ICCV 2017
NISP-A: pruning all conv layers
NISP-B: pruning all conv layers except conv5
NISP-C: pruning all conv layers except conv5, conv4
NISP-D: pruning all conv layers except conv2, conv3, FC6
NISP-x-A: prune 15% filters of each layer
NISP-x-B: prune 25% filters of each layer

Conclusion
• Generic framework for network compression and acceleration based on identifying
the importance levels of neurons
• Neuron importance scores in the layer of interest are obtained by feature ranking
• Formulate the network pruning problem as a binary integer program
• Obtain a closed-form solution to a relaxed version of the formulation
• NISP algorithm propagates the importance to every neuron in the whole network
• It efﬁciently reduces CNN redundancy and achieves full-network acceleration and
compression

PR12-193 NISP: Pruning Networks using Neural Importance Score Propagation

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to PR12-193 NISP: Pruning Networks using Neural Importance Score Propagation

Similar to PR12-193 NISP: Pruning Networks using Neural Importance Score Propagation (20)

More from Taesu Kim

More from Taesu Kim (6)

Recently uploaded

Recently uploaded (20)

PR12-193 NISP: Pruning Networks using Neural Importance Score Propagation