SlideShare a Scribd company logo
1 of 28
Download to read offline
Searching for MobileNetV3
ICCV2019
Wonboem Jang
Introduction
This paper describes the approach we took to develop
MobileNetV3 Large and Small models
1. complementary search techniques
2. new efficient versions of nonlinearities practical for
the mobile setting
3. new efficient network design
4. a new efficient segmentation decoder
MobileNet V3
Recap: MobileNet V1
Depth-wise separable Convolution
Depthwise Convolution Pointwise Convolution
• Standard Convolution cost:
ℎ𝑖 × 𝑤𝑖 × 𝑑𝑗 × 𝑘 × 𝑘 × 𝑑𝑖
• Deptwise seperable convolution
cost
ℎ𝑖 × 𝑤𝑖 × 𝑑𝑗 × (𝑘2 + 𝑑𝑖)
• Reduction in computation
1
𝑑 𝑗
+
1
𝑘2
Recap: MobileNet V2
Linear Bottlenecks
• If the manifold of interest remains non-zero volume after ReLU transformation, it
corresponds to a linear transformation.
• ReLU is capable of preserving complete information about the input manifold, but only if the
input manifold lies in a low-dimensional subspace of the input space.
• we can capture this by inserting linear bottleneck layers into the convolutional blocks.
Recap: MobileNet V2
Inverted Residual
Residual Inverted Residual
wide → narrow → wide narrow → wide → narrow
ℎ × 𝑤 × 𝑡 × 𝑑′ × 𝑑′ + ℎ × 𝑤 × 𝑡 × 𝑑′ × 𝑘 × 𝑘 + ℎ × 𝑤 × 𝑑′′ × 𝑡 × 𝑑′′
= ℎ × 𝑤 × 𝑑′ × 𝑡 × (𝑑′ + 𝑘2 + 𝑑′′)
Block of size ℎ × 𝑤, expansion factor 𝑡, kernel size 𝑘, input channel 𝑑′,
output channel 𝑑′′
Recap: MnasNet
Inverted Residual
Recap: MnasNet
Inverted Residual
Recap: NetAdapt
Recap: NetAdapt
Efficient Mobile Building Block
we use a combination of these layers as building blocks
in order to build the most effective models.
upgraded with modified swish nonlinearities → h-swish
Combination of layers
1. MobileNetV1: depthwise separable convolutions
2. MobileNetV2: linear bottleneck, inverted residual
structure
3. MnasNet: lightweight attention modules based on
squeeze and excitation into the bottleneck structure
MobileNetV3
Network Search
Platform-Aware NAS for Block-wise Search
𝜔 = −0.17 vs the original 𝜔 = −0.07
We observe that accuracy changes much more
dramatically with latency for small models
𝑚𝑎𝑥𝑖𝑚𝑖𝑧𝑒 𝐴𝐶𝐶(𝑚) ×
𝐿𝐴𝑇(𝑚)
𝑇
𝜔
𝑚
Network Search
NetAdapt for Layer-wise Search
1. Starts with a seed network architecture found by platform-aware NAS.
2. For each step:
(a) Generate a set of new proposals. Each proposal represents a modification of an
architecture that generates at least δ reduction in latency compared to the previous step.
(b) For each proposal we use the pre-trained model from the previous step and populate the
new proposed architecture, truncating and randomly initializing missing weights as
appropriate. Finetune each proposal for T steps to get a coarse estimate of the accuracy.
(c) Selected best proposal according to some metric.
3. Iterate previous step until target latency is reached.
Network Search
Proposal
1. Reduce the size of any expansion layer;
2. Reduce bottleneck in all blocks that share the same bottleneck size - to maintain residual
connections
𝑚𝑎𝑥𝑖𝑚𝑖𝑧𝑒
∆Acc
|∆latency|
→ maximize tan
Evaluation
Network Improvements
Redesigning Expensive Layers
We redesign the computionally-expensive layers at the beginning and the end of the network.
These modifications are outside of the scope of the current search space.
1. Current models based on MobileNetV2’s inverted bottleneck structure and variants use 1x1
convolution. This layer is critically important in order to have rich features for prediction.
However, this comes at a cost of extra latency.
→ Move this layer (1) past the final average pooling
2. The previous bottleneck projection layer is no longer needed to reduce computation.
→ Remove the projection and filtering layers in the previous bottleneck layer
Network Improvements
Redesigning Expensive Layers
Network Improvements
Redesigning Expensive Layers
We experimented with reducing the number of filters and using different nonlinearities to try and
reduce redundancy.
We were able to reduce the number of filters to 16 while maintaining the same accuracy as 32
filters using either ReLU or swish.
Network Improvements
Nonlinearities
While this nonlinearity improves accuracy, it comes with non-zero cost in embedded
environments as the sigmoid function is much more expensive to compute on mobile devices.
Network Improvements
Nonlinearities
First, optimized implementations of ReLU6 are available on virtually all software and hardware
frameworks.
Second, in quantized mode, it eliminates potential numerical precision loss caused by different
implementations of the approximate sigmoid.
Finally, in practice, h-swish can be implemented as a piece-wise function to reduce the number of
memory accesses driving the latency cost down substantially
Network Improvements
Nonlinearities
The cost of applying nonlinearity decreases as we go deeper into the network, since each layer
activation memory typically halves every time the resolution drops.
Thus in our architectures we only use h-swish at the second half of the model.
Network Improvements
Large squeeze-and-excite
we replace them all to fixed to be 1/4 of the number of channels in expansion layer.
Experiments
Architecture
Experiments
Results - Classification
Experiments
Results - Classification
Experiments
Results - Classification
Experiments
Results - Classification
Experiments
Results - Detection
SSDLite
Experiments
Results - Semantic Segmentation
R-ASPP
Experiments
Results - Semantic Segmentation
R-ASPP

More Related Content

What's hot

Machine Learning - Convolutional Neural Network
Machine Learning - Convolutional Neural NetworkMachine Learning - Convolutional Neural Network
Machine Learning - Convolutional Neural NetworkRichard Kuo
 
Convolutional Neural Network and Its Applications
Convolutional Neural Network and Its ApplicationsConvolutional Neural Network and Its Applications
Convolutional Neural Network and Its ApplicationsKasun Chinthaka Piyarathna
 
Convolution Neural Network (CNN)
Convolution Neural Network (CNN)Convolution Neural Network (CNN)
Convolution Neural Network (CNN)Basit Rafiq
 
Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...
Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...
Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...Simplilearn
 
Overview of Convolutional Neural Networks
Overview of Convolutional Neural NetworksOverview of Convolutional Neural Networks
Overview of Convolutional Neural Networksananth
 
Recurrent Neural Networks. Part 1: Theory
Recurrent Neural Networks. Part 1: TheoryRecurrent Neural Networks. Part 1: Theory
Recurrent Neural Networks. Part 1: TheoryAndrii Gakhov
 
Convolutional Neural Network (CNN) - image recognition
Convolutional Neural Network (CNN)  - image recognitionConvolutional Neural Network (CNN)  - image recognition
Convolutional Neural Network (CNN) - image recognitionYUNG-KUEI CHEN
 
Convolutional neural network
Convolutional neural networkConvolutional neural network
Convolutional neural networkItachi SK
 
Introduction to Recurrent Neural Network
Introduction to Recurrent Neural NetworkIntroduction to Recurrent Neural Network
Introduction to Recurrent Neural NetworkKnoldus Inc.
 
Convolutional Neural Network (CNN)
Convolutional Neural Network (CNN)Convolutional Neural Network (CNN)
Convolutional Neural Network (CNN)Abdulrazak Zakieh
 
Image classification using cnn
Image classification using cnnImage classification using cnn
Image classification using cnnSumeraHangi
 
Convolutional Neural Network (CNN)
Convolutional Neural Network (CNN)Convolutional Neural Network (CNN)
Convolutional Neural Network (CNN)Muhammad Haroon
 
Image classification using CNN
Image classification using CNNImage classification using CNN
Image classification using CNNNoura Hussein
 

What's hot (20)

Machine Learning - Convolutional Neural Network
Machine Learning - Convolutional Neural NetworkMachine Learning - Convolutional Neural Network
Machine Learning - Convolutional Neural Network
 
Convolutional Neural Network and Its Applications
Convolutional Neural Network and Its ApplicationsConvolutional Neural Network and Its Applications
Convolutional Neural Network and Its Applications
 
Cnn
CnnCnn
Cnn
 
Convolution Neural Network (CNN)
Convolution Neural Network (CNN)Convolution Neural Network (CNN)
Convolution Neural Network (CNN)
 
Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...
Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...
Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...
 
Overview of Convolutional Neural Networks
Overview of Convolutional Neural NetworksOverview of Convolutional Neural Networks
Overview of Convolutional Neural Networks
 
Recurrent Neural Networks. Part 1: Theory
Recurrent Neural Networks. Part 1: TheoryRecurrent Neural Networks. Part 1: Theory
Recurrent Neural Networks. Part 1: Theory
 
Resnet
ResnetResnet
Resnet
 
Cnn
CnnCnn
Cnn
 
Cnn method
Cnn methodCnn method
Cnn method
 
Convolutional Neural Network (CNN) - image recognition
Convolutional Neural Network (CNN)  - image recognitionConvolutional Neural Network (CNN)  - image recognition
Convolutional Neural Network (CNN) - image recognition
 
Convolutional neural network
Convolutional neural networkConvolutional neural network
Convolutional neural network
 
Introduction to Recurrent Neural Network
Introduction to Recurrent Neural NetworkIntroduction to Recurrent Neural Network
Introduction to Recurrent Neural Network
 
Convolutional Neural Network (CNN)
Convolutional Neural Network (CNN)Convolutional Neural Network (CNN)
Convolutional Neural Network (CNN)
 
Image classification using cnn
Image classification using cnnImage classification using cnn
Image classification using cnn
 
EfficientNet
EfficientNetEfficientNet
EfficientNet
 
Densenet CNN
Densenet CNNDensenet CNN
Densenet CNN
 
Convolutional Neural Network (CNN)
Convolutional Neural Network (CNN)Convolutional Neural Network (CNN)
Convolutional Neural Network (CNN)
 
Image classification using CNN
Image classification using CNNImage classification using CNN
Image classification using CNN
 
CNN Tutorial
CNN TutorialCNN Tutorial
CNN Tutorial
 

Similar to MobileNetV3: Efficient Model Scaling and Adaptive Nonlinearity

InternImage: Exploring Large-Scale Vision Foundation Models with Deformable C...
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable C...InternImage: Exploring Large-Scale Vision Foundation Models with Deformable C...
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable C...taeseon ryu
 
PR-144: SqueezeNext: Hardware-Aware Neural Network Design
PR-144: SqueezeNext: Hardware-Aware Neural Network DesignPR-144: SqueezeNext: Hardware-Aware Neural Network Design
PR-144: SqueezeNext: Hardware-Aware Neural Network DesignJinwon Lee
 
MobileNet Review | Mobile Net Research Paper Review | MobileNet v1 Paper Expl...
MobileNet Review | Mobile Net Research Paper Review | MobileNet v1 Paper Expl...MobileNet Review | Mobile Net Research Paper Review | MobileNet v1 Paper Expl...
MobileNet Review | Mobile Net Research Paper Review | MobileNet v1 Paper Expl...Laxmi Kant Tiwari
 
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks.pptx
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks.pptxEfficientNet: Rethinking Model Scaling for Convolutional Neural Networks.pptx
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks.pptxssuser2624f71
 
Introduction to CNN Models: DenseNet & MobileNet
Introduction to CNN Models: DenseNet & MobileNetIntroduction to CNN Models: DenseNet & MobileNet
Introduction to CNN Models: DenseNet & MobileNetKrishnakoumarC
 
Service-aware Networks over Shared Wireless Access Infrastructure
Service-aware Networks over Shared Wireless Access InfrastructureService-aware Networks over Shared Wireless Access Infrastructure
Service-aware Networks over Shared Wireless Access InfrastructureCPqD
 
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020Universitat Politècnica de Catalunya
 
convolutional_neural_networks.pptx
convolutional_neural_networks.pptxconvolutional_neural_networks.pptx
convolutional_neural_networks.pptxMsKiranSingh
 
ConvNeXt: A ConvNet for the 2020s explained
ConvNeXt: A ConvNet for the 2020s explainedConvNeXt: A ConvNet for the 2020s explained
ConvNeXt: A ConvNet for the 2020s explainedSushant Gautam
 
Convolutional Neural Networks : Popular Architectures
Convolutional Neural Networks : Popular ArchitecturesConvolutional Neural Networks : Popular Architectures
Convolutional Neural Networks : Popular Architecturesananth
 
Efficient de cvpr_2020_paper
Efficient de cvpr_2020_paperEfficient de cvpr_2020_paper
Efficient de cvpr_2020_papershanullah3
 
PR-366: A ConvNet for 2020s
PR-366: A ConvNet for 2020sPR-366: A ConvNet for 2020s
PR-366: A ConvNet for 2020sJinwon Lee
 
PR243: Designing Network Design Spaces
PR243: Designing Network Design SpacesPR243: Designing Network Design Spaces
PR243: Designing Network Design SpacesJinwon Lee
 
Handwritten Digit Recognition and performance of various modelsation[autosaved]
Handwritten Digit Recognition and performance of various modelsation[autosaved]Handwritten Digit Recognition and performance of various modelsation[autosaved]
Handwritten Digit Recognition and performance of various modelsation[autosaved]SubhradeepMaji
 
A White Paper On Neural Network Quantization
A White Paper On Neural Network QuantizationA White Paper On Neural Network Quantization
A White Paper On Neural Network QuantizationApril Knyff
 
intro-to-cnn-April_2020.pptx
intro-to-cnn-April_2020.pptxintro-to-cnn-April_2020.pptx
intro-to-cnn-April_2020.pptxssuser3aa461
 
Review of the paper: Traffic-aware Frequency Scaling for Balanced On-Chip Net...
Review of the paper: Traffic-aware Frequency Scaling for Balanced On-Chip Net...Review of the paper: Traffic-aware Frequency Scaling for Balanced On-Chip Net...
Review of the paper: Traffic-aware Frequency Scaling for Balanced On-Chip Net...Luca Sinico
 
PR-183: MixNet: Mixed Depthwise Convolutional Kernels
PR-183: MixNet: Mixed Depthwise Convolutional KernelsPR-183: MixNet: Mixed Depthwise Convolutional Kernels
PR-183: MixNet: Mixed Depthwise Convolutional KernelsJinwon Lee
 

Similar to MobileNetV3: Efficient Model Scaling and Adaptive Nonlinearity (20)

InternImage: Exploring Large-Scale Vision Foundation Models with Deformable C...
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable C...InternImage: Exploring Large-Scale Vision Foundation Models with Deformable C...
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable C...
 
PR-144: SqueezeNext: Hardware-Aware Neural Network Design
PR-144: SqueezeNext: Hardware-Aware Neural Network DesignPR-144: SqueezeNext: Hardware-Aware Neural Network Design
PR-144: SqueezeNext: Hardware-Aware Neural Network Design
 
1801.06434
1801.064341801.06434
1801.06434
 
MobileNet Review | Mobile Net Research Paper Review | MobileNet v1 Paper Expl...
MobileNet Review | Mobile Net Research Paper Review | MobileNet v1 Paper Expl...MobileNet Review | Mobile Net Research Paper Review | MobileNet v1 Paper Expl...
MobileNet Review | Mobile Net Research Paper Review | MobileNet v1 Paper Expl...
 
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks.pptx
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks.pptxEfficientNet: Rethinking Model Scaling for Convolutional Neural Networks.pptx
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks.pptx
 
Introduction to CNN Models: DenseNet & MobileNet
Introduction to CNN Models: DenseNet & MobileNetIntroduction to CNN Models: DenseNet & MobileNet
Introduction to CNN Models: DenseNet & MobileNet
 
Service-aware Networks over Shared Wireless Access Infrastructure
Service-aware Networks over Shared Wireless Access InfrastructureService-aware Networks over Shared Wireless Access Infrastructure
Service-aware Networks over Shared Wireless Access Infrastructure
 
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
 
convolutional_neural_networks.pptx
convolutional_neural_networks.pptxconvolutional_neural_networks.pptx
convolutional_neural_networks.pptx
 
ConvNeXt: A ConvNet for the 2020s explained
ConvNeXt: A ConvNet for the 2020s explainedConvNeXt: A ConvNet for the 2020s explained
ConvNeXt: A ConvNet for the 2020s explained
 
Convolutional Neural Networks : Popular Architectures
Convolutional Neural Networks : Popular ArchitecturesConvolutional Neural Networks : Popular Architectures
Convolutional Neural Networks : Popular Architectures
 
Machine Learning @NECST
Machine Learning @NECSTMachine Learning @NECST
Machine Learning @NECST
 
Efficient de cvpr_2020_paper
Efficient de cvpr_2020_paperEfficient de cvpr_2020_paper
Efficient de cvpr_2020_paper
 
PR-366: A ConvNet for 2020s
PR-366: A ConvNet for 2020sPR-366: A ConvNet for 2020s
PR-366: A ConvNet for 2020s
 
PR243: Designing Network Design Spaces
PR243: Designing Network Design SpacesPR243: Designing Network Design Spaces
PR243: Designing Network Design Spaces
 
Handwritten Digit Recognition and performance of various modelsation[autosaved]
Handwritten Digit Recognition and performance of various modelsation[autosaved]Handwritten Digit Recognition and performance of various modelsation[autosaved]
Handwritten Digit Recognition and performance of various modelsation[autosaved]
 
A White Paper On Neural Network Quantization
A White Paper On Neural Network QuantizationA White Paper On Neural Network Quantization
A White Paper On Neural Network Quantization
 
intro-to-cnn-April_2020.pptx
intro-to-cnn-April_2020.pptxintro-to-cnn-April_2020.pptx
intro-to-cnn-April_2020.pptx
 
Review of the paper: Traffic-aware Frequency Scaling for Balanced On-Chip Net...
Review of the paper: Traffic-aware Frequency Scaling for Balanced On-Chip Net...Review of the paper: Traffic-aware Frequency Scaling for Balanced On-Chip Net...
Review of the paper: Traffic-aware Frequency Scaling for Balanced On-Chip Net...
 
PR-183: MixNet: Mixed Depthwise Convolutional Kernels
PR-183: MixNet: Mixed Depthwise Convolutional KernelsPR-183: MixNet: Mixed Depthwise Convolutional Kernels
PR-183: MixNet: Mixed Depthwise Convolutional Kernels
 

Recently uploaded

complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...asadnawaz62
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLDeelipZope
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidNikhilNagaraju
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionDr.Costas Sachpazis
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxDeepakSakkari2
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineeringmalavadedarshan25
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 
Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.eptoze12
 
Introduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxIntroduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxvipinkmenon1
 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxbritheesh05
 
power system scada applications and uses
power system scada applications and usespower system scada applications and uses
power system scada applications and usesDevarapalliHaritha
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learningmisbanausheenparvam
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...VICTOR MAESTRE RAMIREZ
 

Recently uploaded (20)

complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...
 
Design and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdfDesign and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdf
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCL
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfid
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptx
 
POWER SYSTEMS-1 Complete notes examples
POWER SYSTEMS-1 Complete notes  examplesPOWER SYSTEMS-1 Complete notes  examples
POWER SYSTEMS-1 Complete notes examples
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineering
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 
Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.
 
Introduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxIntroduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptx
 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptx
 
power system scada applications and uses
power system scada applications and usespower system scada applications and uses
power system scada applications and uses
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learning
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...
 
young call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Serviceyoung call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Service
 

MobileNetV3: Efficient Model Scaling and Adaptive Nonlinearity

  • 2. Introduction This paper describes the approach we took to develop MobileNetV3 Large and Small models 1. complementary search techniques 2. new efficient versions of nonlinearities practical for the mobile setting 3. new efficient network design 4. a new efficient segmentation decoder MobileNet V3
  • 3. Recap: MobileNet V1 Depth-wise separable Convolution Depthwise Convolution Pointwise Convolution • Standard Convolution cost: ℎ𝑖 × 𝑤𝑖 × 𝑑𝑗 × 𝑘 × 𝑘 × 𝑑𝑖 • Deptwise seperable convolution cost ℎ𝑖 × 𝑤𝑖 × 𝑑𝑗 × (𝑘2 + 𝑑𝑖) • Reduction in computation 1 𝑑 𝑗 + 1 𝑘2
  • 4. Recap: MobileNet V2 Linear Bottlenecks • If the manifold of interest remains non-zero volume after ReLU transformation, it corresponds to a linear transformation. • ReLU is capable of preserving complete information about the input manifold, but only if the input manifold lies in a low-dimensional subspace of the input space. • we can capture this by inserting linear bottleneck layers into the convolutional blocks.
  • 5. Recap: MobileNet V2 Inverted Residual Residual Inverted Residual wide → narrow → wide narrow → wide → narrow ℎ × 𝑤 × 𝑡 × 𝑑′ × 𝑑′ + ℎ × 𝑤 × 𝑡 × 𝑑′ × 𝑘 × 𝑘 + ℎ × 𝑤 × 𝑑′′ × 𝑡 × 𝑑′′ = ℎ × 𝑤 × 𝑑′ × 𝑡 × (𝑑′ + 𝑘2 + 𝑑′′) Block of size ℎ × 𝑤, expansion factor 𝑡, kernel size 𝑘, input channel 𝑑′, output channel 𝑑′′
  • 10. Efficient Mobile Building Block we use a combination of these layers as building blocks in order to build the most effective models. upgraded with modified swish nonlinearities → h-swish Combination of layers 1. MobileNetV1: depthwise separable convolutions 2. MobileNetV2: linear bottleneck, inverted residual structure 3. MnasNet: lightweight attention modules based on squeeze and excitation into the bottleneck structure MobileNetV3
  • 11. Network Search Platform-Aware NAS for Block-wise Search 𝜔 = −0.17 vs the original 𝜔 = −0.07 We observe that accuracy changes much more dramatically with latency for small models 𝑚𝑎𝑥𝑖𝑚𝑖𝑧𝑒 𝐴𝐶𝐶(𝑚) × 𝐿𝐴𝑇(𝑚) 𝑇 𝜔 𝑚
  • 12. Network Search NetAdapt for Layer-wise Search 1. Starts with a seed network architecture found by platform-aware NAS. 2. For each step: (a) Generate a set of new proposals. Each proposal represents a modification of an architecture that generates at least δ reduction in latency compared to the previous step. (b) For each proposal we use the pre-trained model from the previous step and populate the new proposed architecture, truncating and randomly initializing missing weights as appropriate. Finetune each proposal for T steps to get a coarse estimate of the accuracy. (c) Selected best proposal according to some metric. 3. Iterate previous step until target latency is reached.
  • 13. Network Search Proposal 1. Reduce the size of any expansion layer; 2. Reduce bottleneck in all blocks that share the same bottleneck size - to maintain residual connections 𝑚𝑎𝑥𝑖𝑚𝑖𝑧𝑒 ∆Acc |∆latency| → maximize tan Evaluation
  • 14. Network Improvements Redesigning Expensive Layers We redesign the computionally-expensive layers at the beginning and the end of the network. These modifications are outside of the scope of the current search space. 1. Current models based on MobileNetV2’s inverted bottleneck structure and variants use 1x1 convolution. This layer is critically important in order to have rich features for prediction. However, this comes at a cost of extra latency. → Move this layer (1) past the final average pooling 2. The previous bottleneck projection layer is no longer needed to reduce computation. → Remove the projection and filtering layers in the previous bottleneck layer
  • 16. Network Improvements Redesigning Expensive Layers We experimented with reducing the number of filters and using different nonlinearities to try and reduce redundancy. We were able to reduce the number of filters to 16 while maintaining the same accuracy as 32 filters using either ReLU or swish.
  • 17. Network Improvements Nonlinearities While this nonlinearity improves accuracy, it comes with non-zero cost in embedded environments as the sigmoid function is much more expensive to compute on mobile devices.
  • 18. Network Improvements Nonlinearities First, optimized implementations of ReLU6 are available on virtually all software and hardware frameworks. Second, in quantized mode, it eliminates potential numerical precision loss caused by different implementations of the approximate sigmoid. Finally, in practice, h-swish can be implemented as a piece-wise function to reduce the number of memory accesses driving the latency cost down substantially
  • 19. Network Improvements Nonlinearities The cost of applying nonlinearity decreases as we go deeper into the network, since each layer activation memory typically halves every time the resolution drops. Thus in our architectures we only use h-swish at the second half of the model.
  • 20. Network Improvements Large squeeze-and-excite we replace them all to fixed to be 1/4 of the number of channels in expansion layer.
  • 27. Experiments Results - Semantic Segmentation R-ASPP
  • 28. Experiments Results - Semantic Segmentation R-ASPP