SlideShare a Scribd company logo
1 of 12
Download to read offline
SQUEEZENET: ALEXNET-LEVEL
ACCURACY WITH 50X FEWER
PARAMETERS AND <0.5MB MODEL
SIZE
By
Anup Joseph
August 2021
PAPER PRESENTAION
A large CNN model has fundamental
limitations in deploylment.
To solve this problem a variety of heavily
engineered models with small sizes have
emerged.
Background
August 2021
PAPER PRESENTATION
Advantages
of smaller
CNNs
MORE EFFICIENT DISTRIBUTED
TRAINING
LESS OVERHEAD WHEN EXPORTING
NEW MODELS TO CLIENTS
FEASIBLE FPGA AND EMBEDDED
DEPLOYMENT
PAPER PRESENTATION August 2021
Architectural
Design
Strategies
PAPER PRESENTATION
August 2021
PAPER PRESENTATIONS August 2021
Replace 3x3 filters with 1x1 filters
1x1 filters have 9x fewer parameter than 3x3 filters
Decrease the number of input channels
to 3x3 filters
To decrease the total number of parameters its important to decrease the number
of input channels to the 3x3 filters
Downsample late in the network so that
convolution layers have large activation
maps.
PAPER PRESENTATIONS
The Fire module
August 2021
squeeze layers
expand layers
no. of filters in the squeeze layer(all 1x1)
no. of 1x1 filters in the expand layer
no. of 3x3 filters in the expand layer
Fire module consists of two sections
The Fire module has three tunable hperparameters
Configurations
August 2021
PAPER PRESENATIONS
Learning Rate - 0.4 and then linearly
decrease throughout the network.
ReLU is applied to activations from squeeze
and expand layers.
The number of filters per fire module are
gradually increased to the end of the network
The arcitecture was implemeted using the
Caffe2 framework but ports foor most
popular frameworks exists
Training Settings
Model Architecture
Squeezenets outperform all model compressions approaches before it with the
smallest model being 0.5MB registering a 510x decrease in model size
compared to AlexNet
Results
August 2021
PAPER PRESENTATION
SR - the ratio between the no. of filters in squeeze to the no of filters in expand.
Hyperparameters
August 2021
PAPER PRESENTATION
The bypass connections are same as skip connections in the Resnet
architecture.
The simple bypass is just a wire i.e. it adds the output of past layer as an input
further in the network.
The complex bypass on the other hand has a 1x1 convolution with the number
of filters set equal to the number of output channels that are needed.
Squeezenet variants
August 2021
PAPER PRESENTATION
Quantiphi   squeezenets

More Related Content

Similar to Quantiphi squeezenets

IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD Editor
 
VLSI Design of Low Power High Speed 4 Bit Resolution Pipeline ADC In Submicro...
VLSI Design of Low Power High Speed 4 Bit Resolution Pipeline ADC In Submicro...VLSI Design of Low Power High Speed 4 Bit Resolution Pipeline ADC In Submicro...
VLSI Design of Low Power High Speed 4 Bit Resolution Pipeline ADC In Submicro...
VLSICS Design
 
VLSI Design of Low Power High Speed 4 Bit Resolution Pipeline ADC In Submicro...
VLSI Design of Low Power High Speed 4 Bit Resolution Pipeline ADC In Submicro...VLSI Design of Low Power High Speed 4 Bit Resolution Pipeline ADC In Submicro...
VLSI Design of Low Power High Speed 4 Bit Resolution Pipeline ADC In Submicro...
VLSICS Design
 
Intelligent space buffers power efficient solution for network on chip
Intelligent space buffers   power efficient solution for network on chipIntelligent space buffers   power efficient solution for network on chip
Intelligent space buffers power efficient solution for network on chip
Dhiraj Chaudhary
 
Power efficient solution for network on chip
Power efficient solution for network on chipPower efficient solution for network on chip
Power efficient solution for network on chip
Dhiraj Chaudhary
 
Nexgen tech vlsi 2015 2014
Nexgen  tech vlsi 2015 2014Nexgen  tech vlsi 2015 2014
Nexgen tech vlsi 2015 2014
nexgentech
 
Nath2021_Article_DesignOfLowPowerPreamplifierIC.pdf
Nath2021_Article_DesignOfLowPowerPreamplifierIC.pdfNath2021_Article_DesignOfLowPowerPreamplifierIC.pdf
Nath2021_Article_DesignOfLowPowerPreamplifierIC.pdf
Chikkapriyanka
 
Architectures for Optical Networks SLICE
Architectures for Optical Networks SLICEArchitectures for Optical Networks SLICE
Architectures for Optical Networks SLICE
Wellington Renan Gon
 

Similar to Quantiphi squeezenets (20)

IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
 
VLSI Design of Low Power High Speed 4 Bit Resolution Pipeline ADC In Submicro...
VLSI Design of Low Power High Speed 4 Bit Resolution Pipeline ADC In Submicro...VLSI Design of Low Power High Speed 4 Bit Resolution Pipeline ADC In Submicro...
VLSI Design of Low Power High Speed 4 Bit Resolution Pipeline ADC In Submicro...
 
VLSI Design of Low Power High Speed 4 Bit Resolution Pipeline ADC In Submicro...
VLSI Design of Low Power High Speed 4 Bit Resolution Pipeline ADC In Submicro...VLSI Design of Low Power High Speed 4 Bit Resolution Pipeline ADC In Submicro...
VLSI Design of Low Power High Speed 4 Bit Resolution Pipeline ADC In Submicro...
 
Intelligent space buffers power efficient solution for network on chip
Intelligent space buffers   power efficient solution for network on chipIntelligent space buffers   power efficient solution for network on chip
Intelligent space buffers power efficient solution for network on chip
 
Power efficient solution for network on chip
Power efficient solution for network on chipPower efficient solution for network on chip
Power efficient solution for network on chip
 
Design & Simulation of Half Adder Circuit using AVL Technique Based on CMOS T...
Design & Simulation of Half Adder Circuit using AVL Technique Based on CMOS T...Design & Simulation of Half Adder Circuit using AVL Technique Based on CMOS T...
Design & Simulation of Half Adder Circuit using AVL Technique Based on CMOS T...
 
Convolutional Neural Networks
Convolutional Neural NetworksConvolutional Neural Networks
Convolutional Neural Networks
 
OpenStack Networks the Web-Scale Way - Scott Laffer, Cumulus Networks
OpenStack Networks the Web-Scale Way - Scott Laffer, Cumulus NetworksOpenStack Networks the Web-Scale Way - Scott Laffer, Cumulus Networks
OpenStack Networks the Web-Scale Way - Scott Laffer, Cumulus Networks
 
Low Power and Area Efficient Multiplier Layout using Transmission Gate
Low Power and Area Efficient Multiplier Layout using Transmission GateLow Power and Area Efficient Multiplier Layout using Transmission Gate
Low Power and Area Efficient Multiplier Layout using Transmission Gate
 
439 442
439 442439 442
439 442
 
BULK IEEE PROJECTS IN VLSI ,BULK IEEE PROJECTS, IEEE 2015-16 VLSI PROJECTS I...
 BULK IEEE PROJECTS IN VLSI ,BULK IEEE PROJECTS, IEEE 2015-16 VLSI PROJECTS I... BULK IEEE PROJECTS IN VLSI ,BULK IEEE PROJECTS, IEEE 2015-16 VLSI PROJECTS I...
BULK IEEE PROJECTS IN VLSI ,BULK IEEE PROJECTS, IEEE 2015-16 VLSI PROJECTS I...
 
Nexgen tech vlsi 2015 2014
Nexgen  tech vlsi 2015 2014Nexgen  tech vlsi 2015 2014
Nexgen tech vlsi 2015 2014
 
Ieee 2015 2014 nexgen tech vlsi abstract
Ieee 2015 2014 nexgen  tech vlsi   abstractIeee 2015 2014 nexgen  tech vlsi   abstract
Ieee 2015 2014 nexgen tech vlsi abstract
 
BULK IEEE PROJECTS IN VLSI ,BULK IEEE PROJECTS, IEEE 2015-16 VLSI PROJECTS IN...
BULK IEEE PROJECTS IN VLSI ,BULK IEEE PROJECTS, IEEE 2015-16 VLSI PROJECTS IN...BULK IEEE PROJECTS IN VLSI ,BULK IEEE PROJECTS, IEEE 2015-16 VLSI PROJECTS IN...
BULK IEEE PROJECTS IN VLSI ,BULK IEEE PROJECTS, IEEE 2015-16 VLSI PROJECTS IN...
 
Design of High pass and Low pass Filter using CMOS Operational Trans-conducta...
Design of High pass and Low pass Filter using CMOS Operational Trans-conducta...Design of High pass and Low pass Filter using CMOS Operational Trans-conducta...
Design of High pass and Low pass Filter using CMOS Operational Trans-conducta...
 
Nath2021_Article_DesignOfLowPowerPreamplifierIC.pdf
Nath2021_Article_DesignOfLowPowerPreamplifierIC.pdfNath2021_Article_DesignOfLowPowerPreamplifierIC.pdf
Nath2021_Article_DesignOfLowPowerPreamplifierIC.pdf
 
Architectures for Optical Networks SLICE
Architectures for Optical Networks SLICEArchitectures for Optical Networks SLICE
Architectures for Optical Networks SLICE
 
Low Leakage Low Ground Bounce Noise Power Gating Techniques for FPGAs
Low Leakage Low Ground Bounce Noise Power Gating Techniques for FPGAsLow Leakage Low Ground Bounce Noise Power Gating Techniques for FPGAs
Low Leakage Low Ground Bounce Noise Power Gating Techniques for FPGAs
 
H0144757
H0144757H0144757
H0144757
 
PR-366: A ConvNet for 2020s
PR-366: A ConvNet for 2020sPR-366: A ConvNet for 2020s
PR-366: A ConvNet for 2020s
 

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Recently uploaded (20)

WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

Quantiphi squeezenets

  • 1. SQUEEZENET: ALEXNET-LEVEL ACCURACY WITH 50X FEWER PARAMETERS AND <0.5MB MODEL SIZE By Anup Joseph August 2021 PAPER PRESENTAION
  • 2. A large CNN model has fundamental limitations in deploylment. To solve this problem a variety of heavily engineered models with small sizes have emerged. Background August 2021 PAPER PRESENTATION
  • 3. Advantages of smaller CNNs MORE EFFICIENT DISTRIBUTED TRAINING LESS OVERHEAD WHEN EXPORTING NEW MODELS TO CLIENTS FEASIBLE FPGA AND EMBEDDED DEPLOYMENT PAPER PRESENTATION August 2021
  • 5. PAPER PRESENTATIONS August 2021 Replace 3x3 filters with 1x1 filters 1x1 filters have 9x fewer parameter than 3x3 filters Decrease the number of input channels to 3x3 filters To decrease the total number of parameters its important to decrease the number of input channels to the 3x3 filters Downsample late in the network so that convolution layers have large activation maps.
  • 6. PAPER PRESENTATIONS The Fire module August 2021 squeeze layers expand layers no. of filters in the squeeze layer(all 1x1) no. of 1x1 filters in the expand layer no. of 3x3 filters in the expand layer Fire module consists of two sections The Fire module has three tunable hperparameters
  • 7. Configurations August 2021 PAPER PRESENATIONS Learning Rate - 0.4 and then linearly decrease throughout the network. ReLU is applied to activations from squeeze and expand layers. The number of filters per fire module are gradually increased to the end of the network The arcitecture was implemeted using the Caffe2 framework but ports foor most popular frameworks exists Training Settings
  • 9. Squeezenets outperform all model compressions approaches before it with the smallest model being 0.5MB registering a 510x decrease in model size compared to AlexNet Results August 2021 PAPER PRESENTATION
  • 10. SR - the ratio between the no. of filters in squeeze to the no of filters in expand. Hyperparameters August 2021 PAPER PRESENTATION
  • 11. The bypass connections are same as skip connections in the Resnet architecture. The simple bypass is just a wire i.e. it adds the output of past layer as an input further in the network. The complex bypass on the other hand has a 1x1 convolution with the number of filters set equal to the number of output channels that are needed. Squeezenet variants August 2021 PAPER PRESENTATION