SlideShare a Scribd company logo
1 of 19
Download to read offline
© 2019 Phiar Technologies, Inc.
Separable Convolutions for
Efficient Implementation of CNNs
and Other Vision Algorithms
Chen-Ping Yu, PhD
Phiar Technologies, Inc.
May 2019
© 2019 Phiar Technologies, Inc.
Phiar Technologies, Inc.
• AI-powered AR navigation platform for driving
2
• First product: AR navigation mobile app
• On-device processing with mobile sensors : AI + SLAM + path planning
© 2019 Phiar Technologies, Inc.
Outline
• Spatial convolution in computer vision
• Separable convolution in computer vision
• Application in deep learning CNNs
• Low rank filter expansion (Jaderberg et al., 2014)
• Flattened CNN (Jin et al., 2015)
• MobileNet (Howard et al., 2017)
• Takeaways
• Resources
3
© 2019 Phiar Technologies, Inc.
Spatial convolution
4
• Running a filter through an
input image
• Smoothing (Gaussian)
• Template matching f k
Image source: http://deeplearning.net/software/theano_versions/dev/tutorial/conv_arithmetic.html; http://www.cse.psu.edu/~rtc12/CSE486/lecture07.pdf
© 2019 Phiar Technologies, Inc.
Spatial convolution
5
• Convolve f with a filter k: highest response at the matched locations
Image source: http://www.cse.psu.edu/~rtc12/CSE486/lecture07.pdf
• When both f and k are normalized (zero mean & unit standard deviation)
• Really cross-correlation, but is often called interchangeably
© 2019 Phiar Technologies, Inc.
Spatial convolution – zero padding, stride = 1
6
f k f k⊗
0*-1 + 0*0 + 0*1 +
0*-2 + 6*0 + 3*2 +
0*-1 + 4*0 + 2*1
86 3 3 6
4 2 1 8
2 5 6 4
9 3 2 1
0 0 0 0 0 0
0 0
0 0
0 0
0 0
0 0 0 0 0 0
-1 0 1
-2 0 2
-1 0 1
© 2019 Phiar Technologies, Inc.
Spatial convolution – zero padding, stride = 1
7
f k
8
f k⊗
0*-1 + 0*0 + 0*1 +
6*-2 + 3*0 + 3*2 +
4*-1 + 2*0 + 1*1
-96 3 3 6
4 2 1 8
2 5 6 4
9 3 2 1
0 0 0 0 0 0
0 0
0 0
0 0
0 0
0 0 0 0 0 0
-1 0 1
-2 0 2
-1 0 1
© 2019 Phiar Technologies, Inc.
Spatial convolution – zero padding, stride = 1
8
f k
8 -9
f k⊗
0*-1 + 0*0 + 0*1 +
3*-2 + 3*0 + 6*2 +
2*-1 + 1*0 + 8*1
126 3 3 6
4 2 1 8
2 5 6 4
9 3 2 1
0 0 0 0 0 0
0 0
0 0
0 0
0 0
0 0 0 0 0 0
-1 0 1
-2 0 2
-1 0 1
© 2019 Phiar Technologies, Inc.
Spatial convolution – zero padding, stride = 1
9
f k
8 -9 12
f k⊗
0*-1 + 0*0 + 0*1 +
3*-2 + 6*0 + 0*2 +
1*-1 + 8*0 + 0*1
-76 3 3 6
4 2 1 8
2 5 6 4
9 3 2 1
0 0 0 0 0 0
0 0
0 0
0 0
0 0
0 0 0 0 0 0
-1 0 1
-2 0 2
-1 0 1
© 2019 Phiar Technologies, Inc.
Spatial convolution – zero padding, stride = 1
10
f k
8 -9 12 -7
f k⊗
0*-1 + 6*0 + 3*1 +
0*-2 + 4*0 + 2*2 +
0*-1 + 2*0 + 5*1
12
6 3 3 6
4 2 1 8
2 5 6 4
9 3 2 1
0 0 0 0 0 0
0 0
0 0
0 0
0 0
0 0 0 0 0 0
-1 0 1
-2 0 2
-1 0 1
© 2019 Phiar Technologies, Inc.
Separable convolution
11
Sobel filter
(edge detector)
⊗
-1 0 1
-2 0 2
-1 0 1
=-1 0 1
1
2
1
3x3
1x3
3x1
Also equivalent to outer product, or
matrix multiplication
Turns out, k can be decomposed into a column vector
that is convolved by a row vector in 1D
kc r
© 2019 Phiar Technologies, Inc.
= f c r
= ( f c ) r
f k = f ( c r )
Let k = c r
Separable convolution
12
-1 0 1
-2 0 2
-1 0 1
=>
-1 0 1
1
2
1
6 3 3 6
4 2 1 8
2 5 6 4
9 3 2 1
⊗
Associativity of convolution:
f ( c r ) = ( f c ) r⊗⊗ ⊗ ⊗
⊗
⊗ ⊗ ⊗
⊗ ⊗
⊗ ⊗
8 -9 12 -7
12 -5 14 -11
10 -2 2 -15
11 -10 -5 -10
6 3 3 6
4 2 1 8
2 5 6 4
9 3 2 1
⊗ ⊗ =>
8 -9 12 -7
12 -5 14 -11
10 -2 2 -15
11 -10 -5 -10
f
f
k
c r f c r
f k⊗
⊗ ⊗
© 2019 Phiar Technologies, Inc.
Separable convolution: significantly reduced complexity
13
-1 0 1
-2 0 2
-1 0 1
-1 0 1
1
2
1
6 3 3 6
4 2 1 8
2 5 6 4
9 3 2 1
9 multiplications per pixel
Let n = side size of the filter
2D filter: n*n => O(n2)
3D filter: n*n*n => O(n3)
6 3 3 6
4 2 1 8
2 5 6 4
9 3 2 1
⊗ ⊗
3x3
⊗
3x1 3x1
3 multiplications
+
3 multiplications
2D filter: n+n = 2n => O(n)
3D filter: n+n+n = 3n => O(n)
© 2019 Phiar Technologies, Inc.
Separable convolution example: 3D tumor detection
14
Image source: http://www.cse.psu.edu/~rtc12/CSE486/lecture07.pdf; Yu et al., 2017: http://chenpingyu.org/docs/yu_isbi2014.pdf
• Yu et al., Stony Brook U., ISBI 2014
• Intel i7 dual core @ 2.7 Ghz
• Head MRI: 256 x 256 x 256
• 8 scales of 3D LoG filters
• Regular conv: > 2 hours
• Separable conv: < 2 min
2D Laplacian of Gaussian filter
“Blob” detector
3D Separable LoG
Mult per voxel: from n3 to 9n
© 2019 Phiar Technologies, Inc.
Deep learning example: Low rank filter expansion
• Jaderberg et al., University of Oxford, BMVC 2014
• Reconstruct 3D filters in a pre-trained network with 1D and 2D filters
• Approximation 1: use purely 1D filters
• Approximation 2: use 1D filters followed by 2D filters
• 4-layer CNN; text recognition; 2.5~4.5x speed up; <1% accuracy tradeoff
15
Image source: https://arxiv.org/pdf/1405.3866.pdf
1) 2)
© 2019 Phiar Technologies, Inc.
Deep learning example: Flatten CNN
• Jin et al., Purdue University, ICLR Workshop 2015
• Train a network from scratch with a sequence of 1D filters
• Baseline: 3 conv (5x5) + 2 FC layers; swap each conv with 2 flattened set
• CIFAR-10/100, MNIST; 2~3.5x speedup at same or better accuracy
16
Image source: https://arxiv.org/pdf/1412.5474.pdf
© 2019 Phiar Technologies, Inc.
Deep learning example: MobileNet V1
17
• Howard et al., Google Inc., ArXiv 2017
• Depthwise separable convolutions: 3 x 3 x 1 then 1 x 1 x D
• 28 layers, a number of variants
• ImageNet, Stanford Dogs, Im2GPS, YFCC100M, COCO
• At comparable accuracy, 4.2M parameters vs 138M of VGG-16
Image source: https://arxiv.org/pdf/1704.04861.pdf
© 2019 Phiar Technologies, Inc.
Takeaways - deep learning applicability
• Faster inference and training time (~20% faster training on ImageNet)
• Computational savings get even more with larger filters (per pixel)
• A 3 x 3 x 64 filter: from 576 multiplications down to 70
• A 15 x 15 x 64 filter: from 14,400 multiplications down to 94
• A 35 x 35 x 64 filter: from 78,400 multiplications down to 134
• Allows more contextual information
• Allows deeper & wider network, use residual connections to avoid
vanishing gradient
• Especially good at early layers – reducing large input sizes’ complexity
18
© 2019 Phiar Technologies, Inc.
Resources, and we are hiring!
19
Relevant Papers & Materials
Yu et al., 2014, “3D Blob Based Tumor Detection and
Segmentation in MR Images.”
http://chenpingyu.org/docs/yu_isbi2014.pdf
Jaderberg et al., 2014, “Speeding Up Convolution Neural
Networks with Low Rank Expansions.”
https://arxiv.org/pdf/1405.3866.pdf
Jin et al., 2015, “Flattened Convolutional Neural Networks
For Feedforward Acceleration.”
https://arxiv.org/pdf/1412.5474.pdf
Howard et al., 2017, “MobileNets: Efficient Convolutional
Neural Networks for Mobile Vision Applications.”
https://arxiv.org/pdf/1704.04861.pdf
Computer Vision Lecture Notes, Penn State University
http://www.cse.psu.edu/~rtc12/CSE486/lecture07.pdf
Phiar Technologies, Inc.
Website
https://www.phiar.net
Currently Hiring for:
• SLAM Engineer
• Computer Vision/Deep Learning Engineer
• Full-Stack Software Engineer
• Product Manager
Embedded Vision Summit

More Related Content

What's hot

"Pioneering Analog Compute for Edge AI to Overcome the End of Digital Scaling...
"Pioneering Analog Compute for Edge AI to Overcome the End of Digital Scaling..."Pioneering Analog Compute for Edge AI to Overcome the End of Digital Scaling...
"Pioneering Analog Compute for Edge AI to Overcome the End of Digital Scaling...Edge AI and Vision Alliance
 
“A Mask Detection Smart Camera Using the Nvidia Jetson Nano: System Architect...
“A Mask Detection Smart Camera Using the Nvidia Jetson Nano: System Architect...“A Mask Detection Smart Camera Using the Nvidia Jetson Nano: System Architect...
“A Mask Detection Smart Camera Using the Nvidia Jetson Nano: System Architect...Edge AI and Vision Alliance
 
"Developing Real-time Video Applications with CoaXPress," A Presentation from...
"Developing Real-time Video Applications with CoaXPress," A Presentation from..."Developing Real-time Video Applications with CoaXPress," A Presentation from...
"Developing Real-time Video Applications with CoaXPress," A Presentation from...Edge AI and Vision Alliance
 
Deep learning for smart manufacturing
Deep learning for smart manufacturingDeep learning for smart manufacturing
Deep learning for smart manufacturingSunil Kumar Pradhan
 
“Secure Hardware Architecture for Embedded Vision,” a Presentation from Neuro...
“Secure Hardware Architecture for Embedded Vision,” a Presentation from Neuro...“Secure Hardware Architecture for Embedded Vision,” a Presentation from Neuro...
“Secure Hardware Architecture for Embedded Vision,” a Presentation from Neuro...Edge AI and Vision Alliance
 
"Can Simulation Solve the Training Data Problem?," a Presentation from Mindtech
"Can Simulation Solve the Training Data Problem?," a Presentation from Mindtech"Can Simulation Solve the Training Data Problem?," a Presentation from Mindtech
"Can Simulation Solve the Training Data Problem?," a Presentation from MindtechEdge AI and Vision Alliance
 
“How Containerization Unblocks Barriers to Fast, Easy Deployment of AI-Driven...
“How Containerization Unblocks Barriers to Fast, Easy Deployment of AI-Driven...“How Containerization Unblocks Barriers to Fast, Easy Deployment of AI-Driven...
“How Containerization Unblocks Barriers to Fast, Easy Deployment of AI-Driven...Edge AI and Vision Alliance
 
“Productizing Edge AI Across Applications and Verticals: Case Study and Insig...
“Productizing Edge AI Across Applications and Verticals: Case Study and Insig...“Productizing Edge AI Across Applications and Verticals: Case Study and Insig...
“Productizing Edge AI Across Applications and Verticals: Case Study and Insig...Edge AI and Vision Alliance
 
"Designing a Stereo IP Camera From Scratch," a Presentation from ELVEES
"Designing a Stereo IP Camera From Scratch," a Presentation from ELVEES"Designing a Stereo IP Camera From Scratch," a Presentation from ELVEES
"Designing a Stereo IP Camera From Scratch," a Presentation from ELVEESEdge AI and Vision Alliance
 
"Update on Khronos Standards for Vision and Machine Learning," a Presentation...
"Update on Khronos Standards for Vision and Machine Learning," a Presentation..."Update on Khronos Standards for Vision and Machine Learning," a Presentation...
"Update on Khronos Standards for Vision and Machine Learning," a Presentation...Edge AI and Vision Alliance
 
"High-resolution 3D Reconstruction on a Mobile Processor," a Presentation fro...
"High-resolution 3D Reconstruction on a Mobile Processor," a Presentation fro..."High-resolution 3D Reconstruction on a Mobile Processor," a Presentation fro...
"High-resolution 3D Reconstruction on a Mobile Processor," a Presentation fro...Edge AI and Vision Alliance
 
“Memory Allocation in AI and Computer Vision Applications,” a Presentation fr...
“Memory Allocation in AI and Computer Vision Applications,” a Presentation fr...“Memory Allocation in AI and Computer Vision Applications,” a Presentation fr...
“Memory Allocation in AI and Computer Vision Applications,” a Presentation fr...Edge AI and Vision Alliance
 
Enabling Power-Efficient AI Through Quantization
Enabling Power-Efficient AI Through QuantizationEnabling Power-Efficient AI Through Quantization
Enabling Power-Efficient AI Through QuantizationQualcomm Research
 
“Developing Edge Computer Vision Solutions for Applications with Extreme Limi...
“Developing Edge Computer Vision Solutions for Applications with Extreme Limi...“Developing Edge Computer Vision Solutions for Applications with Extreme Limi...
“Developing Edge Computer Vision Solutions for Applications with Extreme Limi...Edge AI and Vision Alliance
 
Recent trends in DCI optical networking
Recent trends in DCI optical networkingRecent trends in DCI optical networking
Recent trends in DCI optical networkingAPNIC
 
GTC Taiwan 2017 自主駕駛車輛發展平台與技術研發
GTC Taiwan 2017 自主駕駛車輛發展平台與技術研發 GTC Taiwan 2017 自主駕駛車輛發展平台與技術研發
GTC Taiwan 2017 自主駕駛車輛發展平台與技術研發 NVIDIA Taiwan
 
Edge Computing Architecture using GPUs and Kubernetes
Edge Computing Architecture using GPUs and KubernetesEdge Computing Architecture using GPUs and Kubernetes
Edge Computing Architecture using GPUs and KubernetesVirtualTech Japan Inc.
 
Imaging automotive 2015 addfor v002
Imaging automotive 2015   addfor v002Imaging automotive 2015   addfor v002
Imaging automotive 2015 addfor v002Enrico Busto
 
New Media Services from a Mobile Chipset Vendor and Standardization Perspective
New Media Services from a Mobile Chipset Vendor and Standardization PerspectiveNew Media Services from a Mobile Chipset Vendor and Standardization Perspective
New Media Services from a Mobile Chipset Vendor and Standardization PerspectiveFörderverein Technische Fakultät
 
COMIT Community Day at O2 - September 2019, Full Slidedeck
COMIT Community Day at O2 - September 2019, Full SlidedeckCOMIT Community Day at O2 - September 2019, Full Slidedeck
COMIT Community Day at O2 - September 2019, Full SlidedeckComit Projects Ltd
 

What's hot (20)

"Pioneering Analog Compute for Edge AI to Overcome the End of Digital Scaling...
"Pioneering Analog Compute for Edge AI to Overcome the End of Digital Scaling..."Pioneering Analog Compute for Edge AI to Overcome the End of Digital Scaling...
"Pioneering Analog Compute for Edge AI to Overcome the End of Digital Scaling...
 
“A Mask Detection Smart Camera Using the Nvidia Jetson Nano: System Architect...
“A Mask Detection Smart Camera Using the Nvidia Jetson Nano: System Architect...“A Mask Detection Smart Camera Using the Nvidia Jetson Nano: System Architect...
“A Mask Detection Smart Camera Using the Nvidia Jetson Nano: System Architect...
 
"Developing Real-time Video Applications with CoaXPress," A Presentation from...
"Developing Real-time Video Applications with CoaXPress," A Presentation from..."Developing Real-time Video Applications with CoaXPress," A Presentation from...
"Developing Real-time Video Applications with CoaXPress," A Presentation from...
 
Deep learning for smart manufacturing
Deep learning for smart manufacturingDeep learning for smart manufacturing
Deep learning for smart manufacturing
 
“Secure Hardware Architecture for Embedded Vision,” a Presentation from Neuro...
“Secure Hardware Architecture for Embedded Vision,” a Presentation from Neuro...“Secure Hardware Architecture for Embedded Vision,” a Presentation from Neuro...
“Secure Hardware Architecture for Embedded Vision,” a Presentation from Neuro...
 
"Can Simulation Solve the Training Data Problem?," a Presentation from Mindtech
"Can Simulation Solve the Training Data Problem?," a Presentation from Mindtech"Can Simulation Solve the Training Data Problem?," a Presentation from Mindtech
"Can Simulation Solve the Training Data Problem?," a Presentation from Mindtech
 
“How Containerization Unblocks Barriers to Fast, Easy Deployment of AI-Driven...
“How Containerization Unblocks Barriers to Fast, Easy Deployment of AI-Driven...“How Containerization Unblocks Barriers to Fast, Easy Deployment of AI-Driven...
“How Containerization Unblocks Barriers to Fast, Easy Deployment of AI-Driven...
 
“Productizing Edge AI Across Applications and Verticals: Case Study and Insig...
“Productizing Edge AI Across Applications and Verticals: Case Study and Insig...“Productizing Edge AI Across Applications and Verticals: Case Study and Insig...
“Productizing Edge AI Across Applications and Verticals: Case Study and Insig...
 
"Designing a Stereo IP Camera From Scratch," a Presentation from ELVEES
"Designing a Stereo IP Camera From Scratch," a Presentation from ELVEES"Designing a Stereo IP Camera From Scratch," a Presentation from ELVEES
"Designing a Stereo IP Camera From Scratch," a Presentation from ELVEES
 
"Update on Khronos Standards for Vision and Machine Learning," a Presentation...
"Update on Khronos Standards for Vision and Machine Learning," a Presentation..."Update on Khronos Standards for Vision and Machine Learning," a Presentation...
"Update on Khronos Standards for Vision and Machine Learning," a Presentation...
 
"High-resolution 3D Reconstruction on a Mobile Processor," a Presentation fro...
"High-resolution 3D Reconstruction on a Mobile Processor," a Presentation fro..."High-resolution 3D Reconstruction on a Mobile Processor," a Presentation fro...
"High-resolution 3D Reconstruction on a Mobile Processor," a Presentation fro...
 
“Memory Allocation in AI and Computer Vision Applications,” a Presentation fr...
“Memory Allocation in AI and Computer Vision Applications,” a Presentation fr...“Memory Allocation in AI and Computer Vision Applications,” a Presentation fr...
“Memory Allocation in AI and Computer Vision Applications,” a Presentation fr...
 
Enabling Power-Efficient AI Through Quantization
Enabling Power-Efficient AI Through QuantizationEnabling Power-Efficient AI Through Quantization
Enabling Power-Efficient AI Through Quantization
 
“Developing Edge Computer Vision Solutions for Applications with Extreme Limi...
“Developing Edge Computer Vision Solutions for Applications with Extreme Limi...“Developing Edge Computer Vision Solutions for Applications with Extreme Limi...
“Developing Edge Computer Vision Solutions for Applications with Extreme Limi...
 
Recent trends in DCI optical networking
Recent trends in DCI optical networkingRecent trends in DCI optical networking
Recent trends in DCI optical networking
 
GTC Taiwan 2017 自主駕駛車輛發展平台與技術研發
GTC Taiwan 2017 自主駕駛車輛發展平台與技術研發 GTC Taiwan 2017 自主駕駛車輛發展平台與技術研發
GTC Taiwan 2017 自主駕駛車輛發展平台與技術研發
 
Edge Computing Architecture using GPUs and Kubernetes
Edge Computing Architecture using GPUs and KubernetesEdge Computing Architecture using GPUs and Kubernetes
Edge Computing Architecture using GPUs and Kubernetes
 
Imaging automotive 2015 addfor v002
Imaging automotive 2015   addfor v002Imaging automotive 2015   addfor v002
Imaging automotive 2015 addfor v002
 
New Media Services from a Mobile Chipset Vendor and Standardization Perspective
New Media Services from a Mobile Chipset Vendor and Standardization PerspectiveNew Media Services from a Mobile Chipset Vendor and Standardization Perspective
New Media Services from a Mobile Chipset Vendor and Standardization Perspective
 
COMIT Community Day at O2 - September 2019, Full Slidedeck
COMIT Community Day at O2 - September 2019, Full SlidedeckCOMIT Community Day at O2 - September 2019, Full Slidedeck
COMIT Community Day at O2 - September 2019, Full Slidedeck
 

Similar to "Separable Convolutions for Efficient Implementation of CNNs and Other Vision Algorithms," a Presentation from Phiar

Graph Gurus Episode 19: Deep Learning Implemented by GSQL on a Native Paralle...
Graph Gurus Episode 19: Deep Learning Implemented by GSQL on a Native Paralle...Graph Gurus Episode 19: Deep Learning Implemented by GSQL on a Native Paralle...
Graph Gurus Episode 19: Deep Learning Implemented by GSQL on a Native Paralle...TigerGraph
 
Convolution Neural Networks and Ensembles for Visually Impaired Aid.pdf
Convolution Neural Networks and Ensembles for Visually Impaired Aid.pdfConvolution Neural Networks and Ensembles for Visually Impaired Aid.pdf
Convolution Neural Networks and Ensembles for Visually Impaired Aid.pdfloliktry1
 
“COVID-19 Safe Distancing Measures in Public Spaces with Edge AI,” a Presenta...
“COVID-19 Safe Distancing Measures in Public Spaces with Edge AI,” a Presenta...“COVID-19 Safe Distancing Measures in Public Spaces with Edge AI,” a Presenta...
“COVID-19 Safe Distancing Measures in Public Spaces with Edge AI,” a Presenta...Edge AI and Vision Alliance
 
IRJET- Generating 3D Models Using 3D Generative Adversarial Network
IRJET- Generating 3D Models Using 3D Generative Adversarial NetworkIRJET- Generating 3D Models Using 3D Generative Adversarial Network
IRJET- Generating 3D Models Using 3D Generative Adversarial NetworkIRJET Journal
 
Questions On The Equation For Regression
Questions On The Equation For RegressionQuestions On The Equation For Regression
Questions On The Equation For RegressionTiffany Sandoval
 
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...KAIST
 
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...KAIST
 
"Designing CNN Algorithms for Real-time Applications," a Presentation from Al...
"Designing CNN Algorithms for Real-time Applications," a Presentation from Al..."Designing CNN Algorithms for Real-time Applications," a Presentation from Al...
"Designing CNN Algorithms for Real-time Applications," a Presentation from Al...Edge AI and Vision Alliance
 
Latency Performance of Encoding with Random Linear Network Coding
Latency Performance of Encoding with Random Linear Network CodingLatency Performance of Encoding with Random Linear Network Coding
Latency Performance of Encoding with Random Linear Network CodingLars Nielsen
 
Improving AI surveillance using Edge Computing
Improving AI surveillance using Edge ComputingImproving AI surveillance using Edge Computing
Improving AI surveillance using Edge ComputingIRJET Journal
 
Udacity-Didi Challenge Finalists
Udacity-Didi Challenge FinalistsUdacity-Didi Challenge Finalists
Udacity-Didi Challenge FinalistsDavid Silver
 
“Improving Power Efficiency for Edge Inferencing with Memory Management Optim...
“Improving Power Efficiency for Edge Inferencing with Memory Management Optim...“Improving Power Efficiency for Edge Inferencing with Memory Management Optim...
“Improving Power Efficiency for Edge Inferencing with Memory Management Optim...Edge AI and Vision Alliance
 
IRJET- Object Detection and Recognition using Single Shot Multi-Box Detector
IRJET- Object Detection and Recognition using Single Shot Multi-Box DetectorIRJET- Object Detection and Recognition using Single Shot Multi-Box Detector
IRJET- Object Detection and Recognition using Single Shot Multi-Box DetectorIRJET Journal
 
SPAR 2015 - Civil Maps Presentation by Sravan Puttagunta
SPAR 2015 - Civil Maps Presentation by Sravan PuttaguntaSPAR 2015 - Civil Maps Presentation by Sravan Puttagunta
SPAR 2015 - Civil Maps Presentation by Sravan PuttaguntaSravan Puttagunta
 
Mehr und schneller ist nicht automatisch besser - data2day, 06.10.16
Mehr und schneller ist nicht automatisch besser - data2day, 06.10.16Mehr und schneller ist nicht automatisch besser - data2day, 06.10.16
Mehr und schneller ist nicht automatisch besser - data2day, 06.10.16Boris Adryan
 
NCGIC The Geospatial Revolution
NCGIC The Geospatial RevolutionNCGIC The Geospatial Revolution
NCGIC The Geospatial RevolutionPeter Batty
 

Similar to "Separable Convolutions for Efficient Implementation of CNNs and Other Vision Algorithms," a Presentation from Phiar (20)

Graph Gurus Episode 19: Deep Learning Implemented by GSQL on a Native Paralle...
Graph Gurus Episode 19: Deep Learning Implemented by GSQL on a Native Paralle...Graph Gurus Episode 19: Deep Learning Implemented by GSQL on a Native Paralle...
Graph Gurus Episode 19: Deep Learning Implemented by GSQL on a Native Paralle...
 
Convolution Neural Networks and Ensembles for Visually Impaired Aid.pdf
Convolution Neural Networks and Ensembles for Visually Impaired Aid.pdfConvolution Neural Networks and Ensembles for Visually Impaired Aid.pdf
Convolution Neural Networks and Ensembles for Visually Impaired Aid.pdf
 
“COVID-19 Safe Distancing Measures in Public Spaces with Edge AI,” a Presenta...
“COVID-19 Safe Distancing Measures in Public Spaces with Edge AI,” a Presenta...“COVID-19 Safe Distancing Measures in Public Spaces with Edge AI,” a Presenta...
“COVID-19 Safe Distancing Measures in Public Spaces with Edge AI,” a Presenta...
 
IRJET- Generating 3D Models Using 3D Generative Adversarial Network
IRJET- Generating 3D Models Using 3D Generative Adversarial NetworkIRJET- Generating 3D Models Using 3D Generative Adversarial Network
IRJET- Generating 3D Models Using 3D Generative Adversarial Network
 
Questions On The Equation For Regression
Questions On The Equation For RegressionQuestions On The Equation For Regression
Questions On The Equation For Regression
 
How to Make Hand Detector on Native Activity with OpenCV
How to Make Hand Detector on Native Activity with OpenCVHow to Make Hand Detector on Native Activity with OpenCV
How to Make Hand Detector on Native Activity with OpenCV
 
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
 
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
 
"Designing CNN Algorithms for Real-time Applications," a Presentation from Al...
"Designing CNN Algorithms for Real-time Applications," a Presentation from Al..."Designing CNN Algorithms for Real-time Applications," a Presentation from Al...
"Designing CNN Algorithms for Real-time Applications," a Presentation from Al...
 
Latency Performance of Encoding with Random Linear Network Coding
Latency Performance of Encoding with Random Linear Network CodingLatency Performance of Encoding with Random Linear Network Coding
Latency Performance of Encoding with Random Linear Network Coding
 
PointNet
PointNetPointNet
PointNet
 
Improving AI surveillance using Edge Computing
Improving AI surveillance using Edge ComputingImproving AI surveillance using Edge Computing
Improving AI surveillance using Edge Computing
 
Udacity-Didi Challenge Finalists
Udacity-Didi Challenge FinalistsUdacity-Didi Challenge Finalists
Udacity-Didi Challenge Finalists
 
“Improving Power Efficiency for Edge Inferencing with Memory Management Optim...
“Improving Power Efficiency for Edge Inferencing with Memory Management Optim...“Improving Power Efficiency for Edge Inferencing with Memory Management Optim...
“Improving Power Efficiency for Edge Inferencing with Memory Management Optim...
 
IRJET- Object Detection and Recognition using Single Shot Multi-Box Detector
IRJET- Object Detection and Recognition using Single Shot Multi-Box DetectorIRJET- Object Detection and Recognition using Single Shot Multi-Box Detector
IRJET- Object Detection and Recognition using Single Shot Multi-Box Detector
 
Kk3517971799
Kk3517971799Kk3517971799
Kk3517971799
 
SPAR 2015 - Civil Maps Presentation by Sravan Puttagunta
SPAR 2015 - Civil Maps Presentation by Sravan PuttaguntaSPAR 2015 - Civil Maps Presentation by Sravan Puttagunta
SPAR 2015 - Civil Maps Presentation by Sravan Puttagunta
 
Mehr und schneller ist nicht automatisch besser - data2day, 06.10.16
Mehr und schneller ist nicht automatisch besser - data2day, 06.10.16Mehr und schneller ist nicht automatisch besser - data2day, 06.10.16
Mehr und schneller ist nicht automatisch besser - data2day, 06.10.16
 
Pixel Perfect
Pixel PerfectPixel Perfect
Pixel Perfect
 
NCGIC The Geospatial Revolution
NCGIC The Geospatial RevolutionNCGIC The Geospatial Revolution
NCGIC The Geospatial Revolution
 

More from Edge AI and Vision Alliance

“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...Edge AI and Vision Alliance
 
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...Edge AI and Vision Alliance
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...Edge AI and Vision Alliance
 
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...Edge AI and Vision Alliance
 
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...Edge AI and Vision Alliance
 
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...Edge AI and Vision Alliance
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...Edge AI and Vision Alliance
 
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsightsEdge AI and Vision Alliance
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...Edge AI and Vision Alliance
 
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...Edge AI and Vision Alliance
 
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...Edge AI and Vision Alliance
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...Edge AI and Vision Alliance
 
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...Edge AI and Vision Alliance
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...Edge AI and Vision Alliance
 
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...Edge AI and Vision Alliance
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from SamsaraEdge AI and Vision Alliance
 
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...Edge AI and Vision Alliance
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...Edge AI and Vision Alliance
 
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...Edge AI and Vision Alliance
 
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...Edge AI and Vision Alliance
 

More from Edge AI and Vision Alliance (20)

“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
 
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
 
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
 
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
 
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...
 
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
 
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
 
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
 
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
 
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara
 
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
 
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
 
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
 

Recently uploaded

Navigating Identity and Access Management in the Modern Enterprise
Navigating Identity and Access Management in the Modern EnterpriseNavigating Identity and Access Management in the Modern Enterprise
Navigating Identity and Access Management in the Modern EnterpriseWSO2
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....rightmanforbloodline
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...caitlingebhard1
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)Samir Dash
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Decarbonising Commercial Real Estate: The Role of Operational Performance
Decarbonising Commercial Real Estate: The Role of Operational PerformanceDecarbonising Commercial Real Estate: The Role of Operational Performance
Decarbonising Commercial Real Estate: The Role of Operational PerformanceIES VE
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 

Recently uploaded (20)

Navigating Identity and Access Management in the Modern Enterprise
Navigating Identity and Access Management in the Modern EnterpriseNavigating Identity and Access Management in the Modern Enterprise
Navigating Identity and Access Management in the Modern Enterprise
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Decarbonising Commercial Real Estate: The Role of Operational Performance
Decarbonising Commercial Real Estate: The Role of Operational PerformanceDecarbonising Commercial Real Estate: The Role of Operational Performance
Decarbonising Commercial Real Estate: The Role of Operational Performance
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 

"Separable Convolutions for Efficient Implementation of CNNs and Other Vision Algorithms," a Presentation from Phiar

  • 1. © 2019 Phiar Technologies, Inc. Separable Convolutions for Efficient Implementation of CNNs and Other Vision Algorithms Chen-Ping Yu, PhD Phiar Technologies, Inc. May 2019
  • 2. © 2019 Phiar Technologies, Inc. Phiar Technologies, Inc. • AI-powered AR navigation platform for driving 2 • First product: AR navigation mobile app • On-device processing with mobile sensors : AI + SLAM + path planning
  • 3. © 2019 Phiar Technologies, Inc. Outline • Spatial convolution in computer vision • Separable convolution in computer vision • Application in deep learning CNNs • Low rank filter expansion (Jaderberg et al., 2014) • Flattened CNN (Jin et al., 2015) • MobileNet (Howard et al., 2017) • Takeaways • Resources 3
  • 4. © 2019 Phiar Technologies, Inc. Spatial convolution 4 • Running a filter through an input image • Smoothing (Gaussian) • Template matching f k Image source: http://deeplearning.net/software/theano_versions/dev/tutorial/conv_arithmetic.html; http://www.cse.psu.edu/~rtc12/CSE486/lecture07.pdf
  • 5. © 2019 Phiar Technologies, Inc. Spatial convolution 5 • Convolve f with a filter k: highest response at the matched locations Image source: http://www.cse.psu.edu/~rtc12/CSE486/lecture07.pdf • When both f and k are normalized (zero mean & unit standard deviation) • Really cross-correlation, but is often called interchangeably
  • 6. © 2019 Phiar Technologies, Inc. Spatial convolution – zero padding, stride = 1 6 f k f k⊗ 0*-1 + 0*0 + 0*1 + 0*-2 + 6*0 + 3*2 + 0*-1 + 4*0 + 2*1 86 3 3 6 4 2 1 8 2 5 6 4 9 3 2 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 -1 0 1 -2 0 2 -1 0 1
  • 7. © 2019 Phiar Technologies, Inc. Spatial convolution – zero padding, stride = 1 7 f k 8 f k⊗ 0*-1 + 0*0 + 0*1 + 6*-2 + 3*0 + 3*2 + 4*-1 + 2*0 + 1*1 -96 3 3 6 4 2 1 8 2 5 6 4 9 3 2 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 -1 0 1 -2 0 2 -1 0 1
  • 8. © 2019 Phiar Technologies, Inc. Spatial convolution – zero padding, stride = 1 8 f k 8 -9 f k⊗ 0*-1 + 0*0 + 0*1 + 3*-2 + 3*0 + 6*2 + 2*-1 + 1*0 + 8*1 126 3 3 6 4 2 1 8 2 5 6 4 9 3 2 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 -1 0 1 -2 0 2 -1 0 1
  • 9. © 2019 Phiar Technologies, Inc. Spatial convolution – zero padding, stride = 1 9 f k 8 -9 12 f k⊗ 0*-1 + 0*0 + 0*1 + 3*-2 + 6*0 + 0*2 + 1*-1 + 8*0 + 0*1 -76 3 3 6 4 2 1 8 2 5 6 4 9 3 2 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 -1 0 1 -2 0 2 -1 0 1
  • 10. © 2019 Phiar Technologies, Inc. Spatial convolution – zero padding, stride = 1 10 f k 8 -9 12 -7 f k⊗ 0*-1 + 6*0 + 3*1 + 0*-2 + 4*0 + 2*2 + 0*-1 + 2*0 + 5*1 12 6 3 3 6 4 2 1 8 2 5 6 4 9 3 2 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 -1 0 1 -2 0 2 -1 0 1
  • 11. © 2019 Phiar Technologies, Inc. Separable convolution 11 Sobel filter (edge detector) ⊗ -1 0 1 -2 0 2 -1 0 1 =-1 0 1 1 2 1 3x3 1x3 3x1 Also equivalent to outer product, or matrix multiplication Turns out, k can be decomposed into a column vector that is convolved by a row vector in 1D kc r
  • 12. © 2019 Phiar Technologies, Inc. = f c r = ( f c ) r f k = f ( c r ) Let k = c r Separable convolution 12 -1 0 1 -2 0 2 -1 0 1 => -1 0 1 1 2 1 6 3 3 6 4 2 1 8 2 5 6 4 9 3 2 1 ⊗ Associativity of convolution: f ( c r ) = ( f c ) r⊗⊗ ⊗ ⊗ ⊗ ⊗ ⊗ ⊗ ⊗ ⊗ ⊗ ⊗ 8 -9 12 -7 12 -5 14 -11 10 -2 2 -15 11 -10 -5 -10 6 3 3 6 4 2 1 8 2 5 6 4 9 3 2 1 ⊗ ⊗ => 8 -9 12 -7 12 -5 14 -11 10 -2 2 -15 11 -10 -5 -10 f f k c r f c r f k⊗ ⊗ ⊗
  • 13. © 2019 Phiar Technologies, Inc. Separable convolution: significantly reduced complexity 13 -1 0 1 -2 0 2 -1 0 1 -1 0 1 1 2 1 6 3 3 6 4 2 1 8 2 5 6 4 9 3 2 1 9 multiplications per pixel Let n = side size of the filter 2D filter: n*n => O(n2) 3D filter: n*n*n => O(n3) 6 3 3 6 4 2 1 8 2 5 6 4 9 3 2 1 ⊗ ⊗ 3x3 ⊗ 3x1 3x1 3 multiplications + 3 multiplications 2D filter: n+n = 2n => O(n) 3D filter: n+n+n = 3n => O(n)
  • 14. © 2019 Phiar Technologies, Inc. Separable convolution example: 3D tumor detection 14 Image source: http://www.cse.psu.edu/~rtc12/CSE486/lecture07.pdf; Yu et al., 2017: http://chenpingyu.org/docs/yu_isbi2014.pdf • Yu et al., Stony Brook U., ISBI 2014 • Intel i7 dual core @ 2.7 Ghz • Head MRI: 256 x 256 x 256 • 8 scales of 3D LoG filters • Regular conv: > 2 hours • Separable conv: < 2 min 2D Laplacian of Gaussian filter “Blob” detector 3D Separable LoG Mult per voxel: from n3 to 9n
  • 15. © 2019 Phiar Technologies, Inc. Deep learning example: Low rank filter expansion • Jaderberg et al., University of Oxford, BMVC 2014 • Reconstruct 3D filters in a pre-trained network with 1D and 2D filters • Approximation 1: use purely 1D filters • Approximation 2: use 1D filters followed by 2D filters • 4-layer CNN; text recognition; 2.5~4.5x speed up; <1% accuracy tradeoff 15 Image source: https://arxiv.org/pdf/1405.3866.pdf 1) 2)
  • 16. © 2019 Phiar Technologies, Inc. Deep learning example: Flatten CNN • Jin et al., Purdue University, ICLR Workshop 2015 • Train a network from scratch with a sequence of 1D filters • Baseline: 3 conv (5x5) + 2 FC layers; swap each conv with 2 flattened set • CIFAR-10/100, MNIST; 2~3.5x speedup at same or better accuracy 16 Image source: https://arxiv.org/pdf/1412.5474.pdf
  • 17. © 2019 Phiar Technologies, Inc. Deep learning example: MobileNet V1 17 • Howard et al., Google Inc., ArXiv 2017 • Depthwise separable convolutions: 3 x 3 x 1 then 1 x 1 x D • 28 layers, a number of variants • ImageNet, Stanford Dogs, Im2GPS, YFCC100M, COCO • At comparable accuracy, 4.2M parameters vs 138M of VGG-16 Image source: https://arxiv.org/pdf/1704.04861.pdf
  • 18. © 2019 Phiar Technologies, Inc. Takeaways - deep learning applicability • Faster inference and training time (~20% faster training on ImageNet) • Computational savings get even more with larger filters (per pixel) • A 3 x 3 x 64 filter: from 576 multiplications down to 70 • A 15 x 15 x 64 filter: from 14,400 multiplications down to 94 • A 35 x 35 x 64 filter: from 78,400 multiplications down to 134 • Allows more contextual information • Allows deeper & wider network, use residual connections to avoid vanishing gradient • Especially good at early layers – reducing large input sizes’ complexity 18
  • 19. © 2019 Phiar Technologies, Inc. Resources, and we are hiring! 19 Relevant Papers & Materials Yu et al., 2014, “3D Blob Based Tumor Detection and Segmentation in MR Images.” http://chenpingyu.org/docs/yu_isbi2014.pdf Jaderberg et al., 2014, “Speeding Up Convolution Neural Networks with Low Rank Expansions.” https://arxiv.org/pdf/1405.3866.pdf Jin et al., 2015, “Flattened Convolutional Neural Networks For Feedforward Acceleration.” https://arxiv.org/pdf/1412.5474.pdf Howard et al., 2017, “MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications.” https://arxiv.org/pdf/1704.04861.pdf Computer Vision Lecture Notes, Penn State University http://www.cse.psu.edu/~rtc12/CSE486/lecture07.pdf Phiar Technologies, Inc. Website https://www.phiar.net Currently Hiring for: • SLAM Engineer • Computer Vision/Deep Learning Engineer • Full-Stack Software Engineer • Product Manager Embedded Vision Summit