Submit Search
Upload
FPT17: An object detector based on multiscale sliding window search using a fully pipelined binarized CNN on an FPGA
•
2 likes
•
1,491 views
Hiroki Nakahara
Follow
Presentation slide used at the Int'l Conference on Fied Programmable Technology 2017.
Read less
Read more
Engineering
Report
Share
Report
Share
1 of 28
Download now
Download to read offline
Recommended
FCCM2020: High-Throughput Convolutional Neural Network on an FPGA by Customiz...
FCCM2020: High-Throughput Convolutional Neural Network on an FPGA by Customiz...
Hiroki Nakahara
A Random Forest using a Multi-valued Decision Diagram on an FPGa
A Random Forest using a Multi-valued Decision Diagram on an FPGa
Hiroki Nakahara
FPGA2018: A Lightweight YOLOv2: A binarized CNN with a parallel support vecto...
FPGA2018: A Lightweight YOLOv2: A binarized CNN with a parallel support vecto...
Hiroki Nakahara
Batch normalization
Batch normalization
Yuichiro Iio
Deep LearningフレームワークChainerと最近の技術動向
Deep LearningフレームワークChainerと最近の技術動向
Shunta Saito
Naist2015 dec ver1
Naist2015 dec ver1
Hiroki Nakahara
FPL15 talk: Deep Convolutional Neural Network on FPGA
FPL15 talk: Deep Convolutional Neural Network on FPGA
Hiroki Nakahara
ISCAS'18: A Deep Neural Network on the Nested RNS (NRNS) on an FPGA: Applied ...
ISCAS'18: A Deep Neural Network on the Nested RNS (NRNS) on an FPGA: Applied ...
Hiroki Nakahara
Recommended
FCCM2020: High-Throughput Convolutional Neural Network on an FPGA by Customiz...
FCCM2020: High-Throughput Convolutional Neural Network on an FPGA by Customiz...
Hiroki Nakahara
A Random Forest using a Multi-valued Decision Diagram on an FPGa
A Random Forest using a Multi-valued Decision Diagram on an FPGa
Hiroki Nakahara
FPGA2018: A Lightweight YOLOv2: A binarized CNN with a parallel support vecto...
FPGA2018: A Lightweight YOLOv2: A binarized CNN with a parallel support vecto...
Hiroki Nakahara
Batch normalization
Batch normalization
Yuichiro Iio
Deep LearningフレームワークChainerと最近の技術動向
Deep LearningフレームワークChainerと最近の技術動向
Shunta Saito
Naist2015 dec ver1
Naist2015 dec ver1
Hiroki Nakahara
FPL15 talk: Deep Convolutional Neural Network on FPGA
FPL15 talk: Deep Convolutional Neural Network on FPGA
Hiroki Nakahara
ISCAS'18: A Deep Neural Network on the Nested RNS (NRNS) on an FPGA: Applied ...
ISCAS'18: A Deep Neural Network on the Nested RNS (NRNS) on an FPGA: Applied ...
Hiroki Nakahara
A brief introduction to recent segmentation methods
A brief introduction to recent segmentation methods
Shunta Saito
ISMVL2018: A Ternary Weight Binary Input Convolutional Neural Network
ISMVL2018: A Ternary Weight Binary Input Convolutional Neural Network
Hiroki Nakahara
モデル高速化百選
モデル高速化百選
Yusuke Uchida
Convolutional neural networks 이론과 응용
Convolutional neural networks 이론과 응용
홍배 김
Faster R-CNN: Towards real-time object detection with region proposal network...
Faster R-CNN: Towards real-time object detection with region proposal network...
Universitat Politècnica de Catalunya
Towards Machine Comprehension of Spoken Content
Towards Machine Comprehension of Spoken Content
NVIDIA Taiwan
DeepFix: a fully convolutional neural network for predicting human fixations...
DeepFix: a fully convolutional neural network for predicting human fixations...
Universitat Politècnica de Catalunya
モデルアーキテクチャ観点からの高速化2019
モデルアーキテクチャ観点からの高速化2019
Yusuke Uchida
Reducing the dimensionality of data with neural networks
Reducing the dimensionality of data with neural networks
Hakky St
[html5jロボット部 第7回勉強会] Microsoft Cognitive Toolkit (CNTK) Overview
[html5jロボット部 第7回勉強会] Microsoft Cognitive Toolkit (CNTK) Overview
Naoki (Neo) SATO
【DL輪読会】Unpaired Image Super-Resolution Using Pseudo-Supervision
【DL輪読会】Unpaired Image Super-Resolution Using Pseudo-Supervision
Deep Learning JP
Learning where to look: focus and attention in deep vision
Learning where to look: focus and attention in deep vision
Universitat Politècnica de Catalunya
Transformer 動向調査 in 画像認識
Transformer 動向調査 in 画像認識
Kazuki Maeno
Introduction to Chainer Chemistry
Introduction to Chainer Chemistry
Preferred Networks
[251] implementing deep learning using cu dnn
[251] implementing deep learning using cu dnn
NAVER D2
Convolutional Neural Network
Convolutional Neural Network
Junho Cho
Deep Learning Initiative @ NECSTLab
Deep Learning Initiative @ NECSTLab
NECST Lab @ Politecnico di Milano
Deep Learningによる超解像の進歩
Deep Learningによる超解像の進歩
Hiroto Honda
Convolutional neural networks for image classification — evidence from Kaggle...
Convolutional neural networks for image classification — evidence from Kaggle...
Dmytro Mishkin
Synthetic dialogue generation with Deep Learning
Synthetic dialogue generation with Deep Learning
S N
Object extraction from satellite imagery using deep learning
Object extraction from satellite imagery using deep learning
Aly Abdelkareem
Deep Learning AtoC with Image Perspective
Deep Learning AtoC with Image Perspective
Dong Heon Cho
More Related Content
What's hot
A brief introduction to recent segmentation methods
A brief introduction to recent segmentation methods
Shunta Saito
ISMVL2018: A Ternary Weight Binary Input Convolutional Neural Network
ISMVL2018: A Ternary Weight Binary Input Convolutional Neural Network
Hiroki Nakahara
モデル高速化百選
モデル高速化百選
Yusuke Uchida
Convolutional neural networks 이론과 응용
Convolutional neural networks 이론과 응용
홍배 김
Faster R-CNN: Towards real-time object detection with region proposal network...
Faster R-CNN: Towards real-time object detection with region proposal network...
Universitat Politècnica de Catalunya
Towards Machine Comprehension of Spoken Content
Towards Machine Comprehension of Spoken Content
NVIDIA Taiwan
DeepFix: a fully convolutional neural network for predicting human fixations...
DeepFix: a fully convolutional neural network for predicting human fixations...
Universitat Politècnica de Catalunya
モデルアーキテクチャ観点からの高速化2019
モデルアーキテクチャ観点からの高速化2019
Yusuke Uchida
Reducing the dimensionality of data with neural networks
Reducing the dimensionality of data with neural networks
Hakky St
[html5jロボット部 第7回勉強会] Microsoft Cognitive Toolkit (CNTK) Overview
[html5jロボット部 第7回勉強会] Microsoft Cognitive Toolkit (CNTK) Overview
Naoki (Neo) SATO
【DL輪読会】Unpaired Image Super-Resolution Using Pseudo-Supervision
【DL輪読会】Unpaired Image Super-Resolution Using Pseudo-Supervision
Deep Learning JP
Learning where to look: focus and attention in deep vision
Learning where to look: focus and attention in deep vision
Universitat Politècnica de Catalunya
Transformer 動向調査 in 画像認識
Transformer 動向調査 in 画像認識
Kazuki Maeno
Introduction to Chainer Chemistry
Introduction to Chainer Chemistry
Preferred Networks
[251] implementing deep learning using cu dnn
[251] implementing deep learning using cu dnn
NAVER D2
Convolutional Neural Network
Convolutional Neural Network
Junho Cho
Deep Learning Initiative @ NECSTLab
Deep Learning Initiative @ NECSTLab
NECST Lab @ Politecnico di Milano
Deep Learningによる超解像の進歩
Deep Learningによる超解像の進歩
Hiroto Honda
Convolutional neural networks for image classification — evidence from Kaggle...
Convolutional neural networks for image classification — evidence from Kaggle...
Dmytro Mishkin
Synthetic dialogue generation with Deep Learning
Synthetic dialogue generation with Deep Learning
S N
What's hot
(20)
A brief introduction to recent segmentation methods
A brief introduction to recent segmentation methods
ISMVL2018: A Ternary Weight Binary Input Convolutional Neural Network
ISMVL2018: A Ternary Weight Binary Input Convolutional Neural Network
モデル高速化百選
モデル高速化百選
Convolutional neural networks 이론과 응용
Convolutional neural networks 이론과 응용
Faster R-CNN: Towards real-time object detection with region proposal network...
Faster R-CNN: Towards real-time object detection with region proposal network...
Towards Machine Comprehension of Spoken Content
Towards Machine Comprehension of Spoken Content
DeepFix: a fully convolutional neural network for predicting human fixations...
DeepFix: a fully convolutional neural network for predicting human fixations...
モデルアーキテクチャ観点からの高速化2019
モデルアーキテクチャ観点からの高速化2019
Reducing the dimensionality of data with neural networks
Reducing the dimensionality of data with neural networks
[html5jロボット部 第7回勉強会] Microsoft Cognitive Toolkit (CNTK) Overview
[html5jロボット部 第7回勉強会] Microsoft Cognitive Toolkit (CNTK) Overview
【DL輪読会】Unpaired Image Super-Resolution Using Pseudo-Supervision
【DL輪読会】Unpaired Image Super-Resolution Using Pseudo-Supervision
Learning where to look: focus and attention in deep vision
Learning where to look: focus and attention in deep vision
Transformer 動向調査 in 画像認識
Transformer 動向調査 in 画像認識
Introduction to Chainer Chemistry
Introduction to Chainer Chemistry
[251] implementing deep learning using cu dnn
[251] implementing deep learning using cu dnn
Convolutional Neural Network
Convolutional Neural Network
Deep Learning Initiative @ NECSTLab
Deep Learning Initiative @ NECSTLab
Deep Learningによる超解像の進歩
Deep Learningによる超解像の進歩
Convolutional neural networks for image classification — evidence from Kaggle...
Convolutional neural networks for image classification — evidence from Kaggle...
Synthetic dialogue generation with Deep Learning
Synthetic dialogue generation with Deep Learning
Similar to FPT17: An object detector based on multiscale sliding window search using a fully pipelined binarized CNN on an FPGA
Object extraction from satellite imagery using deep learning
Object extraction from satellite imagery using deep learning
Aly Abdelkareem
Deep Learning AtoC with Image Perspective
Deep Learning AtoC with Image Perspective
Dong Heon Cho
Object Detection Beyond Mask R-CNN and RetinaNet I
Object Detection Beyond Mask R-CNN and RetinaNet I
Wanjin Yu
MDEC Data Matters Series: machine learning and Deep Learning, A Primer
MDEC Data Matters Series: machine learning and Deep Learning, A Primer
Poo Kuan Hoong
[CVPR 2018] Utilizing unlabeled or noisy labeled data (classification, detect...
[CVPR 2018] Utilizing unlabeled or noisy labeled data (classification, detect...
NAVER Engineering
Grid is Dead ? Nimrod on the Cloud
Grid is Dead ? Nimrod on the Cloud
Adianto Wibisono
Computer vision for transportation
Computer vision for transportation
Wanjin Yu
Object Detetcion using SSD-MobileNet
Object Detetcion using SSD-MobileNet
IRJET Journal
Data Science, Machine Learning and Neural Networks
Data Science, Machine Learning and Neural Networks
BICA Labs
An Introduction to Deep Learning
An Introduction to Deep Learning
Poo Kuan Hoong
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
Tulipp. Eu
kanimozhi2019.pdf
kanimozhi2019.pdf
AshrafDabbas1
"Enabling Ubiquitous Visual Intelligence Through Deep Learning," a Keynote Pr...
"Enabling Ubiquitous Visual Intelligence Through Deep Learning," a Keynote Pr...
Edge AI and Vision Alliance
Big Data Malaysia - A Primer on Deep Learning
Big Data Malaysia - A Primer on Deep Learning
Poo Kuan Hoong
Deep learning and image analytics using Python by Dr Sanparit
Deep learning and image analytics using Python by Dr Sanparit
BAINIDA
FINAL_Team_4.pptx
FINAL_Team_4.pptx
nitin571047
Object detection with deep learning
Object detection with deep learning
Sushant Shrivastava
Anomaly Detection with Azure and .net
Anomaly Detection with Azure and .net
Marco Parenzan
"A Fast Object Detector for ADAS using Deep Learning," a Presentation from Pa...
"A Fast Object Detector for ADAS using Deep Learning," a Presentation from Pa...
Edge AI and Vision Alliance
Android Malware 2020 (CCCS-CIC-AndMal-2020)
Android Malware 2020 (CCCS-CIC-AndMal-2020)
Indraneel Dabhade
Similar to FPT17: An object detector based on multiscale sliding window search using a fully pipelined binarized CNN on an FPGA
(20)
Object extraction from satellite imagery using deep learning
Object extraction from satellite imagery using deep learning
Deep Learning AtoC with Image Perspective
Deep Learning AtoC with Image Perspective
Object Detection Beyond Mask R-CNN and RetinaNet I
Object Detection Beyond Mask R-CNN and RetinaNet I
MDEC Data Matters Series: machine learning and Deep Learning, A Primer
MDEC Data Matters Series: machine learning and Deep Learning, A Primer
[CVPR 2018] Utilizing unlabeled or noisy labeled data (classification, detect...
[CVPR 2018] Utilizing unlabeled or noisy labeled data (classification, detect...
Grid is Dead ? Nimrod on the Cloud
Grid is Dead ? Nimrod on the Cloud
Computer vision for transportation
Computer vision for transportation
Object Detetcion using SSD-MobileNet
Object Detetcion using SSD-MobileNet
Data Science, Machine Learning and Neural Networks
Data Science, Machine Learning and Neural Networks
An Introduction to Deep Learning
An Introduction to Deep Learning
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
kanimozhi2019.pdf
kanimozhi2019.pdf
"Enabling Ubiquitous Visual Intelligence Through Deep Learning," a Keynote Pr...
"Enabling Ubiquitous Visual Intelligence Through Deep Learning," a Keynote Pr...
Big Data Malaysia - A Primer on Deep Learning
Big Data Malaysia - A Primer on Deep Learning
Deep learning and image analytics using Python by Dr Sanparit
Deep learning and image analytics using Python by Dr Sanparit
FINAL_Team_4.pptx
FINAL_Team_4.pptx
Object detection with deep learning
Object detection with deep learning
Anomaly Detection with Azure and .net
Anomaly Detection with Azure and .net
"A Fast Object Detector for ADAS using Deep Learning," a Presentation from Pa...
"A Fast Object Detector for ADAS using Deep Learning," a Presentation from Pa...
Android Malware 2020 (CCCS-CIC-AndMal-2020)
Android Malware 2020 (CCCS-CIC-AndMal-2020)
More from Hiroki Nakahara
ROS User Group Meeting #28 マルチ深層学習とROS
ROS User Group Meeting #28 マルチ深層学習とROS
Hiroki Nakahara
FPGAX2019
FPGAX2019
Hiroki Nakahara
SBRA2018講演資料
SBRA2018講演資料
Hiroki Nakahara
DSF2018講演スライド
DSF2018講演スライド
Hiroki Nakahara
(公開版)Reconf研2017GUINNESS
(公開版)Reconf研2017GUINNESS
Hiroki Nakahara
(公開版)FPGAエクストリームコンピューティング2017
(公開版)FPGAエクストリームコンピューティング2017
Hiroki Nakahara
2値ディープニューラルネットワークと組込み機器への応用: 開発中のツール紹介
2値ディープニューラルネットワークと組込み機器への応用: 開発中のツール紹介
Hiroki Nakahara
2値化CNN on FPGAでGPUとガチンコバトル(公開版)
2値化CNN on FPGAでGPUとガチンコバトル(公開版)
Hiroki Nakahara
Tensor flow usergroup 2016 (公開版)
Tensor flow usergroup 2016 (公開版)
Hiroki Nakahara
FPGAX2016 ドキュンなFPGA
FPGAX2016 ドキュンなFPGA
Hiroki Nakahara
電波望遠鏡用の分光器をAltera SDK for OpenCL使ってサクッと作ってみた
電波望遠鏡用の分光器をAltera SDK for OpenCL使ってサクッと作ってみた
Hiroki Nakahara
Altera sdk for open cl アンケート集計結果(公開版)
Altera sdk for open cl アンケート集計結果(公開版)
Hiroki Nakahara
Nested RNSを用いたディープニューラルネットワークのFPGA実装
Nested RNSを用いたディープニューラルネットワークのFPGA実装
Hiroki Nakahara
私のファミコンのfpsは530000です。もちろんフルパワーで(以下略
私のファミコンのfpsは530000です。もちろんフルパワーで(以下略
Hiroki Nakahara
Verilog-HDL Tutorial (15) software
Verilog-HDL Tutorial (15) software
Hiroki Nakahara
Verilog-HDL Tutorial (15) hardware
Verilog-HDL Tutorial (15) hardware
Hiroki Nakahara
Verilog-HDL Tutorial (14)
Verilog-HDL Tutorial (14)
Hiroki Nakahara
Verilog-HDL Tutorial (13)
Verilog-HDL Tutorial (13)
Hiroki Nakahara
Verilog-HDL Tutorial (12)
Verilog-HDL Tutorial (12)
Hiroki Nakahara
Verilog-HDL Tutorial (11)
Verilog-HDL Tutorial (11)
Hiroki Nakahara
More from Hiroki Nakahara
(20)
ROS User Group Meeting #28 マルチ深層学習とROS
ROS User Group Meeting #28 マルチ深層学習とROS
FPGAX2019
FPGAX2019
SBRA2018講演資料
SBRA2018講演資料
DSF2018講演スライド
DSF2018講演スライド
(公開版)Reconf研2017GUINNESS
(公開版)Reconf研2017GUINNESS
(公開版)FPGAエクストリームコンピューティング2017
(公開版)FPGAエクストリームコンピューティング2017
2値ディープニューラルネットワークと組込み機器への応用: 開発中のツール紹介
2値ディープニューラルネットワークと組込み機器への応用: 開発中のツール紹介
2値化CNN on FPGAでGPUとガチンコバトル(公開版)
2値化CNN on FPGAでGPUとガチンコバトル(公開版)
Tensor flow usergroup 2016 (公開版)
Tensor flow usergroup 2016 (公開版)
FPGAX2016 ドキュンなFPGA
FPGAX2016 ドキュンなFPGA
電波望遠鏡用の分光器をAltera SDK for OpenCL使ってサクッと作ってみた
電波望遠鏡用の分光器をAltera SDK for OpenCL使ってサクッと作ってみた
Altera sdk for open cl アンケート集計結果(公開版)
Altera sdk for open cl アンケート集計結果(公開版)
Nested RNSを用いたディープニューラルネットワークのFPGA実装
Nested RNSを用いたディープニューラルネットワークのFPGA実装
私のファミコンのfpsは530000です。もちろんフルパワーで(以下略
私のファミコンのfpsは530000です。もちろんフルパワーで(以下略
Verilog-HDL Tutorial (15) software
Verilog-HDL Tutorial (15) software
Verilog-HDL Tutorial (15) hardware
Verilog-HDL Tutorial (15) hardware
Verilog-HDL Tutorial (14)
Verilog-HDL Tutorial (14)
Verilog-HDL Tutorial (13)
Verilog-HDL Tutorial (13)
Verilog-HDL Tutorial (12)
Verilog-HDL Tutorial (12)
Verilog-HDL Tutorial (11)
Verilog-HDL Tutorial (11)
Recently uploaded
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
hassan khalil
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
Suhani Kapoor
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptx
wendy cai
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
purnimasatapathy1234
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
slot gacor bisa pakai pulsa
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
soniya singh
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
rehmti665
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
ranjana rawat
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Suman Mia
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Christo Ananth
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
9953056974 Low Rate Call Girls In Saket, Delhi NCR
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
Suhani Kapoor
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
Soham Mondal
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
upamatechverse
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
ranjana rawat
Extrusion Processes and Their Limitations
Extrusion Processes and Their Limitations
120cr0395
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfid
NikhilNagaraju
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
Asutosh Ranjan
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
RajaP95
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
ranjana rawat
Recently uploaded
(20)
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptx
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
Extrusion Processes and Their Limitations
Extrusion Processes and Their Limitations
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfid
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
FPT17: An object detector based on multiscale sliding window search using a fully pipelined binarized CNN on an FPGA
1.
An Object Detector based on Multiscale Sliding Window Search using a Fully Pipelined Binarized CNN on an FPGA Hiroki Nakahara, Haruyoshi Yonekawa, Shimpei Sato Tokyo Institute of Technology, Japan FPT2017 @Melbourne
2.
Outline • Background • Object detector algorithm •
Fully pipelined Binarized CNN • Experimental results • Conclusion 2
3.
Introduction 3
4.
Convolutional Neural Network (CNN) • Convolutional + fully connected + pooling layers • State‐of‐the‐art
performance in an image recognition task • Widely applicable 4Source: https://www.mathworks.com/discovery/convolutional‐neural‐network.html
5.
Image Recognition Tasks • Classification • answer “category” of the object in an image •
Object Detection • classification + localization • Semantic Segmentation • Object area in pixel level 5 Easy Hard Children
6.
Requirements in Embedded System 6 Cloud Embedded Many classes (1000s) Few classes
(<10) Large workloads Frame rates (15‐30 FPS) High efficiency (Performance/W) Low cost & low power (1W‐5W) Server form factor Custom form factor J. Freeman (Intel), “FPGA Acceleration in the era of high level design”, 2017
7.
Outline • Background • Object detector algorithm •
Fully pipelined Binarized CNN • Experimental results • Conclusion 7
8.
Object Detection Problem • Detecting and classifying multiple objects at the same time • Evaluation criteria (from Pascal VOC): 8 Ground truth annotation Detection results: >50% overlap of bounding box with ground truth One BBox
for each object Confidence value for each object Person (50%) # . # . # . # 1 11 , ∈ ,. ,…, Average Precision (AP):
9.
Proposed Object Detector • Sliding window + Multi‐scaling + Fully pipelined BCNNs 9 ... Multi‐scale images Wrapped Images by Sliding Window Classification by a Fully pipelined Binarized CNN by Non‐maximum Suppression
10.
Sliding Window • It is rectangular region of fixed width and height that “slides” across an image 10
11.
Multi‐Scaling (Pyramid Pooling) • Find objects in images at different scales • Combined with a
sliding window, it can find objects in various locations with the same window size 11
12.
Non‐Maximum Suppression • Given all scored bounding boxes in an image • Rejects a bounding box which overlaps with a higher scoring one considering a threshold 12
13.
Quantification of Iterations 13 q q q q qq... 1st
image 2nd image i-th image ∆ 2 • Trade‐off: Time (Iters), AP, and HW p: Image size (given) q: Window size Δx: Stride → Find good q and Δx
14.
Outline • Background • Object detector algorithm •
Fully pipelined Binarized CNN • Experimental results • Conclusion 14
15.
Binarized CNN 15 x1 w0 (Bias) fsgn(Y) Y z w1 x2 w2 xn wn ... x1 x2
Y ‐1 ‐1 1 ‐1 +1 ‐1 +1 ‐1 ‐1 +1 +1 1 x1 x2 Y 0 0 1 0 1 0 1 0 0 1 1 1
16.
Optimization Techniques • Binary CNN • Multiple fully pipelined architecture 16 Batch normalization free (BNF) [RAW17] Internal FC layer replacement into a binary average pooling [FPL17] [FPL17] H. Nakahara, T. Fujii, S. Sato, ‘’A fully connected layer elimination for a binarized
convolutional neural network on an FPGA,’’ FPL 2017, pp. 1‐4. [RAW17] H. Yonekawa and H. Nakahara, ‘’On‐chip memory based binarized convolutional deep neural network applying batch normalization free technique on an FPGA,’’ IPDPS Workshops 2017, pp. 98‐105.
17.
Dataflow for a 2D Convolutional Operation 17 ... ... ... ... mfeature maps Input maps ... ... Adder Binarized Weights Sign Shift Register Output maps Integer Bias
18.
Dataflow for a 2D Convolutional Operation 18 ... ... ... ... mfeature maps Input maps ... ... Adder Binarized Weights Sign Shift Register Output maps Integer Bias
19.
Dataflow for a 2D Convolutional Operation 19 ... ... ... ... mfeature maps Input maps ... ... Adder Binarized Weights Sign Shift Register Output maps Integer Bias
20.
Pipelined Conv2D Circuit x00 x01 x02
x03 x04 x10 x11 x12 x13 x14 x20 x21 x22 x23 x24 x30 x31 x32 x33 x34 x40 x41 x42 x43 x44 x22 x21 x20 x14 x13 x12 x11 x10 x04 x03 x02 x01 x00 + Binarized Weight Mem. Integer Bias Mem. Write Ctrl. Logic Counter Binarized Feature Maps (L=5, K=3) 9 Binarized MACs (EXNORs + Adder Tree) Sign bit Shift Register (2L+K bits) Read M F.Maps at a time
21.
Used CNN Model 21 Integer Conv2D Binary Conv2D Max Pooling Binary Conv2D Binary Conv2D Binary Conv2D Max Pooling Binary Conv2D Binary Conv2D Binary Conv2D Max Pooling Fully Connect Fully Connect Fully Connect Integer Conv2D Binary Conv2D Max Pooling Binary Conv2D Binary Conv2D Binary Conv2D Max Pooling Binary Conv2D Binary Conv2D Binary Conv2D Average Pooling Fully Connect VGG11 Our VGG • Based on the VGG11 model • 3x3 kernel convolution •
Replacement bottleneck (memory intensive) layers into an average pooling one
22.
Overall Architecture • Weight sharing 22 Pipelined BCNN 1 FIFO Pipelined BCNN 2 FIFO Pipelined BCNN P FIFO AXI4 Bus ... GPIO ARM Processor
DDR Mem Camera Input Image ... Weight Mem
23.
Outline • Background • Object detector algorithm •
Fully pipelined Binarized CNN • Experimental results • Conclusion 23
24.
Implementation Setup • Board: Xilinx Inc. Zynq UltraScale+ MPSoC
zcu102 evaluation board • Zynq UltraScale+ MPSoC FPGA (ZU9EG, 68,250 slices, 269,200 FFs, 1,824 BRAMs, 2,520 DSP48Es) • FPGA design tool: Vivado HLS 2017.2 and Vivado 2017.2 • Timing constraint: 200MHz • Deep learning framework: Chainer 1.24.0 • Dataset: KITTI car detection (moderate) scenario 24
25.
Variation of Fully Pipelined CNNs 25 CNN Parameter Hardware Resource
Accuracy Speed Window Size q Stride ∆X #18Kb BRAMs #FFs #LUTs #DSPs mAP FPS 96x96 24 240 194,930 114,870 0 74.36 11.10 48 71.29 45.30 64x64 16 232 189,820 169,500 0 84.80 8.70 32 82.20 34.95 48x48 12 232 171,850 172,100 0 70.50 7.80 24 64.20 31.65 32x32 8 232 169,930 178,220 0 56.32 8.55 16 52.30 34.20
26.
Comparison with GPU based Detectors 26 0 10 20 30 40 50 60 70 80 90 100 0.01 0.1 1
10 100 mAP(%) Detection Speed (FPS) Ours 29.97 YOLOv2 MV3D(LIDAR) SPD+RPN Deep MANTA RRC FPS Acc (%) RRC 0.27 90.22 Deep MANTA 1.42 90.03 SDP+RPN 2.50 89.90 MV3D (LIDAR) 4.16 79.76 YOLOv2 50.00 28.37 Proposed 34.95 82.20 YOLOv2 (GPU): 250.0 W Proposed(FPGA): 2.5 W
27.
Conclusion • Applied a pipelined binary CNN to an object detector • Multiple pipeline architecture •
Weight sharing • Find good parameters for the KITTI car detection • Better performance and accuracy than GPUs • Future works • Preprocessing to reduce HW and Time • Selective search, BING, etc.. • Post‐processing to adjust bounding boxes • SVR, another CNN, etc.. 27
28.
https://github.com/HirokiNakahara/GUINNESS 28
Download now