Aerial detection part3

•Download as PPTX, PDF•

0 likes•61 views

1. EAST: An Efficient and Accurate Scene Text Detector 2. Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery

Engineering

Aerial Object Detection
HyeongJun Kwon
2019-2

Contents
2
1. EAST: An Efficient and Accurate Scene Text Detector
2. Towards Multi-class Object Detection in Unconstrained Remote Sensing
Imagery

EAST: An Efficient and Accurate Scene Text Detector
3
Network Overview Pipeline
Input
image
Multi-channel
FCN
Multi-channel
FCN
Multi-oriented
Task-wise boxes

4
EAST: An Efficient and Accurate Scene Text Detector
Main Contributions :
1. propose two stage(step) method : FCN and NMS merging stage
2. pipeline is flexible

5
EAST: An Efficient and Accurate Scene Text Detector
Relate work: PVANet

6
EAST: An Efficient and Accurate Scene Text Detector
Pipeline
input
conv2
conv3
conv1
conv3
merging1
merging2
merging3 For reduce computation cost, using
U-shape not using HyperNet in
PVANet that merge all feature maps
output

EAST: An Efficient and Accurate Scene Text Detector
7
Label Generation

EAST: An Efficient and Accurate Scene Text Detector
8
Label Generation: Score Map Generation
Score map generation eq
𝑟𝑖 = min(D 𝑝𝑖, 𝑝 𝑖 𝑚𝑜𝑑 4 +1 ,
D 𝑝𝑖, 𝑝 𝑖+2 𝑚𝑜𝑑 4 +1
we shrink it by moving its two endpoints inward along the edge
by 0.3𝑟𝑖 and 0.3𝑟 𝑖 𝑚𝑜𝑑 4 +1 espectively.

9
EAST: An Efficient and Accurate Scene Text Detector
𝐿 = 𝐿 𝑠 + 𝜆 𝑔 𝐿 𝑔
Label Generation: Loss
Where Y = 𝐹S is the prediction of the score map, and Y∗ is the
ground truth
𝐿 𝑠 : loss for score map
𝐿 𝑔 : loss for geometry

10
EAST: An Efficient and Accurate Scene Text Detector
𝐿 = 𝐿 𝑠 + 𝜆 𝑔 𝐿 𝑔
Label Generation: Loss
RBOX: 𝐿 𝑔 = 𝐿 𝐴𝐴𝐵𝐵 + 𝜆 𝜃 𝐿 𝜃
𝐿 𝑠 : loss for score map
𝐿 𝑔 : loss for geometry
𝐿 𝑔 = 𝐿QUAD Q, Q∗
= min
Q∈𝑃 𝑄∗
𝑐 𝑖∈CQ
𝑐 𝑖∈CQ
smoothed 𝐿1 𝑐𝑖 − 𝑐𝑖
8 × 𝑁 𝑄∗
QUAD:

11
EAST: An Efficient and Accurate Scene Text Detector
Locality-Aware NMS
Problem: A naïve NMS algorithm runs in 𝑂 𝑛2
where 𝑛 is the number of candidate geometries.
The geometries from nearby pixels tend to be highly correlated.
Solution: locality-aware NMS
𝑎 = WEIGHTEDMERGE 𝑔, 𝑝 , then 𝑎i = V 𝑔 𝑔𝑖 + 𝑉 𝑝 𝑝𝑖 and V 𝑎 = V 𝑔 + V(𝑝)

12
EAST: An Efficient and Accurate Scene Text Detector

Towards Multi-class Object Detection in Unconstrained Remote Sensing
Imagery
13
Network Overview

14
Towards Multi-class Object Detection in Unconstrained Remote Sensing
Imagery
Main Contributions :
1. new joint image cascade and feature pyramid network(ICN and FPN)
2. design a DIN module as a domain adaptation module
3. new loss function to shape rectangles by constraining the angles between the edges to
90 degrees

Towards Multi-class Object Detection in Unconstrained Remote Sensing
Imagery
15
ICN, FPN and Deformable Inception Subnetworks
• Appropriate weights sharing
• Resize image size by bilinear
interpolation
ICN
• The low-level semantic feature
from high resolution
• The high-level semantic feature
from low-level resolution

Towards Multi-class Object Detection in Unconstrained Remote Sensing
Imagery
16
ICN, FPN and Deformable Inception Subnetworks

Towards Multi-class Object Detection in Unconstrained Remote Sensing
Imagery
17
R-RPN
Characteristics:
1. no difference between the front and back of objects
2. initialize anchor by using dimension clustering in YOLO v2
3. use the smooth 𝑙1 loss to regress the four coordinates

Towards Multi-class Object Detection in Unconstrained Remote Sensing
Imagery
18
R-ROI
Characteristics:
1. penalize angles that are not 90 degree
2. initialize anchor by using dimension clustering in YOLO v2
3. use the smooth 𝑙1 loss to regress the four coordinates
penalize angles that are not 90 degree

Towards Multi-class Object Detection in Unconstrained Remote Sensing
Imagery
19
Objective

Towards Multi-class Object Detection in Unconstrained Remote Sensing
Imagery
20
Objective

Towards Multi-class Object Detection in Unconstrained Remote Sensing
Imagery
21
Result

Towards Multi-class Object Detection in Unconstrained Remote Sensing
Imagery
22
Result

Towards Multi-class Object Detection in Unconstrained Remote Sensing
Imagery
23
Result

Reference
24
EAST:
PVANET: Deep but lightweight neural networks for real-time object detection.
Balanced-cross entropy:
Holistically-nested edge detection
Scene text detection via holistic, multi-channel prediction.
U-shape: U-net: Convolu-tional networks for biomedical image segmentation.
Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery:
Soft-NMS: Improving object detection with one line of code.
IoU distance: Yolo9000: Better, faster, stronger.
DIN: deformable convolutional networks

What's hot

Objects as points (CenterNet) review [CDM]Dongmin Choi

Class Weighted Convolutional Features for Image Retrieval Universitat Politècnica de Catalunya

[unofficial] Pyramid Scene Parsing Network (CVPR 2017)Shunta Saito

Building and road detection from large aerial imageryShunta Saito

fusion of Camera and lidar for autonomous driving IIYu Huang

Camera-based road Lane detection by deep learning IIIYu Huang

VJAI Paper Reading#3-KDD2019-ClusterGCNDat Nguyen

[PaperReview] LightGCN: Simplifying and Powering Graph Convolution Network fo...Zimin Park

Auro tripathy - Localizing with CNNsAuro Tripathy

Matteoli ieee gold_2010_cleangrssieee

Thesis PresentationReuben Feinman

Multi sensor calibration by deep learningYu Huang

Detection focal loss 딥러닝 논문읽기 모임 발표자료taeseon ryu

Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013Sunando Sengupta

Remote Sensing IEEE 2015 ProjectsVijay Karan

Autonomous deployment for load balancing surface coverage in sensor networksieeepondy

Gnn overviewLouis (Yufeng) Wang

Unsupervised/Self-supervvised visual object trackingYu Huang

Comparative Study of Object Detection AlgorithmsIRJET Journal

Depth Fusion from RGB and Depth Sensors by Deep LearningYu Huang

What's hot (20)

Objects as points (CenterNet) review [CDM]

Class Weighted Convolutional Features for Image Retrieval

[unofficial] Pyramid Scene Parsing Network (CVPR 2017)

Building and road detection from large aerial imagery

fusion of Camera and lidar for autonomous driving II

Camera-based road Lane detection by deep learning III

VJAI Paper Reading#3-KDD2019-ClusterGCN

[PaperReview] LightGCN: Simplifying and Powering Graph Convolution Network fo...

Auro tripathy - Localizing with CNNs

Matteoli ieee gold_2010_clean

Thesis Presentation

Multi sensor calibration by deep learning

Detection focal loss 딥러닝 논문읽기 모임 발표자료

Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013

Remote Sensing IEEE 2015 Projects

Autonomous deployment for load balancing surface coverage in sensor networks

Gnn overview

Unsupervised/Self-supervvised visual object tracking

Comparative Study of Object Detection Algorithms

Depth Fusion from RGB and Depth Sensors by Deep Learning

Similar to Aerial detection part3

R-FCN : object detection via region-based fully convolutional networksEntrepreneur / Startup

Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya

Convolutional Neural Network for pixel-wise skyline detectionDarian Frajberg

camera-based Lane detection by deep learningYu Huang

object detection paper reviewYoonho Na

Object Detection is a very powerful field.pptxusmanyaseen16

LiDAR-based Autonomous Driving III (by Deep Learning)Yu Huang

Adaptive object detection using adjacency and zoom predictionUniversitat Politècnica de Catalunya

Lidar for Autonomous Driving II (via Deep Learning)Yu Huang

Neural Radiance Fields & Neural Rendering.pdfNavneetPaul2

PCA-SIFT: A More Distinctive Representation for Local Image Descriptorswolf

Conditional Image Generation with PixelCNN Decoderssuga93

D3L4-objects.pdfssusere945ae

IRJET- Remote Sensing Image Retrieval using Convolutional Neural Network with...IRJET Journal

最近の研究情勢についていくために - Deep Learningを中心に - Hiroshi Fukui

Image Retrieval (D4L5 2017 UPC Deep Learning for Computer Vision)Universitat Politècnica de Catalunya

物件偵測與辨識技術CHENHuiMei

Object Pose EstimationArithmer Inc.

Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)Universitat Politècnica de Catalunya

NS - CUK Seminar : V.T.Hoang, Review on "Structure-Aware Transformer for Grap...ssuser4b1f48

Similar to Aerial detection part3 (20)

R-FCN : object detection via region-based fully convolutional networks

Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)

Convolutional Neural Network for pixel-wise skyline detection

camera-based Lane detection by deep learning

object detection paper review

Object Detection is a very powerful field.pptx

LiDAR-based Autonomous Driving III (by Deep Learning)

Adaptive object detection using adjacency and zoom prediction

Lidar for Autonomous Driving II (via Deep Learning)

Neural Radiance Fields & Neural Rendering.pdf

PCA-SIFT: A More Distinctive Representation for Local Image Descriptors

Conditional Image Generation with PixelCNN Decoders

D3L4-objects.pdf

IRJET- Remote Sensing Image Retrieval using Convolutional Neural Network with...

最近の研究情勢についていくために - Deep Learningを中心に -

Image Retrieval (D4L5 2017 UPC Deep Learning for Computer Vision)

物件偵測與辨識技術

Object Pose Estimation

Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)

NS - CUK Seminar : V.T.Hoang, Review on "Structure-Aware Transformer for Grap...

Recently uploaded

Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile

Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234

Roadmap to Membership of RICS - Pathways and RoutesM Maged Hegazy, LLM, MBA, CCP, P3O

Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis

VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor

(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat

Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Christo Ananth

Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile

(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...ranjana rawat

VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor

SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome

UNIT - IV - Air Compressors and its Performancesivaprakash250

Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile

Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan

APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3

MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N

CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordAsst.prof M.Gokilavani

Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile

The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat

Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona

Recently uploaded (20)

Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts

Microscopic Analysis of Ceramic Materials.pptx

Roadmap to Membership of RICS - Pathways and Routes

Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...

VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130

(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts

Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...

Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts

(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...

VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130

SPICE PARK APR2024 ( 6,793 SPICE Models )

UNIT - IV - Air Compressors and its Performance

Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...

Coefficient of Thermal Expansion and their Importance.pptx

APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS

MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS

CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record

Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik

The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...

Processing & Properties of Floor and Wall Tiles.pptx

Aerial detection part3

1. Aerial Object Detection HyeongJun Kwon 2019-2

2. Contents 2 1. EAST: An Efficient and Accurate Scene Text Detector 2. Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery

3. EAST: An Efficient and Accurate Scene Text Detector 3 Network Overview Pipeline Input image Multi-channel FCN Multi-channel FCN Multi-oriented Task-wise boxes

4. 4 EAST: An Efficient and Accurate Scene Text Detector Main Contributions : 1. propose two stage(step) method : FCN and NMS merging stage 2. pipeline is flexible

5. 5 EAST: An Efficient and Accurate Scene Text Detector Relate work: PVANet

6. 6 EAST: An Efficient and Accurate Scene Text Detector Pipeline input conv2 conv3 conv1 conv3 merging1 merging2 merging3 For reduce computation cost, using U-shape not using HyperNet in PVANet that merge all feature maps output

7. EAST: An Efficient and Accurate Scene Text Detector 7 Label Generation

8. EAST: An Efficient and Accurate Scene Text Detector 8 Label Generation: Score Map Generation Score map generation eq 𝑟𝑖 = min(D 𝑝𝑖, 𝑝 𝑖 𝑚𝑜𝑑 4 +1 , D 𝑝𝑖, 𝑝 𝑖+2 𝑚𝑜𝑑 4 +1 we shrink it by moving its two endpoints inward along the edge by 0.3𝑟𝑖 and 0.3𝑟 𝑖 𝑚𝑜𝑑 4 +1 espectively.

9. 9 EAST: An Efficient and Accurate Scene Text Detector 𝐿 = 𝐿 𝑠 + 𝜆 𝑔 𝐿 𝑔 Label Generation: Loss Where Y = 𝐹S is the prediction of the score map, and Y∗ is the ground truth 𝐿 𝑠 : loss for score map 𝐿 𝑔 : loss for geometry

10. 10 EAST: An Efficient and Accurate Scene Text Detector 𝐿 = 𝐿 𝑠 + 𝜆 𝑔 𝐿 𝑔 Label Generation: Loss RBOX: 𝐿 𝑔 = 𝐿 𝐴𝐴𝐵𝐵 + 𝜆 𝜃 𝐿 𝜃 𝐿 𝑠 : loss for score map 𝐿 𝑔 : loss for geometry 𝐿 𝑔 = 𝐿QUAD Q, Q∗ = min Q∈𝑃 𝑄∗ 𝑐 𝑖∈CQ 𝑐 𝑖∈CQ smoothed 𝐿1 𝑐𝑖 − 𝑐𝑖 8 × 𝑁 𝑄∗ QUAD:

11. 11 EAST: An Efficient and Accurate Scene Text Detector Locality-Aware NMS Problem: A naïve NMS algorithm runs in 𝑂 𝑛2 where 𝑛 is the number of candidate geometries. The geometries from nearby pixels tend to be highly correlated. Solution: locality-aware NMS 𝑎 = WEIGHTEDMERGE 𝑔, 𝑝 , then 𝑎i = V 𝑔 𝑔𝑖 + 𝑉 𝑝 𝑝𝑖 and V 𝑎 = V 𝑔 + V(𝑝)

12. 12 EAST: An Efficient and Accurate Scene Text Detector

13. Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery 13 Network Overview

14. 14 Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery Main Contributions : 1. new joint image cascade and feature pyramid network(ICN and FPN) 2. design a DIN module as a domain adaptation module 3. new loss function to shape rectangles by constraining the angles between the edges to 90 degrees

15. Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery 15 ICN, FPN and Deformable Inception Subnetworks • Appropriate weights sharing • Resize image size by bilinear interpolation ICN • The low-level semantic feature from high resolution • The high-level semantic feature from low-level resolution

16. Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery 16 ICN, FPN and Deformable Inception Subnetworks

17. Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery 17 R-RPN Characteristics: 1. no difference between the front and back of objects 2. initialize anchor by using dimension clustering in YOLO v2 3. use the smooth 𝑙1 loss to regress the four coordinates

18. Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery 18 R-ROI Characteristics: 1. penalize angles that are not 90 degree 2. initialize anchor by using dimension clustering in YOLO v2 3. use the smooth 𝑙1 loss to regress the four coordinates penalize angles that are not 90 degree

19. Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery 19 Objective

20. Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery 20 Objective

21. Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery 21 Result

22. Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery 22 Result

23. Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery 23 Result

24. Reference 24 EAST: PVANET: Deep but lightweight neural networks for real-time object detection. Balanced-cross entropy: Holistically-nested edge detection Scene text detection via holistic, multi-channel prediction. U-shape: U-net: Convolu-tional networks for biomedical image segmentation. Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery: Soft-NMS: Improving object detection with one line of code. IoU distance: Yolo9000: Better, faster, stronger. DIN: deformable convolutional networks

Editor's Notes

DIN 내부에 deformable convolution을 통해 geometric transformation을 적용하는 것을 도와주고 더욱 offset regression property는 kernel 외부의 object를 localization하는 것을 도움 줍니다.
DIN 내부에 deformable convolution을 통해 geometric transformation을 적용하는 것을 도와주고 더욱 offset regression property는 kernel 외부의 object를 localization하는 것을 도움 줍니다.
DIN 내부에 deformable convolution을 통해 geometric transformation을 적용하는 것을 도와주고 더욱 offset regression property는 kernel 외부의 object를 localization하는 것을 도움 줍니다.
DIN 내부에 deformable convolution을 통해 geometric transformation을 적용하는 것을 도와주고 더욱 offset regression property는 kernel 외부의 object를 localization하는 것을 도움 줍니다.
DIN 내부에 deformable convolution을 통해 geometric transformation을 적용하는 것을 도와주고 더욱 offset regression property는 kernel 외부의 object를 localization하는 것을 도움 줍니다.
DIN 내부에 deformable convolution을 통해 geometric transformation을 적용하는 것을 도와주고 더욱 offset regression property는 kernel 외부의 object를 localization하는 것을 도움 줍니다.
DIN 내부에 deformable convolution을 통해 geometric transformation을 적용하는 것을 도와주고 더욱 offset regression property는 kernel 외부의 object를 localization하는 것을 도움 줍니다.
DIN 내부에 deformable convolution을 통해 geometric transformation을 적용하는 것을 도와주고 더욱 offset regression property는 kernel 외부의 object를 localization하는 것을 도움 줍니다.
DIN 내부에 deformable convolution을 통해 geometric transformation을 적용하는 것을 도와주고 더욱 offset regression property는 kernel 외부의 object를 localization하는 것을 도움 줍니다.

Aerial detection part3

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Aerial detection part3

Similar to Aerial detection part3 (20)

More from ssuser456ad6

More from ssuser456ad6 (6)

Recently uploaded

Recently uploaded (20)

Aerial detection part3

Editor's Notes