SlideShare a Scribd company logo
1 of 24
Aerial Object Detection
HyeongJun Kwon
2019-2
Contents
2
1. EAST: An Efficient and Accurate Scene Text Detector
2. Towards Multi-class Object Detection in Unconstrained Remote Sensing
Imagery
EAST: An Efficient and Accurate Scene Text Detector
3
Network Overview Pipeline
Input
image
Multi-channel
FCN
Multi-channel
FCN
Multi-oriented
Task-wise boxes
4
EAST: An Efficient and Accurate Scene Text Detector
Main Contributions :
1. propose two stage(step) method : FCN and NMS merging stage
2. pipeline is flexible
5
EAST: An Efficient and Accurate Scene Text Detector
Relate work: PVANet
6
EAST: An Efficient and Accurate Scene Text Detector
Pipeline
input
conv2
conv3
conv1
conv3
merging1
merging2
merging3 For reduce computation cost, using
U-shape not using HyperNet in
PVANet that merge all feature maps
output
EAST: An Efficient and Accurate Scene Text Detector
7
Label Generation
EAST: An Efficient and Accurate Scene Text Detector
8
Label Generation: Score Map Generation
Score map generation eq
𝑟𝑖 = min(D 𝑝𝑖, 𝑝 𝑖 𝑚𝑜𝑑 4 +1 ,
D 𝑝𝑖, 𝑝 𝑖+2 𝑚𝑜𝑑 4 +1
we shrink it by moving its two endpoints inward along the edge
by 0.3𝑟𝑖 and 0.3𝑟 𝑖 𝑚𝑜𝑑 4 +1 espectively.
9
EAST: An Efficient and Accurate Scene Text Detector
𝐿 = 𝐿 𝑠 + 𝜆 𝑔 𝐿 𝑔
Label Generation: Loss
Where Y = 𝐹S is the prediction of the score map, and Y∗ is the
ground truth
𝐿 𝑠 : loss for score map
𝐿 𝑔 : loss for geometry
10
EAST: An Efficient and Accurate Scene Text Detector
𝐿 = 𝐿 𝑠 + 𝜆 𝑔 𝐿 𝑔
Label Generation: Loss
RBOX: 𝐿 𝑔 = 𝐿 𝐴𝐴𝐵𝐵 + 𝜆 𝜃 𝐿 𝜃
𝐿 𝑠 : loss for score map
𝐿 𝑔 : loss for geometry
𝐿 𝑔 = 𝐿QUAD Q, Q∗
= min
Q∈𝑃 𝑄∗
𝑐 𝑖∈CQ
𝑐 𝑖∈CQ
smoothed 𝐿1 𝑐𝑖 − 𝑐𝑖
8 × 𝑁 𝑄∗
QUAD:
11
EAST: An Efficient and Accurate Scene Text Detector
Locality-Aware NMS
Problem: A naïve NMS algorithm runs in 𝑂 𝑛2
where 𝑛 is the number of candidate geometries.
The geometries from nearby pixels tend to be highly correlated.
Solution: locality-aware NMS
𝑎 = WEIGHTEDMERGE 𝑔, 𝑝 , then 𝑎i = V 𝑔 𝑔𝑖 + 𝑉 𝑝 𝑝𝑖 and V 𝑎 = V 𝑔 + V(𝑝)
12
EAST: An Efficient and Accurate Scene Text Detector
Towards Multi-class Object Detection in Unconstrained Remote Sensing
Imagery
13
Network Overview
14
Towards Multi-class Object Detection in Unconstrained Remote Sensing
Imagery
Main Contributions :
1. new joint image cascade and feature pyramid network(ICN and FPN)
2. design a DIN module as a domain adaptation module
3. new loss function to shape rectangles by constraining the angles between the edges to
90 degrees
Towards Multi-class Object Detection in Unconstrained Remote Sensing
Imagery
15
ICN, FPN and Deformable Inception Subnetworks
• Appropriate weights sharing
• Resize image size by bilinear
interpolation
ICN
• The low-level semantic feature
from high resolution
• The high-level semantic feature
from low-level resolution
Towards Multi-class Object Detection in Unconstrained Remote Sensing
Imagery
16
ICN, FPN and Deformable Inception Subnetworks
Towards Multi-class Object Detection in Unconstrained Remote Sensing
Imagery
17
R-RPN
Characteristics:
1. no difference between the front and back of objects
2. initialize anchor by using dimension clustering in YOLO v2
3. use the smooth 𝑙1 loss to regress the four coordinates
Towards Multi-class Object Detection in Unconstrained Remote Sensing
Imagery
18
R-ROI
Characteristics:
1. penalize angles that are not 90 degree
2. initialize anchor by using dimension clustering in YOLO v2
3. use the smooth 𝑙1 loss to regress the four coordinates
penalize angles that are not 90 degree
Towards Multi-class Object Detection in Unconstrained Remote Sensing
Imagery
19
Objective
Towards Multi-class Object Detection in Unconstrained Remote Sensing
Imagery
20
Objective
Towards Multi-class Object Detection in Unconstrained Remote Sensing
Imagery
21
Result
Towards Multi-class Object Detection in Unconstrained Remote Sensing
Imagery
22
Result
Towards Multi-class Object Detection in Unconstrained Remote Sensing
Imagery
23
Result
Reference
24
EAST:
PVANET: Deep but lightweight neural networks for real-time object detection.
Balanced-cross entropy:
Holistically-nested edge detection
Scene text detection via holistic, multi-channel prediction.
U-shape: U-net: Convolu-tional networks for biomedical image segmentation.
Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery:
Soft-NMS: Improving object detection with one line of code.
IoU distance: Yolo9000: Better, faster, stronger.
DIN: deformable convolutional networks

More Related Content

What's hot

Objects as points (CenterNet) review [CDM]
Objects as points (CenterNet) review [CDM]Objects as points (CenterNet) review [CDM]
Objects as points (CenterNet) review [CDM]Dongmin Choi
 
[unofficial] Pyramid Scene Parsing Network (CVPR 2017)
[unofficial] Pyramid Scene Parsing Network (CVPR 2017)[unofficial] Pyramid Scene Parsing Network (CVPR 2017)
[unofficial] Pyramid Scene Parsing Network (CVPR 2017)Shunta Saito
 
Building and road detection from large aerial imagery
Building and road detection from large aerial imageryBuilding and road detection from large aerial imagery
Building and road detection from large aerial imageryShunta Saito
 
fusion of Camera and lidar for autonomous driving II
fusion of Camera and lidar for autonomous driving IIfusion of Camera and lidar for autonomous driving II
fusion of Camera and lidar for autonomous driving IIYu Huang
 
Camera-based road Lane detection by deep learning III
Camera-based road Lane detection by deep learning IIICamera-based road Lane detection by deep learning III
Camera-based road Lane detection by deep learning IIIYu Huang
 
VJAI Paper Reading#3-KDD2019-ClusterGCN
VJAI Paper Reading#3-KDD2019-ClusterGCNVJAI Paper Reading#3-KDD2019-ClusterGCN
VJAI Paper Reading#3-KDD2019-ClusterGCNDat Nguyen
 
[PaperReview] LightGCN: Simplifying and Powering Graph Convolution Network fo...
[PaperReview] LightGCN: Simplifying and Powering Graph Convolution Network fo...[PaperReview] LightGCN: Simplifying and Powering Graph Convolution Network fo...
[PaperReview] LightGCN: Simplifying and Powering Graph Convolution Network fo...Zimin Park
 
Auro tripathy - Localizing with CNNs
Auro tripathy -  Localizing with CNNsAuro tripathy -  Localizing with CNNs
Auro tripathy - Localizing with CNNsAuro Tripathy
 
Matteoli ieee gold_2010_clean
Matteoli ieee gold_2010_cleanMatteoli ieee gold_2010_clean
Matteoli ieee gold_2010_cleangrssieee
 
Multi sensor calibration by deep learning
Multi sensor calibration by deep learningMulti sensor calibration by deep learning
Multi sensor calibration by deep learningYu Huang
 
Detection focal loss 딥러닝 논문읽기 모임 발표자료
Detection focal loss 딥러닝 논문읽기 모임 발표자료Detection focal loss 딥러닝 논문읽기 모임 발표자료
Detection focal loss 딥러닝 논문읽기 모임 발표자료taeseon ryu
 
Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013
Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013
Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013Sunando Sengupta
 
Remote Sensing IEEE 2015 Projects
Remote Sensing IEEE 2015 ProjectsRemote Sensing IEEE 2015 Projects
Remote Sensing IEEE 2015 ProjectsVijay Karan
 
Autonomous deployment for load balancing surface coverage in sensor networks
Autonomous deployment for load balancing    surface coverage in sensor networksAutonomous deployment for load balancing    surface coverage in sensor networks
Autonomous deployment for load balancing surface coverage in sensor networksieeepondy
 
Unsupervised/Self-supervvised visual object tracking
Unsupervised/Self-supervvised visual object trackingUnsupervised/Self-supervvised visual object tracking
Unsupervised/Self-supervvised visual object trackingYu Huang
 
Comparative Study of Object Detection Algorithms
Comparative Study of Object Detection AlgorithmsComparative Study of Object Detection Algorithms
Comparative Study of Object Detection AlgorithmsIRJET Journal
 
Depth Fusion from RGB and Depth Sensors by Deep Learning
Depth Fusion from RGB and Depth Sensors by Deep LearningDepth Fusion from RGB and Depth Sensors by Deep Learning
Depth Fusion from RGB and Depth Sensors by Deep LearningYu Huang
 

What's hot (20)

Objects as points (CenterNet) review [CDM]
Objects as points (CenterNet) review [CDM]Objects as points (CenterNet) review [CDM]
Objects as points (CenterNet) review [CDM]
 
Class Weighted Convolutional Features for Image Retrieval
Class Weighted Convolutional Features for Image Retrieval Class Weighted Convolutional Features for Image Retrieval
Class Weighted Convolutional Features for Image Retrieval
 
[unofficial] Pyramid Scene Parsing Network (CVPR 2017)
[unofficial] Pyramid Scene Parsing Network (CVPR 2017)[unofficial] Pyramid Scene Parsing Network (CVPR 2017)
[unofficial] Pyramid Scene Parsing Network (CVPR 2017)
 
Building and road detection from large aerial imagery
Building and road detection from large aerial imageryBuilding and road detection from large aerial imagery
Building and road detection from large aerial imagery
 
fusion of Camera and lidar for autonomous driving II
fusion of Camera and lidar for autonomous driving IIfusion of Camera and lidar for autonomous driving II
fusion of Camera and lidar for autonomous driving II
 
Camera-based road Lane detection by deep learning III
Camera-based road Lane detection by deep learning IIICamera-based road Lane detection by deep learning III
Camera-based road Lane detection by deep learning III
 
VJAI Paper Reading#3-KDD2019-ClusterGCN
VJAI Paper Reading#3-KDD2019-ClusterGCNVJAI Paper Reading#3-KDD2019-ClusterGCN
VJAI Paper Reading#3-KDD2019-ClusterGCN
 
[PaperReview] LightGCN: Simplifying and Powering Graph Convolution Network fo...
[PaperReview] LightGCN: Simplifying and Powering Graph Convolution Network fo...[PaperReview] LightGCN: Simplifying and Powering Graph Convolution Network fo...
[PaperReview] LightGCN: Simplifying and Powering Graph Convolution Network fo...
 
Auro tripathy - Localizing with CNNs
Auro tripathy -  Localizing with CNNsAuro tripathy -  Localizing with CNNs
Auro tripathy - Localizing with CNNs
 
Matteoli ieee gold_2010_clean
Matteoli ieee gold_2010_cleanMatteoli ieee gold_2010_clean
Matteoli ieee gold_2010_clean
 
Thesis Presentation
Thesis PresentationThesis Presentation
Thesis Presentation
 
Multi sensor calibration by deep learning
Multi sensor calibration by deep learningMulti sensor calibration by deep learning
Multi sensor calibration by deep learning
 
Detection focal loss 딥러닝 논문읽기 모임 발표자료
Detection focal loss 딥러닝 논문읽기 모임 발표자료Detection focal loss 딥러닝 논문읽기 모임 발표자료
Detection focal loss 딥러닝 논문읽기 모임 발표자료
 
Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013
Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013
Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013
 
Remote Sensing IEEE 2015 Projects
Remote Sensing IEEE 2015 ProjectsRemote Sensing IEEE 2015 Projects
Remote Sensing IEEE 2015 Projects
 
Autonomous deployment for load balancing surface coverage in sensor networks
Autonomous deployment for load balancing    surface coverage in sensor networksAutonomous deployment for load balancing    surface coverage in sensor networks
Autonomous deployment for load balancing surface coverage in sensor networks
 
Gnn overview
Gnn overviewGnn overview
Gnn overview
 
Unsupervised/Self-supervvised visual object tracking
Unsupervised/Self-supervvised visual object trackingUnsupervised/Self-supervvised visual object tracking
Unsupervised/Self-supervvised visual object tracking
 
Comparative Study of Object Detection Algorithms
Comparative Study of Object Detection AlgorithmsComparative Study of Object Detection Algorithms
Comparative Study of Object Detection Algorithms
 
Depth Fusion from RGB and Depth Sensors by Deep Learning
Depth Fusion from RGB and Depth Sensors by Deep LearningDepth Fusion from RGB and Depth Sensors by Deep Learning
Depth Fusion from RGB and Depth Sensors by Deep Learning
 

Similar to Aerial detection part3

R-FCN : object detection via region-based fully convolutional networks
R-FCN :  object detection via region-based fully convolutional networksR-FCN :  object detection via region-based fully convolutional networks
R-FCN : object detection via region-based fully convolutional networksEntrepreneur / Startup
 
Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)
Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)
Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya
 
Convolutional Neural Network for pixel-wise skyline detection
Convolutional Neural Network for pixel-wise skyline detectionConvolutional Neural Network for pixel-wise skyline detection
Convolutional Neural Network for pixel-wise skyline detectionDarian Frajberg
 
camera-based Lane detection by deep learning
camera-based Lane detection by deep learningcamera-based Lane detection by deep learning
camera-based Lane detection by deep learningYu Huang
 
object detection paper review
object detection paper reviewobject detection paper review
object detection paper reviewYoonho Na
 
Object Detection is a very powerful field.pptx
Object Detection is a very powerful field.pptxObject Detection is a very powerful field.pptx
Object Detection is a very powerful field.pptxusmanyaseen16
 
LiDAR-based Autonomous Driving III (by Deep Learning)
LiDAR-based Autonomous Driving III (by Deep Learning)LiDAR-based Autonomous Driving III (by Deep Learning)
LiDAR-based Autonomous Driving III (by Deep Learning)Yu Huang
 
Lidar for Autonomous Driving II (via Deep Learning)
Lidar for Autonomous Driving II (via Deep Learning)Lidar for Autonomous Driving II (via Deep Learning)
Lidar for Autonomous Driving II (via Deep Learning)Yu Huang
 
Neural Radiance Fields & Neural Rendering.pdf
Neural Radiance Fields & Neural Rendering.pdfNeural Radiance Fields & Neural Rendering.pdf
Neural Radiance Fields & Neural Rendering.pdfNavneetPaul2
 
PCA-SIFT: A More Distinctive Representation for Local Image Descriptors
PCA-SIFT: A More Distinctive Representation for Local Image DescriptorsPCA-SIFT: A More Distinctive Representation for Local Image Descriptors
PCA-SIFT: A More Distinctive Representation for Local Image Descriptorswolf
 
Conditional Image Generation with PixelCNN Decoders
Conditional Image Generation with PixelCNN DecodersConditional Image Generation with PixelCNN Decoders
Conditional Image Generation with PixelCNN Decoderssuga93
 
IRJET- Remote Sensing Image Retrieval using Convolutional Neural Network with...
IRJET- Remote Sensing Image Retrieval using Convolutional Neural Network with...IRJET- Remote Sensing Image Retrieval using Convolutional Neural Network with...
IRJET- Remote Sensing Image Retrieval using Convolutional Neural Network with...IRJET Journal
 
最近の研究情勢についていくために - Deep Learningを中心に -
最近の研究情勢についていくために - Deep Learningを中心に - 最近の研究情勢についていくために - Deep Learningを中心に -
最近の研究情勢についていくために - Deep Learningを中心に - Hiroshi Fukui
 
物件偵測與辨識技術
物件偵測與辨識技術物件偵測與辨識技術
物件偵測與辨識技術CHENHuiMei
 
Object Pose Estimation
Object Pose EstimationObject Pose Estimation
Object Pose EstimationArithmer Inc.
 
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)Universitat Politècnica de Catalunya
 
NS - CUK Seminar : V.T.Hoang, Review on "Structure-Aware Transformer for Grap...
NS - CUK Seminar : V.T.Hoang, Review on "Structure-Aware Transformer for Grap...NS - CUK Seminar : V.T.Hoang, Review on "Structure-Aware Transformer for Grap...
NS - CUK Seminar : V.T.Hoang, Review on "Structure-Aware Transformer for Grap...ssuser4b1f48
 

Similar to Aerial detection part3 (20)

R-FCN : object detection via region-based fully convolutional networks
R-FCN :  object detection via region-based fully convolutional networksR-FCN :  object detection via region-based fully convolutional networks
R-FCN : object detection via region-based fully convolutional networks
 
Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)
Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)
Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)
 
Convolutional Neural Network for pixel-wise skyline detection
Convolutional Neural Network for pixel-wise skyline detectionConvolutional Neural Network for pixel-wise skyline detection
Convolutional Neural Network for pixel-wise skyline detection
 
camera-based Lane detection by deep learning
camera-based Lane detection by deep learningcamera-based Lane detection by deep learning
camera-based Lane detection by deep learning
 
object detection paper review
object detection paper reviewobject detection paper review
object detection paper review
 
Object Detection is a very powerful field.pptx
Object Detection is a very powerful field.pptxObject Detection is a very powerful field.pptx
Object Detection is a very powerful field.pptx
 
LiDAR-based Autonomous Driving III (by Deep Learning)
LiDAR-based Autonomous Driving III (by Deep Learning)LiDAR-based Autonomous Driving III (by Deep Learning)
LiDAR-based Autonomous Driving III (by Deep Learning)
 
Adaptive object detection using adjacency and zoom prediction
Adaptive object detection using adjacency and zoom predictionAdaptive object detection using adjacency and zoom prediction
Adaptive object detection using adjacency and zoom prediction
 
Lidar for Autonomous Driving II (via Deep Learning)
Lidar for Autonomous Driving II (via Deep Learning)Lidar for Autonomous Driving II (via Deep Learning)
Lidar for Autonomous Driving II (via Deep Learning)
 
Neural Radiance Fields & Neural Rendering.pdf
Neural Radiance Fields & Neural Rendering.pdfNeural Radiance Fields & Neural Rendering.pdf
Neural Radiance Fields & Neural Rendering.pdf
 
PCA-SIFT: A More Distinctive Representation for Local Image Descriptors
PCA-SIFT: A More Distinctive Representation for Local Image DescriptorsPCA-SIFT: A More Distinctive Representation for Local Image Descriptors
PCA-SIFT: A More Distinctive Representation for Local Image Descriptors
 
Conditional Image Generation with PixelCNN Decoders
Conditional Image Generation with PixelCNN DecodersConditional Image Generation with PixelCNN Decoders
Conditional Image Generation with PixelCNN Decoders
 
D3L4-objects.pdf
D3L4-objects.pdfD3L4-objects.pdf
D3L4-objects.pdf
 
IRJET- Remote Sensing Image Retrieval using Convolutional Neural Network with...
IRJET- Remote Sensing Image Retrieval using Convolutional Neural Network with...IRJET- Remote Sensing Image Retrieval using Convolutional Neural Network with...
IRJET- Remote Sensing Image Retrieval using Convolutional Neural Network with...
 
最近の研究情勢についていくために - Deep Learningを中心に -
最近の研究情勢についていくために - Deep Learningを中心に - 最近の研究情勢についていくために - Deep Learningを中心に -
最近の研究情勢についていくために - Deep Learningを中心に -
 
Image Retrieval (D4L5 2017 UPC Deep Learning for Computer Vision)
Image Retrieval (D4L5 2017 UPC Deep Learning for Computer Vision)Image Retrieval (D4L5 2017 UPC Deep Learning for Computer Vision)
Image Retrieval (D4L5 2017 UPC Deep Learning for Computer Vision)
 
物件偵測與辨識技術
物件偵測與辨識技術物件偵測與辨識技術
物件偵測與辨識技術
 
Object Pose Estimation
Object Pose EstimationObject Pose Estimation
Object Pose Estimation
 
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)
 
NS - CUK Seminar : V.T.Hoang, Review on "Structure-Aware Transformer for Grap...
NS - CUK Seminar : V.T.Hoang, Review on "Structure-Aware Transformer for Grap...NS - CUK Seminar : V.T.Hoang, Review on "Structure-Aware Transformer for Grap...
NS - CUK Seminar : V.T.Hoang, Review on "Structure-Aware Transformer for Grap...
 

More from ssuser456ad6

Scale invariant feature transform
Scale invariant feature transformScale invariant feature transform
Scale invariant feature transformssuser456ad6
 
Learning joint 2 d 3d representations for depth completion
Learning joint 2 d 3d representations for depth completion Learning joint 2 d 3d representations for depth completion
Learning joint 2 d 3d representations for depth completion ssuser456ad6
 
Guided image filter
Guided image filterGuided image filter
Guided image filterssuser456ad6
 
Fast cost volume filtering for visual correspondence and beyond
Fast cost volume filtering for visual correspondence and beyondFast cost volume filtering for visual correspondence and beyond
Fast cost volume filtering for visual correspondence and beyondssuser456ad6
 
D2 net a trainable cnn for joint description and detection of local features
D2 net a trainable cnn for joint description and detection of local features D2 net a trainable cnn for joint description and detection of local features
D2 net a trainable cnn for joint description and detection of local features ssuser456ad6
 

More from ssuser456ad6 (6)

Scale invariant feature transform
Scale invariant feature transformScale invariant feature transform
Scale invariant feature transform
 
Learning joint 2 d 3d representations for depth completion
Learning joint 2 d 3d representations for depth completion Learning joint 2 d 3d representations for depth completion
Learning joint 2 d 3d representations for depth completion
 
Guided image filter
Guided image filterGuided image filter
Guided image filter
 
Fast cost volume filtering for visual correspondence and beyond
Fast cost volume filtering for visual correspondence and beyondFast cost volume filtering for visual correspondence and beyond
Fast cost volume filtering for visual correspondence and beyond
 
D2 net a trainable cnn for joint description and detection of local features
D2 net a trainable cnn for joint description and detection of local features D2 net a trainable cnn for joint description and detection of local features
D2 net a trainable cnn for joint description and detection of local features
 
Gan dissection
Gan dissectionGan dissection
Gan dissection
 

Recently uploaded

Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Christo Ananth
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...ranjana rawat
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performancesivaprakash250
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordAsst.prof M.Gokilavani
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 

Recently uploaded (20)

Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 

Aerial detection part3

  • 2. Contents 2 1. EAST: An Efficient and Accurate Scene Text Detector 2. Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery
  • 3. EAST: An Efficient and Accurate Scene Text Detector 3 Network Overview Pipeline Input image Multi-channel FCN Multi-channel FCN Multi-oriented Task-wise boxes
  • 4. 4 EAST: An Efficient and Accurate Scene Text Detector Main Contributions : 1. propose two stage(step) method : FCN and NMS merging stage 2. pipeline is flexible
  • 5. 5 EAST: An Efficient and Accurate Scene Text Detector Relate work: PVANet
  • 6. 6 EAST: An Efficient and Accurate Scene Text Detector Pipeline input conv2 conv3 conv1 conv3 merging1 merging2 merging3 For reduce computation cost, using U-shape not using HyperNet in PVANet that merge all feature maps output
  • 7. EAST: An Efficient and Accurate Scene Text Detector 7 Label Generation
  • 8. EAST: An Efficient and Accurate Scene Text Detector 8 Label Generation: Score Map Generation Score map generation eq 𝑟𝑖 = min(D 𝑝𝑖, 𝑝 𝑖 𝑚𝑜𝑑 4 +1 , D 𝑝𝑖, 𝑝 𝑖+2 𝑚𝑜𝑑 4 +1 we shrink it by moving its two endpoints inward along the edge by 0.3𝑟𝑖 and 0.3𝑟 𝑖 𝑚𝑜𝑑 4 +1 espectively.
  • 9. 9 EAST: An Efficient and Accurate Scene Text Detector 𝐿 = 𝐿 𝑠 + 𝜆 𝑔 𝐿 𝑔 Label Generation: Loss Where Y = 𝐹S is the prediction of the score map, and Y∗ is the ground truth 𝐿 𝑠 : loss for score map 𝐿 𝑔 : loss for geometry
  • 10. 10 EAST: An Efficient and Accurate Scene Text Detector 𝐿 = 𝐿 𝑠 + 𝜆 𝑔 𝐿 𝑔 Label Generation: Loss RBOX: 𝐿 𝑔 = 𝐿 𝐴𝐴𝐵𝐵 + 𝜆 𝜃 𝐿 𝜃 𝐿 𝑠 : loss for score map 𝐿 𝑔 : loss for geometry 𝐿 𝑔 = 𝐿QUAD Q, Q∗ = min Q∈𝑃 𝑄∗ 𝑐 𝑖∈CQ 𝑐 𝑖∈CQ smoothed 𝐿1 𝑐𝑖 − 𝑐𝑖 8 × 𝑁 𝑄∗ QUAD:
  • 11. 11 EAST: An Efficient and Accurate Scene Text Detector Locality-Aware NMS Problem: A naïve NMS algorithm runs in 𝑂 𝑛2 where 𝑛 is the number of candidate geometries. The geometries from nearby pixels tend to be highly correlated. Solution: locality-aware NMS 𝑎 = WEIGHTEDMERGE 𝑔, 𝑝 , then 𝑎i = V 𝑔 𝑔𝑖 + 𝑉 𝑝 𝑝𝑖 and V 𝑎 = V 𝑔 + V(𝑝)
  • 12. 12 EAST: An Efficient and Accurate Scene Text Detector
  • 13. Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery 13 Network Overview
  • 14. 14 Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery Main Contributions : 1. new joint image cascade and feature pyramid network(ICN and FPN) 2. design a DIN module as a domain adaptation module 3. new loss function to shape rectangles by constraining the angles between the edges to 90 degrees
  • 15. Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery 15 ICN, FPN and Deformable Inception Subnetworks • Appropriate weights sharing • Resize image size by bilinear interpolation ICN • The low-level semantic feature from high resolution • The high-level semantic feature from low-level resolution
  • 16. Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery 16 ICN, FPN and Deformable Inception Subnetworks
  • 17. Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery 17 R-RPN Characteristics: 1. no difference between the front and back of objects 2. initialize anchor by using dimension clustering in YOLO v2 3. use the smooth 𝑙1 loss to regress the four coordinates
  • 18. Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery 18 R-ROI Characteristics: 1. penalize angles that are not 90 degree 2. initialize anchor by using dimension clustering in YOLO v2 3. use the smooth 𝑙1 loss to regress the four coordinates penalize angles that are not 90 degree
  • 19. Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery 19 Objective
  • 20. Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery 20 Objective
  • 21. Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery 21 Result
  • 22. Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery 22 Result
  • 23. Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery 23 Result
  • 24. Reference 24 EAST: PVANET: Deep but lightweight neural networks for real-time object detection. Balanced-cross entropy: Holistically-nested edge detection Scene text detection via holistic, multi-channel prediction. U-shape: U-net: Convolu-tional networks for biomedical image segmentation. Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery: Soft-NMS: Improving object detection with one line of code. IoU distance: Yolo9000: Better, faster, stronger. DIN: deformable convolutional networks

Editor's Notes

  1. DIN 내부에 deformable convolution을 통해 geometric transformation을 적용하는 것을 도와주고 더욱 offset regression property는 kernel 외부의 object를 localization하는 것을 도움 줍니다.
  2. DIN 내부에 deformable convolution을 통해 geometric transformation을 적용하는 것을 도와주고 더욱 offset regression property는 kernel 외부의 object를 localization하는 것을 도움 줍니다.
  3. DIN 내부에 deformable convolution을 통해 geometric transformation을 적용하는 것을 도와주고 더욱 offset regression property는 kernel 외부의 object를 localization하는 것을 도움 줍니다.
  4. DIN 내부에 deformable convolution을 통해 geometric transformation을 적용하는 것을 도와주고 더욱 offset regression property는 kernel 외부의 object를 localization하는 것을 도움 줍니다.
  5. DIN 내부에 deformable convolution을 통해 geometric transformation을 적용하는 것을 도와주고 더욱 offset regression property는 kernel 외부의 object를 localization하는 것을 도움 줍니다.
  6. DIN 내부에 deformable convolution을 통해 geometric transformation을 적용하는 것을 도와주고 더욱 offset regression property는 kernel 외부의 object를 localization하는 것을 도움 줍니다.
  7. DIN 내부에 deformable convolution을 통해 geometric transformation을 적용하는 것을 도와주고 더욱 offset regression property는 kernel 외부의 object를 localization하는 것을 도움 줍니다.
  8. DIN 내부에 deformable convolution을 통해 geometric transformation을 적용하는 것을 도와주고 더욱 offset regression property는 kernel 외부의 object를 localization하는 것을 도움 줍니다.
  9. DIN 내부에 deformable convolution을 통해 geometric transformation을 적용하는 것을 도와주고 더욱 offset regression property는 kernel 외부의 object를 localization하는 것을 도움 줍니다.