SlideShare a Scribd company logo
Towards Light-weight and Real-time Line Segment Detection
Geonmo Gu*, Byungsoo Ko*, SeoungHyun Go
Sung-Hyun Lee, Jingeun Lee, Minchul Shin
Naver Search Vision
* Equal contribution
Introduction
Introduction
1. Line Segment Detection?
Line Segment Detection
Purpose: Detect line segments in the given images
Line Segment Detector
(LSD)
Introduction
1. Line Segment Detection?
Applications of LSD
Simultaneous Localization and Mapping (SLAM)1 Relative Pose Estimation2
1 Monocular-vision based SLAM using Line Segments, ICRA07
2 Line-Based Relative Pose Estimation, CVPR11
Introduction
2. Real-time inference is limited
a) Existing methods exploit heavy backbone networks
Backbone
Input Image
Introduction
2. Real-time inference is limited
a) Existing methods exploit heavy backbone networks
Backbone
Input Image
• Dilated ResNet50-based FPN
• Stacked hourglass network
• Atrous residual U-net
Existing Methods
Introduction
2. Real-time inference is limited
a) Existing methods exploit heavy backbone networks
Backbone
Input Image
• Dilated ResNet50-based FPN
• Stacked hourglass network
• Atrous residual U-net
Existing Methods Ours
• Modified MobileNetV2
Introduction
2. Real-time inference is limited
b) Existing methods include multi-module line prediction process
Backbone
Input Image
Introduction
2. Real-time inference is limited
b) Existing methods include multi-module line prediction process
Backbone
Feature
Maps
Input Image
Line
Segments
Introduction
2. Real-time inference is limited
Multi-module Processing
Attraction Field Maps
Top-Down
(TD) Strategy
Backbone
Feature
Maps
Input Image
Line
Segments
Squeeze Module
Compute
Line Proposal
Search for
Candidates
Verify
Candidates
b) Existing methods include multi-module line prediction process
Introduction
2. Real-time inference is limited
Multi-module Processing
Junction Heatmap
Line
Sampler
Line Proposal
Line
Verification
Bottom-Up
(BU) Strategy
Attraction Field Maps
Top-Down
(TD) Strategy
Backbone
Feature
Maps
Input Image
Line
Segments
LoI
Pooling
Squeeze Module
Compute
Line Proposal
Search for
Candidates
Verify
Candidates
b) Existing methods include multi-module line prediction process
Introduction
2. Real-time inference is limited
Multi-module Processing
Junction Heatmap
Line
Sampler
Line Proposal
Line
Verification
Bottom-Up
(BU) Strategy
Attraction Field Maps
Top-Down
(TD) Strategy
Backbone
Feature
Maps
Input Image
Line
Segments
LoI
Pooling
Line Map Center Map
Mixture
of Conv.
Mixture
of Conv.
Line
Generation
Displacement Map
Tri-Point
(TP) Strategy
Point Filter
Module
Line
Segmentation
Squeeze Module
Compute
Line Proposal
Search for
Candidates
Verify
Candidates
b) Existing methods include multi-module line prediction process
Introduction
2. Real-time inference is limited
Multi-module Processing
Junction Heatmap
Line
Sampler
Line Proposal
Line
Verification
Bottom-Up
(BU) Strategy
Attraction Field Maps
Top-Down
(TD) Strategy
Single Module Processing
Ours Line
Generation
Backbone
Feature
Maps
Input Image
Line
Segments
LoI
Pooling
Line Map Center Map
Mixture
of Conv.
Mixture
of Conv.
Line
Generation
Displacement Map
Tri-Point
(TP) Strategy
Point Filter
Module
Line
Segmentation
Squeeze Module
Compute
Line Proposal
Search for
Candidates
Verify
Candidates
Center / Displacement Map
b) Existing methods include multi-module line prediction process
Introduction
2. Real-time inference is limited
Inference speed comparison
Inference speed of backbone and prediction process on GPU are significantly improved
Introduction
3. Mobile-LSD
Motivation
Existing LSDs are limited in real-time inference, especially on mobile devices.
• Exploit heavy backbone networks
• Include multi-module line prediction process
Introduction
3. Mobile-LSD
Motivation
We design an efficient LSD for resource-constrained environments: Mobile LSD (M-LSD)
• Minimize the backbone network and adopt single module of line prediction process
• Present novel training schemes: Segments of Line segment (SoL) and geometric learning schemes
In this paper
Existing LSDs are limited in real-time inference, especially on mobile devices.
• Exploit heavy backbone networks
• Include multi-module line prediction process
Proposed Method
Proposed Method
1. Light-weight Backbone
M-LSD-tiny
M-LSD-tiny
Feature Extractor
Block: 1 11 12 16
Proposed Method
1. Light-weight Backbone
Feature Extractor
1x1 Conv
1x1 Conv
Upscale
C
Skip
connection
Block: 1 11
Block type A (12, 14)
12 16
M-LSD-tiny
Proposed Method
1. Light-weight Backbone
Feature Extractor
1x1 Conv
1x1 Conv
Upscale
C
Skip
connection
3x3 Conv
3x3 Conv
+
Block: 1 11
Block type A (12, 14) Block type B (13, 15)
12 16
M-LSD-tiny
Proposed Method
1. Light-weight Backbone
Feature Extractor
1x1 Conv
1x1 Conv
Upscale
C
Skip
connection
3x3 Conv
3x3 Conv
+
Block: 1 11
Block type A (12, 14) Block type B (13, 15)
3x3 Conv
Dilated rate=5
Block type C (16)
12 16
1x1 Conv
3x3 Conv
M-LSD-tiny
Proposed Method
1. Light-weight Backbone
Feature Extractor
Upscale
1x1 Conv
1x1 Conv
Upscale
C
Skip
connection
3x3 Conv
3x3 Conv
+
Block: 1 11
Block type A (12, 14) Block type B (13, 15)
3x3 Conv
Dilated rate=5
Block type C (16)
12 16
1x1 Conv
3x3 Conv
M-LSD-tiny
Proposed Method
1. Light-weight Backbone
Feature Extractor
Upscale
Final Feature Maps
(H/2 x W/2 x 16)
1x1 Conv
1x1 Conv
Upscale
C
Skip
connection
3x3 Conv
3x3 Conv
+
Block: 1 11
Block type A (12, 14) Block type B (13, 15)
3x3 Conv
Dilated rate=5
Block type C (16)
12 16
Segmentation Maps
(H/2 x W/2 x 2)
Junction map x1
Line map x1
SoL Maps
(H/2 x W/2 x 7)
TP Maps
(H/2 x W/2 x 7)
Displacement
map x4
Center map x1
Length map x1
Degree map x1
1x1 Conv
3x3 Conv
M-LSD-tiny
Proposed Method
1. Light-weight Backbone
Feature Extractor
Upscale
Final Feature Maps
(H/2 x W/2 x 16)
1x1 Conv
1x1 Conv
Upscale
C
Skip
connection
3x3 Conv
3x3 Conv
+
Block: 1 11
Block type A (12, 14) Block type B (13, 15)
3x3 Conv
Dilated rate=5
Block type C (16)
12 16
Line Segments
Segmentation Maps
(H/2 x W/2 x 2)
Junction map x1
Line map x1
SoL Maps
(H/2 x W/2 x 7)
TP Maps
(H/2 x W/2 x 7)
Displacement
map x4
Center map x1
Length map x1
Degree map x1
Line
Generation
1x1 Conv
3x3 Conv
M-LSD-tiny
Proposed Method
1. Light-weight Backbone
Feature Extractor Final Feature Maps
(H/2 x W/2 x 16)
1x1 Conv
1x1 Conv
Upscale†
C
Skip
connection
3x3 Conv
3x3 Conv
+
Block: 1 14
Block type A (15,17,19,21) Block type B (16,18,20,22)
3x3 Conv
Dilated rate=5
Block type C (23)
15 23
Line Segments
Segmentation Maps
(H/2 x W/2 x 2)
Junction map x1
Line map x1
SoL Maps
(H/2 x W/2 x 7)
TP Maps
(H/2 x W/2 x 7)
Displacement
map x4
Center map x1
Length map x1
Degree map x1
Line
Generation
1x1 Conv
3x3 Conv
† denotes that block 15 skips upscale operation.
M-LSD
Proposed Method
1. Light-weight Backbone
Proposed Method
2. Line Segment Representation
Tri-Point (TP) representation
𝑙𝑙𝑠𝑠
𝑑𝑑𝑠𝑠
𝑑𝑑𝑒𝑒
𝑙𝑙𝑐𝑐
𝑙𝑙𝑒𝑒
TP Maps
(H/2 x W/2 x 7)
Displacement
map x4
Center map x1
Length map x1
Degree map x1
Notation: 𝑥𝑥𝑙𝑙𝑠𝑠
, 𝑦𝑦𝑙𝑙𝑠𝑠
denotes the 𝛼𝛼 point, 𝑑𝑑𝑠𝑠(𝑥𝑥𝑙𝑙𝑐𝑐
, 𝑦𝑦𝑙𝑙𝑐𝑐
) and 𝑑𝑑𝑒𝑒(𝑥𝑥𝑙𝑙𝑐𝑐
, 𝑦𝑦𝑙𝑙𝑐𝑐
)
indicate 2D displacements from the center point 𝑙𝑙𝑐𝑐 to the
corresponding start 𝑙𝑙𝑠𝑠 and end 𝑙𝑙𝑒𝑒 points.
Proposed Method
2. Line Segment Representation
Tri-Point (TP) representation
𝑙𝑙𝑠𝑠
𝑑𝑑𝑠𝑠
𝑑𝑑𝑒𝑒
𝑙𝑙𝑐𝑐
𝑙𝑙𝑒𝑒
TP Maps
(H/2 x W/2 x 7)
Displacement
map x4
Center map x1
Length map x1
Degree map x1
• For the center loss, we use pos / neg separated binary
classification loss.
• For the displacement loss, we use smooth L1 loss for
regression learning.
Proposed Method
2. Line Segment Representation
Tri-Point (TP) representation
𝑙𝑙𝑠𝑠
𝑑𝑑𝑠𝑠
𝑑𝑑𝑒𝑒
𝑙𝑙𝑐𝑐
𝑙𝑙𝑒𝑒
TP Maps
(H/2 x W/2 x 7)
Displacement
map x4
Center map x1
Length map x1
Degree map x1
TP representation can be insufficient in cases where,
• A line segment may be too long to manage within the receptive
field size.
• The center points of two distinct line segments are too close to
each other.
Proposed Method
3. SoL Augmentation
Segments of Line segment (SoL) augmentation
SoL Maps
(H/2 x W/2 x 7)
Displacement
map x4
Center map x1
Length map x1
Degree map x1
𝑙𝑙𝑠𝑠
𝑙𝑙𝑒𝑒
𝑙𝑙1
𝑙𝑙0
𝑙𝑙2
SoL augmentation directly increases the number and the size of
line segments.
Proposed Method
3. SoL Augmentation
Segments of Line segment (SoL) augmentation
SoL Maps
(H/2 x W/2 x 7)
Displacement
map x4
Center map x1
Length map x1
Degree map x1
𝑙𝑙𝑠𝑠
𝑙𝑙𝑒𝑒
𝑙𝑙1
𝑙𝑙0
𝑙𝑙2
1. We compute 𝑘𝑘 internally dividing points and separate the line
segments into subparts with overlapping portions.
2. Each subpart is trained as if it is a typical line segment.
Proposed Method
4. Learning with Geometric Information
Matching loss
SoL Maps
(H/2 x W/2 x 7)
TP Maps
(H/2 x W/2 x 7)
Displacement
map x4
Center map x1
Length map x1
Degree map x1
1. Take the endpoints of each prediction and measure the
Euclidean distance 𝑑𝑑() to the endpoints of the GT.
𝑙𝑙𝑠𝑠
𝑙𝑙𝑒𝑒
̂
𝑙𝑙𝑠𝑠
̂
𝑙𝑙𝑒𝑒
𝐶𝐶(̂
𝑙𝑙)
ℒ𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚
ℒ𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚
ℒ𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚
Proposed Method
4. Learning with Geometric Information
Matching loss
SoL Maps
(H/2 x W/2 x 7)
TP Maps
(H/2 x W/2 x 7)
Displacement
map x4
Center map x1
Length map x1
Degree map x1
2. These distances are used to match predicted line segments
with GT line segments that are under a threshold 𝛾𝛾:
𝑙𝑙𝑠𝑠
𝑙𝑙𝑒𝑒
̂
𝑙𝑙𝑠𝑠
̂
𝑙𝑙𝑒𝑒
𝐶𝐶(̂
𝑙𝑙)
ℒ𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚
ℒ𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚
ℒ𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚
Proposed Method
4. Learning with Geometric Information
Matching loss
SoL Maps
(H/2 x W/2 x 7)
TP Maps
(H/2 x W/2 x 7)
Displacement
map x4
Center map x1
Length map x1
Degree map x1
3. Compute the matching loss, which aims to minimize
geometric distance of the matched line segments:
𝑙𝑙𝑠𝑠
𝑙𝑙𝑒𝑒
̂
𝑙𝑙𝑠𝑠
̂
𝑙𝑙𝑒𝑒
𝐶𝐶(̂
𝑙𝑙)
ℒ𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚
ℒ𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚
ℒ𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚
Junction and Line segmentation
ℒ𝑗𝑗𝑗𝑗𝑗𝑗𝑗𝑗
ℒ𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑
ℒ𝑗𝑗𝑗𝑗𝑗𝑗𝑗𝑗
Segmentation Maps
(H/2 x W/2 x 2)
Junction map x1
Line map x1
Center point and displacement vectors are highly related to pixel-
wise junctions and line segments in the segmentation maps.
Proposed Method
4. Learning with Geometric Information
Junction and Line segmentation
ℒ𝑗𝑗𝑗𝑗𝑗𝑗𝑗𝑗
ℒ𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑
ℒ𝑗𝑗𝑗𝑗𝑗𝑗𝑗𝑗
Segmentation Maps
(H/2 x W/2 x 2)
Junction map x1
Line map x1
For the junction and line losses, we use pos / neg separated binary
classification loss.
Proposed Method
4. Learning with Geometric Information
Length and Degree regression
ℒ𝑗𝑗𝑗𝑗𝑗𝑗𝑗𝑗
ℒ𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑
ℒ𝑗𝑗𝑗𝑗𝑗𝑗𝑗𝑗
As displacement vectors can be derived from the length and
degree of line segments, they can be additional geometric cues.
SoL Maps
(H/2 x W/2 x 7)
TP Maps
(H/2 x W/2 x 7)
Displacement
map x4
Center map x1
Length map x1
Degree map x1
Proposed Method
4. Learning with Geometric Information
Length and Degree regression
ℒ𝑗𝑗𝑗𝑗𝑗𝑗𝑗𝑗
ℒ𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑
ℒ𝑗𝑗𝑗𝑗𝑗𝑗𝑗𝑗
For the length and degree losses, we use smooth L1 loss
for regression loss
SoL Maps
(H/2 x W/2 x 7)
TP Maps
(H/2 x W/2 x 7)
Displacement
map x4
Center map x1
Length map x1
Degree map x1
Proposed Method
4. Learning with Geometric Information
Proposed Method
5. Final Loss Functions
Loss for each map
Final Loss Function
SoL Maps
(H/2 x W/2 x 7)
TP Maps
(H/2 x W/2 x 7)
Displacement
map x4
Center map x1
Length map x1
Degree map x1
Segmentation Maps
(H/2 x W/2 x 2)
Junction map x1
Line map x1
Experiments
Experiments
1. Experimental Setting
Dataset
• Wireframe: 5,000 training images, 462 test images
• YorkUrban: 102 test images
Evaluation Metrics
• Heatmap-based metric FH
• Structural average precision (sAP)
• Line matching average precision (LAP)
Optimization
• Tesla V100 GPU
• Input size: 320 or 512
• Used ImageNet pretrained MobileNetV2
Experiments
2. Ablation Study and Interpretability
Baseline and Augmentation
Performance
Experiments
2. Ablation Study and Interpretability
Baseline and Augmentation
Performance Input augmentations
• Horizontal / vertical flips
• Shearing
• Rotation
• Scaling
Experiments
2. Ablation Study and Interpretability
Matching Loss
Saliency map
Performance
Saliency maps generated from TP center map
Experiments
2. Ablation Study and Interpretability
Line & Junction segmentation
Saliency map
Performance
Saliency maps generated from each feature map
Experiments
2. Ablation Study and Interpretability
Length & Degree regression
Saliency map
Performance
Saliency maps generated from each feature map
Experiments
2. Ablation Study and Interpretability
SoL augmentation
Saliency map
Performance
Saliency maps generated from TP center map
Experiments
3. Comparison with Other Methods
Quantitative comparisons with existing LSD methods
Qualitative evaluation
(a) (b) (c) (d)
Ground
truth
M-LSD-tiny-320
M-LSD-512
Experiments
4. Visualization
Inference speed and memory usage on mobile devices
Experiments
5. Deployment on Mobile Devices
We use iPhone (A14 Bionic chipset) and Android phone (Snapdragon 865 chipset), where FP denotes floating point.
Real-time box detection on a mobile device
(a) Input image (b) Line detection (c) Box candidates (d) Box detection
Experiments
6. Applications
The potential of real-time LSD can further to: book scanners,
wireframe to image translation, SLAM, and pose estimation
Experiments
6. Applications
Real-time box detection on a mobile device
The potential of real-time LSD can further to: book scanners,
wireframe to image translation, SLAM, and pose estimation
Conclusion
Conclusion
We design an efficient LSD for resource-constrained environments: Mobile LSD (M-LSD)
• Minimize the backbone network and adopt single module of line prediction process
• Present novel training schemes: Segments of Line segment (SoL) and geometric learning schemes
In this paper
Conclusion
• Compared to TP-LSD-Lite, M-LSD-tiny achieves competitive performance with 2.5% of model size
and an increase of 130.5% in inference speed on GPU.
• Our model runs at 56.8 and 48.6 FPS on Android and iPhone, which is the first real-time method
available on mobile devices.
We achieve
Thank you!
Supplementary Material
Supplementary
- Metrics
Precision and Recall of Line Heat Maps (FH)
1. Given a vectorized representation (line), it generates a confidence heat map.
2. It compares with the ground truth heat map by bipartite matching that treats each pixel independently
as a graph node is ran to match between two heat maps.
3. Then, precision and recall curve is computed according to the matching and confidence of each pixel.
Supplementary
- Metrics
Structural Average Precision (sAP)
• Defined on vectorized wireframes rather than on a heat map.
• Recall is the proportion of the correctly detected line segments (up to a cutoff score) to all the ground
truth line segments.
• Precision is the proportion of the correctly detected line segments above that cutoff to all the detected
line segments.
Supplementary
- Metrics
Line Matching Average Precision (LAP)
• Line Matching Score (LMS)
• Score_theta: the differences in angle and position
• Score_l: the matching degree in length
• LMS = Score_theta X Score_l
• Using LMS to determine true positive, i.e., a detected line segment is considered to be true positive if
LMS > 0.5, we can calculate the LAP on the entire test set
• LAP is defined as the area under the precision recall curve.
Supplementary
- Details of M-LSD
Architecture
Supplementary
- Details of M-LSD
Final feature maps
Supplementary
- Extended Experiments
Ablation Study of Architecture
Supplementary
- Extended Experiments
Needs of Offset Maps
Supplementary
- Extended Experiments
Impact of SoL Augmentation
Supplementary
- Extended Experiments
Impact of SoL Augmentation
Supplementary
- Extended Experiments
Threshold of Matching Loss
Supplementary
- Extended Experiments
Precision and Recall Curve
The End

More Related Content

What's hot

東京都市大学 データ解析入門 5 スパース性と圧縮センシング 2
東京都市大学 データ解析入門 5 スパース性と圧縮センシング 2東京都市大学 データ解析入門 5 スパース性と圧縮センシング 2
東京都市大学 データ解析入門 5 スパース性と圧縮センシング 2
hirokazutanaka
 
非負値行列分解の確率的生成モデルと 多チャネル音源分離への応用 (Generative model in nonnegative matrix facto...
非負値行列分解の確率的生成モデルと多チャネル音源分離への応用 (Generative model in nonnegative matrix facto...非負値行列分解の確率的生成モデルと多チャネル音源分離への応用 (Generative model in nonnegative matrix facto...
非負値行列分解の確率的生成モデルと 多チャネル音源分離への応用 (Generative model in nonnegative matrix facto...
Daichi Kitamura
 
機械学習におけるオンライン確率的最適化の理論
機械学習におけるオンライン確率的最適化の理論機械学習におけるオンライン確率的最適化の理論
機械学習におけるオンライン確率的最適化の理論
Taiji Suzuki
 
3次元レジストレーションの基礎とOpen3Dを用いた3次元点群処理
3次元レジストレーションの基礎とOpen3Dを用いた3次元点群処理3次元レジストレーションの基礎とOpen3Dを用いた3次元点群処理
3次元レジストレーションの基礎とOpen3Dを用いた3次元点群処理
Toru Tamaki
 
[DL Hacks]Semi-Supervised Classification with Graph Convolutional Networks
[DL Hacks]Semi-Supervised Classification with Graph Convolutional Networks[DL Hacks]Semi-Supervised Classification with Graph Convolutional Networks
[DL Hacks]Semi-Supervised Classification with Graph Convolutional Networks
Deep Learning JP
 
ポアソン画像合成
ポアソン画像合成ポアソン画像合成
ポアソン画像合成
Arumaziro
 
Deeplearning輪読会
Deeplearning輪読会Deeplearning輪読会
Deeplearning輪読会
正志 坪坂
 
SLAM開発における課題と対策の一例の紹介
SLAM開発における課題と対策の一例の紹介SLAM開発における課題と対策の一例の紹介
SLAM開発における課題と対策の一例の紹介
miyanegi
 
LSTM (Long short-term memory) 概要
LSTM (Long short-term memory) 概要LSTM (Long short-term memory) 概要
LSTM (Long short-term memory) 概要
Kenji Urai
 
【DL輪読会】Visual Classification via Description from Large Language Models (ICLR...
【DL輪読会】Visual Classification via Description from Large Language Models (ICLR...【DL輪読会】Visual Classification via Description from Large Language Models (ICLR...
【DL輪読会】Visual Classification via Description from Large Language Models (ICLR...
Deep Learning JP
 
画像認識モデルを作るための鉄板レシピ
画像認識モデルを作るための鉄板レシピ画像認識モデルを作るための鉄板レシピ
画像認識モデルを作るための鉄板レシピ
Takahiro Kubo
 
.NETラボ 勉強会 2021年1月 「C#で機械学習」
.NETラボ 勉強会 2021年1月 「C#で機械学習」.NETラボ 勉強会 2021年1月 「C#で機械学習」
.NETラボ 勉強会 2021年1月 「C#で機械学習」
Fujio Kojima
 
論文紹介:Dueling network architectures for deep reinforcement learning
論文紹介:Dueling network architectures for deep reinforcement learning論文紹介:Dueling network architectures for deep reinforcement learning
論文紹介:Dueling network architectures for deep reinforcement learning
Kazuki Adachi
 
Microsoft Malware Classification Challenge 上位手法の紹介 (in Kaggle Study Meetup)
Microsoft Malware Classification Challenge 上位手法の紹介 (in Kaggle Study Meetup)Microsoft Malware Classification Challenge 上位手法の紹介 (in Kaggle Study Meetup)
Microsoft Malware Classification Challenge 上位手法の紹介 (in Kaggle Study Meetup)
Shotaro Sano
 
Fisher Vectorによる画像認識
Fisher Vectorによる画像認識Fisher Vectorによる画像認識
Fisher Vectorによる画像認識
Takao Yamanaka
 
[DL輪読会]Scalable Training of Inference Networks for Gaussian-Process Models
[DL輪読会]Scalable Training of Inference Networks for Gaussian-Process Models[DL輪読会]Scalable Training of Inference Networks for Gaussian-Process Models
[DL輪読会]Scalable Training of Inference Networks for Gaussian-Process Models
Deep Learning JP
 
Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recog...
Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recog...Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recog...
Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recog...
yukihiro domae
 
XGBoostからNGBoostまで
XGBoostからNGBoostまでXGBoostからNGBoostまで
XGBoostからNGBoostまで
Tomoki Yoshida
 
主成分分析
主成分分析主成分分析
主成分分析
貴之 八木
 
強化学習アルゴリズムPPOの解説と実験
強化学習アルゴリズムPPOの解説と実験強化学習アルゴリズムPPOの解説と実験
強化学習アルゴリズムPPOの解説と実験
克海 納谷
 

What's hot (20)

東京都市大学 データ解析入門 5 スパース性と圧縮センシング 2
東京都市大学 データ解析入門 5 スパース性と圧縮センシング 2東京都市大学 データ解析入門 5 スパース性と圧縮センシング 2
東京都市大学 データ解析入門 5 スパース性と圧縮センシング 2
 
非負値行列分解の確率的生成モデルと 多チャネル音源分離への応用 (Generative model in nonnegative matrix facto...
非負値行列分解の確率的生成モデルと多チャネル音源分離への応用 (Generative model in nonnegative matrix facto...非負値行列分解の確率的生成モデルと多チャネル音源分離への応用 (Generative model in nonnegative matrix facto...
非負値行列分解の確率的生成モデルと 多チャネル音源分離への応用 (Generative model in nonnegative matrix facto...
 
機械学習におけるオンライン確率的最適化の理論
機械学習におけるオンライン確率的最適化の理論機械学習におけるオンライン確率的最適化の理論
機械学習におけるオンライン確率的最適化の理論
 
3次元レジストレーションの基礎とOpen3Dを用いた3次元点群処理
3次元レジストレーションの基礎とOpen3Dを用いた3次元点群処理3次元レジストレーションの基礎とOpen3Dを用いた3次元点群処理
3次元レジストレーションの基礎とOpen3Dを用いた3次元点群処理
 
[DL Hacks]Semi-Supervised Classification with Graph Convolutional Networks
[DL Hacks]Semi-Supervised Classification with Graph Convolutional Networks[DL Hacks]Semi-Supervised Classification with Graph Convolutional Networks
[DL Hacks]Semi-Supervised Classification with Graph Convolutional Networks
 
ポアソン画像合成
ポアソン画像合成ポアソン画像合成
ポアソン画像合成
 
Deeplearning輪読会
Deeplearning輪読会Deeplearning輪読会
Deeplearning輪読会
 
SLAM開発における課題と対策の一例の紹介
SLAM開発における課題と対策の一例の紹介SLAM開発における課題と対策の一例の紹介
SLAM開発における課題と対策の一例の紹介
 
LSTM (Long short-term memory) 概要
LSTM (Long short-term memory) 概要LSTM (Long short-term memory) 概要
LSTM (Long short-term memory) 概要
 
【DL輪読会】Visual Classification via Description from Large Language Models (ICLR...
【DL輪読会】Visual Classification via Description from Large Language Models (ICLR...【DL輪読会】Visual Classification via Description from Large Language Models (ICLR...
【DL輪読会】Visual Classification via Description from Large Language Models (ICLR...
 
画像認識モデルを作るための鉄板レシピ
画像認識モデルを作るための鉄板レシピ画像認識モデルを作るための鉄板レシピ
画像認識モデルを作るための鉄板レシピ
 
.NETラボ 勉強会 2021年1月 「C#で機械学習」
.NETラボ 勉強会 2021年1月 「C#で機械学習」.NETラボ 勉強会 2021年1月 「C#で機械学習」
.NETラボ 勉強会 2021年1月 「C#で機械学習」
 
論文紹介:Dueling network architectures for deep reinforcement learning
論文紹介:Dueling network architectures for deep reinforcement learning論文紹介:Dueling network architectures for deep reinforcement learning
論文紹介:Dueling network architectures for deep reinforcement learning
 
Microsoft Malware Classification Challenge 上位手法の紹介 (in Kaggle Study Meetup)
Microsoft Malware Classification Challenge 上位手法の紹介 (in Kaggle Study Meetup)Microsoft Malware Classification Challenge 上位手法の紹介 (in Kaggle Study Meetup)
Microsoft Malware Classification Challenge 上位手法の紹介 (in Kaggle Study Meetup)
 
Fisher Vectorによる画像認識
Fisher Vectorによる画像認識Fisher Vectorによる画像認識
Fisher Vectorによる画像認識
 
[DL輪読会]Scalable Training of Inference Networks for Gaussian-Process Models
[DL輪読会]Scalable Training of Inference Networks for Gaussian-Process Models[DL輪読会]Scalable Training of Inference Networks for Gaussian-Process Models
[DL輪読会]Scalable Training of Inference Networks for Gaussian-Process Models
 
Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recog...
Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recog...Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recog...
Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recog...
 
XGBoostからNGBoostまで
XGBoostからNGBoostまでXGBoostからNGBoostまで
XGBoostからNGBoostまで
 
主成分分析
主成分分析主成分分析
主成分分析
 
強化学習アルゴリズムPPOの解説と実験
強化学習アルゴリズムPPOの解説と実験強化学習アルゴリズムPPOの解説と実験
強化学習アルゴリズムPPOの解説と実験
 

Similar to Towards Light-weight and Real-time Line Segment Detection

Area, Delay and Power Comparison of Adder Topologies
Area, Delay and Power Comparison of Adder TopologiesArea, Delay and Power Comparison of Adder Topologies
Area, Delay and Power Comparison of Adder Topologies
VLSICS Design
 
AN EFFICIENT APPROACH FOR FOUR-LAYER CHANNEL ROUTING IN VLSI DESIGN
AN EFFICIENT APPROACH FOR FOUR-LAYER CHANNEL ROUTING IN VLSI DESIGNAN EFFICIENT APPROACH FOR FOUR-LAYER CHANNEL ROUTING IN VLSI DESIGN
AN EFFICIENT APPROACH FOR FOUR-LAYER CHANNEL ROUTING IN VLSI DESIGN
VLSICS Design
 
ENABLING RAY TRACING FOR 5G
ENABLING RAY TRACING FOR 5GENABLING RAY TRACING FOR 5G
ENABLING RAY TRACING FOR 5G
IJCNCJournal
 
Enabling Ray Tracing for 5G
Enabling Ray Tracing for 5GEnabling Ray Tracing for 5G
Enabling Ray Tracing for 5G
IJCNCJournal
 
3D METALLIC PLATE LENS ANTENNA BASED BEAMSPACE CHANNEL ESTIMATION TECHNIQUE F...
3D METALLIC PLATE LENS ANTENNA BASED BEAMSPACE CHANNEL ESTIMATION TECHNIQUE F...3D METALLIC PLATE LENS ANTENNA BASED BEAMSPACE CHANNEL ESTIMATION TECHNIQUE F...
3D METALLIC PLATE LENS ANTENNA BASED BEAMSPACE CHANNEL ESTIMATION TECHNIQUE F...
ijwmn
 
3D METALLIC PLATE LENS ANTENNA BASED BEAMSPACE CHANNEL ESTIMATION TECHNIQUE F...
3D METALLIC PLATE LENS ANTENNA BASED BEAMSPACE CHANNEL ESTIMATION TECHNIQUE F...3D METALLIC PLATE LENS ANTENNA BASED BEAMSPACE CHANNEL ESTIMATION TECHNIQUE F...
3D METALLIC PLATE LENS ANTENNA BASED BEAMSPACE CHANNEL ESTIMATION TECHNIQUE F...
ijwmn
 
COMPUTER NETWORKS CHAPTER 3 NETWORK LAYER NOTES CSE 3RD year sem 1
COMPUTER NETWORKS CHAPTER 3 NETWORK LAYER NOTES CSE 3RD year sem 1COMPUTER NETWORKS CHAPTER 3 NETWORK LAYER NOTES CSE 3RD year sem 1
COMPUTER NETWORKS CHAPTER 3 NETWORK LAYER NOTES CSE 3RD year sem 1
aishwaryaarrao3
 
CHANNEL ESTIMATION AND MULTIUSER DETECTION IN ASYNCHRONOUS SATELLITE COMMUNIC...
CHANNEL ESTIMATION AND MULTIUSER DETECTION IN ASYNCHRONOUS SATELLITE COMMUNIC...CHANNEL ESTIMATION AND MULTIUSER DETECTION IN ASYNCHRONOUS SATELLITE COMMUNIC...
CHANNEL ESTIMATION AND MULTIUSER DETECTION IN ASYNCHRONOUS SATELLITE COMMUNIC...
ijwmn
 
VTC-location based channel estimation for massive full-dimensional MIMO systems
VTC-location based channel estimation for massive full-dimensional MIMO systemsVTC-location based channel estimation for massive full-dimensional MIMO systems
VTC-location based channel estimation for massive full-dimensional MIMO systemsQian Han
 
Methods for comparing scanpaths and saliency maps: strengths and weaknesses
Methods for comparing scanpaths and saliency maps: strengths and weaknessesMethods for comparing scanpaths and saliency maps: strengths and weaknesses
Methods for comparing scanpaths and saliency maps: strengths and weaknesses
Olivier Le Meur
 
Bit Error Rate Performance of MIMO Spatial Multiplexing with MPSK Modulation ...
Bit Error Rate Performance of MIMO Spatial Multiplexing with MPSK Modulation ...Bit Error Rate Performance of MIMO Spatial Multiplexing with MPSK Modulation ...
Bit Error Rate Performance of MIMO Spatial Multiplexing with MPSK Modulation ...
ijsrd.com
 
Cluster based routing protocol
Cluster based routing protocolCluster based routing protocol
Cluster based routing protocolSudhansu Dash
 
UDSLF
UDSLFUDSLF
Enhanced Mobile Node Tracking With Received Signal Strength in Wireless Senso...
Enhanced Mobile Node Tracking With Received Signal Strength in Wireless Senso...Enhanced Mobile Node Tracking With Received Signal Strength in Wireless Senso...
Enhanced Mobile Node Tracking With Received Signal Strength in Wireless Senso...
IOSR Journals
 
Enhanced Mobile Node Tracking With Received Signal Strength in Wireless Senso...
Enhanced Mobile Node Tracking With Received Signal Strength in Wireless Senso...Enhanced Mobile Node Tracking With Received Signal Strength in Wireless Senso...
Enhanced Mobile Node Tracking With Received Signal Strength in Wireless Senso...
IOSR Journals
 
Tensor Spectral Clustering
Tensor Spectral ClusteringTensor Spectral Clustering
Tensor Spectral Clustering
Austin Benson
 
Csit77402
Csit77402Csit77402
Csit77402
csandit
 
Practical Spherical Harmonics Based PRT Methods
Practical Spherical Harmonics Based PRT MethodsPractical Spherical Harmonics Based PRT Methods
Practical Spherical Harmonics Based PRT MethodsNaughty Dog
 

Similar to Towards Light-weight and Real-time Line Segment Detection (20)

Area, Delay and Power Comparison of Adder Topologies
Area, Delay and Power Comparison of Adder TopologiesArea, Delay and Power Comparison of Adder Topologies
Area, Delay and Power Comparison of Adder Topologies
 
AN EFFICIENT APPROACH FOR FOUR-LAYER CHANNEL ROUTING IN VLSI DESIGN
AN EFFICIENT APPROACH FOR FOUR-LAYER CHANNEL ROUTING IN VLSI DESIGNAN EFFICIENT APPROACH FOR FOUR-LAYER CHANNEL ROUTING IN VLSI DESIGN
AN EFFICIENT APPROACH FOR FOUR-LAYER CHANNEL ROUTING IN VLSI DESIGN
 
ENABLING RAY TRACING FOR 5G
ENABLING RAY TRACING FOR 5GENABLING RAY TRACING FOR 5G
ENABLING RAY TRACING FOR 5G
 
Enabling Ray Tracing for 5G
Enabling Ray Tracing for 5GEnabling Ray Tracing for 5G
Enabling Ray Tracing for 5G
 
3D METALLIC PLATE LENS ANTENNA BASED BEAMSPACE CHANNEL ESTIMATION TECHNIQUE F...
3D METALLIC PLATE LENS ANTENNA BASED BEAMSPACE CHANNEL ESTIMATION TECHNIQUE F...3D METALLIC PLATE LENS ANTENNA BASED BEAMSPACE CHANNEL ESTIMATION TECHNIQUE F...
3D METALLIC PLATE LENS ANTENNA BASED BEAMSPACE CHANNEL ESTIMATION TECHNIQUE F...
 
3D METALLIC PLATE LENS ANTENNA BASED BEAMSPACE CHANNEL ESTIMATION TECHNIQUE F...
3D METALLIC PLATE LENS ANTENNA BASED BEAMSPACE CHANNEL ESTIMATION TECHNIQUE F...3D METALLIC PLATE LENS ANTENNA BASED BEAMSPACE CHANNEL ESTIMATION TECHNIQUE F...
3D METALLIC PLATE LENS ANTENNA BASED BEAMSPACE CHANNEL ESTIMATION TECHNIQUE F...
 
COMPUTER NETWORKS CHAPTER 3 NETWORK LAYER NOTES CSE 3RD year sem 1
COMPUTER NETWORKS CHAPTER 3 NETWORK LAYER NOTES CSE 3RD year sem 1COMPUTER NETWORKS CHAPTER 3 NETWORK LAYER NOTES CSE 3RD year sem 1
COMPUTER NETWORKS CHAPTER 3 NETWORK LAYER NOTES CSE 3RD year sem 1
 
CHANNEL ESTIMATION AND MULTIUSER DETECTION IN ASYNCHRONOUS SATELLITE COMMUNIC...
CHANNEL ESTIMATION AND MULTIUSER DETECTION IN ASYNCHRONOUS SATELLITE COMMUNIC...CHANNEL ESTIMATION AND MULTIUSER DETECTION IN ASYNCHRONOUS SATELLITE COMMUNIC...
CHANNEL ESTIMATION AND MULTIUSER DETECTION IN ASYNCHRONOUS SATELLITE COMMUNIC...
 
VTC-location based channel estimation for massive full-dimensional MIMO systems
VTC-location based channel estimation for massive full-dimensional MIMO systemsVTC-location based channel estimation for massive full-dimensional MIMO systems
VTC-location based channel estimation for massive full-dimensional MIMO systems
 
TRS
TRSTRS
TRS
 
Methods for comparing scanpaths and saliency maps: strengths and weaknesses
Methods for comparing scanpaths and saliency maps: strengths and weaknessesMethods for comparing scanpaths and saliency maps: strengths and weaknesses
Methods for comparing scanpaths and saliency maps: strengths and weaknesses
 
Bit Error Rate Performance of MIMO Spatial Multiplexing with MPSK Modulation ...
Bit Error Rate Performance of MIMO Spatial Multiplexing with MPSK Modulation ...Bit Error Rate Performance of MIMO Spatial Multiplexing with MPSK Modulation ...
Bit Error Rate Performance of MIMO Spatial Multiplexing with MPSK Modulation ...
 
Cluster based routing protocol
Cluster based routing protocolCluster based routing protocol
Cluster based routing protocol
 
UDSLF
UDSLFUDSLF
UDSLF
 
Enhanced Mobile Node Tracking With Received Signal Strength in Wireless Senso...
Enhanced Mobile Node Tracking With Received Signal Strength in Wireless Senso...Enhanced Mobile Node Tracking With Received Signal Strength in Wireless Senso...
Enhanced Mobile Node Tracking With Received Signal Strength in Wireless Senso...
 
Enhanced Mobile Node Tracking With Received Signal Strength in Wireless Senso...
Enhanced Mobile Node Tracking With Received Signal Strength in Wireless Senso...Enhanced Mobile Node Tracking With Received Signal Strength in Wireless Senso...
Enhanced Mobile Node Tracking With Received Signal Strength in Wireless Senso...
 
Beamforming antennas (1)
Beamforming antennas (1)Beamforming antennas (1)
Beamforming antennas (1)
 
Tensor Spectral Clustering
Tensor Spectral ClusteringTensor Spectral Clustering
Tensor Spectral Clustering
 
Csit77402
Csit77402Csit77402
Csit77402
 
Practical Spherical Harmonics Based PRT Methods
Practical Spherical Harmonics Based PRT MethodsPractical Spherical Harmonics Based PRT Methods
Practical Spherical Harmonics Based PRT Methods
 

Recently uploaded

Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
DianaGray10
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
Uni Systems S.M.S.A.
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 

Recently uploaded (20)

Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 

Towards Light-weight and Real-time Line Segment Detection

  • 1. Towards Light-weight and Real-time Line Segment Detection Geonmo Gu*, Byungsoo Ko*, SeoungHyun Go Sung-Hyun Lee, Jingeun Lee, Minchul Shin Naver Search Vision * Equal contribution
  • 3. Introduction 1. Line Segment Detection? Line Segment Detection Purpose: Detect line segments in the given images Line Segment Detector (LSD)
  • 4. Introduction 1. Line Segment Detection? Applications of LSD Simultaneous Localization and Mapping (SLAM)1 Relative Pose Estimation2 1 Monocular-vision based SLAM using Line Segments, ICRA07 2 Line-Based Relative Pose Estimation, CVPR11
  • 5. Introduction 2. Real-time inference is limited a) Existing methods exploit heavy backbone networks Backbone Input Image
  • 6. Introduction 2. Real-time inference is limited a) Existing methods exploit heavy backbone networks Backbone Input Image • Dilated ResNet50-based FPN • Stacked hourglass network • Atrous residual U-net Existing Methods
  • 7. Introduction 2. Real-time inference is limited a) Existing methods exploit heavy backbone networks Backbone Input Image • Dilated ResNet50-based FPN • Stacked hourglass network • Atrous residual U-net Existing Methods Ours • Modified MobileNetV2
  • 8. Introduction 2. Real-time inference is limited b) Existing methods include multi-module line prediction process Backbone Input Image
  • 9. Introduction 2. Real-time inference is limited b) Existing methods include multi-module line prediction process Backbone Feature Maps Input Image Line Segments
  • 10. Introduction 2. Real-time inference is limited Multi-module Processing Attraction Field Maps Top-Down (TD) Strategy Backbone Feature Maps Input Image Line Segments Squeeze Module Compute Line Proposal Search for Candidates Verify Candidates b) Existing methods include multi-module line prediction process
  • 11. Introduction 2. Real-time inference is limited Multi-module Processing Junction Heatmap Line Sampler Line Proposal Line Verification Bottom-Up (BU) Strategy Attraction Field Maps Top-Down (TD) Strategy Backbone Feature Maps Input Image Line Segments LoI Pooling Squeeze Module Compute Line Proposal Search for Candidates Verify Candidates b) Existing methods include multi-module line prediction process
  • 12. Introduction 2. Real-time inference is limited Multi-module Processing Junction Heatmap Line Sampler Line Proposal Line Verification Bottom-Up (BU) Strategy Attraction Field Maps Top-Down (TD) Strategy Backbone Feature Maps Input Image Line Segments LoI Pooling Line Map Center Map Mixture of Conv. Mixture of Conv. Line Generation Displacement Map Tri-Point (TP) Strategy Point Filter Module Line Segmentation Squeeze Module Compute Line Proposal Search for Candidates Verify Candidates b) Existing methods include multi-module line prediction process
  • 13. Introduction 2. Real-time inference is limited Multi-module Processing Junction Heatmap Line Sampler Line Proposal Line Verification Bottom-Up (BU) Strategy Attraction Field Maps Top-Down (TD) Strategy Single Module Processing Ours Line Generation Backbone Feature Maps Input Image Line Segments LoI Pooling Line Map Center Map Mixture of Conv. Mixture of Conv. Line Generation Displacement Map Tri-Point (TP) Strategy Point Filter Module Line Segmentation Squeeze Module Compute Line Proposal Search for Candidates Verify Candidates Center / Displacement Map b) Existing methods include multi-module line prediction process
  • 14. Introduction 2. Real-time inference is limited Inference speed comparison Inference speed of backbone and prediction process on GPU are significantly improved
  • 15. Introduction 3. Mobile-LSD Motivation Existing LSDs are limited in real-time inference, especially on mobile devices. • Exploit heavy backbone networks • Include multi-module line prediction process
  • 16. Introduction 3. Mobile-LSD Motivation We design an efficient LSD for resource-constrained environments: Mobile LSD (M-LSD) • Minimize the backbone network and adopt single module of line prediction process • Present novel training schemes: Segments of Line segment (SoL) and geometric learning schemes In this paper Existing LSDs are limited in real-time inference, especially on mobile devices. • Exploit heavy backbone networks • Include multi-module line prediction process
  • 18. Proposed Method 1. Light-weight Backbone M-LSD-tiny
  • 19. M-LSD-tiny Feature Extractor Block: 1 11 12 16 Proposed Method 1. Light-weight Backbone
  • 20. Feature Extractor 1x1 Conv 1x1 Conv Upscale C Skip connection Block: 1 11 Block type A (12, 14) 12 16 M-LSD-tiny Proposed Method 1. Light-weight Backbone
  • 21. Feature Extractor 1x1 Conv 1x1 Conv Upscale C Skip connection 3x3 Conv 3x3 Conv + Block: 1 11 Block type A (12, 14) Block type B (13, 15) 12 16 M-LSD-tiny Proposed Method 1. Light-weight Backbone
  • 22. Feature Extractor 1x1 Conv 1x1 Conv Upscale C Skip connection 3x3 Conv 3x3 Conv + Block: 1 11 Block type A (12, 14) Block type B (13, 15) 3x3 Conv Dilated rate=5 Block type C (16) 12 16 1x1 Conv 3x3 Conv M-LSD-tiny Proposed Method 1. Light-weight Backbone
  • 23. Feature Extractor Upscale 1x1 Conv 1x1 Conv Upscale C Skip connection 3x3 Conv 3x3 Conv + Block: 1 11 Block type A (12, 14) Block type B (13, 15) 3x3 Conv Dilated rate=5 Block type C (16) 12 16 1x1 Conv 3x3 Conv M-LSD-tiny Proposed Method 1. Light-weight Backbone
  • 24. Feature Extractor Upscale Final Feature Maps (H/2 x W/2 x 16) 1x1 Conv 1x1 Conv Upscale C Skip connection 3x3 Conv 3x3 Conv + Block: 1 11 Block type A (12, 14) Block type B (13, 15) 3x3 Conv Dilated rate=5 Block type C (16) 12 16 Segmentation Maps (H/2 x W/2 x 2) Junction map x1 Line map x1 SoL Maps (H/2 x W/2 x 7) TP Maps (H/2 x W/2 x 7) Displacement map x4 Center map x1 Length map x1 Degree map x1 1x1 Conv 3x3 Conv M-LSD-tiny Proposed Method 1. Light-weight Backbone
  • 25. Feature Extractor Upscale Final Feature Maps (H/2 x W/2 x 16) 1x1 Conv 1x1 Conv Upscale C Skip connection 3x3 Conv 3x3 Conv + Block: 1 11 Block type A (12, 14) Block type B (13, 15) 3x3 Conv Dilated rate=5 Block type C (16) 12 16 Line Segments Segmentation Maps (H/2 x W/2 x 2) Junction map x1 Line map x1 SoL Maps (H/2 x W/2 x 7) TP Maps (H/2 x W/2 x 7) Displacement map x4 Center map x1 Length map x1 Degree map x1 Line Generation 1x1 Conv 3x3 Conv M-LSD-tiny Proposed Method 1. Light-weight Backbone
  • 26. Feature Extractor Final Feature Maps (H/2 x W/2 x 16) 1x1 Conv 1x1 Conv Upscale† C Skip connection 3x3 Conv 3x3 Conv + Block: 1 14 Block type A (15,17,19,21) Block type B (16,18,20,22) 3x3 Conv Dilated rate=5 Block type C (23) 15 23 Line Segments Segmentation Maps (H/2 x W/2 x 2) Junction map x1 Line map x1 SoL Maps (H/2 x W/2 x 7) TP Maps (H/2 x W/2 x 7) Displacement map x4 Center map x1 Length map x1 Degree map x1 Line Generation 1x1 Conv 3x3 Conv † denotes that block 15 skips upscale operation. M-LSD Proposed Method 1. Light-weight Backbone
  • 27. Proposed Method 2. Line Segment Representation Tri-Point (TP) representation 𝑙𝑙𝑠𝑠 𝑑𝑑𝑠𝑠 𝑑𝑑𝑒𝑒 𝑙𝑙𝑐𝑐 𝑙𝑙𝑒𝑒 TP Maps (H/2 x W/2 x 7) Displacement map x4 Center map x1 Length map x1 Degree map x1 Notation: 𝑥𝑥𝑙𝑙𝑠𝑠 , 𝑦𝑦𝑙𝑙𝑠𝑠 denotes the 𝛼𝛼 point, 𝑑𝑑𝑠𝑠(𝑥𝑥𝑙𝑙𝑐𝑐 , 𝑦𝑦𝑙𝑙𝑐𝑐 ) and 𝑑𝑑𝑒𝑒(𝑥𝑥𝑙𝑙𝑐𝑐 , 𝑦𝑦𝑙𝑙𝑐𝑐 ) indicate 2D displacements from the center point 𝑙𝑙𝑐𝑐 to the corresponding start 𝑙𝑙𝑠𝑠 and end 𝑙𝑙𝑒𝑒 points.
  • 28. Proposed Method 2. Line Segment Representation Tri-Point (TP) representation 𝑙𝑙𝑠𝑠 𝑑𝑑𝑠𝑠 𝑑𝑑𝑒𝑒 𝑙𝑙𝑐𝑐 𝑙𝑙𝑒𝑒 TP Maps (H/2 x W/2 x 7) Displacement map x4 Center map x1 Length map x1 Degree map x1 • For the center loss, we use pos / neg separated binary classification loss. • For the displacement loss, we use smooth L1 loss for regression learning.
  • 29. Proposed Method 2. Line Segment Representation Tri-Point (TP) representation 𝑙𝑙𝑠𝑠 𝑑𝑑𝑠𝑠 𝑑𝑑𝑒𝑒 𝑙𝑙𝑐𝑐 𝑙𝑙𝑒𝑒 TP Maps (H/2 x W/2 x 7) Displacement map x4 Center map x1 Length map x1 Degree map x1 TP representation can be insufficient in cases where, • A line segment may be too long to manage within the receptive field size. • The center points of two distinct line segments are too close to each other.
  • 30. Proposed Method 3. SoL Augmentation Segments of Line segment (SoL) augmentation SoL Maps (H/2 x W/2 x 7) Displacement map x4 Center map x1 Length map x1 Degree map x1 𝑙𝑙𝑠𝑠 𝑙𝑙𝑒𝑒 𝑙𝑙1 𝑙𝑙0 𝑙𝑙2 SoL augmentation directly increases the number and the size of line segments.
  • 31. Proposed Method 3. SoL Augmentation Segments of Line segment (SoL) augmentation SoL Maps (H/2 x W/2 x 7) Displacement map x4 Center map x1 Length map x1 Degree map x1 𝑙𝑙𝑠𝑠 𝑙𝑙𝑒𝑒 𝑙𝑙1 𝑙𝑙0 𝑙𝑙2 1. We compute 𝑘𝑘 internally dividing points and separate the line segments into subparts with overlapping portions. 2. Each subpart is trained as if it is a typical line segment.
  • 32. Proposed Method 4. Learning with Geometric Information Matching loss SoL Maps (H/2 x W/2 x 7) TP Maps (H/2 x W/2 x 7) Displacement map x4 Center map x1 Length map x1 Degree map x1 1. Take the endpoints of each prediction and measure the Euclidean distance 𝑑𝑑() to the endpoints of the GT. 𝑙𝑙𝑠𝑠 𝑙𝑙𝑒𝑒 ̂ 𝑙𝑙𝑠𝑠 ̂ 𝑙𝑙𝑒𝑒 𝐶𝐶(̂ 𝑙𝑙) ℒ𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚 ℒ𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚 ℒ𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚
  • 33. Proposed Method 4. Learning with Geometric Information Matching loss SoL Maps (H/2 x W/2 x 7) TP Maps (H/2 x W/2 x 7) Displacement map x4 Center map x1 Length map x1 Degree map x1 2. These distances are used to match predicted line segments with GT line segments that are under a threshold 𝛾𝛾: 𝑙𝑙𝑠𝑠 𝑙𝑙𝑒𝑒 ̂ 𝑙𝑙𝑠𝑠 ̂ 𝑙𝑙𝑒𝑒 𝐶𝐶(̂ 𝑙𝑙) ℒ𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚 ℒ𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚 ℒ𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚
  • 34. Proposed Method 4. Learning with Geometric Information Matching loss SoL Maps (H/2 x W/2 x 7) TP Maps (H/2 x W/2 x 7) Displacement map x4 Center map x1 Length map x1 Degree map x1 3. Compute the matching loss, which aims to minimize geometric distance of the matched line segments: 𝑙𝑙𝑠𝑠 𝑙𝑙𝑒𝑒 ̂ 𝑙𝑙𝑠𝑠 ̂ 𝑙𝑙𝑒𝑒 𝐶𝐶(̂ 𝑙𝑙) ℒ𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚 ℒ𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚 ℒ𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚
  • 35. Junction and Line segmentation ℒ𝑗𝑗𝑗𝑗𝑗𝑗𝑗𝑗 ℒ𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑 ℒ𝑗𝑗𝑗𝑗𝑗𝑗𝑗𝑗 Segmentation Maps (H/2 x W/2 x 2) Junction map x1 Line map x1 Center point and displacement vectors are highly related to pixel- wise junctions and line segments in the segmentation maps. Proposed Method 4. Learning with Geometric Information
  • 36. Junction and Line segmentation ℒ𝑗𝑗𝑗𝑗𝑗𝑗𝑗𝑗 ℒ𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑 ℒ𝑗𝑗𝑗𝑗𝑗𝑗𝑗𝑗 Segmentation Maps (H/2 x W/2 x 2) Junction map x1 Line map x1 For the junction and line losses, we use pos / neg separated binary classification loss. Proposed Method 4. Learning with Geometric Information
  • 37. Length and Degree regression ℒ𝑗𝑗𝑗𝑗𝑗𝑗𝑗𝑗 ℒ𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑 ℒ𝑗𝑗𝑗𝑗𝑗𝑗𝑗𝑗 As displacement vectors can be derived from the length and degree of line segments, they can be additional geometric cues. SoL Maps (H/2 x W/2 x 7) TP Maps (H/2 x W/2 x 7) Displacement map x4 Center map x1 Length map x1 Degree map x1 Proposed Method 4. Learning with Geometric Information
  • 38. Length and Degree regression ℒ𝑗𝑗𝑗𝑗𝑗𝑗𝑗𝑗 ℒ𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑 ℒ𝑗𝑗𝑗𝑗𝑗𝑗𝑗𝑗 For the length and degree losses, we use smooth L1 loss for regression loss SoL Maps (H/2 x W/2 x 7) TP Maps (H/2 x W/2 x 7) Displacement map x4 Center map x1 Length map x1 Degree map x1 Proposed Method 4. Learning with Geometric Information
  • 39. Proposed Method 5. Final Loss Functions Loss for each map Final Loss Function SoL Maps (H/2 x W/2 x 7) TP Maps (H/2 x W/2 x 7) Displacement map x4 Center map x1 Length map x1 Degree map x1 Segmentation Maps (H/2 x W/2 x 2) Junction map x1 Line map x1
  • 41. Experiments 1. Experimental Setting Dataset • Wireframe: 5,000 training images, 462 test images • YorkUrban: 102 test images Evaluation Metrics • Heatmap-based metric FH • Structural average precision (sAP) • Line matching average precision (LAP) Optimization • Tesla V100 GPU • Input size: 320 or 512 • Used ImageNet pretrained MobileNetV2
  • 42. Experiments 2. Ablation Study and Interpretability Baseline and Augmentation Performance
  • 43. Experiments 2. Ablation Study and Interpretability Baseline and Augmentation Performance Input augmentations • Horizontal / vertical flips • Shearing • Rotation • Scaling
  • 44. Experiments 2. Ablation Study and Interpretability Matching Loss Saliency map Performance Saliency maps generated from TP center map
  • 45. Experiments 2. Ablation Study and Interpretability Line & Junction segmentation Saliency map Performance Saliency maps generated from each feature map
  • 46. Experiments 2. Ablation Study and Interpretability Length & Degree regression Saliency map Performance Saliency maps generated from each feature map
  • 47. Experiments 2. Ablation Study and Interpretability SoL augmentation Saliency map Performance Saliency maps generated from TP center map
  • 48. Experiments 3. Comparison with Other Methods Quantitative comparisons with existing LSD methods
  • 49. Qualitative evaluation (a) (b) (c) (d) Ground truth M-LSD-tiny-320 M-LSD-512 Experiments 4. Visualization
  • 50. Inference speed and memory usage on mobile devices Experiments 5. Deployment on Mobile Devices We use iPhone (A14 Bionic chipset) and Android phone (Snapdragon 865 chipset), where FP denotes floating point.
  • 51. Real-time box detection on a mobile device (a) Input image (b) Line detection (c) Box candidates (d) Box detection Experiments 6. Applications The potential of real-time LSD can further to: book scanners, wireframe to image translation, SLAM, and pose estimation
  • 52. Experiments 6. Applications Real-time box detection on a mobile device The potential of real-time LSD can further to: book scanners, wireframe to image translation, SLAM, and pose estimation
  • 54. Conclusion We design an efficient LSD for resource-constrained environments: Mobile LSD (M-LSD) • Minimize the backbone network and adopt single module of line prediction process • Present novel training schemes: Segments of Line segment (SoL) and geometric learning schemes In this paper
  • 55. Conclusion • Compared to TP-LSD-Lite, M-LSD-tiny achieves competitive performance with 2.5% of model size and an increase of 130.5% in inference speed on GPU. • Our model runs at 56.8 and 48.6 FPS on Android and iPhone, which is the first real-time method available on mobile devices. We achieve
  • 58. Supplementary - Metrics Precision and Recall of Line Heat Maps (FH) 1. Given a vectorized representation (line), it generates a confidence heat map. 2. It compares with the ground truth heat map by bipartite matching that treats each pixel independently as a graph node is ran to match between two heat maps. 3. Then, precision and recall curve is computed according to the matching and confidence of each pixel.
  • 59. Supplementary - Metrics Structural Average Precision (sAP) • Defined on vectorized wireframes rather than on a heat map. • Recall is the proportion of the correctly detected line segments (up to a cutoff score) to all the ground truth line segments. • Precision is the proportion of the correctly detected line segments above that cutoff to all the detected line segments.
  • 60. Supplementary - Metrics Line Matching Average Precision (LAP) • Line Matching Score (LMS) • Score_theta: the differences in angle and position • Score_l: the matching degree in length • LMS = Score_theta X Score_l • Using LMS to determine true positive, i.e., a detected line segment is considered to be true positive if LMS > 0.5, we can calculate the LAP on the entire test set • LAP is defined as the area under the precision recall curve.
  • 61. Supplementary - Details of M-LSD Architecture
  • 62. Supplementary - Details of M-LSD Final feature maps