Learning joint 2 d 3d representations for depth completion

•Download as PPTX, PDF•

0 likes•37 views

ssuser456ad6

Engineering

Learning Joint 2D-3D Representations for Depth Completion
2
1. Introduction
2. Relative work
3. Learning Joint 2D-3D Representations

Learning Joint 2D-3D Representations for Depth Completion
3
1. Introduction
What is depth completion?

Learning Joint 2D-3D Representations for Depth Completion
4
2. Relative works
Depth estimation from RGB data
Depth completion form RGBD data

Learning Joint 2D-3D Representations for Depth Completion
5
3. Learning Joint 2D-3D Representations

Learning Joint 2D-3D Representations for Depth Completion
6
3. Learning Joint 2D-3D Representations
3.1. 2D-3D Fuse Block

Learning Joint 2D-3D Representations for Depth Completion
7
3. Learning Joint 2D-3D Representations
3.1. 2D-3D Fuse Block

Learning Joint 2D-3D Representations for Depth Completion
8
3. Learning Joint 2D-3D Representations
3.1. 2D-3D Fuse Block

Learning Joint 2D-3D Representations for Depth Completion
9
3. Learning Joint 2D-3D Representations
3.1. 2D-3D Fuse Block

Learning Joint 2D-3D Representations for Depth Completion
10

Learning Joint 2D-3D Representations for Depth Completion
11
3. Learning Joint 2D-3D Representations
3.2. Stack 2D-3D Fuse Blocks into a Network

Learning Joint 2D-3D Representations for Depth Completion
12
3. Learning Joint 2D-3D Representations
3.3. Learning and Inference

Learning Joint 2D-3D Representations for Depth Completion
13

Recently uploaded

Module-III Varried Flow.pptx GVF Definition, Water Surface Profile Dynamic Eq...Nitin Sonavane

ALCOHOL PRODUCTION- Beer Brewing Process.pdfMadan Karki

Research Methodolgy & Intellectual Property Rights Series 1T.D. Shashikala

UNIT 4 PTRP final Convergence in probability.pptxkalpana413121

5G and 6G refer to generations of mobile network technology, each representin...archanaece3

SLIDESHARE PPT-DECISION MAKING METHODS.pptxCHAIRMAN M

Dynamo Scripts for Task IDs and Space Naming.pptxMustafa Ahmed

Piping and instrumentation diagram p.pdfAshrafRagab14

Augmented Reality (AR) with Augin Software.pptxMustafa Ahmed

Seizure stage detection of epileptic seizure using convolutional neural networksIJECEIAES

Online crime reporting system project.pdfKamal Acharya

Maher Othman Interior Design Portfolio..MaherOthman7

Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...drjose256

Microkernel in Operating System | Operating SystemSampad Kar

Diploma Engineering Drawing Qp-2024 Ece .pdfJNTUA

Autodesk Construction Cloud (Autodesk Build).pptxMustafa Ahmed

Performance enhancement of machine learning algorithm for breast cancer diagn...IJECEIAES

Involute of a circle,Square, pentagon,HexagonInvolute_Engineering Drawing.pdfJNTUA

NO1 Best Powerful Vashikaran Specialist Baba Vashikaran Specialist For Love V...Amil baba

Worksharing and 3D Modeling with Revit.pptxMustafa Ahmed

Recently uploaded (20)

Module-III Varried Flow.pptx GVF Definition, Water Surface Profile Dynamic Eq...

ALCOHOL PRODUCTION- Beer Brewing Process.pdf

Research Methodolgy & Intellectual Property Rights Series 1

UNIT 4 PTRP final Convergence in probability.pptx

5G and 6G refer to generations of mobile network technology, each representin...

SLIDESHARE PPT-DECISION MAKING METHODS.pptx

Dynamo Scripts for Task IDs and Space Naming.pptx

Piping and instrumentation diagram p.pdf

Augmented Reality (AR) with Augin Software.pptx

Seizure stage detection of epileptic seizure using convolutional neural networks

Online crime reporting system project.pdf

Maher Othman Interior Design Portfolio..

Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...

Microkernel in Operating System | Operating System

Diploma Engineering Drawing Qp-2024 Ece .pdf

Autodesk Construction Cloud (Autodesk Build).pptx

Performance enhancement of machine learning algorithm for breast cancer diagn...

Involute of a circle,Square, pentagon,HexagonInvolute_Engineering Drawing.pdf

NO1 Best Powerful Vashikaran Specialist Baba Vashikaran Specialist For Love V...

Worksharing and 3D Modeling with Revit.pptx

Featured

2024 State of Marketing Report – by HubspotMarius Sescu

Everything You Need To Know About ChatGPTExpeed Software

Product Design Trends in 2024 | Teenage EngineeringsPixeldarts

How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow

AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork

Skeleton Culture CodeSkeleton Technologies

PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley

Content Methodology: A Best Practices Report (Webinar)contently

How to Prepare For a Successful Job Search for 2024Albert Qian

Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)

Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal

5 Public speaking tips from TED - Visualized summarySpeakerHub

ChatGPT and the Future of Work - Clark Boyd Clark Boyd

Getting into the tech field. what next Tessa Mero

Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray

How to have difficult conversations Rajiv Jayarajah, MAppComm, ACC

Introduction to Data ScienceChristy Abraham Joy

Time Management & Productivity - Best PracticesVit Horky

The six step guide to practical project managementMindGenius

Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36

Featured (20)

2024 State of Marketing Report – by Hubspot

Everything You Need To Know About ChatGPT

Product Design Trends in 2024 | Teenage Engineerings

How Race, Age and Gender Shape Attitudes Towards Mental Health

AI Trends in Creative Operations 2024 by Artwork Flow.pdf

Skeleton Culture Code

PEPSICO Presentation to CAGNY Conference Feb 2024

Content Methodology: A Best Practices Report (Webinar)

How to Prepare For a Successful Job Search for 2024

Social Media Marketing Trends 2024 // The Global Indie Insights

Trends In Paid Search: Navigating The Digital Landscape In 2024

5 Public speaking tips from TED - Visualized summary

ChatGPT and the Future of Work - Clark Boyd

Getting into the tech field. what next

Google's Just Not That Into You: Understanding Core Updates & Search Intent

How to have difficult conversations

Introduction to Data Science

Time Management & Productivity - Best Practices

The six step guide to practical project management

Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...

Learning joint 2 d 3d representations for depth completion

1. Learning Joint 2D-3D Representations for Depth Completion HyeongJun Kwon

2. Learning Joint 2D-3D Representations for Depth Completion 2 1. Introduction 2. Relative work 3. Learning Joint 2D-3D Representations

3. Learning Joint 2D-3D Representations for Depth Completion 3 1. Introduction What is depth completion?

4. Learning Joint 2D-3D Representations for Depth Completion 4 2. Relative works Depth estimation from RGB data Depth completion form RGBD data

5. Learning Joint 2D-3D Representations for Depth Completion 5 3. Learning Joint 2D-3D Representations

6. Learning Joint 2D-3D Representations for Depth Completion 6 3. Learning Joint 2D-3D Representations 3.1. 2D-3D Fuse Block

7. Learning Joint 2D-3D Representations for Depth Completion 7 3. Learning Joint 2D-3D Representations 3.1. 2D-3D Fuse Block

8. Learning Joint 2D-3D Representations for Depth Completion 8 3. Learning Joint 2D-3D Representations 3.1. 2D-3D Fuse Block

9. Learning Joint 2D-3D Representations for Depth Completion 9 3. Learning Joint 2D-3D Representations 3.1. 2D-3D Fuse Block

10. Learning Joint 2D-3D Representations for Depth Completion 10

11. Learning Joint 2D-3D Representations for Depth Completion 11 3. Learning Joint 2D-3D Representations 3.2. Stack 2D-3D Fuse Blocks into a Network

12. Learning Joint 2D-3D Representations for Depth Completion 12 3. Learning Joint 2D-3D Representations 3.3. Learning and Inference

13. Learning Joint 2D-3D Representations for Depth Completion 13

Editor's Notes

Guided filter의 목적성과 전체 구조 그리고 method 마지막으로 computational efficiency에 대해 설명하겠습니다.
해당 논문 소개에 앞서서 depth completion에 대한 이야기를 하겠습니다. Depth completion은 lidar와 같은 active sensor를 이용한 sparse dense observation은 실생활에 사용하기에는 부정확한 점이 많았고 더 dense한 sensor를 쓰는 것은 cost가 큰 일이고 이를 해결하기위해 dense한 information을 가지고 있는 2d image와 sparse dense map을 가지고 dense depth map을 만드는 것이 depth completion의 방법 중 하나이며 해당 논문도 이와 같은 방법을 제시합니다. 기존의 방법들은 3d point cloud를 2d space에 projection하는 방법을 사용했는데 이는 distortion이 심해서 3d geometric clue를 포착하기 어려웠습니다. 저자는 이러한 점들을 2d와 3d를 fuse하는 것으로 간단하지만 효율적인 모델을 제시하였습니다.
첫 번째로 depth estimation은 2d rgb data로 depth를 estimation하는 방식입니다. 하지만, depth information의 부재로 높은 품질의 dense depth를 만들기 어렵다는 단점이 있었습니다. 두 번째로 depth completion from rgbd data는 해당 논문도 포함되는 작업입니다. 기존의 방식들은 더 나은 network architecture나 context, prior information을 이용하는 방식이었는데 저자가 제시하는 방법은 representation을 더욱 잘 배우는 것으로 sota의 성능을 내었씁니다.
해당 network는 2개의 representation의 이득을 취하는 방식입니다. 간단히 network의 소개를 하겠습니다. 첫 번째 figure는 depth completion을 하는 전채 진행도이며 두 번째 figure는 2d-3d fuse block입니다. Shortcut을 제외하면 크게 두가지의 branch가 있는데 첫 번째 branch는 appearance feature를 extracting하는 branch이며 두 번째 branch는 contiunous convolution을 이용한 geometric dependecy를 sparse 한 point에서 뽑아내는 branch입니다. 이러한 fuse block은 joint representation을 배울 수 있게하며 간단하다는 장점이 있습니다. 다음 장 부터 2d – 3d fuse block에 대하여 설명하겠습니다.
첫 번째로 설명할 branch는 multi scale 2d conv net입니다. 이 network는 두 가지의 stride convolution network로 구성이 됩니다. 이 network를 통해 multi scale feature를 뽑아내고 결과적으로 appearance feature를 구성합니다. 아웃풋은 input과 동일하게 CxWxH의 형태입니다.
두 번째 branch인 3d contiunous conv net입니다. 두 개의 continuous convolution을 통해 2d space로 projection 시켜 줍니다. 해당 network를 통해 3d metric space상에서의 geometric feature를 학습합니다.
다 아시겠지만 k nearest neighbors에 대해 간략하게 설명하고 continuous conv를 설명하겠습니다. Knn은 새로운 sample이 주어졌을때 근처의 data들을 이용하여 해당 sample을 예측하는 방법론입니다. Classification을 예로 들면 k =1일 경우 주황색으로 분류가 되고 3일 경우 녹색으로 분류가 된다고 생각하면 됩니다. 기존의 model을 학습시켜 예측을 진행하는 model based learning과 다르게 instance-based learning입니다. 이제 이 knn을 이용한 continuous convolution을 설명하겠습니다.
Continuous convolution network는 일반 grid convolution과 다르게 cnn + knn의 형태라 생각하면 됩니다. 해당 figure를 통해 이해하면 grid convolution에서 pixel의 neighbor은 근접 pixel들이지만 continuous conv에서는 knn을 통해 인접 pixel을 정의합니다. 이 때 사용되는 distance는 euclidian distance를 이용하여 knn을 구합니다. 첫 번째 equation이 parametric continuous cnn에서 소개한 식인데 기존의 컨볼루션식이랑 동일하다는 것을 알 수 있습니다. 두 번째식이 본 논문에서 사용한 pcnn의 식인데 각각의 notation을 살펴보면 다음과 같습니다. W는 weight matrix이며 distance는 mlp를 weighting function으로 사용하여 parameterization을 시켜줍니다. 그리고 kernel과 하다마르 product를 이용하여 아웃풋을 뽑습니다. 이게 ccn입니다.
간단하게 grid conv와 continuous conv의 차이에 대해 보겠습니다. Receptive field를 살펴보면 grid conv는 near car와 distant car가 동시에 포함되지만 cc는 geometric correalation을 이용해 near car의 정보만이 사용되는 것을 알 수 있습니다. 이제 2d conv를 통해 우리는 image feature를 얻고 contiunous conv를 통해 neighbors를 찾아서 이를 fusion하여 결과를 얻으면 됩니다. 이런 방식은 2d conv만을 이용한 방식에서는 non-smooth representation에 대해서 취약한 점을 보였던 반면에 저자의 접근 방식은 3D 공간에서 geometric feature를 활용하여보다 정확한 모양 재구성을 위해 non-smooth representation을 캡처 할 수있는 potential을 가지고 있습니다.
Fuse block을 이용하여 depth completion network를 구성한 모습입니다. 이러한 fuse block을 많이 사용할 수록 network를 large scale context와 local scale clues 그리고 geometric , appearance feature를 잡을 수 있습니다.
Objective function은 모든 pixel에 대하여 l1, l2 loss를 weighted sum하여 구성합니다. 감마는 balance coefficient입니다. Training과 inference 시에 필요한 nn의 index들은 사전에 구해둬서 과정을 진행합니다. Prediction을 구한 다음에는 post-processing을 필요없습니다.
결과 사진입니다. 해당 논문은 기존의 방식들과 다르게 multiple level에서 2d, 3d image의 joint representation을 통해 성능을 올린 방식을 제시하였으며 이를 통해 sota를 갱신하였다는 점이 특징입니다. 이상.!

Learning joint 2 d 3d representations for depth completion

Recommended

Recommended

More Related Content

Recently uploaded

Recently uploaded (20)

Featured

Featured (20)

Learning joint 2 d 3d representations for depth completion

Editor's Notes