SlideShare a Scribd company logo
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
NEURAL FIELDS IN COMPUTER VISION
Full-Day Tutorial, June 20th, 2022
neuralfields.cs.brown.edu/cvpr22
Reality Labs Research
Yiheng Xie Towaki Takikawa Shunsuke Saito Or Litany James Tompkin Vincent Sitzmann Srinath Sridhar
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Prior-based Reconstruction of
Neural Fields
2
Vincent Sitzmann
Assistant Professor, Scene Representation Group
www.scenerepresentations.com
www.vincentsitzmann.com
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Motivation: Novel View Synthesis
+
+
Observations
Image + Pose & Intrinsics
{ ,
,
…
{ Model
Novel Views
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Motivation: Novel View Synthesis
4
Fitting /
Optimization
Neural Scene
Representatio
n
Neural
Renderer
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Motivation: Novel View Synthesis
5
Inference
Neural Scene
Representatio
n
Neural
Renderer
Inference maps a set of observations to the parameters of a Neural Scene Representation.
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Overfitting case: Inference = Fitting via Gradient Descent
6
,…
+ }
{
REN D ER 𝜽
SDF + Color MLPs
SR N 𝝓
Fitting
Rendering
Normal map RGB
Sitzmann et al: Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Representations, NeurIPS 2020.
min REN D ER 𝜽(SR N 𝝓, 𝜉𝑖) − ℐ𝑖
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
DeepVoxels, CVPR 2018. NeRF, ECCV 2021
IDR, ECCV 2021 Plenoxels, CVPR 2022
SIREN, NeurIPS 2020
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
What if we have incomplete observations?
8
REN D ER 𝜽
SDF + Color MLPs
SR N 𝝓
Sitzmann et al: Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Representations, NeurIPS 2020.
min REN D ER 𝜽(SR N 𝝓, 𝜉𝑖) − ℐ𝑖
+
ℐ, 𝜉
No 3D inform.
Normal map RGB
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Inferring Neural Fields
9
Neural Scene
Representatio
n
Neural
Renderer
If only a single observation is available, or if only part of the scene has been observed,
Inference needs to be prior-based – i.e., we need to learn to reconstruct.
?
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
General Framework: Encoder-Decoder
10
Neural Scene
Representatio
n
Neural
Renderer
Decoder
Inference
Latent Variables {𝑧𝑖}𝑖=1
𝑁
Encoder
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
What are the latent variables?
11
Neural Scene
Representatio
n
Neural
Renderer
Inference
Encoder Latent Variables {𝑧𝑖}𝑖=1
𝑁
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
How to predict latent variables from observations?
12
Neural Scene
Representatio
n
Neural
Renderer
Inference
Encoder Latent Variables {𝑧𝑖}𝑖=1
𝑁
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
How do we decode latent variables into the Neural Field?
13
Neural Scene
Representatio
n
Neural
Renderer
Inference
Encoder Latent Variables {𝑧𝑖}𝑖=1
𝑁
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
What are the latent variables?
14
Neural Scene
Representatio
n
Neural
Renderer
Inference
Encoder Latent Variables {𝑧𝑖}𝑖=1
𝑁
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Key Consideration: Locality.
15
Neural Fields in Visual Computing and Beyond, Xie et al., EG STAR 2022
Global Conditioning Local Conditioning
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Global Latent Codes
16
Neural Fields in Visual Computing and Beyond, Xie et al., EG STAR 2022
Global Conditioning Local Conditioning
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Global conditioning
17
?
Latent code 𝑧
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Global conditioning
18
1[Schmidhuber et al. 1992, Schmidhuber et al. 1993, Stanley et al. 2009, Ha et al., 2016]
Hypernetwork1
Latent code 𝑧
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Global Latent Codes: Enables reconstruction from partial observations!
19
Scene Representation Networks: Continuous
3D-Structure-Aware Neural Scene Representations, NeurIPS 2019.
Differential Volumetric Rendering,
Niemeyer et al., CVPR 2020
DeepSDF, Occupancy Networks, IM-Net
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Global Latent Codes: Enables reconstruction from partial observations!
20
Scene Representation Networks: Continuous
3D-Structure-Aware Neural Scene Representations, NeurIPS 2019.
Differential Volumetric Rendering,
Niemeyer et al., CVPR 2020
DeepSDF, Occupancy Networks, IM-Net
Key limitation: Simple, non-compositional scenes.
But: Latent Space for full objects (interpolation etc)
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Local Latent Codes
21
Neural Fields in Visual Computing and Beyond, Xie et al., EG STAR 2022
Global Conditioning Local Conditioning
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
From point clouds: Conditioning on Feature Voxel grids
22
Convolutional Occupancy Networks [Peng et al. 2020]
Local Implicit Grid Representations for 3D Scenes [Jiang et al. 2020]
Implicit Functions in Feature Space for 3D Shape Reconstruction and Completion [Chabra et al. 2020]
Deep Local Shapes: Learning Local SDF Priors for Detailed 3D Reconstruction [Chibane et al. 2020]
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
From point clouds: Conditioning on Feature Voxel grids
23
Convolutional Occupancy Networks [Peng et al. 2020]
Local Implicit Grid Representations for 3D Scenes [Jiang et al. 2020]
Implicit Functions in Feature Space for 3D Shape Reconstruction and Completion [Chabra et al. 2020]
Deep Local Shapes: Learning Local SDF Priors for Detailed 3D Reconstruction [Chibane et al. 2020]
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
From point clouds: Conditioning on Feature Voxel grids
24
Generalizes to Compositional Scenes!
But: cubic memory complexity :/
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
From Point clouds: Ground-plan and Tri-plane factorizations
25
Convolutional Occupancy Networks [Peng et al. 2020]
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
From Point clouds: Ground-plan and Tri-plane factorizations
26
Convolutional Occupancy Networks [Peng et al. 2020]
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
From Point clouds: Ground-plan and Tri-plane factorizations
27
Convolutional Occupancy Networks [Peng et al. 2020]
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
From point clouds: Conditioning on Reconstructed Voxelgrids
28
5x less memory!
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
How to locally condition if sensor
domain different than field
domain?
29
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Local Conditioning: Pixel-Aligned Features.
30
PiFU, Saito et al., ICCV 2019.
PixelNeRF, Yu et al., CVPR 2021
Grf: Learning a general radiance field…, Trevithick et al.
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Local Conditioning: Pixel-Aligned Features.
31
PiFU, Saito et al., ICCV 2019.
PixelNeRF, Yu et al., CVPR 2021
Grf: Learning a general radiance field…, Trevithick et al.
Generalizes much better than global conditioning (like SRNs, DVR).
No persistent 3D representation.
All priors are learned in image space.
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Object-centric representations
32
CoLF: Unsupervised Learning of Compositional Object Light Fields, arXiv 2022.
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Object-centric representations
CoLF: Unsupervised Learning of Compositional
Object Light Fields, arXiv 2022.
uORF, ICLR 2022
Learns to disentangle objects self-supervised.
Inference of object-centric latent codes is hard problem.
Currently limited to relatively simple scenes, but progress is quick!
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Conditional Ground Plans for Single-Image 3D Reconstruction
34
Seeing 3D Objects in a Single Image via Self-Supervised Static-Dynamic Disentanglement, Sharma et al. 2022
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Conditional Ground Plans for Single-Image 3D Reconstruction
35
Seeing 3D Objects in a Single Image via Self-Supervised Static-Dynamic Disentanglement, Sharma et al. 2022
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Conditional Ground Plans for Single-Image 3D Reconstruction
36
Seeing 3D Objects in a Single Image via Self-Supervised Static-Dynamic Disentanglement, Sharma et al. 2022
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
How to infer latent codes?
37
Neural Scene
Representatio
n
Neural
Renderer
Inference
Encoder Latent Variables {𝑧𝑖}𝑖=1
𝑁
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Encoding vs. Auto-Decoding
38
Neural Fields in Visual Computing and Beyond, Xie et al., EG STAR 2022
Encoding Auto-Decoding
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Auto-Decoding for inverse graphics
39
REN D ER
Latent code 𝑧0
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Auto-Decoding for inverse graphics
40
REN D ER
Latent code 𝑧0
𝑧 = arg min
𝑧
REN D ER (Φ) − ℐ
3D-structured, resolution-invariant!
Samples need not lie on regular
grids!
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Out-of-distribution generalization
41
3D structure enables generalization
to out-of-distribution camera poses!
𝑧 = arg min
𝑧
REN D ER 𝜽(SR N 𝜙=𝐻𝑁𝜓(𝑧), 𝜉) − ℐ
Reconstruction
CNN encoder
Input
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Other forms of Generalization: Transformer Decoders
42
AIR-Nets, Giebenhain et al. 2022
Scene Representation Transformer
Sajjadi et al. 2022
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Other forms of Generalization: Gradient-based meta-learning
Representation
In-the loop
specialization via gradient
descent
Meta-Representation
43
MetaSDF: Meta-learning Signed Distance Functions, NeurIPS 2020
Backpropagate through gradient-
descent inference at training time.
Learn initialization that explains
held-out observations when fit to
context observation.
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Inferring Neural Scene Representations
44
Inference
Neural Scene
Representatio
n
Neural
Renderer
Generalization enables reconstruction from incomplete observations.
Any other benefits?
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Problem: Forward map might be expensive!
45
Inference
Neural Scene
Representatio
n
Neural
Renderer
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
3D-structured Neural Scene Representations
: ℝ3 → ℝn
Hundreds of samples per ray.
Time- and memory-intensive training.
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
: ℝ3 → ℝn
[Adelson et al. 1991, Levoy et al. 1996, Gortler et al. 1996]
Light Field
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
: ℝ3 → ℝn
Light Field Networks
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
: ℝ3 → ℝn
Light Field Networks
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Light Field Networks
Conditioning
Plücker Coords.
An Alternative Scene Representation
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Rendering is learned / representation is “already rendered”
51
Inference
Neural Scene
Representatio
n
Neural
Renderer
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Rendering is learned / representation is “already rendered”
52
Inference
“Rendered” Neural Scene
Representation
More difficult inference problem, but more general renderer.
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Light Field Networks Volumetric Rendering (pixelNeRF)
500 FPS
1 evaluation per ray
0.033 FPS
196 evaluations per ray
Real-time. No post-processing, no discrete data structures (octrees, voxelgrids, …).
>100x reduction in memory: Can be trained on small GPUs!
15,000x speed
1,000x speed
100x speed
10x speed
1x speed
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Light Field Networks
500 FPS
1 evaluation per ray
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Light Fields with Transformers:
Scene Representation Transformer (CVPR 2022)
No 3D Renderer: Directly parameterizes Light
Field!
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
56
Things I didn’t talk about
● Generalization in 2D, 1D, etc. neural fields: Images, audio…
see LIIF (Chen et al. 2021), …
● Neural field-to-neural field translation, see Spatially-Adaptive
Pixelwise Networks for Fast Image Translation (Shaham et al.
2020)
● Generalization for robotics applications (see Neural Descriptor
Fields (Simeonov et al.), 3D neural scene … (Li et al., CoRL 2022),
Learning Multi-Object Dynamics... (Driess et al. 2022), …
● Generalization for structured field with known a-priori structure
(humans, hands, faces, etc)
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
57
Outlook
● Generalization gaining traction: Single-scene optimization too
limited.
● Opens up completely new ways of thinking about problems:
Can amortize otherwise expensive forward maps (light fields).
● Making progress on the question of compositionality w/ object-
centric and locally conditioned neural fields. More to come.
● Processing & inferring regular grids is easy. Harder for point clouds
/ factorized representations, etc.
● Transformers seem to learn a type of local conditioning, but more
research necessary.
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Prior-based Reconstruction of
Neural Fields
58
Vincent Sitzmann
Assistant Professor, Scene Representation Group
www.scenerepresentations.com
www.vincentsitzmann.com
Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields
Q & A
Thanks!

More Related Content

What's hot

LiDAR-based Autonomous Driving III (by Deep Learning)
LiDAR-based Autonomous Driving III (by Deep Learning)LiDAR-based Autonomous Driving III (by Deep Learning)
LiDAR-based Autonomous Driving III (by Deep Learning)
Yu Huang
 
Understanding neural radiance fields
Understanding neural radiance fieldsUnderstanding neural radiance fields
Understanding neural radiance fields
Varun Bhaseen
 
Super resolution in deep learning era - Jaejun Yoo
Super resolution in deep learning era - Jaejun YooSuper resolution in deep learning era - Jaejun Yoo
Super resolution in deep learning era - Jaejun Yoo
JaeJun Yoo
 
【チュートリアル】コンピュータビジョンによる動画認識 v2
【チュートリアル】コンピュータビジョンによる動画認識 v2【チュートリアル】コンピュータビジョンによる動画認識 v2
【チュートリアル】コンピュータビジョンによる動画認識 v2
Hirokatsu Kataoka
 
Real-Time Global Illumination Techniques
Real-Time Global Illumination TechniquesReal-Time Global Illumination Techniques
Real-Time Global Illumination TechniquesJangho Lee
 
fusion of Camera and lidar for autonomous driving II
fusion of Camera and lidar for autonomous driving IIfusion of Camera and lidar for autonomous driving II
fusion of Camera and lidar for autonomous driving II
Yu Huang
 
Deep Fakes Detection
Deep Fakes DetectionDeep Fakes Detection
Deep Fakes Detection
Yusuke Uchida
 
V2 v posenet
V2 v posenetV2 v posenet
V2 v posenet
NAVER Engineering
 
ViT (Vision Transformer) Review [CDM]
ViT (Vision Transformer) Review [CDM]ViT (Vision Transformer) Review [CDM]
ViT (Vision Transformer) Review [CDM]
Dongmin Choi
 
fusion of Camera and lidar for autonomous driving I
fusion of Camera and lidar for autonomous driving Ifusion of Camera and lidar for autonomous driving I
fusion of Camera and lidar for autonomous driving I
Yu Huang
 
文献紹介:TSM: Temporal Shift Module for Efficient Video Understanding
文献紹介:TSM: Temporal Shift Module for Efficient Video Understanding文献紹介:TSM: Temporal Shift Module for Efficient Video Understanding
文献紹介:TSM: Temporal Shift Module for Efficient Video Understanding
Toru Tamaki
 
Pose estimation from RGB images by deep learning
Pose estimation from RGB images by deep learningPose estimation from RGB images by deep learning
Pose estimation from RGB images by deep learning
Yu Huang
 
Semantic segmentation with Convolutional Neural Network Approaches
Semantic segmentation with Convolutional Neural Network ApproachesSemantic segmentation with Convolutional Neural Network Approaches
Semantic segmentation with Convolutional Neural Network Approaches
Fellowship at Vodafone FutureLab
 
Lidar for Autonomous Driving II (via Deep Learning)
Lidar for Autonomous Driving II (via Deep Learning)Lidar for Autonomous Driving II (via Deep Learning)
Lidar for Autonomous Driving II (via Deep Learning)
Yu Huang
 
Deep-Learning Based Stereo Super-Resolution
Deep-Learning Based Stereo Super-ResolutionDeep-Learning Based Stereo Super-Resolution
Deep-Learning Based Stereo Super-Resolution
NAVER Engineering
 
Deep VO and SLAM
Deep VO and SLAMDeep VO and SLAM
Deep VO and SLAM
Yu Huang
 
Camera-Based Road Lane Detection by Deep Learning II
Camera-Based Road Lane Detection by Deep Learning IICamera-Based Road Lane Detection by Deep Learning II
Camera-Based Road Lane Detection by Deep Learning II
Yu Huang
 
[unofficial] Pyramid Scene Parsing Network (CVPR 2017)
[unofficial] Pyramid Scene Parsing Network (CVPR 2017)[unofficial] Pyramid Scene Parsing Network (CVPR 2017)
[unofficial] Pyramid Scene Parsing Network (CVPR 2017)
Shunta Saito
 
-SSIIの技術マップ- 過去•現在, そして未来 [領域]認識
-SSIIの技術マップ- 過去•現在, そして未来 [領域]認識-SSIIの技術マップ- 過去•現在, そして未来 [領域]認識
-SSIIの技術マップ- 過去•現在, そして未来 [領域]認識
Hironobu Fujiyoshi
 
마칭 큐브 알고리즘 - ZP 2019 데캠
마칭 큐브 알고리즘 - ZP 2019 데캠마칭 큐브 알고리즘 - ZP 2019 데캠
마칭 큐브 알고리즘 - ZP 2019 데캠
동환 김
 

What's hot (20)

LiDAR-based Autonomous Driving III (by Deep Learning)
LiDAR-based Autonomous Driving III (by Deep Learning)LiDAR-based Autonomous Driving III (by Deep Learning)
LiDAR-based Autonomous Driving III (by Deep Learning)
 
Understanding neural radiance fields
Understanding neural radiance fieldsUnderstanding neural radiance fields
Understanding neural radiance fields
 
Super resolution in deep learning era - Jaejun Yoo
Super resolution in deep learning era - Jaejun YooSuper resolution in deep learning era - Jaejun Yoo
Super resolution in deep learning era - Jaejun Yoo
 
【チュートリアル】コンピュータビジョンによる動画認識 v2
【チュートリアル】コンピュータビジョンによる動画認識 v2【チュートリアル】コンピュータビジョンによる動画認識 v2
【チュートリアル】コンピュータビジョンによる動画認識 v2
 
Real-Time Global Illumination Techniques
Real-Time Global Illumination TechniquesReal-Time Global Illumination Techniques
Real-Time Global Illumination Techniques
 
fusion of Camera and lidar for autonomous driving II
fusion of Camera and lidar for autonomous driving IIfusion of Camera and lidar for autonomous driving II
fusion of Camera and lidar for autonomous driving II
 
Deep Fakes Detection
Deep Fakes DetectionDeep Fakes Detection
Deep Fakes Detection
 
V2 v posenet
V2 v posenetV2 v posenet
V2 v posenet
 
ViT (Vision Transformer) Review [CDM]
ViT (Vision Transformer) Review [CDM]ViT (Vision Transformer) Review [CDM]
ViT (Vision Transformer) Review [CDM]
 
fusion of Camera and lidar for autonomous driving I
fusion of Camera and lidar for autonomous driving Ifusion of Camera and lidar for autonomous driving I
fusion of Camera and lidar for autonomous driving I
 
文献紹介:TSM: Temporal Shift Module for Efficient Video Understanding
文献紹介:TSM: Temporal Shift Module for Efficient Video Understanding文献紹介:TSM: Temporal Shift Module for Efficient Video Understanding
文献紹介:TSM: Temporal Shift Module for Efficient Video Understanding
 
Pose estimation from RGB images by deep learning
Pose estimation from RGB images by deep learningPose estimation from RGB images by deep learning
Pose estimation from RGB images by deep learning
 
Semantic segmentation with Convolutional Neural Network Approaches
Semantic segmentation with Convolutional Neural Network ApproachesSemantic segmentation with Convolutional Neural Network Approaches
Semantic segmentation with Convolutional Neural Network Approaches
 
Lidar for Autonomous Driving II (via Deep Learning)
Lidar for Autonomous Driving II (via Deep Learning)Lidar for Autonomous Driving II (via Deep Learning)
Lidar for Autonomous Driving II (via Deep Learning)
 
Deep-Learning Based Stereo Super-Resolution
Deep-Learning Based Stereo Super-ResolutionDeep-Learning Based Stereo Super-Resolution
Deep-Learning Based Stereo Super-Resolution
 
Deep VO and SLAM
Deep VO and SLAMDeep VO and SLAM
Deep VO and SLAM
 
Camera-Based Road Lane Detection by Deep Learning II
Camera-Based Road Lane Detection by Deep Learning IICamera-Based Road Lane Detection by Deep Learning II
Camera-Based Road Lane Detection by Deep Learning II
 
[unofficial] Pyramid Scene Parsing Network (CVPR 2017)
[unofficial] Pyramid Scene Parsing Network (CVPR 2017)[unofficial] Pyramid Scene Parsing Network (CVPR 2017)
[unofficial] Pyramid Scene Parsing Network (CVPR 2017)
 
-SSIIの技術マップ- 過去•現在, そして未来 [領域]認識
-SSIIの技術マップ- 過去•現在, そして未来 [領域]認識-SSIIの技術マップ- 過去•現在, そして未来 [領域]認識
-SSIIの技術マップ- 過去•現在, そして未来 [領域]認識
 
마칭 큐브 알고리즘 - ZP 2019 데캠
마칭 큐브 알고리즘 - ZP 2019 데캠마칭 큐브 알고리즘 - ZP 2019 데캠
마칭 큐브 알고리즘 - ZP 2019 데캠
 

Similar to Tutorial on Generalization in Neural Fields, CVPR 2022 Tutorial on Neural Fields in Computer Vision

Light Field Networks: Neural Scene Representations with Single-Evaluation Ren...
Light Field Networks: Neural Scene Representations with Single-Evaluation Ren...Light Field Networks: Neural Scene Representations with Single-Evaluation Ren...
Light Field Networks: Neural Scene Representations with Single-Evaluation Ren...
Vincent Sitzmann
 
Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Rep...
Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Rep...Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Rep...
Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Rep...
Vincent Sitzmann
 
Transformer in Vision
Transformer in VisionTransformer in Vision
Transformer in Vision
Sangmin Woo
 
Large Scale Image Retrieval 2022.pdf
Large Scale Image Retrieval 2022.pdfLarge Scale Image Retrieval 2022.pdf
Large Scale Image Retrieval 2022.pdf
SamuCerezo
 
Point-GNN: Graph Neural Network for 3D Object Detection in a Point Cloud
Point-GNN: Graph Neural Network for 3D Object Detection in a Point CloudPoint-GNN: Graph Neural Network for 3D Object Detection in a Point Cloud
Point-GNN: Graph Neural Network for 3D Object Detection in a Point Cloud
Nuwan Sriyantha Bandara
 
HR3D: Content Adaptive Parallax Barriers
HR3D: Content Adaptive Parallax BarriersHR3D: Content Adaptive Parallax Barriers
HR3D: Content Adaptive Parallax Barriers
Matt Hirsch - MIT Media Lab
 
AR/SLAM for end-users
AR/SLAM for end-usersAR/SLAM for end-users
AR/SLAM for end-users
Rakuten Group, Inc.
 
Loihi many core_neuromorphic_chip
Loihi many core_neuromorphic_chipLoihi many core_neuromorphic_chip
Loihi many core_neuromorphic_chip
Mehmood Saleem
 
Deep 3D Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2018
Deep 3D Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2018Deep 3D Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2018
Deep 3D Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2018
Universitat Politècnica de Catalunya
 
Deep 3D Visual Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2017
Deep 3D Visual Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2017Deep 3D Visual Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2017
Deep 3D Visual Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2017
Universitat Politècnica de Catalunya
 
Spectral cnn
Spectral cnnSpectral cnn
Spectral cnn
Brian Kim
 
Visual odometry _report
Visual odometry _reportVisual odometry _report
Visual odometry _report
anutam majumder
 
Development of ML-based Optical Fine Alignment tool
Development of ML-based Optical Fine Alignment toolDevelopment of ML-based Optical Fine Alignment tool
Development of ML-based Optical Fine Alignment tool
Sashank Mishra
 
The CCD detector.ppt
The CCD detector.pptThe CCD detector.ppt
The CCD detector.ppt
Stefan Oprea
 
SIGGRAPH 2014 Course on Computational Cameras and Displays (part 2)
SIGGRAPH 2014 Course on Computational Cameras and Displays (part 2)SIGGRAPH 2014 Course on Computational Cameras and Displays (part 2)
SIGGRAPH 2014 Course on Computational Cameras and Displays (part 2)
Matthew O'Toole
 
Final_draft_Practice_School_II_report
Final_draft_Practice_School_II_reportFinal_draft_Practice_School_II_report
Final_draft_Practice_School_II_reportRishikesh Bagwe
 
Tissue Segmentation Methods Using 2D Histogram Matching in a Sequence of MR B...
Tissue Segmentation Methods Using 2D Histogram Matching in a Sequence of MR B...Tissue Segmentation Methods Using 2D Histogram Matching in a Sequence of MR B...
Tissue Segmentation Methods Using 2D Histogram Matching in a Sequence of MR B...
Vladimir Kanchev
 
240315_Thanh_LabSeminar[G-TAD: Sub-Graph Localization for Temporal Action Det...
240315_Thanh_LabSeminar[G-TAD: Sub-Graph Localization for Temporal Action Det...240315_Thanh_LabSeminar[G-TAD: Sub-Graph Localization for Temporal Action Det...
240315_Thanh_LabSeminar[G-TAD: Sub-Graph Localization for Temporal Action Det...
thanhdowork
 

Similar to Tutorial on Generalization in Neural Fields, CVPR 2022 Tutorial on Neural Fields in Computer Vision (20)

Light Field Networks: Neural Scene Representations with Single-Evaluation Ren...
Light Field Networks: Neural Scene Representations with Single-Evaluation Ren...Light Field Networks: Neural Scene Representations with Single-Evaluation Ren...
Light Field Networks: Neural Scene Representations with Single-Evaluation Ren...
 
Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Rep...
Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Rep...Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Rep...
Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Rep...
 
Transformer in Vision
Transformer in VisionTransformer in Vision
Transformer in Vision
 
Large Scale Image Retrieval 2022.pdf
Large Scale Image Retrieval 2022.pdfLarge Scale Image Retrieval 2022.pdf
Large Scale Image Retrieval 2022.pdf
 
Point-GNN: Graph Neural Network for 3D Object Detection in a Point Cloud
Point-GNN: Graph Neural Network for 3D Object Detection in a Point CloudPoint-GNN: Graph Neural Network for 3D Object Detection in a Point Cloud
Point-GNN: Graph Neural Network for 3D Object Detection in a Point Cloud
 
998-isvc16
998-isvc16998-isvc16
998-isvc16
 
final ppt
final pptfinal ppt
final ppt
 
HR3D: Content Adaptive Parallax Barriers
HR3D: Content Adaptive Parallax BarriersHR3D: Content Adaptive Parallax Barriers
HR3D: Content Adaptive Parallax Barriers
 
AR/SLAM for end-users
AR/SLAM for end-usersAR/SLAM for end-users
AR/SLAM for end-users
 
Loihi many core_neuromorphic_chip
Loihi many core_neuromorphic_chipLoihi many core_neuromorphic_chip
Loihi many core_neuromorphic_chip
 
Deep 3D Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2018
Deep 3D Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2018Deep 3D Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2018
Deep 3D Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2018
 
Deep 3D Visual Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2017
Deep 3D Visual Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2017Deep 3D Visual Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2017
Deep 3D Visual Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2017
 
Spectral cnn
Spectral cnnSpectral cnn
Spectral cnn
 
Visual odometry _report
Visual odometry _reportVisual odometry _report
Visual odometry _report
 
Development of ML-based Optical Fine Alignment tool
Development of ML-based Optical Fine Alignment toolDevelopment of ML-based Optical Fine Alignment tool
Development of ML-based Optical Fine Alignment tool
 
The CCD detector.ppt
The CCD detector.pptThe CCD detector.ppt
The CCD detector.ppt
 
SIGGRAPH 2014 Course on Computational Cameras and Displays (part 2)
SIGGRAPH 2014 Course on Computational Cameras and Displays (part 2)SIGGRAPH 2014 Course on Computational Cameras and Displays (part 2)
SIGGRAPH 2014 Course on Computational Cameras and Displays (part 2)
 
Final_draft_Practice_School_II_report
Final_draft_Practice_School_II_reportFinal_draft_Practice_School_II_report
Final_draft_Practice_School_II_report
 
Tissue Segmentation Methods Using 2D Histogram Matching in a Sequence of MR B...
Tissue Segmentation Methods Using 2D Histogram Matching in a Sequence of MR B...Tissue Segmentation Methods Using 2D Histogram Matching in a Sequence of MR B...
Tissue Segmentation Methods Using 2D Histogram Matching in a Sequence of MR B...
 
240315_Thanh_LabSeminar[G-TAD: Sub-Graph Localization for Temporal Action Det...
240315_Thanh_LabSeminar[G-TAD: Sub-Graph Localization for Temporal Action Det...240315_Thanh_LabSeminar[G-TAD: Sub-Graph Localization for Temporal Action Det...
240315_Thanh_LabSeminar[G-TAD: Sub-Graph Localization for Temporal Action Det...
 

Recently uploaded

insect morphology and physiology of insect
insect morphology and physiology of insectinsect morphology and physiology of insect
insect morphology and physiology of insect
anitaento25
 
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptxBody fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
muralinath2
 
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
NathanBaughman3
 
insect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationinsect taxonomy importance systematics and classification
insect taxonomy importance systematics and classification
anitaento25
 
Structural Classification Of Protein (SCOP)
Structural Classification Of Protein  (SCOP)Structural Classification Of Protein  (SCOP)
Structural Classification Of Protein (SCOP)
aishnasrivastava
 
Penicillin...........................pptx
Penicillin...........................pptxPenicillin...........................pptx
Penicillin...........................pptx
Cherry
 
Citrus Greening Disease and its Management
Citrus Greening Disease and its ManagementCitrus Greening Disease and its Management
Citrus Greening Disease and its Management
subedisuryaofficial
 
Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...
Sérgio Sacani
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Erdal Coalmaker
 
Cancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate PathwayCancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate Pathway
AADYARAJPANDEY1
 
Hemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptxHemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptx
muralinath2
 
justice-and-fairness-ethics with example
justice-and-fairness-ethics with examplejustice-and-fairness-ethics with example
justice-and-fairness-ethics with example
azzyixes
 
Structures and textures of metamorphic rocks
Structures and textures of metamorphic rocksStructures and textures of metamorphic rocks
Structures and textures of metamorphic rocks
kumarmathi863
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
AlaminAfendy1
 
Richard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlandsRichard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlands
Richard Gill
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
muralinath2
 
Anemia_ different types_causes_ conditions
Anemia_ different types_causes_ conditionsAnemia_ different types_causes_ conditions
Anemia_ different types_causes_ conditions
muralinath2
 
platelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptxplatelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptx
muralinath2
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
Columbia Weather Systems
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
SAMIR PANDA
 

Recently uploaded (20)

insect morphology and physiology of insect
insect morphology and physiology of insectinsect morphology and physiology of insect
insect morphology and physiology of insect
 
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptxBody fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
 
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
 
insect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationinsect taxonomy importance systematics and classification
insect taxonomy importance systematics and classification
 
Structural Classification Of Protein (SCOP)
Structural Classification Of Protein  (SCOP)Structural Classification Of Protein  (SCOP)
Structural Classification Of Protein (SCOP)
 
Penicillin...........................pptx
Penicillin...........................pptxPenicillin...........................pptx
Penicillin...........................pptx
 
Citrus Greening Disease and its Management
Citrus Greening Disease and its ManagementCitrus Greening Disease and its Management
Citrus Greening Disease and its Management
 
Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
 
Cancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate PathwayCancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate Pathway
 
Hemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptxHemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptx
 
justice-and-fairness-ethics with example
justice-and-fairness-ethics with examplejustice-and-fairness-ethics with example
justice-and-fairness-ethics with example
 
Structures and textures of metamorphic rocks
Structures and textures of metamorphic rocksStructures and textures of metamorphic rocks
Structures and textures of metamorphic rocks
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
 
Richard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlandsRichard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlands
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
 
Anemia_ different types_causes_ conditions
Anemia_ different types_causes_ conditionsAnemia_ different types_causes_ conditions
Anemia_ different types_causes_ conditions
 
platelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptxplatelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptx
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
 

Tutorial on Generalization in Neural Fields, CVPR 2022 Tutorial on Neural Fields in Computer Vision

  • 1. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields NEURAL FIELDS IN COMPUTER VISION Full-Day Tutorial, June 20th, 2022 neuralfields.cs.brown.edu/cvpr22 Reality Labs Research Yiheng Xie Towaki Takikawa Shunsuke Saito Or Litany James Tompkin Vincent Sitzmann Srinath Sridhar
  • 2. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Prior-based Reconstruction of Neural Fields 2 Vincent Sitzmann Assistant Professor, Scene Representation Group www.scenerepresentations.com www.vincentsitzmann.com
  • 3. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Motivation: Novel View Synthesis + + Observations Image + Pose & Intrinsics { , , … { Model Novel Views
  • 4. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Motivation: Novel View Synthesis 4 Fitting / Optimization Neural Scene Representatio n Neural Renderer
  • 5. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Motivation: Novel View Synthesis 5 Inference Neural Scene Representatio n Neural Renderer Inference maps a set of observations to the parameters of a Neural Scene Representation.
  • 6. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Overfitting case: Inference = Fitting via Gradient Descent 6 ,… + } { REN D ER 𝜽 SDF + Color MLPs SR N 𝝓 Fitting Rendering Normal map RGB Sitzmann et al: Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Representations, NeurIPS 2020. min REN D ER 𝜽(SR N 𝝓, 𝜉𝑖) − ℐ𝑖
  • 7. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields DeepVoxels, CVPR 2018. NeRF, ECCV 2021 IDR, ECCV 2021 Plenoxels, CVPR 2022 SIREN, NeurIPS 2020
  • 8. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields What if we have incomplete observations? 8 REN D ER 𝜽 SDF + Color MLPs SR N 𝝓 Sitzmann et al: Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Representations, NeurIPS 2020. min REN D ER 𝜽(SR N 𝝓, 𝜉𝑖) − ℐ𝑖 + ℐ, 𝜉 No 3D inform. Normal map RGB
  • 9. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Inferring Neural Fields 9 Neural Scene Representatio n Neural Renderer If only a single observation is available, or if only part of the scene has been observed, Inference needs to be prior-based – i.e., we need to learn to reconstruct. ?
  • 10. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields General Framework: Encoder-Decoder 10 Neural Scene Representatio n Neural Renderer Decoder Inference Latent Variables {𝑧𝑖}𝑖=1 𝑁 Encoder
  • 11. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields What are the latent variables? 11 Neural Scene Representatio n Neural Renderer Inference Encoder Latent Variables {𝑧𝑖}𝑖=1 𝑁
  • 12. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields How to predict latent variables from observations? 12 Neural Scene Representatio n Neural Renderer Inference Encoder Latent Variables {𝑧𝑖}𝑖=1 𝑁
  • 13. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields How do we decode latent variables into the Neural Field? 13 Neural Scene Representatio n Neural Renderer Inference Encoder Latent Variables {𝑧𝑖}𝑖=1 𝑁
  • 14. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields What are the latent variables? 14 Neural Scene Representatio n Neural Renderer Inference Encoder Latent Variables {𝑧𝑖}𝑖=1 𝑁
  • 15. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Key Consideration: Locality. 15 Neural Fields in Visual Computing and Beyond, Xie et al., EG STAR 2022 Global Conditioning Local Conditioning
  • 16. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Global Latent Codes 16 Neural Fields in Visual Computing and Beyond, Xie et al., EG STAR 2022 Global Conditioning Local Conditioning
  • 17. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Global conditioning 17 ? Latent code 𝑧
  • 18. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Global conditioning 18 1[Schmidhuber et al. 1992, Schmidhuber et al. 1993, Stanley et al. 2009, Ha et al., 2016] Hypernetwork1 Latent code 𝑧
  • 19. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Global Latent Codes: Enables reconstruction from partial observations! 19 Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Representations, NeurIPS 2019. Differential Volumetric Rendering, Niemeyer et al., CVPR 2020 DeepSDF, Occupancy Networks, IM-Net
  • 20. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Global Latent Codes: Enables reconstruction from partial observations! 20 Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Representations, NeurIPS 2019. Differential Volumetric Rendering, Niemeyer et al., CVPR 2020 DeepSDF, Occupancy Networks, IM-Net Key limitation: Simple, non-compositional scenes. But: Latent Space for full objects (interpolation etc)
  • 21. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Local Latent Codes 21 Neural Fields in Visual Computing and Beyond, Xie et al., EG STAR 2022 Global Conditioning Local Conditioning
  • 22. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields From point clouds: Conditioning on Feature Voxel grids 22 Convolutional Occupancy Networks [Peng et al. 2020] Local Implicit Grid Representations for 3D Scenes [Jiang et al. 2020] Implicit Functions in Feature Space for 3D Shape Reconstruction and Completion [Chabra et al. 2020] Deep Local Shapes: Learning Local SDF Priors for Detailed 3D Reconstruction [Chibane et al. 2020]
  • 23. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields From point clouds: Conditioning on Feature Voxel grids 23 Convolutional Occupancy Networks [Peng et al. 2020] Local Implicit Grid Representations for 3D Scenes [Jiang et al. 2020] Implicit Functions in Feature Space for 3D Shape Reconstruction and Completion [Chabra et al. 2020] Deep Local Shapes: Learning Local SDF Priors for Detailed 3D Reconstruction [Chibane et al. 2020]
  • 24. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields From point clouds: Conditioning on Feature Voxel grids 24 Generalizes to Compositional Scenes! But: cubic memory complexity :/
  • 25. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields From Point clouds: Ground-plan and Tri-plane factorizations 25 Convolutional Occupancy Networks [Peng et al. 2020]
  • 26. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields From Point clouds: Ground-plan and Tri-plane factorizations 26 Convolutional Occupancy Networks [Peng et al. 2020]
  • 27. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields From Point clouds: Ground-plan and Tri-plane factorizations 27 Convolutional Occupancy Networks [Peng et al. 2020]
  • 28. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields From point clouds: Conditioning on Reconstructed Voxelgrids 28 5x less memory!
  • 29. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields How to locally condition if sensor domain different than field domain? 29
  • 30. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Local Conditioning: Pixel-Aligned Features. 30 PiFU, Saito et al., ICCV 2019. PixelNeRF, Yu et al., CVPR 2021 Grf: Learning a general radiance field…, Trevithick et al.
  • 31. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Local Conditioning: Pixel-Aligned Features. 31 PiFU, Saito et al., ICCV 2019. PixelNeRF, Yu et al., CVPR 2021 Grf: Learning a general radiance field…, Trevithick et al. Generalizes much better than global conditioning (like SRNs, DVR). No persistent 3D representation. All priors are learned in image space.
  • 32. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Object-centric representations 32 CoLF: Unsupervised Learning of Compositional Object Light Fields, arXiv 2022.
  • 33. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Object-centric representations CoLF: Unsupervised Learning of Compositional Object Light Fields, arXiv 2022. uORF, ICLR 2022 Learns to disentangle objects self-supervised. Inference of object-centric latent codes is hard problem. Currently limited to relatively simple scenes, but progress is quick!
  • 34. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Conditional Ground Plans for Single-Image 3D Reconstruction 34 Seeing 3D Objects in a Single Image via Self-Supervised Static-Dynamic Disentanglement, Sharma et al. 2022
  • 35. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Conditional Ground Plans for Single-Image 3D Reconstruction 35 Seeing 3D Objects in a Single Image via Self-Supervised Static-Dynamic Disentanglement, Sharma et al. 2022
  • 36. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Conditional Ground Plans for Single-Image 3D Reconstruction 36 Seeing 3D Objects in a Single Image via Self-Supervised Static-Dynamic Disentanglement, Sharma et al. 2022
  • 37. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields How to infer latent codes? 37 Neural Scene Representatio n Neural Renderer Inference Encoder Latent Variables {𝑧𝑖}𝑖=1 𝑁
  • 38. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Encoding vs. Auto-Decoding 38 Neural Fields in Visual Computing and Beyond, Xie et al., EG STAR 2022 Encoding Auto-Decoding
  • 39. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Auto-Decoding for inverse graphics 39 REN D ER Latent code 𝑧0
  • 40. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Auto-Decoding for inverse graphics 40 REN D ER Latent code 𝑧0 𝑧 = arg min 𝑧 REN D ER (Φ) − ℐ 3D-structured, resolution-invariant! Samples need not lie on regular grids!
  • 41. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Out-of-distribution generalization 41 3D structure enables generalization to out-of-distribution camera poses! 𝑧 = arg min 𝑧 REN D ER 𝜽(SR N 𝜙=𝐻𝑁𝜓(𝑧), 𝜉) − ℐ Reconstruction CNN encoder Input
  • 42. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Other forms of Generalization: Transformer Decoders 42 AIR-Nets, Giebenhain et al. 2022 Scene Representation Transformer Sajjadi et al. 2022
  • 43. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Other forms of Generalization: Gradient-based meta-learning Representation In-the loop specialization via gradient descent Meta-Representation 43 MetaSDF: Meta-learning Signed Distance Functions, NeurIPS 2020 Backpropagate through gradient- descent inference at training time. Learn initialization that explains held-out observations when fit to context observation.
  • 44. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Inferring Neural Scene Representations 44 Inference Neural Scene Representatio n Neural Renderer Generalization enables reconstruction from incomplete observations. Any other benefits?
  • 45. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Problem: Forward map might be expensive! 45 Inference Neural Scene Representatio n Neural Renderer
  • 46. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields 3D-structured Neural Scene Representations : ℝ3 → ℝn Hundreds of samples per ray. Time- and memory-intensive training.
  • 47. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields : ℝ3 → ℝn [Adelson et al. 1991, Levoy et al. 1996, Gortler et al. 1996] Light Field
  • 48. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields : ℝ3 → ℝn Light Field Networks
  • 49. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields : ℝ3 → ℝn Light Field Networks
  • 50. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Light Field Networks Conditioning Plücker Coords. An Alternative Scene Representation
  • 51. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Rendering is learned / representation is “already rendered” 51 Inference Neural Scene Representatio n Neural Renderer
  • 52. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Rendering is learned / representation is “already rendered” 52 Inference “Rendered” Neural Scene Representation More difficult inference problem, but more general renderer.
  • 53. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Light Field Networks Volumetric Rendering (pixelNeRF) 500 FPS 1 evaluation per ray 0.033 FPS 196 evaluations per ray Real-time. No post-processing, no discrete data structures (octrees, voxelgrids, …). >100x reduction in memory: Can be trained on small GPUs! 15,000x speed 1,000x speed 100x speed 10x speed 1x speed
  • 54. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Light Field Networks 500 FPS 1 evaluation per ray
  • 55. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Light Fields with Transformers: Scene Representation Transformer (CVPR 2022) No 3D Renderer: Directly parameterizes Light Field!
  • 56. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields 56 Things I didn’t talk about ● Generalization in 2D, 1D, etc. neural fields: Images, audio… see LIIF (Chen et al. 2021), … ● Neural field-to-neural field translation, see Spatially-Adaptive Pixelwise Networks for Fast Image Translation (Shaham et al. 2020) ● Generalization for robotics applications (see Neural Descriptor Fields (Simeonov et al.), 3D neural scene … (Li et al., CoRL 2022), Learning Multi-Object Dynamics... (Driess et al. 2022), … ● Generalization for structured field with known a-priori structure (humans, hands, faces, etc)
  • 57. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields 57 Outlook ● Generalization gaining traction: Single-scene optimization too limited. ● Opens up completely new ways of thinking about problems: Can amortize otherwise expensive forward maps (light fields). ● Making progress on the question of compositionality w/ object- centric and locally conditioned neural fields. More to come. ● Processing & inferring regular grids is easy. Harder for point clouds / factorized representations, etc. ● Transformers seem to learn a type of local conditioning, but more research necessary.
  • 58. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Prior-based Reconstruction of Neural Fields 58 Vincent Sitzmann Assistant Professor, Scene Representation Group www.scenerepresentations.com www.vincentsitzmann.com
  • 59. Vincent Sitzmann, CVPR 2022 Tutorial on Neural Fields Q & A Thanks!