SlideShare a Scribd company logo
1
Neural Inverse Rendering for
General Reflectance Photometric Stereo
Short oral presentation
ICML 2018
July 11, 2018
Tatsunori Taniai
RIKEN AIP
Takanori Maehara
RIKEN AIP
ICML 2018 Paper
2
Photometric stereo: shape from varying shading [Woodham, 80]
Scene observations
under varying illuminations
3D surface normals
(surface orientations)
PS is an essential technique
for highly detailed 3D shape
recovery in combination
with multiview stereo (MVS)
MVS only
[Park+ 13]
MVS + PS
3
Photometric stereo: shape from varying shading [Woodham, 80]
Challenges
• Real-world objects have various complex reflectance properties (BRDFs)
→ Use of deep learning to model various BRDFs seems promising
but it is actually very inactive because…
Scene observations
under varying illuminations
3D surface normals
(surface orientations)
• Not much training data. Accurately measuring surface normals is difficult.
4
ML perspective: physics-based unsupervised learning
Observed data Hidden dataEstimator
𝑿 𝒀
Synthesized data
𝑿′
𝒁
Physical generative model
𝑿 = 𝑓(𝒀, 𝒁)
• Not directly observable
or annotatable.
• No ground truth for
training data.
Use physics to bypass the issue of lacking training data.
Disentangled representation
Reconstruction loss
𝑿 − 𝑿′
𝑾
5
Talk Overview
• Introduction
• Basics of photometric stereo
• Our approach
• Experimental results
6
Photometric stereo as inverse imaging process
𝒗𝒏
ℓ
Point light source
Object surface
Camera
𝐼: Image intensity (known)
ℓ: Light direction & intensity (known)
𝒗: View direction (known)
𝒏: Surface normal (unknown)
𝜌: BRDF (unknown)
𝜌
7
Photometric stereo as inverse imaging process
𝒗𝒏
ℓ
Point light source
Object surface
Camera
⊙=
max(0, ℓ 𝑇 𝒏)𝐼 = ⊙ 𝜌( 𝒏, ℓ, 𝒗)
Observed pixel Shading Reflectance (BRDF)
Reflectance (rendering) equation
𝐼: Image intensity (known)
ℓ: Light direction & intensity (known)
𝒗: View direction (known)
𝒏: Surface normal (unknown)
𝜌: BRDF (unknown)
Estimate 𝒏 from intensities when changing illuminations ℓ
𝜌
× × ×
8
Lest squares solution for diffuse surfaces [Woodham, 80]
𝒏
ℓ
Point light source
Object surface
𝜌0
A closed-form solution exists if 𝝆 is constant (uniform distribution)
9
Lest squares solution for diffuse surfaces [Woodham, 80]
𝒏
ℓ
Point light source
Object surface
A closed-form solution exists if 𝝆 is constant (uniform distribution)
𝜌0
Lambertian diffuse model
𝐼 = 𝜌0 max(0, ℓ 𝑇 𝒏)
𝐼1 = 𝜌0ℓ1
𝑇
𝒏
𝐼2 = 𝜌0ℓ2
𝑇
𝒏
𝐼 𝑀 = 𝜌0ℓ 𝑀
𝑇
𝒏⋯
Multiple observations by varying illuminations
𝑰 = 𝑳 𝑇(𝜌0 𝒏)
Linear system for
a set of bright pixels
= 𝜌0ℓ 𝑇 𝒏 (for 𝐼 > 0)
10
Our goal: general reflectance photometric stereo
Can we determine 𝒏 from intensities when
• 𝝆 is unknown and spatially-varying
• no training data with ground truth of 𝒏 and 𝝆
Multiple intensity observations
under known illumination patterns
𝐼1 = max 0, ℓ1
𝑇
𝒏 ⊙ 𝜌( 𝒏, ℓ1, 𝒗)
⋯
𝐼2 = max 0, ℓ2
𝑇
𝒏 ⊙ 𝜌( 𝒏, ℓ2, 𝒗)
𝐼 𝑀 = max 0, ℓ 𝑀
𝑇
𝒏 ⊙ 𝜌( 𝒏, ℓ 𝑀, 𝒗)
ℓ
𝜌
Surfaces with unknown and
spatially-varying BRDFs
11
Talk Overview
• Introduction
• Basics of photometric stereo
• Our approach
– Physics-embedded auto-encoder network
– Reconstruction loss
– Test-time learning algorithm
• Experimental results
12
Our physics-embedded auto-encoder network (simplified)…
𝚽
𝒀𝑖𝑿𝑖
𝑵
…
…
… …
𝑰1
𝑰𝑖
𝑰 𝑀
𝒁𝑖
Photometric stereo network (PSNet)
Image reconstruction network (IRNet)
𝑀𝐶 x 𝐻 x 𝑊
3 x 𝐻 x 𝑊
𝑰𝑖
𝑀 x 𝐶 x 𝐻 x 𝑊
𝑀 x 𝐶 x 𝐻 x 𝑊
𝑀 x 𝐶 x 𝐻 x 𝑊
384 x 𝐻 x 𝑊
𝑀 x 16 x 𝐻 x 𝑊
Surface
normal map
Synthesized
images
Observed
images
𝑰2
Concat
Batch
Rendering equation
𝑵
𝑹𝑖
𝑰
Reflectance
Two-streams network to 1) produce a normal map and 2) re-render images
analyzes all observations to produce a single normal map
processes each observation individually to disentangle and reconstruct an image
13
Physics-embedded auto-encoder network (full)…
𝑺𝑖
𝚽
𝒀𝑖𝑿𝑖
𝑵
𝑓ps1:
3x3 Conv
BatchNorm
ReLU
x 3
𝑓ps2:
3x3 Conv
𝐿2 Norm
𝑓ir1:
3x3 Conv
BatchNorm
ReLU
x 3 𝑓ir2:
1x1 Conv
BatchNorm
ReLU
…
…
… …
𝑰1
𝑰𝑖
𝑰 𝑀
𝒁𝑖
Photometric stereo network (PSNet)
Image reconstruction network (IRNet)
𝑀𝐶 x 𝐻 x 𝑊
3 x 𝐻 x 𝑊
𝑰𝑖
𝑀 x 𝐶 x 𝐻 x 𝑊
Compute
specular component
using 𝑵, ℓ𝑖, 𝒗
𝑀 x 𝐶 x 𝐻 x 𝑊
𝑀 x 𝐶+1 x 𝐻 x 𝑊
384 x 𝐻 x 𝑊
𝑀 x 16 x 𝐻 x 𝑊
Surface
normal map
Synthesized
images
𝑓ir3:
3x3 Conv
BatchNorm
ReLU
+ 3x3 Conv
Observed
images
𝑰2
Concat
Batch
Rendering equation
𝑵
𝑹𝑖
𝑰
14
Loss function with early-stage weak supervision
Image reconstruction loss Least squares (LS) prior
𝐿 =
1
𝑀
𝑖=1
𝑀
𝑰𝑖 − 𝑰𝑖 1
+ 𝜆 𝑡 𝑵 − 𝑵′ 2
2
Minimize intensity differences btw
synthesized 𝑰𝑖 and observed 𝑰𝑖 images.
Constrain the output normals 𝑵
to be close to prior normals 𝑵′
obtained by the LS method.
Early-stage weak supervision
• LS prior 𝑵′ has low accuracy, so it is used only for an early-stage of
learning process (i.e., 𝜆 𝑡 ← 0 after some SGD iterations).
• It can stabilize learning of randomly initialized network parameters.
15
Test-time learning algorithm
Input: Pairs of an image and corresponding lighting (𝑰𝑖, ℓ𝑖) of a test scene.
Output: A surface normal map 𝑵 of a test scene.
• Run PSNet to produce a normal map 𝑵.
• Run IRNet to reconstruct all input images as 𝑰𝑖 .
• Compute the loss and update the network parameters.
• Terminate the prior (𝜆 𝑡 ← 0) if iterations > 50.
Until convergence (1000 iterations)
Without any pre-training, we directly fit the network to a given test scene.
Initialize network parameters randomly.
Compute LS solution 𝑵′.
Repeat Adam’s iterations
16
Talk Overview
• Introduction
• Basics of photometric stereo
• Our approach
• Experimental results
17
Benchmark on real-world scenes [Shi+ 18]
Outperformed deep learning based [Santo+ 17] and other classical methods
• Totally 10 scenes, each provides 96 images. Evaluated by mean angular errors (degrees).
• [Santo+ 17] is a supervised DNN method pre-trained on synthetic data.
Classicalphysics-based
18
Visual comparison
19
Convergence analysis with early-stage supervision
MeanangularerrorsLoss
Early-stage sup. No sup. All-stage sup.
 Stable & accurate  Unstable  Inaccurate
Terminating supervision
20
Convergence analysis with early-stage supervision
MeanangularerrorsLoss
Early-stage sup. No sup. All-stage sup.
 Stable & accurate  Unstable  Inaccurate
Terminating supervision
21
Summary
We demonstrated
• Physics-based unsupervised learning approach
to general BRDF photometric stereo.
• Use of physics can bypass the issue of lacking
annotated training data.
• SOTA results, outperforming a supervised
deep learning method and other classical
unsupervised methods.
Come to our poster for more details about
our network architecture and experiments.

More Related Content

What's hot

Histogram based Enhancement
Histogram based Enhancement Histogram based Enhancement
Histogram based Enhancement
Vivek V
 
05 histogram processing DIP
05 histogram processing DIP05 histogram processing DIP
05 histogram processing DIP
babak danyal
 
Bayesian restoration of high-dimensional photon-starved images
Bayesian restoration of high-dimensional photon-starved imagesBayesian restoration of high-dimensional photon-starved images
Bayesian restoration of high-dimensional photon-starved images
Julián Tachella
 
Non-Local Compressive Sampling Recovery
Non-Local Compressive Sampling RecoveryNon-Local Compressive Sampling Recovery
Non-Local Compressive Sampling Recovery
shuxianbiao
 
Image denoising with unknown Non-Periodic Noises
Image denoising with unknown Non-Periodic NoisesImage denoising with unknown Non-Periodic Noises
Image denoising with unknown Non-Periodic Noises
SakshiAggarwal85
 
Histogram Operation in Image Processing
Histogram Operation in Image ProcessingHistogram Operation in Image Processing
Histogram Operation in Image Processing
VARUN KUMAR
 
Super-Resolution for Imagery Enhancement Using Variational Quantum Eigensolver
Super-Resolution for Imagery Enhancement Using Variational Quantum EigensolverSuper-Resolution for Imagery Enhancement Using Variational Quantum Eigensolver
Super-Resolution for Imagery Enhancement Using Variational Quantum Eigensolver
Ystallonne Alves
 
Histogram Equalization
Histogram EqualizationHistogram Equalization
Histogram Equalization
Kalyan Acharjya
 
Flash Photography and toonification
Flash Photography and toonificationFlash Photography and toonification
Flash Photography and toonification
Satya Sahoo
 
Image Interpolation Techniques with Optical and Digital Zoom Concepts -semina...
Image Interpolation Techniques with Optical and Digital Zoom Concepts -semina...Image Interpolation Techniques with Optical and Digital Zoom Concepts -semina...
Image Interpolation Techniques with Optical and Digital Zoom Concepts -semina...
mmjalbiaty
 
Graph Regularised Hashing
Graph Regularised HashingGraph Regularised Hashing
Graph Regularised Hashing
Sean Moran
 
3 intensity transformations and spatial filtering slides
3 intensity transformations and spatial filtering slides3 intensity transformations and spatial filtering slides
3 intensity transformations and spatial filtering slides
BHAGYAPRASADBUGGE
 
Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013
Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013
Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013
Sunando Sengupta
 
論文紹介"DynamicFusion: Reconstruction and Tracking of Non-­‐rigid Scenes in Real...
論文紹介"DynamicFusion: Reconstruction and Tracking of Non-­‐rigid Scenes in Real...論文紹介"DynamicFusion: Reconstruction and Tracking of Non-­‐rigid Scenes in Real...
論文紹介"DynamicFusion: Reconstruction and Tracking of Non-­‐rigid Scenes in Real...
Ken Sakurada
 
Remote Sensing Lec 10
Remote Sensing Lec 10Remote Sensing Lec 10
Remote Sensing Lec 10
polylsgiedx
 
Image Interpolation Techniques with Optical and Digital Zoom Concepts
Image Interpolation Techniques with Optical and Digital Zoom ConceptsImage Interpolation Techniques with Optical and Digital Zoom Concepts
Image Interpolation Techniques with Optical and Digital Zoom Concepts
mmjalbiaty
 
Image Restoration And Reconstruction
Image Restoration And ReconstructionImage Restoration And Reconstruction
Image Restoration And Reconstruction
Amnaakhaan
 
30th コンピュータビジョン勉強会@関東 DynamicFusion
30th コンピュータビジョン勉強会@関東 DynamicFusion30th コンピュータビジョン勉強会@関東 DynamicFusion
30th コンピュータビジョン勉強会@関東 DynamicFusion
Hiroki Mizuno
 
Orb feature by nitin
Orb feature by nitinOrb feature by nitin
Orb feature by nitin
NitinMauryaKashipur
 

What's hot (19)

Histogram based Enhancement
Histogram based Enhancement Histogram based Enhancement
Histogram based Enhancement
 
05 histogram processing DIP
05 histogram processing DIP05 histogram processing DIP
05 histogram processing DIP
 
Bayesian restoration of high-dimensional photon-starved images
Bayesian restoration of high-dimensional photon-starved imagesBayesian restoration of high-dimensional photon-starved images
Bayesian restoration of high-dimensional photon-starved images
 
Non-Local Compressive Sampling Recovery
Non-Local Compressive Sampling RecoveryNon-Local Compressive Sampling Recovery
Non-Local Compressive Sampling Recovery
 
Image denoising with unknown Non-Periodic Noises
Image denoising with unknown Non-Periodic NoisesImage denoising with unknown Non-Periodic Noises
Image denoising with unknown Non-Periodic Noises
 
Histogram Operation in Image Processing
Histogram Operation in Image ProcessingHistogram Operation in Image Processing
Histogram Operation in Image Processing
 
Super-Resolution for Imagery Enhancement Using Variational Quantum Eigensolver
Super-Resolution for Imagery Enhancement Using Variational Quantum EigensolverSuper-Resolution for Imagery Enhancement Using Variational Quantum Eigensolver
Super-Resolution for Imagery Enhancement Using Variational Quantum Eigensolver
 
Histogram Equalization
Histogram EqualizationHistogram Equalization
Histogram Equalization
 
Flash Photography and toonification
Flash Photography and toonificationFlash Photography and toonification
Flash Photography and toonification
 
Image Interpolation Techniques with Optical and Digital Zoom Concepts -semina...
Image Interpolation Techniques with Optical and Digital Zoom Concepts -semina...Image Interpolation Techniques with Optical and Digital Zoom Concepts -semina...
Image Interpolation Techniques with Optical and Digital Zoom Concepts -semina...
 
Graph Regularised Hashing
Graph Regularised HashingGraph Regularised Hashing
Graph Regularised Hashing
 
3 intensity transformations and spatial filtering slides
3 intensity transformations and spatial filtering slides3 intensity transformations and spatial filtering slides
3 intensity transformations and spatial filtering slides
 
Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013
Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013
Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013
 
論文紹介"DynamicFusion: Reconstruction and Tracking of Non-­‐rigid Scenes in Real...
論文紹介"DynamicFusion: Reconstruction and Tracking of Non-­‐rigid Scenes in Real...論文紹介"DynamicFusion: Reconstruction and Tracking of Non-­‐rigid Scenes in Real...
論文紹介"DynamicFusion: Reconstruction and Tracking of Non-­‐rigid Scenes in Real...
 
Remote Sensing Lec 10
Remote Sensing Lec 10Remote Sensing Lec 10
Remote Sensing Lec 10
 
Image Interpolation Techniques with Optical and Digital Zoom Concepts
Image Interpolation Techniques with Optical and Digital Zoom ConceptsImage Interpolation Techniques with Optical and Digital Zoom Concepts
Image Interpolation Techniques with Optical and Digital Zoom Concepts
 
Image Restoration And Reconstruction
Image Restoration And ReconstructionImage Restoration And Reconstruction
Image Restoration And Reconstruction
 
30th コンピュータビジョン勉強会@関東 DynamicFusion
30th コンピュータビジョン勉強会@関東 DynamicFusion30th コンピュータビジョン勉強会@関東 DynamicFusion
30th コンピュータビジョン勉強会@関東 DynamicFusion
 
Orb feature by nitin
Orb feature by nitinOrb feature by nitin
Orb feature by nitin
 

Similar to Neural Inverse Rendering for General Reflectance Photometric Stereo (ICML 2018)

Super resolution in deep learning era - Jaejun Yoo
Super resolution in deep learning era - Jaejun YooSuper resolution in deep learning era - Jaejun Yoo
Super resolution in deep learning era - Jaejun Yoo
JaeJun Yoo
 
20230213_ComputerVision_연구.pptx
20230213_ComputerVision_연구.pptx20230213_ComputerVision_연구.pptx
20230213_ComputerVision_연구.pptx
ssuser7807522
 
Towards Accurate Multi-person Pose Estimation in the Wild (My summery)
Towards Accurate Multi-person Pose Estimation in the Wild (My summery)Towards Accurate Multi-person Pose Estimation in the Wild (My summery)
Towards Accurate Multi-person Pose Estimation in the Wild (My summery)
Abdulrahman Kerim
 
DNN and RBM
DNN and RBMDNN and RBM
DNN and RBM
Masayuki Tanaka
 
Final Poster
Final PosterFinal Poster
Final Poster
Elizabeth Koshelev
 
Study on Data Augmentation Methods for Sonar Image Analysis
Study on Data Augmentation Methods for Sonar Image AnalysisStudy on Data Augmentation Methods for Sonar Image Analysis
Study on Data Augmentation Methods for Sonar Image Analysis
harmonylab
 
Neural Radiance Fields & Neural Rendering.pdf
Neural Radiance Fields & Neural Rendering.pdfNeural Radiance Fields & Neural Rendering.pdf
Neural Radiance Fields & Neural Rendering.pdf
NavneetPaul2
 
Paper review: Measuring the Intrinsic Dimension of Objective Landscapes.
Paper review: Measuring the Intrinsic Dimension of Objective Landscapes.Paper review: Measuring the Intrinsic Dimension of Objective Landscapes.
Paper review: Measuring the Intrinsic Dimension of Objective Landscapes.
Wuhyun Rico Shin
 
Introduction to Image Processing
Introduction to Image ProcessingIntroduction to Image Processing
Introduction to Image Processing
Israel Gbati
 
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
taeseon ryu
 
SPICE-MATEX @ DAC15
SPICE-MATEX @ DAC15SPICE-MATEX @ DAC15
SPICE-MATEX @ DAC15
Hao Zhuang
 
Nonlinear dimension reduction
Nonlinear dimension reductionNonlinear dimension reduction
Nonlinear dimension reduction
Yan Xu
 
Paper Introduction "Density-aware person detection and tracking in crowds"
Paper Introduction "Density-aware person detection and tracking in crowds"Paper Introduction "Density-aware person detection and tracking in crowds"
Paper Introduction "Density-aware person detection and tracking in crowds"
壮 八幡
 
3D Reconstruction from Multiple uncalibrated 2D Images of an Object
3D Reconstruction from Multiple uncalibrated 2D Images of an Object3D Reconstruction from Multiple uncalibrated 2D Images of an Object
3D Reconstruction from Multiple uncalibrated 2D Images of an Object
Ankur Tyagi
 
Tutorial Equivariance in Imaging ICMS 23.pptx
Tutorial Equivariance in Imaging ICMS 23.pptxTutorial Equivariance in Imaging ICMS 23.pptx
Tutorial Equivariance in Imaging ICMS 23.pptx
Julián Tachella
 
TransNeRF
TransNeRFTransNeRF
TransNeRF
NavneetPaul2
 
Theories and Engineering Technics of 2D-to-3D Back-Projection Problem
Theories and Engineering Technics of 2D-to-3D Back-Projection ProblemTheories and Engineering Technics of 2D-to-3D Back-Projection Problem
Theories and Engineering Technics of 2D-to-3D Back-Projection Problem
Seongcheol Baek
 
SPIE 10059-36(Reheman Baikejiang)
SPIE 10059-36(Reheman Baikejiang)SPIE 10059-36(Reheman Baikejiang)
SPIE 10059-36(Reheman Baikejiang)
Reheman Baikejiang
 
Manifold Blurring Mean Shift algorithms for manifold denoising, presentation,...
Manifold Blurring Mean Shift algorithms for manifold denoising, presentation,...Manifold Blurring Mean Shift algorithms for manifold denoising, presentation,...
Manifold Blurring Mean Shift algorithms for manifold denoising, presentation,...
Florent Renucci
 
All projects
All projectsAll projects
All projects
Karishma Jain
 

Similar to Neural Inverse Rendering for General Reflectance Photometric Stereo (ICML 2018) (20)

Super resolution in deep learning era - Jaejun Yoo
Super resolution in deep learning era - Jaejun YooSuper resolution in deep learning era - Jaejun Yoo
Super resolution in deep learning era - Jaejun Yoo
 
20230213_ComputerVision_연구.pptx
20230213_ComputerVision_연구.pptx20230213_ComputerVision_연구.pptx
20230213_ComputerVision_연구.pptx
 
Towards Accurate Multi-person Pose Estimation in the Wild (My summery)
Towards Accurate Multi-person Pose Estimation in the Wild (My summery)Towards Accurate Multi-person Pose Estimation in the Wild (My summery)
Towards Accurate Multi-person Pose Estimation in the Wild (My summery)
 
DNN and RBM
DNN and RBMDNN and RBM
DNN and RBM
 
Final Poster
Final PosterFinal Poster
Final Poster
 
Study on Data Augmentation Methods for Sonar Image Analysis
Study on Data Augmentation Methods for Sonar Image AnalysisStudy on Data Augmentation Methods for Sonar Image Analysis
Study on Data Augmentation Methods for Sonar Image Analysis
 
Neural Radiance Fields & Neural Rendering.pdf
Neural Radiance Fields & Neural Rendering.pdfNeural Radiance Fields & Neural Rendering.pdf
Neural Radiance Fields & Neural Rendering.pdf
 
Paper review: Measuring the Intrinsic Dimension of Objective Landscapes.
Paper review: Measuring the Intrinsic Dimension of Objective Landscapes.Paper review: Measuring the Intrinsic Dimension of Objective Landscapes.
Paper review: Measuring the Intrinsic Dimension of Objective Landscapes.
 
Introduction to Image Processing
Introduction to Image ProcessingIntroduction to Image Processing
Introduction to Image Processing
 
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
 
SPICE-MATEX @ DAC15
SPICE-MATEX @ DAC15SPICE-MATEX @ DAC15
SPICE-MATEX @ DAC15
 
Nonlinear dimension reduction
Nonlinear dimension reductionNonlinear dimension reduction
Nonlinear dimension reduction
 
Paper Introduction "Density-aware person detection and tracking in crowds"
Paper Introduction "Density-aware person detection and tracking in crowds"Paper Introduction "Density-aware person detection and tracking in crowds"
Paper Introduction "Density-aware person detection and tracking in crowds"
 
3D Reconstruction from Multiple uncalibrated 2D Images of an Object
3D Reconstruction from Multiple uncalibrated 2D Images of an Object3D Reconstruction from Multiple uncalibrated 2D Images of an Object
3D Reconstruction from Multiple uncalibrated 2D Images of an Object
 
Tutorial Equivariance in Imaging ICMS 23.pptx
Tutorial Equivariance in Imaging ICMS 23.pptxTutorial Equivariance in Imaging ICMS 23.pptx
Tutorial Equivariance in Imaging ICMS 23.pptx
 
TransNeRF
TransNeRFTransNeRF
TransNeRF
 
Theories and Engineering Technics of 2D-to-3D Back-Projection Problem
Theories and Engineering Technics of 2D-to-3D Back-Projection ProblemTheories and Engineering Technics of 2D-to-3D Back-Projection Problem
Theories and Engineering Technics of 2D-to-3D Back-Projection Problem
 
SPIE 10059-36(Reheman Baikejiang)
SPIE 10059-36(Reheman Baikejiang)SPIE 10059-36(Reheman Baikejiang)
SPIE 10059-36(Reheman Baikejiang)
 
Manifold Blurring Mean Shift algorithms for manifold denoising, presentation,...
Manifold Blurring Mean Shift algorithms for manifold denoising, presentation,...Manifold Blurring Mean Shift algorithms for manifold denoising, presentation,...
Manifold Blurring Mean Shift algorithms for manifold denoising, presentation,...
 
All projects
All projectsAll projects
All projects
 

Recently uploaded

Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
University of Rennes, INSA Rennes, Inria/IRISA, CNRS
 
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdfTopic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
TinyAnderson
 
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdfwaterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
LengamoLAppostilic
 
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
AbdullaAlAsif1
 
Eukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptxEukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptx
RitabrataSarkar3
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
David Osipyan
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
yqqaatn0
 
Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...
Leonel Morgado
 
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốtmô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
HongcNguyn6
 
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
Abdul Wali Khan University Mardan,kP,Pakistan
 
Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
Gokturk Mehmet Dilci
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills MN
 
20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx
Sharon Liu
 
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
vluwdy49
 
SAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdfSAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdf
KrushnaDarade1
 
Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
University of Maribor
 
Equivariant neural networks and representation theory
Equivariant neural networks and representation theoryEquivariant neural networks and representation theory
Equivariant neural networks and representation theory
Daniel Tubbenhauer
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
PRIYANKA PATEL
 
Thornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdfThornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdf
European Sustainable Phosphorus Platform
 
Cytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptxCytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptx
Hitesh Sikarwar
 

Recently uploaded (20)

Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
 
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdfTopic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
 
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdfwaterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
 
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
 
Eukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptxEukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptx
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
 
Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...
 
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốtmô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
 
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
 
Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
 
20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx
 
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
 
SAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdfSAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdf
 
Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
 
Equivariant neural networks and representation theory
Equivariant neural networks and representation theoryEquivariant neural networks and representation theory
Equivariant neural networks and representation theory
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
 
Thornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdfThornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdf
 
Cytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptxCytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptx
 

Neural Inverse Rendering for General Reflectance Photometric Stereo (ICML 2018)

  • 1. 1 Neural Inverse Rendering for General Reflectance Photometric Stereo Short oral presentation ICML 2018 July 11, 2018 Tatsunori Taniai RIKEN AIP Takanori Maehara RIKEN AIP ICML 2018 Paper
  • 2. 2 Photometric stereo: shape from varying shading [Woodham, 80] Scene observations under varying illuminations 3D surface normals (surface orientations) PS is an essential technique for highly detailed 3D shape recovery in combination with multiview stereo (MVS) MVS only [Park+ 13] MVS + PS
  • 3. 3 Photometric stereo: shape from varying shading [Woodham, 80] Challenges • Real-world objects have various complex reflectance properties (BRDFs) → Use of deep learning to model various BRDFs seems promising but it is actually very inactive because… Scene observations under varying illuminations 3D surface normals (surface orientations) • Not much training data. Accurately measuring surface normals is difficult.
  • 4. 4 ML perspective: physics-based unsupervised learning Observed data Hidden dataEstimator 𝑿 𝒀 Synthesized data 𝑿′ 𝒁 Physical generative model 𝑿 = 𝑓(𝒀, 𝒁) • Not directly observable or annotatable. • No ground truth for training data. Use physics to bypass the issue of lacking training data. Disentangled representation Reconstruction loss 𝑿 − 𝑿′ 𝑾
  • 5. 5 Talk Overview • Introduction • Basics of photometric stereo • Our approach • Experimental results
  • 6. 6 Photometric stereo as inverse imaging process 𝒗𝒏 ℓ Point light source Object surface Camera 𝐼: Image intensity (known) ℓ: Light direction & intensity (known) 𝒗: View direction (known) 𝒏: Surface normal (unknown) 𝜌: BRDF (unknown) 𝜌
  • 7. 7 Photometric stereo as inverse imaging process 𝒗𝒏 ℓ Point light source Object surface Camera ⊙= max(0, ℓ 𝑇 𝒏)𝐼 = ⊙ 𝜌( 𝒏, ℓ, 𝒗) Observed pixel Shading Reflectance (BRDF) Reflectance (rendering) equation 𝐼: Image intensity (known) ℓ: Light direction & intensity (known) 𝒗: View direction (known) 𝒏: Surface normal (unknown) 𝜌: BRDF (unknown) Estimate 𝒏 from intensities when changing illuminations ℓ 𝜌 × × ×
  • 8. 8 Lest squares solution for diffuse surfaces [Woodham, 80] 𝒏 ℓ Point light source Object surface 𝜌0 A closed-form solution exists if 𝝆 is constant (uniform distribution)
  • 9. 9 Lest squares solution for diffuse surfaces [Woodham, 80] 𝒏 ℓ Point light source Object surface A closed-form solution exists if 𝝆 is constant (uniform distribution) 𝜌0 Lambertian diffuse model 𝐼 = 𝜌0 max(0, ℓ 𝑇 𝒏) 𝐼1 = 𝜌0ℓ1 𝑇 𝒏 𝐼2 = 𝜌0ℓ2 𝑇 𝒏 𝐼 𝑀 = 𝜌0ℓ 𝑀 𝑇 𝒏⋯ Multiple observations by varying illuminations 𝑰 = 𝑳 𝑇(𝜌0 𝒏) Linear system for a set of bright pixels = 𝜌0ℓ 𝑇 𝒏 (for 𝐼 > 0)
  • 10. 10 Our goal: general reflectance photometric stereo Can we determine 𝒏 from intensities when • 𝝆 is unknown and spatially-varying • no training data with ground truth of 𝒏 and 𝝆 Multiple intensity observations under known illumination patterns 𝐼1 = max 0, ℓ1 𝑇 𝒏 ⊙ 𝜌( 𝒏, ℓ1, 𝒗) ⋯ 𝐼2 = max 0, ℓ2 𝑇 𝒏 ⊙ 𝜌( 𝒏, ℓ2, 𝒗) 𝐼 𝑀 = max 0, ℓ 𝑀 𝑇 𝒏 ⊙ 𝜌( 𝒏, ℓ 𝑀, 𝒗) ℓ 𝜌 Surfaces with unknown and spatially-varying BRDFs
  • 11. 11 Talk Overview • Introduction • Basics of photometric stereo • Our approach – Physics-embedded auto-encoder network – Reconstruction loss – Test-time learning algorithm • Experimental results
  • 12. 12 Our physics-embedded auto-encoder network (simplified)… 𝚽 𝒀𝑖𝑿𝑖 𝑵 … … … … 𝑰1 𝑰𝑖 𝑰 𝑀 𝒁𝑖 Photometric stereo network (PSNet) Image reconstruction network (IRNet) 𝑀𝐶 x 𝐻 x 𝑊 3 x 𝐻 x 𝑊 𝑰𝑖 𝑀 x 𝐶 x 𝐻 x 𝑊 𝑀 x 𝐶 x 𝐻 x 𝑊 𝑀 x 𝐶 x 𝐻 x 𝑊 384 x 𝐻 x 𝑊 𝑀 x 16 x 𝐻 x 𝑊 Surface normal map Synthesized images Observed images 𝑰2 Concat Batch Rendering equation 𝑵 𝑹𝑖 𝑰 Reflectance Two-streams network to 1) produce a normal map and 2) re-render images analyzes all observations to produce a single normal map processes each observation individually to disentangle and reconstruct an image
  • 13. 13 Physics-embedded auto-encoder network (full)… 𝑺𝑖 𝚽 𝒀𝑖𝑿𝑖 𝑵 𝑓ps1: 3x3 Conv BatchNorm ReLU x 3 𝑓ps2: 3x3 Conv 𝐿2 Norm 𝑓ir1: 3x3 Conv BatchNorm ReLU x 3 𝑓ir2: 1x1 Conv BatchNorm ReLU … … … … 𝑰1 𝑰𝑖 𝑰 𝑀 𝒁𝑖 Photometric stereo network (PSNet) Image reconstruction network (IRNet) 𝑀𝐶 x 𝐻 x 𝑊 3 x 𝐻 x 𝑊 𝑰𝑖 𝑀 x 𝐶 x 𝐻 x 𝑊 Compute specular component using 𝑵, ℓ𝑖, 𝒗 𝑀 x 𝐶 x 𝐻 x 𝑊 𝑀 x 𝐶+1 x 𝐻 x 𝑊 384 x 𝐻 x 𝑊 𝑀 x 16 x 𝐻 x 𝑊 Surface normal map Synthesized images 𝑓ir3: 3x3 Conv BatchNorm ReLU + 3x3 Conv Observed images 𝑰2 Concat Batch Rendering equation 𝑵 𝑹𝑖 𝑰
  • 14. 14 Loss function with early-stage weak supervision Image reconstruction loss Least squares (LS) prior 𝐿 = 1 𝑀 𝑖=1 𝑀 𝑰𝑖 − 𝑰𝑖 1 + 𝜆 𝑡 𝑵 − 𝑵′ 2 2 Minimize intensity differences btw synthesized 𝑰𝑖 and observed 𝑰𝑖 images. Constrain the output normals 𝑵 to be close to prior normals 𝑵′ obtained by the LS method. Early-stage weak supervision • LS prior 𝑵′ has low accuracy, so it is used only for an early-stage of learning process (i.e., 𝜆 𝑡 ← 0 after some SGD iterations). • It can stabilize learning of randomly initialized network parameters.
  • 15. 15 Test-time learning algorithm Input: Pairs of an image and corresponding lighting (𝑰𝑖, ℓ𝑖) of a test scene. Output: A surface normal map 𝑵 of a test scene. • Run PSNet to produce a normal map 𝑵. • Run IRNet to reconstruct all input images as 𝑰𝑖 . • Compute the loss and update the network parameters. • Terminate the prior (𝜆 𝑡 ← 0) if iterations > 50. Until convergence (1000 iterations) Without any pre-training, we directly fit the network to a given test scene. Initialize network parameters randomly. Compute LS solution 𝑵′. Repeat Adam’s iterations
  • 16. 16 Talk Overview • Introduction • Basics of photometric stereo • Our approach • Experimental results
  • 17. 17 Benchmark on real-world scenes [Shi+ 18] Outperformed deep learning based [Santo+ 17] and other classical methods • Totally 10 scenes, each provides 96 images. Evaluated by mean angular errors (degrees). • [Santo+ 17] is a supervised DNN method pre-trained on synthetic data. Classicalphysics-based
  • 19. 19 Convergence analysis with early-stage supervision MeanangularerrorsLoss Early-stage sup. No sup. All-stage sup.  Stable & accurate  Unstable  Inaccurate Terminating supervision
  • 20. 20 Convergence analysis with early-stage supervision MeanangularerrorsLoss Early-stage sup. No sup. All-stage sup.  Stable & accurate  Unstable  Inaccurate Terminating supervision
  • 21. 21 Summary We demonstrated • Physics-based unsupervised learning approach to general BRDF photometric stereo. • Use of physics can bypass the issue of lacking annotated training data. • SOTA results, outperforming a supervised deep learning method and other classical unsupervised methods. Come to our poster for more details about our network architecture and experiments.

Editor's Notes

  1. 1, 15, 60, 80
  2. 1, 15, 60, 80