SlideShare a Scribd company logo
Recovering the 3D shape of an
object from 2D images
By Rishitha Bandi(015237521)
Introduction
● Natural images are projection of 3D object on a 2D image plane.
● The knowledge gained from the 2D images is used to recover the 3D shape of the
object. The methodologies of recovering 3D shape varies with the source of 2D
images.
Existing Methods of GANs
● Generative adversarial networks can model the 2D natural image manifold of diverse
object categories with high fidelity.
● The phenomenon of GAN changing viewpoint of object in generated images
motivates the possibility of reconstructing the 3D shape from single 2D image
● However, due to either heavy memory consumption or increased difficulty in
training, there are still gaps between their image qualities and those of 2D GANs.
● In unsupervised learning of 3D shapes, it doesn’t need manual annotations and lacks
multiple viewpoints and lightning instances. To tackle it they adopted the shapes from
the external 3D shapes or 2D images which learns viewpoint and shape via
autoencoders that explicitly model the rendering process.
Methodologies for Recovering 3D Shape
● The methodology of using GAN to learn 3D shapes from images which rely on
explicitly modelling 3D representation and rendering during training.
● Another methodology of learning 3D shapes from the images learns the viewpoint
and shapes of images in ‘analysis by synthesis’ manner.
Proposed model
● On the image generated by GAN an ellipsoid is employed and as its first 3D object
shape and render several unnatural images called pseudo samples.
● Pseudo samples are randomly sampled viewpoints and lighting conditions.
● By reconstructing pseudo samples using the GAN, those could guide the original
image towards the sampled viewpoints and lighting conditions in the GAN manifold,
producing a number of natural-looking images, called projected samples.
Continue..
● We present the first attempt to reconstruct the 3D object shapes using GANs that are
pre-trained on 2D images only. Our work shows that 2D GANs inherently capture
rich 3D knowledge for different object categories and provides a new perspective for
3D shape generation.
● Our work also provides an alternative unsupervised 3D shape learning method and
does not rely on the symmetry assumption of object shapes.
● We achieve highly photo-realistic 3D-aware image manipulations including rotation
and relighting without using any external 3D models.
Step 1: Using a Weak Shape Prior
● With single image instance to explore images with many viewpoints and lighting to
the GAN image manifold could resort to ellipsoid as weak prior to create pseudo
samples.
● The viewpoint v and lighting l are initialized with a canonical setting, i.e., v0 = 0 and
lighting is from the front.
● This allows us to have an initial guess about the four factors d0(depth), a0(albedo),
v0(viewpoint),l0(lighting), with d0 being the weak shape prior.
Step 2: Sampling and Projecting to the GAN Image Manifold.
● Pseudo samples have unnatural distortions and shadows, they provide cues on how
the face rotates and how the light changes.
● In order to leverage such cues to guide novel viewpoint and lighting direction
exploration in the GAN image manifold, we perform GAN inversion to these pseudo
samples, i.e., reconstruct them with the GAN generator
Step 3: Learning the 3D Shape
● The viewpoint network, lighting network , the depth network and albedo network are
all together optimised to learn the 3D shape.
● The reconstruction loss is minimised on when all the predictions are inferred.
● The unnatural distortions and shadows that exist in the pseudo samples are mitigated
in the reconstructions.
Conclusion
We have presented the first method that directly leverages off-the-shelf 2D GANs to
recover 3D object shapes from images. We found that existing 2D GANs inherently
capture sufficient knowledge to recover 3D shapes for many object categories, including
human faces, cats, cars, and buildings. Based on a weak convex shape prior, our method
could explore the viewpoint and lighting variations in the GAN image manifold, and
exploit these variations to refine the underlying object shape in an iterative manner.
References
https://arxiv.org/pdf/2011.00844v4.pdf
THANK YOU!!

More Related Content

What's hot

Content-Based Image Retrieval Features: A Survey
Content-Based Image Retrieval Features: A SurveyContent-Based Image Retrieval Features: A Survey
Content-Based Image Retrieval Features: A Survey
Eswar Publications
 
Image search engine
Image search engineImage search engine
Image search engine
Avanish Singh
 
COMPARISON OF RENDERING PROCESSES ON 3D MODEL
COMPARISON OF RENDERING PROCESSES ON 3D MODELCOMPARISON OF RENDERING PROCESSES ON 3D MODEL
COMPARISON OF RENDERING PROCESSES ON 3D MODEL
ijcsit
 
Pc Seminar Jordi
Pc Seminar JordiPc Seminar Jordi
Pc Seminar Jordi
Universitat de Barcelona
 
Object recognition
Object recognitionObject recognition
Object recognition
saniacorreya
 
Ed34785790
Ed34785790Ed34785790
Ed34785790
IJERA Editor
 
Image forgery and security
Image forgery and securityImage forgery and security
Image forgery and security
أحلام انصارى
 
final_report
final_reportfinal_report
final_report
Chiraz Nafouki
 

What's hot (8)

Content-Based Image Retrieval Features: A Survey
Content-Based Image Retrieval Features: A SurveyContent-Based Image Retrieval Features: A Survey
Content-Based Image Retrieval Features: A Survey
 
Image search engine
Image search engineImage search engine
Image search engine
 
COMPARISON OF RENDERING PROCESSES ON 3D MODEL
COMPARISON OF RENDERING PROCESSES ON 3D MODELCOMPARISON OF RENDERING PROCESSES ON 3D MODEL
COMPARISON OF RENDERING PROCESSES ON 3D MODEL
 
Pc Seminar Jordi
Pc Seminar JordiPc Seminar Jordi
Pc Seminar Jordi
 
Object recognition
Object recognitionObject recognition
Object recognition
 
Ed34785790
Ed34785790Ed34785790
Ed34785790
 
Image forgery and security
Image forgery and securityImage forgery and security
Image forgery and security
 
final_report
final_reportfinal_report
final_report
 

Similar to 297 short story

[DL輪読会]ClearGrasp
[DL輪読会]ClearGrasp[DL輪読会]ClearGrasp
[DL輪読会]ClearGrasp
Deep Learning JP
 
Shadow removal using Image Processing (Case study and code Implementation)
Shadow removal using Image Processing (Case study and code Implementation)Shadow removal using Image Processing (Case study and code Implementation)
Shadow removal using Image Processing (Case study and code Implementation)
Ardra
 
Deep Generative Models - Kevin McGuinness - UPC Barcelona 2018
Deep Generative Models - Kevin McGuinness - UPC Barcelona 2018Deep Generative Models - Kevin McGuinness - UPC Barcelona 2018
Deep Generative Models - Kevin McGuinness - UPC Barcelona 2018
Universitat Politècnica de Catalunya
 
Pratik ibm-open power-ppt
Pratik ibm-open power-pptPratik ibm-open power-ppt
Pratik ibm-open power-ppt
Vaibhav R
 
Lm342080283
Lm342080283Lm342080283
Lm342080283
IJERA Editor
 
Introduction to 3D Computer Vision and Differentiable Rendering
Introduction to 3D Computer Vision and Differentiable RenderingIntroduction to 3D Computer Vision and Differentiable Rendering
Introduction to 3D Computer Vision and Differentiable Rendering
Preferred Networks
 
Project_Final_Review.pdf
Project_Final_Review.pdfProject_Final_Review.pdf
Project_Final_Review.pdf
DivyaGugulothu
 
Seeing what a gan cannot generate: paper review
Seeing what a gan cannot generate: paper reviewSeeing what a gan cannot generate: paper review
Seeing what a gan cannot generate: paper review
QuantUniversity
 
Pose estimation from RGB images by deep learning
Pose estimation from RGB images by deep learningPose estimation from RGB images by deep learning
Pose estimation from RGB images by deep learning
Yu Huang
 
Decomposing image generation into layout priction and conditional synthesis
Decomposing image generation into layout priction and conditional synthesisDecomposing image generation into layout priction and conditional synthesis
Decomposing image generation into layout priction and conditional synthesis
Naeem Shehzad
 
3D Modeling Techniques : Types and Specific Applications
3D Modeling Techniques : Types and Specific Applications3D Modeling Techniques : Types and Specific Applications
3D Modeling Techniques : Types and Specific Applications
ThePro3DStudio
 
IJCER (www.ijceronline.com) International Journal of computational Engineeri...
 IJCER (www.ijceronline.com) International Journal of computational Engineeri... IJCER (www.ijceronline.com) International Journal of computational Engineeri...
IJCER (www.ijceronline.com) International Journal of computational Engineeri...
ijceronline
 
VIBE: Video Inference for Human Body Pose and Shape Estimation
VIBE: Video Inference for Human Body Pose and Shape EstimationVIBE: Video Inference for Human Body Pose and Shape Estimation
VIBE: Video Inference for Human Body Pose and Shape Estimation
Arithmer Inc.
 
3-d interpretation from single 2-d image III
3-d interpretation from single 2-d image III3-d interpretation from single 2-d image III
3-d interpretation from single 2-d image III
Yu Huang
 
Image colorization
Image colorizationImage colorization
Image colorization
Yash Saraf
 
Image colorization
Image colorizationImage colorization
Image colorization
Yash Saraf
 
sduGroupEvent
sduGroupEventsduGroupEvent
sduGroupEvent
Thomas Sølund
 
[IJET-V1I6P16] Authors : Indraja Mali , Saumya Saxena ,Padmaja Desai , Ajay G...
[IJET-V1I6P16] Authors : Indraja Mali , Saumya Saxena ,Padmaja Desai , Ajay G...[IJET-V1I6P16] Authors : Indraja Mali , Saumya Saxena ,Padmaja Desai , Ajay G...
[IJET-V1I6P16] Authors : Indraja Mali , Saumya Saxena ,Padmaja Desai , Ajay G...
IJET - International Journal of Engineering and Techniques
 
On constructing z dimensional Image By DIBR Synthesized Images
On constructing z dimensional Image By DIBR Synthesized ImagesOn constructing z dimensional Image By DIBR Synthesized Images
On constructing z dimensional Image By DIBR Synthesized Images
Jayakrishnan U
 
ei2106-submit-opt-415
ei2106-submit-opt-415ei2106-submit-opt-415
ei2106-submit-opt-415
Joseph Lanzone
 

Similar to 297 short story (20)

[DL輪読会]ClearGrasp
[DL輪読会]ClearGrasp[DL輪読会]ClearGrasp
[DL輪読会]ClearGrasp
 
Shadow removal using Image Processing (Case study and code Implementation)
Shadow removal using Image Processing (Case study and code Implementation)Shadow removal using Image Processing (Case study and code Implementation)
Shadow removal using Image Processing (Case study and code Implementation)
 
Deep Generative Models - Kevin McGuinness - UPC Barcelona 2018
Deep Generative Models - Kevin McGuinness - UPC Barcelona 2018Deep Generative Models - Kevin McGuinness - UPC Barcelona 2018
Deep Generative Models - Kevin McGuinness - UPC Barcelona 2018
 
Pratik ibm-open power-ppt
Pratik ibm-open power-pptPratik ibm-open power-ppt
Pratik ibm-open power-ppt
 
Lm342080283
Lm342080283Lm342080283
Lm342080283
 
Introduction to 3D Computer Vision and Differentiable Rendering
Introduction to 3D Computer Vision and Differentiable RenderingIntroduction to 3D Computer Vision and Differentiable Rendering
Introduction to 3D Computer Vision and Differentiable Rendering
 
Project_Final_Review.pdf
Project_Final_Review.pdfProject_Final_Review.pdf
Project_Final_Review.pdf
 
Seeing what a gan cannot generate: paper review
Seeing what a gan cannot generate: paper reviewSeeing what a gan cannot generate: paper review
Seeing what a gan cannot generate: paper review
 
Pose estimation from RGB images by deep learning
Pose estimation from RGB images by deep learningPose estimation from RGB images by deep learning
Pose estimation from RGB images by deep learning
 
Decomposing image generation into layout priction and conditional synthesis
Decomposing image generation into layout priction and conditional synthesisDecomposing image generation into layout priction and conditional synthesis
Decomposing image generation into layout priction and conditional synthesis
 
3D Modeling Techniques : Types and Specific Applications
3D Modeling Techniques : Types and Specific Applications3D Modeling Techniques : Types and Specific Applications
3D Modeling Techniques : Types and Specific Applications
 
IJCER (www.ijceronline.com) International Journal of computational Engineeri...
 IJCER (www.ijceronline.com) International Journal of computational Engineeri... IJCER (www.ijceronline.com) International Journal of computational Engineeri...
IJCER (www.ijceronline.com) International Journal of computational Engineeri...
 
VIBE: Video Inference for Human Body Pose and Shape Estimation
VIBE: Video Inference for Human Body Pose and Shape EstimationVIBE: Video Inference for Human Body Pose and Shape Estimation
VIBE: Video Inference for Human Body Pose and Shape Estimation
 
3-d interpretation from single 2-d image III
3-d interpretation from single 2-d image III3-d interpretation from single 2-d image III
3-d interpretation from single 2-d image III
 
Image colorization
Image colorizationImage colorization
Image colorization
 
Image colorization
Image colorizationImage colorization
Image colorization
 
sduGroupEvent
sduGroupEventsduGroupEvent
sduGroupEvent
 
[IJET-V1I6P16] Authors : Indraja Mali , Saumya Saxena ,Padmaja Desai , Ajay G...
[IJET-V1I6P16] Authors : Indraja Mali , Saumya Saxena ,Padmaja Desai , Ajay G...[IJET-V1I6P16] Authors : Indraja Mali , Saumya Saxena ,Padmaja Desai , Ajay G...
[IJET-V1I6P16] Authors : Indraja Mali , Saumya Saxena ,Padmaja Desai , Ajay G...
 
On constructing z dimensional Image By DIBR Synthesized Images
On constructing z dimensional Image By DIBR Synthesized ImagesOn constructing z dimensional Image By DIBR Synthesized Images
On constructing z dimensional Image By DIBR Synthesized Images
 
ei2106-submit-opt-415
ei2106-submit-opt-415ei2106-submit-opt-415
ei2106-submit-opt-415
 

Recently uploaded

Assistant Engineer (Chemical) Interview Questions.pdf
Assistant Engineer (Chemical) Interview Questions.pdfAssistant Engineer (Chemical) Interview Questions.pdf
Assistant Engineer (Chemical) Interview Questions.pdf
Seetal Daas
 
A high-Speed Communication System is based on the Design of a Bi-NoC Router, ...
A high-Speed Communication System is based on the Design of a Bi-NoC Router, ...A high-Speed Communication System is based on the Design of a Bi-NoC Router, ...
A high-Speed Communication System is based on the Design of a Bi-NoC Router, ...
DharmaBanothu
 
Generative AI Use cases applications solutions and implementation.pdf
Generative AI Use cases applications solutions and implementation.pdfGenerative AI Use cases applications solutions and implementation.pdf
Generative AI Use cases applications solutions and implementation.pdf
mahaffeycheryld
 
Open Channel Flow: fluid flow with a free surface
Open Channel Flow: fluid flow with a free surfaceOpen Channel Flow: fluid flow with a free surface
Open Channel Flow: fluid flow with a free surface
Indrajeet sahu
 
5G Radio Network Througput Problem Analysis HCIA.pdf
5G Radio Network Througput Problem Analysis HCIA.pdf5G Radio Network Througput Problem Analysis HCIA.pdf
5G Radio Network Througput Problem Analysis HCIA.pdf
AlvianRamadhani5
 
1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf
1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf
1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf
MadhavJungKarki
 
openshift technical overview - Flow of openshift containerisatoin
openshift technical overview - Flow of openshift containerisatoinopenshift technical overview - Flow of openshift containerisatoin
openshift technical overview - Flow of openshift containerisatoin
snaprevwdev
 
SCALING OF MOS CIRCUITS m .pptx
SCALING OF MOS CIRCUITS m                 .pptxSCALING OF MOS CIRCUITS m                 .pptx
SCALING OF MOS CIRCUITS m .pptx
harshapolam10
 
Digital Twins Computer Networking Paper Presentation.pptx
Digital Twins Computer Networking Paper Presentation.pptxDigital Twins Computer Networking Paper Presentation.pptx
Digital Twins Computer Networking Paper Presentation.pptx
aryanpankaj78
 
Transformers design and coooling methods
Transformers design and coooling methodsTransformers design and coooling methods
Transformers design and coooling methods
Roger Rozario
 
一比一原版(爱大毕业证书)爱荷华大学毕业证如何办理
一比一原版(爱大毕业证书)爱荷华大学毕业证如何办理一比一原版(爱大毕业证书)爱荷华大学毕业证如何办理
一比一原版(爱大毕业证书)爱荷华大学毕业证如何办理
nedcocy
 
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
ecqow
 
AI + Data Community Tour - Build the Next Generation of Apps with the Einstei...
AI + Data Community Tour - Build the Next Generation of Apps with the Einstei...AI + Data Community Tour - Build the Next Generation of Apps with the Einstei...
AI + Data Community Tour - Build the Next Generation of Apps with the Einstei...
Paris Salesforce Developer Group
 
UNIT 4 LINEAR INTEGRATED CIRCUITS-DIGITAL ICS
UNIT 4 LINEAR INTEGRATED CIRCUITS-DIGITAL ICSUNIT 4 LINEAR INTEGRATED CIRCUITS-DIGITAL ICS
UNIT 4 LINEAR INTEGRATED CIRCUITS-DIGITAL ICS
vmspraneeth
 
smart pill dispenser is designed to improve medication adherence and safety f...
smart pill dispenser is designed to improve medication adherence and safety f...smart pill dispenser is designed to improve medication adherence and safety f...
smart pill dispenser is designed to improve medication adherence and safety f...
um7474492
 
Null Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAMNull Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAM
Divyanshu
 
Asymmetrical Repulsion Magnet Motor Ratio 6-7.pdf
Asymmetrical Repulsion Magnet Motor Ratio 6-7.pdfAsymmetrical Repulsion Magnet Motor Ratio 6-7.pdf
Asymmetrical Repulsion Magnet Motor Ratio 6-7.pdf
felixwold
 
Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...
Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...
Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...
PriyankaKilaniya
 
Introduction to Computer Networks & OSI MODEL.ppt
Introduction to Computer Networks & OSI MODEL.pptIntroduction to Computer Networks & OSI MODEL.ppt
Introduction to Computer Networks & OSI MODEL.ppt
Dwarkadas J Sanghvi College of Engineering
 
Object Oriented Analysis and Design - OOAD
Object Oriented Analysis and Design - OOADObject Oriented Analysis and Design - OOAD
Object Oriented Analysis and Design - OOAD
PreethaV16
 

Recently uploaded (20)

Assistant Engineer (Chemical) Interview Questions.pdf
Assistant Engineer (Chemical) Interview Questions.pdfAssistant Engineer (Chemical) Interview Questions.pdf
Assistant Engineer (Chemical) Interview Questions.pdf
 
A high-Speed Communication System is based on the Design of a Bi-NoC Router, ...
A high-Speed Communication System is based on the Design of a Bi-NoC Router, ...A high-Speed Communication System is based on the Design of a Bi-NoC Router, ...
A high-Speed Communication System is based on the Design of a Bi-NoC Router, ...
 
Generative AI Use cases applications solutions and implementation.pdf
Generative AI Use cases applications solutions and implementation.pdfGenerative AI Use cases applications solutions and implementation.pdf
Generative AI Use cases applications solutions and implementation.pdf
 
Open Channel Flow: fluid flow with a free surface
Open Channel Flow: fluid flow with a free surfaceOpen Channel Flow: fluid flow with a free surface
Open Channel Flow: fluid flow with a free surface
 
5G Radio Network Througput Problem Analysis HCIA.pdf
5G Radio Network Througput Problem Analysis HCIA.pdf5G Radio Network Througput Problem Analysis HCIA.pdf
5G Radio Network Througput Problem Analysis HCIA.pdf
 
1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf
1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf
1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf
 
openshift technical overview - Flow of openshift containerisatoin
openshift technical overview - Flow of openshift containerisatoinopenshift technical overview - Flow of openshift containerisatoin
openshift technical overview - Flow of openshift containerisatoin
 
SCALING OF MOS CIRCUITS m .pptx
SCALING OF MOS CIRCUITS m                 .pptxSCALING OF MOS CIRCUITS m                 .pptx
SCALING OF MOS CIRCUITS m .pptx
 
Digital Twins Computer Networking Paper Presentation.pptx
Digital Twins Computer Networking Paper Presentation.pptxDigital Twins Computer Networking Paper Presentation.pptx
Digital Twins Computer Networking Paper Presentation.pptx
 
Transformers design and coooling methods
Transformers design and coooling methodsTransformers design and coooling methods
Transformers design and coooling methods
 
一比一原版(爱大毕业证书)爱荷华大学毕业证如何办理
一比一原版(爱大毕业证书)爱荷华大学毕业证如何办理一比一原版(爱大毕业证书)爱荷华大学毕业证如何办理
一比一原版(爱大毕业证书)爱荷华大学毕业证如何办理
 
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
 
AI + Data Community Tour - Build the Next Generation of Apps with the Einstei...
AI + Data Community Tour - Build the Next Generation of Apps with the Einstei...AI + Data Community Tour - Build the Next Generation of Apps with the Einstei...
AI + Data Community Tour - Build the Next Generation of Apps with the Einstei...
 
UNIT 4 LINEAR INTEGRATED CIRCUITS-DIGITAL ICS
UNIT 4 LINEAR INTEGRATED CIRCUITS-DIGITAL ICSUNIT 4 LINEAR INTEGRATED CIRCUITS-DIGITAL ICS
UNIT 4 LINEAR INTEGRATED CIRCUITS-DIGITAL ICS
 
smart pill dispenser is designed to improve medication adherence and safety f...
smart pill dispenser is designed to improve medication adherence and safety f...smart pill dispenser is designed to improve medication adherence and safety f...
smart pill dispenser is designed to improve medication adherence and safety f...
 
Null Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAMNull Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAM
 
Asymmetrical Repulsion Magnet Motor Ratio 6-7.pdf
Asymmetrical Repulsion Magnet Motor Ratio 6-7.pdfAsymmetrical Repulsion Magnet Motor Ratio 6-7.pdf
Asymmetrical Repulsion Magnet Motor Ratio 6-7.pdf
 
Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...
Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...
Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...
 
Introduction to Computer Networks & OSI MODEL.ppt
Introduction to Computer Networks & OSI MODEL.pptIntroduction to Computer Networks & OSI MODEL.ppt
Introduction to Computer Networks & OSI MODEL.ppt
 
Object Oriented Analysis and Design - OOAD
Object Oriented Analysis and Design - OOADObject Oriented Analysis and Design - OOAD
Object Oriented Analysis and Design - OOAD
 

297 short story

  • 1. Recovering the 3D shape of an object from 2D images By Rishitha Bandi(015237521)
  • 2. Introduction ● Natural images are projection of 3D object on a 2D image plane. ● The knowledge gained from the 2D images is used to recover the 3D shape of the object. The methodologies of recovering 3D shape varies with the source of 2D images.
  • 3. Existing Methods of GANs ● Generative adversarial networks can model the 2D natural image manifold of diverse object categories with high fidelity. ● The phenomenon of GAN changing viewpoint of object in generated images motivates the possibility of reconstructing the 3D shape from single 2D image ● However, due to either heavy memory consumption or increased difficulty in training, there are still gaps between their image qualities and those of 2D GANs. ● In unsupervised learning of 3D shapes, it doesn’t need manual annotations and lacks multiple viewpoints and lightning instances. To tackle it they adopted the shapes from the external 3D shapes or 2D images which learns viewpoint and shape via autoencoders that explicitly model the rendering process.
  • 4. Methodologies for Recovering 3D Shape ● The methodology of using GAN to learn 3D shapes from images which rely on explicitly modelling 3D representation and rendering during training. ● Another methodology of learning 3D shapes from the images learns the viewpoint and shapes of images in ‘analysis by synthesis’ manner.
  • 5. Proposed model ● On the image generated by GAN an ellipsoid is employed and as its first 3D object shape and render several unnatural images called pseudo samples. ● Pseudo samples are randomly sampled viewpoints and lighting conditions. ● By reconstructing pseudo samples using the GAN, those could guide the original image towards the sampled viewpoints and lighting conditions in the GAN manifold, producing a number of natural-looking images, called projected samples.
  • 6. Continue.. ● We present the first attempt to reconstruct the 3D object shapes using GANs that are pre-trained on 2D images only. Our work shows that 2D GANs inherently capture rich 3D knowledge for different object categories and provides a new perspective for 3D shape generation. ● Our work also provides an alternative unsupervised 3D shape learning method and does not rely on the symmetry assumption of object shapes. ● We achieve highly photo-realistic 3D-aware image manipulations including rotation and relighting without using any external 3D models.
  • 7.
  • 8. Step 1: Using a Weak Shape Prior ● With single image instance to explore images with many viewpoints and lighting to the GAN image manifold could resort to ellipsoid as weak prior to create pseudo samples. ● The viewpoint v and lighting l are initialized with a canonical setting, i.e., v0 = 0 and lighting is from the front. ● This allows us to have an initial guess about the four factors d0(depth), a0(albedo), v0(viewpoint),l0(lighting), with d0 being the weak shape prior.
  • 9. Step 2: Sampling and Projecting to the GAN Image Manifold. ● Pseudo samples have unnatural distortions and shadows, they provide cues on how the face rotates and how the light changes. ● In order to leverage such cues to guide novel viewpoint and lighting direction exploration in the GAN image manifold, we perform GAN inversion to these pseudo samples, i.e., reconstruct them with the GAN generator
  • 10. Step 3: Learning the 3D Shape ● The viewpoint network, lighting network , the depth network and albedo network are all together optimised to learn the 3D shape. ● The reconstruction loss is minimised on when all the predictions are inferred. ● The unnatural distortions and shadows that exist in the pseudo samples are mitigated in the reconstructions.
  • 11. Conclusion We have presented the first method that directly leverages off-the-shelf 2D GANs to recover 3D object shapes from images. We found that existing 2D GANs inherently capture sufficient knowledge to recover 3D shapes for many object categories, including human faces, cats, cars, and buildings. Based on a weak convex shape prior, our method could explore the viewpoint and lighting variations in the GAN image manifold, and exploit these variations to refine the underlying object shape in an iterative manner.