Accommodation-invariant Computational Near-eye Displays - SIGGRAPH 2017

Matt Hirsch - MIT Media Lab

Near-eye displays for VR/AR that support focus cues through accommodation invariance.

Engineering

Accommodation-invariant
Computational Near-eye
Displays
Robert Konrad1, Nitish Padmanaban1, Keenan Molner1, Emily A. Cooper2, Gordon
Weztstein1
1 Stanford University, 2 Dartmouth College
http://www.computationalimaging.o

Q: can we drive accommodation with stereoscopic
cues by optically removing the retinal blur cue?

Real World:
Vergence &
Accommodation
Match!

Current VR Displays:
Vergence &
Accommodation
Mismatch!

Existing Approaches
Multiplane
Rolland et al., Applied Optics 2000 Akeley et al., SIGGRAPH 2004
Focal Surfaces
Matsuda et al., SIGGRAPH 2017
Light Field
Huang et al., SIGGRAPH 2015
Lanman et al., SIGGRAPH Asia 2013
Holographic
Maimone et al., SIGGRAPH 2017
Adaptive Focus
Sugihara et al., SID 1998
Liu et al., ISMAR 2008
Koulieris et al.,SIGGRAPH 2017
Padmanaban et al., PNAS 2017

Aperture Controls Depth of Field
Image courtesy of Concept One Studios

Maxwellian-type (pinhole) Near-eye Displays
Point Light
Source

Maxwellian-type (pinhole) Near-eye Displays
Severely reduces eyebox!
Spatial Light Modulator
Point Light
Source

Focal Sweep
EDOF Cameras:
Dowski & Cathey, App. Opt. 1995
Nagahara et al., ECCV 2008
Cossairt et al., SIGGRAPH 2010
60Hz

PSF Engineering
Spatially InvariantDepth Invariant

Target
Conventional
Target Image Conventional Display @ 1D

Target
Conventional
Target Image Conventional Display @ 3D

Target
Conventional
AI
AI @ 3D Conventional Display @ 3D

Point Spread Functions
Conventional
AI
AI 2 Plane
AI 3 Plane
4D (0.25m) 3D (0.25m) 2D (0.5m) 1D (0.5m) 0D (∞)

Point Spread Functions
Conventional
AI
AI 2 Plane
AI 3 Plane
4D (0.25m) 3D (0.25m) 2D (0.5m) 1D (0.5m) 0D (∞)
3D 1D

MTF of Prototype
Spatial Frequency (cycles/degree)

Follow the target with your
eyes
4D
(0.25m)
0.5D
(2m)
User Study #1
11 participants

Look at each
target
0.5D
(2m)
4D
(0.25m)
User Study #2
12 participants

Ideal Accommodation Response
Individual User Response

Robert Konrad
Computational Imaging Lab
Stanford University
stanford.edu/~rkkonrad
computationalimaging.org

Accommodation-invariant Near-eye Displays

3D 2.5D 2D 1.5D 1D 0D
ConventionalAIAI2-planeAI3-plane

Millions of people worldwide need glasses or contact lenses to see or read properly. We introduce a computational display technology that predistorts the presented content for an observer, so that the target image is perceived without the need for eyewear. We demonstrate a low-cost prototype that can correct myopia, hyperopia, astigmatism, and even higher-order aberrations that are difficult to correct with glasses.

>A Switchable Light Field Camera Architecture with Angle SEnsitive Pixels and...

We propose a flexible light field camera architecture that is at the convergence of optics, sensor electronics, and applied mathematics. Through the co-design of a sensor that comprises tailored, Angle Sensitive Pixels and advanced reconstruction algorithms, we show that—contrary to light field cameras today—our system can use the same measurements captured in a single sensor image to recover either a high-resolution 2D image, a low-resolution 4D light field using fast, linear processing, or a high-resolution light field using sparsity-constrained optimization.

VR2.0: Making Virtual Reality Better Than Reality?

End-to-end Optimization of Cameras and Image Processing - SIGGRAPH 2018

The Light Field Stereoscope | SIGGRAPH 2015

Computational Near-eye Displays with Focus Cues - SID 2017 Seminar

vision correcting display

Disha Tiwari

Recent advances in both computational photography and displays have given rise to a new generation of computational devices. Computational cameras and displays provide a visual experience that goes beyond the capabilities of traditional systems by adding computational power to optics, lights, and sensors. These devices are breaking new ground in the consumer market, including lightfield cameras that redefine our understanding of pictures (Lytro), displays for visualizing 3D/4D content without special eyewear (Nintendo 3DS), motion-sensing devices that use light coded in space or time to detect motion and position (Kinect, Leap Motion), and a movement toward ubiquitous computing with wearable cameras and displays (Google Glass). This short (1.5 hour) course serves as an introduction to the key ideas and an overview of the latest work in computational cameras, displays, and light transport.

Raskar Keynote at Stereoscopic Display Jan 2011

Matt Hirsch - MIT Media Lab

Computational Displays in 4D, 6D, 8D We have explored how light propagates from thin elements into a volume for viewing for both automultiscopic displays and holograms. In particular, devices that are typically connected with geometric optics, like parallax barriers, differ in treatment from those that obey physical optics, like holograms. However, the two concepts are often used to achieve the same effect of capturing or displaying a combination of spatial and angular information. Our work connects the two approaches under a general framework based in ray space, from which insights into applications and limitations of both parallax-based and holography-based systems are observed. Both parallax barrier systems and the practical holographic displays are limited in that they only provide horizontal parallax. Mathematically, this is equivalent to saying that they can always be expressed as a rank-1 matrix (i.e, a matrix in which all the columns are linearly related). Knowledge of this mathematical limitation has helped us to explore the space of possibilities and extend the capabilities of current display types. In particular, we have designed a display that uses two LCD panels, and an optimisation algorithm, to produce a content-adaptive automultiscopic display (SIGGRAPH Asia 2010). (Joint work with R Horstmeyer, Se Baek Oh, George Barbastathis, Doug Lanman, Matt Hirsch and Yunhee Kim) http://cameraculture.media.mit.edu In other work we have developed a 6D optical system that responds to changes in viewpoint as well as changes in surrounding light. Our lenticular array alignment allows us to achieve such a system as a passive setup, omitting the need for electrical components. Unlike traditional 2D flat displays, our 6D displays discretize the incident light field and modulate 2D patterns in order to produce super-realistic (2D) images. By casting light at variable intensities and angles onto our 6D displays, we can produce multiple images as well as store greater information capacity on a single 2D film (SIGGRAPH 2008). Ramesh Raskar joined the Media Lab from Mitsubishi Electric Research Laboratories in 2008 as head of the Lab’s Camera Culture research group. His research interests span the fields of computational photography, inverse problems in imaging and human-computer interaction. Recent inventions include transient imaging to look around a corner, next generation CAT-Scan machine, imperceptible markers for motion capture (Prakash), long distance barcodes (Bokode), touch+hover 3D interaction displays (BiDi screen), low-cost eye care devices (Netra) and new theoretical models to augment light fields (ALF) to represent wave phenomena. In 2004, Raskar received the TR100 Award from Technology Review, which recognizes top young innovators under the age of 35, and in 2003, the Global Indus Technovator Award, instituted at MIT to recognize the top 20 Indian technology innovators worldwide. In 2009, he was awarded a Sloan Research Fellowship. In 2010, he received the Darpa Young Faculty award. He holds over 40 US patents and has received four Mitsubishi Electric Invention Awards. He is currently co-authoring a book on Computational Photography. http://raskar.info

HR3D: Content Adaptive Parallax Barriers

HR3D: Content Adaptive Parallax Barriers, SIGGRAPH Asia 2010 Technical Paper presentation, presented by Douglas Lanman (http://web.media.mit.edu/~dlanman). Please see the project page for more details: http://web.media.mit.edu/~mhirsch/hr3d This is a project in the Camera Culture group (http://cameraculture.media.mit.edu) at the MIT Media Lab, led by Professor Ramesh Raskar (http://web.media.mit.edu/~raskar).

Introduction to Light FieldsCamera Culture Group, MIT Media Lab

Erste Ergebnisse mit einer neuen trifokalen EDOF IOL

Breyer, Kaymak & Klabe Augenchirurgie

Compressive Light Field Displays

Stereoscopic Imaging

Charanjeet Singh

Svr RaskarCamera Culture Group, MIT Media Lab

Coded Photography - Ramesh RaskarCamera Culture Group, MIT Media Lab

Light Field Photography IntroductionCamera Culture Group, MIT Media Lab

Light field

Ujjayanta Bhaumik

Dr.Kawewong Ph.D ThesisSOINN Inc.

Montage4D: Interactive Seamless Fusion of Multiview Video Textures

Ruofei Du

Project Site: http://montage4d.com The commoditization of virtual and augmented reality devices and the availability of inexpensive consumer depth cameras have catalyzed a resurgence of interest in spatiotemporal performance capture. Recent systems like Fusion4D and Holoportation address several crucial problems in the real-time fusion of multiview depth maps into volumetric and deformable representations. Nonetheless, stitching multiview video textures onto dynamic meshes remains challenging due to imprecise geometries, occlusion seams, and critical time constraints. In this paper, we present a practical solution towards real-time seamless texture montage for dynamic multiview reconstruction. We build on the ideas of dilated depth discontinuities and majority voting from Holoportation to reduce ghosting effects when blending textures. In contrast to their approach, we determine the appropriate blend of textures per vertex using view-dependent rendering techniques, so as to avert fuzziness caused by the ubiquitous normal-weighted blending. By leveraging geodesics-guided diffusion and temporal texture fields, our algorithm mitigates spatial occlusion seams while preserving temporal consistency. Experiments demonstrate significant enhancement in rendering quality, especially in detailed regions such as faces. We envision a wide range of applications for Montage4D, including immersive telepresence for business, training, and live entertainment.

What's hot

Light Field, Focus-tunable, and Monovision Near-eye Displays | SID 2016

SIGGRAPH 2012 Computational Plenoptic Imaging Course - 3 Spectral Imaging

SIGGRAPH 2012 Computational Plenoptic Imaging Course - 4 Light Fields

Tailored Displays to Compensate for Visual Aberrations - SIGGRAPH Presentation

Vitor Pamplona

Compressive DIsplays: SID Keynote by Ramesh RaskarCamera Culture Group, MIT Media Lab

Build Your Own VR Display Course - SIGGRAPH 2017: Part 1

ProxImaL | SIGGRAPH 2016

Compressive Light Field Projection @ SIGGRAPH 2014

CORNAR: Looking Around Corners using Trillion FPS Imaging

SIGGRAPH 2014 Course on Computational Cameras and Displays (part 2)

Matthew O'Toole

Raskar Keynote at Stereoscopic Display Jan 2011

Matt Hirsch - MIT Media Lab

HR3D: Content Adaptive Parallax Barriers

Introduction to Light FieldsCamera Culture Group, MIT Media Lab

Erste Ergebnisse mit einer neuen trifokalen EDOF IOL

Breyer, Kaymak & Klabe Augenchirurgie

Compressive Light Field Displays

Stereoscopic Imaging

Charanjeet Singh

Svr RaskarCamera Culture Group, MIT Media Lab

Coded Photography - Ramesh RaskarCamera Culture Group, MIT Media Lab

Light Field Photography IntroductionCamera Culture Group, MIT Media Lab

Light field

Ujjayanta Bhaumik

What's hot (20)

Light Field, Focus-tunable, and Monovision Near-eye Displays | SID 2016

SIGGRAPH 2012 Computational Plenoptic Imaging Course - 3 Spectral Imaging

SIGGRAPH 2012 Computational Plenoptic Imaging Course - 4 Light Fields

Tailored Displays to Compensate for Visual Aberrations - SIGGRAPH Presentation

Compressive DIsplays: SID Keynote by Ramesh Raskar

Build Your Own VR Display Course - SIGGRAPH 2017: Part 1

ProxImaL | SIGGRAPH 2016

Compressive Light Field Projection @ SIGGRAPH 2014

CORNAR: Looking Around Corners using Trillion FPS Imaging

SIGGRAPH 2014 Course on Computational Cameras and Displays (part 2)

Raskar Keynote at Stereoscopic Display Jan 2011

HR3D: Content Adaptive Parallax Barriers

Introduction to Light Fields

Erste Ergebnisse mit einer neuen trifokalen EDOF IOL

Compressive Light Field Displays

Stereoscopic Imaging

Svr Raskar

Coded Photography - Ramesh Raskar

Light Field Photography Introduction

Light field

Similar to Accommodation-invariant Computational Near-eye Displays - SIGGRAPH 2017

Dr.Kawewong Ph.D ThesisSOINN Inc.

Montage4D: Interactive Seamless Fusion of Multiview Video Textures

Ruofei Du

Raskar Computational Camera Fall 2009 Lecture 01

Edge AI and Vision Alliance

Keywords: Signal processing, Applied optics, Computer graphics and vision, Electronics, Art, and Online photo collections A computational camera attempts to digitally capture the essence of visual information by exploiting the synergistic combination of task-specific optics, illumination, sensors and processing. We will discuss and play with thermal cameras, multi-spectral cameras, high-speed, and 3D range-sensing cameras and camera arrays. We will learn about opportunities in scientific and medical imaging, mobile-phone based photography, camera for HCI and sensors mimicking animal eyes. We will learn about the complete camera pipeline. In several hands-on projects we will build several physical imaging prototypes and understand how each stage of the imaging process can be manipulated. We will learn about modern methods for capturing and sharing visual information. If novel cameras can be designed to sample light in radically new ways, then rich and useful forms of visual information may be recorded -- beyond those present in traditional protographs. Furthermore, if computational process can be made aware of these novel imaging models, them the scene can be analyzed in higher dimensions and novel aesthetic renderings of the visual information can be synthesized. In this couse we will study this emerging multi-disciplinary field -- one which is at the intersection of signal processing, applied optics, computer graphics and vision, electronics, art, and online sharing through social networks. We will examine whether such innovative camera-like sensors can overcome the tough problems in scene understanding and generate insightful awareness. In addition, we will develop new algorithms to exploit unusual optics, programmable wavelength control, and femto-second accurate photon counting to decompose the sensed values into perceptually critical elements.

SIGGRAPH 2018 - Full Rays Ahead! From Raster to Real-Time Raytracing

Electronic Arts / DICE

Simulations of Strong Lensing

Nan Li

From STC (Stereo Camera onboard on Bepi Colombo ESA Mission) to Blender

Emanuele Simioni

Using Panoramic Videos for Multi-Person Localization and Tracking In A 3D Pan...

Fan Yang

3D panoramic multi-person localization and tracking are prominent in many applications, however, conventional methods using LiDAR equipment could be economically expensive and also computationally inefficient due to the processing of point cloud data. In this work, we propose an effective and efficient approach at a low cost. First, we utilize RGB panoramic videos instead of LiDAR data. Then, we transform human locations from a 2D panoramic image coordinate to a 3D panoramic camera coordinate using camera geometry and human bio-metric property (i.e., height). Finally, we generate 3D tracklets by associating human appearance and 3D trajectory. We verify the effectiveness of our method on three datasets including a new one built by us, in terms of 3D single-view multi-person localization, 3D single-view multi-person tracking, and 3D panoramic multi-person localization and tracking. Our code is available at \url{https://github.com/fandulu/MPLT}.

CS 354 Acceleration Structures

Mark Kilgard

“Next-generation Computer Vision Methods for Automated Navigation of Unmanned...

For the full video of this presentation, please visit: https://www.edge-ai-vision.com/2023/09/next-generation-computer-vision-methods-for-automated-navigation-of-unmanned-aircraft-a-presentation-from-immervision/ Julie Buquet, Applied Researcher for Imaging and AI at Immervision, presents the “Next-generation Computer Vision Methods for Automated Navigation of Unmanned Aircraft” tutorial at the May 2023 Embedded Vision Summit. Unmanned aircraft systems (UASs) need to perform accurate autonomous navigation using sense-and-avoid algorithms under varying illumination conditions. This requires robust algorithms able to perform consistently, even when image quality is poor. In this presentation, Buquet shares the results of Immervision’s research on the impact of noise and blur on corner detection algorithms and CNN-based 2D object detectors used for drone navigation. Specifically, she shows how to fine-tune these algorithms to make them effective in extreme low light (0.5 lux) and on images with high levels of noise or blur. She also highlights the main benefits of using such computer vision methods for drone navigation.

Mit Museum Talk

Edge AI and Vision Alliance

Raskar BanffCamera Culture Group, MIT Media Lab

RIM Poster Optics r2.1 - 2-OP-05 Glatzel_Tinsley PosterKevin Nouri

Advanced Game Development with the Mobile 3D Graphics API

Tomi Aarnio

Movement Tracking in Real-time Hand Gesture Recognition

Pranav Kulkarni

To translate the gesture performed by the user in a video sequence into meaningful symbols/commands, feature extraction is the first and most crucial step in such systems which measures the detected hand positions and its movement track. We propose an efficient approach based on inter-frame difference (IDF) to handle the hand movement tracking, which is shown to be more robust in the accuracy aspect compared to skin-color based approaches. Computational efficiency is another attractive property that our approach greatly improves the processing frame rate to fulfil the demand of a real-time hand gesture recognition system.

Keynote at Tracking Workshop during ISMAR 2014

Darius Burschka

Raskar Coded Opto CharlotteCamera Culture Group, MIT Media Lab

Laser Beam Homogenizer

Vikram Sachan

=iros16tutorial_2.pdf

usmanarif88

"Think Like an Amateur, Do As an Expert: Lessons from a Career in Computer Vi...

For the full video of this presentation, please visit: https://www.embedded-vision.com/platinum-members/embedded-vision-alliance/embedded-vision-training/videos/pages/may-2018-embedded-vision-summit-kanade For more information about embedded vision, please visit: http://www.embedded-vision.com Dr. Takeo Kanade, U.A. and Helen Whitaker Professor at Carnegie Mellon University, presents the "Think Like an Amateur, Do As an Expert: Lessons from a Career in Computer Vision" tutorial at the May 2018 Embedded Vision Summit. In this keynote presentation, Dr. Kanade shares his experiences and lessons learned in developing a vast range of pioneering computer vision systems and autonomous robots, including face recognition, autonomously-driven cars, computer-assisted surgical robots, robot helicopters, biological live cell tracking and a system for sports broadcasts. Most researchers, when asked their fondest desire, respond that they want to do good research. If asked what constitutes “good research,” they often find it difficult to give a clear answer. For Dr. Kanade, good research derives from solving real-world problems, delivering useful results to society. “Think like an amateur, do as an expert” is Dr. Kanade's research motto: When conceptualizing a problem and its possible solution, think simply and openly, as a novice in that field, without preconceived notions. When implementing a solution, on the other hand, do so thoroughly, meticulously and with expert skill. In his research projects, Dr. Kanade has met and worked with people from diverse backgrounds, and has encountered many challenges. While exploring the technical side of some of his most important projects, he also describes experiences that highlight the enjoyable aspects of a researcher’s life—those that have occurred accidentally or inevitably as his “Think like an amateur, do as an expert” approach has guided his interactions with problems and people.

Raskar Ilp Oct08 WebCamera Culture Group, MIT Media Lab

Similar to Accommodation-invariant Computational Near-eye Displays - SIGGRAPH 2017 (20)

Dr.Kawewong Ph.D Thesis

Montage4D: Interactive Seamless Fusion of Multiview Video Textures

Raskar Computational Camera Fall 2009 Lecture 01

SIGGRAPH 2018 - Full Rays Ahead! From Raster to Real-Time Raytracing

Simulations of Strong Lensing

From STC (Stereo Camera onboard on Bepi Colombo ESA Mission) to Blender

Using Panoramic Videos for Multi-Person Localization and Tracking In A 3D Pan...

CS 354 Acceleration Structures

“Next-generation Computer Vision Methods for Automated Navigation of Unmanned...

Mit Museum Talk

Raskar Banff

RIM Poster Optics r2.1 - 2-OP-05 Glatzel_Tinsley Poster

Advanced Game Development with the Mobile 3D Graphics API

Movement Tracking in Real-time Hand Gesture Recognition

Keynote at Tracking Workshop during ISMAR 2014

Raskar Coded Opto Charlotte

Laser Beam Homogenizer

=iros16tutorial_2.pdf

"Think Like an Amateur, Do As an Expert: Lessons from a Career in Computer Vi...

Raskar Ilp Oct08 Web

More from StanfordComputationalImaging

Gaze-Contingent Ocular Parallax Rendering for Virtual Reality

Autofocals: Evaluating Gaze-Contingent Eyeglasses for Presbyopes - Siggraph 2019

Non-line-of-sight Imaging with Partial Occluders and Surface Normals | TOG 2019

Imaging objects obscured by occluders is a significant challenge for many applications. A camera that could “see around corners” could help improve navigation and mapping capabilities of autonomous vehicles or make search and rescue missions more effective. Time-resolved single-photon imaging systems have recently been demonstrated to record optical information of a scene that can lead to an estimation of the shape and reflectance of objects hidden from the line of sight of a camera. However, existing non-line-of-sight (NLOS) reconstruction algorithms have been constrained in the types of light transport effects they model for the hidden scene parts. We introduce a factored NLOS light transport representation that accounts for partial occlusions and surface normals. Based on this model, we develop a factorization approach for inverse time-resolved light transport and demonstrate high-fidelity NLOS reconstructions for challenging scenes both in simulation and with an experimental NLOS imaging system.

Build Your Own VR Display Course - SIGGRAPH 2017: Part 5

Build Your Own VR Display Course - SIGGRAPH 2017: Part 4

Build Your Own VR Display Course - SIGGRAPH 2017: Part 3

Build Your Own VR Display Course - SIGGRAPH 2017: Part 2

Multi-camera Time-of-Flight Systems | SIGGRAPH 2016