Diffraction-engineered holography: Beyond the depth representation limit of holographic displays

Yang, Daeho; Seo, Wontaek; Yu, Hyeonseung; Kim, Sun Il; Shin, Bongsu; Lee, Chang-Kun; Moon, Seokil; An, Jungkwuen; Hong, Jong-Young; Sung, Geeyoung; Lee, Hong-Seok

doi:10.1038/s41467-022-33728-5

Diffraction-engineered holography: Beyond the depth representation limit of holographic displays

Article
Open access
Published: 12 October 2022

Volume 13, article number 6012, (2022)
Cite this article

Download PDF

You have full access to this open access article

From

View current issue

Diffraction-engineered holography: Beyond the depth representation limit of holographic displays

Download PDF

7167 Accesses
29 Citations
2 Altmetric
Explore all metrics

Abstract

Holography is one of the most prominent approaches to realize true-to-life reconstructions of objects. However, owing to the limited resolution of spatial light modulators compared to static holograms, reconstructed objects exhibit various coherent properties, such as content-dependent defocus blur and interference-induced noise. The coherent properties severely distort depth perception, the core of holographic displays to realize 3D scenes beyond 2D displays. Here, we propose a hologram that imitates defocus blur of incoherent light by engineering diffracted pattern of coherent light with adopting multi-plane holography, thereby offering real world-like defocus blur and photorealistic reconstruction. The proposed hologram is synthesized by optimizing a wave field to reconstruct numerous varifocal images after propagating the corresponding focal distances where the varifocal images are rendered using a physically-based renderer. Moreover, to reduce the computational costs associated with rendering and optimizing, we also demonstrate a network-based synthetic method that requires only an RGB-D image.

Full-color, large area, transmissive holograms enabled by multi-level diffractive optics

Article Open access 19 July 2017

Breaking crosstalk limits to dynamic holography using orthogonality of high-dimensional random vectors

Article 22 March 2019

Ultrathin wide-angle large-area digital 3D holographic display using a non-periodic photon sieve

Article Open access 21 March 2019

Introduction

Holography is a recording and reconstruction process based on the interference of multiple wave fields¹. Holograms duplicate the wave field of the recorded object under an appropriate illumination and provide true-to-life reconstructions of three-dimensional (3D) objects². Beyond the reproduction of a recorded object, the computer-generated hologram (CGH), which is a numerically calculated hologram of a wave field of non-existing objects, enables the display of arbitrary 3D scenes and provides monocular depth cues, unlike traditional displays³.

Although holographic displays are free from vergence-accommodation conflict, which causes visual fatigue⁴ and a significant reduction in the depth constancy⁵, unsolved issues originating from their limited resolution still remain. A real-world object scatters light by reflecting light in various directions from the substructures of its rough surface⁶, and a static hologram can represent such substructures with a large effective number of pixels⁷. In contrast, dynamic holograms, of which the resolution is 3 orders of magnitude smaller than that of static holograms⁸, cannot spread light without the noise because the interference between voxels becomes noticeable as the number of voxels increases^9,10. From this perspective, dynamic holograms can be categorized into two different types, namely diffusive holograms and non-diffusive holograms (Fig. 1a).

**Fig. 1: Schematics of diffraction-engineered holography.**

Diffusive holograms spread light up to the maximum diffraction angle bounded by a pixel pitch by introducing high-frequency patterns^{10,11,12,13,14}. For example, high-frequency patterns can be included in holograms by placing voxels with sufficient separation between them^10,11, applying random phases^12,13, and employing point-based methods with physically correct phases¹⁴. In diffusive holograms, 3D objects can be seen at any position within a viewing angle and out-of-focus objects are blurred as real-world objects. However, the image quality is limited by a small number of points or interference between the points, displaying speckles on the reconstructed scenes^9,10.

In contrast to diffusive holograms, non-diffusive holograms concentrate on enhancing the image quality of reconstructed scenes. In this case, the position-dependent phase offset is imposed in point-based methods to avoid the rapid phase variation of different depth objects^15,16, phase-retrieval algorithms are adopted to reconstruct single-depth images^17,18, and quadratic phases are utilized to suppress the speckles^19,20. Although non-diffusive holograms tend to exhibit an enhanced image quality, the coherent properties of light become conspicuous due to a reduced numerical aperture and content-dependent defocus pattern^19,21. For instance, constructive and destructive interference patterns appear in the intensity distribution of non-diffusive holograms according to Fresnel propagation and those interference patterns are far different from the defocus blur of a real-world object. The inconsistent defocus patterns destroy the relationship between the depth and the blur, which is crucial in the context of depth perception^22,23. Moreover, the presence of a lucid boundary at the interface between objects with different depths due to interference distorts the perception of the relative depth between objects²⁴. Thus, for the high image quality hologram without distortion of depth perception, both advantages of diffusive holograms and non-diffusive holograms are required.

On the other hand, multi-plane hologram attracts great attention recently, especially for their improvement in image quality and computation time^{25,26,27,28,29}. For instance, non-convex optimization is adopted to minimize a custom cost function²⁵, dynamic adjustment of amplitude-constraint is employed to improve image quality²⁶, and a new algorithm based on singular value decomposition of the Fresnel impulse response function is proposed to enhance computational speed²⁷. Since the multi-plane hologram is synthesized by optimizing the wave field to reconstruct one image at one focal plane while optimizing to reconstruct other images at other focal planes, multi-plane holograms are widely adopted in dynamic 3D projections^30,31,32. However, experimental realization of high image quality reconstruction with a single wave field is still challenging^30,31,32,33.

Here, we demonstrate a diffraction-engineered hologram (DEH) that presents photorealistic scenes and real-world-like defocus blur, enhancing depth expressions of holographic displays by utilizing multi-plane holograms. We take advantage of the fact that the phase variation of light does not affect the image seen by the eyes, but steers the propagating direction of light. Contrary to most of the conventional CGH algorithms, which only optimize the intensity at object-existing planes^{15,16,17,18,19,20}, DEH also optimizes diffracted patterns at out-of-focus planes by adaptively changing the phase to enhance defocus blur while leaving the intensity at the object-existing planes nearly the same. To find the phase satisfying such diffracted pattern, the approach of multi-plane hologram^{25,26,27,28,29,30,31,33,34}, which reconstructs different images depending on a propagation distance, is adopted. The wave field of our hologram is optimized to reconstruct sharply focused images of an object at the object plane and reconstruct blurred images at other focal planes. To obtain blurred and focused images employed as optimization targets, varifocal images are rendered by a physically based renderer that properly handles occluded objects and provides an accurate blur circle similar to that of a human eye. As a result, the DEH achieves both superiorities, namely the image quality of non-diffusive holograms and the depth expression of diffusive holograms. Furthermore, to reduce the computational cost associated with the rendering of varifocal images and the optimization of a complex wave field, we design and train a convolutional neural network. The diffraction-engineered hologram network (DEHNet) synthesizes the complex wave field displaying appropriate blurred images depending on the focal distances while requiring only an RGB-D image as an input. Finally, we confirm the properties of the DEH through simulations and experiments to demonstrate an enhanced depth expression compared to conventional CGHs.

Results

Loss function for hologram synthesis

Assuming that a wave field at the z = 0 planes is given by ∣A(x, y)∣e^iϕ(x,y), the propagated wave field at the z = d_n plane calculated by the angular spectrum method (ASM)³⁵ is given as

$${{{{{{{{\rm{Prop}}}}}}}}}_{{d}_{n}}(|A(x,y)|{{{{{{\rm{e}}}}}}}^{{{{{{\rm{i}}}}}}\phi (x,y)})={F}^{-1}\left\{F\left\{|A({x}^{\prime},{y}^{\prime})|{{{{{{\rm{e}}}}}}}^{{{{{{\rm{i}}}}}}\phi ({x}^{\prime},{y}^{\prime})}\right\}{{{{{{\rm{e}}}}}}}^{{{{{{\rm{i}}}}}}{k}_{z}{d}_{n}}\right\},$$

(1)

where F(F⁻¹) is the Fourier (inverse Fourier) transform operator, ${{{{{{\rm{e}}}}}}}^{{{{{{\rm{i}}}}}}{k}_{z}{d}_{n}}$ is a propagation kernel with ${k}_{z}=\sqrt{{k}^{2}-{k}_{x}^{2}-{k}_{y}^{2}}$, and k_x(k_y) is the angular wavenumber along the x(y) direction. Here, a notable point of Eq. (1) is the fact that the propagation kernel ${{{{{{\rm{e}}}}}}}^{{{{{{\rm{i}}}}}}{k}_{z}{d}_{n}}$ does not alter the amplitude distribution in the Fourier domain, and so the amplitude distribution in the Fourier domain is sustained for every propagation distance. Considering that the diffraction angle is proportional to the spatial frequency^19,21, the application of a wide frequency range of phases is the only means to achieve sufficient defocus blur unless the intensity itself is composed of a wide range of frequencies.

However, the majority of high-quality non-diffusive CGH (nCGH) algorithms fix the phase as zero or as a position-dependent formula^15,16 to avoid speckles, thereby leaving the content-dependent defocus pattern unsolved. The DEH starts from this point. DEH is calculated by optimizing a wave field to possess a content-dependent phase so that the propagated wave field forms a clear image at the object-existing plane while forming a blurred image at other planes. As a target image for each propagated distance, we used varifocal images generated by a rendering process by changing the focal distance of a camera to ensure that blur considering occluded surfaces is efficiently reflected (Fig. 1b). After simulating the propagated intensity of the wave field using the ASM, we calculated the mean square error (MSE) between the propagated intensity and the varifocal image of which the focal distance is equal to the propagation distance (Fig. 1c). The wave field is compared with tens of varifocal images and it is updated using a gradient descent method. The optimization is iterated until the change of the wave field is negligible.

Compared to other researches^16,36 employing learning-based methods or optimization methods, occluded surfaces and defocus blur can be reflected on the reconstructed scene by means of explicitly comparing the propagated intensities and defocused images. Furthermore, to reconstruct sharply focused objects, the wave field is also compared with an all-in-focus image when the propagation distance is close to the depth of the objects (see Section 9 of the Supplementary material for further details of all-in-focus loss). Standard phase retrieval algorithms, e.g. the iterative Fourier transform algorithm, can be used in multi-plane holograms^30,33,34, but gradient descent optimization is employed to compare the wave field with the depth-weighted all-in-focus image.

In summary, the total loss function ${{{{{{{\mathcal{L}}}}}}}}$ for optimization is given by

$${{{{{{{\mathcal{L}}}}}}}}= \mathop{\sum }\limits_{n=1}^{N}\Bigg[\left\langle {\left|{\left|{{{{{{{{\rm{Prop}}}}}}}}}_{{d}_{n}}(|A(x,y)|{{{{{{\rm{e}}}}}}}^{{{{{{\rm{i}}}}}}\phi (x,y)})\right|}^{2}-{I}_{{d}_{n}}\right|}^{2}\right\rangle \\ \, +\beta \left\langle \bigg | {\left({\left|{{{{{{{{\rm{Prop}}}}}}}}}_{{d}_{n}}(|A(x,y)|{{{{{{\rm{e}}}}}}}^{{{{{{\rm{i}}}}}}\phi (x,y)})\right|}^{2}-{I}_{{{{{{{{\rm{AIF}}}}}}}}}\right){{{{{{\rm{e}}}}}}}^{-{\left(\gamma \;\frac{{D}_{n}-{d}_{n}}{{d}_{0}-{d}_{N}}\right)}^{2}}\bigg | }^{2}\right\rangle \Bigg],$$

(2)

where N is the number of varifocal images, ${I}_{{d}_{n}}$ is the intensity of the varifocal image at a focal distance d_n, I_AIF is the intensity of the all-in-focus image, D_n is the depth map normalized from d₀ to d_N with a focal distance d_n, β is the user-defined loss weight, and γ is the user-defined depth attention weight. Here, the depth map with defocus blur depending on the focal distance is used instead of an all-in-focus depth map to reflect the occluded surfaces (see the “Methods” section and Section 10 of the Supplementary material for further details). The first term in Eq. (2) represents the MSE of the propagated wave field compared to the varifocal images, while the second term represents the MSE of the propagated wave field compared to the depth-weighted all-in-focus image. In contrast to a DEH, conventional methods¹⁶ construct holograms by propagating each 3D point for a particular distance depending on its depth value and superposing the propagated points (Fig. 1d). The method simulates the propagation of the points by the ASM and also handles occluded surfaces by ignoring the backside wavefront when the backside and frontside wavefronts overlap.

Holograms depicting a scene with different-sized cubes were synthesized and in-focus (out-of-focus) conditions of the holograms were simulated as shown in Fig. 2a. Even in the out-of-focus conditions, the defocus blur of the nCGH cannot be seen, especially for the large cube, due to the content-dependent defocus pattern^19,21. Coherent propagation of the wave field forms a Fresnel diffraction pattern which differs from the defocus blur of incoherent light so the depth perception can be distorted²². In contrast, the out-of-focus image of the DEH displays a clear defocus blur even if the diameter of the blur circle is slightly smaller than that of the rendered image. However, the most significant drawback of the DEH is its computational load, since a number of varifocal images are required in addition to an optimization procedure. Since nCGH can be synthesized using only RGB-D images, DEHs are not practical in the majority of real-time applications.

**Fig. 2: Reconstructed intensities of different algorithms and schematics of DEHNet.**

To overcome such issues, a neural network (DEHNet) is trained to obtain a DEH from RGB-D images (Fig. 2b). The network is composed of 34 convolutions with 12 channels except for the last layer which includes a concatenated shortcut. Non-linearity and a wide receptive field are more important than hidden features so the number of channels is selected as small as possible to increase the number of convolutions and activations under a restricted computation resource (see Section 8 of the Supplementary material for image quality dependency on the number of channels). The training dataset consists of 3000 different scenes and each of these scenes contains 21 varifocal images, an all-in-focus color image, 21 varifocal depth maps, and an all-in-focus depth map (see Section 6 of the Supplementary material for the minimum number of planes required in DEHNet and Section 11 of the Supplementary material for results with a different number of planes). After training, the weights of the network were quantized to 8-bit integers to reduce computational load. The DEHNet can synthesize an optimal wave field that can reconstruct appropriately blurred and sharply focused images while considering occluded surfaces, and this can be achieved using only an all-in-focus color image and an all-in-focus depth map.

Image quality of reconstructed holograms

Figure 3 shows the simulated results for the DEH, the DEHNet, and the nCGH when the focus is adjusted to the frontside or backside of the scene. We only compared non-diffusive holograms in this paper because the purpose of the paper is synthesizing high-image-quality holograms. A comparison between diffusive holograms and DEH can be found in Section 2 of the Supplementary material. One of the differences between the nCGH and the DEH is the vivid boundary at the interface of the objects which are located at different depths as shown in the enlarged image in Fig. 3. An abrupt phase variation at the interface leads to two coherent beams with different phases coinciding at the interface; the constructive and destructive interferences then build a sharp boundary. Since blurred and sharply focused edges at the occluded surface boundary are used to judge the relative depths between objects²⁴, the presence of a distorted blur at an edge can be considered one of the most serious defects. Moreover, when a hole exists in an object, the hole is distorted by the depth difference between the object and the background.

**Fig. 3: Simulation results for the DEH, the DEHNet, and the nCGH.**

The image quality, including defocus blur as well as speckle noise, can be measured quantitatively by evaluating the peak signal-to-noise ratio (PSNR) and the structural similarity (SSIM) compared to the rendered images. While the optimized DEH exhibits the best PSNR (26.1 dB) and SSIM (0.88) values, the DEHNet also gives compatible results. In contrast, the nCGH gives significantly lower PSNR (19.8 dB) and SSIM (0.67) values. Here, the second term in Eq. (2) boosts the image quality of the in-focus objects, which results in a slight reduction in the PSNR. Without the second term, the PSNR increases slightly (0.6 ~ 0.8 dB) although the image quality at the focal plane is reduced. The nCGH algorithm used here only includes ASM propagation and consideration of occluded surfaces, but a comparison with the other algorithm¹⁶ can be found in Fig. S2. The other algorithm shows a similar weak defocus blur as the nCGH algorithm.

Experiments and benchmark

In order to concretely validate the DEHNet, an experimental demonstration is necessary. In an optical reconstruction, an amplitude-only spatial light modulator (SLM) with a 1920 × 1080 (FHD) resolution is used instead of a complex SLM. It is well known that an amplitude SLM can be used as a complex SLM by means of spatial filtering, although the spatial bandwidth of the SLM is lost³⁷. As confirmed by the simulation, the defocus blur is much weaker and a vivid boundary exists near the interface of the different-depth objects in the nCGH. As a consequence, it is difficult to perceive the depth of the 3D scene in the nCGH. This tendency is more apparent in the enlarged images shown in Fig. 4. Numerical and experimental reconstruction of nCGH and DEH with changing focal planes can be found in Supplementary videos. Details regarding the experimental setup and parameters can be found in the “Methods” section.

**Fig. 4: Experimental results of the DEHNet and the nCGH.**

Figure 5 shows the inference times of the various CGH-generation methods, which were evaluated on an NVIDIA V100 GPU using the FHD resolution images. Since an optimization-based DEH requires 500 iterations, the method requires more than 1 min to synthesize a hologram with superior image quality. However, we achieved a frame rate of 62 Hz using DEHNet, while losing only ~0.5 dB of the PSNR compared to the optimization method. Considering encoding time, the total frame rate is 57 Hz since the encoding process of amplitude-only hologram takes 0.89 ms.

The PSNR and SSIM were evaluated for two datasets with different resolutions to quantitatively measure the image quality. One dataset is composed of 512 × 512 resolution images (Fig. 5b) as in the case of the training dataset, while the other dataset is composed of FHD resolution images (Fig. 5c). The indicated metrics represent the mean values of the comparison results between all varifocal images and the corresponding holograms so the smoothness of defocus blur and the sharpness of the focused object are both reflected. In the 512 (FHD) resolution dataset, the DEHNet provides a 6.5 (6.3) dB enhancement in the PSNR and a 0.15 (0.07) enhancement in the SSIM compared to the nCGH. Both of the evaluation datasets are rendered with textures that differ from that of the training dataset to ensure that the performance of the trained network is not restricted to the training dataset. Benchmark results with various image quality metrics including learned perceptual image patch similarity metrics³⁸ can be found in Section 2 of the Supplementary material.

When the holograms are synthesized using real-world images instead of rendered images, it should be pointed out that incorrect values from the captured depth maps can induce severe noise. In the majority of cases, real-world-captured depth maps include depth holes and incorrect depth values³⁹ so the interference pattern distorts the objects when the object boundaries of the depth map are not consistent with those of the RGB image (Figs. 6 and 7). In contrast to the nCGH producing interference-induced black lines at the boundaries of noisy depth, the DEH provides noise-suppressed images at these boundaries. In some applications using measured depth maps, e.g. video see-through displays, the DEH would therefore give a superior image quality to the nCGH.

**Fig. 6: Simulation results obtained using a real-world RGB-D image.**

**Fig. 7: Experimental results obtained using a real-world RGB-D image.**

Discussions

Recent advances in CGH algorithms result in remarkable progress in image quality and computation time. However, those algorithms do not give attention to the weak defocus blur of synthesized holograms, which severely distorts depth perception^22,23. Because of the properties of static holograms, researchers believed that defocus blur would be correct in dynamic holographic displays. In this aspect, we demonstrate the difference between reconstructed images of the conventional holograms and realistic scenes depicting defocus blur depending on accommodation. Furthermore, we propose one solution to overcome the incorrect depth cue problem by adopting a neural network and the increments in the PSNR and SSIM metrics are substantial. We expect the DEHs could be widely used in holographic displays for virtual and augmented realities offering real-world-like 3D displays using currently available display devices.

Recently, several researchers reported enhanced accommodation of CGH by utilizing a number of the wave fields, the so-called time-multiplexing method^40,41. Although the time-multiplexing method presents great image quality and enhanced accommodation effect, real-time reconstruction of such hologram requires more than 10 times of computation resources compared to our method and thus it is still challenging to realize real-time reconstruction. Besides, another approach is recently proposed to remove occlusion artifacts by adopting a layered depth image in learning-based CGH algorithms⁴². However, the approach did not deal with the amount of defocus blur, distinctive from our method.

It should also be noted here that in some works, the multi-plane hologram refers to the hologram reconstructing multiple objects at different depths, as an antonym of the hologram that reconstructs multiple objects at a single depth³⁶. In contrast, we use the term to represent a hologram that can reconstruct numerous full-size images at the same time depending on the focal distance. As the latter hologram, our experiment shows a greatly enhanced image quality in comparison with that reported previously^30,31,32,33 despite the fact that more than 20 images were used as target images. The degraded image quality in previous experiments mainly originated from the high-frequency patterns that almost reached the pixel-pitch-limited frequency, since a phase-only SLM or an amplitude-only SLM was used instead of a complex SLM^30,34. Our experiment confirms that it is possible to reconstruct multiple intensities with great fidelity when the target intensities are gradually varied, suggesting the feasibility of real-time applications of multi-plane holograms, such as holographic optical tweezers³², one-step volumetric printings⁴³, and volumetric displays⁴⁴.

Methods

Determining the diameter of the blur circle

To construct large field of view (FoV) display systems, an SLM is magnified by a lens array. As a consequence, the maximum propagation distance of the hologram that allows the reconstruction of a virtual image with a depth from d to infinity is determined by the effective focal length of the lens array. By approximating the lens array as a thin lens, the maximum propagation distance of the hologram, Δz, can be calculated as⁴⁵,

$$\Delta z \,\approx\, \frac{1}{d}\frac{\Delta {x}^{2}{{{{{{{{\rm{res}}}}}}}}}^{2}}{4\,{\tan }^{2}\,({{{{{{{\rm{FoV}}}}}}}}/2)},$$

(3)

where Δx is the pixel pitch of the SLM, ${{{{{{{\rm{res}}}}}}}}$ is the resolution of the display, FoV is the field of view of the system, d is the virtual image distance of the floating object synthesized by the hologram, and the virtual image distance of the display is set to infinity. If we consider a 55° FoV, a 4K resolution, a 7.2 μm pixel pitch, and d = 0.35 m, then Δz = 2 mm is obtained from Eq. (3).

Under the specific display parameters that were considered herein, it is possible to calculate the diameter of a blur circle of a human eye when the eye is focused on infinity while the object synthesized by the hologram is floating at a distance of d. The diameter of a blur circle of an eye in units of display pixels, CoC_eye, is given by

$${{{{{{{{\rm{CoC}}}}}}}}}_{{{{{{{{\rm{eye}}}}}}}}}=\frac{A\cdot {{{{{{{\rm{res}}}}}}}}}{2d\cdot \tan ({{{{{{{\rm{FoV}}}}}}}}/2)},$$

(4)

where A is the pupil diameter. If the wave field of the hologram is partially blocked by the iris, the image quality degrades by the noise of the blocked wave field. Considering that the diameter of a pupil is larger than 1.5 mm in the majority of cases^46,47, A is set to 1.5 mm to avoid image degradation originating from a partially blocked wave field. From the above parameters, CoC_eye is 15 pixels and the aperture size of the rendering camera is set to satisfy the diameter of a blur circle of the rendered images.

Although the diameter of defocus blur of an nCGH can be enlarged by increasing the propagation distance, achieving a blur circle equivalent to that of a human eye is only possible under a small FoV (~10°). For example, if we increase the propagation distance to enlarge the diameter of the blur circle, the virtual image distance of the object(d) comes closer and the blur circle diameter of the eye (CoC_eye) is also increased. As a result, an increase in the diameter of the defocus blur under a fixed propagation distance is required to attain a human eye-equivalent defocus blur with a holographic display.

Experimental details

In the experiment, IRIS-U62 LCoS (liquid crystal on silicon) from MAY Inc. of which resolution is 3840 × 2160 and pixel pitch is 3.6 μm, is used as 1080p mode by putting the same pixel value in 2 × 2 nearest pixels to minimize pixel crosstalk originated from its small pixel pitch⁴⁸. As a result, the LCoS behaves as an FHD resolution amplitude-only LCoS with a pixel pitch of 7.2 μm. The distance between the minimum and maximum depths was set to 2 mm. The dispersion diameters by the pixel pitch diffraction are 25 (red), 20 (green), and 18 pixels (blue) under 2 mm light propagation. Considering that the maximum diameter of defocus blur of the rendered images is 15 pixels, the propagation distance should be longer than 1.7 mm. Since the modulated intensity non-linearly depends on the assigned values of the pixels, the amplitude was calibrated by measuring output values for each input pixel value. An off-axis hologram was adopted and the grating period was set to 0.25 of its maximum period to avoid unwanted noise. The Burch encoding method⁴⁹ was used to project the complex wave field onto real values. With an adjustable 2D slit, zeroth order and higher order diffractions are blocked (Fig. 8). As a light source, laser diodes with wavelengths of 638, 515, and 460 nm were used and were sequentially illuminated on the LCoS. To remove speckles caused by the coherence of the lasers, the holographic diffuser was rotated at the focused spot of the laser beams.

**Fig. 8: Schematic representation of the experimental setup.**

Phase noise of the amplitude-only SLM

Due to the properties of liquid crystals, it is inevitable that the amplitude-only SLM modulates the phase. The noise from such phase modulation can be avoided if an appropriate grating phase is applied. Assuming that amplitude modulation is given by f(x) and unwanted phase modulation is given by $\exp \{{{{{{\rm{i}}}}}}{p}_{1}\,f(x)+{{{{{\rm{i}}}}}}{p}_{2}\,f{(x)}^{2}\}$, then the wave field at the SLM is given as $f(x){{\rm {e}}}^{{{{{{\rm{i}}}}}}{p}_{1}\,f(x)+{{{{{\rm{i}}}}}}{p}_{2}\,f{(x)}^{2}}$. Here, we approximated the unwanted phase modulation as a second-order polynomial function of the amplitude modulation. To expand the expression, we employed the Jacobi–Angler expansion, ${{\rm {e}}}^{{{{{{\rm{i}}}}}}kz\cos \theta }=\mathop{\sum }\nolimits_{n=-\infty }^{\infty }{i}^{n}{J}_{n}(z){{\rm {e}}}^{{{{{{\rm{i}}}}}}n\theta }$, where J_n(z) is the nth Bessel function of the first kind. Using a Fourier series expansion, $f(x)={\sum }_{k}{F}_{k}\cos (kx+{\phi }_{k})$, the wave field at the SLM can be expressed as

$$f(x){{\rm {e}}}^{{{{{{\rm{i}}}}}}{p}_{1}\,f(x)+{{{\rm{i}}}}{p}_{2}f{(x)}^{2}}= f(x){{\rm {e}}}^{{{{{{\rm{i}}}}}}{p}_{1}({\sum }_{k}{F}_{k}\cos (kx+{\phi }_{k}))+{{{{{\rm{i}}}}}}{p}_{2}{\left({\sum }_{k}{F}_{k}\cos (kx+{\phi }_{k})\right)}^{2}}\\= f(x)\mathop{\prod}\limits_{k}\mathop{\sum}\limits_{n}{i}^{n}{J}_{n}({p}_{1}{F}_{k}){{\rm {e}}}^{{{{{{\rm{i}}}}}}n(kx+{\phi }_{k})}\\ \times \mathop{\prod}\limits_{k,l}\mathop{\sum}\limits_{n}{i}^{n}{J}_{n}({p}_{2}{F}_{k}{F}_{l}/2){{\rm {e}}}^{{{{{{\rm{i}}}}}}n((k+l)\,x+{\phi }_{k}+{\phi }_{l})}\\ \times \mathop{\prod}\limits_{k,l}\mathop{\sum}\limits_{n}{i}^{n}{J}_{n}({p}_{2}{F}_{k}{F}_{l}/2){{\rm {e}}}^{{{{{{\rm{i}}}}}}n((k-l)\,x+{\phi }_{k}-{\phi }_{l})}.$$

(5)

Fortunately, p₁, p₂, and F_k are <1 in our experiment, and so J_n(z) with ∣n∣ ≪ 1 can be neglected for those cases. As a result, Eq. (5) can be approximated as

$$f(x){{\rm {e}}}^{{{{{{\rm{i}}}}}}{p}_{1}f(x)+{{{\rm{i}}}}{p}_{2}f{(x)}^{2}} \approx \, f(x)\mathop{\prod}\limits_{k}{J}_{0}({p}_{1}{F}_{k}){\left(\mathop{\prod}\limits_{k,l}{J}_{0}({p}_{2}{F}_{k}{F}_{l}/2)\right)}^{2}\times \Bigg[\mathop{\sum}\limits_{m}\frac{i{J}_{1}({p}_{1}{F}_{m})}{{J}_{0}({p}_{1}{F}_{m})}{{\rm {e}}}^{{{{{{\rm{i}}}}}}(mx+{\phi }_{m})}\\ +\mathop{\sum}\limits_{m,n}\frac{2i{J}_{1}({p}_{2}{F}_{m}{F}_{n}/2)}{{J}_{0}({p}_{2}{F}_{m}{F}_{n}/2)}\left({{\rm {e}}}^{{{{{{\rm{i}}}}}}((m+n)x+{\phi }_{m}+{\phi }_{n})}+{{\rm {e}}}^{{{{{{\rm{i}}}}}}((m-n)x+{\phi }_{m}-{\phi }_{n})}\right) \\ +{{{{{{{\mathcal{O}}}}}}}}\left({\left({p}_{1}{F}_{k}\right)}^{2}\right)+{{{{{{{\mathcal{O}}}}}}}}\left({\left({p}_{2}{F}_{k}^{2}\right)}^{2}\right)\Bigg].$$

(6)

As we can see from Eq. (6), if a grating phase with a period ${{\rm {e}}}^{{{{{{\rm{i}}}}}}{k}_{{{{{{{{\rm{prism}}}}}}}}}x}$ is applied, then ${{\rm {e}}}^{{{{{{\rm{i}}}}}}{k}_{{{{{{{{\rm{prism}}}}}}}}}x}$, ${{\rm {e}}}^{-{{{{{\rm{i}}}}}}{k}_{{{{{{{{\rm{prism}}}}}}}}}x}$, ${{\rm {e}}}^{2{{{{{\rm{i}}}}}}{k}_{{{{{{{{\rm{prism}}}}}}}}}x}$, and a constant term is generated. Moreover, Burch encoding⁴⁹ generates its conjugate term ${{\rm {e}}}^{-{{{{{\rm{i}}}}}}{k}_{{{{{{{{\rm{prism}}}}}}}}}x}$ and its phase noise-induced terms. As a result, the ${{\rm {e}}}^{{{{{{\rm{i}}}}}}{k}_{{{{{{{{\rm{prism}}}}}}}}}x}$, ${{\rm {e}}}^{-{{{{{\rm{i}}}}}}{k}_{{{{{{{{\rm{prism}}}}}}}}}x}$, ${{\rm {e}}}^{-{{{{{\rm{i}}}}}}{k}_{{{{{{{{\rm{pitch}}}}}}}}}x+2{{{{{\rm{i}}}}}}{k}_{{{{{{{{\rm{prism}}}}}}}}}x}$, ${{\rm {e}}}^{-{{{{{\rm{i}}}}}}{k}_{{{{{{{{\rm{prism}}}}}}}}}x}$, ${{\rm {e}}}^{{{{{{\rm{i}}}}}}{k}_{{{{{{{{\rm{prism}}}}}}}}}x}$, ${{\rm {e}}}^{{{{{{\rm{i}}}}}}{k}_{{{{{{{{\rm{pitch}}}}}}}}}x-2{{{{{\rm{i}}}}}}{k}_{{{{{{{{\rm{prism}}}}}}}}}x}$ terms exist, where k_pitch is the wavenumber of the SLM pixel pitch and the terms such as ${{\rm {e}}}^{{{{{{\rm{i}}}}}}{k}_{{{{{{{{\rm{pitch}}}}}}}}}x-2{{{{{\rm{i}}}}}}{k}_{{{{{{{{\rm{prism}}}}}}}}}x}$ are created by the black matrix of the SLM. When the frequency of the grating phase is one-third of the spatial frequency of the pixel pitch, our signal term ${{\rm {e}}}^{{{{{{\rm{i}}}}}}{k}_{{{{{{{{\rm{prism}}}}}}}}}x}$ overlaps with the noise term ${{\rm {e}}}^{{{{{{\rm{i}}}}}}{k}_{{{{{{{{\rm{pitch}}}}}}}}}x-2{{{{{\rm{i}}}}}}{k}_{{{{{{{{\rm{prism}}}}}}}}}x}$ and the noise cannot be filtered. To avoid such noise, the frequency of the grating phase was set to one-quarter or less of the spatial frequency of the pixel pitch.

Generation of the training dataset

The objects in the 3D scene were randomly sampled from publicly available datasets^50,51,52,53 and each scene was rendered by Blender to have 21 varifocal images⁵⁴. The textures of the objects used in the training stage were randomly sampled from the CC0 texture library⁵⁵ and the textures of the objects used in the evaluation stage were sampled from the "Benchmark for 6D Object Pose Estimation” datasets^50,51,52,53. The colors, orientations, and intensities of the light sources were randomly sampled while the maximum intensity was restricted to prevent overexposure. When a scene is overexposed, intensity sums of each varifocal image could be different because the intensities become clipped. Since the propagation of light conserves its total energy, varifocal images with inconsistent intensity sums cannot be constructed with a single wave field.

The focal planes of each scene were equally spaced while the distances between the camera and the objects were significantly longer than the distances between the different objects to symmetrically blur either side of the focal plane. The symmetric blur in the rendered images is consistent with the asymmetric blur of an eye when a tiny display is magnified and projected to the eye. With the exception of the background, the pixel-wise statistics of the depth distribution were made almost uniform to prevent overfitting to a particular depth during training (see Section 7 of the Supplementary material for further details of the depth distribution).

Parameters of the loss function and the depth map with defocus blur

Since the objects in the scene can have any depth, the number of varifocal images was selected to be 21 pixels larger than the maximum diameter of the blur circle, while γ was fixed to 40 to avoid the simultaneous focusing of an object at two different focal planes. For an arbitrary object, the number of out-of-focus images (20) is significantly larger than the number of in-focus images (1) and so the reconstructed scene of the DEH is more influenced by the blurred images than the focused image. Thus, to apply a similar or higher weight to an in-focus image of objects, β was set to 20.

Although defocus blur is not considered while synthesizing a depth map in the majority of applications, we used a defocus-blur-considered depth map during the optimization and training processes to consider occluded surfaces. Normally, if we include defocusing blur when rendering a depth map, front depth values and rear depth values are blended at an edge of defocus blur of a front object. Instead, we sampled the depth map using only one ray per pixel and collected 10 depth maps for each focal distance. Among 10 depth values of each pixel, only the front-most depth value is used, so the depth values of the blurred pixels are confined to the depth of the front object. If we assume that one object is located at the front of the scene and another object is located at the rear of the scene, a blur circle of the rear object does not invade a focused image of the front object when the front object is focused. In contrast, a blur circle of the front object invades a focused image of the rear object when the rear object is focused (Fig. 9). Assuming that an all-in-focus depth map is used when comparing the depth-weighted all-in-focus image and the intensity of the hologram for the rear plane of focus (second term of Eq. (2)), the pixel weights of the rear object close to the front object are high even if the blur circle degrades the image quality. As a result, the loss function has a lower value when a sharply focused image is reconstructed near the boundary of the front object, ignoring the defocus blur of the front object. Such circumstances can be avoided when the defocus-blur-considered depth map is used for the second term of Eq. (2) since the rear object occupies a smaller area in this depth map than in the all-in-focus depth map for the rear plane of focus.

**Fig. 9: All-in-focus depth map and defocus-blur-considered depth map overlaid on a color image.**

In an aspect of the loss function, we tried to adopt multi-scale structural similarity (MS-SSIM) loss instead of MSE loss while synthesizing DEH. However, the effect of MS-SSIM loss was unclear and the DEH optimized by MS-SSIM loss suffered from defects. Comparison results can be found in Section 12 of the Supplementary material.

Training of the neural network

In the first stage of training, we used batch normalization layers in front of activation layers. When the validation loss stopped decreasing, the batch normalization layers and convolution layers were manually fused using running means and running variances. After fusing the batch normalization layers and convolution layers, the fused layers were trained again with the same dataset until the validation loss stopped decreasing. We used the Adam optimizer with a learning rate of 0.0005. We reset the internal parameters of the optimizer for every 50 epochs at the second stage of training. The batch size was 16 and the weights of the network are updated after 4 batch runs, yielding an effective batch size of 64. The training process took approximately 60 h using an NVIDIA V100 GPU. The trained neural network was symmetrically quantized using the TensorRT library and the same training dataset was fed to calibrate the quantization parameters.

Data availability

All relevant data that support the findings of this work are available from the corresponding author upon reasonable request. The configuration settings of BlenderProc used in synthesizing the training and evaluation datasets will be publicly available along with the paper.

Code availability

All relevant codes that support the findings of this work are available from the corresponding author upon reasonable request.

References

Gabor, D. A new microscopic principle. Nature 161, 777–778 (1948).
Article ADS PubMed CAS Google Scholar
Yaraş, F., Kang, H. & Onural, L. State of the art in holographic displays: a survey. J. Disp. Technol. 6, 443–454 (2010).
Article ADS CAS Google Scholar
Geng, J. Three-dimensional display technologies. Adv. Opt. Photonics 5, 456–535 (2013).
Article ADS PubMed PubMed Central CAS Google Scholar
Hoffman, D. M., Girshick, A. R., Akeley, K. & Banks, M. S. Vergence–accommodation conflicts hinder visual performance and cause visual fatigue. J. Vis. 8, 33–33 (2008).
Article PubMed Google Scholar
Watt, S. J., Akeley, K., Ernst, M. O. & Banks, M. S. Focus cues affect perceived depth. J. Vis. 5, 7–7 (2005).
Article Google Scholar
Warnick, K. F. & Chew, W. C. Numerical simulation methods for rough surface scattering. Waves Random Media 11, R1 (2001).
Article ADS MathSciNet MATH Google Scholar
Colburn, W. & Haines, K. Volume hologram formation in photopolymer materials. Appl. Opt. 10, 1636–1641 (1971).
Article ADS PubMed CAS Google Scholar
Pan, Y., Liu, J., Li, X. & Wang, Y. A review of dynamic holographic three-dimensional display: algorithms, devices, and systems. IEEE Trans. Ind. Inform. 12, 1599–1610 (2015).
Article Google Scholar
Haist, T. & Osten, W. Holography using pixelated spatial light modulators—Part 1: theory and basic considerations. J. Micro/Nanolithogr. MEMS MOEMS 14, 041310 (2015).
Article ADS Google Scholar
Makowski, M. Minimized speckle noise in lens-less holographic projection by pixel separation. Opt. Express 21, 29205–29216 (2013).
Article ADS PubMed Google Scholar
Yu, H., Lee, K., Park, J. & Park, Y. Ultrahigh-definition dynamic 3d holographic display by active control of volume speckle fields. Nat. Photonics 11, 186–192 (2017).
Article ADS CAS Google Scholar
Zhao, Y., Cao, L., Zhang, H., Kong, D. & Jin, G. Accurate calculation of computer-generated holograms using angular-spectrum layer-oriented method. Opt. Express 23, 25440–25449 (2015).
Article ADS PubMed CAS Google Scholar
Zhao, T., Liu, J., Duan, J., Li, X. & Wang, Y. Image quality enhancement via gradient-limited random phase addition in holographic display. Opt. Commun. 442, 84–89 (2019).
Article ADS CAS Google Scholar
Tsang, P., Poon, T.-C. & Wu, Y. Review of fast methods for point-based computer-generated holography. Photonics Res. 6, 837–846 (2018).
Article Google Scholar
Maimone, A., Georgiou, A. & Kollin, J. S. Holographic near-eye displays for virtual and augmented reality. ACM Trans. Graph. (Tog) 36, 1–16 (2017).
Article Google Scholar
Shi, L., Li, B., Kim, C., Kellnhofer, P. & Matusik, W. Towards real-time photorealistic 3d holography with deep neural networks. Nature 591, 234–239 (2021).
Article ADS PubMed CAS Google Scholar
Chang, C. et al. Speckle-suppressed phase-only holographic three-dimensional display based on double-constraint Gerchberg–Saxton algorithm. Appl. Opt. 54, 6994–7001 (2015).
Article ADS PubMed Google Scholar
Chakravarthula, P., Peng, Y., Kollin, J., Fuchs, H. & Heide, F. Wirtinger holography for near-eye displays. ACM Trans. Graph. (TOG) 38, 1–13 (2019).
Article Google Scholar
Shimobaba, T. & Ito, T. Random phase-free computer-generated hologram. Opt. Express 23, 9549–9554 (2015).
Article ADS PubMed Google Scholar
Pang, H., Wang, J., Cao, A. & Deng, Q. High-accuracy method for holographic image projection with suppressed speckle noise. Opt. Express 24, 22766–22776 (2016).
Article ADS PubMed Google Scholar
Ko, S.-B. & Park, J.-H. Speckle reduction using angular spectrum interleaving for triangular mesh based computer generated hologram. Opt. Express 25, 29788–29797 (2017).
Article ADS PubMed Google Scholar
Mather, G. & Smith, D. R. Blur discrimination and its relation to blur-mediated depth perception. Perception 31, 1211–1219 (2002).
Article PubMed Google Scholar
Zannoli, M., Love, G. D., Narain, R. & Banks, M. S. Blur and the perception of depth at occlusions. J. Vis. 16, 17–17 (2016).
Article PubMed Google Scholar
Marshall, J. A., Burbeck, C. A., Ariely, D., Rolland, J. P. & Martin, K. E. Occlusion edge blur: a cue to relative visual depth. JOSA A 13, 681–688 (1996).
Article ADS PubMed CAS Google Scholar
Zhang, J., Pégard, N., Zhong, J., Adesnik, H. & Waller, L. 3d computer-generated holography by non-convex optimization. Optica 4, 1306–1313 (2017).
Article ADS Google Scholar
Zhou, P., Li, Y., Liu, S. & Su, Y. Dynamic compensatory Gerchberg–Saxton algorithm for multiple-plane reconstruction in holographic displays. Opt. Express 27, 8958–8967 (2019).
Article ADS PubMed Google Scholar
Velez-Zea, A., Barrera-Ramírez, J. F. & Torroba, R. Improved phase hologram generation of multiple 3d objects. Appl. Opt. 61, 3230–3239 (2022).
Article ADS PubMed Google Scholar
Chen, C. et al. Multi-depth hologram generation using stochastic gradient descent algorithm with complex loss function. Opt. Express 29, 15089–15103 (2021).
Article ADS PubMed CAS Google Scholar
Ying, C.-f, Pang, H., Fan, C.-j. & Zhou, W.-d New method for the design of a phase-only computer hologram for multiplane reconstruction. Opt. Eng. 50, 055802 (2011).
Article ADS Google Scholar
Makowski, M., Sypek, M., Kolodziejczyk, A., Mikula, G. & Suszek, J. Iterative design of multiplane holograms: experiments and applications. Opt. Eng. 46, 045802 (2007).
Article ADS Google Scholar
Makey, G. et al. Breaking crosstalk limits to dynamic holography using orthogonality of high-dimensional random vectors. Nat. Photonics 13, 251–256 (2019).
Article ADS PubMed PubMed Central CAS Google Scholar
Sinclair, G. et al. Interactive application in holographic optical tweezers of a multi-plane Gerchberg–Saxton algorithm for three-dimensional light shaping. Opt. Express 12, 1665–1670 (2004).
Article ADS PubMed Google Scholar
Dorsch, R. G., Lohmann, A. W. & Sinzinger, S. Fresnel ping-pong algorithm for two-plane computer-generated hologram display. Appl. Opt. 33, 869–875 (1994).
Article ADS PubMed CAS Google Scholar
Makowski, M., Sypek, M., Kolodziejczyk, A. & Mikula, G. Three-plane phase-only computer hologram generated with iterative fresnel algorithm. Opt. Eng. 44, 125805 (2005).
Article ADS Google Scholar
Matsushima, K. & Shimobaba, T. Band-limited angular spectrum method for numerical simulation of free-space propagation in far and near fields. Opt. Express 17, 19662–19673 (2009).
Article ADS PubMed CAS Google Scholar
Peng, Y., Choi, S., Padmanaban, N. & Wetzstein, G. Neural holography with camera-in-the-loop training. ACM Trans. Graph. 39, 1–14 (2020).
Article Google Scholar
Arrizón, V., Méndez, G. & Sánchez-de La-Llave, D. Accurate encoding of arbitrary complex fields with amplitude-only liquid crystal spatial light modulators. Opt. Express 13, 7913–7927 (2005).
Article ADS PubMed Google Scholar
Zhang, R., Isola, P., Efros, A. A., Shechtman, E. & Wang, O. The unreasonable effectiveness of deep features as a perceptual metric. In Proc. IEEE Conference on Computer Vision and Pattern Recognition, 586–595 (IEEE, 2018).
Scharstein, D. et al. High-resolution stereo datasets with subpixel-accurate ground truth. In German Conference on Pattern Recognition, 31–42 (Springer, 2014).
Lee, B., Kim, D., Lee, S., Chen, C. & Lee, B. High-contrast, speckle-free, true 3d holography via binary cgh optimization. Sci. Rep. 12, 1–12 (2022).
ADS Google Scholar
Kim, D., Nam, S.-W., Lee, B., Seo, J.-M. & Lee, B. Accommodative holography: improving accommodation response for perceptually realistic holographic displays. ACM Trans. Graph. 41, 1–15 (2022).
Google Scholar
Shi, L., Li, B. & Matusik, W. End-to-end learning of 3d phase-only holograms for holographic display. Light: Sci. Appl. 11, 1–18 (2022).
Article CAS Google Scholar
Shusteff, M. et al. One-step volumetric additive manufacturing of complex polymer structures. Sci. Adv. 3, eaao5496 (2017).
Article PubMed PubMed Central CAS Google Scholar
Smalley, D. et al. A photophoretic-trap volumetric display. Nature 553, 486–490 (2018).
Article ADS PubMed CAS Google Scholar
Saleh, B. E. & Teich, M. C. Fundamentals of Photonics (John Wiley & Sons, 2019).
Alexandridis, E. Pupil size. In The Pupil, 11–12 (Springer, 1985).
Ren, P. et al. Off-line and on-line stress detection through processing of the pupil diameter signal. Ann. Biomed. Eng. 42, 162–176 (2014).
Article PubMed Google Scholar
Lazarev, G., Hermerschmidt, A., Krüger, S. & Osten, S. Lcos spatial light modulators: trends and applications. Opt. Imaging Metrol.: Adv. Technol. 1, 1–23 (2012).
Burch, J. A computer algorithm for the synthesis of spatial frequency filters. Proc. IEEE 55, 599–601 (1967).
Article Google Scholar
Hodaň, T. et al. Bop: Benchmark for 6d object pose estimation. In Proc. European Conference on Computer Vision (ECCV), 19–34 (Springer, 2018).
Kaskman, R., Zakharov, S., Shugurov, I. & Ilic, S. Homebreweddb: Rgb-d dataset for 6d pose estimation of 3d objects. In Proc. IEEE/CVF International Conference on Computer Vision Workshops, 0–0 (IEEE, 2019).
Hodan T. et al. T-less: An rgb-d dataset for 6d pose estimation of texture-less objects. In 2017 IEEE Winter Conference on Applications of Computer Vision (WACV), 880–888 (IEEE, 2017).
Xiang, Y., Schmidt, T., Narayanan, V. & Fox, D. PoseCNN: A convolutional neural network for 6d object pose estimation in cluttered scenes. Robotics: Science and Systems (RSS) (MIT Press, 2018).
Denninger, M. et al. Blenderproc. arXiv preprint arXiv:1911.01911 (2019).
Demes, L. Cc0 texture library (accessed 17 November 2021); https://cc0textures.com/ (2017).
Kim, C., Zimmer, H., Pritch, Y., Sorkine-Hornung, A. & Gross, M. H. Scene reconstruction from high spatio-angular resolution light fields. ACM Trans. Graph. 32, 73–1 (2013).
Article MATH Google Scholar
Miangoleh, S. M. H., Dille, S., Mai, L., Paris, S. & Aksoy, Y. Boosting monocular depth estimation models to high-resolution via content-adaptive multi-resolution merging. In Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition, 9685–9694 (IEEE, 2021).

Download references

Author information

Authors and Affiliations

Samsung Advanced Institute of Technology, Samsung Electronics, Suwon, Gyeonggi-do, South Korea
Daeho Yang, Wontaek Seo, Hyeonseung Yu, Sun Il Kim, Bongsu Shin, Chang-Kun Lee, Seokil Moon, Jungkwuen An, Jong-Young Hong, Geeyoung Sung & Hong-Seok Lee
Department of Electrical and Computer Engineering, Seoul National University, Seoul, South Korea
Hong-Seok Lee

Authors

Daeho Yang
View author publications
You can also search for this author in PubMed Google Scholar
Wontaek Seo
View author publications
You can also search for this author in PubMed Google Scholar
Hyeonseung Yu
View author publications
You can also search for this author in PubMed Google Scholar
Sun Il Kim
View author publications
You can also search for this author in PubMed Google Scholar
Bongsu Shin
View author publications
You can also search for this author in PubMed Google Scholar
Chang-Kun Lee
View author publications
You can also search for this author in PubMed Google Scholar
Seokil Moon
View author publications
You can also search for this author in PubMed Google Scholar
Jungkwuen An
View author publications
You can also search for this author in PubMed Google Scholar
Jong-Young Hong
View author publications
You can also search for this author in PubMed Google Scholar
Geeyoung Sung
View author publications
You can also search for this author in PubMed Google Scholar
Hong-Seok Lee
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.Y. conceived the idea and wrote the manuscript. W.S. and H.Y. were involved in developing the proposed algorithm. D.Y. performed the experiments with help from W.S., S.I.K., B.S., C.-K. L., and S.M. H.Y., J.K., J.-Y.H., and G.S. contributed to the theoretical investigations. H.-S.L. supervised overall work. All authors participated in discussions and contributed to the manuscript.

Corresponding author

Correspondence to Hong-Seok Lee.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Alejandro Velez Zea and the other anonymous reviewer(s) for their contribution to the peer review of this work. Peer review reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer review file

Description to Additional Supplementary Information

Supplementary Movie 1

Supplementary Movie 2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yang, D., Seo, W., Yu, H. et al. Diffraction-engineered holography: Beyond the depth representation limit of holographic displays. Nat Commun 13, 6012 (2022). https://doi.org/10.1038/s41467-022-33728-5

Download citation

Received: 26 May 2022
Accepted: 28 September 2022
Published: 12 October 2022
DOI: https://doi.org/10.1038/s41467-022-33728-5
Springer Nature Limited

This article is cited by

Non-convex optimization for inverse problem solving in computer-generated holography
- Xiaomeng Sui
- Zehao He
- Liangcai Cao
Light: Science & Applications (2024)
Liquid lens based holographic camera for real 3D scene hologram acquisition using end-to-end physical model-driven network
- Di Wang
- Zhao-Song Li
- Qiong-Hua Wang
Light: Science & Applications (2024)
Neural étendue expander for ultra-wide-angle high-fidelity holographic display
- Ethan Tseng
- Grace Kuo
- Felix Heide
Nature Communications (2024)
Real-time 4K computer-generated hologram based on encoding conventional neural network with learned layered phase
- Chongli Zhong
- Xinzhu Sang
- Shuo Chen
Scientific Reports (2023)

Diffraction-engineered holography: Beyond the depth representation limit of holographic displays

Abstract

Similar content being viewed by others

Introduction

Results

Loss function for hologram synthesis

Image quality of reconstructed holograms

Experiments and benchmark

Discussions

Methods

Determining the diameter of the blur circle

Experimental details

Phase noise of the amplitude-only SLM

Generation of the training dataset

Parameters of the loss function and the depth map with defocus blur

Training of the neural network

Data availability

Code availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation