Single-shot and lensless complex-amplitude imaging with incoherent light based on machine learning

Horisaki, Ryoichi; Fujii, Kazuki; Tanida, Jun

doi:10.1007/s10043-018-0452-1

Single-shot and lensless complex-amplitude imaging with incoherent light based on machine learning

Regular Paper
Open access
Published: 31 July 2018

Volume 25, pages 593–597, (2018)
Cite this article

Download PDF

You have full access to this open access article

Optical Review Aims and scope Submit manuscript

Single-shot and lensless complex-amplitude imaging with incoherent light based on machine learning

Download PDF

Ryoichi Horisaki^1,2,
Kazuki Fujii¹ &
Jun Tanida¹

2499 Accesses
8 Citations
Explore all metrics

Abstract

We present a single-shot incoherent light imaging method for simultaneously observing both amplitude and phase without any imaging optics, based on machine learning. In the proposed method, an object with a complex-amplitude field is illuminated with incoherent light and is captured by an image sensor with or without a coded aperture. The complex-amplitude field of the object is reconstructed from a single captured image using a state-of-the-art deep convolutional neural network, which is trained with a large number of input and output pairs. In experimental demonstrations, the proposed method was verified with a handwritten character database, and the effect of a coded aperture printed on an overhead projector film in the reconstruction was examined. Our method has advantages over conventional wavefront sensing techniques using incoherent light, namely simplification of the optical hardware and improved measurement speed. This study shows the importance and practical impact of machine learning techniques in various fields of optical sensing.

Lightless Fields: Enhancement and Denoising of Light-Deficient Light Fields

Holographic optical field recovery using a regularized untrained deep decoder network

Article Open access 25 May 2021

Universal linear intensity transformations using spatially incoherent diffractive processors

Article Open access 15 August 2023

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

An optical field is expressed as a complex-amplitude, which describes both the amplitude and phase of a light wave, in wave optics [1]. Conventional cameras just observe amplitude information only. Phase information is important for visualizing transparent objects, e.g., biological cells in biology, and compensating for aberrations/scattering with adaptive optics, e.g., deep tissue imaging in biomedicine and telescopic imaging through atmospheric turbulence in astronomy.

An established approach for complex-amplitude imaging, in which the transmittance/reflectance and phase delay of an object are observed, is digital holography (DH) [2,3,4,5]. DH basically assumes coherent illumination, such as a laser, but several incoherent DH methods have been proposed [6,7,8,9,10]. One issue with these incoherent DH methods is a tradeoff between the number of shots and the space–bandwidth product due to the use of a spatial or temporal reference carrier.

Another approach for phase imaging with incoherent light is to use a Shack–Hartmann wavefront sensor [11]. Single-shot complex-amplitude/phase imaging methods based on the Shack–Hartmann wavefront sensor have utilized a lens array or a holographic optical element to observe a stereo or plenoptic image, and this results in a tradeoff between the spatial and angular resolutions [12,13,14,15,16,17]. Thus, these methods involve a compromise between the space–bandwidth product and the simplicity of the optical setup.

In this paper, to solve the above issues, we propose a method for single-shot complex-amplitude imaging with incoherent light and no imaging optics. We verify the effectiveness of a deep convolutional neural network as the reconstruction algorithm and the use of a coded aperture (CA) to improve the reconstruction fidelity. This approach extends the range of applications of complex-amplitude imaging and shows the importance of machine learning techniques in optical sensing.

2 Method

In the proposed method, a complex-amplitude object field illuminated with incoherent light is captured by an image sensor with or without a coded aperture (CA), as shown in Fig. 1. A CA has been used to improve the reconstruction conditions because its Fourier spectrum is significantly extended compared with that of a conventional non-coded aperture. Using CAs, single-shot amplitude imaging with incoherent light and single-shot reference-free DH with coherent light have been demonstrated [18,19,20,21,22,23].

The imaging process of the proposed method is written in general form as

$$\begin{aligned} \varvec{g}&=\mathcal {H}[\varvec{f}] \end{aligned}$$

(1)

$$\begin{aligned}&=\mathcal {H}[\varvec{f}_{r}\exp (i\varvec{f}_{\phi })], \end{aligned}$$

(2)

where $\varvec{g}\in \mathbb {R}^{N^2\times 1}$ is the captured intensity image and $\mathcal {H}[\bullet ]$ is the forward operator. $\varvec{f}\in \mathbb {C}^{N^2\times 1}$, $\varvec{f}_{r}\in \mathbb {R}^{N^2\times 1}$, and $\varvec{f}_{\phi }\in \mathbb {R}^{N^2\times 1}$ are the complex-amplitude, amplitude, and phase of the object field, respectively, and $\varvec{f}=\varvec{f}_{r}\exp (i\varvec{f}_{\phi })$. Here, N is the number of pixels along one spatial direction, $\mathbb {R}^{P\times Q}$ is a $P\times Q$ matrix of real numbers, and $\mathbb {C}^{P\times Q}$ is a $P\times Q$ matrix of complex numbers.

The inversions for reconstructing the amplitude and the phase are separable, as indicated by Eq. (2). In this paper, for simplicity, we solve the following two inversions independently:

$$\begin{aligned}&\hat{\varvec{f}}_{r}=\mathcal {H}^{-1}_{r}[\varvec{g}], \end{aligned}$$

(3)

$$\begin{aligned}&\hat{\varvec{f}}_{\phi }=\mathcal {H}^{-1}_{\phi }[\varvec{g}], \end{aligned}$$

(4)

where $\mathcal {H}^{-1}_{r}[\bullet ]$ and $\mathcal {H}^{-1}_{\phi }[\bullet ]$ are the inverse operators for the amplitude and the phase, respectively.

We use a regression algorithm for the inversions of Eqs. (3) and (4) in the reconstruction process. Regressions, including deep learning, are typical techniques in machine learning, and have applied regressions to optical sensing and control [24,25,26,27,28,29]. In addition, several studies for phase retrieval, ghost imaging, and superresolution imaging with deep learning have been reported [30,31,32]. In this paper, we modify a deep convolutional residual network (ResNet [33]) for each of the inversions. Residual learning in ResNet prevents stagnation during the learning process and optimizes deep layers efficiently. ResNet has been used for phase retrieval and computer-generated holography and showed favorable results [29, 30].

The network architecture for the reconstruction is shown in Fig. 2. The network is composed of multi-scale ResNets, as shown in Fig. 2a. K is the number of filters at each of the convolutional layers. “D” is a downsampling block, as shown in Fig. 2b. “U” is an upsampling block, as shown in Fig. 2c. “R” is a residual convolutional block, as shown in Fig. 2d. “S” is a skip convolutional block, as shown in Fig. 2e. The definitions of the layers are as follows [34]: “Conv(S, L)” is a two-dimensional convolutional layer with a filter size S and a stride L. “TConv(S, L)” is a two-dimensional transposed convolutional layer with a filter size S and a stride L. “BatchNorm” is a batch normalization layer. “ReLU” is a rectified linear unit layer.

The network architectures for reconstructing the amplitude and phase are the same, as shown in Fig. 2a, but they are trained with different datasets. The amplitude datasets are composed of pairs of amplitude images $\varvec{f}_{r}$ and their captured intensity images $\varvec{g}$. On the other hand, the phase datasets are composed of pairs of phase images $\varvec{f}_{\phi }$ and their captured intensity images $\varvec{g}$.

3 Experimental demonstration

The proposed method was demonstrated experimentally. The complex-amplitude object was implemented with two spatial light modulators (SLMs: LC 2012 manufactured by Holoeye, pixel pitch: 36 µm, pixel count: $768\times 1024$); one of them was operated in the phase mode, and the other one was operated in the amplitude mode. The phase and amplitude SLMs were located at positions 5 and 7 cm from an image sensor (PL-B953 manufactured by PixeLink, pixel pitch: 4.65 µm, pixel count: $768\times 1024$), respectively. Those two SLMs were illuminated with an incoherent green light-emitting diode (M565L3 manufactured by Thorlabs, nominal wavelength: 565 nm, full width at half maximum of spectrum: 103 nm) without any spatial or spectral filter. Our method does not suppose amplitude and phase layers at different distances. A single complex-amplitude layer is readily trainable in the same manner by implementing such an object. In the case with the CA, the CA was printed on an overhead projector film (OHP film: VF-1421N manufactured by Kokuyo) by a laser printer (ApeosPort-V C2276 manufactured by Fuji Xerox, resolution: 1200 dpi) and it was located at a position 1 cm from the image sensor. The CA was a random binary pattern with the maximum printable resolution.

The amplitude and phase images simultaneously displayed on the two SLM were different handwritten digits randomly selected from the EMNIST database, where the pixel count of the images was $28\times 28$ [35]. The captured intensity images were reduced to $28\times 28$ pixels, and then they were expanded to $32\times 32 (=N^2)$ with zero padding for adjusting the captured image size to the input one of the network, where the input and output image sizes are a power of two. These resized captured intensity images and the original amplitude and phase images, which were also expanded to $32\times 32 (=N^2)$ with zero padding, were provided to the network for reconstructing the amplitude and phase images with zero padding. The output images from the network were cropped to remove the zero padding area, and the final amplitude and phase images with a pixel count of $28\times 28$ were produced. The number of convolutional filters was $64 (=K)$. An algorithm called “Adam” was used to optimize the network with an initial learning rate of 0.0001 for the step width in the optimization, a mini-batch size of 50, and a maximum number of epochs of 50 [36]. The number of training pairs was 100,000 in both the amplitude and phase reconstructions.

The experimental results without the CA are shown in Figs. 3 and 4. The trained amplitude and phase networks were respectively tested with 1000 amplitude images and 1000 phase images that had not been used for the training process. Ten examples of the amplitude and phase test images are shown in Fig. 3a and b, respectively. Their captured intensity images from the amplitude and phase images in Fig. 3 are shown in Fig. 4a. The amplitude and phase images reconstructed from Fig. 4a are shown in Fig. 4b and c. The peak signal-to-noise ratios (PSNRs) of the 1000 reconstructed amplitude images and 1000 reconstructed phase images were 20.9 and 14.3 dB, respectively. These results demonstrated the effectiveness of the deep network for complex-amplitude imaging with incoherent light.

The experimental results with the CA are shown in Fig. 5. The network was trained and tested with the same amplitude and phase images as the previous experiment but with different captured intensity images through the CA. The captured intensity images with the CA from the amplitude and phase images in Fig. 3 are shown in Fig. 5a. The amplitude and phase images were also reconstructed from the captured intensity images, as shown in Fig. 5b and c. The PSNRs of the amplitude and phase images were 23.5 and 15.3 dB, respectively. These results showed that the amplitude and phase reconstructions were improved in the case with the CA.

In the experiments, the calculation time for reconstructing the test images was 0.5 s when using a computer with an Intel Dual Xeon processor with a clock rate of 2.1 GHz and a memory size of 128 GB, and NVIDIA Quadro P4000 graphics processing unit with a memory size of 8 GB. The training time was 18 h.

4 Conclusion

We presented a method for single-shot, lensless complex-amplitude imaging with incoherent light and no imaging optics. A deep convolutional neural network was used for reconstructing both the amplitude and phase from a single intensity image. We demonstrated the proposed method experimentally with a CA implemented on an OHP film using handwritten datasets. The results showed the effectiveness of the deep network and the CA for complex-amplitude imaging with incoherent light.

Our method reduces the hardware size/cost and the measurement time in incoherent complex-amplitude imaging, while maintaining the image quality. It is readily extendable to a reflective setup and multi-dimensional complex-amplitude imaging, such as three-dimensional spatial imaging and spectral imaging. The CA is not limited to only amplitude modulation but can also be applied to phase or complex-amplitude modulation [22]. Our method is applicable to non-visible spectral regions, such as X-rays, infrared light, and terahertz radiation. Therefore, the method is promising in various fields, including, biology, security, and astronomy.

References

Goodman, J.W.: Introduction to Fourier Optics. McGraw-Hill, New York (1996)
Google Scholar
Nehmetallah, G., Banerjee, P.P.: Applications of digital and analog holography in three-dimensional imaging. Adv. Opt. Photonics 4, 472–553 (2012)
Article ADS Google Scholar
Osten, W., Faridian, A., Gao, P., Körner, K., Naik, D., Pedrini, G., Singh, A.K., Takeda, M., Wilke, M.: Recent advances in digital holography. Appl. Opt. 53, G44–G63 (2014)
Article Google Scholar
Bhaduri, B., Edwards, C., Pham, H., Zhou, R., Nguyen, T.H., Goddard, L.L., Popescu, G.: Diffraction phase microscopy: principles and applications in materials and life sciences. Adv. Opt. Photonics 6, 57–119 (2014)
Article ADS Google Scholar
Memmolo, P., Miccio, L., Paturzo, M., Caprio, G.D., Coppola, G., Netti, P.A., Ferraro, P.: Recent advances in holographic 3D particle tracking. Adv. Opt. Photonics 7, 713–755 (2015)
Article ADS Google Scholar
Rosen, J., Brooker, G.: Non-scanning motionless fluorescence three-dimensional holographic microscopy. Nat. Photonics 2, 190–195 (2008)
Article ADS Google Scholar
Rosen, J., Brooker, G.: Fresnel incoherent correlation holography (FINCH): a review of research. Adv. Opt. Technol. 1, 151–169 (2012)
ADS Google Scholar
Vijayakmar, A., Kashter, Y., Kelner, R., Rosen, J.: Coded aperture correlation holographya new type of incoherent digital holograms. Opt. Express 24, 12430–12441 (2016)
Article ADS Google Scholar
Quan, X., Matoba, O., Awatsuji, Y.: Single-shot incoherent digital holography using a dual-focusing lens with diffraction gratings. Opt. Lett. 42, 383–386 (2017)
Article ADS Google Scholar
Liu, J.-P., Tahara, T., Hayasaki, Y., Poon, T.-C.: Incoherent digital holography: a review. Appl. Sci. 8, 143 (2018)
Article Google Scholar
Platt, B.C., Shack, R.: History and principles of Shack–Hartmann wavefront sensing. J. Refract. Surg. 17, S573–7 (1995)
Google Scholar
Parthasarathy, A.B., Chu, K.K., Ford, T.N., Mertz, J.: Quantitative phase imaging using a partitioned detection aperture. Opt. Lett. 37, 4062–4064 (2012)
Article ADS Google Scholar
Saita, Y., Shinto, H., Nomura, T.: Holographic Shack–Hartmann wavefront sensor based on the correlation peak displacement detection method for wavefront sensing with large dynamic range. Optica 2, 411–415 (2015)
Article Google Scholar
Gong, H., Agbana, T.E., Pozzi, P., Soloviev, O., Verhaegen, M., Vdovin, G.: Optical path difference microscopy with a Shack–Hartmann wavefront sensor. Opt. Lett. 42, 2122–2125 (2017)
Article ADS Google Scholar
Chai, J.-X., Tong, X., Chan, S.-C., Shum, H.-Y.: Plenoptic sampling. In: Proc. 27th Annu. Conf. Comput. Graph. Interact. Tech., SIGGRAPH ’00, pp. 307–318. ACM Press/Addison-Wesley Publishing Co., New York (2000)
Nakamura, T., Horisaki, R., Tanida, J.: Computational phase modulation in light field imaging. Opt. Express 21, 29523–29543 (2013)
Article ADS Google Scholar
Shimano, T., Nakamura, Y., Tajima, K., Sao, M., Hoshizawa, T.: Lensless light-field imaging with Fresnel zone aperture: quasi-coherent coding. Appl. Opt. 57, 2841–2850 (2018)
Article ADS Google Scholar
Fenimore, E.M., Cannon, T.M.: Coded aperture imaging with uniformly redundant arrays. Appl. Opt. 17, 337–347 (1978)
Article ADS Google Scholar
Levin, A., Fergus, R., Durand, F., Freeman, W.T.: Image and depth from a conventional camera with a coded aperture. ACM Trans. Graph. 26, 70:1–70:9 (2007)
Article Google Scholar
Horisaki, R., Ogura, Y., Aino, M., Tanida, J.: Single-shot phase imaging with a coded aperture. Opt. Lett. 39, 6466–6469 (2014)
Article ADS Google Scholar
Horisaki, R., Egami, R., Tanida, J.: Experimental demonstration of single-shot phase imaging with a coded aperture. Opt. Express 23, 28691–28697 (2015)
Article ADS Google Scholar
Egami, R., Horisaki, R., Tian, L., Tanida, J.: Relaxation of mask design for single-shot phase imaging with a coded aperture. Appl. Opt. 55, 1830–1837 (2016)
Article ADS Google Scholar
Horisaki, R., Kojima, T., Matsushima, K., Tanida, J.: Subpixel reconstruction for single-shot phase imaging with coded diffraction. Appl. Opt. 56, 7642–7647 (2017)
Article ADS Google Scholar
Ando, T., Horisaki, R., Tanida, J.: Speckle-learning-based object recognition through scattering media. Opt. Express 23, 33902–33910 (2015)
Article ADS Google Scholar
Horisaki, R., Takagi, R., Tanida, J.: Learning-based imaging through scattering media. Opt. Express 24, 13738–13743 (2016)
Article ADS Google Scholar
Takagi, R., Horisaki, R., Tanida, J.: Object recognition through a multi-mode fiber. Opt. Rev. 24, 117–120 (2017)
Article Google Scholar
Horisaki, R., Takagi, R., Tanida, J.: Learning-based single-shot superresolution in diffractive imaging. Appl. Opt. 56, 8896–8901 (2017)
Article ADS Google Scholar
Horisaki, R., Takagi, R., Tanida, J.: Learning-based focusing through scattering media. Appl. Opt. 56, 4358–4362 (2017)
Article ADS Google Scholar
Horisaki, R., Takagi, R., Tanida, J.: Deep-learning-generated holography. Appl. Opt. 57, 3859–3863 (2018)
Article ADS Google Scholar
Sinha, A., Lee, J., Li, S., Barbastathis, G.: Lensless computational imaging through deep learning. Optica 4, 1117–1125 (2017)
Article Google Scholar
Lyu, M., Wang, W., Wang, H., Wang, H., Li, G., Chen, N., Situ, G.: Deep-learning-based ghost imaging. Sci. Rep. 7, 17865 (2017)
Article ADS Google Scholar
Rivenson, Y., Göröcs, Z., Günaydin, H., Zhang, Y., Wang, H., Ozcan, A.: Deep learning microscopy. Optica 4, 1437–1443 (2017)
Article Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (IEEE), pp. 770–778 (2016)
Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. MIT Press, Cambridge (2016)
MATH Google Scholar
Cohen, G., Afshar, S., Tapson, J., van Schaik, A.: EMNIST: an extension of MNIST to handwritten letters (2017). arXiv:1702.05373 (preprint)
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization.In: International Conference on Learning Representations (ICLR) (2015)

Download references

Funding

This work was supported by JSPS KAKENHI Grant Numbers JP17H02799 and JP17K00233, and JST PRESTO Grant Number JPMJPR17PB.

Author information

Authors and Affiliations

Department of Information and Physical Sciences, Graduate School of Information Science and Technology, Osaka University, 1-5 Yamadaoka, Suita, Osaka, 565-0871, Japan
Ryoichi Horisaki, Kazuki Fujii & Jun Tanida
JST, PRESTO, 4-1-8 Honcho, Kawaguchi-shi, Saitama, 332-0012, Japan
Ryoichi Horisaki

Authors

Ryoichi Horisaki
View author publications
You can also search for this author in PubMed Google Scholar
Kazuki Fujii
View author publications
You can also search for this author in PubMed Google Scholar
Jun Tanida
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ryoichi Horisaki.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Horisaki, R., Fujii, K. & Tanida, J. Single-shot and lensless complex-amplitude imaging with incoherent light based on machine learning. Opt Rev 25, 593–597 (2018). https://doi.org/10.1007/s10043-018-0452-1

Download citation

Received: 08 April 2018
Accepted: 25 July 2018
Published: 31 July 2018
Issue Date: October 2018
DOI: https://doi.org/10.1007/s10043-018-0452-1

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Single-shot and lensless complex-amplitude imaging with incoherent light based on machine learning

Abstract

Similar content being viewed by others

Lightless Fields: Enhancement and Denoising of Light-Deficient Light Fields

Holographic optical field recovery using a regularized untrained deep decoder network

Universal linear intensity transformations using spatially incoherent diffractive processors

1 Introduction

2 Method

3 Experimental demonstration

4 Conclusion

References

Funding

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Single-shot and lensless complex-amplitude imaging with incoherent light based on machine learning

Abstract

Similar content being viewed by others

Lightless Fields: Enhancement and Denoising of Light-Deficient Light Fields

Holographic optical field recovery using a regularized untrained deep decoder network

Universal linear intensity transformations using spatially incoherent diffractive processors

1 Introduction

2 Method

3 Experimental demonstration

4 Conclusion

References

Funding

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation