Isotropic Reconstruction of 3D Fluorescence Microscopy Images Using Convolutional Neural Networks

Weigert, Martin; Royer, Loic; Jug, Florian; Myers, Gene

doi:10.1007/978-3-319-66185-8_15

Isotropic Reconstruction of 3D Fluorescence Microscopy Images Using Convolutional Neural Networks

Martin Weigert^21,22,
Loic Royer^21,22,
Florian Jug^21,22 &
…
Gene Myers^21,22

Conference paper
First Online: 04 September 2017

10k Accesses
34 Citations
3 Altmetric

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 10434))

Abstract

Fluorescence microscopy images usually show severe anisotropy in axial versus lateral resolution. This hampers downstream processing, i.e. the automatic extraction of quantitative biological data. While deconvolution methods and other techniques to address this problem exist, they are either time consuming to apply or limited in their ability to remove anisotropy. We propose a method to recover isotropic resolution from readily acquired anisotropic data. We achieve this using a convolutional neural network that is trained end-to-end from the same anisotropic body of data we later apply the network to. The network effectively learns to restore the full isotropic resolution by restoring the image under a trained, sample specific image prior. We apply our method to 3 synthetic and 3 real datasets and show that our results improve on results from deconvolution and state-of-the-art super-resolution techniques. Finally, we demonstrate that a standard 3D segmentation pipeline performs on the output of our network with comparable accuracy as on the full isotropic data.

You have full access to this open access chapter, Download conference paper PDF

1 Introduction

Fluorescence microscopy is a standard tool for imaging biological samples [15]. Acquired images of confocal microscopes [3] as well as light-sheet microscopes [4], however, are inherently anisotropic, owing to a 3D optical point-spread function (PSF) that is elongated along the axial (z) direction which typical leads to a 2 to 4-fold lower resolution along this axis. Furthermore, due to the mechanical plane-by-plane acquisition modality of most microscopes, the axial sampling is reduced as well, further reducing the overall resolution by a factor of 4 to 8. These effects later render downstream data analysis, e.g. cell segmentation, difficult.

To circumvent this problem, multiple techniques are known and used: Classical deconvolution methods [9, 12] are arguably the most common of these. They can be applied on already acquired data, however, their performance is typically inferior to other more complex techniques. Some confocal systems, e.g. when using two-photon excitation with high numerical aperture objectives and an isotropic axial sampling, can acquire almost isotropic volumes [3, 10] (cf. Fig. 3). Downsides are low acquisition speed, high photo toxicity/bleaching, and large file sizes. Light-sheet microscopes, instead, can improve axial resolution by imaging the sample from multiple sides (views). These views can then be registered and jointly deconvolved [11]. The disadvantage is the reduced effective acquisition speed and the need for a complex optical setup. A method that would allow to recover isotropic resolution from a single, anisotropic acquired microscopic 3D volume is therefore highly desirable and would likely impact the life-sciences in fundamental ways.

Here we propose a method to restore isotropic image volumes from anisotropic light-optical acquisitions with the help of convolutional networks without the need for additional ground truth training data. This can be understood as a combination of a super-resolution problem on subsampled data, and a deconvolution problem to counteract the microscope induced optical PSF. Our method takes two things into account: (i) the 3D image formation process in fluorescence microscopes, and (ii) the 3D structure of the optical PSF. We use and compare two convolutional network architectures that are trained end-to-end from the same anisotropic body of data we later apply the network to. During training, the network effectively learns a sample specific image prior it uses to deconvolve the images and restore full isotropic resolution.

Recently, neural networks have been shown to achieve remarkable results for super-resolution and image restoration on 2D natural images where sufficient ground truth data is available [2, 6]. For fluorescence microscopy data there is, unfortunately, no ground truth (GT) data available because it would essentially require to build an ideal and physically impossible microscope. Currently there is no network approach for recovering isotropic resolution from fluorescence microscopy images. Our work uses familiar network architectures [2, 13], and then applies the concept of self super-resolution [5] by learning from the very same dataset for which we restore isotropic resolution.

2 Methods

Given the true fluorophore distribution f(x, y, z) the acquired volumetric image g of a microscope can be approximated by the following process

$$\begin{aligned} g = \mathcal {P}\big [ \mathcal {S_\sigma } (h \otimes f) \big ] + \eta \end{aligned}$$

(1)

where $h = h(x,y,z)$ is the point spread function (PSF) of the microscope, $\otimes $ is the 3D convolution operation, $S_\sigma $ is the axial downsampling/slicing operator by a factor $\sigma $, $\mathcal {P}$ is the signal dependent noise operator (e.g. poisson noise) and $\eta $ is the detector noise. As the PSF is typically elongated along z and $\sigma >1$, the lateral slices $g_{xy}$ of the resulting volumetric images show a significant higher resolution and structural contrast compared to the axial slices $g_{xz}$ and $g_{yz}$ (cf. Fig. 1a).

2.1 Restoration via Convolutional Neural Networks

The predominant approach to invert the image formation process (1) is, in cases where it is possible, to acquire multiple viewing angles of the sample, and register and deconvolve these images by iterative methods without any sample specific image priors [9, 11, 12]. In contrast to these classical methods for image restoration, we here try to directly learn the mapping between blurred and downsampled images and its true underlying signal. As no ground truth for the true signal is available, we make use of the resolution anisotropy between lateral and axial slices and aim to restore lateral resolution along the axial direction. To this end, we apply an adapted version of the image formation model (1) to the lateral slices $g_{xy}$ of a given volumetric image

$$\begin{aligned} p_{xy} = \mathcal {S_\sigma } (\tilde{h} \otimes g_{xy}) \end{aligned}$$

(2)

with a suitable chosen 3d rotated PSF $\tilde{h}$. To learn the inverse mapping $p_{xy} \rightarrow g_{xy}$ we assemble lateral patches $(g^n_{xy},p^n_{xy})_{n \in \mathbb {N}}$ and train a fully convolutional neural network [8] to minimize the pixel wise PSNR loss

$$\begin{aligned} \mathcal {L} = \sum _n -[20\,\log _{10} \max {g^n_{xy}} - 10\,\log _{10} |g^n_{xy}-\tilde{g}^n_{xy}|^2] \end{aligned}$$

(3)

where $\tilde{g}^n_{xy}$ is the output of the network when applied to $p^n_{xy}$. For choosing the best $\tilde{h}$ we consider the two choices (i) full: $\tilde{h} = h_{rot}$ where $h_{rot}$ is a rotated version of the original PSF that is aligned with the lateral planes, and (ii) split: $\tilde{h} = h_{split}$ which is the solution to the deconvolution problem $ h_{rot} = h_{iso} \otimes h_{split}$ and $h_{iso}$ is the isotropic average of h. The later choice is motivated by the observation that convolving lateral slices with $h_{split}$ leads to images with a resolution comparable to the axially ones. After training we apply the network on the unseen, anisotropically blurred, bicubic upsampled axial slices $g_{xz}$ of the whole volume to get the final estimation output.

2.2 Network Architecture and Training

We propose and compare two learning strategies, IsoNet-1 and IsoNet-2, that implement two different established network topologies. The notation for the specific layers is as follows: $C_{n,w,h}$ for a convolutional layer with n filters of size (w, h), $M_{p,q}$ for max pooling by a factor of (p, q), and $U_{p,q}$ for upsampling by a factor of (p, q). In conjunction with the two different methods of training data generation (full, split), the specific topologies are:

IsoNet-1. Which is the proposed network architecture of [1] used for super resolution: $C_{64,9,9} - C_{32,5,5} - C_{1,5,5}- C_{1,1,1}$. Here the first layer acts as a feature extractor whose output is mapped nonlinearly to the resulting image estimate by the subsequent layers. After each convolutional layer a rectifying activation function (ReLu) is applied.

IsoNet-2. Which is similar to the proposed network architecture of [13] for segmentation which consists of a contractive part: $C_{16,7,7} - M_{2,2} - C_{32,7,7}- M_{2,2} - C_{64,7,7} - U_{2,2} - C_{32,7,7} - U_{2,2} -C_{16,7,7}-C_{1,1,1}$ and symmetric skip connections. The contractive part of the network learns sparse representations of the input whereas skip connections are sensitive to image details (cf. Fig. 1b). In contrast to [13], however, the network learns the residual to the input.

For all datasets, both architectures were trained for 100 epochs with the Adam optimizer [7] and a learning rate $ 5\cdot 10^{-3}$. We furthermore use a dropout of $20\%$ throughout and apply data augmentation (flipped and rotated images) where it is compatible with the symmetries of the PSF (i.e. whenever the latter commutes with the augmentation symmetry).

3 Results

3.1 Synthetic Data

We use 3 different types of synthetic datasets of size $512^3$ that resemble typical biological structures, as shown in Fig. 2: The uppermost row shows small axial crops from a volume containing about 1500 simulated nuclei. The middle row shows crops of membrane structures as they are frequently seen in tightly packed cell epithelia. The last row shows both, simulated cell nuclei and surrounding labeled membranes. All volumes were created in-silico by combining plausible structure distributions, perlin-noise based textures and realistic camera noise. Note that the first column shows the ground truth images that were used to generate the isotropic ground truth, by convolving with the isotropic PSF, and the blurred images that were subsampled and convolved with realistic PSFs in order to resemble microscopic data. This third column (blurred) is then used as the input to all our and other tested methods. The subsequent 6 columns show the results of (i) Richardson-Lucy deconvolution [9], (ii) pure SRCNN [1], i.e. disregarding the PSF, (iii) the IsoNet-1 using the full PSF, (iv) the IsoNet-1 using the anisotropic component of the PSF $h_{split}$, (v) the IsoNet-2 using the full PSF, and (vi) the IsoNet-2 using the split PSF. In addition to the visuals given in the figure, Table 1 compares the PSNR of the full volumes with the two ground truth versions, averaged over 10 different randomly created stacks per dataset type. As can be seen, our method performs significantly ($p<0.01$) best in all cases. Note that failing to incorporate the PSF (as with pure SRCNN) results in an inferior reconstruction.

Table 1. Computed PSNR values against isotropic GT (upper rows), and against GT (lower rows). PSF types are: gaussian ($\sigma _{xy}/\sigma _z = 2/8$); confocal with numerical aperture $\mathrm {NA} = 1.1$; light-sheet with $\mathrm {NA_{detect}} = 0.8$ and $\mathrm {NA_{illum}}=0.1$. Bold values indicate best. Standard deviation in brackets (n = 10).

Full size table

Simple 3D Segmentation. To provide a simple example of how the improved image quality helps downstream processing we applied a standard 3D segmentation pipeline on the simulated nuclei data (cf. Fig. 2), consisting of 3 simple steps: First, we apply a global threshold using the intermodes method [14]. Then, holes in thresholded image regions are closed. Finally, cells that clump together are separated by applying a 3D watershed algorithm on the 3D euclidian distance transform. This pipeline is freely available to a large audience in tools like Fiji or KNIME. We applied this pipeline to the isometric ground truth data, the blurred and subsampled input data, and the result produced by the IsoNet-2. As evaluation metric we used SEG (ISBI Tracking Challenge 2013), the average intersection over union of matching cells when compared to the ground truth labels, that takes values in [0, 1], where 1 corresponds to a perfect voxel-wise matching. The results for the different conditions $\text {SEG}_{GT} = 0.923$ (isotropic ground truth), $\text {SEG}_{blurred} = 0.742$ (blurred input), and $\text {SEG}_{IsoNet-2} = 0.913$ (network output), demonstrate the effectiveness of our approach.

3.2 Real Data

Furthermore, we validate our approach on confocal and light-sheet microscopy data and demonstrate the perceptual isotropy of the recovered stacks. First we show that artificially subsampled two-photon confocal acquisitions can be made isotropic using IsoNet-2. As can be seen in Fig. 3 the original isotropic data is nearly perfectly recovered from the 8-fold subsampled data (by taking every 8th axial slice). Second, we show that single view light-sheet acquisitions can be made isotropic. Figure 4 shows stacks from two different sample recordings where we trained IsoNet-2 to restore the raw axial (yz) slices. The final results exhibit perceptual sharpness close to that of the higher quality raw lateral (xy) slices, even when compared to multi deconvolution, demonstrating the ability to restore isotropic resolution from a single volume in different experimental settings.

4 Discussion

We presented a method to enhance the axial resolution in volumetric microscopy images by reconstructing isotropic 3D data from non-isotropic acquisitions with convolutional neural networks. Training is performed unsupervised and end-to-end, on the same anisotropic image data for which we recover isotropy. We demonstrated our approach on 3 synthetic and 3 real datasets and compared our results to the ones from classical deconvolution [9, 12] and state-of-the-art super resolution methods. We further showed that a standard 3D segmentation pipeline performed on outputs of IsoNet-2 are essentially as good as on full isotropic data. It seems apparent that approaches like the ones we suggest bear a huge potential to make microscopic data acquisition significantly more efficient. For the liver data, for example, we show (Fig. 3) that only $12.5\%$ of the data yields isotropic reconstructions that appear on par with the full isotropic volumes. This would potentially reduce memory and time requirements as well as laser induced fluorophore and sample damage by the same factor. Still, this method can, of course, not fill in missing information: If the axial sample rate would drop below the Shannon limit (with respect to the smallest structures we are interested in resolving), the proposed networks will not be able to recover the data. Source code will be released at https://github.com/maweigert/isonet.

References

Dong, C., Loy, C.C., He, K., Tang, X.: Learning a deep convolutional network for image super-resolution. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8692, pp. 184–199. Springer, Cham (2014). doi:10.1007/978-3-319-10593-2_13
Chapter Google Scholar
Dong, C., Loy, C.C., He, K., Tang, X.: Image super-resolution using deep convolutional networks. IEEE Trans. Pattern Anal. Mach. Intell. 38(2), 295–307 (2016)
Article Google Scholar
Economo, M.N., Clack, N.G., Lavis, L.D., Gerfen, C.R., Svoboda, K., Myers, E.W., Chandrashekar, J.: A platform for brain-wide imaging and reconstruction of individual neurons. Elife 5, e10566 (2016)
Article Google Scholar
Huisken, J., Swoger, J., Del Bene, F., Wittbrodt, J., Stelzer, E.H.K.: Optical sectioning deep inside live embryos by selective plane illumination microscopy. Science (New York, N.Y.) 305, 1007–1009 (2004)
Article Google Scholar
Jog, A., Carass, A., Prince, J.L.: Self super-resolution for magnetic resonance images. In: Ourselin, S., Joskowicz, L., Sabuncu, M.R., Unal, G., Wells, W. (eds.) MICCAI 2016. LNCS, vol. 9902, pp. 553–560. Springer, Cham (2016). doi:10.1007/978-3-319-46726-9_64
Chapter Google Scholar
Kim, J., Kwon Lee, J., Mu Lee, K.: Accurate image super-resolution using very deep convolutional networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1646–1654 (2016)
Google Scholar
Kingma, D., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)
Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431–3440 (2015)
Google Scholar
Lucy, L.B.: An iterative technique for the rectification of observed distributions. The Astron. J. 79, 745 (1974)
Article Google Scholar
Morales-Navarrete, H., Segovia-Miranda, F., Klukowski, P., Meyer, K., Nonaka, H., Marsico, G., Chernykh, M., Kalaidzidis, A., Zerial, M., Kalaidzidis, Y.: A versatile pipeline for the multi-scale digital reconstruction and quantitative analysis of 3D tissue architecture. Elife 4, e11214 (2015)
Article Google Scholar
Preibisch, S., Amat, F., Stamataki, E., Sarov, M., Singer, R.H., Myers, E., Tomancak, P.: Efficient bayesian-based multiview deconvolution. Nat. Methods 11, 645–648 (2014)
Article Google Scholar
Richardson, W.H.: Bayesian-based iterative method of image restoration. JOSA 62(1), 55–59 (1972)
Article MathSciNet Google Scholar
Ronneberger, O., Fischer, P., Brox, T.: U-net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). doi:10.1007/978-3-319-24574-4_28
Chapter Google Scholar
Sezgin, M., Sankur, B.I.: Survey over image thresholding techniques and quantitative performance evaluation. J. Electron. Imaging 13(1), 146–168 (2004)
Article Google Scholar
Tsien, R.Y.: The green fluorescent protein. Annu. Rev. Biochem. 67, 509–544 (1998)
Article Google Scholar

Download references

Acknowledgments

We thank V. Stamataki, C. Schmied (Tomancak lab), S. Merret and S. Janosch (Sarov Group), H.A. Morales-Navarrete (Zerial lab) for providing the datasets, and U. Schmidt (all MPI-CBG) for helpful feedback. Datasets were recorded by the Light Microscopy Facility (LMF) of MPI-CBG.

Author information

Authors and Affiliations

Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
Martin Weigert, Loic Royer, Florian Jug & Gene Myers
Center for Systems Biology Dresden, Dresden, Germany
Martin Weigert, Loic Royer, Florian Jug & Gene Myers

Authors

Martin Weigert
View author publications
You can also search for this author in PubMed Google Scholar
Loic Royer
View author publications
You can also search for this author in PubMed Google Scholar
Florian Jug
View author publications
You can also search for this author in PubMed Google Scholar
Gene Myers
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Martin Weigert .

Editor information

Editors and Affiliations

Université de Sherbrooke, Sherbrooke, QC, Canada
Maxime Descoteaux
DKFZ, Heidelberg, Germany
Lena Maier-Hein
Ulm University of Applied Sciences, Ulm, Germany
Alfred Franz
Université de Rennes 1, Rennes, France
Pierre Jannin
McGill University, Montreal, QC, Canada
D. Louis Collins
Université Laval, Québec, QC, Canada
Simon Duchesne

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Weigert, M., Royer, L., Jug, F., Myers, G. (2017). Isotropic Reconstruction of 3D Fluorescence Microscopy Images Using Convolutional Neural Networks. In: Descoteaux, M., Maier-Hein, L., Franz, A., Jannin, P., Collins, D., Duchesne, S. (eds) Medical Image Computing and Computer-Assisted Intervention − MICCAI 2017. MICCAI 2017. Lecture Notes in Computer Science(), vol 10434. Springer, Cham. https://doi.org/10.1007/978-3-319-66185-8_15

Download citation

DOI: https://doi.org/10.1007/978-3-319-66185-8_15
Published: 04 September 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-66184-1
Online ISBN: 978-3-319-66185-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)