Unsupervised MRI Images Denoising via Decoupled Expression

Zhang, Jiangang; Pan, Xiang; Lv, Tianxu

doi:10.1007/978-981-19-2456-9_77

Jiangang Zhang⁴⁰,
Xiang Pan⁴⁰ &
Tianxu Lv⁴⁰

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE))

Included in the following conference series:

INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND APPLICATIONS

8240 Accesses

Abstract

Magnetic Resonance Imaging (MRI) is widely adopted in medical diagnosis. Due to the spatial coding scheme, MRI image is degraded by various noise. Recently, massive methods have been applied to the MRI image denoising. However, they lack the consideration of artifacts in MRI images. In this paper, we propose an unsupervised MRI image denoising method called UEGAN based on decoupled expression. We decouple the content and noise in a noisy image using content encoders and noise encoders. We employ a noising branch to push the noise decoder only extract the noise. The cycle-consistency loss ensures that the content of the denoised results match the original images. To acquire visually realistic generations, we add an adversarial loss on denoised results. Image quality penalty helps to retain rich image details. We perform experiments on unpaired MRI images from Brainweb datesets, and achieve superior performances compared to several popular denoising approaches.

You have full access to this open access chapter, Download conference paper PDF

Compressed Sensing MRI Reconstruction Using Generative Adversarial Network with Rician De-noising

Article 24 August 2021

Unsupervised medical images denoising via graph attention dual adversarial network

Article 26 November 2020

Adversarial and Perceptual Refinement for Compressed Sensing MRI Reconstruction

Keywords

1 Introduction

MRI image can provide various kinds of detailed information with respect to physical health. However, external errors, inappropriate spatial encoding, body motion etc. may jointly result in the undesirable effects of MRI and the harmful noise. Clean MRI images could increase the accuracy of computer vision assignments [1, 2], like semantic segmentation [3] and object detection [4]. In the past, a wide variety of denoising methods have been proposed such as filtering methods [5, 6], transform domain method [7]. Nevertheless, these methods are restricted to numerous objective factors such as undesirable texture changes caused by violation of assumptions and heavy computational overhead. Recently, deep learning methods have made great progress in the field of image denoising. These means helps to acquire the impressive effects in MRI image denosing. Due to the scarcity of medical images, researchers need to use unpaired data during training. Generative adversarial network (GAN) [8] have been found to be more competitive in image generation tasks [9, 10]. One of the solution might be directly using some unsupervised methods (DualGAN [11], CycleGAN [12]) to find the mappings between clear and noised image domains. However, these general methods often encode some irrelevant characteristics such as texture features rather than noise attributes into the generators, and thus will not produce high-quality denoised images.

Under the guidance of aforementioned theories, we present a MRI image denoising method called UEGAN which uses GAN based on decoupled expression to generate visually realistic denoised images. More specifically, we decouple the content and noise from noised images to accurately encode noise attributes into the denoising model. As shown in Fig. 1, the content encoders encode content information and the noise encoder encode noise attributes from unpaired clear and noised MRI images. However, this type of structure can’t guarantee that the noise encoder encodes noise attributes only - it may encode content information as well. So we employ the nosing branch to limit the noise encoder to encode the content attributes of n. The denosing generator $G_{clear}$ and the noising generator $G_{noised}$ take corresponding content information on condition of noise attributes to generate denoised MRI images and noised MRI images. Based on CycleGAN [12], we apply the adversarial loss and the cycle-consistency loss as the regularizers to help the generator generate a MRI image which closes to the original image. In order to further reduce the undesirable banding artifacts introduced by $G_{noised}$ and $G_{clear}$, we apply the image quality penalty into this structure. We conduct experiments on Brainweb MRI datasets, and obtain qualitative and quantitative results that are competitive with several conventional methods and a deep learning method.

2 Related Work

Since the proposed model structure makes most use of the popular denoising network and the latest technology of image disentangled representation, in this part, we briefly review the generative adversarial network, single image denoising and disentangled representation.

2.1 Generative Adversarial Network

Generative adversarial network [8] is brought forward to train generative models. Radford et al. [13] propose GANs of CNN version called DCGANs. Arjovsky et al. [14] introduce a novel loss called wasserstein into GAN at train time. Zhang et al. [15] propose Self-Attention GAN which applies attention mechanism to the field of image creation.

2.2 Disentangled Representation

Recently, there is a rapid development in learning disentangled representations, namely decoupled expression. Tran et al. [16] unravel posture and identity components for face recognition, which called DRGAN. Liu et al. [17] present an identity extraction and elimination autoencoder to disentangle identity from other characteristics. Xu et al. propose FaceShapeGene [18] which correctly disentangles the shape features of different semantic facial parts.

2.3 Single Image Denosing

Image noise has caused serious damages to image quality. There are many deep learning methods that focus on image denoising tasks. Jain et al. [19] firstly introduce Convolutional neural networks (CNN) which has a small receptive field into image denoising. Chen et al. [20] joint Euclidean and perceptual loss functions to find more edge information. According to deep image prior (DIP), present by Ulyanov et al. [21], abundant prior knowledge for image denosing already exist in the pre-train convolutional neural network.

3 Proposed Method

Inspired by GAN, single image denosing, decoupled expression, we proposed a MRI image Unsupervised denoising method called UEGAN which has well designed loss functions based on decoupled expression. This structure combines the advantages of the above three classic models and is made up of four parts: 1) content encoders $E_N^{cont}$ for noisy image domain and $E_C^{cont}$ for clear image domain; 2) noise encoder $E^{noise}$; 3) noised and clear image generator $G_{noised}$ and $G_{clear}$; 4) noised and clear image discriminators $D_N$ and $D_C$. Given a train sample n $\in$ N in the noised image domain and c $\in$ C in the clear image domain, the content encoders $E_N^{cont}$ and $E_C^{cont}$ acquire content information from corresponding samples and $E^{noise}$ extract the noise attributes from N. Then $E^{noise} \left( n \right)$ and $E_C^{cont} \left( c \right)$ are feed into the $G_{noised}$ to generate a noised image ${\text{c}}^n$, meanwhile, $E^{noise} \left( n \right)$ and $E_N^{cont} \left( n \right)$ are feed into the $G_{clear}$ to generate a clear image $n^c .$ The discriminators $D_{noise}$ and $D_{clear}$ differentiate the real from generated examples. The final structure is shown in Fig. 1.

3.1 Decoupling Noise and Content

It is not easy to decouple content information from a noised image because the ground truth image is not available in the unpaired setting. since the clear image c is not affected by noise, the content encoder $E_C^{cont} \left( c \right)$ is equivalent to encoding the content characteristics only. We share the weights of the last layer which existing in the $E_N^{cont} \left( n \right)$ and $E_C^{cont} \left( c \right)$ respectively to encode as much content information from noised image domain as possible.

Meanwhile, the noise encoder should only encode noise attributes. So We feed the outputs of $E^{noise} \left( n \right)$ and $E_C^{cont} \left( c \right)$ into the $G_{noised}$ to generate $c^n$. Since $c^n$ is a noised version of c, $c^n$ does not contain any content information of n in the whole process. This nosing branch further limits the noise encoder to encode the content information of n.

3.2 Adversarial Loss

In order to acquire a cleaner output, we introduce the adversarial loss function into the content domain and the noise domain. For the clear image domain, we define the adversarial loss as $L_{D_C }$:

$$ L_{D_C } = {\mathbb{E}}_{c \sim p\left( c \right)} [\log \,D_C (c)] + {\mathbb{E}}_{n \sim p\left( n \right)} [\log (1 - D_C (G_{clear} (E_N^{cont} (n),\,z)))]. $$

(1)

where z $= E^{noise} \left( n \right)$ and ${\text{D}}_C$ devotes to maximize the objective function to differentiate denoised images from real clear images. In contrast, $G_{clear}$ tries to minimize the objective function to make denoised images look similar to real samples in clear image domain. For the clear image domain, we define the loss as $L_{D_N }$:

$$ L_{D_N } = {\mathbb{E}}_{n \sim p\left( n \right)} [\log \,D_N (n)] + {\mathbb{E}}_{c \sim p\left( c \right)} [\log (1 - D_N (G_{noise} (E_C^{cont} (c),\,z)))]. $$

(2)

3.3 Image Quality Penalty

We have observed that the denoised images $n^c$ usually contains unpleasant banding artifacts in the experiment. So we introduce the Image information entropy (IE) [22] which is utilized to compute the amount of information in an image to reduce the banding artifacts. And IE loss is employed to guide the generator to produce MRI images with less noise. The loss is defined as:

$$ L_{IE} \left( {G_{clear} \left( z \right)} \right) = \sum\nolimits_{i = 0,\;\;p\left( i \right) \ne 0}^d {p\left( i \right)log\frac{1}{p\left( i \right)}.} $$

(3)

where d is the range of image intensity and p(i), i = 0, 1,2,…, d is the probability distribution of the intensity of the output $G_{clear}$(x).

3.4 Cycle-Consistency Loss

$G_{clear}$ should have the ability to generate visually realistic and clear images after the minmax game. However, without the guidance of pairwise supervision, the denoised image $n^c$ may rarely retains the content information of the original noised sample n. Therefore, we introduce the cycle-consistency loss to ensure that the denoised image $n^c$ can be renoised to construct the original noised image and $c^n$ can be translated back to the original clear image domain. The loss preserves more content information of corresponding original samples. In more detail, we define the forward translation as:

$$ n^c = G_{clear} (E_N^{cont} (n),\,E^{noise} (n)), $$

$$ c^n = G_{noised} (E_C^{cont} (c),\,E^{noise} (n)). $$

(4)

And the backward translation as:

$$ n^{\prime} = G_{noised} (E_C^{cont} (c^n ),E^{noise} (n^c )), $$

$$ c^{\prime} = G_{clear} (E_N^{cont} (n^c ),E^{noise} (n^c )). $$

(5)

We perform the loss on both domains as follows:

$$ L_{cc} = {\mathbb{E}}_{c \sim p\left( c \right)} \left[ {\left\| {c - c^{\prime}} \right\|_1 } \right] + {\mathbb{E}}_{n \sim p\left( n \right)} \left[ {\left\| {n - n^{\prime}} \right\|_1 } \right]. $$

(6)

Meanwhile, we carefully balance the weights among the aforementioned losses to prevent ${\text{n}}^{\text{c}}$ from staying too close to n.

The total objective function is a combination of all the losses from (1) to (6) with respective weights:

$$ L = \lambda_{adv} L_{adv} + {\uplambda }_{{\text{IE}}} {\text{L}}_{{\text{IE}}} + {\uplambda }_{{\text{cc}}} {\text{L}}_{{\text{cc}}} . $$

(7)

3.5 Testing

In the process of testing, the noising branch is removed. Provided a test image a, $E_N^{cont}$ and $E^{noise}$ extract the content information and noise attributes. Then $G_{clear}$ takes the outputs and generates the denoised image A:

$$ A = G_{clear} (E_N^{cont} (a),\,E^{noise} (a)). $$

(8)

4 Experiments and Analysis

We compare the MRI image denoising performance between our work with non-local means (NLM) [23] and a deep learning method DIP. To analyze the performance of denoising methods quantitatively, peak signal to noise ratio (PSNR), structural similarity index (SSIM) are employed. We evaluate the proposed model on Brainweb MRI datasets. The unpaired train set with 150 MRI images consists of the following two parts:

1)
Samples from the noise image domain consist of seventy-five slices, whose slice thickness is 1 mm, and additional gaussian noise standard deviation sigma is 25.
2)
Samples (no additional gaussian noise) from the clear image domain consist of seventy-five slices, whose slice thickness is 1 mm.

4.1 Implementation Details

We train our network UEGAN using Pytorch 1.4.0 package on a computer with Intel i9 9300k CPU, NVIDIA RTX 2080Ti GPU, 32 Gb memory and windows10 OS with Brainweb MRI datasets. The UEGAN is optimized using the gradient-based Adam-optimizer whose hyper-parameter is set as β1 = 0.5, β2 = 0.999, Nepoch = 100000, and the learning rate of all generators is 2e−4, the learning rate of all discriminators is 1e−4. We utilize 208 × 176 original size with batch size of 4 for training. We experimentally set hyper-parameters: $\lambda_{adv}$ = 1, $\lambda_{cc}$ = 10, $\lambda_{IE}$ = 10.

4.2 Experimental Results

In this section, we compare our method with NLM and DIP, and the denosing performance is shown in Fig. 2. For NLM, the denoising results is blurry and a great quantity of local details are missing. However, our visual results have the sharper texture and more structure details.

For DIP, it produces artifacts and cannot recover meaningful MRI image information. On the contrary, our model UEGAN obtains more distinct results and less noise especially on local regions.

The UEGAN achieves the best visual performance in denosing and image information recovering.

4.3 Quantitative Analysis

Two quantitative analysis strategies PSNR and SSIM are adopted to assess the effects of a traditional image denoising method NLM, a deep learning method DIP and our work UEGAN. The denoisong results of our work shows superior performance to other algorithms on above two quantitative evaluation indexes as shown in Table 1 and Table 2.

Table 1. PSNR comparison

Full size table

Table 2. SSIM comparison

Full size table

5 Conclusion

In this paper, we concentrate on generating high-quality denoised MRI images with a deep-learning method which called UEGAN based on decoupled expression. We utilize the noise encoder and the content encoder to decouple the content information and noise attributes in a noisy MRI image. In order to obtain rich content characteristics from the original image, we add the adversarial loss and the cycle-consistency loss. We add the nosing branch into model so as to limit the noise encoder to encoding noise attributes as much as possible. The IE loss helps to remove the banding artifacts which consisting in the outputs of generator. After competing with several popular methods, both visual effects and quantitative results show that our work is extremely promising.

References

He, K., Gkioxari, G., Dollár, P., Girshick, R.: Mask R-CNN. http://arxiv.org/abs/1703.06870 (2017)
Huang, G., Liu, Z., van der Maaten, L., Weinberger, K.Q.: Densely connected convolutional networks. http://arxiv.org/abs/1608.06993 (2016)
Jégou, S., Drozdzal, M., Vázquez, D., Romero, A., Bengio, Y.: The one hundred layers tiramisu: fully convolutional DenseNets for semantic segmentation. In: CVPR Workshops, pp. 1175–1183. IEEE Computer Society (2017)
Google Scholar
Kupyn, O., Budzan, V., Mykhailych, M., Mishkin, D., Matas, J.: DeblurGAN: blind motion deblurring using conditional adversarial networks. In: CVPR, pp. 8183–8192. IEEE Computer Society (2018)
Google Scholar
Ma, J., Plonka, G.: Combined curvelet shrinkage and nonlinear anisotropic diffusion. IEEE Trans. Image Process. 16, 2198–2206 (2007)
Article MathSciNet Google Scholar
Starck, J.-L., Candes, E.J., Donoho, D.L.: The curvelet transform for image denoising. IEEE Trans. Image Process. 11, 670–684 (2002)
Article MathSciNet Google Scholar
Sijbers, J., den Dekker, A.J., Van Audekerke, J., Verhoye, M., Van Dyck, D.: Estimation of the noise in magnitude MRI images. Magn. Reson. Imaging 16, 87–90 (1998)
Article Google Scholar
Goodfellow, I.J., et al.: Generative adversarial networks. http://arxiv.org/abs/1406.2661 (2014)
Denton, E.L., Chintala, S., Szlam, A., Fergus, R.: Deep generative image models using a laplacian pyramid of adversarial networks. In: Cortes, C., Lawrence, N.D., Lee, D.D., Sugiyama, M., Garnett, R. (eds.) NIPS, pp. 1486–1494 (2015)
Google Scholar
Van den Oord, A., Kalchbrenner, N., Kavukcuoglu, K.: Pixel recurrent neural networks. CoRR. abs/1601.06759 (2016)
Google Scholar
Yi, Z., Zhang, H. (Richard), Tan, P., Gong, M.: DualGAN: unsupervised dual learning for image-to-image translation. In: ICCV, pp. 2868–2876. IEEE Computer Society (2017)
Google Scholar
Zhu, J.-Y., Park, T., Isola, P., Efros, A.A.: Unpaired image-to-image translation using cycle-consistent adversarial networks. In: ICCV, pp. 2242–2251. IEEE Computer Society (2017)
Google Scholar
Radford, A., Metz, L., Chintala, S.: Unsupervised representation learning with deep convolutional generative adversarial networks. http://arxiv.org/abs/1511.06434 (2015)
Arjovsky, M., Chintala, S., Bottou, L.: Wasserstein GAN. http://arxiv.org/abs/1701.07875 (2017)
Zhang, H., Goodfellow, I.J., Metaxas, D.N., Odena, A.: Self-attention generative adversarial networks. CoRR. abs/1805.08318 (2018)
Google Scholar
Tran, L., Yin, X., Liu, X.: Disentangled representation learning GAN for pose-invariant face recognition. In: CVPR, pp. 1283–1292. IEEE Computer Society (2017)
Google Scholar
Liu, Y., Wei, F., Shao, J., Sheng, L., Yan, J., Wang, X.: Exploring disentangled feature representation beyond face identification. In: CVPR, pp. 2080–2089. IEEE Computer Society (2018)
Google Scholar
Xu, S.-Z., Huang, H.-Z., Hu, S.-M., Liu, W.: FaceShapeGene: a disentangled shape representation for flexible face image editing. CoRR. abs/1905.01920 (2019)
Google Scholar
Jain, V., Seung, H.S.: Natural image denoising with convolutional networks. In: Koller, D., Schuurmans, D., Bengio, Y., and Bottou, L. (eds.) NIPS, pp. 769–776. Curran Associates, Inc. (2008)
Google Scholar
Chen, X., Zhan, S., Ji, D., Xu, L., Wu, C., Li, X.: Image denoising via deep network based on edge enhancement. J. Ambient. Intell. Humaniz. Comput. 149, 1–11 (2018). https://doi.org/10.1007/s12652-018-1036-4
Article Google Scholar
Ulyanov, D., Vedaldi, A., Lempitsky, V.: Deep image prior. Int. J. Comput. Vis. 128(7), 1867–1888 (2020). https://doi.org/10.1007/s11263-020-01303-4
Article Google Scholar
Tsai, D.-Y., Lee, Y., Matsuyama, E.: Information entropy measure for evaluation of image quality. J. Digit. Imaging 21, 338–347 (2008)
Article Google Scholar
Manjón, J.V., Carbonell-Caballero, J., Lull, J.J., García-Martí, G., Martí-Bonmatí, L., Robles, M.: MRI denoising using non-local means. Med. Image Anal. 12, 514–523 (2008)
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Artificial Intelligence and Computer Science, Jiangnan University, Wuxi, China
Jiangang Zhang, Xiang Pan & Tianxu Lv

Authors

Jiangang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Xiang Pan
View author publications
You can also search for this author in PubMed Google Scholar
Tianxu Lv
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xiang Pan .

Editor information

Editors and Affiliations

College of Communication Engineering, Jilin University, Jilin, Jilin, China
Zhihong Qian
Department of AI & ML, Vardhaman College of Engineering, Hyderabad, Telangana, India
M.A. Jabbar
College of Technology, Indiana State University, Terre Haute, IN, USA
Xiaolong Li

Rights and permissions

Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, J., Pan, X., Lv, T. (2022). Unsupervised MRI Images Denoising via Decoupled Expression. In: Qian, Z., Jabbar, M., Li, X. (eds) Proceeding of 2021 International Conference on Wireless Communications, Networking and Applications. WCNA 2021. Lecture Notes in Electrical Engineering. Springer, Singapore. https://doi.org/10.1007/978-981-19-2456-9_77

Download citation

DOI: https://doi.org/10.1007/978-981-19-2456-9_77
Published: 13 July 2022
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-2455-2
Online ISBN: 978-981-19-2456-9
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics