A Smooth Approximation Algorithm of Rank-Regularized Optimization Problem and Its Applications

Li, Bo; Jin, Lianbao; Leng, Chengcai; Lu, Chunyuan; Zhang, Jie

doi:10.1007/978-3-319-21978-3_41

Bo Li¹⁴,
Lianbao Jin¹⁴,
Chengcai Leng¹⁴,
Chunyuan Lu¹⁴ &
…
Jie Zhang¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9217))

Included in the following conference series:

International Conference on Image and Graphics

1994 Accesses
1 Citations

Abstract

In this paper, we propose a novel smooth approximation algorithm of rank-regularized optimization problem. Rank has been a popular candidate of regularization for image processing problems,especially for images with periodical textures. But the low-rank optimization is difficult to solve, because the rank is nonconvex and can’t be formulated in closed form. The most popular methods is to adopt the nuclear norm as the approximation of rank, but the optimization of nuclear norm is also hard, it’s time expensive as it needs computing the singular value decomposition at each iteration. In this paper, we propose a novel direct regularization method to solve the low-rank optimization. Contrast to the nuclear-norm approximation, a continuous approximation for rank regularization is proposed. The new method proposed in this paper is a ‘direct’ solver to the rank regularization, and it just need computing the singular value decomposition one time, so it’s more efficient. We analyze the choosing criteria of parameters, and propose an adaptive algorithm based on Morozov discrepancy principle. Finally, numerical experiments have been done to show the efficiency of our algorithm and the performance on applications of image denoising.

Bo Li is supported by the NSFC China (61262050) and Natural Science Foundation of Jiangxi Province (20151BAB211006), Chengcai Leng is supported by NSFC China (61363049).

You have full access to this open access chapter, Download conference paper PDF

Image denoising via structure-constrained low-rank approximation

Article 13 February 2020

Low-rank matrix recovery via smooth rank function and its application in image restoration

Article 18 April 2017

Weighted Nuclear Norm Minimization-Based Regularization Method for Image Restoration

Article 27 August 2020

Keywords

1 Introduction

One of the most important task in computer vision is image denoising. The general idea is to regard a noisy image d as being obtained by corrupting a noiseless image m; given a model for the noise corruption, the desired image m is a solution of the corresponding inverse problem.

$$\begin{aligned} d = m + n \end{aligned}$$

where n is the noise, in this paper we assume that the noise is normally distributed and additive. Although, the image denoising problem is the simplest possible inverse problem, it’s still a difficult task for researchers due to the complicated structures and textures of natural images. Generally, the degraded imaging process can be described mathematically by the first class Fridman operator equation

$$\begin{aligned} (Af)(x,y) =\int \int \mathcal K(x-\xi ,y-\eta ) m(x,y)dxdy + noise = d(x,y) \end{aligned}$$

where $\mathcal K$ is usually a linear kernel function. For deblurring problem, $\mathcal K$ is the point spread function(psf), and for denoising, it’s just the identity operator. Since the the inverse problem is extremely ill-posed, most denoising procedures have to employ some sort of regularization. Generally, the image denoising problem can be solved under the frame of Tikhonov regularization.

$$\begin{aligned} \min _m \mathcal J(m) =||\mathcal K m -d||^2 + \alpha {\varOmega }[m] \end{aligned}$$

where ${\varOmega }[m]$ is the stability functional for m. According to different types of problems, the ${\varOmega }[m]$ can adopt different regularizations, such as Total Variation [1, 2], F-norm [3], L0-norm [4], and so on.

For images with low-rank structures, the rank regularization is a good choice. But the key problem is that the rank of an image matrix is difficult to be formulated in closed form. Recently, reference [4] showed that under surprisingly broad conditions, one can exactly recover the low-rank matrix A from D = A + E with gross but sparse errors E by solving the following convex optimization problem:

$$\begin{aligned} \arg \min _A ||D - A||_{F}^{2} + ||A||_* \end{aligned}$$

where $||\cdot ||_*$ represents the nuclear norm of a matrix (the sum of its singular values). In [5], this optimization is dubbed Robust PCA (RPCA), because it enables one to correctly recover underlying low-rank structure in the data, even in the presence of gross errors or outlying observations. This optimization can be easily recast as a semidefinite program and solved by an off-the-shelf interior point solver (e.g., [6–8]). However, although interior point methods offer superior convergence rates, the complexity of computing the step direction is O($m^6$). So they do not scale well with the size of the matrix.

In recent years, the research for more scalable algorithms for high dimensional convex optimization problems has prompted a return to first-order methods. One striking example of this is the current popularity of iterative thresholding algorithms for $ l^1$ -norm minimization problems arising in compressed sensing [9–11]. Similar iterative thresholding techniques [12–15] can be applied to the problem of recovering a low-rank matrix from an incomplete (but clean) subset of its entries. This optimization is closely related to the RPCA problem, and the algorithm and convergence proof extend quite naturally to RPCA. However, the iterative thresholding scheme proposed in [4] exhibits extremely slow convergence: solving one instance requires about $10^4 $ iterations, each of which has the same cost as one singular value decomposition.

In this paper, we attempt to solve the rank regularization problem in a new direction. Contrast to the nuclear-norm approximation introduced in [4], we propose a continuous approximation for rank regularization, and analyze the regularity of the algorithm. Compared with the nuclear norm approximation, the new method proposed in this paper is a ‘direct’ solver to the rank regularization, and it’s a smooth optimization problem with fruitful theory and computation foundation. Finally, the denoising experiments based on the new regularization are proposed for images with periodical textures.

The paper is organized as follows. In Sect. 2 we describe the proposed smooth approximation algorithm in detail, and the criteria of automatic parameter choosing is discussed theoretically; the numerical experiments results will be shown in Sect. 3 and conclusions in Sect. 4.

2 Proposed Method

In this paper, we consider the rank regularization optimization problem, which can be described as following model:

$$\begin{aligned} \min _u \mathcal J(u) =||u-u_0||^2 + rank(u) \end{aligned}$$

(1)

As the rank of an image matrix is nonconvex and difficult to be formulated in closed form, so the optimization problem (1) is difficult to be solved directly. The nuclear norm is a relaxation of the rank, and it can be proved that under some conditions this relaxation is a good convex approximation.

$$\begin{aligned} \min _u \mathcal J(u) =||u-u_0||^2 + \alpha \Vert u\Vert _* \end{aligned}$$

Many optimization algorithms for this problem have been researched, such as the Accelerated Proximal Gradient (APG) [16–18], the Augmented Lagrange Multiplier (ALM) Method [19], and Iterative Singular Value Thresholding (ISVT) [12]. One common shortcoming of these methods is that they have to calculate the singular value decomposition at each iteration, which increase the complexity of computation heavily.

In this paper, we attempt to solve the rank regularization problem in a new direction. Despite using the nuclear-norm as relaxation, we propose a ‘direct’ solver to the rank regularization. The main contribution of this algorithm includes: firstly, it’s a smooth optimization problem, easily to be solved, and we propose an automatic parameter choosing criteria based on the Morozov discrepancy principle; secondly, compared with traditional methods, it just needs calculating singular value decomposition one time, so it’s more efficient.

From the basis of matrix theory, the rank of a matrix equals to the number of non-zero singular value terms. For a matrix A, $rank(A)=\#\{\sigma \ne 0\}$, where $\sigma $is the vector composed by the singular value of matrix A. From this point of view, to compute the rank of a matrix is a process of counting non-zeros. It’s a generalization of $L_0$ norm to two dimensional space. But ’the counting process’ is also difficult to be formed in the closed form and can’t be optimized directly. In this paper, we propose a continuous function to describe the process of “counting non-zeros”.

Firstly, for a matrix A, the operation of singular value decomposition should be calculated, $A = S\cdot V \cdot D$, where S, D are the unitary matrix, and V is a diagonal matrix composed by the singular value of A. Then, we construct a characteristic function $p(\sigma (i))$ for each singular value $\sigma (i)$, and the function $p(\sigma _i)$ should have the following property: $p(\sigma _i) = \left\{ \begin{array}{ll} 0, &{} \hbox {if } \sigma _i =0; \\ 1, &{} \hbox {else.} \end{array} \right. $ In this paper, we choose the gaussian function as the approximation of characteristic function, with only one parameter s as the variance.

$$\begin{aligned} p(\sigma ) = 1-exp(-\frac{\sigma ^2}{s^2}), \end{aligned}$$

(2)

From Fig. 1, we can see that when $s\rightarrow 0$, the function $p(\cdot )$ is a good approximation of characteristic function. So the rank can be approximately formulated as the sum of the characteristic functions of each singular value, $rank(A)=\sum _{i} p(\sigma (i))$. With the proposed relaxation of rank, the denoising model (1) can be formulated as following

$$\begin{aligned} \min _\sigma \mathcal J(\sigma ) =||S\cdot V \cdot D-u_0||^2 + \lambda \sum _i p(\sigma (i)) \end{aligned}$$

(3)

where $V=diag\{\sigma \}$, and the final denoised image $u=S\cdot V(\sigma )\cdot D$.

Compared with the nuclear norm, the model (3) is a new relaxation to the rank regularization. And we can see that it’s a smooth optimization problem, especially, it just need compute the singular value decomposition just one time.

From the property of unitary of matrix S and D, the model (3) can be reformulated as

$$\begin{aligned} \min _{\sigma } ||V - S^T u_0 D^T||^2 + \lambda \sum _i p(\sigma _i) \end{aligned}$$

In the discrete representation, it can be described as

$$\begin{aligned} \min _{\sigma } \sum _{i,j}(\sigma _i -(S^T u_0 D^T)_{ij})^2 + \lambda \sum _i p(\sigma _i) \end{aligned}$$

(4)

By computing the derivative with respect to $\sigma _i$, the final problem is to solve the following smooth equation system, which can be efficiently solved by Newton methods.

$$\begin{aligned} \sigma _i + \frac{2\lambda }{s^2}\sigma _i exp(-\frac{\sigma _i^2}{s^2}) =\sum _{j}(S^T u_0)_{ij} *d_{ij} \end{aligned}$$

(5)

The detail process is shown in Algorithm 1.

In Algorithm 1, there are two parameters $\lambda $ and s need to be initialized, How to choose these two parameters is an important and hard work. In the following, we will discuss the choosing criteria of $\lambda $ and s.

The choose of parameter $\lambda $ is difficult for regularization problem, because if the weighting parameter $\lambda $ is too small, the algorithm will be unstable, and if $\lambda $ is too large, the regularity will be too strong and the resulting algorithm will not accurately optimize the originally problem. In this paper, we propose an adaptive choosing criteria of parameter $\lambda $ based on the Morozov discrepancy principle.

As the final optimal $\sigma $ is dependent on the choosing of parameter $\lambda $, so the problem (3) should be reformulated as

$$\begin{aligned} {} \min _{\sigma } J(\sigma ,\lambda )= ||S\cdot diag(\sigma _i^{\lambda }) \cdot D - u_{0,\delta }||^2 + \alpha \sum _i p(\sigma _i^{\lambda }) \end{aligned}$$

(6)

There are two variables $\sigma $ and $\lambda $. For a fixed $\lambda $, the optimization for $\sigma $ is simple to be solved by the Euler equation:

$$\begin{aligned} \sigma _i^{\lambda } + \lambda p(\sigma _i^{\lambda })^{'} = b, \end{aligned}$$

(7)

And for fixed $\sigma $, the parameter $\lambda $ can be chosen by the Morozov discrepancy principle.

Proposition 1

The criteria of choosing parameter $\lambda $ Definite functional $\phi (\lambda )$,

$$\begin{aligned} \phi (\lambda ) = ||S\cdot diag(\sigma ^{\lambda })\cdot D - u_{0,\delta }||^2 - \delta ^2 \end{aligned}$$

(8)

according to the Morozov discrepancy principle, the parameter $\lambda $ should obey $\frac{d\phi (\lambda )}{d\lambda }=0$, which follows the Newton iteration

$$\begin{aligned} \lambda _{k+1} = \lambda _k -\frac{\phi (\lambda _k)}{\phi (\lambda _k)^{'}} \end{aligned}$$

(9)

according to (7), $\phi _{\lambda _k}^{'} = \sigma _i \frac{d\sigma _i}{d\lambda } = -\sigma _i (1 + \lambda _k p(\sigma _i^\lambda )^{''})^{-1} p(\sigma _i^{\lambda })^{'}$.

We can see that the function $p(\sigma )$ is not a convex function, especially when the parameter $s\rightarrow 0$, it’s difficult to obtain the global optimal solution. But for bigger s, this is a smooth optimization problem, and the global solution can be surely solved. So in this paper, we choose a descent sequence s, $s_0>s_1>\cdots >s_n$.

3 Numerical Experiments

In this section, we report our experimental results with the proposed algorithm in the previous section. We present experiments on both synthetic data and real images. All the experiments are conducted and timed on the same PC with an Intel Core i5 2.50 GHz CPU that has 2 cores and 16 GB memory, running Windows 7 and Matlab (Version 7.10).

We compared the proposed method with some of the most popular low rank matrix completion algorithm, including the Accelerated Proximal Gradient algorithm(APG), the Iterative Singular Value Thresholding algorithm (ISVT) [11] and the Augmented Lagrange Multiplier (ALM).

3.1 Synthetic Data

We generate the synthetic matrix $X\in R^{250*250}$ with rank equal to 25, as shown in Fig. 2(a), it’s a block-diagonal matrix. Then the gaussian noise with level 0.35 is added, as shown in Fig. 2(b). For the compared algorithms, we use the default parameters in their released codes. The results were shown in Fig. 2(c–f), and the PSNR values, estimated rank and the costing time were listed in the Table 1. According to the experimental results, we can learn that our algorithm can achieve the exact estimation of rank and higher PSNR value while having the smallest time complexity.

Table 1. PSNR, estimated rank and costing time of different algorithms

Full size table

3.2 Real Image Denoising

As general natural images may not have the low rank property, so we choose a certain type of natural images which have periodical textures, such as images of buildings, fabric, and so on (as shown in Figs. 3 and 4). The PSNR values for two test images are shown in Fig. 5.

4 Conclusions

In this paper, we propose a novel rank-regularized optimization algorithm. The new method proposed in this paper is a ‘direct’ solver to the rank regularization, it’s a smooth optimization and it just need computing the singular value decomposition one time. An automatic parameter choosing criteria is proposed based on the Morozov discrepancy principle. Finally, the experimental results show that the proposed algorithm can achieve the better estimation of rank and the higher PSNR value while having the smallest time complexity.

References

Rudin, L., Osher, S., Fatemi, E.: Nonlinear total variation based noise removal algorithms. Physica D 60, 259–268 (1992)
Article MATH Google Scholar
Rudin, L., Osher, S.: Total variation based image restoration with free local constraints. IEEE Int. Conf. Image Process. 1, 31–35 (1994)
Article Google Scholar
Zhang, H., Yi, Z., Peng, X.: fLRR: fast low-rank representation using Frobenius-norm. Electron. Lett. 50(13), 936–938 (2014)
Article Google Scholar
Xu, L., Yan, Q., Xia, Y., Jia, J.: Structure extraction from texture via relative total variation. ACM Trans. Graph. (TOG) 31(6), 139 (2012)
Google Scholar
Wright, J., Ganesh, A., Rao, S., Ma, Y.: Robust principal component analysis: exact recovery of corrupted low-rank matrices via convex optimization. Adv. Neural Inf. Process. Syst. 22, 2080–2088 (2009)
Google Scholar
Grant, M., Boyd, S.: CVX: Matlab software for disciplined convex programming (web page and software), June 2009. http://stanford.edu/boyd/cvx
Chandrasekharan, V., Sanghavi, S., Parillo, P., Wilsky, A.: Rank-sparsity incoherence for matrix decomposition. SIAM J. Optim. 21(2), 572–596 (2011)
Article MathSciNet Google Scholar
Beck, A., Teboulle, M.: A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM J. Imaging Sci. 2, 183–202 (2009)
Article MathSciNet MATH Google Scholar
Cai, J.-F., Osher, S., Shen, Z.: Linearized Bregman iterations for compressed sensing. Math. Comp. 78, 1515–1536 (2009)
Article MathSciNet MATH Google Scholar
Hale, E.T., Yin, W., Zhang, Y.: Fixed-point continuation for L1-minimization: methodology and convergence. SIAM J. Optim. 19(3), 1107–1130 (2008)
Article MathSciNet MATH Google Scholar
Yin, W., Osher, S., Goldfarb, D., Darbon, J.: Bregman iterative algorithms for L1- minimization with applications to compressed sensing. SIAM J. Imaging Sci. 1, 143–168 (2008)
Article MathSciNet MATH Google Scholar
Cai, J.-F., Cands, E.J., Shen, Z.: A singular value thresholding algorithm for matrix completion. SIAM J. Optim. 20(4), 1956–1982 (2010)
Article MathSciNet MATH Google Scholar
Recht, B., Fazel, M., Parrilo, P.A.: Guaranteed minimum-rank solutions of linear matrix equations via nuclear norm minimization. SIAM Rev. 52(3), 471–501 (2010)
Article MathSciNet MATH Google Scholar
Candes, E.J., Recht, B.: Exact matrix completion via convex optimization. Found. Comput. Math. 9(6), 717–772 (2009)
Article MathSciNet MATH Google Scholar
Ganesh, A., et al.: Fast algorithms for recovering a corrupted low-rank matrix. In: 3rd IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP). IEEE (2009)
Google Scholar
Lin, Z., Ganesh, A., Wright, J., Wu, L., Chen, M., Ma, Y.: Fast convex optimization algorithms for exact recovery of a corrupted low-rank matrix. Technical report, UILU-ENG-09-2214, UIUC (2009)
Google Scholar
Hu, Y., Wei, Z., Yuan, G.: Inexact accelerated proximal gradient algorithms for matrix $l_{2,1}$-norm minimization problem in multi-task feature learning. Stat. Optim. Inf. Comput. 2(4), 352–367 (2014)
MathSciNet Google Scholar
Goldfarb, D., Qin, Z.: Robust low-rank tensor recovery: models and algorithms. SIAM J. Matrix Anal. Appl. 35(1), 225–253 (2014)
Article MathSciNet MATH Google Scholar
Lin, Z., Liu, R., Su, Z.: Linearized alternating direction method with adaptive penalty for low-rank representation. In: Advances in neural information processing systems, pp. 612–620 (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

Nanchang Hangkong University, Nanchang, 330063, Jiangxi, China
Bo Li, Lianbao Jin, Chengcai Leng & Chunyuan Lu
Dalian University of Technology, Dalian, 116024, Liaoning, China
Jie Zhang

Authors

Bo Li
View author publications
You can also search for this author in PubMed Google Scholar
Lianbao Jin
View author publications
You can also search for this author in PubMed Google Scholar
Chengcai Leng
View author publications
You can also search for this author in PubMed Google Scholar
Chunyuan Lu
View author publications
You can also search for this author in PubMed Google Scholar
Jie Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bo Li .

Editor information

Editors and Affiliations

Department of Electronic Engineering, Tsinghua University, Beijing, China
Yu-Jin Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, B., Jin, L., Leng, C., Lu, C., Zhang, J. (2015). A Smooth Approximation Algorithm of Rank-Regularized Optimization Problem and Its Applications. In: Zhang, YJ. (eds) Image and Graphics. ICIG 2015. Lecture Notes in Computer Science(), vol 9217. Springer, Cham. https://doi.org/10.1007/978-3-319-21978-3_41

Download citation

DOI: https://doi.org/10.1007/978-3-319-21978-3_41
Published: 04 August 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-21977-6
Online ISBN: 978-3-319-21978-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)