A weighted randomized sparse Kaczmarz method for solving linear systems

Zhang, Lu; Yuan, Ziyang; Wang, Hongxia; Zhang, Hui

doi:10.1007/s40314-022-02105-9

A weighted randomized sparse Kaczmarz method for solving linear systems

Published: 08 November 2022

Volume 41, article number 383, (2022)
Cite this article

Computational and Applied Mathematics Aims and scope Submit manuscript

Lu Zhang ORCID: orcid.org/0000-0003-4712-2921¹,
Ziyang Yuan²,
Hongxia Wang¹ &
…
Hui Zhang¹

396 Accesses
2 Citations
Explore all metrics

Abstract

The randomized sparse Kaczmarz method, designed for seeking the sparse solutions of the linear systems $Ax=b$, selects the i-th projection hyperplane with likelihood proportional to $\Vert a_{i}\Vert _2^2$, where $a_{i}^{\mathrm{T}}$ is the i-th row of A. In this work, we propose a weighted randomized sparse Kaczmarz method, which selects the i-th projection hyperplane with probability proportional to $|\langle a_{i},x_{k}\rangle -b_{i}|^p$, where $0<p<\infty $, for possible acceleration. It bridges the randomized Kaczmarz and greedy Kaczmarz by parameter p. Theoretically, we show its linear convergence rate in expectation with respect to the Bregman distance in the noiseless and noisy cases, which is at least as efficient as the randomized sparse Kaczmarz method. The superiority of the proposed method is demonstrated via a group of numerical experiments.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The randomized Kaczmarz algorithm with the probability distribution depending on the angle

Article 11 October 2022

Randomized Kaczmarz method with adaptive stepsizes for inconsistent linear systems

Article 01 June 2023

On Multi-step Greedy Kaczmarz Method for Solving Large Sparse Consistent Linear Systems

Article 25 March 2024

References

Chen X, Qin J (2021) Regularized Kaczmarz algorithms for tensor recovery. SIAM J Imaging Sci 14(4):1439–1471. https://doi.org/10.1137/21M1398562
Article MathSciNet MATH Google Scholar
Chen SS, Donoho DL, Saunders MA (2001) Atomic decomposition by basis pursuit. SIAM Rev 43(1):129–159. https://doi.org/10.1137/S003614450037906X
Article MathSciNet MATH Google Scholar
Davis TA, Hu Y (2011) The university of Florida sparse matrix collection. ACM Trans Math Softw (TOMS) 38(1):1–25. https://doi.org/10.1145/2049662.2049663
Article MathSciNet MATH Google Scholar
Du K, Sun XH (2021) Randomized regularized extended Kaczmarz algorithms for tensor recovery. Preprint arXiv:2112.08566
Elad M (2010) Sparse and redundant representations: from theory to applications in signal and image processing. Springer, London. https://doi.org/10.1007/978-1-4419-7011-4
Book MATH Google Scholar
Feichtinger HG, Cenker C, Mayer M et al (1992) New variants of the POCS method using affine subspaces of finite codimension with applications to irregular sampling. In: Visual communications and image processing’92, pp 299–310. https://doi.org/10.1117/12.131447
Groß J (2021) A note on the randomized Kaczmarz method with a partially weighted selection step. Preprint arXiv:2105.14583
Jiang Y, Wu G, Jiang L (2020) A Kaczmarz method with simple random sampling for solving large linear systems. Preprint arXiv:2011.14693
Kaczmarz S (1937) Angenäherte auflösung von systemen linearer glei-chungen. Bull Int Acad Pol Sic Let Cl Sci Math Nat 1:355–357
Google Scholar
Li RR, Liu H (2022) On randomized partial block Kaczmarz method for solving huge linear algebraic systems. Comput Appl Math 41(6):1–10. https://doi.org/10.1007/s40314-022-01978-0
Article MathSciNet MATH Google Scholar
Lorenz DA, Wenger S, Schöpfer F et al (2014a) A sparse Kaczmarz solver and a linearized Bregman method for online compressed sensing. In: 2014 IEEE international conference on image processing (ICIP), pp 1347–1351. https://doi.org/10.1109/ICIP.2014.7025269
Lorenz DA, Schöpfer F, Wenger S (2014b) The linearized Bregman method via split feasibility problems: analysis and generalizations. SIAM J Imaging Sci 7(2):1237–1262. https://doi.org/10.1137/130936269
Needell D (2010) Randomized Kaczmarz solver for noisy linear systems. BIT Numer Math 50(2):395–403. https://doi.org/10.1007/s10543-010-0265-5
Article MathSciNet MATH Google Scholar
Nesterov Y (2003) Introductory lectures on convex optimization: a basic course, vol 87. Springer, London. https://doi.org/10.1007/978-1-4419-8853-9
Book MATH Google Scholar
Patel V, Jahangoshahi M, Maldonado DA (2021) Convergence of adaptive, randomized, iterative linear solvers. Preprint arXiv:2104.04816
Petra S (2015) Randomized sparse block Kaczmarz as randomized dual block-coordinate descent. Anal Univ Ovidius Const Ser Mat 23(3):129–149. https://doi.org/10.1515/auom-2015-0052
Article MathSciNet MATH Google Scholar
Rockafellar RT, Wets RJB (2009) Variational analysis, vol 317. Springer, London. https://doi.org/10.1007/978-3-030-63416-2_683
Book MATH Google Scholar
Schöpfer F (2012) Exact regularization of polyhedral norms. SIAM J Optim 22(4):1206–1223. https://doi.org/10.1137/11085236X
Article MathSciNet MATH Google Scholar
Schöpfer F, Lorenz DA (2019) Linear convergence of the randomized sparse Kaczmarz method. Math Program 173(1):509–536. https://doi.org/10.1007/s10107-017-1229-1
Article MathSciNet MATH Google Scholar
Steinerberger S (2021) A weighted randomized Kaczmarz method for solving linear systems. Math Comput 90(332):2815–2826. https://doi.org/10.1090/mcom/3644
Article MathSciNet MATH Google Scholar
Strohmer T, Vershynin R (2009) A randomized Kaczmarz algorithm with exponential convergence. J Fourier Anal Appl 15(2):262–278. https://doi.org/10.1007/s00041-008-9030-4
Article MathSciNet MATH Google Scholar
Tan YS, Vershynin R (2019) Phase retrieval via randomized Kaczmarz: theoretical guarantees. Inf Infer J IMA 8(1):97–123. https://doi.org/10.1093/imaiai/iay005
Article MathSciNet MATH Google Scholar
Wang X, Che M, Mo C et al (2022) Solving the system of nonsingular tensor equations via randomized Kaczmarz-like method. J Comput Appl Math. https://doi.org/10.1016/j.cam.2022.114856
Article MATH Google Scholar
Yuan ZY, Zhang H, Wang H (2022a) Sparse sampling Kaczmarz–Motzkin method with linear convergence. Math Methods Appl Sci 45(7):3463–3478. https://doi.org/10.1002/mma.7990
Yuan ZY, Zhang L, Wang H et al (2022b) Adaptively sketched Bregman projection methods for linear systems. Inverse Prob 38(6):065,005. https://doi.org/10.1088/1361-6420/ac5f76
Zouzias A, Freris NM (2013) Randomized extended Kaczmarz for solving least squares. SIAM J Matrix Anal Appl 34(2):773–793. https://doi.org/10.1137/120889897
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

The authors would like to thank the anonymous referees and the associate editor for valuable suggestions and comments, which allowed us to improve the original presentation. This work was supported by the National Natural Science Foundation of China (Nos. 11971480, 61977065), the Natural Science Fund of Hunan for Excellent Youth (No. 2020JJ3038), and the Fund for NUDT Young Innovator Awards (No. 20190105).

Author information

Authors and Affiliations

Department of Mathematics, National University of Defense Technology, Changsha, 410073, Hunan, China
Lu Zhang, Hongxia Wang & Hui Zhang
Academy of Military Science, Beijing, China
Ziyang Yuan

Authors

Lu Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Ziyang Yuan
View author publications
You can also search for this author in PubMed Google Scholar
Hongxia Wang
View author publications
You can also search for this author in PubMed Google Scholar
Hui Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The data used in the manuscript are available in the SuiteSparse Matrix Collection. All authors contributed to the study’s conception and design. The first draft of the manuscript was written by LZ and all authors commented on previous versions of the manuscript. All authors read and approve the final manuscript and are all aware of the current submission to COAM.

Corresponding author

Correspondence to Hui Zhang.

Ethics declarations

Conflict of interest

All authors declare that they have no conflict of interest.

Additional information

Communicated by Yimin Wei.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix A Proof of Theorem 1

Proof

The proof is divided into two parts: we deduce the convergence rate of WRaSK in the first part and compare the convergence rate between WRaSK and RaSK in the second part.

First, we derive the convergence rate of WRaSK. By Theorem 2.8 in Lorenz et al. (2014b), we know that (11) in Lemma 2 holds for both the exact and inexact step sizes. Note that f is 1-strongly convex and $\Vert a_{i_k}\Vert _2=1$; it follows that

$$\begin{aligned} D_f^{x_{k+1}^*}(x_{k+1},{\hat{x}}) \le D_f^{x_{k}^*}(x_{k},{\hat{x}})-\frac{1}{2}\left( \langle a_{i_k},x_k\rangle -b_{i_k}\right) ^2, \end{aligned}$$

(A1)

we fix the values of the indices $i_0,\ldots ,i_{k-1}$ and only consider $i_k$ as a random variable. Taking the conditional expectation on both sides, we derive that

$$\begin{aligned}&{\mathbb {E}} \left( D_{f}^{x_{k+1}^*}(x_{k+1},{\hat{x}})|i_{0},\ldots ,i_{k-1}\right) \\&\quad \le D_{f}^{x_{k}^*}(x_{k},{\hat{x}})-\frac{1}{2}\sum _{i=1}^{m}(\langle a_{i},x_{k}\rangle -b_{i})^2\cdot \frac{|\langle a_{i},x_{k}\rangle -b_{i}|^p}{\Vert Ax_{k}-b\Vert _{l_p}^p}\\&\quad =D_f^{x_{k}^*}(x_{k},{\hat{x}})-\frac{1}{2}\frac{\Vert Ax_k-b\Vert _{l_{p+2}}^{p+2}}{\Vert Ax_k-b\Vert _{l_{p}}^{p}\Vert Ax_k-b\Vert _2^{2}}\cdot \Vert Ax_k-b\Vert _2^{2} \\&\quad \le D_f^{x_{k}^*}(x_{k},{\hat{x}})-\frac{1}{2}\mathop {\inf }\limits _{x\ne {\hat{x}}}\frac{\Vert Ax-b\Vert _{l_{p+2}}^{p+2}}{\Vert Ax-b\Vert _{l_{p}}^{p}\Vert Ax-b\Vert _2^{2}}\cdot \Vert Ax_k-b\Vert _{2}^{2} \\&\quad \le \left( 1-\frac{1}{2} {\widetilde{\sigma }}^2_{\min }(A)\cdot \frac{|{\hat{x}}|_{\min }}{|{\hat{x}}|_{\min }+2\lambda }\cdot \mathop {\inf }\limits _{x\ne {\hat{x}}}\frac{\Vert Ax-b\Vert _{l_{p+2}}^{p+2}}{\Vert Ax-b\Vert _{l_{p}}^{p}\Vert Ax-b\Vert _2^{2}}\right) D_{f}^{x_{k}^*}(x_{k},{\hat{x}}). \end{aligned}$$

The last inequality follows by invoking Lemma 3. Now considering all indices $i_0,\ldots ,i_k$ as random variables and taking the full expectation on both sides, we have that

$$\begin{aligned} {\mathbb {E}}\left( D_f^{x_{k+1}^*}(x_{k+1},{\hat{x}})\right) \le \left( 1-\frac{1}{2} {\widetilde{\sigma }}^2_{\min }(A)\cdot \frac{|{\hat{x}}|_{\min }}{|{\hat{x}}|_{\min }+2\lambda }\cdot \mathop {\inf } \limits _{z\ne 0}\frac{\Vert Az\Vert _{l_{p+2}}^{p+2}}{\Vert Az\Vert _{l_{p}}^{p}\Vert Az\Vert _2^{2}}\right) {\mathbb {E}}\left( D_f^{x_{k}^*}(x_{k},{\hat{x}})\right) , \end{aligned}$$

where $z=x-{\hat{x}}$. According to Lemma 1 and f is 1-strongly convex, we can obtain

$$\begin{aligned} D_f^{x_{k}^*}(x_{k},{\hat{x}})\ge \frac{1}{2}\Vert x_{k}-{\hat{x}}\Vert _2^2. \end{aligned}$$

Thus, we get

$$\begin{aligned} {\mathbb {E}}\Vert x_{k}-{\hat{x}}\Vert _2\le \left( 1-\frac{1}{2} {\widetilde{\sigma }}^2_{\min }(A)\cdot \frac{|{\hat{x}}|_{\min }}{|{\hat{x}}|_{\min }+2\lambda }\cdot \mathop {\inf } \limits _{z\ne 0}\frac{\Vert Az\Vert _{l_{p+2}}^{p+2}}{\Vert Az\Vert _{l_{p}}^{p}\Vert Az\Vert _2^{2}}\right) ^{\frac{k}{2}} \sqrt{2\lambda \Vert {\hat{x}}\Vert _1+\Vert {\hat{x}}\Vert _2^2}. \end{aligned}$$

Next, we compare the convergence rates between RaSK and WRaSK. Hölder’s inequality implies that for any $0\ne x\in {\mathbb {R}}^m,$

$$\begin{aligned} \Vert x\Vert _{l_p}^p=\sum _{i=1}^{m}|x_i|^p\le \left( \sum _{i=1}^{m}|x_i|^{p+2}\right) ^{\frac{p}{p+2}}\left( \sum _{i=1}^{m}1\right) ^{\frac{2}{p+2}}=\Vert x\Vert _{l_{p+2}}^pm^{\frac{2}{p+2}}, \end{aligned}$$

(A2)

and

$$\begin{aligned} \Vert x\Vert _{2}^2=\sum _{i=1}^{m}|x_i|^{2}\le \left( \sum _{i=1}^{m}|x_i|^{p+2}\right) ^{\frac{2}{p+2}}\left( \sum _{i=1}^{m}1\right) ^{\frac{p}{p+2}}=\Vert x\Vert _{l_{p+2}}^2m^{\frac{p}{p+2}}. \end{aligned}$$

(A3)

Based on (A2) and (A3), for $0\ne Az\in {\mathbb {R}}^m$ we deduce that

$$\begin{aligned} \frac{\Vert Az\Vert _{l_{p+2}}^{p+2}}{\Vert Az\Vert _{l_{p}}^{p}} \ge \frac{\Vert Az\Vert _{l_{p+2}}^{2}}{m^{\frac{2}{p+2}}} \ge \frac{1}{m}\Vert Az\Vert _2^2. \end{aligned}$$

(A4)

Hence,

$$\begin{aligned} \frac{\Vert Az\Vert _{l_{p+2}}^{p+2}}{\Vert Az\Vert _{l_{p}}^{p}\Vert Az\Vert _2^{2}}\ge \frac{1}{m}. \end{aligned}$$

(A5)

It follows that

$$\begin{aligned} \mathop {\inf }\limits _{z\ne 0}\frac{\Vert Az\Vert _{l_{p+2}}^{p+2}}{\Vert Az\Vert _{l_{p}}^{p}\Vert Az\Vert _2^{2}}\ge \frac{1}{m}, \end{aligned}$$

with which we further derive that

$$\begin{aligned} \frac{1}{2}\cdot {\widetilde{\sigma }}^2_{\min }(A)\cdot \frac{|{\hat{x}}|_{\min }}{|{\hat{x}}|_{\min }+2\lambda }\cdot \mathop {\inf } \limits _{z\ne 0}\frac{\Vert Az\Vert _{l_{p+2}}^{p+2}}{\Vert Az\Vert _{l_{p}}^{p}\Vert Az\Vert _2^{2}}\ge \frac{1}{2}\cdot \frac{1}{m}\cdot {\widetilde{\sigma }}^2_{\min }(A)\cdot \frac{|{\hat{x}}|_{\min }}{|{\hat{x}}|_{\min }+2\lambda }. \end{aligned}$$

Thereby, we conclude that the convergence rate of WRaSK is at least as efficient as RaSK.

As we all know, Hölder’s inequality takes the equal sign if and only if one of the two vectors is the constant multiple of the other. Since uses Hölder’s inequality twice, (A5) with equality holds if and only if Az is a constant multiple of the unit vector. The proof is completed. $\square $

Appendix B Proof of Theorem 2

Proof

We make use of the observation in Needell (2010) that

$$\begin{aligned} x_k^{\delta }:={\hat{x}}+\frac{b_{i_k}^{\delta }-b_{i_k}}{\Vert a_{i_k}\Vert _2^2}a_{i_k} \in H\left( a_{i_k},b_{i_k}^{\delta }\right) . \end{aligned}$$

(B6)

Note that f is 1-strongly convex and $\Vert a_{i_k}\Vert _2=1$; hence according to Lemma 2, we deduce that

$$\begin{aligned} D_f^{x_{k+1}^*}(x_{k+1},x_k^{\delta })\le D_f^{x_{k}^*}(x_{k},x_k^{\delta })-\frac{1}{2}\left( \langle a_{i_k},x_k\rangle -b_{i_k}^{\delta }\right) ^2. \end{aligned}$$

(B7)

Reformulating (B7) by (B6), we derive that

$$\begin{aligned} D_f^{x_{k+1}^*}\left( x_{k+1},{\hat{x}}\right) \le D_f^{x_{k}^*}(x_{k},{\hat{x}})-\frac{1}{2}\left( \langle a_{i_k},x_k\rangle -b_{i_k}^{\delta }\right) ^2+\left\langle x_{k+1}^*-x_k^*,x_k^{\delta }-{\hat{x}}\right\rangle . \end{aligned}$$

(B8)

(a) In the WRaSK method, we have

$$\begin{aligned} x_{k+1}^*-x_k^*=-\left( \langle a_{i_k},x_k\rangle -b_{i_k}^{\delta }\right) a_{i_k}. \end{aligned}$$

Recall that $x_k^{\delta }-{\hat{x}}=(b_{i_k}^{\delta }-b_{i_k})a_{i_k}$, we get

$$\begin{aligned} \langle x_{k+1}^*-x_k^*,x_k^{\delta }-{\hat{x}}\rangle =\left( b_{i_k}^{\delta }-b_{i_k}\right) ^2 -\left( b_{i_k}^{\delta }-b_{i_k}\right) \cdot \left( \langle a_{i_k},x_k\rangle -b_{i_k}\right) , \end{aligned}$$

(B9)

and

$$\begin{aligned}&-\frac{1}{2}\left( \langle a_{i_k},x_k\rangle -b_{i_k}^{\delta }\right) ^2\nonumber \\&\quad = -\frac{1}{2}\left( \langle a_{i_k},x_k\rangle -b_{i_k}+b_{i_k}-b_{i_k}^{\delta }\right) ^2\nonumber \\&\quad =-\frac{1}{2}\left( \langle a_{i_k},x_k\rangle -b_{i_k}\right) ^2+ \left( b_{i_k}^{\delta }-b_{i_k}\right) \cdot \left( \langle a_{i_k},x_k\rangle -b_{i_k}\right) -\frac{1}{2}\left( b_{i_k}-b_{i_k}^{\delta }\right) ^2. \end{aligned}$$

(B10)

Plugging the reformulations (B9) and (B10) into (B8), we have

$$\begin{aligned} D_f^{x_{k+1}^*}\left( x_{k+1},{\hat{x}}\right) \le D_f^{x_{k}^*}\left( x_{k},{\hat{x}}\right) -\frac{1}{2}\left( \langle a_{i_k},x_k\rangle -b_{i_k}\right) ^2 +\frac{1}{2}\left( b_{i_k}-b_{i_k}^{\delta }\right) ^2. \end{aligned}$$

We fix the values of the indices $i_0,\ldots ,i_{k-1}$ and only consider $i_k$ as a random variable. Taking the conditional expectation on both sides, we get

$$\begin{aligned} \begin{aligned}&{\mathbb {E}}\left( D_f^{x_{k+1}^*}\left( x_{k+1},{\hat{x}}\right) |i_0,\ldots ,i_{k-1}\right) \\&\quad \le D_f^{x_{k}^*}\left( x_{k},{\hat{x}}\right) - \frac{1}{2}\sum _{i=1}^{m}\left( \langle a_i,x_k\rangle -b_i\right) ^2\frac{|\langle a_i,x_k\rangle -b_i|^p}{\Vert Ax_k-b\Vert _{l_p}^p}+ \frac{1}{2}\sum _{i=1}^{m}\left( b_i-b_i^{\delta }\right) ^2\frac{|\langle a_i,x_k\rangle -b_i|^p}{\Vert Ax_k-b\Vert _{l_p}^p}\\&\quad \le qD_f^{x_{k}^*}\left( x_{k},{\hat{x}}\right) +\frac{1}{2}\Vert b-b^{\delta }\Vert _{l_{p+2}}^2\cdot \frac{\Vert Ax_k-b\Vert _{l_{p+2}}^p}{\Vert Ax_k-b\Vert _{l_{p}}^p}. \end{aligned} \end{aligned}$$

The last inequality can be deduced by using the conclusion of Theorem 1 and Hölder’s inequality

$$\begin{aligned} \sum _{i=1}^{m}(b_i-b_i^{\delta })^2\cdot |\langle a_i,x_k\rangle -b_i|^p\le \Vert b-b^{\delta }\Vert _{l_{p+2}}^2\cdot \Vert Ax_k-b\Vert _{l_{p+2}}^p. \end{aligned}$$

Now considering all indices $i_0,\ldots ,i_k$ as random variables and taking the full expectation on both sides, we can derive that

$$\begin{aligned} \begin{aligned} {\mathbb {E}}\left( D_f^{x_{k+1}^*}(x_{k+1},{\hat{x}})\right)&\le q^{k+1}\left( \lambda \Vert {\hat{x}}\Vert _1+\frac{1}{2} \Vert {\hat{x}}\Vert _2^2\right) +\frac{1}{2}\Vert b-b^{\delta }\Vert _{l_{p+2}}^2\sum _{i=0}^{k}q^{k-i} \frac{\Vert Ax_i-b\Vert _{l_{p+2}}^p}{\Vert Ax_i-b\Vert _{l_{p}}^p}. \end{aligned} \end{aligned}$$

According to the equivalence of vector norms in ${\mathbb {R}}^m$, there is a constant $c\in {\mathbb {R}}$ such that for any vector $z\in {\mathbb {R}}^m$ we have that

$$\begin{aligned} \Vert z\Vert _{l_{p+2}} \le c\Vert z\Vert _{l_{p}}. \end{aligned}$$

Thus,

$$\begin{aligned} \begin{aligned} {\mathbb {E}}(D_f^{x_{k}^*}(x_{k},{\hat{x}}))&\le q^k(\lambda \Vert {\hat{x}}\Vert _1+\frac{1}{2} \Vert {\hat{x}}\Vert _2^2)+\frac{1}{2}\Vert b-b^{\delta }\Vert _{l_{p+2}}^2\cdot \frac{c^pq}{1-q}. \end{aligned} \end{aligned}$$

Using $\sqrt{u+v}\le \sqrt{u}+\sqrt{v}$ and f is 1-strongly convex, we further deduce that

$$\begin{aligned} \begin{aligned} {\mathbb {E}}\Vert x_{k}-{\hat{x}}\Vert&\le q^{\frac{k}{2}}\sqrt{2\lambda \Vert {\hat{x}}\Vert _1+ \Vert {\hat{x}}\Vert _2^2}+ \delta \sqrt{ \frac{c^pq}{1-q}}. \end{aligned} \end{aligned}$$

(b) In the EWRaSK method, according to Example 1 we have $x_{k+1}^*=x_k+\lambda \cdot s_k$, where $\Vert s_k\Vert _{\infty },\Vert s_{k+1}\Vert _{\infty }\le 1$. The exact linesearch guarantees $\langle x_{k+1},a_{i_k}\rangle =b_{i_k}^{\delta };$ thus,

$$\begin{aligned} \langle x_{k+1}^{*}-x_{k}^{*},x_k^{\delta }-{\hat{x}}\rangle&=\frac{b_{i_{k}}^{\delta }-b_{i_{k}}}{\Vert a_{i_{k}}\Vert _2^2}(\langle x_{k+1}-x_{k},a_{i_{k}}\rangle +\lambda \langle s_{k+1}-s_{k},a_{i_k}\rangle ),\nonumber \\&\le \frac{(b_{i_{k}}^{\delta }-b_{i_{k}})^{2}}{\Vert a_{i_{k}}\Vert _2^{2}} -\frac{(b_{i_{k}}^{\delta }-b_{i_{k}})(\langle a_{i_{k}},x_{k}\rangle -b_{i_{k}})}{\Vert a_{i_{k}}\Vert _2^{2}}+ \frac{2\lambda |b_{i_{k}}^{\delta }-b_{i_{k}}|\cdot \Vert a_{i_{k}}\Vert _{1}}{\Vert a_{i_{k}}\Vert _{2}^{2}}. \end{aligned}$$

(B11)

Bringing (B10) and (B11) into (B8), note that $\Vert a_{i_k}\Vert _2=1$ we derive

$$\begin{aligned} D_f^{x_{k+1}^*}(x_{k+1},{\hat{x}})&\le D_f^{x_{k}^*}(x_{k},{\hat{x}})-\frac{1}{2}(\langle a_{i_k},x_k\rangle -b_{i_k})^2 +\frac{1}{2}(b_{i_k}-b_{i_k}^{\delta })^2+ 2\lambda |b_{i_k}^{\delta }-b_{i_k}|\cdot \Vert a_{i_k}\Vert _1. \end{aligned}$$

(B12)

Use Hölder’s inequality to reformulate

$$\begin{aligned}&2\lambda \sum _{i=1}^{m}|b_{i}^{\delta }-b_{i}|\cdot \Vert a_{i}\Vert _1\cdot \frac{|\langle a_i,x_k\rangle -b_i|^p}{\Vert Ax_{k}-b\Vert _{l_p}^p}&\le 2\lambda \Vert b^{\delta }-b\Vert _{l_{p+2}}\cdot \Vert A\Vert _{1,p+2}\cdot \frac{\Vert Ax_k-b\Vert _{l_{p+2}}^p}{\Vert Ax_k-b\Vert _{l_p}^p}. \end{aligned}$$

(B13)

Similar to (a), we get

$$\begin{aligned} \begin{aligned} {\mathbb {E}}[\Vert x_k-{\hat{x}}\Vert _2]&\le q^{\frac{k}{2}}\sqrt{2\lambda \Vert {\hat{x}}\Vert _1+\Vert {\hat{x}}\Vert _2^2}+\delta \sqrt{\left( 1+\frac{4\lambda \Vert A\Vert _{1,p+2}}{\delta }\right) \cdot \frac{c^pq}{1-q}}. \end{aligned} \end{aligned}$$

The proof is completed. $\square $

Appendix C Proof of Lemma 4

Proof

First, we compute the derivative of the function f(x) as follows:

$$\begin{aligned} \begin{aligned} f'(x)&= \frac{1}{\left( \sum _{i=1}^{n}e^{d_ix}\right) ^2} \left[ \left( \sum _{i=1}^{n}d_ie^{2d_i}e^{d_ix}\right) \left( \sum _{j=1}^{n}e^{d_jx}\right) - \left( \sum _{i=1}^{n}e^{2d_i}e^{d_ix}\right) \left( \sum _{j=1}^{n}d_je^{d_jx}\right) \right] ,\\&= \frac{1}{\left( \sum _{i=1}^{n}e^{d_ix}\right) ^2} \left[ \sum _{i=1}^{n}\sum _{j=1}^{n}d_ie^{2d_i}e^{(d_i+d_j)x}- \sum _{i=1}^{n}\sum _{j=1}^{n}d_je^{2d_i}e^{(d_i+d_j)x}\right] ,\\&=\frac{1}{\left( \sum _{i=1}^{n}e^{d_ix}\right) ^2} \left[ \sum _{i=1}^{n}\sum _{j=1}^{n}(d_i-d_j)e^{2d_i}e^{(d_i+d_j)x}\right] . \end{aligned} \end{aligned}$$

Denote

$$\begin{aligned} h(x):= \sum _{i=1}^{n}\sum _{j=1}^{n}(d_i-d_j)e^{2d_i}e^{(d_i+d_j)x}. \end{aligned}$$

It follows that $f^{'}(x)=\frac{h(x)}{\left( \sum _{i=1}^{n}e^{d_ix}\right) ^2}$. Let $t_{ij}=e^{2d_i}e^{(d_i+d_j)x}$; then we have

$$\begin{aligned} h(x)=\sum _{i=1}^{n}\sum _{j=1}^{n}(d_i-d_j)t_{ij}. \end{aligned}$$

(C14)

Exchanging the role of i and j, we obtain

$$\begin{aligned} h(x)=\sum _{i=1}^{n}\sum _{j=1}^{n}(d_j-d_i)t_{ji}. \end{aligned}$$

(C15)

Based on (C14) and (C15), we deduce that

$$\begin{aligned} \begin{aligned} 2h(x) = \sum _{i=1}^{n}\sum _{j=1}^{n}(d_i-d_j)(t_{ij}-t_{ji}) = \sum _{i=1}^{n}\sum _{j=1}^{n}(d_i-d_j)(e^{2d_i}-e^{2d_j})e^{(d_i+d_j)x}. \end{aligned} \end{aligned}$$

It follows that $h(x)\ge 0$ and hence $f^{'}(x)\ge 0,\, \forall x\in (0,\infty )$. Therefore, f(x) is a monotonic increasing function. The proof is completed. $\square $

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Zhang, L., Yuan, Z., Wang, H. et al. A weighted randomized sparse Kaczmarz method for solving linear systems. Comp. Appl. Math. 41, 383 (2022). https://doi.org/10.1007/s40314-022-02105-9

Download citation

Received: 24 August 2022
Revised: 12 October 2022
Accepted: 30 October 2022
Published: 08 November 2022
DOI: https://doi.org/10.1007/s40314-022-02105-9

Keywords

Mathematics Subject Classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A weighted randomized sparse Kaczmarz method for solving linear systems

Abstract

Access this article

Similar content being viewed by others

The randomized Kaczmarz algorithm with the probability distribution depending on the angle

Randomized Kaczmarz method with adaptive stepsizes for inconsistent linear systems

On Multi-step Greedy Kaczmarz Method for Solving Large Sparse Consistent Linear Systems

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendices

Appendix A Proof of Theorem 1

Proof

Appendix B Proof of Theorem 2

Proof

Appendix C Proof of Lemma 4

Proof

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification

Navigation

A weighted randomized sparse Kaczmarz method for solving linear systems

Abstract

Access this article

Similar content being viewed by others

The randomized Kaczmarz algorithm with the probability distribution depending on the angle

Randomized Kaczmarz method with adaptive stepsizes for inconsistent linear systems

On Multi-step Greedy Kaczmarz Method for Solving Large Sparse Consistent Linear Systems

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendices

Appendix A Proof of Theorem 1

Proof

Appendix B Proof of Theorem 2

Proof

Appendix C Proof of Lemma 4

Proof

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation