Kaczmarz Projection Algorithms with Rank Two Gain Update

Stotsky, Alexander

doi:10.1007/s11265-024-01915-w

Kaczmarz Projection Algorithms with Rank Two Gain Update

Open access
Published: 25 May 2024

Volume 96, pages 327–332, (2024)
Cite this article

Download PDF

You have full access to this open access article

Journal of Signal Processing Systems Aims and scope Submit manuscript

Kaczmarz Projection Algorithms with Rank Two Gain Update

Download PDF

Alexander Stotsky¹

445 Accesses
1 Altmetric
Explore all metrics

Abstract

New Kaczmarz algorithms with rank two gain update, extended orthogonality property and forgetting mechanism which includes both exponential and instantaneous forgetting (implemented via a proper choice of the forgetting factor and the window size) are introduced and systematically associated in this paper with well-known Kaczmarz algorithms with rank one update. The parameter convergence was proved using Lyapunov method and convergence of the inverse of the information matrix can be used for further performance improvement. The performance of new algorithms is examined in the problem of estimation of the grid events in the presence of significant harmonic emissions.

Filtered Krylov-like sequence method for symmetric eigenvalue problems

Article 23 November 2018

Krylov subspace methods for estimating operator-vector multiplications in Hilbert spaces

Article Open access 21 February 2021

Stabilization and variations to the adaptive local iterative filtering algorithm: the fast resampled iterative filtering method

Article 27 January 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Previous & Related Work

Kaczmarz projection method, [1, 2] is robust and computationally efficient algorithm (alternative to the least squares approach) for estimation of the frequency contents of the oscillating signals in a number of signal processing and system identification applications. In the Kaczmarz algorithm $ \displaystyle \theta _k = \theta _{k-1} - \frac{\varphi _k}{\varphi ^T_k ~ \varphi _k} ~ [ \varphi ^T_k~ \theta _{k-1} - y_k ]$ the output of the model $\varphi ^T_k~ \theta _{k}$ matches exactly the measured signal $y_k = \varphi ^T_k~ \theta _{k}$ in each discrete step k, which can be obtained by multiplying the parameter vector $\theta _k$ by the regressor $\varphi ^T_k$. The algorithm has the remarkable orthogonality property $\varphi _k^T ~ \tilde{\theta }_k = 0 $, where the vector of the parameter mismatch $\tilde{\theta }_k = \theta _k - \theta _*$ (difference between adjustable and true parameters $\theta _*$) is orthogonal to the regressor vector. The square of the length of the harmonic regressor (which consists of the trigonometric functions at different frequencies), $\varphi ^T_k ~ \varphi _k$ is constant and depends on the number of frequencies only. The Kaczmarz algorithms were successfully tested in different estimation problems in automotive applications, [3].

Aiming for convergence rate improvement of the Kaczmarz projection method the gain update algorithm was introduced in [4]. Further development was associated with introduction of the forgetting factor which has significant impact on the estimation performance, discounts exponentially old measurements in the gain update and creates a virtual moving window, [5].

The choice of the forgetting factor is associated with the trade-off between rapidity and accuracy of estimation. The gain matrix with forgetting factor is associated in this paper with the window which is moving in time for further improvements. Introduction of the forgetting factor in the sliding window, [6, 7] creates extended forgetting mechanism that includes both exponential and instantaneous forgetting and provides new opportunities for achievement of the trade-off between rapidity and accuracy.

Table 1 Comparison of Kaczmarz Algorithms with Rank Two and Rank One Updates

Full size table

The movement of the window is associated with data updating and downdating that results in recursive updates of the information matrix, which can occur sequentially, [3, 8] or simultaneously, [9]. Sequential updating and downdating results in computationally complex algorithms. The gain update in known Kaczmarz algorithms, [4, 5] is associated with recursive rank one update, (similar to recursive least squares algorithms, [10, 11]) whereas simultaneous updating/downdating is associated with computationally efficient rank two update, [12].

New gain update which is explicitly associated with the movement of the sliding window of the size w with exponential forgetting is introduced in Kaczmarz algorithms. The parameter update law extends the property of orthogonality where the output of the model matches measured data in two end points of the weighted window. The development is performed and systematically associated with well-known Kaczmarz algorithms with rank one update, see Table 1 and [5].

It is shown that the gain update $\Gamma _k$ converges to the inverse of the information matrix and the parameters converge to their true values, which is a new property of Kaczmarz algorithms with rank two update discovered in this paper, see Section 4.

The response time of the estimation algorithms is restricted by the choice of the window size. Short window (which is associated with fast response) implies ill-conditioning of the information matrix which results in sensitivity to numerical calculations and error accumulation. New convergence properties discovered in this paper allow initialization to the approximate inverse with subsequent convergence of the inverse of information matrix, see Section 4. This property opens new opportunities for performance improvement in the ill-conditioned case where the difficulties are associated with the matrix inversion. In addition, Newton-Schulz and Richardson algorithms can be applied for improvements of the transient performance, [12, 13].

The performance of new algorithms is examined in the problem of estimation of the grid events in the presence of significant harmonic emissions, see Fig. 3 and [12,13,14].

The paper is organized as follows. New Kaczmarz algorithms with rank two update are introduced in the next Section 2. New algorithms are systematically associated with algorithms with rank one update in Section 3 and Table 1. Convergence properties of new algorithms, which is the main contribution of this paper are presented in Section 4. Simulation results which show the advantages of the proposed algorithms are presented in Section 5 and finally Section 6 outlines brief conclusions and directions for further research.

2 Moving Window with Exponential Forgetting

Suppose that the measured oscillating signal is presented in the following form ${y}_k = \varphi _k^{T} \theta _* $, where $\varphi _k$ is the harmonic regressor, $\varphi _k^{T} = [ cos( q_0 k ) ~ sin(q_0 k ) ~...~ cos( q_h k ) ~ sin( q_h k ) ]$, $q_0,...q_h$ are the frequencies and $\theta _* $ is the vector of unknown parameters, $k = 1,2,...$.

The window of the size w which is moving in time can be presented as rank two update of the information matrix (which is defined as the sum of the outer products of the regressor vectors) where new observation is added (updating) and old observation is deleted (downdating). The estimation performance is highly influenced by the forgetting factor, $ 0 < \lambda \le 1$ which discounts exponentially old measurements and creates a virtual window inside of the moving window. In other words, the new data $\varphi _{k}$ (with the largest forgetting factor which is equal to one) enter the window and the data with the lowest priority $ \tilde{\varphi }_{k-w} = \sqrt{\lambda ^w}~\varphi _{k-w}$ leave the window in step k as follows, [12]:

$$\begin{aligned} \begin{aligned} A_k&= \sum _{j = k -(w-1)}^{j=k} \lambda ^{k-j} \varphi _{j}~ \varphi _{j}^{T} \\&= \lambda ~A_{k-1} + Q_k~D~Q^{T}_k \end{aligned} \end{aligned}$$

(1)

where $Q_k = [\varphi _{k} ~ \tilde{\varphi }_{k-w}]$, $ D = diag[ 1, -1] $, $k \ge w + 1$.

The gain matrix (as inverse of $A_k$) which is derived by application of the matrix inversion lemma, [15] to the identity (1) and the parameter update law can be written in the following form, [12]:

$$\begin{aligned} \Gamma _{k} = \frac{1}{\lambda } ~ [~ \Gamma _{k-1} - \Gamma _{k-1} ~ Q_k ~ S^{-1} ~ Q^T_k ~ \Gamma _{k-1}~] \end{aligned}$$

(2)

$$\begin{aligned} \begin{aligned} \theta _k =&\ \theta _{k-1} \\&- \Gamma _{k-1} ~ Q_k ~ [Q^T_k ~ \Gamma _{k-1} ~ Q_k]^{-1} ~ [ Q^T_k~ \theta _{k-1} - \tilde{y}_k ] \end{aligned} \end{aligned}$$

(3)

where $ S = \lambda ~D + Q^T_k ~ \Gamma _{k-1} ~ Q_k $, $\tilde{y}^T_k = [y_k~ \sqrt{\lambda ^w}y_{k-w}]$ is the synthetic output, provided that the matrix $ Q^T_k ~ \Gamma _{k-1} ~ Q_k $ is invertible. The vector of adjustable parameters $\theta _k$ in (3) is constructed so that the output of the model matches exactly the measured signal $Q^T_k~ \theta _{k} = \tilde{y}_k$ (see Sections 1 and 3 ) and the following extended orthogonality property $Q^T_k~\tilde{\theta }_k = 0 $ is valid for the parameter mismatch $\tilde{\theta }_k = \theta _{k} - \theta _*$.

3 Kaczmarz Algorithms with Rank One Update

Introduction of the forgetting factor allows to establish relationship between Kaczmarz algorithms with rank two and rank one updates. Notice that $Q_k$ gets the following form $Q_k = [\varphi _{k} ~ 0]$ without downdating when $\lambda ^w \rightarrow 0$. Straightforward substitution of $Q_k$ defined above in (2) yields to following well-known Kaczmarz algorithms, [4, 5]:

$$\begin{aligned} \Gamma _{k}&= \frac{1}{\lambda } ~ [~ \Gamma _{k-1} - \frac{\Gamma _{k-1} ~ \varphi _k ~ \varphi ^T_k ~ \Gamma _{k-1}~}{ \lambda + \varphi ^T_k ~ \Gamma _{k-1} \varphi _k}~] \end{aligned}$$

(4)

$$\begin{aligned} \theta _k&= \theta _{k-1} - \frac{\Gamma _{k-1} ~ \varphi _k }{ \varphi ^T_k ~ \Gamma _{k-1} \varphi _k}~~[\varphi ^T_k \theta _{k-1} - y_k] \end{aligned}$$

(5)

with the orthogonality property $\varphi ^T_k (\theta _k - \theta _*) = 0$. Comparison of the algorithms (2), (3) and (4), (5) and the error models is presented in Table 1.

The performance of estimation can be further improved using remarkable convergence properties of the algorithms described in the next Section.

4 Convergence of Matrix Inversion and Parameter Errors

Error Models

For system (1)–(3) the following error models are valid:

$$\begin{aligned} E_k = ( I - \Gamma _{k-1} ~ Q_k ~ S^{-1} ~ Q^T_k)~ E_{k-1} \end{aligned}$$

(6)

$$\begin{aligned} \tilde{\theta }_k = ( I - \Gamma _{k-1} ~ Q_k ~ [Q^T_k ~ \Gamma _{k-1} ~ Q_k]^{-1} ~ Q^T_k ) ~ \tilde{\theta }_{k-1} \end{aligned}$$

(7)

where $E_k = I - \Gamma _{k}~ A_k$ and $\tilde{\theta }_k = \theta _k - \theta _* $ are matrix inversion and parameter estimation errors (the same matrix inversion error is used in Newton-Schulz algorithms, [13]) and I is the identity matrix.

Convergence of Matrix Inversion

The matrices $I - \Gamma _{k-1} ~ Q_k S^{-1} ~ Q^T_k$ can be associated with generalized Householder matrices and explicit solution of $E_k$ can be associated with aggregated Householder transformation (products of Householder matrices). Therefore the convergence of the matrix inversion error for system with harmonic regressor can be established using the arguments similar to [16,17,18].

Notice that $Q_k$ gets the following form $Q_k = [\varphi _{k} ~ 0]$ for sufficiently small forgetting factor and large window size, i.e. if $\lambda ^w \rightarrow 0$. Kaczmarz algorithm with rank two update becomes Kaczmarz algorithm with rank one update and the inversion error, (6) gets the form: $ \displaystyle E_k = ( I - \frac{\Gamma _{k-1} \varphi _k \varphi ^T_k}{\lambda + \varphi ^T_k \Gamma _{k-1} \varphi _k}) E_{k-1} $, see Table 1. Multiplying this identity by $\varphi _k \varphi ^T_k$ and taking the sum (under the assumption that $E_k \approx E_{k-1}$) yields $\displaystyle \sum _{k=r}^{r+w} \varphi _k \varphi ^T_k ~ E_k = \sum _{k=r}^{r+w} \frac{\lambda }{\lambda + \varphi ^T_k \Gamma _{k-1} \varphi _k}~\varphi _k \varphi ^T_k ~ E_{k-1}$. Thus the convergence of the matrix inversion error can be established for systems with harmonic regressor, [19].

Comparison of the matrix inversion and parameter convergence rates as a function of the forgetting factors for rank one and rank two updates is presented in Fig. 1. Figure 1(a) shows that Kaczmarz algorithm with rank one update provides faster convergence of the inversion error for larger forgetting factors and Fig. 1(b) shows faster convergence of the parameter error for Kaczmarz algorithm with rank two update.

Parameter Convergence

The convergence of the parameter vector can be proved using Lyapunov function candidate $V_k = \tilde{\theta }_k^T ~ A_k ~ \tilde{\theta }_k$ for symmetric and positive definite matrix $A_k$. Taking into account orthogonality property $Q_k^{T}~\tilde{\theta }_k = 0$ evaluation of the first difference $V_k - V_{k-1}$ yields:

$$\begin{aligned} \begin{aligned} V_k - V_{k-1} =&\ \lambda ~ \tilde{\theta }^T_{k} ~ A_{k-1} ~ \tilde{\theta }_{k-1} - V_{k-1} \\&+ \lambda ~\tilde{\theta }^T_k (I-A_{k-1}\Gamma _{k-1})Q_k [Q^T_k \Gamma _{k-1} Q_k]^{-1} Q^T_k \tilde{\theta }_{k-1} \\ =&- \lambda ~\tilde{\theta }^T_{k-1}~ Q_k ~[Q^T_k ~\Gamma _{k-1} ~Q_k]^{-1} ~Q^T_k ~ \tilde{\theta }_{k-1}\\&- (1 - \lambda ) V_{k-1} + \lambda ~(\tilde{\theta }_k + \tilde{\theta }_{k-1})^T ~(I-A_{k-1}\Gamma _{k-1}) \\&\quad Q_k ~[Q^T_k ~\Gamma _{k-1} ~ Q_k]^{-1} ~ Q^T_k ~ \tilde{\theta }_{k-1} \end{aligned} \end{aligned}$$

(8)

Notice that the matrix inversion error $(I - A_{k-1}\Gamma _{k-1})$ is vanishing, see Fig. 1(a) and there exist positive constants c and $\lambda _*$, $\rho < 1$ such that the following holds $V_k \le \lambda _* V_{k-1} + c ~ \rho ^k$ and the bounds

$$\begin{aligned} V_k \le \lambda _*^{k} ~ V_0 + c ~ \rho ~ \frac{\lambda ^k_* - \rho ^{k}}{\lambda _* - \rho } \end{aligned}$$

(9)

$$\begin{aligned} V_k \le \lambda _*^{k} ~ V_0 + c ~ k ~ \lambda ^k_* \end{aligned}$$

(10)

are valid for $\lambda _* \ne \rho $ and $\lambda _* = \rho $ respectively. Finally, the following bound is true for the parameter error: $\displaystyle \Vert \tilde{\theta }_k \Vert \le \sqrt{\frac{V_k}{\lambda _{min}~(A_k) }} $.

Orthogonality property $\tilde{\theta }^T_k ~Q_k = 0$ can also be directly used to establish the parameter convergence. Multiplying this identity by $Q_k^T$ and taking the sum yields $\sum _{k=r}^{r+w} \tilde{\theta }^T_k ~ Q_k ~ Q_k^T = 0$. Assuming that the vector of adjustable parameters is constant (after some transient) the parameter convergence can be established for persistently exciting regressor. The summation above is associated with rank accumulation using rank two increment. Figure 2 shows that Kaczmarz algorithm with rank one update accumulates the rank slower. Using arguments similar to [19] it can also be shown that the estimates are unbiased in the presence of the noise.

5 Simulations

New algorithms were tested for estimation of the swell signature in the electricity network. The swell event is associated with momentary increase of the voltage that occurs usually when a large load turns off in a power system. The voltage signal which contains the fundamental frequency of 50 Hz and seven higher harmonics was simulated. The swell events are modeled as momentary increase of the amplitude of the first and the third harmonics, see the yellow lines in Fig. 3. The transient response of the Kaczmarz algorithm with rank two update (2), (3) is compared to the response of the Kaczmarz algorithm with rank one update (4), (5) in the presence of the measurement noise for the same forgetting factor $\lambda = 0.55$ in the short sliding window. The Figure shows that Kaczmarz algorithm with rank two update provides faster estimates at the fundamental frequency for the same forgetting factor.

6 Conclusion

The gain matrix with rank two update which allows exponential weighting of the data inside of the moving window, prioritizes recent measurements and improves estimation performance for fast varying changes of the signal formed the basis for new Kaczmarz algorithms. New Kaczmarz adjustment law has extended orthogonality property in the sliding window and showed improvement of estimation performance in the problem of detection of the grid events. The enhancement of the performance was achieved via a proper choice of two adjustable parameters (small window size and forgetting factor) associated with fast forgetting.

New Kaczmarz algorithms with rank two update were systematically associated in this paper with well-known Kaczmarz algorithms with rank one update, see Table 1. The parameter convergence was proved using Lyapunov method and the convergence of the inverse of information matrix can be used for further improvement of the performance.

The results are valid under the assumption associated with invertibility of the matrix $Q^T_k ~ \Gamma _{k-1} ~ Q_k$ which can be seen as drawback of proposed algorithm. For the sake of robustness the Kaczmarz algorithms with rank two update can be modified. The modification can be associated with modification of well-known Kaczmarz algorithm, [1].

References

Åstrom, K. J., & Wittenmark, B. (1989). Adaptive control. Addison-Wesley, Reading, Massachusetts.
Google Scholar
Cegielski, A. (2024). Bibliography on the Kaczmarz method. Institute of Mathematics, University of Zielona Gora, Zielona Gora, Poland. Retrieved January 8, 2024, from https://staff.uz.zgora.pl/acegiels/Publications-Kaczmarz-method.pdf
Stotsky, A. (2009). Automotive engines: Control estimation, statistical detection. Berlin-Heidelberg: Springer-Verlag.
Google Scholar
Avedyan, E. D. (1978). Modified Kaczmarz algorithms for estimating the parameters of linear plants. Automation and Remote Control, 39(5), 674–680.
Stotsky, A. (2012). A new frequency domain system identification method. Proceedings of the IMechE Part I: Journal of Systems and Control Engineering, 226(1), 111–124.
Google Scholar
Liu, H., & He, Z. (1995). A sliding-exponential window RLS adaptive filtering algorithm: Properties and applications. Signal Processing, 45, 357–368.
Djigan, V. (2006). Multichannel parallelizable sliding window RLS and fast RLS algorithms with linear constraints. Signal Processing, 86, 776–791.
Zhang, Q. (2000). Some implementation aspects of sliding window least squares algorithms. IFAC Proceedings Volumes, 33(15), 763–768. https://www.sciencedirect.com/science/article/pii/S1474667017398440?via%3Dihub
Choi, J., & Boyanczyk, A. (1995). Recursive least squares problems on distributed memory mupliprocessors. Journal of Parallel and Distributed Computing, 24, 11–26.
Article Google Scholar
Fomin, V., Fradkov, A., & Yakubovich, V. (1981). Adaptive control of dynamic plants. Moscow: Nauka. In Russian.
Google Scholar
Ljung, L., & Söderström, T. (1983). Theory and practice of recursive identification The MIT Press Series in Signal Processing, Optimization, and Control (Vol. 4). MIT Press.
Google Scholar
Stotsky, A. (2023). Recursive estimation in the moving window: Efficient detection of the distortions in the grids with desired accuracy. Journal of Advances in Applied and Computational Mathematics, 9, 181–191.
Article Google Scholar
Stotsky, A. (2022). Recursive versus nonrecursive Richardson algorithms: Systematic overview, unified frameworks and application to electric grid power quality monitoring. Automatika, 63(2), 328–337.
Article Google Scholar
Stotsky, A. (2022). Simultaneous frequency and amplitude estimation for grid quality monitoring: New partitioning with memory based Newton-Schulz corrections. IFAC PapersOnLine, 55(9), 42–47.
Article Google Scholar
Hager, W. (1989). Updating the inverse of a matrix. SIAM Review, 31(2), 221–239.
Article MathSciNet Google Scholar
Rader C., & Steinhard A. (1986, December). Hyperbolic householder transformations. IEEE Transactions on Acoustics, Speech, and Signal Processing, ASSP-34(6), 1589–1602.
Ljung, L., & Gunnarsson, S. (1990). Adaptive tracking in system identification - a survey. Automatica, 26(1), 7–22.
Article MathSciNet Google Scholar
Guo, L. (1994). Stability of recursive stochastic tracking algorithms. SIAM Journal on Control and Optimization, 32, 1195–1225.
Article MathSciNet Google Scholar
Stotsky, A. (2010). Recursive trigonometric interpolation algorithms. Journal of Systems and Control Engineering, 224(1), 65–77.
Google Scholar

Download references

Funding

Open access funding provided by Chalmers University of Technology.

Author information

Authors and Affiliations

Systems and Control Division, Department of Electrical Engineering, Chalmers University of Technology, Gothenburg, SE, 412 96, Sweden
Alexander Stotsky

Authors

Alexander Stotsky
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alexander Stotsky.

Ethics declarations

This research was not supported by any organization. This paper does not have any conflict of interest. Participation of the author in this research is completely voluntary. The author gives Springer the permission to publish this paper. This paper does not include any measurement data.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Stotsky, A. Kaczmarz Projection Algorithms with Rank Two Gain Update. J Sign Process Syst 96, 327–332 (2024). https://doi.org/10.1007/s11265-024-01915-w

Download citation

Received: 23 January 2024
Revised: 09 April 2024
Accepted: 14 April 2024
Published: 25 May 2024
Issue Date: May 2024
DOI: https://doi.org/10.1007/s11265-024-01915-w

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Kaczmarz Projection Algorithms with Rank Two Gain Update

Abstract

Similar content being viewed by others

Filtered Krylov-like sequence method for symmetric eigenvalue problems

Krylov subspace methods for estimating operator-vector multiplications in Hilbert spaces

Stabilization and variations to the adaptive local iterative filtering algorithm: the fast resampled iterative filtering method

1 Previous & Related Work

2 Moving Window with Exponential Forgetting

3 Kaczmarz Algorithms with Rank One Update

4 Convergence of Matrix Inversion and Parameter Errors

Error Models

Convergence of Matrix Inversion

Parameter Convergence

5 Simulations

6 Conclusion

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Kaczmarz Projection Algorithms with Rank Two Gain Update

Abstract

Similar content being viewed by others

Filtered Krylov-like sequence method for symmetric eigenvalue problems

Krylov subspace methods for estimating operator-vector multiplications in Hilbert spaces

Stabilization and variations to the adaptive local iterative filtering algorithm: the fast resampled iterative filtering method

1 Previous & Related Work

2 Moving Window with Exponential Forgetting

3 Kaczmarz Algorithms with Rank One Update

4 Convergence of Matrix Inversion and Parameter Errors

Error Models

Convergence of Matrix Inversion

Parameter Convergence

5 Simulations

6 Conclusion

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation