Error estimates for the summation of real numbers with application to floating-point summation

Lange, Marko; Rump, Siegfried M.

doi:10.1007/s10543-017-0658-9

Error estimates for the summation of real numbers with application to floating-point summation

Published: 03 May 2017

Volume 57, pages 927–941, (2017)
Cite this article

BIT Numerical Mathematics Aims and scope Submit manuscript

Marko Lange¹ &
Siegfried M. Rump^1,2

276 Accesses
10 Citations
Explore all metrics

Abstract

Standard Wilkinson-type error estimates of floating-point algorithms involve a factor \(\gamma _k:=k\mathbf {u}/(1-k\mathbf {u})\) for \(\mathbf {u}\) denoting the relative rounding error unit of a floating-point number system. Recently, it was shown that, for many standard algorithms such as matrix multiplication, LU- or Cholesky decomposition, \(\gamma _k\) can be replaced by \(k\mathbf {u}\), and the restriction on k can be removed. However, the arguments make heavy use of specific properties of both the underlying set of floating-point numbers and the corresponding arithmetic. In this paper, we derive error estimates for the summation of real numbers where each sum is afflicted with some perturbation. Recent results on floating-point summation follow as a corollary, in particular error estimates for rounding to nearest and for directed rounding. Our new estimates are sharp and unveil the necessary properties of floating-point schemes to allow for a priori estimates of summation with a factor omitting higher order terms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

New Fast Methods To Compute The Number Of Primes Smaller Than A Given Value

Article 01 February 2023

Linear fractional transformations and nonlinear leaping convergents of some continued fractions

Article 29 January 2020

Partition-theoretic Frobenius-type limit formulas

Article 12 April 2024

Notes

Note that the relative error \(\varepsilon _1\) with respect to the true result is, in fact, bounded by \(\tfrac{\mathbf {u}}{1+\mathbf {u}}\), see (3.3).

References

ANSI/IEEE 754-1985: IEEE Standard for Binary Floating-Point Arithmetic. New York (1985)
ANSI/IEEE 754-2008: IEEE Standard for Floating-Point Arithmetic. New York (2008)
Higham, N.J.: Accuracy and Stability of Numerical Algorithms, 2nd edn. SIAM, Philadelphia (2002)
Book MATH Google Scholar
Jeannerod, C.-P., Rump, S.M.: Improved error bounds for inner products in floating-point arithmetic. SIAM J. Matrix Anal. Appl. (SIMAX) 34(2), 338–344 (2013)
Article MathSciNet MATH Google Scholar
Jeannerod, C.-P., Rump, S.M.: On relative errors of floating-point operations: optimal bounds and applications. Preprint (2014)
Knuth, D.E.: The Art of Computer Programming: Seminumerical Algorithms, vol. 2, 3rd edn. Addison Wesley, Reading (1998)
MATH Google Scholar
Ozaki, K., Ogita, T., Bünger, F., Oishi, S.: Accelerating interval matrix multiplication by mixed precision arithmetic. Nonlinear Theory Appl. IEICE 6(3), 364–376 (2015)
Article Google Scholar
Rump, S.M.: Error estimation of floating-point summation and dot product. BIT Numer. Math. 52(1), 201–220 (2012)
Article MathSciNet MATH Google Scholar
Rump, S.M., Jeannerod, C.-P.: Improved backward error bounds for LU and Cholesky factorizations. SIAM J. Matrix Anal. Appl. (SIMAX) 35(2), 684–698 (2014)
Article MathSciNet MATH Google Scholar
Rump, S.M., Lange, M.: On the definition of unit roundoff. BIT Numer. Math. 56(1), 309–317 (2015)
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

The authors would like to thank the anonymous referees for their valuable suggestions which helped us improving this paper.

Author information

Authors and Affiliations

Faculty of Science and Engineering, Waseda University, 3-4-1 Okubo, Shinjuku-ku, Tokyo, 169-8555, Japan
Marko Lange & Siegfried M. Rump
Institute for Reliable Computing, Hamburg University of Technology, Am Schwarzenberg-Campus 1, 21071, Hamburg, Germany
Siegfried M. Rump

Authors

Marko Lange
View author publications
You can also search for this author in PubMed Google Scholar
Siegfried M. Rump
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marko Lange.

Additional information

Communicated by Lars Eldén.

This research was partially supported by CREST, Japan Science and Technology Agency (JST).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lange, M., Rump, S.M. Error estimates for the summation of real numbers with application to floating-point summation. Bit Numer Math 57, 927–941 (2017). https://doi.org/10.1007/s10543-017-0658-9

Download citation

Received: 08 October 2016
Accepted: 21 April 2017
Published: 03 May 2017
Issue Date: September 2017
DOI: https://doi.org/10.1007/s10543-017-0658-9

Keywords

Mathematics Subject Classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Error estimates for the summation of real numbers with application to floating-point summation

Abstract

Access this article

Similar content being viewed by others

New Fast Methods To Compute The Number Of Primes Smaller Than A Given Value

Linear fractional transformations and nonlinear leaping convergents of some continued fractions

Partition-theoretic Frobenius-type limit formulas

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification

Navigation

Error estimates for the summation of real numbers with application to floating-point summation

Abstract

Access this article

Similar content being viewed by others

New Fast Methods To Compute The Number Of Primes Smaller Than A Given Value

Linear fractional transformations and nonlinear leaping convergents of some continued fractions

Partition-theoretic Frobenius-type limit formulas

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation