Numerical Algorithms

, Volume 37, Issue 1–4, pp 101–112 | Cite as

Fast and Accurate Floating Point Summation with Application to Computational Geometry

  • James Demmel
  • Yozo Hida


We present several simple algorithms for accurately computing the sum of n floating point numbers using a wider accumulator. Let f and F be the number of significant bits in the summands and the accumulator, respectively. Then assuming gradual underflow, no overflow, and round-to-nearest arithmetic, up to ⌊2Ff/(1−2f)⌋+1 numbers can be accurately added by just summing the terms in decreasing order of exponents, yielding a sum correct to within about 1.5 units in the last place. In particular, if the sum is zero, it is computed exactly. We apply this result to the floating point formats in the IEEE floating point standard, and investigate its performance. Our results show that in the absence of massive cancellation (the most common case) the cost of guaranteed accuracy is about 30–40% more than the straightforward summation. If massive cancellation does occur, the cost of computing the accurate sum is about a factor of ten. Finally, we apply our algorithm in computing a robust geometric predicate (used in computational geometry), where our accurate summation algorithm improves the existing algorithm by a factor of two on a nearly coplanar set of points.

floating point summation rounding error analysis computational geometry robust geometric predicate 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. [1]
    ANSI/IEEE, IEEE standard for binary floating point arithmetic, New York, Std 754–1985 edition (1985).Google Scholar
  2. [2]
    G. Bohlender, Floating point computation of functions with maximum accuracy, IEEE Trans. Comput. 26 (1977) 621–632.Google Scholar
  3. [3]
    T. Dekker, A floating point technique for extending the available precision, Numer. Math. 18 (1971) 224–242.Google Scholar
  4. [4]
    J. Demmel and Y. Hida, Accurate floating point summation, Computer Science Division Technical Report UCB//CSD–02–1180, University of California, Berkeley, submitted to SIAM J. Sci. Comput.Google Scholar
  5. [5]
    N.J. Higham, The accuracy of floating point summation, SIAMJ. Sci. Comput. 14(4) (1993) 783–799.Google Scholar
  6. [6]
    N.J. Higham, Accuracy and Stability of Numerical Algorithms (SIAM, Philadelphia, PA, 1996).Google Scholar
  7. [7]
    Intel Corporation, Intel Itanium architecture software developer's manual, Vol. 1. Intel Corporation (2002); Scholar
  8. [8]
    Intel Corporation, IA-32 Intel architecture software developer's manual, Vol. 1, Intel Corporation (2002); Scholar
  9. [9]
    W. Kahan, Doubled-precision IEEE standard 754 floating point arithmetic, manuscript (1987).Google Scholar
  10. [10]
    D. Knuth, The Art of Computer Programming, Vol. 2 (Addison-Wesley, Reading, MA, 1969).Google Scholar
  11. [11]
    U. Kulisch and G. Bohlender, Formalization and implementation of floating-point matrix operations, Computing 16 (1976) 239–261.Google Scholar
  12. [12]
    U. Kulisch and W.L. Miranker, Computer Arithmetic in Theory and Practice (Academic Press, New York, 1981).Google Scholar
  13. [13]
    [13] H. Leuprecht and W. Oberaigner, Parallel algorithms for the rounding exact summation of floating point numbers, Computing 28 (1982) 89–104.Google Scholar
  14. [14]
    S. Linnainmaa, Software for doubled-precision floating point computations, ACM Trans. Math. Software 7 (1981) 272–283.Google Scholar
  15. [15]
    M. Malcolm, On accurate floating-point summation, Comm. ACM 14(11) (1971) 731–736.Google Scholar
  16. [16]
    O. Møller, Quasi double precision in floating-point arithmetic, BIT 5 (1965) 37–50.Google Scholar
  17. [17]
    M. Pichat, Correction d'une somme en arithmétique à virgule flottante, Numer. Math. 19 (1972) 400–406.Google Scholar
  18. [18]
    D. Priest, Algorithms for arbitrary precision floating point arithmetic, in: Proc. of the 10th Symposium on Computer Arithmetic, eds. P. Kornerup and D. Matula, Grenoble, France, 26–28 June 1991 (IEEE Computer Soc. Press) pp. 132–145.Google Scholar
  19. [19]
    D. Priest, On properties of floating point arithmetics: Numerical stability and the cost of accurate computations, Ph.D. thesis, University of California at Berkeley (1992); available through anonymous FTP at Scholar
  20. [20]
    D.R. Ross, Reducing truncation errors using cascading accumulators, Comm. ACM 8(1) (1965) 32–33.Google Scholar
  21. [21]
    J.R. Shewchuk, Adaptive precision floating-point arithmetic and fast robust geometric predicates, Discrete Comput. Geometry 18(3) (1997) 305–363.Google Scholar
  22. [22]
    J.M. Wolfe, Reducing truncation errors by programming, Comm. ACM 7(6) (1964) 355–356.Google Scholar

Copyright information

© Kluwer Academic Publishers 2004

Authors and Affiliations

  • James Demmel
    • 1
  • Yozo Hida
    • 2
  1. 1.Computer Science Division and Mathematics DepartmentUniversity of CaliforniaBerkeleyUSA
  2. 2.Computer Science DivisionUniversity of CaliforniaBerkeleyUSA

Personalised recommendations