ACM Home Page
Please provide us with feedback. Feedback
On accurate floating-point summation
Full text PdfPdf (478 KB)
Source
Communications of the ACM archive
Volume 14 ,  Issue 11  (November 1971) table of contents
Pages: 731 - 736  
Year of Publication: 1971
ISSN:0001-0782
Author
Michael A. Malcolm  Stanford Univ., CA
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 37,   Citation Count: 6
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/362854.362889
What is a DOI?

ABSTRACT

cumulation of floating-point sums is considered on a computer which performs t-digit base &bgr; floating-point addition with exponents in the range —m to M. An algorithm is given for accurately summing n t-digit floating-point numbers. Each of these n numbers is split into q parts, forming q·n t-digit floating-point numbers. Each of these is then added to the appropriate one of &eegr; auxiliary t-digit accumulators. Finally, the accumulators are added together to yield the computed sum. In all, q·n + &eegr; - 1 t-digit floating-point additions are performed. Let &ngr; = ⌈(M + m + 1)/(&eegr; + 1)⌉. If n ≤ (1/q)&bgr;⌈((q-1)/q)t⌈-&ngr;+1 (*), then the relative error in the computed sum is at most ⌈(t + 1)/&ngr;&bgr;1-t. Further, with an additional q + &eegr; - 1 t-digit additions, the computed sum can be corrected to full t-digit accuracy. For example, for the IBM/360 (&bgr; = 16, t = 14, M = 63, m = 64), typical values for q and &eegr; are q = 2 and &eegr; = 32. In this case, (*) becomes n ≤ 1/2 × 164 = 32,768, and we have ⌈(t + 1)/&ngr;&bgr;1-t = 4 × 16-13.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
 
3
 
4
Jordan, D.F. ANL F154S-DOTP, extra-precision accumulating inner product. Argonne Nat. Lab., Appl. Math. Div., System/360 Lib. Subroutine, Argonne, Ill., Nov. 1967.
 
5
Moller, Ole. Quasi double-precision in floating point addition, BIT 5 (1965), 37-50.
 
6