ACM Home Page
Please provide us with feedback. Feedback
QR factorization with Morton-ordered quadtree matrices for memory re-use and parallelism
Full text PdfPdf (252 KB)
Source Principles and Practice of Parallel Programming archive
Proceedings of the ninth ACM SIGPLAN symposium on Principles and practice of parallel programming table of contents
San Diego, California, USA
SESSION: Parallel matrix computations table of contents
Pages: 144 - 154  
Year of Publication: 2003
ISBN:1-58113-588-2
Also published in ...
Authors
Jeremy D. Frens  Calvin College, Grand Rapids, MI
David S. Wise  Indiana University, Bloomington, IN
Sponsors
SIGPLAN: ACM Special Interest Group on Programming Languages
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 20,   Downloads (12 Months): 72,   Citation Count: 4
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/781498.781525
What is a DOI?

ABSTRACT

Quadtree matrices using Morton-order storage provide natural blocking on every level of a memory hierarchy. Writing the natural recursive algorithms to take advantage of this blocking results in code that honors the memory hierarchy without the need for transforming the code. Furthermore, the divide-and-conquer algorithm breaks problems down into independent computations. These independent computations can be dispatched in parallel for straightforward parallel processing.Proof-of-concept is given by an algorithm for QR factorization based on Givens rotations for quadtree matrices in Morton-order storage. The algorithms deliver positive results, competing with and even beating the LAPACK equivalent.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
 
3
4
 
5
6
 
7
8
 
9
M. Cole. Parallel Software Designs, chapter~1, pages 1--25. In Kronsjö and Shumsheruddin {19}, 1992.
10
 
11
E. Elmroth and F. Gustavson. Applying recursion to serial and parallel QR factorization leads to better performance. IBM J. Res. Develop., 44(4):605--624, July 2000. http://www.research.ibm.com/journal/rd/444/elmroth.html
 
12
13
 
14
 
15
 
16
17
 
18
 
19
 
20
 
21
 
22
Silicon Graphics, Inc. R8000 microprocessor chip set. Technical report, Silicon Graphics, Inc., 1994.
 
23
 
24
 
25
26
 
27


Collaborative Colleagues:
Jeremy D. Frens: colleagues
David S. Wise: colleagues