| MOB forms: a class of multilevel block algorithms for dense linear algebra operations |
| Full text |
Pdf
(1.00 MB)
|
| Source
|
International Conference on Supercomputing
archive
Proceedings of the 8th international conference on Supercomputing
table of contents
Manchester, England
Pages: 354 - 363
Year of Publication: 1994
ISBN:0-89791-665-4
|
|
Authors
|
|
Juan J. Navarro
|
Computer Architecture Department, Universitat Politecnica de Catalunya, Gran Capita s/n, Modul D6, E-08034 Barcelona, Spain
|
|
Toni Juan
|
Computer Architecture Department, Universitat Politecnica de Catalunya, Gran Capita s/n, Modul D6, E-08034 Barcelona, Spain
|
|
Tomás Lang
|
Department of Electrical and Computer Engineering, University of California at Irvine
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 2, Downloads (12 Months): 7, Citation Count: 12
|
|
|
ABSTRACT
Multilevel block algorithms exploit the data locality in linear algebra operations when executed in machines with several levels in the memory hierarchy. It is shown that the family we call Multilevel Orthogonal Block (MOB) algorithms is optimal and easy to design and that using the multilevel approach produces significant performance improvements. The effect of interference in the cache, of the TLB misses, and of page faults are also considered. The multilevel block algorithms are evaluated analytically for an ideal memory system with M cache levels without interferences. Moreover, experimental results of the MOB forms in some present high performance workstations are presented.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
Ande92
|
E. Anderson , Z. Bai , C. Bischof , J. Demmel , J. Dongarra , J. Du Croz , A. Greenbaum , S. Hammarling , A. McKenney , S. Ostrouchov , D. Sorensen, LAPACK's user's guide, Society for Industrial and Applied Mathematics, Philadelphia, PA, 1992
|
| |
Aspr93
|
Tom Asprey , Gregory S. Averill , Eric DeLano , Russ Mason , Bill Weiner , Jeff Yetter, Performance Features of the PA7100 Microprocessor, IEEE Micro, v.13 n.3, p.22-35, May 1993
[doi> 10.1109/40.216746]
|
| |
CaKe92
|
|
| |
Chen91
|
|
| |
DoMR91
|
J. J. Dongarra, P. Mayes and G. Radicati, The IBM RISC System/6000 and Linear Algebra Operations. Supercomputer, July 1991, pp. 15-30.
|
| |
DoGK84
|
J. Dongarra, F. Gustavson and A. Karp, Implementing linear algebra algorithms for dense matrices on a vector pipeline machine. SIAM Rev., 26 (1984), pp. 91-112.
|
| |
Dutt92
|
T.A. Dutton et al., The Design of the DEC 3000 AXP Systems, Two High-Perform~.nce Workstations, Digital Technical Journal, Vol 4, Num. 4 1992. pp 66-81
|
| |
GaJM88
|
K. Gallivan, W. Jalby, U. Meier, and A. Sameh, Impact of hierarchical memory systems on linear algebra algorithm design. Intl. J. Supercomputer Appl., 2(1988), pp. 12-48
|
| |
GaPS90
|
|
| |
HePa91
|
|
| |
JaMe86
|
W. Jalby and U. Meier, Optimizing matrix operations on a parallel multiprocessor with a hierarchical memory system, in Proc. Intl. Conf. Par. Processing, IEEE Computer Society Press, New York, 1986, pp. 429-432
|
 |
LaRW91
|
Monica D. Lam , Edward E. Rothberg , Michael E. Wolf, The cache performance and optimizations of blocked algorithms, Proceedings of the fourth international conference on Architectural support for programming languages and operating systems, p.63-74, April 08-11, 1991, Santa Clara, California, United States
|
 |
McCo69
|
|
| |
NaJV93
|
J.J. Navarro, A. Juan, M. Valero~ J.M. Llaberia and T. Lang, Multilevel Orthogonal Blocking for Dense Linear Algebra Computations, IEEE Computer Society TC on Computer Architecture Newsletter, Fall 1993, pp. 10-14
|
| |
OeGr90
|
|
 |
TeGJ93
|
|
| |
Wolf87
|
|
CITED BY 12
|
|
D. Cociorva , J. W. Wilkins , C. Lam , G. Baumgartner , J. Ramanujam , P. Sadayappan, Loop optimization for a class of memory-constrained computations, Proceedings of the 15th international conference on Supercomputing, p.103-113, June 2001, Sorrento, Italy
|
|
|
Kamen Yotov , Xiaoming Li , Gang Ren , Michael Cibulskis , Gerald DeJong , Maria Garzaran , David Padua , Keshav Pingali , Paul Stodghill , Peng Wu, A comparison of empirical and model-driven optimization, ACM SIGPLAN Notices, v.38 n.5, May 2003
|
|
|
Juan J. Navarro , Elena García-Diego , Josep-L. Larriba-Pey , Toni Juan, Block algorithms for sparse matrix computations on high performance workstations, Proceedings of the 10th international conference on Supercomputing, p.301-308, May 25-28, 1996, Philadelphia, Pennsylvania, United States
|
|
|
|
|
|
|
|
|
|
|
|
Juan J. Navarro , Elena García-Diego , José R. Herrero, Data prefetching and multilevel blocking for linear algebra operations, Proceedings of the 10th international conference on Supercomputing, p.109-116, May 25-28, 1996, Philadelphia, Pennsylvania, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Sriram Krishnamoorthy , Umit Catalyurek , Jarek Nieplocha , Atanas Rountev , P. Sadayappan, Data management and query---Hypergraph partitioning for automatic memory hierarchy management, Proceedings of the 2006 ACM/IEEE conference on Supercomputing, November 11-17, 2006, Tampa, Florida
|
|