ACM Home Page
Please provide us with feedback. Feedback
Adaptive reduction parallelization techniques
Full text PdfPdf (1.29 MB)
Source International Conference on Supercomputing archive
Proceedings of the 14th international conference on Supercomputing table of contents
Santa Fe, New Mexico, United States
Pages: 66 - 77  
Year of Publication: 2000
ISBN:1-58113-270-0
Authors
Hao Yu  Dept. of Computer Science, Texas A&M University, College Station, TX
Lawrence Rauchwerger  Dept. of Computer Science, Texas A&M University, College Station, TX
Sponsor
SIGARCH: ACM Special Interest Group on Computer Architecture
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 3,   Downloads (12 Months): 35,   Citation Count: 11
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues   peer to peer  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/335231.335238
What is a DOI?

ABSTRACT

In this paper, we propose to adapt parallelizing transformations, more specifically, reduction parallelizations, to the actual reference pattern executed by a loop, i.e., to the particular input data and dynamic phase of a program. More precisely we will show how, after validating a reduction at run-time (when this is not possible at compile time) we can dynamically characterize its reference pattern and choose the most appropriate method for parallelizing it. For this purpose, we develop a library of parallel reduction algorithms, including both previously known and novel schemes, which includes algorithms specialized for different classes of access behavior. In particular, each algorithm in our library has identified strengths related to specific reference pattern characteristics, which are matched, at run-time, with measured characteristics of the actual reference pattern. The matching of algorithm to reference pattern is performed using a decision-tree based selection scheme. The contribution of this work consists in new optimizations for reduction parallelization and in the introduction of a new approach to the optimization of irregular applications: Characteristic based customization.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Charmm: A program for macromolecular energy, minimization, and dynamics calculations. Journal of Computational Chemistry, ,4(6), 1983.
 
2
 
3
 
4
 
5
C. Kruskal. Efficient parallel algorithms for graph problems. In Proceedings of the 1986 International Conference on Parallel Processing, pages 869--876, August 1986.
 
6
 
7
M.J. Frisch, et. al. Gaussian 94, Revision B.1. Gaussian, Inc., Pittsburgh PA, 1995.
 
8
Laurence NageL SPICE2: A Computer Program to Simulate Semiconductor Circuits. PhD thesis, University of California, May 1975.
 
9
 
10
11
 
12
Robert G. Whirley and Bruce Engelmann. DYNA3D: A Nonlinear, Explicit, Three-Dimensional Finite Element Code For Solid and Structural Mechanics. Lawrence Livermore National Laboratory, Ngvember, 1993.
 
13
J. Wu, J. Saltz, S. Hiranandani, and H. Berryman. Runtime compilation methods for multicomputers. In Dr. H.D. Schwetman, editor, Proceedings of the 1991 International Conference on Parallel Processing, pages 26--30. CRC Press, Inc., 1991. Vol. II - Software,
 
14
Hao Yu and L. Rauchwerger. Run-time parallelization overhead reduction techniques. In Proc. of the 9th International Conference on Compiler Construction (CC2000), Berlin, Germany. Lecture Notes in Computer Selene, Springer-Vedag, 2000.
15
 
16
 
17

CITED BY  11
 
 
 
 
 
 
 

Collaborative Colleagues:
Hao Yu: colleagues
Lawrence Rauchwerger: colleagues

Peer to Peer - Readers of this Article have also read: