| An integrated compilation and performance analysis environment for data parallel programs |
| Full text |
Html
(4 KB),
Pdf
(324 KB)
|
| Source
|
Conference on High Performance Networking and Computing
archive
Proceedings of the 1995 ACM/IEEE conference on Supercomputing (CDROM)
table of contents
San Diego, California, United States
Article No. 50
Year of Publication: 1995
ISBN:0-89791-816-9
|
|
Authors
|
|
Vikram S. Adve
|
Center for Research on Parallel Computation, Rice University, Houston, Texas
|
|
John Mellor-Crummey
|
Center for Research on Parallel Computation, Rice University, Houston, Texas
|
|
Mark Anderson
|
Center for Research on Parallel Computation, Rice University, Houston, Texas
|
|
Jhy-Chun Wang
|
Department of Computer Science, University of Illinois, Urbana, Illinois
|
|
Daniel A. Reed
|
Department of Computer Science, University of Illinois, Urbana, Illinois
|
|
Ken Kennedy
|
Center for Research on Parallel Computation, Rice University, Houston, Texas
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 4, Downloads (12 Months): 20, Citation Count: 29
|
|
|
ABSTRACT
Supporting source-level performance analysis of programs written in data-parallel languages requires a unique degree of integration between compilers and performance analysis tools. Compilers for languages such as High Performance Fortran infer parallelism and communication from data distribution directives, thus, performance tools cannot meaningfully relate measurements about these key aspects of execution performance to source-level constructs without substantial compiler support. This paper describes an integrated system for performance analysis of data-parallel programs based on the Rice Fortran 77D compiler and the Illinois Pablo performance analysis toolkit. During code generation, the Fortran D compiler records mapping information and semantic analysis results describing the relationship between performance instrumentation and the original source program. An integrated performance analysis system based on the Pablo toolkit uses this information to correlate the program's dynamic behavior with the data parallel source code. The integrated system provides detailed source-level performance feedback to programmers via a pair of graphical interfaces. Our strategy serves as a model for integration of data-parallel compilers and performance tools.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Applied Parallel Research. Forge 90 Distributed Memory Parallelizer: User's Guide, version 8.0 ed. Placerville, CA, 1992.
|
| |
2
|
Aydt, R. A. SDDF: The Pablo Self-Describing Data Format. Tech. rep., Department of Computer Science, University of Illinois, Apr. 1994.
|
 |
3
|
|
| |
4
|
Irvin, R. B., and Miller, B. P. A Performance Tool for High-Level Parallel Programming Languages. In Programming Environments for Massively Parallel Distributed Systems (Basel, Switzerland, 1994), Birkhauser Verlag.
|
| |
5
|
Charles H. Koelbel , David B. Loveman , Robert S. Schreiber , Guy L. Steele, Jr. , Mary E. Zosel, The high performance Fortran handbook, MIT Press, Cambridge, MA, 1994
|
| |
6
|
Mellor-Crummey, J. M., Adve, V. S., and Koelbel, C. The Compiler's Role in Analysis and Tuning of Data-Parallel Programs. In Proceedings of The Second Workshop on Environments and Tools for Parallel Scientific Computing (Townsend, TN, May 1994), pp. 211-220. Also available via anonymous ftp from softlib.cs.rice.edu in pub/CRPC-TRs/reports/CRPC-TR94405.ps.
|
| |
7
|
B. P. Miller , M. Clark , J. Hollingsworth , S. Kierstead , S. S. Lim , T. Torzewski, IPS-2: The Second Generation of a Parallel Program Measurement System, IEEE Transactions on Parallel and Distributed Systems, v.1 n.2, p.206-217, April 1990
[doi> 10.1109/71.80132]
|
| |
8
|
Pase, D. Personal communication, Aug. 1995.
|
| |
9
|
|
| |
10
|
|
| |
11
|
Reed, D. A., Aydt, R. A., Noe, R. J., Roth, P. C., Shields, K. A., Schwartz, B. W., and Tavera, L. F. Scalable Performance Analysis: The Pablo Performance Analysis Environment. In Proceedings of the Scalable Parallel Libraries Conference, A. Skjellum, Ed. IEEE Computer Society, 1993, pp. 104-113.
|
 |
12
|
B. Ries , R. Anderson , W. Auld , D. Breazeal , K. Callaghan , E. Richards , W. Smith, The paragon performance monitoring environment, Proceedings of the 1993 ACM/IEEE conference on Supercomputing, p.850-859, December 1993, Portland, Oregon, United States
[doi> 10.1145/169627.169851]
|
| |
13
|
TMC. Prism User's Guide, V1.2. Thinking Machines Corporation, Cambridge, Massachusetts, Mar. 1993.
|
| |
14
|
Williams, W., Hoel, T., and Pase, D. The MPP Apprentice Performance Tool: Delivering the Performance of the Cray T3D. In Programming Environments for Massively Parallel Distributed Systems (Basel, Switzerland, 1994), Birkhauser Verlag.
|
CITED BY 29
|
|
|
|
|
Parthasarathy Ranganathan , Vijay S. Pai , Sarita V. Adve, Using speculative retirement and larger instruction windows to narrow the performance gap between memory consistency models, Proceedings of the ninth annual ACM symposium on Parallel algorithms and architectures, p.199-210, June 23-25, 1997, Newport, Rhode Island, United States
|
|
|
Shih-Wei Liao , Amer Diwan , Robert P. Bosch, Jr. , Anwar Ghuloum , Monica S. Lam, SUIF Explorer: an interactive and interprocedural parallelizer, ACM SIGPLAN Notices, v.34 n.8, p.37-48, Aug. 1999
|
|
|
|
|
|
|
|
|
Insung Park , Nirav H. Kapadia , Renato J. Figueiredo , Rudolf Eigenmann , José A. B. Fortes, Towards an integrated, web-executable parallel programming tool environment, Proceedings of the 2000 ACM/IEEE conference on Supercomputing (CDROM), p.9-es, November 04-10, 2000, Dallas, Texas, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Jack Dongarra , Ian Foster , Geoffrey Fox , William Gropp , Ken Kennedy , Linda Torczon , Andy White, References, Sourcebook of parallel computing, Morgan Kaufmann Publishers Inc., San Francisco, CA, 2003
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Bradford L. Chamberlain , Sung-Eun Choi , E. Christopher Lewis , Calvin Lin , Lawrence Snyder , W. Derrick Weathersby, ZPL: A Machine Independent Programming Language for Parallel Computers, IEEE Transactions on Software Engineering, v.26 n.3, p.197-211, March 2000
|
|
|
Michael Frumkin , Michelle Hribar , Haoqiang Jin , Abdul Waheed , Jerry Yan, A comparison of automatic parallelization tools/compilers on the SGI origin 2000, Proceedings of the 1998 ACM/IEEE conference on Supercomputing (CDROM), p.1-22, November 07-13, 1998, San Jose, CA
|
|
|
James E. Lumpp , Kuppuswamy Sivakumar , Christopher Diaz , James N. Griffioen, Performance visualization for distributed shared memory systems, Virtual shared memory for distributed architectures, Nova Science Publishers, Inc., Commack, NY, 2001
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|