| Performance debugging shared memory multiprocessor programs with MTOOL |
| Full text |
Pdf
(1.10 MB)
|
| Source
|
Conference on High Performance Networking and Computing
archive
Proceedings of the 1991 ACM/IEEE conference on Supercomputing
table of contents
Albuquerque, New Mexico, United States
Pages: 481 - 490
Year of Publication: 1991
ISBN:0-89791-459-7
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 2, Downloads (12 Months): 27, Citation Count: 13
|
|
|
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
D. Bernstein, A. Bolmarchich, and K. So. Performance visualization of parallel programs on a shared memory multiprocessor. In Proceedings of ICPP, vol. iI, pages 1-10, Aug. 1989.
|
| |
3
|
|
| |
4
|
J. J. Dongarra , Orlie Brewer , James Arthur Kohl , Samuel Fineberg, A tool to aid in the design, implementation, and understanding of matrix algorithms for parallel processors, Journal of Parallel and Distributed Computing, v.9 n.2, p.185-202, June 1990
[doi> 10.1016/0743-7315(90)90045-Q]
|
| |
5
|
A. Goldberg and J. Hennessy. MTOOL: A method for detecting memory bottlenecks. Technical Note WRL TN-17/90, DEC Western Research Laboratory, 1990.
|
| |
6
|
A. Goldberg and J. Hennessy. Minimum cost basic block counting. Technical Report In Preparation, Stanford Computer Systems Laboratory, 1991.
|
| |
7
|
A. Goldberg and J. Hennessy. MTOOL: A method for isolating memory bottlenecks in shared memory multiprocessor programs. In Proceedings of ICPP, vol. II, Aug. 1991.
|
| |
8
|
|
| |
9
|
|
 |
10
|
|
| |
11
|
B. P. Miller , M. Clark , J. Hollingsworth , S. Kierstead , S. S. Lim , T. Torzewski, IPS-2: The Second Generation of a Parallel Program Measurement System, IEEE Transactions on Parallel and Distributed Systems, v.1 n.2, p.206-217, April 1990
[doi> 10.1109/71.80132]
|
| |
12
|
|
| |
13
|
|
| |
14
|
J.H. Saltz, R. Mirchandaney, and D. Baxter. Runtime parallelization and scheduling of loops. Technical Report ICASE 88-70, ICASE, NASA Langley Research Center, 1988.
|
 |
15
|
|
CITED BY 14
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Hassan Chafi , Chi Cao Minh , Austen McDonald , Brian D. Carlstrom , JaeWoong Chung , Lance Hammond , Christos Kozyrakis , Kunle Olukotun, TAPE: a transactional application profiling environment, Proceedings of the 19th annual international conference on Supercomputing, June 20-22, 2005, Cambridge, Massachusetts
|
|
|
|
|
|
Marco Zagha , Brond Larson , Steve Turner , Marty Itzkowitz, Performance analysis using the MIPS R10000 performance counters, Proceedings of the 1996 ACM/IEEE conference on Supercomputing (CDROM), p.16-es, January 01-01, 1996, Pittsburgh, Pennsylvania, United States
|
|
|
|
|
|
|
|
|
|
|