|
ABSTRACT
Recent research has offered programmers increased options for programming parallel computers by exposing system policies (e.g., memory coherence protocols) or by providing several programming paradigms (e.g. message passing and shared memory) on the same platform. Increased flexibility can lead to higher performance, but it is also a double-edged sword that demands a programmer understand his or her application and system at a more fundamental level. Our system, Tempest, allows a programmer to select or implement communication and memory coherence policies that fit an application's communication patterns. With it, we have achieved substantial performance gains without making major changes in programs. However, the process of selecting, designing, and implementing coherence protocols is difficult and time consuming, without tools to supply detailed information about an application's behavior and interaction with the memory system. StormWatch is a new visualization tool that aids a programmer through four mechanisms: tightly-coupled bidirectionally linked views, interactive filters, animation, and performance slicing. Multiple views present several aspects of program behavior simultaneously and show the same phenomenon from different perspectives. Real-time linking between views enables a programmer to explore levels of abstraction by changing a view and observing the effect on other views. Interactive filters, along with bidirectional linking, can isolate the effects of statements, loops, procedures, or files. StormWatch can also animate a program's dynamic behavior to show the evolution of program execution and communication. Finally, performance slicing captures causality among events. The examples in the paper illustrate how StormWatch helped us substantially improve the performance of two applications.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
David Bailey, John Barton, Thomas Lasinski, and Horst Simon. The NAS Parallel Benchmarks. Technical Report RNR-91-002 Revision 2, Ames Research Center, August 1991.
|
| |
2
|
|
| |
3
|
Barton P. Miller et al. The Paradyn Performance Measurement Tools. Available via anonymous ftp from grilled.cs.wisc.edu:techinical_papers/overview.ps.
|
| |
4
|
Babak Falsafi , Alvin R. Lebeck , Steven K. Reinhardt , Ioannis Schoinas , Mark D. Hill , James R. Larus , Anne Rogers , David A. Wood, Application-specific protocols for user-level shared memory, Proceedings of the 1994 conference on Supercomputing, p.380-389, December 1994, Washington, D.C., United States
|
| |
5
|
GIST User's Manual. Bolt, Beranek, and Newman, 1988.
|
| |
6
|
|
 |
7
|
James R. Larus , Brad Richards , Guhan Viswanathan, LCM: memory system support for parallel language implementation, Proceedings of the sixth international conference on Architectural support for programming languages and operating systems, p.208-218, October 05-07, 1994, San Jose, California, United States
|
| |
17
|
|
| |
18
|
T. Lehr , Z. Segall , D. F. Vrsalovic , E. Caplan , A. L. Chung , C. E. Fineman, Visualizing Performance Debugging, Computer, v.22 n.10, p.38-51, October 1989
[doi> 10.1109/2.42013]
|
| |
19
|
|
 |
20
|
|
 |
21
|
S. K. Reinhardt , J. R. Larus , D. A. Wood, Tempest and typhoon: user-level shared memory, Proceedings of the 21ST annual international symposium on Computer architecture, p.325-336, April 18-21, 1994, Chicago, Illinois, United States
|
| |
23
|
Joel Saltz , Ravi Ponnusamy , Shamik D. Sharma , Bongki Moon , Yuan-Shin Hwang , Mustafa Uysal , Raja Das, A manual for the CHAOS runtime library, University of Maryland at College Park, College Park, MD, 1995
|
| |
24
|
Ioannis Schoinas, Babak Falsafi, Alvin R. Lebeck, Steven K. Reinhardt, James R. Larus, and David A. Wood.
|
CITED BY 6
|
|
Luiz DeRose , K. Ekanadham , Jeffrey K. Hollingsworth , Simone Sbaraglia, SIGMA: a simulator infrastructure to guide memory analysis, Proceedings of the 2002 ACM/IEEE conference on Supercomputing, p.1-13, November 16, 2002, Baltimore, Maryland
|
|
|
Wagner Meira, Jr. , Thomas J. LeBlanc , Virgílio A. F. Almeida, Using cause-effect analysis to understand the performance of distributed programs, Proceedings of the SIGMETRICS symposium on Parallel and distributed tools, p.101-111, August 03-04, 1998, Welches, Oregon, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|