| Techniques for efficient inline tracing on a shared-memory multiprocessor |
| Full text |
Pdf
(1.12 MB)
|
| Source
|
Joint International Conference on Measurement and Modeling of Computer Systems
archive
Proceedings of the 1990 ACM SIGMETRICS conference on Measurement and modeling of computer systems
table of contents
Univ. of Colorado, Boulder, Colorado, United States
Pages: 37 - 47
Year of Publication: 1990
ISBN:0-89791-359-0
Also published in ...
|
|
Authors
|
|
S. J. Eggers
|
Department of Computer Science and Engineering, University of Washington, Seattle, WA
|
|
David R. Keppel
|
Department of Computer Science and Engineering, University of Washington, Seattle, WA
|
|
Eric J. Koldinger
|
Department of Computer Science and Engineering, University of Washington, Seattle, WA
|
|
Henry M. Levy
|
Department of Computer Science and Engineering, University of Washington, Seattle, WA
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 5, Downloads (12 Months): 17, Citation Count: 39
|
|
|
ABSTRACT
While much current research concerns multiprocessor design, few traces of parallel programs are available for analyzing the effect of design trade-offs. Existing trace collection methods have serious drawbacks: trap-driven methods often slow down program execution by more than 1000 times, significantly perturbing program behavior; microcode modification is faster, but the technique is neither general nor portable.
This paper describes a new tool, called MPTRACE, for collecting traces of multithreaded parallel programs executing on shared-memory multiprocessors. MPTRACE requires no hardware or microcode modification; it collects complete program traces; it is portable; and it reduces execution-time dilation to less than a factor 3. MPTRACE is based on inline tracing, in which a program is automatically modified to produce trace information as it executes. We show how the use of compiler flow analysis techniques can reduce the amount of data collected and therefore the runtime dilation of the traced program. We also discuss problematic issues concerning buffering and writing of trace data on a multiprocessor.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
Agarwal et al. 86
|
|
| |
Bershad et al. 88
|
|
| |
Borg et al. 89
|
A. Borg, R. Kessler, G. Lazana, and D. W. Wall. Long address traces from RISC machines" Generation and analysis. Technical Report 89/14, Digtal Equipment Corporation Western Research Laboratory, P alo Alto, CA, September 1989.
|
| |
Devadas & Newton 87
|
S. Devadas and A. Newton. Topological optimization of multiple level array logic. IEEE Transactions on Computer-Aided Design, November 1987.
|
 |
Eggers & Katz 89
|
|
| |
Hill 87
|
|
| |
Lovett & Thakkar 88
|
R. Lovett and S. Thakkar. The Symmetry multiprocessory system. In Proceedings of the 1988 international Conference on Parallel Processing, pages 303-310, August 1988.
|
 |
Ma et al. 87
|
H.-K. T. Ma , S. Devadas , A. Sangiovanni-Vincentelli , R. Wei, Logic verification algorithms and their parallel implementation, Proceedings of the 24th ACM/IEEE conference on Design automation, p.283-290, June 28-July 01, 1987, Miami Beach, Florida, United States
[doi> 10.1145/37888.37931]
|
 |
Mellor-Crummey & LeBlanc 89
|
|
| |
MIPS 86
|
MIPS. Languages and Programmer's Manual. MIPS Computer Systems, Inc., 1986.
|
 |
Przybylski et al. 88
|
S. Prybylski , M. Horowitz , J. Hennessy, Performance tradeoffs in cache design, Proceedings of the 15th Annual International Symposium on Computer architecture, p.290-298, May 30-June 02, 1988, Honolulu, Hawaii, United States
|
| |
Shustek 78
|
|
 |
Sites & Agarwal 88
|
|
 |
Smith 82
|
|
 |
Stunkel & Fuchs 89
|
|
 |
Wiecek 82
|
|
CITED BY 39
|
|
|
|
|
Lionel Brunie , Laurent Lefèvre , Olivier Reymann, Execution analysis of DSM applications: a distributed and scalable approach, Proceedings of the SIGMETRICS symposium on Parallel and distributed tools, p.51-60, May 22-23, 1996, Philadelphia, Pennsylvania, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Ted Romer , Geoff Voelker , Dennis Lee , Alec Wolman , Wayne Wong , Hank Levy , Brian Bershad , Brad Chen, Instrumentation and optimization of Win32/intel executables using Etch, Proceedings of the USENIX Windows NT Workshop on The USENIX Windows NT Workshop 1997, p.1-1, August 11-13, 1997, Seattle, Washington
|
|
|
|
|
|
|
|
|
|
|