ACM Home Page
Please provide us with feedback. Feedback
Supporting reverse execution for parallel programs
Full text PdfPdf (660 KB)
Source Workshop on Parallel & Distributed Debugging archive
Proceedings of the 1988 ACM SIGPLAN and SIGOPS workshop on Parallel and distributed debugging table of contents
Madison, Wisconsin, United States
Pages: 124 - 129  
Year of Publication: 1988
ISBN:0-89791-296-9
Also published in ...
Authors
Douglas Z. Pan  Stanford Univ., Stanford, CA
Mark A. Linton  Stanford Univ., Stanford, CA
Sponsors
SIGPLAN: ACM Special Interest Group on Programming Languages
SIGOPS: ACM Special Interest Group on Operating Systems
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 1,   Downloads (12 Months): 32,   Citation Count: 25
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/68210.69227
What is a DOI?

ABSTRACT

Parallel programs are difficult to debug because they run for a, long time and two executions may yield different results. Reverse execution, is a simple and powerful concept that solves both these problems. We are designing a tool for debugging parallel programs, called Recap, that provides the illusion of reverse execution using checkpoints and event recording and playback. During normal execution, Recap logs the results of system calls and shared memory reads: as well as the times that asynchronous events (signals) occur. Recap periodically checkpoints the state of a process by forking and suspending a new process. To reverse execute to a certain point in time, Recap continues the nearest checkpoint process forward in a self-contained environment, simulating all events using the log. We are implementing Recap as part of a larger environment for parallel program development.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
 
3
R. Curt, is and L. Wittie, "Bugnet: A Debugging System for Pa.rallel Programming Environments", Proceedings of the 3rd Interna.tional Conference on Distributed Computing Systems, Miami, Florida, October 1982, pp. 394-399.
4
 
5
 
6
M. A. Linton, "Distributed Management of a Software Database", IEEE Software, Vol. 4, No. 6, November 1987, pp 70-76.
 
7
B. P. Miller and Jong-Deok Choi, "A Mechanism for Efficient Debugging of Parallel Programs", Technical l~eport~ TR754, University of Wisconsin-Madison, 1987.
8

CITED BY  25

Collaborative Colleagues:
Douglas Z. Pan: colleagues
Mark A. Linton: colleagues