ACM Home Page
Please provide us with feedback. Feedback
Effects of the checkpoint interval on time and space in time warp
Full text PdfPdf (1.85 MB)
Source ACM Transactions on Modeling and Computer Simulation (TOMACS) archive
Volume 4 ,  Issue 3  (July 1994) table of contents
Pages: 223 - 253  
Year of Publication: 1994
ISSN:1049-3301
Authors
Bruno R. Preiss  Univ. of Waterloo, Waterloo, Ont., Canada
Wayne M. Loucks  Univ. of Waterloo, Waterloo, Ont., Canada
Ian D. Macintyre  Univ. of Waterloo, Waterloo, Ont., Canada
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 1,   Downloads (12 Months): 26,   Citation Count: 25
Additional Information:

abstract   references   cited by   index terms   review   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/189443.189444
What is a DOI?

ABSTRACT

Optimistically synchronized parallel discrete-event simulation is based on the use of communicating sequential processes. Optimistic synchronization means that the processes proceed under the assumption that a synchronized execution schedule is fortuitous. Periodic checkpointing of the state of a process allows the process to roll back to an earlier state when synchronization errors are detected. This article examines the effects of varying the checkpoint interval on the execution time and memory space needed to perform a parallel simulation.The empirical results presented in this article were obtained from the simulation of closed stochastic queuing networks with several different topologies. Various intraprocessor process-scheduling algorithms and both lazy and aggressive cancellation strategies are considered. The empirical results are compared with analytical formulae predicting time-optimal checkpoint intervals. Two modes of operation, throttling and thrashing, have been noted and their effect examined. As the checkpoint interval is increased from one, there is a throttling effect among processes on the same processor, which improves performance. When the checkpoint interval is made too large, there is a thrashing effect caused by interaction between processes on different processors. It is shown that the time-optimal and space-optimal checkpoint intervals are not the same. Furthermore, a checkpoint interval that is too small affects space adversely more than time, whereas, a checkpoint interval that is too large affects time adversely more than space.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
BELLENOT, S. 1992. State skipping performance with the Time Warp Operating System. In Proceedings of the 1992 Workshop on Parallel and Distrzbuted Simulation (Newport Beach, Calif., Jan. 1992), M. Abrams and P. F. Reynolds Jr., Eds. IEEE, New York, 53-61.
2
 
3
CHANDY, K. M. AND MISRA, J. 1979. Distributed simulation: A case study in design and verification of distributed programs. IEEE Trans. Softw. Eng. 5, 5 (Sept.), 440 452.
 
4
CHANDY, K. M., BROWNE, J. C., DtSSLY, C. W., AND UHR~G, W. R. 1975. Analytic models for rollback and recovery strategies in data base systems. IEEE Trans. Softw. Eng. 1, i (Mar.), 100-110.
 
5
 
6
FUJIMOTO, R.M. 1988. Lookahead in parallel discrete event simulation. In Proceedings of the 1988 Irternatlona{ Conference on Parallel Processing (St Charles, Ill, Aug. 1988). Vol. III, Pennsylvania State Umversity, University Park, 34-41
 
7
GAFNI, A. 1985 Space management and cancellation mechanisms for Time Warp. Tech. Rep. TR-85-341, Univ. of Southern California, Los Angeles, Calif.
8
 
9
10
11
 
12
LIN, Y.-B. AND L^ZOWSKA, E.D. 1990a. Processor scheduling for Time Warp parallel simulation. Tech. Rep. 90-03-03, Dept. of Computer Science and Engineering, University of Washington, Seattle.
 
13
L~N, Y.-B. AND LAZOWS~, E D. 1990b. Reducing the state saving overhead for Time Warp parallel simulation. Tech. Rep. 90-02-03, Dept. of Computer Science and Engineering, Umverslty of Washington, Seattle
14
 
15
LOUCKS, W. M. AND PREISS, B. R 1990. The role of knowledge m distributed smmlation. In Proceedtngs of SCS Multlconference on Dlstrzbuted Sirnulatwn (San Diego, Calif., Jan. 1990). SCS, San Diego, Calif., 9 16.
 
16
17
18
 
19
 
20
PRE~SS, B. R. 1990. Performance of discrete event simulation on a multiprocessor using optlmlstm and conservai;ive synchronization. In Proceedhzgs of the 1990 Internatzonal Conference on Parallel Processing (St. Charles, Ill, Aug 1990). Pennsylvania State University, University Park, 218 222.
 
21
PREISS, B.R. 1989a. The Yaddes distributed discrete event slmulatmn specification language and executmn enwronments. CCNG Tech Rep. E-181, Dept. of Electrical Engineering- and Computer Communications Networks Group, Univ. of Waterloo, Waterloo, Ontario
 
22
PREISS, B.R. 1989b The Yaddes distributed discrete event mmulatmn specification language and execution environments, In Proceedings of the SCS Mult~conferetwe on Dlstrtbuted Szmulatlon (Tampa, Fla., Mar. 1989) SCS, San Diego, Calif., 139-144.
 
23
PR~TSS, B R AND LouCKS, W. M. 1990 The in, pact o~ lookahead on the performance of conservative dmtributed simulation. In Proceedings of the 1990 European Multzconferencc--Stmulat~on Methodologies, Languages and Archztectures (Nuremberg, Germany, June 1990). SCS, San Diego, Cahf., 204-209
 
24
PREISS, B. R. AND LOUCKS, W.M. 1989. Predictmn and lookahead m distributed simulation. CCNG Tech Rep E-191, Dept. of Electrical Engineenng and Computer Commumcatlons Networks Group, Univ of Waterloo, Waterloo, Ontario.
 
25
P~E~SS, B. R. AND MAClNTY~E, I.D. 1990. YADDES--Yet Another Distributed Discrete Event Simulator: User manual CCNG Tech. Rep. E-197, Dept. of Electrical and Computer Engineering and Computer Communications Networks Group, Univ of Waterloo, Waterloo, Ontario.
 
26
PREISS, B. R., LOUCKS, W. M., AND MAClNTYRE, I.D. 1993. Effects of the checkpoint interval on time and space in Time Warp. CCNG Tech. Rep. E-230, Dept. of Electrical and Computer Engineering and Computer Communications Networks Group, Univ. of Waterloo, Waterloo, Ontario.
 
27
PREISS, B. R., MACINTYRE, I. D., AND LOUCKS, W.M. 1992. On the trade-off between time and space in optimistic parallel discrete-event simulation. In Proceedings of the 1992 Workshop on Parallel and Distributed S~mulat~on (Newport Beach, Calif., Jan. 1992). IEEE, New York, 33-42.
28
 
29
 
30

CITED BY  25


REVIEW

"Timo Olavi Alanko : Reviewer"

Optimistically synchronized parallel discrete-event simulation is based on the use of communicating sequential processes. When synchronization errors are detected, processes are rolled back to an earlier state that has been checkpointed. The p  more...

Collaborative Colleagues:
Bruno R. Preiss: colleagues
Wayne M. Loucks: colleagues
Ian D. Macintyre: colleagues