|
ABSTRACT
Optimistically synchronized parallel discrete-event simulation is based on the use of communicating sequential processes. Optimistic synchronization means that the processes proceed under the assumption that a synchronized execution schedule is fortuitous. Periodic checkpointing of the state of a process allows the process to roll back to an earlier state when synchronization errors are detected. This article examines the effects of varying the checkpoint interval on the execution time and memory space needed to perform a parallel simulation.The empirical results presented in this article were obtained from the simulation of closed stochastic queuing networks with several different topologies. Various intraprocessor process-scheduling algorithms and both lazy and aggressive cancellation strategies are considered. The empirical results are compared with analytical formulae predicting time-optimal checkpoint intervals. Two modes of operation, throttling and thrashing, have been noted and their effect examined. As the checkpoint interval is increased from one, there is a throttling effect among processes on the same processor, which improves performance. When the checkpoint interval is made too large, there is a thrashing effect caused by interaction between processes on different processors. It is shown that the time-optimal and space-optimal checkpoint intervals are not the same. Furthermore, a checkpoint interval that is too small affects space adversely more than time, whereas, a checkpoint interval that is too large affects time adversely more than space.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
BELLENOT, S. 1992. State skipping performance with the Time Warp Operating System. In Proceedings of the 1992 Workshop on Parallel and Distrzbuted Simulation (Newport Beach, Calif., Jan. 1992), M. Abrams and P. F. Reynolds Jr., Eds. IEEE, New York, 53-61.
|
 |
2
|
|
| |
3
|
CHANDY, K. M. AND MISRA, J. 1979. Distributed simulation: A case study in design and verification of distributed programs. IEEE Trans. Softw. Eng. 5, 5 (Sept.), 440 452.
|
| |
4
|
CHANDY, K. M., BROWNE, J. C., DtSSLY, C. W., AND UHR~G, W. R. 1975. Analytic models for rollback and recovery strategies in data base systems. IEEE Trans. Softw. Eng. 1, i (Mar.), 100-110.
|
| |
5
|
|
| |
6
|
FUJIMOTO, R.M. 1988. Lookahead in parallel discrete event simulation. In Proceedings of the 1988 Irternatlona{ Conference on Parallel Processing (St Charles, Ill, Aug. 1988). Vol. III, Pennsylvania State Umversity, University Park, 34-41
|
| |
7
|
GAFNI, A. 1985 Space management and cancellation mechanisms for Time Warp. Tech. Rep. TR-85-341, Univ. of Southern California, Los Angeles, Calif.
|
 |
8
|
|
| |
9
|
|
 |
10
|
D. Jefferson , B. Beckman , F. Wieland , L. Blume , M. Diloreto, Time warp operating system, Proceedings of the eleventh ACM Symposium on Operating systems principles, p.77-93, November 08-11, 1987, Austin, Texas, United States
|
 |
11
|
|
| |
12
|
LIN, Y.-B. AND L^ZOWSKA, E.D. 1990a. Processor scheduling for Time Warp parallel simulation. Tech. Rep. 90-03-03, Dept. of Computer Science and Engineering, University of Washington, Seattle.
|
| |
13
|
L~N, Y.-B. AND LAZOWS~, E D. 1990b. Reducing the state saving overhead for Time Warp parallel simulation. Tech. Rep. 90-02-03, Dept. of Computer Science and Engineering, Umverslty of Washington, Seattle
|
 |
14
|
Yi-Bing Lin , Bruno R. Preiss , Wayne M. Loucks , Edward D. Lazowska, Selecting the checkpoint interval in time warp simulation, Proceedings of the seventh workshop on Parallel and distributed simulation, p.3-10, May 16-19, 1993, San Diego, California, United States
|
| |
15
|
LOUCKS, W. M. AND PREISS, B. R 1990. The role of knowledge m distributed smmlation. In Proceedtngs of SCS Multlconference on Dlstrzbuted Sirnulatwn (San Diego, Calif., Jan. 1990). SCS, San Diego, Calif., 9 16.
|
| |
16
|
|
 |
17
|
|
 |
18
|
|
| |
19
|
|
| |
20
|
PRE~SS, B. R. 1990. Performance of discrete event simulation on a multiprocessor using optlmlstm and conservai;ive synchronization. In Proceedhzgs of the 1990 Internatzonal Conference on Parallel Processing (St. Charles, Ill, Aug 1990). Pennsylvania State University, University Park, 218 222.
|
| |
21
|
PREISS, B.R. 1989a. The Yaddes distributed discrete event slmulatmn specification language and executmn enwronments. CCNG Tech Rep. E-181, Dept. of Electrical Engineering- and Computer Communications Networks Group, Univ. of Waterloo, Waterloo, Ontario
|
| |
22
|
PREISS, B.R. 1989b The Yaddes distributed discrete event mmulatmn specification language and execution environments, In Proceedings of the SCS Mult~conferetwe on Dlstrtbuted Szmulatlon (Tampa, Fla., Mar. 1989) SCS, San Diego, Calif., 139-144.
|
| |
23
|
PR~TSS, B R AND LouCKS, W. M. 1990 The in, pact o~ lookahead on the performance of conservative dmtributed simulation. In Proceedings of the 1990 European Multzconferencc--Stmulat~on Methodologies, Languages and Archztectures (Nuremberg, Germany, June 1990). SCS, San Diego, Cahf., 204-209
|
| |
24
|
PREISS, B. R. AND LOUCKS, W.M. 1989. Predictmn and lookahead m distributed simulation. CCNG Tech Rep E-191, Dept. of Electrical Engineenng and Computer Commumcatlons Networks Group, Univ of Waterloo, Waterloo, Ontario.
|
| |
25
|
P~E~SS, B. R. AND MAClNTY~E, I.D. 1990. YADDES--Yet Another Distributed Discrete Event Simulator: User manual CCNG Tech. Rep. E-197, Dept. of Electrical and Computer Engineering and Computer Communications Networks Group, Univ of Waterloo, Waterloo, Ontario.
|
| |
26
|
PREISS, B. R., LOUCKS, W. M., AND MAClNTYRE, I.D. 1993. Effects of the checkpoint interval on time and space in Time Warp. CCNG Tech. Rep. E-230, Dept. of Electrical and Computer Engineering and Computer Communications Networks Group, Univ. of Waterloo, Waterloo, Ontario.
|
| |
27
|
PREISS, B. R., MACINTYRE, I. D., AND LOUCKS, W.M. 1992. On the trade-off between time and space in optimistic parallel discrete-event simulation. In Proceedings of the 1992 Workshop on Parallel and Distributed S~mulat~on (Newport Beach, Calif., Jan. 1992). IEEE, New York, 33-42.
|
 |
28
|
|
| |
29
|
|
| |
30
|
|
CITED BY 25
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Fabian Gomes , Brian Unger , John Cleary , Steve Franks, Multiplexed state saving for bounded rollback, Proceedings of the 29th conference on Winter simulation, p.460-467, December 07-10, 1997, Atlanta, Georgia, United States
|
|
|
Fabian Gomes , Brian Unger , John Cleary, Language based state saving extensions for optimistic parallel simulation, Proceedings of the 28th conference on Winter simulation, p.794-800, December 08-11, 1996, Coronado, California, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
REVIEW
"Timo Olavi Alanko : Reviewer"
Optimistically synchronized parallel discrete-event simulation is
based on the use of communicating sequential processes. When
synchronization errors are detected, processes are rolled back to an
earlier state that has been checkpointed. The p
more...
|