|
ABSTRACT
In this paper we analyze a model of a parallel processing system. In our model there is a single queue which is K ≥ 1 identical processors. Jobs are assumed to consist of a sequence of barrier synchronizations where, at each step, the number of tasks that must be synchronized is random with a known distribution. An exact analysis of the model is derived. The model leads to a rich set of results characterizing the performance of parallel processing systems. We show that the number of jobs concurrently in execution, as well as the number of synchronization variables, grows linearly with the load of the system and strongly depends on the average number of parallel tasks found in the workload. Properties of expected response time or such systems are extensively analyzed and, in particular, we report on some non-obvious response time behavior that arises as a function of the variance of parallelism found in the workload. Based on exact response time analysis, we propose a simple calculation that can be used as a rule of thumb to predict speedups. This can be viewed as a generalization of Amdahl's law that includes queueing effects. This generalization is reformulated when precise workloads cannot be characterized, but rather when only the fraction or sequential work and the average number of parallel tasks arc assumed to be known.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
G.M. Amdahl, "Validity of the Siilgle Processor Approach to Achieving |,arge Scale Computing Capabilities", A FII'S Conf. Prec., Vol. 30, Thompson, Washington I).C., pp..t83-,185, 1967.
|
| |
3
|
F. Baccelli, A.M. Makowski, A. Shwartz, "The Fork-Join queue related systems with synchronization constraints: Stocllastic ordering at~d computable bounds",to appear in Advances in A pplied Probability.
|
 |
4
|
|
| |
5
|
F. Baccelli and A. Makowskl, "Q,,e,,eing Modcls {or Systcms with Synchronization Constraints", Proceedings of the I EEE, Vol. 77, No. 1, January 1989.
|
| |
6
|
lJ. Brocllard, "Scalability, Granularity and Parallelism of Numerical Algorithms", 1BM Research Report RC 14786, June 89.
|
| |
7
|
F. Darerna-R.ogers, G.F. Pfister and K. So, "Memory Access Patterns of Parallel Scientific Programs", IBM Research Report RC 12086, July 1986.
|
| |
8
|
|
 |
9
|
|
| |
10
|
J. Ilack, "On the Promise of General Purpose Parallel Computing",Parallel Compuling, 10, pp.261-275, 1989.
|
| |
11
|
I,. Jamieson, Purdue University, personal cornnlunication.
|
| |
12
|
|
| |
13
|
{,. Kleinrock. Queueing Systems, Volume {: Theory, John Wiley, 1976.
|
| |
14
|
J.D.C. Little, "A Proof of the Queueing Formula L = A t'V", Operations Research, pp. 383- 387, 1961.
|
| |
15
|
O.M. Lubeck and V. Faber, "Modeling the Performance of l lypercubes: A Case Study using the Particle-ln-Cell Application", Los Alamos National Lab Research Report.
|
| |
16
|
V.K. Naik and M.L. Patrick, "Data Traffic Reduction Schemes for Cholesky Factorization on Asynchronous Multiprocessor Systems", IBM Research Report RC 14500, March 1989.
|
| |
17
|
|
| |
18
|
|
| |
19
|
M. Neuts, "Matrix Geometric Solutions in Stochastic Models", The Johns Hopkins Press, 198i.
|
| |
20
|
G.F. Pfistcr and V.A. Norton, "Ilot Spot Contention and Combining in Multistage Interconnectlon Networks", IEEE Trans. on Computers, Vol. C-34, No 10, pp.943-948, Oct. 1985.
|
 |
21
|
|
| |
22
|
|
| |
23
|
|
| |
24
|
|
CITED BY 12
|
|
|
|
|
|
|
|
|
|
|
Francesco Marinuzzi , Stefano Soliani, LISA: A new symbolic package for the definition, analysis and resolution of Markovian processes: symbolic and inductive techniques, Papers from the international symposium on Symbolic and algebraic computation, p.303-311, July 27-29, 1992, Berkeley, California, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|