ACM Home Page
Please provide us with feedback. Feedback
A performance evaluation of a general parallel processing model
Full text PdfPdf (1.14 MB)
Source Joint International Conference on Measurement and Modeling of Computer Systems archive
Proceedings of the 1990 ACM SIGMETRICS conference on Measurement and modeling of computer systems table of contents
Univ. of Colorado, Boulder, Colorado, United States
Pages: 13 - 26  
Year of Publication: 1990
ISBN:0-89791-359-0
Also published in ...
Author
Randolph Nelson  IBM Research Division, T.J. Watson Research Center, P.O. Box 704, Room H2-A18, Yorktown Heights, NY
Sponsor
SIGMETRICS: ACM Special Interest Group on Measurement and Evaluation
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 16,   Citation Count: 12
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/98457.98495
What is a DOI?

ABSTRACT

In this paper we analyze a model of a parallel processing system. In our model there is a single queue which is K ≥ 1 identical processors. Jobs are assumed to consist of a sequence of barrier synchronizations where, at each step, the number of tasks that must be synchronized is random with a known distribution. An exact analysis of the model is derived. The model leads to a rich set of results characterizing the performance of parallel processing systems. We show that the number of jobs concurrently in execution, as well as the number of synchronization variables, grows linearly with the load of the system and strongly depends on the average number of parallel tasks found in the workload. Properties of expected response time or such systems are extensively analyzed and, in particular, we report on some non-obvious response time behavior that arises as a function of the variance of parallelism found in the workload. Based on exact response time analysis, we propose a simple calculation that can be used as a rule of thumb to predict speedups. This can be viewed as a generalization of Amdahl's law that includes queueing effects. This generalization is reformulated when precise workloads cannot be characterized, but rather when only the fraction or sequential work and the average number of parallel tasks arc assumed to be known.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
G.M. Amdahl, "Validity of the Siilgle Processor Approach to Achieving |,arge Scale Computing Capabilities", A FII'S Conf. Prec., Vol. 30, Thompson, Washington I).C., pp..t83-,185, 1967.
 
3
F. Baccelli, A.M. Makowski, A. Shwartz, "The Fork-Join queue related systems with synchronization constraints: Stocllastic ordering at~d computable bounds",to appear in Advances in A pplied Probability.
4
 
5
F. Baccelli and A. Makowskl, "Q,,e,,eing Modcls {or Systcms with Synchronization Constraints", Proceedings of the I EEE, Vol. 77, No. 1, January 1989.
 
6
lJ. Brocllard, "Scalability, Granularity and Parallelism of Numerical Algorithms", 1BM Research Report RC 14786, June 89.
 
7
F. Darerna-R.ogers, G.F. Pfister and K. So, "Memory Access Patterns of Parallel Scientific Programs", IBM Research Report RC 12086, July 1986.
 
8
9
 
10
J. Ilack, "On the Promise of General Purpose Parallel Computing",Parallel Compuling, 10, pp.261-275, 1989.
 
11
I,. Jamieson, Purdue University, personal cornnlunication.
 
12
 
13
{,. Kleinrock. Queueing Systems, Volume {: Theory, John Wiley, 1976.
 
14
J.D.C. Little, "A Proof of the Queueing Formula L = A t'V", Operations Research, pp. 383- 387, 1961.
 
15
O.M. Lubeck and V. Faber, "Modeling the Performance of l lypercubes: A Case Study using the Particle-ln-Cell Application", Los Alamos National Lab Research Report.
 
16
V.K. Naik and M.L. Patrick, "Data Traffic Reduction Schemes for Cholesky Factorization on Asynchronous Multiprocessor Systems", IBM Research Report RC 14500, March 1989.
 
17
 
18
 
19
M. Neuts, "Matrix Geometric Solutions in Stochastic Models", The Johns Hopkins Press, 198i.
 
20
G.F. Pfistcr and V.A. Norton, "Ilot Spot Contention and Combining in Multistage Interconnectlon Networks", IEEE Trans. on Computers, Vol. C-34, No 10, pp.943-948, Oct. 1985.
21
 
22
 
23
 
24

CITED BY  12