ACM Home Page
Please provide us with feedback. Feedback
Parallel simulation of chip-multiprocessor architectures
Full text PdfPdf (519 KB)
Source ACM Transactions on Modeling and Computer Simulation (TOMACS) archive
Volume 12 ,  Issue 3  (July 2002) table of contents
Pages: 176 - 200  
Year of Publication: 2002
ISSN:1049-3301
Authors
Matthew Chidester  Intel Corporation, Hillsboro, OR
Alan George  University of Florida, Gainesville, FL
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 25,   Downloads (12 Months): 113,   Citation Count: 4
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/643114.643116
What is a DOI?

ABSTRACT

Chip-multiprocessor (CMP) architectures present a challenge for efficient simulation, combining the requirements of a detailed microprocessor simulator with that of a tightly-coupled parallel system. In this paper, a distributed simulator for target CMPs is presented based on the Message Passing Interface (MPI) designed to run on a host cluster of workstations. Microbenchmark-based evaluation is used to narrow the parallelization design space concerning the performance impact of distributed vs. centralized target L2 simulation, blocking vs. non-blocking remote cache accesses, null-message vs. barrier techniques for clock synchronization, and network interconnect selection. The best combination is shown to yield speedups of up to 16 on a 9-node cluster of dual-CPU workstations, partially due to cache effects.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
3
 
4
 
5
Burger, D. and Austin, T. 1997. The SimpleScalar tool set, version 2.0. Tech. Rep. TR-1342, University of Wisconsin-Madison Computer Sciences Department.
 
6
Chandy, K. and Misra, J. 1979. Distributed simulation: A case study in design and verification of distributed programs. IEEE Trans. Soft. Eng., 5(5) 440--452.
 
7
Chidester, M., George, A., and Radlinski, M. 2001. Multiple-path execution for chip-multiprocessors. Tech. Rep., HCS Research Lab, Department of Electrical and Computer Engineering, University of Florida.
 
8
 
9
 
10
 
11
12
 
13
 
14
 
15
 
16
George, A., Fogarty, R., Markwell, J., and Miars, M. 1999. An Integrated Simulation Environment for parallel and distributed system prototyping. Simulation 75(5), 283--294.
 
17
 
18
 
19
Johnson, D. 2001. HP's Mako processor. Microprocessor Forum 2001.
 
20
Kahle, J. 1999. Power4: A dual-CPU processor chip. Microprocessor Forum 1999.
 
21
22
 
23
MPI Forum 1994. MPI: A Message-Passing Interface Standard. Message-Passing Interface Forum, www.mpi-forum.org.
 
24
25
 
26
Pai, V., Ranganathan, P., and Adve, S. 1997. RSIM Reference Manual version 1.0, Tech. Rep. 9705, Department of Electrical and Computer Engineering, Rice University.
 
27
Price, C. 1995. MIPS IV Instruction Set, Revision 3.1. MIPS Technologies, Inc., Mountain View, CA.
 
28
29
 
30
Scali Computer AS 2000. Scali System Guide version 2.0, white paper. Scali Computer AS, www.scali.com.
 
31
IEEE 1993. Scalable Coherent Interface: ANSI/IEEE Standard 1596-1992. IEEE Service Center, Piscataway, NJ.
 
32
 
33
34


Collaborative Colleagues:
Matthew Chidester: colleagues
Alan George: colleagues