| Compiler and hardware support for cache coherence in large-scale multiprocessors: design considerations and performance study |
| Full text |
Pdf
(1.48 MB)
|
| Source
|
International Symposium on Computer Architecture
archive
Proceedings of the 23rd annual international symposium on Computer architecture
table of contents
Philadelphia, Pennsylvania, United States
Pages: 283 - 294
Year of Publication: 1996
ISBN:0-89791-786-3
Also published in ...
|
|
Authors
|
|
Lynn Choi
|
University of Illinois at Urbana-Champaign, Center for Supercomputing R & D, Urbana, IL
|
|
Pen-Chung Yew
|
University of Minnesota, Department of Computer Science, Minneapolis, MN
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 17, Downloads (12 Months): 37, Citation Count: 3
|
|
|
ABSTRACT
In this paper, we study a hardware-supported, compiler directed (HSCD) cache coherence scheme, which can be implemented on a large-scale multiprocessor using off-the-shelf microprocessors, such as the Cray T3D. It can be adapted to various cache organizations, including multi-word cache lines and byte-addressable architectures. Several system related issues, including critical sections, inter-thread communication, and task migration have also been addressed. The cost of the required hardware support is small and proportional to the cache size. The necessary compiler algorithms, including intra- and interprocedural array data-flow analysis, have been implemented on the Polaris compiler [17].From our simulation study using the Perfect Club benchmarks, we found that, in spite of the conservative analysis made by the compiler, the performance of the proposed HSCD scheme can be comparable to that of a full-map hardware directory scheme. With its comparable performance and reduced hardware cost, the scheme can be a viable alternative for large-scale multiprocessors, such as the Cray T3D, that rely on users to maintain data coherence.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
Sarita V. Adve , Vikram S. Adve , Mark D. Hill , Mary K. Vernon, Comparison of hardware and software cache coherence schemes, Proceedings of the 18th annual international symposium on Computer architecture, p.298-308, May 27-30, 1991, Toronto, Ontario, Canada
|
 |
2
|
Karl J. Ottenstein , Robert A. Ballance , Arthur B. MacCabe, The program dependence web: a representation supporting control-, data-, and demand-driven interpretation of imperative languages, Proceedings of the ACM SIGPLAN 1990 conference on Programming language design and implementation, p.257-271, June 1990, White Plains, New York, United States
|
| |
3
|
L. M. Censier and P. Feautrier. A New Solution to Coherence Problems in Multicache Systems. IEEE Transactions on Computers, C-27(12):1112-1118, December, 1978.
|
| |
4
|
Yung-Chin Chen. Cache Design and Performance in a Large-Scale Shared-Memory Multiprocessor System. Technical report, Univ. of Illinois, Dept. of Elec. Eng., 1993. Ph.D. Thesis.
|
 |
5
|
|
| |
6
|
|
| |
7
|
T. Chiueh. A Generational Approach to Software- Controlled Multiprocessor Cache Coherence. Proceedings of the 1993 International Conference on Parallel Processing, August 1993.
|
| |
8
|
|
| |
9
|
|
| |
10
|
Digital Equipment Corp. Alpha 21164 Microprocessor: Hardware Reference Manual. 1994.
|
 |
11
|
|
 |
12
|
|
| |
13
|
C. P. Kruskal and M. Snir. The Performance of Multistage Interconnection Networks for Multiprocessors. IEEE Transactions on Computers, C-32(12):1091- 1098, Sept., 1987.
|
 |
14
|
|
| |
15
|
|
| |
16
|
J. M. Mulder, N. T. Quach, and M. J. Flynn. An area model for on-chip memories and its application. Journal of Solid State Circuits, 26:98-106, Feb. 1991.
|
| |
17
|
D. A. Padua, R. Eigenmann, J. Hoeflinger, P. Peterson, P. Tu, S. Weatherford, and K. Faign. Polaris: A New-Generation Parallelizing Compiler for MPPs. In CSRD Rept. No. 1306. Univ. of Illinois at Urbana- Champaign., June, 1993.
|
 |
18
|
|
| |
19
|
|
 |
20
|
|
| |
21
|
A. V. Veidenbaum. A Comp~r-Assisted Cache Coherence Solution for Multiprocessors. Proceedings of the 1986 International Conference on Parallel Processing, pages 1029-1035, August 1986.
|
 |
22
|
|
|