|
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
Sarita V. Adve , Vikram S. Adve , Mark D. Hill , Mary K. Vernon, Comparison of hardware and software cache coherence schemes, Proceedings of the 18th annual international symposium on Computer architecture, p.298-308, May 27-30, 1991, Toronto, Ontario, Canada
|
 |
2
|
|
 |
3
|
A. Agarwal , R. Simoni , J. Hennessy , M. Horowitz, An evaluation of directory schemes for cache coherence, Proceedings of the 15th Annual International Symposium on Computer architecture, p.280-298, May 30-June 02, 1988, Honolulu, Hawaii, United States
|
| |
4
|
|
 |
5
|
|
 |
6
|
|
| |
7
|
|
 |
8
|
John K. Bennett , John B. Carter , Willy Zwaenepoel, Adaptive software cache management for distributed shared memory architectures, Proceedings of the 17th annual international symposium on Computer Architecture, p.125-134, May 28-31, 1990, Seattle, Washington, United States
|
| |
9
|
BnUYAN, L. N., LIu, B.-C., ANU AnnEu, I. 1989. Analysis of MIN-based nmltiprocessors with private cache memories. In the International Conference on Parallel Processing. Vol. 1. Archztecture, 51 58.
|
| |
10
|
BRANTLEY, W. C., MCAULIFFE, K. P., AND WEISS, J. 1985. RP3 processor-memory element. In the International Conference on Parallel Processlng, 782 789.
|
| |
11
|
CENS{ER, L. M., AND FEAUTmEn, P. 1978. A new solution to coherence problems m multlcache systems. IEEE Trans. Comput. C-27, (Dec.), 1112-1118.
|
 |
12
|
David Chaiken , John Kubiatowicz , Anant Agarwal, LimitLESS directories: A scalable cache coherence scheme, Proceedings of the fourth international conference on Architectural support for programming languages and operating systems, p.224-234, April 08-11, 1991, Santa Clara, California, United States
|
 |
13
|
|
 |
14
|
|
 |
15
|
|
| |
16
|
CHEONG, H , AND VEIDENBAUM, A V 1988b Stale data detectmn and coherence enforcement using flow analysm. In the International Conference on Parallel Processing. Vo}. I, Archttecture, 138-145
|
| |
17
|
Michel, Dubois , Christoph Scheurich , Fayé A. Briggs, Synchronization, Coherence, and Event Ordering in Multiprocessors, Computer, v.21 n.2, p.9-21, February 1988
[doi> 10.1109/2.15]
|
 |
18
|
Jan Edler , Allan Gottlieb , Clyde P. Kruskal , Kevin P. McAuliffe , Larry Rudolph , Marc Snir , Patricia J. Teller , James Wilson, Issues related to MIMD shared-memory computers: the NYU ultracomputer approach, Proceedings of the 12th annual international symposium on Computer architecture, p.126-135, June 17-19, 1985, Boston, Massachusetts, United States
|
 |
19
|
|
 |
20
|
|
 |
21
|
|
 |
22
|
Kourosh Gharachorloo , Anoop Gupta , John Hennessy, Performance evaluation of memory consistency models for shared-memory multiprocessors, Proceedings of the fourth international conference on Architectural support for programming languages and operating systems, p.245-257, April 08-11, 1991, Santa Clara, California, United States
|
 |
23
|
Kourosh Gharachorloo , Daniel Lenoski , James Laudon , Phillip Gibbons , Anoop Gupta , John Hennessy, Memory consistency and event ordering in scalable shared-memory multiprocessors, Proceedings of the 17th annual international symposium on Computer Architecture, p.15-26, May 28-31, 1990, Seattle, Washington, United States
|
 |
24
|
|
 |
25
|
|
 |
26
|
James R. Goodman , Mary K. Vernon , Philip J. Woest, Efficient synchronization primitives for large-scale cache-coherent multiprocessors, Proceedings of the third international conference on Architectural support for programming languages and operating systems, p.64-75, April 03-06, 1989, Boston, Massachusetts, United States
|
 |
27
|
Allan Gottlieb , Ralph Grishman , Clyde P. Kruskal , Kevin P. McAuliffe , Larry Rudolph , Marc Snir, The NYU Ultracomputer—designing a MIMD, shared-memory parallel machine (Extended Abstract), Proceedings of the 9th annual symposium on Computer Architecture, p.27-42, April 26-29, 1982, Austin, Texas, United States
|
 |
28
|
Anoop Gupta , John Hennessy , Kourosh Gharachorloo , Todd Mowry , Wolf-Dietrich Weber, Comparative evaluation of latency reducing and tolerating techniques, Proceedings of the 18th annual international symposium on Computer architecture, p.254-263, May 27-30, 1991, Toronto, Ontario, Canada
|
| |
29
|
GUPTA, A., WEBER, W.-D., AND MOWRY, T. 1990. Reducing memory and traffic reqmrements for scalable directory-based cache coherence schemes. In the International Conference on Parallel Processing Vol. I, Archzteeture, 312-321.
|
| |
30
|
|
| |
31
|
KARLINE, A. R., MANASS, M S., RUDOLPH, L., AND SENATOR, D. D 1986. Competitive snoopy cachemg. In the Symposmm on Foundatmns of Computer Sczence, 244-254.
|
 |
32
|
|
 |
33
|
Clyde P Kruskal , Larry Rudolph , Marc Snir, Efficient synchronization of multiprocessors with shared memory, Proceedings of the fifth annual ACM symposium on Principles of distributed computing, p.218-228, August 11-13, 1986, Calgary, Alberta, Canada
[doi> 10.1145/10590.10609]
|
| |
34
|
KUCK, D J, DAVIDSON, E S., LAWRIE, D. J., AND SAMEH, A.H. 1986. Parallel supercomputing today and the Cedar approach Science 231 (Feb. 28l, 967-974.
|
| |
35
|
LAMPORT, L. 1979 How to make a multiprocessot computer that correctly executes multiprocess programs. IEEE Trans. Comput. C-28, 9, {Sept ), 690-691
|
 |
36
|
R. L. Lee , P. C. Yew , D. H. Lawrie, Multiprocessor cache design considerations, Proceedings of the 14th annual international symposium on Computer architecture, p.253-262, June 02-05, 1987, Pittsburgh, Pennsylvania, United States
[doi> 10.1145/30350.30379]
|
| |
37
|
Daniel Lenoski , James Laudon , Kourosh Gharachorloo , Wolf-Dietrich Weber , Anoop Gupta , John Hennessy , Mark Horowitz , Monica S. Lam, The Stanford Dash Multiprocessor, Computer, v.25 n.3, p.63-79, March 1992
[doi> 10.1109/2.121510]
|
 |
38
|
Daniel Lenoski , James Laudon , Kourosh Gharachorloo , Anoop Gupta , John Hennessy, The directory-based cache coherence protocol for the DASH multiprocessor, Proceedings of the 17th annual international symposium on Computer Architecture, p.148-159, May 28-31, 1990, Seattle, Washington, United States
|
| |
39
|
|
 |
40
|
|
| |
41
|
LmJA, D.J. 1992. Prefetching and scheduling interactions m shared memory multlprocessors. In the Midwest Electrotechnology Con/erence, 84 87.
|
| |
42
|
IAIAA, D. J. 1991. Processor parallehsm consideratnms and memory latency reductmn in shared memory multiprocessors Rep. No. 1136, Center for Supercomputing Research and Development, Univ of I}linms, Urbana.
|
 |
43
|
|
| |
44
|
LILJA, D. J., MAR('.(>VlTZ. D. M., AND YEW, P.-C. 1989 Memory referencing' behavior and a cache performance metric in a shared memory multiprocessor Rep. No. 836, Center for Supercomputing Research and Development, Univ. of Illlnms, Urbana.
|
 |
45
|
D. E. Marquardt , H. S. Alkhatib, C2MP: a cache-coherent, distributed memory multiprocessor-system, Proceedings of the 1989 ACM/IEEE conference on Supercomputing, p.466-475, November 12-17, 1989, Reno, Nevada, United States
[doi> 10.1145/76263.76315]
|
| |
46
|
McCImICHT, E. M 1984. The Dragon Computer System, an early overview. In the NATO Advanced Study blstitute on Mwroarchztecture VLSI Camp uters, 83-101.
|
| |
47
|
MIN, S. L., AND BAER, J -L. 1990. A performance comparison of directory-based and timestampbased cache coherence schemes. In the International Conference on Parallel Processing. Vol. I, Architecture, 305 311.
|
| |
48
|
MIN. S. L., AND BAER J.-L. 1989. A timestampbased cache coherence scheme. In the International Con/Frencc on Parallel Processing. Vol. I, Architecture, 23 32
|
| |
49
|
MIZRAHI, H. E, BAER, J.-L., LAZOWSKA, E. D., AND ZAHORJAN, J. 1989. Extending- the memory hierarchy into multiprocessor lnterconnectian networks A performance analysis, in the Internattonal Conference on Parallel Processing. Vol. I, Architecture, 41-50.
|
| |
50
|
MOUNES-ToussI, F. 1993 An adaptive cache coherence enforcement strategy with compiler assistance. M S. Thesis, Dept. of Electrical Engineering, Univ of Minnesota, Minneapolis.
|
| |
51
|
Nt:UYEN, T. N., LL Z., AND Lm,IA, D.J. 1993. Efficxent use of dynamically tagged directories through compiler analysis. In the Internatwnal Conference on Parallel Processing.
|
 |
52
|
|
 |
53
|
|
| |
54
|
PERRON, R., AND MUNDIE, C. 1986. The architecture of the Alliant FX/8 Computer In IEEE COMPCON. IEEE, New York, 390 393.
|
| |
55
|
PFISTER, G. F., BRANTLEY, W. C., GEORGE, D. A., HARVEY, S. L, KLEINFELDER, W. J., McAULIFFE, K. P., MELTON, E. A., NORTON, V. A., AND WEISS, J. 1985. The IBM research parallel processor prototype (RP3): Introduction and Architecture. In the International Conference on Parallel Processing, 764 771.
|
| |
56
|
POLYCHRONOPOULOS, C. D. 1988. Toward autoscheduling compfiers. J. Supercomput., 2, 297 330.
|
 |
57
|
S. Prybylski , M. Horowitz , J. Hennessy, Performance tradeoffs in cache design, Proceedings of the 15th Annual International Symposium on Computer architecture, p.290-298, May 30-June 02, 1988, Honolulu, Hawaii, United States
|
| |
58
|
|
 |
59
|
|
| |
60
|
|
| |
61
|
TANIL C. K. 1976. Cache design In the tightly coupled multiprocessor system. In the AFIPS Conference Proceedings. National Computer Con{erence. AFIPS, Arlington, Va., 749-753.
|
| |
62
|
|
| |
63
|
TORRELLAS, J., AND HENNESSY, J 1990 Estimating the perfbrmance advantages of relaxing consistency in a shared-memory multiproeessor. In the Internatzonal Confi, rence on Parallel Processing. Vol. I, Architecture, 26-33.
|
| |
64
|
VEII)ENBAUM, A. V. 1986 A compiler-assisted cache coherence solution for multlprocessors. In the Internatzonal Conference on Parallel Processzng, 1029 1036.
|
 |
65
|
|
 |
66
|
|
| |
67
|
YEN, W. C., YEN, D. W. L, AND FU, K.-S. 1985. Data coherence problem m a multlcache system. IEEE Trans Comput. C-34, 1 (Jan), 56-65.
|
 |
68
|
|
|