| Comparative evaluation of latency reducing and tolerating techniques |
| Full text |
Pdf
(1.36 MB)
|
| Source
|
International Symposium on Computer Architecture
archive
Proceedings of the 18th annual international symposium on Computer architecture
table of contents
Toronto, Ontario, Canada
Pages: 254 - 263
Year of Publication: 1991
ISBN:0-89791-394-9
Also published in ...
|
|
Authors
|
|
Anoop Gupta
|
Computer Systems Laboratory, Stanford University, CA
|
|
John Hennessy
|
Computer Systems Laboratory, Stanford University, CA
|
|
Kourosh Gharachorloo
|
Computer Systems Laboratory, Stanford University, CA
|
|
Todd Mowry
|
Computer Systems Laboratory, Stanford University, CA
|
|
Wolf-Dietrich Weber
|
Computer Systems Laboratory, Stanford University, CA
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 4, Downloads (12 Months): 36, Citation Count: 53
|
|
|
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
|
 |
3
|
Anant Agarwal , Beng-Hong Lim , David Kranz , John Kubiatowicz, APRIL: a processor architecture for multiprocessing, Proceedings of the 17th annual international symposium on Computer Architecture, p.104-114, May 28-31, 1990, Seattle, Washington, United States
|
 |
4
|
|
 |
5
|
|
 |
6
|
|
 |
7
|
Kourosh Gharachorloo , Anoop Gupta , John Hennessy, Performance evaluation of memory consistency models for shared-memory multiprocessors, Proceedings of the fourth international conference on Architectural support for programming languages and operating systems, p.245-257, April 08-11, 1991, Santa Clara, California, United States
|
 |
8
|
Kourosh Gharachorloo , Daniel Lenoski , James Laudon , Phillip Gibbons , Anoop Gupta , John Hennessy, Memory consistency and event ordering in scalable shared-memory multiprocessors, Proceedings of the 17th annual international symposium on Computer Architecture, p.15-26, May 28-31, 1990, Seattle, Washington, United States
|
| |
9
|
|
| |
10
|
J. R. Goodman. Cache consistency and sequential consistency. Technical Report no. 61, SCI Committee, March 1989.
|
 |
11
|
|
 |
12
|
R. H. Halstead, Jr. , T. Fujita, MASA: a multithreaded processor architecture for parallel symbolic computing, Proceedings of the 15th Annual International Symposium on Computer architecture, p.443-451, May 30-June 02, 1988, Honolulu, Hawaii, United States
|
 |
13
|
|
| |
14
|
|
| |
15
|
L. Lamport. How to make a multiprocessor computer that correctly executes multiprocess programs. IEEE Trans. Comput., C-28(9):241-248, September 1979.
|
| |
16
|
|
| |
17
|
R. L. Lee, R-C. Yew, and D. H. Lawrie. Data prefetching in shared memory multiprocessors. In Proc. Int. Conf. Paral. Proc., pages 28--31, August 1987.
|
 |
18
|
Daniel Lenoski , James Laudon , Kourosh Gharachorloo , Anoop Gupta , John Hennessy, The directory-based cache coherence protocol for the DASH multiprocessor, Proceedings of the 17th annual international symposium on Computer Architecture, p.148-159, May 28-31, 1990, Seattle, Washington, United States
|
| |
19
|
Ewing Lusk , James Boyle , Ralph Butler , Terrence Disz , Barnett Glickfeld , Ross Overbeek , James Patterson , Rick Stevens, Portable programs for parallel processors, Holt, Rinehart & Winston, Austin, TX, 1988
|
| |
20
|
J. D. McDonald and D. Baganoff. Vectorization of a particle simulation method for hypersonic ratified flow. In A/AA Thermodynamics, Plasmadynamics and Lasers Conference, June 1988.
|
| |
21
|
|
| |
22
|
G. F. Pfister, W. C. Brantley, D. A. George, S. L. Harvey, W. J. Kleinfelder, K. R McAuliffe, E. A. Melton, V. A. Norton, and J. Weiss. The IBM research parallel processor prototype (RP3): Introduction and architecture. In Proc. Int. Conf. Paral. Proc., pages 764-771, 1985.
|
| |
23
|
|
 |
24
|
|
| |
25
|
G. E. Schmidt. The Butterfly parallel processor. In Proc. int. Conf. Supercomputing, pages 362-365, 1987.
|
| |
26
|
B. J. Smith. Architecture and applications of the HEP multiprocessor computer system. SPIE, 298:241-248, 1981.
|
| |
27
|
|
| |
28
|
J. Torrellas, M. S. Lain, and J. L. Hennessy. Measurement, analysis, and improvement of the cache behavior of shared data in cache coherent multiprocessors. Technical Report CSL- TR-90-412, Stanford University, Feb. 1990.
|
 |
29
|
|
 |
30
|
|
CITED BY 53
|
|
|
|
|
|
|
|
Alain Kägi , Nagi Aboulenein , Douglas C. Burger , James R. Goodman, Techniques for reducing overheads of shared-memory multiprocessing, Proceedings of the 9th international conference on Supercomputing, p.11-20, July 03-07, 1995, Barcelona, Spain
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Toshihiro Ozawa , Yasunori Kimura , Shin'ichiro Nishizaki, Cache miss heuristics and preloading techniques for general-purpose programs, Proceedings of the 28th annual international symposium on Microarchitecture, p.243-248, November 29-December 01, 1995, Ann Arbor, Michigan, United States
|
|
|
|
|
|
Umakishore Ramachandran , Gautam Shah , Anand Sivasubramaniam , Aman Singla , Ivan Yanasak, Architectural mechanisms for explicit communication in shared memory multiprocessors, Proceedings of the 1995 ACM/IEEE conference on Supercomputing (CDROM), p.62-es, December 04-08, 1995, San Diego, California, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Jinwoo Suh , Eun-Gyu Kim , Stephen P. Crago , Lakshmi Srinivasan , Matthew C. French, A performance analysis of PIM, stream processing, and tiled processing on memory-intensive signal processing kernels, ACM SIGARCH Computer Architecture News, v.31 n.2, May 2003
|
|
|
|
|
|
|
|
|
Daniel Lenoski , James Laudon , Kourosh Gharachorloo , Wolf-Dietrich Weber , Anoop Gupta , John Hennessy , Mark Horowitz , Monica S. Lam, The Stanford Dash Multiprocessor, Computer, v.25 n.3, p.63-79, March 1992
|
|
|
|
|
|
D. Lenoski , J. Laudon , T. Joe , D. Nakahira , L. Stevens , A. Gupta , J. Hennessy, The DASH Prototype: Logic Overhead and Performance, IEEE Transactions on Parallel and Distributed Systems, v.4 n.1, p.41-61, January 1993
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Ismail Kadayif , Ayhan Zorlubas , Selcuk Koyuncu , Olcay Kabal , Davut Akcicek , Yucel Sahin , Mahmut Kandemir, Capturing and optimizing the interactions between prefetching and cache line turnoff, Microprocessors & Microsystems, v.32 n.7, p.394-404, October, 2008
|
|