ACM Home Page
Please provide us with feedback. Feedback
WaveScalar
Full text PdfPdf (229 KB)
Source International Symposium on Microarchitecture archive
Proceedings of the 36th annual IEEE/ACM International Symposium on Microarchitecture table of contents
Page: 291  
Year of Publication: 2003
ISBN:0-7695-2043-X
Authors
Steven Swanson  Computer Science and Engineering, University of Washington
Ken Michelson  Computer Science and Engineering, University of Washington
Andrew Schwerin  Computer Science and Engineering, University of Washington
Mark Oskin  Computer Science and Engineering, University of Washington
Sponsor
SIGMICRO: ACM Special Interest Group on Microarchitectural Research and Processing
Publisher
IEEE Computer Society  Washington, DC, USA
Bibliometrics
Downloads (6 Weeks): 2,   Downloads (12 Months): 28,   Citation Count: 34
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

Silicon technology will continue to provide an exponential increasein the availability of raw transistors. Effectively translatingthis resource into application performance, however,is an open challenge. Ever increasing wire-delay relativeto switching speed and the exponential cost of circuit complexitymake simply scaling up existing processor designs futile.In this paper, we present an alternative to superscalardesign, WaveScalar. WaveScalar is a dataflow instructionset architecture and execution model designed for scalable,low-complexity/high-performance processors. WaveScalar isunique among dataflow architectures in efficiently providingtraditional memory semantics. At last, a dataflow machinecan run "real-world" programs, written in any language,without sacrificing parallelism.The WaveScalar ISA is designed to run on an intelligentmemory system. Each instruction in a WaveScalar binary executesin place in the memory system and explicitly communicateswith its dependents in dataflow fashion. WaveScalararchitectures cache instructions and the values they operateon in a WaveCache, a simple grid of "alu-in-cache" nodes.By co-locating computation and data in physical space, theWaveCache minimizes long wire, high-latency communication.This paper introduces the WaveScalar instruction setand evaluates a simulated implementation based on currenttechnology. Results for the SPEC and Mediabench applicationsdemonstrate that the WaveCache out-performs an aggressivelyconfigured superscalar design by 2-7 times, withample opportunities for future optimizations.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
 
3
 
4
5
6
 
7
 
8
[8] S. Allan and A. Oldehoeft, "A flow analysis procedure for the translation of high-level languages to a data flow language," IEEE Transactions on Computers, vol. 29, no. 9, 1980.
9
10
 
11
 
12
[12] L. Gwennap, "Digital 21264 sets new standard," Microprocessor Report, vol. 10, October 1996.
 
13
[13] "Map-ca datasheet," June 2001. Equator Technologies.
 
14
[14] H. Sharangpani, "Intel Itanium processor core," in Hot-Chips, 2000.
15
16
17
18
19
 
20
21
22
23
24
25
26
27
28
 
29
 
30
[30] R. Nikhil, "The parallel programming language Id and its compilation for parallel machines," in Workshop on Mazzive Paralleism: Hardware, Programming and Applications, 1990.
 
31
 
32
[32] J. T. Feo, P. J. Miller, and S. K. Skedzielewski, "Sisal90," in High Performance Functional Computing, 1995.
 
33
[33] S. Murer and R. Marti, "The FOOL programming language: Integrating single-assignment and object-oriented paradigms," in European Workshop on Parallel Computing, 1992.
 
34
[34] J. B. Dennis, "First version data flow procedure language," Tech. Rep. MAC TM61, MIT Laboratory for Computer Science, 1991.
35
36
 
37
[37] J. B. Dennis, "Dataflow supercomputers," in IEEE Computer, vol. 13, 1980.
38
39
40
41
42
 
43
44
 
45
 
46
[46] S. Swanson, K. Michelson, and M. Oskin, "Configuration by combustion: Online simulated annealing for dynamic hard-ware configuration," in ASPLOS X Wild and Crazy Idea Session , 2002.
47
 
48
[48] S. McFarling, "Combining Branch Predictors," Tech. Rep. TN- 36, Digital Equipment Corperation, June 1993.
 
49
[49] SPEC, "Spec CPU 2000 benchmark specifications." SPEC2000 Benchmark Release, 2000.
 
50
 
51
 
52
[52] "Personal communication with doug burger and steve keckler," 2002-2003.
53
54

CITED BY  34

Collaborative Colleagues:
Steven Swanson: colleagues
Ken Michelson: colleagues
Andrew Schwerin: colleagues
Mark Oskin: colleagues