ACM Home Page
Please provide us with feedback. Feedback
Compilation for explicitly managed memory hierarchies
Full text PdfPdf (496 KB)
Source
Principles and Practice of Parallel Programming archive
Proceedings of the 12th ACM SIGPLAN symposium on Principles and practice of parallel programming table of contents
San Jose, California, USA
SESSION: Compilation, performance, and energy table of contents
Pages: 226 - 236  
Year of Publication: 2007
ISBN:978-1-59593-602-8
Authors
Timothy J. Knight  Stanford University, Stanford, CA
Ji Young Park  Stanford University, Stanford, CA
Manman Ren  Stanford University, Stanford, CA
Mike Houston  Stanford University, Stanford, CA
Mattan Erez  Stanford University, Stanford, CA
Kayvon Fatahalian  Stanford University, Stanford, CA
Alex Aiken  Stanford University, Stanford, CA
William J. Dally  Stanford University, Stanford, CA
Pat Hanrahan  Stanford University, Stanford, CA
Sponsors
ACM: Association for Computing Machinery
SIGPLAN: ACM Special Interest Group on Programming Languages
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 13,   Downloads (12 Months): 99,   Citation Count: 8
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1229428.1229477
What is a DOI?

ABSTRACT

We present a compiler for machines with an explicitly managed memory hierarchy and suggest that a primary role of any compiler for such architectures is to manipulate and schedule a hierarchy of bulk operations at varying scales of the application and of the machine. We evaluate the performance of our compiler using several benchmarks running on a Cell processor.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
E. Allen, D. Chase, V. Luchangco, J.-W. Maessen, S. Ryu, G. Steele, and S. Tobin-Hochstadt. The Fortress language specification version 0.707. Technical report. Sun Microsystems, 2005.
 
2
B. Alpern, L. Carter, and J. Ferrante. Modeling parallel computers as memory hierarchies. In Programming Models for Massively Parallel Computers, 1993.
3
 
4
D. Callahan, B. L. Chamberlain, and H. P. Zima. The Cascade high productivity language. In Proceedings of the Ninth International Workshop on High Level Parallel Programming Models and Supportive Environments, pages 52--60. IEEE Computer Society, 2004.
5
 
6
W. W. Carlson, J. M. Draper, D. E. Culler, K. Yelick, E. Brooks, and K. Warren. Introduction to UPC and language specification. University of California-Berkeley Technical Report: CCS-TR-99-157, 1999.
 
7
8
 
9
A. Chow, G. Fossum, and D. Brokenshire. A programming example: Large FFT on the Cell Broadband Engine. http://www-306.ibm.com/ chips/techlib/techlib.nsf/techdocs/0AA2394A505EF0FB872570AB005BF0F1, 2005.
 
10
L. Cico, R. Cooper, and J. Greene. Performance and programmability of the IBM/Sony/Toshiba Cell Broadband Engine processor. In Workshop on Edge Computing Using New Commodity Architectures (EDGE), 2006.
 
11
ClearSpeed. CSX600 Processor Datasheet. http://www.clearspeed.com/, 2005.
 
12
ClearSpeed. CSX600 Processor Datasheet. http://www.clearspeed.com/, 2005.
13
 
14
 
15
16
17
 
18
 
19
T. Fukushige, J. Makino, and A. Kawai. GRAPE-6A: A Single-Card GRAPE-6 for Parallel PC-GRAPE Cluster Systems. Publications of the Astronomical Society of Japan, 57:1009--1021, dec 2005.
20
 
21
 
22
IBM. Cell Broadband Engine Architecture Version 1.01. http://www-306.ibm.com/chips/techlib/techlib.nsf/techdocs/1AEEE1270EA2776387257060006E61BA, August 8 2005.
 
23
U. J. Kapasi, P. Mattson, W. J. Dally, J. D. Owens, and B. Towles. Stream scheduling. In Proceedings of the 3rd Workshop on Media and Streaming Processors, pages 101--106, 2001.
 
24
 
25
K. Kennedy, B. Broom, K. Cooper, J. Dongarra, R. Fowler, D. Gannon, L. Johnsson, J. Mellor-Crummey, and L. Torczon. Telescoping Languages: A Strategy for Automatic Generation of Scientific Problem-Solving Systems from Annotated Libraries. Journal of Parallel Distributed Computing, 61:1803--1826, December 2001.
 
26
27
 
28
 
29
S. McPeak and D. Wilderson. Elsa: The Elkhound-based C/C++Parser. http://www.cs.berkeley.edu/~smcpeak/elkhound, 2005.
 
30
Message Passing Interface Forum. MPI: A Message Passing Interface Standard, May 1994.
31
 
32
D. Pham, S. Asano, M. Bolliger, M. N. Day, H. P. Hofstee, C. Johns, J. Kahle, A. Kameyama, J. Keaty, Y. Masubuchi, M. Riley, D. Shippy, D. Stasiak, M. Suzuoki, M. Wang, J. Warnock, S.Weitzel, D.Wendel, T. Yamazaki, and K. Yazawa. The design and implementation of a first-generation CELL processor. In IEEE International Solid-State Circuits Conference, 2005.
 
33
RapidMind. http://rapidmind.net/.
 
34
35
 
36
K. Yelick, L. Semenzato, G. Pike, C. Miyamoto, B. Liblit, A. Krishnamurthy, P. Hilfinger, S. Graham, D. Gay, P. Colella, and A. Aiken. Titanium: A high-performance Java dialect. In ACM 1998 Workshop on Java for High-Performance Network Computing, 1998.

CITED BY  8

Collaborative Colleagues:
Timothy J. Knight: colleagues
Ji Young Park: colleagues
Manman Ren: colleagues
Mike Houston: colleagues
Mattan Erez: colleagues
Kayvon Fatahalian: colleagues
Alex Aiken: colleagues
William J. Dally: colleagues
Pat Hanrahan: colleagues