| CellSs: a programming model for the cell BE architecture |
| Full text |
Html
(2 KB),
Pdf
(608 KB)
|
| Source
|
Conference on High Performance Networking and Computing
archive
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
table of contents
Tampa, Florida
SESSION: Technical papers
table of contents
Article No. 86
Year of Publication: 2006
ISBN:0-7695-2700-0
|
|
Authors
|
|
Pieter Bellens
|
Barcelona Supercomputing Center and UPC, Jordi Girona, Barcelona, (SPAIN)
|
|
Josep M. Perez
|
Barcelona Supercomputing Center and UPC, Jordi Girona, Barcelona, (SPAIN)
|
|
Rosa M. Badia
|
Barcelona Supercomputing Center and UPC, Jordi Girona, Barcelona, (SPAIN)
|
|
Jesus Labarta
|
Barcelona Supercomputing Center and UPC, Jordi Girona, Barcelona, (SPAIN)
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 19, Downloads (12 Months): 106, Citation Count: 25
|
|
|
ABSTRACT
In this work we present Cell superscalar (CellSs) which addresses the automatic exploitation of the functional parallelism of a sequential program through the different processing elements of the Cell BE architecture. The focus in on the simplicity and flexibility of the programming model. Based on a simple annotation of the source code, a source to source compiler generates the necessary code and a runtime library exploits the existing parallelism by building at runtime a task dependency graph. The runtime takes care of the task scheduling and data handling between the different processors of this heterogeneous architecture. Besides, a locality-aware task scheduling has been implemented to reduce the overhead of data transfers. The approach has been implemented and tested with a set of examples and the results obtained since now are promising.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Badia, R. M., Labarta, J., Sirvent, R., Pérez, J. M., Cela, J. M., and Grima, R. 2003. Programming grid applications with GRID superscalar. Journal of Grid Computing 1, 2, 151--170.
|
| |
2
|
Compunity, 2006. The community of OpenMP users, researchers, tool developers and provider website. http://www.compunity.org/.
|
| |
3
|
A. E. Eichenberger , J. K. O'Brien , K. M. O'Brien , P. Wu , T. Chen , P. H. Oden , D. A. Prener , J. C. Shepherd , B. So , Z. Sura , A. Wang , T. Zhang , P. Zhao , M. K. Gschwind , R. Archambault , Y. Gao , R. Koo, Using advanced compiler technology to exploit the performance of the Cell Broadband EngineTM architecture, IBM Systems Journal, v.45 n.1, p.59-84, January 2006
|
| |
4
|
|
| |
5
|
|
| |
6
|
|
| |
7
|
Gonzalez, M., Balart, J., Duran, A., Martorell, X., and Ayguadé, E. 2004. Nanos mercurium: a research compiler for OpenMP. In Proceedings of the European Workshop on OpenMP,-.
|
| |
8
|
Gs, 2006. GRID superscalar homepage. http://www.bsc.es/grid/grid_superscalar/.
|
| |
9
|
|
| |
10
|
|
| |
11
|
IBM, 2006. PowerPC hosted environment for the cell broadband engine version 1.0.1. http://www.bsc.es/projects/deepcomputing/linuxoncell/cellsimulator/ppc-cellsimulator-sdk1.0.1.html.
|
| |
12
|
|
 |
13
|
|
| |
14
|
Perez, J. M., Badia, R. M., and Labarta, J. 2006. Scalar-aware grid superscalar. DAC technical report UPC-DAC-RR-CAP-2006-12, Universitat Politècnica de Catalunya, Computer Architecture Department, www.ac.upc.edu.
|
| |
15
|
Pham, D., and Al. 2005. The design and implementation of a first-generation cell processor. In Proceedings of the 2005 IEEE International Solid-State Circuits Conference (ISSCC), 184--185.
|
 |
16
|
|
 |
17
|
|
CITED BY 25
|
|
|
|
|
|
|
|
|
|
|
Timothy J. Knight , Ji Young Park , Manman Ren , Mike Houston , Mattan Erez , Kayvon Fatahalian , Alex Aiken , William J. Dally , Pat Hanrahan, Compilation for explicitly managed memory hierarchies, Proceedings of the 12th ACM SIGPLAN symposium on Principles and practice of parallel programming, March 14-17, 2007, San Jose, California, USA
|
|
|
|
|
|
Ernie Chan , Field G. Van Zee , Paolo Bientinesi , Enrique S. Quintana-Orti , Gregorio Quintana-Orti , Robert van de Geijn, SuperMatrix: a multithreaded runtime scheduling system for algorithms-by-blocks, Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming, February 20-23, 2008, Salt Lake City, UT, USA
|
|
|
|
|
|
|
|
|
Scott Schneider , Jae-Seung Yeom , Benjamin Rose , John C. Linford , Adrian Sandu , Dimitrios S. Nikolopoulos, A comparison of programming models for multiprocessors with explicitly managed memory hierarchies, Proceedings of the 14th ACM SIGPLAN symposium on Principles and practice of parallel programming, February 14-18, 2009, Raleigh, NC, USA
|
|
|
Filip Blagojevic , Dimitrios S. Nikolopoulos , Alexandros Stamatakis , Christos D. Antonopoulos , Matthew Curtis-Maury, Runtime scheduling of dynamic parallelism on accelerator-based multi-core systems, Parallel Computing, v.33 n.10-11, p.700-719, November, 2007
|
|
|
Tarik Saidani , Stéphane Piskorski , Lionel Lacassagne , Samir Bouaziz, Parallelization schemes for memory optimization on the cell processor: a case study of image processing algorithm, Proceedings of the 2007 workshop on MEmory performance: DEaling with Applications, systems and architecture, p.9-16, September 16-16, 2007, Brasov, Romania
|
|
|
|
|
|
Xavier Teruel , Priya Unnikrishnan , Xavier Martorell , Eduard Ayguadé , Raul Silvera , Guansong Zhang , Ettore Tiotto, OpenMP tasks in IBM XL compilers, Proceedings of the 2008 conference of the center for advanced studies on collaborative research: meeting of minds, October 27-30, 2008, Ontario, Canada
|
|
|
|
|
|
Manman Ren , Ji Young Park , Mike Houston , Alex Aiken , William J. Dally, A tuning framework for software-managed memory hierarchies, Proceedings of the 17th international conference on Parallel architectures and compilation techniques, October 25-29, 2008, Toronto, Ontario, Canada
|
|
|
|
|
|
|
|
|
|
|
|
Gregorio Quintana-Ortí , Francisco D. Igual , Enrique S. Quintana-Ortí , Robert A. van de Geijn, Solving dense linear systems on platforms with multiple hardware accelerators, Proceedings of the 14th ACM SIGPLAN symposium on Principles and practice of parallel programming, February 14-18, 2009, Raleigh, NC, USA
|
|
|
|
|
|
|
|
|
Pieter Bellens , Josep M. Perez , Felipe Cabarcas , Alex Ramirez , Rosa M. Badia , Jesus Labarta, CellSs: Scheduling techniques to better exploit memory hierarchy, Scientific Programming, v.17 n.1-2, p.77-95, January 2009
|
|
|
Filip Blagojevic , Costin Iancu , Katherine Yelick , Matthew Curtis-Maury , Dimitrios S. Nikolopoulos , Benjamin Rose, Scheduling dynamic parallelism on accelerators, Proceedings of the 6th ACM conference on Computing frontiers, May 18-20, 2009, Ischia, Italy
|
|
|
|
|
|
|
|