| Vector stream processing for effective application of heterogeneous parallelism |
| Full text |
Pdf
(652 KB)
|
Source
|
Symposium on Applied Computing
archive
Proceedings of the 2009 ACM symposium on Applied Computing
table of contents
Honolulu, Hawaii
SESSION: Computational sciences track
table of contents
Pages 976-980
Year of Publication: 2009
ISBN:978-1-60558-166-8
|
|
Authors
|
|
John C. Linford
|
Virginia Polytechnic Institute and State University, Blacksburg, VA
|
|
Adrian Sandu
|
Virginia Polytechnic Institute and State University, Blacksburg, VA
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 11, Downloads (12 Months): 48, Citation Count: 0
|
|
|
ABSTRACT
Heterogeneous multicore chipsets with many levels of parallelism are becoming increasingly common in high-performance computing systems. Effective use of parallelism in these new chipsets is paramount. We present a 3D chemical transport module optimized for the Cell Broadband Engine Architecture (CBEA). By leveraging the heterogeneous parallelism of the Cell with a method we call vector stream processing, our transport module achieves performance comparable to two nodes of an IBM BlueGene/P, or eight Xeon cores, on a single Cell chip. Performance of the module on two CBEA systems, an IBM BlueGene/P, and an eight-core shared-memory Intel Xeon workstation are given.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
 |
2
|
Filip Blagojevic , Dimitris S. Nikolopoulos , Alexandros Stamatakis , Christos D. Antonopoulos, Dynamic multigrain parallelization on the cell broadband engine, Proceedings of the 12th ACM SIGPLAN symposium on Principles and practice of parallel programming, March 14-17, 2007, San Jose, California, USA
[doi> 10.1145/1229428.1229445]
|
| |
3
|
|
 |
4
|
Kayvon Fatahalian , Daniel Reiter Horn , Timothy J. Knight , Larkhoon Leem , Mike Houston , Ji Young Park , Mattan Erez , Manman Ren , Alex Aiken , William J. Dally , Pat Hanrahan, Sequoia: programming the memory hierarchy, Proceedings of the 2006 ACM/IEEE conference on Supercomputing, November 11-17, 2006, Tampa, Florida
[doi> 10.1145/1188455.1188543]
|
| |
5
|
B. Flachs, S. Asano, S. H. Dhong, H. P. Hofstee, G. Gervais, R. Kim, T. Le, and et. al. The microarchitecture of the synergistic processor for a cell processor. IEEE J. Solid State Circuits, 41(1): 63--70, 2006.
|
| |
6
|
N. T. Hieu, K. C. Keong, A. Wirawan, and B. Schmidt. Applications of heterogeneous structure of Cell Broadband Engine Architecture for biological database similarity search. In Proceedings of the The 2nd International Conference on Bioinformatics and Biomedical Engineering (ICBBE '08), pages 5--8, Shanghai, May 2008.
|
| |
7
|
|
| |
8
|
W. Hundsdorfer. Numerical solution of advection-diffusion-reaction equations. Technical report, Centrum voor Wiskunde en Informatica, 1996.
|
 |
9
|
|
| |
10
|
|
 |
11
|
|
| |
12
|
J. Ray, C. Kennedy, S. Lefantzi, and H. Najm. High-order spatial discretizations and extended stability methods for reacting flows on structured adaptively refined meshes. In Proceedings of Third Joint Meeting of the U.S. Sections of the Combustion Institute, Chicago, USA, March 2003.
|
| |
13
|
|
 |
14
|
Samuel Williams , John Shalf , Leonid Oliker , Shoaib Kamil , Parry Husbands , Katherine Yelick, The potential of the cell processor for scientific computing, Proceedings of the 3rd conference on Computing frontiers, May 03-05, 2006, Ischia, Italy
[doi> 10.1145/1128022.1128027]
|
|