ACM Home Page
Please provide us with feedback. Feedback
Digital Library logoTake a look at the new version of this page: [ beta version ]. Tell us what you think.
Evaluating multi-core platforms for HPC data-intensive kernels
Full text PdfPdf (558 KB)
Source
Conference On Computing Frontiers archive
Proceedings of the 6th ACM conference on Computing frontiers table of contents
Ischia, Italy
SESSION: Advanced computing systems management and evaluation table of contents
Pages: 207-216  
Year of Publication: 2009
ISBN:978-1-60558-413-3
Authors
Alexander S. van Amesfoort  Delft University of Technology, Delft, Netherlands
Ana Lucia Varbanescu  Delft University of Technology, Delft, Netherlands
Henk J. Sips  Delft University of Technology, Delft, Netherlands
Rob V. van Nieuwpoort  ASTRON, Dwingeloo, Netherlands
Sponsors
ACM: Association for Computing Machinery
SIGMICRO: ACM Special Interest Group on Microarchitectural Research and Processing
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 28,   Downloads (12 Months): 205,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1531743.1531777
What is a DOI?

ABSTRACT

Multi-core platforms have proven themselves able to accelerate numerous HPC applications. But programming data-intensive applications on such platforms is a hard, and not yet solved, problem. Not only do modern processors favor compute-intensive code, they also have diverse architectures and incompatible programming models. And even after making a difficult platform choice, extensive programming effort must be invested with an uncertain performance outcome. By taking the plunge on an irregular, data-intensive application, we present an evaluation of three platform types, namely the generic multi-core CPU, the STI Cell/B.E., and the GPU. We evaluate these platforms in terms of application performance, programming effort and cost. Although we do not select a clear winner, we do provide a list of guidelines to assist in platform choice and development of similar data-intensive applications.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
C. Benthin, I. Wald, M. Scherbaum, and H. Friedrich. Ray tracing on the Cell processor. In IEEE Symposium of Interactive Ray Tracing, pages 15--23. IEEE Computer Society Press, September 2006.
 
3
F. Blagojevic, A. Stamatakis, C. Antonopoulos, and D.S. Nikolopoulos. RAxML-CELL: Parallel phylogenetic tree construction on the Cell Broadband Engine. In IEEE International Parallel and Distributed Processing Symposium. IEEE Press, March 2007.
 
4
 
5
D.S. Briggs, F.R. Schwab, and R.A. Sramek. Imaging. In Synthesis Imaging in Radio Astronomy II, volume 180 of Astronomical Society of the Pacific Conference Series, pages 127--140, 1999.
 
6
A. Buttari, P. Luszczek, J. Kurzak, J. Dongarra, and G. Bosilca. SCOP3: A rough guide to scientific computing on the PlayStation 3. Technical Report UT-CS-07-595, Innovative Computing Lab., University of Tennessee, Knoxville, April 2007.
7
 
8
T. Cornwell, K. Golap, and S. Bhatnagar. W projection: A new algorithm for wide field imaging with radio synthesis arrays. In Astronomical Data Analysis Software and Systems XIV, volume 347, pages 86--95. ASP Press, October 2004.
 
9
T.J. Cornwell. SKA and EVLA computing costs for wide field imaging. Experimental Astronomy, 17:329--343, June 2004.
 
10
T.J. Cornwell and R.A. Perley. Radio-interferometric imaging of very large fields -- The problem of non-coplanar arrays. Astronomy and Astrophysics, 261:353--364, July 1992.
 
11
 
12
U. Drepper. What Every Programmer Should Know About Memory, November 2007.
13
 
14
Khronos OpenCL Working Group. OpenCL 1.0 Standard, December 2008.
 
15
16
 
17
L.-K. Liu, Q. Liu, A.P. Natsev, K.A. Ross, J.R. Smith, and A.L. Varbanescu. Digital media indexing on the Cell processor. In IEEE International Conference on Multimedia and Expo, pages 1866--1869, July 2007.
 
18
W. Liu, B. Schmidt, G. Voss, and W. Müller-Wittig. Molecular dynamics simulations on commodity GPUs with CUDA. In High Performance Computing, pages 185--196, December 2007.
 
19
M. McCool. Signal processing and general-purpose computing on GPUs. IEEE Signal Processing Magazine, pages 109--114, May 2007.
 
20
S.K. Moore. Multicore is bad news for supercomputers. IEEE Spectrum, November 2008.
 
21
NVIDIA. CUDA Programming Guide, December 2008.
 
22
F. Petrini, J. Fernàndez, M. Kistler, G. Fossum, A.L. Varbanescu, and M. Perrone. Multicore surprises: Lessons learned from optimizing Sweep3D on the Cell Broadband Engine. In IEEE International Parallel and Distributed Processing Symposium. IEEE Press, March 2007.
 
23
 
24
K. van der Schaaf, C. Broekema, G. Diepen, and E. Meijeren. The LOFAR central processing facility architecture. Experimental Astronomy, 17(1-3):43--58, June 2004.
 
25
R.V. van Nieuwpoort and J.W. Romein. Using many-core hardware to correlate radio astronomy signals. In 23rd ACM International Conference on Supercomputing, June 2009, to appear.
 
26
A.L. Varbanescu, A. van Amesfoort, T. Cornwell, B.G. Elmegreen, R. van Nieuwpoort, G. van Diepen, and H. Sips. The performance of gridding/degridding on the Cell/B.E. Technical report, Delft University of Technology, January 2008.
 
27
A.L. Varbanescu, A. van Amesfoort, T. Cornwell, B.G. Elmegreen, R. van Nieuwpoort, G. van Diepen, and H. Sips. Radioastronomy image synthesis on the Cell/B.E. Technical report, Delft University of Technology, August 2008.
 
28
R. Wayth, K. Dale, L. Greenhill, D. Mitchell, S. Ord, and H. Pfister. Real-time calibration and imaging for the MWA (poster). In AstroGPU, November 2007.
 
29
S. Williams, J. Carter, L. Oliker, J. Shalf, and K.A. Yelick. Lattice boltzmann simulation optimization on leading multicore platforms. In Symposium on Parallel and Distributed Processing, pages 1--14, April 2008.
30
31

Collaborative Colleagues:
Alexander S. van Amesfoort: colleagues
Ana Lucia Varbanescu: colleagues
Henk J. Sips: colleagues
Rob V. van Nieuwpoort: colleagues