ACM Home Page
Please provide us with feedback. Feedback
Matrix product on heterogeneous master-worker platforms
Full text PdfPdf (438 KB)
Source
Principles and Practice of Parallel Programming archive
Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming table of contents
Salt Lake City, UT, USA
SESSION: Matrix product for special platforms table of contents
Pages 53-62  
Year of Publication: 2008
ISBN:978-1-59593-795-7
Authors
Jack Dongarra  University of Tennessee and Oak Ridge National Lab, USA and University of Manchester UK, Knoxville, USA
Jean-François Pineau  Ecole Normale Supérieure de Lyon, Université de Lyon, LIP CNRS - ENS Lyon - INRIA - UCBL, Lyon, France
Yves Robert  Ecole Normale Supérieure de Lyon, Université de Lyon, LIP CNRS - ENS Lyon - INRIA - UCBL, Lyon, France
Frédéric Vivien  INRIA, Université de Lyon, LIP CNRS - ENS Lyon - INRIA - UCBL, Lyon, France
Sponsors
SIGPLAN: ACM Special Interest Group on Programming Languages
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): n/a,   Downloads (12 Months): n/a,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1345206.1345217
What is a DOI?

ABSTRACT

This paper is focused on designing efficient parallel matrix-product algorithms for heterogeneous master-worker platforms. While matrix-product is well-understood for homogeneous 2D-arrays of processors (e.g., Cannon algorithm and ScaLAPACK outer product algorithm), there are three key hypotheses that render our work original and innovative:

- Centralized data. We assume that all matrix files originate from, and must be returned to, the master. The master distributes data and computations to the workers while in ScaLAPACK, input and output matrices are supposed to be equally distributed among participating resources beforehand). Typically, our approach is useful in the context of speeding up MATLAB or SCILAB clients running on a server (which acts as the master and initial repository of files).

- Heterogeneous star-shaped platforms. We target fully heterogeneous platforms, where computational resources have different computing powers. Also, the workers are connected to the master by links of different capacities. This framework is realistic when deploying the application from the server, which is responsible for enrolling authorized resources.

- Limited memory. As we investigate the parallelization of large problems, we cannot assume that full matrix column blocks can be stored in the worker memories and be re-used for subsequent updates (as in ScaLAPACK).

We have devised efficient algorithms for resource selection (deciding which workers to enroll) and communication ordering (both for input and result messages), and we report a set of numerical experiments on a platform at our site. The experiments show that our matrix-product algorithm has smaller execution times than existing ones, while it also uses fewer resources.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
 
4
 
5
 
6
 
7
R. Clint Whaley, A. Petitet, and J. J. Dongarra. Automated empirical optimizations of software and the atlas project. Parallel Computing, 27(1-2):3--35, Jan. 2001.
 
8
9
 
10
 
11
 
12
 
13
 
14
J.-F. Pineau, Y. Robert, F. Vivien, Z. Shi, and J. Dongarra. Revisiting matrix product on master-worker platforms. Research Report 2006-39, LIP, ENS Lyon, France, Nov. 2006.
 
15
J.-F. Pineau, Y. Robert, F. Vivien, Z. Shi, and J. Dongarra. Revisiting matrix product on master-worker platforms. IEEE Advances in Parallel and Distributed Computational Models, 2007.
 
16
T. Saif and M. Parashar. Understanding the behavior and performance of non-blocking communications in MPI. In Proceedings of Euro-Par 2004: Parallel Processing, LNCS 3149, pages 173--182, 2004.
 
17
 
18

Collaborative Colleagues:
Jack Dongarra: colleagues
Jean-François Pineau: colleagues
Yves Robert: colleagues
Frédéric Vivien: colleagues