ACM Home Page
Please provide us with feedback. Feedback
Positivity, posynomials and tile size selection
Full text PdfPdf (368 KB)
Source Conference on High Performance Networking and Computing archive
Proceedings of the 2008 ACM/IEEE conference on Supercomputing - Volume 00 table of contents
Austin, Texas
SECTION: Papers table of contents
Article No. 55  
Year of Publication: 2008
ISBN:978-1-4244-2835-9
Authors
Lakshminarayanan Renganarayana  IBM T.J. Watson Research Center, Yorktown Heights, New York
Sanjay Rajopadhye  Colorado State University, Fort Collins, Colorado
Publisher
IEEE Press  Piscataway, NJ, USA
Bibliometrics
Downloads (6 Weeks): 3,   Downloads (12 Months): 87,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

Tiling is a widely used loop transformation for exposing/exploiting parallelism and data locality. Effective use of tiling requires selection and tuning of the tile sizes. This is usually achieved by developing cost models that characterize the performance of the tiled program as a function of tile sizes. All previous approaches to tile size selection (TSS) are cost model specific. Due to this they are neither extensible (e.g., to richer program classes/newer architectures) nor scalable (e.g., to multiple levels of tiling).

This paper identifies positivity as a fundamental property shared by the functions and parameters commonly used in TSS models. We show how this positivity can be used as a basis to derive a TSS framework which is both efficient and scalable. We also show that almost all TSS models proposed in the literature (including those used in production compilers and auto-tuners) can be reduced to our framework.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
R. Schreiber and J. Dongarra, "Automatic blocking of nested loops," RIACS, NASA Ames Research Center, Tech. Rep. 90.38, Aug 1990.
3
 
4
 
5
 
6
K. Yotov, X. Li, G. Ren, M. J. S. Garzaran, D. Padua, K. Pingali, and P. Stodghill, "Is search really necessary to generate high-performance BLAS?" Proceedings of the IEEE, vol. 93, pp. 358--386, 2005.
 
7
L. Renganarayanan, M. Harthi-kote, R. Dewri, and S. Rajopadhye, "Towards optimal multi-level tiling for stencil computations," in 21st IEEE International Parallel and Distributed Processing Symposium (IPDPS) (to appear), 2007.
8
 
9
 
10
 
11
 
12
13
 
14
J. Ramanujam and P. Sadayappan, "Tiling multidimensional itertion spaces for multicomputers." J. Parallel Distrib. Comput., vol. 16, no. 2, pp. 108--120, 1992.
 
15
R. Andonov, S. Balev, S. V. Rajopadhye, and N. Yanev, "Optimal semi-oblique tiling," IEEE Trans. Parallel Distrib. Syst., vol. 14, no. 9, pp. 944--960, 2003.
 
16
 
17
 
18
 
19
C. Hsu and U. Kremer, "Tile selection algorithms and their performance models," CS Dept, Rutgers University, Tech. Rep. DCS-TR-401, Oct. 1999.
20
21
 
22
 
23
B. B. Fraguela, M. G. Carmueja, and D. Andrade, "Optimal tile size selection guided by analytical models." in PARCO, 2005, pp. 565--572.
 
24
25
26
 
27
 
28
 
29
30
 
31
 
32
33
 
34
 
35
 
36
 
37
 
38
R. Duffin, E. Peterson, and C. Zener, Geometric Programming - Theory and Applications. John Wiley, 1967.
 
39
S. Boyd, S. J. Kim, L. Vandenberghe, and A. Hassibi, "A tutorial on Geometric Programming," To appear in Optimization and Engineering, 2006.
 
40
 
41
K. Esseghir, "Improving data locality for caches," Master's thesis, Rice University, September 1993.
 
42
S. Moon and R. Saavedra, "Hyperblocking: A data reorganization method to eliminate cache conflicts in tiled loop nests," University of Southern California, Tech. Rep. TR-98-671, February 1998.
 
43
 
44
 
45
 
46
 
47
L. Renganarayana, R. Upadrasta, and S. Rajopadhye, "Optimal ILP and register tiling: Analytical model and optimization framework," in LCPC 2005: 12th International Workshop on Languages and Compilers for Parallel Computing. Springer Verlag, 2005.
 
48
49
 
50
51
 
52
J. Löfberg, "YALMIP: A toolbox for modeling and optimization in MATLAB," in Proceedings of the CACSD Conference, Taipei, Taiwan, 2004, available from http://control.ee.ethz.ch/~joloef/yalmip.php.
 
53
U. Bondhugula, A. Hartono, J. Ramanujam, and P. Sadayappan, "A practical and fully automatic polyhedral program optimization system," in ACM SIGPLAN PLDI, Jun. 2008.
54
55
56
 
57
B. Meister and S. Verdoolaege, "Polynomial approximations in the polytope model: Bringing the power of quasi-polynomials to the masses," in ODES-6: 6th Workshop on Optimizations for DSP and Embedded Systems, Apr 2008, pp. 45--54.


Collaborative Colleagues:
Lakshminarayanan Renganarayana: colleagues
Sanjay Rajopadhye: colleagues