ACM Home Page
Please provide us with feedback. Feedback
Cluster assignment for high-performance embedded VLIW processors
Full text PdfPdf (227 KB)
Source ACM Transactions on Design Automation of Electronic Systems (TODAES) archive
Volume 7 ,  Issue 3  (July 2002) table of contents
Pages: 430 - 454  
Year of Publication: 2002
ISSN:1084-4309
Authors
Viktor S. Lapinskii  The University of Texas at Austin, Austin, TX
Margarida F. Jacome  The University of Texas at Austin, Austin, TX
Gustavo A. De Veciana  The University of Texas at Austin, Austin, TX
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 2,   Downloads (12 Months): 35,   Citation Count: 7
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/567270.567274
What is a DOI?

ABSTRACT

Clustering is an effective method to increase the available parallelism in VLIW datapaths without incurring severe penalties associated with a large number of register file ports. Efficient utilization of a clustered datapath requires careful binding/assignment of operations to clusters. The article proposes a binding algorithm that effectively explores trade-offs between in-cluster operation serialization and delays associated with data transfers between clusters. Extensive experimental evidence is provided showing that the algorithm generates high quality solutions for representative kernels, with up to 33% improvement over a state-of-the-art binding algorithm.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
Akturan, C. and Jacome, M. F. 2002. An effective software pipelining algorithm for clustered embedded VLIW processors. J. Des. Autom. Embed. Sys., Special Issue on Design Methodologies and Tools for Real-Time Embedded Systems (to appear).
 
3
Analog Devices. 2001. ADSP-TS001M TigerSHARC DSP product description. Available online at http://www.analog.com/products/descriptions/ADSP-TS001.html.
 
4
Basoglu, C., Zhao, K., Kojima, K., and Kawaguchi, A. 2000. The MAP-CA VLIW-based media processor. Equator Technologies Inc. and Hitachi Ltd. Available online at http://equator.com.
5
 
6
 
7
 
8
Desoli, G. 1998. Instruction assignment for clustered VLIW DSP compilers: A new approach. Tech. Rep. HPL-98-13, Hewlett-Packard Co., February.
 
9
Dixit, K. 2001. Performance SPECulations---Benchmarks, friend or foe. In Procedings of the Seventh International Symposium on High Performance Computer Architecture (Monterrey, Mexico).
 
10
 
11
12
 
13
Faraboschi, P., Desoli, G., and Fisher, J. A. 1998. Clustered instruction-level parallel processors. Tech. Rep. HPL-98-204, Hewlett-Packard Co., December.
 
14
 
15
 
16
17
 
18
 
19
 
20
 
21
Lapinskii, V., Jacome, M. F., and de Veciana, G. 2002. Application-specific clustered VLIW datapaths: Early exploration on a parameterized design space. IEEE Trans. Comput. Aid. Des. Integ. Circ. Syst. (accepted for publication).
 
22
 
23
24
 
25
 
26
27
 
28
 
29
Rau, B. R., Kathail, V., and Aditya, S. 1998. Machine-description driven compilers for EPIC processors. Tech. Rep. HPL-98-40, Hewlett-Packard Co., September.
 
30
Rixner, S., Dally, W. J., Khailany, B., Mattson, P., Kapasi, U. J., and Owens, J. D. 1999. Register organization for media processing. In Proceedings of the 26th International Symposium on High-Performance Computer Architecture.
 
31
 
32
Texas Instruments. 2000. TMS320C6000 CPU and instruction set reference guide. Literature Number: SPRU226.
 
33


Collaborative Colleagues:
Viktor S. Lapinskii: colleagues
Margarida F. Jacome: colleagues
Gustavo A. De Veciana: colleagues