ACM Home Page
Please provide us with feedback. Feedback
Instruction buffering exploration for low energy VLIWs with instruction clusters
Full text Publisher SitePublisher Site PdfPdf (394 KB)
Source Asia and South Pacific Design Automation Conference archive
Proceedings of the 2004 Asia and South Pacific Design Automation Conference table of contents
Yokohama, Japan
SESSION: Embedded system architectures table of contents
Pages: 824 - 829  
Year of Publication: 2004
ISBN:0-7803-8175-0
Authors
Tom Vander Aa  K.U.Leuven/ESAT, Heverlee, Arenberg, Belgium
Murali Jayapala  K.U.Leuven/ESAT, Heverlee, Arenberg, Belgium
Francisco Barat  K.U.Leuven/ESAT, Heverlee, Arenberg, Belgium
Geert Deconinck  K.U.Leuven/ESAT, Heverlee, Arenberg, Belgium
Rudy Lauwereins  IMEC vzw, Heverlee, Belgium
Francky Catthoor  IMEC vzw, Heverlee, Belgium
Henk Corporaal  TU Eindhoven, AZ Eindhoven, Netherlands
Sponsors
IEICE : Institute of Electronics, Information and Communication Engineers
: IEEE Circuits and Systems Society
IPSJ : Information Processing Society of Japan
SIGDA: ACM Special Interest Group on Design Automation
Publisher
IEEE Press  Piscataway, NJ, USA
Bibliometrics
Downloads (6 Weeks): 2,   Downloads (12 Months): 19,   Citation Count: 5
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

For multimedia applications, loop buffering is an efficient mechanism to reduce the power in the instruction memory of embedded processors. In particular, software controlled clustered loop buffers are energy efficient. However current compilers for VLIW do not fully exploit the potentials offered by such a clustered organization This paper presents an algorithm to explore what is the optimal loop buffer configuration and the optimal way to use this configuration for an application or a set of applications. Results for the MediaBench application suite show an additional 18% reduction (on average) in energy in the instruction memory hierarchy as compared to traditional non-clustered approaches to the loop buffer without compromising performance.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
G. Slavenburg, S. Rathnam, and H. Dijkstra, "The Trimedia TM-1 PCI VLIW media processor," in Proceedings Hot Chips VIII Conference, 1996.
 
3
Texas Instruments Inc., C6000 Platform: DSP Selection Guide, 2003. Selection Guide No. SSDV004L.
 
4
L. Benini, D. Bruni, M. Chinosi, C. Silvano, V. Zaccaria, and R. Zafalon, "A power modeling and estimation framework for vliw-based embedded systems," in in Proc. Int. Workshop on Power And Timing Modeling, Optimization and Simulation PATMOS, September 2001.
 
5
F. Catthoor, K. Danckaert, C. Kulkarni, E. Brockmeyer, P. G. Kjeldsberg, T. Van Achteren, and T. Omnes, Data access and storage management for embedded programmable processors. Kluwer Academic Publishers, March 2002.
6
7
8
 
9
S. Rixner, W. Dally, B. Khialany, p. Mattson, U. Kapnasi, and J. Owens, "Register organization for media processing," in Proc of 26th International Symposium on High-Performance Computer Architecture (HiPC), January 2000.
 
10
M. Jayapala, F. Barat, T. Vander Aa, F. Catthoor, G. Deconinck, and H. Corporaal, "Clustered 10 buffer organization for low energy embedded processors," in Proc of 1st Workshop on Application Specific Processors (WASP), held in conjunction with MICRO-35, November 2002.
 
11
 
12
 
13
L. H. Lee, B. Moyer, J. Arends, and A. Arbor, "Low-cost embedded program loop caching - revisited," tech.rep., EECS, University of Michigan, December 1999.
 
14
A. Gordon-Ross, S. Cotterell, and F. Vahid, "Exploiting fixed programs in embedded systems: A loop cache example," in Proc of IEEE Computer Architecture Letters, Jan 2002.
15
 
16
17
 
18
19
 
20
V. Lapinskii, M. F. Jacome, and G. de Veciana, "Application-specific clustered vliw datapaths: Early exploration on a parameterized design space," IEEE Transactions on Computer Aided Design of Integrated Circuits and Systems, vol. 21, pp. 889--903, August 2002.
21
 
22
Trimaran group, http://www.trimaran.org, Trimaran: An Infrastructure for Research in Instruction-Level Parallelism, 1999.
 
23
 
24
T. Vander Aa, M. Jayapala, F. Barat, F. Catthoor, H. Corporaal, and G. Deconinck, "Instruction buffering exploration for low energy embedded processors," in Proc. of the 13th International Workshop on Power and Timing Modeling, Optimization and Simulation (PATMOS 2003) (E. M. J. J. Chico, ed.), (September 10--12, 2003, Torino, Italy), pp. 409--419, Springer Verlag, Lecture Notes in Computer Science, 09 2003.

Collaborative Colleagues:
Tom Vander Aa: colleagues
Murali Jayapala: colleagues
Francisco Barat: colleagues
Geert Deconinck: colleagues
Rudy Lauwereins: colleagues
Francky Catthoor: colleagues
Henk Corporaal: colleagues