| Instruction buffering exploration for low energy VLIWs with instruction clusters |
| Full text |
Publisher Site
,
Pdf
(394 KB)
|
| Source
|
Asia and South Pacific Design Automation Conference
archive
Proceedings of the 2004 Asia and South Pacific Design Automation Conference
table of contents
Yokohama, Japan
SESSION: Embedded system architectures
table of contents
Pages: 824 - 829
Year of Publication: 2004
ISBN:0-7803-8175-0
|
|
Authors
|
|
Tom Vander Aa
|
K.U.Leuven/ESAT, Heverlee, Arenberg, Belgium
|
|
Murali Jayapala
|
K.U.Leuven/ESAT, Heverlee, Arenberg, Belgium
|
|
Francisco Barat
|
K.U.Leuven/ESAT, Heverlee, Arenberg, Belgium
|
|
Geert Deconinck
|
K.U.Leuven/ESAT, Heverlee, Arenberg, Belgium
|
|
Rudy Lauwereins
|
IMEC vzw, Heverlee, Belgium
|
|
Francky Catthoor
|
IMEC vzw, Heverlee, Belgium
|
|
Henk Corporaal
|
TU Eindhoven, AZ Eindhoven, Netherlands
|
|
| Sponsors |
|
| Publisher |
IEEE Press
Piscataway, NJ, USA
|
| Bibliometrics |
Downloads (6 Weeks): 2, Downloads (12 Months): 19, Citation Count: 5
|
|
|
ABSTRACT
For multimedia applications, loop buffering is an efficient mechanism to reduce the power in the instruction memory of embedded processors. In particular, software controlled clustered loop buffers are energy efficient. However current compilers for VLIW do not fully exploit the potentials offered by such a clustered organization This paper presents an algorithm to explore what is the optimal loop buffer configuration and the optimal way to use this configuration for an application or a set of applications. Results for the MediaBench application suite show an additional 18% reduction (on average) in energy in the instruction memory hierarchy as compared to traditional non-clustered approaches to the loop buffer without compromising performance.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
G. Slavenburg, S. Rathnam, and H. Dijkstra, "The Trimedia TM-1 PCI VLIW media processor," in Proceedings Hot Chips VIII Conference, 1996.
|
| |
3
|
Texas Instruments Inc., C6000 Platform: DSP Selection Guide, 2003. Selection Guide No. SSDV004L.
|
| |
4
|
L. Benini, D. Bruni, M. Chinosi, C. Silvano, V. Zaccaria, and R. Zafalon, "A power modeling and estimation framework for vliw-based embedded systems," in in Proc. Int. Workshop on Power And Timing Modeling, Optimization and Simulation PATMOS, September 2001.
|
| |
5
|
F. Catthoor, K. Danckaert, C. Kulkarni, E. Brockmeyer, P. G. Kjeldsberg, T. Van Achteren, and T. Omnes, Data access and storage management for embedded programmable processors. Kluwer Academic Publishers, March 2002.
|
 |
6
|
|
 |
7
|
|
 |
8
|
Lea Hwang Lee , Bill Moyer , John Arends, Instruction fetch energy reduction using loop caches for embedded applications with small tight loops, Proceedings of the 1999 international symposium on Low power electronics and design, p.267-269, August 16-17, 1999, San Diego, California, United States
[doi> 10.1145/313817.313944]
|
| |
9
|
S. Rixner, W. Dally, B. Khialany, p. Mattson, U. Kapnasi, and J. Owens, "Register organization for media processing," in Proc of 26th International Symposium on High-Performance Computer Architecture (HiPC), January 2000.
|
| |
10
|
M. Jayapala, F. Barat, T. Vander Aa, F. Catthoor, G. Deconinck, and H. Corporaal, "Clustered 10 buffer organization for low energy embedded processors," in Proc of 1st Workshop on Application Specific Processors (WASP), held in conjunction with MICRO-35, November 2002.
|
| |
11
|
|
| |
12
|
Raminder S. Bajwa , Mitsuru Hiraki , Hirotsugu Kojima , Douglas J. Gorny , Kenichi Nitta , Avadhani Shridhar , Koichi Seki , Katsuro Sasaki, Instruction buffering to reduce power in processors for signal processing, IEEE Transactions on Very Large Scale Integration (VLSI) Systems, v.5 n.4, p.417-424, Dec. 1997
[doi> 10.1109/92.645068]
|
| |
13
|
L. H. Lee, B. Moyer, J. Arends, and A. Arbor, "Low-cost embedded program loop caching - revisited," tech.rep., EECS, University of Michigan, December 1999.
|
| |
14
|
A. Gordon-Ross, S. Cotterell, and F. Vahid, "Exploiting fixed programs in embedded systems: A loop cache example," in Proc of IEEE Computer Architecture Letters, Jan 2002.
|
 |
15
|
Nikolaos Bellas Ibrahim Hajj , George Stamoulis , N. Bellas , C. Polychronopoulos, Architectural and compiler support for energy reduction in the memory hierarchy of high performance microprocessors, Proceedings of the 1998 international symposium on Low power electronics and design, p.70-75, August 10-12, 1998, Monterey, California, United States
[doi> 10.1145/280756.280788]
|
| |
16
|
|
 |
17
|
|
| |
18
|
|
 |
19
|
Andrea Capitanio , Nikil Dutt , Alexandru Nicolau, Partitioned register files for VLIWs: a preliminary analysis of tradeoffs, Proceedings of the 25th annual international symposium on Microarchitecture, p.292-300, December 01-04, 1992, Portland, Oregon, United States
|
| |
20
|
V. Lapinskii, M. F. Jacome, and G. de Veciana, "Application-specific clustered vliw datapaths: Early exploration on a parameterized design space," IEEE Transactions on Computer Aided Design of Integrated Circuits and Systems, vol. 21, pp. 889--903, August 2002.
|
 |
21
|
|
| |
22
|
Trimaran group, http://www.trimaran.org, Trimaran: An Infrastructure for Research in Instruction-Level Parallelism, 1999.
|
| |
23
|
Chunho Lee , Miodrag Potkonjak , William H. Mangione-Smith, MediaBench: a tool for evaluating and synthesizing multimedia and communicatons systems, Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture, p.330-335, December 01-03, 1997, Research Triangle Park, North Carolina, United States
|
| |
24
|
T. Vander Aa, M. Jayapala, F. Barat, F. Catthoor, H. Corporaal, and G. Deconinck, "Instruction buffering exploration for low energy embedded processors," in Proc. of the 13th International Workshop on Power and Timing Modeling, Optimization and Simulation (PATMOS 2003) (E. M. J. J. Chico, ed.), (September 10--12, 2003, Torino, Italy), pp. 409--419, Springer Verlag, Lecture Notes in Computer Science, 09 2003.
|
CITED BY 5
|
|
|
|
|
|
|
|
Rajiv A. Ravindran , Pracheeti D. Nagarkar , Ganesh S. Dasika , Eric D. Marsman , Robert M. Senger , Scott A. Mahlke , Richard B. Brown, Compiler Managed Dynamic Instruction Placement in a Low-Power Code Cache, Proceedings of the international symposium on Code generation and optimization, p.179-190, March 20-23, 2005
|
|
|
Yuki Kobayashi , Murali Jayapala , Praveen Raghavan , Francky Catthoor , Masaharu Imai, Efficient Method to Generate an Energy Efficient Schedule Using Operation Shuffling, IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, v.E91-A n.2, p.604-612, February 2008
|
|
|
|
|