|
ABSTRACT
We introduce an approach for power optimization using a set of compilation and architectural techniques. The key technical innovation is a novel divide-and-conquer compilation technique to minimize the number of operations for general computations. Our technique optimizes not only a significantly wider set of computations than the previously published techniques, but also outperforms (or performs at least as well as other techniques) on all examples. Along the architectural dimension, we investigate coordinated impact of compilation techniques on the number of processors which provide optimal trade-off between cost and power. We demonstrate that proper compilation techniques can significantly reduce power with bounded hardware cost. The effectiveness of all techniques and algorithms is documented on numerous real-life designs.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
BANERJEE, U., EIGENMANN, R., NICOLAU, A., AND PADUA, D.A. 1993. Automatic program parallelization. Proc. IEEE 81, 2 (Feb.), 211-243.
|
| |
3
|
BHATTACHARYYA, S. S., BUCK, J. T., HA, S., AND LEE, E.A. 1993. A scheduling framework for minimizing memory requirements of multirate signal processing algorithms expressed as dataflow graphs. In VLSI Signal Processing VI IEEE Press, Piscataway, NJ, 188-196.
|
| |
4
|
CHANDRAKASAN, A. P., SHENG, S., AND BRODERSEN, R. W. 1992. Low-power CMOS digital design. IEEE J. Solid-State Circuits 27, 4 (Apr. 1992), 473-484.
|
| |
5
|
CHANDRAKASAN, A. P., SRIVASTAVA, M., AND BRODERSEN, R.W. 1994. Energy efficient programmable computation. In Proceedings of the 7th International Conference on VLSI Design 261-264.
|
| |
6
|
CHATTERJEE, A. AND ROY, R. K. 1994. Synthesis of low power DSP circuits using activity metrics. In Proceedings of the 7th International Conference on VLSI Design 265-270.
|
| |
7
|
CHANDRAKASAN, A. P., POTKONJAK, M., MEHRA, R., RABAEY, J., AND BRODERSEN, R. W. 1995. Optimizing power using transformations. IEEE Trans. Comput.-Aided Des. 14, 1 (1995), 13-32.
|
 |
8
|
Anantha Chandrakasan , Isabel Yang , Carlin Vieri , Dimitri Antoniadis, Design considerations and tools for low-voltage digital system design, Proceedings of the 33rd annual conference on Design automation, p.113-118, June 03-07, 1996, Las Vegas, Nevada, United States
[doi> 10.1145/240518.240540]
|
| |
9
|
|
| |
10
|
EL-KAREH, B., CHEN, B., AND STANLEY, T. 1995. Silicon on insulator--an emerging highleverage technology. IEEE Trans. Compon., Packaging, Manuf. Technol. 18, 1, 187-194.
|
| |
11
|
|
| |
12
|
|
| |
13
|
|
| |
14
|
GUERRA, L., POTKONJAK, M., AND RABAEY, J. 1994. System-level design guidance using algorithm properties. In Proceedings of the IEEE Workshop on VLSI Signal Processing VII IEEE Press, Piscataway, NJ, 73-82.
|
| |
15
|
HOANG, P. D. AND RABAEY, J. M. 1993. Scheduling of DSP programs onto multiprocessors for maximum throughput. IEEE Trans. Signal Process. 41, 6, 2225-2235.
|
| |
16
|
HUANG, S. H. AND RABAEY, J. M. 1994. Maximizing the throughput of high performance DSP applications using behavioral transformations. In EDAC '94 (EDAC '94, 1994) 25-30.
|
| |
17
|
IPPOSHI, T., IWAMATSU, T., YAMAGUCHI, Y., UEDA, K., MORINAKA, H., MASHIKO, K., INOUE, Y., AND Hmno, T. 1995. An advanced 0.5 mu m CMOS/SOI technology for practical ultrahigh-speed and low-power circuits. In Proceedings of the International Conference on SOI 46-47.
|
 |
18
|
Zia Iqbal , Miodrag Potkonjak , Sujit Dey , Alice Parker, Critical path minimization using retiming and algebraic speed-up, Proceedings of the 30th international conference on Design automation, p.573-577, June 14-18, 1993, Dallas, Texas, United States
[doi> 10.1145/157485.165046]
|
| |
19
|
KAM, J. B. AND ULLMAN, J. D. 1997. Monotone data flow analysis frameworks. Acta Inf. 7, 305-317.
|
| |
20
|
KIRKPATRICK, S., GELATT, C. D., JR., AND VECCHI, M. P. 1983. Optimization by simulated annealing. Science 220, 4598 (May), 671-680.
|
| |
21
|
|
| |
22
|
LANDMAN, P. E. AND RABAEY, g. 1996. Activity-sensitive architectural power analysis. IEEE Trans. Comput.-Aided Des. 15, 6, 571-587.
|
| |
23
|
LEE, E. AND MESSERSCHMITT, D. 1987. Synchronous data flow. Proc. IEEE 75, 9, 1235-1245.
|
| |
24
|
LEE, E. A. AND PARKS, T. M. 1995. Dataflow process networks. Proc. IEEE 83, 5 (May), 773-801.
|
| |
25
|
|
| |
26
|
LEISERSON, C. E. AND SAXE, J. B. 1991. Retiming synchronous circuitry. Algorithmica 6, 1, 5-35.
|
 |
27
|
|
| |
28
|
|
| |
29
|
|
| |
30
|
|
| |
31
|
|
| |
32
|
POTKONJAK, M., SRIVASTAVA, M., AND CHANDRAKASAN, A. P. 1996. A. P. Multiple constant multiplications: efficient and versatile framework and algorithms for exploring common subexpression elimination. IEEE Trans. Comput.-Aided Des. 15, 2, 151-165.
|
| |
33
|
|
| |
34
|
|
| |
35
|
|
| |
36
|
|
| |
37
|
SINGH, D., RABAEY, J., PEDRAM, M., CATTHOOR, F., RAJGOPAL, S., SEHGAL, N., AND MOZDZEN, T. 1995. Power conscious CAD tools and methodologies: A perspective. Proc. IEEE 83, 4 (Apr.), 570-594.
|
| |
38
|
SRIVASTAVA, M. B. AND POTKONJAK, M. 1994. Transforming linear systems for joint latency and throughput optimization. In EDAC '94 (EDAC '94, 1994) 267-271.
|
 |
39
|
|
 |
40
|
|
| |
41
|
TARJAN, R. E. 1972. Depth first search and linear graph algorithms. SIAM J. Comput. 1, 2 (1972), 146-160.
|
| |
42
|
|
 |
43
|
|
| |
44
|
|
| |
45
|
WALKER, R. AND CAMPOSANO, R. 1991. A Survey of High-Level Synthesis Systems. Kluwer Academic Publishers, Hingham, MA.
|
 |
46
|
|
 |
47
|
|
| |
48
|
|
|