|
ABSTRACT
Multiple clock cycles are needed to cross the global interconnects for multi-gigahertz designs in nanometer technologies. For synchronous designs, this requires retiming and pipelining on global interconnects. In this paper, we present a practical solution for simultaneous retiming and multilevel global placement for performance optimization, based on the theory and algorithms of sequential timing analysis (Seq-TA). We extend the Seq-TA to handle gates/clusters with multiple outputs and integrate it into a multilevel optimization framework for simultaneous retiming and placement. We also develop two speed-up techniques which enable the Seq-TA to be efficiently integrated into a simulated annealing-based multilevel coarse placement for large-scale designs. Experimental results show that (i) retiming can improve the performance (delay) by 14% on average when it is applied after placement; (ii) our approach for simultaneous retiming and placement can outperform the two-step approach (placement followed by retiming) by 10% on average in terms of delay minimization.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
P. Chong and R. K. Brayton. Characterization of feasible retimings. In Proc. Int. Workshop on Logic and Synthesis, pages 1--6, 2001.
|
| |
3
|
J. Cong. An interconnect-centric design flow for nanometer technologies. In Proceedings of the IEEE, pages 505--527, April 2001.
|
 |
4
|
Jason Cong , Honching Li , Chang Wu, Simultaneous circuit partitioning/clustering with retiming for performance optimization, Proceedings of the 36th ACM/IEEE conference on Design automation, p.460-465, June 21-25, 1999, New Orleans, Louisiana, United States
[doi> 10.1145/309847.309980]
|
| |
5
|
|
 |
6
|
Jason Cong , Sung Kyu Lim , Chang Wu, Performance driven multi-level and multiway partitioning with retiming, Proceedings of the 37th conference on Design automation, p.274-279, June 05-09, 2000, Los Angeles, California, United States
[doi> 10.1145/337292.337418]
|
| |
7
|
J. Cong and C. Wu. An efficient algorithm for performance-optimal FPGA technology mapping with retiming. IEEE Trans. on Computer-Aided Design of Integrated Circuits and Systems, 17(9):738--748, 1998.
|
| |
8
|
J. Cong and X. Yuan. Multilevel global placement with retiming. Technical Report 03-0023, Computer Science Department, University of California, Los Angeles, 2003.
|
 |
9
|
|
 |
10
|
|
| |
11
|
C. E. Leiserson and J. B. Saxe. Retiming synchronous circuitry. Algorithmica, pages 5--35, 1991.
|
 |
12
|
Alexander Marquardt , Vaughn Betz , Jonathan Rose, Timing-driven placement for FPGAs, Proceedings of the 2000 ACM/SIGDA eighth international symposium on Field programmable gate arrays, p.203-213, February 10-11, 2000, Monterey, California, United States
[doi> 10.1145/329166.329208]
|
| |
13
|
|
| |
14
|
I. Neumann and W. Kunz. Tight coupling of timing-driven placement and retiming. In Proc. IEEE Int. Symp. on Circuits and Systems, pages 351--354, 2001.
|
| |
15
|
|
| |
16
|
P. Pan, A. K. Karandikar, and C. L. Liu. Optimal clock period clustering for sequential circuits with retiming. IEEE Trans. on Computer-Aided Design of Integrated Circuits and Systems, 17(6):489--498, 1998.
|
 |
17
|
|
| |
18
|
Semiconductor Industry Association. International Technology Roadmap for Semiconductors, 2002 update.
|
| |
19
|
M. Senn, U. Seidl, and F. Johannes. High quality deterministic timing driven FPGA placement. In Proc. ACM/SIGDA Int. Symp. on Field Programmable Gate Arrays, 2002.
|
 |
20
|
|
 |
21
|
Tzu-Chieh Tien , Hsiao-Pin Su , Yu-Wen Tsay , Yih-Chih Chou , Youn-Long Lin, Integrating logic retiming and register placement, Proceedings of the 1998 IEEE/ACM international conference on Computer-aided design, p.136-139, November 08-12, 1998, San Jose, California, United States
[doi> 10.1145/288548.288591]
|
|