|
ABSTRACT
This paper describes a register and functional unit (FU) binding algorithm in high level synthesis. Our algorithm targets the reduction of multiplexer inputs. Since multiplexers connect multiple inputs to FUs or registers, the multiplexer count is a good indicator of the interconnect complexity. Reducing the number of multiplexer inputs results in reducing interconnect cost. Specifically, our algorithm constructs a weighted and ordered compatibility graph, and binds operations that form a long path in the graph together. As a result, operations with many flow dependencies and common inputs are bound to same FU, leading to a small number of FU inputs. In addition, the operation variables generated by a single FU are assigned to the same register so that connections between FUs and registers are reduced. We have implemented our algorithm within a MATLAB to Verilog conversion tool, and applied it to a suite of benchmark programs. Our experimental results have shown that the proposed scheme achieves 11.8%, 43.6% and 58.8% multiplexer input count reduction on average over weighted bipartite matching algorithm, k-cofamily algorithm and left edge algorithm, respectively. To assess the impact on interconnect reduction, we have generated layouts of the circuits from our Verilog description. It is shown that our approach delivers a 10.1% reduction in total wire-length of global interconnects with minor area overhead of register and FUs in comparison to the best previously proposed scheme.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
I. Ahmad and C. Y. R. Chen. Post-processor for data path synthesis using multiport memories. In Proceedings of ICCAD, November 1991.
|
| |
2
|
|
| |
3
|
|
| |
4
|
|
 |
5
|
|
| |
6
|
J. Cong, Y. Fan, G. Han, X. Yang, and Z. Zhang. Architecture and synthesis for on-chip multicycle communication. IEEE Transactions on CAD of integrated circuits and systems, 2004.
|
 |
7
|
|
 |
8
|
Jason Cong , Yiping Fan , Xun Yang , Zhiru Zhang, Architecture and synthesis for multi-cycle communication, Proceedings of the 2003 international symposium on Physical design, April 06-09, 2003, Monterey, CA, USA
[doi> 10.1145/640000.640040]
|
| |
9
|
|
| |
10
|
|
 |
11
|
Chu-Yi Huang , Yen-Shen Chen , Youn-Long Lin , Yu-Chin Hsu, Data path allocation based on bipartite weighted matching, Proceedings of the 27th ACM/IEEE conference on Design automation, p.499-504, June 24-27, 1990, Orlando, Florida, United States
[doi> 10.1145/123186.123350]
|
 |
12
|
|
| |
13
|
Ryan Kastner , Wenrui Gong , Xin Hao , Forrest Brewer , Adam Kaplan , Philip Brisk , Majid Sarrafzadeh, Layout driven data communication optimization for high level synthesis, Proceedings of the conference on Design, automation and test in Europe: Proceedings, March 06-10, 2006, Munich, Germany
|
| |
14
|
Daehong Kim , Jinyong Jung , Sunghyun Lee , Jinhwan Jeon , Kiyoung Choi, Behavior-to-placed RTL synthesis with performance-driven placement, Proceedings of the 2001 IEEE/ACM international conference on Computer-aided design, November 04-08, 2001, San Jose, California
|
 |
15
|
|
| |
16
|
T. Kim and C. L. Liu. An integrated data path synthesis algorithm based on network flow method. In Proceedings of CICC, May 1995.
|
 |
17
|
|
| |
18
|
X. Liu and M. C. Papaefthymiou. Design of a 20-mb/s 256-state viterbi decoder. IEEE Transactions on VLSI Systems, 2003.
|
| |
19
|
|
| |
20
|
R. Mehra, L. M. Guerra, and J. M. Rabaey. A partitioning scheme for optimizing interconnect power. IEEE Journal of Solid-State Circuits, 1997.
|
| |
21
|
|
 |
22
|
|
| |
23
|
M. Rim , R. Jain , R. De Leone, Optimal allocation and binding in high-level synthesis, Proceedings of the 29th ACM/IEEE conference on Design automation, p.120-123, June 08-12, 1992, Anaheim, California, United States
|
| |
24
|
|
| |
25
|
|
 |
26
|
|
 |
27
|
|
|