ACM Home Page
Please provide us with feedback. Feedback
Cluster-aware iterative improvement techniques for partitioning large VLSI circuits
Full text PdfPdf (598 KB)
Source ACM Transactions on Design Automation of Electronic Systems (TODAES) archive
Volume 7 ,  Issue 1  (January 2002) table of contents
Pages: 91 - 121  
Year of Publication: 2002
ISSN:1084-4309
Authors
Shantanu Dutt  University of Illinois-Chicago, Chicago, IL
Wenyong Deng  Cadence Design Systems, San Jose, CA
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 2,   Downloads (12 Months): 41,   Citation Count: 4
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/504914.504918
What is a DOI?

ABSTRACT

Move-based iterative improvement partitioning (IIP) methods, such as the Fiduccia-Mattheyses (FM) algorithm [Fidducia and Mattheyses 1982] and Krishnamurthy's Look-Ahead (LA) algorithm [Krishnamurthy 1984], are widely used in VLSI CAD applications, largely due to their time efficiency and ease of implementation. This class of algorithms is of the "local/greedy improvement" type, and they generate relatively high-quality results for small and medium-size circuits. However, as VLSI circuits become larger, these algorithms suffer a rapid deterioration in solution quality. We propose new IIP methods CLIP and CDIP that select cells to move with a view to moving clusters that straddle the two subsets of a partition, into one of the subsets. The new algorithms significantly improve partition quality while preserving the advantage of time efficiency. Experimental results on 25 medium to large-size ACM/SIGDA benchmark circuits show up to 70% improvement over FM in mincut, and average mincut improvements of about 35% over all circuits and 47% over large circuits. They also outperform state-of-the-art non-IIP techniques, the quadratic-programming-based method Paraboli [Reiss et al. 1994] and the spectral partitioner MELO [Alpert and Yao 1995], by about 17% and 23%, respectively, with less CPU time. This demonstrates the potential of sophisticated IIP algorithms in dealing with the increasing complexity of emerging VLSI circuits. We also compare CLIP and CDIP to hMetis [Karypis et al. 1997], one of the best of the recent state-of-the-art partitioners that are based on the multilevel paradigm (others include MLc [Alpert et al. 1997] and LSR/MFFS [Cong et al. 1997]). The results show that one scheme of hMetis is 8% worse than CLIP/CDIP and the other two schemes are only 2--4% better. However, CLIP/CDIP have advantages over hMetis and other multilevel partitioners that outweigh these minimal mincut improvements. The first is much faster times-to-solution (for example, one of our best schemes CLIP-LA2 is 6.4 and 11.75 times faster than the two best hMetis schemes) and much better scalability with circuit size (e.g., for the largest circuit with about 162K nodes, CLIP-LA2 is 10.4 and and 21.5 times faster and obtains better solution qualities than the two best hMetis schemes). Second, CLIP/CDIP are "flat" partitioners, while multilevel techniques perform a sequence of node clustering/coarsening before partitioning the circuit. In complex placement applications such as timing-driven placement in the presence of multiple constraints, such circuit coarsening can hide crucial information needed for good-quality solutions, thus making the partitioning process oblivious to them. This, however, is not a problem with flat partitioners like CLIP/CDIP that can take all important parameters into account while partitioning. All these advantages make CLIP/CDIP suitable for use in complex physical design problems for large, deep-submicron VLSI circuits.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
 
3
ALPERT, C. J. AND KAHNG, A. B. 1996. A hybrid multilevel/genetic approach for circuit partitioning. In Physical Design Workshop (1996). 100-105.
4
 
5
 
6
DUTT, S. 1997. A stochastic approach to timing-driven partitioning and placement with accurate net and gain modeling. In IEEE/ACMInternational Workshop on Time. Issues in Digital Systems (TAU '97, Dec. 1997). 246-256.
 
7
DUTT, S. AND DENG, W. 1995. VLSI circuit partitioning by cluster-removal using iterative improvement techniques. Technical report, Department of Electrical Engineering, University of Minnesota, Minneapolis, MN. This report is available online at www.eecs.uic.edu/~ dutt/publ.html.
8
 
9
 
10
 
11
 
12
HAGEN, L. AND KAHNG, A. B. 1991. Fast spectral methods for ratio cut partitioning and clustering. In Proceedings of the IEEE International Conference on Computer-Aided Design (1991). 10-13.
 
13
HAUCK, S. AND BORRIELLO, G. 1997. An evaluation of bipartitioning techniques. IEEE Trans. Comput-Aided Des. Integr. Circ. Syst. 16, 8 (Aug.), 849-866.
14
 
15
KERNIGAN, B. W. AND LIN, S. 1970. An efficient heuristic procedure for partitioning graphs. Bell Syst. Tech. J. 49, Feb., 291-307.
 
16
KRISHNAMURTHY, B. 1984. An improved min-cut algorithm for partitioning VLSI networks. IEEE Trans. Comput. C-33, May, 438-446.
17
 
18
 
19
MAREK-SADOWSKA, M. 1993. Issues in timing driven layout. In Algorithmic Aspects of VLSI Layout, M. Sarrafzadeh and D. T. Lee, Eds., World Scientific Publishing Co., Singapore, 1-24.
20
 
21
22
 
23
WEI, Y. C. AND CHENG, C. K. 1989. Towards efficient hierarchical designs by ratio cut partitioning. In Proceedings of the International Conference on Computer-Aided Design (1989). 298-301.
 
24
WEI, Y. C. AND CHENG, C. K. 1991. An improved two-way partitioning algorithm with stable performance. IEEE Trans. Comput.-Aided Des. 10, 12, 1502-1511.


Collaborative Colleagues:
Shantanu Dutt: colleagues
Wenyong Deng: colleagues