| High-quality, deterministic parallel placement for FPGAs on commodity hardware |
| Full text |
Pdf
(405 KB)
|
Source
|
International Symposium on Field Programmable Gate Arrays
archive
Proceedings of the 16th international ACM/SIGDA symposium on Field programmable gate arrays
table of contents
Monterey, California, USA
SESSION: Physical design
table of contents
Pages 14-23
Year of Publication: 2008
ISBN:978-1-59593-934-0
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 16, Downloads (12 Months): 87, Citation Count: 0
|
|
|
ABSTRACT
In this paper, we describe the application of two parallelization strategies to the Quartus II FPGA placer. The first uses a pipelining approach and achieves speedups of 1.3x on two processing cores. The second uses a parallel moves approach and achieves speedups of 2.2x on four cores. Unlike all previous parallel moves algorithms, ours is deterministic and always gives the same answer as the serial version of the algorithm, without any significant reduction in performance. We also describe a process to quantify multi-core performance effects, such as memory subsystem limitations and explicit synchronization overhead, and fully describe these effects on a CAD tool for the first time. Memory limitations alone are found to cost up to 35% of total runtime. Unlike previous algorithms, our algorithms have negligible explicit synchronization overhead. These results are relevant to both CAD designers and to any developers seeking to parallelize existing software.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
H. Sutter, "A fundamental turn toward concurrency in software," Dr. Dobb's J. Mar. 2005.
|
| |
2
|
Altera Corp., "Quartus II Incremental Compilation for Hierarchical & Team-Based Design," in The Quartus II Handbook, Version 7.1, Vol. 1, Ch. 2 2007.
|
| |
3
|
J. Chandy, S. Kim, B. Ramkumar, S. Parkes, and P. Banerjee, "An evaluation of parallel simulated annealing strategies with application to standard cell placement," TCAD vol. 16, pp.398--410, Apr. 1997.
|
| |
4
|
M. Sarrafzadeh, M. Wang, and X. Yang, Modern Placement Techniques Boston: Kluwer Academic Publishers, 2003.
|
| |
5
|
C. Alpert, L. Hagen, and A. Kahng, "A hybrid multilevel/genetic approach for circuit partitioning," tech. rep., CS Dept., UCLA, Los Angeles, CA, USA, 1996.
|
| |
6
|
J.M. Kleinhans, G. Sigl, F. Johannes, and K. Antreich, "GORDIAN: VLSI placement by quadratic programming and slicing optimization," TCAD vol. 10, pp.356--365, Mar. 1991.
|
| |
7
|
S.N.R. Borra, A. Muthukaruppan, S. Suresh, and V. Kamakoti, "A parallel genetic approach to the placement problem for field programmable gate arrays," in IPDPS (Nice, France), p.184, 2003.
|
| |
8
|
S. Kirkpatrick, C. Gelatt Jr., and M. Vecchi, "Optimization by simulated annealing," Science pp.671--680, May 1983.
|
| |
9
|
C. Sechen and A. Sangiovanni-Vincentelli, "The TimberWolf placement and routing package," JSSC pp.510--522, Apr. 1985.
|
| |
10
|
|
| |
11
|
M. Hutton and V. Betz, "FPGA synthesis and physical design," in Electronic Design Automation for Integrated Circuits Handbook (L. Scheffer,L. Lavagno, and G. Martin, eds.), vol. 1, ch. 13, pp.13.1--13.32, Taylor and Francis CRC Press, 2006.
|
 |
12
|
|
| |
13
|
S. Kravitz and R. Rutenbar, "Placement by simulated annealing on a multiprocessor," TCAD pp.534--549, Jul. 1987.
|
 |
14
|
Malay Haldar , Anshuman Nayak , Alok Choudhary , Prith Banerjee, Parallel algorithms for FPGA placement, Proceedings of the 10th Great Lakes symposium on VLSI, p.86-94, March 02-04, 2000, Chicago, Illinois, United States
[doi> 10.1145/330855.330988]
|
| |
15
|
|
| |
16
|
|
| |
17
|
|
| |
18
|
|
| |
19
|
|
| |
20
|
|
 |
21
|
|
| |
22
|
S. Vadlamani and S. Jenks, "Architectural considerations for efficient software execution on parallel microprocessors," in IPDPS (Long Beach, CA, USA), 2007.
|
|