|
ABSTRACT
The Field Programmable Counter Array (FPCA) was introduced to improve FPGA performance for arithmetic circuits. An FPCA is a reconfigurable IP core that can be integrated into an FPGA. To exploit the FPCA, a circuit is transformed by merging disparate addition and multiplication operations into large multi-input addition operations, which are synthesized as compressor trees on the FPCA; the remaining portion of the circuit is synthesized on the FPGA. This paper presents a series of architectural improvements to the FPCA that reduce routing delay, increase flexibility and component utilization, and simplify the integration process. Using an FPGA containing six FPCAs, we observed average and maximum speedups of 1.60x and 2.40x on a set of arithmetic benchmarks
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Altera Corporation, Stratix II Device Handbook, vol. 1 and 2, available online: http://www.altera.com/
|
| |
2
|
Altera Corporation, Stratix II vs. Virtex-4 Performance Comparison, available online: http://www.altera.com/
|
| |
3
|
Altera Corporation, Stratix III Device Handbook, vol. 1 and 2, available online: http://www.altera.com/
|
| |
4
|
|
 |
5
|
|
| |
6
|
Chen, C.-Y., Chien, S.-Y., Huang, Y.-W., Chen, T.-C., Wang, T.-C., and Chen, L.-G. Analysis and architecture design of variable block-size motion estimation for H.264/AVC, IEEE Trans. Circuits and Systems-I, vol. 53, no. 2, February, 2006, 578--593.
|
| |
7
|
Cherepacha, D., and Lewis, D. DP-FPGA: an FPGA architecture optimized for datapaths. VLSI Design, vol. 4, no. 4, 1996, 329--343.
|
| |
8
|
Dadda, L., Some schemes for parallel multipliers, Alta Frequenza, vol. 34, May, 1965, 349--356.
|
| |
9
|
Frederick, M.T., and Somani, A.K. Multi-bit carry chains for high-performance reconfigurable fabrics. Int. Conf. Field Prog. Logic and Applications (FPL'06) (Madrid, Spain, August 28-30, 2006) 1--6.
|
| |
10
|
|
 |
11
|
|
| |
12
|
Kaviani, A., Vranseic, D., and Brown, S. Computational field programmable architecture, IEEE Custom Integrated Circuits, Conf. (CICC'98) (Santa Clara, CA, USA, May 11-14, 1998) 261--264.
|
| |
13
|
Kuon, I., and Rose, J. Measuring the gap between FPGAs and ASICs. IEEE Trans. Computer-Aided Design, vol. 26, no. 2, February, 2007, 203--215.
|
 |
14
|
|
| |
15
|
Mirzaei, S., Hosangadi, A., and Kastner, R. High speed FIR filter implementation using add and shift method, Int. Conf. Computer Design (ICCD'06) (San Jose, CA, USA, October 1-4, 2006).
|
| |
16
|
|
| |
17
|
|
| |
18
|
Sriram, S., Brown, K., Defosseux, R., Moerman, F., Paviot, O., Sundararajan, V., and Gatherer, A. A 64 channel programmable receiver chip for 3G wireless infrastructure, IEEE Custom Integrated Circuits Conf. (CICC'05) (San Jose, CA, USA, September 18-21, 2005) 59--62.
|
| |
19
|
|
| |
20
|
|
| |
21
|
|
| |
22
|
|
| |
23
|
Wallace, C.S. A suggestion for a fast multiplier, IEEE Trans. Elec. Computers, vol. 13, February, 1964, 14--17.
|
| |
24
|
Wang, G., Sivaswamy, S., Ababei, C., Bazargan, K., Kastner, R., and Bozorgzadeh, E. Statistical analysis and design of HARP FPGAs, IEEE Trans. Computer-Aided Design, vol. 25, no. 10, 2006, 2088--2102.
|
| |
25
|
Weinberger, A. 4:2 carry-save adder module, IBM Technical Disclosure Bulletin, vol. 23, Jan. 1981.
|
| |
26
|
Xilinx Corporation, Virtex-4 User Guide, available online: http://www.xilinx.com/
|
| |
27
|
Xilinx Corporation, Virtex-5 User Guide, available online: http://www.xilinx.com/
|
 |
28
|
Paul S. Zuchowski , Christopher B. Reynolds , Richard J. Grupp , Shelly G. Davis , Brendan Cremen , Bill Troxel, A hybrid ASIC and FPGA architecture, Proceedings of the 2002 IEEE/ACM international conference on Computer-aided design, p.187-194, November 10-14, 2002, San Jose, California
[doi> 10.1145/774572.774600]
|
CITED BY 2
|
|
Seyed Hosein Attarzadeh Niaki , Alessandro Cevrero , Philip Brisk , Chrysostomos Nicopoulos , Frank K. Gurkaynak , Yusuf Leblebici , Paolo Ienne, Design space exploration for field programmable compressor trees, Proceedings of the 2008 international conference on Compilers, architectures and synthesis for embedded systems, October 19-24, 2008, Atlanta, GA, USA
|
|
|
Alessandro Cevrero , Panagiotis Athanasopoulos , Hadi Parandeh-Afshar , Ajay K. Verma , Hosein Seyed Attarzadeh Niaki , Chrysostomos Nicopoulos , Frank K. Gurkaynak , Philip Brisk , Yusuf Leblebici , Paolo Ienne, Field Programmable Compressor Trees: Acceleration of Multi-Input Addition on FPGAs, ACM Transactions on Reconfigurable Technology and Systems (TRETS), v.2 n.2, p.1-36, June 2009
|
|