ACM Home Page
Please provide us with feedback. Feedback
Architectural improvements for field programmable counter arrays: enabling efficient synthesis of fast compressor trees on FPGAs
Full text PdfPdf (590 KB)
Source
International Symposium on Field Programmable Gate Arrays archive
Proceedings of the 16th international ACM/SIGDA symposium on Field programmable gate arrays table of contents
Monterey, California, USA
SESSION: Architecture table of contents
Pages 181-190  
Year of Publication: 2008
ISBN:978-1-59593-934-0
Authors
Alessandro Cevrero  Swiss Federal Institute of Technology: Lausanne (EPFL), Lausanne, Switzerland
Panagiotis Athanasopoulos  Swiss Federal Institute of Technology: Lausanne (EPFL), Lausanne, Switzerland
Hadi Parandeh-Afshar  Swiss Federal Institute of Technology: Lausanne (EPFL), Lausanne, Switzerland
Ajay K. Verma  Swiss Federal Institute of Technology: Lausanne (EPFL), Lausanne, Switzerland
Philip Brisk  Swiss Federal Institute of Technology: Lausanne (EPFL), Lausanne, Switzerland
Frank K. Gurkaynak  Swiss Federal Institute of Technology: Lausanne (EPFL), Lausanne, Switzerland
Yusuf Leblebici  Swiss Federal Institute of Technology: Lausanne (EPFL), Lausanne, Switzerland
Paolo Ienne  Swiss Federal Institute of Technology: Lausanne (EPFL), Lausanne, Switzerland
Sponsors
ACM: Association for Computing Machinery
SIGDA: ACM Special Interest Group on Design Automation
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 65,   Citation Count: 2
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1344671.1344699
What is a DOI?

ABSTRACT

The Field Programmable Counter Array (FPCA) was introduced to improve FPGA performance for arithmetic circuits. An FPCA is a reconfigurable IP core that can be integrated into an FPGA. To exploit the FPCA, a circuit is transformed by merging disparate addition and multiplication operations into large multi-input addition operations, which are synthesized as compressor trees on the FPCA; the remaining portion of the circuit is synthesized on the FPGA. This paper presents a series of architectural improvements to the FPCA that reduce routing delay, increase flexibility and component utilization, and simplify the integration process. Using an FPGA containing six FPCAs, we observed average and maximum speedups of 1.60x and 2.40x on a set of arithmetic benchmarks


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Altera Corporation, Stratix II Device Handbook, vol. 1 and 2, available online: http://www.altera.com/
 
2
Altera Corporation, Stratix II vs. Virtex-4 Performance Comparison, available online: http://www.altera.com/
 
3
Altera Corporation, Stratix III Device Handbook, vol. 1 and 2, available online: http://www.altera.com/
 
4
5
 
6
Chen, C.-Y., Chien, S.-Y., Huang, Y.-W., Chen, T.-C., Wang, T.-C., and Chen, L.-G. Analysis and architecture design of variable block-size motion estimation for H.264/AVC, IEEE Trans. Circuits and Systems-I, vol. 53, no. 2, February, 2006, 578--593.
 
7
Cherepacha, D., and Lewis, D. DP-FPGA: an FPGA architecture optimized for datapaths. VLSI Design, vol. 4, no. 4, 1996, 329--343.
 
8
Dadda, L., Some schemes for parallel multipliers, Alta Frequenza, vol. 34, May, 1965, 349--356.
 
9
Frederick, M.T., and Somani, A.K. Multi-bit carry chains for high-performance reconfigurable fabrics. Int. Conf. Field Prog. Logic and Applications (FPL'06) (Madrid, Spain, August 28-30, 2006) 1--6.
 
10
11
 
12
Kaviani, A., Vranseic, D., and Brown, S. Computational field programmable architecture, IEEE Custom Integrated Circuits, Conf. (CICC'98) (Santa Clara, CA, USA, May 11-14, 1998) 261--264.
 
13
Kuon, I., and Rose, J. Measuring the gap between FPGAs and ASICs. IEEE Trans. Computer-Aided Design, vol. 26, no. 2, February, 2007, 203--215.
14
 
15
Mirzaei, S., Hosangadi, A., and Kastner, R. High speed FIR filter implementation using add and shift method, Int. Conf. Computer Design (ICCD'06) (San Jose, CA, USA, October 1-4, 2006).
 
16
 
17
 
18
Sriram, S., Brown, K., Defosseux, R., Moerman, F., Paviot, O., Sundararajan, V., and Gatherer, A. A 64 channel programmable receiver chip for 3G wireless infrastructure, IEEE Custom Integrated Circuits Conf. (CICC'05) (San Jose, CA, USA, September 18-21, 2005) 59--62.
 
19
 
20
 
21
 
22
 
23
Wallace, C.S. A suggestion for a fast multiplier, IEEE Trans. Elec. Computers, vol. 13, February, 1964, 14--17.
 
24
Wang, G., Sivaswamy, S., Ababei, C., Bazargan, K., Kastner, R., and Bozorgzadeh, E. Statistical analysis and design of HARP FPGAs, IEEE Trans. Computer-Aided Design, vol. 25, no. 10, 2006, 2088--2102.
 
25
Weinberger, A. 4:2 carry-save adder module, IBM Technical Disclosure Bulletin, vol. 23, Jan. 1981.
 
26
Xilinx Corporation, Virtex-4 User Guide, available online: http://www.xilinx.com/
 
27
Xilinx Corporation, Virtex-5 User Guide, available online: http://www.xilinx.com/
28


Collaborative Colleagues:
Alessandro Cevrero: colleagues
Panagiotis Athanasopoulos: colleagues
Hadi Parandeh-Afshar: colleagues
Ajay K. Verma: colleagues
Philip Brisk: colleagues
Frank K. Gurkaynak: colleagues
Yusuf Leblebici: colleagues
Paolo Ienne: colleagues