|
ABSTRACT
Due to their generic and highly programmable nature, FPGAs provide the ability to implement a wide range of applications. However, it is this nonspecific nature that has limited the use of FPGAs in scientific applications that require floating-point arithmetic. Even simple floating-point operations consume a large amount of computational resources. In this paper, we introduce embedding floating-point multiply-add units in an island style FPGA. This has shown to have an average area savings of 55.0% and an average increase of 40.7% in clock rate over existing architectures.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
 |
3
|
|
| |
4
|
G. Govindu, S. Choi, V. K. Prasanna, V. Daga, S. Gangadharpalli, and V. Sridhar. A high-performance and energy efficient architecture for floating-point based lu decomposition on fpgas. In Proceedings of the 11th Reconfigurable Architectures Workshop (RAW), Santa Fe, NM, April 2004.
|
| |
5
|
L. Zhuo and V. K. Prasanna. Scalable and modular algorithms for floating-point matrix multiplication on fpgs. In 18 th International Parallel and Distributed Processing Symposium (IPDPS'04), Santa Fe, NM, April 2004.
|
 |
6
|
|
 |
7
|
|
| |
8
|
IEEE Standards Board. IEEE standard for binary floating-point arithmetic. Technical Report ANSI/IEEE Std. 754-1985, The Institute of Electrical and Electronic Engineers, New York, 1985.
|
| |
9
|
|
| |
10
|
Virtex-II Pro and Virtex-II Pro X Platform FPGAs: Complete Data Sheet. June 2005 (Rev 4.3), {cited Aug 2005}, http://direct.xilinx.com/bvdocs/publications/ds083.pdf.
|
| |
11
|
Virtex-4 Family Overview. June 2005 (Rev 1.4), {cited Sept 2005}, http://direct.xilinx.com/bvdocs/publications/ds112.pdf.
|
| |
12
|
|
| |
13
|
|
| |
14
|
Xilinx: ASMBL Architecture. 2005 {cited Sept 2005}, http://www.xilinx.com/products/silicon_solutions/fpgas/virtex/virtex4/overview/
|
| |
15
|
Virtex-4 Data Sheet: DC and Switching Characteristics. Aug 2005 (Rev 1.9), {cited Sept 2005}, http://direct.xilinx.com/bvdocs/publications/ds302.pdf
|
| |
16
|
Virtex-II Platform FPGAs: Complete Data Sheet. Mar 2005 (Rev 3.4), {cited Aug 2005}, http://direct.xilinx.com/bvdocs/publications/ds031.pdf
|
| |
17
|
MIPS Technologies, Inc. 64-Bit Cores, MIPS64 Family Features. 2005, {cited Jan 2005}, http://www.mips.com/content/Products/Cores/64-BitCores.
|
 |
18
|
Jay B. Brockman , Shyamkumar Thoziyoor , Shannon K. Kuntz , Peter M. Kogge, A low cost, multithreaded processing-in-memory system, Proceedings of the 3rd workshop on Memory performance issues: in conjunction with the 31st international symposium on computer architecture, p.16-22, June 20-20, 2004, Munich, Germany
[doi> 10.1145/1054943.1054946]
|
| |
19
|
Brad Hutchings , Peter Bellows , Joseph Hawkins , Scott Hemmert , Brent Nelson , Mike Rytting, A CAD Suite for High-Performance FPGA Design, Proceedings of the Seventh Annual IEEE Symposium on Field-Programmable Custom Computing Machines, p.12, April 21-23, 1999
|
CITED BY 5
|
|
|
|
|
|
|
|
|
|
|
Jason Luu , Ian Kuon , Peter Jamieson , Ted Campbell , Andy Ye , Wei Mark Fang , Jonathan Rose, VPR 5.0: FPGA cad and architecture exploration tools with single-driver routing, heterogeneity and process scaling, Proceeding of the ACM/SIGDA international symposium on Field programmable gate arrays, February 22-24, 2009, Monterey, California, USA
|
|
|
|
|