| Instruction set extension with shadow registers for configurable processors |
| Full text |
Pdf
(447 KB)
|
| Source
|
International Symposium on Field Programmable Gate Arrays
archive
Proceedings of the 2005 ACM/SIGDA 13th international symposium on Field-programmable gate arrays
table of contents
Monterey, California, USA
SESSION: Computation techniques for FPGAs
table of contents
Pages: 99 - 106
Year of Publication: 2005
ISBN:1-59593-029-9
|
|
Authors
|
|
Jason Cong
|
University of California, Los Angeles, Los Angeles, CA
|
|
Yiping Fan
|
University of California, Los Angeles, Los Angeles, CA
|
|
Guoling Han
|
University of California, Los Angeles, Los Angeles, CA
|
|
Ashok Jagannathan
|
University of California, Los Angeles, Los Angeles, CA
|
|
Glenn Reinman
|
University of California, Los Angeles, Los Angeles, CA
|
|
Zhiru Zhang
|
University of California, Los Angeles, Los Angeles, CA
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 5, Downloads (12 Months): 42, Citation Count: 12
|
|
|
ABSTRACT
Configurable processors are becoming increasingly popular for modern embedded systems (especially for the field-programmable system-on-a-chip). While steady progress has been made in the tools and methodologies of automatic instruction set extension for configurable processors, the limited data bandwidth available in the core processor (e.g., the number of simultaneous accesses to the register file) becomes a potential performance bottleneck. In this paper we first present a quantitative analysis of the data bandwidth limitation in configurable processors, and then propose a novel low-cost architectural extension and associated compilation techniques to address the problem. The application of our approach results in a promising performance improvement.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Alfred V. Aho , Ravi Sethi , Jeffrey D. Ullman, Compilers: principles, techniques, and tools, Addison-Wesley Longman Publishing Co., Inc., Boston, MA, 1986
|
 |
2
|
|
| |
3
|
D. Burger, T. Austin, and S. Bennett, "Evaluating Future Microprocessors: The SimpleScalar Toolset," Technical Report, CS-TR96-1308, Univ. of Wisconsin - Madison, 1996.
|
| |
4
|
|
 |
5
|
|
 |
6
|
Jason Cong , Yiping Fan , Guoling Han , Zhiru Zhang, Application-specific instruction generation for configurable processor architectures, Proceedings of the 2004 ACM/SIGDA 12th international symposium on Field programmable gate arrays, February 22-24, 2004, Monterey, California, USA
[doi> 10.1145/968280.968307]
|
 |
7
|
R. Cytron , J. Ferrante , B. K. Rosen , M. N. Wegman , F. K. Zadeck, An efficient method of computing static single assignment form, Proceedings of the 16th ACM SIGPLAN-SIGACT symposium on Principles of programming languages, p.25-35, January 11-13, 1989, Austin, Texas, United States
[doi> 10.1145/75277.75280]
|
 |
8
|
Dirk Fischer , Jürgen Teich , Michael Thies , Ralph Weper, Efficient architecture/compiler co-exploration for ASIPs, Proceedings of the 2002 international conference on Compilers, architecture, and synthesis for embedded systems, October 08-11, 2002, Grenoble, France
[doi> 10.1145/581630.581635]
|
| |
9
|
C. Greene and D. Kleitman, "The Structure of Sperner K-Family," J. Combinatorial Theory, Ser. A, vol. 20, pp. 80--88, 1976.
|
| |
10
|
M. R. Guthaus et al., "MiBench: A Free, Commercially Representative Embedded Benchmark Suite," in IEEE 4th Workshop on Workload Characterization, Dec. 2001.
|
| |
11
|
|
| |
12
|
P. Ienne, L. Pozzi, and M. Vuletic, "On the Limits of Processor Specialisation by Mapping Dataflow Sections on Ad-hoc Functional Units," Technical Report 01/376, Swiss Federal Institute of Technology Lausanne, Computer Science Department, Dec. 2001.
|
 |
13
|
|
| |
14
|
|
| |
15
|
Chunho Lee , Miodrag Potkonjak , William H. Mangione-Smith, MediaBench: a tool for evaluating and synthesizing multimedia and communicatons systems, Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture, p.330-335, December 01-03, 1997, Research Triangle Park, North Carolina, United States
|
| |
16
|
|
| |
17
|
|
| |
18
|
S. Rixner, W. J. Dally, B. Khailany, P. Mattson, U. J. Kapasi, and J. D. Owens, "Register Organization for Media Processing," in Proc. Sixth International Symposium on High-Performance Computer Architecture, pp. 375--386, Jan. 2000.
|
 |
19
|
Fei Sun , Srivaths Ravi , Anand Raghunathan , Niraj K. Jha, Synthesis of custom processors based on extensible platforms, Proceedings of the 2002 IEEE/ACM international conference on Computer-aided design, p.641-648, November 10-14, 2002, San Jose, California
[doi> 10.1145/774572.774667]
|
| |
20
|
|
| |
21
|
R. D. Wittig and P. Chow, "OneChip: An FPGA Processor with Reconfigurable Logic," in Proc. 4th Annual IEEE Symposium on FPGAs for Custom Computing Machines, pp. 126--135, March 1996.
|
 |
22
|
Zhi Alex Ye , Andreas Moshovos , Scott Hauck , Prithviraj Banerjee, CHIMAERA: a high-performance architecture with a tightly-coupled reconfigurable functional unit, Proceedings of the 27th annual international symposium on Computer architecture, p.225-235, June 2000, Vancouver, British Columbia, Canada
|
| |
23
|
Altera Corp., http://www.altera.com.
|
| |
24
|
Tensilica Inc., http://www.tensilica.com.
|
| |
25
|
Xilinx Inc., http://www.xilinx.com.
|
CITED BY 12
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
David Sheldon , Rakesh Kumar , Frank Vahid , Dean Tullsen , Roman Lysecky, Conjoining soft-core FPGA processors, Proceedings of the 2006 IEEE/ACM international conference on Computer-aided design, November 05-09, 2006, San Jose, California
|
|
|
Kubilay Atasu , Robert G. Dimond , Oskar Mencer , Wayne Luk , Can Özturan , Günhan Dündar, Optimizing instruction-set extensible processors under data bandwidth constraints, Proceedings of the conference on Design, automation and test in Europe, April 16-20, 2007, Nice, France
|
|
|
|
|
|
|
|
|
|
|
|
|
|