| ShiftQ: a bufferred interconnect for custom loop accelerators |
| Full text |
Pdf
(230 KB)
|
| Source
|
International Conference on Compilers, Architecture and Synthesis for Embedded Systems
archive
Proceedings of the 2001 international conference on Compilers, architecture, and synthesis for embedded systems
table of contents
Atlanta, Georgia, USA
Session: Hardware Support
table of contents
Pages: 158 - 167
Year of Publication: 2001
ISBN:1-58113-399-5
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 0, Downloads (12 Months): 14, Citation Count: 4
|
|
|
ABSTRACT
ShiftQs are hardware structures consisting of registers and switches which buffer and transport operands among function units within custom hardware loop accelerators. ShiftQs help minimize buffering and interconnect costs by customizing the hardware to the given schedule and by intelligent sharing of register and interconnect resources. This paper describes the ShiftQ schema and a method to automatically synthesize them from modulo-scheduled loops. We also evaluate the cost savings by comparing them against traditional storage and interconnect mechanisms.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
S. Devadas and R. Newton. Algorithms for hardware allocation in data path synthesis. IEEE Transactions on Computer-Aided Design, 8(7), July 1989.
|
 |
3
|
|
| |
4
|
V. Kathail, M. Schlansker, and B. R. Rau. HPL PlayDoh architecture specification: Version 1.0. Technical Report HPL-93-80, Hewlett-Packard Laboratories, Feb. 1994.
|
 |
5
|
|
| |
6
|
S. Mahlke, R. Ravindran, M. Schlansker, R. Schreiber, and T. Sherwood. Bitwidth cognizant architecture synthesis of custom hardware accelerators. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 20(11), November 2001.
|
 |
7
|
Stefaan Note , Werner Geurts , Francky Catthoor , Hugo De Man, Cathedral-III: Architecture-driven high-level synthesis for high throughput DSP applications, Proceedings of the 28th conference on ACM/IEEE design automation, p.597-602, June 17-22, 1991, San Francisco, California, United States
[doi> 10.1145/127601.127739]
|
| |
8
|
P. G. Paulin and J. P. Knight. Force-directed scheduling for the behavioral synthesis of ASICs. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 8(6), June 1989.
|
| |
9
|
|
 |
10
|
|
| |
11
|
Robert Schreiber , Shail Aditya , B. Ramakrishna Rau , Vinod Kathail , Scott Mahlke , Santosh Abraham , Greg Snider, High-Level Synthesis of Nonprogrammable Hardware Accelerators, Proceedings of the IEEE International Conference on Application-Specific Systems, Architectures, and Processors, p.113, July 10-12, 2000
|
CITED BY 4
|
|
Alain Darte , Rob Schreiber , Gilles Villard, Lattice-based memory allocation, Proceedings of the 2003 international conference on Compilers, architecture and synthesis for embedded systems, October 30-November 01, 2003, San Jose, California, USA
|
|
|
|
|
|
Vinod Kathail , Shail Aditya , Robert Schreiber , B. Ramakrishna Rau , Darren C. Cronquist , Mukund Sivaraman, PICO: Automatically Designing Custom Computers, Computer, v.35 n.9, p.39-47, September 2002
|
|
|
|
|