|
ABSTRACT
Future computing systems need to balance flexibility, specialization, and performance in order to meet market demands and the computing power required by new applications. Instruction generation is a vital component for determining these trade-offs. In this work, we present theory and an algorithm for instruction generation. The algorithm profiles a dataflow graph and iteratively contracts edges to create the templates. We discuss how to target the algorithm toward the novel problem of instruction generation for hybrid reconfigurable systems. In particular, we target the Strategically Programmable System, which embeds complex computational units such as ALUs, IP blocks, and so on into a configurable fabric. We argue that an essential compilation step for these systems is instruction generation, as it is needed to specify the functionality of the embedded computational units. In addition, instruction generation can be used to create soft reconfigurable macros---tightly sequenced prespecified operations placed in the reconfigurable fabric.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Bozorgzadeh, E., Ogrenci Memik, S., Kastner, R., and Sarrafzadeh, M. 2002a. Pattern selection: Customized block allocation for domain-specific programmable systems. In Proceedings of the International Conference on Engineering of Reconfigurable Systems and Algorithms.
|
| |
3
|
Bozorgzadeh, E., Ogrenci Memik, S., Kastner, R., and Sarrafzadeh, M. 2002b. SPS: Strategically programmable system---fully automated architecture generation and application compilation. Tech. Rep. UCLA.
|
| |
4
|
|
 |
5
|
Timothy J. Callahan , Philip Chong , André DeHon , John Wawrzynek, Fast module mapping and placement for datapaths in FPGAs, Proceedings of the 1998 ACM/SIGDA sixth international symposium on Field programmable gate arrays, p.123-132, February 22-25, 1998, Monterey, California, United States
[doi> 10.1145/275107.275132]
|
| |
6
|
|
 |
7
|
Amit Chowdhary , Sudhakar Kale , Phani Saripella , Naresh Sehgal , Rajesh Gupta, A general approach for regularity extraction in datapath circuits, Proceedings of the 1998 IEEE/ACM international conference on Computer-aided design, p.332-339, November 08-12, 1998, San Jose, California, United States
[doi> 10.1145/288548.289050]
|
| |
8
|
|
| |
9
|
|
| |
10
|
Corazao, M. R., Khalaf, M. A., Guerra, L. M., Potkonjak, M., and Rabaey, J. M. 1996. Performance optimization using template mapping for datapath-intensive high-level synthesis. IEEE Trans. Comput. Aided Des. Integrated Circ. Syst. 15, 8, 877--888.
|
 |
11
|
|
| |
12
|
|
| |
13
|
Ebeling, C. and Zajicek, O. 1983. Validating VLSI circuit layout by wirelist comparison. In Proceedings of the International Conference on Computer-Aided Design.
|
| |
14
|
Edwards, S. A. 2002. An Esterel compiler for large control-dominated systems. IEEE Trans. Comput. Aided Des. Integrated Circ. Syst. 21, 2, 169--183.
|
| |
15
|
Gajski, D. D., Zhu, J., Dömer, R., Gerstlauser, A., and Zhoa, S. 2000. SpecC: Specification Language and Methodology. Kluwer Academic, Boston.
|
| |
16
|
|
| |
17
|
Girault, A., Bilung, L., and Lee, E. A. 1999. Hierarchical finite state machines with multiple concurrency models. IEEE Trans. Comput. Aided Des. Integrated Circ. Syst. 18, 6, 742--760.
|
| |
18
|
Maya Gokhale , William Holmes , Andrew Kopser , Sara Lucas , Ronald Minnich , Douglas Sweely , Daniel Lopresti, Building and Using a Highly Parallel Programmable Logic Array, Computer, v.24 n.1, p.81-89, January 1991
[doi> 10.1109/2.67197]
|
| |
19
|
Seth Copen Goldstein , Herman Schmit , Mihai Budiu , Srihari Cadambi , Matt Moe , R. Reed Taylor, PipeRench: A Reconfigurable Architecture and Compiler, Computer, v.33 n.4, p.70-77, April 2000
[doi> 10.1109/2.839324]
|
| |
20
|
|
| |
21
|
|
| |
22
|
Mary W. Hall , Jennifer M. Anderson , Saman P. Amarasinghe , Brian R. Murphy , Shih-Wei Liao , Edouard Bugnion , Monica S. Lam, Maximizing Multiprocessor Performance with the SUIF Compiler, Computer, v.29 n.12, p.84-89, December 1996
[doi> 10.1109/2.546613]
|
| |
23
|
|
| |
24
|
Kahrs, M. 1986. Matching a parts library in a silicon compiler. In Proceedings of the International Conference on Computer-Aided Design.
|
| |
25
|
Kastner, R., Bozorgzadeh, E., Ogrenci Memik, S., and Sarrafzadeh, M. 2002. Compiler techniques for system synthesis optimization, Tech. Rep. UCLA.
|
| |
26
|
|
 |
27
|
|
| |
28
|
Chunho Lee , Miodrag Potkonjak , William H. Mangione-Smith, MediaBench: a tool for evaluating and synthesizing multimedia and communicatons systems, Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture, p.330-335, December 01-03, 1997, Research Triangle Park, North Carolina, United States
|
 |
29
|
Scott A. Mahlke , Richard E. Hank , James E. McCormick , David I. August , Wen-Mei W. Hwu, A comparison of full and partial predicated execution support for ILP processors, Proceedings of the 22nd annual international symposium on Computer architecture, p.138-150, June 22-24, 1995, S. Margherita Ligure, Italy
|
 |
30
|
Scott A. Mahlke , David C. Lin , William Y. Chen , Richard E. Hank , Roger A. Bringmann, Effective compiler support for predicated execution using the hyperblock, Proceedings of the 25th annual international symposium on Microarchitecture, p.45-54, December 01-04, 1992, Portland, Oregon, United States
|
| |
31
|
|
| |
32
|
Micali, S. and Vazirani, V. V. 1980. An O(√|V||E|) algorithm for finding maximum matching in general graphs. In Proceedings of the Symposium on Foundations of Computer Science.
|
 |
33
|
Stefaan Note , Werner Geurts , Francky Catthoor , Hugo De Man, Cathedral-III: Architecture-driven high-level synthesis for high throughput DSP applications, Proceedings of the 28th conference on ACM/IEEE design automation, p.597-602, June 17-22, 1991, San Francisco, California, United States
[doi> 10.1145/127601.127739]
|
| |
34
|
Ogrenci Memik, S., Bozorgzadeh, E., Kastner, R., and Sarrafzadeh, M. 2001a. Strategically programmable systems. In Proceedings of the Reconfigurable Architecture Workshop.
|
| |
35
|
|
| |
36
|
Peixin, Z., Martonosi, M., Ashar, P., and Malik, S. 1999. Using configurable computing to accelerate Boolean satisfiability. IEEE Trans. Comput. Aided Des. Integrated Circ. Syst. 18, 6, 861--868.
|
| |
37
|
Rao, D. S. and Kurdahi, F. J. 1993. On clustering for maximal regularity extraction. IEEE Trans. Comput. Aided Des. Integrated Circ. Syst. 12, 8, 1198--1208.
|
| |
38
|
Robert Schreiber , Shail Aditya , B. Ramakrishna Rau , Vinod Kathail , Scott Mahlke , Santosh Abraham , Greg Snider, High-Level Synthesis of Nonprogrammable Hardware Accelerators, Proceedings of the IEEE International Conference on Application-Specific Systems, Architectures, and Processors, p.113, July 10-12, 2000
|
 |
39
|
Reetinder P. S. Sidhu , Alessandro Mei , Viktor K. Prasanna, String matching on multicontext FPGAs using self-reconfiguration, Proceedings of the 1999 ACM/SIGDA seventh international symposium on Field programmable gate arrays, p.217-226, February 21-23, 1999, Monterey, California, United States
[doi> 10.1145/296399.296463]
|
| |
40
|
Smith, M. D. and Holloway, G. An introduction to machine SUIF and its portable libraries for analysis and optimization. Tech. Rep., Division of Engineering and Applied Sciences, Harvard University.
|
 |
41
|
Tai Ly , David Knapp , Ron Miller , Don MacMillen, Scheduling using behavioral templates, Proceedings of the 32nd ACM/IEEE conference on Design automation, p.101-106, June 12-16, 1995, San Francisco, California, United States
[doi> 10.1145/217474.217514]
|
| |
42
|
Michael Bedford Taylor , Jason Kim , Jason Miller , David Wentzlaff , Fae Ghodrat , Ben Greenwald , Henry Hoffman , Paul Johnson , Jae-Wook Lee , Walter Lee , Albert Ma , Arvind Saraf , Mark Seneski , Nathan Shnidman , Volker Strumpen , Matt Frank , Saman Amarasinghe , Anant Agarwal, The Raw Microprocessor: A Computational Fabric for Software Circuits and General-Purpose Programs, IEEE Micro, v.22 n.2, p.25-35, March 2002
[doi> 10.1109/MM.2002.997877]
|
CITED BY 39
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Jason Cong , Yiping Fan , Guoling Han , Ashok Jagannathan , Glenn Reinman , Zhiru Zhang, Instruction set extension with shadow registers for configurable processors, Proceedings of the 2005 ACM/SIGDA 13th international symposium on Field-programmable gate arrays, February 20-22, 2005, Monterey, California, USA
|
|
|
Jason Cong , Yiping Fan , Guoling Han , Zhiru Zhang, Application-specific instruction generation for configurable processor architectures, Proceedings of the 2004 ACM/SIGDA 12th international symposium on Field programmable gate arrays, February 22-24, 2004, Monterey, California, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Carlo Galuzzi , Elena Moscu Panainte , Yana Yankova , Koen Bertels , Stamatis Vassiliadis, Automatic selection of application-specific instruction-set extensions, Proceedings of the 4th international conference on Hardware/software codesign and system synthesis, October 22-25, 2006, Seoul, Korea
|
|
|
|
|
|
|
|
|
|
|
|
Nathan Clark , Amir Hormati , Scott Mahlke , Sami Yehia, Scalable subgraph mapping for acyclic computation accelerators, Proceedings of the 2006 international conference on Compilers, architecture and synthesis for embedded systems, October 22-25, 2006, Seoul, Korea
|
|
|
M. D. Galanis , A. Milidonis , G. Theodoridis , D. Soudris , C. E. Goutis, Automated framework for partitioning DSP applications in hybrid reconfigurable platforms, Microprocessors & Microsystems, v.31 n.1, p.1-14, February, 2007
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Ajay K. Verma , Philip Brisk , Paolo Ienne, Fast, quasi-optimal, and pipelined instruction-set extensions, Proceedings of the 2008 conference on Asia and South Pacific design automation, January 21-24, 2008, Seoul, Korea
|
|
|
|
|
|
|
|
|
M. D. Galanis , A. Milidonis , G. Theodoridis , D. Soudris , C. E. Goutis, A Partitioning Methodology for Accelerating Applications in Hybrid Reconfigurable Platforms, Proceedings of the conference on Design, Automation and Test in Europe, p.247-252, March 07-11, 2005
|
|
|
|
|
|
Huynh Phung Huynh , Joon Edward Sim , Tulika Mitra, An efficient framework for dynamic reconfiguration of instruction-set customization, Proceedings of the 2007 international conference on Compilers, architecture, and synthesis for embedded systems, September 30-October 03, 2007, Salzburg, Austria
|
|
|
Ajay K. Verma , Philip Brisk , Paolo Ienne, Rethinking custom ISE identification: a new processor-agnostic method, Proceedings of the 2007 international conference on Compilers, architecture, and synthesis for embedded systems, September 30-October 03, 2007, Salzburg, Austria
|
|
|
Alessandro Cevrero , Panagiotis Athanasopoulos , Hadi Parandeh-Afshar , Ajay K. Verma , Philip Brisk , Frank K. Gurkaynak , Yusuf Leblebici , Paolo Ienne, Architectural improvements for field programmable counter arrays: enabling efficient synthesis of fast compressor trees on FPGAs, Proceedings of the 16th international ACM/SIGDA symposium on Field programmable gate arrays, February 24-26, 2008, Monterey, California, USA
|
|
|
|
|
|
|
|
|
|
|
|
Foad Dabiri , Hyduke Noshadi , Hagop Hagopian , Tammara Massey , Majid Sarrafzadeh, Lightweight medical BodyNets, Proceedings of the ICST 2nd international conference on Body area networks, p.1-8, June 11-13, 2007, Florence, Italy
|
|
|
|
|
|
|
|