ACM Home Page
Please provide us with feedback. Feedback
Streaming FFT on REDEFINE-v2: an application-architecture design space exploration
Full text PdfPdf (405 KB)
Source
International Conference on Compilers, Architecture and Synthesis for Embedded Systems archive
Proceedings of the 2009 international conference on Compilers, architecture, and synthesis for embedded systems table of contents
Grenoble, France
SESSION: Architectural optimizations table of contents
Pages 127-136  
Year of Publication: 2009
ISBN:978-1-60558-626-7
Authors
Alexander Fell  Indian Institute of Science, Bangalore, India
Mythri Alle  Indian Institute of Science, Bangalore, India
Keshavan Varadarajan  Indian Institute of Science, Bangalore, India
Prasenjit Biswas  Indian Institute of Science, Bangalore, India
Saptarsi Das  Indian Institute of Science, Bangalore, India
Jugantor Chetia  Indian Institute of Science, Bangalore, India
S. K. Nandy  Indian Institute of Science, Bangalore, India
Ranjani Narayan  Morphing Machines, Bangalore, India
Sponsors
SIGDA: ACM Special Interest Group on Design Automation
ACM: Association for Computing Machinery
SIGBED: ACM Special Interest Group on Embedded Systems
SIGMICRO: ACM Special Interest Group on Microarchitectural Research and Processing
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 10,   Downloads (12 Months): 10,   Citation Count: 0
Additional Information:

abstract   references   index terms  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1629395.1629414
What is a DOI?

ABSTRACT

In this paper we explore an implementation of a high-throughput, streaming application on REDEFINE-v2, which is an enhancement of REDEFINE. REDEFINE is a polymorphic ASIC combining the flexibility of a programmable solution with the execution speed of an ASIC. In REDEFINE Compute Elements are arranged in an 8x8 grid connected via a Network on Chip (NoC) called RECONNECT, to realize the various macrofunctional blocks of an equivalent ASIC.

For a 1024-FFT we carry out an application-architecture design space exploration by examining the various characterizations of Compute Elements in terms of the size of the instruction store. We further study the impact by using application specific, vectorized FUs. By setting up different partitions of the FFT algorithm for persistent execution on REDEFINE-v2, we derive the benefits of setting up pipelined execution for higher performance. The impact of the REDEFINE-v2 micro-architecture for any arbitrary N-point FFT (N > 4096) FFT is also analyzed. We report the various algorithm-architecture tradeoffs in terms of area and execution speed with that of an ASIC implementation. In addition we compare the performance gain with respect to a GPP.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Mythri Alle, Keshavan Varadarajan, , Alexander Fell, S. K. Nandy, and Ranjani Narayan. Compiling Techniques for Coarse Grained Runtime Reconfigurable Architectures. In ARC'09: Proceedings of the 5th IEEE International Workshop on Applied Reconfigurable Computing, July 2008.
 
2
Mythri Alle, Keshavan Varadarajan, Alexander Fell, Nimmy Joseph, C. Ramesh Reddy, Saptarsi Das, Prasenjit Biswas, Jugantor Chetia, S. K. Nandy, and Ranjani Narayan. REDEFINE: Runtime Reconfigurable Polymorphic ASIC. IEEE Transactions on Embedded Systems, Special Issue on Configuring Algorithms, Processes and Architecture, 2008.
 
3
Mythri Alle, Keshavan Varadarajan, Nimmy Joseph, C. Ramesh Reddy, Alexander Fell, S. K. Nandy, and Ranjani Narayan. Synthesis of Application Accelerators on Runtime Reconfigurable Hardware. In ASAP '08: Proceedings of the 19th IEEE International Conference on Application specific Systems, Architectures and Processors, July 2008.
 
4
James W. Cooley and John W. Tukey. An algorithm for the machine calculation of complex fourier series. Mathematics of Computation, 19(90):297--301, 1965.
 
5
William J. Dally. Virtual-Channel Flow Control. IEEE Transactions on Parallel and Distributed Systems, 3(2):194--205, 1992.
 
6
Alexander Fell, Prasenjit Biswas, Jugantor Chetia, S. K. Nandy, and Ranjani Narayan. Generic Routing Rules and a Scalable Access Enhancement for the Network on Chip RECONNECT. In 22nd IEEE International SOC Conference, September 2009.
 
7
Yasuhiro Inagami and John F. Foley. The specification of a new Manchester Dataflow Machine. In ICS '89: Proceedings of the 3rd International Conference on Supercomputing, New York, NY, USA, 1989.
 
8
Nimmy Joseph, C. Ramesh Reddy, Keshavan Varadarajan, Mythri Alle, Alexander Fell, S. K. Nandy, and Ranjani Narayan. RECONNECT: A NoC for polymorphic ASICs using a Low Overhead Single Cycle Router. In ASAP '08: Proceedings of the 19th IEEE International Conference on Application specific Systems, Architectures and Processors, July 2008.
 
9
Amar Nath Satrawala, Keshavan Varadarajan, Mythri Alle, S. K. Nandy, and Ranjani Narayan. REDEFINE: Architecture of a SOC Fabric for Runtime Composition of Computation Structures. In FPL '07: Proceedings of the International Conference on Field Programmable Logic and Applications, August 2007.