ACM Home Page
Please provide us with feedback. Feedback
Thread warping: a framework for dynamic synthesis of thread accelerators
Full text PdfPdf (277 KB)
Source
International Conference on Hardware Software Codesign archive
Proceedings of the 5th IEEE/ACM international conference on Hardware/software codesign and system synthesis table of contents
Salzburg, Austria
SESSION: Static and dynamic techniques for partitioning and scheduling table of contents
Pages: 93 - 98  
Year of Publication: 2007
ISBN:978-1-59593-824-4
Authors
Greg Stitt  University of Florida, Gainesville, FL
Frank Vahid  University of California: Riverside, Riverside, CA
Sponsors
SIGDA: ACM Special Interest Group on Design Automation
ACM: Association for Computing Machinery
SIGBED: ACM Special Interest Group on Embedded Systems
SIGMICRO: ACM Special Interest Group on Microarchitectural Research and Processing
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 12,   Downloads (12 Months): 76,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1289816.1289841
What is a DOI?

ABSTRACT

We present a dynamic optimization technique, thread warping, that uses a single processor on a multiprocessor system to dynamically synthesize threads into custom accelerator circuits on FPGAs (field-programmable gate arrays). Building on dynamic synthesis for single-processor single-thread systems, known as warp processing, thread warping improves performances of multiprocessor systems by speeding up individual threads and by allowing more threads to execute concurrently. Furthermore, thread warping maintains the important separation of function from architecture, enabling portability of applications to architectures with different quantities of microprocessors and FPGA.an advantage not shared by static compilation/synthesis approaches. We introduce a framework of architecture, CAD tools, and operating system that together support thread warping. We summarize experiments on an extensive architectural simulation framework we developed, showing application speedups of 4x to 502x, averaging 130x compared to a multiprocessor system having four ARM11 microprocessors, for eight benchmark applications. Even compared to a 64-processor system, thread warping achieves 11x speedup.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
3
 
4
Cifuentes, C. Reverse Compilation Techniques. PhD Thesis, Queensland University of Technology, 1994.
 
5
Cray XD1. http://www.cray.com/products/xd1, 2005.
 
6
Dellson, A., Sandberg, G., and Möhl, S. Turning FPGAs into Supercomputers. Cray User Group, 2006.
 
7
Eles, P., Peng, Z., Kuchchinski, K., and Doboli, A. System level hardware/software partitioning based on simulated annealing and tabu search. Journal on Design Automation for Embedded Systems (DAES), Springer, 2, 1 (1997), 5--32.
8
9
10
 
11
12
 
13
IBM. The Cell Architecture. http://domino.research.ibm.com, 2006.
 
14
Schleupen, K., Lekuch, S., Mannion, R., Guo, Z., Najjar, W., and Vahid, F. Dynamic partial FPGA reconfiguration in a prototype microprocessor system. In Proceedings of Int. Conf. on Field Programmable Logic And Applications, 2007.
 
15
Intel Quad-Core Xeon. http://www.intel.com, 2007.
16
17
 
18
M. LaPedus. Intel Tips Teraflops Programmable Processor. EE Times, September 2006.
 
19
Lu, J., Chen, H., Yew, P., and Hsu, W. Design and implementation of a lightweight dynamic optimization system. Journal of Instruction-Level Parallelism, 6 (Jun 2004), 1--24.
 
20
Ludwig, S. Fast Hardware Synthesis Tools and a Reconfigurable Coprocessor. Ph.D. Thesis, ETH Zurich, 2005.
21
 
22
23
 
24
25
 
26
SGI Altix. http://www.sgi.com/products/servers/altix/
 
27
 
28
VxWorks RTOS. http://www.windriver.com/vxworks/, 2007.
 
29
Xilinx Virtex II Pro, http://www.xilinx.com, 2006.
 
30
Xilinx Virtex IV, http://www.xilinx.com, 2006.
 
31