| Memory and architecture exploration with thread shifting for multithreaded processors in embedded systems |
| Full text |
Pdf
(382 KB)
|
| Source
|
International Conference on Compilers, Architecture and Synthesis for Embedded Systems
archive
Proceedings of the 2004 international conference on Compilers, architecture, and synthesis for embedded systems
table of contents
Washington DC, USA
SESSION: Co-design and synthesis
table of contents
Pages: 230 - 237
Year of Publication: 2004
ISBN:1-58113-890-3
|
|
Authors
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 3, Downloads (12 Months): 47, Citation Count: 1
|
|
|
ABSTRACT
In embedded multithreaded architectures, the performance enhancement relative to the base single-threaded architecture is highly dependent on the characteristics of the application and memory configuration. When the application is well parallelized, the multithreading performance may be good even with a small cache since the memory access latency can be hidden. However, if there are complicated dependencies between threads, they cause frequent cache conflicts, so the performance may not be improved. For that reason, not only processor architecture but also memory configuration should be customized to get an optimal solution of an embedded multithreaded system. We suggest a design space exploration algorithm, which considers both memory configuration and multithreaded architecture and a thread shifting technique, which shifts threads in compile time to minimize cache conflict.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
|
| |
4
|
Hirst, K. R., Haskins, J. W., and Skadron, K. dMT: inexpensive throughput enhancement in small-scale embedded microprocessors with differential multithreading. IEE Proc.-Comput. Digit. Tech., vol. 151, no. 1, Jan. 2004.
|
| |
5
|
Hily, S., and Seznec. A. Standard memory hierarchy does not fit simultaneous multithreading. In Proceedings of the Workshop on Multithreaded execution architecture and compilation (MTEAC), 1998.
|
| |
6
|
|
| |
7
|
Panda, P. R., Dutt, N. D., and Nicolau, A. Local memory exploration and optimization in embedded systems. IEEE Trans. on CAD of Integrated Circuits and Systems, vol. 18, no. 1, Jan. 1999.
|
 |
8
|
|
| |
9
|
|
| |
10
|
Wittenburg, J. P, Pirsch, P., and Meyer, G. A multithreaded architecture approach to parallel DSPs for high performance image processing applications. In Proceedings of the IEEE workshop on Signal processing systems (SiPS'99), 1999.
|
| |
11
|
|
| |
12
|
Sigmund, U., Steinhaus, M., and Ungerer, T. On performance, transistor count and chip space assessment of multimedia-enhanced simultaneous multithreaded processors, In Proceedings of the Workshop on Multithreaded execution architecture and compilation (MTEAC-4), 2000.
|
| |
13
|
|
| |
14
|
|
| |
15
|
Lioupis, D., and Milios, S. Exploring cache performance in multithreaded processors, Microprocessors and Microsystems, vol.20, no.10, Jun. 1997.
|
|