| Using fine grain multithreading for energy efficient computing |
| Full text |
Pdf
(2.38 MB)
|
Source
|
Principles and Practice of Parallel Programming
archive
Proceedings of the 12th ACM SIGPLAN symposium on Principles and practice of parallel programming
table of contents
San Jose, California, USA
SESSION: Compilation, performance, and energy
table of contents
Pages: 259 - 269
Year of Publication: 2007
ISBN:978-1-59593-602-8
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 31, Downloads (12 Months): 94, Citation Count: 0
|
|
|
ABSTRACT
We investigate extremely fine-grain multithreading as a means for improving energy efficiency of single-task program execution.Our work is based on low-overhead threads executing an explicitly parallel program in a register-sharing context. The thread-based parallelism takes the place of instruction-level parallelism, allowing us to use simple and more energy-efficient in-order pipelines while retaining performance that is characteristic of classical out-of-order processors. Our evaluation shows that in energy terms, the parallelized code running over in-order pipelines can outperform both plain in-order and out-of-order processors.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
|
| |
4
|
|
 |
5
|
|
| |
6
|
D. Burger, T. M. Austin, and S. Bennett. Evaluating future microprocessors: The SimpleScalar tool set. Technical Report CS-TR-1996-1308, University of Wisconsin-Madison, 1996.
|
| |
7
|
Alper Buyuktosunoglu , Stanley Schuster , David Brooks , Pradip Bose , Peter W. Cook , David H. Albonesi, An Adaptive Issue Queue for Reduced Power at High Performance, Proceedings of the First International Workshop on Power-Aware Computer Systems-Revised Papers, p.25-39, November 12, 2000
|
| |
8
|
|
 |
9
|
|
 |
10
|
José-Lorenzo Cruz , Antonio González , Mateo Valero , Nigel P. Topham, Multiple-banked register file architectures, Proceedings of the 27th annual international symposium on Computer architecture, p.316-325, June 2000, Vancouver, British Columbia, Canada
|
| |
11
|
|
 |
12
|
|
 |
13
|
Kourosh Gharachorloo , Daniel Lenoski , James Laudon , Phillip Gibbons , Anoop Gupta , John Hennessy, Memory consistency and event ordering in scalable shared-memory multiprocessors, Proceedings of the 17th annual international symposium on Computer Architecture, p.15-26, May 28-31, 1990, Seattle, Washington, United States
|
| |
14
|
R. Gonzalez and M. Horowitz. Energy dissipation in general purpose microprocessors. IEEE Journal of Solid-State Circuits, 31(9):1277--1284, 31, 1996.
|
| |
15
|
|
| |
16
|
|
| |
17
|
C. Jesshope. Scalable instruction-level parallelism. In 3rd and 4th International Workshops on Computer Systems: Architectures, Modelling and Simulation, 2004.
|
| |
18
|
Chunho Lee , Miodrag Potkonjak , William H. Mangione-Smith, MediaBench: a tool for evaluating and synthesizing multimedia and communicatons systems, Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture, p.330-335, December 01-03, 1997, Research Triangle Park, North Carolina, United States
|
| |
19
|
|
 |
20
|
|
 |
21
|
Yingmin Li , David Brooks , Zhigang Hu , Kevin Skadron , Pradip Bose, Understanding the energy efficiency of simultaneous multithreading, Proceedings of the 2004 international symposium on Low power electronics and design, August 09-11, 2004, Newport Beach, California, USA
[doi> 10.1145/1013235.1013251]
|
| |
22
|
C.-c. Lim, X. Feng, and J. Zhang. ORC: Open research compiler: a compiler infrastructure for research. In Proceedings 17th International Workshop on Languages and Compilers for Parallel Computing, 2004.
|
 |
23
|
|
 |
24
|
|
| |
25
|
|
| |
26
|
G. Shklover, A. Gontmakher, A. Schuster, and A. Mendelson. Code generation for fine-granularity register sharing threads. In INTERACT-10, in conjunction with HPCA-12, 2006.
|
 |
27
|
|
 |
28
|
|
 |
29
|
|
 |
30
|
|
 |
31
|
Dean M. Tullsen , Susan J. Eggers , Joel S. Emer , Henry M. Levy , Jack L. Lo , Rebecca L. Stamm, Exploiting choice: instruction fetch and issue on an implementable simultaneous multithreading processor, Proceedings of the 23rd annual international symposium on Computer architecture, p.191-202, May 22-24, 1996, Philadelphia, Pennsylvania, United States
|
| |
32
|
|
| |
33
|
Alpha 21164 Processor Datasheet. Compaq Corporation, 1998.
|
| |
34
|
Alpha 21264 Processor Datasheet. Compaq Corporation, 1999.
|
| |
35
|
MIPS R10000 Microprocessor User's Manual. MIPS Technologies, inc., 1996.
|
| |
36
|
MIPS R5000 Processor. MIPS Technologies, inc., 1996.
|
| |
37
|
The GNU GCC compiler. http://gcc.gnu.org.
|
| |
38
|
ULTRASparc Architecture 2005. Sun Microsystems, Inc., 2006.
|
| |
39
|
Crusoe Processor Model TM5800 Product Brief. Transmeta Corporation, 2003.
|
|