ACM Home Page
Please provide us with feedback. Feedback
Compiler-guided register reliability improvement against soft errors
Full text PdfPdf (126 KB)
Source International Conference On Embedded Software archive
Proceedings of the 5th ACM international conference on Embedded software table of contents
Jersey City, NJ, USA
SESSION: Compilation and power table of contents
Pages: 203 - 209  
Year of Publication: 2005
ISBN:1-59593-091-4
Authors
Jun Yan  Southern Illinois University - Carbondale, Carbondale, IL
Wei Zhang  Southern Illinois University - Carbondale, Carbondale, IL
Sponsors
SIGBED: ACM Special Interest Group on Embedded Systems
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 44,   Citation Count: 4
Additional Information:

abstract   references   cited by   index terms   review   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1086228.1086266
What is a DOI?

ABSTRACT

With the scaling of technology, transient errors caused by external particle strikes have become a critical challenge for microprocessor design. As embedded processors are widely used in reliability-sensitive environments, it becomes increasingly important to develop cost-effective techniques to improve the processor reliability against soft errors. This paper focuses on studying the register file immunity against soft errors since modern processors typically employ a large number of registers, which are accessed very frequently. As a result, soft errors occurred in registers can easily propagate to functional units or the memory system, leading to silent data error (SDC) or system crash.To develop cost-effective techniques to fight soft errors for embedded processors, the first step is to understand the register file susceptibility to soft errors and its impact on the system reliability accurately. Toward this goal, this paper proposes the concept of register vulnerability factor (RVF) to characterize the probability that register transient errors can escape the register file and thus potentially impact the system reliability. Built upon the RVF concept, we then propose two cost-effective compiler-guided techniques to improve the register file reliability by lowering the RVF value. Our experiments indicate that on average, the RVF can be reduced to 9.1% and 9.5% by the hyperblock-based instruction re-scheduling and the reliability-oriented register assignment respectively, which can potentially lower the reliability cost significantly while protecting register files against transient errors.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
C. L. Chen and M.Y Hsiao. Error-correcting codes for semiconductor memory applications: a state of the art review. In Reliable Computer Systems - Design and Evaluation, pages 771--786, Digital Press, 2nd edition, 1992.
 
4
 
5
M. Tremblay and Y. Tamir. Support for fault tolerance in VLSI processors. ISCS, 1989.
 
6
R. Phelan. Addressing soft errors in ARM core-based SoC. ARM White Paper, Dec. 2003.
7
 
8
 
9
10
 
11
 
12
T. J. Dell. A white paper on the benefits of chipkill-correct ECC for PC serve main memory. IBM, Nov 1997.
 
13
 
14
15
 
16
 
17
 
18
 
19
 
20



REVIEW

"Juan A. Carrasco : Reviewer"

As the size of processors shrinks, transient errors caused by external particle strikes, also known as soft errors, are becoming a significant source of system failures. This paper analyzes the impact of such soft errors in the register file of a   more...