ACM Home Page
Please provide us with feedback. Feedback
Direct address translation for virtual memory in energy-efficient embedded systems
Full text PdfPdf (523 KB)
Source
ACM Transactions on Embedded Computing Systems (TECS) archive
Volume 8 ,  Issue 1  (December 2008) table of contents
Article No. 5  
Year of Publication: 2008
ISSN:1539-9087
Authors
Xiangrong Zhou  University of Maryland, College Park
Peter Petrov  University of Maryland, College Park
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 20,   Downloads (12 Months): 211,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1457246.1457251
What is a DOI?

ABSTRACT

This article presents a methodology for virtual memory support in energy-efficient embedded systems. A holistic approach is proposed, where the combined efforts of compiler, operating system, and hardware architecture achieve a significant system power reductions. The application information extracted and analyzed by the compiler is utilized dynamically by the microarchitecture and the operating system to perform energy-efficient and, for many memory references, time-deterministic address translations. We demonstrate that by using application information regarding virtual memory layout, an efficient and conflict-free translation process can be implemented through the utilization of a small hardware direct translation table (DTT) accessed in an application-specific manner. The set of virtual pages is partitioned into groups, such that for each group only a few of the least significant bits are used as an index to obtain the physical page number. We outline an efficient compile-time algorithm for identifying these groups and allocate their translation entries optimally into the DTT. The introduced hardware is minimal in terms of area, performance, and power overhead, while offering the flexibility of software programmability. This is achieved through a small set of registers and tables, which are made software accessible. We have quantitatively evaluated the proposed methodology on a number of embedded applications, including voice, image, and video processing.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
ARM Ltd. ARM920T Technical Reference Manual. ARM Ltd.
 
3
 
4
Baase, S. and Gelder, A. 2000. Computer Algorithms. Addison-Wesley, Boston, MA.
5
6
 
7
 
8
9
10
11
12
13
 
14
15
 
16
 
17
Heckmann, R., Langenbach, M., Thesing, S., and Wilhelm, R. 2003. The influence of processor architecture on the design and the results of wcet tools. IEEE Proc. 91, 7, 1038--1054.
18
 
19
Intel Corporation. Intel XScale microarchitecture. Intel Corporation.
 
20
21
 
22
23
 
24
Kandemir, M., Ramanujam, J., Irwin, M., Vijaykrishnan, N., Kadayif, I., and Parikh, A. 2004. A compiler-based approach for dynamically managing scratch-pad memories in embedded systems. IEEE Trans Comput.-Aid. Design Integr. Circ. Syst. 23, 2, 243--260.
 
25
 
26
27
 
28
 
29
 
30
 
31
Montanaro, J., Witek, R., Anne, K., Black, A., Cooper, E., Dobberpuhl, D., Donahue, P., Eno, J., Farell, A., Hoeppner, G., et al. 1996. A 160mhz, 32b 0.5w cmos risc microprocessor. In Proceedings of the International Symposium on Computers and Communication (ISCC'96). IEEE, Los Alamitos, CA, 214--229.
 
32
 
33
 
34
Shivakumar, P. and Jouppi, N. 2001. Cacti 3.0: An integrated cache timing, power and area model. Tech. rep., Western Research Lab.
35
 
36
Stojanovic, V. and Oklobdzija, V. 1999. Comparative analysis of master-slave latches and flip-flops for high-performance and low-power systems. IEEE J. Solid-State Circ. 34, 4, 536--548.
37
38
39

Collaborative Colleagues:
Xiangrong Zhou: colleagues
Peter Petrov: colleagues