ACM Home Page
Please provide us with feedback. Feedback
Dynamic scratchpad memory management for code in portable systems with an MMU
Full text PdfPdf (1.43 MB)
Source
ACM Transactions on Embedded Computing Systems (TECS) archive
Volume 7 ,  Issue 2  (February 2008) table of contents
Article No. 11  
Year of Publication: 2008
ISSN:1539-9087
Authors
Bernhard Egger  Samsung Advanced Institute of Technology
Jaejin Lee  Seoul National University, Seoul, South Korea
Heonshik Shin  Seoul National University, Seoul, South Korea
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 19,   Downloads (12 Months): 210,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1331331.1331335
What is a DOI?

ABSTRACT

In this work, we present a dynamic memory allocation technique for a novel, horizontally partitioned memory subsystem targeting contemporary embedded processors with a memory management unit (MMU). We propose to replace the on-chip instruction cache with a scratchpad memory (SPM) and a small minicache. Serializing the address translation with the actual memory access enables the memory system to access either only the SPM or the minicache. Independent of the SPM size and based solely on profiling information, a postpass optimizer classifies the code of an application binary into a pageable and a cacheable code region. The latter is placed at a fixed location in the external memory and cached by the minicache. The former, the pageable code region, is copied on demand to the SPM before execution. Both the pageable code region and the SPM are logically divided into pages the size of an MMU memory page. Using the MMU's pagefault exception mechanism, a runtime scratchpad memory manager (SPMM) tracks page accesses and copies frequently executed code pages to the SPM before they get executed. In order to minimize the number of page transfers from the external memory to the SPM, good code placement techniques become more important with increasing sizes of the MMU pages. We discuss code-grouping techniques and provide an analysis of the effect of the MMU's page size on execution time, energy consumption, and external memory accesses. We show that by using the data cache as a victim buffer for the SPM, significant energy savings are possible. We evaluate our SPM allocation strategy with fifteen applications, including H.264, MP3, MPEG-4, and PGP. The proposed memory system requires 8&percent; less die are compared to a fully-cached configuration. On average, we achieve a 31&percent; improvement in runtime performance and a 35&percent; reduction in energy consumption with an MMU page size of 256 bytes.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
 
3
ARM926EJ-S 2002. ARM926EJ-S Jazelle-enhanced macrocell,. http://www.arm.com/products/CPUs/ARM926EJ-S.html.
 
4
ARMv6 2002. ARM Architecture Version 6 (ARMv6),. http://www.arm.com.
5
6
 
7
8
 
9
10
11
12
 
13
 
14
H.264 2003. H.264 Video Codec. http://www.itu.int/rec/T-REC-H.264.
 
15
Intel IXP Network Processor 2002. The Intel IXP Network Processor,. http://developer.intel.com/technology/itj/2002/volume06issue03/.
 
16
Intel XScale 2002. Intel XScale Architecture. http://www.intel.com.
 
17
18
19
 
20
 
21
22
 
23
Micron Technology, Inc. 2003. MT48H8M16LF Mobile SDRAM.
 
24
Micron Technology, Inc. 2004. Mobile SDRAM Power Calc 10.
 
25
Moussouris, J., Crudele, L., Freitas, D., Hansen, C., Hudson, E., Przybylski, S., Riordan, T., and Rowen, C. 1986. A cmos risc processor with integrated system functions. In COMPCON.
 
26
MP3 1996. MP3 Reference Decoder. http://www.mp3-tech.org/programmer/sources/dist10.tgz.
27
 
28
29
30
 
31
PGPi. 2002. Pretty Good Privacy (PGPi). http://www.pgpi.org/.
 
32
Philips LPC3180 2006. Philips LPC3180 microcontroller. http://www.standardics.philips.com/.
33
 
34
Samsung Semiconductor. 2005. K4X51163PC Mobile DDR SRAM.
35
 
36
SNACK 2004. Seoul National University Advanced Compiler Tool Kit. http://aces.snu.ac.kr/snack.html.
37
38
 
39
 
40
Verma, M., Petzold, K., Wehmeyer, L., Falk, H., and Marvedel, P. 2005. Scratchpad sharing strategies for multiprocess embedded systems: A first approach. In 3rd Workshop on Embedded Systems for Real-Time Multimedia.
 
41
Wilton, S. and Jouppi, N. 1996. CACTI: An enhanced cache access and cycle time model. IEEE Journal of Solid State Circuits 31, 5, 677--688.
 
42
Xvid 2005. Xvid MPEG-4 Video Codec. http://www.xvid.org.


Collaborative Colleagues:
Bernhard Egger: colleagues
Jaejin Lee: colleagues
Heonshik Shin: colleagues