ACM Home Page
Please provide us with feedback. Feedback
Speculative DMA for architecturally visible storage in instruction set extensions
Full text PdfPdf (167 KB)
Source
International Conference on Hardware Software Codesign archive
Proceedings of the 6th IEEE/ACM/IFIP international conference on Hardware/Software codesign and system synthesis table of contents
Atlanta, GA, USA
SESSION: System level design: throughput, dependability, coherence, and yield table of contents
Pages 243-248  
Year of Publication: 2008
ISBN:978-1-60558-470-6
Authors
Theo Kluter  Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
Philip Brisk  Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
Paolo Ienne  Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
Edoardo Charbon  Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
Sponsors
SIGDA: ACM Special Interest Group on Design Automation
SIGBED: ACM Special Interest Group on Embedded Systems
ACM: Association for Computing Machinery
SIGMICRO: ACM Special Interest Group on Microarchitectural Research and Processing
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 12,   Downloads (12 Months): 84,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1450135.1450191
What is a DOI?

ABSTRACT

Instruction set extensions (ISEs) can accelerate embedded processor performance. Many algorithms for ISE generation have shown good potential; some of them have recently been expanded to include Architecturally Visible Storage (AVS) - compiler-controlled memories, similar to scratchpads, that are accessible only to ISEs. To achieve a speedup using AVS, Direct Memory Access (DMA) transfers are required to move data from the main memory to the AVS; unfortunately, this creates coherence problems between the AVS and the cache, which previous methods for ISEs with AVS failed to address; additionally, these methods need to leave many conservative DMA transfers in place, whose execution significantly limits the achievable speedup. This paper presents a memory coherence scheme for ISEs with AVS, which can ensure execution correctness and memory consistency with minimal area overhead. We also present a method that speculatively removes redundant DMA transfers. Cycle-accurate experimental results were obtained using an FPGA-emulation platform. These results show that the application-specific instruction-set extended processors with speculative DMA-enhanced AVS gain significantly over previous techniques, despite the overhead of the coherence mechanism.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
ARM Ltd. ARM Cortex-A9 MPCore. http://www.arm.com/products/CPUs/.
2
3
 
4
P. Biswas, N. Dutt, L. Pozzi, and P. Ienne. Introduction of architecturally visible storage in instruction set extensions. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, CAD-26(3):435--46, Mar. 2007.
 
5
 
6
 
7
T. R. Halfhill. EEMBC releases first benchmarks. Microprocessor Report, 1 May 2000.
 
8
9
 
10
L. Pozzi, K. Atasu, and P. Ienne. Exact and approximate algorithms for the extension of embedded processor instruction sets. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, CAD-25(7):1209--29, July 2006.
 
11
12
13

Collaborative Colleagues:
Theo Kluter: colleagues
Philip Brisk: colleagues
Paolo Ienne: colleagues
Edoardo Charbon: colleagues