|
ABSTRACT
Reuse of cores can reduce design time for systems-on-a-chip. Such reuse is dependent on being able to easily interface a core to any bus. To enable such interfacing, many propose separating a core's interface from its internals by using a bus wrapper. However, this separation can lead to a performance penalty when reading a core's internal registers. In this paper, we introduce prefetching, which is analogous to caching, as a technique to reduce or eliminate this performance penalty, involving a tradeoff with power and size. We describe the prefetching technique, classify different types of registers, describe our initial prefetching architectures and heuristics for certain classes of registers, and highlight experiments demonstrating the performance improvements and size/power tradeoffs. We further introduce a technique for automatically designing a prefetch unit that satisfies user-imposed register-access constraints. The technique benefits from mapping the prefetching problem to the well-known real-time process scheduling problem. We then extend the technique to allow user-specified register interdependencies, using a Petri net model, resulting in even more efficient prefetch schedules.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Pai Chou , Ross B. Ortega , Gaetano Borriello, Interface co-synthesis techniques for embedded systems, Proceedings of the 1995 IEEE/ACM international conference on Computer-aided design, p.280-287, November 05-09, 1995, San Jose, California, United States
|
 |
3
|
Benoit Clement , Richard Hersemeule , Etienne Lantreibecq , Bernard Ramanadin , Pierre Coulomb , Francois Pogodalla, Fast prototyping: a system design flow applied to a complex system-on-chip multiprocessor design, Proceedings of the 36th ACM/IEEE conference on Design automation, p.420-424, June 21-25, 1999, New Orleans, Louisiana, United States
[doi> 10.1145/309847.309971]
|
| |
4
|
COWARE. 1999. CoWare N2C design system overview. http://www.coware.com/ products.html.
|
 |
5
|
Josef Fleischmann , Klaus Buchenrieder , Rainer Kress, Java driven codesign and prototyping of networked embedded systems, Proceedings of the 36th ACM/IEEE conference on Design automation, p.794-797, June 21-25, 1999, New Orleans, Louisiana, United States
[doi> 10.1145/309847.310068]
|
| |
6
|
|
| |
7
|
|
| |
8
|
KUHN, T., ROSENSTIEL, W., AND KEBSCHULL, U. 1997. Description and simulation of hardware/software systems with Java. In Design Automation Conference (1997).
|
 |
9
|
|
| |
10
|
|
| |
11
|
MENTOR GRAPHICS. n.d. Inventra core library. http://www.mentorg.com/inventra/.
|
| |
12
|
METAFLOW. 1999. Implosion ARM-based Internet appliance system on a chip platform. http://www.metaflow.com/ html/ implosion.htm.
|
| |
13
|
MOTOROLA. 1999. IP interface standard document. http://www.mot-sps.com/ technology/srs/.
|
| |
14
|
|
| |
15
|
PHILIPS SEMICONDUCTORS. 1999. Introduction to rapid silicon prototyping: hardware-software co-design for embedded systems-on-a-chip ICs. http://www.semiconductors.com/ acrobat/other/ technology/velocity/paper2221.pdf.
|
| |
16
|
RABAEY, J., ABNOUS, A., ICHIKAWA, Y., SENO, K., AND WAN, M. 1997. Heterogeneous reconfigurable systems. In Proceedings of the IEEE Workshop on Design and Implementation of Signal Processing Systems (1997), 24-34.
|
 |
17
|
|
| |
18
|
SEMICONDUCTOR INDUSTRY ASSOCIATION. 1999. International technology roadmap for semiconductors: 1999 edition. International SEMATECH, Austin, TX.
|
 |
19
|
|
| |
20
|
|
| |
21
|
VAN MEERBERGEN, J., TIMMER, A., LEIJTEN, J., HARMSZE, F., AND STRIK, M. 1998. Experiences with system level design for consumer ICs. In Proceedings of the IEEE Computer Society Workshop on VLSI System Level Design (1998).
|
 |
22
|
Steven Vercauteren , Bill Lin , Hugo De Man, Constructing application-specific heterogeneous embedded architectures from custom HW/SW applications, Proceedings of the 33rd annual conference on Design automation, p.521-526, June 03-07, 1996, Las Vegas, Nevada, United States
[doi> 10.1145/240518.240617]
|
| |
23
|
VIRTUAL SOCKET INTERFACE ASSOCIATION. 1997a. Architecture document. http://www.vsi.org.
|
| |
24
|
VIRTUAL SOCKET INTERFACE ASSOCIATION. 1997b. Virtual component interface standard version 1.0. http://www.vsi.org.
|
| |
25
|
VIRTUAL SOCKET INTERFACE ASSOCIATION. 1998. On-chip bus development working group, Specification 1 Version 1.0 (OCB 1 1.0). http://www.vsi.org.
|
CITED BY 4
|
|
|
|
Minas Dasygenis , Erik Brockmeyer , Bart Durinck , Francky Catthoor , Dimitrios Soudris , Antonios Thanailakis, A combined DMA and application-specific prefetching approach for tackling the memory latency bottleneck, IEEE Transactions on Very Large Scale Integration (VLSI) Systems, v.14 n.3, p.279-291, March 2006
|
|
|
|
|
|
|
Peer to Peer - Readers of this Article have also read:
-
Data structures for quadtree approximation and compression
Communications of the ACM
28, 9
Hanan Samet
-
A hierarchical single-key-lock access control using the Chinese remainder theorem
Proceedings of the 1992 ACM/SIGAPP Symposium on Applied computing
Kim S. Lee
, Huizhu Lu
, D. D. Fisher
-
Putting innovation to work: adoption strategies for multimedia communication systems
Communications of the ACM
34, 12
Ellen Francik
, Susan Ehrlich Rudman
, Donna Cooper
, Stephen Levine
-
The GemStone object database management system
Communications of the ACM
34, 10
Paul Butterworth
, Allen Otis
, Jacob Stein
-
An intelligent component database for behavioral synthesis
Proceedings of the 27th ACM/IEEE Design Automation Conference on
Gwo-Dong Chen
, Daniel D. Gajski
|