|
ABSTRACT
Nanoscale systems on chip will integrate billion-gate designs. The challenge is to find a scalable HW/SW design style for future CMOS technologies. Tiled architectures suggest a possible path: "small" processing tiles connected by "short wires". A typical SHAPES tile contains a VLIW floating-point DSP, a RISC, a DNP (Distributed Network Processor), distributed on chip memory, the POT (a set of Peripherals On Tile) plus an interface for DXM (Distributed External Memory). The SHAPES routing fabric connects on-chip and off-chip tiles, weaving a distributed packet switching network. 3D next-neighbours engineering methodologies is adopted for off-chip networking and maximum system density. The SW challenge is to provide a simple and efficient programming environment for tiled architectures. SHAPES will investigate a layered system software, which does not destroy algorithmic and distribution info provided by the programmer and is fully aware of the HW paradigm. For efficiency and QoS, the system SW manages intra-tile and inter-tile latencies, bandwidths, computing resources, using static and dynamic profiling. The SW accesses the on-chip and off-chip networks through a homogeneous interface.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
R. Ho, K. Mai and M. Horowitz, "The Future of Wires", Proc. IEEE, 89-4 (2001)490--504.
|
| |
2
|
D. Sylvester and K. Keutzer, "Impact of Small Process Geometries on Microarchitectures in Systems on a Chip", Proc. IEEE, 89-4(2001)467--489.
|
| |
3
|
|
| |
4
|
Alan Allan , Don Edenfeld , William H. Joyner, Jr. , Andrew B. Kahng , Mike Rodgers , Yervant Zorian, 2001 Technology Roadmap for Semiconductors, Computer, v.35 n.1, p.42-53, January 2002
[doi> 10.1109/2.976918]
|
| |
5
|
|
| |
6
|
Michael Bedford Taylor , Jason Kim , Jason Miller , David Wentzlaff , Fae Ghodrat , Ben Greenwald , Henry Hoffman , Paul Johnson , Jae-Wook Lee , Walter Lee , Albert Ma , Arvind Saraf , Mark Seneski , Nathan Shnidman , Volker Strumpen , Matt Frank , Saman Amarasinghe , Anant Agarwal, The Raw Microprocessor: A Computational Fabric for Software Circuits and General-Purpose Programs, IEEE Micro, v.22 n.2, p.25-35, March 2002
[doi> 10.1109/MM.2002.997877]
|
| |
7
|
P. S. Paolucci et al. "Janus: A gigaflop VLIW+RISC Soc Tile", Hot Chips 15 IEEE Stanford Conference (2003). http://www.hotchips.org (note: Janus was the development name of Diopsis. see www.atmelroma.it).
|
| |
8
|
|
| |
9
|
P. Faraboschi, G. Desoli, J. A. Fisher, "The Latest Word in Digital and Media Processing", IEEE Signal Processing Mag. 15-2(1998)59--85.
|
| |
10
|
R. P. Clowell, J. O'Donnell, D. P. Papworth, P. K. Rodman, "Instruction Storage Method with a Compressed Format Using a Mask Word", U.S. Patent 5057837, (Oct 1991).
|
| |
11
|
P. S. Paolucci, P. Kajfasz et al., "mAgic-FPU and MADE: A customizable VLIW core and the modular VLIW processor architecture description environment", Computer Physics Communication 139(2001)132--143.
|
| |
12
|
A. Bartoloni, P. S. Paolucci et al., "A Hardware Implementation of the APE100 Architecture", Int. Journ. Mod. Phys. C 4(1993)969.
|
| |
13
|
N. Cabibbo and P. S. Paolucci, "SIMD algorithm for Matrix Transposition", Int. Journ. Mod. Phys. C 6(1995)183.
|
| |
14
|
F. Aglietti, P. S. Paolucci, et al. , "The teraflop supercomputer APEmille: architecture, software and project status report" Computer Physics Communications, 110,1-3 (May 1998) 216--219
|
| |
15
|
|
 |
16
|
E. A. de Kock , W. J. M. Smits , P. van der Wolf , J.-Y. Brunel , W. M. Kruijtzer , P. Lieverse , K. A. Vissers , G. Essink, YAPI: application modeling for signal processing systems, Proceedings of the 37th conference on Design automation, p.402-405, June 05-09, 2000, Los Angeles, California, United States
[doi> 10.1145/337292.337511]
|
| |
17
|
G. E. Erwin de Kock. Y-Chart Application Programmer's Interface: The YAPI Programmer's and Reference Guide Version 2.0.0.
|
 |
18
|
Pieter van der Wolf , Erwin de Kock , Tomas Henriksson , Wido Kruijtzer , Gerben Essink, Design and programming of embedded multiprocessors: an interface-centric approach, Proceedings of the 2nd IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis, September 08-10, 2004, Stockholm, Sweden
[doi> 10.1145/1016720.1016771]
|
 |
19
|
Tim Kogel , Malte Doerper , Andreas Wieferink , Rainer Leupers , Gerd Ascheid , Heinrich Meyr , Serge Goossens, A modular simulation framework for architectural exploration of on-chip interconnection networks, Proceedings of the 1st IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis, October 01-03, 2003, Newport Beach, CA, USA
[doi> 10.1145/944645.944648]
|
| |
20
|
Torsten Kempf , Malte Doerper , R. Leupers , G. Ascheid , H. Meyr , Tim Kogel , Bart Vanthournout, A Modular Simulation Framework for Spatial and Temporal Task Mapping onto Multi-Processor SoC Platforms, Proceedings of the conference on Design, Automation and Test in Europe, p.876-881, March 07-11, 2005
[doi> 10.1109/DATE.2005.21]
|
| |
21
|
Torsten Kempf , Kingshuk Karuri , Stefan Wallentowitz , Gerd Ascheid , Rainer Leupers , Heinrich Meyr, A SW performance estimation framework for early system-level-design using fine-grained instrumentation, Proceedings of the conference on Design, automation and test in Europe: Proceedings, March 06-10, 2006, Munich, Germany
|
| |
22
|
|
| |
23
|
K. Keutzer, S. Malik, A. Newton, J. Rabaey, and A. Sangiovanni-Vincentelli, "System level design: Orthogonalization of concerns and platform-based design," IEEE Trans. on Computer-Aided Design of Integrated Circuits and Systems, vol. 19, no. 12, Dec. 2000.
|
| |
24
|
"Virtual Platform Designer" http://www.CoWare.com
|
| |
25
|
"Chess/Checkers, a retargetable tool-suite for embedded processors", Target Compiler Technologies, http://www.retarget.com/doc/target-whitepaper.pdf.
|
| |
26
|
L. Thiele, S. Chakraborty, and M. Naedele. "Real-time calculus for scheduling hard real-time systems," In Proc. IEEE International Symposium on Circuits and Systems (ISCAS), volume 4, pages 101--104, 2000.
|
| |
27
|
G. Kahn, "The semantics of a simple language for parallel programming," in Proc. of the IFIP Congress 74, (1974).
|
| |
28
|
Francesco Belletti , Sebastiano Fabio Schifano , Raffaele Tripiccione , Francois Bodin , Philippe Boucaud , Jacques Micheli , Olivier Pene , Nicola Cabibbo , Sergio de Luca , Alessandro Lonardo , Davide Rossetti , Piero Vicini , Maxim Lukyanov , Laurent Morin , Norbert Paschedag , Hubert Simma , Vincent Morenas , Dirk Pleiter , Federico Rapuano, Computing for LQCD: apeNEXT, Computing in Science and Engineering, v.8 n.1, p.18-29, January 2006
[doi> 10.1109/MCSE.2006.4]
|
INDEX TERMS
Primary Classification:
C.
Computer Systems Organization
C.1
PROCESSOR ARCHITECTURES
C.1.4
Parallel Architectures
Additional Classification:
B.
Hardware
B.7
INTEGRATED CIRCUITS
B.7.2
Design Aids
Subjects:
Simulation
C.
Computer Systems Organization
C.0
GENERAL
Subjects:
Hardware/software interfaces;
Modeling of computer architecture;
System architectures
C.3
SPECIAL-PURPOSE AND APPLICATION-BASED SYSTEMS
Subjects:
Real-time and embedded systems;
Signal processing systems
F.
Theory of Computation
F.2
ANALYSIS OF ALGORITHMS AND PROBLEM COMPLEXITY
F.2.2
Nonnumerical Algorithms and Problems
Subjects:
Sequencing and scheduling
General Terms:
Design,
Experimentation,
Languages,
Performance
Keywords:
MP-SOC,
RISC,
VLIW,
application mapping,
binding,
distributed network processors,
embedded systems,
hardware dependent software,
model based design,
network of processes,
retargetable compiler,
scheduling,
simulation,
tiled parallel architectures
|