|
ABSTRACT
Multiprocessor SoC systems have led to the increasing use of parallel hardware along with the associated software. These approaches have included coprocessor, homogeneous processor (e.g. SMP) and application specific architectures (i.e. DSP, ASIC). ASIPs have emerged as a viable alternative to conventional processing entities (PEs) due to its configurability and programmability. In this work, we introduce a heterogeneous multi-processor system using ASIPs as processing entities in a pipeline configuration. A streaming application is taken and manually broken into a series of algorithmic stages (each of which make up a stage in a pipeline). We formulate the problem of mapping each algorithmic stage in the system to an ASIP configuration, and propose a heuristic to efficiently search the design space for a pipeline-based multi ASIP system. We have implemented the proposed heterogeneous multiprocessor methodology using a commercial extensible processor (Xtensa LX from Tensilica Inc.). We have evaluated our system by creating two benchmarks (MP3 and JPEG encoders) which are mapped to our proposed design platform. Our multiprocessor design provided a performance improvement of at least 4.11X (JPEG) and 3.36X (MP3) compared to the single processor design. The minimum cost obtained through our heuristic was within 5.47% and 5.74% of the best possible values for JPEG and MP3 benchmarks respectively.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Altera Nios Processor. Altera Corp. (http://www.altera.com).
|
| |
2
|
ARCtangent. ARC International (http://www.arc.com).
|
| |
3
|
SP-5flex. 3DSP Corp. (http://www.3dsp.com).
|
| |
4
|
SystemC Initiative. (http://www.systemc.org).
|
| |
5
|
Xtensa Processor. Tensilica Inc. (http://www.tensilica.com).
|
| |
6
|
Flix: Fast relief for performance-hungry embedded applications. Tensilica Inc. (http://www.tensilica.com/pdf/FLIX_White_Paper_v2.pdf), 2005.
|
| |
7
|
J. Axelsson. A Case Study in Heterogeneous Implementation of Automotive Real-Time Systems. In CODES'98, Seattle, 1998.
|
| |
8
|
S. Banerjee, T. Hamada, P. M. Chau, and R. D. Fellman. Macro Pipelining Based Scheduling on High Performance Heterogeneous Multiprocessor Systems. Signal Processing, IEEE Transactions on, 43(6):1468--1484, 1995.
|
| |
9
|
|
| |
10
|
A. Berić, R. Sethuraman, C. A. Pinto, H. Peters, G. Veldman, P. van de Haar, and M. Duranton. Heterogeneous Multiprocessor for High Definition Video. In ICCE'06, pages 401--402, 2006.
|
| |
11
|
|
| |
12
|
|
| |
13
|
CriticalBlue. Coprocessor synthesis - increassing system on chip platform ROL Technical report, CriticalBlue, June 2004.
|
| |
14
|
|
| |
15
|
|
| |
16
|
|
| |
17
|
J. Jeon and K. Choi. Loop Pipelining in Hardware-Software Partitioning. In ASP-DAC'98, pages 361--366, Yokohama, Japan, 1998.
|
| |
18
|
G. Kahn. The semantics of a simple language for parallel programming. In IFIP'74, pages 471--475, Stockolm, Sweden, 1974.
|
| |
19
|
M. Kim, D. Kim, and G. E. Sobelman. MPEG-4 performance analysis for a CDMA network-on-chip. In ICCCAS'05, pages 493--496, 2005.
|
| |
20
|
|
| |
21
|
|
| |
22
|
D. Pham. The design and implementation of a first-generation cell processor. In ISSCC 2005, pages 184--186. IEEE CS Press, 2005.
|
 |
23
|
|
| |
24
|
S. L. Shee, A. Erdos, and S. Parameswaran. Heterogeneous Multiprocessor Implementations for JPEG: A Case Study. In CODES+ISSS'06, Seoul, Korea, 2006.
|
| |
25
|
|
| |
26
|
J. E. Smith and G. S. Sohi. The Microarchitecture of Superscalar Processors. Proceedings of the IEEE, 83(12):1609--1624, 1995.
|
| |
27
|
M. T. J. Strik, A. H. Timmer, J. L. van Meerbergen, and G.-J. van Rootselaar. Heterogeneous multiprocessor for the management of real-time video and graphics streams. Solid-State Circuits, IEEE Journal of, 35(11):1722--1731, 2000.
|
| |
28
|
F. Sun, S. Ravi, A. Raghunathan, and N. K. Jha. Custom-instruction synthesis for extensible-processor platforms. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 23(2):216--228, 2004.
|
| |
29
|
|
| |
30
|
V. Živojnović, S. Pees, and H. Myer. LISA-machine description language and generic machine model for HW/SW co-design. In Workshop on VLSI Signal Processing, pages 127--136, 1996.
|
| |
31
|
A. Wieferink, M. Doerper, R. Leupers, G. Ascheid, H. Meyr, T. Kogel, G. Braun, and A. Nohl. System Level Processor/Communication Co-exploration Methodology for Multiprocessor System-on-Chip Platforms. Computers and Digital Techniques, IEE Proceedings, 152(1):3--11, 2005.
|
| |
32
|
N. Zhang and C.-H. Wu. Study on Adaptive Job Assignment for Multiprocessor Implementation of MPEG2 Video Encoding. Industrial Electronics, IEEE Transactions on, 44(5):726--734, 1997.
|
CITED BY 6
|
|
Marco Branca , Lorenzo Camerini , Fabrizio Ferrandi , Pier Luca Lanzi , Christian Pilato , Donatella Sciuto , Antonino Tumeo, Evolutionary algorithms for the mapping of pipelined applications onto heterogeneous embedded systems, Proceedings of the 11th Annual conference on Genetic and evolutionary computation, July 08-12, 2009, Montreal, Québec, Canada
|
|
|
|
|
|
|
|
|
|
|
|
Antonino Tumeo , Marco Branca , Lorenzo Camerini , Marco Ceriani , Matteo Monchiero , Gianluca Palermo , Fabrizio Ferrandi , Donatella Sciuto, Prototyping pipelined applications on a heterogeneous FPGA multiprocessor virtual platform, Proceedings of the 2009 Conference on Asia and South Pacific Design Automation, January 19-22, 2009, Yokohama, Japan
|
|
|
Kang Zhao , Jinian Bian , Sheqin Dong , Yang Song , Satoshi Goto, Exploring Partitions Based on Search Space Smoothing for Heterogeneous Multiprocessor System, IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, v.E91-A n.9, p.2456-2464, September 2008
|
|