| Automatic multithreading and multiprocessing of C programs for IXP |
| Full text |
Pdf
(634 KB)
|
| Source
|
Principles and Practice of Parallel Programming
archive
Proceedings of the tenth ACM SIGPLAN symposium on Principles and practice of parallel programming
table of contents
Chicago, IL, USA
SESSION: Automatic parallelization
table of contents
Pages: 132 - 141
Year of Publication: 2005
ISBN:1-59593-080-9
|
|
Authors
|
|
Long Li
|
Intel China Software Center, Shanghai, PRC
|
|
Bo Huang
|
Intel China Software Center, Shanghai, PRC
|
|
Jinquan Dai
|
Intel China Software Center, Shanghai, PRC
|
|
Luddy Harrison
|
Univ. of Illinois at Urbana-Champaign, Urbana, IL
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 13, Downloads (12 Months): 74, Citation Count: 4
|
|
|
ABSTRACT
Effective compilation of packet processing applications onto the Intel IXP network processors requires, among other things, the automatic use of multiple threads on one or more processing elements, and the automatic introduction of synchronization as required to correctly enforce dependences between such threads. We describe the program transformation that is used in the Intel Auto-partitioning C Compiler for IXP to automatically multithread/multi-process a program for the IXP. This transformation consists of steps that introduce inter-thread signaling to enforce dependences, optimize the placement of such signaling, reduce the number of signals in use to the number available in hardware, and transform the initialization code for correct execution in the multithreaded version. Experimental results show that our method provides impressive speedup for six PPSes (Packet Processing Stages) in the widely used NPF IP forwarding benchmarks. For most packet processing stages, our algorithms can achieve almost linear performance improvement after automatic multi-threading transformation. The automatic multi-processing transformation help further boost the speedup of two PPSes.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Challenges in Building Network Processor Based Solutions, http://www.futsoft.com/pdf/NPwp.pdf
|
| |
2
|
Intel IXP family of Network Processors, www.intel.com/design/network/products/npfamily/index.htm
|
| |
3
|
IBM PowerNP Network Processors http://www-3.ibm.com/chips/techlib/techlib.nsf/products/IBM_PowerNP_NP4GS3
|
| |
4
|
CPort Network Processor family, http://www.windriver.com/cgi-bin/partnerships/directory/viewProd.cgi?id=1371
|
| |
5
|
Agere's PayloadPlus Family of Network Processors, http://www.agere.com/telecom/network_processors.html
|
| |
6
|
AMCC's nP7xxx series of Network Processors, http://www.mmcnetworks.com/solutions/
|
| |
7
|
Introduction to the Auto-Partitioning Programming Model, http://www.intel.com/design/network/papers/25411401.pdf
|
| |
8
|
|
 |
9
|
|
| |
10
|
|
| |
11
|
|
 |
12
|
|
| |
13
|
|
| |
14
|
|
| |
15
|
Ron G. Cytron. Doacross, beyond vectorization for multiprocessors, Proceedings of the International Conference on Parallel Processing, pages 836--844, 1986
|
| |
16
|
|
| |
17
|
Z.Li and W.Abu-Sufah, "On Reducing Data Synchronization in Multiprocessed Loops," IEEE Trans. Computers, vol. 36, no. 1, pp. 105--109, Jan. 1987
|
| |
18
|
P.Shaffer, "Minimization of Interprocessor Synchronization in Multiprocessors with Shared and Private Memory", Proc. Int'l Conf. Parallel Processing, vol. III, pp. 138--141, Aug. 1989
|
 |
19
|
D. Callahan , K. Kennedy , J. Subhlok, Analysis of event synchronization in a parallel programming tool, Proceedings of the second ACM SIGPLAN symposium on Principles & practice of parallel programming, p.21-30, March 14-16, 1990, Seattle, Washington, United States
|
 |
20
|
|
 |
21
|
|
| |
22
|
|
 |
23
|
Jeff Bogda , Urs Hölzle, Removing unnecessary synchronization in Java, Proceedings of the 14th ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications, p.35-46, November 01-05, 1999, Denver, Colorado, United States
|
 |
24
|
|
 |
25
|
|
| |
26
|
Network Processor Forum (NPF), IP Forwarding Benchmark Implementation Agreements (June 2003), http://www.npforum.org/benchmarking/licenseagm_ipforwarding.shtml.
|
CITED BY 4
|
|
|
|
|
|
|
|
Xiaofeng Guo , Jinquan Dai , Long Li , Zhiyuan Lv , Prashant R. Chandra, Latency hiding through multithreading on a network processor, Proceedings of the 12th ACM SIGPLAN symposium on Principles and practice of parallel programming, March 14-17, 2007, San Jose, California, USA
|
|
|
|
|