|
ABSTRACT
We present a methodology for pipelined software synthesis of streaming applications. First, we develop a versatile task assignment algorithm capable of optimizing realistically-arbitrary cost functions for two cores. The algorithm is exact (i.e., theoretically optimal) contrary to existing heuristics. Second, our approximation technique provides an adjustable knob to trade solution quality with algorithm runtime and memory. Third, we develop a recursive heuristic for more cores. FPGA-based emulated experiments validate our theoretical results. The exact algorithm yields 1.7 × throughput improvement. The approximation method offers a range of tradeoff points (e.g., 3 × faster with 20 × less memory) while degrading the throughput only 1% to 5%.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
Alur, R. 2003. Formal analysis of hierarchical state machines. In Verifcation Theory and Practice. Springer, Berlin, Germany, 42--66.
|
| |
3
|
|
| |
4
|
Angelini, P., Di Battista, G., and Patrignani, M. 2007. Computing a minimum-depth planar graph embedding in O(n4) time. Lecture Notes in Computer Science, vol. 4619, 287.
|
 |
5
|
|
| |
6
|
Felice Balarin , Yosinori Watanabe , Harry Hsieh , Luciano Lavagno , Claudio Passerone , Alberto Sangiovanni-Vincentelli, Metropolis: An Integrated Electronic System Design Environment, Computer, v.36 n.4, p.45-52, April 2003
[doi> 10.1109/MC.2003.1193228]
|
| |
7
|
Benveniste, A., Carloni, L. P., Caspi, P., and Sangiovanni-Vincentelli, A. L. 2003. Heterogeneous reactive systems modeling and correct-by-construction deployment. In Proceedings of the International Conference on Embedded Software (EMSOFT). Springer, Berlin, Germany, 35--50.
|
 |
8
|
|
| |
9
|
|
 |
10
|
|
| |
11
|
|
| |
12
|
Erbas, C., Erbas, S. C., and Pimentel, A. D. 2006. Multiobjective optimization and evolutionary algorithms for the application mapping problem in multiprocessor system-on-chip design. IEEE Trans. Evolut. Comput. 10, 3, 358--374.
|
| |
13
|
|
| |
14
|
|
| |
15
|
|
 |
16
|
Michael I. Gordon , William Thies , Saman Amarasinghe, Exploiting coarse-grained task, data, and pipeline parallelism in stream programs, Proceedings of the 12th international conference on Architectural support for programming languages and operating systems, October 21-25, 2006, San Jose, California, USA
|
| |
17
|
|
| |
18
|
|
| |
19
|
|
| |
20
|
Henzinger, T. A. and Sifakis, J. 2006. The embedded systems design challenge. In Proceedings of the 14th International Symposium on Formal Methods. Springer, Berlin, Germany, 1--15.
|
| |
21
|
Hu, J. and Marculescu, R. 2005. Energy- and performance-aware mapping for regular noc architectures. IEEE Trans. Comput. Aid. Des. Integr. Circ. Syst. 24, 4.
|
| |
22
|
Kahn, G. 1974. The semantics of simple language for parallel programming. In Proceedings of the International Federation for Information Processing (IFIP) Congress. 471--475.
|
| |
23
|
|
| |
24
|
Lee, E. A. 2005. Building unreliable systems out of reliable components: The real time story. Tech. rep. UCB/EECS-2005-5, EECS Department, University of California, Berkeley.
|
| |
25
|
|
| |
26
|
|
| |
27
|
Lee, E. A. and Messerschmitt, D. G. 1987b. Synchronous data ow. Proc. IEEE 75, 9, 1235--1245.
|
| |
28
|
Lipton, R. J. and Tarjan, R. E. 1979. A separator theorem for planar graphs. SIAM J. Applied Mathematics 36, 177--189.
|
 |
29
|
|
| |
30
|
Meeting. 2006. Joint United States-European Union-TEKES workshop: Long term challenges in high con_dence composable embedded systems. http://www.truststc.org/euus/wiki/Euus/HelsinkiMeeting.
|
 |
31
|
Michael I. Gordon , William Thies , Michal Karczmarek , Jasper Lin , Ali S. Meli , Andrew A. Lamb , Chris Leger , Jeremy Wong , Henry Hoffmann , David Maze , Saman Amarasinghe, A stream compiler for communication-exposed architectures, Proceedings of the 10th international conference on Architectural support for programming languages and operating systems, October 05-09, 2002, San Jose, California
|
 |
32
|
John D. Owens , William J. Dally , Ujval J. Kapasi , Scott Rixner , Peter Mattson , Ben Mowery, Polygon rendering on a stream architecture, Proceedings of the ACM SIGGRAPH/EUROGRAPHICS workshop on Graphics hardware, p.23-32, August 21-22, 2000, Interlaken, Switzerland
[doi> 10.1145/346876.346883]
|
| |
33
|
|
 |
34
|
|
| |
35
|
|
| |
36
|
|
 |
37
|
|
| |
38
|
Ram Rangan , Neil Vachharajani , Adam Stoler , Guilherme Ottoni , David I. August , George Z. N. Cai, Support for High-Frequency Streaming in CMPs, Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture, p.259-272, December 09-13, 2006
[doi> 10.1109/MICRO.2006.47]
|
 |
39
|
|
 |
40
|
|
| |
41
|
Stankovic, J. A. 2007. Keynote speech: Control challenges in wireless sensor networks. In Proceedings of the 10th International Conference on Hybrid Systems: Computation and Control. Springer, Berlin, Germany, 2.
|
| |
42
|
|
 |
43
|
|
 |
44
|
|
 |
45
|
Janos Sztipanovits , John Glossner , Trevor Mudge , Chris Rowen , Alberto Sangiovanni-Vincentelli , Wayne Wolf , Feng Zhao, Grand challenges in embedded systems, Proceedings of the 5th ACM international conference on Embedded software, September 18-22, 2005, Jersey City, NJ, USA
[doi> 10.1145/1086228.1086296]
|
 |
46
|
Michael Bedford Taylor , Walter Lee , Jason Miller , David Wentzlaff , Ian Bratt , Ben Greenwald , Henry Hoffmann , Paul Johnson , Jason Kim , James Psota , Arvind Saraf , Nathan Shnidman , Volker Strumpen , Matt Frank , Saman Amarasinghe , Anant Agarwal, Evaluation of the Raw Microprocessor: An Exposed-Wire-Delay Architecture for ILP and Streams, Proceedings of the 31st annual international symposium on Computer architecture, p.2, June 19-23, 2004, München, Germany
|
| |
47
|
|
| |
48
|
Thies, W., Lin, J., and Amarasinghe, S. 2003. Partitioning a structured stream graph using dynamic programming. Tech. rep., CS Department, Massachusetts Institute of Technology.
|
| |
49
|
|
 |
50
|
|
| |
51
|
Yu, Z., Meeuwsen, M., Apperson, R., Sattari, O., Lai, M., Webb, J., Work, E., Mohsenin, T., Singh, M., and Baas, B. M. 2006. An asynchronous array of simple processors for DSP applications. In Proceedings of the IEEE International Solid-State Circuits Conference (ISSCC). IEEE, Los Alamitos, CA.
|
| |
52
|
|
|