| |
1
|
R. Allen and K. Kennedy. Optimizing Compilers for Modern Architectures. Morgan Kaufmann Publishers, 2001.
|
| |
2
|
R. Barik and V. Sarkar. Interprocedural load elimination for dynamic optimization of parallel programs. In The Eighteenth International Conference on Parallel Architectures and Compilation Techniques (PACT), September 2009. (To appear).
|
| |
3
|
Z. Budimlic et el. Declarative aspects of memory management in the concurrent collections parallel programming model. In DAMP 2009: Workshop on Declarative Aspects of Multicore Programming, January 2009.
|
| |
4
|
Z. Budimlic et al. Multi-core implementations of the concurrent collections programming model. In CPC '09: 14th International Workshop on Compilers for Parallel Computers. Springer, January 2009.
|
| |
5
|
S. Chandra et al. Type inference for locality analysis of distributed data structures. In PPoPP '08: Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming, pages 11--22, New York, NY, USA, 2008. ACM.
|
| |
6
|
P. Charles et al. X10: an object-oriented approach to non-uniform cluster computing. In OOPSLA '05: Proceedings of the 20th annual ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications, pages 519--538, New York, NY, USA, 2005. ACM.
|
| |
7
|
J. Ferrante et al. The Program Dependence Graph and its Use in Optimization. ACM Transactions on Programming Languages and Systems, 9(3):319--349, July 1987.
|
| |
8
|
Y. Guo et al. Work-First and Help-First Scheduling Policies for Async-Finish Task Parallelism. In IPDPS '09: International Parallel and Distributed Processing Symposium (To Appear), 2009.
|
| |
9
|
Intel (r) concurrent collections for c/c++. http://softwarecommunity.intel.com/articles/eng/3862.htm.
|
| |
10
|
K. Knobe and C. D. Offner. Tstreams: A model of parallel computation (preliminary report). Technical Report HPL-2004-78, HP Labs, 2004.
|
| |
11
|
K. Knobe and V. Sarkar. The concurrent collections parallel programming model -- foundations and implementation challenges. PLDI 2009 tutorial. http://www.cs.virginia.edu/kim/publicity/pldi09tutorials/CnC-tutorial.pdf.
|
| |
12
|
J. R. Larus and R. Rajwar. Transactional Memory. Morgan & Claypool, 2006.
|
| |
13
|
T. Peierls et al. Java Concurrency in Practice. Addison-Wesley Professional, 2005.
|
| |
14
|
Rice University. Habanero Multicore Software Research project.
|
| |
15
|
J. Shirako et al. Phasers: a unified deadlock-free construct for collective and point-to-point synchronization. In ICS '08: Proceedings of the 22nd annual international conference on Supercomputing, pages 277--288, New York, NY, USA, 2008. ACM.
|
| |
16
|
J. Shirako et al. Phaser Accumulators: a New Reduction Construct for Dynamic Parallelism. In 23rd IEEE IPDPS, 2009.
|
| |
17
|
Habanero Team. Download site for initial release of Concurrent Collections (CnC) and Habanero Java (HJ) integrated runtime system. http://www.cs.rice.edu/~vsarkar/downloads/cnc_distrib_2009_07_21.zip.
|