|
ABSTRACT
As computational power increases, tele-immersive applications are an emerging trend. These applications make extensive demands on computational resources through their heavy use of real-time 3D reconstruction algorithms. Since computer vision developers do not necessarily have parallel programming expertise, it is important to give them the tools and capabilities to naturally express computer vision algorithms, yet retain high efficiency by exploiting modern GPU and large-scale multi-core platforms. In this paper, we describe our optimization efforts for a tele-immersion application by tuning it for GPU and multi-core platforms. Additionally, we introduce a method to obtain portability, high performance, and increase programmer productivity.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Parallel Computing Research at Illinois: The Upcrc Agenda. Technical report, Dept. of Computer Science, Dept. of Electrical and Computer Engineering, Corrdinated Science Laboratory, Nov 2008.
|
| |
2
|
G. Almasi, L. D. Rose, J. Moreira, and D. Padua. Programming for locality and parallelism with hierarchically tiled arrays. In In Proc. of the 16th International Workshop on Languages and Compilers for Parallel Computing, LCPC 2003, pages 162--176. Springer-Verlag, 2003.
|
 |
3
|
Ganesh Bikshandi , Jia Guo , Daniel Hoeflinger , Gheorghe Almasi , Basilio B. Fraguela , María J. Garzarán , David Padua , Christoph von Praun, Programming for parallelism and locality with hierarchically tiled arrays, Proceedings of the eleventh ACM SIGPLAN symposium on Principles and practice of parallel programming, March 29-31, 2006, New York, New York, USA
[doi> 10.1145/1122971.1122981]
|
| |
4
|
B. Delaunay. Sur la sphère vide. Izvestia Akademia Nauk SSSR, VII Seria, Otdelenie Matematicheskii i Estestvennyka Nauk, 7:793--800, 1934.
|
 |
5
|
Jia Guo , Ganesh Bikshandi , Basilio B. Fraguela , Maria J. Garzaran , David Padua, Programming with tiles, Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming, February 20-23, 2008, Salt Lake City, UT, USA
[doi> 10.1145/1345206.1345225]
|
| |
6
|
M. Harris. Optimizing Parallel Reduction in Cuda, 2007.
|
| |
7
|
S.-H. Jung and R. Bajcsy. A Framework for Constructing Real-time Immersive Environments for Training Physical Activities. Journal of Multimedia, 1(7):9--17, 2006.
|
 |
8
|
Milind Kulkarni , Keshav Pingali , Bruce Walter , Ganesh Ramanarayanan , Kavita Bala , L. Paul Chew, Optimistic parallelism requires abstractions, Proceedings of the 2007 ACM SIGPLAN conference on Programming language design and implementation, June 10-13, 2007, San Diego, California, USA
|
| |
9
|
|
| |
10
|
K. Nahrstedt, 2008. private communication.
|
| |
11
|
NVIDIA. Nvidia Cuda Programming Guide 2.0, 2008.
|
| |
12
|
J. H. Wolf. Programming methods for the Pentium III processor's streaming SIMD extensions using the VTune performance enhancement environment, May 1999.
|
| |
13
|
|
| |
14
|
Zhenyu Yang , Klara Nahrstedt , Yi Cui , Bin Yu , Jin Liang , Sang-hack Jung , Ruzena Bajscy, TEEVE: The Next Generation Architecture for Tele-immersive Environment, Proceedings of the Seventh IEEE International Symposium on Multimedia, p.112-119, December 12-14, 2005
[doi> 10.1109/ISM.2005.113]
|
 |
15
|
Zhenyu Yang , Bin Yu , Wanmin Wu , Ross Diankov , Ruzena Bajscy, Collaborative dancing in tele-immersive environment, Proceedings of the 14th annual ACM international conference on Multimedia, October 23-27, 2006, Santa Barbara, CA, USA
[doi> 10.1145/1180639.1180793]
|
|