ACM Home Page
Please provide us with feedback. Feedback
Performance analysis of accelerated image registration using GPGPU
Full text PdfPdf (791 KB)
Source ACM International Conference Proceeding Series; Vol. 383 archive
Proceedings of 2nd Workshop on General Purpose Processing on Graphics Processing Units table of contents
Washington, D.C.
Pages 38-45  
Year of Publication: 2009
ISBN:978-1-60558-517-8
Authors
Peter Bui  University of Notre Dame
Jay Brockman  University of Notre Dame
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 79,   Downloads (12 Months): 343,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1513895.1513900
What is a DOI?

ABSTRACT

This paper presents a performance analysis of an accelerated 2-D rigid image registration implementation that employs the Compute Unified Device Architecture (CUDA) programming environment to take advantage of the parallel processing capabilities of NVIDIA's Tesla C870 GPU. We explain the underlying structure of the GPU implementation and compare its performance and accuracy against a fast CPU-based implementation. Our experimental results demonstrate that our GPU version is capable of up to 90x speedup with bilinear interpolation and 30x speedup with bicubic interpolation while maintaining a high level of accuracy. This compares favorably to recent image registration studies, but it also indicates that our implementation only reaches about 70% of theorectical peak performance. To analyze our results, we utilize profiling data to identify some of the underlying limitations of CUDA that prohibit peak performance. At the end, we emphasize the need to manage memory resources carefully to fully utilize the GPU and obtain maximum speedup.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Free Software Foundation. GNU scientific library. http://www.gnu.org/software/gsl/.
 
2
J. Fung and S. Mann. Using graphics devices in reverse: GPU-based image processing and computer vision. In IEEE Int'l Conf. on Multimedia & Expo, pages 9--12, 2008.
 
3
A. A. Goshtasby. 2-D and 3-D Image Registration. Wiley-Interscience, 2005.
 
4
M. Harris. Mapping computational concepts to GPUs. In GPU Gems 2, pages 493--508. Addison Wesley, 2005.
 
5
K. E. Hillesland and A. Lastra. GPU floating-point paranoia. In GP2 ACM Workshop on General Purpose Computing on Graphics Processors, page 8, 2004.
 
6
F. Ino, J. Gomita, Y. Kawasaki, and K. Hagihara. A GPGPU approach for accelerating 2-d/3-d rigid registration of medical images. In International Symposium on Parallel and Distributed Processing and Applications (ISPA), pages 939--950, 2006.
 
7
A. Kubias, F. Deinzer, T. Feldmann, D. Paulus, B. Schreiber, and T. Brunner. 2d/3d image registration on the GPU. Pattern Recognition and Image Analysis, 18(3):381--389, 2008.
 
8
 
9
NVIDIA. NVIDIA CUDA Compute Unified Device Architecture, Programming Guide, Version 2.0. NVIDIA, 2008.
 
10
A. Obukhov and A. Kharlamov. Dct8x8. NVIDIA Software Development Kit (SDK), 2008.
 
11
J. D. Owens, M. Houston, D. Luebke, S. Green, J. E. Stone, and J. C. Phillips. GPU computing. Proceedings of the IEEE, 96(5):879--899, 2008.
 
12
W. Plishker, O. Dandekar, S. Bhattacharyya, and R. Shekhar. Towards a heterogeneous medical image registration acceleration platform. Biomedical Circuits and Systems Conference (BIOCAS), pages 231--234, Nov. 2007.
 
13
C. Sigg and M. Hadwiger. Fast third-order texture filtering. In GPU Gems 2, pages 313--317. Addison Wesley, 2005.
 
14
T. Sugiura, D. Deguichi, T. Kitasaka, K. Mori, and Y. Suenaga. A method for accelerating bronchoscope tracking based on image registration by GPGPU. In Augmented environments for Medical Imaging including Augmented Reality in Computer-aided Surgery (AMI-ARCS) 2008 (Medical Image Computing and Computer Assisted Intervention 2008), 2008.
 
15
P. Thévenaz, U. Ruttimann, and M. Unser. A pyramid approach to subpixel registration based on intensity. IEEE Transactions on Image Processing, 7(1):27--41, January 1998.
 
16
 
17
B. Zitová and J. Flusser. Image registration methods: a survey. Image and Vision Computing, 21(11):977--1000, October 2003.