|
ABSTRACT
The advent of new matrix-valued magnetic resonance imaging modalities such as Diffusion Tensor Imaging (DTI) requires extensive computational acceleration. Computational acceleration on graphics processing units (GPUs) can make the regularization (denoising) of DTI images attractive in clinical settings, hence improving the quality of DTI images in a broad range of applications. Construction of DTI images consists of direction-specific Magnetic Resonance (MR) measurements. Compared with conventional MR, direction-sensitive acquisition has a lower signal-to-noise ratio (SNR). Therefore, high noise levels often limit DTI imaging. Advanced post-processing of imaging data can improve the quality of estimated tensors. However, the post-processing problem is only made more computationally difficult when considering matrix-valued imaging data. This paper describes the acceleration of a Total Variation regularization method for matrix-valued images, in particular, for DTI images on NVIDIA Quadro FX 5600. The TV regularization of a 3-D image with 1283 voxels ultimately achieves 266X speedup and requires 1 minute and 30 seconds on the Quadro, while this algorithm on a dual-core CPU completes in more than 3 hours. In this application study we are aimed at analyzing the effective of excessive synchronization, which provides an insight into generally adapting Variational methods to the GPU architecture for other image processing algorithms designed for matrix-valued images.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
D. Manocha, M.C. Lin, N. Govindaraju. GPGPU to Many-Core Processing: Higher Performance for Mass Market Applications. Manycore Computing Workshop, 2007.
|
| |
2
|
NVIDIA Corporation. NVIDIA CUDA Programming Guide, version 1.1, 2007.
|
| |
3
|
AMD Stream Processor. http://ati.amd.com/products/streamprocessor/index.html.
|
| |
4
|
M. Segal and K. Akeley. The OpenGL Graphics System: A Specification (Version 2.0). Silicon Graphics, Inc., October 2004.
|
| |
5
|
DirectX Developer Center. http://www.msdn.com/directx/.
|
| |
6
|
Cg. http://developer.nvidia.com/page/cg main.html.
|
 |
7
|
|
| |
8
|
I. Buck. Brook Specification v0.2, October 2003.
|
| |
9
|
P.J. Basser, J. Mattiello, and D. LeBihan, "MR diffusion tensor spectroscopy and imaging," Biophysical Journal, vol. 66, no. 1, pp. 259--267, 1994.
|
| |
10
|
D. Le Bihan, J.-F. Mangin, C. Poupon, et al., "Diffusion tensor imaging: concepts and applications," Journal of Magnetic Resonance Imaging, vol. 13, no. 4, pp. 534--546, 2001.
|
| |
11
|
C.-F. Westin, S.E. Maier, H. Mamata, A. Nabavi, F.A. Jolesz, and R. Kikinis, "Processing and visualization for diffusion tensor MRI," Medical Image Analysis, vol. 6, no. 2, pp. 93--108, 2002.
|
| |
12
|
S. Mori and P.B. Barker, "Diffusion magnetic resonance imaging: its principle and applications," The Anatomical Record vol. 257, no. 3, pp. 102--109, 1999.
|
| |
13
|
S. Mori and P.C.M. van Zijl, "Fiber tracking: principles and strategies'a technical review," NMR in Biomedicine, vol. 15, no. 7-8, pp. 468--480, 2002.
|
| |
14
|
R. Bammer, "Basic principles of diffusion-weighted imaging," European Journal of Radiology, vol. 45, no. 3, pp. 169--184, 2003.
|
| |
15
|
O. Christiansen, T.M. Lee, J. Lie, U. Sinha, and T.F. Chan, "Total Variation Regularization of Matrix-Valued Images," International Journal of Biomedical Imaging, vol. 2007, Article ID 27432, 11 pages, 2007.
|
| |
16
|
M. Lysaker, S. Osher, and X.-C. Tai, "Noise removal using smoothed normals and surface fitting," IEEE Transactions on Image Processing, vol. 13, no. 10, pp. 1345--1357, 2004.
|
| |
17
|
T.F. Chan and S. Esedoglu, "Aspects of total variation regularized L1 function approximation," SIAM Journal on Applied Mathematics, vol. 65, no. 10, pp. 1345--1357, 2005.
|
| |
18
|
J. Weickert and T. Brox, "Diffusion and regularization of vector- and matrix-valued images," Tech. Rep. preprint no. 58, Fachrichtung 6.1 Mathematik, Universitat des Saarlandes, Saarbrucken, Germany, 2002.
|
| |
19
|
|
| |
20
|
S. Ryoo, C. Rodrigues, S. Stone, S. Baghsorkhi, S. Ueng, and W. Hwu. Program optimization study on a 128-core GPU. First Workshop on General Purpose Processing on Graphics Processing Units (GPGPU), 2007.
|
 |
21
|
Samuel S. Stone , Justin P. Haldar , Stephanie C. Tsao , Wen-mei W. Hwu , Zhi-Pei Liang , Bradley P. Sutton, Accelerating advanced mri reconstructions on gpus, Proceedings of the 5th conference on Computing frontiers, May 05-07, 2008, Ischia, Italy
[doi> 10.1145/1366230.1366276]
|
| |
22
|
Shuai Che , Michael Boyer , Jiayuan Meng , David Tarjan , Jeremy W. Sheaffer , Kevin Skadron, A performance study of general-purpose applications on graphics processors using CUDA, Journal of Parallel and Distributed Computing, v.68 n.10, p.1370-1380, October, 2008
[doi> 10.1016/j.jpdc.2008.05.014]
|
| |
23
|
J. Owens, D. Luebke, N. Govindaraju, M. Harris, J. Kruger, A. Lefohn, and T. Purcell. A survey of general-purpose computation on graphics hardware. Computer Graphics Forum, 26(1):80--113, March 2007.
|
 |
24
|
Brian Cabral , Nancy Cam , Jim Foran, Accelerated volume rendering and tomographic reconstruction using texture mapping hardware, Proceedings of the 1994 symposium on Volume visualization, p.91-98, October 17-18, 1994, Tysons Corner, Virginia, United States
[doi> 10.1145/197938.197972]
|
| |
25
|
Pock, T.; Unger, M.; Cremers, D.; Bischof, H., "Fast and exact solution of Total Variation models on the GPU," Computer Vision and Pattern Recognition Workshops, 2008. CVPRW '08. IEEE Computer Society Conference on , vol., no., pp.1--8, 23-28 June 2008.
|
| |
26
|
T. Pock, M. Grabner, and H. Bischof. "Real-time Computation of Variational Methods on Graphics Hardware," Computer Vision Winter Workshop 2007, Michael Grabner, Helmut Grabner, St. Lambrecht, Austria, February 6-8.
|
|