|
ABSTRACT
Recent years have witnessed a dramatic increase in the quantity of image data collected, due to advances in fields such as medical imaging, reconnaissance, surveillance, astronomy, multimedia etc. With this increase has come the need to be able to store, transmit, and query large volumes of image data efficiently. A common operation on image databases is the retrieval of all images that are similar to a query image. For this, the images in the database are often represented as vectors in a high-dimensional space and a query is answered by retrieving all image vectors that are proximal to the query image in this space, under a suitable similarity metric. To overcome problems associated with high dimensionality, such as high storage and retrieval times, a dimension reduction step is usually applied to the vectors to concentrate relevant information in a small number of dimensions. Principal Component Analysis (PCA) is a well-known dimension reduction scheme. However, since it works with vectorized representations of images, PCA does not take into account the spatial locality of pixels in images. In this paper, a new dimension reduction scheme, called Generalized Principal Component Analysis (GPCA), is presented. This scheme works directly with images in their native state, as two-dimensional matrices, by projecting the images to a vector space that is the tensor product of two lower-dimensional vector spaces. Experiments on databases of face images show that, for the same amount of storage, GPCA is superior to PCA in terms of quality of the compressed images, query precision, and computational cost.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
 |
2
|
|
 |
3
|
|
| |
4
|
P. Aigrain, H.J. Zhang, and D. Petkovic. Content-based representation and retrieval of visual media: A state-of-the-art review. Multimedia Tools and Applications, 3(3):179--202, 1996.
|
| |
5
|
|
| |
6
|
H. Cho, I.S. Dhillon, Y. Guan, and S. Sra. Minimum sum-squared residue co-clustering of gene expression data. In SIAM Data Mining Conference proceedings, pages 114--125, 2004.
|
| |
7
|
P. Drineas , Alan Frieze , Ravi Kannan , Santosh Vempala , V. Vinay, Clustering in large graphs and matrices, Proceedings of the tenth annual ACM-SIAM symposium on Discrete algorithms, p.291-299, January 17-19, 1999, Baltimore, Maryland, United States
|
 |
8
|
Christos Faloutsos , King-Ip Lin, FastMap: a fast algorithm for indexing, data-mining and visualization of traditional and multimedia datasets, Proceedings of the 1995 ACM SIGMOD international conference on Management of data, p.163-174, May 22-25, 1995, San Jose, California, United States
|
| |
9
|
|
| |
10
|
|
| |
11
|
H. Jin, B. C. Ooi, H. T. Shen, C. Yu, and A.Y. Zhou. An adaptive and efficient dimensionality reduction algorithm for high-dimensionality indexing. In ICDE Conference Proceedings, Bangalore, India, 2003.
|
| |
12
|
I. T. Jolliffe. Principal Component Analysis. Springer-Verlag, New York, 1986.
|
| |
13
|
R. Ng and A. Sedighian. Evaluating multi-dimensional indexing structures for images transformed by principal component analysis. In Proc. of the SPIE, number 2670, pages 50--61, 1994.
|
 |
14
|
K. V. Ravi Kanth , Divyakant Agrawal , Ambuj Singh, Dimensionality reduction for similarity searching in dynamic databases, Proceedings of the 1998 ACM SIGMOD international conference on Management of data, p.166-176, June 01-04, 1998, Seattle, Washington, United States
|
| |
15
|
L. Sirovich and M. Kirby. Low-dimensional procedure for the characterization of human faces. Journal of Optical Society of America, 4(3):519--524, 1987.
|
| |
16
|
M. Turk and A. Pentland. Eigenfaces for recognition. Journal of Cognitive Neuroscience, 3(1):71--86, 1991.
|
 |
17
|
|
CITED BY 11
|
|
|
|
|
|
|
|
|
|
|
Khanh Vu , Kien A. Hua , Hao Cheng , Sheau-Dong Lang, A non-linear dimensionality-reduction technique for fast similarity search in large databases, Proceedings of the 2006 ACM SIGMOD international conference on Management of data, June 27-29, 2006, Chicago, IL, USA
|
|
|
C. Losada , M. Mazo , S. Palazuelos , J. L. Martín , J. J. García, Motion segmentation using GPCA techniques and optical flow, Proceedings of the 2007 Euro American conference on Telematics and information systems, May 14-17, 2007, Faro, Portugal
|
|
|
|
|
|
Heng Huang , Chris Ding , Dijun Luo , Tao Li, Simultaneous tensor subspace selection and clustering: the equivalence of high order svd and k-means clustering, Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, August 24-27, 2008, Las Vegas, Nevada, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|