ACM Home Page
Please provide us with feedback. Feedback
Dimensionality reduction using magnitude and shape approximations
Full text PdfPdf (193 KB)
Source Conference on Information and Knowledge Management archive
Proceedings of the twelfth international conference on Information and knowledge management table of contents
New Orleans, LA, USA
SESSION: Database session 2: querying high-dimensional data II table of contents
Pages: 99 - 107  
Year of Publication: 2003
ISBN:1-58113-723-0
Authors
Ümit Y. Ogras  The Ohio State University
Hakan Ferhatosmanoglu  The Ohio State University
Sponsors
ACM: Association for Computing Machinery
SIGMIS: ACM Special Interest Group on Management Information Systems
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 29,   Citation Count: 3
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/956863.956883
What is a DOI?

ABSTRACT

High dimensional data sets are encountered in many modern database applications. The usual approach is to construct a summary of the data set through a lossy compression technique, and use this lower dimensional synopsis to provide fast, approximate answers to the queries. In this paper, we develop a novel dimensionality reduction technique based on partitioning the high dimensional vector space into orthogonal subspaces. First, we find a relation between the Euclidian distance of two n-dimensional vectors and the Euclidian distances of their projections on the orthogonal subspaces. Then, based on this relation we develop a method to approximate the Euclidian distance using novel inner product approximation. This process allows us to incorporate the shape information of the vectors to this approximation. While the inner product approximation is symmetric, i.e., captures only the magnitude information of the data, the proposed method takes both the magnitude and shape information of the original vectors into account through partitioning. In the experiments, we demonstrate the effectiveness of our technique by comparing it with commonly used methods.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
 
3
A. Baruffolo. R-trees for astronomical data indexing. ASP Conf. Ser., Astronomical Data Analysis Software and Systems VIII , 172:375, 1999.
4
 
5
6
7
8
 
9
 
10
 
11
S. Deerwester, S. T. Dumais, G. W. Furnas, T. K. Launder, and R. Harshman. Indexing by latent semantic analysis. Journal of the American Society for Information Science, 41:391--407, 1990.
 
12
 
13
S. T. Dumais. Improving the retrieval of information from external sources. Behavior Research Methods, Instruments and Computers, 23:229--236, 1991.
 
14
15
 
16
 
17
18
 
19
 
20
21
 
22
 
23
Informix. http://www.ibm.com/software/data/informix/blades/spatial/rtree.html, 2002.
 
24
 
25
T. Kailath. Modern Signal Processing. Springer Verlag, 1985.
26
27
 
28
H. Karhunen. Uber lineare methoden in der wahrscheinlich-keitsrechnung. Ann. Acad. Science Fenn , 1947.
29
30
 
31
 
32
 
33
M. Loeve. Fonctions aleatoires de seconde ordre. Processus Stochastiques et Mouvement Brownien , 1948.
 
34
W. Niblack, R. Barber, W. Equitz, M. Flickner, E. Glasman, D. Petkovic, and P. Yanker. The QBIC project: Querying images by content using color, texture and shape. In Proc. of the SPIE Conf. 1908 on Storage and Retrieval for Image and Video Databases , volume 1908, pages 173--187, February 1993.
 
35
 
36
 
37
38
 
39
40
 
41
 
42
A. J. Wicenec and M. Albrecht. Methods for structuring and searching very large catalogs. ASP Conf. Ser., Astronomical Data Analysis Software and Systems VII, 145:512, 1998.
43


Collaborative Colleagues:
Ümit Y. Ogras: colleagues
Hakan Ferhatosmanoglu: colleagues