ACM Home Page
Please provide us with feedback. Feedback
Property testing of data dimensionality
Full text PdfPdf (1.12 MB)
Source Symposium on Discrete Algorithms archive
Proceedings of the fourteenth annual ACM-SIAM symposium on Discrete algorithms table of contents
Baltimore, Maryland
SESSION: Session 1A table of contents
Pages: 18 - 27  
Year of Publication: 2003
ISBN:0-89871-538-5
Authors
Robert Krauthgamer  International Computer Science Institute, Berkeley, CA and University of California, Berkeley, CA
Ori Sasson  The Hebrew University of Jerusalem, Jerusalem, Israel
Sponsors
: SIAM Activity Group on Discrete Mathematics
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
Publisher
Society for Industrial and Applied Mathematics  Philadelphia, PA, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 32,   Citation Count: 2
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

Data dimensionality is a crucial issue in a variety of settings, where it is desirable to determine whether a data set given in a high-dimensional space adheres to a low-dimensional structure. We study this problem in the framework of property testing: Given a query access to a data set S, we wish to determine whether S is low-dimensional, or whether it should be modified significantly in order to have the property. Allowing a constant probability of error, we aim at algorithms whose complexity does not depend on the size of S.We present algorithms for testing the low-dimensionality of a set of vectors and for testing whether a matrix is of low rank. We then address low-dimensionality in metric spaces. For vectors in the metric space l1, we show that low-dimensionality is not testable. For l2, we show that a data set can be tested for having a low-dimensional structure, but that the property of approximately having such a structure is not testable.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
{BCL98} H. J. Bandelt, V. Chepoi, and M. Laurent. Embedding into rectilinear spaces. Discrete Comput. Geom., 19(4):595--604, 1998.
 
3
 
4
{DDL+90} S. C. Deerwester, S. T. Dumais, T. K. Landauer, G. W. Furnas, and R. A. Harshman. Indexing by latent semantic analysis. Journal of the American Society of Information Science, 41(6):391--407, 1990.
 
5
 
6
{DL97} M. Deza and M. Laurent. Geometry of cuts and metrics. Springer-Verlag, Berlin, 1997.
 
7
 
8
 
9
 
10
 
11
{Fis01} E. Fischer. The art of uninformed decisions: A primer to property testing. Bulletin of the European Association for Theoretical Computer Science. EATCS, (75):97--126, 2001.
12
13
 
14
{HH78} F. Hadlock and F. Hoffman. Manhattan trees. Utilitas Math., 13:55--67, 1978.
15
 
16
 
17
{JL84} W. B. Johnson and J. Lindenstrauss. Extensions of Lipschitz mappings into a Hilbert space. In Conference in modern analysis and probability (New Haven, Conn., 1982), pages 189--206. Amer. Math. Soc., Providence, RI, 1984.
 
18
 
19
{Men28} K. Menger. Untersuchungen uber allgemeine metrik. Mathematische Annalen, 100:75--163, 1928.
 
20
 
21
 
22
{Oxl92} J. G. Oxley. Matroid theory. Oxford Science Publications. The Clarendon Press Oxford University Press, New York, 1992.
23
 
24
{Ron01} D. Ron. Property testing. In P. Pardalos, S. Rajasekaran, J. Reif, and J. Rolim, editors, Handbook of Randomized Computing. Kluwer Academic, 2001.
 
25
{RR98} Y. Rabinovich and R. Raz. Lower bounds on the distortion of embedding finite metric spaces in graphs. Discrete Comput. Geom., 19(1):79--94, 1998.
 
26


Collaborative Colleagues:
Robert Krauthgamer: colleagues
Ori Sasson: colleagues