|
ABSTRACT
Data dimensionality is a crucial issue in a variety of settings, where it is desirable to determine whether a data set given in a high-dimensional space adheres to a low-dimensional structure. We study this problem in the framework of property testing: Given a query access to a data set S, we wish to determine whether S is low-dimensional, or whether it should be modified significantly in order to have the property. Allowing a constant probability of error, we aim at algorithms whose complexity does not depend on the size of S.We present algorithms for testing the low-dimensionality of a set of vectors and for testing whether a matrix is of low rank. We then address low-dimensionality in metric spaces. For vectors in the metric space l1, we show that low-dimensionality is not testable. For l2, we show that a data set can be tested for having a low-dimensional structure, but that the property of approximately having such a structure is not testable.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
{BCL98} H. J. Bandelt, V. Chepoi, and M. Laurent. Embedding into rectilinear spaces. Discrete Comput. Geom., 19(4):595--604, 1998.
|
| |
3
|
|
| |
4
|
{DDL+90} S. C. Deerwester, S. T. Dumais, T. K. Landauer, G. W. Furnas, and R. A. Harshman. Indexing by latent semantic analysis. Journal of the American Society of Information Science, 41(6):391--407, 1990.
|
| |
5
|
Yevgeniy Dodis , Oded Goldreich , Eric Lehman , Sofya Raskhodnikova , Dana Ron , Alex Samorodnitsky, Improved Testing Algorithms for Monotonicity, Proceedings of the Third International Workshop on Approximation Algorithms for Combinatorial Optimization Problems: Randomization, Approximation, and Combinatorial Algorithms and Techniques, p.97-108, August 08-11, 1999
|
| |
6
|
{DL97} M. Deza and M. Laurent. Geometry of cuts and metrics. Springer-Verlag, Berlin, 1997.
|
| |
7
|
Tamal K. Dey , Joachim Giesen , Samrat Goswami , Wulue Zhao, Shape dimension and approximation from samples, Proceedings of the thirteenth annual ACM-SIAM symposium on Discrete algorithms, p.772-780, January 06-08, 2002, San Francisco, California
|
| |
8
|
|
| |
9
|
|
| |
10
|
|
| |
11
|
{Fis01} E. Fischer. The art of uninformed decisions: A primer to property testing. Bulletin of the European Association for Theoretical Computer Science. EATCS, (75):97--126, 2001.
|
 |
12
|
|
 |
13
|
|
| |
14
|
{HH78} F. Hadlock and F. Hoffman. Manhattan trees. Utilitas Math., 13:55--67, 1978.
|
 |
15
|
|
| |
16
|
|
| |
17
|
{JL84} W. B. Johnson and J. Lindenstrauss. Extensions of Lipschitz mappings into a Hilbert space. In Conference in modern analysis and probability (New Haven, Conn., 1982), pages 189--206. Amer. Math. Soc., Providence, RI, 1984.
|
| |
18
|
|
| |
19
|
{Men28} K. Menger. Untersuchungen uber allgemeine metrik. Mathematische Annalen, 100:75--163, 1928.
|
| |
20
|
|
| |
21
|
|
| |
22
|
{Oxl92} J. G. Oxley. Matroid theory. Oxford Science Publications. The Clarendon Press Oxford University Press, New York, 1992.
|
 |
23
|
|
| |
24
|
{Ron01} D. Ron. Property testing. In P. Pardalos, S. Rajasekaran, J. Reif, and J. Rolim, editors, Handbook of Randomized Computing. Kluwer Academic, 2001.
|
| |
25
|
{RR98} Y. Rabinovich and R. Raz. Lower bounds on the distortion of embedding finite metric spaces in graphs. Discrete Comput. Geom., 19(1):79--94, 1998.
|
| |
26
|
|
CITED BY 2
|
|
|
|
|
Andris Ambainis , Kazuo Iwama , Akinori Kawachi , Rudy Raymond , Shigeru Yamashita, Improved algorithms for quantum identification of Boolean oracles, Theoretical Computer Science, v.378 n.1, p.41-53, June, 2007
|
|