|
ABSTRACT
During the last decade, multimedia databases have become increasingly important in many application areas such as medicine, CAD, geography, and molecular biology. An important research issue in the field of multimedia databases is the content-based retrieval of similar multimedia objects such as images, text, and videos. However, in contrast to searching data in a relational database, a content-based retrieval requires the search of similar objects as a basic functionality of the database system. Most of the approaches addressing similarity search use a so-called feature transformation that transforms important properties of the multimedia objects into high-dimensional points (feature vectors). Thus, the similarity search is transformed into a search of points in the feature space that are close to a given query point in the high-dimensional feature space. Query processing in high-dimensional spaces has therefore been a very active research area over the last few years. A number of new index structures and algorithms have been proposed. It has been shown that the new index structures considerably improve the performance in querying large multimedia databases. Based on recent tutorials [Berchtold and Keim 1998], in this survey we provide an overview of the current state of the art in querying multimedia databases, describing the index structures and algorithms for an efficient query processing in high-dimensional spaces. We identify the problems of processing queries in high-dimensional space, and we provide an overview of the proposed approaches to overcome these problems.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
ABEL, D. AND SMITH, J. 1983. A data structure and algorithm based on a linear key for a rectangle retrieval problem. Comput. Vis. 24, 1-13.
|
| |
2
|
|
 |
3
|
Rakesh Agrawal , Johannes Gehrke , Dimitrios Gunopulos , Prabhakar Raghavan, Automatic subspace clustering of high dimensional data for data mining applications, Proceedings of the 1998 ACM SIGMOD international conference on Management of data, p.94-105, June 01-04, 1998, Seattle, Washington, United States
|
| |
4
|
Rakesh Agrawal , King-Ip Lin , Harpreet S. Sawhney , Kyuseok Shim, Fast Similarity Search in the Presence of Noise, Scaling, and Translation in Time-Series Databases, Proceedings of the 21th International Conference on Very Large Data Bases, p.490-501, September 11-15, 1995
|
| |
5
|
ALTSCHUL, S., GISH, W., MILLER, W., MYERS, E., AND LIPMAN, D. 1990. A basic local alignment search tool. J. Molecular Biol. 215, 3, 403-410.
|
| |
6
|
|
| |
7
|
|
| |
8
|
|
 |
9
|
Sunil Arya , David M. Mount , Onuttom Narayan, Accounting for boundary effects in nearest neighbor searching, Proceedings of the eleventh annual symposium on Computational geometry, p.336-344, June 05-07, 1995, Vancouver, British Columbia, Canada
[doi> 10.1145/220279.220315]
|
| |
10
|
|
| |
11
|
BAYER,R.AND MCCREIGHT, E. 1977. Organization and maintenance of large ordered indices. Acta Inf. 1, 3, 173-189.
|
 |
12
|
Norbert Beckmann , Hans-Peter Kriegel , Ralf Schneider , Bernhard Seeger, The R*-tree: an efficient and robust access method for points and rectangles, Proceedings of the 1990 ACM SIGMOD international conference on Management of data, p.322-331, May 23-26, 1990, Atlantic City, New Jersey, United States
|
| |
13
|
|
 |
14
|
|
| |
15
|
BENTLEY, J. 1979. Multidimensional binary search in database applications. IEEE Trans. Softw. Eng. 4, 5, 397-409.
|
 |
16
|
|
 |
17
|
Stefan Berchtold , Christian Böhm , Bernhard Braunmüller , Daniel A. Keim , Hans-Peter Kriegel, Fast parallel similarity search in multimedia databases, Proceedings of the 1997 ACM SIGMOD international conference on Management of data, p.1-12, May 11-15, 1997, Tucson, Arizona, United States
|
| |
18
|
BERCHTOLD, S., B~OHM, C., JAGADISH, H., KRIEGEL, H.-P., AND SANDER, J. 2000a. Independent quantization: An index compression technique for highdimensional data spaces. In Proc. 16th Int. Conf. on Data Engineering.
|
 |
19
|
Stefan Berchtold , Christian Böhm , Daniel A. Keim , Hans-Peter Kriegel, A cost model for nearest neighbor search in high-dimensional data space, Proceedings of the sixteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems, p.78-86, May 11-15, 1997, Tucson, Arizona, United States
[doi> 10.1145/263661.263671]
|
| |
20
|
|
| |
21
|
|
| |
22
|
|
 |
23
|
Stefan Berchtold , Christian Böhm , Hans-Peter Kriegal, The pyramid-technique: towards breaking the curse of dimensionality, Proceedings of the 1998 ACM SIGMOD international conference on Management of data, p.142-153, June 01-04, 1998, Seattle, Washington, United States
|
| |
24
|
|
| |
25
|
BERCHTOLD, S., JAGADISH, H., AND ROSS, K. 1998d. Independence diagrams: A technique for visual data mining. In Proc. 4th Int. Conf. on Knowledge Discovery and Data Mining (New York), 139- 143.
|
| |
26
|
|
| |
27
|
|
| |
28
|
|
| |
29
|
B~OHM, C. 1998. Efficiently indexing highdimensional databases. PhD thesis, University of Munich, Germany.
|
 |
30
|
|
 |
31
|
|
| |
32
|
|
 |
33
|
|
 |
34
|
|
| |
35
|
|
| |
36
|
|
 |
37
|
Paolo Ciaccia , Marco Patella , Pavel Zezula, A cost model for similarity queries in metric spaces, Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems, p.59-68, June 01-04, 1998, Seattle, Washington, United States
[doi> 10.1145/275487.275495]
|
 |
38
|
|
 |
39
|
|
 |
40
|
Antonio Corral , Yannis Manolopoulos , Yannis Theodoridis , Michael Vassilakopoulos, Closest pair queries in spatial databases, Proceedings of the 2000 ACM SIGMOD international conference on Management of data, p.189-200, May 15-18, 2000, Dallas, Texas, United States
|
| |
41
|
EASTMAN, C. 1981. Optimal bucket size for nearest neighbor searching in kd-trees. Inf. Proc. Lett. 12,4.
|
| |
42
|
EVANGELIDIS, G. 1994. The hB -tree: A concurrent and recoverable multi-attribute index structure. PhD thesis, Northeastern University, Boston, MA.
|
| |
43
|
|
 |
44
|
|
| |
45
|
|
| |
46
|
|
 |
47
|
|
 |
48
|
Christos Faloutsos , King-Ip Lin, FastMap: a fast algorithm for indexing, data-mining and visualization of traditional and multimedia datasets, Proceedings of the 1995 ACM SIGMOD international conference on Management of data, p.163-174, May 22-25, 1995, San Jose, California, United States
|
 |
49
|
|
| |
50
|
C. Faloutsos , R. Barber , M. Flickner , J. Hafner , W. Niblack , D. Petkovic , W. Equitz, Efficient and effective querying by image content, Journal of Intelligent Information Systems, v.3 n.3-4, p.231-262, July 1994
[doi> 10.1007/BF00962238]
|
 |
51
|
Christos Faloutsos , M. Ranganathan , Yannis Manolopoulos, Fast subsequence matching in time-series databases, Proceedings of the 1994 ACM SIGMOD international conference on Management of data, p.419-429, May 24-27, 1994, Minneapolis, Minnesota, United States
|
 |
52
|
Christos Faloutsos , Timos Sellis , Nick Roussopoulos, Analysis of object oriented spatial access methods, Proceedings of the 1987 ACM SIGMOD international conference on Management of data, p.426-439, May 27-29, 1987, San Francisco, California, United States
|
| |
53
|
FINKEL,R.AND BENTLEY, J. 1974. Quad trees: A data structure for retrieval of composite trees. Acta Inf. 4, 1, 1-9.
|
 |
54
|
|
 |
55
|
|
| |
56
|
|
 |
57
|
|
| |
58
|
|
| |
59
|
|
 |
60
|
|
 |
61
|
Joseph M. Hellerstein , Elias Koutsoupias , Christos H. Papadimitriou, On the analysis of indexing schemes, Proceedings of the sixteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems, p.249-256, May 11-15, 1997, Tucson, Arizona, United States
[doi> 10.1145/263661.263688]
|
| |
62
|
|
| |
63
|
HENRICH, A. 1994. A distance-scan algorithm for spatial access strucures. In Proc. 2nd ACM Workshop on Advances in Geographic Information Systems (Gaithersburg, MD), 136-143.
|
| |
64
|
|
| |
65
|
|
| |
66
|
HINNEBURG,A.AND KEIM, D. 1998. An efficient approach to clustering in large multimedia databases with noise. In Proc. Int. Conf. on Knowledge Discovery in Databases (New York).
|
| |
67
|
|
| |
68
|
|
 |
69
|
|
| |
70
|
|
 |
71
|
Andreas Hutflesz , Hans-Werner Six , Peter Widmayer, Twin grid files: space optimizing access schemes, Proceedings of the 1988 ACM SIGMOD international conference on Management of data, p.183-190, June 01-03, 1988, Chicago, Illinois, United States
|
 |
72
|
|
 |
73
|
|
| |
74
|
JAIN,R.AND WHITE, D. 1996. Similarity indexing: Algorithms and performance. In Proc. SPIE Storage and Retrieval for Image and Video Databases IV (San Jose, CA), 62-75.
|
 |
75
|
|
 |
76
|
|
| |
77
|
|
 |
78
|
|
| |
79
|
|
 |
80
|
|
| |
81
|
|
| |
82
|
|
| |
83
|
|
| |
84
|
|
| |
85
|
|
| |
86
|
KRISHNAMURTHY,R.AND WHANG, K.-Y. 1985. Multilevel Grid Files. IBM Research Center Report, Yorktown Heights, NY.
|
 |
87
|
|
| |
88
|
|
| |
89
|
|
 |
90
|
|
| |
91
|
MANDELBROT, B. 1977. Fractal Geometry of Nature. W.H. Freeman, New York.
|
| |
92
|
|
| |
93
|
|
| |
94
|
MORTON, G. 1966. A Computer Oriented Geodetic Data Base and a New Technique in File Sequencing. IBM Ltd., USA.
|
| |
95
|
MUMFORD, D. 1987. The problem of robust shape descriptors. In Proc. 1st IEEE Int. Conf. on Computer Vision.
|
 |
96
|
|
 |
97
|
|
 |
98
|
|
| |
99
|
|
 |
100
|
|
| |
101
|
|
 |
102
|
Bernd-Uwe Pagel , Hans-Werner Six , Heinrich Toben , Peter Widmayer, Towards an analysis of range query performance in spatial data structures, Proceedings of the twelfth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems, p.214-221, May 25-28, 1993, Washington, D.C., United States
[doi> 10.1145/153850.153878]
|
| |
103
|
|
| |
104
|
|
 |
105
|
|
| |
106
|
|
 |
107
|
|
 |
108
|
Nick Roussopoulos , Stephen Kelley , Frédéric Vincent, Nearest neighbor queries, Proceedings of the 1995 ACM SIGMOD international conference on Management of data, p.71-79, May 22-25, 1995, San Jose, California, United States
|
| |
109
|
SAGAN, H. 1994. Space Filling Curves. Springer- Verlag, New York.
|
| |
110
|
SCHR~ODER, M. 1991. Fractals, Chaos, Power Laws: Minutes from an Infinite Paradise. W.H. Freeman, New York.
|
| |
111
|
|
| |
112
|
SEIDL, T. 1997. Adaptable similarity search in 3-d spatial database systems. PhD thesis, University of Munich, Germany.
|
| |
113
|
|
| |
114
|
|
| |
115
|
SHAWNEY,H.AND HAFNER, J. 1994. Efficient color histogram indexing. In Proc. Int. Conf. on Image Processing, 66-70.
|
| |
116
|
|
| |
117
|
SPROULL, R. 1991. Refinements to nearest neighbor searching in k-dimensional trees. Algorithmica, 579-589.
|
| |
118
|
STANOI, I., AGRAWAL,D.,AND ABBADI, A. 2000. Reverse nearest neighbor queries for dynamic databases. In Proc. ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, 44-53.
|
| |
119
|
STONEBRAKER, M., SELLIS,T.,AND HANSON, E. 1986. An analysis of rule indexing implementations in data base systems. In Proc. Int. Conf. on Expert Database Systems.
|
 |
120
|
|
| |
121
|
UHLMANN, J. 1991. Satisfying general proximity/ similarity queries with metric trees. Inf. Proc. Lett. 145-157.
|
| |
122
|
|
| |
123
|
WALLACE,T.AND WINTZ, P. 1980. An efficient threedimensional aircraft recognition algorithm using normalized Fourier descriptors. Comput. Graph. Image Proc. 13, 99-126.
|
| |
124
|
|
| |
125
|
|
 |
126
|
|
| |
127
|
|
| |
128
|
YIANILOS, P. 1999. Excluded middle vantage point forests for nearest neighbor search. In Proc. DIMACS Implementation Challenge (Baltimore, MD).
|
CITED BY 105
|
|
|
|
|
|
|
|
|
|
|
Ling-Yu Duan , Jun-Song Yuan , Qi Tian , Chang-Sheng Xu, Fast and robust video clip search using index structure, Proceedings of the 12th annual ACM international conference on Multimedia, October 10-16, 2004, New York, NY, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Hubert Ka Yau Leung , Ioana Burcea , Hans-Amo Jacobsen, Modeling location-based services with subject spaces, Proceedings of the 2003 conference of the Centre for Advanced Studies on Collaborative research, p.171-181, October 06-09, 2003, Toronto, Ontario, Canada
|
|
|
|
|
|
Jieping Ye , Qi Li , Hui Xiong , Haesun Park , Ravi Janardan , Vipin Kumar, IDR/QR: an incremental dimension reduction algorithm via QR decomposition, Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, August 22-25, 2004, Seattle, WA, USA
|
|
|
|
|
|
|
|
|
|
|
|
Junsong Yuan , Ling-Yu Duan , Qi Tian , Changsheng Xu, Fast and robust short video clip search using an index structure, Proceedings of the 6th ACM SIGMM international workshop on Multimedia information retrieval, October 15-16, 2004, New York, NY, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Bin Cui , Heng Tao Shen , Jialie Shen , Kian-Lee Tan, Exploring bit-difference for approximate KNN search in high-dimensional databases, Proceedings of the sixteenth Australasian database conference, p.165-174, January 01, 2005, Newcastle, Australia
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Jieping Ye , Qi Li , Hui Xiong , Haesun Park , Ravi Janardan , Vipin Kumar, IDR/QR: An Incremental Dimension Reduction Algorithm via QR Decomposition, IEEE Transactions on Knowledge and Data Engineering, v.17 n.9, p.1208-1222, September 2005
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
B. Vassiliadis , A. Stefani , L. Drossos , K. Ioannou, Knowledge discovery in multimedia repositories: the role of metadata, Proceedings of the 7th WSEAS International Conference on Mathematical Methods and Computational Techniques In Electrical Engineering, p.330-335, October 27-29, 2005, Sofia, Bulgaria
|
|
|
Fabrizio Falchi , Claudio Lucchese , Salvatore Orlando , Raffaele Perego , Fausto Rabitti, Caching content-based queries for robust and efficient image retrieval, Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, March 24-26, 2009, Saint Petersburg, Russia
|
|
|
|
|
|
Paolo Bolettieri , Fabrizio Falchi , Claudio Gennaro , Fausto Rabitti, Automatic metadata extraction and indexing for reusing e-learning multimedia objects, Workshop on multimedia information retrieval on The many faces of multimedia semantics, September 28-28, 2007, Augsburg, Bavaria, Germany
|
|
|
Qin Lv , William Josephson , Zhe Wang , Moses Charikar , Kai Li, Efficient filtering with sketches in the ferret toolkit, Proceedings of the 8th ACM international workshop on Multimedia information retrieval, October 26-27, 2006, Santa Barbara, California, USA
|
|
|
Bin Cui , Jialie Shen , Gao Cong , Heng Tao Shen , Cui Yu, Exploring composite acoustic features for efficient music similarity query, Proceedings of the 14th annual ACM international conference on Multimedia, October 23-27, 2006, Santa Barbara, CA, USA
|
|
|
Yi Zhuang , Yueting Zhuang , Qing Li , Lei Chen, Towards interactive indexing for large Chinese calligraphic character databases, Proceedings of the 15th ACM international conference on Information and knowledge management, November 06-11, 2006, Arlington, Virginia, USA
|
|
|
|
|
|
Biswanath Panda , Mirek Riedewald , Stephen B. Pope , Johannes Gehrke , L. Paul Chew, Indexing for function approximation, Proceedings of the 32nd international conference on Very large data bases, September 12-15, 2006, Seoul, Korea
|
|
|
Xiangmin Zhou , Xiaofang Zhou , Heng Tao Shen, Efficient similarity search by summarization in large video database, Proceedings of the eighteenth conference on Australasian database, p.161-167, January 30-February 02, 2007, Ballarat, Victoria, Australia
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Bin Cui , Ling Liu , Calton Pu , Jialie Shen , Kian-Lee Tan, QueST: querying music databases by acoustic and textual features, Proceedings of the 15th international conference on Multimedia, September 25-29, 2007, Augsburg, Germany
|
|
|
|
|
|
|
|
|
|
|
|
Guoren Wang , Xiangmin Zhou , Bin Wang , Baiyou Qiao , Donghong Han, A hyperplane based indexing technique for high-dimensional data, Information Sciences: an International Journal, v.177 n.11, p.2255-2268, June, 2007
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Vassilis Athitsos , Panagiotis Papapetrou , Michalis Potamias , George Kollios , Dimitrios Gunopulos, Approximate embedding-based subsequence matching of time series, Proceedings of the 2008 ACM SIGMOD international conference on Management of data, June 09-12, 2008, Vancouver, Canada
|
|
|
|
|
|
Ritendra Datta , Dhiraj Joshi , Jia Li , James Z. Wang, Image retrieval: Ideas, influences, and trends of the new age, ACM Computing Surveys (CSUR), v.40 n.2, p.1-60, April 2008
|
|
|
Zi Huang , Hengtao Shen , Xiaofang Zhou , Dawei Song , Stefan Rüger, Dimensionality reduction for dimension-specific search, Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, July 23-27, 2007, Amsterdam, The Netherlands
|
|
|
|
|
|
|
|
|
|
|
|
Yi Zhuang , Yueting Zhuang , Qing Li , Lei Chen , Yi Yu, Indexing high-dimensional data in dual distance spaces: a symmetrical encoding approach, Proceedings of the 11th international conference on Extending database technology: Advances in database technology, March 25-29, 2008, Nantes, France
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Bart Thomee , Mark J. Huiskes , Erwin Bakker , Michael S. Lew, Large scale image copy detection evaluation, Proceeding of the 1st ACM international conference on Multimedia information retrieval, October 30-31, 2008, Vancouver, British Columbia, Canada
|
|
|
Fabrizio Falchi , Claudio Lucchese , Salvatore Orlando , Raffaele Perego , Fausto Rabitti, A metric cache for similarity search, Proceeding of the 2008 ACM workshop on Large-Scale distributed systems for information retrieval, October 30-30, 2008, Napa Valley, California, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Zi Huang , Heng Tao Shen , Jie Shao , Stefan Rüger , Xiaofang Zhou, Locality condensation: a new dimensionality reduction method for image retrieval, Proceeding of the 16th ACM international conference on Multimedia, October 26-31, 2008, Vancouver, British Columbia, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|