| Surrogate subsets: a free space management strategy for the index of a text retrieval system |
| Full text |
Pdf
(1.44 MB)
|
| Source
|
Annual ACM Conference on Research and Development in Information Retrieval
archive
Proceedings of the 13th annual international ACM SIGIR conference on Research and development in information retrieval
table of contents
Brussels, Belgium
Pages: 211 - 226
Year of Publication: 1989
ISBN:0-89791-408-2
|
|
Author
|
|
F. J. Burkowski
|
Department of Computer Science, University of Waterloo, Waterloo, Ontario, Canada
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 9, Downloads (12 Months): 24, Citation Count: 2
|
|
|
ABSTRACT
This paper presents a new data structure and an associated strategy to be utilized by indexing facilities for text retrieval systems. The paper starts by reviewing some of the goals that may be considered when designing such an index and continues with a small survey of various current strategies. It then presents an indexing strategy referred to as surrogate subsets discussing its appropriateness in the light of the specified goals. Various design issues and implementation details are discussed. Our strategy requires that a surrogate file be divided into a large number of subsets separated by free space which will allow the index to expand when new material is appended to the database. Experimental results report on the utilization of free space when the database is enlarged.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
BAY72
|
BAYER, R. AND MCCREIGHT, E., Organization and maintenance of large ordered indexes, Acta Informatica, vol. 1, no. 3, 1972, pp. 173-189. ~
|
| |
BER87
|
|
| |
CHR84
|
CHRISTODOULAKIS, S. AND FALOUTSOS, C., Design considerations for a message file server, IEEE Trans. Sofn#,are Engineering, Vol. SE-10, No. 2, Mar. 1984, pp. 201-210.
|
| |
CHR86
|
|
| |
DEF88
|
DEFAZIO, S. AND GREENWALD, C., The Mead information retrieval system, IEEE Compcon 88, Feb. 1988, pp. 431.
|
| |
DEF89
|
DEFAZIO, S., Private communication.
|
 |
FAL84
|
|
 |
FAL85
|
|
| |
FAL87D
|
FALOUTSOS, C. AND CHAN, R., Fast text access methods for optical disks- designs and performance comparison, UMIACS-TR-87-66, CS-TR- 1958, Dept. of Comp. Sci. and Inst. for Adv. Comp. Studies, Univ. of Maryland, Dec. 1987, 29 pages.
|
 |
FAL87S
|
|
| |
HAS81
|
HASKIN, R. L., Special purpose processors for text retrieval, Database Engineering, Vol. 4, No. 1, Sept. 1981, pp. 16-29.
|
 |
LAR83
|
|
| |
ROB79
|
ROBERTS, C. S., Partial-match retrieval via the method of superimposed codes, Proc. IEEE, 67,12, Dec. 1979, 1624-1642.
|
 |
SAL86
|
|
 |
STA86
|
|
| |
TEO82
|
|
 |
TSI83
|
|
CITED BY 2
|
|
Peter G. Anick , Rex A. Flynn , David R. Hanssen, Addressing the requirements of a dynamic corporate textual information base, Proceedings of the 14th annual international ACM SIGIR conference on Research and development in information retrieval, p.163-172, October 13-16, 1991, Chicago, Illinois, United States
|
|
|
|
|