ACM Home Page
Please provide us with feedback. Feedback
Hardware for searching very large text databases
Full text PdfPdf (812 KB)
Source Computer Architecture Workshop archive
Proceedings of the fifth workshop on Computer architecture for non-numeric processing table of contents
Pacific Grove, California, United States
Pages: 49 - 56  
Year of Publication: 1980
Also published in ...
Author
Sponsors
SIGARCH: ACM Special Interest Group on Computer Architecture
SIGMOD: ACM Special Interest Group on Management of Data
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 18,   Citation Count: 4
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/800083.802691
What is a DOI?

ABSTRACT

This paper discusses the problem of searching very large text databases. It is shown that conventional techniques for searching current databases cannot be scaled up to larger ones, and that it is necessary to build hardware to search the database in parallel if reasonable search times are expected. The part of the search process requiring the highest bandwidth is scanning the database to detect instances of search terms. Methods of doing this in hardware that have been mentioned in the literature are examined, and design criteria for term matchers are discussed. A new design that uses a nondeterministic finite state automaton to control matching, is introduced, its operation is explained, and the practicality of using it in a real system is discussed.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
McCarn, D. B., 'Online Services of the National Library of Medicine,' Digest of Papers, COMPCON Fall 1978, Washington, D. C., pp. 48-53
 
2
Black, D. V., 'System Development Corporation's Search Service,' Digest of Papers, COMPCON Fall 1978, Washington, D. C., pp. 59-64.
 
3
Bayer, M. P., 'Dialog—An Online Retrieval System for Bibliographic Information,' Digest of Papers, COMPCON Fall 1978, Washington, D. C., pp. 54-58.
 
4
Sprowl, J. A., 'Computer-Assisted Legal Research—An Analysis of Full Text Document Retrieval Systems, Particularly the LEXIS System,' American Bar Foundation Research Journal, Jan. 1976, Vol. 1, No. 1, pp. 175-226.
 
5
Hollaar, L. A., 'Text Retrieval Computers,' Computer, March 1979, Vol. 12, No. 3 (ISSN 0018-9162), pp. 40-50.
6
 
7
Ozkarahan, E. A., Schuster, S. A., and Smith, K. C., 'RAP: an Associative processor for Data Base Management,' Proc. 1975 AFIPS Nat. Comp. Conf., Vol. 44, AFIPS Press, Montvale, N. J., pp. 379-387.
 
8
Su, S., and Lipovski, G. J., 'CASSM: A Cellular System for Very Large Databases,' Proc. Conf. on Very Large Data Bases, Sept. 1975, pp. 456-472.
9
 
10
 
11
Hollaar, L. A., 'An Architecture for the Efficient Combining of Linearly Ordered Lists,' Second Workshop on Comp. Arch. for Non-Numeric Processing, Jan. 1976.
 
12
Hurley, B. J., 'Analysis of Computer Architectures for Information Retrieval,' M. S. Thesis, University of Illinois, Urbana, May 1976.
 
13
14
 
15
Cheng, W. K., 'Multiprocessor for String Manipulation,' M. S. Thesis, University of Illinois, Urbana, Oct. 1977, pp. 22-65.
 
16
Stellhorn, W. H., 'A Processor for Direct Scanning of Text,' presented at First Nonnumeric Workshop, Dallas, Oct. 1974.
17
 
18
Foster, M. J., and Kung, H. T., 'Design of Special Purpose VLSI Chips,' Computer, Jan. 1980, Vol. 13, No. 1, (ISSN 0018-9162), pp. 26-40.
 
19
Mules, D. W., and Warter, P. J., 'A String Matcher for an Approximate Matches,' Department of Electrical Engineering, University of Delaware, Newark, April, 1979.
20
21
 
22
Moore, G., et. al., 'High-Speed-Text-Search Design Contract Interim Report,' Operating Systems Incorporated, Woodland Hills, Calif., Jan. 1977, pp. 2-69-2-101.
 
23
'High-Speed-Text-Search Design Contract Final Report', Operating Systems Incorporated, OSI:R77-009, April 1977, pp. 3-1-3-24.
 
24
Booth, T. L., Sequential Machine and Automata Theory, John Wiley and Sons, Inc., New York, 1968.
 
25