|
ABSTRACT
In preparation for the widespread use of automatic scanners which will read documents and transmit their contents to other machines for analysis, this report presents a new concept in automatic analysis: the relative-frequency approach to measuring the significance of words, word groups, and sentences. The relative-frequency approach is discussed in detail, as is its application to problems of automatic indexing and automatic abstracting. Included in the report is a summary of automatic analysis studies published as of the date of writing. Conclusions are drawn that point toward more sophisticated mathematical and linguistic techniques for the solution of problems of automatic analysis.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
BAXENDALE, P. B. Machine-made index for technical liter:qture-an experiment. IBM J. Res. Dev. 2, 4 (Oct. 1958), 354-361.
|
| |
2
|
LUHN, H.P. The automatic creation of literature abstracts. IBM J. Res. Dev. 2, 2 (Apr. 1958), 159-165.
|
| |
3
|
OSWALD, V. A., JR., ET AL. Automatic indexing and abstract.. ing of the contents of documents. RADC-TR-59-208, 3L October 1959, prepared for the Rome Air Development Center, Air Research and Development Command, United States Air Force, pp. 5-34, 59-133.
|
| |
4
|
OSWALD, V. A. JR.; AND LAWSON, R.H. An idioglossary for mechanical translation. Mod. Language Forum 38, 2 (Sept.- Dec. 1953), 1-11.
|
| |
5
|
RATH, G. J.; RESNICK, A.; and SAVAGE, T.R. The formation of abstracts by the selection of sentences. Research Report RC-184, 29 June 1959, IBM Research Center, Yorktown Heights, N. Y.
|
| |
6
|
RATH, G. J.; RESNICK, A.; and SAVAGE, W.R. Comparisons of four types of lexical indicators of contents. Research Report RC-187, 14 August 1959, IBM Research Center, Yorktown Heights, N. Y.
|
| |
7
|
RESNICK, A.; and SAVAGE, T .n . A re-evaluation of machine generated abstracts. Research Report RC-230, 1 March 1960, IBM Research Center, Yorktown Heights, N. Y.
|
CITED BY 22
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Gerard Salton , J. Allan , Chris Buckley, Approaches to passage retrieval in full text information systems, Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval, p.49-58, June 27-July 01, 1993, Pittsburgh, Pennsylvania, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Terry Noreault , Michael McGill , Matthew B. Koll, A performance evaluation of similarity measures, document term weighting schemes and representations in a Boolean environment, Proceedings of the 3rd annual ACM conference on Research and development in information retrieval, p.57-76, June 23-27, 1980, Cambridge, England
|
|
|
|
|
|
|
|
|
G. N. Arnovick , J. A. Liles , J. S. Wood, Information storage and retrieval-analysis of the state of the art, Proceedings of the April 21-23, 1964, spring joint computer conference, April 21-23, 1964, Washington, D.C.
|
|
|
|
|
|
|
|