ACM Home Page
Please provide us with feedback. Feedback
New Methods in Automatic Extracting
Full text PdfPdf (5.41 MB)
Source Journal of the ACM (JACM) archive
Volume 16 ,  Issue 2  (April 1969) table of contents
Pages: 264 - 285  
Year of Publication: 1969
ISSN:0004-5411
Author
H. P. Edmundson  University of Maryland, Computer Science Center, College Park, Maryland
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 16,   Downloads (12 Months): 143,   Citation Count: 108
Additional Information:

abstract   references   cited by   index terms  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/321510.321519
What is a DOI?

ABSTRACT

This paper describes new methods of automatically extracting documents for screening purposes, i.e. the computer selection of sentences having the greatest potential for conveying to the reader the substance of the document. While previous work has focused on one component of sentence significance, namely, the presence of high-frequency content words (key words), the methods described here also treat three additional components: pragmatic words (cue words); title and heading words; and structural indicators (sentence location). The research has resulted in an operating system and a research methodology. The extracting system is parameterized to control and vary the influence of the above four components. The research methodology includes procedures for the compilation of the required dictionaries, the setting of the control parameters, and the comparative evaluation of the automatic extracts with manually produced extracts. The results indicate that the three newly proposed components dominate the frequency component in the production of better extracts.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Automatic abstracting. RADC-TDR-63-93, TRW Computer Div., Thompsoa-Ramo- Wooldridge, Inc., Canoga Park, Calif., Feb. 1963.
2
3
 
4
Final report on the study for automtic abstracting. Cl07-1U12, Thompson-Ramo- Wooldridge, Inc., Canoga Park, Calif., Sept. 1961.
 
5
KUNs, J.L. An application of logical probability to problems in automatic abstracting and information retrieval. Joint Man-Computer Indexing and Abstracting, Sess. 13, First Congress on the Information System Sciences, Nov. 1962.
 
6
LUHN, H.P. The automatic creation of literature abstracts, iBM J. Res. Develop. 2, 2 (1959), 159-165.
 
7
RATH, G. J., RESNICK, A., AND SAVAGE, T. R. Comparisons of four types of lcxical indicators of content. Amer. Docum. 12, 2 (Apr. 1961), 126-130.

CITED BY  108