|
ABSTRACT
This paper describes new methods of automatically extracting documents for screening purposes, i.e. the computer selection of sentences having the greatest potential for conveying to the reader the substance of the document. While previous work has focused on one component of sentence significance, namely, the presence of high-frequency content words (key words), the methods described here also treat three additional components: pragmatic words (cue words); title and heading words; and structural indicators (sentence location).
The research has resulted in an operating system and a research methodology. The extracting system is parameterized to control and vary the influence of the above four components. The research methodology includes procedures for the compilation of the required dictionaries, the setting of the control parameters, and the comparative evaluation of the automatic extracts with manually produced extracts. The results indicate that the three newly proposed components dominate the frequency component in the production of better extracts.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Automatic abstracting. RADC-TDR-63-93, TRW Computer Div., Thompsoa-Ramo- Wooldridge, Inc., Canoga Park, Calif., Feb. 1963.
|
 |
2
|
|
 |
3
|
|
| |
4
|
Final report on the study for automtic abstracting. Cl07-1U12, Thompson-Ramo- Wooldridge, Inc., Canoga Park, Calif., Sept. 1961.
|
| |
5
|
KUNs, J.L. An application of logical probability to problems in automatic abstracting and information retrieval. Joint Man-Computer Indexing and Abstracting, Sess. 13, First Congress on the Information System Sciences, Nov. 1962.
|
| |
6
|
LUHN, H.P. The automatic creation of literature abstracts, iBM J. Res. Develop. 2, 2 (1959), 159-165.
|
| |
7
|
RATH, G. J., RESNICK, A., AND SAVAGE, T. R. Comparisons of four types of lcxical indicators of content. Amer. Docum. 12, 2 (Apr. 1961), 126-130.
|
CITED BY 109
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Julian Kupiec , Jan Pedersen , Francine Chen, A trainable document summarizer, Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval, p.68-73, July 09-13, 1995, Seattle, Washington, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Takehiko Yoshimi , Toshiyuki Okunishi , Takahiro Yamaji , Yoji Fukumochi, Evaluation of importance of sentences based on connectivity to title, Proceedings of the 17th international conference on Computational linguistics, p.1443-1447, August 10-14, 1998, Montreal, Quebec, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Anton Leuski , Chin-Yew Lin , Liang Zhou , Ulrich Germann , Franz Josef Och , Eduard Hovy, Cross-lingual C*ST*RD: English access to Hindi information, ACM Transactions on Asian Language Information Processing (TALIP), v.2 n.3, p.245-269, September 2003
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Chinatsu Aone , Mary Ellen Okurowski , James Gorlinsky, Trainable, scalable summarization using robust NLP and machine learning, Proceedings of the 17th international conference on Computational linguistics, August 10-14, 1998, Montreal, Quebec, Canada
|
|
|
|
|
|
Inderjeet Mani , David House , Gary Klein , Lynette Hirschman , Therese Firmin , Beth Sundheim, The TIPSTER SUMMAC Text Summarization Evaluation, Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics, June 08-12, 1999, Bergen, Norway
|
|
|
Takahiro Wakao , Terumasa Ehara , Eiji Sawamura , Ichiro Maruyama , Katsuhiko Shirai, Project for production of closed-caption TV programs for the hearing impaired, Proceedings of the 17th international conference on Computational linguistics, p.1340-1344, August 10-14, 1998, Montreal, Quebec, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Tsutomu Hirao , Hideki Isozaki , Eisaku Maeda , Yuji Matsumoto, Extracting important sentences with support vector machines, Proceedings of the 19th international conference on Computational linguistics, p.1-7, August 24-September 01, 2002, Taipei, Taiwan
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Zhuli Xie , Xin Li , Barbara Di Eugenio , Peter C. Nelson , Weimin Xiao , Thomas M. Tirpak, Using gene expression programming to construct sentence ranking functions for text summarization, Proceedings of the 20th international conference on Computational Linguistics, p.1381-es, August 23-27, 2004, Geneva, Switzerland
|
|
|
|
|
|
|
|
|
Enrique Amigó , Julio Gonzalo , Víctor Peinado , Anselmo Peñas , Felisa Verdejo, Using syntactic information to extract relevant terms for multi-document summarization, Proceedings of the 20th international conference on Computational Linguistics, p.652-es, August 23-27, 2004, Geneva, Switzerland
|
|
|
|
|
|
|
|
|
Hong Yu , Minsuk Lee , David Kaufman , John Ely , Jerome A. Osheroff , George Hripcsak , James Cimino, Development, implementation, and a cognitive evaluation of a definitional question answering system for physicians, Journal of Biomedical Informatics, v.40 n.3, p.236-251, June, 2007
|
|
|
|
|
|
Paul - Alexandru Chirita , Stefania Costache , Wolfgang Nejdl , Siegfried Handschuh, P-TAG: large scale automatic generation of personalized annotation tags for the web, Proceedings of the 16th international conference on World Wide Web, May 08-12, 2007, Banff, Alberta, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Inderjeet Mani , Gary Klein , David House , Lynette Hirschman , Therese Firmin , Beth Sundheim, SUMMAC: a text summarization evaluation, Natural Language Engineering, v.8 n.1, p.43-68, March 2002
|
|
|
|
|
|
P. V. Sai Teja , T. H. V. Reddy , P. E. Rohit , Prashanth N , S. Mittal , S. Tapaswi, Web content adaptation classification and summarization in mobile devices, Proceedings of the 4th international conference on mobile technology, applications, and systems and the 1st international symposium on Computer human interaction in mobile technology, September 10-12, 2007, Singapore
|
|
|
|
|
|
Mohamed Abdel Fattah , Fuji Ren, GA, MR, FFNN, PNN and GMM based models for automatic text summarization, Computer Speech and Language, v.23 n.1, p.126-144, January, 2009
|
|
|
Lucas Antiqueira , Osvaldo N. Oliveira, Jr. , Luciano da Fontoura Costa , Maria das Graças Volpe Nunes, A complex network approach to text summarization, Information Sciences: an International Journal, v.179 n.5, p.584-599, February, 2009
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Edleno Silva de Moura , Celia Francisca dos Santos , Bruno Dos santos de Araujo , Altigran Soares da Silva , Pavel Calado , Mario A. Nascimento, Locality-Based pruning methods for web search, ACM Transactions on Information Systems (TOIS), v.26 n.2, p.1-28, March 2008
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Jaehui Park , Tomohiro Fukuhara , Ikki Ohmukai , Hideaki Takeda , Sang-goo Lee, Web content summarization using social bookmarks: a new approach for social summarization, Proceeding of the 10th ACM workshop on Web information and data management, October 30-30, 2008, Napa Valley, California, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|