ACM Home Page
Please provide us with feedback. Feedback
Summarization as feature selection for text categorization
Full text PdfPdf (947 KB)
Source Conference on Information and Knowledge Management archive
Proceedings of the tenth international conference on Information and knowledge management table of contents
Atlanta, Georgia, USA
Session: String Match and Text Extraction table of contents
Pages: 365 - 370  
Year of Publication: 2001
ISBN:1-58113-436-3
Authors
Aleksander Kolcz  Personalogy, Inc., Colorado Springs, CO
Vidya Prabakarmurthi  University of Colorado at Colorado Springs, Colorado Springs, CO
Jugal Kalita  University of Colorado at Colorado Springs, Colorado Springs, CO
Sponsors
SIGMIS: ACM Special Interest Group on Management Information Systems
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 13,   Downloads (12 Months): 98,   Citation Count: 6
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/502585.502647
What is a DOI?

ABSTRACT

We address the problem of evaluating the effectiveness of summarization techniques for the task of document categorization. It is argued that for a large class of automatic categorization algorithms, extraction-based document categorization can be viewed as a particular form of feature selection performed on the full text of the document and, in this context, its impact can be compared with state-of-the-art feature selection techniques especially devised to provide good categorization performance. Such a framework provides for a better assessment of the expected performance of a categorizer if the compression rate of the summarizer is known.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
 
3
4
 
5
H. P. Edmundson. New methods in automatic extracting. Technical report, Department of Computer Science, University of Maryland at College Park, 1969.
6
 
7
T. F. Hand and B. Sundheim. TIPSTER-SUMMAC summarization evaluation. In Proceedings of the TIPSTER Text Phase III Workshop, 1998.
 
8
H. Jing, R. Barzilay, K. McKeown, and M. Elhadad. Summarization evaluation methods: Experiments and analysis. In AAAI Intelligent Text Summarization Workshop, pages 60-68, 1998.
9
 
10
 
11
 
12
 
13
J. T. Y. Kwok. Automated text categorization using support vector machine. In Proceedings of the International Conference on Neural Information Processing (ICONIP), pages 347-351, 1999.
 
14
 
15
 
16
II. P. Luhn. The automatic creation of literature abstracts. In IRE National Convention, pages 60-68, 1958.
 
17
K. Mahesh. Hypertext summary extraction for fast document browsing. In Working Notes of the AAAl Spring Symposium on Natural Language Processing for the World Wide Web, pages 95-103, 1997.
 
18
D. MladeniC and M. Grobelnik. Feature selection for classification based on text hierarchy. In Working notes of Learning from Text and the Web: Conference on Automatic Learning and Discovery (CONALD-98), 1998.
 
19
 
20
A. Tombros, M. Sanderson, and P. Gray. Advantages of query based summaries in information retrieval. In Worlcing Notes of the AAAI Spring Symposium on Natural Language Processing for the World Wide Web, pages 44.-52, 1998.
 
21
 
22
23
 
24


Collaborative Colleagues:
Aleksander Kolcz: colleagues
Vidya Prabakarmurthi: colleagues
Jugal Kalita: colleagues