| Summarization as feature selection for text categorization |
| Full text |
Pdf
(947 KB)
|
| Source
|
Conference on Information and Knowledge Management
archive
Proceedings of the tenth international conference on Information and knowledge management
table of contents
Atlanta, Georgia, USA
Session: String Match and Text Extraction
table of contents
Pages: 365 - 370
Year of Publication: 2001
ISBN:1-58113-436-3
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 13, Downloads (12 Months): 98, Citation Count: 6
|
|
|
ABSTRACT
We address the problem of evaluating the effectiveness of summarization techniques for the task of document categorization. It is argued that for a large class of automatic categorization algorithms, extraction-based document categorization can be viewed as a particular form of feature selection performed on the full text of the document and, in this context, its impact can be compared with state-of-the-art feature selection techniques especially devised to provide good categorization performance. Such a framework provides for a better assessment of the expected performance of a categorizer if the compression rate of the summarizer is known.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
|
| |
3
|
|
 |
4
|
Susan Dumais , John Platt , David Heckerman , Mehran Sahami, Inductive learning algorithms and representations for text categorization, Proceedings of the seventh international conference on Information and knowledge management, p.148-155, November 02-07, 1998, Bethesda, Maryland, United States
[doi> 10.1145/288627.288651]
|
| |
5
|
H. P. Edmundson. New methods in automatic extracting. Technical report, Department of Computer Science, University of Maryland at College Park, 1969.
|
 |
6
|
Venkatesh Ganti , Johannes Gehrke , Raghu Ramakrishnan, CACTUS—clustering categorical data using summaries, Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining, p.73-83, August 15-18, 1999, San Diego, California, United States
[doi> 10.1145/312129.312201]
|
| |
7
|
T. F. Hand and B. Sundheim. TIPSTER-SUMMAC summarization evaluation. In Proceedings of the TIPSTER Text Phase III Workshop, 1998.
|
| |
8
|
H. Jing, R. Barzilay, K. McKeown, and M. Elhadad. Summarization evaluation methods: Experiments and analysis. In AAAI Intelligent Text Summarization Workshop, pages 60-68, 1998.
|
 |
9
|
|
| |
10
|
|
| |
11
|
|
| |
12
|
|
| |
13
|
J. T. Y. Kwok. Automated text categorization using support vector machine. In Proceedings of the International Conference on Neural Information Processing (ICONIP), pages 347-351, 1999.
|
| |
14
|
|
| |
15
|
|
| |
16
|
II. P. Luhn. The automatic creation of literature abstracts. In IRE National Convention, pages 60-68, 1958.
|
| |
17
|
K. Mahesh. Hypertext summary extraction for fast document browsing. In Working Notes of the AAAl Spring Symposium on Natural Language Processing for the World Wide Web, pages 95-103, 1997.
|
| |
18
|
D. MladeniC and M. Grobelnik. Feature selection for classification based on text hierarchy. In Working notes of Learning from Text and the Web: Conference on Automatic Learning and Discovery (CONALD-98), 1998.
|
| |
19
|
|
| |
20
|
A. Tombros, M. Sanderson, and P. Gray. Advantages of query based summaries in information retrieval. In Worlcing Notes of the AAAI Spring Symposium on Natural Language Processing for the World Wide Web, pages 44.-52, 1998.
|
| |
21
|
|
| |
22
|
|
 |
23
|
|
| |
24
|
|
CITED BY 6
|
|
Dou Shen , Zheng Chen , Qiang Yang , Hua-Jun Zeng , Benyu Zhang , Yuchang Lu , Wei-Ying Ma, Web-page classification through summarization, Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval, July 25-29, 2004, Sheffield, United Kingdom
|
|
|
|
|
|
|
|
|
|
|
|
Aris Anagnostopoulos , Andrei Z. Broder , Evgeniy Gabrilovich , Vanja Josifovski , Lance Riedel, Just-in-time contextual advertising, Proceedings of the sixteenth ACM conference on Conference on information and knowledge management, November 06-10, 2007, Lisbon, Portugal
|
|
|
|
|