ACM Home Page
Please provide us with feedback. Feedback
Combining structural and citation-based evidence for text classification
Full text PdfPdf (85 KB)
Source Conference on Information and Knowledge Management archive
Proceedings of the thirteenth ACM international conference on Information and knowledge management table of contents
Washington, D.C., USA
POSTER SESSION: Posters P-1 table of contents
Pages: 162 - 163  
Year of Publication: 2004
ISBN:1-58113-874-1
Authors
Baoping Zhang  Virginia Tech, Blacksburg, VA
Marcos André Gonçalves  Virginia Tech, Blacksburg, VA
Weiguo Fan  Virginia Tech, Blacksburg, VA
Yuxin Chen  Virginia Tech, Blacksburg, VA
Edward A. Fox  Virginia Tech, Blacksburg, VA
Pável Calado  Federal University of Minas Gerais, Belo Horizonte, Brazil
Marco Cristo  Federal University of Minas Gerais, Belo Horizonte, Brazil
Sponsors
SIGIR: ACM Special Interest Group on Information Retrieval
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 1,   Downloads (12 Months): 22,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1031171.1031204
What is a DOI?

ABSTRACT

This paper discusses how citation-based information and structural content (e.g., title, abstract) can be combined to improve classification of text documents into predefined categories. We evaluate different measures of similarity derived from the citation structure and the structural content of the collection, and determine how they can be fused to improve classification effectiveness. To discover the best fusion framework, we apply Genetic Programming (GP) techniques. Our empirical experiments using documents from the ACM Digital Library and the ACM Computing Classification System show that we can discover similarity functions that work better than using evidence in isolation and whose combined performance through a simple majority voting is comparable to that of Support Vector Machine classifiers.



Collaborative Colleagues:
Baoping Zhang: colleagues
Marcos André Gonçalves: colleagues
Weiguo Fan: colleagues
Yuxin Chen: colleagues
Edward A. Fox: colleagues
Pável Calado: colleagues
Marco Cristo: colleagues