ACM Home Page
Please provide us with feedback. Feedback
Intelligent fusion of structural and citation-based evidence for text classification
Full text PdfPdf (81 KB)
Source Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Salvador, Brazil
POSTER SESSION: Posters table of contents
Pages: 667 - 668  
Year of Publication: 2005
ISBN:1-59593-034-5
Authors
Baoping Zhang  Virginia Tech, Blacksburg, VA
Yuxin Chen  Virginia Tech, Blacksburg, VA
Weiguo Fan  Virginia Tech, Blacksburg, VA
Edward A. Fox  Virginia Tech, Blacksburg, VA
Marcos Andrép Gonçalves  Federal University of Minas Gerais, Belo Horizonte, MG, Brazil
Marco Cristo  Federal University of Minas Gerais, Belo Horizonte, MG, Brazil
Pàvel Calado  IST/INESC-ID, Lisbon, Portugal
Sponsor
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 50,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1076034.1076181
What is a DOI?

ABSTRACT

This paper shows how different measures of similarity derived from the citation information and the structural content (e.g., title, abstract) of the collection can be fused to improve classification effectiveness. To discover the best fusion framework, we apply Genetic Programming (GP) techniques. Our experiments with the ACM Computing Classification Scheme, using documents from the ACM Digital Library, indicate that GP can discover similarity functions superior to those based solely on a single type of evidence. Effectiveness of the similarity functions discovered through simple majority voting is better than that of content-based as well as combination-based Support Vector Machine classifiers. Experiments also were conducted to compare the performance between GP techniques and other fusion techniques such as Genetic Algorithms (GA) and linear fusion. Empirical results show that GP was able to discover better similarity functions than other fusion techniques.



Collaborative Colleagues:
Baoping Zhang: colleagues
Yuxin Chen: colleagues
Weiguo Fan: colleagues
Edward A. Fox: colleagues
Marcos Andrép Gonçalves: colleagues
Marco Cristo: colleagues
Pàvel Calado: colleagues