|
ABSTRACT
We describe a method for generating accurate, compact, human understandable text classifiers. Text datasets are indexed using Apache Lucene and Genetic Programs are used to construct Lucene search queries. Genetic programs acquire fitness by producing queries that are effective binary classifiers for a particular category when evaluated against a set of training documents. We describe a set of functions and terminals and provide results from classification tasks.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
|
| |
3
|
|
 |
4
|
Agneta Bergström , Patricija Jaksetic , Peter Nordin, Enhancing information retrieval by automatic acquisition of textual relations using genetic programming, Proceedings of the 5th international conference on Intelligent user interfaces, p.29-32, January 09-12, 2000, New Orleans, Louisiana, United States
[doi> 10.1145/325737.325763]
|
 |
5
|
Chris Clack , Johnny Farringdon , Peter Lidwell , Tina Yu, Autonomous document classification for business, Proceedings of the first international conference on Autonomous agents, p.201-208, February 05-08, 1997, Marina del Rey, California, United States
[doi> 10.1145/267658.267716]
|
| |
6
|
Cohen, W. 1995 Fast effective rule induction. In Proceedings of the Twelfth International Conference on Machine Learning, pages 115--123
|
| |
7
|
|
| |
8
|
P. J. Hayes , P. M. Andersen , I. B. Nirenburg , L. M. Schmandt, TCS: a shell for content-based text categorization, Proceedings of the sixth conference on Artificial intelligence applications, p.320-326, January 1990, Santa Barbara, California, United States
|
| |
9
|
Hirsch L., Saeedi M. and Hirsch R., Evolving Text Classification Rules with Genetic Programming Applied Artificial Intelligence, (AAI 19/7), Taylor & Francis, August 2005
|
| |
10
|
|
| |
11
|
|
| |
12
|
Montana, D. 1995. Strongly Typed Genetic Programming. In Evolutionary Computation. 3:2, 199----230. The MIT Press, Cambridge MA.
|
| |
13
|
Quinlan, J. R. Bagging, boosting, and C4.5. In Proceedings, Fourteenth National Conference on Artificial Intelligence, 1996.
|
 |
14
|
Gerard Salton , Amit Singhal , Chris Buckley , Mandar Mitra, Automatic text decomposition using text segments and text themes, Proceedings of the the seventh ACM conference on Hypertext, p.53-65, March 16-20, 1996, Bethesda, Maryland, United States
[doi> 10.1145/234828.234834]
|
 |
15
|
|
| |
16
|
|
| |
17
|
|
| |
18
|
Vasile, F., Silvescu, A., Kang, D-K. and Honavar, V. 2006. TRIPPER: An Attribute Value Taxonomy Guided Rule Learner. Proceedings of the Tenth Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), Berlin: Springer--Verlag. pp. 55--59
|
|