| Multi-labelled classification using maximum entropy method |
| Full text |
Pdf
(220 KB)
|
| Source
|
Annual ACM Conference on Research and Development in Information Retrieval
archive
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
table of contents
Salvador, Brazil
SESSION: Categorization and supervised machine learning
table of contents
Pages: 274 - 281
Year of Publication: 2005
ISBN:1-59593-034-5
|
|
Authors
|
|
Shenghuo Zhu
|
NEC Laboratories America, Inc., Cupertino, CA
|
|
Xiang Ji
|
NEC Laboratories America, Inc., Cupertino, CA
|
|
Wei Xu
|
NEC Laboratories America, Inc., Cupertino, CA
|
|
Yihong Gong
|
NEC Laboratories America, Inc., Cupertino, CA
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 26, Downloads (12 Months): 205, Citation Count: 13
|
|
|
ABSTRACT
Many classification problems require classifiers to assign each single document into more than one category, which is called multi-labelled classification. The categories in such problems usually are neither conditionally independent from each other nor mutually exclusive, therefore it is not trivial to directly employ state-of-the-art classification algorithms without losing information of relation among categories. In this paper, we explore correlations among categories with maximum entropy method and derive a classification algorithm for multi-labelled documents. Our experiments show that this method significantly outperforms the combination of single label approach.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Benson, S. J., McInnes, L. C., Moré, J., & Sarich, J. (2004). TAO user manual (revision 1.7) (Technical Report ANL/MCS-TM-242). Mathematics and Computer Science Division, Argonne National Laboratory. http://www.mcs.anl.gov/tao.
|
 |
2
|
|
| |
3
|
Chen, S. F., & Rosenfeld, R. (1999). A Gaussian prior for smoothing maximum entropy models (Technical Report CMU-CS-99-108). School of Computer Science Carnegie Mellon University.
|
| |
4
|
|
| |
5
|
Comite, F. D., Gilleron, R., & Tommasi, M. (2001). Learning multi-label alternating decision trees and applications. Proceedings of CAP'01 (pp. 195--210).
|
 |
6
|
|
| |
7
|
|
| |
8
|
Elisseeff, A., & Weston, J. (2002). A kernel method for multi-labelled classification. Advances in Neural Information Processing Systems 14 (pp. 681--687). Cambridge, MA: MIT Press.
|
 |
9
|
Sheng Gao , Wen Wu , Chin-Hui Lee , Tat-Seng Chua, A MFoM learning approach to robust multiclass multi-label text categorization, Proceedings of the twenty-first international conference on Machine learning, p.42, July 04-08, 2004, Banff, Alberta, Canada
[doi> 10.1145/1015330.1015361]
|
| |
10
|
Godbole, S., & Sarawagi, S. (2004). Discriminative methods for multi-labeled classification. PAKDD.
|
| |
11
|
Har-Peled, S., Roth, D., & Zimak, D. Constraint classification for multiclass classification and ranking. In S. T. S. Becker and K. Obermayer (Eds.), Advances in neural information processing systems 15. MIT Press.
|
| |
12
|
Jaynes, E. T. (1957). Information theory and statistical mechanics. Physical Review, 106, 620--630.
|
| |
13
|
|
| |
14
|
McCallum, A. (1999). Multi-label text classification with a mixture model trained by EM. AAAI'99 Workshop on Text Learning.
|
| |
15
|
Nigam, K., Lafferty, J., & McCallum, A. (1999). Using maximum entropy for text classification. IJCAI-99 Workshop on Machine Learning for Information Filtering (pp. 61--67).
|
| |
16
|
|
| |
17
|
Ueda, N., & Saito, K. Parametric mixture models for multi-labeled text. Advances in Neural Information Processing Systems 15. MIT Press.
|
| |
18
|
Wilcoxon, F. (1945). Individual comparisons by ranking methods. Biometrics, 1, 80--93.
|
 |
19
|
|
| |
20
|
|
| |
21
|
Zhu, J., & Hastie, T. (2003). Classification of gene microarrays by penalized logistic regression. Biostatistics.
|
|