|
ABSTRACT
We empirically study the relationship between supervised and multiple instance (MI) learning. Algorithms to learn various concepts have been adapted to the MI representation. However, it is also known that concepts that are PAC-learnable with one-sided noise can be learned from MI data. A relevant question then is: how well do supervised learners do on MI data? We attempt to answer this question by looking at a cross section of MI data sets from various domains coupled with a number of learning algorithms including Diverse Density, Logistic Regression, nonlinear Support Vector Machines and FOIL. We consider a supervised and MI version of each learner. Several interesting conclusions emerge from our work: (1) no MI algorithm is superior across all tested domains, (2) some MI algorithms are consistently superior to their supervised counterparts, (3) using high false-positive costs can improve a supervised learner's performance in MI domains, and (4) in several domains, a supervised algorithm is superior to any MI algorithm we tested.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Andrews, S., Tsochantaridis, I., & Hofmann, T. (2003). Support vector machines for multiple-instance learning. In Advances in neural information processing systems, vol. 15. MIT Press.
|
| |
2
|
Auer, P., & Ortner, R. (2004). A boosting approach to multiple instance learning. Proc. of the 15th European Conf. on Machine Learning (pp. 63--74). Springer-Verlag.
|
| |
3
|
Blaschke, C., Andres Leon, E., Krallinger, M., & Valencia, A. (2005). Evaluation of BioCreAtIvE assessment of task 2. BMC Bioinformatics, 6(Suppl. 1).
|
| |
4
|
|
| |
5
|
Bradley, A. P. (1997). The use of the area under the roc curve in the evaluation of machine learning algorithms. Pattern Recognition, 30, 1145--1159.
|
| |
6
|
|
| |
7
|
Fletcher, R. (1980). Practical methods of optimization, vol. 1: Unconstrained Optimization, chapter 3. Wiley.
|
| |
8
|
|
| |
9
|
|
| |
10
|
|
| |
11
|
|
| |
12
|
|
| |
13
|
|
| |
14
|
|
| |
15
|
|
| |
16
|
|
| |
17
|
Ruffo, G. (2000). Learning single and multiple instance decision trees for computer security applications. Doctoral dissertation, Università di Torino, Torino, Italy.
|
 |
18
|
Qingping Tao , Stephen Scott , N. V. Vinodchandran , Thomas Takeo Osugi, SVM-based generalized multiple-instance learning via approximate box counting, Proceedings of the twenty-first international conference on Machine learning, p.101, July 04-08, 2004, Banff, Alberta, Canada
[doi> 10.1145/1015330.1015405]
|
| |
19
|
|
| |
20
|
Xu, X., & Frank, E. (2004). Logistic regression and boosting for labeled bags of instances. Proc. of the Pacific Asia Conf. on Knowledge Discovery and Data Mining. Springer--Verlag.
|
| |
21
|
|
CITED BY 13
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Vikas C. Raykar , Balaji Krishnapuram , Jinbo Bi , Murat Dundar , R. Bharat Rao, Bayesian multiple instance learning: automatic feature selection and inductive transfer, Proceedings of the 25th international conference on Machine learning, p.808-815, July 05-09, 2008, Helsinki, Finland
|
|
|
|
|
|
Zhi-Hua Zhou , Yu-Yin Sun , Yu-Feng Li, Multi-instance learning by treating instances as non-I.I.D. samples, Proceedings of the 26th Annual International Conference on Machine Learning, p.1249-1256, June 14-18, 2009, Montreal, Quebec, Canada
|
|
|
|
|
|
|
|