ACM Home Page
Please provide us with feedback. Feedback
Supervised versus multiple instance learning: an empirical comparison
Full text PdfPdf (935 KB)
Source ACM International Conference Proceeding Series; Vol. 119 archive
Proceedings of the 22nd international conference on Machine learning table of contents
Bonn, Germany
Pages: 697 - 704  
Year of Publication: 2005
ISBN:1-59593-180-5
Authors
Soumya Ray  University of Wisconsin
Mark Craven  University of Wisconsin
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 37,   Citation Count: 13
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1102351.1102439
What is a DOI?

ABSTRACT

We empirically study the relationship between supervised and multiple instance (MI) learning. Algorithms to learn various concepts have been adapted to the MI representation. However, it is also known that concepts that are PAC-learnable with one-sided noise can be learned from MI data. A relevant question then is: how well do supervised learners do on MI data? We attempt to answer this question by looking at a cross section of MI data sets from various domains coupled with a number of learning algorithms including Diverse Density, Logistic Regression, nonlinear Support Vector Machines and FOIL. We consider a supervised and MI version of each learner. Several interesting conclusions emerge from our work: (1) no MI algorithm is superior across all tested domains, (2) some MI algorithms are consistently superior to their supervised counterparts, (3) using high false-positive costs can improve a supervised learner's performance in MI domains, and (4) in several domains, a supervised algorithm is superior to any MI algorithm we tested.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Andrews, S., Tsochantaridis, I., & Hofmann, T. (2003). Support vector machines for multiple-instance learning. In Advances in neural information processing systems, vol. 15. MIT Press.
 
2
Auer, P., & Ortner, R. (2004). A boosting approach to multiple instance learning. Proc. of the 15th European Conf. on Machine Learning (pp. 63--74). Springer-Verlag.
 
3
Blaschke, C., Andres Leon, E., Krallinger, M., & Valencia, A. (2005). Evaluation of BioCreAtIvE assessment of task 2. BMC Bioinformatics, 6(Suppl. 1).
 
4
 
5
Bradley, A. P. (1997). The use of the area under the roc curve in the evaluation of machine learning algorithms. Pattern Recognition, 30, 1145--1159.
 
6
 
7
Fletcher, R. (1980). Practical methods of optimization, vol. 1: Unconstrained Optimization, chapter 3. Wiley.
 
8
 
9
 
10
 
11
 
12
 
13
 
14
 
15
 
16
 
17
Ruffo, G. (2000). Learning single and multiple instance decision trees for computer security applications. Doctoral dissertation, Università di Torino, Torino, Italy.
18
 
19
 
20
Xu, X., & Frank, E. (2004). Logistic regression and boosting for labeled bags of instances. Proc. of the Pacific Asia Conf. on Knowledge Discovery and Data Mining. Springer--Verlag.
 
21

CITED BY  13