ACM Home Page
Please provide us with feedback. Feedback
Experimental comparisons of online and batch versions of bagging and boosting
Full text PdfPdf (475 KB)
Source International Conference on Knowledge Discovery and Data Mining archive
Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining table of contents
San Francisco, California
Pages: 359 - 364  
Year of Publication: 2001
ISBN:1-58113-391-X
Authors
Nikunj C. Oza  University of California, Berkeley, CA
Stuart Russell  University of California, Berkeley, CA
Sponsors
SIGMOD: ACM Special Interest Group on Management of Data
AAAI : American Association for Artificial Intelligence
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 10,   Downloads (12 Months): 72,   Citation Count: 3
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/502512.502565
What is a DOI?

ABSTRACT

Bagging and boosting are well-known ensemble learning methods. They combine multiple learned base models with the aim of improving generalization performance. To date, they have been used primarily in batch mode, i.e., they require multiple passes through the training data. In previous work, we presented online bagging and boosting algorithms that only require one pass through the training data and presented experimental results on some relatively small datasets. Through additional experiments on a variety of larger synthetic and real datasets, this paper demonstrates that our online versions perform comparably to their batch counterparts in terms of classification accuracy. We also demonstrate the substantial reduction in running time we obtain with our online algorithms because they require fewer passes through the training data.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
S.D. Bay. The UCI KDD archive, 1999. (URL: http://kdd.ics.uci.edu).
 
2
C. Blake, E. Keogh, and C.J. Merz. UCI repository of machine learning databases, 1999. (URL: http: / /www.ics.uci.edu /~mlearn /MLRepository.htmI).
 
3
 
4
 
5
O. L. Mangasarian, R. Setiono, and W. H. Wolberg. Pattern recognition via linear programming: Theory and application to medical diagnosis. In Thomas F. Coleman and Yuying Li, editors, Large-Scale Numerical Optimization, pages 22-30. SIAM Publications, 1990.
 
6
Nikunj C. Oza and Stuart Russell. Experimental comparisons of online and batch versions of bagging and boosting. Technical report, Electrical Engineering and Computer Science Department, University of California, Berkeley, CA. In prepaxation.
 
7
Nikunj C. Oza and Stuart Russell. Online bagging and boosting. In Artificial Intelligence and Statistics 2001, pages 105-112. Morgan Kanfmann, 2001.
 
8


Collaborative Colleagues:
Nikunj C. Oza: colleagues
Stuart Russell: colleagues