ACM Home Page
Please provide us with feedback. Feedback
Toward economic machine learning and utility-based data mining
Full text PdfPdf (20 KB)
Source International Conference on Knowledge Discovery and Data Mining archive
Proceedings of the 1st international workshop on Utility-based data mining table of contents
Chicago, Illinois
SESSION: Invited talks table of contents
Pages: 1 - 1  
Year of Publication: 2005
ISBN:1-59593-208-9
Author
Foster Provost  New York University, New York, NY
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 26,   Citation Count: 5
Additional Information:

abstract   cited by   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1089827.1089841
What is a DOI?

ABSTRACT

Data mining requires certain information---for example, supervised learning requires training data. Some prior research has recognized that this information often does not simply present itself for free, but involves various acquisition costs. In addition, applying the learned models involves costs and benefits. I introduce a general economic setting that includes as special cases the settings of many different streams of prior research, such as cost-sensitive learning, traditional active learning, semi-supervised learning, active feature acquisition, progressive sampling, and budgeted learning, which are interwoven inextricably. For data mining in the general setting I suggest a strategy of maximum expected-utility data acquisition. Finally, I discuss how there are many open research issues that must be addressed. As a simple example, we must be able to deal with the seemingly straightforward problem of handling missing values in induction and inference.