ACM Home Page
Please provide us with feedback. Feedback
Cross-disciplinary perspectives on meta-learning for algorithm selection
Full text PdfPdf (359 KB)
Source
ACM Computing Surveys (CSUR) archive
Volume 41 ,  Issue 1  (December 2008) table of contents
Article No. 6  
Year of Publication: 2008
ISSN:0360-0300
Author
Kate A. Smith-Miles  Monash University, Australia
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 69,   Downloads (12 Months): 679,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1456650.1456656
What is a DOI?

ABSTRACT

The algorithm selection problem [Rice 1976] seeks to answer the question: Which algorithm is likely to perform best for my problem? Recognizing the problem as a learning task in the early 1990's, the machine learning community has developed the field of meta-learning, focused on learning about learning algorithm performance on classification problems. But there has been only limited generalization of these ideas beyond classification, and many related attempts have been made in other disciplines (such as AI and operations research) to tackle the algorithm selection problem in different ways, introducing different terminology, and overlooking the similarities of approaches. In this sense, there is much to be gained from a greater awareness of developments in meta-learning, and how these ideas can be generalized to learn about the behaviors of other (nonlearning) algorithms. In this article we present a unified framework for considering the algorithm selection problem as a learning problem, and use this framework to tie together the crossdisciplinary developments in tackling the algorithm selection problem. We discuss the generalization of meta-learning concepts to algorithms focused on tasks including sorting, forecasting, constraint satisfaction, and optimization, and the extension of these ideas to bioinformatics, cryptography, and other fields.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Achlioptas, D., Naor, A., and Peres, Y. 2005. Rigorous location of phase transitions in hard optimization problems. Nature 435 (Jun.), 759--764.
 
2
 
3
Ali, S. and Smith, K. A. 2005. Kernel width selection for SVM classification: A meta learning approach. Int. J. Data Warehousing Mining 1, 78--97.
 
4
Ali, S. and Smith, K. 2006. On learning algorithm selection for classification. Appl. Soft Comput. 6, 2, 119--138.
 
5
Ali, S. and Smith-Miles, K. 2006. A meta-learning approach to automatic kernel selection for support vector machines. Neurocomput. 70, 1-3, 173--186.
 
6
 
7
Arinze, B. 1995. Selecting appropriate forecasting models using rule induction. Omega Int. J. Manage. Sci. 22, 6, 647--658.
 
8
 
9
 
10
Asuncion, A and Newman, D.J. 2007. UCI Machine Learning Repository (http://www.ics.uci.edu/~mlearn/MLRepository.html). Irvine, CA: University of California, Department of Information and Computer Science.
 
11
Bachelet, V. 1999. Métaheuristiques paralleles hybrides: Application au probleme d'affectation quadratique. Ph.D. dissertation, Universite des Sciences et Technologies de Lille, France. December.
 
12
Bensusan, H., Giraud-Carrier, C., and Kennedy, C. 2000. A higher-order approach to meta-learning. In Proceedings of the European Conference on Machine Learning, Workshop on Meta-Learning: Building Automatic Advice Strategies for Model Selection and Method Combination, 109--117.
 
13
 
14
 
15
Berrer, H., Paterson, I., and Keller, J. 2000. Evaluation of machine-learning algorithm ranking advisors. In Proceedings of the PKDD-00 Workshop on Data Mining, Decision Support, Meta-Learning and ILP: Forum for Practical Problem Presentation and Prospective Solutions, P. Brazdil and A. Jorge, eds.
 
16
Borenstein, Y., and Poli, R. 2006. Kolmogorov complexity, optimization and hardness. IEEE Congress on Evol. Comput., 112--119.
 
17
Boukeas, G., Halatsis, C., Zissimpoloulos, V., and Stamatopoulos, P. 2004. Measures of instrinsic hardness for constraint satisfaction problem instances. Lecture Notes in Computer Science, vol. 2932, 184--195.
 
18
 
19
 
20
Brodley, C. E. 1993. Addressing the selective superiority problem: Automatic algorithm/model class selection. In Proceedings of the 10th International Machine Learning Conference, 17--24.
 
21
Burke, E., Kendall, G., Newall, J., Hart, E., Ross, P., and Schulenburg, S. 2003. Hyper-Heuristics: An emerging direction in modern search technology. In Handbook of Metaheuristics, Glover and Kochenberger, eds. Kluwer Academic, Dordrecht, 457--474.
 
22
Castiello, C., Castellano, G., and Fanelli, A. M. 2005. Meta-Data: Characterization of input features for meta-learning. Lecture Notes in Artificial Intelligence, vol. 3558, 457--468.
 
23
 
24
 
25
Desjardins, M and Gordon, D. F. 1995. Special issue on bias evaluation and selection. Mach. Learn. 20, 1--2.
 
26
Duch, W. and Grudzinski, K. 2001. Meta-Learning: Searching in the model space. In Proceedings of the International Conference on Neural Information Processing (ICONIP) I, 235--240.
27
 
28
 
29
Furnkranz, J. and Petrak, J. 2001. An evaluation of landmarking variants. In Proceedings of the ECML/PKDD Workshop on Integrating Aspects of Data Mining, Decision Support and Meta-Learning (IDDM), C. Giraud-Carrier et al., eds., 57--68.
 
30
 
31
 
32
 
33
Glover, F. and Kochenberger, G. 2003. Handbook of Metaheuristics. Kluwer Academic, Dordrecht.
 
34
Gnanadesikan, R. 1997. Methods for Statistical Data Analysis of Multivariate Observations, 2nd ed. Wiley, New York.
 
35
 
36
 
37
 
38
 
39
Hyndman, R. 2002. Time Series Data Library, available from http://www-personal.buseco.monash.edu.au/~hyndman/TSDL/.
 
40
Jones, N. and Pevzner, P. 2004. An Introduction to Bioinformatics Algorithms. MIT Press, Cambridge, MA.
 
41
 
42
Kalousis, A. and Hilario, M. 2001. Model selection via meta-learning. International. Journal on AI Tools. 10, 4.
 
43
Kalousis, A. and Theoharis, T. 1999. Design, implementation and performance results of an intelligent assistant for classifier selection. Intell. Data Anal. 3, 5, 319--337.
 
44
Keogh, E. and Folias, T. 2002. The UCR Time Series Data Mining Archive. http://www.cs.ucr.edu/~eamonn/TSDMA/.
 
45
Knowles, J. D. and Corne, D. W. 2002. Towards landscape analysis to inform the design of a hybrid local search for the multiobjective quadratic assignment problem. In Soft Computing Systems: Design, Management and Applications, A. Abraham et al., eds. IOS Press, Amsterdam, 271--279.
 
46
 
47
Kopf, C., Taylor, C., and Keller, J. 2000. Meta-Analysis: From data characterisation for meta-learning to meta-regression. In Proceedings of the PKDD Workshop on Data Mining, Decision Support, Meta-Learning and ILP, P. Brazdil and A. Jorge, eds.
 
48
Kuba, P., Brázdil, P., Soares, C., and Woznica, A. 2002. Exploiting sampling and meta-learning for parameter setting for support vector machines. In Proceedings of the Workshop Learning and Data Mining Associated with Iberamia VIII Iberoamerican Conference on Artificial Intelligence. F. Herrera et al., eds., 209--216.
 
49
Lagoudakis, M., Littman, M., and Parr, R. 2001. Selecting the right algorithm. In Proceedings of the AAAI Fall Symposium Series: Using Uncertainty within Computation.
 
50
Ler, D., Koprinska, I., and Chawla, S. 2005. Utilising regression-based landmarkers within a meta-learning framework for algorithm selection. In Proceedings of the Workshop on Meta-Learning, 22nd International Conference on Machine Learning (ICML), 44--51.
 
51
 
52
Leyton-Brown, K., Nudelman, E., Andrew, G., Mcfadden, J., and Shoham, Y. 2003a. A portfolio approach to algorithm selection. In Proceedings of the International Joint Conference on Artificial Intelligence, 1542--1543.
 
53
Leyton-Brown, K., Nudelman, E., Andrew, G., Mcfadden, J., and Shoham, Y. 2003b. Boosting as a metaphor for algorithm design. In Proceedings of the Conference on Principles and Practice of Constraint Programming, 899--903.
 
54
 
55
Linder, C. and Studer, R. 1999. AST: Support for algorithm selection with a CBR approach. In Proceedings of the 16th International Conference on Machine Learning.
 
56
Makridakis, S. and Hibon, M. 2000. The M3-competition: Results, conclusions and implications. Int. J. Forecast. 16, 4, 451--476.
 
57
Marom, Y. Zukerman, I., and Japkowicz, N. 2007. A meta-learning approach for selecting between response automation strategies in a help-desk domain. In Proceedings of the 22nd AAAI Conference on Artificial Intelligence, 907--912.
 
58
 
59
 
60
 
61
Nudelman, E., Leyton-Brown, K., Hoos, H., Devkar, A., and Shoham, Y. 2004. Understanding random SAT: Beyond the clauses-to-variables ratio. Lecture Notes in Computer Science, vol. 3258, 438--452.
 
62
Opitz, D. and Maclin, R. 1999. Popular ensemble methods: An empirical study. J. Artif. Intell. Res. 11, 169--198.
 
63
 
64
Peterson, A. H. and Martinez, T. R. 2005. Estimating the potential for combining learning models. In Proceedings of the ICML Workshop on Meta-Learning, 68--75.
 
65
 
66
Prodromidis, A. L., Chan, P., and Stolfo, S. J. 2000. Meta-Learning in distributed data mining systems: Issues and approaches. In Advances of Distributed Data Mining, H. Kargupta and P. Chan, eds. AAAI Press.
 
67
Prudêncio, R. and Ludermir, T. 2004. Meta-Learning approaches to selecting time-series models. Neurocomput. 61, 121--137.
 
68
 
69
Rice, J. R. 1953. Classes of recursively enumerable sets and their decision problems. Trans. Amer. Math. Soc. 89, 29--59.
 
70
Rice, J. R. 1976. The algorithm selection problem. Advances in Comput. 15, 65--118.
 
71
Samulowitz, H. and Memisevic, R. 2007. Learning to solve QBF. In Proceedings of the 22nd AAAI Conference on Artificial Intelligence, 255--260.
 
72
 
73
 
74
Shannon, C. E. 1948. A mathematical theory of communication. Bell Syst. Tech. J. 27 (Jul., Oct.), 379--423 and 623--656.
 
75
Slaney, J. and Walsh, T. 2001. Backbones in optimization and approximation. In Proceedings of the International Joint Conference on Artificial Intelligence.
 
76
Smith, K. A., Woo, F., Ciesielski, V., and Ibrahim, R. 2001. Modelling the relationship between problem characteristics and data mining algorithm performance using neural networks. In Smart Engineering System Design: Neural Networks, Fuzzy Logic, Evolutionary Programming, Data Mining, and Complex Systems, C. Dagli et al., eds. ASME Press, 11, 356--362.
 
77
Smith, K. A., Woo, F., Ciesielski, V., and Ibrahim, R. 2002. Matching data mining algorithm suitability to data characteristics using a self-organising map. In Hybrid Information Systems, A. Abraham and M. Koppen, M., eds. Physica, Heidelberg, 169--180.
 
78
Smith-Miles, K. A. 2008. Towards insightful algorithm selection for optimisation using meta-learning concepts. In Proceedings of the IEEE Joint Conference on Neural Networks. 4118--4124.
 
79
 
80
 
81
Streeter, M., Golovin, D., and Smith, S. F. 2007. Combining multiple heuristics online. In Proceedings of the 22nd AAAI Conference on Artificial Intelligence, 1197--1203.
 
82
Stützle, T. and Fernandes, S. 2004. New benchmark instances for the QAP and the experimental analysis of algorithms. Lecture Notes in Computer Science, vol. 3004, 199--209.
 
83
 
84
 
85
 
86
 
87
Venkatachalam, A. R. and Sohl, J. E. 1999. An intelligent model selection and forecasting system. Int. J. Forecast. 18, 3, 67--180.
 
88
 
89
Vollmann, T. E. and Buffa, E. S. 1966. The facility layout problem in perspective. Manage. Sci. 12, 10, 450--468.
 
90
Wallace, C. S. and Boulton D. M. 1968. An information measure for classification. Comput. J. 11, 2, 185--194.
 
91
 
92
Wang, X. 2005. Characteristic-Based forecasting for time series data, Ph.D. dissertation, Monash University, Australia.
 
93
 
94
 
95
Wolpert, D. and Macready, W. 1997. No free lunch theorems for optimization. IEEE Trans. Evol. Comput. 1, 67--82.
 
96
 
97
Xu, L., Hutter, F., Hoos, H., and Leyton-Brown, K. 2007a. Satzilla-07: The design and analysis of an algorithm portfolio for SAT. In Principles and Practices of Constraint Programming. Lecture Notes in Computer Science, 712--727.
 
98
Xu, L., Hoos, H., and Leyton-Brown, K. 2007b. Hierarchical hardness models for SAT. In Principles and Practices of Constraint Programming. Lecture Notes in Computer Science, 696--711.
 
99
Yang, J. and Jiu, B. 2006. Algorithm selection: A quantitative approach. Algorithmic Trading II: Precision, control, execution. http://www.itg.com/news_events/papers/AlgoSelection20/.


Collaborative Colleagues:
Kate A. Smith-Miles: colleagues