|
ABSTRACT
Is it possible to predict how long an algorithm will take to solve a previously-unseen instance of an NP-complete problem? If so, what uses can be found for models that make such predictions? This article provides answers to these questions and evaluates the answers experimentally. We propose the use of supervised machine learning to build models that predict an algorithm's runtime given a problem instance. We discuss the construction of these models and describe techniques for interpreting them to gain understanding of the characteristics that cause instances to be hard or easy. We also present two applications of our models: building algorithm portfolios that outperform their constituent algorithms, and generating test distributions that emphasize hard problems. We demonstrate the effectiveness of our techniques in a case study of the combinatorial auction winner determination problem. Our experimental results show that we can build very accurate models of an algorithm's running time, interpret our models, build an algorithm portfolio that strongly outperforms the best single algorithm, and tune a standard benchmark suite to generate much harder problem instances.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
|
| |
4
|
Achlioptas, D., Jia, H., and Moore, C. 2005. Hiding satisfying assignments: Two are better than one. J. Artif. Intell. Res. 24, 623--639.
|
| |
5
|
|
 |
6
|
Paul Beame , Richard Karp , Toniann Pitassi , Michael Saks, On the complexity of unsatisfiability proofs for random k-CNF formulas, Proceedings of the thirtieth annual ACM symposium on Theory of computing, p.561-571, May 24-26, 1998, Dallas, Texas, United States
[doi> 10.1145/276698.276870]
|
| |
7
|
|
 |
8
|
|
| |
9
|
Carchrae, T., and Beck, J. C. 2005. Applying machine learning to low knowledge control of optimization algorithms. Computat. Intell. 21, 4, 372--387.
|
| |
10
|
Chaloner, K., and Verdinelli, I. 1995. Bayesian experimental design: A review. Statist. Sci. 10, 273--304.
|
| |
11
|
Cheeseman, P., Kanefsky, B., and Taylor, W. M. 1991. Where the really hard problems are. In IJCAI: Proceedings of the International Joint Conference on Artificial Intelligence. 331--337.
|
| |
12
|
|
| |
13
|
|
| |
14
|
de Farias, D. P., and Megiddo, N. 2004. How to combine expert (or novice) advice when actions impact the environment. In NIPS: Proceedings of the Neural Information Processing Systems Conference.
|
| |
15
|
|
| |
16
|
Diao, Y., Eskesen, F., Proehlich, S., Hellerstein, J., Spainhower, L., and Surendra, M. 2003. Generic online optimization of multiple configuration parameters with application to a database server. Proceedings of the IFIP/IEEE International Workshop on Distributed Systems: Operations and Man- agement.
|
| |
17
|
Doucet, A., de Freitas, N., and Gorden, N. (Eds.) 2001. Sequential Monte Carlo Methods in Practice. Springer-Verlag.
|
| |
18
|
|
| |
19
|
Olivier Dubois , Yacine Boufkhad , Jacques Mandler, Typical random 3-SAT formulae and the satisfiability threshold, Proceedings of the eleventh annual ACM-SIAM symposium on Discrete algorithms, p.126-127, January 09-11, 2000, San Francisco, California, United States
|
| |
20
|
Efron, B., Hastie, T., Johnstone, I., and Tibshirani, R. 2004. Least angle regression. Ann. Stat. 32, 407--451.
|
| |
21
|
|
| |
22
|
|
| |
23
|
Friedman, J. 1991. Multivariate adaptive regression splines. Ann. Stat. 19, 1, 1--141.
|
| |
24
|
|
| |
25
|
|
| |
26
|
|
| |
27
|
Gebruers, C., and Guerri, A. 2004. Machine learning for portfolio selection using structure at the instance level. In Principles and Practice of Constraint Programming, Doctoral Consortium.
|
| |
28
|
Gebruers, C., Guerri, A., Hnich, B., and Milano, M. 2004. Making choices using structure at the instance level within a case based reasoning framework. In CPAIOR: Proceedings of the International Conference on Integration of AI and OR Techniques in Constraint Programming for Combinatorial Optimization Problems. 380--386.
|
| |
29
|
Gebruers, C., Hnich, B., Bridge, D., and Freuder, E. 2005. Using CBR to select solution strategies in constraint programming. In ICCBR: Proceedings of the International Conference on Case-Based Reasoning. 222--236.
|
 |
30
|
|
| |
31
|
Goldszmidt, M. 2007. Making life better one large system at a time: Challenges for UAI research. In UAI: Proceedings of the Conference on Uncertainty in Artificial Intelligence.
|
| |
32
|
Gomes, C., Fernández, C., Selman, B., and Bessière, C. 2004. Statistical regimes across constrainedness regions. In CP: Proceedings of the International Conference on Principles and Practice of Constraint Programming.
|
| |
33
|
|
| |
34
|
|
| |
35
|
Gomes, C. P., and Selman, B. 1997. Problem structure in the presence of perturbations. In AAAI: Proceedings of the AAAI Conference on Artif. Intell. 221--226.
|
 |
36
|
|
| |
37
|
Gonen, R., and Lehmann, D. 2001. Linear programming helps solving large multi-unit combinatorial auctions. Tech. Rep. TR-2001-8, Leibniz Center for Research in Computer Science. April.
|
| |
38
|
Guo, H., and Hsu, W. 2007. A machine learning approach to algorithm selection for NP-hard optimization problems. Ann. Oper. Res. 156, 1 (Dec.), 61--82.
|
| |
39
|
Hastie, T., Tibshirani, R., and Friedman, J. 2001. Elements of Statistical Learning. Springer-Verlang, Berlin, Germany.
|
| |
40
|
|
| |
41
|
|
| |
42
|
|
| |
43
|
Eric Horvitz , Yongshao Ruan , Carla P. Gomes , Henry A. Kautz , Bart Selman , David Maxwell Chickering, A Bayesian Approach to Tackling Hard Computational Problems, Proceedings of the 17th Conference in Uncertainty in Artificial Intelligence, p.235-244, August 02-05, 2001
|
| |
44
|
Huberman, B., Lukose, R., and Hogg, T. 1997. An economics approach to hard computational problems. Science 265, 51--54.
|
| |
45
|
Hutter, F., Hamadi, Y., Hoos, H. H., and Leyton-Brown, K. 2006. Performance prediction and automated tuning of randomized and parametric algorithms. In CP: Proceedings of the International Conference on Principles and Practice of Constraint Programming. Lecture Notes in Computer Science, vol. 4204. Springer-Verlag, Berlin, Germany, 213--228.
|
| |
46
|
Jia, H., Moore, C., and Selman, B. 2004. From spin glasses to hard satisfiable formulas. In SAT: Proceedings of the International Conference on Theory and Applications of Satisfiability Testing. 199--210.
|
| |
47
|
Jia, H., Moore, C., and Strain, D. 2007. Generating hard satisfiable formulas by hiding solutions deceptively. J. Artif. Intell. Res. 28, 107--118.
|
| |
48
|
Knuth, D. 1975. Estimating the efficiency of backtrack programs. Math. Comput. 29, 129, 121--136.
|
| |
49
|
|
| |
50
|
Kolaitis, P. 2003. Constraint satisfaction, databases and logic. In IJCAI: Proceedings of the International Joint Conference on Artificial Intelligence. 1587--1595.
|
| |
51
|
|
| |
52
|
|
| |
53
|
Lagoudakis, M., and Littman, M. 2001. Learning to select branching rules in the DPLL procedure for satisfiability. In SAT: Proceedings of the International Conference on Theory and Applications of Satisfiability Testing. 344--359.
|
| |
54
|
Leyton-Brown, K., Nudelman, E., Andrew, G., McFadden, J., and Shoham, Y. 2003a. Boosting as a metaphor for algorithm design. In CP: Proceedings of the International Conference on Principles and Practice of Constraint Programming. 899--903.
|
| |
55
|
Leyton-Brown, K., Nudelman, E., Andrew, G., McFadden, J., and Shoham, Y. 2003b. A portfolio approach to algorithm selection. In IJCAI: Proceedings of the International Joint Conference on Artificial Intelligence. 1542--1543.
|
| |
56
|
|
| |
57
|
Leyton-Brown, K., Nudelman, E., and Shoham, Y. 2006. Empirical hardness models for combinatorial auctions. In Combinatorial Auctions. MIT Press, Cambridge, Chap. 19, 479--504.
|
 |
58
|
Kevin Leyton-Brown , Mark Pearson , Yoav Shoham, Towards a universal test suite for combinatorial auction algorithms, Proceedings of the 2nd ACM conference on Electronic commerce, p.66-76, October 17-20, 2000, Minneapolis, Minnesota, United States
[doi> 10.1145/352871.352879]
|
| |
59
|
|
| |
60
|
|
| |
61
|
Mason, R. L., Gunst, R. F., and Hess, J. L. 2003. Statistical Design and Analysis of Experiments. Wiley-Interscience, New York.
|
| |
62
|
Monasson, R., Zecchina, R., Kirkpatrick, S., Selman, B., and Troyansky, L. 1998. Determining computational complexity for characteristic ‘phase transitions’. Nature 400, 133--137.
|
 |
63
|
|
| |
64
|
Nudelman, E., Leyton-Brown, K., Devkar, A., Shoham, Y., and Hoos, H. H. 2004a. SATzilla: An algorithm portfolio for SAT. In Proceedings of the International Conference on Theory and Applications of Satisfiability Testing, SAT 2004 Competition: Solver Descriptions. 13--14.
|
| |
65
|
Nudelman, E., Leyton-Brown, K., Devkar, A., Shoham, Y., and Hoos, H. H. 2004b. Understanding random SAT: Beyond the clauses-to-variables ratio. In CP: Proceedings of the International Conference on Principles and Practice of Constraint Programming. 438--452.
|
| |
66
|
Rice, J. R. 1976. The algorithm selection problem. Adv. Comput. 15, 65--118.
|
| |
67
|
|
| |
68
|
|
| |
69
|
|
| |
70
|
|
| |
71
|
Sandholm, T., Suri, S., Gilpin, A., and Levine, D. 2001. CABOB: A fast optimal algorithm for combinatorial auctions. In IJCAI: Proceedings of the International Joint Conference on Artificial Intelligence. 1102--1108.
|
| |
72
|
Sandholm, T., Suri, S., Gilpin, A., and Levine, D. 2005. CABOB: A fast optimal algorithm for winner determination in combinatorial auctions. Manage. Sci. 51, 3, 374--390.
|
| |
73
|
|
| |
74
|
Schmee, J., and Hahn, G. J. 1979. A simple method for regression analysis with censored data. Technometrics 21, 4, 417--432.
|
| |
75
|
|
| |
76
|
Slaney, J., and Walsh, T. 2001. Backbones in optimization and approximation. In IJCAI: Proceedings of the International Joint Conference on Artificial Intelligence. 254--259.
|
| |
77
|
Streeter, M., Golovin, D., and Smith, S. F. 2007. Combining multiple heuristics online. In AAAI: Proceedings of the AAAI Conference on Artificial Intelligence. 1197--1203.
|
| |
78
|
Williams, R., Gomes, C., and Selman, B. 2003. Backdoors to typical case complexity. In IJCAI: Proceedings of the International Joint Conference on Artificial Intelligence. 1173--1178.
|
| |
79
|
Xu, L., Hoos, H. H., and Leyton-Brown, K. 2007a. Hierarchical hardness models for SAT. In CP: Proceedings of the International Conference on Principles and Practice of Constraint Programming. Lecture Notes in Computer Science, vol. 4741. Springer-Verlag, Berlin, Germany, 696--711.
|
| |
80
|
Xu, L., Hutter, F., Hoos, H. H., and Leyton-Brown, K. 2007b. SATzilla-07: The design and analysis of an algorithm portfolio for SAT. In CP: Proceedings of the International Conference on Principles and Practice of Constraint Programming. Lecture Notes in Computer Science, vol. 4741. Springer-Verlag, Berlin, Germany, 712--727.
|
| |
81
|
Xu, L., Hutter, F., Hoos, H. H., and Leyton-Brown, K. 2008. SATzilla: Portfolio-based algorithm selection for SAT. J. Artif. Intell. Res. 32, 565--606.
|
| |
82
|
Zhang, W. 1999. State-Space Search: Algorithms, Complexity, Extensions, and Applications. Springer-Verlag, Berlin, Germany.
|
| |
83
|
|
| |
84
|
Zheng, A., Jordan, M., Liblit, B., and Aiken, A. 2003. Statistical debugging of sampled programs. In NIPS: Proceedings of the Neural Information Processing Systems Conference.
|
|