ACM Home Page
Please provide us with feedback. Feedback
Nonmyopic active learning of Gaussian processes: an exploration-exploitation approach
Full text PdfPdf (370 KB)
Source ICML; Vol. 227 archive
Proceedings of the 24th international conference on Machine learning table of contents
Corvalis, Oregon
Pages: 449 - 456  
Year of Publication: 2007
ISBN:978-1-59593-793-3
Authors
Andreas Krause  Carnegie Mellon University, Pittsburgh, PA
Carlos Guestrin  Carnegie Mellon University, Pittsburgh, PA
Sponsor
: Machine Learning Journal
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 56,   Citation Count: 3
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1273496.1273553
What is a DOI?

ABSTRACT

When monitoring spatial phenomena, such as the ecological condition of a river, deciding where to make observations is a challenging task. In these settings, a fundamental question is when an active learning, or sequential design, strategy, where locations are selected based on previous measurements, will perform significantly better than sensing at an a priori specified set of locations. For Gaussian Processes (GPs), which often accurately model spatial phenomena, we present an analysis and efficient algorithms that address this question. Central to our analysis is a theoretical bound which quantifies the performance difference between active and a priori design strategies. We consider GPs with unknown kernel parameters and present a nonmyopic approach for trading off exploration, i.e., decreasing uncertainty about the model parameters, and exploitation, i.e., near-optimally selecting observations when the parameters are (approximately) known. We discuss several exploration strategies, and present logarithmic sample complexity bounds for the exploration phase. We then extend our algorithm to handle nonstationary GPs exploiting local structure in the model. We also present extensive empirical evaluation on several real-world problems.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
Caselton, W., & Zidek, J. (1984). Optimal monitoring network designs. Statist. Prob. Lett., 2, 223--227.
 
3
Castro, R., Willett, R., & Nowak, R. (2005). Faster rates in regression via active learning. NIPS.
 
4
 
5
Cressie, N. A. (1991). Statistics for spatial data. Wiley.
 
6
 
7
Gretton, A., Borgwardt, K., Rasch, M., Schlkopf, B., & Smola, A. (2006). A kernel method for the two-sample-problem. NIPS.
8
 
9
Harmon, T. C., Ambrose, R. F., Gilbert, R. M., Fisher, J. C., Stealey, M., & Kaiser, W. J. (2006). High resolution river hydraulic and water quality characterization using rapidly deployable networked infomechanical systems (nims rd) (Technical Report 60). CENS.
 
10
Ko, C., Lee, J., & Queyranne, M. (1995). An exact algorithm for maximum entropy sampling. Ops Res, 43.
 
11
Koller, D., & Friedman, N. (2007). Structured probabilistic models. Electronic Preprint.
 
12
Krause, A., & Guestrin, C. (2007). Nonmyopic active learning of gaussian processes: An exploration-exploitation approach (Techn. Report CMU-ML-07-105).
 
13
Nott, D. J., & Dunsmuir, W. T. M. (2002). Estimation of nonstationary spatial covariance structure. Biomet., 89.
 
14
 
15
Paciorek, C. (2003). Nonstationary gaussian processes for regression and spatial mod. Doctoral dissertation, CMU.
 
16
 
17
Seo, S., Wallat, M., Graepel, T., & Obermayer, K. (2000). Gaussian process regression: Active data selection and test point rejection. IJCNN (pp. 241--246).
 
18
Shewry, M., & Wynn, H. (1987). Maximum entropy sampling. Journal of Applied Statistics, 14, 165--170.
 
19
Storkey, A. J. (99). Truncated covariance matrices and toeplitz methods in gaussian processes. ICANN.
 
20
Tresp, V. (2000). Mixtures of gaussian processes. NIPS (pp. 654--660).
 
21
Zhu, Z., & Stein, M. L. (2006). Spatial sampling design for prediction with estimated parameters. J Agric., Biol. Env. Statist., 11, 24--49.

Collaborative Colleagues:
Andreas Krause: colleagues
Carlos Guestrin: colleagues