|
ABSTRACT
When monitoring spatial phenomena, such as the ecological condition of a river, deciding where to make observations is a challenging task. In these settings, a fundamental question is when an active learning, or sequential design, strategy, where locations are selected based on previous measurements, will perform significantly better than sensing at an a priori specified set of locations. For Gaussian Processes (GPs), which often accurately model spatial phenomena, we present an analysis and efficient algorithms that address this question. Central to our analysis is a theoretical bound which quantifies the performance difference between active and a priori design strategies. We consider GPs with unknown kernel parameters and present a nonmyopic approach for trading off exploration, i.e., decreasing uncertainty about the model parameters, and exploitation, i.e., near-optimally selecting observations when the parameters are (approximately) known. We discuss several exploration strategies, and present logarithmic sample complexity bounds for the exploration phase. We then extend our algorithm to handle nonstationary GPs exploiting local structure in the model. We also present extensive empirical evaluation on several real-world problems.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
Caselton, W., & Zidek, J. (1984). Optimal monitoring network designs. Statist. Prob. Lett., 2, 223--227.
|
| |
3
|
Castro, R., Willett, R., & Nowak, R. (2005). Faster rates in regression via active learning. NIPS.
|
| |
4
|
|
| |
5
|
Cressie, N. A. (1991). Statistics for spatial data. Wiley.
|
| |
6
|
|
| |
7
|
Gretton, A., Borgwardt, K., Rasch, M., Schlkopf, B., & Smola, A. (2006). A kernel method for the two-sample-problem. NIPS.
|
 |
8
|
|
| |
9
|
Harmon, T. C., Ambrose, R. F., Gilbert, R. M., Fisher, J. C., Stealey, M., & Kaiser, W. J. (2006). High resolution river hydraulic and water quality characterization using rapidly deployable networked infomechanical systems (nims rd) (Technical Report 60). CENS.
|
| |
10
|
Ko, C., Lee, J., & Queyranne, M. (1995). An exact algorithm for maximum entropy sampling. Ops Res, 43.
|
| |
11
|
Koller, D., & Friedman, N. (2007). Structured probabilistic models. Electronic Preprint.
|
| |
12
|
Krause, A., & Guestrin, C. (2007). Nonmyopic active learning of gaussian processes: An exploration-exploitation approach (Techn. Report CMU-ML-07-105).
|
| |
13
|
Nott, D. J., & Dunsmuir, W. T. M. (2002). Estimation of nonstationary spatial covariance structure. Biomet., 89.
|
| |
14
|
|
| |
15
|
Paciorek, C. (2003). Nonstationary gaussian processes for regression and spatial mod. Doctoral dissertation, CMU.
|
| |
16
|
|
| |
17
|
Seo, S., Wallat, M., Graepel, T., & Obermayer, K. (2000). Gaussian process regression: Active data selection and test point rejection. IJCNN (pp. 241--246).
|
| |
18
|
Shewry, M., & Wynn, H. (1987). Maximum entropy sampling. Journal of Applied Statistics, 14, 165--170.
|
| |
19
|
Storkey, A. J. (99). Truncated covariance matrices and toeplitz methods in gaussian processes. ICANN.
|
| |
20
|
Tresp, V. (2000). Mixtures of gaussian processes. NIPS (pp. 654--660).
|
| |
21
|
Zhu, Z., & Stein, M. L. (2006). Spatial sampling design for prediction with estimated parameters. J Agric., Biol. Env. Statist., 11, 24--49.
|
|