| Gaussian process classification for segmenting and annotating sequences |
| Full text |
Pdf
(204 KB)
|
| Source
|
ACM International Conference Proceeding Series; Vol. 69
archive
Proceedings of the twenty-first international conference on Machine learning
table of contents
Banff, Alberta, Canada
Page: 4
Year of Publication: 2004
ISBN:1-58113-828-5
|
|
Authors
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 7, Downloads (12 Months): 39, Citation Count: 2
|
|
|
ABSTRACT
Many real-world classification tasks involve the prediction of multiple, inter-dependent class labels. A prototypical case of this sort deals with prediction of a sequence of labels for a sequence of observations. Such problems arise naturally in the context of annotating and segmenting observation sequences. This paper generalizes Gaussian Process classification to predict multiple labels by taking dependencies between neighboring labels into account. Our approach is motivated by the desire to retain rigorous probabilistic semantics, while overcoming limitations of parametric methods like Conditional Random Fields, which exhibit conceptual and computational difficulties in high-dimensional input spaces. Experiments on named entity recognition and pitch accent prediction tasks demonstrate the competitiveness of our approach.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Altun, Y., Hofmann, T., & Johnson, M. (2003a). Discriminative learning for label sequences via boosting. Advances in Neural Information Processing Systems.
|
| |
2
|
Altun, Y., Tsochantaridis, I., & Hofmann, T. (2003b). Hidden markov support vector machines. 20th International Conference on Machine Learning.
|
 |
3
|
|
| |
4
|
|
| |
5
|
|
| |
6
|
|
| |
7
|
Gibbs, M. N., & MacKay, D. J. C. (2000). Variational Gaussian Process Classifiers. IEEE-NN, 11, 1458.
|
| |
8
|
|
| |
9
|
Greenberg, S., Ellis, D., & Hollenback, J. (1996). Insights into spoken language gleaned from phonetic transcripti on of the Switchboard corpus. ICSLP96.
|
| |
10
|
Jaakkola, T. S., & Jordan, M. I. (1996). Computing upper and lower bounds on likelihoods in intractable networks. In Proc. of the Twelfth Conf. on UAI.
|
| |
11
|
Kimeldorf, G., & Wahba, G. (1971). A correspondence between Bayesian estimation and on stochastic processes and smoothing by splines. Annals of Math. Stat., 41(2), 495--502.
|
| |
12
|
|
| |
13
|
|
| |
14
|
|
| |
15
|
|
| |
16
|
Punyakanok, V., & Roth, D. (2000). The use of classifiers in sequential inference. Advances in Neural Information Processing Systems.
|
| |
17
|
Schölkopf, B., & Smola, A. J. (2002). Learning with kernels. MIT Press.
|
| |
18
|
Seeger, M., Lawrence, N. D., & Herbrich, R. (2003). Fast sparse Gaussian Process methods: The informative vector machine. Advances in Neural Information Processing Systems.
|
| |
19
|
Smola, A. J., & Bartlett, P. L. (2000). Sparse greedy Gaussian Process regression. Advances in Neural Information Processing Systems.
|
| |
20
|
|
| |
21
|
Taskar, B., Guestrin, C., & Koller, D. (2004). Max-margin markov networks. Advances in Neural Information Processing Systems.
|
| |
22
|
Weston, J., & Watkins, C. (1999). Support vector machines for multi-class pattern recognition. Proceedings European Symposium on Artificial Neural Networks.
|
| |
23
|
|
| |
24
|
Williams, C. K. I., & Seeger, M. (2000). Using the nystrom method to speed up kernel machines. Advances in Neural Information Processing Systems.
|
| |
25
|
Zhu, & Hastie, T. (2001). Kernel logistic regression and the import vector machine. Advances in Neural Information Processing Systems.
|
|