|
Warning: The download time has expired please click on the item to try again.
ABSTRACT
We describe an extension of Earley's parser for stochastic context-free grammars that computes the following quantities given a stochastic context-free grammar and an input string: a) probabilities of successive prefixes being generated by the grammar; b) probabilities of substrings being generated by the nonterminals, including the entire string being generated by the grammar; c) most likely (Viterbi) parse of the string; d) posterior expected number of applications of each grammar production, as required for reestimating rule probabilities. Probabilities (a) and (b) are computed incrementally in a single left-to-right pass over the input. Our algorithm compares favorably to standard bottom-up parsing methods for SCFGs in that it works efficiently on sparse grammars by making use of Earley's top-down control structure. It can process any context-free rule format without conversion to some normal form, and combines computations for (a) through (d) in a single algorithm. Finally, the algorithm has simple extensions for processing partially bracketed inputs, and for finding partial parses and their likelihoods on ungrammatical inputs.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Bahl, Lalit R.; Jelinek, Frederick; and Mercer, Robert L. (1983). "A maximum likelihood approach to continuous speech recognition." IEEE Transactions on Pattern Analysis and Machine Intelligence, 5(2), 179--190.
|
| |
3
|
Baker, James K. (1979). "Trainable grammars for speech recognition." In Speech Communication Papers for the 97th Meeting of the Acoustical Society of America, edited by Jared J. Wolf and Dennis H. Klatt, 547--550.
|
| |
4
|
Baum, Leonard E.; Petrie, Ted; Soules, George; and Weiss, Norman (1970). "A maximization technique occurring in the statistical analysis of probabilistic functions in Markov chains." The Annals of Mathematical Statistics, 41(1), 164--171.
|
| |
5
|
Booth, Taylor L., and Thompson, Richard A. (1973). "Applying probability measures to abstract languages." IEEE Transactions on Computers, C-22(5), 442--450.
|
| |
6
|
|
| |
7
|
|
| |
8
|
|
| |
9
|
Dempster, A. P.; Laird, N. M.; and Rubin, D. B. (1977). "Maximum likelihood from incomplete data via the EM algorithm." Journal of the Royal Statistical Society, Series B, 34, 1--38.
|
 |
10
|
|
| |
11
|
Fujisaki, T.; Jelinek, F.; Cocke, J.; Black, E.; and Nishino, T. (1991). "A probabilistic parsing method for sentence disambiguation." In Current Issues in Parsing Technology, edited by Masaru Tomita, 139--152. Kluwer Academic Publishers.
|
 |
12
|
|
| |
13
|
Jelinek, Frederick (1985). "Markov source modeling of text generation." In The Impact of Processing Techniques on Communications, Volume E91, NATO ASI Series, edited by J. K. Skwirzynski, 569--598. Nijhoff.
|
| |
14
|
|
| |
15
|
Jelinek, Frederick; Lafferty, John D.; and Mercer, Robert L. (1992). "Basic methods of probabilistic context free grammars." In Speech Recognition and Understanding. Recent Advances, Trends, and Applications, Volume F75, NATO ASI Series, edited by Pietro Laface and Renato De Mori, 345--360. Springer Verlag.
|
| |
16
|
Jones, Mark A., and Eisner, Jason M. (1992). "A probabilistic parser and its applications." In AAAI Workshop on Statistically-Based NLP Techniques, 20--27.
|
| |
17
|
Jurafsky, Daniel; Wooters, Chuck; Segal, Jonathan; Stolcke, Andreas; Fosler, Eric; Tajchman, Gary; and Morgan, Nelson (1995). "Using a stochastic context-free grammar as a language model for speech recognition." In Proceedings, IEEE Conference on Acoustics, Speech and Signal Processing, 189--192. Detroit, Michigan.
|
| |
18
|
Jurafsky, Daniel; Wooters, Chuck; Tajchman, Gary; Segal, Jonathan; Stolcke, Andreas; Fosler, Eric; and Morgan, Nelson (1994). "The Berkeley restaurant project." In Proceedings, International Conference on Spoken Language Processing, 4, 2139--2142. Yokohama, Japan.
|
| |
19
|
Kupiec, Julian (1992). "Hidden Markov estimation for unrestricted stochastic context-free grammars." In Proceedings, IEEE Conference on Acoustics, Speech and Signal Processing, 1, 177--180, San Francisco, California.
|
| |
20
|
Lari, K., and Young, S. J. (1990). "The estimation of stochastic context-free grammars using the Inside-Outside algorithm." Computer Speech and Language, 4, 35--56.
|
| |
21
|
Lari, K., and Young, S. J. (1991). "Applications of stochastic context-free grammars using the Inside-Outside algorithm." Computer Speech and Language, 5, 237--257.
|
| |
22
|
Magerman, David M., and Marcus, Mitchell P. (1991). "Pearl: A probabilistic chart parser." In Proceedings, Second International Workshop on Parsing Technologies. Cancun, Mexico, 193--199.
|
| |
23
|
|
| |
24
|
Nakagawa, Sei-ichi (1987). "Spoken sentence recognition by time-synchronous parsing algorithm of context-free grammar." In Proceedings, IEEE Conference on Acoustics, Speech and Signal Processing, 2, 829--832. Dallas, Texas.
|
| |
25
|
Ney, Hermann (1992). "Stochastic grammars and pattern recognition." In Speech Recognition and Understanding. Recent Advances, Trends, and Applications, volume F75 of NATO ASI Series, edited by Pietro Laface and Renato De Mori, 319--344. Springer Verlag.
|
| |
26
|
|
| |
27
|
|
| |
28
|
|
| |
29
|
Rabiner, L. R., and Juang, B. H. (1986). "An introduction to hidden Markov models." IEEE ASSP Magazine, 3(1), 4--16.
|
| |
30
|
Schabes, Yves (1991). "An inside-outside algorithm for estimating the parameters of a hidden stochastic context-free grammar based on Earley's algorithm." Paper presented at the Second Workshop on Mathematics of Language, Tarrytown, New York.
|
| |
31
|
|
| |
32
|
Stolcke, Andreas (1993). "An efficient probabilistic context-free parsing algorithm that computes prefix probabilities." Technical Report TR-93-065, International Computer Science Institute, Berkeley, CA. Revised 1994.
|
| |
33
|
|
| |
34
|
Wright, J. H. (1990). "LR parsing of probabilistic grammars with input uncertainty for speech recognition." Computer Speech and Language, 4, 297--323.
|
CITED BY 31
|
|
Giancarlo Mauri , Giulio Pavesi , Antonio Piccolboni, Approximation algorithms for protein folding prediction, Proceedings of the tenth annual ACM-SIAM symposium on Discrete algorithms, p.945-946, January 17-19, 1999, Baltimore, Maryland, United States
|
|
|
Alexander Krotov , Mark Hepple , Robert Gaizauskas , Yorick Wilks, Compacting the Penn Treebank grammar, Proceedings of the 17th international conference on Computational linguistics, August 10-14, 1998, Montreal, Quebec, Canada
|
|
|
|
|
|
|
|
|
Scott Miller , David Stallard , Robert Bobrow , Richard Schwartz, A fully statistical approach to natural language interfaces, Proceedings of the 34th annual meeting on Association for Computational Linguistics, p.55-61, June 24-27, 1996, Santa Cruz, California
|
|
|
|
|
|
|
|
|
Mark-Jan Nederhof , Anoop Sarkar , Giorgio Satta, Prefix probabilities from stochastic Tree Adjoining Grammars, Proceedings of the 17th international conference on Computational linguistics, p.953-959, August 10-14, 1998, Montreal, Quebec, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Amit Dubey , Frank Keller , Patrick Sturt, Integrating syntactic priming into an incremental probabilistic parser, with an application to psycholinguistic modeling, Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the ACL, p.417-424, July 17-18, 2006, Sydney, Australia
|
|
|
Jason Eisner , Eric Goldlust , Noah A. Smith, Compiling Comp Ling: practical weighted dynamic programming and the Dyna language, Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, p.281-290, October 06-08, 2005, Vancouver, British Columbia, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Gal Lavee , Ehud Rivlin , Michael Rudzsky, Understanding video events: a survey of methods for automatic interpretation of semantic occurrences in video, IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews, v.39 n.5, p.489-504, September 2009
|
|