| A weighted finite state transducer implementation of the alignment template model for statistical machine translation |
| Full text |
Pdf
(176 KB)
|
| Source
|
North American Chapter Of The Association For Computational Linguistics
archive
Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
table of contents
Edmonton, Canada
Pages: 63 - 70
Year of Publication: 2003
|
|
Authors
|
|
| Publisher |
Association for Computational Linguistics
Morristown, NJ, USA
|
| Bibliometrics |
Downloads (6 Weeks): 4, Downloads (12 Months): 27, Citation Count: 10
|
|
|
ABSTRACT
We present a derivation of the alignment template model for statistical machine translation and an implementation of the model using weighted finite state transducers. The approach we describe allows us to implement each constituent distribution of the model as a weighted finite state transducer or acceptor. We show that bitext word alignment and translation under the model can be performed with standard FSM operations involving these transducers. One of the benefits of using this framework is that it obviates the need to develop specialized search procedures, even for the generation of lattices or N-Best lists of bitext word alignments and translation hypotheses. We evaluate the implementation of the model on the French-to-English Hansards task and report alignment and translation performance.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
G. Doddington. 2002. Automatic evaluation of machine translation quality using n-gram co-occurrence statistics. In Proc. of HLT 2002, San Diego, CA. USA.
|
| |
4
|
|
| |
5
|
D. Marcu and U. Germann, 2002. The ISI ReWrite Decoder Release 0.7.0b. http://www.isi.edu/licensedsw/rewrite-decoder/.
|
| |
6
|
M. Mohri, F. Pereira, and M. Riley, 1997. ATT General-purpose finite-state machine software tools. http://www.research.att.com/sw/tools/fsm/.
|
| |
7
|
M. Mohri, F. Pereira, and M. Riley. 2002. Weighted finite-state transducers in speech recognition. Computer Speech and Language, 16(1):69--88.
|
| |
8
|
|
| |
9
|
F. Och, C. Tillmann, and H. Ney. 1999. Improved alignment models for statistical machine translation. In Proc. of the Joint Conf. of Empirical Methods in Natural Language Processing and Very Large Corpora, pages 20--28, College Park, MD, USA.
|
| |
10
|
F. Och. 2002. Statistical Machine Translation: From Single Word Models to Alignment Templates. Ph.D. thesis, RWTH Aachen, Germany.
|
| |
11
|
K. Papineni, S. Roukos, T. Ward, and W. Zhu. 2001. Bleu: a method for automatic evaluation of machine translation. Technical Report RC22176 (W0109-022), IBM Research Division.
|
| |
12
|
A. Stolcke. 2002. SRILM -- an extensible language modeling toolkit. In Proc. of the International Conference on Spoken Language Processing, pages 901--904, Denver, CO, USA. http://www.speech.sri.com/projects/srilm/.
|
| |
13
|
|
| |
14
|
F. Wessel, K. Macherey, and R. Schlueter. 1998. Using word probabilities as confidence measures. In Proc. of ICASSP-98, pages 225--228, Seattle, WA, USA.
|
|