ACM Home Page
Please provide us with feedback. Feedback
A weighted finite state transducer implementation of the alignment template model for statistical machine translation
Full text PdfPdf (176 KB)
Source North American Chapter Of The Association For Computational Linguistics archive
Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1 table of contents
Edmonton, Canada
Pages: 63 - 70  
Year of Publication: 2003
Authors
Shankar Kumar  Johns Hopkins University, Baltimore, MD
William Byrne  Johns Hopkins University, Baltimore, MD
Publisher
Association for Computational Linguistics  Morristown, NJ, USA
Bibliometrics
Downloads (6 Weeks): 4,   Downloads (12 Months): 27,   Citation Count: 10
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: 10.3115/1073445.1073464

ABSTRACT

We present a derivation of the alignment template model for statistical machine translation and an implementation of the model using weighted finite state transducers. The approach we describe allows us to implement each constituent distribution of the model as a weighted finite state transducer or acceptor. We show that bitext word alignment and translation under the model can be performed with standard FSM operations involving these transducers. One of the benefits of using this framework is that it obviates the need to develop specialized search procedures, even for the generation of lattices or N-Best lists of bitext word alignments and translation hypotheses. We evaluate the implementation of the model on the French-to-English Hansards task and report alignment and translation performance.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
G. Doddington. 2002. Automatic evaluation of machine translation quality using n-gram co-occurrence statistics. In Proc. of HLT 2002, San Diego, CA. USA.
 
4
 
5
D. Marcu and U. Germann, 2002. The ISI ReWrite Decoder Release 0.7.0b. http://www.isi.edu/licensedsw/rewrite-decoder/.
 
6
M. Mohri, F. Pereira, and M. Riley, 1997. ATT General-purpose finite-state machine software tools. http://www.research.att.com/sw/tools/fsm/.
 
7
M. Mohri, F. Pereira, and M. Riley. 2002. Weighted finite-state transducers in speech recognition. Computer Speech and Language, 16(1):69--88.
 
8
 
9
F. Och, C. Tillmann, and H. Ney. 1999. Improved alignment models for statistical machine translation. In Proc. of the Joint Conf. of Empirical Methods in Natural Language Processing and Very Large Corpora, pages 20--28, College Park, MD, USA.
 
10
F. Och. 2002. Statistical Machine Translation: From Single Word Models to Alignment Templates. Ph.D. thesis, RWTH Aachen, Germany.
 
11
K. Papineni, S. Roukos, T. Ward, and W. Zhu. 2001. Bleu: a method for automatic evaluation of machine translation. Technical Report RC22176 (W0109-022), IBM Research Division.
 
12
A. Stolcke. 2002. SRILM -- an extensible language modeling toolkit. In Proc. of the International Conference on Spoken Language Processing, pages 901--904, Denver, CO, USA. http://www.speech.sri.com/projects/srilm/.
 
13
 
14
F. Wessel, K. Macherey, and R. Schlueter. 1998. Using word probabilities as confidence measures. In Proc. of ICASSP-98, pages 225--228, Seattle, WA, USA.

CITED BY  10
 
 
 
 
 
 
 
 
Collaborative Colleagues:
Shankar Kumar: colleagues
William Byrne: colleagues