ACM Home Page
Please provide us with feedback. Feedback
Towards a unified approach to memory- and statistical-based machine translation
Full text PdfPdf (101 KB)
Source Annual Meeting of the ACL archive
Proceedings of the 39th Annual Meeting on Association for Computational Linguistics table of contents
Toulouse, France
Pages: 386 - 393  
Year of Publication: 2001
Author
Daniel Marcu  University of Southern California, Marina del Rey, CA
Publisher
Association for Computational Linguistics  Morristown, NJ, USA
Bibliometrics
Downloads (6 Weeks): 2,   Downloads (12 Months): 20,   Citation Count: 7
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: 10.3115/1073012.1073062

ABSTRACT

We present a set of algorithms that enable us to translate natural language sentences by exploiting both a translation memory and a statistical-based translation model. Our results show that an automatically derived translation memory can be used within a statistical framework to often find translations of higher probability than those found using solely a statistical model. The translations produced using both the translation memory and the statistical model are significantly better than translations produced by two commercial systems: our hybrid system translated perfectly 58% of the 505 sentences in a test collection, while the commercial systems translated perfectly only 40-42% of them.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Yaser Al-Onaizan, Jan Curin, Michael Jahr, Kevin Knight, John Lafferty, Dan Melamed, Franz-Josef Och, David Purdy, Noah A. Smith, and David Yarowsky. 1999. Statistical machine translation. Final Report, JHU Summer Workshop.
 
2
 
3
Ralph D. Brown. 1999. Adding linguistic knowledge to a lexical example-based translation system. In Proceedings of TMI'99, pages 22--32, Chester, England.
 
4
A. P. Dempster, N. M. Laird, and D. B. Rubin. 1977. Maximum likelihood from incomplete data via the em algorithm. Journal of the Royal Statistical Society, 39(Ser B):1--38.
 
5
 
6
 
7
H. Maruyana and H. Watanabe. 1992. Tree cover search algorithm for example-based translation. In Proceedings of TMI'92, pages 173--184.
 
8
 
9
Franz Josef Och, Christoph Tillmann, and Herman Ney. 1999. Improved alignment models for statistical machine translation. In Proceedings of the EMNLP and VLC, pages 20--28, University of Maryland, Maryland.
 
10
 
11
Robert C. Sprung, editor. 2000. Translating Into Success: Cutting-Edge Strategies For Going Multilingual In A Global Age. John Benjamins Publishers.
 
12
Tony Veale and Andy Way. 1997. Gaijin: A template-based bootstrapping approach to example-based machine translation. In Proceedings of "New Methods in Natural Language Processing", Sofia, Bulgaria.
 
13
 
14
Ye-Yi Wang. 1998. Grammar Inference and Statistical Machine Translation. Ph.D. thesis, Carnegie Mellon University. Also available as CMU-LTI Technical Report 98--160.
 
15
 
16

CITED BY  7