ACM Home Page
Please provide us with feedback. Feedback
How effective is Google's translation service in search?
Full text HtmlHtml (35 KB),  PdfPdf (108 KB)
Source
Communications of the ACM archive
Volume 52 ,  Issue 10  (October 2009) table of contents
A View of Parallel Computing
SECTION: Virtual extension table of contents
Pages 139-143  
Year of Publication: 2009
ISSN:0001-0782
Authors
Jacques Savoy  University of Neuchatel, Switzerland
Ljiljana Dolamic  University of Neuchatel, Switzerland
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 138,   Downloads (12 Months): 138,   Citation Count: 0
Additional Information:

abstract   references   index terms  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1562764.1562799
What is a DOI?

ABSTRACT

Introduction

In multilingual countries (Canada, Hong Kong, India, among others) and large international organizations or companies (such as, WTO, European Parliament), and among Web users in general, accessing information written in other languages has become a real need (news, hotel or airline reservations, or government information, statistics). While some users are bilingual, others can read documents written in another language but cannot formulate a query to search it, or at least cannot provide reliable search terms in a form comparable to those found in the documents being searched. There are also many monolingual users who may want to retrieve documents in another language and then have them translated into their own language, either manually or automatically.

Translation services may however be too expensive, not readily accessible or not available within a short timeframe. On the other hand, many documents contain non-textual information such as images, videos and statistics that do not need translation and can be understood regardless of the language involved. In response to these needs and in order to make the Web universally available regardless of any language barriers, in May 2007 Google launched a translation service that now provides two-way online translation services mainly between English and 41 other languages, for example, Arabic, simplified and traditional Chinese, French, German, Italian, Japanese, Korean, Portuguese, Russian, and Spanish (http://translate.google.com/). Over the last few years other free Internet translation services have been made available as for example by BabelFish (http://babel.altavista.com/) or Yahoo! (http://babelfish.yahoo.com/). These two systems are similar to that used by Google, given they are based on technology developed by Systran, one of the earliest companies to develop machine translation. Also worth mentioning here is the Promt system (also known as Reverso, http://translation2.paralink.com/), which was developed in Russia to provide mainly translation between Russian and other languages.

The question we would like to address here is to what extent a translation service such as Google can produce adequate results in the language other than that being used to write the query. Although we will not evaluate translations per se we will test and analyze various systems in terms of their ability to retrieve items automatically based on a translated query. To be adequate, these tests must be done on a collection of documents written in one given language plus a series of topics (expressing user information needs) written in other languages, plus a series of relevance assessments (relevant documents for each topic).


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Baeza-Yates, R., and Ribeiro-Neto, B. Modern Information Retrieval. ACM Press, 1999.
 
2
Buckley, C., and Voorhees, E. Retrieval system evaluation. In TREC, Experiment and Evaluation in Information Retrieval. MIT Press, Cambridge, MA, 2005, 53--75.
 
3
Harman, D.K. The TREC ad hoc experiments. TREC, Experiment and Evaluation in Information Retrieval. MIT Press, 2005, 79--97.
 
4
Hiemstra, D. Term-specific smoothing for the language modeling approach to information retrieval. In Proceedings of the ACM-SIGIR. ACM Press, 2002, 35--41.
 
5
Noriko, K., (Ed) NTCIR Workshop 6 Meeting. National Institute of Informatics, Tokyo, 2007.
 
6
Peters, C., Clough, P., Gey, F.C., Karlgren, J., Magnini, B., Oard, D.W., de Rijke, M., and Stempfhuber, M. (Eds) Evaluation of multilingual and multi-modal information retrieval. Lecture Notes in Computer Science #4730. Spinger-Verlag, Berlin, 2007.
 
7
Robertson, S.E., Walker, S., and Beaulieu, M. Experimentation as a way of life: Okapi at TREC. Information Processing & Management 36, 1, (2000), 95--108.
 
8
Savoy, J. Statistical inference in retrieval effectiveness evaluation. Information Processing & Management 33, 4, (1997), 495--512.