ACM Home Page
Please provide us with feedback. Feedback
Using the web for automated translation extraction in cross-language information retrieval
Full text PdfPdf (454 KB)
Source Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Sheffield, United Kingdom
SESSION: Cross-language information retrieval table of contents
Pages: 162 - 169  
Year of Publication: 2004
ISBN:1-58113-881-4
Authors
Ying Zhang  RMIT University, Melbourne, Australia
Phil Vines  RMIT University, Melbourne, Australia
Sponsors
ACM: Association for Computing Machinery
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 135,   Citation Count: 19
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1008992.1009022
What is a DOI?

ABSTRACT

There have been significant advances in Cross-Language Information Retrieval (CLIR) in recent years. One of the major remaining reasons that CLIR does not perform as well as monolingual retrieval is the presence of out of vocabulary (OOV) terms. Previous work has either relied on manual intervention or has only been partially successful in solving this problem. We use a method that extends earlier work in this area by augmenting this with statistical analysis, and corpus-based translation disambiguation to dynamically discover translations of OOV terms. The method can be applied to both Chinese-English and English-Chinese CLIR, correctly extracting translations of OOV terms from the Web automatically, and thus is a significant improvement on earlier work.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
3
 
4
A. Chen and F. Gey. Experiments on Cross-language and Patent Retrieval at NTCIR-3 Workshop. In Proceedings of the 3rd NTCIR Workshop Japan, 2003.
5
6
7
 
8
 
9
S. Kuai, Q. Lu, and L. Yang, editors. A New English-Chinese Dictionary Joint Publishing Company, HongKong, 1st edition, 1984.
10
 
11
H. Meng, B. Chen, S. Khudanpur, G. Levow, W. Lo, D. Oard, P. Schone, K. Tang, H. Wang, and J. Wang. Mandarin-English Information (MEI): investigating translingual speech retrieval. In Computer Speech and Language 2003.
 
12
 
13
M. F. Porter. An Algorithm for Suffix Stripping. Program (Automated Library and Information Systems) 14(3):130--137, 1980.
 
14
 
15
Y. Zhang and P. Vinese. Improved Cross-Language Information Retrieval via Disambiguation and Vocabulary Discovery. In Proceedings of the 8th Australasian Document Computing Symposium pages 3--7,2003.

CITED BY  19