| Using the web for automated translation extraction in cross-language information retrieval |
| Full text |
Pdf
(454 KB)
|
| Source
|
Annual ACM Conference on Research and Development in Information Retrieval
archive
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
table of contents
Sheffield, United Kingdom
SESSION: Cross-language information retrieval
table of contents
Pages: 162 - 169
Year of Publication: 2004
ISBN:1-58113-881-4
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 7, Downloads (12 Months): 135, Citation Count: 19
|
|
|
ABSTRACT
There have been significant advances in Cross-Language Information Retrieval (CLIR) in recent years. One of the major remaining reasons that CLIR does not perform as well as monolingual retrieval is the presence of out of vocabulary (OOV) terms. Previous work has either relied on manual intervention or has only been partially successful in solving this problem. We use a method that extends earlier work in this area by augmenting this with statistical analysis, and corpus-based translation disambiguation to dynamically discover translations of OOV terms. The method can be applied to both Chinese-English and English-Chinese CLIR, correctly extracting translations of OOV terms from the Web automatically, and thus is a significant improvement on earlier work.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
|
 |
3
|
|
| |
4
|
A. Chen and F. Gey. Experiments on Cross-language and Patent Retrieval at NTCIR-3 Workshop. In Proceedings of the 3rd NTCIR Workshop Japan, 2003.
|
 |
5
|
|
 |
6
|
|
 |
7
|
Jianfeng Gao , Ming Zhou , Jian-Yun Nie , Hongzhao He , Weijun Chen, Resolving query translation ambiguity using a decaying co-occurrence model and syntactic dependence relations, Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, August 11-15, 2002, Tampere, Finland
[doi> 10.1145/564376.564409]
|
| |
8
|
|
| |
9
|
S. Kuai, Q. Lu, and L. Yang, editors. A New English-Chinese Dictionary Joint Publishing Company, HongKong, 1st edition, 1984.
|
 |
10
|
|
| |
11
|
H. Meng, B. Chen, S. Khudanpur, G. Levow, W. Lo, D. Oard, P. Schone, K. Tang, H. Wang, and J. Wang. Mandarin-English Information (MEI): investigating translingual speech retrieval. In Computer Speech and Language 2003.
|
| |
12
|
|
| |
13
|
M. F. Porter. An Algorithm for Suffix Stripping. Program (Automated Library and Information Systems) 14(3):130--137, 1980.
|
| |
14
|
|
| |
15
|
Y. Zhang and P. Vinese. Improved Cross-Language Information Retrieval via Disambiguation and Vocabulary Discovery. In Proceedings of the 8th Australasian Document Computing Symposium pages 3--7,2003.
|
CITED BY 19
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Fei Huang , Ying Zhang , Stephan Vogel, Mining key phrase translations from web corpora, Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, p.483-490, October 06-08, 2005, Vancouver, British Columbia, Canada
|
|
|
Qing Li , Sung-Hyon Myaeng , Yun Jin , Bo-yeong Kang, Concept unification of terms in different languages for IR, Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the ACL, p.641-648, July 17-18, 2006, Sydney, Australia
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Wei Gao , Cheng Niu , Jian-Yun Nie , Ming Zhou , Jian Hu , Kam-Fai Wong , Hsiao-Wuen Hon, Cross-lingual query suggestion using query logs of different languages, Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, July 23-27, 2007, Amsterdam, The Netherlands
|
|
|
|
|
|
|
|
|
Qing Li , Yuanzhu Peter Chen , Sung-Hyon Myaeng , Yun Jin , Bo-Yeong Kang, Concept unification of terms in different languages via web mining for Information Retrieval, Information Processing and Management: an International Journal, v.45 n.2, p.246-262, March, 2009
|
|