| Exploiting a Chinese-English bilingual wordlist for English-Chinese cross language information retrieval |
| Full text |
Pdf
(589 KB)
|
| Source
|
International Workshop on Information Retrieval with Asia Languages
archive
Proceedings of the fifth international workshop on on Information retrieval with Asian languages
table of contents
Hong Kong, China
Pages: 173 - 179
Year of Publication: 2000
ISBN:1-58113-300-6
|
|
Author
|
|
K. L. Kwok
|
Computer Science Department, Queens College, City University of New York, Flushing, NY
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 15, Downloads (12 Months): 41, Citation Count: 6
|
|
|
ABSTRACT
We investigated using the LDC English/Chinese bilingual wordlists for English-Chinese cross language retrieval. It is shown that the Chinese-to-English wordlist can be considered as both a phrase and word dictionary, and is preferable to the English-to-Chinese version in terms of phrase translations and word translation selection. Additional techniques such as target corpus frequency-based term selection and weighting were employed. Experiments show that over 70% of monolingual effectiveness is achievable for the TREC Chinese corpus and retrieval environment with short queries of a few English words.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Oard, D & Diekema, A. Cross-language information retrieval. In: Annual Review of Information Science and Technology, Vol. 33. 1998, pp.223-256.
|
| |
3
|
Schauble, P and Sheridan, P. Cross-language information retrieval (CLIR) track overview. In: Information Technology: The Sixth Text REtrieval Conference (TREC-6). Eds. E.M. Voorhees & D.K. Harman, NIST SP 500-240. 1998, pp.31-44.
|
| |
4
|
Kwok, ICL. English-Chinese cross language retrieval based on a translation package. In: MT Summit VII Workshop: Machine Translation for Cross Language lRl. 1999, pp.8-13.
|
| |
5
|
Voorhees, E.M. &Harman, D.K (1998). Overview of the Sixth Text REtrieval Conference (TREC-6). In: Information Technology: The Sixth Text REtrieval Conference (TREC-6). Eds. E.M.Voorhees & D.K. Harman, NIST SP 500-240. 1998, pp.1-24.
|
| |
6
|
Wu, DeKai & Xia, X.Y. Large-scale automatic extraction of an English-Chinese translation lexicon. Machine Translation, 1995, 9:3-4, pp.285-313.
|
 |
7
|
Jian-Yun Nie , Michel Simard , Pierre Isabelle , Richard Durand, Cross-language information retrieval based on parallel texts and automatic mining of parallel texts from the Web, Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, p.74-81, August 15-19, 1999, Berkeley, California, United States
[doi> 10.1145/312624.312656]
|
| |
8
|
Davies, M. New experirnents in cross-language text retrieval at NMSU's Computing Research Lab. In: Information Technology: The Fifth Text REtrieval Conference (TREC-5). Eds. E.M. Voorhees & D.K. Harman, N1ST SP 500-238. 1997, pp.447-453.
|
| |
9
|
|
 |
10
|
|
 |
11
|
|
 |
12
|
|
| |
13
|
|
CITED BY 6
|
|
Jianfeng Gao , Jian-Yun Nie , Endong Xun , Jian Zhang , Ming Zhou , Changning Huang, Improving query translation for cross-language information retrieval using statistical models, Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, p.96-104, September 2001, New Orleans, Louisiana, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|