|
ABSTRACT
This article presents an approach to automatically extracting translations of Web query terms through mining of Web anchor texts and link structures. One of the existing difficulties in cross-language information retrieval (CLIR) and Web search is the lack of appropriate translations of new terminology and proper names. The proposed approach successfully exploits the anchor-text resources and reduces the existing difficulties of query term translation. Many query terms that cannot be obtained in general-purpose translation dictionaries are, therefore, extracted.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Peter F. Brown , John Cocke , Stephen A. Della Pietra , Vincent J. Della Pietra , Fredrick Jelinek , John D. Lafferty , Robert L. Mercer , Paul S. Roossin, A statistical approach to machine translation, Computational Linguistics, v.16 n.2, p.79-85, June 1990
|
| |
2
|
|
| |
3
|
|
| |
4
|
Soumen Chakrabarti , Byron Dom , Prabhakar Raghavan , Sridhar Rajagopalan , David Gibson , Jon Kleinberg, Automatic resource compilation by analyzing hyperlink structure and associated text, Proceedings of the seventh international conference on World Wide Web 7, p.65-74, April 1998, Brisbane, Australia
|
 |
5
|
|
| |
6
|
DAGAN, I., CHURCH, K. W., AND GALE, W. A 1993. Robust bilingual word alignment for machine aided translation. In Proceedings of the Workshop on Very Large Corpora, 1-8.
|
| |
7
|
DUMAIS, S. T., LANDAUER, T. K., AND LITTMAN, M. L. 1996. Automatic cross-linguistic information retrieval using latent semantic indexing. In Proceedings of SIGIR'96 Workshop on Cross-Linguistic Information Retrieval, 16-24.
|
| |
8
|
FUNG, P., AND MCKEOWN, K. 1997. Finding terminology translation from non-parallel corpora. In Proceedings of the 5th Annual Workshop on Very Large Corpora. 192-202.
|
| |
9
|
|
| |
10
|
|
| |
11
|
KWOK, K. L. 2001. NTCIR-2 Chinese, cross language retrieval experiments using PIRCS. In Proceedings of NTCIR Workshop Meeting, 668-677.
|
| |
12
|
LIU, S. H., CHEN, K. J., CHANG, L. P., AND CHIN, Y. H. 1995. Automatic part-of-speech tagging for Chinese corpora. Comput. Process. Chinese/Oriental Lang. 9, 1, 31-47.
|
| |
13
|
|
 |
14
|
Jian-Yun Nie , Michel Simard , Pierre Isabelle , Richard Durand, Cross-language information retrieval based on parallel texts and automatic mining of parallel texts from the Web, Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, p.74-81, August 15-19, 1999, Berkeley, California, United States
[doi> 10.1145/312624.312656]
|
| |
15
|
|
| |
16
|
|
| |
17
|
|
| |
18
|
|
 |
19
|
|
CITED BY 21
|
|
|
|
|
Jenq-Haur Wang , Jei-Wen Teng , Pu-Jen Cheng , Wen-Hsiang Lu , Lee-Feng Chien, Translating unknown cross-lingual queries in digital libraries using a web-based approach, Proceedings of the 4th ACM/IEEE-CS joint conference on Digital libraries, June 07-11, 2004, Tuscon, AZ, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Fei Huang , Ying Zhang , Stephan Vogel, Mining key phrase translations from web corpora, Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, p.483-490, October 06-08, 2005, Vancouver, British Columbia, Canada
|
|
|
|
|
|
Pu-Jen Cheng , Yi-Cheng Pan , Wen-Hsiang Lu , Lee-Feng Chien, Creating multilingual translation lexicons with regional variations using web corpora, Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, p.534-es, July 21-26, 2004, Barcelona, Spain
|
|
|
|
|
|
|
|
|
|
|
|
|
|