| Translating unknown queries with web corpora for cross-language information retrieval |
| Full text |
Pdf
(387 KB)
|
| Source
|
Annual ACM Conference on Research and Development in Information Retrieval
archive
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
table of contents
Sheffield, United Kingdom
SESSION: Cross-language information retrieval
table of contents
Pages: 146 - 153
Year of Publication: 2004
ISBN:1-58113-881-4
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 14, Downloads (12 Months): 95, Citation Count: 21
|
|
|
ABSTRACT
It is crucial for cross-language information retrieval (CLIR) systems to deal with the translation of unknown queries due to that real queries might be short. The purpose of this paper is to investigate the feasibility of exploiting the Web as the corpus source to translate unknown queries for CLIR. We propose an online translation approach to determine effective translations for unknown query terms via mining of bilingual search-result pages obtained from Web search engines. This approach can alleviate the problem of the lack of large bilingual corpora, translate many unknown query terms, provide flexible query specifications, and extract semantically-close translations to benefit CLIR tasks -- especially for cross-language Web search.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
|
| |
3
|
|
 |
4
|
|
| |
5
|
|
| |
6
|
|
| |
7
|
Kwok, K.L. NTCIR-2 Chinese, Cross Language Retrieval Experiments Using PIRCS. In Proc. of NTCIR Workshop Meeting, 2001.
|
 |
8
|
|
 |
9
|
|
| |
10
|
|
 |
11
|
Jian-Yun Nie , Michel Simard , Pierre Isabelle , Richard Durand, Cross-language information retrieval based on parallel texts and automatic mining of parallel texts from the Web, Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, p.74-81, August 15-19, 1999, Berkeley, California, United States
[doi> 10.1145/312624.312656]
|
| |
12
|
Oard, D.W. and Diekema, A.R. Cross-Language Information Retrieval. In Annual Review of Information Science and Technology. American Society for Information Science, 1998.
|
| |
13
|
|
| |
14
|
|
| |
15
|
|
| |
16
|
|
| |
17
|
Silverstein, C., Henzinger, M., Marais, H., and Morics, M. Analysis of a Very Large AltaVista Query Log. Technical Report 1998-014, Digital Systems Research Center, 1998.
|
| |
18
|
|
| |
19
|
Voorhees, E.M. and Harman, D.K. Overview of the sixth Text Retrieval Conference TREC-6. In Proc. of the 6th Text Retrieval Conference, 1998.
|
 |
20
|
Jenq-Haur Wang , Jei-Wen Teng , Pu-Jen Cheng , Wen-Hsiang Lu , Lee-Feng Chien, Translating unknown cross-lingual queries in digital libraries using a web-based approach, Proceedings of the 4th ACM/IEEE-CS joint conference on Digital libraries, June 07-11, 2004, Tuscon, AZ, USA
[doi> 10.1145/996350.996378]
|
 |
21
|
|
| |
22
|
|
CITED BY 21
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Fei Huang , Ying Zhang , Stephan Vogel, Mining key phrase translations from web corpora, Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, p.483-490, October 06-08, 2005, Vancouver, British Columbia, Canada
|
|
|
Qing Li , Sung-Hyon Myaeng , Yun Jin , Bo-yeong Kang, Concept unification of terms in different languages for IR, Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the ACL, p.641-648, July 17-18, 2006, Sydney, Australia
|
|
|
Pu-Jen Cheng , Yi-Cheng Pan , Wen-Hsiang Lu , Lee-Feng Chien, Creating multilingual translation lexicons with regional variations using web corpora, Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, p.534-es, July 21-26, 2004, Barcelona, Spain
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Wei Gao , Cheng Niu , Jian-Yun Nie , Ming Zhou , Jian Hu , Kam-Fai Wong , Hsiao-Wuen Hon, Cross-lingual query suggestion using query logs of different languages, Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, July 23-27, 2007, Amsterdam, The Netherlands
|
|
|
|
|
|
|
|
|
|
|
|
Qing Li , Yuanzhu Peter Chen , Sung-Hyon Myaeng , Yun Jin , Bo-Yeong Kang, Concept unification of terms in different languages via web mining for Information Retrieval, Information Processing and Management: an International Journal, v.45 n.2, p.246-262, March, 2009
|
|