ACM Home Page
Please provide us with feedback. Feedback
Quantifying the utility of parallel corpora
Full text PdfPdf (90 KB)
Source Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval table of contents
New Orleans, Louisiana, United States
Pages: 398 - 399  
Year of Publication: 2001
ISBN:1-58113-331-6
Authors
Martin Franz  IBM T.J. Watson Research Center, Yorktown Heights, NY
J. Scott McCarley  IBM T.J. Watson Research Center, Yorktown Heights, NY
Todd Ward  IBM T.J. Watson Research Center, Yorktown Heights, NY
Wei-Jing Zhu  IBM T.J. Watson Research Center, Yorktown Heights, NY
Sponsor
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 10,   Downloads (12 Months): 32,   Citation Count: 4
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/383952.384037
What is a DOI?

ABSTRACT

Our English-Chinese cross-language IR system is trained from parallel corpora; we investigate its performance as a function of training corpus size for three different training corpora. We find that the performance of the system as trained on the three parallel corpora can be related by a simple measure, namely the out-of-vocabulary rate of query words.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
D.Harman and E.Voorhees, ed. The 9th Text Retrieval Conference (TREC-9) and references therein.
 
2
 
3
 
4
S.E. Robertson, S. Walker, S. Jones, M.M. Hancock-Beaulieu, M. Gatford, "Okapi at TREC-3" in Proceedings of the Third Text REtrieval Conference (TREC-3) ed. by D.K. Harman. NIST Special Publication 500-225, 1995.
 
5


Collaborative Colleagues:
Martin Franz: colleagues
J. Scott McCarley: colleagues
Todd Ward: colleagues
Wei-Jing Zhu: colleagues