ACM Home Page
Please provide us with feedback. Feedback
Cross-lingual search over 22 european languages
Full text PdfPdf (183 KB)
Source
Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Singapore, Singapore
DEMONSTRATION SESSION: Demonstrations table of contents
Pages 883-883  
Year of Publication: 2008
ISBN:978-1-60558-164-4
Authors
Blaž Fortuna  Institute Jozef Stefan, Ljubljana, Slovenia
Jan Rupnik  Institute Jozef Stefan, Ljubljana, Slovenia
Boštjan Pajntar  Institute Jozef Stefan, Ljubljana, Slovenia
Marko Grobelnik  Institute Jozef Stefan, Ljubljana, Slovenia
Dunja Mladenič  Institute Jozef Stefan, Ljubljana, Slovenia
Sponsors
ACM: Association for Computing Machinery
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 12,   Downloads (12 Months): 104,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1390334.1390557
What is a DOI?

ABSTRACT

In this paper we present a system for cross-lingual information retrieval, which can handle tens of languages and millions of documents. Functioning of the system is demonstrated on corpus of European Legislation (22 languages, more than 400,000 documents per language). The system uses an interactive web-interface, which can take advantage of a predefined thesaurus allowing the user to dynamically re-rank the retrieval results based on the mapping onto a predefined thesaurus.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Steinberger, R., Pouliquen, B., Widiger, A., Ignat, C., Erjavec, T., Tufiş, D., Varga, D. 2006. The JRC-Acquis: A multilingual aligned parallel corpus with 20+ languages. Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC'2006). Genoa, Italy, 24-26 May 2006.
 
2
Fortuna, B., Cristianini, N., Shawe-Taylor, J. 2006. A Kernel Canonical Correlation Analysis For Learning The Semantics Of Text. Kernel methods in bioengineering, communications and image processing, edited by G. Camps-Valls, J. L. Rojo-Álvarez & M. Martíínez-Ramón.
 
3
Pajntar, B., Grobelnik, M., http://searchpoint.ijs.si/

Collaborative Colleagues:
Blaž Fortuna: colleagues
Jan Rupnik: colleagues
Boštjan Pajntar: colleagues
Marko Grobelnik: colleagues
Dunja Mladenič: colleagues