| Management of keyword variation with frequency based generation of word forms in IR |
| Full text |
Pdf
(216 KB)
|
Source
|
Annual ACM Conference on Research and Development in Information Retrieval
archive
Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
table of contents
Amsterdam, The Netherlands
POSTER SESSION: Posters
table of contents
Pages: 691 - 692
Year of Publication: 2007
ISBN:978-1-59593-597-7
|
|
Author
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 8, Downloads (12 Months): 74, Citation Count: 0
|
|
|
ABSTRACT
This paper presents a new management method for morphological variation of keywords. The method is called FCG, Frequent Case Generation. It is based on the skewed distributions of word forms in natural languages and is suitable for languages that have either fair amount of morphological variation or are morphologically very rich. The proposed method has been evaluated so far with four languages, Finnish, Swedish, German and Russian, which show varying degrees of morphological complexity.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Baayen, R. H. Statistical Models for Word Frequency Distribution. Computers and the Humanities 26 (1993): 347--363.
|
| |
2
|
Baayen, R. H. Word Frequency Distributions. Kluwer Academic Publishers, Dordrecht Boston London, 2001.
|
| |
3
|
Karlsson, F. Frequency Considerations in Morphology. Zeitsschrift fr Phonetik, Sprachwissenschaft und Kommunikationsforschung 39 (1986): 19--28.
|
| |
4
|
Karlsson, F. Defectivity. In: Booij G. et al. (eds.): Morphology. An International Handbook on Inflection and Word-Formation. Volume 1. Walter de Gruyter, Berlin, 2000, 647--654.
|
| |
5
|
Kettunen, K. and Airio, E. Is a morphologically complex language really that complex in full-text retrieval? In T. Salakoski et al. (Eds.): Advances in Natural Language Processing, LNAI 4139. Springer-Verlag Berlin Heidelberg, 2006, 411--422.
|
| |
6
|
|
| |
7
|
Kosti , A., Markovi , T. and Baucal, A. Inflectional Morphology and Word Meaning: Orthogonal or Co-implicative Cognitive Domains. In: Baayen, R.H. and Schreuder R. (eds.): Morphological Structure in Language Processing. Trends in Linguistics, Studies and Monographs 151. Mouton de Gruyter, Berlin, 2003, 1--43.
|
| |
8
|
Perebeynoss, V. and Khidekel, S. Frequency of Language Units as a Reflection of Their Systemic and Functional Properties. Journal of Quantitative Linguistics 11 (2004): 3--25.
|
|