ACM Home Page
Please provide us with feedback. Feedback
Effects of out of vocabulary words in spoken document retrieval (poster session)
Full text PdfPdf (341 KB)
Source Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Athens, Greece
Pages: 372 - 374  
Year of Publication: 2000
ISBN:1-58113-226-3
Authors
P. C. Woodland  Cambridge University, Engineering Department, Trumpington Street, Cambridge CB2 1PZ, UK
S. E. Johnson  Cambridge University, Engineering Department, Trumpington Street, Cambridge CB2 1PZ, UK
P. Jourlin  Cambridge University, Computer Laboratory, Pembroke Street, Cambridge, CB2 3QG, UK
K. Spärck Jones  Cambridge University, Computer Laboratory, Pembroke Street, Cambridge, CB2 3QG, UK
Sponsors
Athens U of Econ & Business : Athens University of Economics and Business
Greek Com Soc : Greek Computer Society
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 37,   Citation Count: 9
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/345508.345661
What is a DOI?

ABSTRACT

The effects of out-of-vocabulary (OOV) items in spoken document retrieval (SDR) are investigated. Several sets of transcriptions were created for the TREC-8 SDR task using a speech recognition system varying the vocabulary sizes and OOV rates, and the relative retrieval performance measured. The effects of OOV terms on a simple baseline IR system and on more sophisticated retrieval systems are described. The use of a parallel corpus for query and document expansion is found to be especially beneficial, and with this data set, good retrieval performance can be achieved even for fairly high OOV rates.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
J S Garofolo, C G P Auzane & E M Voorhees 1999 TREC-8 Spoken Document Retrieval Track: Overview, Results and Analyses To appear Proc. TREC-8, 2000
 
2
S E Johnson, P Jourlin, K Spi.rrck Jones & P C Woodland. Spoken Document Retrieval for TREC-8 at Cambridge University. To appear Proc. TREC-8, 2000
3
 
4
S E Robertson & K Spirck Jones Simple, Proven Approaches to Test Retrieval Technical Report TR356, Cambridge University Computer Laboratory, May. 1997

CITED BY  9

Collaborative Colleagues:
P. C. Woodland: colleagues
S. E. Johnson: colleagues
P. Jourlin: colleagues
K. Spärck Jones: colleagues