ACM Home Page
Please provide us with feedback. Feedback
A system for retrieving speech documents
Full text PdfPdf (942 KB)
Source Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Copenhagen, Denmark
Pages: 168 - 176  
Year of Publication: 1992
ISBN:0-89791-523-2
Authors
Ulrike Glavitsch  Swiss Federal Institute of Technology, ETH, CH-8092 Zürich, Switzerland
Peter Schäuble  Swiss Federal Institute of Technology, ETH, CH-8092 Zürich, Switzerland
Sponsors
Royal School of Lib. : Royal School of Lib.
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 40,   Citation Count: 11
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/133160.133194
What is a DOI?

ABSTRACT

An information retrieval model is presented for the retrieval of speech documents, i.e. audio recordings containing speech. The indexing vocabulary consists of indexing features that have the following characteristics. First, they are easy to recognize by speech recognition methods. Second, the number of different indexing features is small such that a reasonable amount of training data is sufficent to train the hidden Markov models that are used by the speech recognition process. Third, the retrieval method based on such indexing features achieves an acceptable retrieval effectiveness as shown by experiments on text collections. Fourth, these indexing features cannot only be identified in speech documents but also in text documents. From the last characteristic follows that speech documents and text documents can be retrieved simultaneously. Analogously, the queries may contain either speech or text. Thus, we have a simple multimedia retrieval model where two different medias are indexed coherently. We also describe a prototype retrieval system under development.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
de Heer, T. (1974). Experiments with Syntactic Traces in Information Retrieval. Information Storage gj Retrieval, 10,133-144.
 
3
Fox, E. A. (1990). Virginia Disc One. Virginia Polytechnic Institute and State University, Department of Computer Science.
4
 
5
 
6
 
7
8
 
9
Ladefoged, P. (1975). A Course in Phone~ics. Harcourt Brace Jovanovich, Inc., New York.
 
10
 
11
Lee, K. F. (September, 1989b). Hidden Markov Models: Past, Present, Future. In European Conference on Speech Communication and Technology, pp. 148- 155.
 
12
Linde, Y., Buzo, A., & Gray, R. M. (1980). An Algorithm for Vector Quantizer Design. IEEE Transactions on Communication, 28(1),84-95.
 
13
 
14
 
15
 
16
Porter, M. F. (1980). An Algorithm for Suffix Stripping. Program, 14(3),130-137.
 
17
Rabiner, L. R. (1989). Tutorial on Markov Models and Selected Applications in Speech Recognition. Proceedings of the IEEE, 77(2),257-286.
 
18
Robertson, S. E. (1977). The Probability Ranking Principle in IR. Journal of Documentation, 33(4),294- 304.
 
19
Rose, R. C., Chang, E. I., & Lippmann, R. (1991). Techniques for Information Retrieval from Voice Messages. In International Conference on Acoustics, Speech, and Signal Processing, pp. 317-320.
 
20
 
21
Salton, G., & Buckley, C. (1990). Improving Retrieval Performance by Relevance Feedback. Journal of the ASIS, 41(4),288-297.
 
22
 
23
Shore, J. (1988). Interactive Signal Processing with UNIX. Speech Technology, 4(2),70-79.
 
24
Teufel, B. (1989). Informationsspuren zum numerischen und graphischen Vergleich yon reduzierten natiirlichsprachlichen Texten. PhD thesis, Swiss Federal Institute of Technology. VdF-Verlag, Ziirich.
25
 
26
 
27
Wilcox, L. D., & Bush, M. A. (1991). HMM-Based Wordspotting for Voice Editing and Indexing. In European Conference on Speech Communication and Technology, pp. 25-28.
 
28
Willet, P. (1979). Document Retrieval Experiments Using Indexing Vocabularies of Varying Size. II Hashing, Truncation, Digram and Trigram Encoding of Indexing Terms. Journal of Documentation, 35(4),296-305.

CITED BY  11

Collaborative Colleagues:
Ulrike Glavitsch: colleagues
Peter Schäuble: colleagues