|
ABSTRACT
An information retrieval model is presented for the retrieval of speech documents, i.e. audio recordings containing speech. The indexing vocabulary consists of indexing features that have the following characteristics. First, they are easy to recognize by speech recognition methods. Second, the number of different indexing features is small such that a reasonable amount of training data is sufficent to train the hidden Markov models that are used by the speech recognition process. Third, the retrieval method based on such indexing features achieves an acceptable retrieval effectiveness as shown by experiments on text collections. Fourth, these indexing features cannot only be identified in speech documents but also in text documents. From the last characteristic follows that speech documents and text documents can be retrieved simultaneously. Analogously, the queries may contain either speech or text. Thus, we have a simple multimedia retrieval model where two different medias are indexed coherently. We also describe a prototype retrieval system under development.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
de Heer, T. (1974). Experiments with Syntactic Traces in Information Retrieval. Information Storage gj Retrieval, 10,133-144.
|
| |
3
|
Fox, E. A. (1990). Virginia Disc One. Virginia Polytechnic Institute and State University, Department of Computer Science.
|
 |
4
|
|
| |
5
|
|
| |
6
|
|
| |
7
|
|
 |
8
|
|
| |
9
|
Ladefoged, P. (1975). A Course in Phone~ics. Harcourt Brace Jovanovich, Inc., New York.
|
| |
10
|
|
| |
11
|
Lee, K. F. (September, 1989b). Hidden Markov Models: Past, Present, Future. In European Conference on Speech Communication and Technology, pp. 148- 155.
|
| |
12
|
Linde, Y., Buzo, A., & Gray, R. M. (1980). An Algorithm for Vector Quantizer Design. IEEE Transactions on Communication, 28(1),84-95.
|
| |
13
|
|
| |
14
|
|
| |
15
|
|
| |
16
|
Porter, M. F. (1980). An Algorithm for Suffix Stripping. Program, 14(3),130-137.
|
| |
17
|
Rabiner, L. R. (1989). Tutorial on Markov Models and Selected Applications in Speech Recognition. Proceedings of the IEEE, 77(2),257-286.
|
| |
18
|
Robertson, S. E. (1977). The Probability Ranking Principle in IR. Journal of Documentation, 33(4),294- 304.
|
| |
19
|
Rose, R. C., Chang, E. I., & Lippmann, R. (1991). Techniques for Information Retrieval from Voice Messages. In International Conference on Acoustics, Speech, and Signal Processing, pp. 317-320.
|
| |
20
|
|
| |
21
|
Salton, G., & Buckley, C. (1990). Improving Retrieval Performance by Relevance Feedback. Journal of the ASIS, 41(4),288-297.
|
| |
22
|
|
| |
23
|
Shore, J. (1988). Interactive Signal Processing with UNIX. Speech Technology, 4(2),70-79.
|
| |
24
|
Teufel, B. (1989). Informationsspuren zum numerischen und graphischen Vergleich yon reduzierten natiirlichsprachlichen Texten. PhD thesis, Swiss Federal Institute of Technology. VdF-Verlag, Ziirich.
|
 |
25
|
|
| |
26
|
|
| |
27
|
Wilcox, L. D., & Bush, M. A. (1991). HMM-Based Wordspotting for Voice Editing and Indexing. In European Conference on Speech Communication and Technology, pp. 25-28.
|
| |
28
|
Willet, P. (1979). Document Retrieval Experiments Using Indexing Vocabularies of Varying Size. II Hashing, Truncation, Digram and Trigram Encoding of Indexing Terms. Journal of Documentation, 35(4),296-305.
|
|