ACM Home Page
Please provide us with feedback. Feedback
SpeechSkimmer: interactively skimming recorded speech
Full text PdfPdf (1.13 MB)
Source Symposium on User Interface Software and Technology archive
Proceedings of the 6th annual ACM symposium on User interface software and technology table of contents
Atlanta, Georgia, United States
Pages: 187 - 196  
Year of Publication: 1993
ISBN:0-89791-628-X
Author
Barry Arons  Speech Research Group, MIT Media Laboratory, 20 Ames Street, Cambridge, MA
Sponsors
SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
SIGCHI: ACM Special Interest Group on Computer-Human Interaction
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 46,   Citation Count: 18
Additional Information:

references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/168642.168661
What is a DOI?

REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Aaronson, D., Markowitz, N., and Shapiro, H. Perception and Immediate Recall of Normal and Compressed Auditory Sequences. Perception and Psychophysics 9, 4 (1971), 338-344.
2
 
3
Arons, B. Techniques, Perception, and Applications of Time-Compressed Speech. In Proceedings of 1992 Conference, American Voice i/O Society, Sep. 1992, pp. 169-177.
4
 
5
Beasley, D.S. and Maki, J.E. Time- and Frequency- Altered Speech. In Contemporary Issues in Experimental Phonetics. Academic Press, Lass, N.J., editor, Ch. 12, pp. 419--458, 1976.
 
6
Buxton, W., Gaver, B., and Bly, S., The Use of Non- Speech Audio at the Interface, ACM SIGCHI, 199 I, Tutorial Notes.
 
7
Chen, F.R. and Withgott, M. The Use of Emphasis to Automatically Summarize Spoken Discourse. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, IEEE, 1992, pp. 229-233.
 
8
De Souza, P. A Statistical Approach to the Design of an Adaptive Self-Normalizing Silence Detector. IEEE Transactions on Acoustics, Speech, and Signal Processing ASSP-31, 3 (Jun. 1983), 678-684.
9
 
10
Fairbanks, G., Everitt, W.L., and Jaeger, R.P. Method for Time or Frequency Compression- Expansion of Speech. Transaction of the Institute of Radio Engineers, Professional Group on Audio A U-2 (1954), 7-12, Reprinted in G. Fairbanks. Experimental Phonetics: Selected Articles, University of Illinois Press, 1966.
 
11
Foulke, E. The Perception of Time Compressed Speech. In Perception of Language. Chm'les E. Merrill Publishing Company, Kjeldergaard, P.M., Horton, D.L., and Jenkins, J.J., editors, Ch. 4, pp. 79-107, 1971.
12
 
13
Gaver, W.W. Auditory Icons: Using Sound in Computer Interfaces. Human-Computer Interaction 2 (1989), 167-177.
 
14
Gerber, S.E. and Wulfeck, B.H. The Limiting Effect of Discard Interval on Time-Compressed Speech. Language and Speech 20, 2 (1977), 108-115.
15
 
16
Gruber, J.G. A Comparison of Measured and Calculated Speech Temporal Parameters Relevant to Speech Activity Detection. iEEE Transactions on Communications COM-30, 4 (Apr. 1982), 728-738.
 
17
Gruber, J.G. and Le, N.H. Performance Requirements for Integrated Voice/Data Networks. IEEE Journal on Selected Areas in Communications SAC-i, 6 (Dec. 1983), 981-1005.
18
 
19
Heiman, G.W., Leo, R.J., Leighbody, G., and Bowler, K. Word Intelligibility Decrements and the Comprehension of Time-Compressed Speech. Perception and Psychophysics 40, 6 (1986), 407- 411.
 
20
Hejna Jr., D.J. Real-Time Time-Scale Modification of Speech via the Synchronized Overlap-Add Algorithm, Master's thesis, Department of Electrical Engineering and Computer Science, MIT, Feb. 1990.
 
21
Houle, G.R., Maksymowicz, A.T., and Penafiel, H.M. Back-End Processing for Automatic Gisting Systems. In Proceedings of 1988 Conference, American Voice I/O Society, 1988.
22
 
23
Lamel, L.F., Rabiner, L.R., Rosenberg, A.E., and Wilpon, J.G. An Improved Endpoint Detec~tor for Isolated Word Recognition. IEEE Transactions on Acoustics, Speech, and Signal Processing ASSP-29, 4 (Aug. 1981), 777-785.
 
24
Lass, N.J. and Leeper, H.A. Listening Rate Preference: Comparison of Two Time Alteration Techniques. Perceptual and Motor Skills 44 (1977), 1163-1168.
 
25
Lee, H.H. and Un, C.K. A Study of on-off Characteristics of Conversational Speech. IEEE Transactions on Communications COM-34, 6 (Jun. 1986), 630-637.
 
26
Levelt, W.J.M. Speaking: From Intention to Articulation, MIT Press (1989).
 
27
Lynch Jr., J.F., Josenhans, J.G., and Crochiere, R.E. Speech/Silence Segmentation for Real-Time Coding via Rule Based Adaptive Endpoint Detection. In Proceedings of the international Conference on Acoustics, Speech, and Signal Processing, IEEE, 1987, pp. 1348-1351.
28
 
29
UnMouse User's Manual, Microtouch Systems Inc., Wilmington, MA.
30
 
31
Minifie, F.D. Durational Aspects of Connected Speech Samples. In Time-Compressed Speech. Scarecrow, Duker, S., editor, pp. 709-715, 1974.
 
32
Neuburg, E.P. Simple Pitch-Dependent Algorithm for High Quality Speech Rate Changing. Journal of the Acoustic Society of America 63, 2 (1978), 624-625.
 
33
O'Shaughnessy, D. Speech Communication: Human and Machine, Addison-Wesley (1987).
 
34
O'Shaughnessy, D. Recognition of Hesitations in Spontaneous Speech. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, iEEE, 1992, pp. 1521-1524.
 
35
Rabiner, L.R. and Sambur, M.R. An Algorithm for Determining the Endpoints of Isolated Utterances. The Bell System Technical Journal 54, 2 (Feb. 1975), 297-315.
 
36
Reich, S.S. Significance of Pauses for Speech Perception. Journal of Psycholinguistic Research 9, 4 (1980), 379-389.
37
 
38
 
39
Roucos, S. and Wilgus, A.M. High Quality Time- Scale Modification for Speech. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, iEEE, 1985, pp. 493-496.
 
40
 
41
Schmandt, C. and Arons, B. A Conversational Telephone Messaging System. IEEE Transactions on Consumer Electronics CE-30, 3 (Aug. 1984), xxixxiv.
 
42
Scott, R.J. Time Adjustment in Speech Synthesis. Journal of the Acoustic Society of America 41, 1 (1967), 60-65.
 
43
 
44
Wightman, C.W. and Ostendorf, M. Automatic Recognition of Intonational Features. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, IEEE, 1992, pp. 1221-1224.
45

CITED BY  18