|
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Aaronson, D., Markowitz, N., and Shapiro, H. Perception and Immediate Recall of Normal and Compressed Auditory Sequences. Perception and Psychophysics 9, 4 (1971), 338-344.
|
 |
2
|
|
| |
3
|
Arons, B. Techniques, Perception, and Applications of Time-Compressed Speech. In Proceedings of 1992 Conference, American Voice i/O Society, Sep. 1992, pp. 169-177.
|
 |
4
|
|
| |
5
|
Beasley, D.S. and Maki, J.E. Time- and Frequency- Altered Speech. In Contemporary Issues in Experimental Phonetics. Academic Press, Lass, N.J., editor, Ch. 12, pp. 419--458, 1976.
|
| |
6
|
Buxton, W., Gaver, B., and Bly, S., The Use of Non- Speech Audio at the Interface, ACM SIGCHI, 199 I, Tutorial Notes.
|
| |
7
|
Chen, F.R. and Withgott, M. The Use of Emphasis to Automatically Summarize Spoken Discourse. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, IEEE, 1992, pp. 229-233.
|
| |
8
|
De Souza, P. A Statistical Approach to the Design of an Adaptive Self-Normalizing Silence Detector. IEEE Transactions on Acoustics, Speech, and Signal Processing ASSP-31, 3 (Jun. 1983), 678-684.
|
 |
9
|
Leo Degen , Richard Mander , Gitta Salomon, Working with audio: integrating personal tape recorders and desktop computers, Proceedings of the SIGCHI conference on Human factors in computing systems, p.413-418, May 03-07, 1992, Monterey, California, United States
[doi> 10.1145/142750.142877]
|
| |
10
|
Fairbanks, G., Everitt, W.L., and Jaeger, R.P. Method for Time or Frequency Compression- Expansion of Speech. Transaction of the Institute of Radio Engineers, Professional Group on Audio A U-2 (1954), 7-12, Reprinted in G. Fairbanks. Experimental Phonetics: Selected Articles, University of Illinois Press, 1966.
|
| |
11
|
Foulke, E. The Perception of Time Compressed Speech. In Perception of Language. Chm'les E. Merrill Publishing Company, Kjeldergaard, P.M., Horton, D.L., and Jenkins, J.J., editors, Ch. 4, pp. 79-107, 1971.
|
 |
12
|
|
| |
13
|
Gaver, W.W. Auditory Icons: Using Sound in Computer Interfaces. Human-Computer Interaction 2 (1989), 167-177.
|
| |
14
|
Gerber, S.E. and Wulfeck, B.H. The Limiting Effect of Discard Interval on Time-Compressed Speech. Language and Speech 20, 2 (1977), 108-115.
|
 |
15
|
|
| |
16
|
Gruber, J.G. A Comparison of Measured and Calculated Speech Temporal Parameters Relevant to Speech Activity Detection. iEEE Transactions on Communications COM-30, 4 (Apr. 1982), 728-738.
|
| |
17
|
Gruber, J.G. and Le, N.H. Performance Requirements for Integrated Voice/Data Networks. IEEE Journal on Selected Areas in Communications SAC-i, 6 (Dec. 1983), 981-1005.
|
 |
18
|
|
| |
19
|
Heiman, G.W., Leo, R.J., Leighbody, G., and Bowler, K. Word Intelligibility Decrements and the Comprehension of Time-Compressed Speech. Perception and Psychophysics 40, 6 (1986), 407- 411.
|
| |
20
|
Hejna Jr., D.J. Real-Time Time-Scale Modification of Speech via the Synchronized Overlap-Add Algorithm, Master's thesis, Department of Electrical Engineering and Computer Science, MIT, Feb. 1990.
|
| |
21
|
Houle, G.R., Maksymowicz, A.T., and Penafiel, H.M. Back-End Processing for Automatic Gisting Systems. In Proceedings of 1988 Conference, American Voice I/O Society, 1988.
|
 |
22
|
Robin Jeffries , James R. Miller , Cathleen Wharton , Kathy Uyeda, User interface evaluation in the real world: a comparison of four techniques, Proceedings of the SIGCHI conference on Human factors in computing systems: Reaching through technology, p.119-124, April 27-May 02, 1991, New Orleans, Louisiana, United States
[doi> 10.1145/108844.108862]
|
| |
23
|
Lamel, L.F., Rabiner, L.R., Rosenberg, A.E., and Wilpon, J.G. An Improved Endpoint Detec~tor for Isolated Word Recognition. IEEE Transactions on Acoustics, Speech, and Signal Processing ASSP-29, 4 (Aug. 1981), 777-785.
|
| |
24
|
Lass, N.J. and Leeper, H.A. Listening Rate Preference: Comparison of Two Time Alteration Techniques. Perceptual and Motor Skills 44 (1977), 1163-1168.
|
| |
25
|
Lee, H.H. and Un, C.K. A Study of on-off Characteristics of Conversational Speech. IEEE Transactions on Communications COM-34, 6 (Jun. 1986), 630-637.
|
| |
26
|
Levelt, W.J.M. Speaking: From Intention to Articulation, MIT Press (1989).
|
| |
27
|
Lynch Jr., J.F., Josenhans, J.G., and Crochiere, R.E. Speech/Silence Segmentation for Real-Time Coding via Rule Based Adaptive Endpoint Detection. In Proceedings of the international Conference on Acoustics, Speech, and Signal Processing, IEEE, 1987, pp. 1348-1351.
|
 |
28
|
Jock D. Mackinlay , George G. Robertson , Stuart K. Card, The perspective wall: detail and context smoothly integrated, Proceedings of the SIGCHI conference on Human factors in computing systems: Reaching through technology, p.173-176, April 27-May 02, 1991, New Orleans, Louisiana, United States
[doi> 10.1145/108844.108870]
|
| |
29
|
UnMouse User's Manual, Microtouch Systems Inc., Wilmington, MA.
|
 |
30
|
Michael Mills , Jonathan Cohen , Yin Yin Wong, A magnifier tool for video data, Proceedings of the SIGCHI conference on Human factors in computing systems, p.93-98, May 03-07, 1992, Monterey, California, United States
[doi> 10.1145/142750.142764]
|
| |
31
|
Minifie, F.D. Durational Aspects of Connected Speech Samples. In Time-Compressed Speech. Scarecrow, Duker, S., editor, pp. 709-715, 1974.
|
| |
32
|
Neuburg, E.P. Simple Pitch-Dependent Algorithm for High Quality Speech Rate Changing. Journal of the Acoustic Society of America 63, 2 (1978), 624-625.
|
| |
33
|
O'Shaughnessy, D. Speech Communication: Human and Machine, Addison-Wesley (1987).
|
| |
34
|
O'Shaughnessy, D. Recognition of Hesitations in Spontaneous Speech. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, iEEE, 1992, pp. 1521-1524.
|
| |
35
|
Rabiner, L.R. and Sambur, M.R. An Algorithm for Determining the Endpoints of Isolated Utterances. The Bell System Technical Journal 54, 2 (Feb. 1975), 297-315.
|
| |
36
|
Reich, S.S. Significance of Pauses for Speech Perception. Journal of Psycholinguistic Research 9, 4 (1980), 379-389.
|
 |
37
|
|
| |
38
|
|
| |
39
|
Roucos, S. and Wilgus, A.M. High Quality Time- Scale Modification for Speech. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, iEEE, 1985, pp. 493-496.
|
| |
40
|
|
| |
41
|
Schmandt, C. and Arons, B. A Conversational Telephone Messaging System. IEEE Transactions on Consumer Electronics CE-30, 3 (Aug. 1984), xxixxiv.
|
| |
42
|
Scott, R.J. Time Adjustment in Speech Synthesis. Journal of the Acoustic Society of America 41, 1 (1967), 60-65.
|
| |
43
|
|
| |
44
|
Wightman, C.W. and Ostendorf, M. Automatic Recognition of Intonational Features. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, IEEE, 1992, pp. 1221-1224.
|
 |
45
|
Lynn Wilcox , Ian Smith , Marcia Bush, Wordspotting for voice editing and audio indexing, Proceedings of the SIGCHI conference on Human factors in computing systems, p.655-656, May 03-07, 1992, Monterey, California, United States
[doi> 10.1145/142750.150715]
|
CITED BY 18
|
|
|
|
|
|
|
|
|
|
|
Scott Minneman , Steve Harrison , Bill Janssen , Gordon Kurtenbach , Thomas Moran , Ian Smith , Bill van Melle, A confederation of tools for capturing and accessing collaborative activity, Proceedings of the third ACM international conference on Multimedia, p.523-534, November 05-09, 1995, San Francisco, California, United States
|
|
|
|
|
|
|
|
|
Donald G. Kimber , Lynn D. Wilcox , Francine R. Chen , Thomas P. Moran, Speaker segmentation for browsing recorded audio, Conference companion on Human factors in computing systems, p.212-213, May 07-11, 1995, Denver, Colorado, United States
|
|
|
Gregory D. Abowd , Christopher G. Atkeson , Ami Feinstein , Cindy Hmelo , Rob Kooper , Sue Long , Nitin Sawhney , Mikiya Tani, Teaching and learning as multimedia authoring: the classroom 2000 project, Proceedings of the fourth ACM international conference on Multimedia, p.187-198, November 18-22, 1996, Boston, Massachusetts, United States
|
|
|
Sunil Vemuri , Philip DeCamp , Walter Bender , Chris Schmandt, Improving speech playback using time-compression and speech recognition, Proceedings of the SIGCHI conference on Human factors in computing systems, p.295-302, April 24-29, 2004, Vienna, Austria
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Michael G. Christel , Michael A. Smith , C. Roy Taylor , David B. Winkler, Evolving video skims into useful multimedia abstractions, Proceedings of the SIGCHI conference on Human factors in computing systems, p.171-178, April 18-23, 1998, Los Angeles, California, United States
|
|
|
Kristin Vadas , Nirmal Patel , Kent Lyons , Thad Starner , Julie Jacko, Reading on-the-go: a comparison of audio and hand-held displays, Proceedings of the 8th conference on Human-computer interaction with mobile devices and services, September 12-15, 2006, Helsinki, Finland
|
|
|
|
|
|
|
|
|
|
INDEX TERMS
Primary Classification:
H.
Information Systems
H.5
INFORMATION INTERFACES AND PRESENTATION (I.7)
H.5.2
User Interfaces (D.2.2, H.1.2, I.3.6)
Subjects:
Interaction styles (e.g., commands, menus, forms, direct manipulation)
Additional Classification:
H.
Information Systems
H.1
MODELS AND PRINCIPLES
H.1.2
User/Machine Systems
Subjects:
Human factors
H.5
INFORMATION INTERFACES AND PRESENTATION (I.7)
H.5.1
Multimedia Information Systems
Subjects:
Audio input/output
H.5.2
User Interfaces (D.2.2, H.1.2, I.3.6)
Subjects:
Input devices and strategies (e.g., mouse, touchscreen)
General Terms:
Algorithms,
Design,
Languages
Keywords:
browsing,
interactive listening,
non-speech audio,
speech as data,
speech detection,
speech skimming,
speech user interfaces,
time compression
|