ACM Home Page
Please provide us with feedback. Feedback
Multi-context voice communication in a SIP/SIMPLE-based shared virtual sound room with early reflections
Full text PdfPdf (350 KB)
Source International Workshop on Network and Operating System Support for Digital Audio and Video archive
Proceedings of the international workshop on Network and operating systems support for digital audio and video table of contents
Stevenson, Washington, USA
SESSION: Audio table of contents
Pages: 45 - 50  
Year of Publication: 2005
ISBN:1-58113-987-X
Author
Yasusi Kanada  Hitachi, Ltd., Tokyo, Japan
Sponsors
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 1,   Downloads (12 Months): 26,   Citation Count: 2
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1065983.1065996
What is a DOI?

ABSTRACT

An improved prototype of the "voiscape" voice communication medium has been developed and subjectively evaluated. Voiscape enables natural and seamless voice communication by using sound to create a virtual "sound room" in which people, who are represented by different sounds, can move freely. It features low-delay motion-tracking spatial audio with simulated early reflections that produce out-of-head sound localization and sound distance expression. It also features virtual-location-based selective communication: a user can walk freely in the sound room using a map- and cursor-key-based user-interface and can select whom to talk to or which sound sources to listen to. A third feature is SIP-presence-event-notification (SIMPLE)-based sound room management: when users move, their locations and directions are distributed using SIP SUBSCRIBE/NOTIFY messages. The combination of these features creates a natural voice-communication space in which two or more parallel conversation contexts can coexist. Limited, subjective testing by around 200 people showed that this medium can be used for cocktail-party-like conversation; i.e., users could distinguish parallel conversations by paying attention to or by moving toward one of them.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Allen, J. B. and Berkley, A., "Image Method for Efficiently Simulating Small-Room Acoustics", J. Acoustical Society of America, Vol. 65, No. 4, pp. 943--950, April 1979.
2
 
3
Begault, D. R., "Virtual Acoustic Displays for Teleconferencing: Intelligibility Advantage for 'Telephone-Grade' Audio", J. Audio Engineering Society, Vol. 47, No. 10, pp. 824--828, October 1999.
 
4
 
5
 
6
Benford, S. D. and Fahlén, L. E., "A Spatial Model of Interaction in Large Virtual Environments", 3rd European Conference on CSCW (ECSCW'93), Milano, Italy, Kluwer, 1993.
7
 
8
Bronkhorst, A. W. and Houtgast, T., "Auditory Distance Perception in Rooms", Nature, 397, pp. 517--520, 1999.
 
9
DiPaola, S. and Collins, D., "A 3D Virtual Environment for Social Telepresence", Western Computer Graphics Symposium, 2002.
 
10
Gardner, B. and Martin, K., "HRTF Measurements of a KEMAR Dummy-Head Microphone", MIT Media Lab Perceptual Computing - Technical Report #280, 1994.
 
11
Gardner, W. G., "The Virtual Acoustic Room", Masters Thesis, MIT, 1994.
 
12
Hall, E. T., "The Hidden Dimension", Doubleday & Company, 1966.
 
13
Hardman, V. and Iken, M., "Enhanced Reality Audio in Interactive Networked Environments", Framework for Interactive Virtual Environments (FIVE) Conference, December 1996.
 
14
Kanada, Y., "Multi-Context Voice Communication Controlled by using an Auditory Virtual Space", 2nd Int'l Conference on Communication and Computer Networks (CCN 2004), pp. 467--472, 2004.
 
15
Langendijk, E. H. A. and Bronkhorst, A. W., "Contribution of Spectral Cues to Human Sound Localization", J. Acoustical Society of America, Vol. 112, No. 4, pp. 1583--1596, 2002.
 
16
 
17
Mark, G. and Abrams, S., "Sensemaking and Design Practices in Large-scale Group-to-Group Distance Collaboration", ACM CHI 2004 Workshop on Designing for Reflective Practitioners, 2004.
 
18
The Math Works, Inc. Using MATLAB, Version 6, 2000.
 
19
Niemi, A., Ed., "Session Initiation Protocol (SIP) Extension for Event State Publication", RFC 3903, IETF, October 2004.
 
20
Roach, A. B., "Session Initiation Protocol (SIP)-Specific Event Notification", RFC 2543, IETF, June 2002.
 
21
Rosenberg, J., Schulzrinne, H., Camarillo, G., Johnston, A., Peterson, J., Sparks, R., Handley, M., and Schooler, E., "SIP: Session Initiation Protocol", RFC 3261, IETF, June 2002.
 
22
Rosenberg, J., "A Presence Event Package for the Session Initiation Protocol (SIP)", RFC 3856, IETF, August 2004.
 
23
Savioja, L., Modeling Techniques for Virtual Acoustics, Helsinki University, 1999.
 
24
Shinn-Cunningham, B., "Learning Reverberation: Consideration for Spatial Auditory Displays", Int'l Conference on Auditory Display (ICAD), pp. 126--134, April 2000.
 
25
Sugano, H., Fujimoto, S., Klyne, G., Bateman, A., Carr, W., and Peterson, J., "Presence Information Data Format (PIDF)", RFC 3863, IETF, August 2004.