ACM Home Page
Please provide us with feedback. Feedback
Live speaker identification in conversations
Full text PdfPdf (1.13 MB)
Source
International Multimedia Conference archive
Proceeding of the 16th ACM international conference on Multimedia table of contents
Vancouver, British Columbia, Canada
DEMONSTRATION SESSION: Demo session 2 table of contents
Pages 1017-1018  
Year of Publication: 2008
ISBN:978-1-60558-303-7
Authors
Gerald Friedland  International Computer Science Institute, Berkeley, CA, USA
Oriol Vinyals  University of California San Diego, San Diego, CA, USA
Sponsors
ACM: Association for Computing Machinery
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 15,   Downloads (12 Months): 82,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1459359.1459558
What is a DOI?

ABSTRACT

The following article describes our technical demonstration of an online speaker identification system for conversations. A laptop with an internal microphone is centrally placed in the table of a meeting room. The system is able to identify the current speaker independent of spoken text or language with a latency of about 1.5 seconds and an accuracy of about 85% (as evaluated against the NIST RT benchmark). A Java GUI shows the image of the current speaker along with a timeline containing past speakers. Speakers are added to the system's database using a one-minute training procedure.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
X. Anguera, C. Wooters, B. Peskin, and M. Aguilo. Robust speaker segmentation for meetings: The ICSI-SRI spring 2005 diarization system. In Proceeding of the NIST MLMI Meeting Recognition Workshop, Edinburgh, 2005.
 
2
 
3
D. A. Reynolds and P. Torres-Carrasquillo. Approaches and applications of audio diarization. In Proceedings of the IEEE ICASSP, 2005.
 
4
 
5
C. Wooters and M. Huijbregts. The ICSI RT07s speaker diarization system. In Proceedings of the RT07 Meeting Recognition Evaluation Workshop, 2007.

Collaborative Colleagues:
Gerald Friedland: colleagues
Oriol Vinyals: colleagues