| The meeting project at ICSI |
| Full text |
Pdf
(75 KB)
|
| Source
|
Human Language Technology Conference
archive
Proceedings of the first international conference on Human language technology research
table of contents
San Diego
Pages: 1 - 7
Year of Publication: 2001
|
|
Authors
|
|
Nelson Morgan
|
International Computer Science Institute, Berkeley, CA
|
|
Don Baron
|
International Computer Science Institute, Berkeley, CA
|
|
Jane Edwards
|
International Computer Science Institute, Berkeley, CA
|
|
Dan Ellis
|
International Computer Science Institute, Berkeley, CA
|
|
David Gelbart
|
International Computer Science Institute, Berkeley, CA
|
|
Adam Janin
|
International Computer Science Institute, Berkeley, CA
|
|
Thilo Pfau
|
International Computer Science Institute, Berkeley, CA
|
|
Elizabeth Shriberg
|
International Computer Science Institute, Berkeley, CA
|
|
Andreas Stolcke
|
International Computer Science Institute, Berkeley, CA
|
|
| Publisher |
Association for Computational Linguistics
Morristown, NJ, USA
|
| Bibliometrics |
Downloads (6 Weeks): 2, Downloads (12 Months): 9, Citation Count: 12
|
|
|
ABSTRACT
In collaboration with colleagues at UW, OGI, IBM, and SRI, we are developing technology to process spoken language from informal meetings. The work includes a substantial data collection and transcription effort, and has required a nontrivial degree of infrastructure development. We are undertaking this because the new task area provides a significant challenge to current HLT capabilities, while offering the promise of a wide range of potential applications. In this paper, we give our vision of the task, the challenges it represents, and the current state of our development, with particular attention to automatic transcription.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
D. Biber, Variation across speech and writing. 1st pbk. ed. Cambridge {England}; New York: Cambridge University Press, 1991.
|
| |
3
|
W. Chafe, Cognitive constraints on information flow. In R. S. Tomlin (ed.) Coherence and grounding in discourse. Philadelphia: John Benjamins, pp. 21--51, 1987.
|
| |
4
|
J. Edwards, The transcription of Discourse. In D. Tannen, D. Schiffrin, and H. Hamilton (eds). The Handbook of Discourse Analysis. NY: Blackwell (in press).
|
| |
5
|
H. Hermansky, D. Ellis, and S. Sharma, Tandem connectionist feature stream extraction for conventional HMM systems, Proc. ICASSP, pp. III-1635--1638, Istanbul, 2000.
|
| |
6
|
H. Hermansky and N. Morgan, RASTA Processing of Speech, IEEE Trans. Speech and Audio Processing 2(4), 578--589, 1994.
|
| |
7
|
A. Janin and N. Morgan, SpeechCorder, the Portable Meeting Recorder, Workshop on hands-free speech communication, Kyoto, April 9--11, 2001.
|
| |
8
|
|
| |
9
|
E. Shriberg, R. Bates, A. Stolcke, P. Taylor, D. Jurafsky, K. Ries, N. Coccaro, R. Martin, M. Meteer, and C. Van Ess-Dykema. Can prosody aid the automatic classification of dialog acts in conversational speech? Language and Speech, 41(3--4):439--487, 1998.
|
| |
10
|
|
| |
11
|
A. Stolcke, H. Bratt, J. Butzberger, H. Franco, V. R. Rao Gadde, M. Plauché, C. Richey, E. Shriberg, K. Sönmez, F. Weng, and J. Zheng. The SRI March 2000 Hub-5 conversational speech transcription system. Proc. NIST Speech Transcription Workshop, College Park, MD, May 2000.
|
| |
12
|
Andreas Stolcke , Noah Coccaro , Rebecca Bates , Paul Taylor , Carol Van Ess-Dykema , Klaus Ries , Elizabeth Shriberg , Daniel Jurafsky , Rachel Martin , Marie Meteer, Dialogue act modeling for automatic tagging and recognition of conversational speech, Computational Linguistics, v.26 n.3, p.339-373, September 2000
[doi> 10.1162/089120100561737]
|
| |
13
|
|
| |
14
|
A. Waibel, M. Bett, M. Finke, and R. Stiefelhagen, Meeting Browser: Tracking and Summarizing Meetings, Proc. DARPA Broadcast News Transcription and Understanding Workshop, Lansdowne, VA, 1998.
|
| |
15
|
H. Yu, C. Clark, R. Malkin, and A. Waibel, Experiments in Automatic Meeting Transcription Using JRTK, Proc. ICASSP, pp. 921--924, Seattle, 1998.
|
CITED BY 12
|
|
Iain McCowan , Daniel Gatica-Perez , Samy Bengio , Guillaume Lathoud , Mark Barnard , Dong Zhang, Automatic Analysis of Multimodal Group Actions in Meetings, IEEE Transactions on Pattern Analysis and Machine Intelligence, v.27 n.3, p.305-317, March 2005
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Paul M. Aoki , Matthew Romaine , Margaret H. Szymanski , James D. Thornton , Daniel Wilson , Allison Woodruff, The mad hatter's cocktail party: a social mobile audio space supporting multiple simultaneous conversations, Proceedings of the SIGCHI conference on Human factors in computing systems, April 05-10, 2003, Ft. Lauderdale, Florida, USA
|
|
Dong Zhang , Daniel Gatica-Perez , Samy Bengio , Iain McCowan , Guillaume Lathoud, Multimodal group action clustering in meetings, Proceedings of the ACM 2nd international workshop on Video surveillance & sensor networks, October 15-15, 2004, New York, NY, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|