ACM Home Page
Please provide us with feedback. Feedback
Disambiguating speech commands using physical context
Full text PdfPdf (641 KB)
Source
International Conference on Multimodal Interfaces archive
Proceedings of the 9th international conference on Multimodal interfaces table of contents
Nagoya, Aichi, Japan
POSTER SESSION: Poster session 2 table of contents
Pages 247-254  
Year of Publication: 2007
ISBN:978-1-59593-817-6
Authors
Katherine M. Everitt  University of Washington, Seattle, WA
Susumu Harada  University of Washington, Seattle, WA
Jeff Bilmes  University of Washington, Seattle, WA
James A. Landay  University of Washington, Seattle, WA
Sponsors
SIGCHI: ACM Special Interest Group on Computer-Human Interaction
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 44,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1322192.1322235
What is a DOI?

ABSTRACT

Speech has great potential as an input mechanism for ubiquitous computing. However, the current requirements necessary for accurate speech recognition, such as a quiet environment and a well-positioned and high-quality microphone, are unreasonable to expect in a realistic setting. In a physical environment, there is often contextual information which can be sensed and used to augment the speech signal. We investigated improving speech recognition rates for an electronic personal trainer using knowledge about what equipment was in use as context. We performed an experiment with participants speaking in an instrumented apartment environment and compared the recognition rates of a larger grammar with those of a smaller grammar that is determined by the context.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
G. S.Aist and J. Mostow. 1997. Adapting Human Tutorial Interventions for a Reading Tutor that Listens: Using Continous Speech Recognition in Interactive Educational Multimedia. In CALL '97 Conference on Multimedia, England.
 
2
Answers Anywhere www.iAnywhere.com
 
3
Chang, K. Chen, M., Canny, J., Towards Balanced Exercise Programs: Tracking Free-weight Exercises, Ubicomp 2007.
 
4
Coen, M.; Weisman, L; Thomas, K; Groh, M. A Context Sensitive Natural Language Modality for the Intelligent Room. In Proc. MANSE'99. Dublin, Ireland. 1999.
 
5
Consolvo, S., Paulos, E., Smith, I. Mobile Persuasion for Everyday Behavior Change. Mobile Persuasion. Stanford Captology Media. 2007
6
 
7
 
8
9
 
10
Microsoft Speech Server http://microsoft.com/speech
 
11
MySportTraining 3.97 by VidaOne, Inc http://www.pocketgear.com/software_detail.asp?id=630
 
12
 
13
 
14
R. Porzel and I. Gurevych, Contextual Coherence in Natural Language Processing, CONTEXT 2003, LNAI 2680, Springer-Verlag, pp 272--285, 2003.
 
15
 
16
Joshua R. Smith, Alanson Sample, Pauline Powledge, Alexander Mamishev, Sumit Roy. A wirelessly powered platform for sensing and computation. In Proc. Ubicomp 2006
 
17
C. Wai, R. Pieraccini, and H. M. Meng, A Dynamic Semantic Model for Re-scoring Recognition Hypotheses, In Proceedings of ICASSP2001, pp 589--592, 2001.
 
18
 
19
Mark Weiser, "The Computer for the Twenty-First Century," Scientific American, pp. 94--10, September 1991

Collaborative Colleagues:
Katherine M. Everitt: colleagues
Susumu Harada: colleagues
Jeff Bilmes: colleagues
James A. Landay: colleagues