ACM Home Page
Please provide us with feedback. Feedback
VoiceLabel: using speech to label mobile sensor data
Full text PdfPdf (2.70 MB)
Source
International Conference on Multimodal Interfaces archive
Proceedings of the 10th international conference on Multimodal interfaces table of contents
Chania, Crete, Greece
POSTER SESSION: Multimodal systems I (poster session) table of contents
Pages 69-76  
Year of Publication: 2008
ISBN:978-1-60558-198-9
Authors
Susumu Harada  University of Washington, Seattle, WA, USA
Jonathan Lester  University of Washington, Seattle, WA, USA
Kayur Patel  University of Washingotn, Seattle, WA, USA
T. Scott Saponas  University of Washington, Seattle, WA, USA
James Fogarty  University of Washington, Seattle, WA, USA
James A. Landay  University of Washington, Seattle, WA, USA
Jacob O. Wobbrock  University of Washington, Seattle, WA, USA
Sponsors
SIGCHI: ACM Special Interest Group on Computer-Human Interaction
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 18,   Downloads (12 Months): 97,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1452392.1452407
What is a DOI?

ABSTRACT

Many mobile machine learning applications require collecting and labeling data, and a traditional GUI on a mobile device may not be an appropriate or viable method for this task. This paper presents an alternative approach to mobile labeling of sensor data called VoiceLabel. VoiceLabel consists of two components: (1) a speech-based data collection tool for mobile devices, and (2) a desktop tool for offline segmentation of recorded data and recognition of spoken labels. The desktop tool automatically analyzes the audio stream to find and recognize spoken labels, and then presents a multimodal interface for reviewing and correcting data labels using a combination of the audio stream, the system's analysis of that audio, and the corresponding mobile sensor data. A study with ten participants showed that VoiceLabel is a viable method for labeling mobile sensor data. VoiceLabel also illustrates several key features that inform the design of other data labeling tools.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Bao, L. and Intille, S. S. 2004. Activity recognition from user-annotated acceleration data. In Proc. of Pervasive 2004.
 
2
 
3
 
4
Chen, M, Sohn, T., Chmelev, D., Haehnel, D., Hightower, J., Hughes, J., and LaMarca, A. 2006. Practical metropolitan-scale positioning for GSM phones. In Proc. of Ubicomp 2006.
 
5
Chen, S., Gaur, A., Muthukrishnan, S., and Rosenbluth, D. 2004. Wireless in loco sensor data collection and applications. In Proc. of WWW 2004 Workshop on Emerging Applications for Wireless and Mobile Access.
6
 
7
 
8
Goto, M., Itou, K., and Kobayashi, T. 2005. Speech interface exploiting intentionally--controlled nonverbal speech information. In Proc. of UIST 2005.
9
 
10
Lamming, M., and Bohm, D. 2003. SPECs: Another approach to human context and activity sensing research, using tiny peer-to-peer wireless computers. In Proc. of Ubicomp 2003.
 
11
Lester, J., Choudhury, T., Kern, N., Borriello, G., and Hannaford, B. 2005. A hybrid discriminative-generative approach for modeling human activities. In Proc. of IJCAI 2005.
 
12
Lester, J., Choudhury, T., and Borriello, G. 2006. A practical approach to recognizing physical activities. In Proc. of Pervasive 2006.
 
13
Logan, B., Healey, J., Philipose, M., Tapia, E.M., and Intille, S. 2007. A long-term evaluation of sensing modalities for activity recognition. In Proc. of Ubicomp 2007.
 
14
Nissenbaum, H. 1998. Protecting privacy in an information age: The problem of privacy in public. In Law and Philosophy 17 (5-6).
15
 
16
Saponas, T., Lester, J., Froehlich, J, Fogarty, J., Landay, J. 2008. iLearn on the iPhone: Real-Time Human Activity Classification on Commodity Mobile Phones. University of Washington CSE Tech Report UW-CSE-08-04-02.
 
17
Tapia, E.M., Intille, S.S., and Larson, K. 2004. Activity recognition in the home setting using simple and ubiquitous sensors. In Proc. of Pervasive 2004.

Collaborative Colleagues:
Susumu Harada: colleagues
Jonathan Lester: colleagues
Kayur Patel: colleagues
T. Scott Saponas: colleagues
James Fogarty: colleagues
James A. Landay: colleagues
Jacob O. Wobbrock: colleagues