ACM Home Page
Please provide us with feedback. Feedback
Minimal-impact audio-based personal archives
Full text PdfPdf (458 KB)
Source International Multimedia Conference archive
Proceedings of the the 1st ACM workshop on Continuous archival and retrieval of personal experiences table of contents
New York, New York, USA
SESSION: Session 2 table of contents
Pages: 39 - 47  
Year of Publication: 2004
ISBN:1-58113-932-2
Authors
Daniel P.W. Ellis  Columbia University, New York, NY
Keansub Lee  Columbia University, New York, NY
Sponsors
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
ACM: Association for Computing Machinery
SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 39,   Citation Count: 15
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1026653.1026659
What is a DOI?

ABSTRACT

Collecting and storing continuous personal archives has become cheap and easy, but we are still farfromcreating a useful, ubiquitous memory aid. We view the inconvenience to the user of being 'instrumented'as one of the key barriers to the broader development and adoption of these technologies. Audio-only recordings,however, can have minimal impact, requiring only that a device the size and weight of a cellphone be carried somewhere on the person. We have conducted some small-scale experiments on collecting continuous personal recordings of this kind, and investigating how they can be automatically analyzed and indexed, visualized, and correlated with other minimal-impact, opportunistic data feeds (such as online calendars and digital photo collections). We describe our unsupervised segmentation and clustering experiments in which we can achieve good agreement with hand-marked environment/situation labels. We al so di scuss some of the broader issues raised by this kind of work including privacy concerns,and describe our future plans to address these and other questions.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
AquaMinds Software.NoteTaker: An outlining program, 2003.http://www.aquaminds.com/
 
2
S. Chen and P. Gopalakrishnan. Speaker, environment and channel change detection and clustering via the Bayesian Information Criterion. In Proc. DARPA Broadcast News Transcription and Understanding Workshop 1998. http://www.nist.gov/speech/publications/darpa98/pdf/bn20.pdf
 
3
B. Clarkson, N. Sawhney, and A. Pentland Auditory context awareness via wearable computing. In Proc. Perceptual User Interfaces Workshop 1998. http://web.media.mit.edu/~nitin/NomadicRadio/PUI98/pui98.pdf.
 
4
B.P. Clarkson. Life patterns: structure from wearable sensors PhD thesis, MIT Media Lab, 2002. http://web.media.mit.edu~clarkson/thesis.pdf
 
5
Digital Innovations. The Neuros digital audio computer, 2003. http://www.neurosaudio.com/
 
6
D.P.W. Ellis and K. Lee. Features for segmenting and classifying long-duration recordings of "personal" audio. In Proc. ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing SAPA-04 Jeju, Korea, October 2004. http://www.ee.columbia.edu~dpwe/pubs/sapa04-persaud.pdf
 
7
M. Flynn, 2004.Personal communication.
8
 
9
Hyun Won Co.(mobiBLU).MusicMasterFM DAH-420 MP3 player,2004. http://itave.com/mp3_players/musicmaster.html
 
10
M. Lamming and M. Flynn.F orget-me-not: Intimate computing in support of human memory. In Proc. FRIEND21, 94 International Symposium on Next Generation Human Interface Meguro Gajoen, Japan, 1994. http://www.lamming.com/mik/Papers/fmn.pdf
 
11
N. Morgan, D. Baron, J. Edwards, D. Ellis, D.Gelbart, A. cJanin, T. Pfau, E.Shriberg, and A. Stolcke. Themeeting project at ICSI. In Proc. HLT pages 246--252, 2001.
 
12
A. Ng, M. Jordan, and Y. Weiss. On spectral clustering: Analysis and an algorithm. In Advances in NIPS MIT Press, Cambridge, MA, 2001. http://citeseer.ist.psu.edu/ng01spectral.html
13
 
14
S. Renals and D.P. Ellis. Audio information access from meeting rooms. In Proc. ICASSP HongKong, 2003. http://www.dcs.shef.ac.uk/~sjr/pubs/2003/icassp03-mtg.html
 
15
D. Reynolds. An overview of automatic speaker recognition technology. In Proc. IEEE ICASSP-02 Orlando, FL, 2002.
 
16
K. Sjölander and J. Beskow. Wavesurfer-anopen source speech tool. In Proc. ICSLP-2000 Beijing, 2000. http://www.speech.kth.se/wavesurfer/wsurf_icslp00.pdf
 
17
Soundman. OKM IMic set with A3 adapter, 2004. http://www.outwardsound.com/product/microphones/72/
18
 
19
A. Stolcke, C. Wooters, N. Mirghafori, T. Pirinen, I. Bulyko, D. Gelbart, M. Graciarena, S. Otterson, B. Peskin, and M. Ostendorf. Progress in meeting recognition: The ICSI-SRI-UW spring 2004 evaluation system. In NIST 2004 Meeting Recognition Workshop Montreal, May 2004. http://www.speech.sri.com/papers/nist2004-meeting-system.ps.gz
 
20
G. Williams and D. Ellis. Speech/music discrimination based on posterior probability features. In Proc. Eurospeech-99 1999. ftp://ftp.icsi.berkeley.edu/pub/speech/papers/euro99-mussp.pdf

CITED BY  15

Collaborative Colleagues:
Daniel P.W. Ellis: colleagues
Keansub Lee: colleagues