ACM Home Page
Please provide us with feedback. Feedback
Efficient retrieval of life log based on context and content
Full text PdfPdf (932 KB)
Source International Multimedia Conference archive
Proceedings of the the 1st ACM workshop on Continuous archival and retrieval of personal experiences table of contents
New York, New York, USA
SESSION: Session 1 table of contents
Pages: 22 - 31  
Year of Publication: 2004
ISBN:1-58113-932-2
Authors
Kiyoharu Aizawa  The University of Tokyo, Chiba, Japan
Datchakorn Tancharoen  The University of Tokyo, Chiba, Japan
Shinya Kawasaki  The University of Tokyo, Chiba, Japan
Toshihiko Yamasaki  The University of Tokyo, Chiba, Japan
Sponsors
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
ACM: Association for Computing Machinery
SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 16,   Downloads (12 Months): 184,   Citation Count: 15
Additional Information:

abstract   references   cited by   index terms   review   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1026653.1026656
What is a DOI?

ABSTRACT

In this paper, we present continuous capture of our life log with various sensors plus additional data and propose effective retrieval methods using this context and content. Our life log system contains video, audio, acceleration sensor, gyro, GPS, annotations, documents, web pages, and emails. In our previous studies, we showed our retrieval methodology [8], [9], which mainly depends on context information from sensor data. In this paper, we extend our methodology with additional functions. They are (1) spatio-temporal sampling for extraction of key frames for summarization; and (2) conversation scene detection. With the first of these, key frames for the summarization are extracted using time and location data (GPS). Because our life log captures dense location data, we can also make use of derivatives of location data, that is, speed and acceleration in the movement of the person. The summarizing key frames are made using them. We also introduce content analysis for conversation scene detection. In our previous work, we have investigated context-based retrieval, which differs from the majority of studies in image/video retrieval focusing on content-based retrieval. In this paper, we introduce visual and audio data content analysis for conversation scene detection. The detection of conversation scenes will be very important tags for our life log data retrieval. We describe our present system and additional functions, as well as preliminary results for the additional functions.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Lamming, M., and Flynn, M. "Forget-me-not" Intimate Computing in Support of Human Memory. Proceedings of FRIEND21, 94 Int. Symp. Next Generation Human Interface, (Feb. 1994), 125--128.
 
2
 
3
Gemmell, J., Lueder, R., and Bell, G. Living with a Lifetime Store. ATR Workshop on Ubiquitous Experience Media, (Sep. 2003).
 
4
 
5
 
6
Aizawa, K., Ishijima, K., and Shiina, M. Summarizing wearable video. Proceedings of ICIP 2001, (Oct. 2001), 398--401.
 
7
Sawahata, Y., and Aizawa, K. Wearable imaging system for summarizing personal experiences. Proceedings of ICME 2003, (Jul. 2003), I45--I48.
8
 
9
Hori, T. and Aizawa, K., Capturing Life Log and Retrieval based on Context, IEEE ICME2004, June, 2004.
 
10
Pan, H., Liang, Z. P., and Huang, T. S. Fusing Audio and Visual Features of Speech. Proceedings of ICIP 2000 (Sep. 2000), 214--217.
 
11
Cuetos, P., and Neti, C. Audio-Visual Intent-to-Speak Detection For Human-Computer Interaction. Proceedings of ICASSP'00(Jun. 2000), 2373--2376.
 
12
Tancharoen, D., and Jitapunkul, S. Automatic Face Color Segmentation Based Rate Control For Low Bit Rate Video Coding. Proceedings of ISCAS 2003 (May. 2003), 384--387.
 
13
Tsekeridou, S., and Pitas, I. Content-Based Video Parsing and Indexing Based on Audio-Visual Interaction. IEEE Trans. Circuits and System for Video Technology, 11, 4 (Apr. 2001), 522--535.
 
14

CITED BY  15


REVIEW

"Athena Vakali : Reviewer"

People have always been interested in the use of personal media capture to sample and archive their experiences. Nowadays, the technology to support this endeavor has progressed from diaries and painting, through pocket cameras, to the current era  more...

Collaborative Colleagues:
Kiyoharu Aizawa: colleagues
Datchakorn Tancharoen: colleagues
Shinya Kawasaki: colleagues
Toshihiko Yamasaki: colleagues