|
ABSTRACT
In this paper, we propose a video summarization system which takes into account users' individual preferences by using their personal photo libraries. Nowadays it is common, especially among people of younger generations, to store thousands of photos inside their PCs and manage them using software such as iPhoto and Picasa. These personal photo libraries contain rich information about the user's tastes, personalities, and lifestyles. Since still photos are in many aspects similar to video as a medium, we assume that these personal photo libraries can be used to estimate users' preferences on video summarization.Our system first divides a movie into short segments, and uses image classification techniques to judge whether each segment is meaningful to the user or not. If many photos with contents similar to the segment can be found in the user's photo library, the segment is judged as being "important" to the user. Conventional image classification techniques use public or commercial photo databases as training data, while our system uses personal photo libraries. This difference leads to the need of several modifications in the classification process.We have implemented a prototype version of our system, and have validated the effectiveness of our approach through evaluating both the accuracy of our image classification algorithm, and users' subjective satisfaction levels of the summarization results.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Carson, C., Belongie, S., Greenspan, H. Malik, J. Blobworld: Image segmentation using Expectation-Maximization and its application to image querying. In IEEE Transactions on Pattern Analysis and Machine Intelligence, 1999.
|
| |
2
|
|
 |
3
|
|
| |
4
|
Ju, S. X., Black, M. J., Minneman, S., Kimber, D. Summarization of Video-taped Presentations: Automatic Analysis of Motion and Gesture. In IEEE Transactions on Circuits and Systems for Video Technology, 1998.
|
| |
5
|
|
 |
6
|
|
 |
7
|
|
| |
8
|
Mori, Y., Takahashi, H., Oka, R. Image-to-word transformation based on dividing and vector quantizing images with words. In First International Workshop on Multimedia Intelligent Storage and Retrieval Management (MISRM 99), 1999.
|
| |
9
|
Parshin, V., Chen, L. Video Summarization Based on User-Defined Constraints and Preferences. In Proceedings of RIAO 2004, 2004.
|
| |
10
|
|
| |
11
|
Smith, M. A., Kanade, T. Video Skimming and Characterization through the Combination of Image and Language Understanding Techniques. In CMU Technology Report, 1997.
|
| |
12
|
Yow, D., Yeo, B. L., Yeung, M., Liu, G. Analysis and Presentation of Soccer Highlights from Digital Video. In Proceedings of Second Asian Conference on Computer Vision, 1995.
|
CITED BY
|
Wen-Huang Cheng , Yung-Yu Chuang , Bing-Yu Chen , Ja-Ling Wu , Shao-Yen Fang , Yin-Tzu Lin , Chi-Chang Hsieh , Chen-Ming Pan , Wei-Ta Chu , Min-Chun Tien, Semantic-event based analysis and segmentation of wedding ceremony videos, Proceedings of the international workshop on Workshop on multimedia information retrieval, September 24-29, 2007, Augsburg, Bavaria, Germany
|
|