| Semantic-event based analysis and segmentation of wedding ceremony videos |
| Full text |
Pdf
(1.53 MB)
|
Source
|
International Multimedia Conference
archive
Proceedings of the international workshop on Workshop on multimedia information retrieval
table of contents
Augsburg, Bavaria, Germany
SESSION: Video retrieval
table of contents
Pages: 95 - 104
Year of Publication: 2007
ISBN:978-1-59593-778-0
|
|
Authors
|
|
Wen-Huang Cheng
|
National Taiwan University, Taipei, Taiwan Roc
|
|
Yung-Yu Chuang
|
National Taiwan University, Taipei, Taiwan Roc
|
|
Bing-Yu Chen
|
National Taiwan University, Taipei, Taiwan Roc
|
|
Ja-Ling Wu
|
National Taiwan University, Taipei, Taiwan Roc
|
|
Shao-Yen Fang
|
National Taiwan University, Taipei, Taiwan Roc
|
|
Yin-Tzu Lin
|
National Taiwan University, Taipei, Taiwan Roc
|
|
Chi-Chang Hsieh
|
National Taiwan University, Taipei, Taiwan Roc
|
|
Chen-Ming Pan
|
National Taiwan University, Taipei, Taiwan Roc
|
|
Wei-Ta Chu
|
National Taiwan University, Taipei, Taiwan Roc
|
|
Min-Chun Tien
|
National Taiwan University, Taipei, Taiwan Roc
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 13, Downloads (12 Months): 67, Citation Count: 0
|
|
|
ABSTRACT
Wedding is one of the most important ceremonies in our lives. It symbolizes the birth and creation of a new family. In this paper, we present a system for automatically segmenting a wedding ceremony video into a sequence of recognized wedding events, e.g., the couple's wedding kiss. Our goal is to develop an automatic tool for users to efficiently organize, search, and retrieve his/her treasured wedding memories. Furthermore, the event descriptions could benefit and complement the current research in semantic video understanding. Technically, three kinds of event features, i.e., the speech/music discriminator, flashlight detector, and bride indicator, are exploited to build statistical models for each wedding event. Events are then recognized by a hidden Markov model, which takes into account both the fitness of observed features and the temporal rationality of event ordering to improve the segmentation accuracy. We conducted experiments on a rich set of wedding videos, and the results demonstrate the effectiveness of our approach.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
L. M. Spangenberg. Timeless Traditions: A Couple's Guide To Wedding Customs Around The World. Universe Publishing, 2001.
|
| |
2
|
D. Warner. Diane Warner's Contemporary Guide to Wedding Ceremonies. New Page Books, 2006.
|
| |
3
|
D. Gatica-Perez, A. Loui, and M.-T. Sun. Finding structure in home videos by probabilistic hierarchical clustering. IEEE TCSVT, 13(6):539--548, June 2003.
|
| |
4
|
Y. Zhai and M. Shah. Automatic segmentation of home videos. Proc. IEEE ICME'05, pages 9--12, 2005.
|
| |
5
|
|
| |
6
|
Z. Rasheed and M. Shah. Detection and representation of scenes in videos. IEEE TMM, 7(6):1097--1105, Dec. 2005.
|
| |
7
|
P. Yin, X.-S. Hua, and H.-J. Zhang. Automatic time stamp extraction system for home videos. Proc. IEEE ISCAS'02, pages 73--76, 2002.
|
 |
8
|
Matthew Cooper , Jonathan Foote , Andreas Girgensohn , Lynn Wilcox, Temporal event clustering for digital photo collections, Proceedings of the eleventh ACM international conference on Multimedia, November 02-08, 2003, Berkeley, CA, USA
[doi> 10.1145/957013.957093]
|
| |
9
|
|
| |
10
|
T. Mei, X.-S. Hua, H.-Q. Zhou, and S. Li. Modeling and mining of users' capture intention for home videos. IEEE TMM, 9(1):66--77, Jan. 2007.
|
| |
11
|
|
| |
12
|
|
 |
13
|
|
 |
14
|
|
| |
15
|
W.-H. Cheng, C.-W. Wang, and J.-L. Wu. Video adaptation for small display based on content recomposition. IEEE TCSVT, 17(1):43--58, Jan. 2007.
|
| |
16
|
X.-S. Hua, L. Lu, and H.-J. Zhang. Optimization-based automated home video editing system. IEEE TCSVT, 14(5):572--583, May 2004.
|
 |
17
|
|
| |
18
|
M. Chen, S.-C. Chen, M.-L. Shyu, and K. Wickramaratna. Semantic event detection via multimodal data mining. IEEE SPM, Mar. 2006.
|
| |
19
|
J.-H. Lim, Q. Tian, and P. Mulhem. Home photo content modeling for personalized event-based retrieval. IEEE MM, 10(4):28--37, Oct.-Dec. 2003.
|
| |
20
|
C. Panagiotakis and G. Tziritas. A speech/music discriminator based on rms and zero-crossings. IEEE TMM, 7(1):155--166, Feb. 2005.
|
| |
21
|
|
| |
22
|
Y. Li and C. Dorai. Instructional video content analysis using audio information. IEEE Trans. Audio, Speech, Lang. Process., 14(6):2264--2274, Nov. 2006.
|
| |
23
|
|
| |
24
|
B. T. Truong and S. Venkatesh. Determining dramatic intensification via flashing lights in movies. Proc. IEEE ICME'01, pages 61--64, 2001.
|
| |
25
|
|
| |
26
|
|
| |
27
|
|
|