ACM Home Page
Please provide us with feedback. Feedback
Combining multimodal preferences for multimedia information retrieval
Full text PdfPdf (259 KB)
Source
International Multimedia Conference archive
Proceedings of the international workshop on Workshop on multimedia information retrieval table of contents
Augsburg, Bavaria, Germany
SESSION: Video retrieval table of contents
Pages: 71 - 78  
Year of Publication: 2007
ISBN:978-1-59593-778-0
Authors
Eric Bruno  University of Geneva, Geneva, Switzerland
Jana Kludas  University of Geneva, Geneva, Switzerland
Stephane Marchand-Maillet  University of Geneva, Geneva, Switzerland
Sponsors
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
ACM: Association for Computing Machinery
SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 16,   Downloads (12 Months): 65,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1290082.1290095
What is a DOI?

ABSTRACT

Representing and fusing multimedia information is a key issue to discover semantics in multimedia. In this paper we address more specifically the problem of multimedia content retrieval by first defining a novel preference-based representation particularly adapted to the fusion problem, and then, by investigating the RankBoost algorithm to combine those preferences and a learn multimodal retrieval model. The approach has been tested on annotated images and on the complete TRECVID 2005 corpus and compared with SVM-based fusion strategies. The results show that our approach equals SVM performance but, contrary to SVM, is parameter free and faster.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
L. Boldareva and D. Hiemstra. Interactive content-based retrieval using pre-computed object-object similarities. In Conference on Image and Video Retrieval, CIVR'04, pages 308--316, Dublin, Ireland, 2004.
 
2
Eric Bruno, Nicolas Moenne-Loccoz, and Stéphane Marchand Maillet. Learning user queries in multimodal dissimilarity spaces. In Proceedings of the 3rd International Workshop on Adaptive Multimedia Retrieval, AMR'05, Glasgow, UK, July 2005.
 
3
 
4
Eric Bruno, Nicolas Moenne-Loccoz, and Stéphane Marchand-Maillet. Asymmetric learning and dissimilarity spaces for content-based retrieval. In Proc. of International Conference on Image and Video Retrieval (CIVR), pages 330--339, Tempe, AZ, July 2006.
 
5
E. Y. Chang, B. Li, G. Wu, and K. Go. Statistical learning for effective visual information retrieval. In Proceedings of the IEEE International Conference on Image Processing, 2003.
 
6
R.P.W. Duin. The combining classifier: To train or not to train? In Proceedings of the 16th International Conference on Pattern Recognition, ICPR'02, volume II, pages 765--770, Quebec City, 2004. IEEE Computer Socity Press.
 
7
 
8
J. Gu, L. Lu, H.J Zhang, and J. Yang. Dominant feature vectors based audio similarity measure. In PCM, number 2, pages 890--897, 2004.
 
9
D Heesch and S Rueger. NNk networks for content-based image retrieval. In 26th European Conference on Information Retrieval, Sunderland, UK, 2004.
 
10
Winston H. Hsu and Shih-Fu Chang. Generative, discriminative, and ensemble learning on multi-modal perceptual fusion toward news video story segmentation. In ICME, Taipei, Taiwan, June 2004.
 
11
 
12
 
13
 
14
Nicolas Moënne-Loccoz, Eric Bruno, and Stéphane Marchand-Maillet. Interactive partial matching of video sequences in large collections. In IEEE International Conference on Image Processing, Genova, Italy, 11-14 September 2005.
15
 
16
N.C. Oza, R. Polikar, J. Kittler, and F. Roli. Multiple classifier systems. In Series: Lecture Notes in Computer Science, volume 3541. Springer, 2005.
 
17
 
18
C. Petersohn. Fraunhofer HHI at TRECVID 2004: Shot boundary detection system. In TREC Video Retrieval Evaluation Online Proceedings, 2004.
 
19
Philip Resnik. Using information content to evaluate semantic similarity in a taxonomy. In 14th International Joint Conference on Artificial Intelligence, IJCAI, pages 448--453, Montreal, Canada, 1995.
 
20
21
 
22
J. R. Smith, A. Jaimes, C.-Y. Lin, M. Naphade, A. Natsev, and B. Tseng. Interactive search fusion methods for video database retrieval. In IEEE International Conference on Image Processing (ICIP), 2003.
 
23
24
25
 
26
J. Yang and A.G. Hauptmann. Multi-modality analysis for person type classification in news video. In Electronic Imaging'05 - Conference on Storage and Retrieval Methods and Applications for Multimedia, San Jose, USA, Jan 2005.
 
27
X.S. Zhou, A. Garg, and T.S. Huang. A discussion of nonlinear variants of biased discriminant for interactive image retrieval. In Proc. of the 3rd Conference on Image and Video Retrieval, CIVR'04, pages 353--364, 2004.
 
28
X.S. Zhou and T.S. Huang. Small sample learning during multimedia retrieval using biasmap. In Proceedings of the IEEE Conference on Pattern Recognition and Computer Vision, CVPR'01, volume I, pages 11--17, Hawaii, 2004.


Collaborative Colleagues:
Eric Bruno: colleagues
Jana Kludas: colleagues
Stephane Marchand-Maillet: colleagues