|
ABSTRACT
The variety of features available to represent multimedia data constitutes a rich pool of information. However, the plethora of data poses a challenge in terms of feature selection and integration for effective retrieval. Moreover, to further improve effectiveness, the retrieval model should ideally incorporate context-dependent feature representations to allow for retrieval on a higher semantic level. In this paper we present a retrieval model and learning framework for the purpose of interactive information retrieval. We describe how semantic relations between multimedia objects based on user interaction can be learnt and then integrated with visual and textual features into a unified framework. The framework models both feature similarities and semantic relations in a single graph. Querying in this model is implemented using the theory of random walks. In addition, we present ideas to implement short-term learning from relevance feedback. Systematic experimental results validate the effectiveness of the proposed approach for image retrieval. However, the model is not restricted to the image domain and could easily be employed for retrieving multimedia data (and even a combination of different domains, eg images, audio and text documents).
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
 |
2
|
|
| |
3
|
J. Han, M. Li, H. Zhang, and L. Guo. A memory learning framework for effective image retrieval. IEEE Trans. Image Processing, 14(4): 511--524, 2005.
|
 |
4
|
Jingrui He , Hanghang Tong , Mingjing Li , Wei-Ying Ma , Changshui Zhang, Multiple random walk and its application in content-based image retrieval, Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval, November 10-11, 2005, Hilton, Singapore
[doi> 10.1145/1101826.1101852]
|
| |
5
|
M.-K. Hu. Visual pattern recognition by moment invariants. IEEE Trans. Information Theory, 8(2): 179--187, Feb. 1962.
|
 |
6
|
G. Iyengar , P. Duygulu , S. Feng , P. Ircing , S. P. Khudanpur , D. Klakow , M. R. Krause , R. Manmatha , H. J. Nock , D. Petkova , B. Pytlik , P. Virga, Joint visual-text modeling for automatic retrieval of multimedia documents, Proceedings of the 13th annual ACM international conference on Multimedia, November 06-11, 2005, Hilton, Singapore
[doi> 10.1145/1101149.1101154]
|
 |
7
|
|
| |
8
|
A. N. Langville and C. D. Meyer. Deeper inside pagerank. Internet Mathematics, 1(3):335--400, 2004.
|
| |
9
|
|
| |
10
|
L. Lovasz. Random walks on graphs: A survey. Combinatorics, Paul Erdos is Eighty, 2:353--398, 1993.
|
| |
11
|
NIST. Proc. of the TREC Video Retrieval Evaluation Conference (TRECVID2005), Gaithersburgh, MD, USA, Nov. 2005.
|
| |
12
|
|
| |
13
|
J. J. Rocchio. Relevance feedback in information retrieval. In The SMART retrieval system: experiments in automatic document processing, pages 313--323. Prentice-Hall, Englewood Cliffs, US, 1971.
|
| |
14
|
Y. Rui and T. S. Huang. Optimizing learning in image retrieval. In IEEE Proc. of Conf. on Computer Vision and Pattern Recognition (CVPR-00), pages 236--245, Los Alamitos, June 2000.
|
| |
15
|
|
| |
16
|
M. Sonka, V. Hlavac, and R. Boyle. Image Processing, Analysis, and Machine Vision. Thomson-Engineering, Toronto, Canada, 2nd edition, 1998.
|
| |
17
|
M. Stricker and M. Orengo. Similarity of color images. In Proc. of the SPIE: Storage and Retrieval for Image and Video Databases, volume 2420, pages 381--392, Feb. 1995.
|
| |
18
|
|
 |
19
|
Hanghang Tong , Jingrui He , Mingjing Li , Changshui Zhang , Wei-Ying Ma, Graph based multi-modality learning, Proceedings of the 13th annual ACM international conference on Multimedia, November 06-11, 2005, Hilton, Singapore
[doi> 10.1145/1101149.1101337]
|
| |
20
|
J. Urban and J. M. Jose. EGO: A personalised multimedia management tool. In Proc. of the 2nd Int. Workshop on Adaptive Multimedia Retrieval, pages 3--17, 2004.
|
| |
21
|
|
| |
22
|
J. Urban and J. M. Jose. Evaluating a workspace's usefulness for image retrieval. ACM Multimedia Sytems Journal (Special Issue on User-Centered Multimedia), 2006. accepted for publication.
|
| |
23
|
|
| |
24
|
Berkley's list of Corel CDs, and image annotations. Available from http://elib.cs.berkeley.edu/corel/, last accessed in July 2006.
|
|