ACM Home Page
Please provide us with feedback. Feedback
Diversifying image search with user generated content
Full text PdfPdf (324 KB)
Source
International Multimedia Conference archive
Proceeding of the 1st ACM international conference on Multimedia information retrieval table of contents
Vancouver, British Columbia, Canada
SESSION: Image retrieval 1 table of contents
Pages 67-74  
Year of Publication: 2008
ISBN:978-1-60558-312-9
Authors
Roelof van Zwol  Yahoo! Research, Barcelona, Spain
Vanessa Murdock  Yahoo! Research, Barcelona, Spain
Lluis Garcia Pueyo  Yahoo! Research, Barcelona, Spain
Georgina Ramirez  Yahoo! Research, Barcelona, Spain
Sponsors
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 10,   Downloads (12 Months): 173,   Citation Count: 2
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1460096.1460109
What is a DOI?

ABSTRACT

Large-scale image retrieval on the Web relies on the availability of short snippets of text associated with the image. This user-generated content is a primary source of information about the content and context of an image. While traditional information retrieval models focus on finding the most relevant document without consideration for diversity, image search requires results that are both diverse and relevant. This is problematic for images because they are represented very sparsely by text, and as with all user-generated content the text for a given image can be extremely noisy.

The contribution of this paper is twofold. First, we present a retrieval model which provides diverse results as a property of the model itself, rather than in a post-retrieval step. Relevance models offer a unified framework to afford the greatest diversity without harming precision. Second, we show that it is possible to minimize the trade-offs between precision and diversity, and estimating the query model from the distribution of tags favors the dominant sense of a query. Relevance models operating only on tags offers the highest level of diversity with no significant decrease in precision.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
J. Allan, J. Callan, K. Collins-Thompson, W. B. Croft, F. Feng, D. Fisher, J. Lafferty, L. Larkey, T. N. Truong, P. Ogilvie, L. Si, T. Strohman, H. Turtle, and C. Zhai. The lemur toolkit for language modeling and information retrieval, 2005. http://www.cs.cmu.edu/ lemur.
3
4
 
5
D. Harman. Overview of the TREC 2002 novelty track. In Proceedings of the Eleventh Text Retrieval Conference (TREC), 2002.
6
7
8
9
 
10
V. Lavrenko, R. Manmatha, and J. Jeon. A model for learning the semantics of pictures. In Proceedings of the 17th Annual Conference on Neural Information Processing Systems, 2003.
11
12
 
13
 
14
15
16
 
17
I. Soboroff. Overview of the TREC 2004 novelty track. In Proceedings of the Thirteenth Text Retrieval Conference (TREC), 2004.
 
18
I. Soboroff and D. Harman. Overview of the TREC 2003 novelty track. In Proceedings of the Twelfth Text Retrieval Conference (TREC), 2003.
19
 
20
S. A. Yahia, P. Bhat, J. Shanmugasundaram, U. Srivastava, and E. Vee. Efficient online computation of diverse query results. In Proceedings of VLDB, 2007.
21
22


Collaborative Colleagues:
Roelof van Zwol: colleagues
Vanessa Murdock: colleagues
Lluis Garcia Pueyo: colleagues
Georgina Ramirez: colleagues