ACM Home Page
Please provide us with feedback. Feedback
Clustering near-duplicate images in large collections
Full text PdfPdf (647 KB)
Source
International Multimedia Conference archive
Proceedings of the international workshop on Workshop on multimedia information retrieval table of contents
Augsburg, Bavaria, Germany
SESSION: Image retrieval and multimedia modeling table of contents
Pages: 21 - 30  
Year of Publication: 2007
ISBN:978-1-59593-778-0
Authors
Jun Jie Foo  RMIT University, Melbourne, Australia
Justin Zobel  RMIT University, Melbourne, Australia
Ranjan Sinha  University of Melbourne, Melbourne, Australia
Sponsors
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
ACM: Association for Computing Machinery
SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 16,   Downloads (12 Months): 112,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1290082.1290089
What is a DOI?

ABSTRACT

Near-duplicate images introduce problems of redundancy and copyright infringement in large image collections. The problem is acute on the web, where appropriation of images without acknowledgment of source is prevalent. In this paper, we present an effective clustering approach for near-duplicate images, using a combination of techniques from invariant image local descriptors and an adaptation of near-duplicate text-document clustering techniques; we extend our earlier approach of near-duplicate image pairwise identification for this clustering approach. We demonstrate that our clustering approach is highly effective for collections of up to a few hundred thousand images. We also show --- via experimentation with real examples --- that ourapproach presents a viable solution for clustering near-duplicate images on the Web.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
H. Bay, T. Tuytelaars, and L. V. Gool. Surf: Speeded up robust features. In Proc. ECCV, May 2006.
 
3
Y. Bernstein and J. Zobel. A scalable system for identifying co-derivative documents. In Proc. SPIRE, pages 55--67, October 2004.
 
4
 
5
E. Y. Chang, C. Li, J. Z. Wang, P. Mork, and G. Wiederhold. Searching near-replicas of images via clustering. In Proc. SPIE, pages 281--292, September 1999.
 
6
Y. Chen, J. Z. Wang, and R. Krovetz. CLUE: Cluster-based retrieval of images by unsupervised learning. IEEE Trans. on Image Processing, 14(8):1187--1201, 2005.
7
 
8
 
9
J. J. Foo, R. Sinha, and J. Zobel. Discovery of image versions in large collections. In Proc. MMM, pages 433--442, Jaunary 2007.
10
 
11
 
12
13
14
15
 
16
Y. Ke and R. Sukthankar. PCA-SIFT: A more distinctive representation for local image descriptors. In Proc. CVPR, pages 506--513, June 2004.
17
 
18
 
19
C.-S. Lu and C.-Y. Hsu. Geometric distortion-resilient image hashing scheme and its applications on copy detection and authentication. Multimedia Systems, 11(2):159--173, 2005.
 
20
Y. Meng, E. Y. Chang, and B. Li. Enhancing DPF for near-replica image recognition. In Proc. CVPR, pages 416--423, June 2003.
 
21
K. Mikolajczyk and C. Schmid. A performance evaluation of local descriptors. In Proc. CVPR, pages 257--263, June 2003.
 
22
 
23
 
24
 
25
26
27

Collaborative Colleagues:
Jun Jie Foo: colleagues
Justin Zobel: colleagues
Ranjan Sinha: colleagues