| Clustering near-duplicate images in large collections |
| Full text |
Pdf
(647 KB)
|
Source
|
International Multimedia Conference
archive
Proceedings of the international workshop on Workshop on multimedia information retrieval
table of contents
Augsburg, Bavaria, Germany
SESSION: Image retrieval and multimedia modeling
table of contents
Pages: 21 - 30
Year of Publication: 2007
ISBN:978-1-59593-778-0
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 16, Downloads (12 Months): 112, Citation Count: 0
|
|
|
ABSTRACT
Near-duplicate images introduce problems of redundancy and copyright infringement in large image collections. The problem is acute on the web, where appropriation of images without acknowledgment of source is prevalent. In this paper, we present an effective clustering approach for near-duplicate images, using a combination of techniques from invariant image local descriptors and an adaptation of near-duplicate text-document clustering techniques; we extend our earlier approach of near-duplicate image pairwise identification for this clustering approach. We demonstrate that our clustering approach is highly effective for collections of up to a few hundred thousand images. We also show --- via experimentation with real examples --- that ourapproach presents a viable solution for clustering near-duplicate images on the Web.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
H. Bay, T. Tuytelaars, and L. V. Gool. Surf: Speeded up robust features. In Proc. ECCV, May 2006.
|
| |
3
|
Y. Bernstein and J. Zobel. A scalable system for identifying co-derivative documents. In Proc. SPIRE, pages 55--67, October 2004.
|
| |
4
|
|
| |
5
|
E. Y. Chang, C. Li, J. Z. Wang, P. Mork, and G. Wiederhold. Searching near-replicas of images via clustering. In Proc. SPIE, pages 281--292, September 1999.
|
| |
6
|
Y. Chen, J. Z. Wang, and R. Krovetz. CLUE: Cluster-based retrieval of images by unsupervised learning. IEEE Trans. on Image Processing, 14(8):1187--1201, 2005.
|
 |
7
|
Ondřej Chum , James Philbin , Michael Isard , Andrew Zisserman, Scalable near identical image and shot detection, Proceedings of the 6th ACM international conference on Image and video retrieval, p.549-556, July 09-11, 2007, Amsterdam, The Netherlands
[doi> 10.1145/1282280.1282359]
|
| |
8
|
|
| |
9
|
J. J. Foo, R. Sinha, and J. Zobel. Discovery of image versions in large collections. In Proc. MMM, pages 433--442, Jaunary 2007.
|
 |
10
|
Jun Jie Foo , Justin Zobel , Ranjan Sinha , S. M. M. Tahaghoghi, Detection of near-duplicate images for web search, Proceedings of the 6th ACM international conference on Image and video retrieval, p.557-564, July 09-11, 2007, Amsterdam, The Netherlands
[doi> 10.1145/1282280.1282360]
|
| |
11
|
|
| |
12
|
|
 |
13
|
|
 |
14
|
|
 |
15
|
|
| |
16
|
Y. Ke and R. Sukthankar. PCA-SIFT: A more distinctive representation for local image descriptors. In Proc. CVPR, pages 506--513, June 2004.
|
 |
17
|
|
| |
18
|
|
| |
19
|
C.-S. Lu and C.-Y. Hsu. Geometric distortion-resilient image hashing scheme and its applications on copy detection and authentication. Multimedia Systems, 11(2):159--173, 2005.
|
| |
20
|
Y. Meng, E. Y. Chang, and B. Li. Enhancing DPF for near-replica image recognition. In Proc. CVPR, pages 416--423, June 2003.
|
| |
21
|
K. Mikolajczyk and C. Schmid. A performance evaluation of local descriptors. In Proc. CVPR, pages 257--263, June 2003.
|
| |
22
|
|
| |
23
|
|
| |
24
|
|
| |
25
|
|
 |
26
|
|
 |
27
|
|
|