| Sample-based creation of peer summaries for efficient similarity search in scalable peer-to-peer networks |
| Full text |
Pdf
(213 KB)
|
Source
|
International Multimedia Conference
archive
Proceedings of the international workshop on Workshop on multimedia information retrieval
table of contents
Augsburg, Bavaria, Germany
POSTER SESSION: Multimedia retrieval and modeling
table of contents
Pages: 143 - 152
Year of Publication: 2007
ISBN:978-1-59593-778-0
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 4, Downloads (12 Months): 52, Citation Count: 1
|
|
|
ABSTRACT
In this paper we introduce a simple yet experimentally convincing approach in the research field of source selection for content-based similarity search in P2P networks or, more concretely, in summary-based P2P systems. In these systems, summaries are used for data source selection when performing k-NN queries on distributed collections of documents represented by feature vectors. We introduce a new type of cluster-based summaries for source selection that can efficiently and cheaply be calculated and distributed in P2P networks. For the summaries generation, a very large number of sample points is used. Each peer in the network assigns its indexing data to their corresponding closest sample points and publishes its constructed summary. We evaluate the quality of these summaries when changing the number of sample points used in experiments on real-world image feature data obtained from a large crawl of the flickr web photo community and show that for higher numbers of sample points we achieve a better retrieval performance. Our experiments show that the proposed summaries yield four times better performance with respect to previous methods. Intuitively, there are some disadvantages to this approach due to the large size of the generated summaries. We show experimentally, that these disadvantages can easily be overcome due to the sparse nature of the generated summaries by simple compression techniques.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
M. Bender et al. The Minerva Project: Database Selection in the Context of P2P Search. In BTW, Karlsruhe, pages 125--144, 2005.
|
 |
3
|
|
 |
4
|
Wendy Chang , Gholamhosein Sheikholeslami , Aidong Zhang , Tanveer F. Syeda-Mahmood, Efficient resource selection in distributed visual information systems, Proceedings of the fifth ACM international conference on Multimedia, p.203-213, November 09-13, 1997, Seattle, Washington, United States
[doi> 10.1145/266180.266367]
|
| |
5
|
I. Cox, M. Miller, T. Minka, T. Papathornas, and P. Yianilos. The Bayesian Image Retrieval System, PicHunter: Theory, Implementation, and Psychophysical Experiments. IEEE TIP, 9(1):20--37, 2000.
|
| |
6
|
F. M. Cuenca-Acuna and T. Nguyen. Text-Based Content Search and Retrieval in ad hoc P2P Communities. Technical Report DCS-TR-483, Dept. for CS, Rutgers University, 2002.
|
| |
7
|
|
| |
8
|
S. El Allali, D. Blank, M. Eisenhardt, A. Henrich, and W. Müller. Untersuchung des Einflusses verschiedener Bild-Features und Distanzmaβe im inhaltsbasierten P2P Information Retrieval. BTW 2007, 12th GI-Fachtagung für Datenbanksysteme in Business, Technologie und Web, pages 382--396, 2007.
|
| |
9
|
S. El Allali, D. Blank, W. Müller, and A. Henrich. Image data source selection using Gaussian Mixture Models. 5th International Workshop on Adaptive Multimedia Retrieval AMR'07, 2007.
|
 |
10
|
|
| |
11
|
|
| |
12
|
Gnutella. URL: http://www.gnutella.com.
|
| |
13
|
A. Henrich and H.-W. Six. How to split buckets in spatial data structures . Int. Conf. on Geographic Database Management Systems, pages 212--244, 1991.
|
 |
14
|
|
 |
15
|
|
| |
16
|
|
 |
17
|
|
| |
18
|
|
| |
19
|
W. Müller, M. Eisenhardt, and A. Henrich. Scalable summary based retrieval in P2P networks. In CIKM, pages 586--593, 2005.
|
 |
20
|
|
 |
21
|
|
 |
22
|
O. D. Sahin , A. Gulbeden , F. Emekci , D. Agrawal , A. El Abbadi, PRISM: indexing multi-dimensional data in P2P networks using reference vectors, Proceedings of the 13th annual ACM international conference on Multimedia, November 06-11, 2005, Hilton, Singapore
[doi> 10.1145/1101149.1101349]
|
| |
23
|
|
| |
24
|
|
 |
25
|
|
| |
26
|
|
| |
27
|
R. C. Veltkamp and M. Tanase. Content-based image retrieval systems: A survey. Technical Report UU-CS-2000-34, Department of Computing Science, Utrecht University, 2002.
|
| |
28
|
|
| |
29
|
B. Yang and H. Garcia-Molina. Designing a Super-peer Network. In 17th International Conference on Data Engineering, ICDE, pages 49--60, 2003.
|
|