ACM Home Page
Please provide us with feedback. Feedback
Sample-based creation of peer summaries for efficient similarity search in scalable peer-to-peer networks
Full text PdfPdf (213 KB)
Source
International Multimedia Conference archive
Proceedings of the international workshop on Workshop on multimedia information retrieval table of contents
Augsburg, Bavaria, Germany
POSTER SESSION: Multimedia retrieval and modeling table of contents
Pages: 143 - 152  
Year of Publication: 2007
ISBN:978-1-59593-778-0
Authors
Daniel Blank  University of Bamberg, Bamberg, Germany
Soufyane El Allali  University of Bamberg, Bamberg, Germany
Wolfgang Mueller  University of Bamberg, Bamberg, Germany
Andreas Henrich  University of Bamberg, Bamberg, Germany
Sponsors
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
ACM: Association for Computing Machinery
SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 4,   Downloads (12 Months): 52,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1290082.1290104
What is a DOI?

ABSTRACT

In this paper we introduce a simple yet experimentally convincing approach in the research field of source selection for content-based similarity search in P2P networks or, more concretely, in summary-based P2P systems. In these systems, summaries are used for data source selection when performing k-NN queries on distributed collections of documents represented by feature vectors.

We introduce a new type of cluster-based summaries for source selection that can efficiently and cheaply be calculated and distributed in P2P networks. For the summaries generation, a very large number of sample points is used. Each peer in the network assigns its indexing data to their corresponding closest sample points and publishes its constructed summary. We evaluate the quality of these summaries when changing the number of sample points used in experiments on real-world image feature data obtained from a large crawl of the flickr web photo community and show that for higher numbers of sample points we achieve a better retrieval performance. Our experiments show that the proposed summaries yield four times better performance with respect to previous methods. Intuitively, there are some disadvantages to this approach due to the large size of the generated summaries. We show experimentally, that these disadvantages can easily be overcome due to the sparse nature of the generated summaries by simple compression techniques.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
M. Bender et al. The Minerva Project: Database Selection in the Context of P2P Search. In BTW, Karlsruhe, pages 125--144, 2005.
3
4
 
5
I. Cox, M. Miller, T. Minka, T. Papathornas, and P. Yianilos. The Bayesian Image Retrieval System, PicHunter: Theory, Implementation, and Psychophysical Experiments. IEEE TIP, 9(1):20--37, 2000.
 
6
F. M. Cuenca-Acuna and T. Nguyen. Text-Based Content Search and Retrieval in ad hoc P2P Communities. Technical Report DCS-TR-483, Dept. for CS, Rutgers University, 2002.
 
7
 
8
S. El Allali, D. Blank, M. Eisenhardt, A. Henrich, and W. Müller. Untersuchung des Einflusses verschiedener Bild-Features und Distanzmaβe im inhaltsbasierten P2P Information Retrieval. BTW 2007, 12th GI-Fachtagung für Datenbanksysteme in Business, Technologie und Web, pages 382--396, 2007.
 
9
S. El Allali, D. Blank, W. Müller, and A. Henrich. Image data source selection using Gaussian Mixture Models. 5th International Workshop on Adaptive Multimedia Retrieval AMR'07, 2007.
10
 
11
 
12
Gnutella. URL: http://www.gnutella.com.
 
13
A. Henrich and H.-W. Six. How to split buckets in spatial data structures . Int. Conf. on Geographic Database Management Systems, pages 212--244, 1991.
14
15
 
16
17
 
18
 
19
W. Müller, M. Eisenhardt, and A. Henrich. Scalable summary based retrieval in P2P networks. In CIKM, pages 586--593, 2005.
20
21
22
 
23
 
24
25
 
26
 
27
R. C. Veltkamp and M. Tanase. Content-based image retrieval systems: A survey. Technical Report UU-CS-2000-34, Department of Computing Science, Utrecht University, 2002.
 
28
 
29
B. Yang and H. Garcia-Molina. Designing a Super-peer Network. In 17th International Conference on Data Engineering, ICDE, pages 49--60, 2003.


Collaborative Colleagues:
Daniel Blank: colleagues
Soufyane El Allali: colleagues
Wolfgang Mueller: colleagues
Andreas Henrich: colleagues