| Processing complex similarity queries in peer-to-peer networks |
| Full text |
Pdf
(477 KB)
|
| Source
|
Symposium on Applied Computing
archive
Proceedings of the 2008 ACM symposium on Applied computing
table of contents
Fortaleza, Ceara, Brazil
SESSION: Engineering large-scale distributed systems
table of contents
Pages 473-478
Year of Publication: 2008
ISBN:978-1-59593-753-7
|
|
Authors
|
|
Claudio Gennaro
|
ISTI - CNR, Pisa, Italy
|
|
Matteo Mordacchini
|
ISTI - CNR, Pisa, Italy
|
|
Salvatore Orlando
|
ISTI - CNR, Pisa, Italy and Università Ca' Foscari, Via Torino, Venezia, Italy
|
|
Fausto Rabitti
|
ISTI - CNR, Pisa, Italy
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 5, Downloads (12 Months): 54, Citation Count: 2
|
|
|
ABSTRACT
Similarity search for content-based retrieval (where content can be any combination of text, image, audio/video, etc.) has gained importance in recent years, also because of the advantage of ranking the retrieved results according to their proximity to a query. However, to use similarity search in real world applications, we need to tackle the problem of huge volumes of such mixed multimedia data (e.g., coming from Web sites) and the problem of their distribution on multiple cooperating nodes. The proposed approach is being used in two running projects: SAPIR and NeP4B. In this paper we approach this problem by considering a scenario of a network of autonomous peers maintaining a local collection of metric objects (i.e., mixed mode multimedia content). This network forms a distributed Peer-to-Peer (P2P) search engine for similarity search based on the paradigm of Routing Index. Each peer in the network thus maintains both an index of its local resources and a table for every neighbor, summarizing the objects that are reachable from it. The paper presents techniques that aim to make our P2P similarity-based search system viable, trading approximate results for scalable solutions. Results of simulations that use real collections of images are discussed.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
2006. Gnutella protocol development. http://rfc-gnutella.sourceforge.net/.
|
 |
2
|
|
 |
3
|
|
| |
4
|
|
| |
5
|
M. Marzolla. libcppsim: a simula-like, portable process-oriented simulation library in C++. In G. Horton, editor, Proc. of ESM04, the 18th European Simulation Multiconference, Magdeburg, DE, 2004. SCS-European Publishing House.
|
 |
6
|
Sylvia Ratnasamy , Paul Francis , Mark Handley , Richard Karp , Scott Schenker, A scalable content-addressable network, Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications, p.161-172, August 2001, San Diego, California, United States
|
| |
7
|
Ion Stoica , Robert Morris , David Liben-Nowell , David R. Karger , M. Frans Kaashoek , Frank Dabek , Hari Balakrishnan, Chord: a scalable peer-to-peer lookup protocol for internet applications, IEEE/ACM Transactions on Networking (TON), v.11 n.1, p.17-32, February 2003
[doi> 10.1109/TNET.2002.808407]
|
| |
8
|
|
| |
9
|
|
CITED BY 2
|
|
Stanislav Barton , Vlastislav Dohnal , Jan Sedmidubsky , Pavel Zezula, Building self-organized image retrieval network, Proceeding of the 2008 ACM workshop on Large-Scale distributed systems for information retrieval, October 30-30, 2008, Napa Valley, California, USA
|
|
|
|
|