|
ABSTRACT
Hybrid peer-to-peer architectures use special nodes to provide directory services for regions of the network ("regional directory services"). Hybrid peer-to-peer architectures are a potentially powerful model for developing large-scale networks of complex digital libraries, but peer-to-peer networks have so far tended to use very simple methods of resource selection and document retrieval. In this paper, we study the application of content-based resource selection and document retrieval to hybrid peer-to-peer networks. The directory nodes that provide regional directory services construct and use the content models of neighboring nodes to determine how to route query messages through the network. The leaf nodes that provide information use content-based retrieval to decide which documents to retrieve for queries. The experimental results demonstrate that using content-based retrieval in hybrid peer-to-peer networks is both more accurate and more efficient for some digital library environments than more common alternatives such as Gnutella 0.6.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
A. Asvanund, R. Krishnan, M. D. Smith, R. Telang, S. Bagla, and M. Kapadia. Intelligent club management in peer-to-peer petworks. In Workshop on Economics of Peer-to-Peer Systems, 2003.
|
| |
3
|
J. Callan. Distributed information retrieval. W. B. Croft, editor, Advances in information retrieval, chapter 5, pages 127--150. Kluwer Academic Publishers, 2000.
|
 |
4
|
David Carmel , Doron Cohen , Ronald Fagin , Eitan Farchi , Michael Herscovici , Yoelle S. Maarek , Aya Soffer, Static index pruning for information retrieval systems, Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, p.43-50, September 2001, New Orleans, Louisiana, United States
[doi> 10.1145/383952.383958]
|
| |
5
|
F. Cuenca-Acuna and T. Nguyen. Text-based content search and retrieval in ad hoc p2p communities. Technical Report DCS-TR-483, Rutgers University, 2002.
|
 |
6
|
Frank Dabek , M. Frans Kaashoek , David Karger , Robert Morris , Ion Stoica, Wide-area cooperative storage with CFS, Proceedings of the eighteenth ACM symposium on Operating systems principles, October 21-24, 2001, Banff, Alberta, Canada
|
| |
7
|
|
 |
8
|
Luis Gravano , Héctor García-Molina , Anthony Tomasic, The effectiveness of GIOSS for the text database discovery problem, Proceedings of the 1994 ACM SIGMOD international conference on Management of data, p.126-137, May 24-27, 1994, Minneapolis, Minnesota, United States
|
| |
9
|
The Gnutella protocol specification v0.4. http://www9.limewire.com/developer/gnutella_protocol_0.4.pdf.
|
| |
10
|
The Gnutella protocol specification v0.6. http://rfc-gnutella.sourceforge.net.
|
| |
11
|
D. Hawking. Overview of the TREC-9 web track. In Proc. of the 9th Text Retrieval Conference (TREC-9), 2000.
|
| |
12
|
|
| |
13
|
Javasim. http://javasim.ncl.ac.uk/.
|
 |
14
|
|
| |
15
|
KaZaA. http://www.kazaa.com.
|
| |
16
|
Limewire. http://www.limewire.com.
|
| |
17
|
|
| |
18
|
J. Lu and J. Callan. Peer-to-peer testbed definitions: trecwt10g-2500-bysource-v1 and trecwt10g-query-bydoc-v1. http://hartford.lti.cs.cmu.edu/callan/Data, 2003.
|
 |
19
|
|
| |
20
|
P. Ogilvie and J. Callan. Experiments using the Lemur toolkit. In Proc. of the 10th Text Retrieval Conference (TREC-10), 2001.
|
 |
21
|
Sylvia Ratnasamy , Paul Francis , Mark Handley , Richard Karp , Scott Schenker, A scalable content-addressable network, Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications, p.161-172, August 2001, San Diego, California, United States
|
| |
22
|
C. Tang, Z. Xu and M. Mahalingam. Efficient information retrieval in peer-to-peer networks. In Proc. of HotNets-I, ACM SIGCOMM, 2002.
|
| |
23
|
S. Waterhouse. JXTA Search: Distributed search for distributed networks. Technical report, Sun Microsystems Inc., 2001.
|
 |
24
|
|
| |
25
|
Y. Zhang, W. Xu and J. Callan. Exact maximum likelihood estimation for word mixtures. http://www-2.cs.cmu.edu/ callan/Papers/icml02-yiz.ps In Workshop on Text Learning of the 9th International Conference on Machine Learning (TextML' 2002), 2002.
|
CITED BY 36
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Matthias Bender , Sebastian Michel , Peter Triantafillou , Gerhard Weikum , Christian Zimmer, MINERVA: collaborative P2P search, Proceedings of the 31st international conference on Very large data bases, August 30-September 02, 2005, Trondheim, Norway
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
R. Akavipat , L.-S. Wu , F. Menczer , A.G. Maguitman, Emerging semantic communities in peer web search, Proceedings of the international workshop on Information retrieval in peer-to-peer networks, November 11-11, 2006, Arlington, Virginia, USA
|
|
|
|
|
|
Sebastian Michel , Matthias Bender , Nikos Ntarmos , Peter Triantafillou , Gerhard Weikum , Christian Zimmer, Discovering and exploiting keyword and attribute-value co-occurrences to improve P2P routing indices, Proceedings of the 15th ACM international conference on Information and knowledge management, November 06-11, 2006, Arlington, Virginia, USA
|
|
|
Toan Luu , Fabius Klemm , Ivana Podnar , Martin Rajman , Karl Aberer, ALVIS peers: a scalable full-text peer-to-peer retrieval engine, Proceedings of the international workshop on Information retrieval in peer-to-peer networks, November 11-11, 2006, Arlington, Virginia, USA
|
|
|
|
|
|
|
|
|
|
|
|
Daniel Blank , Soufyane El Allali , Wolfgang Mueller , Andreas Henrich, Sample-based creation of peer summaries for efficient similarity search in scalable peer-to-peer networks, Proceedings of the international workshop on Workshop on multimedia information retrieval, September 24-29, 2007, Augsburg, Bavaria, Germany
|
|
|
|
|
|
Hanhua Chen , Hai Jin , Jiliang Wang , Lei Chen , Yunhao Liu , Lionel M. Ni, Efficient multi-keyword search over p2p web, Proceeding of the 17th international conference on World Wide Web, April 21-25, 2008, Beijing, China
|
|
|
Aruna Balasubramanian , Yun Zhou , W. Bruce Croft , Brian Neil Levine , Aruna Venkataramani, Web search from a bus, Proceedings of the second workshop on Challenged networks CHANTS, September 14-14, 2007, Montreal, Quebec, Canada
|
|
|
Fabiano Atalla , Daniel Miranda , Jussara Almeida , Marcos André Gonçalves , Virgilio Almeida, Analyzing the impact of churn and malicious behavior on the quality of peer-to-peer web search, Proceedings of the 2008 ACM symposium on Applied computing, March 16-20, 2008, Fortaleza, Ceara, Brazil
|
|
|
|
|
|
|
|
|
Aoying Zhou , Rong Zhang , Weining Qian , Quang Hieu Vu , Tianming Hu, Adaptive indexing for content-based search in P2P systems, Data & Knowledge Engineering, v.67 n.3, p.381-398, December, 2008
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|