|
ABSTRACT
Peer-to-Peer networks have become very popular on the Internet, with millions of peers all over the world sharing large volumes of data. In the assistive healthcare sector, it is likely that P2P networks will develop that interconnect and allow the controlled sharing of patient databases of various hospitals, clinics, and research laboratories. However, the sheer scale of these networks has made it difficult to gather statistics that could be used for building new features. In this paper, we present a technique to obtain estimations of the number of distinct values matching a query on the network. We evaluate the technique experimentally and provide a set of results that demonstrate its effectiveness, as well as its flexibility in supporting a variety of queries and applications.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
Arai, B., Das, G., Gunopulos, D., and Kalogeraki, V. Approximating Aggregation Queries in Peer-to-Peer Networks. ICDE 2006 (April 3--8, Atlanta, GA, 2006)
|
 |
3
|
|
 |
4
|
Moses Charikar , Surajit Chaudhuri , Rajeev Motwani , Vivek Narasayya, Towards estimation error guarantees for distinct values, Proceedings of the nineteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, p.268-279, May 15-18, 2000, Dallas, Texas, United States
[doi> 10.1145/335168.335230]
|
 |
5
|
Qin Lv , Pei Cao , Edith Cohen , Kai Li , Scott Shenker, Search and replication in unstructured peer-to-peer networks, Proceedings of the 16th international conference on Supercomputing, June 22-26, 2002, New York, New York, USA
[doi> 10.1145/514191.514206]
|
| |
6
|
Java Universal Network/Graph Framework (JUNG) Website. http://jung.sourceforge.net.
|
| |
7
|
Metropolis, N., Rosenbluth A, Rosenbluth, M., Teller, A., and Teller, E., Equations of state calculations by fast computing machines. J. of Chemical Physics, 21:1087--1091, 1953.
|
| |
8
|
W. Hastings. Monte Carlo sampling methods using Markov chains and their applications. Biometrika, 57(1):97--109, 1970.
|
| |
9
|
|
| |
10
|
Gnutella Development website: rfc-gnutella.sourceforge.net
|
 |
11
|
Daniel Stutzbach , Reza Rejaie , Nick Duffield , Subhabrata Sen , Walter Willinger, On unbiased sampling for unstructured peer-to-peer networks, Proceedings of the 6th ACM SIGCOMM conference on Internet measurement, October 25-27, 2006, Rio de Janeriro, Brazil
[doi> 10.1145/1177080.1177084]
|
| |
12
|
Baset, S. A., and Schulzrinne, H. An analysis of the Skype peer-to-peer Internet telephony protocol. Technical Report CUCS-039-04, Computer Science Department, Columbia University, September 2004.
|
| |
13
|
KazaA website: kazaa.com
|
 |
14
|
Surajit Chaudhuri , Rajeev Motwani , Vivek Narasayya, Random sampling for histogram construction: how much is enough?, Proceedings of the 1998 ACM SIGMOD international conference on Management of data, p.436-447, June 01-04, 1998, Seattle, Washington, United States
|
| |
15
|
|
| |
16
|
Tsoumakos, D., and Roussopoulos, N. - A Comparison of Peer-to-Peer Search Methods In Proceedings of the Sixth International Workshop on Web and Databases, San Diego, California (June 12--13, 2003)
|
 |
17
|
|
| |
18
|
|
 |
19
|
|
| |
20
|
|
| |
21
|
|
 |
22
|
Wen-Chi Hou , Gultekin Ozsoyoglu , Erdogan Dogdu, Error-constrained COUNT query evaluation in relational databases, Proceedings of the 1991 ACM SIGMOD international conference on Management of data, p.278-287, May 29-31, 1991, Denver, Colorado, United States
|
 |
23
|
|
| |
24
|
|
| |
25
|
|
| |
26
|
|
| |
27
|
L. Goodman. On the estimation of the number of classes in a population. Annals of Math. Stat., 20:572--579, 1949.
|
| |
28
|
K. Burnham and W. Overton. Robust estimation of population size when capture probabilities vary among animals. Ecology, 60:927--936, 1979.
|
| |
29
|
Shlosser A. On estimation of the size of the dictionary of a long text on the basis of a sample. Engrg. Cybernetics, 19:97--102, 1981.
|
| |
30
|
Zipf, G. E. Human Behavior and the Principle of Least Effort. Addison-Wesley Press, Inc., 1949.
|
 |
31
|
Richard J. Lipton , Jeffrey F. Naughton , Donovan A. Schneider, Practical selectivity estimation through adaptive sampling, Proceedings of the 1990 ACM SIGMOD international conference on Management of data, p.1-11, May 23-26, 1990, Atlantic City, New Jersey, United States
|
 |
32
|
|
 |
33
|
|
| |
34
|
Le Fessant, F., Handurukande, S., Kermarrec, A.-M., and Massoulié, L. Clustering in Peer-to-Peer File Sharing Workloads. 3rd Intl. Workshop on Peer-to-Peer Systems IPTPS 2004
|
 |
35
|
|
| |
36
|
|
|