|
ABSTRACT
Network analysis has proved to be very useful in many social and natural sciences, and in particular Small World topologies have been exploited in many application fields. In this article, we focus on P2P file sharing applications, where spontaneous communities of users are studied and analyzed. We define a family of structures that we call “Affinity Networks” (or even Graphs) that show self-organized interest-based clusters. Empirical evidence proves that affinity networks are small worlds and shows scale-free features. The relevance of this finding is augmented with the introduction of a proactive recommendation scheme, namely DeHinter, that exploits this natural feature. The intuition behind this scheme is that a user would trust her network of “elective affinities” more than anonymous and generic suggestions made by impersonal entities. The accuracy of the recommendation is evaluated by way of a 10-fold cross validation, and a prototype has been implemented for further feedbacks from the users.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Achacoso, T. B. and Yamamoto, W. S. 1991. AYs Neuroanatomy of C Elegans for Computation. CRC-Press.
|
| |
3
|
Adamic, L. A. and Huberman, B. A. 2000. The nature of markets in the world wide web. Quarterly J. Electron. Commerce 1, 512.
|
| |
4
|
Albert, R. and Barabasi, A. L. 2002. Statistical mechanics of complex networks. Rev. Modern Physics 74, 1.
|
 |
5
|
|
| |
6
|
|
| |
7
|
Barabási, A.-L. and Albert, R. 1999. Emergence of scaling in random networks. Science 286, 509.
|
| |
8
|
|
| |
9
|
Bhalla, U. S. and Iyengar, R. 1999. Emergent properties of networks of biological signaling pathways. Science 283, 381--387.
|
| |
10
|
Breslau, L., Cao, P., Fan, L., Phillips, G., and Shenker, S. 1999. Web caching and zipf-like distributions: Evidence and implications. In Proceedings of INFOCOM. 126--134.
|
| |
11
|
Andrei Broder , Ravi Kumar , Farzin Maghoul , Prabhakar Raghavan , Sridhar Rajagopalan , Raymie Stata , Andrew Tomkins , Janet Wiener, Graph structure in the Web, Computer Networks: The International Journal of Computer and Telecommunications Networking, v.33 n.1-6, p.309-320, June 2000
|
| |
12
|
Claypool, M., Gokhale, A., Miranda, T., Murnikov, P., Netes, D., and Sartin, M. 1999. Combining content-based and collaborative filters in an online newspaper. In Proceedings of the ACM SIGIR Workshop on Recommender Systems: Algorithms and Evaluation. ACM.
|
| |
13
|
Cohen, J. E., Briand, F., and Newman, C. M. 1986. A stochastic theory of community food webs III. Predicted and observed lengths of food. Royal Soc. London Proc. Series B 228, 317--353.
|
| |
14
|
Cox, R. A. K., Felton, J. M., and Chung, K. C. 1995. The concentration of commercial success in popular music: an analysis of the distribution of gold records. J. Cultural Economics 19, 333--340.
|
| |
15
|
|
| |
16
|
de Sola Pool, I. and Kochen, M. 1978. Contacts and influence. Social Netw. 1, 1--48.
|
| |
17
|
D. DeRoure , W. Hall , S. Reich , G. Hill , A. Pikrakis , M. Stairmand, MEMOIR — an open framework for enhanced navigation of distributed information, Information Processing and Management: an International Journal, v.37 n.1, p.53-74, 2001
[doi> 10.1016/S0306-4573(00)00019-4]
|
| |
18
|
deSolla Price, D. J. 1967. Networks of scientific papers. Science 155, 3767, 1213--1219.
|
| |
19
|
Erdős, P. and Rényi, A. 1959. On random graphs. Publicationes Mathematicae 6.
|
| |
20
|
Erdős, P. and Rényi, A. 1960. On the evolution of random graphs. Publications of the Mathematical Institute of the Hungarian Academy of Sciences 5.
|
| |
21
|
Erdős, P. and Rényi, A. 1961. On the strength of connectedness of a random graph. Acta Mathematica Scientia Hungary 12.
|
| |
22
|
Estoup, J. B. 1916. Les gammes stenographiques. Institut Stenographique de France.
|
 |
23
|
Michalis Faloutsos , Petros Faloutsos , Christos Faloutsos, On power-law relationships of the Internet topology, Proceedings of the conference on Applications, technologies, architectures, and protocols for computer communication, p.251-262, August 30-September 03, 1999, Cambridge, Massachusetts, United States
|
 |
24
|
|
| |
25
|
Gutenberg, B. and Richter, R. F. 1944. Frequency of earthquakes in california. Bul. Seismological Soc. Amer. 34, 185--188.
|
| |
26
|
Han, P., Xie, B., Yang, F., and Shen, R. 2004. A scalable p2p recommender system based on distributed collaborative filtering. Expert Syst. Appl. 27, 2, 203--210.
|
| |
27
|
Hartwell, L. H., Hopfield, J. J., Leibler, S., and Murray, A. W. 1999. From molecular to modular cell biology. Nature 402, 6761 Suppl.
|
 |
28
|
|
| |
29
|
Howe, A. E. and Dreilinger, D. 1997. SAVVYSEARCH: A metasearch engine that learns which search engines to query. AI Mag. 18, 2, 19--25.
|
| |
30
|
Iamnitchi, A., Ripeanu, M., and Foster, I. 2004. Small-world file-sharing communities. In The 23rd Conference of the IEEE Communications Society (INFOCOM'04). Vol. 2. 952--963.
|
| |
31
|
Jeong, H., Tombor, B., Albert, R., Oltvai, Z. N., and Barabási, A. L. 2000. The large-scale organization of metabolic networks. Nature 407, 6804, 651--654.
|
| |
32
|
Karinthy, F. 1929. Chains. Everything is Different. Atheneum Press.
|
 |
33
|
|
| |
34
|
Kleinfeld, J. S. 2001. Could it be a big world after all? the “six degrees of separation” myth. Society.
|
| |
35
|
Klingberg, T. and Manfredi, R. 2002. Gnutella protocol development. http://rfc-gnutella.sourceforge.net/src/rfc-0_6-draft.html (last access:1/13/09).
|
| |
36
|
Kohli, R. and Sah, R. 2003. Market shares: Some power law results and observations. Working paper 04.01, School of Public Policy, University of Chicago.
|
| |
37
|
Kohn, K. W. 1999. Molecular interaction map of the mammalian cell cycle control and DNA repair systems. Mol. Biol. Cell 10, 8, 2703--2734.
|
 |
38
|
Joseph A. Konstan , Bradley N. Miller , David Maltz , Jonathan L. Herlocker , Lee R. Gordon , John Riedl, GroupLens: applying collaborative filtering to Usenet news, Communications of the ACM, v.40 n.3, p.77-87, March 1997
[doi> 10.1145/245108.245126]
|
| |
39
|
Krulwich, B. 1997. Lifestyle finder: Intelligent user profiling using large-scale demographic data. AI Maga. 18, 2, 37--45.
|
| |
40
|
Lang, K. 1995. NewsWeeder: learning to filter netnews. In Proceedings of the 12th International Conference on Machine Learning. Morgan Kaufmann Publishers Inc. 331--339.
|
| |
41
|
|
| |
42
|
Lotka, A. J. 1926. The frequency distribution of scientific production. J. Wash. Acad. Sci. 16, 317--323.
|
| |
43
|
Lu, E. T. and Hamilton, R. J. 1991. Avalanches and the distribution of solar flares. Astrophysical J. 380, L89--L92.
|
| |
44
|
Milgram, S. 1967. The small world problem. Psych. Today 2, 60--67.
|
 |
45
|
|
| |
46
|
Monasson, R. 1999. Diffusion, localization and dispersion relations on “small-world” lattices. European Physical J. B 12, 4, 555--567.
|
| |
47
|
|
| |
48
|
Neukum, G. and Ivanov, B. A. 1994. Crater size distributions and impact probabilities on earth from lunar, terrestrial-planet, and asteroid cratering Data. In Hazards Due to Comets and Asteroids, T. Gehrels, M. S. Matthews, and A. M. Schumann, Eds. The University of Arizona Press, 359--416.
|
| |
49
|
Newman, M. E. 2001. The structure of scientific collaboration networks. Proc. Nat. Acad. Sci. 98, 2, 404--409.
|
| |
50
|
Newman, M. E. J. 2003. The structure and function of complex networks. SIAM Rev. 45, 167.
|
| |
51
|
Newman, M. E. J. 2005. Power laws, pareto distributions and zipf's law. Contemp. Physics 46, 323.
|
| |
52
|
Newman, M. E. J. and Watts, D. J. 1999. Renormalization group analysis of the small-world network model. Physics Lett. A 263, 341--346.
|
| |
53
|
|
| |
54
|
Pagallo, U. 2006. Teoria giundica della Complessità. Dalla “Polis primitiva” di Socrate ai “mondi piccoli” dellinformatica—Un approccio evolutivo. Giapichelli, Torino, Italy.
|
| |
55
|
|
| |
56
|
Pazzani, M. J., Muramatsu, J., and Billsus, D. 1996. Syskill webert: Identifying interesting web sites. In Proceedings of AAAI/IAAI, Vol. 1. 54--61.
|
| |
57
|
Phex Team. 2003. Phex file-sharing gnutella client. http://www.phex.org/mambo/(last access: 1/13/09).
|
| |
58
|
|
| |
59
|
|
| |
60
|
Redner, S. 1998. How popular is your paper? An empirical study of the citation distribution. European Physical J. B 4, 131.
|
| |
61
|
|
 |
62
|
|
| |
63
|
Rich, E. 1979. User modeling via stereotypes. Cognitive Sci. 3, 329--354.
|
| |
64
|
Roberts, D. C. and Turcotte, D. L. 1998. Fractality and selforganized criticality of wars. Fractals 6, 351--357.
|
 |
65
|
|
| |
66
|
Ruffo, G., Schifanella, R., and Ghiringhello, E. 2006. A decentralized recommendation system based on self-organizing partnerships. Lecture Notes in Computer Science, vol. 3976. Springer, 618--629.
|
 |
67
|
Badrul M. Sarwar , Joseph A. Konstan , Al Borchers , Jon Herlocker , Brad Miller , John Riedl, Using filtering agents to improve prediction quality in the GroupLens research collaborative filtering system, Proceedings of the 1998 ACM conference on Computer supported cooperative work, p.345-354, November 14-18, 1998, Seattle, Washington, United States
[doi> 10.1145/289444.289509]
|
 |
68
|
|
| |
69
|
Seglen, P. O. 1992. The skewness of science. J. Amer. Soc. Inform. Sci. 43, 9, 628--638.
|
| |
70
|
|
| |
71
|
Sripanidkulchai, K., Maggs, B., and Zhang, H. 2003. Efficient content location using interest-based locality in peer-topeer systems. In Proceedings of the InfoCom.
|
| |
72
|
|
| |
73
|
Terveen, L. and Hill, W. 2001. Beyond recommender systems: Helping people help each other. In HCI in the New Millennium. Addison-Wesley, 487--509.
|
 |
74
|
|
| |
75
|
von Goethe, J. W. 1809. Die Wahlverwandtschaften. http://en.wikipedia.org/wiki/Elective_Affinities.
|
 |
76
|
|
| |
77
|
Wasserman, S. and Faust, K. 1994. Social Network Analysis. Cambridge University Press, Cambridge, U.K.
|
| |
78
|
|
| |
79
|
Watts, D. J. and Strogatz, S. H. 1998. Collective dynamics of ‘small-world’ networks. Nature 393, 6684, 440--442.
|
 |
80
|
|
| |
81
|
Williams, R. J. and Martinez, N. D. 2000. Simple rules yield complex food webs. Nature 404, 6774, 180--183.
|
| |
82
|
Willis, J. C. and Yule, G. U. 1922. Some statistics of evolution and geographical distribution in plants and animals, and their significance. Nature 109, 177--179.
|
 |
83
|
|
| |
84
|
|
| |
85
|
Zanette, D. H. and Manrubia, S. C. 2001. Vertical transmission of culture and the distribution of family names. Physica A: Statist. Mechanics Appl. 295, 1-2, 1--8.
|
|