|
ABSTRACT
The unabated growth and increasing significance of the World Wide Web has resulted in a flurry of research activity to improve its capacity for serving information more effectively. But at the heart of these efforts lie implicit assumptions about "quality" and "usefulness" of Web resources and services. This observation points towards measurements and models that quantify various attributes of web sites. The science of measuring all aspects of information, especially its storage and retrieval or informetrics has interested information scientists for decades before the existence of the Web. Is Web informetrics any different, or is it just an application of classical informetrics to a new medium? In this article, we examine this issue by classifying and discussing a wide ranging set of Web metrics. We present the origins, measurement functions, formulations and comparisons of well-known Web metrics for quantifying Web graph properties, Web page significance, Web page similarity, search and retrieval, usage characterization and information theoretic properties. We also discuss how these metrics can be applied for improving Web information access and use.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Albert, R. and Barabasi, A. 2000. Topology of evolving networks: Local events and uncertainty. Phys. Rev. Lett. 84, 56--60.
|
| |
2
|
Albert, R., Jeong, H., and Barabasi, A. 1999. The diameter of the world wide web. Nature 401, 130--131.
|
| |
3
|
Barabasi, A. and Albert, R. 1999. Emergence of scaling in random networks. Science 286 (Oct.), 509--512.
|
| |
4
|
Barabasi, A., Albert, R., and Jeong, A. 1999. Mean-field theory for scale free random networks. Phys. A 272, 173--187.
|
| |
5
|
Barabasi, A., Albert, R., and Jeong, J. 2000. Scale-free characteristics of random networks: The topology of the world wide web. Phys. A, 281, 69--77.
|
| |
6
|
|
| |
7
|
|
| |
8
|
Andrei Z. Broder , Steven C. Glassman , Mark S. Manasse , Geoffrey Zweig, Syntactic clustering of the Web, Selected papers from the sixth international conference on World Wide Web, p.1157-1166, September 1997, Santa Clara, California, United States
|
| |
9
|
Andrei Broder , Ravi Kumar , Farzin Maghoul , Prabhakar Raghavan , Sridhar Rajagopalan , Raymie Stata , Andrew Tomkins , Janet Wiener, Graph structure in the Web, Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking, p.309-320, June 2000, Amsterdam, The Netherlands
|
| |
10
|
Boyce, B. R., Meadow, C. T., and Kraft, D. H. 1994. Measurement in Information Science. Academic Press Inc. Orlando, Fla.
|
| |
11
|
|
 |
12
|
Allan Borodin , Gareth O. Roberts , Jeffrey S. Rosenthal , Panayiotis Tsaparas, Finding authorities and hubs from link structures on the World Wide Web, Proceedings of the 10th international conference on World Wide Web, p.415-429, May 01-05, 2001, Hong Kong, Hong Kong
[doi> 10.1145/371920.372096]
|
 |
13
|
|
| |
14
|
|
| |
15
|
|
| |
16
|
Chakrabarti, S., Dom, B., Gibson, D., Kumar, R., Raghavan, P., Rajagopalan, S., and Tomkins, A. 1998a. Experiments in topic distillation. In Proceedings of the SIGIR Workshop on Hypertext IR.
|
| |
17
|
Soumen Chakrabarti , Byron Dom , Prabhakar Raghavan , Sridhar Rajagopalan , David Gibson , Jon Kleinberg, Automatic resource compilation by analyzing hyperlink structure and associated text, Proceedings of the seventh international conference on World Wide Web 7, p.65-74, April 1998, Brisbane, Australia
|
| |
18
|
|
| |
19
|
|
| |
20
|
|
| |
21
|
Dhyani, D. 2001. Measuring the web: Metrics, models and methods. Master's Dissertation, School of Computer Engineering, Nanyang Technological University, Singapore.
|
| |
22
|
Egghe, L. and Rousseau, R. 1990. Introduction to Informetrics. Elsevier Science Publishers. Amsterdam, The Netherlands.
|
 |
23
|
David Gibson , Jon Kleinberg , Prabhakar Raghavan, Inferring Web communities from link topology, Proceedings of the ninth ACM conference on Hypertext and hypermedia : links, objects, time and space---structure in hypermedia systems: links, objects, time and space---structure in hypermedia systems, p.225-234, June 20-24, 1998, Pittsburgh, Pennsylvania, United States
[doi> 10.1145/276627.276652]
|
| |
24
|
|
| |
25
|
|
| |
26
|
|
| |
27
|
Kleinberg, J., Kumar, R., Raghavan, P., Rajagopalan, S., and Tomkins, A. 1999. The web as a graph: Measurements, models, and methods. In Proceedings of the 5th International Conference on Computing and Combinatorics (COCOON).
|
| |
28
|
|
| |
29
|
Larson, R. 1996. Bibliometrics of the world wide web: An exploratory analysis of the intellectual structure of cyberspace. In Annual Meeting of the American Society of Information Science.
|
| |
30
|
Lawrence, S. and Giles, C. L. 1998. Searching the world wide web. Science 280 (Apr.).
|
| |
31
|
Lawrence, S. and Giles, C. L. 1999. Searching the web: General and scientific information access. IEEE Commun. 37, 1, 116--122.
|
| |
32
|
|
| |
33
|
|
 |
34
|
|
| |
35
|
|
| |
36
|
Montgomery, D. C. and Runger, G. C. 1994. Applied Statistics and Probability for Engineers. Wiley, New York.
|
| |
37
|
Murray, B. H. and Moore, A. 2000. Sizing the internet. White paper. Available from http:// www.cyveillance.com/web/us/downloads/Sizing_the_Internet.pdf (July).
|
| |
38
|
Perkowitz, M. and Etzioni, O. 1997. Adaptive web sites: An AI challenge. In Proceedings of the 15th International Joint Conference on Artificial Intelligence.
|
| |
39
|
|
| |
40
|
|
 |
41
|
Peter Pirolli , James Pitkow , Ramana Rao, Silk from a sow's ear: extracting usable structures from the Web, Proceedings of the SIGCHI conference on Human factors in computing systems: common ground, p.118-125, April 13-18, 1996, Vancouver, British Columbia, Canada
[doi> 10.1145/238386.238450]
|
| |
42
|
|
 |
43
|
James Pitkow , Peter Pirolli, Life, death, and lawfulness on the electronic frontier, Proceedings of the SIGCHI conference on Human factors in computing systems, p.383-390, March 22-27, 1997, Atlanta, Georgia, United States
[doi> 10.1145/258549.258805]
|
| |
44
|
|
 |
45
|
|
| |
46
|
Ross, S. 1983. Stochastic Processes. Wiley, New York.
|
| |
47
|
Selberg, E. and Etzioni, O. 1995. Multi-service search and comparison using the MetaCrawler. In Proceedings of the 4th International World Wide Web Conference.
|
| |
48
|
|
| |
49
|
Snell, L. 1998. Introduction to Probability. McGraw-Hill International Edition, Englewood Cliffs, N.J.
|
 |
50
|
Ron Weiss , Bienvenido Vélez , Mark A. Sheldon, HyPursuit: a hierarchical network search engine that exploits content-link hypertext clustering, Proceedings of the the seventh ACM conference on Hypertext, p.180-193, March 16-20, 1996, Bethesda, Maryland, United States
[doi> 10.1145/234828.234846]
|
| |
51
|
|
| |
52
|
|
| |
53
|
Yuwono, B., Lam, S., Ying, J., and Lee, D. 1995. A world wide web resource discovery system. In Proceedings of the 4th International World Wide Web Conference.
|
CITED BY 29
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Roberto Willrich , Rafael de Moura Speroni , Christopher Viana Lima , André Luiz de Oliveira Diaz , Sérgio Murilo Penedo, Adaptive information retrieval system applied to digital libraries, Proceedings of the 12th Brazilian symposium on Multimedia and the web, November 19-22, 2006, Natal, Rio Grande do Norte, Brazil
|
|
|
|
|
|
|
|
|
|
|
|
Tim Berners-Lee , Wendy Hall , James A. Hendler , Kieron O'Hara , Nigel Shadbolt , Daniel J. Weitzner, A framework for web science, Foundations and Trends in Web Science, v.1 n.1, p.1-130, January 2006
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Enrique Herrera-Viedma , Eduardo Peis , José M. Morales-del-Castillo , Sergio Alonso , Karina Anaya, A fuzzy linguistic model to evaluate the quality of Web sites that store XML documents, International Journal of Approximate Reasoning, v.46 n.1, p.226-253, September, 2007
|
|
|
Javier Ortiz-Hernández , Erika M. Nieto-Ariza , Hugo Estrada-Esquivel , Guillermo Rodríguez-Ortiz , Azucena Montes-Rendon, A theoretical evaluation for assessing the relevance of modeling techniques in business process modeling, Fourth international workshop on Software quality assurance: in conjunction with the 6th ESEC/FSE joint meeting, September 03-04, 2007, Dubrovnik, Croatia
|
|
|
|
|
|
Ion Ivan , Catalin Boja , Adrian Visoiu , Mihai Doinea, Optimization of distributed software, Proceedings of the 7th WSEAS International Conference on Software Engineering, Parallel and Distributed Systems, p.132-137, February 20-22, 2008, Cambridge, UK
|
|
|
|
|
|
Atsuyuki Morishima , Akiyoshi Nakamizo , Toshinari Iida , Shigeo Sugimoto , Hiroyuki Kitagawa, Bringing your dead links back to life: a comprehensive approach and lessons learned, Proceedings of the 20th ACM conference on Hypertext and hypermedia, June 29-July 01, 2009, Torino, Italy
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|