| Re-ranking search results using network analysis a case study with google: a case study with Google |
| Full text |
Pdf
(186 KB)
|
| Source
|
IBM Centre for Advanced Studies Conference
archive
Proceedings of the 2002 conference of the Centre for Advanced Studies on Collaborative research
table of contents
Toronto, Ontario, Canada
Page: 14
Year of Publication: 2002
|
|
Authors
|
|
Behnak Yaltaghian
|
Interactive Media Laboratory, Bahen Center for Information Technology, University of Toronto, Toronto, Ontario, M5S 2E4
|
|
Mark Chignell
|
Interactive Media Laboratory, Bahen Center for Information Technology, University of Toronto, Toronto, Ontario, M5S 2E4
|
|
| Sponsors |
|
| Publisher |
IBM Press
|
| Bibliometrics |
Downloads (6 Weeks): 10, Downloads (12 Months): 70, Citation Count: 2
|
|
|
ABSTRACT
In this paper we review methods of structured search for information on the World Wide Web. We propose new methods based on co-citation and network analysis. We describe a set of 21 measures based on these methods and examine the factor structure of those measures. We then report on a recent study that we have conducted at the University of Toronto. Human judges rated the relevance of a selection of Web pages returned by the Google search engine for each of seven queries. We compared the average judged relevance of the top 20 search results selected by Google vs. the top 20 results as selected by each of the 21 network analysis measures. All but one of the network analysis measures ("inlink") showed significantly (p<.05) better (as compared to Google) average judged relevance amongst their top 20 selections. Stepwise regression analysis was then used to identify a linear model with three network analysis measures as predictors, which accounted for roughly 17% of the variance in relevance judgments. While these results need to be extended with more detailed analysis of a wide range of queries and topics, they suggest that network analysis of search output adjacency matrices (where adjacency/proximity is based on web-wide co-citations) may significantly improve search engine rankings.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
 |
2
|
Allan Borodin , Gareth O. Roberts , Jeffrey S. Rosenthal , Panayiotis Tsaparas, Finding authorities and hubs from link structures on the World Wide Web, Proceedings of the 10th international conference on World Wide Web, p.415-429, May 01-05, 2001, Hong Kong, Hong Kong
[doi> 10.1145/371920.372096]
|
| |
3
|
{3} Borgatti S., Everett M. and Freeman L. UCINET5, Software for social Network Analysis: User Guide, 2001.
|
| |
4
|
|
 |
5
|
|
| |
6
|
Soumen Chakrabarti , Byron Dom , Prabhakar Raghavan , Sridhar Rajagopalan , David Gibson , Jon Kleinberg, Automatic resource compilation by analyzing hyperlink structure and associated text, Proceedings of the seventh international conference on World Wide Web 7, p.65-74, April 1998, Brisbane, Australia
|
| |
7
|
Soumen Chakrabarti , Byron E. Dom , S. Ravi Kumar , Prabhakar Raghavan , Sridhar Rajagopalan , Andrew Tomkins , David Gibson , Jon Kleinberg, Mining the Web's Link Structure, Computer, v.32 n.8, p.60-67, August 1999
[doi> 10.1109/2.781636]
|
| |
8
|
{8} Everett Martin G. and Borgatti Stephen P. Analyzing Clique Overlap. Connections, 21 (1), 49-61, 1998.
|
| |
9
|
{9} Google Search Engine: www.google.com
|
| |
10
|
|
 |
11
|
|
| |
12
|
{12} Hanneman, Robert. Introduction to Social Network Methods. Self published at: http://faculty.ucr.edu/~hanneman/Soc157/TEX T/TextIndex.html. 2000
|
| |
13
|
|
 |
14
|
|
 |
15
|
|
| |
16
|
{16} Larson R. Bibliometrics of the World Wide Web: An Exploratory Analysis of the Intellectual Structure of Cyberspace, Proceedings of ASIS' 1996.
|
| |
17
|
{17} Small H. and Griffith B. The structure of scientific literatures: identifying and graphing specialties. Science Studies, 4 (17), 17-40, 1974.
|
| |
18
|
{18} Yaltaghian B. and Chignell M. Facilitation of Browsing the Search Engine Results: Using Co-Citation Analysis to Organize & Present the Search Results, Knowledge Network Conference - Beyond The Edge: Road Mapping Innovation, CITO, Ottawa, Oct. 2000
|
| |
19
|
{19} Yaltaghian B. and Chignell M. How Good is Search Engine Ranking?: A Validation Study with Human Judges, To appear in the Annual Meeting of the Human Factors and Ergonomics Society, Baltimore, MD, Sep 2002.
|
 |
20
|
|
Peer to Peer - Readers of this Article have also read:
-
Data structures for quadtree approximation and compression
Communications of the ACM
28, 9
Hanan Samet
-
A hierarchical single-key-lock access control using the Chinese remainder theorem
Proceedings of the 1992 ACM/SIGAPP Symposium on Applied computing
Kim S. Lee
, Huizhu Lu
, D. D. Fisher
-
The GemStone object database management system
Communications of the ACM
34, 10
Paul Butterworth
, Allen Otis
, Jacob Stein
-
Putting innovation to work: adoption strategies for multimedia communication systems
Communications of the ACM
34, 12
Ellen Francik
, Susan Ehrlich Rudman
, Donna Cooper
, Stephen Levine
-
An intelligent component database for behavioral synthesis
Proceedings of the 27th ACM/IEEE Design Automation Conference on
Gwo-Dong Chen
, Daniel D. Gajski
|