ACM Home Page
Please provide us with feedback. Feedback
Re-ranking search results using network analysis a case study with google: a case study with Google
Full text PdfPdf (186 KB)
Source IBM Centre for Advanced Studies Conference archive
Proceedings of the 2002 conference of the Centre for Advanced Studies on Collaborative research table of contents
Toronto, Ontario, Canada
Page: 14  
Year of Publication: 2002
Authors
Behnak Yaltaghian  Interactive Media Laboratory, Bahen Center for Information Technology, University of Toronto, Toronto, Ontario, M5S 2E4
Mark Chignell  Interactive Media Laboratory, Bahen Center for Information Technology, University of Toronto, Toronto, Ontario, M5S 2E4
Sponsors
IBM Canada : IBM Canada
NRC : National Research Council - Canada
Publisher
IBM Press 
Bibliometrics
Downloads (6 Weeks): 10,   Downloads (12 Months): 70,   Citation Count: 2
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues   peer to peer  

Tools and Actions: Review this Article  

ABSTRACT

In this paper we review methods of structured search for information on the World Wide Web. We propose new methods based on co-citation and network analysis. We describe a set of 21 measures based on these methods and examine the factor structure of those measures. We then report on a recent study that we have conducted at the University of Toronto. Human judges rated the relevance of a selection of Web pages returned by the Google search engine for each of seven queries. We compared the average judged relevance of the top 20 search results selected by Google vs. the top 20 results as selected by each of the 21 network analysis measures. All but one of the network analysis measures ("inlink") showed significantly (p<.05) better (as compared to Google) average judged relevance amongst their top 20 selections. Stepwise regression analysis was then used to identify a linear model with three network analysis measures as predictors, which accounted for roughly 17% of the variance in relevance judgments. While these results need to be extended with more detailed analysis of a wide range of queries and topics, they suggest that network analysis of search output adjacency matrices (where adjacency/proximity is based on web-wide co-citations) may significantly improve search engine rankings.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
 
3
{3} Borgatti S., Everett M. and Freeman L. UCINET5, Software for social Network Analysis: User Guide, 2001.
 
4
5
 
6
 
7
 
8
{8} Everett Martin G. and Borgatti Stephen P. Analyzing Clique Overlap. Connections, 21 (1), 49-61, 1998.
 
9
{9} Google Search Engine: www.google.com
 
10
11
 
12
{12} Hanneman, Robert. Introduction to Social Network Methods. Self published at: http://faculty.ucr.edu/~hanneman/Soc157/TEX T/TextIndex.html. 2000
 
13
14
15
 
16
{16} Larson R. Bibliometrics of the World Wide Web: An Exploratory Analysis of the Intellectual Structure of Cyberspace, Proceedings of ASIS' 1996.
 
17
{17} Small H. and Griffith B. The structure of scientific literatures: identifying and graphing specialties. Science Studies, 4 (17), 17-40, 1974.
 
18
{18} Yaltaghian B. and Chignell M. Facilitation of Browsing the Search Engine Results: Using Co-Citation Analysis to Organize & Present the Search Results, Knowledge Network Conference - Beyond The Edge: Road Mapping Innovation, CITO, Ottawa, Oct. 2000
 
19
{19} Yaltaghian B. and Chignell M. How Good is Search Engine Ranking?: A Validation Study with Human Judges, To appear in the Annual Meeting of the Human Factors and Ergonomics Society, Baltimore, MD, Sep 2002.
20


Collaborative Colleagues:
Behnak Yaltaghian: colleagues
Mark Chignell: colleagues

Peer to Peer - Readers of this Article have also read: