ACM Home Page
Please provide us with feedback. Feedback
Agreeing to disagree: search engines and their public interfaces
Full text PdfPdf (309 KB)
Source
International Conference on Digital Libraries archive
Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries table of contents
Vancouver, BC, Canada
SESSION: User studies and user interfaces table of contents
Pages: 309 - 318  
Year of Publication: 2007
ISBN:978-1-59593-644-8
Authors
Frank McCown  Old Dominion University, Norfolk, VA
Michael L. Nelson  Old Dominion University, Norfolk, VA
Sponsors
ACM: Association for Computing Machinery
SIGIR: ACM Special Interest Group on Information Retrieval
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 22,   Downloads (12 Months): 121,   Citation Count: 7
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1255175.1255237
What is a DOI?

ABSTRACT

Google, Yahoo and MSN all provide both web user interfaces (WUIs) and application programming interfaces (APIs) to their collections. Whether building collections of resources or studying the search engines themselves, the search engines request that researchers use their APIs and not "scrape" the WUIs. However, anecdotal evidence suggests the interfaces produce different results. We provide the first in depth quantitative analysis of the results produced by the Google, MSN and Yahoo API and WUI interfaces. We have queried both interfaces for five months and found significant discrepancies between the interfaces in several categories. In general, we found MSN to produce the most consistent results between their two interfaces. Our findings suggest that the API indexes are not older, but they are probably smaller for Google and Yahoo. We also examined how search results decay over time and built predictive models based on the observed decay rates. Based on our findings, it can take over a year for half of the top 10 results to a popular query to be replaced in Google and Yahoo; for MSN it may take only 2-3 months.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Ask terms of service, 2006. http://sp.ask.com/en/docs/about/terms_of_service.shtml.
 
2
J. Bar-Ilan. Search engine results over time-A case study on search engine stability. Cybermetrics, 2/3(1), 1998/99.
 
3
 
4
 
5
J. Bar-Ilan. Expectations versus reality-search engine features needed for web research at mid 2005. Cybermetrics, 9(1), 2005.
 
6
J. Bar-Ilan, M. Levene, and M. Mat-Hassan. Dynamics of search engine rankings-A case study. In Proceedings of the 3rd International Workshop on Web Dynamics, May 2004.
 
7
8
9
 
10
 
11
12
 
13
D. Clinton. Beyond the SOAP search API, Dec. 2006. http://google-code-updates.blogspot.com/2006/12/beyond-soap-search-api.html.
 
14
K. Curran and A. Doherty. Automated broadcast media monitoring using the Google API. In Proceedings of CCNC 2006, volume 2, pages 1098--1102, 2006.
 
15
M. Cutts. GoogleGuy's posts, June 2005. http://www.webmasterworld.com/forum30/29720.htm.
 
16
M. Cutts. Google datacenters. Video, July 31 2006. http://video.google.com/videoplay?docid=8726665066825965913.
 
17
Did-it, Enquiro, and Eyetools uncover search's Golden Triangle, 2005. http://www.enquiro.com/eye-tracking-pr.asp.
 
18
W. Ding and G. Marchionini. A comparative study of web search service performance. In Proceedings of the ASIS Annual Meeting, volume 33, pages 136--142, 1996.
 
19
 
20
P. Festa. Google worm targets AOL, Yahoo. Dec. 28 2004. http://news.com.com/Google+worm+targets+AOL%2C+Yahoo/2100-7349_3-5504769.html.
 
21
S. Gauch, G. Wang, and M. Gomez. Profusion: Intelligent fusion from multiple, distributed search engines. Journal of Universal Computer Science, 2(9):637--649, 1996.
 
22
B. Gillette. Google blacklisting researchers? Dec. 14 2004. http://www.emailbattles.com/2005/12/14/virus_aacdehdcic_ei/.
 
23
Google privacy center: Terms of service, 2006. http://www.google.com/terms_of_service.html.
24
 
25
T. G. Habing, T. W. Cole, and W. H. Mischo. Developing a technical registry of OAI data providers. In Proceedings of ECDL '04, pages 400--410, 2004.
 
26
N. Jain, M. Dahlin, and R. Tewari. Using Bloom filters to refine web search results. In Proceedings of the 8th International Workshop on the Web and Databases, 2005.
27
 
28
W. Koehler. A longitudinal study of web pages continued: A consideration of document persistence. Information Research, 9(2), 2004.
 
29
M. Koo and H. Skinner. Improving web searches: Case study of quit-smoking web sites for teenagers. Journal of Medical Internet Research, 5(4), Nov. 2003.
 
30
31
 
32
The Lycos 50, 2006. http://50.lycos.com/.
 
33
P. Mayr and F. Tosques. Google Web APIs - an instrument for webometric analyses? In Proceedings of the 10th International Conference of the International Society for Scientometrics and Informetrics (ISSI '05), 2005.
 
34
F. McCown. Comparison of search engine interfaces, 2006. http://www.cs.odu.edu/~fmccown/research/se_apis/.
 
35
F. McCown, J. Bollen, and M. L. Nelson. Evaluation of the NSDL and Google for obtaining pedagogical resources. In Proceedings of ECDL '05, pages 344--355, 2005.
 
36
F. McCown, S. Chan, M. L. Nelson, and J. Bollen. The availability and persistence of web references in D-Lib Magazine. In Proceedings of the 5th International Web Archiving Workshop (IWAW '05), Sept. 2005.
 
37
 
38
M. Moffatt. Yahoo error: Unable to process request at this time - error 999. Feb. 14 2005. http://murraymoffatt.com/software-problem-0011.html.
 
39
MSN terms of service, 2006. http://tou.live.com/en-us/default.aspx.
 
40
MSN Web Search API. http://msdn.microsoft.com/msn/msnsearch/.
41
42
 
43
 
44
C. Snelson. Sampling the Web: The development of a custom search tool for research. Library and Information Science Research Electronic Journal, 16(1), Dec. 2005.
 
45
 
46
K. C. Sua, S. E. Waldren, and T. B. Patrick. Differences inthe effects of filters on health information retrieval from the internet in three languages from three countries: A comparative study. In Proceedings of MEDINFO 2004, 2004.
 
47
M. Thelwall. Can the Web give useful information about commercial uses of scientific research? Online Information Review, 28:120--130, 2004.
 
48
 
49
What's a "supplemental result?" Google Webmaster Help Center, 2006. http://www.google.com/support/webmasters/bin/answer.py?answer=34473.
 
50
Wikipedia: List of basic computer science topics, 2006. http://en.wikipedia.org/wiki/List_of_basic_computer_science_topics.
 
51
Yahoo! Web Search APIs. http://developer.yahoo.net/search/web/.
52

CITED BY  7

Collaborative Colleagues:
Frank McCown: colleagues
Michael L. Nelson: colleagues