ACM Home Page
Please provide us with feedback. Feedback
Mining anchor text for query refinement
Full text PdfPdf (100 KB)
Source International World Wide Web Conference archive
Proceedings of the 13th international conference on World Wide Web table of contents
New York, NY, USA
SESSION: Query result processing table of contents
Pages: 666 - 674  
Year of Publication: 2004
ISBN:1-58113-844-X
Authors
Reiner Kraft  IBM Almaden Research Center, San Jose, CA
Jason Zien  IBM Almaden Research Center, San Jose, CA
Sponsor
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 25,   Downloads (12 Months): 174,   Citation Count: 20
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/988672.988763
What is a DOI?

ABSTRACT

When searching large hypertext document collections, it is often possible that there are too many results available for ambiguous queries. Query refinement is an interactive process of query modification that can be used to narrow down the scope of search results. We propose a new method for automatically generating refinements or related terms to queries by mining anchor text for a large hypertext document collection. We show that the usage of anchor text as a basis for query refinement produces high quality refinement suggestions that are significantly better in terms of perceived usefulness compared to refinements that are derived using the document content. Furthermore, our study suggests that anchor text refinements can also be used to augment traditional query refinement algorithms based on query logs, since they typically differ in coverage and produce different refinements. Our results are based on experiments on an anchor text collection of a large corporate intranet.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
 
3
4
 
5
E. W. Brown and H. A. Chong. The GURU system in TREC-6. In Text REtrieval Conference, pages 535--540, 1997.
 
6
 
7
C. Buckley, G. Salton, J. Allan, and A. Singhal. Automatic query expansion using SMART: TREC 3. In Text REtrieval Conference, pages 69--80, 1994.
8
 
9
 
10
11
12
13
14
15
 
16
17
18
19
 
20
O. A. McBryan. GENVL and WWWW: Tools for taming the web. In World Wide Web Conference (WWW'94), Geneva, Switzerland, 1994.
 
21
L. Page, S. Brin, R. Motwani, and T. Winograd. The pagerank citation ranking: Bringing order to the web. Technical report, Stanford Digital Library Technologies Project, 1998.
22
23
24
25
 
26
J. Zien, J. Meyer, J. Tomlin, and J. Liu. Web query characteristics and their implications on search engines. IBM Research Report, RJ 10199, November 2000.

CITED BY  20

Collaborative Colleagues:
Reiner Kraft: colleagues
Jason Zien: colleagues