ACM Home Page
Please provide us with feedback. Feedback
Predicting query reformulation during web searching
Full text PdfPdf (672 KB)
Source
Conference on Human Factors in Computing Systems archive
Proceedings of the 27th international conference extended abstracts on Human factors in computing systems table of contents
Boston, MA, USA
SESSION: Spotlight on work in progress session 1 table of contents
Pages 3907-3912  
Year of Publication: 2009
ISBN:978-1-60558-247-4
Authors
Bernard J. Jansen  The Pennsylvania State University, University Park, PA, USA
Danielle Booth  The Pennsylvania State University, University Park, PA, USA
Amanda Spink  Queensland University of Technology, Brisbane, PQ, Australia
Sponsors
ACM: Association for Computing Machinery
SIGCHI: ACM Special Interest Group on Computer-Human Interaction
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 12,   Downloads (12 Months): 73,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1520340.1520592
What is a DOI?

ABSTRACT

his paper reports results from a study in which we automatically classified the query reformulation patterns for 964,780 Web searching sessions (composed of 1,523,072 queries) in order to predict what the next query reformulation would be. We employed an n-gram modeling approach to describe the probability of searchers transitioning from one query reformulation state to another and predict their next state. We developed first, second, third, and fourth order models and evaluated each model for accuracy of prediction. Findings show that Reformulation and Assistance account for approximately 45 percent of all query reformulations. Searchers seem to seek system searching assistant early in the session or after a content change. The results of our evaluations show that the first and second order models provided the best predictability, between 28 and 40 percent overall, and higher than 70 percent for some patterns. Implications are that the n-gram approach can be used for improving searching systems and searching assistance in real time.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
3
 
4
Belkin, N., Oddy, R. and Brooks, H. Ask for information retrieval, parts 1 & 2. Journal of Documentation, 38, 2 1982), 61--71, 145--164.
 
5
 
6
 
7
Meadow, C. T., Hewett, T. T. and Aversa, E. A computer intermediary for interactive database searching ii: Evaluation. Journal of the American Society for Information Science, 331982), 357--364.
 
8
Park, S., Bae, H. and Lee, J. End user searching: A Web log analysis of NAVER, a Korean Web search engine. Library & Information Science Research, 27, 2 2005), 203--221.
 
9
10
 
11

Collaborative Colleagues:
Bernard J. Jansen: colleagues
Danielle Booth: colleagues
Amanda Spink: colleagues