ACM Home Page
Please provide us with feedback. Feedback
Sources of evidence for vertical selection
Full text PdfPdf (1.95 MB)
Source
Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval table of contents
Boston, MA, USA
SESSION: Vertical search table of contents
Pages 315-322  
Year of Publication: 2009
ISBN:978-1-60558-483-6
Authors
Jaime Arguello  Carnegie Mellon University, Pittsburgh, PA, USA
Fernando Diaz  Yahoo! Labs Montreal, Montreal, PQ, Canada
Jamie Callan  Carnegie Mellon University, Pittsburgh, PA, USA
Jean-Francois Crespo  Yahoo! Labs Montreal, Montreal, PQ, Canada
Sponsors
SIGIR: ACM Special Interest Group on Information Retrieval
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 66,   Downloads (12 Months): 345,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1571941.1571997
What is a DOI?

ABSTRACT

Web search providers often include search services for domain-specific subcollections, called verticals, such as news, images, videos, job postings, company summaries, and artist profiles. We address the problem of vertical selection, predicting relevant verticals (if any) for queries issued to the search engine's main web search page. In contrast to prior query classification and resource selection tasks, vertical selection is associated with unique resources that can inform the classification decision. We focus on three sources of evidence: (1) the query string, from which features are derived independent of external resources, (2) logs of queries previously issued directly to the vertical, and (3) corpora representative of vertical content. We focus on 18 different verticals, which differ in terms of semantics, media type, size, and level of query traffic. We compare our method to prior work in federated search and retrieval effectiveness prediction. An in-depth error analysis reveals unique challenges across different verticals and provides insight into vertical selection for future work.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
 
3
A. Bhattacharyya. On a measure of divergence between two statistical populations defined by probability distributions. Bull. Calcutta Math. Soc., 35:99 -- 109, 1943.
 
4
J. Callan. Distributed information retrieval. In W. B. Croft, editor, Advances in Information Retrieval, pages 127--150. Kluwer Academic Publishers, 2000.
5
6
7
8
9
10
11
 
12
V. Murdock and M. Lalmas, editors. SIGIR 2008 Workshop on Aggregated Search, 2008.
13
14
 
15
 
16
L. Si. Federated Search of Text Search Engines in Uncooperative Environments. PhD thesis, Carnegie Mellon University, Pittsburgh, PA, 2006.
17
 
18
I. H. Witten and T. C. Bell. The zero-frequency problem: Estimating the probabilities of novel events in adaptive text compression. Transactions on Information Theory, 37, 1991.
19
 
20


Collaborative Colleagues:
Jaime Arguello: colleagues
Fernando Diaz: colleagues
Jamie Callan: colleagues
Jean-Francois Crespo: colleagues