|
ABSTRACT
We present a new algorithm for improving retrieval results by combining document ranking functions: Condorcet-fuse. Beginning with one of the two major classes of voting procedures from Social Choice Theory, the Condorcet procedure, we apply a graph-theoretic analysis that yields a sorting-based algorithm that is elegant, efficient, and effective. The algorithm performs very well on TREC data, often outperforming existing metasearch algorithms whether or not relevance scores and training data is available. Condorcet-fuse significantly outperforms Borda-fuse, the analogous representative from the other major class of voting algorithms.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
|
| |
3
|
|
| |
4
|
N. Belkin, P. Kantor, C. Cool, and R. Quatrain. Combining evidence for information retrieval. In Harman {15}, pages 35--43.
|
| |
5
|
N. Craswell, D. Hawking, and P. Thistlewaite. Merging results from isolated search engines. In Proceedings of the Tenth Australasian Database Conference, Aukland, New Zealand, Jan. 1999. Springer-Verlag.
|
| |
6
|
W. B. Croft. Combining approaches to information retrieval. In W. B. Croft, editor, Advances in Information Retrieval: Recent Research from the Center for Intelligent Information Retrieval, The Kluwer International Series on Information Retrieval, chapter~1. Kluwer Academic Publishers, 2000.
|
| |
7
|
W. B. Croft, D. J. Harper, D. H. Kraft, and J. Zobel, editors. SIGIR'01, Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, New Orleans, Louisiana, USA, Sept. 2001. ACM Press, New York.
|
| |
8
|
J. C. de~Borda. Mémoire sur les élections au scrutin. In Histoire de l'Academie Royale des Sciences. Paris, 1781.
|
| |
9
|
M. de~Condorcet. Essai sur l'application de l'analyse à la probabilité des decisions rendues à la pluralité des voix, 1785.
|
| |
10
|
H. L. Fisher and D. R. Elchesen. Effectiveness of combining title words and index terms in machine retrieval searches. Nature, 238:109--110, July 1972.
|
 |
11
|
|
| |
12
|
E. A. Fox, M. P. Koushik, J. Shaw, R. Modlin, and D. Rao. Combining evidence from multiple searches. In D. Harman, editor, The First Text REtrieval Conference (TREC-1), pages 319--328, Gaithersburg, MD, USA, Mar. 1993. U.S. Government Printing Office, Washington D.C.
|
| |
13
|
E. A. Fox and J. A. Shaw. Combination of multiple searches. In Harman {15}, pages 243--249.
|
| |
14
|
|
| |
15
|
D. Harman, editor. The Second Text REtrieval Conference (TREC-2), Gaithersburg, MD, USA, Mar. 1994. U.S. Government Printing Office, Washington D.C.
|
 |
16
|
|
| |
17
|
J. S. Kelly. Social Choice Theory: An Introduction. Springer-Verlag, 1988.
|
 |
18
|
|
 |
19
|
|
 |
20
|
|
| |
21
|
H. Moulin. Axioms of Cooperative Decision Making. Cambridge University Press, 1988.
|
| |
22
|
K. B. Ng. An Investigation of the Conditions for Effective Data Fusion in Information Retrieval. PhD thesis, School of Communication, Information, and Library Studies, Rutgers University, 1998.
|
| |
23
|
K. B. Ng and P. B. Kantor. An investigation of the preconditions for effective data fusion in IR: A pilot study. In Proceedings of the 61th Annual Meeting of the American Society for Information Science, 1998.
|
| |
24
|
K. B. Ng, D. Loewenstern, C. Basu, H. Hirsh, and P. B. Kantor. Data fusion of machine learning methods for the TREC5 routing task (and other work). In Voorhees and Harman {35}, pages 477--487.
|
| |
25
|
W. H. Riker. Liberalism Against Populism. Waveland Press, 1982.
|
| |
26
|
|
| |
27
|
J. A. Shaw and E. A. Fox. Combination of multiple searches. In D. Harman, editor, Overview of the Third Text REtrieval Conference (TREC-3), pages 105--108, Gaithersburg, MD, USA, Apr. 1995. U.S. Government Printing Office, Washington D.C.
|
| |
28
|
|
| |
29
|
|
| |
30
|
|
| |
31
|
|
| |
32
|
C. C. Vogt. How much more is better? Characterizing the effects of adding more IR systems to a combination. In Content-Based Multimedia Information Access (RIAO), pages 457--475, Paris, France, Apr. 2000.
|
| |
33
|
|
| |
34
|
C. C. Vogt, G. W. Cottrell, R. K. Belew, and B. T. Bartell. Using relevance to train a linear mixture of experts. In Voorhees and Harman {35}, pages 503--515.
|
| |
35
|
E. Voorhees and D. Harman, editors. The Fifth Text REtrieval Conference (TREC-5), Gaithersburg, MD, USA, 1997. U.S. Government Printing Office, Washington D.C.
|
 |
36
|
Ellen M. Voorhees , Narendra K. Gupta , Ben Johnson-Laird, Learning collection fusion strategies, Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval, p.172-179, July 09-13, 1995, Seattle, Washington, United States
[doi> 10.1145/215206.215357]
|
CITED BY 27
|
|
|
|
Steven M. Beitzel , Eric C. Jensen , Ophir Frieder , Abdur Chowdhury , Greg Pass, Surrogate scoring for improved metasearch precision, Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, August 15-19, 2005, Salvador, Brazil
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Javed A. Aslam , Virgiliu Pavlu , Robert Savell, A unified model for metasearch, pooling, and system evaluation, Proceedings of the twelfth international conference on Information and knowledge management, November 03-08, 2003, New Orleans, LA, USA
|
|
Steven M. Beitzel , Ophir Frieder , Eric C. Jensen , David Grossman , Abdur Chowdhury , Nazli Goharian, Disproving the fusion hypothesis: an analysis of data fusion via effective information retrieval strategies, Proceedings of the 2003 ACM symposium on Applied computing, March 09-12, 2003, Melbourne, Florida
|
|
|
|
|
|
Thomas R. Lynam , Chris Buckley , Charles L. A. Clarke , Gordon V. Cormack, A multi-system analysis of document and term selection for blind feedback, Proceedings of the thirteenth ACM international conference on Information and knowledge management, November 08-13, 2004, Washington, D.C., USA
|
|
|
D. Lillis , F. Toolan , A. Mur , L. Peng , R. Collier , J. Dunnion, Probability-based fusion of information retrieval result sets, Artificial Intelligence Review, v.25 n.1-2, p.179-191, April 2006
|
|
Ronald Fagin , Ravi Kumar , Mohammad Mahdian , D. Sivakumar , Erik Vee, Comparing and aggregating rankings with ties, Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, June 14-16, 2004, Paris, France
|
|
|
|
|
|
|
|
David Lillis , Fergus Toolan , Rem Collier , John Dunnion, ProbFuse: a probabilistic approach to data fusion, Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, August 06-11, 2006, Seattle, Washington, USA
|
|
|
|
|
Steven M. Beitzel , Eric C. Jensen , Abdur Chowdhury , David Grossman , Ophir Frieder , Nazli Goharian, Fusion of effective retrieval strategies in the same information retrieval system, Journal of the American Society for Information Science and Technology, v.55 n.10, p.859-868, August 2004
|
|
|
|
|
|
|
|
|
|
Ronald Fagin , Ravi Kumar , Kevin S. McCurley , Jasmine Novak , D. Sivakumar , John A. Tomlin , David P. Williamson, Searching the workplace web, Proceedings of the 12th international conference on World Wide Web, May 20-24, 2003, Budapest, Hungary
|
|
Dimitrios Skoutas , Dimitris Sacharidis , Alkis Simitsis , Verena Kantere , Timos Sellis, Top-k dominant web services under multi-criteria matching, Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, March 24-26, 2009, Saint Petersburg, Russia
|
Peer to Peer - Readers of this Article have also read:
-
M4: a metamodel for data preprocessing
Proceedings of the 4th ACM international workshop on Data warehousing and OLAP
Anca Vaduva
, Jörg-Uwe Kietz
, Regina Zücker
-
Data structures for quadtree approximation and compression
Communications of the ACM
28, 9
Hanan Samet
-
A hierarchical single-key-lock access control using the Chinese remainder theorem
Proceedings of the 1992 ACM/SIGAPP Symposium on Applied computing
Kim S. Lee
, Huizhu Lu
, D. D. Fisher
-
The GemStone object database management system
Communications of the ACM
34, 10
Paul Butterworth
, Allen Otis
, Jacob Stein
-
Putting innovation to work: adoption strategies for multimedia communication systems
Communications of the ACM
34, 12
Ellen Francik
, Susan Ehrlich Rudman
, Donna Cooper
, Stephen Levine
|