ACM Home Page
Please provide us with feedback. Feedback
Comparing top k lists
Full text PdfPdf (1.02 MB)
Source Symposium on Discrete Algorithms archive
Proceedings of the fourteenth annual ACM-SIAM symposium on Discrete algorithms table of contents
Baltimore, Maryland
SESSION: Session 1A table of contents
Pages: 28 - 36  
Year of Publication: 2003
ISBN:0-89871-538-5
Authors
Ronald Fagin  IBM Almaden Research Center, San Jose, CA
Ravi Kumar  IBM Almaden Research Center, San Jose, CA
D. Sivakumar  IBM Almaden Research Center, San Jose, CA
Sponsors
: SIAM Activity Group on Discrete Mathematics
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
Publisher
Society for Industrial and Applied Mathematics  Philadelphia, PA, USA
Bibliometrics
Downloads (6 Weeks): 23,   Downloads (12 Months): 192,   Citation Count: 35
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

Motivated by several applications, we introduce various distance measures between "top k lists." Some of these distance measures are metrics, while others are not. For each of these latter distance measures: we show that it is "almost" a metric in the following two seemingly unrelated aspects:step-(i) it satisfies a relaxed version of the polygonal (hence, triangle) inequality, andstep-(ii) there is a metric with positive constant multiples that bounds our measure above and below.This is not a coincidence---we show that these two notions of almost being a metric are the same. Based on the second notion, we define two distance measures to be equivalent if they are bounded above and below by constant multiples of each other. We thereby identify a large and robust equivalence class of distance measures.Besides the applications to the task of identifying good notions of (dis-)similarity between two top k lists, our results imply polynomial-time constant-factor approximation algorithms for the rank aggregation problem with respect to a large class of distance measures.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
3
 
4
 
5
D. E. Critchlow. Metric Methods for Analyzing Partially Ranked Data. Number 34 in Lecture Notes in Statistics. Springer-Verlag, Berlin, 1980.
 
6
P. Diaconis. Group Representation in Probability and Statistics. Number 11 in IMS Lecture Series. Institute of Mathematical Statistics, 1988.
 
7
P. Diaconis and R. Graham. Spearman's footrule as a measure of disarray. Journal of the Royal Statistical Society, Series B, 39(2):262--268, 1977.
8
 
9
 
10
 
11
M. Kendall and J. D. Gibbons. Rank Correlation Methods. Edward Arnold, London, 1990.
12
 
13

CITED BY  35

Collaborative Colleagues:
Ronald Fagin: colleagues
Ravi Kumar: colleagues
D. Sivakumar: colleagues