ACM Home Page
Please provide us with feedback. Feedback
Approximate counting of inversions in a data stream
Full text PdfPdf (257 KB)
Source Annual ACM Symposium on Theory of Computing archive
Proceedings of the thiry-fourth annual ACM symposium on Theory of computing table of contents
Montreal, Quebec, Canada
SESSION: Session 7A table of contents
Pages: 370 - 379  
Year of Publication: 2002
ISBN:1-58113-495-9
Authors
Miklós Ajtai  IBM Almaden Research Center, San Jose, CA
T. S. Jayram  IBM Almaden Research Center, San Jose, CA
Ravi Kumar  IBM Almaden Research Center, San Jose, CA
D. Sivakumar  IBM Almaden Research Center, San Jose, CA
Sponsor
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 59,   Citation Count: 10
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues   peer to peer  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/509907.509964
What is a DOI?

ABSTRACT

(MATH) Inversions are used as a fundamental quantity to measure the sortedness of data, to evaluate different ranking methods for databases, and in the context of rank aggregation. Considering the volume of the data sets in these applications, the data stream model {14, 2] is a natural setting to design efficient algorithms.We obtain a suite of space-efficient streaming algorithms for approximating the number of inversions in a permutation. The best space bound we achieve is $O(\log n \log \log n)$ through a deterministic algorithm. In contrast, we derive an $\Omega(n)$ lower bound for randomized exact computation for this problem; thus approximation is essential.(MATH) We also consider two generalizations of this problem: (1) approximating the number of inversions between two permutations, for which we obtain a randomized $O(\sqrt{n} \log n)$-space algorithm, and (2) approximating the number of inversions in a general list, for which we obtain a randomized $O(\sqrt{n} \log^2 n)$-space two-pass algorithm. In contrast, we derive $\Omega(n)$-space lower bounds for deterministic approximate computation for these problems; thus both randomization and approximation are essential.All our algorithms use only O(log n) time per data item.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
 
4
 
5
 
6
P. Diaconis. Group Representation in Probability and Statistics. IMS Lecture Series 11, IMS, 1988.
 
7
P. Diaconis and R. Graham. Spearman's footrule as a measure of disarray. J. of the Royal Statistical Society, Series B, 39(2):262--268, 1977.
 
8
9
 
10
11
 
12
 
13
 
14
A. C. Gilbert, S. Guha, P. Indyk, Y. Kotidis, S. Muthukrishnan, and M. J. Strauss. A few good terms: Efficient streaming computation of wavelet decompositions. Manuscript, 2001.
15
 
16
 
17
 
18
 
19
 
20
21

CITED BY  10
 
 
 
 
 
 
 
 

Collaborative Colleagues:
Miklós Ajtai: colleagues
T. S. Jayram: colleagues
Ravi Kumar: colleagues
D. Sivakumar: colleagues

Peer to Peer - Readers of this Article have also read: