cited by index terms review collaborative colleagues
Classification: F. Theory of Computation F.2 ANALYSIS OF ALGORITHMS AND PROBLEM COMPLEXITY F.2.1 Numerical Algorithms and Problems Subjects: Number-theoretic computations (e.g., factoring, primality testing) G. Mathematics of Computing G.3 PROBABILITY AND STATISTICS Subjects: Probabilistic algorithms (including Monte Carlo) H. Information Systems H.2 DATABASE MANAGEMENT H.3 INFORMATION STORAGE AND RETRIEVAL
General Terms: Algorithms, Measurement
The algorithm presented here bounds the number of distinct elements (n>) in a large collection of data records. The essence of the algorithm is to “or” to a bit vector the least significant one bit of a hashed version of ea more...