|
ABSTRACT
For a massive I/O array of size n, we want to compute the first N local moments, for some constant N. Our simpler algorithms partition the array into consecutive ranges called bins, and apply not only to local-moment queries, but also to algebraic queries. With N buffers of size &sqrt;n, time complexity drops to O(&sqrt;n). A more sophisticated approach uses hierarchical buffering and has a logarithmic time complexity (O(b logb n)), when using N hierarchical buffers of size n/b. Using overlapped bin buffering, we show that only one buffer is needed, as with wavelet-based algorithms, but using much less storage.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
Noga Alon , Yossi Matias , Mario Szegedy, The space complexity of approximating the frequency moments, Proceedings of the twenty-eighth annual ACM symposium on Theory of computing, p.20-29, May 22-24, 1996, Philadelphia, Pennsylvania, United States
[doi> 10.1145/237814.237823]
|
| |
2
|
|
| |
3
|
|
| |
4
|
Cleveland, W., and Loader, C. 1995. Smoothing by local regression: Principles and methods. Tech. Rep., AT&T Bell Laboratories.
|
| |
5
|
Codd, E. F., Codd, S., and Salley, C. 1993. Providing OLAP (on-line analytical processing) to user-analysts: An IT mandate. Tech. Rep., E. F. Codd and Associates.
|
 |
6
|
|
| |
7
|
|
| |
8
|
Jim Gray , Adam Bosworth , Andrew Layman , Hamid Pirahesh, Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Total, Proceedings of the Twelfth International Conference on Data Engineering, p.152-159, February 26-March 01, 1996
|
 |
9
|
Ching-Tien Ho , Rakesh Agrawal , Nimrod Megiddo , Ramakrishnan Srikant, Range queries in OLAP data cubes, Proceedings of the 1997 ACM SIGMOD international conference on Management of data, p.73-88, May 11-15, 1997, Tucson, Arizona, United States
|
| |
10
|
IEC. 1999. Letter symbols to be used in electrical technology---Part 2: Telecommunications and electronics. Tech. Rep. IEC 60027-2 Second ed. International Electrotechnical Commission.
|
 |
11
|
|
| |
12
|
|
| |
13
|
Lemire, D. 2007. A better alternative to piecewise linear time series segmentation. In Proceedings of the SIAM International Conference on Data Mining (SDM).
|
| |
14
|
Lemire, D., and Kaser, O. 2007. Hierarchical bin buffering library in C++. http://code.google.com/p/hierarchicalbinbuffering/. Last checked on 15-7-2007.
|
| |
15
|
Li, B.-C., and Shen, J. 1992. Fast calculation of local moments and application to range image segmentation. In Proceedings of the International Conference on Pattern Recognition, 298--301.
|
| |
16
|
|
 |
17
|
|
| |
18
|
|
| |
19
|
|
| |
20
|
|
| |
21
|
Scott, D., and Sagae, M. 1997. Adaptive density estimation with massive data sets. In Proceedings of the Statistical Computing Section, ASA, 104--108.
|
| |
22
|
Silva, C., Chiang, Y., El-Sana, J., and Lindstrom, P. 2002. Out-of-Core algorithms for scientific visualization and computer graphics. In Visualization Course Notes, Tutorial.
|
| |
23
|
|
 |
24
|
Jeffrey Scott Vitter , Min Wang , Bala Iyer, Data cube approximation and histograms via wavelets, Proceedings of the seventh international conference on Information and knowledge management, p.96-104, November 02-07, 1998, Bethesda, Maryland, United States
[doi> 10.1145/288627.288645]
|
| |
25
|
|
|