APPENDICES and SUPPLEMENTS
|
|
Online appendix to designing mediation for context-aware applications. The appendix supports the information on page 1.
|
ABSTRACT
Given an OLAP query expressed over multiple source OLAP databases, we study the problem of estimating the resulting OLAP target database. The problem arises when it is not possible to derive the result from a single database. The method we use is linear indirect estimation, commonly used for statistical estimation. We examine two obvious computational methods for computing such a target database, called the full cross-product (F) and preaggregation (P) methods. We study the accuracy and computational cost of these methods. While the F method provides a more accurate estimate, it is more expensive computationally than P. Our contribution is in proposing a third, new method, called the partial preaggregation method (PP), which is significantly less expensive than F, but just as accurate. We prove formally that the PP method yields the same results as the F method, and provide analytical and experimental results on the accuracy and computational benefits of the PP method.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
Chan, P. and Shoshani, A. 1981. SUBJECT: A directory driven system for organizing and accessing large statistical databases. In Proceedings of the 7th International Conference on Very Large Data Bases (VLDB) (Cannes, France), C. Zaniolo and C. Delobel Eds. IEEE Computer Society Press, Los Alamitos, CA. 553--563.
|
| |
4
|
Chand, N. and Alexander, C. H. 1996. Small area estimation with administrative records and continuous measurement. In Proceedings of the Survey Research Methods Section, Annual Meeting of the American Statistical Association (ASA) (Chicago, IL). 870--875.
|
| |
5
|
Codd, E. F., Codd, S. B., and Salley, C. T. 1993. Providing OLAP (on-line analytical processing) to user-analysts: An IT mandate. Tech. Rep., E.F. Codd and Associates.
|
| |
6
|
Elliott, P., Cuzick, J., English, D., and Stern, R. 1996. Geographical and Environmental Epidemiology: Methods for Small-Area Studies. Oxford University Press, New York.
|
| |
7
|
|
| |
8
|
Ghosh, M. and Rao, J. N. K. 1994. Small area estimation: An appraisal. Statis. Sci. 9, 55--93.
|
| |
9
|
Jim Gray , Adam Bosworth , Andrew Layman , Hamid Pirahesh, Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Total, Proceedings of the Twelfth International Conference on Data Engineering, p.152-159, February 26-March 01, 1996
|
| |
10
|
|
| |
11
|
|
 |
12
|
|
 |
13
|
|
| |
14
|
Pfeffermann, D. 2002. Small area estimation-New developments and directions. Int. Statis. Revi. 70, 1, 125--143.
|
| |
15
|
Pourabbas, E. and Shoshani, A. 2003. Answering joint queries from multiple aggregate OLAP databases. In Data Warehousing and Knowledge Discovery, 5th International Conference (DaWaK), Y. Kambayashi et al., Eds. Lecture Notes Notes in Computer Science vol. 2737. Springer Verlag. 24--34.
|
| |
16
|
Rao, J. N. K. 2003. Small Area Estimation. John Wiley, New York.
|
| |
17
|
RTI International. 2007. RTI webpage. http://www.rti.org/page.cfm?nav=497.
|
| |
18
|
Schaible, W. L. 1996. Indirect estimators in U.S. federal programs, bureau of labor statistics. Lecture Notes Stat. 108, 1--212.
|
 |
19
|
|
|