ACM Home Page
Please provide us with feedback. Feedback
Efficient estimation of joint queries from multiple OLAP databases
Full text PdfPdf (816 KB)
Source
ACM Transactions on Database Systems (TODS) archive
Volume 32 ,  Issue 1  (March 2007) table of contents
Article No. 2  
Year of Publication: 2007
ISSN:0362-5915
Authors
Elaheh Pourabbas  National Research Council, Rome, Italy
Arie Shoshani  Lawrence Berkeley National Laboratory, Berkeley, CA
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 12,   Downloads (12 Months): 107,   Citation Count: 1
Additional Information:

appendices and supplements   abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1206049.1206051
What is a DOI?

APPENDICES and SUPPLEMENTS
Online appendix to designing mediation for context-aware applications. The appendix supports the information on page 1.


ABSTRACT

Given an OLAP query expressed over multiple source OLAP databases, we study the problem of estimating the resulting OLAP target database. The problem arises when it is not possible to derive the result from a single database. The method we use is linear indirect estimation, commonly used for statistical estimation. We examine two obvious computational methods for computing such a target database, called the full cross-product (F) and preaggregation (P) methods. We study the accuracy and computational cost of these methods. While the F method provides a more accurate estimate, it is more expensive computationally than P. Our contribution is in proposing a third, new method, called the partial preaggregation method (PP), which is significantly less expensive than F, but just as accurate. We prove formally that the PP method yields the same results as the F method, and provide analytical and experimental results on the accuracy and computational benefits of the PP method.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
Chan, P. and Shoshani, A. 1981. SUBJECT: A directory driven system for organizing and accessing large statistical databases. In Proceedings of the 7th International Conference on Very Large Data Bases (VLDB) (Cannes, France), C. Zaniolo and C. Delobel Eds. IEEE Computer Society Press, Los Alamitos, CA. 553--563.
 
4
Chand, N. and Alexander, C. H. 1996. Small area estimation with administrative records and continuous measurement. In Proceedings of the Survey Research Methods Section, Annual Meeting of the American Statistical Association (ASA) (Chicago, IL). 870--875.
 
5
Codd, E. F., Codd, S. B., and Salley, C. T. 1993. Providing OLAP (on-line analytical processing) to user-analysts: An IT mandate. Tech. Rep., E.F. Codd and Associates.
 
6
Elliott, P., Cuzick, J., English, D., and Stern, R. 1996. Geographical and Environmental Epidemiology: Methods for Small-Area Studies. Oxford University Press, New York.
 
7
 
8
Ghosh, M. and Rao, J. N. K. 1994. Small area estimation: An appraisal. Statis. Sci. 9, 55--93.
 
9
 
10
 
11
12
13
 
14
Pfeffermann, D. 2002. Small area estimation-New developments and directions. Int. Statis. Revi. 70, 1, 125--143.
 
15
Pourabbas, E. and Shoshani, A. 2003. Answering joint queries from multiple aggregate OLAP databases. In Data Warehousing and Knowledge Discovery, 5th International Conference (DaWaK), Y. Kambayashi et al., Eds. Lecture Notes Notes in Computer Science vol. 2737. Springer Verlag. 24--34.
 
16
Rao, J. N. K. 2003. Small Area Estimation. John Wiley, New York.
 
17
RTI International. 2007. RTI webpage. http://www.rti.org/page.cfm?nav=497.
 
18
Schaible, W. L. 1996. Indirect estimators in U.S. federal programs, bureau of labor statistics. Lecture Notes Stat. 108, 1--212.
19


Collaborative Colleagues:
Elaheh Pourabbas: colleagues
Arie Shoshani: colleagues