ACM Home Page
Please provide us with feedback. Feedback
DSQoS-distributed architecture providing QoS in summary warehouses
Full text PdfPdf (306 KB)
Source Data Warehousing and OLAP archive
Proceedings of the 6th ACM international workshop on Data warehousing and OLAP table of contents
New Orleans, Louisiana, USA
SESSION: XML and architecture table of contents
Pages: 41 - 47  
Year of Publication: 2003
ISBN:1-58113-727-3
Authors
João Pedro Costa  Instituto Superior de Engenharia de Coimbra, Coimbra, Portugal
Pedro Furtado  Universidade de Coimbra, Coimbra, Portugal
Sponsors
SIGIR: ACM Special Interest Group on Information Retrieval
ACM: Association for Computing Machinery
SIGMIS: ACM Special Interest Group on Management Information Systems
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 40,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/956060.956068
What is a DOI?

ABSTRACT

Data warehouses (DW) that store enormous quantities of data put a major challenge in what concerns performance and scalability, as users request instant answers to their queries. Traditional solutions rely on very expensive architectures and structures for speedup and scale-up. The Summary Warehouse (SW) is an inexpensive solution that has the potential to deliver very fast approximate answers to aggregate queries using only general-purpose sampling summaries.Although summaries are expected to be extremely fast, some analysis requires larger summaries to estimate individual group results, compromising the speedup advantage. This is the accuracy/speedup (A/S) tradeoff.In this paper we propose the "Distributed Set-of-Summaries for Quality of Service" (DSQoS) that solves the A/S issue by optimizing the accuracy and response time for each query pattern in order to guarantee a desired Quality of Service (QoS). This QoS is defined in terms of response time and accuracy bounds. The strategy determines the required summary size to guarantee the accuracy targets and then dynamically select a set of summaries, distributed in various nodes, which can ensure the QoS constraints (time and accuracy). The strategy presents enormous possibilities since each node can contain summaries with different sizes, depending on the node characteristics, and can dynamically be added and removed from the system.We discuss the design of the approach and the strategies used to process queries. In the experimental section we show how the approach is able to deliver almost instant and accurate answers without employing expensive architectures, which would be impossible using other strategies.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
 
3
 
4
 
5
 
6
Pedro Furtado, João Pedro Costa: "Distributed Summaries: Fast Accurate Summary Warehouses" IDEAS 2003, July 2003.
7
 
8
P. B. Gibbons and Y. Matias. "AQUA: System and Techniques for Approximate Query Answering". Bell Labs TR 1998.
 
9
10
 
11
TPC Benchmark H, Transaction Processing Council, June 1999. Available at http://www.tpc.org/
12
13

Collaborative Colleagues:
João Pedro Costa: colleagues
Pedro Furtado: colleagues