ACM Home Page
Please provide us with feedback. Feedback
Query sampling in DB2 Universal Database
Full text PdfPdf (98 KB)
Source International Conference on Management of Data archive
Proceedings of the 2004 ACM SIGMOD international conference on Management of data table of contents
Paris, France
SESSION: Industrial sessions: database internals - I table of contents
Pages: 839 - 843  
Year of Publication: 2004
ISBN:1-58113-859-8
Authors
Jarek Gryz  York University
Junjie Guo  York University
Linqi Liu  IBM Toronto Lab
Calisto Zuzarte  IBM Toronto Lab
Sponsor
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 12,   Downloads (12 Months): 55,   Citation Count: 1
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1007568.1007664
What is a DOI?

ABSTRACT

Executing ad hoc queries against large databases can be prohibitively expensive. Exploratory analysis of data may not require exact answers to queries, however: results based on sampling the data are often satisfactory. Supporting sampling as a primitive SQL operator turns out to be difficult because sampling does not commute with many SQL operators.In this paper, we describe an implementation in IBM® DB2® Universal Database (UDB) of a sampling operator that commutes with some SQL operators. As a result, the query with the sampling operator always returns a random sample of the answers and in many cases runs faster than it would have without such an operator.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
3
 
4
5
6
 
7
8
 
9
10
11
12
 
13
14
 
15
 
16
Frank Olken. Random Sampling from Databases. PhD thesis, University of California at Berkeley, 1993.
 
17
18
 
19
 
20
Transaction Processing Performance Council, 777 No. First Street, Suite 600, San Jose, CA 95112--6311, www.tpc.org. TPC Benchmark#8482;, 2.1.0 edition.
 
21

Collaborative Colleagues:
Jarek Gryz: colleagues
Junjie Guo: colleagues
Linqi Liu: colleagues
Calisto Zuzarte: colleagues