ACM Home Page
Please provide us with feedback. Feedback
A stratified approach to progressive approximate joins
Full text PdfPdf (285 KB)
Source ACM International Conference Proceeding Series; Vol. 261 archive
Proceedings of the 11th international conference on Extending database technology: Advances in database technology table of contents
Nantes, France
SESSION: Research sessions: Join processing table of contents
Pages 582-593  
Year of Publication: 2008
ISBN:978-1-59593-926-5
Authors
Wee Hyong Tok  National University of Singapore
Stéphane Bressan  National University of Singapore
Mong-Li Lee  National University of Singapore
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 42,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1353343.1353414
What is a DOI?

ABSTRACT

Users often do not require a complete answer to their query but rather only a sample. They expect the sample to be either the largest possible or the most representative (or both) given the resources available. We call the query processing techniques that deliver such results 'approximate'. Processing of queries to streams of data is said to be 'progressive' when it can continuously produce results as data arrives. In this paper, we are interested in the progressive and approximate processing of queries to data streams when processing is limited to main memory. In particular, we study one of the main building blocks of such processing: the progressive approximate join. We devise and present several novel progressive approximate join algorithms. We empirically evaluate the performance of our algorithms and compare them with algorithms based on existing techniques. In particular we study the trade-off between maximization of throughput and maximization of representativeness of the sample.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
3
 
4
5
 
6
W. G. Cochran. Sampling Techniques, 3rd Edition. John Wiley, 1977.
7
 
8
 
9
10
 
11
C. J. Hahn, S. G. Warren, and J. London. Edited synoptic cloud reports from ships and land stations over the globe, 1982--1991, http://cdiac.esd.ornl.gov/ftp/ndp026b, 1996.
 
12
J. Lin. Divergence measures based on the shannon entropy. IEEE Transactions on Information Theory, 37(1):145--151, 1991.
 
13
 
14
 
15
F. Olken. Random Sampling from Databases. Ph.D. dissertation, Computer Science, University of California, 1993.
 
16
17
 
18
W. H. Tok, S. Bressan, and M.-L. Lee. RRPJ : Result-rate based progressive relational join. In DASFAA, pages 43--54, 2007.
 
19
T. Urhan and M. J. Franklin. XJoin: Getting fast answers from slow and bursty networks. Technical Report CS-TR-3994, University of Maryland, 1999.
20

Collaborative Colleagues:
Wee Hyong Tok: colleagues
Stéphane Bressan: colleagues
Mong-Li Lee: colleagues