ACM Home Page
Please provide us with feedback. Feedback
Approximate quantiles and the order of the stream
Full text PdfPdf (192 KB)
Source Symposium on Principles of Database Systems archive
Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems table of contents
Chicago, IL, USA
SESSION: Stream algorithms and complexity table of contents
Pages: 273 - 279  
Year of Publication: 2006
ISBN:1-59593-318-2
Authors
Sudipto Guha  University of Pennsylvania, Philadelphia, PA
Andrew McGregor  University of Pennsylvania, Philadelphia, PA
Sponsors
SIGMOD: ACM Special Interest Group on Management of Data
SIGART: ACM Special Interest Group on Artificial Intelligence
ACM: Association for Computing Machinery
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 10,   Downloads (12 Months): 41,   Citation Count: 7
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1142351.1142390
What is a DOI?

ABSTRACT

Recently, there has been an increased focus on modeling uncertainty by distributions. Suppose we wish to compute a function of a stream whose elements are samples drawn independently from some distribution. The distribution is unknown, but the order in which the samples are presented to us will not be completely adversarial. In this paper, we investigate the importance of the ordering of a data stream, without making any assumptions about the actual distribution of the data. Using quantiles as an example application, we show that we can design provably better algorithms, and settle several open questions on the impact of order on streams. With the recent impetus in the investigation of models for sensor networks, we believe that our approach will allow the construction of novel and significantly improved algorithms.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
A. Arasu, B. Babcock, S. Babu, M. Datar, K. Ito, R. Motwani, I. Nishizawa, U. Srivastava, D. Thomas, R. Varma, and J. Widom. Stream: The Stanford stream data manager. IEEE Data Engineering Bulletin, 26(1):19--26, 2003.
3
 
4
5
 
6
 
7
A. Deshpande, C. Guestrin, S. Madden, J. M. Hellerstein, and W. Hong. Model-driven data acquisition in sensor networks. Proc. of VLDB Conference (Best Paper), pages 588--599, 2004.
 
8
 
9
10
 
11
A. C. Gilbert, Y. Kotidis, S. Muthukrishnan, and M. Strauss. How to summarize the universe: Dynamic maintenance of quantiles. In Proc. 28th International Conference on Very Large Data Bases, pages 454--465, 2002.
12
13
 
14
15
 
16
 
17
M. R. Henzinger, P. Raghavan, and S. Rajagopalan. Computing on data streams. Technical Report 1998-001, DEC Systems Research Center, 1998.
18
 
19
S. Kannan. Open problems in streaming. PPT Slides, (request source).
20
21
22
 
23
J. Munro and M. Paterson. Selection and sorting with limited storage. Theoretical Computer Science, 12:315--323, 1980.
 
24
S. Muthukrishnan. Data streams: Algorithms and applications. Survey available on request at muthu@research.att.com, 2003.

CITED BY  7

Collaborative Colleagues:
Sudipto Guha: colleagues
Andrew McGregor: colleagues