ACM Home Page
Please provide us with feedback. Feedback
Identifying frequent items in sliding windows over on-line packet streams
Full text PdfPdf (212 KB)
Source Internet Measurement Conference archive
Proceedings of the 3rd ACM SIGCOMM conference on Internet measurement table of contents
Miami Beach, FL, USA
SESSION: Algorithms table of contents
Pages: 173 - 178  
Year of Publication: 2003
ISBN:1-58113-773-7
Authors
Lukasz Golab  University of Waterloo
David DeHaan  University of Waterloo
Erik D. Demaine  M.I.T
Alejandro Lopez-Ortiz  University of Waterloo
J. Ian Munro  University of Waterloo
Sponsors
SIGCOMM: ACM Special Interest Group on Data Communication
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 53,   Citation Count: 13
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/948205.948227
What is a DOI?

ABSTRACT

Internet traffic patterns are believed to obey the power law, implying that most of the bandwidth is consumed by a small set of heavy users. Hence, queries that return a list of frequently occurring items are important in the analysis of real-time Internet packet streams. While several results exist for computing frequent item queries using limited memory in the infinite stream model, in this paper we consider the limited-memory sliding window model. This model maintains the last $N$ items that have arrived at any given time and forbids the storage of the entire window in memory. We present a deterministic algorithm for identifying frequent items in sliding windows defined over real-time packet streams. The algorithm uses limited memory, requires constant processing time per packet (amortized), makes only one pass over the data, and is shown to work well when tested on TCP traffic logs.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
3
4
 
5
 
6
7
 
8
9
10
 
11
 
12
G. S. Manku and R. Motwani. Approximate frequency counts over data streams. Proc. 28th Int. Conf. on Very Large Data Bases, pages 346--357, 2002.
 
13
 
14
 
15
M. Sullivan and A. Heybey. Tribeca: A system for managing large databases of network traffic. Proc. USENIX Annual Technical Conf., 1998.
 
16
Y. Zhu and D. Shasha. StatStream: Statistical monitoring of thousands of data streams in real time. Proc. 28th Int. Conf. on Very Large Data Bases, pages 358--369, 2002.

CITED BY  13

Collaborative Colleagues:
Lukasz Golab: colleagues
David DeHaan: colleagues
Erik D. Demaine: colleagues
Alejandro Lopez-Ortiz: colleagues
J. Ian Munro: colleagues