|
ABSTRACT
Frequent elements and top-k queries constitute an important class of queries for data stream analysis applications. Certain applications require answers for both frequent elements and top-k queries on the same stream. In addition, the ever increasing data rates call for providing fast answers to the queries, and researchers have been looking towards exploiting specialized hardware for this purpose. Content Addressable Memory(CAM) provides an efficient way of looking up elements and hence are well suited for the class of algorithms that involve lookups. In this paper, we present a fast and efficient CAM conscious integrated solution for answering both frequent elements and top-k queries on the same stream. We call our scheme CAM conscious Space Saving with Stream Summary (CSSwSS), and it can efficiently answer continuous queries. We provide an implementation of the proposed scheme using commodity CAM chips, and the experimental evaluation demonstrates that not only does the proposed scheme outperforms existing CAM conscious techniques by an order of magnitude at query loads of about 10%, but the proposed scheme can also efficiently answer continuous queries.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
N. Bandi, S. Schnieder, D. Agrawal, and A. E. Abbadi. Hardware Acceleration of Database operations using Content Addressable Memories. In DaMoN, 2005.
|
 |
3
|
Bishwaranjan Bhattacharjee , Naoki Abe , Kenneth Goldman , Bianca Zadrozny , Vamsavardhana R. Chillakuru , Marysabel del Carpio , Chid Apte, Using secure coprocessors for privacy preserving collaborative data mining and analysis, Proceedings of the 2nd international workshop on Data management on new hardware, June 25-25, 2006, Chicago, Illinois
[doi> 10.1145/1140402.1140404]
|
| |
4
|
|
| |
5
|
|
| |
6
|
|
| |
7
|
|
 |
8
|
Rui Fang , Bingsheng He , Mian Lu , Ke Yang , Naga K. Govindaraju , Qiong Luo , Pedro V. Sander, GPUQP: query co-processing using graphics processors, Proceedings of the 2007 ACM SIGMOD international conference on Management of data, June 11-14, 2007, Beijing, China
[doi> 10.1145/1247480.1247606]
|
| |
9
|
M. Fischer and S. Salzberg. Finding a majority among n votes: Solution to problem 81--5. In Journal of Algorithms 3(4), pages 376--379, 1982.
|
 |
10
|
|
 |
11
|
|
| |
12
|
N. Hardavellas, I. Pandis, R. Johnson, N. G. Mancheril, A. Ailamaki, and B. Falsafi. Database Servers on Chip Multiprocessors: Limitations and Opportunities. In CIDR, pages 79--87, 2007.
|
| |
13
|
Integrated Devices Technologies, Integrated IP Co-processor IDT 75K62134. http://idt.com/?genID=75K62134&source=products_genericPart_75K62134, 2006.
|
| |
14
|
Intel Internet Exchange Architecture for Network Processors. Technical report, Intel Corp., 2002.
|
| |
15
|
Intel IXP 2800 Network Processor Product Brief. http://www.intel.com/design/network/prodbrf/279054.htm, 2006.
|
| |
16
|
|
 |
17
|
|
 |
18
|
|
| |
19
|
R. Panigrahy and D. Thomas. Finding Frequent Elements in Non-Bursty Streams. In ESA, pages 53--62, October 2007.
|
| |
20
|
|
| |
21
|
Teja networking systems and teja np application development platform. http://www.teja.com, 2006.
|
| |
22
|
S. Venkataraman, D. Song, P. Gibbons, and A. Blum. New streaming algorithms for fast detection of superspreaders. In NDSS, 2005.
|
| |
23
|
|
| |
24
|
G. K. Zipf. Human Behavior and The Principle of Least Effort. Addison-Wesley, Cambridge, MA, 1949.
|
 |
25
|
|
|