|
ABSTRACT
In the emerging area of sensor-based systems, a significant challenge is to develop scalable, fault-tolerant methods to extract useful information from the data the sensors collect. An approach to this data management problem is the use of sensor database systems, which allow users to perform aggregation queries such as MIN, COUNT, and AVG on the readings of a sensor network. In addition, more advanced queries such as frequency counting and quantile estimation can be supported. Due to energy limitations in sensor-based networks, centralized data collection is generally impractical, so most systems use in-network aggregation to reduce network traffic. However, even these aggregation strategies remain bandwidth-intensive when combined with the fault-tolerant, multipath routing methods often used in these environments. To avoid this expense, we investigate the use of approximate in-network aggregation using small sketches. We present duplicate-insensitive sketching techniques that can be implemented efficiently on small sensor devices with limited hardware support and we analyze both their performance and accuracy. Finally, we present an experimental evaluation that validates the effectiveness of our methods.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
 |
2
|
Noga Alon , Yossi Matias , Mario Szegedy, The space complexity of approximating the frequency moments, Proceedings of the twenty-eighth annual ACM symposium on Theory of computing, p.20-29, May 22-24, 1996, Philadelphia, Pennsylvania, United States
[doi> 10.1145/237814.237823]
|
| |
3
|
|
| |
4
|
Bawa, M., Garcia-Molina, H., Gionis, A., and Motwani, R. 2003. Estimating aggregates on a peer-to-peer network. Tech. rep., Stanford University.
|
| |
5
|
Chakrabarti, D., Leskovec, J., Faloutsos, C., Madden, S., Guestrin, C., and Faloutsos, M. 2007. Information survival threshold in sensor and p2p networks. In Proceedings of the Annual Joint Conference of the IEEE Computer and Communications Societies (InfoCom'07), 1316--1324.
|
| |
6
|
|
| |
7
|
|
| |
8
|
|
 |
9
|
|
 |
10
|
|
| |
11
|
Deligiannakis, A., Kotidis, Y., and Roussopoulos, N. 2004b. Hierarchical in-network data aggregation with quality guarantees. In Proceedings of the International Conference on Extending Database Technology (EDBT'04), 658--675.
|
| |
12
|
|
| |
13
|
Amol Deshpande , Carlos Guestrin , Samuel R. Madden , Joseph M. Hellerstein , Wei Hong, Model-driven data acquisition in sensor networks, Proceedings of the Thirtieth international conference on Very large data bases, p.588-599, August 31-September 03, 2004, Toronto, Canada
|
 |
14
|
|
| |
15
|
Devroye, L. 1986. Non-Uniform Random Variate Generation. Springer.
|
| |
16
|
Durand, M. and Flajolet, P. 2003. Loglog counting of large cardinalities. In Proceedings of the European Symposium on Algorithms (ESA'03), 605--617.
|
| |
17
|
|
| |
18
|
|
 |
19
|
|
 |
20
|
|
 |
21
|
|
| |
22
|
Garofalakis, M., Hellerstein, J. M., and Maniatis, P. 2007. Proof sketches: Verifiable in- network aggregation. In Proceedings of the International Conference on Data Engineering (ICDE'07), 996--1005.
|
 |
23
|
|
| |
24
|
Jim Gray , Surajit Chaudhuri , Adam Bosworth , Andrew Layman , Don Reichart , Murali Venkatrao , Frank Pellow , Hamid Pirahesh, Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals, Data Mining and Knowledge Discovery, v.1 n.1, p.29-53, 1997
[doi> 10.1023/A:1009726021843]
|
 |
25
|
|
| |
26
|
Horton, M., Culler, D., Pister, K., Hill, J., Szewczyk, R., and Woo, A. 2002. Mica, the commercialization of microsensor motes. IEEE Sensors J. 19, 4, 40--48.
|
| |
27
|
|
 |
28
|
|
| |
29
|
|
| |
30
|
|
 |
31
|
|
 |
32
|
|
 |
33
|
|
 |
34
|
|
| |
35
|
Munro, J. I. and Paterson, M. 1980. Selection and sorting with limited storage. Theor. Comput. Sci. 12, 315--323.
|
 |
36
|
Suman Nath , Phillip B. Gibbons , Srinivasan Seshan , Zachary R. Anderson, Synopsis diffusion for robust aggregation in sensor networks, Proceedings of the 2nd international conference on Embedded networked sensor systems, November 03-05, 2004, Baltimore, MD, USA
[doi> 10.1145/1031495.1031525]
|
 |
37
|
|
| |
38
|
Papoulis, A. 1965. Probability, Random Variables, and Stochastic Processes. McGraw-Hill, New York.
|
 |
39
|
|
| |
40
|
|
 |
41
|
Nisheeth Shrivastava , Chiranjeeb Buragohain , Divyakant Agrawal , Subhash Suri, Medians and beyond: new aggregation techniques for sensor networks, Proceedings of the 2nd international conference on Embedded networked sensor systems, November 03-05, 2004, Baltimore, MD, USA
[doi> 10.1145/1031495.1031524]
|
| |
42
|
Stoev, S., Hadjieleftheriou, M., Kollios, G., and Taqqu, M. S. 2007. Norm, point, and distance estimation over multiple signals using max-stable distributions. In Proceedings of the International Conference on Data Engineering (ICDE'07), 1006--1015.
|
| |
43
|
U.S. Census Bureau. 2009.TIGER/Line datasets. http://www.census.gov/geo/www/tiger/.
|
 |
44
|
|
 |
45
|
|
| |
46
|
Yao, Y. and Gehrke, J. 2003. Query processing in sensor networks. In Proceedings of the Biennial Conference on Innovative Data Systems Research (CIDR'03).
|
| |
47
|
|
| |
48
|
Zhao, J., Govindan, R., and Estrin, D. 2003. Computing aggregates for monitoring wireless sensor networks. In Proceedings of the IEEE International Workshop on Sensor Network Protocols and Applications (SNPA'03),139--148.
|
|