ACM Home Page
Please provide us with feedback. Feedback
Power-conserving computation of order-statistics over sensor networks
Full text PdfPdf (235 KB)
Source Symposium on Principles of Database Systems archive
Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems table of contents
Paris, France
SESSION: Data streams table of contents
Pages: 275 - 285  
Year of Publication: 2004
ISBN:158113858X
Authors
Michael B. Greenwald  University of Pennsylvania, Philadelphia, PA
Sanjeev Khanna  University of Pennsylvania, Philadelphia, PA
Sponsors
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
SIGMOD: ACM Special Interest Group on Management of Data
SIGART: ACM Special Interest Group on Artificial Intelligence
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 46,   Citation Count: 21
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1055558.1055597
What is a DOI?

ABSTRACT

We study the problem of power-conserving computation of order statistics in sensor networks. Significant power-reducing optimizations have been devised for computing simple aggregate queries such as COUNT, AVERAGE, or MAX over sensor networks. In contrast, aggregate queries such as MEDIAN have seen little progress over the brute force approach of forwarding all data to a central server. Moreover, battery life of current sensors seems largely determined by communication costs --- therefore we aim to minimize the number of bytes transmitted. Unoptimized aggregate queries typically impose extremely high power consumption on a subset of sensors located near the server. Metrics such as total communication cost underestimate the penalty of such imbalance: network lifetime may be dominated by the worst-case replacement time for depleted batteries.In this paper, we design the first algorithms for computing order-statistics such that power consumption is balanced across the entire network. Our first main result is a distributed algorithm to compute an ε-approximate quantile summary of the sensor data such that each sensor transmits only O(log2 n/ε) data values, irrespective of the network topology, an improvement over the current worst-case behavior of Ω(n). Second, we show an improved result when the height, h, of the network is significantly smaller than n. Our third result is that we can exactly compute any order statistic (e.g., median) in a distributed manner such that each sensor needs to transmit O(log3 n) values.Further, we design the aggregates used by our algorithms to be decomposable. An aggregate Q over a set S is decomposable if there exists a function, f, such that for all S = S1S2, Q(S) = f(Q(S1), Q(S2)). We can thus directly apply existing optimizations to decomposable aggregates that increase error-resilience and reduce communication cost.Finally, we validate our results empirically, through simulation. When we compute the median exactly, we show that, even for moderate size networks, the worst communication cost for any single node is several times smaller than the corresponding cost in prior median algorithms. We show similar cost reductions when computing approximate order-statistic summaries with guaranteed precision. In all cases, our total communication cost over the entire network is smaller than or equal to the total cost of prior algorithms.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Daniel Barbara, William DuMouchel, Christos Faloutsos, Peter J. Haas, Joseph M. Hellerstein, Yannis Ioannidis, H. V. Jagadish, Theodore Johnson, Raymond Ng, Viswanath Poosala, Kenneth A. Ross, and Kenneth C. Sevcik. The New Jersey Data Reduction Report. Data Engineering Bulletin, 20(4):3--45, December 1997.
 
2
 
3
Graham Cormode and S. Muthukrishnan. An improved data stream summary: The count-min sketch and its applications. In to appear in Proceedings of Latin American Theoretical Informatics (LATIN 2004), April 2004. Buenos Aires, Argentina (also available as DIMACS Tech Report TR-2003-20).
 
4
Crossbow Technology, Inc. Wireless sensor networks: MICA, MICA2, and MICA2DOT. http://www.xbow.com/Products/Wireless_Sensor_Networks.htm.
5
 
6
Deepak Ganesan, Sylvia Ratnasamy, Hanbiao Wang, and Deborah Estrin. Coping with irregular spatio-temporal sampling in sensor networks. In Proceedings of Second Annual Workshop on Hot Topics in Networking (HOTNETS-II), November 20--21 2003. Cambridge, MA.
7
 
8
9
 
10
Joseph M. Hellerstein, Wei Hong, Samuel Madden, and Kyle Stanek. Beyond average: Toward sophisticated sensing with queries. In Proceedings of 2nd International Workshop on Information Processing in Sensor Networks (IPSN '03), pages 63--79. (Lecture Notes in Computer Science 2634), Springer Verlag, April 2003. Palo Alto, CA.
11
 
12
13
14
 
15
 
16
Stephane Mallat. A Wavelet tour of signal processing. Academic Press, 2nd edition, 1999. London, UK.
17
 
18
Rajeev Motwani, Jennifer Widom, Arvind Arasu, Brian Babcock, Shivnath Babu, Mayur Datar, Gurmeet Singh Manku, Chris Olston, Justin Rosenstein, and Rohit Varma. Query processing, approximation, and resource management in a data stream management system. In CIDR, 2003.
 
19
J. I. Munro and M. S. Paterson. Selection and sorting with limited storage. Theoretical Computer Science, 12:315--323, 1980.
20
 
21
Yong Yao and Johannes Gehrke. Query processing for sensor networks. In CIDR, 2003.
 
22
Jerry Zhao, Ramesh Govindan, and Deborah Estrin. Computing aggregates for monitoring wireless sensor networks. In Proceedings of the 1st IEEE International Workshop on Sensor Net Protocols and Applications (SNPA 2003), May 11 2003. Anchorage, AK.

CITED BY  21
Collaborative Colleagues:
Michael B. Greenwald: colleagues
Sanjeev Khanna: colleagues