| Implementing an autonomic architecture for fault-tolerance in a wireless sensor network testbed for at-scale experimentation |
| Full text |
Pdf
(608 KB)
|
| Source
|
Symposium on Applied Computing
archive
Proceedings of the 2008 ACM symposium on Applied computing
table of contents
Fortaleza, Ceara, Brazil
SESSION: Autonomic computing
table of contents
Pages 1670-1676
Year of Publication: 2008
ISBN:978-1-59593-753-7
|
|
Authors
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): n/a, Downloads (12 Months): n/a, Citation Count: 0
|
|
|
ABSTRACT
The wireless sensor networking (WSN) community has increasingly grown to rely on experimentation with large-scale test-beds as a means of verifying protocols, middleware and applications. These testbeds need to be highly available in order to support this community, but are themselves complex, and complex to manage, being prone to faults in hardware, software specification and software implementation. In this paper we report on our experience in designing Kansei, a WSN testbed for experimentation at scale, to be autonomic - i.e. self-healing and self-managing. We implement autonomic management in Kansei through an architecture that consists of a hierarchy of self-contained components, extended with detectors for discovering faults and correctors for subsequent stabilization. We find that our invariant based architecture is well suited for large complex systems with unpredictable fault model and its fault monitoring framework can be extended to include user programs.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
J. Heidemann, N. Bulusu, J. Elson, C. Intanagonwiwat, K. Lan, Y. Xu, W. Ye, D. Estrin, and R. Govindan, "Effects of detail in wireless network simulation", SCS Multi-conference on Distributed Simulation, 2001.
|
 |
2
|
|
| |
3
|
K. Pawlikowski, H.-D. J. Jeong, and J.-S. R. Lee, "On credibility of simulation studies of telecommunication networks", IEEE Communications Magazine, vol. 40, 2002.
|
| |
4
|
"Kansei: A Sensor Testbed for At-Scale Experiments", http://ceti.cse.ohio-state.edu/kansei/.
|
| |
5
|
"TinyOS: An Operating System for Wireless Embedded Sensor Network", http://tinyos.net
|
 |
6
|
Emre Ertin , Anish Arora , Rajiv Ramnath , Vinayak Naik , Sandip Bapat , Vinod Kulathumani , Mukundan Sridharan , Hongwei Zhang , Hui Cao , Mikhail Nesterenko, Kansei: a testbed for sensing at scale, Proceedings of the fifth international conference on Information processing in sensor networks, April 19-21, 2006, Nashville, Tennessee, USA
[doi> 10.1145/1127777.1127838]
|
| |
7
|
|
| |
8
|
|
| |
9
|
A, Arora and M. Theimer, "On modeling and tolerating incorrect software", Technical Report, MSR-TR-003-7, Microsoft Research, 2003.
|
| |
10
|
|
| |
11
|
S. Bapat, W. Leal, T. Kwon, P. Wei, and A. Arora, "Chowkidar: A health monitor for Wireless Sensor Networks", TridentCom, 2007.
|
| |
12
|
B. Randell, "System Structure for Software Fault-Tolerance", IEEE Transactions on Software Engineering, No. 2, 1975.
|
| |
13
|
|
| |
14
|
J. Gray, "Why Do Computers Stop and What Can We Do About It", 6th International Conference on Reliability and Distributed Databases, 1987.
|
|