ACM Home Page
Please provide us with feedback. Feedback
Implementing an autonomic architecture for fault-tolerance in a wireless sensor network testbed for at-scale experimentation
Full text PdfPdf (608 KB)
Source Symposium on Applied Computing archive
Proceedings of the 2008 ACM symposium on Applied computing table of contents
Fortaleza, Ceara, Brazil
SESSION: Autonomic computing table of contents
Pages 1670-1676  
Year of Publication: 2008
ISBN:978-1-59593-753-7
Authors
Mukundan Sridharan  The Ohio State University, Columbus, OH
Sandip Bapat  The Ohio State University, Columbus, OH
Rajiv Ramnath  The Ohio State University, Columbus, OH
Anish Arora  The Ohio State University, Columbus, OH
Sponsor
SIGAPP: ACM Special Interest Group on Applied Computing
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): n/a,   Downloads (12 Months): n/a,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1363686.1364082
What is a DOI?

ABSTRACT

The wireless sensor networking (WSN) community has increasingly grown to rely on experimentation with large-scale test-beds as a means of verifying protocols, middleware and applications. These testbeds need to be highly available in order to support this community, but are themselves complex, and complex to manage, being prone to faults in hardware, software specification and software implementation. In this paper we report on our experience in designing Kansei, a WSN testbed for experimentation at scale, to be autonomic - i.e. self-healing and self-managing. We implement autonomic management in Kansei through an architecture that consists of a hierarchy of self-contained components, extended with detectors for discovering faults and correctors for subsequent stabilization. We find that our invariant based architecture is well suited for large complex systems with unpredictable fault model and its fault monitoring framework can be extended to include user programs.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
J. Heidemann, N. Bulusu, J. Elson, C. Intanagonwiwat, K. Lan, Y. Xu, W. Ye, D. Estrin, and R. Govindan, "Effects of detail in wireless network simulation", SCS Multi-conference on Distributed Simulation, 2001.
2
 
3
K. Pawlikowski, H.-D. J. Jeong, and J.-S. R. Lee, "On credibility of simulation studies of telecommunication networks", IEEE Communications Magazine, vol. 40, 2002.
 
4
"Kansei: A Sensor Testbed for At-Scale Experiments", http://ceti.cse.ohio-state.edu/kansei/.
 
5
"TinyOS: An Operating System for Wireless Embedded Sensor Network", http://tinyos.net
6
 
7
 
8
 
9
A, Arora and M. Theimer, "On modeling and tolerating incorrect software", Technical Report, MSR-TR-003-7, Microsoft Research, 2003.
 
10
 
11
S. Bapat, W. Leal, T. Kwon, P. Wei, and A. Arora, "Chowkidar: A health monitor for Wireless Sensor Networks", TridentCom, 2007.
 
12
B. Randell, "System Structure for Software Fault-Tolerance", IEEE Transactions on Software Engineering, No. 2, 1975.
 
13
 
14
J. Gray, "Why Do Computers Stop and What Can We Do About It", 6th International Conference on Reliability and Distributed Databases, 1987.

Collaborative Colleagues:
Mukundan Sridharan: colleagues
Sandip Bapat: colleagues
Rajiv Ramnath: colleagues
Anish Arora: colleagues