|
ABSTRACT
Scalable management and self-organizational capabilities are
emerging as central requirements for a generation of large-scale,
highly dynamic, distributed applications. We have developed an
entirely new distributed information management system called
Astrolabe. Astrolabe collects large-scale system state, permitting
rapid updates and providing on-the-fly attribute aggregation. This
latter capability permits an application to locate a resource, and
also offers a scalable way to track system state as it evolves over
time. The combination of features makes it possible to solve a wide
variety of management and self-configuration problems. This paper
describes the design of the system with a focus upon its
scalability. After describing the Astrolabe service, we present
examples of the use of Astrolabe for locating resources,
publish-subscribe, and distributed synchronization in large
systems. Astrolabe is implemented using a peer-to-peer protocol,
and uses a restricted form of mobile code based on the SQL query
language for aggregation. This protocol gives rise to a novel
consistency model. Astrolabe addresses several security
considerations using a built-in PKI. The scalability of the system
is evaluated using both simulation and experiments; these confirm
that Astrolabe could scale to thousands and perhaps millions of
nodes, with information propagation delays in the tens of seconds.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
William Adjie-Winoto , Elliot Schwartz , Hari Balakrishnan , Jeremy Lilley, The design and implementation of an intentional naming system, Proceedings of the seventeenth ACM symposium on Operating systems principles, p.186-201, December 12-15, 1999, Charleston, South Carolina, United States
|
 |
2
|
Marcos K. Aguilera , Robert E. Strom , Daniel C. Sturman , Mark Astley , Tushar D. Chandra, Matching events in a content-based subscription system, Proceedings of the eighteenth annual ACM symposium on Principles of distributed computing, p.53-61, May 04-06, 1999, Atlanta, Georgia, United States
[doi> 10.1145/301308.301326]
|
 |
3
|
David Andersen , Hari Balakrishnan , Frans Kaashoek , Robert Morris, Resilient overlay networks, Proceedings of the eighteenth ACM symposium on Operating systems principles, October 21-24, 2001, Banff, Alberta, Canada
|
| |
4
|
|
 |
5
|
Kenneth P. Birman , Mark Hayden , Oznur Ozkasap , Zhen Xiao , Mihai Budiu , Yaron Minsky, Bimodal multicast, ACM Transactions on Computer Systems (TOCS), v.17 n.2, p.41-88, May 1999
[doi> 10.1145/312203.312207]
|
 |
6
|
|
 |
7
|
|
 |
8
|
|
| |
9
|
|
 |
10
|
|
 |
11
|
Alan Demers , Dan Greene , Carl Hauser , Wes Irish , John Larson , Scott Shenker , Howard Sturgis , Dan Swinehart , Doug Terry, Epidemic algorithms for replicated database maintenance, Proceedings of the sixth annual ACM Symposium on Principles of distributed computing, p.1-12, August 10-12, 1987, Vancouver, British Columbia, Canada
[doi> 10.1145/41840.41841]
|
| |
12
|
Golding, R. 1992. A weak-consistency architecture for distributed information services. Comput. Syst. 5, 4 (Fall), 379--405.
|
| |
13
|
Golding, R., Long, D., and Wilkes, J. 1994. The REFDBMS distributed bibliographic database system. In Proceedings of Usenix'94. Santa Fe, NM, 47--62.
|
| |
14
|
Steven D. Gribble , Matt Welsh , Rob von Behren , Eric A. Brewer , David Culler , N. Borisov , S. Czerwinski , R. Gummadi , J. Hill , A. Joseph , R. H. Katz , Z. M. Mao , S. Ross , B. Zhao , Robert C. Holte, The Ninja architecture for robust Internet-scale systems and services373423, Computer Networks: The International Journal of Computer and Telecommunications Networking, v.35 n.4, p.473-497, March 2001
[doi> 10.1016/S1389-1286(00)00179-1]
|
 |
15
|
John Heidemann , Fabio Silva , Chalermek Intanagonwiwat , Ramesh Govindan , Deborah Estrin , Deepak Ganesan, Building efficient wireless sensor networks with low-level naming, Proceedings of the eighteenth ACM symposium on Operating systems principles, October 21-24, 2001, Banff, Alberta, Canada
|
 |
16
|
|
| |
17
|
|
| |
18
|
|
 |
19
|
Brian Oki , Manfred Pfluegl , Alex Siegel , Dale Skeen, The Information Bus: an architecture for extensible distributed systems, Proceedings of the fourteenth ACM symposium on Operating systems principles, p.58-68, December 05-08, 1993, Asheville, North Carolina, United States
|
 |
20
|
Karin Petersen , Mike J. Spreitzer , Douglas B. Terry , Marvin M. Theimer , Alan J. Demers, Flexible update propagation for weakly consistent replication, Proceedings of the sixteenth ACM symposium on Operating systems principles, p.288-301, October 05-08, 1997, Saint Malo, France
|
| |
21
|
|
| |
22
|
|
| |
23
|
|
| |
24
|
|
 |
25
|
|
| |
26
|
Stallings, W. 1993. SNMP, SNMPv2, and CMIP. Addison-Wesley.
|
 |
27
|
Ion Stoica , Robert Morris , David Karger , M. Frans Kaashoek , Hari Balakrishnan, Chord: A scalable peer-to-peer lookup service for internet applications, Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications, p.149-160, August 2001, San Diego, California, United States
|
| |
28
|
Tennenhouse, D., Smith, J., Sincoskie, W., Wetherall, D., and Minden, G. 1997. A survey of active network research. IEEE Communications Magazine 35, 1 (Jan.), 80--86.
|
| |
29
|
|
| |
30
|
|
| |
31
|
Van Renesse, R., Minsky, Y., and Hayden, M. 1998. A gossip-style failure detection service. In Proceedings of Middleware'98. IFIP, The Lake District, UK, 55--70.
|
| |
32
|
Van Steen, M., Hauck, F., Homburg, P., and Tanenbaum, A. 1998. Locating objects in wide-area systems. IEEE Communications Magazine 36, 1 (Jan.), 104--109.
|
 |
33
|
Brian White , Jay Lepreau , Leigh Stoller , Robert Ricci , Shashi Guruprasad , Mac Newbold , Mike Hibler , Chad Barb , Abhijeet Joglekar, An integrated experimental environment for distributed systems and networks, Proceedings of the 5th symposium on Operating systems design and implementation Due to copyright restrictions we are not able to make the PDFs for this conference available for downloading, December 09-11, 2002, Boston, Massachusetts
[doi> 10.1145/1060289.1060313]
|
| |
34
|
|
CITED BY 83
|
|
|
|
|
|
|
|
|
|
|
Matthieu Leclercq , Vivien Quéma , Jean-Bernard Stefani, DREAM: a component framework for the construction of resource-aware, reconfigurable MOMs, Proceedings of the 3rd workshop on Adaptive and reflective middleware, p.250-255, October 19-19, 2004, Toronto, Ontario, Canada
|
|
|
|
|
|
|
|
|
Guy Bernard , Jalel Ben-othman , Luc Bouganim , Gérôme Canals , Sophie Chabridon , Bruno Defude , Jean Ferrié , Stéphane Gançarski , Rachid Guerraoui , Pascal Molli , Philippe Pucheral , Claudia Roncancio , Patricia Serrano-Alvarado , Patrick Valduriez, Mobile databases: a selection of open issues and research directions, ACM SIGMOD Record, v.33 n.2, June 2004
|
|
|
|
|
|
|
|
|
|
|
|
Shariq Rizvi , Shawn R. Jeffery , Sailesh Krishnamurthy , Michael J. Franklin , Nathan Burkhart , Anil Edakkunni , Linus Liang, Events on the edge, Proceedings of the 2005 ACM SIGMOD international conference on Management of data, June 14-16, 2005, Baltimore, Maryland
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Graham Williamson , Graeme Stevenson , Steve Neely , Lorcan Coyle , Paddy Nixon, Scalable information dissemination for pervasive systems: implementation and evaluation, Proceedings of the 4th international workshop on Middleware for Pervasive and Ad-Hoc Computing (MPAC 2006), p.7, November 27-December 01, 2006, Melbourne, Australia
|
|
|
|
|
|
Srinivas Kashyap , Supratim Deb , K. V. M. Naidu , Rajeev Rastogi , Anand Srinivasan, Efficient gossip-based aggregate computation, Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, June 26-28, 2006, Chicago, IL, USA
|
|
|
Yitzhak Birk , Idit Keidar , Liran Liss , Assaf Schuster , Ran Wolff, Veracity radius: capturing the locality of distributed computations, Proceedings of the twenty-fifth annual ACM symposium on Principles of distributed computing, July 23-26, 2006, Denver, Colorado, USA
|
|
|
|
|
|
|
|
|
|
|
|
Hongzhou Liu , Venugopalan Ramasubramanian , Emin Gün Sirer, Client behavior and feed characteristics of RSS, a publish-subscribe system for web micronews, Proceedings of the Internet Measurement Conference 2005 on Internet Measurement Conference, p.3-3, October 19-21, 2005, Berkeley, CA
|
|
|
Ira Cohen , Moises Goldszmidt , Terence Kelly , Julie Symons , Jeffrey S. Chase, Correlating instrumentation data to system states: a building block for automated diagnosis and control, Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation, p.16-16, December 06-08, 2004, San Francisco, CA
|
|
|
|
|
|
|
|
|
Niels Drost , Elth Ogston , Rob V. van Nieuwpoort , Henri E. Bal, ARRG: real-world gossiping, Proceedings of the 16th international symposium on High performance distributed computing, June 25-29, 2007, Monterey, California, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Lorenzo Alvisi , Jeroen Doumen , Rachid Guerraoui , Boris Koldehofe , Harry Li , Robbert van Renesse , Gilles Tredan, How robust are gossip-based communication protocols?, ACM SIGOPS Operating Systems Review, v.41 n.5, October 2007
|
|
|
Dionysios Kostoulas , Dimitrios Psaltoulis , Indranil Gupta , Kenneth P. Birman , Alan J. Demers, Active and passive techniques for group size estimation in large-scale and dynamic distributed systems, Journal of Systems and Software, v.80 n.10, p.1639-1658, October, 2007
|
|
|
Praveen Yalagandula , Puneet Sharma , Sujata Banerjee , Sujoy Basu , Sung-Ju Lee, S3: a scalable sensing service for monitoring large networked systems, Proceedings of the 2006 SIGCOMM workshop on Internet network management, p.71-76, September 11-15, 2006, Pisa, Italy
|
|
|
|
|
|
Ozalp Babaoglu , Geoffrey Canright , Andreas Deutsch , Gianni A. Di Caro , Frederick Ducatelle , Luca M. Gambardella , Niloy Ganguly , Márk Jelasity , Roberto Montemanni , Alberto Montresor , Tore Urnes, Design patterns from biology for distributed computing, ACM Transactions on Autonomous and Adaptive Systems (TAAS), v.1 n.1, p.26-66, September 2006
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Navendu Jain , Dmitry Kit , Prince Mahajan , Praveen Yalagandula , Mike Dahlin , Yin Zhang, STAR: self-tuning aggregation for scalable monitoring, Proceedings of the 33rd international conference on Very large data bases, September 23-27, 2007, Vienna, Austria
|
|
|
|
|
|
Paolo Costa , Vincent Gramoli , Márk Jelasity , Gian Paolo Jesi , Erwan Le Merrer , Alberto Montresor , Leonardo Querzoni, Exploring the interdisciplinary connections of gossip-based systems, ACM SIGOPS Operating Systems Review, v.41 n.5, October 2007
|
|
|
|
|
|
|
|
|
Mary Fernández , Trevor Jim , Kristi Morton , Nicola Onose , Jérôme Siméon, DXQ: a distributed XQuery scripting language, Proceedings of the 4th international workshop on XQuery implementation, experience and perspectives, p.1-6, June 15-15, 2007, Beijing, China
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Gunjan Khanna , Mike Yu Cheng , Padma Varadharajan , Saurabh Bagchi , Miguel P. Correia , Paulo J. Veríssimo, Automated Rule-Based Diagnosis through a Distributed Monitor System, IEEE Transactions on Dependable and Secure Computing, v.4 n.4, p.266-279, October 2007
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Steven Y. Ko , Praveen Yalagandula , Indranil Gupta , Vanish Talwar , Dejan Milojicic , Subu Iyer, Moara: flexible and scalable group-based querying system, Proceedings of the 9th ACM/IFIP/USENIX International Conference on Middleware, December 01-05, 2008, Leuven, Belgium
|
|
|
|
|
|
|
|
|
Rémi Sharrock , Fadi Khalil , Thierry Monteil , Hervé Aubert , Fabio Cocccetti , Patricia Stolf , Laurent Broto , Robert Plana, Deployment and management of large planar reflectarray antennas simulation on grid, Proceedings of the 7th international workshop on Challenges of large applications in distributed environments, June 09-10, 2009, Garching, Germany
|
|
|
Ying Zhao , Yongmin Tan , Zhenhuan Gong , Xiaohui Gu , Mike Wamboldt, Self-correlating predictive information tracking for large-scale production systems, Proceedings of the 6th international conference on Autonomic computing, June 15-19, 2009, Barcelona, Spain
|
|
|
|
|
|
|
|
|
Venugopalan Ramasubramanian , Dahlia Malkhi , Fabian Kuhn , Mahesh Balakrishnan , Archit Gupta , Aditya Akella, On the treeness of internet latency and bandwidth, Proceedings of the eleventh international joint conference on Measurement and modeling of computer systems, June 15-19, 2009, Seattle, WA, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Oznur Ozkasap , Mine Caglar , Emrah Cem , Emrah Ahi , Emre Iskender, Stepwise fair-share buffering for gossip-based peer-to-peer data dissemination, Computer Networks: The International Journal of Computer and Telecommunications Networking, v.53 n.13, p.2259-2274, August, 2009
|
INDEX TERMS
Primary Classification:
C.
Computer Systems Organization
C.2
COMPUTER-COMMUNICATION NETWORKS
C.2.1
Network Architecture and Design
Subjects:
Network communications
Additional Classification:
C.
Computer Systems Organization
C.2
COMPUTER-COMMUNICATION NETWORKS
C.2.4
Distributed Systems
Subjects:
Distributed applications
D.
Software
D.1
PROGRAMMING TECHNIQUES
D.1.3
Concurrent Programming
Subjects:
Distributed programming
D.4
OPERATING SYSTEMS
D.4.4
Communications Management
Subjects:
Network communication
D.4.5
Reliability
Subjects:
Fault-tolerance
D.4.6
Security and Protection
Subjects:
Authentication
D.4.7
Organization and Design
Subjects:
Distributed systems
H.
Information Systems
H.3
INFORMATION STORAGE AND RETRIEVAL
H.3.3
Information Search and Retrieval
Subjects:
Information filtering
H.3.4
Systems and Software
Subjects:
Distributed systems
H.3.5
On-line Information Services
Subjects:
Data sharing
General Terms:
Algorithms,
Design,
Management,
Performance,
Reliability,
Security
Keywords:
Aggregation,
epidemic protocols,
failure detection,
gossip,
membership,
publish-subscribe,
scalability
|