|
ABSTRACT
The Beowulf Distributed Process Space (BProc) is a set of Linux kernel modifications which provides a single system image and process migration facilities for processes running in a Beowulf style cluster. With BProc, all the processes running in a cluster are visible on the cluster front end machine and are controllable via existing UNIX process control mechanisms. Process creation is done on the front end machine and the processes are placed on the nodes where they will run with BProc's process migration mechanism.These two features combined greatly simplify creating and cleaning up parallel jobs as well as removing the necessity of a user login to remote nodes in the cluster. Removing the need for user logins drastically reduces the mount of software required on cluster nodes.Job startup with BProc's process migration mechanism is faster than the traditional method of logging into a node and starting the process with rsh. BProc does not affect file or network I/O of processes running on remote nodes so the vast majority of MPI applications will experience no performance loss as a result of being managed by BProc.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Amnon Barak, Oren La'adan, and Amnon Shiloh. Scalable cluster computing with MOSIX for Linux. In Proceedings of the Linux Expo '99, pages 95--100, Raleigh, NC, May 1999.
|
| |
2
|
Greg Bruno and Philip M. Papadopoulos. NPACI Rocks: Tools and Techniques for Easily Deploying Manageable Linux Clusters. October 2001.
|
| |
3
|
Al Geist , Adam Beguelin , Jack Dongarra , Weicheng Jiang , Robert Manchek , Vaidy Sunderam, PVM: Parallel virtual machine: a users' guide and tutorial for networked parallel computing, MIT Press, Cambridge, MA, 1995
|
| |
4
|
|
| |
5
|
Miron Livny, Jim Basney, Rajesh Raman, and Todd Tannenbaum. Mechanisms for high throughput computing. SPEEDUP Journal, Vol. 11, No.1, June 1997.
|
| |
6
|
R.M. Stallman. GDB manual. Second edition, Free Software Foundation, Inc., February 1988.
|
| |
7
|
The Open Cluster Group. OSCAR: A packaged cluster software stack for high performance computing. January 2001.
|
CITED BY 6
|
|
|
|
|
|
Richard L. Graham , Sung-Eun Choi , David J. Daniel , Nehal N. Desai , Ronald G. Minnich , Craig E. Rasmussen , L. Dean Risinger , Mitchel W. Sukalski, A network-failure-tolerant message-passing system for terascale clusters, Proceedings of the 16th international conference on Supercomputing, June 22-26, 2002, New York, New York, USA
|
|
|
Eitan Frachtenberg , Fabrizio Petrini , Juan Fernandez , Scott Pakin , Salvador Coll, STORM: lightning-fast resource management, Proceedings of the 2002 ACM/IEEE conference on Supercomputing, p.1-26, November 16, 2002, Baltimore, Maryland
|
|
|
Richard L. Graham , Sung-Eun Choi , David J. Daniel , Nehal N. Desai , Ronald G. Minnich , Craig E. Rasmussen , L. Dean Risinger , Mitchel W. Sukalski, A network-failure-tolerant message-passing system for terascale clusters, International Journal of Parallel Programming, v.31 n.4, p.285-303, August 2003
|
|
|
|
Peer to Peer - Readers of this Article have also read:
-
Improving the granularity of access control in Windows NT
Proceedings of the sixth ACM symposium on Access control models and technologies
Michael M. Swift
, Peter Brundrett
, Cliff Van Dyke
, Praerit Garg
, Anne Hopkins
, Shannon Chan
, Mario Goertzel
, Gregory Jensenworth
-
Efficient, DoS-resistant, secure key exchange for internet protocols
Proceedings of the 9th ACM conference on Computer and communications security
William Aiello
, Steven M. Bellovin
, Matt Blaze
, John Ioannidis
, Omer Reingold
, Ran Canetti
, Angelos D. Keromytis
-
Data structures for quadtree approximation and compression
Communications of the ACM
28, 9
Hanan Samet
-
A hierarchical single-key-lock access control using the Chinese remainder theorem
Proceedings of the 1992 ACM/SIGAPP Symposium on Applied computing
Kim S. Lee
, Huizhu Lu
, D. D. Fisher
-
The GemStone object database management system
Communications of the ACM
34, 10
Paul Butterworth
, Allen Otis
, Jacob Stein
|