|
ABSTRACT
Processable Bulk Data Transfer (PBDT) tasks are resource intensive concurrent tasks which involve transfer of a very large amount of data that has to be processed in some way before it can be used at a remote set of destination nodes called the sink nodes. A distributed computing environment, such as the Grid is a popular way to perform these tasks. Focusing on the execution of PBDT tasks in a Grid computing environment, this paper presents an efficient resource allocation mechanism. Due to the resource thirsty nature of these tasks, an efficient resource allocation is essential to perform these tasks while achieving satisfactory performance. The time-complexity of the resource allocation algorithm rises sharply as the available number of resources in the given Grid computing environment is increased making efficient resource allocation a challenge. To meet this challenge, this paper investigates the use of approximate algorithms for the resource allocation. The benefits obtained by using the reduced complexity of the algorithm are weighed against the increased costs incurred during the task execution (due to the inaccuracies in resource allocation introduced by the approximations). This paper describes a number of approximations and discusses under which circumstances such approximations are to be used. The techniques presented in this research can be extended to non-PBDT tasks and other distributed computing environments.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
|
| |
4
|
Ahmad I. and Majumdar S. Policies for Efficient Allocation of Grid Resources using a Bi-level Decision-making Architecture of "Processable" Bulk Data. Department of Systems and Computer Engineering, Carleton University, 2007.
|
| |
5
|
Ahmad I. and Majumdar S. An adaptive high performance architecture for "processable" bulk data transfers on a Grid. In 2nd International Conference on Broadband Networks (Broadnets). (3-7 Oct. 2005). IEEE, Boston, MA, USA, 2005, 1482--91.
|
| |
6
|
Ahmad I. and Majumdar S. Efficient Allocation of Grid Resources Using a Bi-level Decision-Making Architecture for "Processable" Bulk Data. On the Move to Meaningful Internet Systems 2007: CoopIS, DOA, ODBASE, GADA, and IS, (2007), 1313--1321.
|
| |
7
|
Allcock B., Chervenak A., Foster I., Kesselman C. and Livny M. Data Grid tools: enabling science on big distributed data. Journal of Physics: Conference Series, 16, 1 (2005), 571--5.
|
| |
8
|
Bunn J. and Newman H. Data-intensive Grids for high energy physics. In Berman G. and Hey E. eds.Grid Computing: Making the Global Infrastructure a Reality. John Wiley & Sons, Inc., New York, 2003.
|
| |
9
|
F. Berman , H. Casanova , A. Chien , K. Cooper , H. Dail , A. Dasgupta , W. Deng , J. Dongarra , L. Johnsson , K. Kennedy , C. Koelbel , B. Liu , X. Liu , A. Mandal , G. Marin , M. Mazina , J. Mellor-Crummey , C. Mendes , A. Olugbile , Jignesh M. Patel , D. Reed , Z. Shi , O. Sievert , H. Xia , A. YarKhan, New grid scheduling and rescheduling methods in the GrADS project, International Journal of Parallel Programming, v.33 n.2, p.209-229, June 2005
[doi> 10.1007/s10766-005-3584-4]
|
| |
10
|
Candler W. and Townsley R. A linear two-level programming problem, Computers & Operations Research, 9, 1 (1982), 59--76.
|
| |
11
|
Chvatal V. Linear Programming. W.H. Freeman and Company Press, 1980.
|
| |
12
|
Devpura A. Scheduling Parallel and Single Batch Machines to Minimize Total Weighted Tardiness. Ph.D. Dissertation, Computer Science Department, Arizona State University, 2003.
|
| |
13
|
Dimitris Bertsimas, John N. Tsitsiklis. Introduction to Linear Programming. Athena Scientific, Belmont, Massachusetts, 1998.
|
| |
14
|
Downey A. B. Lognormal and Pareto distributions in the Internet. Comput. Commun., 28, 7 (05/02 2005), 790--801.
|
| |
15
|
|
| |
16
|
Gzara F. Large scale integer programming: A novel solution method and application. McGill University, 2004.
|
| |
17
|
Jeong S., Chan-Hyun Youn and Hyoug-Jun Kim. Optimal file replication scheme (CO-RLS) for data Grids. In Anonymous 6th International Conference on Advanced Communication Technology. (9-11 Feb. 2004). IEEE, Phoenix Park, South Korea, 2004, 1055--9.
|
| |
18
|
Foster I., Kesselman C., Lee C., Lindell B., Nahrstedt K. and Roy A. A distributed resource management architecture that supports advance reservations and co-allocation. In Anonymous Proceedings of IWQoS'99 - Seventh International Workshop on Quality of Service. (31 May-4 June 1999). IEEE, London, UK, 1999, 27--36.
|
| |
19
|
M. Elayeb. Efficient Data Scheduling For Real-Time Large Scale Data Intensive Distributed Applications. (Masters Dissertation, The Ohio State University).
|
| |
20
|
Mathur K. and Puri M. C. A bilevel bottleneck programming problem. European Journal of Operational Research, 86, 2 (10/19 1995), 337--344.
|
| |
21
|
|
| |
22
|
|
| |
23
|
|
| |
24
|
Venugopal S. Scheduling Distributed Data-Intensive Applications on Global Grids. Ph.D. Dissertation, Department of Computer Science and Software Engineering, The University of Melbourne, 2006.
|
| |
25
|
|
| |
26
|
Jing Wu , Michel Savoie , Scott Campbell , Hanxi Zhang , Gregor V. Bochmann , Bill St. Arnaud, Customer-managed end-to-end lightpath provisioning, International Journal of Network Management, v.15 n.5, p.349-362, September 2005
[doi> 10.1002/nem.581]
|
| |
27
|
Yang K., Galis A. and Todd C. Policy-based active Grid Allocation architecture. In Towards Network Superiority. (27-30 Aug. 2002). IEEE, Singapore, 2002, 243--8.
|
|