SlideShare a Scribd company logo
1 of 11
Download to read offline
International journal on applications of graph theory in wireless ad hoc networks and sensor networks
(GRAPH-HOC) Vol.2, No.3, September 2010
DOI : 10.5121/jgraphoc.2010.2302 15
THE IMPACT OF DATA REPLICATION ON JOB
SCHEDULING PERFORMANCE IN HIERARCHICAL
DATA GRID
Somayeh Abdi1
, Somayeh Mohamadi2
1
Department of Computer Engineering, Eslamabad Gharb branch Islamic Azad
University, Eslamabad Gharb,Kermanshah, Iran
Somayeh.abdi@gmail.com
1
Department of Computer Engineering, Ghasre Shirin branch Islamic Azad University,
Ghasre Shirin, Kermanshah, Iran
s.mohamadi@gmail.com
ABSTRACT
In data-intensive applications data transfer is a primary cause of job execution delay. Data access time
depends on bandwidth. The major bottleneck to supporting fast data access in Grids is the high latencies
of Wide Area Networks and Internet. Effective scheduling can reduce the amount of data transferred
across the internet by dispatching a job to where the needed data are present. Another solution is to use a
data replication mechanism. Objective of dynamic replica strategies is reducing file access time which
leads to reducing job runtime. In this paper we develop a job scheduling policy and a dynamic data
replication strategy, called HRS (Hierarchical Replication Strategy), to improve the data access
efficiencies. We study our approach and evaluate it through simulation. The results show that our
algorithm has improved 12% over the current strategies.
KEYWORDS
Grid, Data Grid, Job Scheduling, Data Replication, Simulation
1. INTRODUCTION
In the increasing demand of scientific and large-scale business application, a large amount of
data are generated and spread for using by users around the world. Many good examples can be
listed such as High Energy Physics, meteorology, computational genomics which processes and
results large amount of data. Such data cannot be stored centralized in any site but distributed
among centre around the world [1]. Data grid tries to store this data in decentralize sites and
then for each application retrieves it from these sites. Data grid as an important branch of grid
computing focuses on supporting an efficient management mechanism for controlled sharing
and large amounts of distributed data. In Data Grid, for each incoming job, the Grid scheduler
decides where to run the job based on the job requirements and the system status. Scheduling
jobs to suitable grid sites is necessary because data movement between different grid sites is
time consuming. If a job is scheduled to a site where the required data are present, the job can
process data in this site without any transmission delay for getting data from a remote site.
International journal on applications of graph theory in wireless ad hoc networks and sensor networks
(GRAPH-HOC) Vol.2, No.3, September 2010
16
Data replication is an important optimization step to manage large data by replicating data in
geographically distributed data stores. When users’ jobs access a large amount of data from
remote sites, dynamic replica optimizer running in the site tries to store replicas on local storage
for future possible repeated requests.
We proposed a new job scheduling algorithm and data replication policy that reduce job
execution time by reducing job data access time. Our new scheduling policy considers the
locations of required data and the job queue length of a computing node. We develop a
replication strategy, called HRS (Hierarchical Replication Strategy). It takes into account
bandwidth as an important factor for replica selection and replica placement. It also increases
the chances of accessing data at a nearby node.
The rest of the paper is organized as follows, in section 2 we present a summary of existing and
related work. In section 3, a hierarchical structure is proposed for data grid based on
classification of networks, along with a scheduling algorithm and replication algorithm for this
structure, Section 4 describes our experiments and the results achieved followed by conclusion
in section 5.
2. RELATED WORK
There are some recent works that address the problem of scheduling and/ or replication in Data
Grid as well as the combination between them.
In [3], it considers two centralized and decentralized replication algorithms. In centralized
method, replica master uses a table that ranks each file access in descending order. If a file
access is less than the average, it will be removed from the table. Then it pop files from top and
replicates using a response-time oriented replica placement algorithm. In the decentralized
method, every site records file access in its table and exchange this table with neighbours. Since
every domain knows average number of access for each file and then deletes those files whose
access is less than the average, and replicates other files in its local storage.
In [5], an algorithm for a two-level hierarchical structure based on internet hierarchy (BHR) has
been introduced which only considers dynamic replication and does not consider scheduling.
Nodes in the first level are connected to each other with high speed networks and in the second
level via internet. The algorithm replicates the file to the site if there is enough space. Next it,
accesses the file remotely if the file is available in the sites that are in the same region.
Otherwise it tries to make available space by deleting files using LRU (Least Recently Used)
method, and replicates the file. It assumes that master site always has a safe copy of file before
deleting.
In [6], a structure with few networks connected via internet has been presented and an algorithm
similar to [5], along with scheduling is proposed. For replicating a file, first computes the total
transfer time, then it selects the best node with shortest transfer time.
In [7] authors introduce dynamic replication placement (RP) that categorizes the data based on
their property. This category is used for job scheduling and replication. Then a job is allocated
to a site which has the file in the required category, this leads to reduce the cost for file transfer.
In [8] a Genetic Algorithm based co-scheduling of data and jobs of independent nature was
proposed; the GA is executed to converge to a schedule by looking at the jobs in the scheduler
queue as well as the replicated data objects at once. A performance overhead might be incurred
by the system as a result of delaying the replication of the data until the scheduling time. The
authors also use an objective function that assumes infinite availability of storage for all data
objects which is infeasible in a realistic grid setting.
International journal on applications of graph theory in wireless ad hoc networks and sensor networks
(GRAPH-HOC) Vol.2, No.3, September 2010
17
3. THE PROPOSED METHOD
In this section, we present hierarchical network structure and then we proposed two strategies
for job scheduling and data replication by considering the hierarchical network structure.
3.1. NETWORK STRUCTURE
The proposed Data Grid structure is shown in Figure 1. This structure has two levels. Regions
comprise the first level and sites that are located in a region comprise the second level. A region
represents an organization unit which is a group of sites that are geographically close to each
other, each region comprises the computers which are connected by a high bandwidth. We
define two kinds of communications between sites, inter-communication and intra-
communication. Intra-communication is the communication between sites within the same
region and inter-communication is the communication between sites across regions. Network
bandwidth between sites within a region will be larger than across regions.
In communication networks, the performance of system is underlying available network
bandwidth and data access latency, especially in networks that hierarchy of bandwidth appears.
Therefore, to reduce access latency and to avoid WAN bandwidth bottleneck, it is important to
reduce the number of inter-communications.
Data Grid Information Service (DGIS) providing resource registration services and keeping
track of a list of resources available in the Data Grid. The Grid Scheduler can query this for
resource contact, configuration, and status information; Resource discovery identify resources
that can be used with their capability through Data Grid Information Service.
In this structure we apply distributed dynamic data replication infrastructure. Replica Manager
at each site manages the data movement between sites. When a job is assigned to Local
Scheduler of site, those required data that do not exist in local site will be transfer to it. Data
request by the job is generated as soon as job is scheduled into Local Scheduler queue and
Replica Manager controls data movement at each site.
In proposed structure there is a centralized Replica Catalogue. It is responsible for indexing
available files on the resources and handles queries from users and resources about location of
replicas.
International journal on applications of graph theory in wireless ad hoc networks and sensor networks
(GRAPH-HOC) Vol.2, No.3, September 2010
18
Figure 1. Data Grid Architecture.
3.2. SCHEDULING ALGORITHM
Our new scheduling policy considers the locations of required data, computing capacity and
number of jobs that allocate on computing nodes. For efficient scheduling of any job, the
algorithm determines the best site and then submit job to Local scheduler. A best site is a site
that holds most of the requested files (from size point of view). This will significantly reduce
total transfer time, and reduce the job execution time.
Each job requires some replicas and the replicas is needed to execute job j are represented as: Rj
= {LFN1, LFN2,…., LFNn}. We assume that jS
is total size of the requested files available in
site j, Ci is computing capacity (in MIPS1) of site i and RelativeLoadi is relative load of site i.
1
. Million Instruction Per Second
International journal on applications of graph theory in wireless ad hoc networks and sensor networks
(GRAPH-HOC) Vol.2, No.3, September 2010
19
∑=
SsiteinLFNiavailableallfor
iS LFNS (1)
RelativeLoadi= SizeofJobsi / Ci (2)
SizeofJobsi is the lengths of queued jobs (in MIPS) on site i and iLFN present size of file
LFNi .
The scheduling algorithm can be summarized as follow:
1- Compute SC for each site from (1)
2- Select the best site i
q
i
SMAXS
1
max
=
= ; where q is the number of site (i.e. site with largest
available requested data files)
3- If there are several sites with most available data, select the site with minimum relative
load from (2).
When a job is assigned to local scheduler, the Replica Manager transfers all the requested files
that are not exist in local site. The objective of file replicating is to transferring required data to
local site before job execution. Therefore, data replication improves job scheduling performance
by reducing job execution time.
The proposed scheduling algorithm takes into consideration the requested data files.
Additionally, where there are the same data requirements in several sites (from size point of
view); it takes into consideration the computing capacity of sites with a view to reducing the
queue waiting time and relative load of sites.
3.3. HIERARCHICAL REPLICATION STRATEGY
After a job is scheduled to site Sj , the requested data will be transferred to Sj to become
replicas. HRS (hierarchical replication Strategy) determines wich replica will be transferred to
Sj and how to handle this new replica. HRS considers the bandwidth between the regions as the
main factor for replica selection / deletion.
When each site stores a new replica, replica manager sends a file register request to RC and then
RC adds this site to the list of sites that holds the replica. Replica Manager controls data
transferring in each site and provides a mechanism for accessing the Replica Catalogue. Data
Grid Architecture for Replica Catalogue and Replica Manager is shwon in in Figure 2.
International journal on applications of graph theory in wireless ad hoc networks and sensor networks
(GRAPH-HOC) Vol.2, No.3, September 2010
20
Figure 2. Data Grid Architecture for replica cataloge and replica manager.
For each required file of the job, Replica Manager controls the existence of the file in local site.
If file doesn’t exist in the local site HRS first searches the file in local region. If the file
duplicated in the same region, then it will creates a list of candidate replicas and selects a replica
with the maximum bandwidth available for transferring it. If there is enough space for new
replica, then stores it in the local site, otherwise it is only stored in the temporary buffer and will
be deleted after the job completes.
If the file doesn’t exit in the same region, then HRS creates a list of replicas in other regions and
selects the replica with the maximum bandwidth available for transferring it. If there is enough
space for new replica, then stores it in the local site, otherwise occupided space will be released
to make engough room for this new replica. First, it removes the replicas that already exist in
other sites in the same region based on LRU(least recently used) replacement algorithm. After
all these replicas are deleted, if the space is still insufficient, the HRS use LRU replacement
algorithm to delete replica in local storage which is duplicated in other regions , till it has
enough room for the new replica.
4. SIMULATIONS
Gridsim is used as the simulation tool to evaluate the performance of the proposed replication
and scheduling algorithms. The Java-based GridSim discrete event simulation toolkit provides
Java classes that represent entities essential for application, resource modelling, scheduling of
jobs to resources, and their execution along with management [9]. Its Java-based design makes
it portable and available on all computational platforms. The components of Gridsim are as
follow and also depicted in Figure 3.
International journal on applications of graph theory in wireless ad hoc networks and sensor networks
(GRAPH-HOC) Vol.2, No.3, September 2010
21
Figure 3. Simulator architecture
Resource Broker: It receives jobs from user and sends them to the best node according to
proposed algorithm; Storage Element (SE): Storage resource in grid; Computing Element (CE):
Computing resource in grid; Replica Manager:controls data transfering at each site; Replica
Catalogue: stores list of sites that holds replicas.
Based on the scheduling algorithm the broker sends jobs to a node. Each job needs a list of files
to run. Reducing file access time is the final objective of optimization algorithms.
4.1. Simulation Environment
There are four regions in our configuration and each region has an average of 13 sites, which all
have CE with associated SE. Table1 specifies the simulation parameters used in our study.
There are 5 job types; each job type requires 12 files to execute. While running, jobs were
randomly picked from 5 job types, then submitted to the Resource Broker. Files are accessed
sequentially within a job without any access pattern. To simplify the requirements, data
replication approaches in Data Grid environments commonly assume that the data is read-only.
International journal on applications of graph theory in wireless ad hoc networks and sensor networks
(GRAPH-HOC) Vol.2, No.3, September 2010
22
4.2. Simulation results and discussion
HRS will be compared with LRU (Least Recently Used) and BHR (Bandwidth Hierarchy based
Replication). The LRU algorithm always replicates and then deletes those files that have been
used least recently. Figure 4 shows the Average job time based on changing number of jobs for
3 algorithms. Figure 5 shows the Average job time for 1000 jobs by 3 mentioned algorithms.
HRS replication strategy uses the concept of “network locality” as BHR [5].
0
5000
10000
15000
20000
25000
100 200 300 400 500
Number of Jobs
AverageJobTime(Sec)
HRS
BHR
LRU
Figure 4. Average job Time based on varying number of jobs.
TABLE 1. Simulation parameters
valueTopology Parameters
4Number of region
13Number of sites in each region
10 GBStorage space at each site
10 MbpsConnectivity bandwidth(WAN)
1000MbpsConnectivity bandwidth(LAN)
valueJob parameters
500Number of jobs
5Number of job types
12Number of file accessed per job
500 MBSize of single file
50GBTotal size of files
International journal on applications of graph theory in wireless ad hoc networks and sensor networks
(GRAPH-HOC) Vol.2, No.3, September 2010
23
The difference between HRS and BHR is that required replica within the same region is
always the top priority used in HRS, while BHR searches all sites to find the best replica and
has no distinction between intra-region communication and inter-region comuuniation. It could
be anticipated that HRS will avoid inter-region communications and be stable in hierarchical
network architecture with variable bandwidth. Our method takes benefit from network level
locality of BHR. Thus, total job execution time is about 12% faster using HRS optimizer than
BHR.
0
10000
20000
30000
40000
50000
MeanJobsTime(sec)
1000
Number of Jobs
LRU
BHR
HRS
Figure 5. Average Job Time for 1000 jobs.
The job execution time is the Max{file transmission time, queue time} plus job processing time.
Since the file transmission time is the most important factor to influence the job execution time
for data-intensive jobs in data grids, the proposed scheduling algorithm with HRS can reduce
the file transmission time effectively by virtue of valid scheduling and proper data replication,
as can be seen from the experiments.
The average number of inter-communications for a job execution is illustrated in Figure 6. By
selecting the best site based on location of required data by the job, the proposed scheduling
algorithm with HRS can decrease the number of inter-communications effectively. Overall the
simulation results with Gridsim show better performance (over 12%) comparing to current
algorithms.
International journal on applications of graph theory in wireless ad hoc networks and sensor networks
(GRAPH-HOC) Vol.2, No.3, September 2010
24
0
1
2
3
4
5
6
7
8
NumberofAverage
inter-communication
100 300 500
Number of jobs
BHR
HRS
Figure 6. Average number of inter-communications
Figures 7 shows the average job time for 500 jobs. We compare HRS, BHR and LRU
algorithms for varying inter-communication bandwidth. As inter-communication bandwidth
increase 3 mentioned algorithms will converge. We can conclude that HRS strategy can be
effectively utilized when hierarchy of bandwidth appears because BHR has no distinction
between intra-region and inter-region. It could be anticipated that HRS will avoid inter-region
communications and be stable in hierarchical network architecture with variable bandwidth.
0
5000
10000
15000
20000
25000
10 50 100 300 600 1000
inter-communication Bandwidth
AverageJobTime(Sec)
HRS
BHR
LRU
Figure 7. Average Jobs Time with varying inter-communication bandwidth for 500 jobs
Overall the simulation results with Gridsim show better performance (over 12%) comparing to
current algorithms.
International journal on applications of graph theory in wireless ad hoc networks and sensor networks
(GRAPH-HOC) Vol.2, No.3, September 2010
25
5. CONCLUSION AND FUTURE WORK
In this paper a hierarchical structure for dynamic replicating file and scheduling in data grids
was proposed. To achieve good network bandwidth utilization and reduce data access time, we
propose a job scheduling policy that considers computational capability, job type and data
location in job placement decision. We study and evaluate the performance of various replica
strategies. The simulation results show, first of all, that proposed scheduling algorithm and HRS
both get better performances. Second, we can achieve particularly good performance with
scheduling algorithm where jobs are always scheduled to site with most of the data needed, and
a separate HRS process at each site for replication management. Experimental data show
proposed scheduling algorithm with HRS replica strategy outperforms others combinations in
total job execution time.
REFERENCES
[1] K. Ranganathan and I. Foster, “Identifying Dynamic Replication Strategies for a High
Performance Data Grid”. In Proceedings of the International Grid Computing Workshop,
Denver, Colorado, USA, 2001.
[2] Jianhua Jiang, Huifang Ji, “scheduling algorithm with potential behaviors”, Journal of
Computers, VOL. 3 , NO. 12 , December 2008.
[3] Ming Tang, Bu-Sung Lee, Xueyan Tang, Chai-Kiat Yeo .” The Impact of Data Replication on
Job Scheduling Performance in the Data Grid”, Future Generation Computer Systems, Volume
22, Issue 3, February 2006, Pages 254-268
[4] Ali Elghirani, Riky Subrata, Albert Y. Zomaya, and Ali Al Mazari., “Performance Enhancement
through Hybrid Replication and Genetic Algorithm Co-Scheduling in Data Grids”, Advanced
Networks Research Group, School of Information Technologies, University of Sydney, NSW
2006 Australia.
[5] Sang-Min Park, Jai-Hoon Kim, Young-Bae Ko: “Dynamic Grid Replication Strategy based on
Internet Hierarchy”, Book Series Lecture Notes in Computer Science, Grid and Cooperative
omputing book,Publisher Springer, August 2005, Volume 3033/2004, Pages 838-846
[6] Ruay-Shiung Chang, Jih-Sheng Chang, Shin-Yi Lin, “Job scheduling and data replication on
data grids”, Future Generation Computer Systems, Volume 23, Issue 7, August 2007, Pages 846-
860
[7] Nhan Nguyen Dang, Sang Boem Lim2: “Combination of Replication and Scheduling in Data
Grids”, IJCSNS International Journal of Computer Science and Network Security, VOL.7 No.3,
March 2007.
[8] T. Phan, K. Ranganathan, and R. Sion, “Evolving toward the perfect schedule: Co-scheduling
job assignments and data replication in wide-area systems using a genetic algorithm”, Job
scheduling strategies for parallel processing (11th international workshop), JSSPP 2005,
Cambridge MA, 2005.
[9] Klaus Krauter, Rajkumar Buyya and Muthucumaru Maheswaran, “A taxonomy and survey of
grid resource management systems for distributed computing”, SOFTWARE—PRACTICE
AND EXPERIENCE Softw. Pract. Exper. 2002; 32:135–164 (DOI: 10.1002/spe.432).
[10] Klaus Krauter and Manzur Murshed, GridSim a toolkit for the modelling and simulation of
distributed resource management and scheduling for Grid computing, CONCURRENCY AND
COMPUTATION: PRACTICE AND EXPERIENCE Concurrency Computat.: Pract. Exper.
2002; 14:1175–1220 (DOI: 10.1002/cpe.710).
[11] The Data Grid Project. http://www.eu-datagrid.org
[12] Parallel workload Project. http://www.parrallelworkload.org.

More Related Content

What's hot

ODRS: Optimal Data Replication Scheme for Time Efficiency In MANETs
ODRS: Optimal Data Replication Scheme for Time Efficiency In  MANETsODRS: Optimal Data Replication Scheme for Time Efficiency In  MANETs
ODRS: Optimal Data Replication Scheme for Time Efficiency In MANETsIOSR Journals
 
Survey on Synchronizing File Operations Along with Storage Scalable Mechanism
Survey on Synchronizing File Operations Along with Storage Scalable MechanismSurvey on Synchronizing File Operations Along with Storage Scalable Mechanism
Survey on Synchronizing File Operations Along with Storage Scalable MechanismIRJET Journal
 
B036407011
B036407011B036407011
B036407011theijes
 
THRESHOLD BASED VM PLACEMENT TECHNIQUE FOR LOAD BALANCED RESOURCE PROVISIONIN...
THRESHOLD BASED VM PLACEMENT TECHNIQUE FOR LOAD BALANCED RESOURCE PROVISIONIN...THRESHOLD BASED VM PLACEMENT TECHNIQUE FOR LOAD BALANCED RESOURCE PROVISIONIN...
THRESHOLD BASED VM PLACEMENT TECHNIQUE FOR LOAD BALANCED RESOURCE PROVISIONIN...IJCNCJournal
 
Dynamic selection of cluster head in in networks for energy management
Dynamic selection of cluster head in in networks for energy managementDynamic selection of cluster head in in networks for energy management
Dynamic selection of cluster head in in networks for energy managementeSAT Journals
 
Dynamic selection of cluster head in in networks for energy management
Dynamic selection of cluster head in in networks for energy managementDynamic selection of cluster head in in networks for energy management
Dynamic selection of cluster head in in networks for energy managementeSAT Publishing House
 
A survey of various scheduling algorithm in cloud computing environment
A survey of various scheduling algorithm in cloud computing environmentA survey of various scheduling algorithm in cloud computing environment
A survey of various scheduling algorithm in cloud computing environmenteSAT Publishing House
 
An efficient approach on spatial big data related to wireless networks and it...
An efficient approach on spatial big data related to wireless networks and it...An efficient approach on spatial big data related to wireless networks and it...
An efficient approach on spatial big data related to wireless networks and it...eSAT Journals
 
An adaptive algorithm for task scheduling for computational grid
An adaptive algorithm for task scheduling for computational gridAn adaptive algorithm for task scheduling for computational grid
An adaptive algorithm for task scheduling for computational grideSAT Journals
 
DYNAMIC TASK SCHEDULING BASED ON BURST TIME REQUIREMENT FOR CLOUD ENVIRONMENT
DYNAMIC TASK SCHEDULING BASED ON BURST TIME REQUIREMENT FOR CLOUD ENVIRONMENTDYNAMIC TASK SCHEDULING BASED ON BURST TIME REQUIREMENT FOR CLOUD ENVIRONMENT
DYNAMIC TASK SCHEDULING BASED ON BURST TIME REQUIREMENT FOR CLOUD ENVIRONMENTIJCNCJournal
 
ORCHESTRATING BULK DATA TRANSFERS ACROSS GEO-DISTRIBUTED DATACENTERS
ORCHESTRATING BULK DATA TRANSFERS ACROSS GEO-DISTRIBUTED DATACENTERSORCHESTRATING BULK DATA TRANSFERS ACROSS GEO-DISTRIBUTED DATACENTERS
ORCHESTRATING BULK DATA TRANSFERS ACROSS GEO-DISTRIBUTED DATACENTERSNexgen Technology
 
NETWORK-AWARE DATA PREFETCHING OPTIMIZATION OF COMPUTATIONS IN A HETEROGENEOU...
NETWORK-AWARE DATA PREFETCHING OPTIMIZATION OF COMPUTATIONS IN A HETEROGENEOU...NETWORK-AWARE DATA PREFETCHING OPTIMIZATION OF COMPUTATIONS IN A HETEROGENEOU...
NETWORK-AWARE DATA PREFETCHING OPTIMIZATION OF COMPUTATIONS IN A HETEROGENEOU...IJCNCJournal
 

What's hot (15)

ODRS: Optimal Data Replication Scheme for Time Efficiency In MANETs
ODRS: Optimal Data Replication Scheme for Time Efficiency In  MANETsODRS: Optimal Data Replication Scheme for Time Efficiency In  MANETs
ODRS: Optimal Data Replication Scheme for Time Efficiency In MANETs
 
Survey on Synchronizing File Operations Along with Storage Scalable Mechanism
Survey on Synchronizing File Operations Along with Storage Scalable MechanismSurvey on Synchronizing File Operations Along with Storage Scalable Mechanism
Survey on Synchronizing File Operations Along with Storage Scalable Mechanism
 
B036407011
B036407011B036407011
B036407011
 
THRESHOLD BASED VM PLACEMENT TECHNIQUE FOR LOAD BALANCED RESOURCE PROVISIONIN...
THRESHOLD BASED VM PLACEMENT TECHNIQUE FOR LOAD BALANCED RESOURCE PROVISIONIN...THRESHOLD BASED VM PLACEMENT TECHNIQUE FOR LOAD BALANCED RESOURCE PROVISIONIN...
THRESHOLD BASED VM PLACEMENT TECHNIQUE FOR LOAD BALANCED RESOURCE PROVISIONIN...
 
Dynamic selection of cluster head in in networks for energy management
Dynamic selection of cluster head in in networks for energy managementDynamic selection of cluster head in in networks for energy management
Dynamic selection of cluster head in in networks for energy management
 
Dynamic selection of cluster head in in networks for energy management
Dynamic selection of cluster head in in networks for energy managementDynamic selection of cluster head in in networks for energy management
Dynamic selection of cluster head in in networks for energy management
 
A survey of various scheduling algorithm in cloud computing environment
A survey of various scheduling algorithm in cloud computing environmentA survey of various scheduling algorithm in cloud computing environment
A survey of various scheduling algorithm in cloud computing environment
 
An efficient approach on spatial big data related to wireless networks and it...
An efficient approach on spatial big data related to wireless networks and it...An efficient approach on spatial big data related to wireless networks and it...
An efficient approach on spatial big data related to wireless networks and it...
 
An adaptive algorithm for task scheduling for computational grid
An adaptive algorithm for task scheduling for computational gridAn adaptive algorithm for task scheduling for computational grid
An adaptive algorithm for task scheduling for computational grid
 
Grid computing for load balancing strategies
Grid computing for load balancing strategiesGrid computing for load balancing strategies
Grid computing for load balancing strategies
 
DYNAMIC TASK SCHEDULING BASED ON BURST TIME REQUIREMENT FOR CLOUD ENVIRONMENT
DYNAMIC TASK SCHEDULING BASED ON BURST TIME REQUIREMENT FOR CLOUD ENVIRONMENTDYNAMIC TASK SCHEDULING BASED ON BURST TIME REQUIREMENT FOR CLOUD ENVIRONMENT
DYNAMIC TASK SCHEDULING BASED ON BURST TIME REQUIREMENT FOR CLOUD ENVIRONMENT
 
ORCHESTRATING BULK DATA TRANSFERS ACROSS GEO-DISTRIBUTED DATACENTERS
ORCHESTRATING BULK DATA TRANSFERS ACROSS GEO-DISTRIBUTED DATACENTERSORCHESTRATING BULK DATA TRANSFERS ACROSS GEO-DISTRIBUTED DATACENTERS
ORCHESTRATING BULK DATA TRANSFERS ACROSS GEO-DISTRIBUTED DATACENTERS
 
50120130405014 2-3
50120130405014 2-350120130405014 2-3
50120130405014 2-3
 
[IJET V2I2P18] Authors: Roopa G Yeklaspur, Dr.Yerriswamy.T
[IJET V2I2P18] Authors: Roopa G Yeklaspur, Dr.Yerriswamy.T[IJET V2I2P18] Authors: Roopa G Yeklaspur, Dr.Yerriswamy.T
[IJET V2I2P18] Authors: Roopa G Yeklaspur, Dr.Yerriswamy.T
 
NETWORK-AWARE DATA PREFETCHING OPTIMIZATION OF COMPUTATIONS IN A HETEROGENEOU...
NETWORK-AWARE DATA PREFETCHING OPTIMIZATION OF COMPUTATIONS IN A HETEROGENEOU...NETWORK-AWARE DATA PREFETCHING OPTIMIZATION OF COMPUTATIONS IN A HETEROGENEOU...
NETWORK-AWARE DATA PREFETCHING OPTIMIZATION OF COMPUTATIONS IN A HETEROGENEOU...
 

Similar to The Impact of Data Replication on Job Scheduling Performance in Hierarchical Data Grid

An enhanced adaptive scoring job scheduling algorithm with replication strate...
An enhanced adaptive scoring job scheduling algorithm with replication strate...An enhanced adaptive scoring job scheduling algorithm with replication strate...
An enhanced adaptive scoring job scheduling algorithm with replication strate...eSAT Publishing House
 
A Survey of File Replication Techniques In Grid Systems
A Survey of File Replication Techniques In Grid SystemsA Survey of File Replication Techniques In Grid Systems
A Survey of File Replication Techniques In Grid SystemsEditor IJCATR
 
A Survey of File Replication Techniques In Grid Systems
A Survey of File Replication Techniques In Grid SystemsA Survey of File Replication Techniques In Grid Systems
A Survey of File Replication Techniques In Grid SystemsEditor IJCATR
 
A Survey of Job Scheduling Algorithms Whit Hierarchical Structure to Load Ba...
A Survey of Job Scheduling Algorithms Whit  Hierarchical Structure to Load Ba...A Survey of Job Scheduling Algorithms Whit  Hierarchical Structure to Load Ba...
A Survey of Job Scheduling Algorithms Whit Hierarchical Structure to Load Ba...Editor IJCATR
 
Analyse the performance of mobile peer to Peer network using ant colony optim...
Analyse the performance of mobile peer to Peer network using ant colony optim...Analyse the performance of mobile peer to Peer network using ant colony optim...
Analyse the performance of mobile peer to Peer network using ant colony optim...IJCI JOURNAL
 
GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...
GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...
GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...ijgca
 
GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...
GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...
GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...ijgca
 
An asynchronous replication model to improve data available into a heterogene...
An asynchronous replication model to improve data available into a heterogene...An asynchronous replication model to improve data available into a heterogene...
An asynchronous replication model to improve data available into a heterogene...Alexander Decker
 
Efficient Resource Management Mechanism with Fault Tolerant Model for Computa...
Efficient Resource Management Mechanism with Fault Tolerant Model for Computa...Efficient Resource Management Mechanism with Fault Tolerant Model for Computa...
Efficient Resource Management Mechanism with Fault Tolerant Model for Computa...Editor IJCATR
 
Use of genetic algorithm for
Use of genetic algorithm forUse of genetic algorithm for
Use of genetic algorithm forijitjournal
 
A survey of various scheduling algorithm in cloud computing environment
A survey of various scheduling algorithm in cloud computing environmentA survey of various scheduling algorithm in cloud computing environment
A survey of various scheduling algorithm in cloud computing environmenteSAT Journals
 
T AXONOMY OF O PTIMIZATION A PPROACHES OF R ESOURCE B ROKERS IN D ATA G RIDS
T AXONOMY OF  O PTIMIZATION  A PPROACHES OF R ESOURCE B ROKERS IN  D ATA  G RIDST AXONOMY OF  O PTIMIZATION  A PPROACHES OF R ESOURCE B ROKERS IN  D ATA  G RIDS
T AXONOMY OF O PTIMIZATION A PPROACHES OF R ESOURCE B ROKERS IN D ATA G RIDSijcsit
 
A Platform for Large-Scale Grid Data Service on Dynamic High-Performance Netw...
A Platform for Large-Scale Grid Data Service on Dynamic High-Performance Netw...A Platform for Large-Scale Grid Data Service on Dynamic High-Performance Netw...
A Platform for Large-Scale Grid Data Service on Dynamic High-Performance Netw...Tal Lavian Ph.D.
 
QoS_Aware_Replica_Control_Strategies_for_Distributed_Real_time_dbms.pdf
QoS_Aware_Replica_Control_Strategies_for_Distributed_Real_time_dbms.pdfQoS_Aware_Replica_Control_Strategies_for_Distributed_Real_time_dbms.pdf
QoS_Aware_Replica_Control_Strategies_for_Distributed_Real_time_dbms.pdfneju3
 
AN ENTROPIC OPTIMIZATION TECHNIQUE IN HETEROGENEOUS GRID COMPUTING USING BION...
AN ENTROPIC OPTIMIZATION TECHNIQUE IN HETEROGENEOUS GRID COMPUTING USING BION...AN ENTROPIC OPTIMIZATION TECHNIQUE IN HETEROGENEOUS GRID COMPUTING USING BION...
AN ENTROPIC OPTIMIZATION TECHNIQUE IN HETEROGENEOUS GRID COMPUTING USING BION...ijcsit
 
Cache mechanism to avoid dulpication of same thing in hadoop system to speed ...
Cache mechanism to avoid dulpication of same thing in hadoop system to speed ...Cache mechanism to avoid dulpication of same thing in hadoop system to speed ...
Cache mechanism to avoid dulpication of same thing in hadoop system to speed ...eSAT Journals
 
MAP/REDUCE DESIGN AND IMPLEMENTATION OF APRIORIALGORITHM FOR HANDLING VOLUMIN...
MAP/REDUCE DESIGN AND IMPLEMENTATION OF APRIORIALGORITHM FOR HANDLING VOLUMIN...MAP/REDUCE DESIGN AND IMPLEMENTATION OF APRIORIALGORITHM FOR HANDLING VOLUMIN...
MAP/REDUCE DESIGN AND IMPLEMENTATION OF APRIORIALGORITHM FOR HANDLING VOLUMIN...acijjournal
 
ANALYSE THE PERFORMANCE OF MOBILE PEER TO PEER NETWORK USING ANT COLONY OPTIM...
ANALYSE THE PERFORMANCE OF MOBILE PEER TO PEER NETWORK USING ANT COLONY OPTIM...ANALYSE THE PERFORMANCE OF MOBILE PEER TO PEER NETWORK USING ANT COLONY OPTIM...
ANALYSE THE PERFORMANCE OF MOBILE PEER TO PEER NETWORK USING ANT COLONY OPTIM...ijcsity
 

Similar to The Impact of Data Replication on Job Scheduling Performance in Hierarchical Data Grid (20)

An enhanced adaptive scoring job scheduling algorithm with replication strate...
An enhanced adaptive scoring job scheduling algorithm with replication strate...An enhanced adaptive scoring job scheduling algorithm with replication strate...
An enhanced adaptive scoring job scheduling algorithm with replication strate...
 
A Survey of File Replication Techniques In Grid Systems
A Survey of File Replication Techniques In Grid SystemsA Survey of File Replication Techniques In Grid Systems
A Survey of File Replication Techniques In Grid Systems
 
A Survey of File Replication Techniques In Grid Systems
A Survey of File Replication Techniques In Grid SystemsA Survey of File Replication Techniques In Grid Systems
A Survey of File Replication Techniques In Grid Systems
 
A Survey of Job Scheduling Algorithms Whit Hierarchical Structure to Load Ba...
A Survey of Job Scheduling Algorithms Whit  Hierarchical Structure to Load Ba...A Survey of Job Scheduling Algorithms Whit  Hierarchical Structure to Load Ba...
A Survey of Job Scheduling Algorithms Whit Hierarchical Structure to Load Ba...
 
50120130406035
5012013040603550120130406035
50120130406035
 
Analyse the performance of mobile peer to Peer network using ant colony optim...
Analyse the performance of mobile peer to Peer network using ant colony optim...Analyse the performance of mobile peer to Peer network using ant colony optim...
Analyse the performance of mobile peer to Peer network using ant colony optim...
 
GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...
GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...
GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...
 
GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...
GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...
GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...
 
An asynchronous replication model to improve data available into a heterogene...
An asynchronous replication model to improve data available into a heterogene...An asynchronous replication model to improve data available into a heterogene...
An asynchronous replication model to improve data available into a heterogene...
 
Efficient Resource Management Mechanism with Fault Tolerant Model for Computa...
Efficient Resource Management Mechanism with Fault Tolerant Model for Computa...Efficient Resource Management Mechanism with Fault Tolerant Model for Computa...
Efficient Resource Management Mechanism with Fault Tolerant Model for Computa...
 
Use of genetic algorithm for
Use of genetic algorithm forUse of genetic algorithm for
Use of genetic algorithm for
 
A survey of various scheduling algorithm in cloud computing environment
A survey of various scheduling algorithm in cloud computing environmentA survey of various scheduling algorithm in cloud computing environment
A survey of various scheduling algorithm in cloud computing environment
 
T AXONOMY OF O PTIMIZATION A PPROACHES OF R ESOURCE B ROKERS IN D ATA G RIDS
T AXONOMY OF  O PTIMIZATION  A PPROACHES OF R ESOURCE B ROKERS IN  D ATA  G RIDST AXONOMY OF  O PTIMIZATION  A PPROACHES OF R ESOURCE B ROKERS IN  D ATA  G RIDS
T AXONOMY OF O PTIMIZATION A PPROACHES OF R ESOURCE B ROKERS IN D ATA G RIDS
 
A Platform for Large-Scale Grid Data Service on Dynamic High-Performance Netw...
A Platform for Large-Scale Grid Data Service on Dynamic High-Performance Netw...A Platform for Large-Scale Grid Data Service on Dynamic High-Performance Netw...
A Platform for Large-Scale Grid Data Service on Dynamic High-Performance Netw...
 
QoS_Aware_Replica_Control_Strategies_for_Distributed_Real_time_dbms.pdf
QoS_Aware_Replica_Control_Strategies_for_Distributed_Real_time_dbms.pdfQoS_Aware_Replica_Control_Strategies_for_Distributed_Real_time_dbms.pdf
QoS_Aware_Replica_Control_Strategies_for_Distributed_Real_time_dbms.pdf
 
AN ENTROPIC OPTIMIZATION TECHNIQUE IN HETEROGENEOUS GRID COMPUTING USING BION...
AN ENTROPIC OPTIMIZATION TECHNIQUE IN HETEROGENEOUS GRID COMPUTING USING BION...AN ENTROPIC OPTIMIZATION TECHNIQUE IN HETEROGENEOUS GRID COMPUTING USING BION...
AN ENTROPIC OPTIMIZATION TECHNIQUE IN HETEROGENEOUS GRID COMPUTING USING BION...
 
J41046368
J41046368J41046368
J41046368
 
Cache mechanism to avoid dulpication of same thing in hadoop system to speed ...
Cache mechanism to avoid dulpication of same thing in hadoop system to speed ...Cache mechanism to avoid dulpication of same thing in hadoop system to speed ...
Cache mechanism to avoid dulpication of same thing in hadoop system to speed ...
 
MAP/REDUCE DESIGN AND IMPLEMENTATION OF APRIORIALGORITHM FOR HANDLING VOLUMIN...
MAP/REDUCE DESIGN AND IMPLEMENTATION OF APRIORIALGORITHM FOR HANDLING VOLUMIN...MAP/REDUCE DESIGN AND IMPLEMENTATION OF APRIORIALGORITHM FOR HANDLING VOLUMIN...
MAP/REDUCE DESIGN AND IMPLEMENTATION OF APRIORIALGORITHM FOR HANDLING VOLUMIN...
 
ANALYSE THE PERFORMANCE OF MOBILE PEER TO PEER NETWORK USING ANT COLONY OPTIM...
ANALYSE THE PERFORMANCE OF MOBILE PEER TO PEER NETWORK USING ANT COLONY OPTIM...ANALYSE THE PERFORMANCE OF MOBILE PEER TO PEER NETWORK USING ANT COLONY OPTIM...
ANALYSE THE PERFORMANCE OF MOBILE PEER TO PEER NETWORK USING ANT COLONY OPTIM...
 

More from graphhoc

ON THE PROBABILITY OF K-CONNECTIVITY IN WIRELESS AD HOC NETWORKS UNDER DIFFER...
ON THE PROBABILITY OF K-CONNECTIVITY IN WIRELESS AD HOC NETWORKS UNDER DIFFER...ON THE PROBABILITY OF K-CONNECTIVITY IN WIRELESS AD HOC NETWORKS UNDER DIFFER...
ON THE PROBABILITY OF K-CONNECTIVITY IN WIRELESS AD HOC NETWORKS UNDER DIFFER...graphhoc
 
DISTANCE TWO LABELING FOR MULTI-STOREY GRAPHS
DISTANCE TWO LABELING FOR MULTI-STOREY GRAPHSDISTANCE TWO LABELING FOR MULTI-STOREY GRAPHS
DISTANCE TWO LABELING FOR MULTI-STOREY GRAPHSgraphhoc
 
Impact of Mobility for Qos Based Secure Manet
Impact of Mobility for Qos Based Secure Manet  Impact of Mobility for Qos Based Secure Manet
Impact of Mobility for Qos Based Secure Manet graphhoc
 
A Transmission Range Based Clustering Algorithm for Topology Control Manet
A Transmission Range Based Clustering Algorithm for Topology Control ManetA Transmission Range Based Clustering Algorithm for Topology Control Manet
A Transmission Range Based Clustering Algorithm for Topology Control Manetgraphhoc
 
A Battery Power Scheduling Policy with Hardware Support In Mobile Devices
A Battery Power Scheduling Policy with Hardware Support In Mobile Devices A Battery Power Scheduling Policy with Hardware Support In Mobile Devices
A Battery Power Scheduling Policy with Hardware Support In Mobile Devices graphhoc
 
A Review of the Energy Efficient and Secure Multicast Routing Protocols for ...
 A Review of the Energy Efficient and Secure Multicast Routing Protocols for ... A Review of the Energy Efficient and Secure Multicast Routing Protocols for ...
A Review of the Energy Efficient and Secure Multicast Routing Protocols for ...graphhoc
 
Case Study On Social Engineering Techniques for Persuasion Full Text
Case Study On Social Engineering Techniques for Persuasion   Full Text Case Study On Social Engineering Techniques for Persuasion   Full Text
Case Study On Social Engineering Techniques for Persuasion Full Text graphhoc
 
Breaking the Legend: Maxmin Fairness notion is no longer effective
Breaking the Legend: Maxmin Fairness notion is no longer effective  Breaking the Legend: Maxmin Fairness notion is no longer effective
Breaking the Legend: Maxmin Fairness notion is no longer effective graphhoc
 
I-Min: An Intelligent Fermat Point Based Energy Efficient Geographic Packet F...
I-Min: An Intelligent Fermat Point Based Energy Efficient Geographic Packet F...I-Min: An Intelligent Fermat Point Based Energy Efficient Geographic Packet F...
I-Min: An Intelligent Fermat Point Based Energy Efficient Geographic Packet F...graphhoc
 
Fault tolerant wireless sensor mac protocol for efficient collision avoidance
Fault tolerant wireless sensor mac protocol for efficient collision avoidanceFault tolerant wireless sensor mac protocol for efficient collision avoidance
Fault tolerant wireless sensor mac protocol for efficient collision avoidancegraphhoc
 
Enhancing qo s and qoe in ims enabled next generation networks
Enhancing qo s and qoe in ims enabled next generation networksEnhancing qo s and qoe in ims enabled next generation networks
Enhancing qo s and qoe in ims enabled next generation networksgraphhoc
 
Simulated annealing for location area planning in cellular networks
Simulated annealing for location area planning in cellular networksSimulated annealing for location area planning in cellular networks
Simulated annealing for location area planning in cellular networksgraphhoc
 
Secure key exchange and encryption mechanism for group communication in wirel...
Secure key exchange and encryption mechanism for group communication in wirel...Secure key exchange and encryption mechanism for group communication in wirel...
Secure key exchange and encryption mechanism for group communication in wirel...graphhoc
 
Simulation to track 3 d location in gsm through ns2 and real life
Simulation to track 3 d location in gsm through ns2 and real lifeSimulation to track 3 d location in gsm through ns2 and real life
Simulation to track 3 d location in gsm through ns2 and real lifegraphhoc
 
Performance Analysis of Ultra Wideband Receivers for High Data Rate Wireless ...
Performance Analysis of Ultra Wideband Receivers for High Data Rate Wireless ...Performance Analysis of Ultra Wideband Receivers for High Data Rate Wireless ...
Performance Analysis of Ultra Wideband Receivers for High Data Rate Wireless ...graphhoc
 
Coverage and Connectivity Aware Neural Network Based Energy Efficient Routing...
Coverage and Connectivity Aware Neural Network Based Energy Efficient Routing...Coverage and Connectivity Aware Neural Network Based Energy Efficient Routing...
Coverage and Connectivity Aware Neural Network Based Energy Efficient Routing...graphhoc
 
An Overview of Mobile Ad Hoc Networks for the Existing Protocols and Applicat...
An Overview of Mobile Ad Hoc Networks for the Existing Protocols and Applicat...An Overview of Mobile Ad Hoc Networks for the Existing Protocols and Applicat...
An Overview of Mobile Ad Hoc Networks for the Existing Protocols and Applicat...graphhoc
 
An Algorithm for Odd Graceful Labeling of the Union of Paths and Cycles
An Algorithm for Odd Graceful Labeling of the Union of Paths and Cycles  An Algorithm for Odd Graceful Labeling of the Union of Paths and Cycles
An Algorithm for Odd Graceful Labeling of the Union of Paths and Cycles graphhoc
 
ACTOR GARBAGE COLLECTION IN DISTRIBUTED SYSTEMS USING GRAPH TRANSFORMATION
ACTOR GARBAGE COLLECTION IN DISTRIBUTED SYSTEMS USING GRAPH TRANSFORMATIONACTOR GARBAGE COLLECTION IN DISTRIBUTED SYSTEMS USING GRAPH TRANSFORMATION
ACTOR GARBAGE COLLECTION IN DISTRIBUTED SYSTEMS USING GRAPH TRANSFORMATIONgraphhoc
 
A Proposal Analytical Model and Simulation of the Attacks in Routing Protocol...
A Proposal Analytical Model and Simulation of the Attacks in Routing Protocol...A Proposal Analytical Model and Simulation of the Attacks in Routing Protocol...
A Proposal Analytical Model and Simulation of the Attacks in Routing Protocol...graphhoc
 

More from graphhoc (20)

ON THE PROBABILITY OF K-CONNECTIVITY IN WIRELESS AD HOC NETWORKS UNDER DIFFER...
ON THE PROBABILITY OF K-CONNECTIVITY IN WIRELESS AD HOC NETWORKS UNDER DIFFER...ON THE PROBABILITY OF K-CONNECTIVITY IN WIRELESS AD HOC NETWORKS UNDER DIFFER...
ON THE PROBABILITY OF K-CONNECTIVITY IN WIRELESS AD HOC NETWORKS UNDER DIFFER...
 
DISTANCE TWO LABELING FOR MULTI-STOREY GRAPHS
DISTANCE TWO LABELING FOR MULTI-STOREY GRAPHSDISTANCE TWO LABELING FOR MULTI-STOREY GRAPHS
DISTANCE TWO LABELING FOR MULTI-STOREY GRAPHS
 
Impact of Mobility for Qos Based Secure Manet
Impact of Mobility for Qos Based Secure Manet  Impact of Mobility for Qos Based Secure Manet
Impact of Mobility for Qos Based Secure Manet
 
A Transmission Range Based Clustering Algorithm for Topology Control Manet
A Transmission Range Based Clustering Algorithm for Topology Control ManetA Transmission Range Based Clustering Algorithm for Topology Control Manet
A Transmission Range Based Clustering Algorithm for Topology Control Manet
 
A Battery Power Scheduling Policy with Hardware Support In Mobile Devices
A Battery Power Scheduling Policy with Hardware Support In Mobile Devices A Battery Power Scheduling Policy with Hardware Support In Mobile Devices
A Battery Power Scheduling Policy with Hardware Support In Mobile Devices
 
A Review of the Energy Efficient and Secure Multicast Routing Protocols for ...
 A Review of the Energy Efficient and Secure Multicast Routing Protocols for ... A Review of the Energy Efficient and Secure Multicast Routing Protocols for ...
A Review of the Energy Efficient and Secure Multicast Routing Protocols for ...
 
Case Study On Social Engineering Techniques for Persuasion Full Text
Case Study On Social Engineering Techniques for Persuasion   Full Text Case Study On Social Engineering Techniques for Persuasion   Full Text
Case Study On Social Engineering Techniques for Persuasion Full Text
 
Breaking the Legend: Maxmin Fairness notion is no longer effective
Breaking the Legend: Maxmin Fairness notion is no longer effective  Breaking the Legend: Maxmin Fairness notion is no longer effective
Breaking the Legend: Maxmin Fairness notion is no longer effective
 
I-Min: An Intelligent Fermat Point Based Energy Efficient Geographic Packet F...
I-Min: An Intelligent Fermat Point Based Energy Efficient Geographic Packet F...I-Min: An Intelligent Fermat Point Based Energy Efficient Geographic Packet F...
I-Min: An Intelligent Fermat Point Based Energy Efficient Geographic Packet F...
 
Fault tolerant wireless sensor mac protocol for efficient collision avoidance
Fault tolerant wireless sensor mac protocol for efficient collision avoidanceFault tolerant wireless sensor mac protocol for efficient collision avoidance
Fault tolerant wireless sensor mac protocol for efficient collision avoidance
 
Enhancing qo s and qoe in ims enabled next generation networks
Enhancing qo s and qoe in ims enabled next generation networksEnhancing qo s and qoe in ims enabled next generation networks
Enhancing qo s and qoe in ims enabled next generation networks
 
Simulated annealing for location area planning in cellular networks
Simulated annealing for location area planning in cellular networksSimulated annealing for location area planning in cellular networks
Simulated annealing for location area planning in cellular networks
 
Secure key exchange and encryption mechanism for group communication in wirel...
Secure key exchange and encryption mechanism for group communication in wirel...Secure key exchange and encryption mechanism for group communication in wirel...
Secure key exchange and encryption mechanism for group communication in wirel...
 
Simulation to track 3 d location in gsm through ns2 and real life
Simulation to track 3 d location in gsm through ns2 and real lifeSimulation to track 3 d location in gsm through ns2 and real life
Simulation to track 3 d location in gsm through ns2 and real life
 
Performance Analysis of Ultra Wideband Receivers for High Data Rate Wireless ...
Performance Analysis of Ultra Wideband Receivers for High Data Rate Wireless ...Performance Analysis of Ultra Wideband Receivers for High Data Rate Wireless ...
Performance Analysis of Ultra Wideband Receivers for High Data Rate Wireless ...
 
Coverage and Connectivity Aware Neural Network Based Energy Efficient Routing...
Coverage and Connectivity Aware Neural Network Based Energy Efficient Routing...Coverage and Connectivity Aware Neural Network Based Energy Efficient Routing...
Coverage and Connectivity Aware Neural Network Based Energy Efficient Routing...
 
An Overview of Mobile Ad Hoc Networks for the Existing Protocols and Applicat...
An Overview of Mobile Ad Hoc Networks for the Existing Protocols and Applicat...An Overview of Mobile Ad Hoc Networks for the Existing Protocols and Applicat...
An Overview of Mobile Ad Hoc Networks for the Existing Protocols and Applicat...
 
An Algorithm for Odd Graceful Labeling of the Union of Paths and Cycles
An Algorithm for Odd Graceful Labeling of the Union of Paths and Cycles  An Algorithm for Odd Graceful Labeling of the Union of Paths and Cycles
An Algorithm for Odd Graceful Labeling of the Union of Paths and Cycles
 
ACTOR GARBAGE COLLECTION IN DISTRIBUTED SYSTEMS USING GRAPH TRANSFORMATION
ACTOR GARBAGE COLLECTION IN DISTRIBUTED SYSTEMS USING GRAPH TRANSFORMATIONACTOR GARBAGE COLLECTION IN DISTRIBUTED SYSTEMS USING GRAPH TRANSFORMATION
ACTOR GARBAGE COLLECTION IN DISTRIBUTED SYSTEMS USING GRAPH TRANSFORMATION
 
A Proposal Analytical Model and Simulation of the Attacks in Routing Protocol...
A Proposal Analytical Model and Simulation of the Attacks in Routing Protocol...A Proposal Analytical Model and Simulation of the Attacks in Routing Protocol...
A Proposal Analytical Model and Simulation of the Attacks in Routing Protocol...
 

Recently uploaded

Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 

Recently uploaded (20)

Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 

The Impact of Data Replication on Job Scheduling Performance in Hierarchical Data Grid

  • 1. International journal on applications of graph theory in wireless ad hoc networks and sensor networks (GRAPH-HOC) Vol.2, No.3, September 2010 DOI : 10.5121/jgraphoc.2010.2302 15 THE IMPACT OF DATA REPLICATION ON JOB SCHEDULING PERFORMANCE IN HIERARCHICAL DATA GRID Somayeh Abdi1 , Somayeh Mohamadi2 1 Department of Computer Engineering, Eslamabad Gharb branch Islamic Azad University, Eslamabad Gharb,Kermanshah, Iran Somayeh.abdi@gmail.com 1 Department of Computer Engineering, Ghasre Shirin branch Islamic Azad University, Ghasre Shirin, Kermanshah, Iran s.mohamadi@gmail.com ABSTRACT In data-intensive applications data transfer is a primary cause of job execution delay. Data access time depends on bandwidth. The major bottleneck to supporting fast data access in Grids is the high latencies of Wide Area Networks and Internet. Effective scheduling can reduce the amount of data transferred across the internet by dispatching a job to where the needed data are present. Another solution is to use a data replication mechanism. Objective of dynamic replica strategies is reducing file access time which leads to reducing job runtime. In this paper we develop a job scheduling policy and a dynamic data replication strategy, called HRS (Hierarchical Replication Strategy), to improve the data access efficiencies. We study our approach and evaluate it through simulation. The results show that our algorithm has improved 12% over the current strategies. KEYWORDS Grid, Data Grid, Job Scheduling, Data Replication, Simulation 1. INTRODUCTION In the increasing demand of scientific and large-scale business application, a large amount of data are generated and spread for using by users around the world. Many good examples can be listed such as High Energy Physics, meteorology, computational genomics which processes and results large amount of data. Such data cannot be stored centralized in any site but distributed among centre around the world [1]. Data grid tries to store this data in decentralize sites and then for each application retrieves it from these sites. Data grid as an important branch of grid computing focuses on supporting an efficient management mechanism for controlled sharing and large amounts of distributed data. In Data Grid, for each incoming job, the Grid scheduler decides where to run the job based on the job requirements and the system status. Scheduling jobs to suitable grid sites is necessary because data movement between different grid sites is time consuming. If a job is scheduled to a site where the required data are present, the job can process data in this site without any transmission delay for getting data from a remote site.
  • 2. International journal on applications of graph theory in wireless ad hoc networks and sensor networks (GRAPH-HOC) Vol.2, No.3, September 2010 16 Data replication is an important optimization step to manage large data by replicating data in geographically distributed data stores. When users’ jobs access a large amount of data from remote sites, dynamic replica optimizer running in the site tries to store replicas on local storage for future possible repeated requests. We proposed a new job scheduling algorithm and data replication policy that reduce job execution time by reducing job data access time. Our new scheduling policy considers the locations of required data and the job queue length of a computing node. We develop a replication strategy, called HRS (Hierarchical Replication Strategy). It takes into account bandwidth as an important factor for replica selection and replica placement. It also increases the chances of accessing data at a nearby node. The rest of the paper is organized as follows, in section 2 we present a summary of existing and related work. In section 3, a hierarchical structure is proposed for data grid based on classification of networks, along with a scheduling algorithm and replication algorithm for this structure, Section 4 describes our experiments and the results achieved followed by conclusion in section 5. 2. RELATED WORK There are some recent works that address the problem of scheduling and/ or replication in Data Grid as well as the combination between them. In [3], it considers two centralized and decentralized replication algorithms. In centralized method, replica master uses a table that ranks each file access in descending order. If a file access is less than the average, it will be removed from the table. Then it pop files from top and replicates using a response-time oriented replica placement algorithm. In the decentralized method, every site records file access in its table and exchange this table with neighbours. Since every domain knows average number of access for each file and then deletes those files whose access is less than the average, and replicates other files in its local storage. In [5], an algorithm for a two-level hierarchical structure based on internet hierarchy (BHR) has been introduced which only considers dynamic replication and does not consider scheduling. Nodes in the first level are connected to each other with high speed networks and in the second level via internet. The algorithm replicates the file to the site if there is enough space. Next it, accesses the file remotely if the file is available in the sites that are in the same region. Otherwise it tries to make available space by deleting files using LRU (Least Recently Used) method, and replicates the file. It assumes that master site always has a safe copy of file before deleting. In [6], a structure with few networks connected via internet has been presented and an algorithm similar to [5], along with scheduling is proposed. For replicating a file, first computes the total transfer time, then it selects the best node with shortest transfer time. In [7] authors introduce dynamic replication placement (RP) that categorizes the data based on their property. This category is used for job scheduling and replication. Then a job is allocated to a site which has the file in the required category, this leads to reduce the cost for file transfer. In [8] a Genetic Algorithm based co-scheduling of data and jobs of independent nature was proposed; the GA is executed to converge to a schedule by looking at the jobs in the scheduler queue as well as the replicated data objects at once. A performance overhead might be incurred by the system as a result of delaying the replication of the data until the scheduling time. The authors also use an objective function that assumes infinite availability of storage for all data objects which is infeasible in a realistic grid setting.
  • 3. International journal on applications of graph theory in wireless ad hoc networks and sensor networks (GRAPH-HOC) Vol.2, No.3, September 2010 17 3. THE PROPOSED METHOD In this section, we present hierarchical network structure and then we proposed two strategies for job scheduling and data replication by considering the hierarchical network structure. 3.1. NETWORK STRUCTURE The proposed Data Grid structure is shown in Figure 1. This structure has two levels. Regions comprise the first level and sites that are located in a region comprise the second level. A region represents an organization unit which is a group of sites that are geographically close to each other, each region comprises the computers which are connected by a high bandwidth. We define two kinds of communications between sites, inter-communication and intra- communication. Intra-communication is the communication between sites within the same region and inter-communication is the communication between sites across regions. Network bandwidth between sites within a region will be larger than across regions. In communication networks, the performance of system is underlying available network bandwidth and data access latency, especially in networks that hierarchy of bandwidth appears. Therefore, to reduce access latency and to avoid WAN bandwidth bottleneck, it is important to reduce the number of inter-communications. Data Grid Information Service (DGIS) providing resource registration services and keeping track of a list of resources available in the Data Grid. The Grid Scheduler can query this for resource contact, configuration, and status information; Resource discovery identify resources that can be used with their capability through Data Grid Information Service. In this structure we apply distributed dynamic data replication infrastructure. Replica Manager at each site manages the data movement between sites. When a job is assigned to Local Scheduler of site, those required data that do not exist in local site will be transfer to it. Data request by the job is generated as soon as job is scheduled into Local Scheduler queue and Replica Manager controls data movement at each site. In proposed structure there is a centralized Replica Catalogue. It is responsible for indexing available files on the resources and handles queries from users and resources about location of replicas.
  • 4. International journal on applications of graph theory in wireless ad hoc networks and sensor networks (GRAPH-HOC) Vol.2, No.3, September 2010 18 Figure 1. Data Grid Architecture. 3.2. SCHEDULING ALGORITHM Our new scheduling policy considers the locations of required data, computing capacity and number of jobs that allocate on computing nodes. For efficient scheduling of any job, the algorithm determines the best site and then submit job to Local scheduler. A best site is a site that holds most of the requested files (from size point of view). This will significantly reduce total transfer time, and reduce the job execution time. Each job requires some replicas and the replicas is needed to execute job j are represented as: Rj = {LFN1, LFN2,…., LFNn}. We assume that jS is total size of the requested files available in site j, Ci is computing capacity (in MIPS1) of site i and RelativeLoadi is relative load of site i. 1 . Million Instruction Per Second
  • 5. International journal on applications of graph theory in wireless ad hoc networks and sensor networks (GRAPH-HOC) Vol.2, No.3, September 2010 19 ∑= SsiteinLFNiavailableallfor iS LFNS (1) RelativeLoadi= SizeofJobsi / Ci (2) SizeofJobsi is the lengths of queued jobs (in MIPS) on site i and iLFN present size of file LFNi . The scheduling algorithm can be summarized as follow: 1- Compute SC for each site from (1) 2- Select the best site i q i SMAXS 1 max = = ; where q is the number of site (i.e. site with largest available requested data files) 3- If there are several sites with most available data, select the site with minimum relative load from (2). When a job is assigned to local scheduler, the Replica Manager transfers all the requested files that are not exist in local site. The objective of file replicating is to transferring required data to local site before job execution. Therefore, data replication improves job scheduling performance by reducing job execution time. The proposed scheduling algorithm takes into consideration the requested data files. Additionally, where there are the same data requirements in several sites (from size point of view); it takes into consideration the computing capacity of sites with a view to reducing the queue waiting time and relative load of sites. 3.3. HIERARCHICAL REPLICATION STRATEGY After a job is scheduled to site Sj , the requested data will be transferred to Sj to become replicas. HRS (hierarchical replication Strategy) determines wich replica will be transferred to Sj and how to handle this new replica. HRS considers the bandwidth between the regions as the main factor for replica selection / deletion. When each site stores a new replica, replica manager sends a file register request to RC and then RC adds this site to the list of sites that holds the replica. Replica Manager controls data transferring in each site and provides a mechanism for accessing the Replica Catalogue. Data Grid Architecture for Replica Catalogue and Replica Manager is shwon in in Figure 2.
  • 6. International journal on applications of graph theory in wireless ad hoc networks and sensor networks (GRAPH-HOC) Vol.2, No.3, September 2010 20 Figure 2. Data Grid Architecture for replica cataloge and replica manager. For each required file of the job, Replica Manager controls the existence of the file in local site. If file doesn’t exist in the local site HRS first searches the file in local region. If the file duplicated in the same region, then it will creates a list of candidate replicas and selects a replica with the maximum bandwidth available for transferring it. If there is enough space for new replica, then stores it in the local site, otherwise it is only stored in the temporary buffer and will be deleted after the job completes. If the file doesn’t exit in the same region, then HRS creates a list of replicas in other regions and selects the replica with the maximum bandwidth available for transferring it. If there is enough space for new replica, then stores it in the local site, otherwise occupided space will be released to make engough room for this new replica. First, it removes the replicas that already exist in other sites in the same region based on LRU(least recently used) replacement algorithm. After all these replicas are deleted, if the space is still insufficient, the HRS use LRU replacement algorithm to delete replica in local storage which is duplicated in other regions , till it has enough room for the new replica. 4. SIMULATIONS Gridsim is used as the simulation tool to evaluate the performance of the proposed replication and scheduling algorithms. The Java-based GridSim discrete event simulation toolkit provides Java classes that represent entities essential for application, resource modelling, scheduling of jobs to resources, and their execution along with management [9]. Its Java-based design makes it portable and available on all computational platforms. The components of Gridsim are as follow and also depicted in Figure 3.
  • 7. International journal on applications of graph theory in wireless ad hoc networks and sensor networks (GRAPH-HOC) Vol.2, No.3, September 2010 21 Figure 3. Simulator architecture Resource Broker: It receives jobs from user and sends them to the best node according to proposed algorithm; Storage Element (SE): Storage resource in grid; Computing Element (CE): Computing resource in grid; Replica Manager:controls data transfering at each site; Replica Catalogue: stores list of sites that holds replicas. Based on the scheduling algorithm the broker sends jobs to a node. Each job needs a list of files to run. Reducing file access time is the final objective of optimization algorithms. 4.1. Simulation Environment There are four regions in our configuration and each region has an average of 13 sites, which all have CE with associated SE. Table1 specifies the simulation parameters used in our study. There are 5 job types; each job type requires 12 files to execute. While running, jobs were randomly picked from 5 job types, then submitted to the Resource Broker. Files are accessed sequentially within a job without any access pattern. To simplify the requirements, data replication approaches in Data Grid environments commonly assume that the data is read-only.
  • 8. International journal on applications of graph theory in wireless ad hoc networks and sensor networks (GRAPH-HOC) Vol.2, No.3, September 2010 22 4.2. Simulation results and discussion HRS will be compared with LRU (Least Recently Used) and BHR (Bandwidth Hierarchy based Replication). The LRU algorithm always replicates and then deletes those files that have been used least recently. Figure 4 shows the Average job time based on changing number of jobs for 3 algorithms. Figure 5 shows the Average job time for 1000 jobs by 3 mentioned algorithms. HRS replication strategy uses the concept of “network locality” as BHR [5]. 0 5000 10000 15000 20000 25000 100 200 300 400 500 Number of Jobs AverageJobTime(Sec) HRS BHR LRU Figure 4. Average job Time based on varying number of jobs. TABLE 1. Simulation parameters valueTopology Parameters 4Number of region 13Number of sites in each region 10 GBStorage space at each site 10 MbpsConnectivity bandwidth(WAN) 1000MbpsConnectivity bandwidth(LAN) valueJob parameters 500Number of jobs 5Number of job types 12Number of file accessed per job 500 MBSize of single file 50GBTotal size of files
  • 9. International journal on applications of graph theory in wireless ad hoc networks and sensor networks (GRAPH-HOC) Vol.2, No.3, September 2010 23 The difference between HRS and BHR is that required replica within the same region is always the top priority used in HRS, while BHR searches all sites to find the best replica and has no distinction between intra-region communication and inter-region comuuniation. It could be anticipated that HRS will avoid inter-region communications and be stable in hierarchical network architecture with variable bandwidth. Our method takes benefit from network level locality of BHR. Thus, total job execution time is about 12% faster using HRS optimizer than BHR. 0 10000 20000 30000 40000 50000 MeanJobsTime(sec) 1000 Number of Jobs LRU BHR HRS Figure 5. Average Job Time for 1000 jobs. The job execution time is the Max{file transmission time, queue time} plus job processing time. Since the file transmission time is the most important factor to influence the job execution time for data-intensive jobs in data grids, the proposed scheduling algorithm with HRS can reduce the file transmission time effectively by virtue of valid scheduling and proper data replication, as can be seen from the experiments. The average number of inter-communications for a job execution is illustrated in Figure 6. By selecting the best site based on location of required data by the job, the proposed scheduling algorithm with HRS can decrease the number of inter-communications effectively. Overall the simulation results with Gridsim show better performance (over 12%) comparing to current algorithms.
  • 10. International journal on applications of graph theory in wireless ad hoc networks and sensor networks (GRAPH-HOC) Vol.2, No.3, September 2010 24 0 1 2 3 4 5 6 7 8 NumberofAverage inter-communication 100 300 500 Number of jobs BHR HRS Figure 6. Average number of inter-communications Figures 7 shows the average job time for 500 jobs. We compare HRS, BHR and LRU algorithms for varying inter-communication bandwidth. As inter-communication bandwidth increase 3 mentioned algorithms will converge. We can conclude that HRS strategy can be effectively utilized when hierarchy of bandwidth appears because BHR has no distinction between intra-region and inter-region. It could be anticipated that HRS will avoid inter-region communications and be stable in hierarchical network architecture with variable bandwidth. 0 5000 10000 15000 20000 25000 10 50 100 300 600 1000 inter-communication Bandwidth AverageJobTime(Sec) HRS BHR LRU Figure 7. Average Jobs Time with varying inter-communication bandwidth for 500 jobs Overall the simulation results with Gridsim show better performance (over 12%) comparing to current algorithms.
  • 11. International journal on applications of graph theory in wireless ad hoc networks and sensor networks (GRAPH-HOC) Vol.2, No.3, September 2010 25 5. CONCLUSION AND FUTURE WORK In this paper a hierarchical structure for dynamic replicating file and scheduling in data grids was proposed. To achieve good network bandwidth utilization and reduce data access time, we propose a job scheduling policy that considers computational capability, job type and data location in job placement decision. We study and evaluate the performance of various replica strategies. The simulation results show, first of all, that proposed scheduling algorithm and HRS both get better performances. Second, we can achieve particularly good performance with scheduling algorithm where jobs are always scheduled to site with most of the data needed, and a separate HRS process at each site for replication management. Experimental data show proposed scheduling algorithm with HRS replica strategy outperforms others combinations in total job execution time. REFERENCES [1] K. Ranganathan and I. Foster, “Identifying Dynamic Replication Strategies for a High Performance Data Grid”. In Proceedings of the International Grid Computing Workshop, Denver, Colorado, USA, 2001. [2] Jianhua Jiang, Huifang Ji, “scheduling algorithm with potential behaviors”, Journal of Computers, VOL. 3 , NO. 12 , December 2008. [3] Ming Tang, Bu-Sung Lee, Xueyan Tang, Chai-Kiat Yeo .” The Impact of Data Replication on Job Scheduling Performance in the Data Grid”, Future Generation Computer Systems, Volume 22, Issue 3, February 2006, Pages 254-268 [4] Ali Elghirani, Riky Subrata, Albert Y. Zomaya, and Ali Al Mazari., “Performance Enhancement through Hybrid Replication and Genetic Algorithm Co-Scheduling in Data Grids”, Advanced Networks Research Group, School of Information Technologies, University of Sydney, NSW 2006 Australia. [5] Sang-Min Park, Jai-Hoon Kim, Young-Bae Ko: “Dynamic Grid Replication Strategy based on Internet Hierarchy”, Book Series Lecture Notes in Computer Science, Grid and Cooperative omputing book,Publisher Springer, August 2005, Volume 3033/2004, Pages 838-846 [6] Ruay-Shiung Chang, Jih-Sheng Chang, Shin-Yi Lin, “Job scheduling and data replication on data grids”, Future Generation Computer Systems, Volume 23, Issue 7, August 2007, Pages 846- 860 [7] Nhan Nguyen Dang, Sang Boem Lim2: “Combination of Replication and Scheduling in Data Grids”, IJCSNS International Journal of Computer Science and Network Security, VOL.7 No.3, March 2007. [8] T. Phan, K. Ranganathan, and R. Sion, “Evolving toward the perfect schedule: Co-scheduling job assignments and data replication in wide-area systems using a genetic algorithm”, Job scheduling strategies for parallel processing (11th international workshop), JSSPP 2005, Cambridge MA, 2005. [9] Klaus Krauter, Rajkumar Buyya and Muthucumaru Maheswaran, “A taxonomy and survey of grid resource management systems for distributed computing”, SOFTWARE—PRACTICE AND EXPERIENCE Softw. Pract. Exper. 2002; 32:135–164 (DOI: 10.1002/spe.432). [10] Klaus Krauter and Manzur Murshed, GridSim a toolkit for the modelling and simulation of distributed resource management and scheduling for Grid computing, CONCURRENCY AND COMPUTATION: PRACTICE AND EXPERIENCE Concurrency Computat.: Pract. Exper. 2002; 14:1175–1220 (DOI: 10.1002/cpe.710). [11] The Data Grid Project. http://www.eu-datagrid.org [12] Parallel workload Project. http://www.parrallelworkload.org.