SlideShare a Scribd company logo
1 of 1
Download to read offline
Metascheduling on the Grid
In the grid environment, a metascheduler is also known as global scheduler, which 
coordinates  communications  between  multiple  heterogeneous  schedulers  (local 
schedulers) that operate at the local or cluster level. 
    By cooperating with Grid Information Services, which abstract the information of 
all the available resources on the grid into a sing le resource perspective, the grid 
metascheduler presents the end user a single virtual resource pool, which hides all 
the  details  of  scheduling  and  monitoring  jobs  in  the  dynamic  and  heterogeneous 
environment.
    From  the  end  user's  viewpoint,  a  fully  virtualized  grid  is  like  a  single 
supercomputer, and as a proxy or entry point to the grid, the metascheduler should 
hide grid­specific technical details to the most extent. A good metascheduler should 
only  ask  the  end  user  (without  any  grid  specific  knowledge)  three  questions  (see 
Figure 3 for an example description)
WHAT  DO  YOU  NEED?  (Requirements)WHAT  DO  YOU  NEED?  (Requirements)  This  usually  involves 
requirement matchmaking between known resources on the grid and the resources 
required by the end user, such as a particular software, disk space, and the forth.
WHAT  DO  YOU  HAVE?  (Input)WHAT  DO  YOU  HAVE?  (Input)  This  usually  needs  the  end  user  to 
specify the input of the job, such as input files on the local or remote machine, or 
just the arguments of the executable, or both.
WHO ARE YOU? (VO Identity*)WHO ARE YOU? (VO Identity*) This usually needs the user to declare 
his or her VO identity, which enables the scheduler to analyse resource information 
from the user's view point.
* A grid user's VO identity is not used for authentication or authorization, instead, by providing 
the VO identity, we can retrieve user­specific resource information like the accessible free cups, 
disk space and software tools.
Improving the Functionality and Customization of Scheduling
 Services for Grid Computing
Jingjing Sun, Supervised by
Dr. Paul Coddington, Dr. Andrew Wendelborn
School of Computer Science, University of Adelaide
Figure 1. Computing resources across ANG Grid, discovered by the customized metascheduler Figure 3. A typical customized job template 
including all kinds of user requirements
Acknowledgements
This project would not have been possible without Dr. Paul Coddington and Dr. Andrew 
Wendelborn's wise and patient guidance. Special thanks to the cooperation of the excellent 
SAPAC staff: Daniel Cox, Gerson Galang and Shunde Zhang, they made the whole things 
happen.
For more information about our project, please refer to the ANG­GridWay wiki:  
http://www.grid.apac.edu.au/repository/trac/APACGridway/wiki
Overview and Aims
By aggregating computing power, software tools, data storage systems and scientific 
instruments that are distributed in heterogeneous systems across multiple locations, 
Grid  Computing  promises  a  global  virtual  supercomputer  where  users  at  different 
physical locations can cooperate for a specific problem in a high performance, secure, 
reliable and cost­effective way. 
  Via grid resource virtualization technologies, Grid hides the large­scale, distributed 
and dynamic nature of the grid computing environment, thus creating a single system 
image, i.e., a single yet powerful virtual computer, and enabling the end­users (i.e., 
domain  experts)  to  fully  focus  on  problem  solving  rather  than  the  underlying 
technical details.
   As the underlying infrastructure of the Australian National Grid (ANG) is becoming 
increasingly complex and dynamic, it is no longer suitable and efficient for its users to 
manually  perform  computational  tasks  on  the  large­scale  and  heterogeneous  ANG 
Grid, therefore a metascheduling system is needed by the ANG Grid to provided its 
end­users  with  an  easy­to­use  and  automatic  job  execution  environment.  However, 
none  of  current  major  metaschedulers  (such  as  Condor­G,  GridBus  Broker  and 
GridWay,  etc)  support  the  latest  grid  information  standard,  i.e.,  Globus  MDS  4 
(Monitoring and Discovery Services) with GLUE Schema Specification 1.2, which is 
used  by  ANG  to  describe  and  publish  resource  information.  Moreover,  none  of  the 
these  metaschedulers  offer  the  functionalities  that  enable  the  end­users  to  specify 
software  requirements  and  other  desired  resources  that  they  are  allowed  to  use 
according to the local domain policies (which can be considered as a quota system), 
and  the  ability  which  helps  the  grid  administrator  to  appropriately  allocate  the 
resource  such  as  software  and  storage  usage  priority  to  different  ANG  user  group. 
These functionalities are highly desirable for building a completely virtualized grid 
environment in which the end­users are able to fully focus on problem solving rather 
than the  grid­specific technical details.
  This project aims to customize and deploy a metascheduling system (GridWay) for 
the ANG Grid by cooperating with the ANG Grid resource information service, thus 
virtualizing  the  high  performance  computing  resources  across  the  ANG  Grid  and 
providing  its  users  with  an  automatic  and  intelligent  job  execution  environment. 
Based on GridWay's basic scheduling framework, we added very important features 
to  achieve  the  real  virtualization  of  the  ANG  Grid  resource(see  Section 
Implementation  for  details).  By  using  the  GridWay  modified  and  customized  for 
ANG(i.e., ANG­GridWay), the end user just needs to specify the input data (what I 
have), the required resources (What I need) and optionally the user's user group(i.e., 
Virtual  Organization,  a.k.a.,  VO*)  identity  (Who  I  am),  the  scheduler  then 
appropriately  schedules  the  tasks  through  ANG  grid  and  hides  all  the  technical 
details such as requirements matchmaking, VO View checking, and failure handling. 
Our scheduling policies also allows the scheduler to choose a “preferred” queue for a 
specific  software.  *VO:  An  administrative  domain  with  a  separate  and  distinct  set  of 
administrative policies such as access control and resource usage quota allocation
Implementation
As mentioned in the first section, our implementation focused on adapting the original GridWay scheduling framework to 
current ANG Grid infrastructure, and virtualizing all computing resources across the ANG Grid to a unified resource pool. 
Based on GridWay’s original scheduling framework, we developed advanced functionalities to provide our end­users with an 
easy­to­use and intelligent job execution environment. We summarized our work that has been done so far into the following 
six categories: 
Information ModelInformation Model Work in this part mainly focused on adapting GridWay to smoothly cooperate with the information 
published by the ANG MDS information service. The ANG­GridWay was customized to cooperate with an extended version of 
MDS4(GLUE1.2) Schema (see Figure 4 for the information model) according to real experience of ANG users, the extended 
MDS4(GLUE1.2) resource information model allows our scheduler to accurately locate the target resource according to user 
requirements and grid domain policies. Figure 1 and Figure 2 show all the ANG computing resources discovered by the 
customized scheduler and a complete view of a particular resource, respectively.
Software RequirementsSoftware Requirements One of the most important functionalities the ANG­GridWay provides is enabling the end 
user to specify required software (then generating corresponding module extension in the corresponding RSL file).
VOView RequirementsVOView Requirements The VOView entity of GLUE Schema describes the resource information from a specific grid 
user's viewpoint, by making use of the VOView information, our scheduler presents the end user a “personalized” resource 
perspective, enabling more precise metascheduling.
Requirement Matchmaking AlgorithmRequirement Matchmaking Algorithm Based on the above work, an advanced matchmaking algorithm was 
developed to enable the scheduler to automatically search the resources that accurately satisfy user requirements. Our 
matchmaking algorithm allows user to specify multiple software requirements in a single task, which is a highly desirable 
feature which enables our user to design simple workflow requiring multiple softwares without the need of knowing where 
these softwares are located on the grid.
Software Priority on the ResourceSoftware Priority on the Resource According to the real cases of resource allocation and software usage on the 
ANG Grid, ANG­GridWay introduces a special functionality which enables software to run on a “preferred” resource according 
to a specified priority assigned to the given grid user.
Resource Failure HandlingResource Failure Handling Our new failure handling policy performs on queue level for a given user, which avoids 
waste of resources and enables fine­grained failure handling control on the resources.
The Grid and Grid Information Service
The most common description of Grid Computing is often compared with an electric 
power  grid,  through  which  we  consume  the  electrical  power  on  demand,  without 
knowing  where  and  how  the  energy  is  generated.  Similarly,  Grid  Computing 
technologies  hide  the  details  of  the  underlying  computing  resources  and  the 
complexity  of  how  these  resources  are  organized  and  how  computation  jobs  are 
scheduled,  thus  creating  a  single  and  unified  system  image,  as  a  result,  the  end 
users are able to perform computational tasks on the grid as if they were using a 
single yet powerful virtual computer. 
        The  Globus  MDS  services  provide  a  standardized  approach  for  grid  resource 
discovery  and  monitoring,  thus  playing  a  key  role  in  computing  resource 
virtualization.  More  detailly,  Grid  Laboratory  Uniform  Environment  (GLUE) 
schema  is  used  by  MDS  service  to  describe  the  grid  resources  in  a  precise  and 
systematic  manner,  thus  enabling  them  to  be  discovered  for  subsequent 
management or use such as task scheduling. By defining an information model at 
the conceptual level, GLUE schema specification abstracts the real world computing 
resources into constructs which can be represented in computer systems.
      Our metascheduler was customized to cooperate with MDS4(GLUE1.2) Schema 
extended  by  the  ANG  Grid  according  to  the  real  user  experience.  The  extended 
model of resource information is illustrated in Figure 4, the minimum discovery unit 
is  SubCluster  (a  homogeneous  computing  environment),  which  help  the 
metascheduler precisely locate the target resource according to user requirements. A 
complete view of a SubCluster (from ANG­GridWay's view point) is also shown in 
Figure 4.
Conclusion and Future Work
We successfully adapted GridWay to current ANG Grid infrastructure, our customizations 
have enabled GridWay to smoothly cooperate with the ANG MDS information service. The 
customized scheduler has very important and useful features that are not provided by the 
original  one  but  needed  by  the  ANG  Grid,  such  as  supporting  the  latest  information 
standards  and  the heterogeneous  information  model,  multiple software  requirements,  VO 
policies, software priority and fine­grained failure handling.
        The  result  of  our  work  is  a  metascheduling  system  which  fully  virtualizes  resources 
across the ANG Grid. All the details of resource discovery, requirements matchmaking, job 
submission  and  execution,  usage  and  scheduling  policy  control  and  failure  handling  are 
successfully hidden from the ANG users. The bottom line of what a user needs to do is just 
to  declare  the  required  software(s)  and  input  data  through  the  grid  portal.  Through  the 
resource virtualization of our metascheduling system, the ANG Grid presents its end users 
a  unified  super  virtual  computer  behind  which  are  various  dynamic,  heterogeneous  and 
geographically distributed grid sites. 
        As  the  underlying  grid  infrastructure  is  becoming  increasingly  complex  and  the  MDS 
GLUE is evolving towards a mature and flexible resource description model, there are a lot 
of  potential  improvements  based  on  current  ANG­GridWay,  like  adaptive  scheduling  by 
using  the  VOView  information,  advanced  resource  reservation  and  scientific  workflow 
planning and scheduling.
Figure 2. The complete view of a homogeneous 
cluster presented by ANG­GridWay, including 
environment information, queues and user VO 
views, available software tools and storage 
information.
Figure 4. The whole picture: The information model used by the ANG Grid and the 
resource virtualization via ANG­GridWay

More Related Content

What's hot

IEEE 2014 JAVA CLOUD COMPUTING PROJECTS Performance and cost evaluation of an...
IEEE 2014 JAVA CLOUD COMPUTING PROJECTS Performance and cost evaluation of an...IEEE 2014 JAVA CLOUD COMPUTING PROJECTS Performance and cost evaluation of an...
IEEE 2014 JAVA CLOUD COMPUTING PROJECTS Performance and cost evaluation of an...IEEEGLOBALSOFTSTUDENTPROJECTS
 
5. the grid implementing production grid
5. the grid implementing production grid5. the grid implementing production grid
5. the grid implementing production gridDr Sandeep Kumar Poonia
 
MongoDB .local London 2019: Best Practices for Working with IoT and Time-seri...
MongoDB .local London 2019: Best Practices for Working with IoT and Time-seri...MongoDB .local London 2019: Best Practices for Working with IoT and Time-seri...
MongoDB .local London 2019: Best Practices for Working with IoT and Time-seri...MongoDB
 
MongoDB .local Houston 2019: Best Practices for Working with IoT and Time-ser...
MongoDB .local Houston 2019: Best Practices for Working with IoT and Time-ser...MongoDB .local Houston 2019: Best Practices for Working with IoT and Time-ser...
MongoDB .local Houston 2019: Best Practices for Working with IoT and Time-ser...MongoDB
 
Workflow Scheduling Techniques and Algorithms in IaaS Cloud: A Survey
Workflow Scheduling Techniques and Algorithms in IaaS Cloud: A Survey Workflow Scheduling Techniques and Algorithms in IaaS Cloud: A Survey
Workflow Scheduling Techniques and Algorithms in IaaS Cloud: A Survey IJECEIAES
 
Mining Of Big Data Using Map-Reduce Theorem
Mining Of Big Data Using Map-Reduce TheoremMining Of Big Data Using Map-Reduce Theorem
Mining Of Big Data Using Map-Reduce TheoremIOSR Journals
 
Data-Intensive Technologies for Cloud Computing
Data-Intensive Technologies for CloudComputingData-Intensive Technologies for CloudComputing
Data-Intensive Technologies for Cloud Computinghuda2018
 
Data Partitioning in Mongo DB with Cloud
Data Partitioning in Mongo DB with CloudData Partitioning in Mongo DB with Cloud
Data Partitioning in Mongo DB with CloudIJAAS Team
 
First review presentation
First review presentationFirst review presentation
First review presentationArvind Krishnaa
 
Combiningefficiencyfidelityandflexibilityin 150511053028-lva1-app6892
Combiningefficiencyfidelityandflexibilityin 150511053028-lva1-app6892Combiningefficiencyfidelityandflexibilityin 150511053028-lva1-app6892
Combiningefficiencyfidelityandflexibilityin 150511053028-lva1-app6892Shakas Technologies
 
IEEE 2015 - 2016 | Combining Efficiency, Fidelity, and Flexibility in Resource...
IEEE 2015 - 2016 | Combining Efficiency, Fidelity, and Flexibility in Resource...IEEE 2015 - 2016 | Combining Efficiency, Fidelity, and Flexibility in Resource...
IEEE 2015 - 2016 | Combining Efficiency, Fidelity, and Flexibility in Resource...1crore projects
 
Comparative Analysis, Security Aspects & Optimization of Workload in Gfs Base...
Comparative Analysis, Security Aspects & Optimization of Workload in Gfs Base...Comparative Analysis, Security Aspects & Optimization of Workload in Gfs Base...
Comparative Analysis, Security Aspects & Optimization of Workload in Gfs Base...IOSR Journals
 
MUSYOP: Towards a Query Optimization for Heterogeneous Distributed Database S...
MUSYOP: Towards a Query Optimization for Heterogeneous Distributed Database S...MUSYOP: Towards a Query Optimization for Heterogeneous Distributed Database S...
MUSYOP: Towards a Query Optimization for Heterogeneous Distributed Database S...Institute of Information Systems (HES-SO)
 
COMBINING EFFICIENCY, FIDELITY, AND FLEXIBILITY IN RESOURCE INFORMATION SERV...
 COMBINING EFFICIENCY, FIDELITY, AND FLEXIBILITY IN RESOURCE INFORMATION SERV... COMBINING EFFICIENCY, FIDELITY, AND FLEXIBILITY IN RESOURCE INFORMATION SERV...
COMBINING EFFICIENCY, FIDELITY, AND FLEXIBILITY IN RESOURCE INFORMATION SERV...Nexgen Technology
 

What's hot (18)

IEEE 2014 JAVA CLOUD COMPUTING PROJECTS Performance and cost evaluation of an...
IEEE 2014 JAVA CLOUD COMPUTING PROJECTS Performance and cost evaluation of an...IEEE 2014 JAVA CLOUD COMPUTING PROJECTS Performance and cost evaluation of an...
IEEE 2014 JAVA CLOUD COMPUTING PROJECTS Performance and cost evaluation of an...
 
5. the grid implementing production grid
5. the grid implementing production grid5. the grid implementing production grid
5. the grid implementing production grid
 
MongoDB .local London 2019: Best Practices for Working with IoT and Time-seri...
MongoDB .local London 2019: Best Practices for Working with IoT and Time-seri...MongoDB .local London 2019: Best Practices for Working with IoT and Time-seri...
MongoDB .local London 2019: Best Practices for Working with IoT and Time-seri...
 
MongoDB .local Houston 2019: Best Practices for Working with IoT and Time-ser...
MongoDB .local Houston 2019: Best Practices for Working with IoT and Time-ser...MongoDB .local Houston 2019: Best Practices for Working with IoT and Time-ser...
MongoDB .local Houston 2019: Best Practices for Working with IoT and Time-ser...
 
Ogsa
OgsaOgsa
Ogsa
 
Workflow Scheduling Techniques and Algorithms in IaaS Cloud: A Survey
Workflow Scheduling Techniques and Algorithms in IaaS Cloud: A Survey Workflow Scheduling Techniques and Algorithms in IaaS Cloud: A Survey
Workflow Scheduling Techniques and Algorithms in IaaS Cloud: A Survey
 
Cs6703 grid and cloud computing unit 2
Cs6703 grid and cloud computing unit 2Cs6703 grid and cloud computing unit 2
Cs6703 grid and cloud computing unit 2
 
Mining Of Big Data Using Map-Reduce Theorem
Mining Of Big Data Using Map-Reduce TheoremMining Of Big Data Using Map-Reduce Theorem
Mining Of Big Data Using Map-Reduce Theorem
 
Fault tolerance on cloud computing
Fault tolerance on cloud computingFault tolerance on cloud computing
Fault tolerance on cloud computing
 
Data-Intensive Technologies for Cloud Computing
Data-Intensive Technologies for CloudComputingData-Intensive Technologies for CloudComputing
Data-Intensive Technologies for Cloud Computing
 
Data Partitioning in Mongo DB with Cloud
Data Partitioning in Mongo DB with CloudData Partitioning in Mongo DB with Cloud
Data Partitioning in Mongo DB with Cloud
 
First review presentation
First review presentationFirst review presentation
First review presentation
 
International Journal of Engineering Inventions (IJEI)
International Journal of Engineering Inventions (IJEI)International Journal of Engineering Inventions (IJEI)
International Journal of Engineering Inventions (IJEI)
 
Combiningefficiencyfidelityandflexibilityin 150511053028-lva1-app6892
Combiningefficiencyfidelityandflexibilityin 150511053028-lva1-app6892Combiningefficiencyfidelityandflexibilityin 150511053028-lva1-app6892
Combiningefficiencyfidelityandflexibilityin 150511053028-lva1-app6892
 
IEEE 2015 - 2016 | Combining Efficiency, Fidelity, and Flexibility in Resource...
IEEE 2015 - 2016 | Combining Efficiency, Fidelity, and Flexibility in Resource...IEEE 2015 - 2016 | Combining Efficiency, Fidelity, and Flexibility in Resource...
IEEE 2015 - 2016 | Combining Efficiency, Fidelity, and Flexibility in Resource...
 
Comparative Analysis, Security Aspects & Optimization of Workload in Gfs Base...
Comparative Analysis, Security Aspects & Optimization of Workload in Gfs Base...Comparative Analysis, Security Aspects & Optimization of Workload in Gfs Base...
Comparative Analysis, Security Aspects & Optimization of Workload in Gfs Base...
 
MUSYOP: Towards a Query Optimization for Heterogeneous Distributed Database S...
MUSYOP: Towards a Query Optimization for Heterogeneous Distributed Database S...MUSYOP: Towards a Query Optimization for Heterogeneous Distributed Database S...
MUSYOP: Towards a Query Optimization for Heterogeneous Distributed Database S...
 
COMBINING EFFICIENCY, FIDELITY, AND FLEXIBILITY IN RESOURCE INFORMATION SERV...
 COMBINING EFFICIENCY, FIDELITY, AND FLEXIBILITY IN RESOURCE INFORMATION SERV... COMBINING EFFICIENCY, FIDELITY, AND FLEXIBILITY IN RESOURCE INFORMATION SERV...
COMBINING EFFICIENCY, FIDELITY, AND FLEXIBILITY IN RESOURCE INFORMATION SERV...
 

Viewers also liked

Topology Aware Resource Allocation
Topology Aware Resource AllocationTopology Aware Resource Allocation
Topology Aware Resource AllocationSujith Jay Nair
 
Realtime scheduling for virtual machines in SKT
Realtime scheduling for virtual machines in SKTRealtime scheduling for virtual machines in SKT
Realtime scheduling for virtual machines in SKTThe Linux Foundation
 
Toward a practical “HPC Cloud”: Performance tuning of a virtualized HPC cluster
Toward a practical “HPC Cloud”: Performance tuning of a virtualized HPC clusterToward a practical “HPC Cloud”: Performance tuning of a virtualized HPC cluster
Toward a practical “HPC Cloud”: Performance tuning of a virtualized HPC clusterRyousei Takano
 
Cloud Comp Challenges
Cloud Comp ChallengesCloud Comp Challenges
Cloud Comp Challengesbefreax
 
High Performance Computing - Cloud Point of View
High Performance Computing - Cloud Point of ViewHigh Performance Computing - Cloud Point of View
High Performance Computing - Cloud Point of Viewaragozin
 
хичээлийн тодорхойлолт
хичээлийн тодорхойлолтхичээлийн тодорхойлолт
хичээлийн тодорхойлолтdavaa627
 
Key Challenges in Cloud Computing and How Yahoo! is Approaching Them
Key Challenges in Cloud Computing and How Yahoo! is Approaching ThemKey Challenges in Cloud Computing and How Yahoo! is Approaching Them
Key Challenges in Cloud Computing and How Yahoo! is Approaching ThemYahoo Developer Network
 
Challenges and Issues of Next Cloud Computing Platforms
Challenges and Issues of Next Cloud Computing PlatformsChallenges and Issues of Next Cloud Computing Platforms
Challenges and Issues of Next Cloud Computing PlatformsFrederic Desprez
 
Optimization of Resource Provisioning Cost in Cloud Computing
Optimization of Resource Provisioning Cost in Cloud ComputingOptimization of Resource Provisioning Cost in Cloud Computing
Optimization of Resource Provisioning Cost in Cloud ComputingAswin Kalarickal
 
Cloud computing: Trends and Challenges
Cloud computing: Trends and ChallengesCloud computing: Trends and Challenges
Cloud computing: Trends and ChallengesBig Data Colombia
 
Ramco ERP on Cloud - The Best Cloud Computing Solution Worldwide
Ramco ERP on Cloud - The Best Cloud Computing Solution Worldwide Ramco ERP on Cloud - The Best Cloud Computing Solution Worldwide
Ramco ERP on Cloud - The Best Cloud Computing Solution Worldwide Ramco Systems
 

Viewers also liked (13)

Topology Aware Resource Allocation
Topology Aware Resource AllocationTopology Aware Resource Allocation
Topology Aware Resource Allocation
 
Realtime scheduling for virtual machines in SKT
Realtime scheduling for virtual machines in SKTRealtime scheduling for virtual machines in SKT
Realtime scheduling for virtual machines in SKT
 
Toward a practical “HPC Cloud”: Performance tuning of a virtualized HPC cluster
Toward a practical “HPC Cloud”: Performance tuning of a virtualized HPC clusterToward a practical “HPC Cloud”: Performance tuning of a virtualized HPC cluster
Toward a practical “HPC Cloud”: Performance tuning of a virtualized HPC cluster
 
Cloud Comp Challenges
Cloud Comp ChallengesCloud Comp Challenges
Cloud Comp Challenges
 
Virtualization for HPC at NCI
Virtualization for HPC at NCIVirtualization for HPC at NCI
Virtualization for HPC at NCI
 
High Performance Computing - Cloud Point of View
High Performance Computing - Cloud Point of ViewHigh Performance Computing - Cloud Point of View
High Performance Computing - Cloud Point of View
 
хичээлийн тодорхойлолт
хичээлийн тодорхойлолтхичээлийн тодорхойлолт
хичээлийн тодорхойлолт
 
Key Challenges in Cloud Computing and How Yahoo! is Approaching Them
Key Challenges in Cloud Computing and How Yahoo! is Approaching ThemKey Challenges in Cloud Computing and How Yahoo! is Approaching Them
Key Challenges in Cloud Computing and How Yahoo! is Approaching Them
 
Challenges and Issues of Next Cloud Computing Platforms
Challenges and Issues of Next Cloud Computing PlatformsChallenges and Issues of Next Cloud Computing Platforms
Challenges and Issues of Next Cloud Computing Platforms
 
"Cloud Computing for HPC"
"Cloud Computing for HPC""Cloud Computing for HPC"
"Cloud Computing for HPC"
 
Optimization of Resource Provisioning Cost in Cloud Computing
Optimization of Resource Provisioning Cost in Cloud ComputingOptimization of Resource Provisioning Cost in Cloud Computing
Optimization of Resource Provisioning Cost in Cloud Computing
 
Cloud computing: Trends and Challenges
Cloud computing: Trends and ChallengesCloud computing: Trends and Challenges
Cloud computing: Trends and Challenges
 
Ramco ERP on Cloud - The Best Cloud Computing Solution Worldwide
Ramco ERP on Cloud - The Best Cloud Computing Solution Worldwide Ramco ERP on Cloud - The Best Cloud Computing Solution Worldwide
Ramco ERP on Cloud - The Best Cloud Computing Solution Worldwide
 

Similar to ANG-GridWay-Poster-Final-Colorful-Bright-Final0

Inroduction to grid computing by gargi shankar verma
Inroduction to grid computing by gargi shankar vermaInroduction to grid computing by gargi shankar verma
Inroduction to grid computing by gargi shankar vermagargishankar1981
 
Distributed computing file
Distributed computing fileDistributed computing file
Distributed computing fileberasrujana
 
An Exploration of Grid Computing to be Utilized in Teaching and Research at TU
An Exploration of Grid Computing to be Utilized in Teaching and Research at TUAn Exploration of Grid Computing to be Utilized in Teaching and Research at TU
An Exploration of Grid Computing to be Utilized in Teaching and Research at TUEswar Publications
 
B036407011
B036407011B036407011
B036407011theijes
 
cuSTINGER: Supporting Dynamic Graph Aigorithms for GPUs : NOTES
cuSTINGER: Supporting Dynamic Graph Aigorithms for GPUs : NOTEScuSTINGER: Supporting Dynamic Graph Aigorithms for GPUs : NOTES
cuSTINGER: Supporting Dynamic Graph Aigorithms for GPUs : NOTESSubhajit Sahu
 
cuSTINGER: Supporting Dynamic Graph Aigorithms for GPUs (NOTES)
cuSTINGER: Supporting Dynamic Graph Aigorithms for GPUs (NOTES)cuSTINGER: Supporting Dynamic Graph Aigorithms for GPUs (NOTES)
cuSTINGER: Supporting Dynamic Graph Aigorithms for GPUs (NOTES)Subhajit Sahu
 
A CLOUD BASED ARCHITECTURE FOR WORKING ON BIG DATA WITH WORKFLOW MANAGEMENT
A CLOUD BASED ARCHITECTURE FOR WORKING ON BIG DATA WITH WORKFLOW MANAGEMENTA CLOUD BASED ARCHITECTURE FOR WORKING ON BIG DATA WITH WORKFLOW MANAGEMENT
A CLOUD BASED ARCHITECTURE FOR WORKING ON BIG DATA WITH WORKFLOW MANAGEMENTIJwest
 
Grid computing standards
Grid computing standardsGrid computing standards
Grid computing standardsPooja Dixit
 
Unit i introduction to grid computing
Unit i   introduction to grid computingUnit i   introduction to grid computing
Unit i introduction to grid computingsudha kar
 
Bt9002 grid computing 1
Bt9002 grid computing 1Bt9002 grid computing 1
Bt9002 grid computing 1Techglyphs
 
Grid Computing
Grid ComputingGrid Computing
Grid Computingabhiritva
 
Grid proxy architecture for network resources
Grid proxy architecture for network resourcesGrid proxy architecture for network resources
Grid proxy architecture for network resourcesTal Lavian Ph.D.
 
Implementation of p pic algorithm in map reduce to handle big data
Implementation of p pic algorithm in map reduce to handle big dataImplementation of p pic algorithm in map reduce to handle big data
Implementation of p pic algorithm in map reduce to handle big dataeSAT Publishing House
 
IRJET - Evaluating and Comparing the Two Variation with Current Scheduling Al...
IRJET - Evaluating and Comparing the Two Variation with Current Scheduling Al...IRJET - Evaluating and Comparing the Two Variation with Current Scheduling Al...
IRJET - Evaluating and Comparing the Two Variation with Current Scheduling Al...IRJET Journal
 

Similar to ANG-GridWay-Poster-Final-Colorful-Bright-Final0 (20)

Inroduction to grid computing by gargi shankar verma
Inroduction to grid computing by gargi shankar vermaInroduction to grid computing by gargi shankar verma
Inroduction to grid computing by gargi shankar verma
 
Distributed computing file
Distributed computing fileDistributed computing file
Distributed computing file
 
Grid computing
Grid computingGrid computing
Grid computing
 
An Exploration of Grid Computing to be Utilized in Teaching and Research at TU
An Exploration of Grid Computing to be Utilized in Teaching and Research at TUAn Exploration of Grid Computing to be Utilized in Teaching and Research at TU
An Exploration of Grid Computing to be Utilized in Teaching and Research at TU
 
Grid Presentation
Grid PresentationGrid Presentation
Grid Presentation
 
GRID COMPUTING
GRID COMPUTINGGRID COMPUTING
GRID COMPUTING
 
B036407011
B036407011B036407011
B036407011
 
cuSTINGER: Supporting Dynamic Graph Aigorithms for GPUs : NOTES
cuSTINGER: Supporting Dynamic Graph Aigorithms for GPUs : NOTEScuSTINGER: Supporting Dynamic Graph Aigorithms for GPUs : NOTES
cuSTINGER: Supporting Dynamic Graph Aigorithms for GPUs : NOTES
 
cuSTINGER: Supporting Dynamic Graph Aigorithms for GPUs (NOTES)
cuSTINGER: Supporting Dynamic Graph Aigorithms for GPUs (NOTES)cuSTINGER: Supporting Dynamic Graph Aigorithms for GPUs (NOTES)
cuSTINGER: Supporting Dynamic Graph Aigorithms for GPUs (NOTES)
 
A CLOUD BASED ARCHITECTURE FOR WORKING ON BIG DATA WITH WORKFLOW MANAGEMENT
A CLOUD BASED ARCHITECTURE FOR WORKING ON BIG DATA WITH WORKFLOW MANAGEMENTA CLOUD BASED ARCHITECTURE FOR WORKING ON BIG DATA WITH WORKFLOW MANAGEMENT
A CLOUD BASED ARCHITECTURE FOR WORKING ON BIG DATA WITH WORKFLOW MANAGEMENT
 
Grid computing standards
Grid computing standardsGrid computing standards
Grid computing standards
 
A01260104
A01260104A01260104
A01260104
 
Unit i introduction to grid computing
Unit i   introduction to grid computingUnit i   introduction to grid computing
Unit i introduction to grid computing
 
Bt9002 grid computing 1
Bt9002 grid computing 1Bt9002 grid computing 1
Bt9002 grid computing 1
 
Grid Computing
Grid ComputingGrid Computing
Grid Computing
 
Grid computing 12
Grid computing 12Grid computing 12
Grid computing 12
 
Grid Computing
Grid ComputingGrid Computing
Grid Computing
 
Grid proxy architecture for network resources
Grid proxy architecture for network resourcesGrid proxy architecture for network resources
Grid proxy architecture for network resources
 
Implementation of p pic algorithm in map reduce to handle big data
Implementation of p pic algorithm in map reduce to handle big dataImplementation of p pic algorithm in map reduce to handle big data
Implementation of p pic algorithm in map reduce to handle big data
 
IRJET - Evaluating and Comparing the Two Variation with Current Scheduling Al...
IRJET - Evaluating and Comparing the Two Variation with Current Scheduling Al...IRJET - Evaluating and Comparing the Two Variation with Current Scheduling Al...
IRJET - Evaluating and Comparing the Two Variation with Current Scheduling Al...
 

ANG-GridWay-Poster-Final-Colorful-Bright-Final0

  • 1. Metascheduling on the Grid In the grid environment, a metascheduler is also known as global scheduler, which  coordinates  communications  between  multiple  heterogeneous  schedulers  (local  schedulers) that operate at the local or cluster level.      By cooperating with Grid Information Services, which abstract the information of  all the available resources on the grid into a sing le resource perspective, the grid  metascheduler presents the end user a single virtual resource pool, which hides all  the  details  of  scheduling  and  monitoring  jobs  in  the  dynamic  and  heterogeneous  environment.     From  the  end  user's  viewpoint,  a  fully  virtualized  grid  is  like  a  single  supercomputer, and as a proxy or entry point to the grid, the metascheduler should  hide grid­specific technical details to the most extent. A good metascheduler should  only  ask  the  end  user  (without  any  grid  specific  knowledge)  three  questions  (see  Figure 3 for an example description) WHAT  DO  YOU  NEED?  (Requirements)WHAT  DO  YOU  NEED?  (Requirements)  This  usually  involves  requirement matchmaking between known resources on the grid and the resources  required by the end user, such as a particular software, disk space, and the forth. WHAT  DO  YOU  HAVE?  (Input)WHAT  DO  YOU  HAVE?  (Input)  This  usually  needs  the  end  user  to  specify the input of the job, such as input files on the local or remote machine, or  just the arguments of the executable, or both. WHO ARE YOU? (VO Identity*)WHO ARE YOU? (VO Identity*) This usually needs the user to declare  his or her VO identity, which enables the scheduler to analyse resource information  from the user's view point. * A grid user's VO identity is not used for authentication or authorization, instead, by providing  the VO identity, we can retrieve user­specific resource information like the accessible free cups,  disk space and software tools. Improving the Functionality and Customization of Scheduling  Services for Grid Computing Jingjing Sun, Supervised by Dr. Paul Coddington, Dr. Andrew Wendelborn School of Computer Science, University of Adelaide Figure 1. Computing resources across ANG Grid, discovered by the customized metascheduler Figure 3. A typical customized job template  including all kinds of user requirements Acknowledgements This project would not have been possible without Dr. Paul Coddington and Dr. Andrew  Wendelborn's wise and patient guidance. Special thanks to the cooperation of the excellent  SAPAC staff: Daniel Cox, Gerson Galang and Shunde Zhang, they made the whole things  happen. For more information about our project, please refer to the ANG­GridWay wiki:   http://www.grid.apac.edu.au/repository/trac/APACGridway/wiki Overview and Aims By aggregating computing power, software tools, data storage systems and scientific  instruments that are distributed in heterogeneous systems across multiple locations,  Grid  Computing  promises  a  global  virtual  supercomputer  where  users  at  different  physical locations can cooperate for a specific problem in a high performance, secure,  reliable and cost­effective way.    Via grid resource virtualization technologies, Grid hides the large­scale, distributed  and dynamic nature of the grid computing environment, thus creating a single system  image, i.e., a single yet powerful virtual computer, and enabling the end­users (i.e.,  domain  experts)  to  fully  focus  on  problem  solving  rather  than  the  underlying  technical details.    As the underlying infrastructure of the Australian National Grid (ANG) is becoming  increasingly complex and dynamic, it is no longer suitable and efficient for its users to  manually  perform  computational  tasks  on  the  large­scale  and  heterogeneous  ANG  Grid, therefore a metascheduling system is needed by the ANG Grid to provided its  end­users  with  an  easy­to­use  and  automatic  job  execution  environment.  However,  none  of  current  major  metaschedulers  (such  as  Condor­G,  GridBus  Broker  and  GridWay,  etc)  support  the  latest  grid  information  standard,  i.e.,  Globus  MDS  4  (Monitoring and Discovery Services) with GLUE Schema Specification 1.2, which is  used  by  ANG  to  describe  and  publish  resource  information.  Moreover,  none  of  the  these  metaschedulers  offer  the  functionalities  that  enable  the  end­users  to  specify  software  requirements  and  other  desired  resources  that  they  are  allowed  to  use  according to the local domain policies (which can be considered as a quota system),  and  the  ability  which  helps  the  grid  administrator  to  appropriately  allocate  the  resource  such  as  software  and  storage  usage  priority  to  different  ANG  user  group.  These functionalities are highly desirable for building a completely virtualized grid  environment in which the end­users are able to fully focus on problem solving rather  than the  grid­specific technical details.   This project aims to customize and deploy a metascheduling system (GridWay) for  the ANG Grid by cooperating with the ANG Grid resource information service, thus  virtualizing  the  high  performance  computing  resources  across  the  ANG  Grid  and  providing  its  users  with  an  automatic  and  intelligent  job  execution  environment.  Based on GridWay's basic scheduling framework, we added very important features  to  achieve  the  real  virtualization  of  the  ANG  Grid  resource(see  Section  Implementation  for  details).  By  using  the  GridWay  modified  and  customized  for  ANG(i.e., ANG­GridWay), the end user just needs to specify the input data (what I  have), the required resources (What I need) and optionally the user's user group(i.e.,  Virtual  Organization,  a.k.a.,  VO*)  identity  (Who  I  am),  the  scheduler  then  appropriately  schedules  the  tasks  through  ANG  grid  and  hides  all  the  technical  details such as requirements matchmaking, VO View checking, and failure handling.  Our scheduling policies also allows the scheduler to choose a “preferred” queue for a  specific  software.  *VO:  An  administrative  domain  with  a  separate  and  distinct  set  of  administrative policies such as access control and resource usage quota allocation Implementation As mentioned in the first section, our implementation focused on adapting the original GridWay scheduling framework to  current ANG Grid infrastructure, and virtualizing all computing resources across the ANG Grid to a unified resource pool.  Based on GridWay’s original scheduling framework, we developed advanced functionalities to provide our end­users with an  easy­to­use and intelligent job execution environment. We summarized our work that has been done so far into the following  six categories:  Information ModelInformation Model Work in this part mainly focused on adapting GridWay to smoothly cooperate with the information  published by the ANG MDS information service. The ANG­GridWay was customized to cooperate with an extended version of  MDS4(GLUE1.2) Schema (see Figure 4 for the information model) according to real experience of ANG users, the extended  MDS4(GLUE1.2) resource information model allows our scheduler to accurately locate the target resource according to user  requirements and grid domain policies. Figure 1 and Figure 2 show all the ANG computing resources discovered by the  customized scheduler and a complete view of a particular resource, respectively. Software RequirementsSoftware Requirements One of the most important functionalities the ANG­GridWay provides is enabling the end  user to specify required software (then generating corresponding module extension in the corresponding RSL file). VOView RequirementsVOView Requirements The VOView entity of GLUE Schema describes the resource information from a specific grid  user's viewpoint, by making use of the VOView information, our scheduler presents the end user a “personalized” resource  perspective, enabling more precise metascheduling. Requirement Matchmaking AlgorithmRequirement Matchmaking Algorithm Based on the above work, an advanced matchmaking algorithm was  developed to enable the scheduler to automatically search the resources that accurately satisfy user requirements. Our  matchmaking algorithm allows user to specify multiple software requirements in a single task, which is a highly desirable  feature which enables our user to design simple workflow requiring multiple softwares without the need of knowing where  these softwares are located on the grid. Software Priority on the ResourceSoftware Priority on the Resource According to the real cases of resource allocation and software usage on the  ANG Grid, ANG­GridWay introduces a special functionality which enables software to run on a “preferred” resource according  to a specified priority assigned to the given grid user. Resource Failure HandlingResource Failure Handling Our new failure handling policy performs on queue level for a given user, which avoids  waste of resources and enables fine­grained failure handling control on the resources. The Grid and Grid Information Service The most common description of Grid Computing is often compared with an electric  power  grid,  through  which  we  consume  the  electrical  power  on  demand,  without  knowing  where  and  how  the  energy  is  generated.  Similarly,  Grid  Computing  technologies  hide  the  details  of  the  underlying  computing  resources  and  the  complexity  of  how  these  resources  are  organized  and  how  computation  jobs  are  scheduled,  thus  creating  a  single  and  unified  system  image,  as  a  result,  the  end  users are able to perform computational tasks on the grid as if they were using a  single yet powerful virtual computer.          The  Globus  MDS  services  provide  a  standardized  approach  for  grid  resource  discovery  and  monitoring,  thus  playing  a  key  role  in  computing  resource  virtualization.  More  detailly,  Grid  Laboratory  Uniform  Environment  (GLUE)  schema  is  used  by  MDS  service  to  describe  the  grid  resources  in  a  precise  and  systematic  manner,  thus  enabling  them  to  be  discovered  for  subsequent  management or use such as task scheduling. By defining an information model at  the conceptual level, GLUE schema specification abstracts the real world computing  resources into constructs which can be represented in computer systems.       Our metascheduler was customized to cooperate with MDS4(GLUE1.2) Schema  extended  by  the  ANG  Grid  according  to  the  real  user  experience.  The  extended  model of resource information is illustrated in Figure 4, the minimum discovery unit  is  SubCluster  (a  homogeneous  computing  environment),  which  help  the  metascheduler precisely locate the target resource according to user requirements. A  complete view of a SubCluster (from ANG­GridWay's view point) is also shown in  Figure 4. Conclusion and Future Work We successfully adapted GridWay to current ANG Grid infrastructure, our customizations  have enabled GridWay to smoothly cooperate with the ANG MDS information service. The  customized scheduler has very important and useful features that are not provided by the  original  one  but  needed  by  the  ANG  Grid,  such  as  supporting  the  latest  information  standards  and  the heterogeneous  information  model,  multiple software  requirements,  VO  policies, software priority and fine­grained failure handling.         The  result  of  our  work  is  a  metascheduling  system  which  fully  virtualizes  resources  across the ANG Grid. All the details of resource discovery, requirements matchmaking, job  submission  and  execution,  usage  and  scheduling  policy  control  and  failure  handling  are  successfully hidden from the ANG users. The bottom line of what a user needs to do is just  to  declare  the  required  software(s)  and  input  data  through  the  grid  portal.  Through  the  resource virtualization of our metascheduling system, the ANG Grid presents its end users  a  unified  super  virtual  computer  behind  which  are  various  dynamic,  heterogeneous  and  geographically distributed grid sites.          As  the  underlying  grid  infrastructure  is  becoming  increasingly  complex  and  the  MDS  GLUE is evolving towards a mature and flexible resource description model, there are a lot  of  potential  improvements  based  on  current  ANG­GridWay,  like  adaptive  scheduling  by  using  the  VOView  information,  advanced  resource  reservation  and  scientific  workflow  planning and scheduling. Figure 2. The complete view of a homogeneous  cluster presented by ANG­GridWay, including  environment information, queues and user VO  views, available software tools and storage  information. Figure 4. The whole picture: The information model used by the ANG Grid and the  resource virtualization via ANG­GridWay