SlideShare a Scribd company logo
한국해양과학기술진흥원
Cluster and Grid Computing
2013.10.6
Sayed Chhattan Shah, PhD
Senior Researcher
Electronics and Telecommunications Research Institute, Korea
한국해양과학기술진흥원
Outline
 Cluster Computing
 Architecture
 Key Components
 Grid Computing
 Architecture
 Key Components
 Resource Management
• Discovery
• QoS Support
• Scheduling
Cluster Computing
한국해양과학기술진흥원
Cluster
A type of distributed system
A collection of workstations of PCs that are
interconnected by a high-speed network
Work as an integrated collection of resources
Have a single system image spanning all its nodes
한국해양과학기술진흥원
Sequential Applications
Parallel Applications
Parallel Programming Environment
Cluster Middleware
(Single System Image and Availability Infrastructure)
Cluster Interconnection Network/Switch
PC/Workstation
Network Interface
Hardware
Communications
Software
PC/Workstation
Network Interface
Hardware
Communications
Software
PC/Workstation
Network Interface
Hardware
Communications
Software
PC/Workstation
Network Interface
Hardware
Communications
Software
Sequential Applications
Sequential Applications
Parallel Applications
Parallel Applications
Cluster Computer Architecture
한국해양과학기술진흥원
Prominent Components of Cluster Computers
Multiple High Performance Computers
 PCs
 Workstations
State of the art Operating Systems
 Linux (MOSIX, Beowulf, and many more)
 Microsoft NT (Illinois HPVM, Cornell Velocity)
 SUN Solaris (Berkeley NOW, C-DAC PARAM)
 IBM AIX (IBM SP2)
한국해양과학기술진흥원
Prominent Components of Cluster Computers
High Performance Networks
 Ethernet (10Mbps),
 Fast Ethernet (100Mbps),
 Gigabit Ethernet (1Gbps)
 SCI (Scalable Coherent Interface- MPI- 12µsec latency)
 ATM (Asynchronous Transfer Mode)
 Myrinet (1.2Gbps)
 Digital Memory Channel
 FDDI (fiber distributed data interface)
 InfiniBand
한국해양과학기술진흥원
Fast Communication Protocols and Services
 Active Messages (Berkeley)
 Fast Messages (Illinois)
 U-net (Cornell)
 XTP (Virginia)
 Virtual Interface Architecture (VIA)
Prominent Components of Cluster Computers
한국해양과학기술진흥원
Myrinet QSnet Giganet ServerNet2
SCI Gigabit
Ethernet
Bandwidth
(MBytes/s)
140 – 33MHz
215 – 66 Mhz 208 ~105 165 ~80 30 - 50
MPI
Latency (µs)
16.5 – 33Nhz
11 – 66 Mhz
5 ~20 - 40 20.2 6 100 - 200
List price/port $1.5K $6.5K $1.5K ~$1.5K
Hardware
Availability
Now Now Now Q2‘00 Now Now
Linux Support Now Late‘00 Now Q2‘00 Now Now
Maximum
#nodes
1000’s 1000’s 1000’s 64K 1000’s
Protocol
Implementation
Firmware on
adapter
Firmware
on adapter
Firmware on
adapter
Implemented in h
ardware
Implemented
in hardware
VIA support Soon None NT/Linux Done in hardware Software
TCP/IP, VIA
NT/Linux
MPI support 3rd party Quadrics/
Compaq
3rd Party Compaq/3rd party MPICH – TCP/IP
1000’s
Firmware
on adapter
~$1.5K
3rd Party
~$1.5K
Prominent Components of Cluster Computers
한국해양과학기술진흥원
Cluster Middleware
 Resource management and scheduling
 Fault handling
 Migration
 Load balancing
Prominent Components of Cluster Computers
Grid Computing
한국해양과학기술진흥원
Overview: Clusters x GridsCluster - How can we use local networked resources
to achieve better performance for large scale
applications?
 High speed networks
 Centralized resource and task management
How can we put together geographically distributed
resources to achieve even better results?
 Distributed resource and task management
 No high speed connections
Grid Computing
Information
Generators
Information Distributed
Over the Grid
Customer
Access to
Information
Grid
 Computing power should be available on demand, for a fee
 Just like the electrical power grid.
Basic Idea
Grid and Cluster
한국해양과학기술진흥원
Grid Computing 15
Core networking technology now accelerates at a much
faster rate than advances in microprocessor speeds
Exploiting under utilized resources
Parallel CPU capacity
Access to additional resources
Why Grid Computing?
한국해양과학기술진흥원
Grid Computing
 Several clusters in Grid
 May include super computers, desktops, laptops, mobile devices
한국해양과학기술진흥원
1800 Physicists, 150 Institutes, 32 Countries
100 PB of data by 2010; 50,000 CPUs?
CERNs Large Hadron Collider
한국해양과학기술진흥원
Data Grids for High Energy Physics
Tier2 Centre
~1 TIPS
Online System
Offline Processor Farm
~20 TIPS
CERN Computer Centre
FermiLab ~4 TIPSFrance Regional
Centre
Italy Regional
Centre
Germany Regional
Centre
InstituteInstituteInstitute
Institute
~0.25TIPS
Physicist workstations
~100 MBytes/sec
~100 MBytes/sec
~622 Mbit/sec
~1 MBytes/sec
There is a “bunch crossing” every 25 nsecs.
There are 100 “triggers” per second
Each triggered event is ~1 MByte in size
Physicists work on analysis “channels”.
Each institute will have ~10 physicists working on one or more
channels; data for these channels should be cached by the
institute server
Physics data cache
~PBytes/sec
~622 Mbits/sec
or Air Freight (deprecated)
Tier2 Centre
~1 TIPS
Tier2 Centre
~1 TIPS
Tier2 Centre
~1 TIPS
Caltech
~1 TIPS
~622 Mbits/sec
Tier 0
Tier 1
Tier 2
Tier 4
1 TIPS is approximately 25,000
SpecInt95 equivalents
한국해양과학기술진흥원
Grid
Fabric
Grid
Apps.
Grid
Middleware
Grid
Tools
Networked Resources across Organisations
Computers Clusters Data Sources Scientific InstrumentsStorage Systems
Local Resource Managers
Operating Systems Queuing Systems TCP/IP & UDP
…
Libraries & App Kernels …
Distributed Resources Coupling Services
Security Information … QoSProcess
Development Environments and Tools
Languages Libraries Debuggers … Web toolsResource BrokersMonitoring
Applications and Portals
Prob. Solving Env.Scientific …CollaborationEngineering Web enabled Apps
Resource Trading
Grid Components
Market Info
한국해양과학기술진흥원
Overview: Clusters x GridsA large proportion of personal computer’s
computational power is left unused
A desktop grid takes this unused capacity
 Local Desktop Grid
• Comprised mainly of a set of computers at one location
 Volunteer Desktop Grid
• Resources in a volunteer desktop grid are provided by citizens
all over the world
Desktop Grid
한국해양과학기술진흥원
Types of Grids
 Computational Grid
 Processing power is the main computing resource shared
amongst nodes
 Distributed Supercomputing
• Executes the application in parallel on multiple machines to reduce
the completion time
 High throughput
• Increases the completion rate of a stream of jobs
 Data Grid
 Data storage capacity as the main shared resource amongst
nodes
Resource Management
한국해양과학기술진흥원
Overview: Clusters x GridsManages the pool of resources available to Grid
 Processors
 Network bandwidth
 Disk storage
The pool includes resources from different providers
 RMS should maintain the required level of trust
• Without affecting performance
 RMS should adhere to different policies
 RMS should meet QoS requirements
Resource Management System
한국해양과학기술진흥원
Overview: Clusters x Grids
Core Functions of Resource Management System
한국해양과학기술진흥원
Overview: Clusters x GridsResource Dissemination and Discovery Protocols
 Used to determine the state of the resources
• Resource Dissemination Protocol
• Provides information about the resources
• Discovery Protocol
• Provides a mechanism by which resource information can be found
Resource resolution and co-allocation protocols
 To schedule the job at the remote resource
 Simultaneously acquire multiple resources
Core Functions of Resource Management System
한국해양과학기술진흥원
Overview: Clusters x GridsMachine Organization
 Organization of the machines in the Grid affects the
communication patterns and thus
• determines the scalability
Resource Management System
한국해양과학기술진흥원
Overview: Clusters x Grids Centralized Organization
• a single controller or designated set of controllers performs the
scheduling for all machines
• suffer from scalability issues
 Decentralized Organization
• Roles are distributed among machines
• Sender initiated
• Receiver initiated
Resource Management System
한국해양과학기술진흥원
Overview: Clusters x Grids
 Flat Organization
• All machines can directly communicate with each other without going
through
 Hierarchical Organization
• Machines in the same level can directly communicate with the
machines directly above them or below them
 Cell or Group Organization
• Machines within the cell communicate between themselves using flat
organization
• Designated machines within the cell function acts as boundary elements
that are responsible for all communication outside the cell
• Flat cell structure has only one level of cells
• Hierarchical cell structure can have cells that contain other cells
Resource Management System
한국해양과학기술진흥원
Overview: Clusters x GridsQoS Support
 QoS is not limited to network bandwidth but extends to the
processing and storage capabilities of the nodes
 Resource reservation is one of the ways of providing guaranteed
QoS
 Key components of QoS
• Admission control determines if requested level of service can be given
• Policing ensures that job does not violate agreed upon level of service
Resource Management System
한국해양과학기술진흥원
Overview: Clusters x GridsResource Discovery and Dissemination
 Discovery is initiated by applications to find suitable resources
 Dissemination is initiated by resources to find suitable application
Resource Management System
한국해양과학기술진흥원
Overview: Clusters x GridsScheduling
 Determining when and where the jobs are executed and how
many resources are allocated
 Time-shared job-scheduling approaches
• Multiple jobs share the same resources
 Space-shared job-scheduling approaches
• Multiple jobs can run at any point of time by the available nodes
 Gang or Synchronous Scheduling
• Scheduling all tasks of application at the same time
 Loosely coordinated co-scheduling
• Schedule communicating tasks of application at the same time
Resource Management System
한국해양과학기술진흥원
Overview: Clusters x GridsScheduling Objectives
 Minimize response time and
 Maximize system utilization
 Trade-off
• Maximizing system utilization may increase response time
Resource Management System
한국해양과학기술진흥원
Overview: Clusters x GridsJob Requirements
 Independent jobs
 Dependent jobs
• Precedence dependency
• Parallel Dependency
Resource Management System
한국해양과학기술진흥원
Overview: Clusters x GridsScheduling
Resource Management System
한국해양과학기술진흥원
Overview: Clusters x GridsState Estimation
 Predictive state estimation uses current and historical job and
resource status information
 Non-predictive state estimation uses only the current job and
resource status information
Resource Management System
한국해양과학기술진흥원
Overview: Clusters x GridsRescheduling
 To improve utilization, balance load, etc
 Periodic or batch rescheduling approaches group resource
requests and system events which are then processed at
intervals
 Event driven online rescheduling performs rescheduling as soon
the RMS receives the resource request or system event
Resource Management System

More Related Content

What's hot

Synchronization in distributed computing
Synchronization in distributed computingSynchronization in distributed computing
Synchronization in distributed computing
SVijaylakshmi
 
Distributed System ppt
Distributed System pptDistributed System ppt
Levels of Virtualization.docx
Levels of Virtualization.docxLevels of Virtualization.docx
Levels of Virtualization.docx
kumari36
 
Distributed Computing
Distributed ComputingDistributed Computing
Distributed Computing
Sudarsun Santhiappan
 
2. Distributed Systems Hardware & Software concepts
2. Distributed Systems Hardware & Software concepts2. Distributed Systems Hardware & Software concepts
2. Distributed Systems Hardware & Software concepts
Prajakta Rane
 
Fault tolerance in distributed systems
Fault tolerance in distributed systemsFault tolerance in distributed systems
Fault tolerance in distributed systemssumitjain2013
 
Peer to peer system
Peer to peer systemPeer to peer system
Peer to peer system
Jahanzaib Niazi
 
Cloud sim
Cloud simCloud sim
Cloud sim
Khyati Rajput
 
Virtualization
VirtualizationVirtualization
distributed Computing system model
distributed Computing system modeldistributed Computing system model
distributed Computing system model
Harshad Umredkar
 
cloud computing:Types of virtualization
cloud computing:Types of virtualizationcloud computing:Types of virtualization
cloud computing:Types of virtualization
Dr.Neeraj Kumar Pandey
 
What is Virtualization and its types & Techniques.What is hypervisor and its ...
What is Virtualization and its types & Techniques.What is hypervisor and its ...What is Virtualization and its types & Techniques.What is hypervisor and its ...
What is Virtualization and its types & Techniques.What is hypervisor and its ...
Shashi soni
 
Introduction to Virtualization
Introduction to VirtualizationIntroduction to Virtualization
Introduction to Virtualization
Rahul Hada
 
Message and Stream Oriented Communication
Message and Stream Oriented CommunicationMessage and Stream Oriented Communication
Message and Stream Oriented Communication
Dilum Bandara
 
Load Balancing In Distributed Computing
Load Balancing In Distributed ComputingLoad Balancing In Distributed Computing
Load Balancing In Distributed Computing
Richa Singh
 
distributed shared memory
 distributed shared memory distributed shared memory
distributed shared memoryAshish Kumar
 
Communication in Distributed Systems
Communication in Distributed SystemsCommunication in Distributed Systems
Communication in Distributed Systems
Dilum Bandara
 
Synchronization in distributed systems
Synchronization in distributed systems Synchronization in distributed systems
Synchronization in distributed systems
SHATHAN
 
Virtualization in cloud computing
Virtualization in cloud computingVirtualization in cloud computing
Virtualization in cloud computing
Mohammad Ilyas Malik
 

What's hot (20)

Synchronization in distributed computing
Synchronization in distributed computingSynchronization in distributed computing
Synchronization in distributed computing
 
Distributed System ppt
Distributed System pptDistributed System ppt
Distributed System ppt
 
Levels of Virtualization.docx
Levels of Virtualization.docxLevels of Virtualization.docx
Levels of Virtualization.docx
 
Distributed Computing
Distributed ComputingDistributed Computing
Distributed Computing
 
2. Distributed Systems Hardware & Software concepts
2. Distributed Systems Hardware & Software concepts2. Distributed Systems Hardware & Software concepts
2. Distributed Systems Hardware & Software concepts
 
Fault tolerance in distributed systems
Fault tolerance in distributed systemsFault tolerance in distributed systems
Fault tolerance in distributed systems
 
Peer to peer system
Peer to peer systemPeer to peer system
Peer to peer system
 
Cloud sim
Cloud simCloud sim
Cloud sim
 
Virtualization
VirtualizationVirtualization
Virtualization
 
distributed Computing system model
distributed Computing system modeldistributed Computing system model
distributed Computing system model
 
cloud computing:Types of virtualization
cloud computing:Types of virtualizationcloud computing:Types of virtualization
cloud computing:Types of virtualization
 
What is Virtualization and its types & Techniques.What is hypervisor and its ...
What is Virtualization and its types & Techniques.What is hypervisor and its ...What is Virtualization and its types & Techniques.What is hypervisor and its ...
What is Virtualization and its types & Techniques.What is hypervisor and its ...
 
Introduction to Virtualization
Introduction to VirtualizationIntroduction to Virtualization
Introduction to Virtualization
 
Message and Stream Oriented Communication
Message and Stream Oriented CommunicationMessage and Stream Oriented Communication
Message and Stream Oriented Communication
 
Load Balancing In Distributed Computing
Load Balancing In Distributed ComputingLoad Balancing In Distributed Computing
Load Balancing In Distributed Computing
 
Cloud Service Models
Cloud Service ModelsCloud Service Models
Cloud Service Models
 
distributed shared memory
 distributed shared memory distributed shared memory
distributed shared memory
 
Communication in Distributed Systems
Communication in Distributed SystemsCommunication in Distributed Systems
Communication in Distributed Systems
 
Synchronization in distributed systems
Synchronization in distributed systems Synchronization in distributed systems
Synchronization in distributed systems
 
Virtualization in cloud computing
Virtualization in cloud computingVirtualization in cloud computing
Virtualization in cloud computing
 

Similar to Cluster and Grid Computing

Grid optical network service architecture for data intensive applications
Grid optical network service architecture for data intensive applicationsGrid optical network service architecture for data intensive applications
Grid optical network service architecture for data intensive applications
Tal Lavian Ph.D.
 
Distributed Clouds and Software Defined Networking
Distributed Clouds and Software Defined NetworkingDistributed Clouds and Software Defined Networking
Distributed Clouds and Software Defined Networking
US-Ignite
 
Inroduction to grid computing by gargi shankar verma
Inroduction to grid computing by gargi shankar vermaInroduction to grid computing by gargi shankar verma
Inroduction to grid computing by gargi shankar verma
gargishankar1981
 
IEEE HPSR 2017 Keynote: Softwarized Dataplanes and the P^3 trade-offs: Progra...
IEEE HPSR 2017 Keynote: Softwarized Dataplanes and the P^3 trade-offs: Progra...IEEE HPSR 2017 Keynote: Softwarized Dataplanes and the P^3 trade-offs: Progra...
IEEE HPSR 2017 Keynote: Softwarized Dataplanes and the P^3 trade-offs: Progra...
Christian Esteve Rothenberg
 
Microx - A Unix like kernel for Embedded Systems written from scratch.
Microx - A Unix like kernel for Embedded Systems written from scratch.Microx - A Unix like kernel for Embedded Systems written from scratch.
Microx - A Unix like kernel for Embedded Systems written from scratch.Waqar Sheikh
 
Dataservices based on mesos and kafka kostiantyn bokhan dataconf 21 04 18
Dataservices based on mesos and kafka kostiantyn bokhan dataconf 21 04 18Dataservices based on mesos and kafka kostiantyn bokhan dataconf 21 04 18
Dataservices based on mesos and kafka kostiantyn bokhan dataconf 21 04 18
Olga Zinkevych
 
MetaCloud Computing Environment
MetaCloud Computing EnvironmentMetaCloud Computing Environment
MetaCloud Computing Environment
ARCCN
 
Grid computing
Grid computingGrid computing
Grid computing
Dikshita_Viradia
 
CSense: A Stream-Processing Toolkit for Robust and High-Rate Mobile Sensing A...
CSense: A Stream-Processing Toolkit for Robust and High-Rate Mobile Sensing A...CSense: A Stream-Processing Toolkit for Robust and High-Rate Mobile Sensing A...
CSense: A Stream-Processing Toolkit for Robust and High-Rate Mobile Sensing A...Farley Lai
 
Software defined network-- SDN
Software defined network-- SDNSoftware defined network-- SDN
Software defined network-- SDN
Aadarsh Sharma
 
Manage Microservices & Fast Data Systems on One Platform w/ DC/OS
Manage Microservices & Fast Data Systems on One Platform w/ DC/OSManage Microservices & Fast Data Systems on One Platform w/ DC/OS
Manage Microservices & Fast Data Systems on One Platform w/ DC/OS
Mesosphere Inc.
 
Service Fabric and Azure Service Fabric Mesh introduction
Service Fabric and Azure Service Fabric Mesh introductionService Fabric and Azure Service Fabric Mesh introduction
Service Fabric and Azure Service Fabric Mesh introduction
Mikkel Mørk Hegnhøj
 
Grid computing
Grid computingGrid computing
Grid computing
Ramraj Choudhary
 
Software Defined Networking - Huawei, June 2017
Software Defined Networking - Huawei, June 2017Software Defined Networking - Huawei, June 2017
Software Defined Networking - Huawei, June 2017
Novosco
 
SDN 101: Software Defined Networking Course - Sameh Zaghloul/IBM - 2014
SDN 101: Software Defined Networking Course - Sameh Zaghloul/IBM - 2014SDN 101: Software Defined Networking Course - Sameh Zaghloul/IBM - 2014
SDN 101: Software Defined Networking Course - Sameh Zaghloul/IBM - 2014
SAMeh Zaghloul
 
Unit i introduction to grid computing
Unit i   introduction to grid computingUnit i   introduction to grid computing
Unit i introduction to grid computingsudha kar
 
SDN Security Talk - (ISC)2_3
SDN Security Talk - (ISC)2_3SDN Security Talk - (ISC)2_3
SDN Security Talk - (ISC)2_3Wen-Pai Lu
 
Hannover Messe 2017 - Systems Federation in industrie 4.0
Hannover Messe 2017 - Systems Federation in industrie 4.0Hannover Messe 2017 - Systems Federation in industrie 4.0
Hannover Messe 2017 - Systems Federation in industrie 4.0
Clemens Vasters
 
Proactive ops for container orchestration environments
Proactive ops for container orchestration environmentsProactive ops for container orchestration environments
Proactive ops for container orchestration environments
Docker, Inc.
 

Similar to Cluster and Grid Computing (20)

Grid optical network service architecture for data intensive applications
Grid optical network service architecture for data intensive applicationsGrid optical network service architecture for data intensive applications
Grid optical network service architecture for data intensive applications
 
Grid Computing
Grid ComputingGrid Computing
Grid Computing
 
Distributed Clouds and Software Defined Networking
Distributed Clouds and Software Defined NetworkingDistributed Clouds and Software Defined Networking
Distributed Clouds and Software Defined Networking
 
Inroduction to grid computing by gargi shankar verma
Inroduction to grid computing by gargi shankar vermaInroduction to grid computing by gargi shankar verma
Inroduction to grid computing by gargi shankar verma
 
IEEE HPSR 2017 Keynote: Softwarized Dataplanes and the P^3 trade-offs: Progra...
IEEE HPSR 2017 Keynote: Softwarized Dataplanes and the P^3 trade-offs: Progra...IEEE HPSR 2017 Keynote: Softwarized Dataplanes and the P^3 trade-offs: Progra...
IEEE HPSR 2017 Keynote: Softwarized Dataplanes and the P^3 trade-offs: Progra...
 
Microx - A Unix like kernel for Embedded Systems written from scratch.
Microx - A Unix like kernel for Embedded Systems written from scratch.Microx - A Unix like kernel for Embedded Systems written from scratch.
Microx - A Unix like kernel for Embedded Systems written from scratch.
 
Dataservices based on mesos and kafka kostiantyn bokhan dataconf 21 04 18
Dataservices based on mesos and kafka kostiantyn bokhan dataconf 21 04 18Dataservices based on mesos and kafka kostiantyn bokhan dataconf 21 04 18
Dataservices based on mesos and kafka kostiantyn bokhan dataconf 21 04 18
 
MetaCloud Computing Environment
MetaCloud Computing EnvironmentMetaCloud Computing Environment
MetaCloud Computing Environment
 
Grid computing
Grid computingGrid computing
Grid computing
 
CSense: A Stream-Processing Toolkit for Robust and High-Rate Mobile Sensing A...
CSense: A Stream-Processing Toolkit for Robust and High-Rate Mobile Sensing A...CSense: A Stream-Processing Toolkit for Robust and High-Rate Mobile Sensing A...
CSense: A Stream-Processing Toolkit for Robust and High-Rate Mobile Sensing A...
 
Software defined network-- SDN
Software defined network-- SDNSoftware defined network-- SDN
Software defined network-- SDN
 
Manage Microservices & Fast Data Systems on One Platform w/ DC/OS
Manage Microservices & Fast Data Systems on One Platform w/ DC/OSManage Microservices & Fast Data Systems on One Platform w/ DC/OS
Manage Microservices & Fast Data Systems on One Platform w/ DC/OS
 
Service Fabric and Azure Service Fabric Mesh introduction
Service Fabric and Azure Service Fabric Mesh introductionService Fabric and Azure Service Fabric Mesh introduction
Service Fabric and Azure Service Fabric Mesh introduction
 
Grid computing
Grid computingGrid computing
Grid computing
 
Software Defined Networking - Huawei, June 2017
Software Defined Networking - Huawei, June 2017Software Defined Networking - Huawei, June 2017
Software Defined Networking - Huawei, June 2017
 
SDN 101: Software Defined Networking Course - Sameh Zaghloul/IBM - 2014
SDN 101: Software Defined Networking Course - Sameh Zaghloul/IBM - 2014SDN 101: Software Defined Networking Course - Sameh Zaghloul/IBM - 2014
SDN 101: Software Defined Networking Course - Sameh Zaghloul/IBM - 2014
 
Unit i introduction to grid computing
Unit i   introduction to grid computingUnit i   introduction to grid computing
Unit i introduction to grid computing
 
SDN Security Talk - (ISC)2_3
SDN Security Talk - (ISC)2_3SDN Security Talk - (ISC)2_3
SDN Security Talk - (ISC)2_3
 
Hannover Messe 2017 - Systems Federation in industrie 4.0
Hannover Messe 2017 - Systems Federation in industrie 4.0Hannover Messe 2017 - Systems Federation in industrie 4.0
Hannover Messe 2017 - Systems Federation in industrie 4.0
 
Proactive ops for container orchestration environments
Proactive ops for container orchestration environmentsProactive ops for container orchestration environments
Proactive ops for container orchestration environments
 

More from Sayed Chhattan Shah

Introduction to System Programming
Introduction to System ProgrammingIntroduction to System Programming
Introduction to System Programming
Sayed Chhattan Shah
 
Introduction to Differential Equations
Introduction to Differential EquationsIntroduction to Differential Equations
Introduction to Differential Equations
Sayed Chhattan Shah
 
Algorithm Design and Analysis
Algorithm Design and AnalysisAlgorithm Design and Analysis
Algorithm Design and Analysis
Sayed Chhattan Shah
 
Cloud and Edge Computing Systems
Cloud and Edge Computing SystemsCloud and Edge Computing Systems
Cloud and Edge Computing Systems
Sayed Chhattan Shah
 
Introduction to Internet of Things
Introduction to Internet of ThingsIntroduction to Internet of Things
Introduction to Internet of Things
Sayed Chhattan Shah
 
IoT Network Technologies
IoT Network TechnologiesIoT Network Technologies
IoT Network Technologies
Sayed Chhattan Shah
 
5G Network: Requirements, Design Principles, Architectures, and Enabling Tech...
5G Network: Requirements, Design Principles, Architectures, and Enabling Tech...5G Network: Requirements, Design Principles, Architectures, and Enabling Tech...
5G Network: Requirements, Design Principles, Architectures, and Enabling Tech...
Sayed Chhattan Shah
 
Data Center Networks
Data Center NetworksData Center Networks
Data Center Networks
Sayed Chhattan Shah
 
IEEE 802.11 Architecture and Services
IEEE 802.11 Architecture and ServicesIEEE 802.11 Architecture and Services
IEEE 802.11 Architecture and Services
Sayed Chhattan Shah
 
Routing in Mobile Ad hoc Networks
Routing in Mobile Ad hoc NetworksRouting in Mobile Ad hoc Networks
Routing in Mobile Ad hoc Networks
Sayed Chhattan Shah
 
Keynote Talk on Recent Advances in Mobile Grid and Cloud Computing
Keynote Talk on Recent Advances in Mobile Grid and Cloud ComputingKeynote Talk on Recent Advances in Mobile Grid and Cloud Computing
Keynote Talk on Recent Advances in Mobile Grid and Cloud Computing
Sayed Chhattan Shah
 
Keynote on Mobile Grid and Cloud Computing
Keynote on Mobile Grid and Cloud ComputingKeynote on Mobile Grid and Cloud Computing
Keynote on Mobile Grid and Cloud Computing
Sayed Chhattan Shah
 
Introduction to Mobile Ad hoc Networks
Introduction to Mobile Ad hoc NetworksIntroduction to Mobile Ad hoc Networks
Introduction to Mobile Ad hoc Networks
Sayed Chhattan Shah
 
Cloud Robotics
Cloud RoboticsCloud Robotics
Cloud Robotics
Sayed Chhattan Shah
 
Introduction to Cloud Computing
Introduction to Cloud ComputingIntroduction to Cloud Computing
Introduction to Cloud Computing
Sayed Chhattan Shah
 
Tips on Applying for a Scholarship
Tips on Applying for a ScholarshipTips on Applying for a Scholarship
Tips on Applying for a Scholarship
Sayed Chhattan Shah
 
Introduction to Parallel and Distributed Computing
Introduction to Parallel and Distributed ComputingIntroduction to Parallel and Distributed Computing
Introduction to Parallel and Distributed Computing
Sayed Chhattan Shah
 

More from Sayed Chhattan Shah (17)

Introduction to System Programming
Introduction to System ProgrammingIntroduction to System Programming
Introduction to System Programming
 
Introduction to Differential Equations
Introduction to Differential EquationsIntroduction to Differential Equations
Introduction to Differential Equations
 
Algorithm Design and Analysis
Algorithm Design and AnalysisAlgorithm Design and Analysis
Algorithm Design and Analysis
 
Cloud and Edge Computing Systems
Cloud and Edge Computing SystemsCloud and Edge Computing Systems
Cloud and Edge Computing Systems
 
Introduction to Internet of Things
Introduction to Internet of ThingsIntroduction to Internet of Things
Introduction to Internet of Things
 
IoT Network Technologies
IoT Network TechnologiesIoT Network Technologies
IoT Network Technologies
 
5G Network: Requirements, Design Principles, Architectures, and Enabling Tech...
5G Network: Requirements, Design Principles, Architectures, and Enabling Tech...5G Network: Requirements, Design Principles, Architectures, and Enabling Tech...
5G Network: Requirements, Design Principles, Architectures, and Enabling Tech...
 
Data Center Networks
Data Center NetworksData Center Networks
Data Center Networks
 
IEEE 802.11 Architecture and Services
IEEE 802.11 Architecture and ServicesIEEE 802.11 Architecture and Services
IEEE 802.11 Architecture and Services
 
Routing in Mobile Ad hoc Networks
Routing in Mobile Ad hoc NetworksRouting in Mobile Ad hoc Networks
Routing in Mobile Ad hoc Networks
 
Keynote Talk on Recent Advances in Mobile Grid and Cloud Computing
Keynote Talk on Recent Advances in Mobile Grid and Cloud ComputingKeynote Talk on Recent Advances in Mobile Grid and Cloud Computing
Keynote Talk on Recent Advances in Mobile Grid and Cloud Computing
 
Keynote on Mobile Grid and Cloud Computing
Keynote on Mobile Grid and Cloud ComputingKeynote on Mobile Grid and Cloud Computing
Keynote on Mobile Grid and Cloud Computing
 
Introduction to Mobile Ad hoc Networks
Introduction to Mobile Ad hoc NetworksIntroduction to Mobile Ad hoc Networks
Introduction to Mobile Ad hoc Networks
 
Cloud Robotics
Cloud RoboticsCloud Robotics
Cloud Robotics
 
Introduction to Cloud Computing
Introduction to Cloud ComputingIntroduction to Cloud Computing
Introduction to Cloud Computing
 
Tips on Applying for a Scholarship
Tips on Applying for a ScholarshipTips on Applying for a Scholarship
Tips on Applying for a Scholarship
 
Introduction to Parallel and Distributed Computing
Introduction to Parallel and Distributed ComputingIntroduction to Parallel and Distributed Computing
Introduction to Parallel and Distributed Computing
 

Recently uploaded

How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
Celine George
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
tarandeep35
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Thiyagu K
 
Multithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race conditionMultithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race condition
Mohammed Sikander
 
Marketing internship report file for MBA
Marketing internship report file for MBAMarketing internship report file for MBA
Marketing internship report file for MBA
gb193092
 
Honest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptxHonest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptx
timhan337
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
TechSoup
 
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th SemesterGuidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Atul Kumar Singh
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
EduSkills OECD
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
Tamralipta Mahavidyalaya
 
A Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptxA Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptx
thanhdowork
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
TechSoup
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
Vivekanand Anglo Vedic Academy
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
Vikramjit Singh
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
EugeneSaldivar
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
Scholarhat
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
Balvir Singh
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
Jisc
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
Pavel ( NSTU)
 

Recently uploaded (20)

How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
 
Multithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race conditionMultithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race condition
 
Marketing internship report file for MBA
Marketing internship report file for MBAMarketing internship report file for MBA
Marketing internship report file for MBA
 
Honest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptxHonest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptx
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
 
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th SemesterGuidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th Semester
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
 
A Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptxA Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptx
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 

Cluster and Grid Computing

  • 1. 한국해양과학기술진흥원 Cluster and Grid Computing 2013.10.6 Sayed Chhattan Shah, PhD Senior Researcher Electronics and Telecommunications Research Institute, Korea
  • 2. 한국해양과학기술진흥원 Outline  Cluster Computing  Architecture  Key Components  Grid Computing  Architecture  Key Components  Resource Management • Discovery • QoS Support • Scheduling
  • 4. 한국해양과학기술진흥원 Cluster A type of distributed system A collection of workstations of PCs that are interconnected by a high-speed network Work as an integrated collection of resources Have a single system image spanning all its nodes
  • 5. 한국해양과학기술진흥원 Sequential Applications Parallel Applications Parallel Programming Environment Cluster Middleware (Single System Image and Availability Infrastructure) Cluster Interconnection Network/Switch PC/Workstation Network Interface Hardware Communications Software PC/Workstation Network Interface Hardware Communications Software PC/Workstation Network Interface Hardware Communications Software PC/Workstation Network Interface Hardware Communications Software Sequential Applications Sequential Applications Parallel Applications Parallel Applications Cluster Computer Architecture
  • 6. 한국해양과학기술진흥원 Prominent Components of Cluster Computers Multiple High Performance Computers  PCs  Workstations State of the art Operating Systems  Linux (MOSIX, Beowulf, and many more)  Microsoft NT (Illinois HPVM, Cornell Velocity)  SUN Solaris (Berkeley NOW, C-DAC PARAM)  IBM AIX (IBM SP2)
  • 7. 한국해양과학기술진흥원 Prominent Components of Cluster Computers High Performance Networks  Ethernet (10Mbps),  Fast Ethernet (100Mbps),  Gigabit Ethernet (1Gbps)  SCI (Scalable Coherent Interface- MPI- 12µsec latency)  ATM (Asynchronous Transfer Mode)  Myrinet (1.2Gbps)  Digital Memory Channel  FDDI (fiber distributed data interface)  InfiniBand
  • 8. 한국해양과학기술진흥원 Fast Communication Protocols and Services  Active Messages (Berkeley)  Fast Messages (Illinois)  U-net (Cornell)  XTP (Virginia)  Virtual Interface Architecture (VIA) Prominent Components of Cluster Computers
  • 9. 한국해양과학기술진흥원 Myrinet QSnet Giganet ServerNet2 SCI Gigabit Ethernet Bandwidth (MBytes/s) 140 – 33MHz 215 – 66 Mhz 208 ~105 165 ~80 30 - 50 MPI Latency (µs) 16.5 – 33Nhz 11 – 66 Mhz 5 ~20 - 40 20.2 6 100 - 200 List price/port $1.5K $6.5K $1.5K ~$1.5K Hardware Availability Now Now Now Q2‘00 Now Now Linux Support Now Late‘00 Now Q2‘00 Now Now Maximum #nodes 1000’s 1000’s 1000’s 64K 1000’s Protocol Implementation Firmware on adapter Firmware on adapter Firmware on adapter Implemented in h ardware Implemented in hardware VIA support Soon None NT/Linux Done in hardware Software TCP/IP, VIA NT/Linux MPI support 3rd party Quadrics/ Compaq 3rd Party Compaq/3rd party MPICH – TCP/IP 1000’s Firmware on adapter ~$1.5K 3rd Party ~$1.5K Prominent Components of Cluster Computers
  • 10. 한국해양과학기술진흥원 Cluster Middleware  Resource management and scheduling  Fault handling  Migration  Load balancing Prominent Components of Cluster Computers
  • 12. 한국해양과학기술진흥원 Overview: Clusters x GridsCluster - How can we use local networked resources to achieve better performance for large scale applications?  High speed networks  Centralized resource and task management How can we put together geographically distributed resources to achieve even better results?  Distributed resource and task management  No high speed connections Grid Computing
  • 13. Information Generators Information Distributed Over the Grid Customer Access to Information Grid  Computing power should be available on demand, for a fee  Just like the electrical power grid. Basic Idea
  • 15. 한국해양과학기술진흥원 Grid Computing 15 Core networking technology now accelerates at a much faster rate than advances in microprocessor speeds Exploiting under utilized resources Parallel CPU capacity Access to additional resources Why Grid Computing?
  • 16. 한국해양과학기술진흥원 Grid Computing  Several clusters in Grid  May include super computers, desktops, laptops, mobile devices
  • 17. 한국해양과학기술진흥원 1800 Physicists, 150 Institutes, 32 Countries 100 PB of data by 2010; 50,000 CPUs? CERNs Large Hadron Collider
  • 18. 한국해양과학기술진흥원 Data Grids for High Energy Physics Tier2 Centre ~1 TIPS Online System Offline Processor Farm ~20 TIPS CERN Computer Centre FermiLab ~4 TIPSFrance Regional Centre Italy Regional Centre Germany Regional Centre InstituteInstituteInstitute Institute ~0.25TIPS Physicist workstations ~100 MBytes/sec ~100 MBytes/sec ~622 Mbit/sec ~1 MBytes/sec There is a “bunch crossing” every 25 nsecs. There are 100 “triggers” per second Each triggered event is ~1 MByte in size Physicists work on analysis “channels”. Each institute will have ~10 physicists working on one or more channels; data for these channels should be cached by the institute server Physics data cache ~PBytes/sec ~622 Mbits/sec or Air Freight (deprecated) Tier2 Centre ~1 TIPS Tier2 Centre ~1 TIPS Tier2 Centre ~1 TIPS Caltech ~1 TIPS ~622 Mbits/sec Tier 0 Tier 1 Tier 2 Tier 4 1 TIPS is approximately 25,000 SpecInt95 equivalents
  • 19. 한국해양과학기술진흥원 Grid Fabric Grid Apps. Grid Middleware Grid Tools Networked Resources across Organisations Computers Clusters Data Sources Scientific InstrumentsStorage Systems Local Resource Managers Operating Systems Queuing Systems TCP/IP & UDP … Libraries & App Kernels … Distributed Resources Coupling Services Security Information … QoSProcess Development Environments and Tools Languages Libraries Debuggers … Web toolsResource BrokersMonitoring Applications and Portals Prob. Solving Env.Scientific …CollaborationEngineering Web enabled Apps Resource Trading Grid Components Market Info
  • 20. 한국해양과학기술진흥원 Overview: Clusters x GridsA large proportion of personal computer’s computational power is left unused A desktop grid takes this unused capacity  Local Desktop Grid • Comprised mainly of a set of computers at one location  Volunteer Desktop Grid • Resources in a volunteer desktop grid are provided by citizens all over the world Desktop Grid
  • 21. 한국해양과학기술진흥원 Types of Grids  Computational Grid  Processing power is the main computing resource shared amongst nodes  Distributed Supercomputing • Executes the application in parallel on multiple machines to reduce the completion time  High throughput • Increases the completion rate of a stream of jobs  Data Grid  Data storage capacity as the main shared resource amongst nodes
  • 23. 한국해양과학기술진흥원 Overview: Clusters x GridsManages the pool of resources available to Grid  Processors  Network bandwidth  Disk storage The pool includes resources from different providers  RMS should maintain the required level of trust • Without affecting performance  RMS should adhere to different policies  RMS should meet QoS requirements Resource Management System
  • 24. 한국해양과학기술진흥원 Overview: Clusters x Grids Core Functions of Resource Management System
  • 25. 한국해양과학기술진흥원 Overview: Clusters x GridsResource Dissemination and Discovery Protocols  Used to determine the state of the resources • Resource Dissemination Protocol • Provides information about the resources • Discovery Protocol • Provides a mechanism by which resource information can be found Resource resolution and co-allocation protocols  To schedule the job at the remote resource  Simultaneously acquire multiple resources Core Functions of Resource Management System
  • 26. 한국해양과학기술진흥원 Overview: Clusters x GridsMachine Organization  Organization of the machines in the Grid affects the communication patterns and thus • determines the scalability Resource Management System
  • 27. 한국해양과학기술진흥원 Overview: Clusters x Grids Centralized Organization • a single controller or designated set of controllers performs the scheduling for all machines • suffer from scalability issues  Decentralized Organization • Roles are distributed among machines • Sender initiated • Receiver initiated Resource Management System
  • 28. 한국해양과학기술진흥원 Overview: Clusters x Grids  Flat Organization • All machines can directly communicate with each other without going through  Hierarchical Organization • Machines in the same level can directly communicate with the machines directly above them or below them  Cell or Group Organization • Machines within the cell communicate between themselves using flat organization • Designated machines within the cell function acts as boundary elements that are responsible for all communication outside the cell • Flat cell structure has only one level of cells • Hierarchical cell structure can have cells that contain other cells Resource Management System
  • 29. 한국해양과학기술진흥원 Overview: Clusters x GridsQoS Support  QoS is not limited to network bandwidth but extends to the processing and storage capabilities of the nodes  Resource reservation is one of the ways of providing guaranteed QoS  Key components of QoS • Admission control determines if requested level of service can be given • Policing ensures that job does not violate agreed upon level of service Resource Management System
  • 30. 한국해양과학기술진흥원 Overview: Clusters x GridsResource Discovery and Dissemination  Discovery is initiated by applications to find suitable resources  Dissemination is initiated by resources to find suitable application Resource Management System
  • 31. 한국해양과학기술진흥원 Overview: Clusters x GridsScheduling  Determining when and where the jobs are executed and how many resources are allocated  Time-shared job-scheduling approaches • Multiple jobs share the same resources  Space-shared job-scheduling approaches • Multiple jobs can run at any point of time by the available nodes  Gang or Synchronous Scheduling • Scheduling all tasks of application at the same time  Loosely coordinated co-scheduling • Schedule communicating tasks of application at the same time Resource Management System
  • 32. 한국해양과학기술진흥원 Overview: Clusters x GridsScheduling Objectives  Minimize response time and  Maximize system utilization  Trade-off • Maximizing system utilization may increase response time Resource Management System
  • 33. 한국해양과학기술진흥원 Overview: Clusters x GridsJob Requirements  Independent jobs  Dependent jobs • Precedence dependency • Parallel Dependency Resource Management System
  • 34. 한국해양과학기술진흥원 Overview: Clusters x GridsScheduling Resource Management System
  • 35. 한국해양과학기술진흥원 Overview: Clusters x GridsState Estimation  Predictive state estimation uses current and historical job and resource status information  Non-predictive state estimation uses only the current job and resource status information Resource Management System
  • 36. 한국해양과학기술진흥원 Overview: Clusters x GridsRescheduling  To improve utilization, balance load, etc  Periodic or batch rescheduling approaches group resource requests and system events which are then processed at intervals  Event driven online rescheduling performs rescheduling as soon the RMS receives the resource request or system event Resource Management System