SlideShare a Scribd company logo
VHPC 2010
              August 31st, Ischia, Italy

Providing Performance Guarantees to
  Virtual Machines using Real-Time
             Scheduling


            Tommaso Cucinotta, Dhaval Giani,
              Dario Faggioli, Fabio Checconi

               Real-Time Systems Laboratory
                Scuola Superiore Sant'Anna
                        Pisa, Italy




Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy   1/24
Introduction and Motivations




Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy   2/24
Introduction

“Traditional” worlds of computing
   General-Purpose Computing (GPC) and servers
       Low-cost, low parallelism degree equipment
       Focused on mixing batch and interactive workloads
   High-Performance Computing (HPC)
       High-cost, massively parallel and vector-based equipment
       Focused on batch computing, tightly coupled parallel tasks,
        scientific applications
New trends
 Affordable many-core systems also for GPC
 Cloud-Computing: world-level scalability and replicability
       Use of high-performance hardware in CC applications
       Interest in mixing interactive/real-time and HPC workloads
         Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy   3/24
Introduction

Virtualization is a key technology
 For IaaS providers (Cloud Computing)
 For server consolidation




    Physical Host
    Physical Host
           OS
           OS
           ...

    Physical Host
    Physical Host
           OS
           OS
     Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy   4/24
Introduction

Virtualization is a key technology
 For IaaS providers (Cloud Computing)
 For server consolidation




    Physical Host
    Physical Host                                   Physical Host
                                                    Physical Host
           OS
           OS                                              VM/OS
                                                           VM/OS
           ...                                            ...

    Physical Host                                          VM/OS
                                                           VM/OS
    Physical Host
           OS
           OS
     Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy   5/24
Need for Performance Isolation

Resource sharing
 → Temporal interference

        Physical Host
        Physical Host
         VM
         VM

         VM
         VM

         VM
         VM

         VM
         VM

    Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy   6/24
Need for Performance Isolation
                                                   ~30ms
Resource sharing                                              VM Alone

 → Temporal interference

        Physical Host
        Physical Host
         VM
         VM

         VM
         VM

         VM
         VM

         VM
         VM

    Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy   7/24
Need for Performance Isolation
                                                   ~30ms
Resource sharing                                              VM Alone

 → Temporal interference

        Physical Host
        Physical Host
         VM
         VM

         VM
         VM
                                                                   ~120ms
                                                                2 VMs

         VM
         VM

         VM
         VM

    Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy   8/24
Possible Solution

Hardware replication and                                     Physical Host
                                                             Physical Host
static partitioning                                           VM
                                                              VM
   Computing
       Multi-core (1 core per VM)
                                                              VM
                                                              VM
   Networking
       Multiple network adapters
        (1 network adapter per VM)                            VM
                                                              VM
       Multi-queue adapters
Drawbacks                                                     VM
                                                              VM
 Limitation of flexibility
 Under-utilization of resources
       e.g., with real-time/interactive workloads
        (cloud computing)
         Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy   9/24
Possible Solution

Another approach
 Let multiple VMs use the same resources
 Use proper resource scheduling strategies
For example
   Computing
       Xen credit-based and SEDF schedulers
   Networking
       QoS-aware protocols (IntServ, MPLS)




        Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy   10/24
What is still missing ?

Most approaches
   Investigate on high-level load balancing techniques
       Without the necessary care for fine-grain resource scheduling
   Focus on fairness among multiple VMs
Only a few works
 Focus on providing precise QoS guarantees to VMs
 For example
       Gupta et al., “[...] Performance Isolation […] in Xen”
       Cherkasova et al., “Comparison of the 3 CPU Schedulers in Xen”
Our focus
 Network performance isolation among concurrent VMs
 Virtualization with a host OS
       Specifically, on Linux as host OS with KVM
         Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy   11/24
Proposed Approach




Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy   12/24
Proposed Approach

Use of real-time CPU scheduling
   As basic mechanism for isolating VMs concurrently
    running on the same CPU and core
IRMOS Real-Time Scheduler
 For the Linux kernel
 Provides hierarchical EDF/FP scheduling
       EDF-based resource reservations
        – (Q, P): a budget Q is granted every period P
        – Both a guarantee and a limitation
       FP scheduling within each EDF reservation
   Provides temporal isolation


         Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy   13/24
What can we achieve ?
                    (by CPU real-time scheduling)

CPU real-time scheduling achieves
   Performance isolation of
    compute-intensive VMs
       Shown in other papers
   What about network-intensive VMs ?
Problems
   On the host OS (Linux)
       network-intensive VMs impose a big interrupt workload
        – difficult to quantify and keep under control
        – steals CPU from reservations of other VMs
   How can we fix this ?


         Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy   14/24
Preliminary Solution

Preliminary solution (subject of this paper)
   Temporal isolation of VMs by real-time scheduling
       Scheduling parameters tuned according to benchmarked figures
        – Reservation period set according to responsiveness requirements
        – Budget needed for computations, plus
        – Budget needed for sustaining network traffic
   Budget over-provisioning
       According to the foreseen interference due to
        network-intensive VMs




         Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy   15/24
Experimental Results




Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy   16/24
Experimental Results
                 (Q6600 @ 2.4 GHz, 1-Gbit Card)

Achievable network throughput as a function of
the CPU share reserved to the VM
   Measured via iperf: almost linear, as expected




       Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy   17/24
Experimental Results
                      (2 VMs on the same core)

Throughput as a function of its own budget
   13% drop when
       own reservation is 35% (from ~300 Mbps to ~260 Mbps)
       and reservation of other VM increased from 10% to 40%




        Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy   18/24
Experimental Results
                      (2 VMs on the same core)

Throughput as a function of its own budget
   ~14% budget over-provisioning needed
       for sustaining a ~300 Mbps throughput
       when reservation of other VM increased from 10% to 40%




        Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy   19/24
Experimental Results
                     (2 VMs on the same core)

Throughput as a function of the budget reserved
to the other interfering VM
   Drop due to compute-intensive interfering VM lower than
    the one due to network-intensive interfering VM




       Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy   20/24
Experimental Results
                (application-level benchmark)

Download time for a 100 KB file from Apache
 Periodic download requests every 20ms
 Response-times may be kept much more stable
  by real-time scheduling




     Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy   21/24
Conclusions and Future Work




Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy   22/24
Conclusions and Future Work

Conclusions
 We showed how to achieve network
  performance isolation by CPU
  real-time scheduling
 Technique to be used jointly with
  traffic-shaping techniques
Planned Future Work
   Experiment with PREEMPT_RT
       IRQs handled in kernel threads
 Investigate on task-level isolation of network traffic of
  different VMs (via real-time scheduling)
 Make KVM a QoS-aware hypervisor
 Investigate on the use of Adaptive Reservations
        Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy   23/24
Thanks for your attention!
                        Questions ?




Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy   24/24

More Related Content

Viewers also liked

Watering down a colossal crisis
Watering down a colossal crisisWatering down a colossal crisis
Watering down a colossal crisisguest5b5af7
 
The Wizard of OS: a Heartbeat for Legacy Multimedia Applications
The Wizard of OS: a Heartbeat for Legacy Multimedia ApplicationsThe Wizard of OS: a Heartbeat for Legacy Multimedia Applications
The Wizard of OS: a Heartbeat for Legacy Multimedia Applications
tcucinotta
 
Research in Soft Real-Time and Virtualized Applications on Linux
Research in Soft Real-Time and Virtualized Applications on LinuxResearch in Soft Real-Time and Virtualized Applications on Linux
Research in Soft Real-Time and Virtualized Applications on Linuxtcucinotta
 
Virtualised e-Learning with Real-Time Guarantees on the IRMOS Platform
Virtualised e-Learning with Real-Time Guarantees on the IRMOS PlatformVirtualised e-Learning with Real-Time Guarantees on the IRMOS Platform
Virtualised e-Learning with Real-Time Guarantees on the IRMOS Platform
tcucinotta
 
Aflac Power Point Presentation
Aflac Power Point PresentationAflac Power Point Presentation
Aflac Power Point Presentation
wyakin
 
Study: The Future of VR, AR and Self-Driving Cars
Study: The Future of VR, AR and Self-Driving CarsStudy: The Future of VR, AR and Self-Driving Cars
Study: The Future of VR, AR and Self-Driving Cars
LinkedIn
 

Viewers also liked (6)

Watering down a colossal crisis
Watering down a colossal crisisWatering down a colossal crisis
Watering down a colossal crisis
 
The Wizard of OS: a Heartbeat for Legacy Multimedia Applications
The Wizard of OS: a Heartbeat for Legacy Multimedia ApplicationsThe Wizard of OS: a Heartbeat for Legacy Multimedia Applications
The Wizard of OS: a Heartbeat for Legacy Multimedia Applications
 
Research in Soft Real-Time and Virtualized Applications on Linux
Research in Soft Real-Time and Virtualized Applications on LinuxResearch in Soft Real-Time and Virtualized Applications on Linux
Research in Soft Real-Time and Virtualized Applications on Linux
 
Virtualised e-Learning with Real-Time Guarantees on the IRMOS Platform
Virtualised e-Learning with Real-Time Guarantees on the IRMOS PlatformVirtualised e-Learning with Real-Time Guarantees on the IRMOS Platform
Virtualised e-Learning with Real-Time Guarantees on the IRMOS Platform
 
Aflac Power Point Presentation
Aflac Power Point PresentationAflac Power Point Presentation
Aflac Power Point Presentation
 
Study: The Future of VR, AR and Self-Driving Cars
Study: The Future of VR, AR and Self-Driving CarsStudy: The Future of VR, AR and Self-Driving Cars
Study: The Future of VR, AR and Self-Driving Cars
 

Similar to Providing Performance Guarantees to Virtual Machines using Real-Time Scheduling

Self-tuning Schedulers for Legacy Real-Time Applications
Self-tuning Schedulers for Legacy Real-Time ApplicationsSelf-tuning Schedulers for Legacy Real-Time Applications
Self-tuning Schedulers for Legacy Real-Time Applications
tcucinotta
 
Virtual Network Functions as Real-Time Containers in Private Clouds
Virtual Network Functions as Real-Time Containers in Private CloudsVirtual Network Functions as Real-Time Containers in Private Clouds
Virtual Network Functions as Real-Time Containers in Private Clouds
tcucinotta
 
The IRMOS Real-Time Scheduler
The IRMOS Real-Time SchedulerThe IRMOS Real-Time Scheduler
The IRMOS Real-Time Schedulertcucinotta
 
Self-tuning Schedulers for Legacy Real-Time Applications
Self-tuning Schedulers for Legacy Real-Time ApplicationsSelf-tuning Schedulers for Legacy Real-Time Applications
Self-tuning Schedulers for Legacy Real-Time Applications
guestbbe1c83
 
SLAs in Virtualized Cloud Computing Infrastructures with QoS Assurance
SLAs in Virtualized Cloud Computing Infrastructures with QoS AssuranceSLAs in Virtualized Cloud Computing Infrastructures with QoS Assurance
SLAs in Virtualized Cloud Computing Infrastructures with QoS Assurance
tcucinotta
 
Real-Time API
Real-Time APIReal-Time API
Real-Time API
tcucinotta
 
Modeling and simulation of power consumption and execution times for real-tim...
Modeling and simulation of power consumption and execution times for real-tim...Modeling and simulation of power consumption and execution times for real-tim...
Modeling and simulation of power consumption and execution times for real-tim...
tcucinotta
 
Mpls conference 2016-data center virtualisation-11-march
Mpls conference 2016-data center virtualisation-11-marchMpls conference 2016-data center virtualisation-11-march
Mpls conference 2016-data center virtualisation-11-march
Aricent
 
A checkpointing mechanism for virtual clusters using memory- bound time-multi...
A checkpointing mechanism for virtual clusters using memory- bound time-multi...A checkpointing mechanism for virtual clusters using memory- bound time-multi...
A checkpointing mechanism for virtual clusters using memory- bound time-multi...
IJECEIAES
 
ICALEPCS 2011: Testing Environments using Virtualization
ICALEPCS 2011: Testing Environments using VirtualizationICALEPCS 2011: Testing Environments using Virtualization
ICALEPCS 2011: Testing Environments using VirtualizationOmer Khalid
 
Optimum Scalability Point for Parallelisable Real-Time Components
Optimum Scalability Point for Parallelisable Real-Time ComponentsOptimum Scalability Point for Parallelisable Real-Time Components
Optimum Scalability Point for Parallelisable Real-Time Components
tcucinotta
 
Comparison of Open Source Virtualization Technology
Comparison of Open Source Virtualization TechnologyComparison of Open Source Virtualization Technology
Comparison of Open Source Virtualization Technology
Benoit des Ligneris
 
Tommaso Cucinotta - Low-latency and power-efficient audio applications on Linux
Tommaso Cucinotta - Low-latency and power-efficient audio applications on LinuxTommaso Cucinotta - Low-latency and power-efficient audio applications on Linux
Tommaso Cucinotta - Low-latency and power-efficient audio applications on Linux
linuxlab_conf
 
An Evaluation of Adaptive Partitioning of Real-Time Workloads on Linux
An Evaluation of Adaptive Partitioning of Real-Time Workloads on LinuxAn Evaluation of Adaptive Partitioning of Real-Time Workloads on Linux
An Evaluation of Adaptive Partitioning of Real-Time Workloads on Linux
tcucinotta
 
A Casual Teaching Tool for Large Size Computer Laboratories ans Small Size Se...
A Casual Teaching Tool for Large Size Computer Laboratories ans Small Size Se...A Casual Teaching Tool for Large Size Computer Laboratories ans Small Size Se...
A Casual Teaching Tool for Large Size Computer Laboratories ans Small Size Se...
Takashi Yamanoue
 
High Performance Parallel Computing with Clouds and Cloud Technologies
High Performance Parallel Computing with Clouds and Cloud TechnologiesHigh Performance Parallel Computing with Clouds and Cloud Technologies
High Performance Parallel Computing with Clouds and Cloud Technologies
jaliyae
 
Embedded Intro India05
Embedded Intro India05Embedded Intro India05
Embedded Intro India05
Rajesh Gupta
 
XPDS13: Enabling Fast, Dynamic Network Processing with ClickOS - Joao Martins...
XPDS13: Enabling Fast, Dynamic Network Processing with ClickOS - Joao Martins...XPDS13: Enabling Fast, Dynamic Network Processing with ClickOS - Joao Martins...
XPDS13: Enabling Fast, Dynamic Network Processing with ClickOS - Joao Martins...
The Linux Foundation
 
Why AIOps Matters For Kubernetes
Why AIOps Matters For KubernetesWhy AIOps Matters For Kubernetes
Why AIOps Matters For Kubernetes
Timothy Chen
 
How Adobe Built An OpenStack Cloud
How Adobe Built An OpenStack CloudHow Adobe Built An OpenStack Cloud
How Adobe Built An OpenStack Cloud
Jun Park
 

Similar to Providing Performance Guarantees to Virtual Machines using Real-Time Scheduling (20)

Self-tuning Schedulers for Legacy Real-Time Applications
Self-tuning Schedulers for Legacy Real-Time ApplicationsSelf-tuning Schedulers for Legacy Real-Time Applications
Self-tuning Schedulers for Legacy Real-Time Applications
 
Virtual Network Functions as Real-Time Containers in Private Clouds
Virtual Network Functions as Real-Time Containers in Private CloudsVirtual Network Functions as Real-Time Containers in Private Clouds
Virtual Network Functions as Real-Time Containers in Private Clouds
 
The IRMOS Real-Time Scheduler
The IRMOS Real-Time SchedulerThe IRMOS Real-Time Scheduler
The IRMOS Real-Time Scheduler
 
Self-tuning Schedulers for Legacy Real-Time Applications
Self-tuning Schedulers for Legacy Real-Time ApplicationsSelf-tuning Schedulers for Legacy Real-Time Applications
Self-tuning Schedulers for Legacy Real-Time Applications
 
SLAs in Virtualized Cloud Computing Infrastructures with QoS Assurance
SLAs in Virtualized Cloud Computing Infrastructures with QoS AssuranceSLAs in Virtualized Cloud Computing Infrastructures with QoS Assurance
SLAs in Virtualized Cloud Computing Infrastructures with QoS Assurance
 
Real-Time API
Real-Time APIReal-Time API
Real-Time API
 
Modeling and simulation of power consumption and execution times for real-tim...
Modeling and simulation of power consumption and execution times for real-tim...Modeling and simulation of power consumption and execution times for real-tim...
Modeling and simulation of power consumption and execution times for real-tim...
 
Mpls conference 2016-data center virtualisation-11-march
Mpls conference 2016-data center virtualisation-11-marchMpls conference 2016-data center virtualisation-11-march
Mpls conference 2016-data center virtualisation-11-march
 
A checkpointing mechanism for virtual clusters using memory- bound time-multi...
A checkpointing mechanism for virtual clusters using memory- bound time-multi...A checkpointing mechanism for virtual clusters using memory- bound time-multi...
A checkpointing mechanism for virtual clusters using memory- bound time-multi...
 
ICALEPCS 2011: Testing Environments using Virtualization
ICALEPCS 2011: Testing Environments using VirtualizationICALEPCS 2011: Testing Environments using Virtualization
ICALEPCS 2011: Testing Environments using Virtualization
 
Optimum Scalability Point for Parallelisable Real-Time Components
Optimum Scalability Point for Parallelisable Real-Time ComponentsOptimum Scalability Point for Parallelisable Real-Time Components
Optimum Scalability Point for Parallelisable Real-Time Components
 
Comparison of Open Source Virtualization Technology
Comparison of Open Source Virtualization TechnologyComparison of Open Source Virtualization Technology
Comparison of Open Source Virtualization Technology
 
Tommaso Cucinotta - Low-latency and power-efficient audio applications on Linux
Tommaso Cucinotta - Low-latency and power-efficient audio applications on LinuxTommaso Cucinotta - Low-latency and power-efficient audio applications on Linux
Tommaso Cucinotta - Low-latency and power-efficient audio applications on Linux
 
An Evaluation of Adaptive Partitioning of Real-Time Workloads on Linux
An Evaluation of Adaptive Partitioning of Real-Time Workloads on LinuxAn Evaluation of Adaptive Partitioning of Real-Time Workloads on Linux
An Evaluation of Adaptive Partitioning of Real-Time Workloads on Linux
 
A Casual Teaching Tool for Large Size Computer Laboratories ans Small Size Se...
A Casual Teaching Tool for Large Size Computer Laboratories ans Small Size Se...A Casual Teaching Tool for Large Size Computer Laboratories ans Small Size Se...
A Casual Teaching Tool for Large Size Computer Laboratories ans Small Size Se...
 
High Performance Parallel Computing with Clouds and Cloud Technologies
High Performance Parallel Computing with Clouds and Cloud TechnologiesHigh Performance Parallel Computing with Clouds and Cloud Technologies
High Performance Parallel Computing with Clouds and Cloud Technologies
 
Embedded Intro India05
Embedded Intro India05Embedded Intro India05
Embedded Intro India05
 
XPDS13: Enabling Fast, Dynamic Network Processing with ClickOS - Joao Martins...
XPDS13: Enabling Fast, Dynamic Network Processing with ClickOS - Joao Martins...XPDS13: Enabling Fast, Dynamic Network Processing with ClickOS - Joao Martins...
XPDS13: Enabling Fast, Dynamic Network Processing with ClickOS - Joao Martins...
 
Why AIOps Matters For Kubernetes
Why AIOps Matters For KubernetesWhy AIOps Matters For Kubernetes
Why AIOps Matters For Kubernetes
 
How Adobe Built An OpenStack Cloud
How Adobe Built An OpenStack CloudHow Adobe Built An OpenStack Cloud
How Adobe Built An OpenStack Cloud
 

Recently uploaded

Welocme to ViralQR, your best QR code generator.
Welocme to ViralQR, your best QR code generator.Welocme to ViralQR, your best QR code generator.
Welocme to ViralQR, your best QR code generator.
ViralQR
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Thierry Lestable
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
UiPathCommunity
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
Peter Spielvogel
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 

Recently uploaded (20)

Welocme to ViralQR, your best QR code generator.
Welocme to ViralQR, your best QR code generator.Welocme to ViralQR, your best QR code generator.
Welocme to ViralQR, your best QR code generator.
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 

Providing Performance Guarantees to Virtual Machines using Real-Time Scheduling

  • 1. VHPC 2010 August 31st, Ischia, Italy Providing Performance Guarantees to Virtual Machines using Real-Time Scheduling Tommaso Cucinotta, Dhaval Giani, Dario Faggioli, Fabio Checconi Real-Time Systems Laboratory Scuola Superiore Sant'Anna Pisa, Italy Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy 1/24
  • 2. Introduction and Motivations Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy 2/24
  • 3. Introduction “Traditional” worlds of computing  General-Purpose Computing (GPC) and servers  Low-cost, low parallelism degree equipment  Focused on mixing batch and interactive workloads  High-Performance Computing (HPC)  High-cost, massively parallel and vector-based equipment  Focused on batch computing, tightly coupled parallel tasks, scientific applications New trends  Affordable many-core systems also for GPC  Cloud-Computing: world-level scalability and replicability  Use of high-performance hardware in CC applications  Interest in mixing interactive/real-time and HPC workloads Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy 3/24
  • 4. Introduction Virtualization is a key technology  For IaaS providers (Cloud Computing)  For server consolidation Physical Host Physical Host OS OS ... Physical Host Physical Host OS OS Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy 4/24
  • 5. Introduction Virtualization is a key technology  For IaaS providers (Cloud Computing)  For server consolidation Physical Host Physical Host Physical Host Physical Host OS OS VM/OS VM/OS ... ... Physical Host VM/OS VM/OS Physical Host OS OS Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy 5/24
  • 6. Need for Performance Isolation Resource sharing → Temporal interference Physical Host Physical Host VM VM VM VM VM VM VM VM Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy 6/24
  • 7. Need for Performance Isolation ~30ms Resource sharing VM Alone → Temporal interference Physical Host Physical Host VM VM VM VM VM VM VM VM Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy 7/24
  • 8. Need for Performance Isolation ~30ms Resource sharing VM Alone → Temporal interference Physical Host Physical Host VM VM VM VM ~120ms 2 VMs VM VM VM VM Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy 8/24
  • 9. Possible Solution Hardware replication and Physical Host Physical Host static partitioning VM VM  Computing  Multi-core (1 core per VM) VM VM  Networking  Multiple network adapters (1 network adapter per VM) VM VM  Multi-queue adapters Drawbacks VM VM  Limitation of flexibility  Under-utilization of resources  e.g., with real-time/interactive workloads (cloud computing) Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy 9/24
  • 10. Possible Solution Another approach  Let multiple VMs use the same resources  Use proper resource scheduling strategies For example  Computing  Xen credit-based and SEDF schedulers  Networking  QoS-aware protocols (IntServ, MPLS) Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy 10/24
  • 11. What is still missing ? Most approaches  Investigate on high-level load balancing techniques  Without the necessary care for fine-grain resource scheduling  Focus on fairness among multiple VMs Only a few works  Focus on providing precise QoS guarantees to VMs  For example  Gupta et al., “[...] Performance Isolation […] in Xen”  Cherkasova et al., “Comparison of the 3 CPU Schedulers in Xen” Our focus  Network performance isolation among concurrent VMs  Virtualization with a host OS  Specifically, on Linux as host OS with KVM Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy 11/24
  • 12. Proposed Approach Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy 12/24
  • 13. Proposed Approach Use of real-time CPU scheduling  As basic mechanism for isolating VMs concurrently running on the same CPU and core IRMOS Real-Time Scheduler  For the Linux kernel  Provides hierarchical EDF/FP scheduling  EDF-based resource reservations – (Q, P): a budget Q is granted every period P – Both a guarantee and a limitation  FP scheduling within each EDF reservation  Provides temporal isolation Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy 13/24
  • 14. What can we achieve ? (by CPU real-time scheduling) CPU real-time scheduling achieves  Performance isolation of compute-intensive VMs  Shown in other papers  What about network-intensive VMs ? Problems  On the host OS (Linux)  network-intensive VMs impose a big interrupt workload – difficult to quantify and keep under control – steals CPU from reservations of other VMs  How can we fix this ? Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy 14/24
  • 15. Preliminary Solution Preliminary solution (subject of this paper)  Temporal isolation of VMs by real-time scheduling  Scheduling parameters tuned according to benchmarked figures – Reservation period set according to responsiveness requirements – Budget needed for computations, plus – Budget needed for sustaining network traffic  Budget over-provisioning  According to the foreseen interference due to network-intensive VMs Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy 15/24
  • 16. Experimental Results Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy 16/24
  • 17. Experimental Results (Q6600 @ 2.4 GHz, 1-Gbit Card) Achievable network throughput as a function of the CPU share reserved to the VM  Measured via iperf: almost linear, as expected Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy 17/24
  • 18. Experimental Results (2 VMs on the same core) Throughput as a function of its own budget  13% drop when  own reservation is 35% (from ~300 Mbps to ~260 Mbps)  and reservation of other VM increased from 10% to 40% Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy 18/24
  • 19. Experimental Results (2 VMs on the same core) Throughput as a function of its own budget  ~14% budget over-provisioning needed  for sustaining a ~300 Mbps throughput  when reservation of other VM increased from 10% to 40% Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy 19/24
  • 20. Experimental Results (2 VMs on the same core) Throughput as a function of the budget reserved to the other interfering VM  Drop due to compute-intensive interfering VM lower than the one due to network-intensive interfering VM Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy 20/24
  • 21. Experimental Results (application-level benchmark) Download time for a 100 KB file from Apache  Periodic download requests every 20ms  Response-times may be kept much more stable by real-time scheduling Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy 21/24
  • 22. Conclusions and Future Work Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy 22/24
  • 23. Conclusions and Future Work Conclusions  We showed how to achieve network performance isolation by CPU real-time scheduling  Technique to be used jointly with traffic-shaping techniques Planned Future Work  Experiment with PREEMPT_RT  IRQs handled in kernel threads  Investigate on task-level isolation of network traffic of different VMs (via real-time scheduling)  Make KVM a QoS-aware hypervisor  Investigate on the use of Adaptive Reservations Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy 23/24
  • 24. Thanks for your attention! Questions ? Tommaso Cucinotta – ReTiS Lab – Scuola Superiore Sant'Anna – Pisa – Italy 24/24