SlideShare a Scribd company logo
1 of 10
CHARACTERIZING THE
CACHING BEHAVIOUR
OF DISTRIBUTED
CLOUD APPLICATIONS
Kanwar Gill, Ioan Stefanovici (Mentor), Bianca Schroeder (Supervisor)
Cloud Data Centers
 Enterprise applications migrating to hosted platforms
in cloud data centers
 Such platforms are virtualized
 Sharing of resources with other tenants mediated
by hypervisors such as Hyper-V, Vmware ESX,
Xen
 One important such resource: storage caches
 Provide quick access to frequently accessed data
 Caches exist in multiple places along the IO stack
 Problem: Storage caches along the IO stack are not
designed for multi-tenancy, and are not workload-
aware
VM VM
Hypervisor Hypervisor
VM VM
Hypervisor
Storage Storage
VM VMVM VM
Hypervisor Hypervisor
VM VM
Hypervisor
Storage Storage
VM VM
Cache
StorageCacheApplication
Problems with Cloud Storage Caches
 Lack of performance isolation b/w workloads
 Allocated cache space depends on access patterns and request rate aggressiveness
 Lack of customization
 Workload performance can vary significantly with different caching policies
 Lack of coordination
 Each cache makes local decisions, agnostic to other caches on the IO stack
 Lack of adaptability
 Hard (or impossible) to respond to changes in workload and tenant/provider objectives
 Waste of system resources
 Lack of control over all-of-the-above can lead to wasted resources
 E.g.: cache space allocated to a sequential workload that makes no use of it
Software-Defined Caching
 Moirai: A system developed by my mentor Ioan and researchers at Microsoft Research
Cambridge that solves these problems
 To appear at Symposium on Cloud Computing (SoCC) 2015 in August
 Comprises three components: controller, metrics engine, and programmable caches
 Provides: performance isolation, cache policy customization, distributed cache coordination
CtrlAPI
VM VM
Cache allocation,
configuration,
placement
Controller
cache
Compute
server
Hypervisor
Metrics
Engine
Storage
server
CtrlAPI
cache
Control Plane
Data Plane
Programmable Cache API:
My Research:
 Characterize IO behavior of cloud applications to inform cache configuration
decisions in data centers employing software-defined caching.
 Configuration parameters:
 Cache size
 Eviction policy
 Write policy
 Placement of data in caches along the IO stack
 Methodology:
 IO traces collected from workloads running on real systems
 OLTP, Microsoft enterprise
 Simulation
Example Cache Configurations
 Consolidating Memory Over Fast Networks
 Scale-Out LRU Cache + Bandwidth Guarantees
Workload A A.file
VM Hypervisor Storage
C1
C2
VM1
DATA.file
VMn
Default IO
...
VM Hypervisor Storage
C
Current Progress
 Automated parallel test harness
 Single-cache simulator implementing LRU, MRU, LFU, CLOCK
 TPC-C and MSR Cambridge Enterprise IO traces
 Trace Operations:
(Timestamp, Hostname, DiskNumber, Type, Offset, Size, Response Time)
 Simulator currently outputs
 Cache size (bytes) vs. Hit Ratio (%) graphs
Current Progress
Future Work:
 More sophisticated cache eviction policies
 LRU-K, ARC, etc
 Write policies
 Write-back, write-through, write-around
 Multi-cache behavior (e.g. scale-out caches)
 [Potentially] collect & investigate traces from other
cloud workloads
 E.g. CloudSuite
Questions?
 Thank you for listening to my awesome research this summer!
 Any questions?

More Related Content

What's hot

Hyperconvergence FAQ's
Hyperconvergence FAQ'sHyperconvergence FAQ's
Hyperconvergence FAQ'sSpringpath
 
OpenNebula TechDay Boston 2015 - Future of Information Storage with ISS Super...
OpenNebula TechDay Boston 2015 - Future of Information Storage with ISS Super...OpenNebula TechDay Boston 2015 - Future of Information Storage with ISS Super...
OpenNebula TechDay Boston 2015 - Future of Information Storage with ISS Super...OpenNebula Project
 
Emea nutanix overview presentation emea
Emea  nutanix overview presentation emeaEmea  nutanix overview presentation emea
Emea nutanix overview presentation emeaLan & Wan Solutions
 
Server virtualization and cloud computing
Server virtualization and cloud computingServer virtualization and cloud computing
Server virtualization and cloud computingJoseph D'Antoni
 
Storage Multi-Tenancy For Cloud Service Providers
Storage Multi-Tenancy For Cloud Service ProvidersStorage Multi-Tenancy For Cloud Service Providers
Storage Multi-Tenancy For Cloud Service ProvidersCloudByte Inc.
 
Nutanix - The Next Level in Web Scale IT Architectures is Here
Nutanix - The Next Level in Web Scale IT Architectures is HereNutanix - The Next Level in Web Scale IT Architectures is Here
Nutanix - The Next Level in Web Scale IT Architectures is HereVMUG IT
 
Hyper-V Integration with System Center 2012 Virtual Machine Manager
Hyper-V Integration with System Center 2012 Virtual Machine ManagerHyper-V Integration with System Center 2012 Virtual Machine Manager
Hyper-V Integration with System Center 2012 Virtual Machine ManagerPaulo Freitas
 
Nutanix + Cumulus Linux: Deploying True Hyper Convergence with Open Networking
Nutanix + Cumulus Linux: Deploying True Hyper Convergence with Open NetworkingNutanix + Cumulus Linux: Deploying True Hyper Convergence with Open Networking
Nutanix + Cumulus Linux: Deploying True Hyper Convergence with Open NetworkingCumulus Networks
 
Nutanix Technology Bootcamp
Nutanix Technology BootcampNutanix Technology Bootcamp
Nutanix Technology BootcampICT-Partners
 
Hyperconvergence Facts and FAQs
Hyperconvergence Facts and FAQsHyperconvergence Facts and FAQs
Hyperconvergence Facts and FAQsSpringpath
 
StorPool Presents at Cloud Field Day 9
StorPool Presents at Cloud Field Day 9StorPool Presents at Cloud Field Day 9
StorPool Presents at Cloud Field Day 9StorPool Storage
 
Presentation on data Warehouse
Presentation on data WarehousePresentation on data Warehouse
Presentation on data Warehousebloombird
 
h12525-top-reasons-scaleio-ho
h12525-top-reasons-scaleio-hoh12525-top-reasons-scaleio-ho
h12525-top-reasons-scaleio-hoReece Gaumont
 
Nutanix - Expert Session - Metro Availability
Nutanix -  Expert Session - Metro AvailabilityNutanix -  Expert Session - Metro Availability
Nutanix - Expert Session - Metro AvailabilityChristian Johannsen
 
Nutanix and microsoft_webinar_oct_28
Nutanix and microsoft_webinar_oct_28Nutanix and microsoft_webinar_oct_28
Nutanix and microsoft_webinar_oct_28groberts52
 
Sql server consolidation and virtualization
Sql server consolidation and virtualizationSql server consolidation and virtualization
Sql server consolidation and virtualizationIvan Donev
 
VMWARE Professionals - Foundation Hybrid Clouds and Costs
VMWARE Professionals -  Foundation Hybrid Clouds and CostsVMWARE Professionals -  Foundation Hybrid Clouds and Costs
VMWARE Professionals - Foundation Hybrid Clouds and CostsPaulo Freitas
 

What's hot (18)

Hyperconvergence FAQ's
Hyperconvergence FAQ'sHyperconvergence FAQ's
Hyperconvergence FAQ's
 
OpenNebula TechDay Boston 2015 - Future of Information Storage with ISS Super...
OpenNebula TechDay Boston 2015 - Future of Information Storage with ISS Super...OpenNebula TechDay Boston 2015 - Future of Information Storage with ISS Super...
OpenNebula TechDay Boston 2015 - Future of Information Storage with ISS Super...
 
Emea nutanix overview presentation emea
Emea  nutanix overview presentation emeaEmea  nutanix overview presentation emea
Emea nutanix overview presentation emea
 
Server virtualization and cloud computing
Server virtualization and cloud computingServer virtualization and cloud computing
Server virtualization and cloud computing
 
Storage Multi-Tenancy For Cloud Service Providers
Storage Multi-Tenancy For Cloud Service ProvidersStorage Multi-Tenancy For Cloud Service Providers
Storage Multi-Tenancy For Cloud Service Providers
 
Nutanix - The Next Level in Web Scale IT Architectures is Here
Nutanix - The Next Level in Web Scale IT Architectures is HereNutanix - The Next Level in Web Scale IT Architectures is Here
Nutanix - The Next Level in Web Scale IT Architectures is Here
 
Hyper-V Integration with System Center 2012 Virtual Machine Manager
Hyper-V Integration with System Center 2012 Virtual Machine ManagerHyper-V Integration with System Center 2012 Virtual Machine Manager
Hyper-V Integration with System Center 2012 Virtual Machine Manager
 
Nutanix + Cumulus Linux: Deploying True Hyper Convergence with Open Networking
Nutanix + Cumulus Linux: Deploying True Hyper Convergence with Open NetworkingNutanix + Cumulus Linux: Deploying True Hyper Convergence with Open Networking
Nutanix + Cumulus Linux: Deploying True Hyper Convergence with Open Networking
 
Nutanix Technology Bootcamp
Nutanix Technology BootcampNutanix Technology Bootcamp
Nutanix Technology Bootcamp
 
Hyperconvergence Facts and FAQs
Hyperconvergence Facts and FAQsHyperconvergence Facts and FAQs
Hyperconvergence Facts and FAQs
 
StorPool Presents at Cloud Field Day 9
StorPool Presents at Cloud Field Day 9StorPool Presents at Cloud Field Day 9
StorPool Presents at Cloud Field Day 9
 
Presentation on data Warehouse
Presentation on data WarehousePresentation on data Warehouse
Presentation on data Warehouse
 
h12525-top-reasons-scaleio-ho
h12525-top-reasons-scaleio-hoh12525-top-reasons-scaleio-ho
h12525-top-reasons-scaleio-ho
 
Nutanix - Expert Session - Metro Availability
Nutanix -  Expert Session - Metro AvailabilityNutanix -  Expert Session - Metro Availability
Nutanix - Expert Session - Metro Availability
 
D Y N A C A C H E Wxs
D Y N A C A C H E WxsD Y N A C A C H E Wxs
D Y N A C A C H E Wxs
 
Nutanix and microsoft_webinar_oct_28
Nutanix and microsoft_webinar_oct_28Nutanix and microsoft_webinar_oct_28
Nutanix and microsoft_webinar_oct_28
 
Sql server consolidation and virtualization
Sql server consolidation and virtualizationSql server consolidation and virtualization
Sql server consolidation and virtualization
 
VMWARE Professionals - Foundation Hybrid Clouds and Costs
VMWARE Professionals -  Foundation Hybrid Clouds and CostsVMWARE Professionals -  Foundation Hybrid Clouds and Costs
VMWARE Professionals - Foundation Hybrid Clouds and Costs
 

Similar to Research_Presentation

Microsoft Azure Cloud Basics Tutorial
Microsoft Azure Cloud Basics TutorialMicrosoft Azure Cloud Basics Tutorial
Microsoft Azure Cloud Basics TutorialIIMSE Edu
 
VMworld 2013: Maximize Database Performance in Your Software-Defined Data Center
VMworld 2013: Maximize Database Performance in Your Software-Defined Data CenterVMworld 2013: Maximize Database Performance in Your Software-Defined Data Center
VMworld 2013: Maximize Database Performance in Your Software-Defined Data CenterVMworld
 
VMworld 2013: Software-Defined Storage: The VCDX Way
VMworld 2013: Software-Defined Storage: The VCDX Way VMworld 2013: Software-Defined Storage: The VCDX Way
VMworld 2013: Software-Defined Storage: The VCDX Way VMworld
 
Kluczowe elementy infrastruktury...
Kluczowe elementy infrastruktury...Kluczowe elementy infrastruktury...
Kluczowe elementy infrastruktury...Alicja Sieminska
 
System Center Virtual Machine Manager 2012 - Whats New
System Center  Virtual Machine Manager 2012 - Whats NewSystem Center  Virtual Machine Manager 2012 - Whats New
System Center Virtual Machine Manager 2012 - Whats NewAmit Gatenyo
 
Hostingcon 2010 Cloud Presentation
Hostingcon 2010 Cloud PresentationHostingcon 2010 Cloud Presentation
Hostingcon 2010 Cloud PresentationJason Baker
 
Track 1 Virtualizing Critical Applications with VMWARE VISPHERE by Roshan Shetty
Track 1 Virtualizing Critical Applications with VMWARE VISPHERE by Roshan ShettyTrack 1 Virtualizing Critical Applications with VMWARE VISPHERE by Roshan Shetty
Track 1 Virtualizing Critical Applications with VMWARE VISPHERE by Roshan ShettyEMC Forum India
 
Storage Strategies Now- Virtualizaing Busines Critical applications
Storage Strategies Now- Virtualizaing Busines Critical applicationsStorage Strategies Now- Virtualizaing Busines Critical applications
Storage Strategies Now- Virtualizaing Busines Critical applicationsDataCore Software
 
Storage architectures and the cloud
Storage architectures and the cloudStorage architectures and the cloud
Storage architectures and the cloudShapeBlue
 
Storage Virtualization Introduction
Storage Virtualization IntroductionStorage Virtualization Introduction
Storage Virtualization IntroductionStephen Foskett
 
How to Think Multi-Cloud
How to Think Multi-CloudHow to Think Multi-Cloud
How to Think Multi-CloudRightScale
 
Caching for Microservices Architectures: Session I
Caching for Microservices Architectures: Session ICaching for Microservices Architectures: Session I
Caching for Microservices Architectures: Session IVMware Tanzu
 
V sphere 5 roadshow final
V sphere 5 roadshow finalV sphere 5 roadshow final
V sphere 5 roadshow finalbluechipper
 
Private Cloud Academy: Backup and DPM 2010
Private Cloud Academy: Backup and DPM 2010Private Cloud Academy: Backup and DPM 2010
Private Cloud Academy: Backup and DPM 2010Aidan Finn
 
What's VNext On Virtualization Management Toward Private Cloud
What's VNext On Virtualization Management Toward Private CloudWhat's VNext On Virtualization Management Toward Private Cloud
What's VNext On Virtualization Management Toward Private CloudLai Yoong Seng
 
Building Scalable .NET Web Applications
Building Scalable .NET Web ApplicationsBuilding Scalable .NET Web Applications
Building Scalable .NET Web ApplicationsBuu Nguyen
 
V mware v fabric 5 - what's new technical sales training presentation
V mware v fabric 5 - what's new technical sales training presentationV mware v fabric 5 - what's new technical sales training presentation
V mware v fabric 5 - what's new technical sales training presentationsolarisyourep
 
Storage for Virtual Environments 2011 R2
Storage for Virtual Environments 2011 R2Storage for Virtual Environments 2011 R2
Storage for Virtual Environments 2011 R2Stephen Foskett
 
VMworld 2013: The Missing Link: Storage Visibility In Virtualized Environments
VMworld 2013: The Missing Link: Storage Visibility In Virtualized Environments VMworld 2013: The Missing Link: Storage Visibility In Virtualized Environments
VMworld 2013: The Missing Link: Storage Visibility In Virtualized Environments VMworld
 

Similar to Research_Presentation (20)

Managing the cloud
Managing the cloudManaging the cloud
Managing the cloud
 
Microsoft Azure Cloud Basics Tutorial
Microsoft Azure Cloud Basics TutorialMicrosoft Azure Cloud Basics Tutorial
Microsoft Azure Cloud Basics Tutorial
 
VMworld 2013: Maximize Database Performance in Your Software-Defined Data Center
VMworld 2013: Maximize Database Performance in Your Software-Defined Data CenterVMworld 2013: Maximize Database Performance in Your Software-Defined Data Center
VMworld 2013: Maximize Database Performance in Your Software-Defined Data Center
 
VMworld 2013: Software-Defined Storage: The VCDX Way
VMworld 2013: Software-Defined Storage: The VCDX Way VMworld 2013: Software-Defined Storage: The VCDX Way
VMworld 2013: Software-Defined Storage: The VCDX Way
 
Kluczowe elementy infrastruktury...
Kluczowe elementy infrastruktury...Kluczowe elementy infrastruktury...
Kluczowe elementy infrastruktury...
 
System Center Virtual Machine Manager 2012 - Whats New
System Center  Virtual Machine Manager 2012 - Whats NewSystem Center  Virtual Machine Manager 2012 - Whats New
System Center Virtual Machine Manager 2012 - Whats New
 
Hostingcon 2010 Cloud Presentation
Hostingcon 2010 Cloud PresentationHostingcon 2010 Cloud Presentation
Hostingcon 2010 Cloud Presentation
 
Track 1 Virtualizing Critical Applications with VMWARE VISPHERE by Roshan Shetty
Track 1 Virtualizing Critical Applications with VMWARE VISPHERE by Roshan ShettyTrack 1 Virtualizing Critical Applications with VMWARE VISPHERE by Roshan Shetty
Track 1 Virtualizing Critical Applications with VMWARE VISPHERE by Roshan Shetty
 
Storage Strategies Now- Virtualizaing Busines Critical applications
Storage Strategies Now- Virtualizaing Busines Critical applicationsStorage Strategies Now- Virtualizaing Busines Critical applications
Storage Strategies Now- Virtualizaing Busines Critical applications
 
Storage architectures and the cloud
Storage architectures and the cloudStorage architectures and the cloud
Storage architectures and the cloud
 
Storage Virtualization Introduction
Storage Virtualization IntroductionStorage Virtualization Introduction
Storage Virtualization Introduction
 
How to Think Multi-Cloud
How to Think Multi-CloudHow to Think Multi-Cloud
How to Think Multi-Cloud
 
Caching for Microservices Architectures: Session I
Caching for Microservices Architectures: Session ICaching for Microservices Architectures: Session I
Caching for Microservices Architectures: Session I
 
V sphere 5 roadshow final
V sphere 5 roadshow finalV sphere 5 roadshow final
V sphere 5 roadshow final
 
Private Cloud Academy: Backup and DPM 2010
Private Cloud Academy: Backup and DPM 2010Private Cloud Academy: Backup and DPM 2010
Private Cloud Academy: Backup and DPM 2010
 
What's VNext On Virtualization Management Toward Private Cloud
What's VNext On Virtualization Management Toward Private CloudWhat's VNext On Virtualization Management Toward Private Cloud
What's VNext On Virtualization Management Toward Private Cloud
 
Building Scalable .NET Web Applications
Building Scalable .NET Web ApplicationsBuilding Scalable .NET Web Applications
Building Scalable .NET Web Applications
 
V mware v fabric 5 - what's new technical sales training presentation
V mware v fabric 5 - what's new technical sales training presentationV mware v fabric 5 - what's new technical sales training presentation
V mware v fabric 5 - what's new technical sales training presentation
 
Storage for Virtual Environments 2011 R2
Storage for Virtual Environments 2011 R2Storage for Virtual Environments 2011 R2
Storage for Virtual Environments 2011 R2
 
VMworld 2013: The Missing Link: Storage Visibility In Virtualized Environments
VMworld 2013: The Missing Link: Storage Visibility In Virtualized Environments VMworld 2013: The Missing Link: Storage Visibility In Virtualized Environments
VMworld 2013: The Missing Link: Storage Visibility In Virtualized Environments
 

Research_Presentation

  • 1. CHARACTERIZING THE CACHING BEHAVIOUR OF DISTRIBUTED CLOUD APPLICATIONS Kanwar Gill, Ioan Stefanovici (Mentor), Bianca Schroeder (Supervisor)
  • 2. Cloud Data Centers  Enterprise applications migrating to hosted platforms in cloud data centers  Such platforms are virtualized  Sharing of resources with other tenants mediated by hypervisors such as Hyper-V, Vmware ESX, Xen  One important such resource: storage caches  Provide quick access to frequently accessed data  Caches exist in multiple places along the IO stack  Problem: Storage caches along the IO stack are not designed for multi-tenancy, and are not workload- aware VM VM Hypervisor Hypervisor VM VM Hypervisor Storage Storage VM VMVM VM Hypervisor Hypervisor VM VM Hypervisor Storage Storage VM VM Cache StorageCacheApplication
  • 3. Problems with Cloud Storage Caches  Lack of performance isolation b/w workloads  Allocated cache space depends on access patterns and request rate aggressiveness  Lack of customization  Workload performance can vary significantly with different caching policies  Lack of coordination  Each cache makes local decisions, agnostic to other caches on the IO stack  Lack of adaptability  Hard (or impossible) to respond to changes in workload and tenant/provider objectives  Waste of system resources  Lack of control over all-of-the-above can lead to wasted resources  E.g.: cache space allocated to a sequential workload that makes no use of it
  • 4. Software-Defined Caching  Moirai: A system developed by my mentor Ioan and researchers at Microsoft Research Cambridge that solves these problems  To appear at Symposium on Cloud Computing (SoCC) 2015 in August  Comprises three components: controller, metrics engine, and programmable caches  Provides: performance isolation, cache policy customization, distributed cache coordination CtrlAPI VM VM Cache allocation, configuration, placement Controller cache Compute server Hypervisor Metrics Engine Storage server CtrlAPI cache Control Plane Data Plane Programmable Cache API:
  • 5. My Research:  Characterize IO behavior of cloud applications to inform cache configuration decisions in data centers employing software-defined caching.  Configuration parameters:  Cache size  Eviction policy  Write policy  Placement of data in caches along the IO stack  Methodology:  IO traces collected from workloads running on real systems  OLTP, Microsoft enterprise  Simulation
  • 6. Example Cache Configurations  Consolidating Memory Over Fast Networks  Scale-Out LRU Cache + Bandwidth Guarantees Workload A A.file VM Hypervisor Storage C1 C2 VM1 DATA.file VMn Default IO ... VM Hypervisor Storage C
  • 7. Current Progress  Automated parallel test harness  Single-cache simulator implementing LRU, MRU, LFU, CLOCK  TPC-C and MSR Cambridge Enterprise IO traces  Trace Operations: (Timestamp, Hostname, DiskNumber, Type, Offset, Size, Response Time)  Simulator currently outputs  Cache size (bytes) vs. Hit Ratio (%) graphs
  • 9. Future Work:  More sophisticated cache eviction policies  LRU-K, ARC, etc  Write policies  Write-back, write-through, write-around  Multi-cache behavior (e.g. scale-out caches)  [Potentially] collect & investigate traces from other cloud workloads  E.g. CloudSuite
  • 10. Questions?  Thank you for listening to my awesome research this summer!  Any questions?