SlideShare a Scribd company logo
CloudHealth
A Model-Driven Approach to Watch the
Health of Cloud Services
by Anas Shatnawi, Ph.D.
University of Milan-Bicocca
Milan, Italy
Matteo Orru, Ph.D. Marco Mobilio, Ph.D. Prof. Leonardo Mariani, Ph.D.Oliviero Riganelli, Ph.D.
CloudHealth
International Workshop on Software Health 2018
C loudHealth
Problem and Motivation
Primitive KPIs:
• CPU utilization
• Mem consumption
• Network
• …
Ship
Monitoring goals:
• Performance
• Efficiency
• Reliability
• …
How to map primitive KPIs
into monitoring goals
Monitoring goals:
• Availability
• Adaptability
• Reliability
• …
Different operators with
different interest, thus
different monitoring goals
C loudHealth
The Main Idea of CloudHealth
Primitive KPIs:
• CPU utilization
• Mem consumption
• Network
• …
Ship
Monitoring goals:
• Performance
• Efficiency
• Reliability
• …
From monitoring goals to probes to collect primitive KPIs to be collected
From primitive KPIs to dashboard based on monitoring goals
Model
Reliability
Efficiency
Performance
… …
CPU
Memory
Network
C loudHealth
CloudHealth Monitoring Model (CHMM)
ISO/IEC 25010:2011 and 25011:2017
• IT service quality
• Product quality
• Quality in use model
• Quality attributes that could be adapted for cloud
services
• Definition offered by the ISO standards
ReliabilityResponsivenessAdaptabilityEffectivenessEfficiencyCompatibilityPerformance
Continuity
Recoverability
Availability
Reactiveness
Timeliness
Customizability
Initiative
Accuracy
Completeness
Positive
resource
consumption
Nigative
resource
consumption
Co-existence
Timebehaviour
Resource
utilization
Capacity
Timebetweenfailures
Timetorecovera
failure
Consistencyofreplicas
Numberoffailures
Durationoffailures
Waitingtimetoaccept
requests
Waitingtimetoreceive
response
Scalabilityofresources
Timetoperform
adaptation
Successfullyprovided
services
Faultyservices
Resourceconsumption
forsuccessfully
providedservices
Resourceconsumption
forfaultyprovided
services
Resourceconflict
Responsetime
Latency
Throughput
Resourcesconsumed
perservice
Sizeofworkload
comparetoavailable
infrastructure
Quality models
Monitoring
goals
Monitoring
subgoals
Cloud service
proprieties
Probes to
collect KPIs
Level of consistency and
stability of service outcomes.
Level to which a system can restore services (including
functions and data) and making them available, in the
event of interruptions, failures or disasters.
C loudHealth
The Process of CloudHealth
Selecting
monitoring goals
Cloud operators
Configure Deploy Operate
CHMM Monitoring goals
C loudHealth
Selecting Monitoring Goals and Services
ReliabilityResponsivenessAdaptabilityEffectivenessEfficiencyCompatibilityPerformance
Continuity
Recoverability
Availability
Reactiveness
Timeliness
Customizability
Initiative
Accuracy
Completeness
Positive
resource
consumption
Nigative
resource
consumption
Co-existence
Timebehaviour
Resource
utilization
Capacity
Timebetweenfailures
Timetorecovera
failure
Consistencyof
replicas
Numberoffailures
Durationoffailures
Waitingtimeto
acceptrequests
Waitingtimeto
receiveresponse
Scalabilityof
resources
Timetoperform
adaptation
Successfullyprovided
services
Faultyservices
Resource
consumptionfor
successfullyprovided
services
Resource
consumptionfor
faultyprovided
services
Resourceconflict
Responsetime
Latency
Throughput
Resourcesconsumed
perservice
Sizeofworkload
comparetoavailable
infrastructure
CHMM
C loudHealth
The Process of CloudHealth
Selecting
monitoring goals
Mapping
monitoring goals to
Probes
Cloud operators Monitoring goals Probes
Configure Deploy Operate
CHMM
C loudHealth
Mapping Monitoring Goals to Probes
ReliabilityResponsivenessAdaptabilityEffectivenessEfficiencyCompatibilityPerformance
Continuity
Recoverability
Availability
Reactiveness
Timeliness
Customizability
Initiative
Accuracy
Completeness
Positive
resource
consumption
Nigative
resource
consumption
Co-existence
Timebehaviour
Resource
utilization
Capacity
Timebetweenfailures
Timetorecovera
failure
Consistencyof
replicas
Numberoffailures
Durationoffailures
Waitingtimeto
acceptrequests
Waitingtimeto
receiveresponse
Scalabilityof
resources
Timetoperform
adaptation
Successfullyprovided
services
Faultyservices
Resource
consumptionfor
successfullyprovided
services
Resource
consumptionfor
faultyprovided
services
Resourceconflict
Responsetime
Latency
Throughput
Resourcesconsumed
perservice
Sizeofworkload
comparetoavailable
infrastructure
CHMM
C loudHealth
The Process of CloudHealth
Selecting
monitoring goals
Mapping
monitoring goals to
Probes
Cloud operators Monitoring goals
Attaching probes
to services
Probes Cloud services Probes attached
to services
Configure Deploy Operate
Probes
catalog
Cloud
architecture
CHMM
C loudHealth
Probing Approaches
Target service
Ship
Service host Database host
Another
service
Target
service
API
Query
KPIs
ShipKPIs
Service host Probe host Database host
1) Inside the host
2) Outside the host
Rest or SNMP APIs
C loudHealth
ELK Stack (Elasticsearch, Logstash, Kibana)
Service 1
Service 2
Service N
Ship
Logstash Elasticsearch
Parse, filter and
transform
Kibana
Query KPIs
• Pulling data from various sources
• Execute different transformations
• Exports the data to various targets
• Indexes and stores the
information in a NoSQL
database
• Visualization layer that works
on top of Elasticsearch
KPIs
C loudHealth
ELK Probes
• Lightweight agents
• Monitoring service and system levels
• Deployable on Linux, Windows, and Mac hosts
• Connectable to ELK stack
Metricbeat Filebeat Packetbeat
C loudHealth
Metricbeat
Host
VM
Apache
VM
VM KPIs
Apache-specific
KPIs
Host KPIs
KPIs:
- CPU usage
- Memory
- File system
- Disk IO
- Network IO
If some of your services are not, build your own Metricbeat module 
Apache Module configuration
C loudHealth
The Process of CloudHealth
Selecting
monitoring goals
Mapping
monitoring goals to
Probes
Cloud operators Monitoring goals
Attaching probes
to services
Probes Cloud services Probes attached
to services
Customizing the
dashboard
Dashboard
Configure Deploy Operate
Probes
catalog
Cloud
architecture
CHMMCHMM
C loudHealth
Customizing the Dashboard
Kibana
Kibana APIs
Vega APIs
Monitoring
goals CHMM
Elasticsearch
Query KPIs
KPIs
Visualize
Calculating
monitoring goals
values from KPIs
C loudHealth
Related Work
• There are not approaches that use models to automatically configure,
deploy and operate a complete monitoring solution
• Existing work concentrates on some aspects of this process only
C loudHealth
Related Work
• There are not approaches that use models to automatically configure,
deploy and operate a complete monitoring solution
• Existing work concentrates on some aspects of this process only
C loudHealth
Conclusions
C loudHealth
Future Direction
C loudHealth
Future Direction
C loudHealth
Thank you… Questions?

More Related Content

What's hot

Capgemini: Observability within the Dutch government
Capgemini: Observability within the Dutch governmentCapgemini: Observability within the Dutch government
Capgemini: Observability within the Dutch government
Elasticsearch
 
Cloud migration process simplified, innovate vancouver
Cloud migration process   simplified, innovate vancouverCloud migration process   simplified, innovate vancouver
Cloud migration process simplified, innovate vancouver
Innovate Vancouver
 
OpenStack Marketing Meeting - Icehouse Release
OpenStack Marketing Meeting - Icehouse ReleaseOpenStack Marketing Meeting - Icehouse Release
OpenStack Marketing Meeting - Icehouse Release
OpenStack Foundation
 
Oracle Cloud Native
Oracle Cloud NativeOracle Cloud Native
Oracle Cloud Native
Neagu Alexandru Cristian
 
(ATS3-APP06) Integrating Lab Equipment with the Accelrys Enterprise Platform
(ATS3-APP06) Integrating Lab Equipment with the Accelrys Enterprise Platform(ATS3-APP06) Integrating Lab Equipment with the Accelrys Enterprise Platform
(ATS3-APP06) Integrating Lab Equipment with the Accelrys Enterprise Platform
BIOVIA
 
QUDOS2015
QUDOS2015QUDOS2015
QUDOS2015
Danilo Ardagna
 
Intern Report
Intern ReportIntern Report
Intern Report
Shivang Singh
 
SPACE4Clouds: a DevOps Environment for multi-Clouds Applications
SPACE4Clouds: a DevOps Environment for multi-Clouds ApplicationsSPACE4Clouds: a DevOps Environment for multi-Clouds Applications
SPACE4Clouds: a DevOps Environment for multi-Clouds Applications
Michele Guerriero
 
How eStruxture Data Centers is Using ECE to Rapidly Scale Their Business
How eStruxture Data Centers is Using ECE to Rapidly Scale Their BusinessHow eStruxture Data Centers is Using ECE to Rapidly Scale Their Business
How eStruxture Data Centers is Using ECE to Rapidly Scale Their Business
Elasticsearch
 
Elastic @ John Deere
Elastic @ John DeereElastic @ John Deere
Elastic @ John Deere
Elasticsearch
 
Centralized logging in a changing environment at the UK’s DVLA
Centralized logging in a changing environment at the UK’s DVLACentralized logging in a changing environment at the UK’s DVLA
Centralized logging in a changing environment at the UK’s DVLA
Elasticsearch
 
Automatize a detecção de ameaças e evite falsos positivos
Automatize a detecção de ameaças e evite falsos positivosAutomatize a detecção de ameaças e evite falsos positivos
Automatize a detecção de ameaças e evite falsos positivos
Elasticsearch
 
使用 Elastic Stack 进行端对端安全分析
使用 Elastic Stack 进行端对端安全分析 使用 Elastic Stack 进行端对端安全分析
使用 Elastic Stack 进行端对端安全分析
Elasticsearch
 
Autonomous Database Explained
Autonomous Database ExplainedAutonomous Database Explained
Autonomous Database Explained
Neagu Alexandru Cristian
 
Getting Started with Infrastructure as Code (IaC)
Getting Started with Infrastructure as Code (IaC)Getting Started with Infrastructure as Code (IaC)
Getting Started with Infrastructure as Code (IaC)
Noor Basha
 
Elastic Cloud Enterprise @ Cisco
Elastic Cloud Enterprise @ CiscoElastic Cloud Enterprise @ Cisco
Elastic Cloud Enterprise @ Cisco
Elasticsearch
 
Logging using ELK Stack for Microservices
Logging using ELK Stack for MicroservicesLogging using ELK Stack for Microservices
Logging using ELK Stack for Microservices
Vineet Sabharwal
 
Leveraging Operational Data in the Cloud
Leveraging Operational Data in the CloudLeveraging Operational Data in the Cloud
Leveraging Operational Data in the Cloud
Inductive Automation
 
Elastic APM : développez vos logs et vos indicateurs pour obtenir une vue com...
Elastic APM : développez vos logs et vos indicateurs pour obtenir une vue com...Elastic APM : développez vos logs et vos indicateurs pour obtenir une vue com...
Elastic APM : développez vos logs et vos indicateurs pour obtenir une vue com...
Elasticsearch
 
Security Events Logging at Bell with the Elastic Stack
Security Events Logging at Bell with the Elastic StackSecurity Events Logging at Bell with the Elastic Stack
Security Events Logging at Bell with the Elastic Stack
Elasticsearch
 

What's hot (20)

Capgemini: Observability within the Dutch government
Capgemini: Observability within the Dutch governmentCapgemini: Observability within the Dutch government
Capgemini: Observability within the Dutch government
 
Cloud migration process simplified, innovate vancouver
Cloud migration process   simplified, innovate vancouverCloud migration process   simplified, innovate vancouver
Cloud migration process simplified, innovate vancouver
 
OpenStack Marketing Meeting - Icehouse Release
OpenStack Marketing Meeting - Icehouse ReleaseOpenStack Marketing Meeting - Icehouse Release
OpenStack Marketing Meeting - Icehouse Release
 
Oracle Cloud Native
Oracle Cloud NativeOracle Cloud Native
Oracle Cloud Native
 
(ATS3-APP06) Integrating Lab Equipment with the Accelrys Enterprise Platform
(ATS3-APP06) Integrating Lab Equipment with the Accelrys Enterprise Platform(ATS3-APP06) Integrating Lab Equipment with the Accelrys Enterprise Platform
(ATS3-APP06) Integrating Lab Equipment with the Accelrys Enterprise Platform
 
QUDOS2015
QUDOS2015QUDOS2015
QUDOS2015
 
Intern Report
Intern ReportIntern Report
Intern Report
 
SPACE4Clouds: a DevOps Environment for multi-Clouds Applications
SPACE4Clouds: a DevOps Environment for multi-Clouds ApplicationsSPACE4Clouds: a DevOps Environment for multi-Clouds Applications
SPACE4Clouds: a DevOps Environment for multi-Clouds Applications
 
How eStruxture Data Centers is Using ECE to Rapidly Scale Their Business
How eStruxture Data Centers is Using ECE to Rapidly Scale Their BusinessHow eStruxture Data Centers is Using ECE to Rapidly Scale Their Business
How eStruxture Data Centers is Using ECE to Rapidly Scale Their Business
 
Elastic @ John Deere
Elastic @ John DeereElastic @ John Deere
Elastic @ John Deere
 
Centralized logging in a changing environment at the UK’s DVLA
Centralized logging in a changing environment at the UK’s DVLACentralized logging in a changing environment at the UK’s DVLA
Centralized logging in a changing environment at the UK’s DVLA
 
Automatize a detecção de ameaças e evite falsos positivos
Automatize a detecção de ameaças e evite falsos positivosAutomatize a detecção de ameaças e evite falsos positivos
Automatize a detecção de ameaças e evite falsos positivos
 
使用 Elastic Stack 进行端对端安全分析
使用 Elastic Stack 进行端对端安全分析 使用 Elastic Stack 进行端对端安全分析
使用 Elastic Stack 进行端对端安全分析
 
Autonomous Database Explained
Autonomous Database ExplainedAutonomous Database Explained
Autonomous Database Explained
 
Getting Started with Infrastructure as Code (IaC)
Getting Started with Infrastructure as Code (IaC)Getting Started with Infrastructure as Code (IaC)
Getting Started with Infrastructure as Code (IaC)
 
Elastic Cloud Enterprise @ Cisco
Elastic Cloud Enterprise @ CiscoElastic Cloud Enterprise @ Cisco
Elastic Cloud Enterprise @ Cisco
 
Logging using ELK Stack for Microservices
Logging using ELK Stack for MicroservicesLogging using ELK Stack for Microservices
Logging using ELK Stack for Microservices
 
Leveraging Operational Data in the Cloud
Leveraging Operational Data in the CloudLeveraging Operational Data in the Cloud
Leveraging Operational Data in the Cloud
 
Elastic APM : développez vos logs et vos indicateurs pour obtenir une vue com...
Elastic APM : développez vos logs et vos indicateurs pour obtenir une vue com...Elastic APM : développez vos logs et vos indicateurs pour obtenir une vue com...
Elastic APM : développez vos logs et vos indicateurs pour obtenir une vue com...
 
Security Events Logging at Bell with the Elastic Stack
Security Events Logging at Bell with the Elastic StackSecurity Events Logging at Bell with the Elastic Stack
Security Events Logging at Bell with the Elastic Stack
 

Similar to CloudHealth: A Model-Driven Approach to Watch the Health of Cloud Services

Integration in the Cloud, by Rob Davies
Integration in the Cloud, by Rob DaviesIntegration in the Cloud, by Rob Davies
Integration in the Cloud, by Rob Davies
Judy Breedlove
 
ThroughTheLookingGlass_EffectiveObservability.pptx
ThroughTheLookingGlass_EffectiveObservability.pptxThroughTheLookingGlass_EffectiveObservability.pptx
ThroughTheLookingGlass_EffectiveObservability.pptx
Grace Jansen
 
Analyzing the World's Largest Security Data Lake!
Analyzing the World's Largest Security Data Lake!Analyzing the World's Largest Security Data Lake!
Analyzing the World's Largest Security Data Lake!
DataWorks Summit
 
Monitoring & Securing Microservices in Kubernetes
Monitoring & Securing Microservices in KubernetesMonitoring & Securing Microservices in Kubernetes
Monitoring & Securing Microservices in Kubernetes
Michael Ducy
 
Simplify Your Way To Expert Kubernetes Management
Simplify Your Way To Expert Kubernetes ManagementSimplify Your Way To Expert Kubernetes Management
Simplify Your Way To Expert Kubernetes Management
DevOps.com
 
Modernizing Testing as Apps Re-Architect
Modernizing Testing as Apps Re-ArchitectModernizing Testing as Apps Re-Architect
Modernizing Testing as Apps Re-Architect
DevOps.com
 
Monitoring Containerized Micro-Services In Azure
Monitoring Containerized Micro-Services In AzureMonitoring Containerized Micro-Services In Azure
Monitoring Containerized Micro-Services In Azure
Alex Bulankou
 
Azure meetup cloud native concepts - may 28th 2018
Azure meetup   cloud native concepts - may 28th 2018Azure meetup   cloud native concepts - may 28th 2018
Azure meetup cloud native concepts - may 28th 2018
Jim Bugwadia
 
ADDO Open Source Observability Tools
ADDO Open Source Observability Tools ADDO Open Source Observability Tools
ADDO Open Source Observability Tools
Mickey Boxell
 
Avanttic tech dates - de la monitorización a la 'observabilidad'
Avanttic tech dates - de la monitorización a la 'observabilidad'Avanttic tech dates - de la monitorización a la 'observabilidad'
Avanttic tech dates - de la monitorización a la 'observabilidad'
avanttic Consultoría Tecnológica
 
DevOps in the Cloud with Microsoft Azure
DevOps in the Cloud with Microsoft AzureDevOps in the Cloud with Microsoft Azure
DevOps in the Cloud with Microsoft Azure
gjuljo
 
Regulated Reactive - Security Considerations for Building Reactive Systems in...
Regulated Reactive - Security Considerations for Building Reactive Systems in...Regulated Reactive - Security Considerations for Building Reactive Systems in...
Regulated Reactive - Security Considerations for Building Reactive Systems in...
Ryan Hodgin
 
Containers as Infrastructure for New Gen Apps
Containers as Infrastructure for New Gen AppsContainers as Infrastructure for New Gen Apps
Containers as Infrastructure for New Gen Apps
Khalid Ahmed
 
Cloud-native Data
Cloud-native DataCloud-native Data
Cloud-native Data
cornelia davis
 
Cloud-Native-Data with Cornelia Davis
Cloud-Native-Data with Cornelia DavisCloud-Native-Data with Cornelia Davis
Cloud-Native-Data with Cornelia Davis
VMware Tanzu
 
IBM Monitoring and Event Management Solutions
IBM Monitoring and Event Management SolutionsIBM Monitoring and Event Management Solutions
IBM Monitoring and Event Management Solutions
IBM Danmark
 
From Containerized Application to Secure and Scaling With Kubernetes
From Containerized Application to Secure and Scaling With KubernetesFrom Containerized Application to Secure and Scaling With Kubernetes
From Containerized Application to Secure and Scaling With Kubernetes
Shikha Srivastava
 
Business and IT agility through DevOps and microservice architecture powered ...
Business and IT agility through DevOps and microservice architecture powered ...Business and IT agility through DevOps and microservice architecture powered ...
Business and IT agility through DevOps and microservice architecture powered ...
Lucas Jellema
 
Keynote : évolution et vision d'Elastic Observability
Keynote : évolution et vision d'Elastic ObservabilityKeynote : évolution et vision d'Elastic Observability
Keynote : évolution et vision d'Elastic Observability
Elasticsearch
 
GOTOpia 2020: "The Past, Present, and Future of Cloud Native API Gateways"
GOTOpia 2020: "The Past, Present, and Future of Cloud Native API Gateways"GOTOpia 2020: "The Past, Present, and Future of Cloud Native API Gateways"
GOTOpia 2020: "The Past, Present, and Future of Cloud Native API Gateways"
Daniel Bryant
 

Similar to CloudHealth: A Model-Driven Approach to Watch the Health of Cloud Services (20)

Integration in the Cloud, by Rob Davies
Integration in the Cloud, by Rob DaviesIntegration in the Cloud, by Rob Davies
Integration in the Cloud, by Rob Davies
 
ThroughTheLookingGlass_EffectiveObservability.pptx
ThroughTheLookingGlass_EffectiveObservability.pptxThroughTheLookingGlass_EffectiveObservability.pptx
ThroughTheLookingGlass_EffectiveObservability.pptx
 
Analyzing the World's Largest Security Data Lake!
Analyzing the World's Largest Security Data Lake!Analyzing the World's Largest Security Data Lake!
Analyzing the World's Largest Security Data Lake!
 
Monitoring & Securing Microservices in Kubernetes
Monitoring & Securing Microservices in KubernetesMonitoring & Securing Microservices in Kubernetes
Monitoring & Securing Microservices in Kubernetes
 
Simplify Your Way To Expert Kubernetes Management
Simplify Your Way To Expert Kubernetes ManagementSimplify Your Way To Expert Kubernetes Management
Simplify Your Way To Expert Kubernetes Management
 
Modernizing Testing as Apps Re-Architect
Modernizing Testing as Apps Re-ArchitectModernizing Testing as Apps Re-Architect
Modernizing Testing as Apps Re-Architect
 
Monitoring Containerized Micro-Services In Azure
Monitoring Containerized Micro-Services In AzureMonitoring Containerized Micro-Services In Azure
Monitoring Containerized Micro-Services In Azure
 
Azure meetup cloud native concepts - may 28th 2018
Azure meetup   cloud native concepts - may 28th 2018Azure meetup   cloud native concepts - may 28th 2018
Azure meetup cloud native concepts - may 28th 2018
 
ADDO Open Source Observability Tools
ADDO Open Source Observability Tools ADDO Open Source Observability Tools
ADDO Open Source Observability Tools
 
Avanttic tech dates - de la monitorización a la 'observabilidad'
Avanttic tech dates - de la monitorización a la 'observabilidad'Avanttic tech dates - de la monitorización a la 'observabilidad'
Avanttic tech dates - de la monitorización a la 'observabilidad'
 
DevOps in the Cloud with Microsoft Azure
DevOps in the Cloud with Microsoft AzureDevOps in the Cloud with Microsoft Azure
DevOps in the Cloud with Microsoft Azure
 
Regulated Reactive - Security Considerations for Building Reactive Systems in...
Regulated Reactive - Security Considerations for Building Reactive Systems in...Regulated Reactive - Security Considerations for Building Reactive Systems in...
Regulated Reactive - Security Considerations for Building Reactive Systems in...
 
Containers as Infrastructure for New Gen Apps
Containers as Infrastructure for New Gen AppsContainers as Infrastructure for New Gen Apps
Containers as Infrastructure for New Gen Apps
 
Cloud-native Data
Cloud-native DataCloud-native Data
Cloud-native Data
 
Cloud-Native-Data with Cornelia Davis
Cloud-Native-Data with Cornelia DavisCloud-Native-Data with Cornelia Davis
Cloud-Native-Data with Cornelia Davis
 
IBM Monitoring and Event Management Solutions
IBM Monitoring and Event Management SolutionsIBM Monitoring and Event Management Solutions
IBM Monitoring and Event Management Solutions
 
From Containerized Application to Secure and Scaling With Kubernetes
From Containerized Application to Secure and Scaling With KubernetesFrom Containerized Application to Secure and Scaling With Kubernetes
From Containerized Application to Secure and Scaling With Kubernetes
 
Business and IT agility through DevOps and microservice architecture powered ...
Business and IT agility through DevOps and microservice architecture powered ...Business and IT agility through DevOps and microservice architecture powered ...
Business and IT agility through DevOps and microservice architecture powered ...
 
Keynote : évolution et vision d'Elastic Observability
Keynote : évolution et vision d'Elastic ObservabilityKeynote : évolution et vision d'Elastic Observability
Keynote : évolution et vision d'Elastic Observability
 
GOTOpia 2020: "The Past, Present, and Future of Cloud Native API Gateways"
GOTOpia 2020: "The Past, Present, and Future of Cloud Native API Gateways"GOTOpia 2020: "The Past, Present, and Future of Cloud Native API Gateways"
GOTOpia 2020: "The Past, Present, and Future of Cloud Native API Gateways"
 

Recently uploaded

Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...
Globus
 
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
Shahin Sheidaei
 
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdfDominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
AMB-Review
 
Enhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdfEnhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdf
Globus
 
GraphSummit Paris - The art of the possible with Graph Technology
GraphSummit Paris - The art of the possible with Graph TechnologyGraphSummit Paris - The art of the possible with Graph Technology
GraphSummit Paris - The art of the possible with Graph Technology
Neo4j
 
Pro Unity Game Development with C-sharp Book
Pro Unity Game Development with C-sharp BookPro Unity Game Development with C-sharp Book
Pro Unity Game Development with C-sharp Book
abdulrafaychaudhry
 
Quarkus Hidden and Forbidden Extensions
Quarkus Hidden and Forbidden ExtensionsQuarkus Hidden and Forbidden Extensions
Quarkus Hidden and Forbidden Extensions
Max Andersen
 
Lecture 1 Introduction to games development
Lecture 1 Introduction to games developmentLecture 1 Introduction to games development
Lecture 1 Introduction to games development
abdulrafaychaudhry
 
Graphic Design Crash Course for beginners
Graphic Design Crash Course for beginnersGraphic Design Crash Course for beginners
Graphic Design Crash Course for beginners
e20449
 
Globus Connect Server Deep Dive - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024Globus Connect Server Deep Dive - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024
Globus
 
Vitthal Shirke Microservices Resume Montevideo
Vitthal Shirke Microservices Resume MontevideoVitthal Shirke Microservices Resume Montevideo
Vitthal Shirke Microservices Resume Montevideo
Vitthal Shirke
 
Orion Context Broker introduction 20240604
Orion Context Broker introduction 20240604Orion Context Broker introduction 20240604
Orion Context Broker introduction 20240604
Fermin Galan
 
Text-Summarization-of-Breaking-News-Using-Fine-tuning-BART-Model.pptx
Text-Summarization-of-Breaking-News-Using-Fine-tuning-BART-Model.pptxText-Summarization-of-Breaking-News-Using-Fine-tuning-BART-Model.pptx
Text-Summarization-of-Breaking-News-Using-Fine-tuning-BART-Model.pptx
ShamsuddeenMuhammadA
 
Top 7 Unique WhatsApp API Benefits | Saudi Arabia
Top 7 Unique WhatsApp API Benefits | Saudi ArabiaTop 7 Unique WhatsApp API Benefits | Saudi Arabia
Top 7 Unique WhatsApp API Benefits | Saudi Arabia
Yara Milbes
 
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
informapgpstrackings
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
Large Language Models and the End of Programming
Large Language Models and the End of ProgrammingLarge Language Models and the End of Programming
Large Language Models and the End of Programming
Matt Welsh
 
APIs for Browser Automation (MoT Meetup 2024)
APIs for Browser Automation (MoT Meetup 2024)APIs for Browser Automation (MoT Meetup 2024)
APIs for Browser Automation (MoT Meetup 2024)
Boni García
 
Enterprise Software Development with No Code Solutions.pptx
Enterprise Software Development with No Code Solutions.pptxEnterprise Software Development with No Code Solutions.pptx
Enterprise Software Development with No Code Solutions.pptx
QuickwayInfoSystems3
 
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Globus
 

Recently uploaded (20)

Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...
 
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
 
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdfDominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
 
Enhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdfEnhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdf
 
GraphSummit Paris - The art of the possible with Graph Technology
GraphSummit Paris - The art of the possible with Graph TechnologyGraphSummit Paris - The art of the possible with Graph Technology
GraphSummit Paris - The art of the possible with Graph Technology
 
Pro Unity Game Development with C-sharp Book
Pro Unity Game Development with C-sharp BookPro Unity Game Development with C-sharp Book
Pro Unity Game Development with C-sharp Book
 
Quarkus Hidden and Forbidden Extensions
Quarkus Hidden and Forbidden ExtensionsQuarkus Hidden and Forbidden Extensions
Quarkus Hidden and Forbidden Extensions
 
Lecture 1 Introduction to games development
Lecture 1 Introduction to games developmentLecture 1 Introduction to games development
Lecture 1 Introduction to games development
 
Graphic Design Crash Course for beginners
Graphic Design Crash Course for beginnersGraphic Design Crash Course for beginners
Graphic Design Crash Course for beginners
 
Globus Connect Server Deep Dive - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024Globus Connect Server Deep Dive - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024
 
Vitthal Shirke Microservices Resume Montevideo
Vitthal Shirke Microservices Resume MontevideoVitthal Shirke Microservices Resume Montevideo
Vitthal Shirke Microservices Resume Montevideo
 
Orion Context Broker introduction 20240604
Orion Context Broker introduction 20240604Orion Context Broker introduction 20240604
Orion Context Broker introduction 20240604
 
Text-Summarization-of-Breaking-News-Using-Fine-tuning-BART-Model.pptx
Text-Summarization-of-Breaking-News-Using-Fine-tuning-BART-Model.pptxText-Summarization-of-Breaking-News-Using-Fine-tuning-BART-Model.pptx
Text-Summarization-of-Breaking-News-Using-Fine-tuning-BART-Model.pptx
 
Top 7 Unique WhatsApp API Benefits | Saudi Arabia
Top 7 Unique WhatsApp API Benefits | Saudi ArabiaTop 7 Unique WhatsApp API Benefits | Saudi Arabia
Top 7 Unique WhatsApp API Benefits | Saudi Arabia
 
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
Large Language Models and the End of Programming
Large Language Models and the End of ProgrammingLarge Language Models and the End of Programming
Large Language Models and the End of Programming
 
APIs for Browser Automation (MoT Meetup 2024)
APIs for Browser Automation (MoT Meetup 2024)APIs for Browser Automation (MoT Meetup 2024)
APIs for Browser Automation (MoT Meetup 2024)
 
Enterprise Software Development with No Code Solutions.pptx
Enterprise Software Development with No Code Solutions.pptxEnterprise Software Development with No Code Solutions.pptx
Enterprise Software Development with No Code Solutions.pptx
 
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
 

CloudHealth: A Model-Driven Approach to Watch the Health of Cloud Services

  • 1. CloudHealth A Model-Driven Approach to Watch the Health of Cloud Services by Anas Shatnawi, Ph.D. University of Milan-Bicocca Milan, Italy Matteo Orru, Ph.D. Marco Mobilio, Ph.D. Prof. Leonardo Mariani, Ph.D.Oliviero Riganelli, Ph.D. CloudHealth International Workshop on Software Health 2018
  • 2. C loudHealth Problem and Motivation Primitive KPIs: • CPU utilization • Mem consumption • Network • … Ship Monitoring goals: • Performance • Efficiency • Reliability • … How to map primitive KPIs into monitoring goals Monitoring goals: • Availability • Adaptability • Reliability • … Different operators with different interest, thus different monitoring goals
  • 3. C loudHealth The Main Idea of CloudHealth Primitive KPIs: • CPU utilization • Mem consumption • Network • … Ship Monitoring goals: • Performance • Efficiency • Reliability • … From monitoring goals to probes to collect primitive KPIs to be collected From primitive KPIs to dashboard based on monitoring goals Model Reliability Efficiency Performance … … CPU Memory Network
  • 4. C loudHealth CloudHealth Monitoring Model (CHMM) ISO/IEC 25010:2011 and 25011:2017 • IT service quality • Product quality • Quality in use model • Quality attributes that could be adapted for cloud services • Definition offered by the ISO standards ReliabilityResponsivenessAdaptabilityEffectivenessEfficiencyCompatibilityPerformance Continuity Recoverability Availability Reactiveness Timeliness Customizability Initiative Accuracy Completeness Positive resource consumption Nigative resource consumption Co-existence Timebehaviour Resource utilization Capacity Timebetweenfailures Timetorecovera failure Consistencyofreplicas Numberoffailures Durationoffailures Waitingtimetoaccept requests Waitingtimetoreceive response Scalabilityofresources Timetoperform adaptation Successfullyprovided services Faultyservices Resourceconsumption forsuccessfully providedservices Resourceconsumption forfaultyprovided services Resourceconflict Responsetime Latency Throughput Resourcesconsumed perservice Sizeofworkload comparetoavailable infrastructure Quality models Monitoring goals Monitoring subgoals Cloud service proprieties Probes to collect KPIs Level of consistency and stability of service outcomes. Level to which a system can restore services (including functions and data) and making them available, in the event of interruptions, failures or disasters.
  • 5. C loudHealth The Process of CloudHealth Selecting monitoring goals Cloud operators Configure Deploy Operate CHMM Monitoring goals
  • 6. C loudHealth Selecting Monitoring Goals and Services ReliabilityResponsivenessAdaptabilityEffectivenessEfficiencyCompatibilityPerformance Continuity Recoverability Availability Reactiveness Timeliness Customizability Initiative Accuracy Completeness Positive resource consumption Nigative resource consumption Co-existence Timebehaviour Resource utilization Capacity Timebetweenfailures Timetorecovera failure Consistencyof replicas Numberoffailures Durationoffailures Waitingtimeto acceptrequests Waitingtimeto receiveresponse Scalabilityof resources Timetoperform adaptation Successfullyprovided services Faultyservices Resource consumptionfor successfullyprovided services Resource consumptionfor faultyprovided services Resourceconflict Responsetime Latency Throughput Resourcesconsumed perservice Sizeofworkload comparetoavailable infrastructure CHMM
  • 7. C loudHealth The Process of CloudHealth Selecting monitoring goals Mapping monitoring goals to Probes Cloud operators Monitoring goals Probes Configure Deploy Operate CHMM
  • 8. C loudHealth Mapping Monitoring Goals to Probes ReliabilityResponsivenessAdaptabilityEffectivenessEfficiencyCompatibilityPerformance Continuity Recoverability Availability Reactiveness Timeliness Customizability Initiative Accuracy Completeness Positive resource consumption Nigative resource consumption Co-existence Timebehaviour Resource utilization Capacity Timebetweenfailures Timetorecovera failure Consistencyof replicas Numberoffailures Durationoffailures Waitingtimeto acceptrequests Waitingtimeto receiveresponse Scalabilityof resources Timetoperform adaptation Successfullyprovided services Faultyservices Resource consumptionfor successfullyprovided services Resource consumptionfor faultyprovided services Resourceconflict Responsetime Latency Throughput Resourcesconsumed perservice Sizeofworkload comparetoavailable infrastructure CHMM
  • 9. C loudHealth The Process of CloudHealth Selecting monitoring goals Mapping monitoring goals to Probes Cloud operators Monitoring goals Attaching probes to services Probes Cloud services Probes attached to services Configure Deploy Operate Probes catalog Cloud architecture CHMM
  • 10. C loudHealth Probing Approaches Target service Ship Service host Database host Another service Target service API Query KPIs ShipKPIs Service host Probe host Database host 1) Inside the host 2) Outside the host Rest or SNMP APIs
  • 11. C loudHealth ELK Stack (Elasticsearch, Logstash, Kibana) Service 1 Service 2 Service N Ship Logstash Elasticsearch Parse, filter and transform Kibana Query KPIs • Pulling data from various sources • Execute different transformations • Exports the data to various targets • Indexes and stores the information in a NoSQL database • Visualization layer that works on top of Elasticsearch KPIs
  • 12. C loudHealth ELK Probes • Lightweight agents • Monitoring service and system levels • Deployable on Linux, Windows, and Mac hosts • Connectable to ELK stack Metricbeat Filebeat Packetbeat
  • 13. C loudHealth Metricbeat Host VM Apache VM VM KPIs Apache-specific KPIs Host KPIs KPIs: - CPU usage - Memory - File system - Disk IO - Network IO If some of your services are not, build your own Metricbeat module  Apache Module configuration
  • 14. C loudHealth The Process of CloudHealth Selecting monitoring goals Mapping monitoring goals to Probes Cloud operators Monitoring goals Attaching probes to services Probes Cloud services Probes attached to services Customizing the dashboard Dashboard Configure Deploy Operate Probes catalog Cloud architecture CHMMCHMM
  • 15. C loudHealth Customizing the Dashboard Kibana Kibana APIs Vega APIs Monitoring goals CHMM Elasticsearch Query KPIs KPIs Visualize Calculating monitoring goals values from KPIs
  • 16. C loudHealth Related Work • There are not approaches that use models to automatically configure, deploy and operate a complete monitoring solution • Existing work concentrates on some aspects of this process only
  • 17. C loudHealth Related Work • There are not approaches that use models to automatically configure, deploy and operate a complete monitoring solution • Existing work concentrates on some aspects of this process only