OpenStack reliability metrics

•

2 likes•313 views

Ilya Shakhat

How to measure performance impact of different faults in OpenStack

Engineering

Reliability
Metrics
fault-injection and performance
impact analysis

User story
As performance engineer, I would like to measure
how different faults impact availability and
performance of OpenStack services.
Examples:
● What is the impact of Keystone restart
to Nova API operations?
● How loss of one of RabbitMQ servers
affects VM instances creation time?

Hypothesis
A particular failure may cause errors and/or
performance degradation
Measurements:
● Service downtime (seconds)
● MTTR (seconds)
● Absolute performance degradation
(seconds)
● Relative performance degradation
(ratio)

Implementation
1. Rally hooks
an entry-point to call plugins at specified
points of scenario execution
2. OS-Faults lib
fault-injection library
3. Stats processing
results visualization and report
generation

Rally hooks
● Hook is a new type of plugins.
● Hooks can be called at specific point of
scenario execution.
● Available in Rally 0.7.0

os-faults ● Generalized fault injection library
● DevStack, Fuel, libvirt and IPMI drivers
are already in
● Rally hook plugin is on review
https://review.openstack.org/384483
Simplified API:
● restart rabbitmq service
● reboot one node with mysql service

Stats processing ● Time-based vs iteration-based in Rally -
accurate look on service state
● Anomaly analysis - highlight areas where
performance differs

Reports http://docs.openstack.org/developer/
performance-docs/test_results/reliability/
version_2/index.html

What's hot

Nova Updates - Kilo EditionOpenStack Foundation

Using Rally for OpenStack certification at ScaleBoris Pavlovic

How OpenStack is Built - Anton Weiss - OpenStack Day Israel 2016Cloud Native Day Tel Aviv

Operator development made easy with helmConSol Consulting & Solutions Software GmbH

OpenNebula Conf 2014 | OpenNebula as Open Replacement of vCloud by Javier FontanNETWAYS

stackconf 2021 | How we finally migrated an eCommerce-Platform to GCPNETWAYS

KubeOne loodse

Kubermatic How to Migrate 100 Clusters from On-Prem to Google Cloud Without D...Tobias Schneck

Monitoring with prometheus at scaleJuraj Hantak

Serverless Workflow: New approach to Kubernetes service orchestration | DevNa...Red Hat Developers

OpenContrail ImplementationsJakub Pavlik

CI, CD, CT, Deploy, IaaS, DevOps, StageArtur Basak

Orchestrating VM & Container DeploymentsLars Wander

Documentation Updates - Kilo EditionOpenStack Foundation

The Kubernetes Operator Pattern - ContainerConf Nov 2017Jakob Karalus

Cncf k8s_network_02Erhwen Kuo

Cloud Native APIs: The API Operator for KubernetesWSO2

Remote debugging of Application in KubernetesConSol Consulting & Solutions Software GmbH

Data Engineer's Lunch #47: Airflow on KubernetesAnant Corporation

What's hot (19)

Nova Updates - Kilo Edition

Using Rally for OpenStack certification at Scale

How OpenStack is Built - Anton Weiss - OpenStack Day Israel 2016

Operator development made easy with helm

OpenNebula Conf 2014 | OpenNebula as Open Replacement of vCloud by Javier Fontan

stackconf 2021 | How we finally migrated an eCommerce-Platform to GCP

KubeOne

Kubermatic How to Migrate 100 Clusters from On-Prem to Google Cloud Without D...

Monitoring with prometheus at scale

Serverless Workflow: New approach to Kubernetes service orchestration | DevNa...

OpenContrail Implementations

CI, CD, CT, Deploy, IaaS, DevOps, Stage

Orchestrating VM & Container Deployments

Documentation Updates - Kilo Edition

The Kubernetes Operator Pattern - ContainerConf Nov 2017

Cncf k8s_network_02

Cloud Native APIs: The API Operator for Kubernetes

Remote debugging of Application in Kubernetes

Data Engineer's Lunch #47: Airflow on Kubernetes

Similar to OpenStack reliability metrics

Developing Microservices using Spring - Beginner's GuideMohanraj Thirumoorthy

Openshift service broker and catalog ocp-meetup july 2018Michael Calizo

Openshift serverless SolutionRyan ZhangCheng

Spring boot microservice metrics monitoringOracle Korea

Spring Boot - Microservice Metrics MonitoringDonghuKIM2

Why use Gitlababenyeung1

Why and How to Run Your Own Gitlab Runners as Your Company GrowsNGINX, Inc.

How's relevant JMeter to me - DevConf (Letterkenny)Giulio Vian

What's new in confluent platform 5.4 online talkconfluent

Functioning incessantly of Data Science Platform with Kubeflow - Albert Lewan...GetInData

Session 41 - Struts 2 IntroductionPawanMM

MuleSoft Sizing Guidelines - VirtualMuleysAngel Alberici

Monitoring in Big Data Platform - Albert Lewandowski, GetInDataGetInData

Ultimate Guide to Microservice Architecture on Kuberneteskloia

What's new in OpenStack LibertyMichael Solberg

Kirill Rozin - Practical Wars for AutomatizationSergey Arkhipov

Cloud APIs Overview TuckerInfrastructure 2.0

Cloud-Native Progressive DeliveryMatt Turner

Struts 2 - Introduction Hitesh-Java

PAC 2019 virtual Bruno Audoux Neotys

Similar to OpenStack reliability metrics (20)

Developing Microservices using Spring - Beginner's Guide

Openshift service broker and catalog ocp-meetup july 2018

Openshift serverless Solution

Spring boot microservice metrics monitoring

Spring Boot - Microservice Metrics Monitoring

Why use Gitlab

Why and How to Run Your Own Gitlab Runners as Your Company Grows

How's relevant JMeter to me - DevConf (Letterkenny)

What's new in confluent platform 5.4 online talk

Functioning incessantly of Data Science Platform with Kubeflow - Albert Lewan...

Session 41 - Struts 2 Introduction

MuleSoft Sizing Guidelines - VirtualMuleys

Monitoring in Big Data Platform - Albert Lewandowski, GetInData

Ultimate Guide to Microservice Architecture on Kubernetes

What's new in OpenStack Liberty

Kirill Rozin - Practical Wars for Automatization

Cloud APIs Overview Tucker

Cloud-Native Progressive Delivery

Struts 2 - Introduction

PAC 2019 virtual Bruno Audoux

Recently uploaded

KubeKraft presentation @CloudNativeHooghlysanyuktamishra911

Unit 1 - Soil Classification and Compaction.pdfRagavanV2

Generative AI or GenAI technology based PPTbhaskargani46

chapter 5.pptx: drainage and irrigation engineeringmulugeta48

Call Girls in Netaji Nagar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service9953056974 Low Rate Call Girls In Saket, Delhi NCR

XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXssuser89054b

Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Kandungan 087776558899

Employee leave management system project.Kamal Acharya

Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi

Introduction to Serverless with AWS LambdaOmar Fathy

University management System project report..pdfKamal Acharya

data_management_and _data_science_cheat_sheet.pdfJiananWang21

Minimum and Maximum Modes of microprocessor 8086anil_gaur

Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...9953056974 Low Rate Call Girls In Saket, Delhi NCR

Hostel management system project report..pdfKamal Acharya

Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control

Double Revolving field theory-how the rotor develops torqueBhangaleSonal

Thermal Engineering Unit - I & II . pptDineshKumar4165

A Study of Urban Area Plan for Pabna MunicipalityMorshed Ahmed Rahath

ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfKamal Acharya

Recently uploaded (20)

KubeKraft presentation @CloudNativeHooghly

Unit 1 - Soil Classification and Compaction.pdf

Generative AI or GenAI technology based PPT

chapter 5.pptx: drainage and irrigation engineering

Call Girls in Netaji Nagar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service

XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil

Employee leave management system project.

Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking

Introduction to Serverless with AWS Lambda

University management System project report..pdf

data_management_and _data_science_cheat_sheet.pdf

Minimum and Maximum Modes of microprocessor 8086

Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...

Hostel management system project report..pdf

Water Industry Process Automation & Control Monthly - April 2024

Double Revolving field theory-how the rotor develops torque

Thermal Engineering Unit - I & II . ppt

A Study of Urban Area Plan for Pabna Municipality

ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf

OpenStack reliability metrics

1. Reliability Metrics fault-injection and performance impact analysis

2. User story As performance engineer, I would like to measure how different faults impact availability and performance of OpenStack services. Examples: ● What is the impact of Keystone restart to Nova API operations? ● How loss of one of RabbitMQ servers affects VM instances creation time?

3. Hypothesis A particular failure may cause errors and/or performance degradation Measurements: ● Service downtime (seconds) ● MTTR (seconds) ● Absolute performance degradation (seconds) ● Relative performance degradation (ratio)

4. Implementation 1. Rally hooks an entry-point to call plugins at specified points of scenario execution 2. OS-Faults lib fault-injection library 3. Stats processing results visualization and report generation

5. Rally hooks ● Hook is a new type of plugins. ● Hooks can be called at specific point of scenario execution. ● Available in Rally 0.7.0

6. os-faults ● Generalized fault injection library ● DevStack, Fuel, libvirt and IPMI drivers are already in ● Rally hook plugin is on review https://review.openstack.org/384483 Simplified API: ● restart rabbitmq service ● reboot one node with mysql service

7. Stats processing ● Time-based vs iteration-based in Rally - accurate look on service state ● Anomaly analysis - highlight areas where performance differs

8. Reports http://docs.openstack.org/developer/ performance-docs/test_results/reliability/ version_2/index.html

OpenStack reliability metrics

Recommended

Recommended

More Related Content

What's hot

What's hot (19)

Similar to OpenStack reliability metrics

Similar to OpenStack reliability metrics (20)

Recently uploaded

Recently uploaded (20)

OpenStack reliability metrics