SlideShare a Scribd company logo
SLALOM Webinar
George Kousiouris, Andreas Menychtas,
Dimosthenis Kyriazis
ICCS/NTUA
Outline
• SLALOM Project
• Our background
• Overview of contributions
• SLA specification / reference model
• Abstract metric function / definition
• Conclusions
2
SLALOM Project
• EU funded project started on January 2015
• Aims at developing a core SLA specification
– As a basis for service interactions between providers and customers
– Considering current SLA landscape and research outcomes from
various projects and initiatives
• Current status
– Analysis of SLA landscape, standardization efforts, relevant research
outcomes
– Development of SLA specification (v1) including main blocks and
components for each component
– Development of abstract metric / function (v1) applicable to different
metrics
– Submission of our work to ISO SLA WG for standardization
• SLALOM is officially accepted as liaison body to ISO SLA WG
• Attended and presented our work in the last ISO SLA WG meeting (Dublin)
3
Our Background
• European Commission Expert Group on SLAs,
http://ec.europa.eu/digital-agenda/en/news/cloud-computing-
service-level-agreements-exploitation-research-results
• Real time SLAs (IRMOS FP7 project, 2008-2011)
– Real Time Cloud for enabling performance guarantees on soft real-
time applications via SLAs
• Admission Control and Legal Aspects (OPTIMIS FP7 project, 2010-
2013)
– Risk, Eco-efficiency and Cost as parameters
– Data Location considerations, Contractual terms
• Abstracted Auditing/Monitoring of SLAs (ARTIST FP7 project, 2012-
2015)
– 3ALib abstracted library implementation
– CloudML@ARTIST definition of a UML profile for SLA descriptions
4
Outline
• SLALOM Project
• Our background
• Overview of contributions
• SLA specification / reference model
• Abstract metric function / definition
• Conclusions
5
SLA Specification Contribution (1/2)
• The proposed specification / reference model takes into
account
– Standardization approaches and working groups outcomes
– Current SLAs offered by commercial cloud providers
– Expressed views by cloud providers and adopters
– Research outcomes
• With respect to ISO SLA WG outcomes
– Follows ISO 19086-2: Core blocks (i.e. metric definition,
parameters definition, rule definition) and the corresponding
elements (e.g. ID, name, unit, scale, etc) of the SLA
– Follows ISO 3534-2: Metric in different scales (such as interval,
ratio, nominal or ordinal)
6
SLA Specification Contribution (2/2)
• With respect to ISO WG outcomes (cont.)
– Suggests changes to ISO 19086-2: Naming of specific
elements (e.g. referenceId is used both for metricId
for parameterId and for ruleId, while SLALOM
proposes the use of different identifiers)
– Suggests changes to ISO 19086-2: Inclusion of
additional elements and blocks:
• Dependencies with other metrics - e.g. availability of storage
service and dependency to latency or response time and
dependency to bandwidth
• Importance of a metric (i.e. gradeOfImportance)
7
SLA Specification / Reference model:
Main blocks
• Follows ISO 19086-2 SLA specification
• Metric
– Corresponds to the service metric / objective (e.g. availability)
• Parameter
– Links the metric with a parameter to express how it can be
monitored and validated (e.g. time to provide resources
following an elasticity trigger)
• Rule
– Refers to metric “constraints” (e.g. number of concurrent
connections for a number of users metric)
• SLALOM proposed the addition of the Dependency block
– Captures dependencies between metrics (e.g. response time
and bandwidth)
8
Components
• Specific components are used for all building blocks
– ID
– Name
– Definition / Expression
– Unit
– Notes
• SLALOM proposed the following additions to the ISO
specification
– gradeOfImportance component for the Metric definition, to
define the metrics importance within an SLA
– consequenceOfViolation component for the Rule definition, to
define the potential consequence of violation on the service
provisioning
9
Outline
• SLALOM Project
• Our background
• Overview of contributions
• SLA specification / reference model
• Abstract metric function / definition
• Conclusions
10
Abstract metric function / definition primary goal
• Have a standard that forces ambiguities to be clarified
• Help in the measurement/auditing process of an SLA
– Especially by 3rd party providers
– What is the purpose of having SLAs if one is not able to measure them
non-repudiably?
• Abstract the process
11
Example of ambiguity in the measurement process: AWS EC2 SLA
• ““Unavailable” and “Unavailability” mean:
– For Amazon EC2, when all of your running instances have no
external connectivity.”
• Determination of external connectivity. How?
– E.g. pinging (ICMP)?
• Security threat
– Application layer (endpoint checking)?
• Includes application downtime (Not the responsibility of AWS EC2)
12
SLALOM 3-Layer Definition
Metric (Ratio) Layer
Final ratio calculation of the defined metric Correlates individual periods to the overall metric
Period (Time) Layer
Size of period, limit of base period and on the error
rate needed
Aggregates samples at a time interval granularity,
deciding on this level
Sample (Measurement) layer
Boolean expression based on concrete measurement
constraints
Dictates success or not of a sample
13
Sample Layer
• Sample Condition - sc: the condition stating whether a sample has been successful.
– operator: the operator can either be a boolean one (i.e. AND, OR, NOT) or a comparison
operator (<, >, <=, >=, ==, !=).
– value: the actual value of the condition that can be arithmetic, non-arithmetic (e.g. a
string such as “exception”) or an enumeration (e.g. HTTP response code == 200).
– unit: the unit for the value of the condition.
• Sample - s: the sample used to evaluate a parameter against the condition sc.
• Successful Sample - ss: the sample satisfying the condition sc.
• Unsuccessful Sample - us: the sample not satisfying the condition sc.
• Type of Operation field: defined sampling process e.g. pair of protocol- response
Sample definition
For a given type of operation as specified in the corresponding field (described
previously)
sc = operator + value + unit
ss = s if (sc is true)
us = s if (sc is false)
14
Period Layer
• Boundary Period - bp: the period for which the analysis of a parameter (through samples) should be
taken into account. Any sample that is not meeting this criterion (i.e. falls within the period) is
excluded even though if it is successful (i.e. ss according to the sample definition).
– operator: a comparison operator (<, >, <=, >=, ==, !=).
– value: the actual arithmetic value of the condition.
– unit: the unit in this case is always a time unit (e.g. seconds, minutes, etc).
• Error Condition - ec: the error condition ratio for which the analysis of a parameter (through samples)
should be taken into account. The ratio is always expressed in a percentage (%) format.
– operator: a comparison operator (<, >, <=, >=, ==, !=).
– value: the actual arithmetic value of the condition.
• Error Ratio - er: the error ratio calculated based on the total set of samples and the successful
samples.
• Period - p: the period in which samples (sc and uc) are examined according to the boundary period
and the error condition.
• Valid Period - vp: the period for which the error ratio value meets the error condition ratio and the
boundary period condition is also satisfied.
• Non-valid Period - np: the period for which the error ratio value does not meet the error condition
ratio (the boundary period condition is satisfied). Boundary period and error definitions
bp = operator + value + unit
ec = operator + value + %
er = Σus/Σs ∀us ∈ p
vp = p if ((er<=ec) && (p>=bp))
np = p if ((er>=ec) && (p>=bp))
15
Metric Layer
• Metric Condition - mc: the condition regarding a specific metric. The
condition is always expressed in a percentage (Ratio Type?) (%) format to
enable its evaluation as proposed through the metric evaluation.
– operator: a comparison operator (<, >, <=, >=, ==, !=).
– value: the actual arithmetic value of the condition.
• Metric Evaluation - me: the evaluation of the metric based on the valid
and non-valid period samples. The evaluation should be smaller than the
condition (i.e. me <mc).
Abstract metric definition
mc = operator + value + %
me = Σnp / (Σvp+Σnp)
16
Mapping of EC2 SLA@SLALOM
Amazon EC2
Level / definition Expression Notes
Sample definition sc: UNDEFINED (assumed ‘ping’->
ICMP)
The sampling condition is not defined in the
Amazon EC2 SLA. The concrete wording is “when
all of your running instances have no external
connectivity”. Nonetheless, the way to specify /
measure “external connectivity” is not defined.
For example a customer could use a ping
operation or a custom monitoring mechanism.
Type of operation: ping Not defined how the condition of connectivity
can be actually measured (e.g. the ping
operation mentioned previously).
Boundary period
and error
definitions
bp > 60 sec The exact wording is “the percentage of
minutes”, thus the period is 60 seconds.
ec = 100% Error condition reflecting that the error ratio is
that for the entire bp the resource must be
continuously “unavailable”.
Abstract metric
definition
availability < 99.95 % Availability metric definition given the boundary
period and error condition.
17
Mapping of SLALOM @ GAE Datastore
Google AppEngine Datastore
Level / definition Expression Notes
Sample definition sc: INTERNAL_ERROR Several sampling conditions are
defined per type of operation. For
example it is specified (exact wording)
“INTERNAL_ERROR, TIMEOUT, …” for
API calls.
Type of operation: API calls Several type of operations are defined.
An example is provided here.
Boundary period
and error
definitions
bp > 300 sec The exact wording is “five consecutive
minutes”.
ec > 10% Error condition reflecting that the
error ratio is (exact wording) “ten
percent Error Rate”.
Abstract metric
definition
availability < 99.95 % Availability metric definition given the
boundary period and error condition.
18
Mapping of SLALOM @ Microsoft
Azure SLA
19
Microsoft Azure Storage
Level / definition Expression Notes
Sample definition sc = 60 sec Several sampling conditions are defined
per type of operation. For example it is
specified (exact wording) “Sixty (60)
seconds” for PutBlockList and
GetBlockList.
Type of operation: PutBlockList and
GetBlockList
Several type of operations are defined.
An example is provided here.
Boundary period
and error
definitions
bp > 3600 sec The exact wording is “given one-hour
interval”.
ec > 0% Error condition reflecting that all periods
should be taken into account for the
availability metric evaluation (exact
wording) “is the sum of Error Rates for
each hour”.
Abstract metric
definition
availability < 99.9 % Availability metric definition given the
boundary period and error condition.
Preconditions
• For any SLA to apply, a number of preconditions
typically exist per provider
• Examples
– Deployment: Number of Availability Zones used
– Deployment: Replication options used
– Usage/Measurement: Restarting of resources when
unavailable
– Usage/Measurement: Applied Throttling of requests
20
Outline
• SLALOM Project
• Our background
• Overview of contributions
• SLA specification / reference model
• Abstract metric function / definition
• Conclusions
21
Snapshot of SLA contributions
• An SLA specification / reference model (captured in
SLALOM SLA Specification and Reference Model v1:
http://slalom-project.eu/content/d32-%E2%80%93-sla-
specification-and-reference-model)
– Follows and proposes changes to ISO 19086-2
– Follows ISO 3534-2
• An abstract metric function / definition that can be
exploited to specify any SLA metric (captured in
SLALOM SLA Metric Specification v1: http://slalom-
project.eu/content/slalom-sla-specification-v1-sep-
2015)
– Submitted as contribution to ISO 19086-2
22
Feedback
• Please provide us feedback on the presented
concepts at:
• https://docs.google.com/forms/d/1Ljnc2x2WS
aAXrWzHglDyy31xdCNC3qDiQqhC1FoxDZI/vie
wform
23
Any questions?
24
www.slalom-project.eu
3ALib (Availability SLA Benchmark and Auditor)
• Abstracted Availability Auditor Library (Java-based) Implementation:
– Based on the conceptual abstractions of different providers SLAs
– Abstracted at the code level, for efficient replacement of processes
and inclusion of providers
• Purpose:
– Align monitoring with specific provider definitions and adapt
availability calculations
– Check preconditions of SLA applicability for a specific deployment and
give feedback
– Adapt to dynamic Cloud Services user behavior
25
Details for GAE Datastore SLA
26
Details for Azure SLA
Transaction Type Maximum Processing Time*
 PutBlob and GetBlob (includes blocks
and pages)
 Get Valid Page Blob Ranges
Two (2) seconds multiplied by the number of MBs
transferred in the course of processing the
request
 Copy Blob Ninety (90) seconds (where the source and
destination blobs are within the same storage
account)
 PutBlockList
 GetBlockList
Sixty (60) seconds
 Table Query
 List Operations
Ten (10) seconds (to complete processing or
return a continuation)
 Batch Table Operations Thirty (30) seconds
 All Single Entity Table Operations
 All other Blob and Message Operations
Two (2) seconds
27

More Related Content

Viewers also liked (8)

Fibre Optics Seminar Ise09
Fibre Optics Seminar Ise09Fibre Optics Seminar Ise09
Fibre Optics Seminar Ise09
 
Call Sheets for "Street Style"
Call Sheets for "Street Style"Call Sheets for "Street Style"
Call Sheets for "Street Style"
 
Location research
Location researchLocation research
Location research
 
Google I/O 2016: What to expect from Android N to virtual reality?
Google I/O 2016: What to expect from Android N to virtual reality?Google I/O 2016: What to expect from Android N to virtual reality?
Google I/O 2016: What to expect from Android N to virtual reality?
 
Service level management
Service level managementService level management
Service level management
 
Project tango
Project tangoProject tango
Project tango
 
Section A Revision: Ethnicity
Section A Revision: EthnicitySection A Revision: Ethnicity
Section A Revision: Ethnicity
 
TEDxTaipei-Animal Protect
TEDxTaipei-Animal Protect TEDxTaipei-Animal Protect
TEDxTaipei-Animal Protect
 

Similar to SLALOM Project Technical Webinar 20151111

Requirement verification & validation
Requirement verification & validationRequirement verification & validation
Requirement verification & validation
Abdul Basit
 
Process Capability.ppt related to quality
Process Capability.ppt related to qualityProcess Capability.ppt related to quality
Process Capability.ppt related to quality
nikhilyadav365577
 
VCS_QAPerformanceSlides
VCS_QAPerformanceSlidesVCS_QAPerformanceSlides
VCS_QAPerformanceSlides
Michael Cowan
 

Similar to SLALOM Project Technical Webinar 20151111 (20)

SLALOM Webinar Final Technical Outcomes Explanined "Using the SLALOM Technica...
SLALOM Webinar Final Technical Outcomes Explanined "Using the SLALOM Technica...SLALOM Webinar Final Technical Outcomes Explanined "Using the SLALOM Technica...
SLALOM Webinar Final Technical Outcomes Explanined "Using the SLALOM Technica...
 
Modeling and Testing Dovetail in MagicDraw
Modeling and Testing Dovetail in MagicDrawModeling and Testing Dovetail in MagicDraw
Modeling and Testing Dovetail in MagicDraw
 
Process capability relation between yield and number of parts in assembly und...
Process capability relation between yield and number of parts in assembly und...Process capability relation between yield and number of parts in assembly und...
Process capability relation between yield and number of parts in assembly und...
 
Elements of a Test Framework
Elements of a Test FrameworkElements of a Test Framework
Elements of a Test Framework
 
Agile testing approach
Agile testing approachAgile testing approach
Agile testing approach
 
John Kent - An Entity Model for Software Testing
John Kent - An Entity Model for Software TestingJohn Kent - An Entity Model for Software Testing
John Kent - An Entity Model for Software Testing
 
Agile Estimation for Fixed Price Model
Agile Estimation for Fixed Price ModelAgile Estimation for Fixed Price Model
Agile Estimation for Fixed Price Model
 
Requirement verification & validation
Requirement verification & validationRequirement verification & validation
Requirement verification & validation
 
Automating Speed: A Proven Approach to Preventing Performance Regressions in ...
Automating Speed: A Proven Approach to Preventing Performance Regressions in ...Automating Speed: A Proven Approach to Preventing Performance Regressions in ...
Automating Speed: A Proven Approach to Preventing Performance Regressions in ...
 
Process Capability.ppt related to quality
Process Capability.ppt related to qualityProcess Capability.ppt related to quality
Process Capability.ppt related to quality
 
Behavioral Analytics and Blockchain Applications – a Reliability View. Keynot...
Behavioral Analytics and Blockchain Applications – a Reliability View. Keynot...Behavioral Analytics and Blockchain Applications – a Reliability View. Keynot...
Behavioral Analytics and Blockchain Applications – a Reliability View. Keynot...
 
Performance Testing
Performance Testing Performance Testing
Performance Testing
 
Effect of Temporal Collaboration Network, Maintenance Activity, and Experienc...
Effect of Temporal Collaboration Network, Maintenance Activity, and Experienc...Effect of Temporal Collaboration Network, Maintenance Activity, and Experienc...
Effect of Temporal Collaboration Network, Maintenance Activity, and Experienc...
 
Model Driven Testing: requirements, models & test
Model Driven Testing: requirements, models & test Model Driven Testing: requirements, models & test
Model Driven Testing: requirements, models & test
 
Past Experiences and Future Challenges using Automatic Performance Modelling ...
Past Experiences and Future Challenges using Automatic Performance Modelling ...Past Experiences and Future Challenges using Automatic Performance Modelling ...
Past Experiences and Future Challenges using Automatic Performance Modelling ...
 
Verifying and Validating Requirements
Verifying and Validating RequirementsVerifying and Validating Requirements
Verifying and Validating Requirements
 
Role+Of+Testing+In+Sdlc
Role+Of+Testing+In+SdlcRole+Of+Testing+In+Sdlc
Role+Of+Testing+In+Sdlc
 
Revolutionise your Machine Learning Workflow using Scikit-Learn Pipelines
Revolutionise your Machine Learning Workflow using Scikit-Learn PipelinesRevolutionise your Machine Learning Workflow using Scikit-Learn Pipelines
Revolutionise your Machine Learning Workflow using Scikit-Learn Pipelines
 
The RuleML Perspective on Reaction Rule Standards
The RuleML Perspective on Reaction Rule StandardsThe RuleML Perspective on Reaction Rule Standards
The RuleML Perspective on Reaction Rule Standards
 
VCS_QAPerformanceSlides
VCS_QAPerformanceSlidesVCS_QAPerformanceSlides
VCS_QAPerformanceSlides
 

More from Oliver Barreto Rodríguez

More from Oliver Barreto Rodríguez (20)

H-HW&SW Alliance Presentation
H-HW&SW Alliance PresentationH-HW&SW Alliance Presentation
H-HW&SW Alliance Presentation
 
Ditas factsheet h2020 v1.1
Ditas factsheet h2020  v1.1Ditas factsheet h2020  v1.1
Ditas factsheet h2020 v1.1
 
Ditas Poster v1.1
Ditas  Poster v1.1Ditas  Poster v1.1
Ditas Poster v1.1
 
Ditas Flyer v2.1
Ditas  Flyer v2.1Ditas  Flyer v2.1
Ditas Flyer v2.1
 
Ditas Project Presentation v1.0
Ditas Project Presentation v1.0Ditas Project Presentation v1.0
Ditas Project Presentation v1.0
 
Heterogeneous Hardware & Software Alliance...
Heterogeneous Hardware & Software Alliance... Heterogeneous Hardware & Software Alliance...
Heterogeneous Hardware & Software Alliance...
 
SLALOM Webinar Final Legal Outcomes Explanined "Using the SLALOM Contract Ser...
SLALOM Webinar Final Legal Outcomes Explanined "Using the SLALOM Contract Ser...SLALOM Webinar Final Legal Outcomes Explanined "Using the SLALOM Contract Ser...
SLALOM Webinar Final Legal Outcomes Explanined "Using the SLALOM Contract Ser...
 
TANGO Project in a Nutshell Presentation
TANGO Project in a Nutshell PresentationTANGO Project in a Nutshell Presentation
TANGO Project in a Nutshell Presentation
 
TANGO Project Poster v1
TANGO Project Poster v1TANGO Project Poster v1
TANGO Project Poster v1
 
TANGO Project in a Nutshell Flyer
TANGO Project in a Nutshell FlyerTANGO Project in a Nutshell Flyer
TANGO Project in a Nutshell Flyer
 
TANGO Project Poster v2
TANGO Project Poster v2TANGO Project Poster v2
TANGO Project Poster v2
 
SLALOM Best Practice DOs & DON'Ts Guide on Cloud SLAs for Project Researchers
SLALOM Best Practice DOs & DON'Ts Guide on Cloud SLAs for Project ResearchersSLALOM Best Practice DOs & DON'Ts Guide on Cloud SLAs for Project Researchers
SLALOM Best Practice DOs & DON'Ts Guide on Cloud SLAs for Project Researchers
 
SLALOM Project Legal Webinar Introduction 20151019 Legal Aspects
SLALOM Project Legal Webinar Introduction 20151019 Legal AspectsSLALOM Project Legal Webinar Introduction 20151019 Legal Aspects
SLALOM Project Legal Webinar Introduction 20151019 Legal Aspects
 
SLALOM Project Legal Webinar Introduction 20151019 Introduction
SLALOM Project Legal Webinar Introduction 20151019 IntroductionSLALOM Project Legal Webinar Introduction 20151019 Introduction
SLALOM Project Legal Webinar Introduction 20151019 Introduction
 
MODAClouds Value - Solving Top Problems of Cloud Dev Lifecycle
MODAClouds Value - Solving Top Problems of Cloud Dev LifecycleMODAClouds Value - Solving Top Problems of Cloud Dev Lifecycle
MODAClouds Value - Solving Top Problems of Cloud Dev Lifecycle
 
Solving Top Cloud Problems MODAClouds Toolbox
Solving Top Cloud Problems MODAClouds ToolboxSolving Top Cloud Problems MODAClouds Toolbox
Solving Top Cloud Problems MODAClouds Toolbox
 
H2020 Research Projects Elevator Ptch
H2020 Research Projects Elevator PtchH2020 Research Projects Elevator Ptch
H2020 Research Projects Elevator Ptch
 
MODAClouds - Underpinning the Leap to DevOps Movement on Clouds scenarios
MODAClouds - Underpinning the Leap to DevOps Movement on Clouds scenariosMODAClouds - Underpinning the Leap to DevOps Movement on Clouds scenarios
MODAClouds - Underpinning the Leap to DevOps Movement on Clouds scenarios
 
Cloud Interoperability and Portability at Future Pre-FIA 2013 Multi-Clouds Wo...
Cloud Interoperability and Portability at Future Pre-FIA 2013 Multi-Clouds Wo...Cloud Interoperability and Portability at Future Pre-FIA 2013 Multi-Clouds Wo...
Cloud Interoperability and Portability at Future Pre-FIA 2013 Multi-Clouds Wo...
 
OPTIMIS in a Nutshell
OPTIMIS in a NutshellOPTIMIS in a Nutshell
OPTIMIS in a Nutshell
 

Recently uploaded

Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Peter Udo Diehl
 

Recently uploaded (20)

IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John Staveley
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
 
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone KomSalesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
 
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeFree and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
 
Optimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityOptimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through Observability
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
IESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIESVE for Early Stage Design and Planning
IESVE for Early Stage Design and Planning
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
UiPath Test Automation using UiPath Test Suite series, part 1
UiPath Test Automation using UiPath Test Suite series, part 1UiPath Test Automation using UiPath Test Suite series, part 1
UiPath Test Automation using UiPath Test Suite series, part 1
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 

SLALOM Project Technical Webinar 20151111

  • 1. SLALOM Webinar George Kousiouris, Andreas Menychtas, Dimosthenis Kyriazis ICCS/NTUA
  • 2. Outline • SLALOM Project • Our background • Overview of contributions • SLA specification / reference model • Abstract metric function / definition • Conclusions 2
  • 3. SLALOM Project • EU funded project started on January 2015 • Aims at developing a core SLA specification – As a basis for service interactions between providers and customers – Considering current SLA landscape and research outcomes from various projects and initiatives • Current status – Analysis of SLA landscape, standardization efforts, relevant research outcomes – Development of SLA specification (v1) including main blocks and components for each component – Development of abstract metric / function (v1) applicable to different metrics – Submission of our work to ISO SLA WG for standardization • SLALOM is officially accepted as liaison body to ISO SLA WG • Attended and presented our work in the last ISO SLA WG meeting (Dublin) 3
  • 4. Our Background • European Commission Expert Group on SLAs, http://ec.europa.eu/digital-agenda/en/news/cloud-computing- service-level-agreements-exploitation-research-results • Real time SLAs (IRMOS FP7 project, 2008-2011) – Real Time Cloud for enabling performance guarantees on soft real- time applications via SLAs • Admission Control and Legal Aspects (OPTIMIS FP7 project, 2010- 2013) – Risk, Eco-efficiency and Cost as parameters – Data Location considerations, Contractual terms • Abstracted Auditing/Monitoring of SLAs (ARTIST FP7 project, 2012- 2015) – 3ALib abstracted library implementation – CloudML@ARTIST definition of a UML profile for SLA descriptions 4
  • 5. Outline • SLALOM Project • Our background • Overview of contributions • SLA specification / reference model • Abstract metric function / definition • Conclusions 5
  • 6. SLA Specification Contribution (1/2) • The proposed specification / reference model takes into account – Standardization approaches and working groups outcomes – Current SLAs offered by commercial cloud providers – Expressed views by cloud providers and adopters – Research outcomes • With respect to ISO SLA WG outcomes – Follows ISO 19086-2: Core blocks (i.e. metric definition, parameters definition, rule definition) and the corresponding elements (e.g. ID, name, unit, scale, etc) of the SLA – Follows ISO 3534-2: Metric in different scales (such as interval, ratio, nominal or ordinal) 6
  • 7. SLA Specification Contribution (2/2) • With respect to ISO WG outcomes (cont.) – Suggests changes to ISO 19086-2: Naming of specific elements (e.g. referenceId is used both for metricId for parameterId and for ruleId, while SLALOM proposes the use of different identifiers) – Suggests changes to ISO 19086-2: Inclusion of additional elements and blocks: • Dependencies with other metrics - e.g. availability of storage service and dependency to latency or response time and dependency to bandwidth • Importance of a metric (i.e. gradeOfImportance) 7
  • 8. SLA Specification / Reference model: Main blocks • Follows ISO 19086-2 SLA specification • Metric – Corresponds to the service metric / objective (e.g. availability) • Parameter – Links the metric with a parameter to express how it can be monitored and validated (e.g. time to provide resources following an elasticity trigger) • Rule – Refers to metric “constraints” (e.g. number of concurrent connections for a number of users metric) • SLALOM proposed the addition of the Dependency block – Captures dependencies between metrics (e.g. response time and bandwidth) 8
  • 9. Components • Specific components are used for all building blocks – ID – Name – Definition / Expression – Unit – Notes • SLALOM proposed the following additions to the ISO specification – gradeOfImportance component for the Metric definition, to define the metrics importance within an SLA – consequenceOfViolation component for the Rule definition, to define the potential consequence of violation on the service provisioning 9
  • 10. Outline • SLALOM Project • Our background • Overview of contributions • SLA specification / reference model • Abstract metric function / definition • Conclusions 10
  • 11. Abstract metric function / definition primary goal • Have a standard that forces ambiguities to be clarified • Help in the measurement/auditing process of an SLA – Especially by 3rd party providers – What is the purpose of having SLAs if one is not able to measure them non-repudiably? • Abstract the process 11
  • 12. Example of ambiguity in the measurement process: AWS EC2 SLA • ““Unavailable” and “Unavailability” mean: – For Amazon EC2, when all of your running instances have no external connectivity.” • Determination of external connectivity. How? – E.g. pinging (ICMP)? • Security threat – Application layer (endpoint checking)? • Includes application downtime (Not the responsibility of AWS EC2) 12
  • 13. SLALOM 3-Layer Definition Metric (Ratio) Layer Final ratio calculation of the defined metric Correlates individual periods to the overall metric Period (Time) Layer Size of period, limit of base period and on the error rate needed Aggregates samples at a time interval granularity, deciding on this level Sample (Measurement) layer Boolean expression based on concrete measurement constraints Dictates success or not of a sample 13
  • 14. Sample Layer • Sample Condition - sc: the condition stating whether a sample has been successful. – operator: the operator can either be a boolean one (i.e. AND, OR, NOT) or a comparison operator (<, >, <=, >=, ==, !=). – value: the actual value of the condition that can be arithmetic, non-arithmetic (e.g. a string such as “exception”) or an enumeration (e.g. HTTP response code == 200). – unit: the unit for the value of the condition. • Sample - s: the sample used to evaluate a parameter against the condition sc. • Successful Sample - ss: the sample satisfying the condition sc. • Unsuccessful Sample - us: the sample not satisfying the condition sc. • Type of Operation field: defined sampling process e.g. pair of protocol- response Sample definition For a given type of operation as specified in the corresponding field (described previously) sc = operator + value + unit ss = s if (sc is true) us = s if (sc is false) 14
  • 15. Period Layer • Boundary Period - bp: the period for which the analysis of a parameter (through samples) should be taken into account. Any sample that is not meeting this criterion (i.e. falls within the period) is excluded even though if it is successful (i.e. ss according to the sample definition). – operator: a comparison operator (<, >, <=, >=, ==, !=). – value: the actual arithmetic value of the condition. – unit: the unit in this case is always a time unit (e.g. seconds, minutes, etc). • Error Condition - ec: the error condition ratio for which the analysis of a parameter (through samples) should be taken into account. The ratio is always expressed in a percentage (%) format. – operator: a comparison operator (<, >, <=, >=, ==, !=). – value: the actual arithmetic value of the condition. • Error Ratio - er: the error ratio calculated based on the total set of samples and the successful samples. • Period - p: the period in which samples (sc and uc) are examined according to the boundary period and the error condition. • Valid Period - vp: the period for which the error ratio value meets the error condition ratio and the boundary period condition is also satisfied. • Non-valid Period - np: the period for which the error ratio value does not meet the error condition ratio (the boundary period condition is satisfied). Boundary period and error definitions bp = operator + value + unit ec = operator + value + % er = Σus/Σs ∀us ∈ p vp = p if ((er<=ec) && (p>=bp)) np = p if ((er>=ec) && (p>=bp)) 15
  • 16. Metric Layer • Metric Condition - mc: the condition regarding a specific metric. The condition is always expressed in a percentage (Ratio Type?) (%) format to enable its evaluation as proposed through the metric evaluation. – operator: a comparison operator (<, >, <=, >=, ==, !=). – value: the actual arithmetic value of the condition. • Metric Evaluation - me: the evaluation of the metric based on the valid and non-valid period samples. The evaluation should be smaller than the condition (i.e. me <mc). Abstract metric definition mc = operator + value + % me = Σnp / (Σvp+Σnp) 16
  • 17. Mapping of EC2 SLA@SLALOM Amazon EC2 Level / definition Expression Notes Sample definition sc: UNDEFINED (assumed ‘ping’-> ICMP) The sampling condition is not defined in the Amazon EC2 SLA. The concrete wording is “when all of your running instances have no external connectivity”. Nonetheless, the way to specify / measure “external connectivity” is not defined. For example a customer could use a ping operation or a custom monitoring mechanism. Type of operation: ping Not defined how the condition of connectivity can be actually measured (e.g. the ping operation mentioned previously). Boundary period and error definitions bp > 60 sec The exact wording is “the percentage of minutes”, thus the period is 60 seconds. ec = 100% Error condition reflecting that the error ratio is that for the entire bp the resource must be continuously “unavailable”. Abstract metric definition availability < 99.95 % Availability metric definition given the boundary period and error condition. 17
  • 18. Mapping of SLALOM @ GAE Datastore Google AppEngine Datastore Level / definition Expression Notes Sample definition sc: INTERNAL_ERROR Several sampling conditions are defined per type of operation. For example it is specified (exact wording) “INTERNAL_ERROR, TIMEOUT, …” for API calls. Type of operation: API calls Several type of operations are defined. An example is provided here. Boundary period and error definitions bp > 300 sec The exact wording is “five consecutive minutes”. ec > 10% Error condition reflecting that the error ratio is (exact wording) “ten percent Error Rate”. Abstract metric definition availability < 99.95 % Availability metric definition given the boundary period and error condition. 18
  • 19. Mapping of SLALOM @ Microsoft Azure SLA 19 Microsoft Azure Storage Level / definition Expression Notes Sample definition sc = 60 sec Several sampling conditions are defined per type of operation. For example it is specified (exact wording) “Sixty (60) seconds” for PutBlockList and GetBlockList. Type of operation: PutBlockList and GetBlockList Several type of operations are defined. An example is provided here. Boundary period and error definitions bp > 3600 sec The exact wording is “given one-hour interval”. ec > 0% Error condition reflecting that all periods should be taken into account for the availability metric evaluation (exact wording) “is the sum of Error Rates for each hour”. Abstract metric definition availability < 99.9 % Availability metric definition given the boundary period and error condition.
  • 20. Preconditions • For any SLA to apply, a number of preconditions typically exist per provider • Examples – Deployment: Number of Availability Zones used – Deployment: Replication options used – Usage/Measurement: Restarting of resources when unavailable – Usage/Measurement: Applied Throttling of requests 20
  • 21. Outline • SLALOM Project • Our background • Overview of contributions • SLA specification / reference model • Abstract metric function / definition • Conclusions 21
  • 22. Snapshot of SLA contributions • An SLA specification / reference model (captured in SLALOM SLA Specification and Reference Model v1: http://slalom-project.eu/content/d32-%E2%80%93-sla- specification-and-reference-model) – Follows and proposes changes to ISO 19086-2 – Follows ISO 3534-2 • An abstract metric function / definition that can be exploited to specify any SLA metric (captured in SLALOM SLA Metric Specification v1: http://slalom- project.eu/content/slalom-sla-specification-v1-sep- 2015) – Submitted as contribution to ISO 19086-2 22
  • 23. Feedback • Please provide us feedback on the presented concepts at: • https://docs.google.com/forms/d/1Ljnc2x2WS aAXrWzHglDyy31xdCNC3qDiQqhC1FoxDZI/vie wform 23
  • 25. 3ALib (Availability SLA Benchmark and Auditor) • Abstracted Availability Auditor Library (Java-based) Implementation: – Based on the conceptual abstractions of different providers SLAs – Abstracted at the code level, for efficient replacement of processes and inclusion of providers • Purpose: – Align monitoring with specific provider definitions and adapt availability calculations – Check preconditions of SLA applicability for a specific deployment and give feedback – Adapt to dynamic Cloud Services user behavior 25
  • 26. Details for GAE Datastore SLA 26
  • 27. Details for Azure SLA Transaction Type Maximum Processing Time*  PutBlob and GetBlob (includes blocks and pages)  Get Valid Page Blob Ranges Two (2) seconds multiplied by the number of MBs transferred in the course of processing the request  Copy Blob Ninety (90) seconds (where the source and destination blobs are within the same storage account)  PutBlockList  GetBlockList Sixty (60) seconds  Table Query  List Operations Ten (10) seconds (to complete processing or return a continuation)  Batch Table Operations Thirty (30) seconds  All Single Entity Table Operations  All other Blob and Message Operations Two (2) seconds 27