SlideShare a Scribd company logo
Reliable Capacity Provisioning and Enhanced
Remediation for Distributed Cloud Applications
http://recap-project.eu recap2020
THIS PROJECT HAS RECEIVED FUNDING FROM THE EUROPEAN UNION’S HORIZON 2020
RESEARCH AND INNOVATION PROGRAMME UNDER GRANT AGREEMENT NUMBER 732667
REliable CApacity Provisioning for
Distributed Cloud/Edge/Fog
Computing Applications
(RECAP)
Sergej Svorobej
Dublin City University
Agenda
• Introduction
• The RECAP Vision
• Use Cases for RECAP
• DCU - Simulation Framework
• Intel
2
What is RECAP?
• Reliable Capacity Provisioning and Enhanced Remediation for
Distributed Cloud/Edge/Fog Applications
• Founded under European Horizon 2020 framework
• Jan 2017 – Dec 2019
• Consortium comprises many academic and industrial partners
⁃ 9 partners from 5 countries (Germany, Ireland, Spain, Sweden, UK)
• http://recap-project.eu
3
Partner Location
4
Project Motivation
• Large-scale systems are typically built as
distributed systems
• Tradeoffs in the placement of application
components:
⁃ Data center High latency, high power
⁃ Fog/Edge Low latency, low power
• Cloud computing capacity is provisioned
using best-effort models and coarsed-
grained QoS mechanisms
⁃ Not a sustainable way as the number of
connected ”things” increases
5
Realisation Approach (1/2)
• An architecture for cloud-edge computing capacity provisioning and
remediation
⁃ Fine-grained and accurate models of application behaviour and deployment
⁃ Model of QoS requirements at application component-level
⁃ Model of workloads
• To understand and predict the behaviour of applications (users)
• To enhance the proactive remediation of systems
6
Realisation Approach (2/2)
7
Architecture
• Feedback Loop
⁃ Collector
⁃ Application Modeler
⁃ Workload Modeler
⁃ Optimizer
⁃ Simulator
8
The RECAP Collector
• Gathers, synthesizes and analyzes metrics to be monitored across
the infrastructure
• Acquires, characterizes, and analyzes data
⁃ Workload patterns and their relationship
⁃ Status of the infrastructure
⁃ …
• Visualizes, annotates,archives and manages the collected data
9
Knowledge
Discovery
The RECAP Application Modeler
• With the knowledge provided by the Collector, the Modeler
discovers and defines
⁃ The internal structure of cloud applications
⁃ The QoS requirements for each applications
• To support intelligent decision making
⁃ Application/Component placement and autoscaling
10
Application
Modeling
The RECAP Workload Modeler
• Decomposes, classifies, and predicts the workloads in the network,
and the load propagation in applications
⁃ CPU, memory, network traffic, …
• Models the workload distribution and
load propagation patterns
• To improve planning decision and ensure QoS
• To support the construction of an artificial workload generation tool
⁃ Validating and training the learning models
11
Workload
Modeling
The RECAP Optimizer
• With the models provided by the Modelers, the Optimizer performs
optimization tasks
⁃ Application/Component placement: scaling vs. migration
⁃ Infrastructure management decisions: energy, utilization rate,
load balancing
• To improve the efficiency in resources utilzation
while maintaining QoS
12
Optimization
The RECAP Simulator
• Simulation is needed due to the size and complexity of the
intended target systems
⁃ Simulateinteractions of distributed cloud application behaviors
⁃ Emulate data center and network systems
• To assist the Optimizer with the evaluation of different
deployments of applications and infrastructures
• To feed back to the Data Collector with simulation results
13
Testing &
Improvement
Use Cases from Industry
• BT
⁃ NFV, QoS Management and
Remediation
• Linknovate
⁃ Complex Big Data Analytics
Engine
• Satec
⁃ Fog Computing and Large
Scale IoT Scenario for
supporting Smart Cities
• Tieto
⁃ Infrastructure and Network
Management
14
Reliable Capacity Provisioning and Enhanced
Remediation for Distributed Cloud Applications
http://recap-project.eu recap2020
THIS PROJECT HAS RECEIVED FUNDING FROM THE EUROPEAN UNION’S HORIZON 2020
RESEARCH AND INNOVATION PROGRAMME UNDER GRANT AGREEMENT NUMBER 732667
WP7 – Large Scale Simulation Framework
Sergej Svorobej
Dublin City University
16
Role of simulation
• Reason on implementation effectiveness
• Show optimisation effects
⁃ Placement
⁃ Consolidation
⁃ Elasticity
⁃ Infrastructure and Application configuration
⁃ Cost
• Provide value to the use case owners
17
18
API
API
APPLICATION
COMPONENT
1
A 300
B 500
C 1000
LB
COMPONENT
1’
A 300
B 500
C 1000
LB
RequestDevice
API
API
COMPONENT
2
A 500
B 450
C 100
COMPONENT
2’
A 500
B 450
C 100
High level model
DeviceDevice/
User
Request
Request
Network
Node Node Node
Node
Node
Node
Node
Node
Node
Network Network
Tier 1 Metro Core
Simulation challenges
• Experiment model size
⁃ Network topology (Graph)
⁃ Infrastructure (Physical and Virtual)
⁃ Workload
⁃ Application
• Speed and resource demand
• Accuracy and granularity
19
Research direction
• Discrete Event Simulation (DES)
⁃ Event Queue simulation engine
⁃ Java 8
⁃ CloudSimPlus based (http://cloudsimplus.org/)
• “Time step” simulation
⁃ Calculation loop
⁃ C/C++, MPI based
⁃ CLSim (https://cloudlightning.eu/)
20
THANK YOU
http://recap-project.eu recap2020
RECAP Project ■ H2020 ■ Grant Agreement #732667
Call: H2020-ICT-2016-2017 ■Topic: ICT-06-2016
THIS PROJECT HAS RECEIVED FUNDING FROM THE EUROPEAN UNION’S HORIZON 2020
RESEARCH AND INNOVATION PROGRAMME UNDER GRANT AGREEMENT NUMBER 732667
Additional Slides
22
BT
23
• NFV infrastructure and virtual CDNs
⁃ Softwarization of network appliances
⁃ Mulitiple distributed applications per
CDN operator
• RECAP:
⁃ Automated decision making
• Optimization of placement and
scaling vCDN systems
• Monitoring and Remediation
⁃ Improves resource utilization while
guaranteeing SLAs
Linknovate
• Big data analytics engine
⁃ Data acquisition, data aggregation,
data processing, data visualization
• RECAP:
⁃ Characterizes workload and
models workload distribution
⁃ Automatically and dynamically
allocates computing resources
⁃ To reduce costs and improve
performance
24
Satec
• IoT, smart city
⁃ Data management system: to collect sensory data, and to provide data (in
a distributed manner)
• RECAP:
⁃ Optimizes the placement of IoT
resources such as computation,
storage for cost and latency
⁃ Automated reallocation of resources
⁃ Automated deployment of IoT
applications
25
Tieto
26
• Telecommunication
infrastructure systems
and applications
• RECAP:
⁃ Automated profiling and
simulating the
infrastructure & VNFs
⁃ QoS and low latency

More Related Content

What's hot

Martin Brooks Green It Workshop Final
Martin Brooks Green It Workshop FinalMartin Brooks Green It Workshop Final
Martin Brooks Green It Workshop Final
Bill St. Arnaud
 
HPC Midlands - Update for Bull eXtreme Computing User Group 2012 meeting
HPC Midlands - Update for Bull eXtreme Computing User Group 2012 meetingHPC Midlands - Update for Bull eXtreme Computing User Group 2012 meeting
HPC Midlands - Update for Bull eXtreme Computing User Group 2012 meeting
Martin Hamilton
 

What's hot (20)

Deep Hybrid DataCloud
Deep Hybrid DataCloudDeep Hybrid DataCloud
Deep Hybrid DataCloud
 
Martin Brooks Green It Workshop Final
Martin Brooks Green It Workshop FinalMartin Brooks Green It Workshop Final
Martin Brooks Green It Workshop Final
 
Edge optimized architecture for fabric defect detection in real-time
Edge optimized architecture for fabric defect detection in real-timeEdge optimized architecture for fabric defect detection in real-time
Edge optimized architecture for fabric defect detection in real-time
 
Postcard: NECOS
Postcard: NECOSPostcard: NECOS
Postcard: NECOS
 
HNSciCloud Prototype Phase Award - Marc-Elian Begin
HNSciCloud Prototype Phase Award - Marc-Elian Begin HNSciCloud Prototype Phase Award - Marc-Elian Begin
HNSciCloud Prototype Phase Award - Marc-Elian Begin
 
Helix Nebula - The Science Cloud - Lessons learned
Helix Nebula - The Science Cloud - Lessons learned Helix Nebula - The Science Cloud - Lessons learned
Helix Nebula - The Science Cloud - Lessons learned
 
FogFlow: Cloud-Edge Orchestrator in FIWARE
FogFlow: Cloud-Edge Orchestrator in FIWAREFogFlow: Cloud-Edge Orchestrator in FIWARE
FogFlow: Cloud-Edge Orchestrator in FIWARE
 
IoT Slam Keynote: Harnessing the Flood of Data with Heterogeneous Computing a...
IoT Slam Keynote: Harnessing the Flood of Data with Heterogeneous Computing a...IoT Slam Keynote: Harnessing the Flood of Data with Heterogeneous Computing a...
IoT Slam Keynote: Harnessing the Flood of Data with Heterogeneous Computing a...
 
Control of computing systems
Control of computing systemsControl of computing systems
Control of computing systems
 
HPC Midlands - Update for Bull eXtreme Computing User Group 2012 meeting
HPC Midlands - Update for Bull eXtreme Computing User Group 2012 meetingHPC Midlands - Update for Bull eXtreme Computing User Group 2012 meeting
HPC Midlands - Update for Bull eXtreme Computing User Group 2012 meeting
 
Rain technology ppt
Rain technology pptRain technology ppt
Rain technology ppt
 
Stream Processing
Stream Processing Stream Processing
Stream Processing
 
FIWARE Global Summit - FogFlow, a new GE for IoT Edge Computing
FIWARE Global Summit - FogFlow, a new GE for IoT Edge ComputingFIWARE Global Summit - FogFlow, a new GE for IoT Edge Computing
FIWARE Global Summit - FogFlow, a new GE for IoT Edge Computing
 
FIWARE Tech Summit - FogFlow - New GE for IoT Edge Computing
FIWARE Tech Summit - FogFlow - New GE for IoT Edge ComputingFIWARE Tech Summit - FogFlow - New GE for IoT Edge Computing
FIWARE Tech Summit - FogFlow - New GE for IoT Edge Computing
 
SPARK USE CASE- Distributed Reinforcement Learning for Electricity Market Bi...
SPARK USE CASE-  Distributed Reinforcement Learning for Electricity Market Bi...SPARK USE CASE-  Distributed Reinforcement Learning for Electricity Market Bi...
SPARK USE CASE- Distributed Reinforcement Learning for Electricity Market Bi...
 
GaruaGeo: Global Scale Data Aggregation in Hybrid Edge and Cloud Computing En...
GaruaGeo: Global Scale Data Aggregation in Hybrid Edge and Cloud Computing En...GaruaGeo: Global Scale Data Aggregation in Hybrid Edge and Cloud Computing En...
GaruaGeo: Global Scale Data Aggregation in Hybrid Edge and Cloud Computing En...
 
Integration of mixed-criticality subsystems on multicore and manycore processors
Integration of mixed-criticality subsystems on multicore and manycore processorsIntegration of mixed-criticality subsystems on multicore and manycore processors
Integration of mixed-criticality subsystems on multicore and manycore processors
 
Increasing ROI Through Simulation and the 'Digital Twin'
Increasing ROI Through Simulation and the 'Digital Twin'Increasing ROI Through Simulation and the 'Digital Twin'
Increasing ROI Through Simulation and the 'Digital Twin'
 
Telvent Big Data Approach and Case Studies
Telvent Big Data Approach and Case StudiesTelvent Big Data Approach and Case Studies
Telvent Big Data Approach and Case Studies
 
Excellerat CoE
Excellerat CoEExcellerat CoE
Excellerat CoE
 

Similar to The RECAP Project: Large Scale Simulation Framework

IWSM2014 MEGSUS14 - GQM on energy for SaaS - CETIC
IWSM2014   MEGSUS14 - GQM on energy for SaaS - CETICIWSM2014   MEGSUS14 - GQM on energy for SaaS - CETIC
IWSM2014 MEGSUS14 - GQM on energy for SaaS - CETIC
Nesma
 
module1st-cloudcomputing-180131063409 - Copy.pdf
module1st-cloudcomputing-180131063409 - Copy.pdfmodule1st-cloudcomputing-180131063409 - Copy.pdf
module1st-cloudcomputing-180131063409 - Copy.pdf
BenakappaSM
 

Similar to The RECAP Project: Large Scale Simulation Framework (20)

RECAP at ETSI Experiential Network Intelligence (ENI) Meeting
RECAP at ETSI Experiential Network Intelligence (ENI) MeetingRECAP at ETSI Experiential Network Intelligence (ENI) Meeting
RECAP at ETSI Experiential Network Intelligence (ENI) Meeting
 
Webinar: Burst ANSYS Workloads to the Cloud with Univa & UberCloud
Webinar: Burst ANSYS Workloads to the Cloud with Univa & UberCloudWebinar: Burst ANSYS Workloads to the Cloud with Univa & UberCloud
Webinar: Burst ANSYS Workloads to the Cloud with Univa & UberCloud
 
Univa Presentation at DAC 2020
Univa Presentation at DAC 2020 Univa Presentation at DAC 2020
Univa Presentation at DAC 2020
 
IWSM2014 MEGSUS14 - GQM on energy for SaaS - CETIC
IWSM2014   MEGSUS14 - GQM on energy for SaaS - CETICIWSM2014   MEGSUS14 - GQM on energy for SaaS - CETIC
IWSM2014 MEGSUS14 - GQM on energy for SaaS - CETIC
 
Service Engineering, ZHAW for CeBIT
Service Engineering, ZHAW for CeBITService Engineering, ZHAW for CeBIT
Service Engineering, ZHAW for CeBIT
 
01-06 OCRE Test Suite - Fernandes.pdf
01-06 OCRE Test Suite - Fernandes.pdf01-06 OCRE Test Suite - Fernandes.pdf
01-06 OCRE Test Suite - Fernandes.pdf
 
Navops talk at hpc in the cloud meetup 19 march 2019
Navops talk at hpc in the cloud meetup 19 march 2019Navops talk at hpc in the cloud meetup 19 march 2019
Navops talk at hpc in the cloud meetup 19 march 2019
 
Introduction to Cloud Computing
Introduction to Cloud ComputingIntroduction to Cloud Computing
Introduction to Cloud Computing
 
Green cloud computing
Green  cloud computingGreen  cloud computing
Green cloud computing
 
Modern Context-Aware Data Center Design
Modern Context-Aware Data Center DesignModern Context-Aware Data Center Design
Modern Context-Aware Data Center Design
 
module1st-cloudcomputing-180131063409 - Copy.pdf
module1st-cloudcomputing-180131063409 - Copy.pdfmodule1st-cloudcomputing-180131063409 - Copy.pdf
module1st-cloudcomputing-180131063409 - Copy.pdf
 
Singapore oif transport-sdn-junjie li
Singapore oif transport-sdn-junjie liSingapore oif transport-sdn-junjie li
Singapore oif transport-sdn-junjie li
 
Enterprise Cloud Transformation
Enterprise Cloud TransformationEnterprise Cloud Transformation
Enterprise Cloud Transformation
 
Design and inplementation of hybrid cloud computing architecture based on clo...
Design and inplementation of hybrid cloud computing architecture based on clo...Design and inplementation of hybrid cloud computing architecture based on clo...
Design and inplementation of hybrid cloud computing architecture based on clo...
 
Design and implementation of hybrid cloud computing architecture based on clo...
Design and implementation of hybrid cloud computing architecture based on clo...Design and implementation of hybrid cloud computing architecture based on clo...
Design and implementation of hybrid cloud computing architecture based on clo...
 
Basics of Cloud Computing- 5 th semester
Basics of Cloud Computing- 5 th semesterBasics of Cloud Computing- 5 th semester
Basics of Cloud Computing- 5 th semester
 
Cloud Computing and Agile Product Line Engineering Integration
Cloud Computing and Agile Product Line Engineering IntegrationCloud Computing and Agile Product Line Engineering Integration
Cloud Computing and Agile Product Line Engineering Integration
 
CloudLighting - A Brief Overview
CloudLighting - A Brief OverviewCloudLighting - A Brief Overview
CloudLighting - A Brief Overview
 
Overview of CloudLightning
Overview of CloudLightningOverview of CloudLightning
Overview of CloudLightning
 
On-Demand Production Infrastructure delivered Just In Time By Shane Guthrie o...
On-Demand Production Infrastructure delivered Just In Time By Shane Guthrie o...On-Demand Production Infrastructure delivered Just In Time By Shane Guthrie o...
On-Demand Production Infrastructure delivered Just In Time By Shane Guthrie o...
 

Recently uploaded

Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Peter Udo Diehl
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
Bhaskar Mitra
 

Recently uploaded (20)

JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone KomSalesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
Powerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaPowerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara Laskowska
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 
In-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT ProfessionalsIn-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT Professionals
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya HalderCustom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John Staveley
 
The architecture of Generative AI for enterprises.pdf
The architecture of Generative AI for enterprises.pdfThe architecture of Generative AI for enterprises.pdf
The architecture of Generative AI for enterprises.pdf
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptxWSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2
 

The RECAP Project: Large Scale Simulation Framework

  • 1. Reliable Capacity Provisioning and Enhanced Remediation for Distributed Cloud Applications http://recap-project.eu recap2020 THIS PROJECT HAS RECEIVED FUNDING FROM THE EUROPEAN UNION’S HORIZON 2020 RESEARCH AND INNOVATION PROGRAMME UNDER GRANT AGREEMENT NUMBER 732667 REliable CApacity Provisioning for Distributed Cloud/Edge/Fog Computing Applications (RECAP) Sergej Svorobej Dublin City University
  • 2. Agenda • Introduction • The RECAP Vision • Use Cases for RECAP • DCU - Simulation Framework • Intel 2
  • 3. What is RECAP? • Reliable Capacity Provisioning and Enhanced Remediation for Distributed Cloud/Edge/Fog Applications • Founded under European Horizon 2020 framework • Jan 2017 – Dec 2019 • Consortium comprises many academic and industrial partners ⁃ 9 partners from 5 countries (Germany, Ireland, Spain, Sweden, UK) • http://recap-project.eu 3
  • 5. Project Motivation • Large-scale systems are typically built as distributed systems • Tradeoffs in the placement of application components: ⁃ Data center High latency, high power ⁃ Fog/Edge Low latency, low power • Cloud computing capacity is provisioned using best-effort models and coarsed- grained QoS mechanisms ⁃ Not a sustainable way as the number of connected ”things” increases 5
  • 6. Realisation Approach (1/2) • An architecture for cloud-edge computing capacity provisioning and remediation ⁃ Fine-grained and accurate models of application behaviour and deployment ⁃ Model of QoS requirements at application component-level ⁃ Model of workloads • To understand and predict the behaviour of applications (users) • To enhance the proactive remediation of systems 6
  • 8. Architecture • Feedback Loop ⁃ Collector ⁃ Application Modeler ⁃ Workload Modeler ⁃ Optimizer ⁃ Simulator 8
  • 9. The RECAP Collector • Gathers, synthesizes and analyzes metrics to be monitored across the infrastructure • Acquires, characterizes, and analyzes data ⁃ Workload patterns and their relationship ⁃ Status of the infrastructure ⁃ … • Visualizes, annotates,archives and manages the collected data 9 Knowledge Discovery
  • 10. The RECAP Application Modeler • With the knowledge provided by the Collector, the Modeler discovers and defines ⁃ The internal structure of cloud applications ⁃ The QoS requirements for each applications • To support intelligent decision making ⁃ Application/Component placement and autoscaling 10 Application Modeling
  • 11. The RECAP Workload Modeler • Decomposes, classifies, and predicts the workloads in the network, and the load propagation in applications ⁃ CPU, memory, network traffic, … • Models the workload distribution and load propagation patterns • To improve planning decision and ensure QoS • To support the construction of an artificial workload generation tool ⁃ Validating and training the learning models 11 Workload Modeling
  • 12. The RECAP Optimizer • With the models provided by the Modelers, the Optimizer performs optimization tasks ⁃ Application/Component placement: scaling vs. migration ⁃ Infrastructure management decisions: energy, utilization rate, load balancing • To improve the efficiency in resources utilzation while maintaining QoS 12 Optimization
  • 13. The RECAP Simulator • Simulation is needed due to the size and complexity of the intended target systems ⁃ Simulateinteractions of distributed cloud application behaviors ⁃ Emulate data center and network systems • To assist the Optimizer with the evaluation of different deployments of applications and infrastructures • To feed back to the Data Collector with simulation results 13 Testing & Improvement
  • 14. Use Cases from Industry • BT ⁃ NFV, QoS Management and Remediation • Linknovate ⁃ Complex Big Data Analytics Engine • Satec ⁃ Fog Computing and Large Scale IoT Scenario for supporting Smart Cities • Tieto ⁃ Infrastructure and Network Management 14
  • 15. Reliable Capacity Provisioning and Enhanced Remediation for Distributed Cloud Applications http://recap-project.eu recap2020 THIS PROJECT HAS RECEIVED FUNDING FROM THE EUROPEAN UNION’S HORIZON 2020 RESEARCH AND INNOVATION PROGRAMME UNDER GRANT AGREEMENT NUMBER 732667 WP7 – Large Scale Simulation Framework Sergej Svorobej Dublin City University
  • 16. 16
  • 17. Role of simulation • Reason on implementation effectiveness • Show optimisation effects ⁃ Placement ⁃ Consolidation ⁃ Elasticity ⁃ Infrastructure and Application configuration ⁃ Cost • Provide value to the use case owners 17
  • 18. 18 API API APPLICATION COMPONENT 1 A 300 B 500 C 1000 LB COMPONENT 1’ A 300 B 500 C 1000 LB RequestDevice API API COMPONENT 2 A 500 B 450 C 100 COMPONENT 2’ A 500 B 450 C 100 High level model DeviceDevice/ User Request Request Network Node Node Node Node Node Node Node Node Node Network Network Tier 1 Metro Core
  • 19. Simulation challenges • Experiment model size ⁃ Network topology (Graph) ⁃ Infrastructure (Physical and Virtual) ⁃ Workload ⁃ Application • Speed and resource demand • Accuracy and granularity 19
  • 20. Research direction • Discrete Event Simulation (DES) ⁃ Event Queue simulation engine ⁃ Java 8 ⁃ CloudSimPlus based (http://cloudsimplus.org/) • “Time step” simulation ⁃ Calculation loop ⁃ C/C++, MPI based ⁃ CLSim (https://cloudlightning.eu/) 20
  • 21. THANK YOU http://recap-project.eu recap2020 RECAP Project ■ H2020 ■ Grant Agreement #732667 Call: H2020-ICT-2016-2017 ■Topic: ICT-06-2016 THIS PROJECT HAS RECEIVED FUNDING FROM THE EUROPEAN UNION’S HORIZON 2020 RESEARCH AND INNOVATION PROGRAMME UNDER GRANT AGREEMENT NUMBER 732667
  • 23. BT 23 • NFV infrastructure and virtual CDNs ⁃ Softwarization of network appliances ⁃ Mulitiple distributed applications per CDN operator • RECAP: ⁃ Automated decision making • Optimization of placement and scaling vCDN systems • Monitoring and Remediation ⁃ Improves resource utilization while guaranteeing SLAs
  • 24. Linknovate • Big data analytics engine ⁃ Data acquisition, data aggregation, data processing, data visualization • RECAP: ⁃ Characterizes workload and models workload distribution ⁃ Automatically and dynamically allocates computing resources ⁃ To reduce costs and improve performance 24
  • 25. Satec • IoT, smart city ⁃ Data management system: to collect sensory data, and to provide data (in a distributed manner) • RECAP: ⁃ Optimizes the placement of IoT resources such as computation, storage for cost and latency ⁃ Automated reallocation of resources ⁃ Automated deployment of IoT applications 25
  • 26. Tieto 26 • Telecommunication infrastructure systems and applications • RECAP: ⁃ Automated profiling and simulating the infrastructure & VNFs ⁃ QoS and low latency