SlideShare a Scribd company logo
University of Stuttgart
Universitätsstr. 38
70569 Stuttgart
Germany
Phone +49-711-685 88337
Fax +49-711-685 88472
Research
Santiago Gómez Sáez, Vasilios Andrikopoulos, Frank Leymann, and Steve Strauch
Institute of Architecture of Application Systems
{gomez-saez, andrikopoulos, leymann, strauch}@iaas.uni-stuttgart.de
Evaluating Caching Strategies
for Cloud Data Access using an
Enterprise Service Bus
IEEE IC2E 2014
Research
© Santiago Gómez Sáez 2
Agenda
 Motivating Scenario
 CDASMix Architecture & Realization
 Evaluation
 Conclusion and Future Work
33
Research
© Santiago Gómez Sáez
Motivating Scenario
Presentation
Layer
Application
Business
Layer
SQL
Data Access
Layer
SQL
Data Access LayerCloud-Enabled Data Access Layer
SQL
Registry
Public Cloud Public CloudTraditional
Application Layers
Deployment
Models
Assumptions
 Database layer has already been
migrated
 Focus on Relational Databases
44
Research
© Santiago Gómez Sáez
CDASMix - Architecture
Presentation
Business
Logic
Resources
Web Service API
Configuration Registry Manager
Tenant Registry Manager
Service Registry Manager
JBI Container Manager
Service Assembly Manager
Service Registry
Database Cluster
Configuration
Registry Database
JBI Container
Instance Cluster
Access Layer
Web UI
Tenant Registry
Database
Message Broker
(1) Strauch et al.: Transparent Access to Relational Databases in the Cloud Using a Multi-tenant ESB. CLOSER’14
(2) ESBMT Project: www.iaas.uni-stuttgart.de/esbmt/
55
Research
© Santiago Gómez Sáez
CDASMix – Cloud Data Access ESB Instance
OSGi Environment
JBI Environment
Standardized Interfaces for Service Engines
Standardized Interfaces for Binding Components
Normalized Message Router
External
Application
SMX-Camel
-mt
MySQL
Proxy
SMX-
Camel
Camel
cdasmixJDBC
Backend Cloud Data Store Provider
Legend
Message Flow
OSGi Component
JBI Component
NMR API
Cache Cluster
Instance 1Instance 1Instance 1
• Ehcache 2.6.0
• LRU, LFU & FIFO
• Multi-tenancy Awareness
66
Research
© Santiago Gómez Sáez
Evaluation – Methodology & Data Set
 Measure how caching mitigates the performance degradation
when accessing data through CDASMix
 Analyze the optimal cache eviction algorithm (in tandem with the
MySQL instances)
 Cache Hit rate in % and throughput in Req./s
 TPC-H 1 GB data distributed in 8 tables
 Discrete uniform (1/N) generated workload from 5 adapted TPC-
H queries -> read intensive (2.5 MB per query) constituted by 100
queries from initial sample of 9 queries
 Generated Load publicly available at
https://santiago.studiforge.informatik.uni-
stuttgart.de/svn/publications/IC2E14/queries4Load/generatedLoad 5-100.csv
77
Research
© Santiago Gómez Sáez
Evaluation Setup
VM0 (Flexiscale)
Apache JMeter
2.9
CDASMix
MySQL 5.1
TPC-H
Amazon RDS
MySQL 5.1 instance
VM1 (Amazon EC2)
MySQL 5.1
D1D2 D3
E3 E2 E1
Legend
Message Flow
Measurement Point
Throughput and
Transfer Rate
Built-in Cache
E
TPC-H
TPC-H
QueryGen.shload.csv
MySQL & Ehcache cache size 16MB
88
Research
© Santiago Gómez Sáez
Evaluation – MySQL in IaaS Flexiscale & AWS EC2
Flexiscale AWS EC2
-51%
-14%
-10%
+17%
-39%
+21%
+30%
+16%
99
Research
© Santiago Gómez Sáez
Evaluation – MySQL in IaaS Flexiscale & AWS EC2
Flexiscale AWS EC2
-51%
+43%
+46%
+42%
-39%
+50%
+53%
+48%
1010
Research
© Santiago Gómez Sáez
Evaluation – MySQL in AWS RDS
-93%
-89% -89%
-89%
1111
Research
© Santiago Gómez Sáez
Evaluation – MySQL in AWS RDS
-93%
+37% +37%
+37%
1212
Research
© Santiago Gómez Sáez
Evaluation – CDASMix Cache Hit Ratio
Flexiscale AWS EC2 AWS RDS
1313
Research
© Santiago Gómez Sáez
Conclusion & Future Work
 Design and realization of CDASMix, a multi-tenant aware ESB
solution that enables transparent data access
 Caching support for ameliorating the performance
 Evaluation based on
 different database deployment scenarios
 the utilization of different caching eviction algorithms
 Extend CDASMix towards supporting PostgreSQL
 CDASMix horizontal scalability & distributed caching
 Further evaluation
 + Caching Eviction Algorithms
 Different workloads
14
Thanks for your attention!!

More Related Content

What's hot

ACACES 2019: Towards Energy Efficient Deep Learning
ACACES 2019: Towards Energy Efficient Deep LearningACACES 2019: Towards Energy Efficient Deep Learning
ACACES 2019: Towards Energy Efficient Deep Learning
LEGATO project
 
Hpc Cloud project Overview
Hpc Cloud project OverviewHpc Cloud project Overview
Hpc Cloud project Overview
Floris Sluiter
 
HybridAzureCloud
HybridAzureCloudHybridAzureCloud
HybridAzureCloudChris Condo
 
An introduction to Workload Modelling for Cloud Applications
An introduction to Workload Modelling for Cloud ApplicationsAn introduction to Workload Modelling for Cloud Applications
An introduction to Workload Modelling for Cloud Applications
Ravi Yogesh
 
An optimized scientific workflow scheduling in cloud computing
An optimized scientific workflow scheduling in cloud computingAn optimized scientific workflow scheduling in cloud computing
An optimized scientific workflow scheduling in cloud computing
DIGVIJAY SHINDE
 
Analyse de sécurité de bout en bout avec la Suite Elastic
Analyse de sécurité de bout en bout avec la Suite ElasticAnalyse de sécurité de bout en bout avec la Suite Elastic
Analyse de sécurité de bout en bout avec la Suite Elastic
Elasticsearch
 
Compose hardware resources on the fly with openstack valence
Compose hardware resources on the fly with openstack valenceCompose hardware resources on the fly with openstack valence
Compose hardware resources on the fly with openstack valence
Shuquan Huang
 
Enabling Efficient and Geometric Range Query with Access Control over Encrypt...
Enabling Efficient and Geometric Range Query with Access Control over Encrypt...Enabling Efficient and Geometric Range Query with Access Control over Encrypt...
Enabling Efficient and Geometric Range Query with Access Control over Encrypt...
JAYAPRAKASH JPINFOTECH
 
Presentation fyp1automationreplicationinopenstack
Presentation fyp1automationreplicationinopenstackPresentation fyp1automationreplicationinopenstack
Presentation fyp1automationreplicationinopenstack
athiqah
 
Ultra Fast Deep Learning in Hybrid Cloud using Intel Analytics Zoo & Alluxio
Ultra Fast Deep Learning in Hybrid Cloud using Intel Analytics Zoo & AlluxioUltra Fast Deep Learning in Hybrid Cloud using Intel Analytics Zoo & Alluxio
Ultra Fast Deep Learning in Hybrid Cloud using Intel Analytics Zoo & Alluxio
Alluxio, Inc.
 
Build bare metal kubernetes cluster for hpc on open stack in translational me...
Build bare metal kubernetes cluster for hpc on open stack in translational me...Build bare metal kubernetes cluster for hpc on open stack in translational me...
Build bare metal kubernetes cluster for hpc on open stack in translational me...
Shuquan Huang
 
IoT Event Processing and Analytics with InfluxDB in Google Cloud | Christoph ...
IoT Event Processing and Analytics with InfluxDB in Google Cloud | Christoph ...IoT Event Processing and Analytics with InfluxDB in Google Cloud | Christoph ...
IoT Event Processing and Analytics with InfluxDB in Google Cloud | Christoph ...
InfluxData
 
Distributed, concurrent, and independent access to encrypted cloud databases
Distributed, concurrent, and independent access to encrypted cloud databasesDistributed, concurrent, and independent access to encrypted cloud databases
Distributed, concurrent, and independent access to encrypted cloud databases
Papitha Velumani
 
Deep Learning and Gene Computing Acceleration with Alluxio in Kubernetes
Deep Learning and Gene Computing Acceleration with Alluxio in KubernetesDeep Learning and Gene Computing Acceleration with Alluxio in Kubernetes
Deep Learning and Gene Computing Acceleration with Alluxio in Kubernetes
Alluxio, Inc.
 
DATACUBES: Conquering Space & Time
DATACUBES: Conquering Space & TimeDATACUBES: Conquering Space & Time
DATACUBES: Conquering Space & Time
plan4all
 
distributed, concurrent, and independent access to encrypted cloud databases
distributed, concurrent, and independent access to encrypted cloud databasesdistributed, concurrent, and independent access to encrypted cloud databases
distributed, concurrent, and independent access to encrypted cloud databases
swathi78
 
The Past, Present, and Future of OpenACC
The Past, Present, and Future of OpenACCThe Past, Present, and Future of OpenACC
The Past, Present, and Future of OpenACC
inside-BigData.com
 
Towards Enabling Mid-Scale Geo-Science Experiments Through Microsoft Trident ...
Towards Enabling Mid-Scale Geo-Science Experiments Through Microsoft Trident ...Towards Enabling Mid-Scale Geo-Science Experiments Through Microsoft Trident ...
Towards Enabling Mid-Scale Geo-Science Experiments Through Microsoft Trident ...
Eran Chinthaka Withana
 

What's hot (18)

ACACES 2019: Towards Energy Efficient Deep Learning
ACACES 2019: Towards Energy Efficient Deep LearningACACES 2019: Towards Energy Efficient Deep Learning
ACACES 2019: Towards Energy Efficient Deep Learning
 
Hpc Cloud project Overview
Hpc Cloud project OverviewHpc Cloud project Overview
Hpc Cloud project Overview
 
HybridAzureCloud
HybridAzureCloudHybridAzureCloud
HybridAzureCloud
 
An introduction to Workload Modelling for Cloud Applications
An introduction to Workload Modelling for Cloud ApplicationsAn introduction to Workload Modelling for Cloud Applications
An introduction to Workload Modelling for Cloud Applications
 
An optimized scientific workflow scheduling in cloud computing
An optimized scientific workflow scheduling in cloud computingAn optimized scientific workflow scheduling in cloud computing
An optimized scientific workflow scheduling in cloud computing
 
Analyse de sécurité de bout en bout avec la Suite Elastic
Analyse de sécurité de bout en bout avec la Suite ElasticAnalyse de sécurité de bout en bout avec la Suite Elastic
Analyse de sécurité de bout en bout avec la Suite Elastic
 
Compose hardware resources on the fly with openstack valence
Compose hardware resources on the fly with openstack valenceCompose hardware resources on the fly with openstack valence
Compose hardware resources on the fly with openstack valence
 
Enabling Efficient and Geometric Range Query with Access Control over Encrypt...
Enabling Efficient and Geometric Range Query with Access Control over Encrypt...Enabling Efficient and Geometric Range Query with Access Control over Encrypt...
Enabling Efficient and Geometric Range Query with Access Control over Encrypt...
 
Presentation fyp1automationreplicationinopenstack
Presentation fyp1automationreplicationinopenstackPresentation fyp1automationreplicationinopenstack
Presentation fyp1automationreplicationinopenstack
 
Ultra Fast Deep Learning in Hybrid Cloud using Intel Analytics Zoo & Alluxio
Ultra Fast Deep Learning in Hybrid Cloud using Intel Analytics Zoo & AlluxioUltra Fast Deep Learning in Hybrid Cloud using Intel Analytics Zoo & Alluxio
Ultra Fast Deep Learning in Hybrid Cloud using Intel Analytics Zoo & Alluxio
 
Build bare metal kubernetes cluster for hpc on open stack in translational me...
Build bare metal kubernetes cluster for hpc on open stack in translational me...Build bare metal kubernetes cluster for hpc on open stack in translational me...
Build bare metal kubernetes cluster for hpc on open stack in translational me...
 
IoT Event Processing and Analytics with InfluxDB in Google Cloud | Christoph ...
IoT Event Processing and Analytics with InfluxDB in Google Cloud | Christoph ...IoT Event Processing and Analytics with InfluxDB in Google Cloud | Christoph ...
IoT Event Processing and Analytics with InfluxDB in Google Cloud | Christoph ...
 
Distributed, concurrent, and independent access to encrypted cloud databases
Distributed, concurrent, and independent access to encrypted cloud databasesDistributed, concurrent, and independent access to encrypted cloud databases
Distributed, concurrent, and independent access to encrypted cloud databases
 
Deep Learning and Gene Computing Acceleration with Alluxio in Kubernetes
Deep Learning and Gene Computing Acceleration with Alluxio in KubernetesDeep Learning and Gene Computing Acceleration with Alluxio in Kubernetes
Deep Learning and Gene Computing Acceleration with Alluxio in Kubernetes
 
DATACUBES: Conquering Space & Time
DATACUBES: Conquering Space & TimeDATACUBES: Conquering Space & Time
DATACUBES: Conquering Space & Time
 
distributed, concurrent, and independent access to encrypted cloud databases
distributed, concurrent, and independent access to encrypted cloud databasesdistributed, concurrent, and independent access to encrypted cloud databases
distributed, concurrent, and independent access to encrypted cloud databases
 
The Past, Present, and Future of OpenACC
The Past, Present, and Future of OpenACCThe Past, Present, and Future of OpenACC
The Past, Present, and Future of OpenACC
 
Towards Enabling Mid-Scale Geo-Science Experiments Through Microsoft Trident ...
Towards Enabling Mid-Scale Geo-Science Experiments Through Microsoft Trident ...Towards Enabling Mid-Scale Geo-Science Experiments Through Microsoft Trident ...
Towards Enabling Mid-Scale Geo-Science Experiments Through Microsoft Trident ...
 

Similar to Evaluating Caching Strategies for Cloud Data Access using an Enterprise Service Bus

Performance_and_Cost_Evaluation
Performance_and_Cost_EvaluationPerformance_and_Cost_Evaluation
Performance_and_Cost_Evaluation
Santiago Gómez Sáez
 
Den Datenschatz heben und Zeit- und Energieeffizienz steigern: Mathematik und...
Den Datenschatz heben und Zeit- und Energieeffizienz steigern: Mathematik und...Den Datenschatz heben und Zeit- und Energieeffizienz steigern: Mathematik und...
Den Datenschatz heben und Zeit- und Energieeffizienz steigern: Mathematik und...
Joachim Schlosser
 
Tool-Driven Technology Transfer in Software Engineering
Tool-Driven Technology Transfer in Software EngineeringTool-Driven Technology Transfer in Software Engineering
Tool-Driven Technology Transfer in Software Engineering
Heiko Koziolek
 
Webinar: Cutting Time, Complexity and Cost from Data Science to Production
Webinar: Cutting Time, Complexity and Cost from Data Science to ProductionWebinar: Cutting Time, Complexity and Cost from Data Science to Production
Webinar: Cutting Time, Complexity and Cost from Data Science to Production
iguazio
 
Processing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache SparkProcessing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache Spark
Databricks
 
IEEE 2014 JAVA CLOUD COMPUTING PROJECTS Adaptive algorithm for minimizing clo...
IEEE 2014 JAVA CLOUD COMPUTING PROJECTS Adaptive algorithm for minimizing clo...IEEE 2014 JAVA CLOUD COMPUTING PROJECTS Adaptive algorithm for minimizing clo...
IEEE 2014 JAVA CLOUD COMPUTING PROJECTS Adaptive algorithm for minimizing clo...
IEEEGLOBALSOFTSTUDENTPROJECTS
 
2014 IEEE JAVA CLOUD COMPUTING PROJECT Adaptive algorithm for minimizing clou...
2014 IEEE JAVA CLOUD COMPUTING PROJECT Adaptive algorithm for minimizing clou...2014 IEEE JAVA CLOUD COMPUTING PROJECT Adaptive algorithm for minimizing clou...
2014 IEEE JAVA CLOUD COMPUTING PROJECT Adaptive algorithm for minimizing clou...
IEEEFINALSEMSTUDENTPROJECTS
 
Design_Support_Cloud_Application_Redistribution
Design_Support_Cloud_Application_RedistributionDesign_Support_Cloud_Application_Redistribution
Design_Support_Cloud_Application_Redistribution
Santiago Gómez Sáez
 
Dynamic_Cloud_Application_Redistribution_Performance_Optimization
Dynamic_Cloud_Application_Redistribution_Performance_OptimizationDynamic_Cloud_Application_Redistribution_Performance_Optimization
Dynamic_Cloud_Application_Redistribution_Performance_Optimization
Santiago Gómez Sáez
 
Privacy preserving public auditing for regenerating code based cloud storage
Privacy preserving public auditing for regenerating code based cloud storagePrivacy preserving public auditing for regenerating code based cloud storage
Privacy preserving public auditing for regenerating code based cloud storage
kitechsolutions
 
AIST Super Green Cloud: lessons learned from the operation and the performanc...
AIST Super Green Cloud: lessons learned from the operation and the performanc...AIST Super Green Cloud: lessons learned from the operation and the performanc...
AIST Super Green Cloud: lessons learned from the operation and the performanc...
Ryousei Takano
 
Managing and Deploying High Performance Computing Clusters using Windows HPC ...
Managing and Deploying High Performance Computing Clusters using Windows HPC ...Managing and Deploying High Performance Computing Clusters using Windows HPC ...
Managing and Deploying High Performance Computing Clusters using Windows HPC ...
Saptak Sen
 
OS for AI: Elastic Microservices & the Next Gen of ML
OS for AI: Elastic Microservices & the Next Gen of MLOS for AI: Elastic Microservices & the Next Gen of ML
OS for AI: Elastic Microservices & the Next Gen of ML
Nordic APIs
 
Providing user security guarantees in public infrastructure clouds
Providing user security guarantees in public infrastructure cloudsProviding user security guarantees in public infrastructure clouds
Providing user security guarantees in public infrastructure clouds
Finalyearprojects Toall
 
BDW16 London - William Vambenepe, Google - 3rd Generation Data Platform
BDW16 London - William Vambenepe, Google - 3rd Generation Data PlatformBDW16 London - William Vambenepe, Google - 3rd Generation Data Platform
BDW16 London - William Vambenepe, Google - 3rd Generation Data Platform
Big Data Week
 
Path to continuous delivery
Path to continuous deliveryPath to continuous delivery
Path to continuous delivery
Anirudh Bhatnagar
 
Why AIOps Matters For Kubernetes
Why AIOps Matters For KubernetesWhy AIOps Matters For Kubernetes
Why AIOps Matters For Kubernetes
Timothy Chen
 
MongoDB World 2018: MongoDB for High Volume Time Series Data Streams
MongoDB World 2018: MongoDB for High Volume Time Series Data StreamsMongoDB World 2018: MongoDB for High Volume Time Series Data Streams
MongoDB World 2018: MongoDB for High Volume Time Series Data Streams
MongoDB
 
Automated ML Workflow for Distributed Big Data Using Analytics Zoo (CVPR2020 ...
Automated ML Workflow for Distributed Big Data Using Analytics Zoo (CVPR2020 ...Automated ML Workflow for Distributed Big Data Using Analytics Zoo (CVPR2020 ...
Automated ML Workflow for Distributed Big Data Using Analytics Zoo (CVPR2020 ...
Jason Dai
 
Foundstone scq cypherpath
Foundstone scq cypherpathFoundstone scq cypherpath
Foundstone scq cypherpath
Learn24x7
 

Similar to Evaluating Caching Strategies for Cloud Data Access using an Enterprise Service Bus (20)

Performance_and_Cost_Evaluation
Performance_and_Cost_EvaluationPerformance_and_Cost_Evaluation
Performance_and_Cost_Evaluation
 
Den Datenschatz heben und Zeit- und Energieeffizienz steigern: Mathematik und...
Den Datenschatz heben und Zeit- und Energieeffizienz steigern: Mathematik und...Den Datenschatz heben und Zeit- und Energieeffizienz steigern: Mathematik und...
Den Datenschatz heben und Zeit- und Energieeffizienz steigern: Mathematik und...
 
Tool-Driven Technology Transfer in Software Engineering
Tool-Driven Technology Transfer in Software EngineeringTool-Driven Technology Transfer in Software Engineering
Tool-Driven Technology Transfer in Software Engineering
 
Webinar: Cutting Time, Complexity and Cost from Data Science to Production
Webinar: Cutting Time, Complexity and Cost from Data Science to ProductionWebinar: Cutting Time, Complexity and Cost from Data Science to Production
Webinar: Cutting Time, Complexity and Cost from Data Science to Production
 
Processing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache SparkProcessing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache Spark
 
IEEE 2014 JAVA CLOUD COMPUTING PROJECTS Adaptive algorithm for minimizing clo...
IEEE 2014 JAVA CLOUD COMPUTING PROJECTS Adaptive algorithm for minimizing clo...IEEE 2014 JAVA CLOUD COMPUTING PROJECTS Adaptive algorithm for minimizing clo...
IEEE 2014 JAVA CLOUD COMPUTING PROJECTS Adaptive algorithm for minimizing clo...
 
2014 IEEE JAVA CLOUD COMPUTING PROJECT Adaptive algorithm for minimizing clou...
2014 IEEE JAVA CLOUD COMPUTING PROJECT Adaptive algorithm for minimizing clou...2014 IEEE JAVA CLOUD COMPUTING PROJECT Adaptive algorithm for minimizing clou...
2014 IEEE JAVA CLOUD COMPUTING PROJECT Adaptive algorithm for minimizing clou...
 
Design_Support_Cloud_Application_Redistribution
Design_Support_Cloud_Application_RedistributionDesign_Support_Cloud_Application_Redistribution
Design_Support_Cloud_Application_Redistribution
 
Dynamic_Cloud_Application_Redistribution_Performance_Optimization
Dynamic_Cloud_Application_Redistribution_Performance_OptimizationDynamic_Cloud_Application_Redistribution_Performance_Optimization
Dynamic_Cloud_Application_Redistribution_Performance_Optimization
 
Privacy preserving public auditing for regenerating code based cloud storage
Privacy preserving public auditing for regenerating code based cloud storagePrivacy preserving public auditing for regenerating code based cloud storage
Privacy preserving public auditing for regenerating code based cloud storage
 
AIST Super Green Cloud: lessons learned from the operation and the performanc...
AIST Super Green Cloud: lessons learned from the operation and the performanc...AIST Super Green Cloud: lessons learned from the operation and the performanc...
AIST Super Green Cloud: lessons learned from the operation and the performanc...
 
Managing and Deploying High Performance Computing Clusters using Windows HPC ...
Managing and Deploying High Performance Computing Clusters using Windows HPC ...Managing and Deploying High Performance Computing Clusters using Windows HPC ...
Managing and Deploying High Performance Computing Clusters using Windows HPC ...
 
OS for AI: Elastic Microservices & the Next Gen of ML
OS for AI: Elastic Microservices & the Next Gen of MLOS for AI: Elastic Microservices & the Next Gen of ML
OS for AI: Elastic Microservices & the Next Gen of ML
 
Providing user security guarantees in public infrastructure clouds
Providing user security guarantees in public infrastructure cloudsProviding user security guarantees in public infrastructure clouds
Providing user security guarantees in public infrastructure clouds
 
BDW16 London - William Vambenepe, Google - 3rd Generation Data Platform
BDW16 London - William Vambenepe, Google - 3rd Generation Data PlatformBDW16 London - William Vambenepe, Google - 3rd Generation Data Platform
BDW16 London - William Vambenepe, Google - 3rd Generation Data Platform
 
Path to continuous delivery
Path to continuous deliveryPath to continuous delivery
Path to continuous delivery
 
Why AIOps Matters For Kubernetes
Why AIOps Matters For KubernetesWhy AIOps Matters For Kubernetes
Why AIOps Matters For Kubernetes
 
MongoDB World 2018: MongoDB for High Volume Time Series Data Streams
MongoDB World 2018: MongoDB for High Volume Time Series Data StreamsMongoDB World 2018: MongoDB for High Volume Time Series Data Streams
MongoDB World 2018: MongoDB for High Volume Time Series Data Streams
 
Automated ML Workflow for Distributed Big Data Using Analytics Zoo (CVPR2020 ...
Automated ML Workflow for Distributed Big Data Using Analytics Zoo (CVPR2020 ...Automated ML Workflow for Distributed Big Data Using Analytics Zoo (CVPR2020 ...
Automated ML Workflow for Distributed Big Data Using Analytics Zoo (CVPR2020 ...
 
Foundstone scq cypherpath
Foundstone scq cypherpathFoundstone scq cypherpath
Foundstone scq cypherpath
 

Recently uploaded

Into the Box 2024 - Keynote Day 2 Slides.pdf
Into the Box 2024 - Keynote Day 2 Slides.pdfInto the Box 2024 - Keynote Day 2 Slides.pdf
Into the Box 2024 - Keynote Day 2 Slides.pdf
Ortus Solutions, Corp
 
Accelerate Enterprise Software Engineering with Platformless
Accelerate Enterprise Software Engineering with PlatformlessAccelerate Enterprise Software Engineering with Platformless
Accelerate Enterprise Software Engineering with Platformless
WSO2
 
Quarkus Hidden and Forbidden Extensions
Quarkus Hidden and Forbidden ExtensionsQuarkus Hidden and Forbidden Extensions
Quarkus Hidden and Forbidden Extensions
Max Andersen
 
Vitthal Shirke Microservices Resume Montevideo
Vitthal Shirke Microservices Resume MontevideoVitthal Shirke Microservices Resume Montevideo
Vitthal Shirke Microservices Resume Montevideo
Vitthal Shirke
 
Globus Compute wth IRI Workflows - GlobusWorld 2024
Globus Compute wth IRI Workflows - GlobusWorld 2024Globus Compute wth IRI Workflows - GlobusWorld 2024
Globus Compute wth IRI Workflows - GlobusWorld 2024
Globus
 
Advanced Flow Concepts Every Developer Should Know
Advanced Flow Concepts Every Developer Should KnowAdvanced Flow Concepts Every Developer Should Know
Advanced Flow Concepts Every Developer Should Know
Peter Caitens
 
Why React Native as a Strategic Advantage for Startup Innovation.pdf
Why React Native as a Strategic Advantage for Startup Innovation.pdfWhy React Native as a Strategic Advantage for Startup Innovation.pdf
Why React Native as a Strategic Advantage for Startup Innovation.pdf
ayushiqss
 
2024 RoOUG Security model for the cloud.pptx
2024 RoOUG Security model for the cloud.pptx2024 RoOUG Security model for the cloud.pptx
2024 RoOUG Security model for the cloud.pptx
Georgi Kodinov
 
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Globus
 
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdfDominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
AMB-Review
 
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Globus
 
Enhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdfEnhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdf
Globus
 
BoxLang: Review our Visionary Licenses of 2024
BoxLang: Review our Visionary Licenses of 2024BoxLang: Review our Visionary Licenses of 2024
BoxLang: Review our Visionary Licenses of 2024
Ortus Solutions, Corp
 
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...
Juraj Vysvader
 
Explore Modern SharePoint Templates for 2024
Explore Modern SharePoint Templates for 2024Explore Modern SharePoint Templates for 2024
Explore Modern SharePoint Templates for 2024
Sharepoint Designs
 
Strategies for Successful Data Migration Tools.pptx
Strategies for Successful Data Migration Tools.pptxStrategies for Successful Data Migration Tools.pptx
Strategies for Successful Data Migration Tools.pptx
varshanayak241
 
Corporate Management | Session 3 of 3 | Tendenci AMS
Corporate Management | Session 3 of 3 | Tendenci AMSCorporate Management | Session 3 of 3 | Tendenci AMS
Corporate Management | Session 3 of 3 | Tendenci AMS
Tendenci - The Open Source AMS (Association Management Software)
 
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
informapgpstrackings
 
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2
 
A Comprehensive Look at Generative AI in Retail App Testing.pdf
A Comprehensive Look at Generative AI in Retail App Testing.pdfA Comprehensive Look at Generative AI in Retail App Testing.pdf
A Comprehensive Look at Generative AI in Retail App Testing.pdf
kalichargn70th171
 

Recently uploaded (20)

Into the Box 2024 - Keynote Day 2 Slides.pdf
Into the Box 2024 - Keynote Day 2 Slides.pdfInto the Box 2024 - Keynote Day 2 Slides.pdf
Into the Box 2024 - Keynote Day 2 Slides.pdf
 
Accelerate Enterprise Software Engineering with Platformless
Accelerate Enterprise Software Engineering with PlatformlessAccelerate Enterprise Software Engineering with Platformless
Accelerate Enterprise Software Engineering with Platformless
 
Quarkus Hidden and Forbidden Extensions
Quarkus Hidden and Forbidden ExtensionsQuarkus Hidden and Forbidden Extensions
Quarkus Hidden and Forbidden Extensions
 
Vitthal Shirke Microservices Resume Montevideo
Vitthal Shirke Microservices Resume MontevideoVitthal Shirke Microservices Resume Montevideo
Vitthal Shirke Microservices Resume Montevideo
 
Globus Compute wth IRI Workflows - GlobusWorld 2024
Globus Compute wth IRI Workflows - GlobusWorld 2024Globus Compute wth IRI Workflows - GlobusWorld 2024
Globus Compute wth IRI Workflows - GlobusWorld 2024
 
Advanced Flow Concepts Every Developer Should Know
Advanced Flow Concepts Every Developer Should KnowAdvanced Flow Concepts Every Developer Should Know
Advanced Flow Concepts Every Developer Should Know
 
Why React Native as a Strategic Advantage for Startup Innovation.pdf
Why React Native as a Strategic Advantage for Startup Innovation.pdfWhy React Native as a Strategic Advantage for Startup Innovation.pdf
Why React Native as a Strategic Advantage for Startup Innovation.pdf
 
2024 RoOUG Security model for the cloud.pptx
2024 RoOUG Security model for the cloud.pptx2024 RoOUG Security model for the cloud.pptx
2024 RoOUG Security model for the cloud.pptx
 
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
 
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdfDominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
 
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
 
Enhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdfEnhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdf
 
BoxLang: Review our Visionary Licenses of 2024
BoxLang: Review our Visionary Licenses of 2024BoxLang: Review our Visionary Licenses of 2024
BoxLang: Review our Visionary Licenses of 2024
 
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...
 
Explore Modern SharePoint Templates for 2024
Explore Modern SharePoint Templates for 2024Explore Modern SharePoint Templates for 2024
Explore Modern SharePoint Templates for 2024
 
Strategies for Successful Data Migration Tools.pptx
Strategies for Successful Data Migration Tools.pptxStrategies for Successful Data Migration Tools.pptx
Strategies for Successful Data Migration Tools.pptx
 
Corporate Management | Session 3 of 3 | Tendenci AMS
Corporate Management | Session 3 of 3 | Tendenci AMSCorporate Management | Session 3 of 3 | Tendenci AMS
Corporate Management | Session 3 of 3 | Tendenci AMS
 
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
 
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
 
A Comprehensive Look at Generative AI in Retail App Testing.pdf
A Comprehensive Look at Generative AI in Retail App Testing.pdfA Comprehensive Look at Generative AI in Retail App Testing.pdf
A Comprehensive Look at Generative AI in Retail App Testing.pdf
 

Evaluating Caching Strategies for Cloud Data Access using an Enterprise Service Bus

  • 1. University of Stuttgart Universitätsstr. 38 70569 Stuttgart Germany Phone +49-711-685 88337 Fax +49-711-685 88472 Research Santiago Gómez Sáez, Vasilios Andrikopoulos, Frank Leymann, and Steve Strauch Institute of Architecture of Application Systems {gomez-saez, andrikopoulos, leymann, strauch}@iaas.uni-stuttgart.de Evaluating Caching Strategies for Cloud Data Access using an Enterprise Service Bus IEEE IC2E 2014
  • 2. Research © Santiago Gómez Sáez 2 Agenda  Motivating Scenario  CDASMix Architecture & Realization  Evaluation  Conclusion and Future Work
  • 3. 33 Research © Santiago Gómez Sáez Motivating Scenario Presentation Layer Application Business Layer SQL Data Access Layer SQL Data Access LayerCloud-Enabled Data Access Layer SQL Registry Public Cloud Public CloudTraditional Application Layers Deployment Models Assumptions  Database layer has already been migrated  Focus on Relational Databases
  • 4. 44 Research © Santiago Gómez Sáez CDASMix - Architecture Presentation Business Logic Resources Web Service API Configuration Registry Manager Tenant Registry Manager Service Registry Manager JBI Container Manager Service Assembly Manager Service Registry Database Cluster Configuration Registry Database JBI Container Instance Cluster Access Layer Web UI Tenant Registry Database Message Broker (1) Strauch et al.: Transparent Access to Relational Databases in the Cloud Using a Multi-tenant ESB. CLOSER’14 (2) ESBMT Project: www.iaas.uni-stuttgart.de/esbmt/
  • 5. 55 Research © Santiago Gómez Sáez CDASMix – Cloud Data Access ESB Instance OSGi Environment JBI Environment Standardized Interfaces for Service Engines Standardized Interfaces for Binding Components Normalized Message Router External Application SMX-Camel -mt MySQL Proxy SMX- Camel Camel cdasmixJDBC Backend Cloud Data Store Provider Legend Message Flow OSGi Component JBI Component NMR API Cache Cluster Instance 1Instance 1Instance 1 • Ehcache 2.6.0 • LRU, LFU & FIFO • Multi-tenancy Awareness
  • 6. 66 Research © Santiago Gómez Sáez Evaluation – Methodology & Data Set  Measure how caching mitigates the performance degradation when accessing data through CDASMix  Analyze the optimal cache eviction algorithm (in tandem with the MySQL instances)  Cache Hit rate in % and throughput in Req./s  TPC-H 1 GB data distributed in 8 tables  Discrete uniform (1/N) generated workload from 5 adapted TPC- H queries -> read intensive (2.5 MB per query) constituted by 100 queries from initial sample of 9 queries  Generated Load publicly available at https://santiago.studiforge.informatik.uni- stuttgart.de/svn/publications/IC2E14/queries4Load/generatedLoad 5-100.csv
  • 7. 77 Research © Santiago Gómez Sáez Evaluation Setup VM0 (Flexiscale) Apache JMeter 2.9 CDASMix MySQL 5.1 TPC-H Amazon RDS MySQL 5.1 instance VM1 (Amazon EC2) MySQL 5.1 D1D2 D3 E3 E2 E1 Legend Message Flow Measurement Point Throughput and Transfer Rate Built-in Cache E TPC-H TPC-H QueryGen.shload.csv MySQL & Ehcache cache size 16MB
  • 8. 88 Research © Santiago Gómez Sáez Evaluation – MySQL in IaaS Flexiscale & AWS EC2 Flexiscale AWS EC2 -51% -14% -10% +17% -39% +21% +30% +16%
  • 9. 99 Research © Santiago Gómez Sáez Evaluation – MySQL in IaaS Flexiscale & AWS EC2 Flexiscale AWS EC2 -51% +43% +46% +42% -39% +50% +53% +48%
  • 10. 1010 Research © Santiago Gómez Sáez Evaluation – MySQL in AWS RDS -93% -89% -89% -89%
  • 11. 1111 Research © Santiago Gómez Sáez Evaluation – MySQL in AWS RDS -93% +37% +37% +37%
  • 12. 1212 Research © Santiago Gómez Sáez Evaluation – CDASMix Cache Hit Ratio Flexiscale AWS EC2 AWS RDS
  • 13. 1313 Research © Santiago Gómez Sáez Conclusion & Future Work  Design and realization of CDASMix, a multi-tenant aware ESB solution that enables transparent data access  Caching support for ameliorating the performance  Evaluation based on  different database deployment scenarios  the utilization of different caching eviction algorithms  Extend CDASMix towards supporting PostgreSQL  CDASMix horizontal scalability & distributed caching  Further evaluation  + Caching Eviction Algorithms  Different workloads
  • 14. 14 Thanks for your attention!!

Editor's Notes

  1. 1- In the last years Cloud computing has become popular among IT organizations aiming to reduce its operational costs 2- Applications can be designed to run in the Cloud, or can be partially or completely migrated to the Cloud. 3- Focusing on the three layered application pattern, in other works we have focused on migrating the application data to the Cloud. Migrating the application data to the Cloud requires adaptations, e.g. rewiring to access the migrated to the Cloud databases. 4- In this work we target how to mitigate the performance degradation due to accessing the migrated to the Cloud data through CDASMix
  2. Contributions of this work: The design and realization of CDASMix, a multi-tenant aware ESB solution with caching support that enables transparent data access to databases both on-premise and off-premise.Design and realization of CDASMix, a multi-tenant aware ESB solution that enables transparent access to databases hosted on or off-premise A performance evaluation of our proposal, with the dual purpose of showing the impact of introducing CDASMix to the performance of the application, and identifying the optimal caching strategy for CDASMix for different deployment options across Cloud service providers. A set of initial findings stemming from this evaluation, that can be valuable for related efforts
  3. 1- Focus on the three layered application pattern proposed by Fowler 2- Data layer is subdivided into the data access layer and the database layer 3.1- Consider an application whose stack is completely hosted on-premise. 3.2- Data is partially or completely migrated to the Cloud, e.g. to Amazon RDS 3.3- The data Access layer must be adapted and rewired in order to access the migrated to the Cloud database 3.4- If we assume 3 different scenarios, e.g. data partially hosted between on-premise, and DBaaS or IaaS solutions, the data access layer must be aware of such locations towards retrieving the data from the different backend data sources. 3.5- Therefore, there is a need of a Cloud enabled data access layer able to redirect the data storage and retrieval requests to the different databases. 3.6- For example, being able to redirect SQL requests to data migrated to AWS RDS.
  4. Presentation layer: Extended to provide a larger amount of operations not only for multi-tenant aware administration and management, but also to enable the registration of the necessary information for routing requests between multiple backend data sources. Business Logic Layer: encapsulates the business logic of the ESBmt administration and management. Have extended to incorporate cloud data access awareness. Access layer: based on role based access control. Tenant and users access the system with an unique tenant id and user id. Tenant Registry Manager, Configuration Registry Manager, and Service Registry Manager: wrap the interaction functionalities with the persistent resources, tenant registry, configuration registry, and service registry. JBI Container Manager and Service Assembly Managers contain the necessary functionalities to interact with the JBI Container Cluster for deployment and undeployment of message adapters and transformers. Resources Layer: encapsulates the persistancy resources, and the resources which are managed and administered through the upper layers. ESB Instance Cluster: multiple ESB instaces which perform the tasks associated with ESB solutions, e.g. message routing and transformation. Each ESB instances can be seen as three main components: Message adapters, message processors, and a normalized message router. Tenant Registry: contains information related to the tenants and users, e.g. id, email, etc. Configuration Registry: contains information related to the configuration of each tenant, e.g. tenant operator permissions, used jbi clusters, quota for message adapters, etc. Service Registry: tenant’s services in the ESB cluster, as well as the configuration of each message adapter deployed to the ESB instance. Message broker is an intermediate component for communicating with the ESB instances based on topic subscription.
  5. - MySQL Proxy: OSGi and JBI compliant version of Java MySQL Proxy implementing native MySQL communication protocol, providing one endpoint - Caching: EhCache realizing Least Recently Used (LRU) caching policy and deleting cach records when SQL statements involve data modifications - NMR: enables integration of OSGi Proxy and NMR - SMX-Camel-mt: multi-tenancy, integration between JBI and Enterprise Integration Patterns provided by Apache Camel CamelcdasmixJDBC: dynamically connecting to backend data stores via corresponding database communication protocol JNDI: to register database connections in order to reduce latency when creating a database connection per user SMX-Camel: enables loading CamelcedasmixJDBC packages at runtime, e.g. updates for supporting a new backend data store or data service
  6. MySQL query cache uses the an improved LRU eviction algorithm incorporating a midpoint insertion strategy. Following a temporal storage based on lists, the list is divided into the most recently accessed and the oldest values which are less recently used. With this approach, the list contains blocks which are the most recently used. TPC-H benchmark is a database decision support benchmar which comprises a set of queries with a high degree of complexity that run over a large volume of data. 9 Adapted queries which are distributed among a workload constituted by 100 queries distributed with probability 1/9 Average of 10 Rounds per scenario
  7. RDS and EC2 m1.xlarge EC2 and a db.m1.xlarge instances Amazon instances in the EU zone.
  8. Refer the RDS results in the paper First comparison is based on the performance degradation Second comparison is based on how the degraded performance is mitigated by introducing the cache. In previous papers we identified the network latency as approx 3 % of the total throughput.
  9. Refer the RDS results in the paper First comparison is based on the performance degradation Second comparison is based on how the degraded performance is mitigated by introducing the cache.
  10. Performs better in Tandem with the Optimized LRU caching strategy implemented in MySQL, as it relies on the LRU. The combination of both provide better performance results.