SlideShare a Scribd company logo
1 of 32
Download to read offline
Refining the Estimation of
the Available Bandwidth in Inter-Cloud
Links for Task Scheduling
Thiago A. L. Genez, Luiz F. Bittencourt,
Nelson L. S. da Fonseca, Edmundo R. M. Madeira
Institute of Computing (IC)
University of Campinas (UNICAMP)
Campinas, SP, Brazil
December 10, 2014
IEEE GLOBECOM 2014
1 / 22
Outline
Introduction
Related Works
Procedure for Deflating Estimates of the Available Bandwidth
Evaluation
Final Considerations
2 / 22
Introduction
Workflow Scheduling Problem in Hybrid Clouds
Peak demand time:
• Private resources → overloaded or insufficient
• Hybrid Cloud: Public resources + private resources
What are the advantages of using public clouds?
• Elasticity
• Pay-as-you-go basis
Workflow scheduling problem
3 / 22
Introduction
Current schedulers
Not designed to cope with imprecise information
Produce schedules without taking into account the variability of the
available bandwidth in inter-cloud links
Available bandwidth can increase or decrease at the running time
Application execution can lead
• Violation of deadlines
4 / 22
Introduction
Purpose of this work
How to reduce the negative impact of imprecise information about the
inter-cloud available bandwidth on the production of schedules by a
scheduler that was not designed to address with such imprecise
information?
Challenge
Use the original scheduling algorithm
Proposed Mechanism
Deflating the estimate of the inter-cloud available bandwidth based on
the expected imprecision of such estimate and provide a deflated
bandwidth estimate as an input to the scheduler
5 / 22
Introduction
Purpose of this work
How to reduce the negative impact of imprecise information about the
inter-cloud available bandwidth on the production of schedules by a
scheduler that was not designed to address with such imprecise
information?
Challenge
Use the original scheduling algorithm
Proposed Mechanism
Deflating the estimate of the inter-cloud available bandwidth based on
the expected imprecision of such estimate and provide a deflated
bandwidth estimate as an input to the scheduler
5 / 22
Introduction
Purpose of this work
How to reduce the negative impact of imprecise information about the
inter-cloud available bandwidth on the production of schedules by a
scheduler that was not designed to address with such imprecise
information?
Challenge
Use the original scheduling algorithm
Proposed Mechanism
Deflating the estimate of the inter-cloud available bandwidth based on
the expected imprecision of such estimate and provide a deflated
bandwidth estimate as an input to the scheduler
5 / 22
Outline
Introduction
Related Works
Procedure for Deflating Estimates of the Available Bandwidth
Evaluation
Final Considerations
6 / 22
Related Works
Rahman et al.
– Performance of the network of the Amazon EC2
(2010)
– Analysis of the packets delay of VMs to/from Amazon EC2
– Large delay variations
– Negatively impact the performance of scientific applications
Batista et al. – Describe tools for estimating available bandwidth
2010 – Produce estimations with large variability
8 / 22
Outline
Introduction
Related Works
Procedure for Deflating Estimates of the Available Bandwidth
Evaluation
Final Considerations
9 / 22
Procedure for Deflating Estimates of the Available
Bandwidth in Inter-cloud Links
Available bandwidth
estimation tool
Scheduler
Estimate of the
Available Bandwidth
Hybrid Cloud
Application workflow
and
Deadline Value
Schedule
10 / 22
Procedure for Deflating Estimates of the Available
Bandwidth in Inter-cloud Links
Available bandwidth
estimation tool
Scheduler
Estimate of
the Available
Bandwidth
Expected
uncertainty
value
Hybrid Cloud
Procedure
Deflated
Available
Bandwidth
Estimate
Application workflow
and
Deadline Value
Schedule
10 / 22
Procedure for Deflating Estimates of the Available
Bandwidth in Inter-cloud Links
Procedure
History of past executions of the target workflow
When a workflow is about to be scheduled
1. Estimate of the available bandwidth
2. Expected uncertainty value
3. Query the history of past executions of the target workflow
4. Calculates the deflating factor U
U = 10 ⇒ 90% of the estimate of the available bandwidth
Schedule produced is based on the expected uncertainty of the
estimate of available bandwidth in inter-cloud links
11 / 22
Procedure for Deflating Estimates of the Available
Bandwidth in Inter-cloud Links
Procedure
History of past executions of the target workflow
When a workflow is about to be scheduled
1. Estimate of the available bandwidth
2. Expected uncertainty value
3. Query the history of past executions of the target workflow
4. Calculates the deflating factor U
U = 10 ⇒ 90% of the estimate of the available bandwidth
Schedule produced is based on the expected uncertainty of the
estimate of available bandwidth in inter-cloud links
11 / 22
Procedure for Deflating Estimates of the Available
Bandwidth in Inter-cloud Links
Procedure
History of past executions of the target workflow
When a workflow is about to be scheduled
1. Estimate of the available bandwidth
2. Expected uncertainty value
3. Query the history of past executions of the target workflow
4. Calculates the deflating factor U
U = 10 ⇒ 90% of the estimate of the available bandwidth
Schedule produced is based on the expected uncertainty of the
estimate of available bandwidth in inter-cloud links
11 / 22
Procedure for Deflating Estimates of the Available
Bandwidth in Inter-cloud Links
Database
Available bandwidth
estimation tool
Scheduler
Observed
Available
Bandwidth
value
Expected
uncertainty
value
Hybrid Cloud
Procedure
Deflated
Available
Bandwidth
Application workflow
and
Deadline Value
Schedule
12 / 22
Procedure for Deflating Estimates of the Available
Bandwidth in Inter-cloud Links
Database
Available bandwidth
estimation tool
Scheduler
Estimate of
the Available
Bandwidth
Expected
uncertainty
value
Hybrid Cloud
Procedure
Deflated
Available
Bandwidth
Application workflow
and
Deadline Value
Schedule
12 / 22
Procedure for Deflating Estimates of the Available
Bandwidth in Inter-cloud Links
Database
Available bandwidth
estimation tool
Scheduler
Estimate of
the Available
Bandwidth
Expected
uncertainty
value
Hybrid Cloud
Procedure
Deflated
Available
Bandwidth
Application workflow
and
Deadline Value
Schedule
12 / 22
Procedure for Deflating Estimates of the Available
Bandwidth in Inter-cloud Links
Database
Available bandwidth
estimation tool
Scheduler
Estimate of
the Available
Bandwidth
Expected
uncertainty
value
Hybrid Cloud
Procedure
Deflated
Available
Bandwidth
Application workflow
and
Deadline Value
Schedule
12 / 22
Procedure for Deflating Estimates of the Available
Bandwidth in Inter-cloud Links
Database
Available bandwidth
estimation tool
Scheduler
Estimate of
the Available
Bandwidth
Expected
uncertainty
value
Hybrid Cloud
Procedure
Deflated
Available
Bandwidth
Application workflow
and
Deadline Value
Schedule
12 / 22
Procedure for Deflating Estimates of the Available
Bandwidth in Inter-cloud Links
Database
Available bandwidth
estimation tool
Scheduler
Estimate of
the Available
Bandwidth
Expected
uncertainty
value
Hybrid Cloud
Procedure
Deflated
Available
Bandwidth
Application workflow
and
Deadline Value
Schedule
Untouched Qualifed
Solution
12 / 22
Procedure for Deflating Estimates of the Available
Bandwidth in Inter-cloud Links
Computation of the Deflating factor U for the Target
Workflow
Multiple Linear Regression: f(x, y) = ax + by + c
• x: Current estimate of the available bandwidth
• y: Current expected uncertainty
• Deflating factor U = f(x, y)
Computation of the coefficients a, b and c
Target workflow G: dataset HG
• 5-tuple hi = bw, p, U, errorm
G , error$
G
Subset Hk ⊆ HG
• For each pair (bw, p) in HG
• bw, p, Um
and bw, p, U$
are added into Hk
Subset Hk is used by the multiple linear regression
13 / 22
Outline
Introduction
Related Works
Procedure for Deflating Estimates of the Available Bandwidth
Evaluation
Final Considerations
14 / 22
Evaluation
Experimental Parameters
Scheduler
• HCOC scheduling algorithm
Hybrid Cloud Scenario
• 1 private cloud and 2 public clouds
• Inter-cloud bandwidths of 10 to 60 Mbps
• Intra-cloud bandwidths of 1 Gbps
Simulator
• Estimates the makespan and cost of the execution of the workflow
15 / 22
Evaluation
Scheduler
DAX
File
VMs
File
Schedule Simulator
Reduction factor Uncertain
Makespan
and
Cost ($)
Available
Bandwidth
Makespan
and
Cost ($)
U p
b
Database
16 / 22
Evaluation
Experimental Steps
1. History of execution was created
• Fixed bandwidth deflating factors U ∈ {0, 25, 50}
• p varying from 45% to 99%
• 100 simulations
2. Multiple linear regression (MLR) procedure
• f(x, y) = ax + by + c
• Employs using 50% and 100% of the dataset
3. Use the equation f(x, y) to calculate the deflating factor U
• 100 simulations
17 / 22
Evaluation
40
50
60
70
80
90
100
0 45 50 60 70 80 90 100
%ofqualifiedsolutions
Uncertainty p
Montage DAG
U=0
U=25
U=50
MLR 50%
MLR 100%
Inter-cloud available bandwidth of 60Mbps
D = Tmax × 3/7
18 / 22
Evaluation
25
30
35
40
45
50
55
60
0 45 50 60 70 80 90 100
Averagemakespanestimation
Uncertainty p
Montage DAG
U=0
U=25
U=50
MLR 50%
MLR 100%
Inter-cloud available bandwidth of 60Mbps
D = Tmax × 3/7
19 / 22
Outline
Introduction
Related Works
Procedure for Deflating Estimates of the Available Bandwidth
Evaluation
Final Considerations
20 / 22
Final Considerations
Conclusion
Current scheduler
• Estimated available bandwidth is precise at the scheduling time
• Produce inefficient scheduling decisions
• Missing deadlines, increasing costs and makespan more than expected
The proposed procedure
• Deflates the estimate of the available bandwidth in inter-cloud links
• Multiple linear regression approach
• Increases the number of qualified solutions
21 / 22
Final Considerations
Conclusion
Current scheduler
• Estimated available bandwidth is precise at the scheduling time
• Produce inefficient scheduling decisions
• Missing deadlines, increasing costs and makespan more than expected
The proposed procedure
• Deflates the estimate of the available bandwidth in inter-cloud links
• Multiple linear regression approach
• Increases the number of qualified solutions
21 / 22
Thank You!
Questions?
thiagogenez@ic.unicamp.br
Acknowledgment:
grant #2014/08607-4
Sao Paulo Research Foundation
22 / 22

More Related Content

What's hot

Master Thesis Presentation
Master Thesis PresentationMaster Thesis Presentation
Master Thesis PresentationEhab Qadah
 
A Framework for Protecting Worker Location Privacy in Spatial Crowdsourcing
A Framework for Protecting Worker Location Privacy in Spatial CrowdsourcingA Framework for Protecting Worker Location Privacy in Spatial Crowdsourcing
A Framework for Protecting Worker Location Privacy in Spatial CrowdsourcingUniversity of Southern California
 
Balman dissertation Copyright @ 2010 Mehmet Balman
Balman dissertation Copyright @ 2010 Mehmet BalmanBalman dissertation Copyright @ 2010 Mehmet Balman
Balman dissertation Copyright @ 2010 Mehmet Balmanbalmanme
 
DYNAMIC TASK SCHEDULING BASED ON BURST TIME REQUIREMENT FOR CLOUD ENVIRONMENT
DYNAMIC TASK SCHEDULING BASED ON BURST TIME REQUIREMENT FOR CLOUD ENVIRONMENTDYNAMIC TASK SCHEDULING BASED ON BURST TIME REQUIREMENT FOR CLOUD ENVIRONMENT
DYNAMIC TASK SCHEDULING BASED ON BURST TIME REQUIREMENT FOR CLOUD ENVIRONMENTIJCNCJournal
 
QoS-Aware Data Replication for Data-Intensive Applications in Cloud Computing...
QoS-Aware Data Replication for Data-Intensive Applications in Cloud Computing...QoS-Aware Data Replication for Data-Intensive Applications in Cloud Computing...
QoS-Aware Data Replication for Data-Intensive Applications in Cloud Computing...Papitha Velumani
 
A location based least-cost scheduling for data-intensive applications
A location based least-cost scheduling for data-intensive applicationsA location based least-cost scheduling for data-intensive applications
A location based least-cost scheduling for data-intensive applicationsIAEME Publication
 
Data Retrieval Scheduling For Unsynchronized Channel in Wireless Broadcast Sy...
Data Retrieval Scheduling For Unsynchronized Channel in Wireless Broadcast Sy...Data Retrieval Scheduling For Unsynchronized Channel in Wireless Broadcast Sy...
Data Retrieval Scheduling For Unsynchronized Channel in Wireless Broadcast Sy...IJERA Editor
 
Challenges in Dynamic Resource Allocation and Task Scheduling in Heterogeneou...
Challenges in Dynamic Resource Allocation and Task Scheduling in Heterogeneou...Challenges in Dynamic Resource Allocation and Task Scheduling in Heterogeneou...
Challenges in Dynamic Resource Allocation and Task Scheduling in Heterogeneou...rahulmonikasharma
 
Benchmarking Elastic Cloud Big Data Services under SLA Constraints
Benchmarking Elastic Cloud Big Data Services under SLA ConstraintsBenchmarking Elastic Cloud Big Data Services under SLA Constraints
Benchmarking Elastic Cloud Big Data Services under SLA ConstraintsNicolas Poggi
 
Energy efficient reverse skyline query processing over wireless sensor networks
Energy efficient reverse skyline query processing over wireless sensor networksEnergy efficient reverse skyline query processing over wireless sensor networks
Energy efficient reverse skyline query processing over wireless sensor networksFinalyear Projects
 
Modeling and Optimization of Resource Allocation in Cloud [PhD Thesis Progres...
Modeling and Optimization of Resource Allocation in Cloud [PhD Thesis Progres...Modeling and Optimization of Resource Allocation in Cloud [PhD Thesis Progres...
Modeling and Optimization of Resource Allocation in Cloud [PhD Thesis Progres...AtakanAral
 
Ieeepro techno solutions 2014 ieee java project - cloud bandwidth and cost ...
Ieeepro techno solutions   2014 ieee java project - cloud bandwidth and cost ...Ieeepro techno solutions   2014 ieee java project - cloud bandwidth and cost ...
Ieeepro techno solutions 2014 ieee java project - cloud bandwidth and cost ...hemanthbbc
 
Ieeepro techno solutions 2014 ieee java project - deadline based resource p...
Ieeepro techno solutions   2014 ieee java project - deadline based resource p...Ieeepro techno solutions   2014 ieee java project - deadline based resource p...
Ieeepro techno solutions 2014 ieee java project - deadline based resource p...hemanthbbc
 
Resource Mapping Optimization for Distributed Cloud Services - PhD Thesis Def...
Resource Mapping Optimization for Distributed Cloud Services - PhD Thesis Def...Resource Mapping Optimization for Distributed Cloud Services - PhD Thesis Def...
Resource Mapping Optimization for Distributed Cloud Services - PhD Thesis Def...AtakanAral
 
Time Efficient VM Allocation using KD-Tree Approach in Cloud Server Environment
Time Efficient VM Allocation using KD-Tree Approach in Cloud Server EnvironmentTime Efficient VM Allocation using KD-Tree Approach in Cloud Server Environment
Time Efficient VM Allocation using KD-Tree Approach in Cloud Server Environmentrahulmonikasharma
 
accessible-streaming-algorithms
accessible-streaming-algorithmsaccessible-streaming-algorithms
accessible-streaming-algorithmsFarhan Zaki
 

What's hot (19)

Master Thesis Presentation
Master Thesis PresentationMaster Thesis Presentation
Master Thesis Presentation
 
A Framework for Protecting Worker Location Privacy in Spatial Crowdsourcing
A Framework for Protecting Worker Location Privacy in Spatial CrowdsourcingA Framework for Protecting Worker Location Privacy in Spatial Crowdsourcing
A Framework for Protecting Worker Location Privacy in Spatial Crowdsourcing
 
Balman dissertation Copyright @ 2010 Mehmet Balman
Balman dissertation Copyright @ 2010 Mehmet BalmanBalman dissertation Copyright @ 2010 Mehmet Balman
Balman dissertation Copyright @ 2010 Mehmet Balman
 
Resource management
Resource managementResource management
Resource management
 
Cloud Computing and PSo
Cloud Computing and PSoCloud Computing and PSo
Cloud Computing and PSo
 
DYNAMIC TASK SCHEDULING BASED ON BURST TIME REQUIREMENT FOR CLOUD ENVIRONMENT
DYNAMIC TASK SCHEDULING BASED ON BURST TIME REQUIREMENT FOR CLOUD ENVIRONMENTDYNAMIC TASK SCHEDULING BASED ON BURST TIME REQUIREMENT FOR CLOUD ENVIRONMENT
DYNAMIC TASK SCHEDULING BASED ON BURST TIME REQUIREMENT FOR CLOUD ENVIRONMENT
 
QoS-Aware Data Replication for Data-Intensive Applications in Cloud Computing...
QoS-Aware Data Replication for Data-Intensive Applications in Cloud Computing...QoS-Aware Data Replication for Data-Intensive Applications in Cloud Computing...
QoS-Aware Data Replication for Data-Intensive Applications in Cloud Computing...
 
A location based least-cost scheduling for data-intensive applications
A location based least-cost scheduling for data-intensive applicationsA location based least-cost scheduling for data-intensive applications
A location based least-cost scheduling for data-intensive applications
 
Data Retrieval Scheduling For Unsynchronized Channel in Wireless Broadcast Sy...
Data Retrieval Scheduling For Unsynchronized Channel in Wireless Broadcast Sy...Data Retrieval Scheduling For Unsynchronized Channel in Wireless Broadcast Sy...
Data Retrieval Scheduling For Unsynchronized Channel in Wireless Broadcast Sy...
 
[IJET-V1I5P2] Authors :Hind HazzaAlsharif , Razan Hamza Bawareth
[IJET-V1I5P2] Authors :Hind HazzaAlsharif , Razan Hamza Bawareth[IJET-V1I5P2] Authors :Hind HazzaAlsharif , Razan Hamza Bawareth
[IJET-V1I5P2] Authors :Hind HazzaAlsharif , Razan Hamza Bawareth
 
Challenges in Dynamic Resource Allocation and Task Scheduling in Heterogeneou...
Challenges in Dynamic Resource Allocation and Task Scheduling in Heterogeneou...Challenges in Dynamic Resource Allocation and Task Scheduling in Heterogeneou...
Challenges in Dynamic Resource Allocation and Task Scheduling in Heterogeneou...
 
Benchmarking Elastic Cloud Big Data Services under SLA Constraints
Benchmarking Elastic Cloud Big Data Services under SLA ConstraintsBenchmarking Elastic Cloud Big Data Services under SLA Constraints
Benchmarking Elastic Cloud Big Data Services under SLA Constraints
 
Energy efficient reverse skyline query processing over wireless sensor networks
Energy efficient reverse skyline query processing over wireless sensor networksEnergy efficient reverse skyline query processing over wireless sensor networks
Energy efficient reverse skyline query processing over wireless sensor networks
 
Modeling and Optimization of Resource Allocation in Cloud [PhD Thesis Progres...
Modeling and Optimization of Resource Allocation in Cloud [PhD Thesis Progres...Modeling and Optimization of Resource Allocation in Cloud [PhD Thesis Progres...
Modeling and Optimization of Resource Allocation in Cloud [PhD Thesis Progres...
 
Ieeepro techno solutions 2014 ieee java project - cloud bandwidth and cost ...
Ieeepro techno solutions   2014 ieee java project - cloud bandwidth and cost ...Ieeepro techno solutions   2014 ieee java project - cloud bandwidth and cost ...
Ieeepro techno solutions 2014 ieee java project - cloud bandwidth and cost ...
 
Ieeepro techno solutions 2014 ieee java project - deadline based resource p...
Ieeepro techno solutions   2014 ieee java project - deadline based resource p...Ieeepro techno solutions   2014 ieee java project - deadline based resource p...
Ieeepro techno solutions 2014 ieee java project - deadline based resource p...
 
Resource Mapping Optimization for Distributed Cloud Services - PhD Thesis Def...
Resource Mapping Optimization for Distributed Cloud Services - PhD Thesis Def...Resource Mapping Optimization for Distributed Cloud Services - PhD Thesis Def...
Resource Mapping Optimization for Distributed Cloud Services - PhD Thesis Def...
 
Time Efficient VM Allocation using KD-Tree Approach in Cloud Server Environment
Time Efficient VM Allocation using KD-Tree Approach in Cloud Server EnvironmentTime Efficient VM Allocation using KD-Tree Approach in Cloud Server Environment
Time Efficient VM Allocation using KD-Tree Approach in Cloud Server Environment
 
accessible-streaming-algorithms
accessible-streaming-algorithmsaccessible-streaming-algorithms
accessible-streaming-algorithms
 

Similar to Refining the Estimation of the Available Bandwidth in Inter-Cloud Links for Task SchedulingPresentation

Extended summary of "Cloudy with a chance of short RTTs Analyzing Cloud Conne...
Extended summary of "Cloudy with a chance of short RTTs Analyzing Cloud Conne...Extended summary of "Cloudy with a chance of short RTTs Analyzing Cloud Conne...
Extended summary of "Cloudy with a chance of short RTTs Analyzing Cloud Conne...IsabellaFilippo
 
Modeling and Optimization of Resource Allocation in Cloud [PhD Thesis Proposal]
Modeling and Optimization of Resource Allocation in Cloud [PhD Thesis Proposal]Modeling and Optimization of Resource Allocation in Cloud [PhD Thesis Proposal]
Modeling and Optimization of Resource Allocation in Cloud [PhD Thesis Proposal]AtakanAral
 
Presented by Ahmed Abdulhakim Al-Absi - Scaling map reduce applications acro...
Presented by Ahmed Abdulhakim Al-Absi -  Scaling map reduce applications acro...Presented by Ahmed Abdulhakim Al-Absi -  Scaling map reduce applications acro...
Presented by Ahmed Abdulhakim Al-Absi - Scaling map reduce applications acro...Absi Ahmed
 
JAVA 2013 IEEE DATAMINING PROJECT Distributed web systems performance forecas...
JAVA 2013 IEEE DATAMINING PROJECT Distributed web systems performance forecas...JAVA 2013 IEEE DATAMINING PROJECT Distributed web systems performance forecas...
JAVA 2013 IEEE DATAMINING PROJECT Distributed web systems performance forecas...IEEEGLOBALSOFTTECHNOLOGIES
 
Distributed web systems performance forecasting
Distributed web systems performance forecastingDistributed web systems performance forecasting
Distributed web systems performance forecastingIEEEFINALYEARPROJECTS
 
PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDM O...
PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDM O...PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDM O...
PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDM O...ijgca
 
CONTEXT-AWARE DECISION MAKING SYSTEM FOR MOBILE CLOUD OFFLOADING
CONTEXT-AWARE DECISION MAKING SYSTEM FOR MOBILE CLOUD OFFLOADINGCONTEXT-AWARE DECISION MAKING SYSTEM FOR MOBILE CLOUD OFFLOADING
CONTEXT-AWARE DECISION MAKING SYSTEM FOR MOBILE CLOUD OFFLOADINGIJCNCJournal
 
Ieeepro techno solutions 2014 ieee dotnet project - deadline based resource...
Ieeepro techno solutions   2014 ieee dotnet project - deadline based resource...Ieeepro techno solutions   2014 ieee dotnet project - deadline based resource...
Ieeepro techno solutions 2014 ieee dotnet project - deadline based resource...ASAITHAMBIRAJAA
 
Ieeepro techno solutions 2014 ieee dotnet project - deadline based resource...
Ieeepro techno solutions   2014 ieee dotnet project - deadline based resource...Ieeepro techno solutions   2014 ieee dotnet project - deadline based resource...
Ieeepro techno solutions 2014 ieee dotnet project - deadline based resource...ASAITHAMBIRAJAA
 
COST-EFFECTIVE LOW-DELAY DESIGN FOR MULTI-PARTY CLOUD VIDEO CONFERENCING
 COST-EFFECTIVE LOW-DELAY DESIGN FOR MULTI-PARTY CLOUD VIDEO CONFERENCING COST-EFFECTIVE LOW-DELAY DESIGN FOR MULTI-PARTY CLOUD VIDEO CONFERENCING
COST-EFFECTIVE LOW-DELAY DESIGN FOR MULTI-PARTY CLOUD VIDEO CONFERENCINGnexgentechnology
 
Principles of Computing Resources Planning in Cloud-Based Problem Solving Env...
Principles of Computing Resources Planning in Cloud-Based Problem Solving Env...Principles of Computing Resources Planning in Cloud-Based Problem Solving Env...
Principles of Computing Resources Planning in Cloud-Based Problem Solving Env...Ural-PDC
 
Kickstart your Kafka with Faker Data | Francesco Tisiot, Aiven.io
Kickstart your Kafka with Faker Data | Francesco Tisiot, Aiven.ioKickstart your Kafka with Faker Data | Francesco Tisiot, Aiven.io
Kickstart your Kafka with Faker Data | Francesco Tisiot, Aiven.ioHostedbyConfluent
 
Download-manuals-surface water-manual-45howtoreviewmonitoringnetworks
 Download-manuals-surface water-manual-45howtoreviewmonitoringnetworks Download-manuals-surface water-manual-45howtoreviewmonitoringnetworks
Download-manuals-surface water-manual-45howtoreviewmonitoringnetworkshydrologyproject001
 
Download-manuals-surface water-manual-45howtoreviewmonitoringnetworks
 Download-manuals-surface water-manual-45howtoreviewmonitoringnetworks Download-manuals-surface water-manual-45howtoreviewmonitoringnetworks
Download-manuals-surface water-manual-45howtoreviewmonitoringnetworkshydrologywebsite1
 
Paper sharing_resource optimization scheduling and allocation for hierarchica...
Paper sharing_resource optimization scheduling and allocation for hierarchica...Paper sharing_resource optimization scheduling and allocation for hierarchica...
Paper sharing_resource optimization scheduling and allocation for hierarchica...YOU SHENG CHEN
 
IRJET- Improving Data Availability by using VPC Strategy in Cloud Environ...
IRJET-  	  Improving Data Availability by using VPC Strategy in Cloud Environ...IRJET-  	  Improving Data Availability by using VPC Strategy in Cloud Environ...
IRJET- Improving Data Availability by using VPC Strategy in Cloud Environ...IRJET Journal
 

Similar to Refining the Estimation of the Available Bandwidth in Inter-Cloud Links for Task SchedulingPresentation (20)

Extended summary of "Cloudy with a chance of short RTTs Analyzing Cloud Conne...
Extended summary of "Cloudy with a chance of short RTTs Analyzing Cloud Conne...Extended summary of "Cloudy with a chance of short RTTs Analyzing Cloud Conne...
Extended summary of "Cloudy with a chance of short RTTs Analyzing Cloud Conne...
 
Distributed Web System Performance Improving Forecasting Accuracy
Distributed Web System Performance Improving Forecasting  AccuracyDistributed Web System Performance Improving Forecasting  Accuracy
Distributed Web System Performance Improving Forecasting Accuracy
 
Modeling and Optimization of Resource Allocation in Cloud [PhD Thesis Proposal]
Modeling and Optimization of Resource Allocation in Cloud [PhD Thesis Proposal]Modeling and Optimization of Resource Allocation in Cloud [PhD Thesis Proposal]
Modeling and Optimization of Resource Allocation in Cloud [PhD Thesis Proposal]
 
Presented by Ahmed Abdulhakim Al-Absi - Scaling map reduce applications acro...
Presented by Ahmed Abdulhakim Al-Absi -  Scaling map reduce applications acro...Presented by Ahmed Abdulhakim Al-Absi -  Scaling map reduce applications acro...
Presented by Ahmed Abdulhakim Al-Absi - Scaling map reduce applications acro...
 
JAVA 2013 IEEE DATAMINING PROJECT Distributed web systems performance forecas...
JAVA 2013 IEEE DATAMINING PROJECT Distributed web systems performance forecas...JAVA 2013 IEEE DATAMINING PROJECT Distributed web systems performance forecas...
JAVA 2013 IEEE DATAMINING PROJECT Distributed web systems performance forecas...
 
Distributed web systems performance forecasting
Distributed web systems performance forecastingDistributed web systems performance forecasting
Distributed web systems performance forecasting
 
PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDM O...
PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDM O...PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDM O...
PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDM O...
 
D04573033
D04573033D04573033
D04573033
 
CONTEXT-AWARE DECISION MAKING SYSTEM FOR MOBILE CLOUD OFFLOADING
CONTEXT-AWARE DECISION MAKING SYSTEM FOR MOBILE CLOUD OFFLOADINGCONTEXT-AWARE DECISION MAKING SYSTEM FOR MOBILE CLOUD OFFLOADING
CONTEXT-AWARE DECISION MAKING SYSTEM FOR MOBILE CLOUD OFFLOADING
 
Ieeepro techno solutions 2014 ieee dotnet project - deadline based resource...
Ieeepro techno solutions   2014 ieee dotnet project - deadline based resource...Ieeepro techno solutions   2014 ieee dotnet project - deadline based resource...
Ieeepro techno solutions 2014 ieee dotnet project - deadline based resource...
 
Ieeepro techno solutions 2014 ieee dotnet project - deadline based resource...
Ieeepro techno solutions   2014 ieee dotnet project - deadline based resource...Ieeepro techno solutions   2014 ieee dotnet project - deadline based resource...
Ieeepro techno solutions 2014 ieee dotnet project - deadline based resource...
 
COST-EFFECTIVE LOW-DELAY DESIGN FOR MULTI-PARTY CLOUD VIDEO CONFERENCING
 COST-EFFECTIVE LOW-DELAY DESIGN FOR MULTI-PARTY CLOUD VIDEO CONFERENCING COST-EFFECTIVE LOW-DELAY DESIGN FOR MULTI-PARTY CLOUD VIDEO CONFERENCING
COST-EFFECTIVE LOW-DELAY DESIGN FOR MULTI-PARTY CLOUD VIDEO CONFERENCING
 
Principles of Computing Resources Planning in Cloud-Based Problem Solving Env...
Principles of Computing Resources Planning in Cloud-Based Problem Solving Env...Principles of Computing Resources Planning in Cloud-Based Problem Solving Env...
Principles of Computing Resources Planning in Cloud-Based Problem Solving Env...
 
Kickstart your Kafka with Faker Data | Francesco Tisiot, Aiven.io
Kickstart your Kafka with Faker Data | Francesco Tisiot, Aiven.ioKickstart your Kafka with Faker Data | Francesco Tisiot, Aiven.io
Kickstart your Kafka with Faker Data | Francesco Tisiot, Aiven.io
 
Download-manuals-surface water-manual-45howtoreviewmonitoringnetworks
 Download-manuals-surface water-manual-45howtoreviewmonitoringnetworks Download-manuals-surface water-manual-45howtoreviewmonitoringnetworks
Download-manuals-surface water-manual-45howtoreviewmonitoringnetworks
 
Download-manuals-surface water-manual-45howtoreviewmonitoringnetworks
 Download-manuals-surface water-manual-45howtoreviewmonitoringnetworks Download-manuals-surface water-manual-45howtoreviewmonitoringnetworks
Download-manuals-surface water-manual-45howtoreviewmonitoringnetworks
 
Resisting skew accumulation
Resisting skew accumulationResisting skew accumulation
Resisting skew accumulation
 
Fault tolerance on cloud computing
Fault tolerance on cloud computingFault tolerance on cloud computing
Fault tolerance on cloud computing
 
Paper sharing_resource optimization scheduling and allocation for hierarchica...
Paper sharing_resource optimization scheduling and allocation for hierarchica...Paper sharing_resource optimization scheduling and allocation for hierarchica...
Paper sharing_resource optimization scheduling and allocation for hierarchica...
 
IRJET- Improving Data Availability by using VPC Strategy in Cloud Environ...
IRJET-  	  Improving Data Availability by using VPC Strategy in Cloud Environ...IRJET-  	  Improving Data Availability by using VPC Strategy in Cloud Environ...
IRJET- Improving Data Availability by using VPC Strategy in Cloud Environ...
 

Refining the Estimation of the Available Bandwidth in Inter-Cloud Links for Task SchedulingPresentation

  • 1. Refining the Estimation of the Available Bandwidth in Inter-Cloud Links for Task Scheduling Thiago A. L. Genez, Luiz F. Bittencourt, Nelson L. S. da Fonseca, Edmundo R. M. Madeira Institute of Computing (IC) University of Campinas (UNICAMP) Campinas, SP, Brazil December 10, 2014 IEEE GLOBECOM 2014 1 / 22
  • 2. Outline Introduction Related Works Procedure for Deflating Estimates of the Available Bandwidth Evaluation Final Considerations 2 / 22
  • 3. Introduction Workflow Scheduling Problem in Hybrid Clouds Peak demand time: • Private resources → overloaded or insufficient • Hybrid Cloud: Public resources + private resources What are the advantages of using public clouds? • Elasticity • Pay-as-you-go basis Workflow scheduling problem 3 / 22
  • 4. Introduction Current schedulers Not designed to cope with imprecise information Produce schedules without taking into account the variability of the available bandwidth in inter-cloud links Available bandwidth can increase or decrease at the running time Application execution can lead • Violation of deadlines 4 / 22
  • 5. Introduction Purpose of this work How to reduce the negative impact of imprecise information about the inter-cloud available bandwidth on the production of schedules by a scheduler that was not designed to address with such imprecise information? Challenge Use the original scheduling algorithm Proposed Mechanism Deflating the estimate of the inter-cloud available bandwidth based on the expected imprecision of such estimate and provide a deflated bandwidth estimate as an input to the scheduler 5 / 22
  • 6. Introduction Purpose of this work How to reduce the negative impact of imprecise information about the inter-cloud available bandwidth on the production of schedules by a scheduler that was not designed to address with such imprecise information? Challenge Use the original scheduling algorithm Proposed Mechanism Deflating the estimate of the inter-cloud available bandwidth based on the expected imprecision of such estimate and provide a deflated bandwidth estimate as an input to the scheduler 5 / 22
  • 7. Introduction Purpose of this work How to reduce the negative impact of imprecise information about the inter-cloud available bandwidth on the production of schedules by a scheduler that was not designed to address with such imprecise information? Challenge Use the original scheduling algorithm Proposed Mechanism Deflating the estimate of the inter-cloud available bandwidth based on the expected imprecision of such estimate and provide a deflated bandwidth estimate as an input to the scheduler 5 / 22
  • 8. Outline Introduction Related Works Procedure for Deflating Estimates of the Available Bandwidth Evaluation Final Considerations 6 / 22
  • 9. Related Works Rahman et al. – Performance of the network of the Amazon EC2 (2010) – Analysis of the packets delay of VMs to/from Amazon EC2 – Large delay variations – Negatively impact the performance of scientific applications Batista et al. – Describe tools for estimating available bandwidth 2010 – Produce estimations with large variability 8 / 22
  • 10. Outline Introduction Related Works Procedure for Deflating Estimates of the Available Bandwidth Evaluation Final Considerations 9 / 22
  • 11. Procedure for Deflating Estimates of the Available Bandwidth in Inter-cloud Links Available bandwidth estimation tool Scheduler Estimate of the Available Bandwidth Hybrid Cloud Application workflow and Deadline Value Schedule 10 / 22
  • 12. Procedure for Deflating Estimates of the Available Bandwidth in Inter-cloud Links Available bandwidth estimation tool Scheduler Estimate of the Available Bandwidth Expected uncertainty value Hybrid Cloud Procedure Deflated Available Bandwidth Estimate Application workflow and Deadline Value Schedule 10 / 22
  • 13. Procedure for Deflating Estimates of the Available Bandwidth in Inter-cloud Links Procedure History of past executions of the target workflow When a workflow is about to be scheduled 1. Estimate of the available bandwidth 2. Expected uncertainty value 3. Query the history of past executions of the target workflow 4. Calculates the deflating factor U U = 10 ⇒ 90% of the estimate of the available bandwidth Schedule produced is based on the expected uncertainty of the estimate of available bandwidth in inter-cloud links 11 / 22
  • 14. Procedure for Deflating Estimates of the Available Bandwidth in Inter-cloud Links Procedure History of past executions of the target workflow When a workflow is about to be scheduled 1. Estimate of the available bandwidth 2. Expected uncertainty value 3. Query the history of past executions of the target workflow 4. Calculates the deflating factor U U = 10 ⇒ 90% of the estimate of the available bandwidth Schedule produced is based on the expected uncertainty of the estimate of available bandwidth in inter-cloud links 11 / 22
  • 15. Procedure for Deflating Estimates of the Available Bandwidth in Inter-cloud Links Procedure History of past executions of the target workflow When a workflow is about to be scheduled 1. Estimate of the available bandwidth 2. Expected uncertainty value 3. Query the history of past executions of the target workflow 4. Calculates the deflating factor U U = 10 ⇒ 90% of the estimate of the available bandwidth Schedule produced is based on the expected uncertainty of the estimate of available bandwidth in inter-cloud links 11 / 22
  • 16. Procedure for Deflating Estimates of the Available Bandwidth in Inter-cloud Links Database Available bandwidth estimation tool Scheduler Observed Available Bandwidth value Expected uncertainty value Hybrid Cloud Procedure Deflated Available Bandwidth Application workflow and Deadline Value Schedule 12 / 22
  • 17. Procedure for Deflating Estimates of the Available Bandwidth in Inter-cloud Links Database Available bandwidth estimation tool Scheduler Estimate of the Available Bandwidth Expected uncertainty value Hybrid Cloud Procedure Deflated Available Bandwidth Application workflow and Deadline Value Schedule 12 / 22
  • 18. Procedure for Deflating Estimates of the Available Bandwidth in Inter-cloud Links Database Available bandwidth estimation tool Scheduler Estimate of the Available Bandwidth Expected uncertainty value Hybrid Cloud Procedure Deflated Available Bandwidth Application workflow and Deadline Value Schedule 12 / 22
  • 19. Procedure for Deflating Estimates of the Available Bandwidth in Inter-cloud Links Database Available bandwidth estimation tool Scheduler Estimate of the Available Bandwidth Expected uncertainty value Hybrid Cloud Procedure Deflated Available Bandwidth Application workflow and Deadline Value Schedule 12 / 22
  • 20. Procedure for Deflating Estimates of the Available Bandwidth in Inter-cloud Links Database Available bandwidth estimation tool Scheduler Estimate of the Available Bandwidth Expected uncertainty value Hybrid Cloud Procedure Deflated Available Bandwidth Application workflow and Deadline Value Schedule 12 / 22
  • 21. Procedure for Deflating Estimates of the Available Bandwidth in Inter-cloud Links Database Available bandwidth estimation tool Scheduler Estimate of the Available Bandwidth Expected uncertainty value Hybrid Cloud Procedure Deflated Available Bandwidth Application workflow and Deadline Value Schedule Untouched Qualifed Solution 12 / 22
  • 22. Procedure for Deflating Estimates of the Available Bandwidth in Inter-cloud Links Computation of the Deflating factor U for the Target Workflow Multiple Linear Regression: f(x, y) = ax + by + c • x: Current estimate of the available bandwidth • y: Current expected uncertainty • Deflating factor U = f(x, y) Computation of the coefficients a, b and c Target workflow G: dataset HG • 5-tuple hi = bw, p, U, errorm G , error$ G Subset Hk ⊆ HG • For each pair (bw, p) in HG • bw, p, Um and bw, p, U$ are added into Hk Subset Hk is used by the multiple linear regression 13 / 22
  • 23. Outline Introduction Related Works Procedure for Deflating Estimates of the Available Bandwidth Evaluation Final Considerations 14 / 22
  • 24. Evaluation Experimental Parameters Scheduler • HCOC scheduling algorithm Hybrid Cloud Scenario • 1 private cloud and 2 public clouds • Inter-cloud bandwidths of 10 to 60 Mbps • Intra-cloud bandwidths of 1 Gbps Simulator • Estimates the makespan and cost of the execution of the workflow 15 / 22
  • 25. Evaluation Scheduler DAX File VMs File Schedule Simulator Reduction factor Uncertain Makespan and Cost ($) Available Bandwidth Makespan and Cost ($) U p b Database 16 / 22
  • 26. Evaluation Experimental Steps 1. History of execution was created • Fixed bandwidth deflating factors U ∈ {0, 25, 50} • p varying from 45% to 99% • 100 simulations 2. Multiple linear regression (MLR) procedure • f(x, y) = ax + by + c • Employs using 50% and 100% of the dataset 3. Use the equation f(x, y) to calculate the deflating factor U • 100 simulations 17 / 22
  • 27. Evaluation 40 50 60 70 80 90 100 0 45 50 60 70 80 90 100 %ofqualifiedsolutions Uncertainty p Montage DAG U=0 U=25 U=50 MLR 50% MLR 100% Inter-cloud available bandwidth of 60Mbps D = Tmax × 3/7 18 / 22
  • 28. Evaluation 25 30 35 40 45 50 55 60 0 45 50 60 70 80 90 100 Averagemakespanestimation Uncertainty p Montage DAG U=0 U=25 U=50 MLR 50% MLR 100% Inter-cloud available bandwidth of 60Mbps D = Tmax × 3/7 19 / 22
  • 29. Outline Introduction Related Works Procedure for Deflating Estimates of the Available Bandwidth Evaluation Final Considerations 20 / 22
  • 30. Final Considerations Conclusion Current scheduler • Estimated available bandwidth is precise at the scheduling time • Produce inefficient scheduling decisions • Missing deadlines, increasing costs and makespan more than expected The proposed procedure • Deflates the estimate of the available bandwidth in inter-cloud links • Multiple linear regression approach • Increases the number of qualified solutions 21 / 22
  • 31. Final Considerations Conclusion Current scheduler • Estimated available bandwidth is precise at the scheduling time • Produce inefficient scheduling decisions • Missing deadlines, increasing costs and makespan more than expected The proposed procedure • Deflates the estimate of the available bandwidth in inter-cloud links • Multiple linear regression approach • Increases the number of qualified solutions 21 / 22