What they looking for?
What is Multi Data Center?
Managing Multi DCs
Modeling the System
Cloud Computing, has become crucial for the externalization of IT resources for
business, organizations and people.
“everything as a service”
(plataform, infrastructure and service)
Providers want in turn to optimize the use of the resources they have deployed
with their own metrics
Factors to be optimized
• came from servicing the clients of the hosted
web-services with reasonable Quality of
• operational costs for the infrastructure
Consolidation - Set the maximum number of services in the least viable amount
of hosting machines, so the number of on-line machines and resources is
Virtualization technology has made consolidation easier,
2. WHAT THEY LOOKING FOR?
“Build a model to automate (AC) an improve the process of achieve
allocation of virtualized web-services, using a Machine Learning (ML)
and Data Mining, to predict behavior and select “policies” to be applied
in a multi-DC”
Energy Saving in Cloud Self-management
3. WHAT IS MULTI DATA CENTER?
• Its a Networking of Data Centers (DCs) interconected
Must be considerate
Energy cost at diferent locations
Modularity between inter-DC relations an
4. MANAGING MULTI DCS
Multi-DataCenter Business Model
SLA (Service Level Agreement)
ensure the agreed QoS for
de VM, while minimizing the
cost by reducing the
5. MODELING THE SYSTEM
In this case
Quality of Service = Response Time
(monitoring PM resources
and adjusting VM placements
Machine Learning + Data Mining
Around the world
Predict behavior and
Scheduling the VM
Across de DC networks
5. MODELING THE SYSTEM
The Machine Learning decisive
Questions to predict
How good will each VM behave?
How much CPU/Mem/IO… will each
Optimizing the schedule and management of multi-DC systems requires balancing
several factors, like economic revenues, Quality of Service and operational costs
such as energy.
Using virtualization technology is presented a model to solve a multi-DC
scheduling problem which balances and optimizes the economic factors above.
A few issues for future study are:
How decide which VMs are excluded from inter-DC scheduling or which PMs are offered
as host candidates for scheduling;
The inclusion of more operational costs (networking, bandwidth management,etc.)
The green energy into the scheme and the environmental impact of computation.
The use of online learning methods to make the system react quickly to changes
(application behavior, hardware or middleware changes, or workload characteristics