Dynamic Fractional Resource Scheduling -- 2010, ARCS

Introduction Off-Line Problem On-Line Problem Summary Appendix
Dynamic Fractional Resource Scheduling
for Cluster Platforms
Mark Stillwell
Department of Information and Computer Sciences
University of Hawai’i at M¯anoa
Achievement Rewards for College Scientists
2010 Scholarship Awards Program
Mark Stillwell UH M¯anoa ICS
DFRS for Clusters

Clusters
Deﬁnition
A cluster is a group of independent computers, or nodes,
working together closely, usually connected by a high-speed
network.
DFRS for Clusters

Jobs
The system can accept user requests to run jobs or the
administrator can instantiate jobs directly
Running jobs are made up of nearly identical tasks
The number of tasks is speciﬁed by user/administrator
Tasks can block while communicating with each other
The assignment of resources to tasks is called scheduling
DFRS for Clusters

Current Approaches
Service Hosting (Off-Line scheduling)
traditionally, dedicated machines
current interest in server consolidation
few good theoretical models
heavy “engineering” bias
High-Performance Computing (On-Line Scheduling)
Usually First-Come-First-Served (FCFS) with backfilling
Backfilling needs (unreliable) compute time estimates
Unbounded wait times
Inefficient use of nodes/resources
DFRS for Clusters

Our Proposal
Use virtual machine technology.
Multiple tasks on one node
Performance isolation
Sharing of fractional resources
Deﬁne a run-time computable metric that captures notions
of performance and fairness.
Design heuristics that allocate resources to jobs while
explicitly trying to achieve high ratings by our metric.
DFRS for Clusters

Requirements, Needs, and Yield
Tasks have memory requirements and CPU needs
All tasks of a job have the same requirements and needs
For a task to be placed on a node there must be memory
available at least equal to its requirements
A task can be allocated less CPU than its need, and the
ratio of the allocation to the need is the yield
All tasks of a job must have the same yield, so we can also
speak of the yield of a job
The yield of a job gives its performance relative to if it were
run on a dedicated system
DFRS for Clusters

Off-Line Problem Assumptions
Steady-state execution with inﬁnite jobs
Models an ideal service hosting environment
Makes the problem more tractable [Marchal et al., 2006]
Good when job duration longer than schedule time
Jobs are serial (single task)
DFRS for Clusters

Objective Function
The performance of a job is correlated to its yield
Maximizing the average or sum of the yields may be unfair
to some jobs
Instead, we seek to maximize the minimum yield
DFRS for Clusters

Task Placement Heuristics
Greedy Task Placement – Incremental, puts each task on
the node with the lowest computational load on which it
can ﬁt without violating memory constraints
Randomized Rounding Task Placement – Based on
relaxing an MILP to an LP and rounding probabilistically
MCB Task Placement – Global, iteratively applies
multi-capacity (vector) bin-packing heuristics during a
binary search for the maximized minimum yield
DFRS for Clusters

Large Problem Set: Minimum Yield vs. Free Memory
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9
0.3
0.35
0.4
0.45
0.5
0.55
Slack
MinimumYield
bound
MCB8
GR
GB
SG
SGB
RRND
RRNZ
DFRS for Clusters

Conclusions
The problem is tractable
Multi-capacity bin packing algorithms perform well
Greedy algorithms perform okay, and are very fast
Randomized rounding approaches are not very promising
DFRS for Clusters

On-Line Problem Assumptions
Job arrival/completion times are not known in advance
Jobs are temporary
The user wants a ﬁnal result
Quick turnaround relative to runtime is desired
Jobs are not interactive
So can wait until resources are available to start
We avoid the use of runtime estimates
DFRS for Clusters

Stretch
Our goal: minimize maximum stretch (aka slowdown)
Stretch: the time a job spends in the system divided by the
time that would be spent in a dedicated
system [Bender et al., 1998]
Popular to quantify schedule quality post-mortem
Not generally used to make scheduling decisions
Runtime computation requires (unreliable) user estimates.
Minimizing average stretch prone to starvation
Minimizing maximum stretch captures notions of both
performance and fairness.
Our approach: try to maximize minimum yield
Similar, but not the same, as minimizing maximum stretch
DFRS for Clusters

Max Stretch Degradation vs. Load, no migration cost
1
10
100
1000
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9
StretchDegradationFactor
Load
EASY
FCFS
GREEDY
GREEDYP
GREEDYPM
DynMCB8
MCB8P 600
MCB8PG 600
DFRS for Clusters

Conclusions
DFRS approaches can signiﬁcantly outperform traditional
approaches
Aggressive repacking can lead to much better resource
allocations
But also to heavy migration costs
Greedy migration is not that useful
DFRS for Clusters

Summary
We have proposed a novel approach to job scheduling on
clusters, Dynamic Fractional Resource Scheduling, that
makes use of modern virtual machine technology and
seeks to optimize a runtime-computable, user-centric
measure of performance called the minimum yield
Multi-capacity bin packing heuristics can be used to ﬁnd
good solutions
Our approach avoids the use of unreliable runtime
estimates
This approach has the potential to lead to
order-of-magnitude improvements in performance over
current technology
DFRS for Clusters

References I
Bender, M. A., Chakrabarti, S., and Muthukrishnan, S.
(1998).
Flow and Stretch Metrics for Scheduling Continuous Job
Streams.
In Proc. of the 9th ACM-SIAM Symp. On Discrete
Algorithms, pages 270–279.
Marchal, L., Yang, Y., Casanova, H., and Robert, Y. (2006).
Steady-state scheduling of multiple divisible load
applications on wide-area distributed computing platforms.
Intl. J. of High Performance Computing Applications,
20(3):365–381.
DFRS for Clusters

References II
Stillwell, M., Shanzenbach, D., Vivien, F., and Casanova,
H. (2009).
Resource Allocation using Virtual Clusters.
In CCGrid, pages 260–267. IEEE.
Stillwell, M., Vivien, F., and Casanova, H. (2010).
Dynamic fractional resource scheduling for HPC workloads.
In IPDPS.
to appear.
DFRS for Clusters

Dynamic Fractional Resource Scheduling -- 2010, ARCS

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (9)

Similar to Dynamic Fractional Resource Scheduling -- 2010, ARCS

Similar to Dynamic Fractional Resource Scheduling -- 2010, ARCS (20)

Recently uploaded

Recently uploaded (20)

Dynamic Fractional Resource Scheduling -- 2010, ARCS