DEEP-mon: Dynamic and Energy Efficient Power monitoring for container-based infrastructures

DEEP-mon: Dynamic and Energy Eﬃcient Power
monitoring for container-based infrastructures
NECST Group Conference 2018 @ Facebook
06/01/2018
Rolando Brondolin, Tommaso Sardelli, Marco D. Santambrogio
<rolando.brondolin@polimi.it>

DEEP-mon at a glance
2
• DEEP-mon is a HT-aware ﬁne-grain power monitor for container-based
environments
- precise power a5ribu8on to containers
- instrumenta8on free, watch workloads from outside
- lightweight, with liSle overhead on the target workloads and systems
- scalable and distributed, to observe Kubernetes clusters
• Monitoring ingredients:
Container
execution
Resource
usage
Power
consumption
Context
switch
Performance
Counter (cycle)*
Intel
RAPL
* cycles has 99% correlation w.r.t. CPU power usage

Anatomy of a power monitoring agent
3
user-space
kernel-space
Intel RAPL
DEEP-mon
Power attribution
Docker and Kubernetes metrics
kernel tracing
PMC
context switch
Linux CFS
Monitoring back-end

Anatomy of a power monitoring agent
4
user-space
kernel-space
Intel RAPL
DEEP-mon
Power attribution
Docker and Kubernetes metrics
PMC
Monitoring back-end
200K evts/s
kernel tracing context switch
Linux CFS

Kernel level data acquisi[on (1)
• We cannot send each context switch to user-space
- too many events per second to process
- too much overhead
• Introduce in-kernel data aggrega[on:
5
eBPF and BCC:
build, inject and execute code
in a Kernel VM
trace context switch,
count PMCs on the ﬂy
store data in
eBPF data structures
send one big event instead
of many small ones
DEEP-mon
kernel

Kernel level data acquisi[on (2)
6
eBPF is event based, we leverage context switch to trigger PMC measures
eBPF output
HT1 HT2
t1
t2
t3
t4
Example: 2 threads running on 2 HT cores
store cycles (thread + core) data
store cycles (thread + core) data
account cycles (alone + overlap) to thread1
account cycles (alone + overlap) to thread2
HT1 data
HT2 data
processor map
HT1 data
Thread1
Thread2
Thread3
thread mapaccount overlap cycles for thread2
DEEP-mon
kernel
Thread execution overlap:
Power consumption of T1 + T2 is 1.1 w.r.t. T1+idle

Correlate power and performance
• At ﬁxed [me intervals we collect the thread map
- extrac[on [me depends on # of context switch
• Then we extract power measurement from RAPL and we
account it for each thread:
7
eBPF output
Thread1
Thread2
Thread3
thread map
G benchmarks from NPB with
HT experiments pin two threads
ysical core
run on a Dell PowerEdge
n E5-2680 v2 (10 cores
and with Ubuntu Linux
st experiment shows that
cal cores mapped on the
umption is ' 1.15 with
g on that same physical
execution periods in which the thread was co-running on the
same physical core via HT, weighted by the HTr ratio and
divided by 2 to equally divide the overlapping cycles among
the two threads. In this context an execution period is deﬁned
as the time between context switches on the physical core
where the thread is scheduled.
Starting from Equation (1), we can now attribute the power
measured by RAPL for our thread T1 following Equation (2),
where |K| is the cardinality of the set K of threads running in
the server in a given period of time and |S| is the cardinality
of the set S of sockets in the system.
PT 1(t) =
|S|
X
s=0
RAPLcore(t, s) ·
CyclesT W1 (t, s)
P|K|
k=0 CyclesT Wk
(t, s)
!
(2)
Starting from this result, the next sections will provide
details on how we implemented power attribution for each
thread and container running in the system.
B. Kernel level data acquisition
The power attribution model described in Section III-A
needs a precise measurement of the performance counter
Power of Thread 1
Sum among all sockets
RAPL measurement of the socket
Thread weight inside
the socket power consumption
• Finally we group each thread by container
DEEP-mon
kernel

Monitoring containers at scale
• Once power data is collected, we can send it to a back-end  
on a regular basis
- further aggrega[on of metrics data
- Kubernetes cluster level view
• Backend exposes data for visualiza8on and autonomic power management
8

Benchmarks
Cloud Benchmarks: Phoronix test suite pts/apache, pts/Nginx, pts/ﬁo
HPC Benchmarks: NAS Parallel benchmarks EP, MG, CG
Experimental results
9
Network and syscall intensive benchmarks CPU and memory intensive tasks
Cloud benchmarks
app overhead
< 3.3%
HPC benchmarks
app overhead
< 4%
Cloud benchmarks
power overhead
1.74% avg
HPC benchmarks
power overhead
0.90% avg
Evalua8on goals
Monitoring should introduce minimum overhead
We evaluated DEEP-mon w.r.t. its overhead on applica8ons and the target system

Conclusion
• We saw the main aspects of power monitoring for Docker containers
- ﬁne grain and accurate monitoring
- low overhead power measurement
• Dynamic and Energy Eﬃcient Power monitor (DEEP-mon)
- kernel-level performance data aggrega[on
- per thread power aSribu[on
- container aggrega[on
- Kubernetes cluster power visibility
10

Thanks for your aSen[on
11
NECST Group Conference 2018 @ Facebook
06/01/2018
Rolando Brondolin, Tommaso Sardelli, Marco D. Santambrogio
<rolando.brondolin@polimi.it>
DEEP-mon: Dynamic and Energy Eﬃcient Power
monitoring for container-based infrastructures

DEEP-mon: Dynamic and Energy Efficient Power monitoring for container-based infrastructures

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to DEEP-mon: Dynamic and Energy Efficient Power monitoring for container-based infrastructures

Similar to DEEP-mon: Dynamic and Energy Efficient Power monitoring for container-based infrastructures (20)

More from NECST Lab @ Politecnico di Milano

More from NECST Lab @ Politecnico di Milano (20)

Recently uploaded

Recently uploaded (20)

DEEP-mon: Dynamic and Energy Efficient Power monitoring for container-based infrastructures