Linux System Monitoring with eBPF

Heinrich.Hartmann@Circonus.com
Linux System Monitoring with eBPF
DevOpsDays Kiel, 2018-05-16
Heinrich Hartmann

System Monitoring is about Kernel & Hardware

Best Practice: The USE Method
https://www.circonus.com/2017/08/system-monitoring-with-the-use-dashboard
CPU
Memory
Network
Disks
Utilization Saturation Errors

Lot’s of Unknowns remaining
https://www.circonus.com/2017/08/system-monitoring-with-the-use-dashboard
?
?
?
~
~ ~
CPU
Memory
Network
Disks
Utilization Saturation Errors

eBPF allows unparalleled insights
https://github.com/iovisor/bcc
Credits:
- Brendan Gregg @ Netflix (Sun)
- Sasha Goldshtein @ Sela, Microsoft
- Brenden Blanco @ VMWare
- Linus Torvalds, et. al.

CPU: Scheduling Latency

Disk: Block-I/O Latency

Disk: Block-I/O Latency over time

Don’t shout in the Datacenter
Brendan Gregg (2008) https://www.youtube.com/watch?v=tDacjrSCeq4

System Calls: The Kernel API
Monitor
Rate
Errors
Duration
System Call API

Syscalls: Rate / Count
sched_yield (2tn)
clock_time (1.5tn)
recvfrom (300bn)
394 Metrics

Syscalls: Duration
1
us
10
us

Syscall durations span >8 orders of magnitude
1s
100
ms
10
us 1.5 tn
events total

File System: Latency

Memory: Allocation Latency

Further Reading
Slides: @HeinrichHartman / #dodkiel18
Code: https://github.com/circonus-labs/nad/.../bccbpf
Blog: http://www.circonus.com/2018/05/linux-system-monitoring-with-ebpf/

Linux System Monitoring with eBPF

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Linux System Monitoring with eBPF

Similar to Linux System Monitoring with eBPF (20)

More from Heinrich Hartmann

More from Heinrich Hartmann (20)

Recently uploaded

Recently uploaded (20)

Linux System Monitoring with eBPF