Building a KVM-based Hypervisor for a Heterogeneous System Architecture Compliant System

Building A KVM-based Hypervisor for A
Heterogeneous System Architecture
Compliant System
National Chiao Tung University & National Tsing Hua University & National Taiwan University
Yu-Ju Huang, Hsuan-Heng Wu,
Yeh-Ching Chung, Wei-Chung Hsu

Agenda
• Motivation
• Background
• HSA features
• AMD’s implementation on Kaveri, the HSA-
compliant platform
• Design and Implementation
• Evaluation
• Conclusion
2

Motivation
• Problem of heterogeneous computing
• Data communication between CPU & GPU
• Inefficiency
• Programmability inconvenience
• Heterogeneous System Architecture (HSA)
• Developed by HSA Foundation
• Goal
• Improving computation efficiency for heterogeneous computing
• Reducing programmability barrier
• Make virtual machines also get benefit of HSA !
3
HSA
Hypervisor
Guest
OS
Guest
OS
A
p
p
A
p
p
A
p
p
A
p
p HSA!!!

HSA Features
• Shared virtual memory
• I/O page faulting
• User-level queueing
• Memory based signaling
4
CPU Memory
GPUCPU
GPU
Memory
Data copy
Before HSA
Physical Memory
HSA GPUCPU
Virtual Memory
HSA
Application
Queues
Operating System
GPU Driver
GPU
Before HSA
HSA GPU
Application
Queues
HSA
• I/O page faulting
• Memory based signaling

Shared Virtual Memory - IOMMU
• Set process page table to IOMMU to carry out virtual to
physical address translation
• CPU and GPU share same process page table
5
System Memory
GPU CPU
IOMMU MMUProcess Page Table

I/O Page Faulting - PPR
• PPR(peripheral page service request) issued by IOMMU as
interrupt
• PPR logs contains fault process ID and fault address
• get_user_pages API can be used to fix page fault
6
IOMMU CPU
Call PPR handler
Get PPR logs
Fix fault fault
COMPLETE command
PPR Interrupt
1
2
3
4
5

User Level Queueing -
Kernel Fusion Driver (KFD)
• Help applications set address of user level queues to GPU
7
Kernel Space
GPU
Userspace
KFD
Addr of user
level queue
User Level Queues
Computation

Design - How to Virtualize
• VirtIO-KFD
• Shadow page table
• Why not hardware-assisted nested paging ?
• I/O Page faulting
• Shadow PPR
• VirtIO-IOMMU
8

Virtualize User Level Queueing
VirtIO-KFD
9
Guest OS
Host OS
KFD
Qemu
Guest
App
VirtIO-KFD
(Back-end)
VirtIO-KFD
(Front-end)
Guest
App
Guest
App
GPU
Share virtqueue
HSA Runtime Library
1
2
3
4
KVM

Virtualize Shared Virtual Memory
Shadow Page Table
10
Guest OS
Host OS
KFD
Qemu
Guest
App
VirtIO-KFD
(Back-end)
VirtIO-KFD
(Front-end)
Guest
App
Guest
App
Share virtqueue
HSA Runtime Library
1
2
3
4IOMMU
Driver
KVM
IOMMU
Addr of
shadow
page table
5
6

GPU
IOMMU
Memory
ID System Page table
1 Host, process 1 Addr of PT
2 Guest 1,
process 1
Addr of SPT
Page
Table
ID=1
HVA
MPA
Native ScenarioGuest Scenario
 More guest processes in different guest OSes are also allowed.
11
IOMMU Snapshot During GPU Execution
GVA
MPA
ID=2

Virtualize I/O Page Faulting
VirtIO-IOMMU, Shadow PPR
12
Guest OS
Host OS
Shadow
PPR
Qemu
Guest
App
VirtIO-
IOMMU
Guest
App
Guest
App
IOMMU
HSA Runtime Library
IOMMU
Driver
KVM
Interrupt1
3
5
4
2
PPR: Peripheral Page Request

System Architecture
13
Guest OS
Host OS
KVM
Shadow
PPR
KFD
Qemu
(Host Process)
HSA Runtime Library
Guest
App
VirtIO-
IOMMU
VirtIO-
IOMMU
VirtIO-KFD
VirtIO-KFD
Guest
App
Guest
App
IOMMU GPU
User level
queuing
IOMMU
Driver
 KFD: Kernel Fusion Driver
 PPR: Peripheral Page Request
Shared
virtual
memory
I/O page
faulting

Evaluation
• Queue initialization time
• Measuring overheads of VirtIO-KFD
• GPU execution time
• Measuring overheads of shadow page table and shadow PPR
14
Configurations Native Guest
Hardware platform Kaveri
Memory 8G 4G
Number of CPUs 4 4
OS Ubuntu 13.10

Queue Initialization Time
15
Average 30% performance drop.

GPU Execution Time
16
Achieve average 95% of native performance in most cases.
GPU time
(sec)
BinarySea
rch
FastWalsh
Transform
BitonocSort FloydWars
hall
MatrixMulti
plication
MatrixTrans
pose
MoteCarlo
Asian
Native 0.0108 0.0018 0.014 16.094 8.012 0.502 17.458
Guest 0.0113 0.0019 0.016 16.603 8.286 0.538 18.342
Small benchmark
Enqueue Task
Kick GPU
Wait Signal
World Switch to Host
Switch Back
Guest Application
World Switch to Host
Signal
delay
Enqueue many times

Conclusion
• Successfully implementing a hypervisor virtualizing HSA
features.
• Guest system can get benefit of HSA and carry out
heterogeneous computing.
• GPU in Kaveri is shareable between multiple guest OSes and
host OS.
17

Thanks!
Q&A
gic4107@gmail.com
18

Building a KVM-based Hypervisor for a Heterogeneous System Architecture Compliant System

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Building a KVM-based Hypervisor for a Heterogeneous System Architecture Compliant System

Similar to Building a KVM-based Hypervisor for a Heterogeneous System Architecture Compliant System (20)

Recently uploaded

Recently uploaded (20)

Building a KVM-based Hypervisor for a Heterogeneous System Architecture Compliant System

Editor's Notes