Understanding Open Source Serverless Platforms: Design Considerations and Performance

1
Understanding Open Source Serverless Platforms:
Design Considerations and Performance
Junfeng Li, Sameer G. Kulkarni,
K. K. Ramakrishnan, Dan Li
hotjunfeng@163.com

2
Open Source Serverless Platforms

3
Motivation and Goals
❖ To develop an understanding on the open source
serverless platforms:
➢Do measurements to understand the impact of key
configuration parameters of different components
(platform, gateway, controller and function)
❖ Evaluate and compare the performance of open source
➢Different workloads
➢Different auto-scaling modes

5
Service Exporting and Routing

6
Service Exporting and Routing
Public
NodePort
⇕
Internal
Pod_IP
Encapsulate
Execute
functions
Configure mapping
rules for exporting
service (Netfilter
NAT rules)
Get events and
push to a worker
Route &
Load balance
Decapsulate

7
(gateway, controller and function)
v1.1.16 Gateway: v0.17.0
Faas-netes: v0.8.6
Faas-cli: v0.9.2
v0.8 v1.0.4

8
❖ Topology: Kubernetes cluster (1 master, 2 workers) on CloudLab1
➢Hardware: Intel Xeon E5-2640 v4 @ 20 Hyperthread cores.
➢Operating System: Ubuntu 16.04.1 LTS
➢Kubernetes v1.16.1, Docker v18.09.2
❖ Functions and Workload:
➢Python ‘Hello-world’ function
➢Python ‘HTTP’ function
➢Workload Generator: wrk
Experiment Setup
[1] Duplyakin, Dmitry, et al. "The design and operation of CloudLab." 2019 USENIX Annual Technical
Conference (USENIX ATC 19). 2019.

9
❖ Working model
Nuclio
Multiple
Workers
(Processes)
Direct call to
func pod
(NodePort)
Invoke
through IG

10
❖ Salient parameter: the number of workers within one pod
Nuclio
Performance increases as the number
of workers increases.

11
❖ Working model
Knative
Multiple
Workers
(Threads)

12
❖ Salient parameter: the number of workers within one pod
Knative
Performance improves, but relatively
lower than Nuclio.
Fig. Throughput of Knative. Fig. Throughput of Nuclio.

13
❖ Working model
OpenFaaS
Multiple Modes for Of-Watchdog:
(1) Fork-per-request:
Cold start for every request;
(2) Pre-fork:
Start the function once and keep warm.
Get events and
invoke function
One Worker

14
OpenFaaS
❖ Salient parameter: of-watchdog modes

15
OpenFaaS
Pre-fork
(Warm worker)
Fork-per-request
❖ Salient parameter: of-watchdog modes
“Pre-fork” mode has much better
performance than “Fork-per-request”.

16
❖ Working model
Kubeless
One Worker
Fork-per-request
Kubeless only supports “Fork-per-request” mode.

17
Kubeless
Fig. Throughput of Kubeless. Fig. Throughput of OpenFaaS.

18
Kubeless
Fig. Throughput of Kubeless. Fig. Throughput of OpenFaaS.
Fork-per-request
The performance of Kubeless is similar to that of
OpenFaaS in “Fork-per-request” mode.

19
➢Describe how different components work
(platform, gateway, controller and function)

20
Performance
Baseline: helloworld function (Return “hello”)

21
Performance
Baseline: helloworld function (Return “hello”) Queuing Up ⇒ Long tail
Kubeless:
Fork-per-request
⇒ Lowest throughput
Nuclio:
No ingress controller
⇒ Bypass the queue of
ingress controller
⇒ Highest throughput

22
Performance
Latency breakdown of helloworld function:

23
Performance
Latency breakdown of helloworld function:
Fork-Per-Request
(Cold Start All the Time)
Same Python Runtime
⇒ Same execution time
Platform Specific
Platform Specific

24
Performance
HTTP Workload: fetch a web page (5 Byte) from a local server

25
Performance
HTTP Workload: fetch a web page (5 Byte) from a local server
Nuclio:
No ingress controller
⇒ Bypass the queue of
ingress controller
⇒ Highest throughput

26
Performance
Different modes of exporting services:

27
Performance
Different modes of exporting services:
Direct call to
func pod
(NodePort)
Invoke through IC/GW
IC/GW: Overhead of Ingress
Controller/API Gateway.

28
Performance: Auto-scaling
Resource-based auto-scaling:
Resource-based auto-scaling
depends on Kubernetes HPA
(Horizontal Pod Autoscaler)
In spite of the same function and
HPA, platform characteristics
govern auto-scaling.
(Different performance
⇒ Different resource utilization
⇒ Different auto-scaling rate)

29
Workload-based auto-scaling:

30
Workload-based auto-scaling:
Concurrency-based RPS-based
Prometheus reacts slowly
⇒ Slow scaling

31
Issues about load balancing for OpenFaaS:
Fig. RPS-based auto-scaling
in OpenFaaS

32
Issues about load balancing for OpenFaaS:
Load-balancing Issue!
If client enables keep-alive,
OpenFaaS does not set up
connections with newly
created function pods, which
hinders performance
improvement.
Behavior: Auto-scaling happens
but NO performance improvement!
Fig. RPS-based auto-scaling
in OpenFaaS
No improvement
Auto-scale

33
Issues about Concurrent-based auto-scaling:
Fig. Conc-based auto-scaling in Knative

34
Issues about Concurrent-based auto-scaling:
Misconfiguration inhibits auto-
scaling.
(Conc. does not exceed threshold.
⇒ No auto-scaling with workload of
low concurrency but high RPS.)
Traffic: Conc=9, RPS=400
Configuration: Conc_Threshold=10
Fig. Conc-based auto-scaling in Knative
No improvement
No Auto-scale
Behavior: No Auto-scaling!
No able to scale to 400 RPS (Actual
RPS=~220)

35
❖ Function processing:
➢Multiple workers within one function pod contribute to
performance improvement.
➢Pre-fork mode (warm worker) increases the throughput
and reduces the latency.
❖ Load balancing:
➢Plays an important role in the performance and
scalability.
➢Coupling routing with load balancing can adversely
affect the performance -- Needs greater attention!
Key Observations

36
❖ Autoscaling:
➢For resource-based auto-scaling, in spite of the same
function and HPA, platform characteristics govern
auto-scaling.
➢Misconfiguration of auto-scaling rules can severely
degrade the performance and system utilization.
➢Current Auto-scaling approaches are based only on
the total processed requests, while the dropped
requests are missed out. -- Incoming request rate
needs to be accounted for.
Key Observations

37
❖ Paper title:
➢Understanding Open Source Serverless Platforms:
Design Considerations and Performance.
❖ Paper links:
➢https://dl.acm.org/doi/10.1145/3366623.3368139
➢https://arxiv.org/abs/1911.07449
➢https://www.researchgate.net/publication/337362475_
Understanding_Open_Source_Serverless_Platforms_
Design_Considerations_and_Performance
More details

39
❖ Load balancing:
➢Improper load balancing results in poor performance
improvement -- Needs greater attention!
❖ Autoscaling:
➢Misconfiguration of auto-scaling rules can severely
degrade the performance and system utilization.
➢Current Auto-scaling approaches are based only on the
total processed requests, while the dropped requests are
missed out. -- Incoming request rate needs to be
accounted for.
Serverless 2020 and Beyond

40
Motivation
❖ To understand how the serverless platforms work?
❖ What is the impact of configuration parameters?
❖ What is the performance of serverless platforms?
❖ What is the behavior of auto-scaling?

42
Performance
HTTP Workload: fetch a web page of different sizes

43
Workload-based auto-scaling: bursty workload

44
Workload-based auto-scaling: bursty workload
Concurrency-based RPS-based
Prometheus ⇒ React slowly

46
https://trends.google.com/trends/explore?date=2015-01-01%202019-05-08&geo=US&q=serverless,IaaS,PaaS,FaaS
46
Serverless Computing: The New Hotness

Understanding Open Source Serverless Platforms: Design Considerations and Performance

Recommended

Recommended

More Related Content

What's hot

What's hot (7)

Similar to Understanding Open Source Serverless Platforms: Design Considerations and Performance

Similar to Understanding Open Source Serverless Platforms: Design Considerations and Performance (20)

Recently uploaded

Recently uploaded (20)

Understanding Open Source Serverless Platforms: Design Considerations and Performance