Geecon2022: How to avoid common mistakes and misconceptions when working with Java on AWS Lambda

How to avoid common mistakes
and misconceptions when
working with Java on AWS Lambda
•Andrzej Dębski

Technicalities
• Disclaimer
• Q&A
• url: sli.do
• code: #geecon2022
• Code:
• https://github.com/Adebski/LambdaJavaGeecon
Andrzej Dębski

About me
• A decade of experience with JVM languages, mostly Java
and Scala
• Working with server-based environments since the beginning
• Pratical experience with AWS Lambda since February 2021
• Maitanance and development of 2 existing services
• Development of additional two services from scratch
Andrzej Dębski

Agenda
1. Cold starts (generic and those specific to JVM)
2. AWS Lambda Pricing
3. In-memory caching on Lambda
Andrzej Dębski

Lambda instance lifecycle
Andrzej Dębski

Cold start type 1 (container recycle)
1. 18:16:32 Duration: 85.24 ms Max Memory Used: 109
MB Init Duration: 1821.08 ms
MB
MB
MB Init Duration: 1870.40 ms
Andrzej Dębski

Cold start type 2 (JVM JIT and lazy load)
1. Code path 1: Duration: 731.32 ms
4. Code path 2: Duration: 1261.40 ms (new code path
executed for the first time)
Andrzej Dębski

Reducing cold starts
1.Provisioned concurrency (best but costly)
1.Keeps a set of instances ready to respond to requests
2."Background" traffic (best effort)
3.Use a different language (e.g. Go) or use AOT
compilation through GraalVM
1.https://shinesolutions.com/2021/08/30/improving-cold-start-
times-of-java-aws-lambda-functions-using-graalvm-and-
native-images/
Andrzej Dębski

Cold start for provisioned concurrency
4. Code path 2: Duration: 2.03 ms (new code path
executed for the first time)
Andrzej Dębski

Provisioned concurrency best practices
1.Make sure the function is invoked using the alias.
2.Monitor (and alarm) on metrics
1. ProvisionedConcurrencySpilloverInvocations
2. ProvisionedConcurrencyUtilization
3. Pre-warm the code paths in the constructor
4. Use code deploy policies to gradually deploy
new function revisions.
Andrzej Dębski

Recap
1.Cold starts are real and they affect the JVM
Lambdas even more
2.Either accept them or pay for provisioned
concurrency (and use it well)
3. Monitor and instrument your functions to
understand the bottlenecks in your code
Andrzej Dębski

Lambda pricing
1. Lambda price (https://aws.amazon.com/lambda/pricing/):
1. On demand: billed for every GB-second
(sum(duration ) * allocated memory) + number of
invocations
2. Provisioned concurrency is more complicated. We
pay for keeping the containers "warm"
2. Free-tier
Andrzej Dębski

How to calculate AWS costs
1. AWS pricing calculator
1. https://calculator.aws/#/
2. Cost estimates for the following examples:
1. https://tinyurl.com/yk7v3wek
Andrzej Dębski

Assumptions
1. "Average" request time: 200 ms
2. Monthly costs
3. Region: us-east-1
4. Focus on compute cost, ignore everything else
Andrzej Dębski

Scenarios considered
1. "Hello world": on-demand vs provisioned capacity
2. Smallest Fargate vs Lambda
3. StackExchange on Lambda vs Fargate
Andrzej Dębski

"Hello world", 1 request per second
Andrzej Dębski
256 MB RAM
On demand Provisioned
capacity of 1
0.33 $ 4.55 $

Fargate, 1 request per second
Andrzej Dębski
1769 MB RAM 1vCPU
2GB RAM
capacity of 1
Fargate
8.8 $ 28.28 $ 36.04 $

Simulating Stack Exchange
1. https://stackexchange.com/performance
1. 1.3* 10^9 monthly page views means ~495
requests per second
2. For Fargate we assume 9 tasks to protect from AZ
outage, 82.5 RPS per task.
3. For Lambda provisioned concurrency try to handle
every request with prov capacity.
1. 495 (RPS) / 5 (requests per second per container) = 99
Andrzej Dębski

SE, 495 requests per second
Andrzej Dębski
1769 MB RAM 1vCPU
2GB RAM
capacity of 99
Fargate,
9 tasks
7,744.27 $ 6,502.63 $ 324.36 $

Recap
1. Free-tier
2. Use the AWS Pricing Calculator
3. With more traffic Lambda gets really costly
4. Take advantage of the "free" compute during init phase for
on-demand lambdas
5. For consistent workloads provisioned concurrency may be
cheaper
6. Always consider factors other than $$$ when evaluating
technologies
Andrzej Dębski

https://hichaelmart.medium.com/shave-99-93-off-your-
lambda-bill-with-this-one-weird-trick-33c0acebb2ea

Caching in Lambda – challenges
1. No hard guarantees when the container will be
recycled
2. Best effort async updates
3. No control over the routing algorithm (no sticky
sessions)
Andrzej Dębski

Caching in Lambda – approaches
1. (Mostly) rely on out of process cache
2. Use different compute platform if L1 cache is critical
3. Use Lambda extensions
1.That's how AWS AppConfig does it
Andrzej Dębski

Caching in Lambda – extensions
1. Code that can execute alongside your Lambda
2. Extension continues to execute AFTER the Lambda
handler returned a response.
3. Implementing cache in extension
1.Save the data on Lambda FS
2.Expose local HTTP server and serve the data
Andrzej Dębski

Lambda extension overview
Andrzej Dębski

Extensions for caching
1. (+) "Asynchronous" cache updates
2. (-) Overhead (in ms):
1.Avg (no ext/ext): 1.88 ms/14.1 ms
2.Tm99.9 (no ext/ext): 1.88 ms/14.2 ms
3. (-) Lambda instance can't handle new requests until
extension is done
4. (-) Extension instances are not shared between
containers
Andrzej Dębski

Recap
1. In memory caching in Lambda is not straightforward
1. Small window of time where the cache is "warm"
2. Asynchronous updates are best effort only
2. Extensions improve the situation but have their own
downsides
1. Runtime overhead
2. Complicate the flow
3. L2 cache or different compute platform
Andrzej Dębski

Q&A
url: sli.do
code: #geecon20202
Andrzej Dębski

Geecon2022: How to avoid common mistakes and misconceptions when working with Java on AWS Lambda

Recommended

Recommended

More Related Content

Similar to Geecon2022: How to avoid common mistakes and misconceptions when working with Java on AWS Lambda

Similar to Geecon2022: How to avoid common mistakes and misconceptions when working with Java on AWS Lambda (20)

Recently uploaded

Recently uploaded (20)

Geecon2022: How to avoid common mistakes and misconceptions when working with Java on AWS Lambda