Building Scalable Applications with Microsoft Azure

Building Scalable Applications with
Microsoft Azure
FISNIK DOKO

Hello!
Fisnik Doko
• Cloud Solution Architect - CloudAstro
• Microsoft Trainer
• Speaker
• https://youracclaim.com/users/fisnik-doko/badges
2

Content
• Scalability
• Azure PaaS resources
3

What is scalability?
• We can’t predict the load
• Scaling is the process of managing your resources to help your application meet a set of
performance requirements
• Ability to handle increased load
• Add resources without modifying the system
4

What is scaling up?
• Scaling up is the process where you increase the capacity of a given instance
• Scaling down is the process where you decrease the capacity of a given instance
• Application does not have to be designed for scalability
• Easy to implement
• Costly
• Not linear performance growth
• Restart or short interruption of the resources
5

What is scaling out or in?
• Scaling out is the process of adding more instances to support the load of your solution
• Scaling in is the process of removing instances
• Is elastic process
• Isn’t a magic fix
• Application has to be designed for horizontal scalability
• Requires more investment to implement
• Introduces additional complexity
• Nearly linear performance increase
• Application issues: File access, Session State, Shared resources, Bottlenecks (database)
6

Autoscale
• A primary advantage of the cloud is elastic scaling .
• Many Microsoft Azure services provide the capability to scale both manually and automatically
• Autoscale refers to the capability of many of these services to monitor the application instances and
automatically scale appropriately to handle the current usage of the application
• Scale based on:
• Metrics
• Schedules
• Consider startup time
• Handle state (to Azure Cache for Redis or SQL Database)
• Serverless – infrastructure isn’t you responsibility, scaling is handled automatically
7

Application Gateway
• OWASP Protection
• URL-based routing
• Application Gateway Ingress Controller
• Scaling:
• Autoscaling
• Manual (1-125 instances)
• Creates new instance can take some time (around six or seven minutes)
• Scaling does not cause downtime
8

App Service
• Scale out (horizontal scaling) – increases number of VM instances depending on pricing tier
• Manually
• Automatic
• Scale up (vertical scaling)
• Create own App Service Plan for the apps that need scaling
9

Autoscale metrics
Metric Metric identifier Description
CPU CpuPercentage The average amount of CPU time used across all instances of the plan
Memory MemoryPercentage The average amount of memory used across all instances of the plan
Data in BytesReceived The average incoming bandwidth used across all instances of the plan
Data out BytesSent The average outgoing bandwidth used across all instances of the plan
HTTP queue HttpQueueLength
The average number of both read and write requests that were queued on storage. A
high disk queue length is an indication of an application that might be slowing down
due to excessive disk I/O.
Disk queue DiskQueueLength
The average number of HTTP requests that had to sit in the queue before being
fulfilled. A high or increasing HTTP queue length is a symptom of a plan under a heavy
load.
10

Autoscale patterns
Scale based on CPU
Scale differently on weekdays vs. weekends
Scale differently during holidays
Scale based on custom metric
11

Autoscale concepts
• Each resource can have one autoscale setting:
• Autoscale settings can have one-to-many profiles
• Profiles can have one-to-many rules
• Autoscale increases instances horizontally within bounds:
• Bounds are set by using the minimum, maximum, and default values
• Thresholds are calculated at an instance level
• Autoscale successful actions and failures are logged to the Activity Log
12

Autoscale thresholds
Scale is constrained to a minimum and maximum:
◦ Your current instance count must be between the minimum and maximum:
◦ Minimum can help guarantee availability
◦ Maximum can help control costs
13

Best practices
• Ensure that the maximum and minimum values are different and have an adequate margin
between them
• Manual scaling is reset by autoscale min and max
• Always use a scale-out and scale-in rule combination that performs an increase and decrease
• Choose the appropriate statistic for your diagnostics metric
• Choose the thresholds carefully for all metric types
14

Azure Functions
• Consumption plan
• Scales automatically
• Apps may scale to zero when idle
• Scales CPU and memory
• Max instances
• Windows 200
• Linux 100
• Premium plan
• Scales automatically – no delay (pre-warmed workers)
• Max instances
• Windows 100
• Linux 40
• Dedicated plan
• Requires predictive scaling
15

Azure Functions scalability best practices
• Share and manage connections
• Avoid sharing storage accounts between function apps
• Don't host production code in shared function app
• Use async code but avoid blocking calls
• Use multiple worker processes
• Configure host behaviors to better handle concurrency
16

API Management
• Policies
• API documentation
• Rate limiting
• Health monitoring
• Modern formats like JSON
• Security
• Scale (manual) process can take from 15 to 45 minutes to apply
• Support autoscale, based on capacity metric (takes at least 20 minutes)
• API Management service in the Consumption tier scales automatically based on the traffic
• Add caching to improve performance (reduce latency for API callers and backend load)
• If you're scaling from or to the Developer tier, there will be downtime. Otherwise no
downtime
APIM
Modern
API
Legacy
API
17

Azure Static Web App
• Globally distributed content
• Integration with serverless APIs powered by Azure Functions
• Access to a variety of authentication providers
• First-class GitHub and Azure DevOps integration
• Free SSL certificates, which are automatically renewed
• Add managed Azure Front Door to reduce latency
18

Azure Service Bus
• Decouple services for greater scalability and reliability
• Supports larger messages sizes of 256 KB (standard tier) or 100 MB (premium tier) per message
• Supports both at-most-once and at-least-once delivery
• Guarantees first-in, first-out (FIFO) order
• Supports role-based security
◦ Premium tier supports manual and autoscaling
19

Storage account
• Depending on region maximum ingress (gp v2) can vary 30-60 Gb ps
• Default maximum request rate per storage account – 20 000 requests ps
• If your application must exceed one of the scalability targets, then create multiple storage
accounts
• Blob type will affect the performance and scalability of your solution
• Connect with Azure CDN
• CDN can typically support much higher egress limits than a single storage account
20

Azure SQL Database
• Scale on the fly with minimal downtime
• Elastic pool can be scaled just manually
• Expect a short connection break when the scale up/scale
• Database sharding - split your data into several databases and scale them independently
• SQL Managed Instance doesn’t support serverless mode
21

Azure PostgreSQL
• Scale separately vCores and storage
• The number of vCores can be scaled up or down – restarts the server
• Storage can only be scaled up, not down
• Perform scale operations during non-peak hours
22

Azure Cosmos DB
• Scale the throughput (RU/s) manually or autoscale
• When the storage increases it increases and autoscale RU/s
• Logical and physical partition
• Choose good partition key
Column family Document
Graph
Turnkey
global
distribution
Elastic scale-out
of storage and
throughput
Guaranteed low
latency at the 99th
percentile
Comprehensive
SLAs
Five well-defined
consistency models
Table API
Key-value
MongoDB
23

Optimize performance
• Create a read-replica of the database
• Use a globally distributed database such as Azure Cosmos DB
• Use a CDN to cache content close to users
Origin server
Client
Client
CDN
40 milliseconds
Image
Image
120 milliseconds
24

Azure Redis Cache
• Fully Managed Service
• High Performance
• Built-in Reliability
• Choose an appropriate tier
• Flexible Scaling
• It takes approximately 20 minutes
• Caches remain available during the scaling operation
• You can't scale from a higher pricing tier to a lower pricing tier
25

Containers
• Containers are lightweight and well suited to scale-out scenarios
Azure Kubernetes Service (AKS) offers two levels of autoscale
• Horizontal autoscale: Can be enabled on service containers to add more or fewer pod
instances within the cluster
• Cluster autoscale: Can be enabled on the agent VM instances running an agent node-pool to
add more or remove VM instances dynamically
• Azure Container Instances
26

Application Insight
• Use Application Insights Profiler
• Performance visualization and details
• Smart detection automatically warns about potential problems
• Response time degradation
• Dependency duration degradation
• Slow performance pattern
27

Tips and tricks
• Autoscale
• Move load to blobs and CDN
• Use Redis for caching
• Load test and measure up performances
• Use Async in code
• Use queues
• Database Sharding and Partitioning
28

Example scalable architecture
29

Building Scalable Applications with Microsoft Azure

Recommended

Recommended

More Related Content

Similar to Building Scalable Applications with Microsoft Azure

Similar to Building Scalable Applications with Microsoft Azure (20)

More from Fisnik Doko

More from Fisnik Doko (7)

Recently uploaded

Recently uploaded (20)

Building Scalable Applications with Microsoft Azure

Editor's Notes