Scalable service architectures @ BWS16

•Download as PPTX, PDF•

0 likes•495 views

Zoltán Németh

Scalable service architectures talk at Bulgaria Web Summit 2016

Technology

Scalable Service Architectures
Lessons learned
Zoltán Németh
Engineering Manager, Core SystemsAn IBM company

Agenda
 Our scalability experience
 What is Scalability?
 Requirements in detail
 Tips and tools
 Extras, Closing remarks

Defining scalability
Scalability is the ability to handle increased workload
by repeatedly applying a costeffective strategy for
extending a system’s capacity.
(CMU paper, 2006)
How well a solution to some problem will work when
the size of the problem increases. When the size
decreases, the solution must fit. (dictionary.com and
Theo Schlossnagle, 2006)

Self-contained
service
 Explicitly declare and
isolate dependencies
 Isolation from the outside
system
 Static linking
 Do not rely on system
packages

Disposability  Maximize robustness with
fast startup and graceful
shutdown
 Disposable processes
 Graceful shutdown on
SIGTERM
 Handling sudden death:
robust queue backend

Startup and
Shutdown
 Automate all the things
 Chef
 Docker
 Gold image based
deployment
 Immutable
 Handling tasks before
shutdown

Backing Services  Treat backing services as
attached resources
 No distinction between
local and third party
services
 Easily swap out resources
 Export services via port
binding
 Become the backing
service for another app

Processes,
concurrency
 Stateless processes (not
even sticky sessions)
 Process types by work type
 We <3 linux process
 Shared-nothing  adding
concurrency is safe
 Process distribution
spanning machines

Statelessness  Store everything in a
datastore
 Aggregate data
 Chandra
 Aggregator / map &
reduce
 Scalable datastores
 Handling user sessions

Monitoring  Application state and
metrics
 Dashboards
 Alerting
 Health
 Remove failing nodes
 Capacity
 Act on trends

Monitoring  Metrics collecting
 Graphite, New Relic
 Self-aware checks
 Cluster state
 Zookeeper, Consul
 Scaling decision types
 Capacity amount
 Graph derivative
 App requests

Load Balance and
Resource
Allocation
 Load Balance: distribute
tasks
 Utilize machines
efficiently
 VM compatible apps
 Flexibility
 Adapting to available
resources

Load Balance  DNS or API
 App level balance
 Uniform entry point or
proxy
 Balance decisions
 Load
 Zookeeper state
 Resource policies

Service
Separation
 Failure is inevitable
 Protect from failing
components
 Cascading failure
 Fail fast
 Decoupling
 Asynchronous operations
 Message queues

Service
Separation
 Rate limiting
 Circuit Breaker pattern
 Stop cascading failure,
allow recovery
 Hystrix
 Fail fast, fail silent
 Service decoupling

Extras  Debugging features
 Logs
 Clojure / JS consoles
 Runtime configuration
via env
 Scaling API
 Integrating several
cloud providers
 Automatic start / stop

Reading
 Scalable Internet Architectures by Theo Schlossnagle
 The 12-factor App: http://12factor.net/
 Carnegie Mellon Paper: http://www.sei.cmu.edu/reports/06tn012.pdf
 Circuit Breaker: http://martinfowler.com/bliki/CircuitBreaker.html
 Release It! by Michael T. Nygard

What's hot

Keynote : évolution et vision d'Elastic ObservabilityElasticsearch

Grant Fritchey - Query Tuning In Azure SQL DatabaseRed Gate Software

Workflows via Event driven architectureMilan Patel

Ionut hrubaru, bogdan lazarescu sql server high availabilityCodecamp Romania

CloudBrew 2016 - Building IoT solution with Service FabricTeemu Tapanila

Whitepaper : Building an Efficient Microservices ArchitectureNewt Global Consulting LLC

What's hot (6)

Keynote : évolution et vision d'Elastic Observability

Grant Fritchey - Query Tuning In Azure SQL Database

Workflows via Event driven architecture

Ionut hrubaru, bogdan lazarescu sql server high availability

CloudBrew 2016 - Building IoT solution with Service Fabric

Whitepaper : Building an Efficient Microservices Architecture

Similar to Scalable service architectures @ BWS16

Scalable service architectures @ VDB16Zoltán Németh

Stephane Lapointe & Alexandre Brisebois: Développer des microservices avec Se...MSDEVMTL

Microsoft Azure Cloud Basics TutorialIIMSE Edu

(ENT210) Accelerating Business Innovation with DevOps on AWS | AWS re:Invent ...Amazon Web Services

Eda on the azure services platformYves Goeleven

Azure and cloud design patternsVenkatesh Narayanan

Designing distributed systemsMalisa Ncube

Sql AzureYves Goeleven

Muves3 Elastic Grid Java One2009 FinalElastic Grid, LLC.

Deploying SaaS Application on the Cloud - Case StudyNati Shalom

A guide through the Azure Messaging services - Update ConferenceEldert Grootenboer

Introduction To Cloud ComputingRinat Shagisultanov

12-Factor AppAbdullah Çetin ÇAVDAR

CSC AWS re:Invent Enterprise DevOps sessionTom Laszewski

Being Elastic -- Evolving Programming for the CloudRandy Shoup

Managing the cloudFrancesco Orlando

SaaS Enablement of your existing application (Cloud Slam 2010)Nati Shalom

Kluczowe elementy infrastruktury...Alicja Sieminska

Albara Abdalkhaligbrra51

Current trends in software engineeringbrra51

Similar to Scalable service architectures @ BWS16 (20)

Scalable service architectures @ VDB16

Stephane Lapointe & Alexandre Brisebois: Développer des microservices avec Se...

Microsoft Azure Cloud Basics Tutorial

(ENT210) Accelerating Business Innovation with DevOps on AWS | AWS re:Invent ...

Eda on the azure services platform

Azure and cloud design patterns

Designing distributed systems

Sql Azure

Muves3 Elastic Grid Java One2009 Final

Deploying SaaS Application on the Cloud - Case Study

A guide through the Azure Messaging services - Update Conference

Introduction To Cloud Computing

12-Factor App

CSC AWS re:Invent Enterprise DevOps session

Being Elastic -- Evolving Programming for the Cloud

Managing the cloud

SaaS Enablement of your existing application (Cloud Slam 2010)

Kluczowe elementy infrastruktury...

Albara Abdalkhalig

Current trends in software engineering

Recently uploaded

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

GenAI Risks & Security Meetup 01052024.pdflior mazor

Real Time Object Detection Using Open CVKhem

GenCyber Cyber Security Day PresentationMichael W. Hawkins

Scaling API-first – The story of a global engineering organizationRadu Cotescu

Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko

Finology Group – Insurtech Innovation Award 2024The Digital Insurer

Histor y of HAM Radio presentation slidevu2urc

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j

TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services

🐬 The future of MySQL is Postgres 🐘RTylerCroy

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous

Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal

presentation ICT roal in 21st century educationjfdjdjcjdnsjd

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@

AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

Recently uploaded (20)

How to Troubleshoot Apps for the Modern Connected Worker

GenAI Risks & Security Meetup 01052024.pdf

Real Time Object Detection Using Open CV

GenCyber Cyber Security Day Presentation

Scaling API-first – The story of a global engineering organization

Handwritten Text Recognition for manuscripts and early printed texts

Finology Group – Insurtech Innovation Award 2024

Histor y of HAM Radio presentation slide

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery

Boost PC performance: How more available memory can improve productivity

Strategies for Landing an Oracle DBA Job as a Fresher

🐬 The future of MySQL is Postgres 🐘

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf

presentation ICT roal in 21st century education

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

AWS Community Day CPH - Three problems of Terraform

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

Scalable service architectures @ BWS16

1. Scalable Service Architectures Lessons learned Zoltán Németh Engineering Manager, Core SystemsAn IBM company

2. Agenda  Our scalability experience  What is Scalability?  Requirements in detail  Tips and tools  Extras, Closing remarks

3. Our experience

4. Streaming stack

6. Defining scalability Scalability is the ability to handle increased workload by repeatedly applying a costeffective strategy for extending a system’s capacity. (CMU paper, 2006) How well a solution to some problem will work when the size of the problem increases. When the size decreases, the solution must fit. (dictionary.com and Theo Schlossnagle, 2006)

7. Self-contained service  Explicitly declare and isolate dependencies  Isolation from the outside system  Static linking  Do not rely on system packages

8. Disposability  Maximize robustness with fast startup and graceful shutdown  Disposable processes  Graceful shutdown on SIGTERM  Handling sudden death: robust queue backend

9. Startup and Shutdown  Automate all the things  Chef  Docker  Gold image based deployment  Immutable  Handling tasks before shutdown

10. Backing Services  Treat backing services as attached resources  No distinction between local and third party services  Easily swap out resources  Export services via port binding  Become the backing service for another app

11. Processes, concurrency  Stateless processes (not even sticky sessions)  Process types by work type  We <3 linux process  Shared-nothing  adding concurrency is safe  Process distribution spanning machines

12. Statelessness  Store everything in a datastore  Aggregate data  Chandra  Aggregator / map & reduce  Scalable datastores  Handling user sessions

13. Monitoring  Application state and metrics  Dashboards  Alerting  Health  Remove failing nodes  Capacity  Act on trends

14. Monitoring  Metrics collecting  Graphite, New Relic  Self-aware checks  Cluster state  Zookeeper, Consul  Scaling decision types  Capacity amount  Graph derivative  App requests

15.

16. Load Balance and Resource Allocation  Load Balance: distribute tasks  Utilize machines efficiently  VM compatible apps  Flexibility  Adapting to available resources

17. Load Balance  DNS or API  App level balance  Uniform entry point or proxy  Balance decisions  Load  Zookeeper state  Resource policies

18. Service Separation  Failure is inevitable  Protect from failing components  Cascading failure  Fail fast  Decoupling  Asynchronous operations  Message queues

19. Service Separation  Rate limiting  Circuit Breaker pattern  Stop cascading failure, allow recovery  Hystrix  Fail fast, fail silent  Service decoupling

20. Extras  Debugging features  Logs  Clojure / JS consoles  Runtime configuration via env  Scaling API  Integrating several cloud providers  Automatic start / stop

21. Reading  Scalable Internet Architectures by Theo Schlossnagle  The 12-factor App: http://12factor.net/  Carnegie Mellon Paper: http://www.sei.cmu.edu/reports/06tn012.pdf  Circuit Breaker: http://martinfowler.com/bliki/CircuitBreaker.html  Release It! by Michael T. Nygard

22. Questions syntaxerror@ustream.tv

Editor's Notes

A bit of Ustream intro
Definition Requirements coming from 12-factor, and some added by us Some more detail and tools on selected requirements
30 day viewer graph. Clear peaks -> need for scaling
Quick description of the streaming stack, roles of components, how they require scaling - Transcontroller/transcoder scaling - UMS scaling
Quick description of the streaming stack, roles of components, how they require scaling - Transcontroller/transcoder scaling - UMS scaling
Carnegie Mellon University paper by Charles B. Weinstock, John B. Goodenough: On System Scalability LINFO: The Linux Information Project http://www.linfo.org/ Next: principles
Example: calling imagemagick or curl from code – they might be there or might not be Bundle everything into the app instead
Disposable process: they can be started or stopped at a moment’s notice For a web process, graceful shutdown is achieved by ceasing to listen on the service port (thereby refusing any new requests), allowing any current requests to finish, and then exiting. Implicit in this model is that HTTP requests are short (no more than a few seconds), or in the case of long polling, the client should seamlessly attempt to reconnect when the connection is lost. For a worker process, graceful shutdown is achieved by returning the current job to the work queue.
Docker: build images from dockerfile, deploy from repository Tasks before shutdown: moving jobs, log collection, sleep
A backing service is any service the app consumes over the network as part of its normal operation. Examples include datastores (such as MySQL or CouchDB), messaging/queueing systems (such as RabbitMQ or Beanstalkd), SMTP services for outbound email (such as Postfix), and caching systems (such as Memcached). Put a resource locator in the config only – environment variables Example: Easily swap out a local mysql to a remote service The app does not rely on runtime injection of a webserver into the execution environment to create a web-facing service. The web app exports HTTP as a service by binding to a port, and listening to requests coming in on that port. One app can become the backing service for another app, by providing the URL to the backing app as a resource handle in the config for the consuming app
Handle diverse workloads by assigning each type of work to a process type. For example, HTTP requests may be handled by a web process, and long-running background tasks handled by a worker process An individual VM can only grow so large (vertical scale), so the application must also be able to span multiple processes running on multiple physical machines.
Aggregate everything within the app and write it out in bulk – careful about write frequency, must not lose too many data on a crash Aggregator map-reduce Redis: scales reads, write problematic Cassandra: quick scaling questionable Aerospike: scales reads and writes, working together with their eng team User sessions: persistent connection, NIO+
Alerting -> openduty Two important groups: Health vs capacity
Report everything to graphite, constantly check graph trends automatically Apps are self-aware, they know their health App instances report into Zookeeper and thus know about each other Central logic can request resource based on capacity or graph, app can request based on self-check or zookeeper Zookeeper, Consul: miért, mik az előnyei
load balancing distributes workloads across multiple computing resources Flexibility: can increase or decrease its own size, example: Threadpools Adapting to CPU, RAM, disk, network
App level: transcontroller selects transcoder App level balance with proxy can be SPOF, careful Resource policies: even distribution, keep large chunks free for possible large tasks (transcoder use case), group requests together on some attribute (pro, etc)
Failure inevitable because: large numbers, hw issues, independent network Decoupling: serving one request should not wait on others
Hystrix by Netflix 2011/12 Circuit Breaker: Martin Fowler post from 2014 Service decoupling example: inserting layers between DB and UMS -> RGW. Then another layer between RGW and UMS -> Queue
Logs: logs as stream / stdout (factor #9), collect / transport / process Scaling API: Other considerations: price, network line to the cloud provider, instance type (spot vs normal) Openstack, Ganeti

Scalable service architectures @ BWS16

Recommended

Recommended

More Related Content

What's hot

What's hot (6)

Similar to Scalable service architectures @ BWS16

Similar to Scalable service architectures @ BWS16 (20)

More from Zoltán Németh

More from Zoltán Németh (9)

Recently uploaded

Recently uploaded (20)

Scalable service architectures @ BWS16

Editor's Notes