How to scale up, out or down in Windows Azure - Webinar

1. How to scale up, out or down in Windows Azure Juan De Abreu VP -Delivery Director jdeabreu@getcs.com

2. #CSwebinar

3. Outline Scalability Achieving linear scale Scale Up vs. Scale Out in Windows Azure Choosing VM Sizes Caching Approaches to caching Cache storage Elasticity Scale out, scale back Automation of scaling #CSwebinar

4. A Primer on Scale Scalability is the ability to add capacity to a computing system to allow it to process more work #CSwebinar

5. A Primer On Scalability Vertical Scale Up Add more resources to a single computation unit i.e. Buy a bigger box Move a workload to a computation unit with more resourcese.g. Windows Azure Storage moving a partition. Horizontal Scale Out Adding additional computation units and having them act in concert Splitting workload across multiple computation units #CSwebinar

6. Vertical vs. Horizontal For small scenarios scale up is cheaper Code ‘just works’ For larger scenarios scale out only solution Massive diseconomies of scale 1 x 64 Way Server >>>$$$ 64 x 1 Way Servers. Shared resource contention becomes a problem Scale out offers promise of linear, infinite scale #CSwebinar

7. Roughly Linear Scalei.e. Additional throughput achieved by each additional unit remains constant Throughput Non Linear Scalei.e. Additional throughput achieved by each additional unit decreases as more are added Computation Units

8. Scalability != Performance Often you will sacrifice raw speed for scalability For example; ASP.NET session state In Process ASP.NET Session State SQL Server ASP.NET Session State #CSwebinar

9. Achieving Linear Scale Out Reduce or Eliminate Shared Resources Minimize reliance on transactions or transactional type behaviour Homogenous, Stateless computation nodes We can then use simple work distribution methodsLoad balancers, queue distribution Less reliance on expensive hardware H/A

10. Units of Scale Consolidation of Roles provides more redundancy for same Create as many roles as you need ‘knobs’ to adjust scale Web Driven Role WCF Role Web Site Role’ Cache Build Role Clean Up Role Loss of an instance results in just 25% capacity loss in web site. Loss of an instance results in 50% capacity loss in web site. Queue Drive Role #CSwebinar

11. VM Size in Windows Azure Windows Azure Supports Various VM Sizes ~800mb/s NIC shared across machine Set in Service Definition (*.csdef).All instances of role will be equi-sized <WorkerRole name=“myRole" vmsize="ExtraLarge"> #CSwebinar

12. Remember: If it doesn’t run faster on multiple cores on your desktop … It’s not going to run faster on multiple cores in the cloud! #CSwebinar

13. Choosing Your VM Size Don’t just throw big VMs at every problem Scale out architectures have natural parallelism Test various configurations under load Some scenarios will benefit from more cores Where moving data >$ parallel overhead E.g. Video processing Stateful services Database server requiring full network bandwidth #CSwebinar

14. Caching #CSwebinar

15. Caching Caching can improve both performance and scalability Moving data closer to the consumer (Web/Worker) improves performance. Reducing load on the hard to scale data tier Caching Is The Easiest Way To Add Performance and Scalability To Your Application In Windows Azure: Caching Will Save You Money! #CSwebinar

16. Caching Scenario: Website UI Images Website UI Images Largely static data Included in every page Goal: A Better UI Serve content once Avoid round trip unless content changes Minimise traffic over the wire Fewer storage transactions Lower load on web roles #CSwebinar

17. Caching Scenario: RSS Feeds Regular RSS Feed Data delivered from database/storage Large content payload>1mb Data changes irregularly Cost determined by client voracity Goal: A Better RSS Feed Minimise traffic over the wire Fewer storage transactions Less hits on database #CSwebinar

18. Caching Strategies Client Side Caching Static Content Generation #CSwebinar

19. Client Side Caching Client Web Roles WorkerRoles BLOBs Queues Tables SQL Azure #CSwebinar

20. Client Caching - ETags ETag == Soft Caching Header added on HTTP Response ETag: “ABCDEFG” Client does conditional HTTP GET If-None-Match: “ABCDEFG” Returns content if ETag no longer matches Implemented natively by Windows Azure Storage Supports client side caching Also used for optimistic concurrency control #CSwebinar

21. Client Caching - ETags Benefits Prevents client downloading un-necessary data Out of the box support for simple ‘static content’ scenarios. Problems Still requires round trip to server May require execution of server side code to re-create ETag before checking string etag = Request.Headers["If-None-Match"]; if(String.Compare(etag, GetLastBlogPostIDAzTable()) == 0) { Response.StatusCode = 412; return; } #CSwebinar

22. Client Caching – Cache-Control Cache-Control: max-age == Hard Caching Header added on HTTP Response Cache-Control: max-age=2592000 Client may cache file without further request for 30 days Client will not re-check on every request Very useful for static files header_logo.png Used to determine TTL on CDN edge nodes Set this on Blob using x-ms-blob-cache-control #CSwebinar

23. Client Caching – Cache-Control Benefits Prevents un-necessary HTTP requests Prevents un-necessary downloads Problems What if files do change in the 30 days? Windows Azure Technique: Put static files in Blob storage use Cache-Control + URL FlippingSimple randomization == simple but no versioning Container level flipping == simple but more expensive Snapshot level flipping == more complex but lower cost <img src=http://*.blob.*/Container/header_logo.png ?random=<rnd>/> <img src=http://*.blob.*/Containerv1.0/header_logo.png /> <img src=http://*.blob.*/Containerv2.0/header_logo.png /> <img src=http://*.blob.*/Container/header_logo.png ?snapshot=<DT1>/> <img src=http://*.blob.*/Container/header_logo.png ?snapshot=<DT2>/>

24. Static Content Generation Web Roles WorkerRoles BLOBs Queues Tables SQL Azure #CSwebinar

25. Static Content Generation Generate Content Periodically in Worker Role Can spin up workers just for generation Generate as triggered async operation Content May Be Full pages Resources (CSS Sprites, PDF/XPS, Images etc…) Content fragments Push static content into Blob storage Serve direct out of Blob storage May also be able to use persistent local storage #CSwebinar

26. Static Content Generation Benefits Reduce load on web roles Potentially reduce load on data tier Response times improved Can combine with Cache-Control and ETags Problems Need to deal with stale data Manage/Refresh Ignore #CSwebinar

27. A Better RSS Feed? Build standard RSS Feed in Web Role Generate content dynamically from storage Serialize as RSS using Feed Formatters Place on obfuscated (hidden) URL Build a worker role to poll hidden RSS feed Retrieve RSS content at certain intervals or on event Push content into a Blob if changed Serve RSS to users from Blob storage Take advantage of E-Tags Zero load on database or RSS tables to serve content #CSwebinar

28. BLOBs vs. Compute Instances BLOB Storage Disk Based 15c/GB/Month 1c/10,000 requests Compute Instances RAM and Disk Based 12c/hrper 1GB RAMper 250GB disk Dedicated compute cache roles must serve at least 120,000 cache requests per hour to be cheaper than Windows Azure storage Outside USA and Europe: use CDN for caching due to much lower bandwidth costs #CSwebinar

29. Elastic Scale Out #CSwebinar

30. Elastic Cloud Workflow Patterns “Growing Fast“ “On and Off “ Inactivity Period Compute Compute Average Usage Usage Average Time Time On & off workloads (e.g. batch job) Over provisioned capacity is wasted Time to market can be cumbersome Successful services needs to grow/scale Keeping up w/ growth is big IT challenge Cannot provision hardware fast enough “Unpredictable Bursting“ “Predictable Bursting“ Compute Compute Average Usage Average Usage Time Time Unexpected/unplanned peak in demand Sudden spike impacts performance Can’t over provision for extreme cases Services with micro seasonality trends Peaks due to periodic increased demand IT complexity and wasted capacity #CSwebinar

32. Faster availability

34. Requires management- human or automated

35. Pre-emptive or metric driven#CSwebinar

36. Head Room in Windows Azure Web Roles Run additional web roles Handle additional load before performance degrades Worker Roles If possible just buffer into queues Will be driven by tolerable level of latency Start additional roles only if queues not clearing Use generic workers to pool resources #CSwebinar

37. Head Room in Windows Azure Services Windows Azure Storage Storage nodes serve many partitions Partition served by a single storage node Fabric can move to a different storage node Opaque to the Windows Azure customer SQL Azure Non-deterministic throttle gives little indication Run extra instances – requires DB sharding #CSwebinar

38. Adding Capacity in Windows Azure Web Roles/Worker Roles Enable more instances (API or *.config) Editing instance count in config leaves existing instances running Change to using larger VMs- will require redeploy. Windows Azure Storage Opaque to user Partition aggressively Can ‘heat up’ a partition to encourage scale up #CSwebinar

39. Adding Capacity in SQL Azure SQL Azure Add more databases (more partitions) Very difficult to achieve mid-stream Requires moving hot data Maintaining consistency across multiple DBs without DTC Will depend on partitioning strategy #CSwebinar

40. Rule Based Scaling Use Service Management and Diagnostics APIs On/Off and Predictable Bursting Time based rules Unpredictable demand and Fast Growth Monitor metrics and react accordingly Action+/- instance count Deploy new service Increase queues Send notifications Monitor InputsHistorical Data TransactionsPerf CountersBusiness KPIs Evaluate Biz Rules Latency too high/lowHow much $ spent Are we at limit Predicted load Diagnostics & Management APIs #CSwebinar

41. Monitor metrics Primary metrics (actual work done) Requests per Second Queue messages processed / interval Secondary metrics CPU Utilization Queue length Response time Derivative metrics Rate of change of queue lengthUse ‘historical’ data to help predict requirements #CSwebinar

42. Gathering Metrics Use Microsoft.WindowsAzure.Diagnostics.* Capture various metrics via Management API Diagnostics Infrastructure Logs Event Logs Performance Counters IIS Logs May need to smooth/average some measures Remember the cost of gathering data Both performance and financial costs Would you use Perf Counters 24/7 on a production system? http://technet.microsoft.com/en-us/library/cc938553.aspx #CSwebinar

43. Evaluating Business Rules Are requests taking too long? Do I have too many jobs in my queue? How much money have I spent this month? Could write these into code. Could build some sort of rules engine. Could use the WF rules engine. #CSwebinar

44. Take Action Add/Remove Instances Use Service Management API Change role size Requires change to *.csdef Most suited to Worker Roles Send notifications Email IM Manage momentum Be careful not to overshoot #CSwebinar

45. Summary Designing for multiple instances provides Scale out Availability Elasticity options Caching should be a key component of any Windows Azure application Various options for variable load Spare capacity Scale Out/Back Automation possible #CSwebinar

46. Resources www.msteched.com/Australia Sessions On-Demand & Community www.microsoft.com/australia/learning Microsoft Certification & Training Resources http:// technet.microsoft.com/en-au Resources for IT Professionals http://msdn.microsoft.com/en-au Resources for Developers #CSwebinar

47. Thanks! How can we help? Juan De Abreu VP -Delivery Director jdeabreu@getcs.com blog.getcs.com #CSwebinar © 2010 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

How to scale up, out or down in Windows Azure - Webinar

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to How to scale up, out or down in Windows Azure - Webinar

Similar to How to scale up, out or down in Windows Azure - Webinar (20)

More from Common Sense

More from Common Sense (7)

Recently uploaded

Recently uploaded (20)

How to scale up, out or down in Windows Azure - Webinar

Editor's Notes