Capacity Planning For Web Operations Presentation

Loading...

Flash Player 9 (or above) is needed to view presentations.
We have detected that you do not have it on your computer. To install it, go here.

0 comments

Post a comment

    Post a comment
    Embed Video
    Edit your comment Cancel

    1 Favorite

    Capacity Planning For Web Operations Presentation - Presentation Transcript

    1. Capacity Planning for Web Operations John Allspaw Operations Engineering
    2. Are you tracking how your servers are performing? ? Do you know how many servers do you have? Do you know how much Are you tracking how tra!c can your application is being your servers used ? handle? (without dying)
    3. monitoring testing deployment forecasting architecture metrics product planning capex procurement
    4. monitoring testing deployment go see Adam Jacob’s talk! forecasting architecture metrics product planning capex procurement
    5. traditional capacity planning
    6. capacity planning for web
    7. Why capacity planning is important Hardware* costs $$ (Cloudware costs $$, too) Having too little is bad (!@#!!) -> ($$$) Having too much is bad ($$$$!) * and network, datacenter space, power, etc.
    8. Growth “Normal” projected planned (yay!) expected hoped for “Instantaneous” spikes (yay?) unexpected external events (omg! wtf!) digg, etc.
    9. “Normal” growth at Flickr in a year.... 4x increase in photo requests/sec 2.2x increase in uploads/day 3x increase in database queries/sec
    10. Yahoo! FrontPage link XMas lights “Instantaneous”
    11. “Instantaneous” coping - Disabling “heavier” features on the site - Cache aggressively or serve stale data - Bake dynamic pages into static ones
    12. capacity != performance Making something fast doesn’t necessarily make it last Performance tuning = good, just don’t count on it Accept (for now) the performance you have, not the performance you wished you had, or you think you might have later
    13. Stewart: “Allspaw!!!! OMG!!!” How many servers will we need next year?! (we need to tell finance by 2pm today)
    14. “Ah, just buy twice as much as we need” 2 x (how much we need) = ?
    15. measurement
    16. Good capacity measurement tools can... Easily measure and record any number that changes over time Easily compare metrics to any other metrics from anywhere else (import/export) Easily make graphs
    17. good tools are out there cacti.net munin.projects.linpro.no hyperic.com ganglia.info
    18. good tools are out there cacti.net munin.projects.linpro.no hyperic.com Flickr uses ganglia.info
    19. photo uploads via email per minute hour application metrics
    20. your stu!, not just system stu! photos uploaded (and processed) per minute average photo processing time per minute average photo size disk space consumed per day user registrations per day etc etc etc
    21. your stu!, not just system stu! photos uploaded (and processed) per minute average photo processing time per minute average photo size disk space consumed per day user registrations per day etc etc etc
    22. Tie application metrics to system metrics Pretty!! But what does this mean?
    23. It means that with about 60% total CPU... It means we can process ~120 images per minute ...and we can process them in ~3.5 seconds (on average)
    24. Benchmarking Great for comparing hardware platforms and configurations BUT Doesn’t represent real workloads
    25. not exactly like a bike messenger
    26. Finding your ceilings • Use real data, from production servers (if at all possible) • No, really
    27. How much traffic can each webserver take before it dies? How many webservers can fail before we’re screwed? When should I add more webservers?
    28. people LBs webservers (databases, etc.)
    29. people network LBs cpu memory webservers disk usage disk i/o (databases, etc.)
    30. people network LBs cpu memory webservers disk usage disk i/o (databases, etc.) - comments/min - photos/min - videos/min - kittens/min - etc etc etc/min
    31. people LBs webservers (databases, etc.)
    32. people LBs webservers (databases, etc.)
    33. people LBs webservers (databases, etc.)
    34. what happens here? Ceiling = upper limit of “work” (and resources)
    35. Trends of peaks Time
    36. Benchmarking Might be your only option if you have a single server. some good benchmarking tools: Siege http://www.joedog.org/JoeDog/Siege httperf/autobench http://www.hpl.hp.com/research/linux/httperf/ http://www.xenoclast.org/autobench sysbench http://sysbench.sf.net
    37. Economics
    38. Time makes everything cheaper (the Moore’s Law thing) BUT you don’t have a lot of time to wait around, do you?
    39. Vertical scaling
    40. Horizontal architectures
    41. Diagonal scaling
    42. Diagonal scaling Replacing 67 dual-core webservers with 18 dual quads
    43. Diagonal scaling more tra\"c from less machines
    44. Diagonal Scaling servers CPUs RAM drives total power (W) @60% peak per server per server per server 67 2 4GB 1x80GB 8763.6 18 8 4GB 1x146GB 2332.8 ~70% less power 49U less rack space
    45. Utility Computing Disclosure: We don’t use clouds at Flickr. (but we know folks who do)
    46. clouds Help with deployment timelines Help with procurement timelines BUT Still have to pay attention Many people use the same forecasting methods
    47. Use Common Sense(tm) Pay attention to the right metrics Don’t pretend to know the exact future Measure constantly, adapt constantly Complex simulation and modeling is rarely worth it Don’t expect tuning and tweaking will ever win you any excess capacity
    48. Some more stats Serving 32,000 photos per second at peak Consuming 6-8TB per day Consumed >34TB per day during Y!Photos migration ~3M uploads per day, 60 per second at peak
    49. June 23-24, 2008 20% off discount: “vel08js”
    50. We Are Hiring! (DBA, engineers) http://flickr.com/photos/85013738@N00/542591121/ http://flickr.com/photos/formatc1/2301500208/ http://flickr.com/photos/mikefats/11546240/ http://flickr.com/photos/kanaka/491064256/ http://flickr.com/photos/randysonofrobert/1035003071/ http://flickr.com/photos/halcyonsnow/446166047/ http://flickr.com/photos/wwworks/2313927146/ http://flickr.com/photos/sunxez/1392677065/ http://flickr.com/photos/spacesuitcatalyst/536389937/ http://flickr.com/photos/theklan/1276710183/

    + jward5519jward5519, 2 years ago

    custom

    824 views, 1 favs, 0 embeds more stats

    More info about this document

    © All Rights Reserved

    Go to text version

    • Total Views 824
      • 824 on SlideShare
      • 0 from embeds
    • Comments 0
    • Favorites 1
    • Downloads 17
    Most viewed embeds

    more

    All embeds

    less

    Flagged as inappropriate Flag as inappropriate
    Flag as inappropriate

    Select your reason for flagging this presentation as inappropriate. If needed, use the feedback form to let us know more details.

    Cancel
    File a copyright complaint
    Having problems? Go to our helpdesk?

    Categories