Slideshare.net (beta)

 
Post: 
Myspace Hi5 Friendster Xanga LiveJournal Facebook Blogger Tagged Typepad Freewebs BlackPlanet gigya icons



All comments

Add a comment on Slide 1

If you have a SlideShare account, login to comment; else you can comment as a guest


Showing 1-50 of 7 (more)

Set Amazons Servers on Fire

From aws, 10 months ago

A talk given by Don MacAskill at the Amazon Startup Project, held more

4496 views  |  1 comment  |  5 favorites  |  15 embeds (Stats)
 

Privacy InfoNew!

This slideshow is Public

 
Embed in your blog
Embed (wordpress.com)
custom

Slideshow Statistics
Total Views: 4496
on Slideshare: 4110
from embeds: 386* * Views from embeds since 21 Aug, 07

Slideshow transcript

Slide 1: Scalability Set Amazon’s Servers on Fire, Not Yours Parks Hall Fire, July 3, 2002 - http://www.acadweb.wwu.edu/dbrunner/

Slide 2: Why trust us? Bootstrapped. $10M+. Profitable. No debt. ~195M photos. ~300TB at S3. Doubling yearly.

Slide 3: Why trust us? Bootstrapped. $10M+. Profitable. No debt. ~195M photos. ~300TB at S3. Doubling yearly. Super heroes.

Slide 4: Show me the money! Photo by Kirk Tanner - http://kirktanner.smugmug.com/

Slide 5: Show me the money! (as of Apr 07) Guesstimate: ~$500K saved per year. Actual: Growth: 64M photos -> 140M photos Disks would cost: $40K -> $100K/month. $922K would have been spent. $230K spent instead. $692K in cold, hard savings. Nasty taxes! $295K ‘saved’ in cash flow. Bonus! Reselling disks - recouping sunk cost.

Slide 6: $ sweet spots Perfect for startups & small companies. Ideal for ‘store lots, serve little’ businesses of all sizes. Not so great (yet?) for serving lots if you’re a medium or large sized business. Transfer costs high if you can buy GigE links. We’re a ‘store lots, serve lots’ company. What to do?

Slide 7: Our S3 evolution Started just doing secondary storage. Too cold! Tried out as solely using S3. Too hot! Finally, hot & cold model = Just right! Amazon gets 100% of the data - Primary datastore! SmugMug keeps “hot” data local (roughly 10%). 95% reduction in # of disks bought.

Slide 8: Reliability Not 100%. Close, though. More reliable than SmugFS which is quite reliable. Lots of failure points: SmugMug’s datacenter Internet backbones Amazon’s datacenter No other software, hardware, or service we use is 100%, either.

Slide 9: Outages & Problems Not perfect. 5 major issues. 3 outages (15-30 mins). 2 core switch failures and one DNS problem. Amazon.com affected. 2 performance degradations. One, our customers noticed. Second, they didn’t. Not a big deal - everything fails. Expect it.

Slide 10: SLA, Service, & Support We don’t care about SLA, but you may. Service Support: One area where Amazon is weak. This is a utility. They need a service status dashboard. Pro-active customer notifications. Ability to get ahold of a human. Amazon.com’s customer service is good, AWS will likely catch up.

Slide 11: Flirting with the other services.

Slide 12: Elastic Compute Cloud (EC2) Launching large EC2 implementation “soon” Image processing. 1M+ photos/day. BIG photos. Up to 24MB, 48Mpix. 20+ Terapixels/day processed Peaky traffic on weekends, holidays Ridiculously parallel

Slide 13: Flexible Payments Service (FPS) Tens of thousands of Pro photographers. Built their businesses on SmugMug. Need to get paid. Writing checks sucks. Manual human job. Mail gets lost. Harder to track. Everyone has an Amazon account.

Slide 14: Missing Pieces Database API and/or DB grade EC2 instances. Fast (lots of local spindles, lots of RAM) Persistent. Load balancer API. Single IP in front of lots of EC2 instances. Programmable to add/remove/change clusters. Can be done with software on an EC2 instance, but painful. CDN

Slide 15: Questions? Blog: http://blogs.smugmug.com/onethumb Slides: See the blog. Posting soon. Email: don AT smugmug Twitter: http://twitter.com/DonMacAskill Photo sharing: http://www.smugmug.com/ Thanks!