OSOM Operations in the Cloud

529 views

Published on

Published in: Technology, Business
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
529
On SlideShare
0
From Embeds
0
Number of Embeds
2
Actions
Shares
0
Downloads
8
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

OSOM Operations in the Cloud

  1. 1. Operations in the Cloud Marius Stuparu DevOps @ SDL Language Weaver
  2. 2. SummaryThe Cloud, AWS Web ServicesBest Practices in the CloudHigh Availability and Scalability in the CloudAlternative Open Source Solutions
  3. 3. What is cloud computing? IaaS, PaaS, SaaS
  4. 4. Who is using it?
  5. 5. Why?- Zynga thought 200 thousand daily activeusers on Farmville would count as a success(2009).- 1 million new net users every single week- Within a year, FarmVille had more than 50million monthly active users- CityVille on AWS was able to scale up to ~60million active monthly users in the first 2months!!!
  6. 6. AWS● Amazon started really simple● Ec2 - Virtual Machines service● S3 - Storage service
  7. 7. Elastic Block Store(EBS)● local storage is volatile● use EBS for persistent storage(network accessible block storage volumes).● try to keep persistent data on S3 or RDS. EBS performance varies
  8. 8. Elastic Block Store(EBS)
  9. 9. Elastic Load BalancersCool things:● ELB- Availability zones● SSL termination
  10. 10. Relational Database Service● RDS (Multi AZ availability, fail-over ~5min)● Easy to launch replicas and offload read traffic (3 clicks away)● Backup using PITR, Snapshots
  11. 11. ● Infrastructure as code● Configuration management● Orchestration● Automated Provisioning/Auto Scaling● Repeatable/Reproducible Servers (cloning your servers)
  12. 12. April 21 2011 Server Down
  13. 13. Eliminate Single Point of Failures● architect around these problems● decouple your components (queues)● build asynchronous systems and scale horizontally● make your applications as stateless as possible● use multiple cloud providers (AWS, RackSpace, GoGrid, Linode.)
  14. 14. Netflix Chaos Monkey
  15. 15. All about being fast● Cache database responses, objects, fully formed html (ElastiCache)● CDN● Follow the Sun● Try to touch metal only when necessary, use local storage or SAN, avoid NFS
  16. 16. DNS Management● Route 53 LBR● LBR = Latency Based Routingwhat it does:● route your user to the closest server which runs your application
  17. 17. Monitor and graph everything● RightScale collectd, AWS CloudWatch● NewRelic● PingDom, Catch Point, Uptrends● Nagios, Cacti, Zabbix● Splunk
  18. 18. AWS Cloudwatch
  19. 19. New Relic - RUM
  20. 20. New Relic - Application Monitoring
  21. 21. Open Source AlternativesForeman (no logo) - web ui for puppet
  22. 22. Other useful tools● Git (Github)● Vagrant● If you have a python stack look at boto (Python interface to Amazon Web Services)
  23. 23. Q&A
  24. 24. Thank You!
  25. 25. Slides: http://www.slideshare.net/mstuparu/osom-operations-in-the-cloud Contact information: marius@ec2.ro / mstuparu@sdl.com

×