DevOps sensors 360° high availability in the cloud

830 views

Published on

This presentation describes technics that helps Emind DevOps team to keep applications always-on.

Enjoy.
Lahav

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
830
On SlideShare
0
From Embeds
0
Number of Embeds
226
Actions
Shares
0
Downloads
3
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

DevOps sensors 360° high availability in the cloud

  1. 1. DevOps Sensors 360°High Availability in the CloudLahav Savir, Architect & CEOEmind systems Ltd.lahavs@emind.co
  2. 2. AboutLahav Savir• 15+ years’ experience• Architect and CEO @ Emind SystemsEmind Systems (est. 2006)• Highly professional system integrator• Dedicated Cloud Architecture and DevOps teams• 24x7 SLA by DevOps Specialists• ~100 AWS customers• Partnerships with leading cloud vendors
  3. 3. PartnershipsAmazon Advanced Consulting Partnerhttps://aws.amazon.com/solution-providers/si/emind-systems-ltd
  4. 4. What is Availability?The ability to provide quality servicethat can support the online service
  5. 5. DowntimeThe term downtime refers to periods when asystem is unavailable.Downtime or outage duration refers to a periodof time that a system/service fails to provide orperform its primary function.
  6. 6. Unavailability / Causes of Downtime• Hardware failure – 55%• Human error – 22%• Software failure – 18%• Natural disasters – 5%http://www.continuitycentral.com/news06645.htmlReputable studies have concluded that as much as75% of downtime is the result of some sortof human error.http://searchdatacenter.techtarget.com/feature/The-causes-and-costs-of-data-center-system-downtime-Advisory-Board-QA
  7. 7. Hardware / InfrastructureAWS SLA – 99.9 – 100%• Redundant servers– Multiple servers of each type– EBS, Snapshots• Multi AZ– ELB, VPC• Multi Region• PaaS– S3, SQS, DynamoDB, RDS, Route53 . . .
  8. 8. Architect• Plan based on experience and best practices• Continuously review and correct
  9. 9. Knowledge Sharing
  10. 10. Preinstalled / PreconfiguredPredefined procedures / scriptsPreinstalled instances / AMIs
  11. 11. AutomateAutomate repeating tasks
  12. 12. Scheduler
  13. 13. Orchestration Tools
  14. 14. Orchestration Tools
  15. 15. Track Changes• Diff your changes# List all application folders# Iterate through this list and# cd <folder># svn ci . -m "[<timestamp>] Checkin <folder>"• Ready for rollback
  16. 16. Security Group Changes
  17. 17. Servers Monitoring
  18. 18. Log monitoringGraylog => Nagios checkshttp://www.emind.co/open-source/• check_graylog2_stream
  19. 19. Applications Counters / Metricsnet-snmp sub-agenthttp://www.emind.co/open-source/• net-snmp_shell_subagent# Syntax# < oid > ; < type > ; < script path >.1.3.6.1.4.1.39731.2100.1:string:/usr/local/emind/sync_manager/sync_manager.sh status status.1.3.6.1.4.1.39731.2100.2:string:/usr/local/emind/sync_manager/sync_manager.sh status state.1.3.6.1.4.1.39731.2100.3:integer:/usr/local/emind/sync_manager/sync_manager.sh status sync_duration_min.1.3.6.1.4.1.39731.2100.4:integer:/usr/local/emind/sync_manager/sync_manager.sh status idle_duration_h.1.3.6.1.4.1.39731.2100.5:string:/usr/local/emind/sync_manager/sync_manager.sh status last
  20. 20. Application performance
  21. 21. CloudWatchCloudWatch MetricsCloudWatch Alarmshttp://www.emind.co/open-source• check_elb_instances_health• check_cloudwatch_alarm
  22. 22. RobotsgoCloud DevOps Central
  23. 23. NewVem• Volume backup• Security groups• Instanceutilization• IO utilization• ReservedInstances• Plannedmaintenance
  24. 24. Security SensorsSnort, IPTables, Dome9
  25. 25. Human FeedsMail Notifications Phone Calls
  26. 26. Support Router APITurning events to ACTIONS
  27. 27. lahavs@emind.co+972 (54) 4321688http://www.emind.co

×