This document provides best practices for making your first cloud deployment successful. It discusses organizational structure, management tools, understanding virtual machine density limits, network operations, storage choices, templates, and defining service offerings. Key lessons are to know your application requirements, optimize for your needs rather than just using public clouds, ensure backup plans, and clearly define service level agreements and compliance expectations. Infrastructure choices like hypervisors and primary storage should fit the defined service offerings. Operations should focus on monitoring, maintainability with no maintenance windows, and adapting as usage grows over time.
2. whoami
• Name: Tim Mackey
• Current roles: XenServer Community Manager and Evangelist; occasional coder
• Cool things I’ve done
– Designed laser communication systems
– Early designer of retail self-checkout machines
– Embedded special relativity algorithms into industrial control system
• Find me
– Twitter: @XenServerArmy
– SlideShare: slideshare.net/TimMackey
4. Who owns what?
• Organizational structure matters
– Team buy-in (no “mine, mine, mine”)
– Management of key components
– Understanding of “as-a-service”
• Management toolset
– Beware of overlap
– Ensure runbooks reflect tooling
• If you build it, they will come …
– Growth will challenge everything
– Success can be worst case
6. Traditional Server Virtualization
• Core Objectives
– Server consolidation
– Power and cooling savings
– Hardware independence
• Looks Like
– VM Density < 20
– vCPU = pCPU
– vRAM = pRAM
– Low IOPS
– Redundancy matters
– No templates
7. Desktop Virtualization
• Core Objectives
– Control of IP
– Ensuring patch compliance
– Supporting mobile workstyles
• Looks Like
– 50 -100 VMs per host
– 2-4 vCores = pCore
– 1-2 vRAM = pRAM
– High IOPS
– Boot storms
– Network contention
– Highly templated
11. Along Comes Server Virtualization
• Multiple VMs/host
– Loss of visibility
– Loss of control
• Edge moves into host
– Network admins need to understand
server virtualization
12. Example 1 – Mirroring Traffic
• Without virtualization this is pretty easy
• With virtualization you now have multiple VMs
13. Example 1 – Mirroring Traffic
• Without virtualization this is pretty easy
• With virtualization you now have multiple VMs
– Plus VMs can move
• Better to monitor at virtual switch
14. Example 2 – Network Policies
• Server admins have significant impact on the network
– IP and MAC Address
– Virtual NICs
– Protocols and ports
• Granular network control requires awareness of virtual machines
– Define policies at virtual switch
15. Network Management Tools Lag
• Assumptions of fixed topology
– Fine for physical
– Challenge for dynamic environment
• Not virtualization aware
– Incorrect topology
– Incomplete topology
– VM actions obsolete data
X
16. Virtual Machine Density Planning
• Host capacities are growing rapidly
– XenServer 6.2 > 500 VMs
– vSphere 5 > 512 VMs
– RHEV 3 > 1000 VMs
– Hyper-V > 2048 VMs
• Clouds and VDI push limits
• Top of rack switch selection matters?
– ARP table
– Switching performance drops
– VM starts, but can’t connect
VM
VM
VM
VM
VM
VM
VM
VM
VM
VM
Host 1
Host 2
VM
VM
VM
VM
VM
VM
VM
VM
VM
18. Design Phase – Expected Storage Growth
1,000
500
VMs
Cost,
AU
100 200
500
VMs
Provisioning efficiency
AU – arbitrary units
19. Storage Scalability During Usage
Redesign
1,000
500
VMs
100 200 Cost, AU
VMs
1,000
500
Cost, AU100 200
?
Alternatives
AU – arbitrary units
20. Redesign
Efficiency and Pod Storage
1,000
500
VMs
100 200 Cost, AU
POD #1
POD #2
POD #3
1,000
500
VMs
100 200 Cost, AU
AU – arbitrary units
No redesign
21. What about local storage?
1,000
500
VMs
Cost, AU100 200
50
VMs
Provisioning efficiency
AU – arbitrary units
27. Cloud Builder Lessons from Zynga
• Public clouds are minivans
• zCloud is a race car
– zCloud is optimized for social gaming
– Know your application requirements
• Don’t rent what you can own cheaper
– Cloud operator doesn’t care about your success
– Optimized applications might be key
• Ensure you have backup plans
– Usage can and does spike
– Outages can and do happen
vs.
28. Cloud Builder Lessons From Telcos
• Utility computing fits business model
– Traditionally operate a low margin business model
– Understand tiered service offerings
– Have a history with instant provisioning
• Tiered service demands infrastructure flexibility
– “Cost per instance” is paramount
– Charge extra for premium features
– Instance doesn’t imply virtualization
– Be prepared to change vendors if better model appears
• Provisioning agility expected
– Customers expect instant self service access and detailed billing
29. Service Offerings
• Clearly define what you want to offer
– What types of applications
– Who has access, and who owns them
– What type of access
• Define how templates need to be managed
– Operating system support
– Patching requirements
• Define expectations around compliance and availability
– Who owns backup and monitoring
30. Define Tenancy Requirements
• Department data local to department
– Where is the application data stored
• Data and service isolation
– VM migration and host HA
– Network services
• Encryption of PII/PCI
– Where do keys live when data location unknown
– Need encryption designed for the cloud
• Showback to stakeholders
– More than just usage, compliance and audits
31. Virtualization Infrastructure
• Hypervisor defined by service offerings
– Don’t select hypervisor based on “standards”
– Understand true costs of virtualization
– Multiple hypervisors are “OK”
– Bare metal can be a hypervisor
• To “Pool” resources or not
– Is there a real requirement for pooled resources
– Can the cloud management solution do better?
• Primary storage defined by hypervisor
• Template storage defined by solution
– Typically low cost options like NFS
32. Cloud Operations
• Design for maintainability
• Monitor critical components
– Management servers and system support VMs
– Hypervisor hosts, and critical infrastructure
– End user deployment environments
If your cloud has maintenance windows, you’re doing it wrong.
- Allan Leinwand Former CTO Zynga