SlideShare a Scribd company logo
1 of 26
Download to read offline
© 2010 VMware Inc. All rights reserved
Welcome to the
CZ VMUG
Meeting
4 December 2017
2
The dark side of stretched clusters
Andrea Mauro – VCDX & vExpert
http://vinfrastructure.it/en/
Italian VMUG Founder and Board Member
http://www.vmug.it
3
Stretched cluster
▪ Two active sites (+1?)
▪ Storage architecture cross-site
▪ Hypervisor architecture cross-site
• vSphere Metro Storage Cluster (vMSC)
4
HA and vMotion
▪ vMotion for planned failover or failback
▪ HA for non planned failover
5
Requirements and limitations
▪ Networking
• Higher latency in vMotion supported in vSphere Enterprise Plus
• although Enterprise Plus requirement is no longer indicated in vSphere 6.x
• stretched cluster not geographically?
• ESXi vSphere vMotion network requirements:
• minimum link bandwidth of 250Mbps
• maximum supported network latency between sites should be around 10ms round-trip
time (RTT)
• Note that vSphere vMotion supports a maximum of 150ms latency as of vSphere
6.0, but this is not intended for stretched clustering usage
• VMs networks should be the “same” on both sites
• stretched L2 network
• or some network virtualization techniques
• Note that ESXi Management network and vMotion network could be also L3
6
Requirements and limitations
▪ Storage
• Storage must be certified for vMSC architecture
• Maximum supported latency for synchronous storage replication links
• 5ms RTT?
• Vendor specific requirements and architectures
• Supported storage protocols are Fibre Channel, iSCSI, NFS, and FCoE
• Hyper-converged solutions?
• vSAN is supported
• single vSAN stretched cluster
• Other solutions can have different architecture
• two storage clusters
7
Requirements and limitations
▪ Other
• Cluster size?
• 64?
• in a vSAN Stretched Cluster: 30+1 (15+15+1)
• 3rd site?
• Number of shared datastore?
• vCenter location?
8
Synchronous vs. Asynchronous
▪ Synchronous
▪ High consistency?
▪ High available?
▪ Asynchronous
▪ RPO depending by schedule
▪ Nearline sync?
9
Uniform vs. non uniform
▪ Uniform
• «full access»
▪ Non-uniform
• «LUN locality»
• «VM locality»
10
Disaster recovery vs. disaster avoidance
▪ Disaster avoidance prevent or significantly reduce the probability
that a disaster will occur (like for human errors)
• if such an event does occur (like for a natural disaster) that the effects upon
the organization’s technology systems are minimized as much as possible
▪ Disaster avoidance provides better "resilience" rather than good
recovery
• infrastructure availability solutions?
• application availability and redundancy?
▪ Multi datacenter (or multi-region cloud) replication is one part
• the second part is having active-active datacenters or have applications
spanned between the multiple site that provide service availability
▪ Stretched cluster is an example of disaster avoidance at the
infrastructure layer
11
Application vs. infrastructure resiliency
▪ Mostly of the new cloud native application are designed high
availability and resiliency
▪ Fault domain or availability zone concepts
▪ There are also some example of traditional applications with high
availability concepts at the application level that can work also
geographically
• DNS services
• Active Directory Domain Controllers
• Exchange DAG
• SQL Always-On clusters
12
Disaster recovery vs. Stretched cluster
▪ Stretched cluster can provide both disaster recovery and disaster
avoidances for some cases
▪ There are some possible limitation on using a stretched cluster
also as disaster recovery:
• Stretched cluster is coupled, disaster recovery site is de-coupled
• Stretched cluster can’t protect you from site link failures and can be affected by
the split-brain scenario
• A witness can minimize this problem
• Stretched cluster usually works with synchronous replication, that means
• limited distance
• bandwidth requirements are really high, to minimize storage latency
• difficult to provide multiple consistent restore point at different time
▪ In most cases, where a stretched cluster is used, then there could
be third site acting as a traditional DR, using in this way a multi-
level protection approach
13
Only Stretched storage
▪ DR at virtualization layer
14
vSAN Stretched Cluster
15
Design aspects
▪ Split-brain scenario
• How avoid it
• Networking consideration
▪ Dependencies
▪ Availability & Resiliency
• Host failure
• Storage failure
• Site failure
▪ Data resiliency
• Local resiliency, not only cross-site resiliency
▪ Data locality
• Block storage and paths
• NFS and IPs/networks
• vSAN and other hyper-converged solution
16
External dependecies
▪ DNS
▪ Witness
▪ PSC
▪ vCenter Server
• Distribuited virtual switches
• vSAN
• Storage policies
• vVols
• Storage policies
• VM Encryption
▪ vCenter HA?
• vCenter HA network latency between Active, Passive, and Witness nodes
must be less than 10 ms
17
vSphere HA
▪ VMware recommends enabling vSphere HA admission control in all
cluster, especially in a stretched cluster
▪ Workload availability is the primary driver for most stretched
cluster environments, so can be crucial providing sufficient
capacity for a full site failure
▪ To ensure that all workloads can be restarted by vSphere HA on
just one site, configuring the admission control policy to 50
percent for both memory and CPU is recommended
• VMware recommends using a percentage-based policy because it offers the
most flexibility and reduces operational overhead
18
VM Component Protection (VMCP)
▪ Typical configuration for PDL events, is Power off and restart VMs
▪ For APD events, VMware recommends selecting Power off and
restart VMs (conservative)
• Refer to specific storage vendor requirements
▪ For vSphere 5.5?
19
Network heartbeat
▪ VMware vSphere HA network heartbeat
• if a host is not receiving any heartbeats, it uses a fail-safe mechanism to detect
if it is merely isolated from its master node or completely isolated from the
network
• By default, it does this by pinging the default gateway
• In addition to this mechanism, one or more isolation addresses can be
specified manually to enhance reliability of isolation validation
▪ VMware recommends specifying a minimum of two additional
isolation addresses, with each address site local
• This enables vSphere HA validation for complete network isolation, even in
case of a connection failure between sites
20
Storage heartbeat
▪ VMware vSphere HA storage heartbeat
• the minimum number of heartbeat datastores is two and the maximum is five
▪ Stretched cluster specific hints
• For vSphere HA datastore heartbeating to function correctly in any type of
failure scenario, VMware recommends increasing the number of heartbeat
datastores from two to four
• This provides full redundancy for both data center locations
• Defining four specific datastores as preferred heartbeat datastores is also
recommended, selecting two from one site and two from the other
• This enables vSphere HA to heartbeat to a datastore even in the case of a connection
failure between sites
• Subsequently, it enables vSphere HA to determine the state of a host in any scenario
▪ VMware recommends selecting two datastores in each location to
ensure that datastores are available at each site in the case of a
site partition
▪ vSAN?
21
vSphere FT
▪ VMware vSphere FT 6.x replicate also the storage part
▪ Can function in clusters with nonuniform hosts, but it works best in
clusters with compatible nodes
▪ vSMP FT is explicitly not supported in a stretched environment
▪ Legacy FT?
▪ vSAN?
• https://cormachogan.com/2017/09/26/supporting-fault-tolerance-vms-vsan-
stretched-cluster/
22
vSphere DRS
▪ To provide VM locality you should build specific VMs to hosts
affinity rules
▪ VMware recommends implementing the “should rule” because
these are violated by vSphere HA in the case of a full site failure
• Note that vSphere DRS communicates these rules to vSphere HA, and these
are stored in a “compatibility list” governing allowed start-up
• If a single host fails, VM-to-host “should rules” are ignored by default
▪ For vSAN, VMware recommends that DRS is placed in partially
automated mode if there is an outage
• Customers will continue to be informed about DRS recommendations when
the hosts on the recovered site are online, but can now wait until vSAN has
fully resynced the virtual machine components
• DRS can then be changed back to fully automated mode, which will allow virtual
machine migrations to take place to conform to the VM/Host affinity rules
23
vSphere Storage DRS
▪ For Storage DRS (if applicable), this should be configured in
manual mode or partially automated
▪ This enables human validation per recommendation and allows
recommendations to be applied during off-peak hours
▪ Note that the use of I/O Metric or VMware vSphere Storage I/O
Control is not supported in a vMSC configuration
• VMware KB article 2042596 - https://kb.vmware.com/kb/2042596
▪ Also SIOC is not supported!
Multi-path
▪ Uniform ▪ Not uniform
Conclusions
▪ Stretched cluster vs. disaster recovery
▪ Stretched cluster + disaster recovery
▪ Applications & services first
▪ Business driven
▪ Design considerations
▪ More sites for campus deployment?
26
Enjoy The Day!
Join the Conversation!
#VMUGCZ
www.vmug.com

More Related Content

What's hot

Five common customer use cases for Virtual SAN - VMworld US / 2015
Five common customer use cases for Virtual SAN - VMworld US / 2015Five common customer use cases for Virtual SAN - VMworld US / 2015
Five common customer use cases for Virtual SAN - VMworld US / 2015Duncan Epping
 
Presentazione VMware @ VMUGIT UserCon 2015
Presentazione VMware @ VMUGIT UserCon 2015Presentazione VMware @ VMUGIT UserCon 2015
Presentazione VMware @ VMUGIT UserCon 2015VMUG IT
 
VMworld 2017 vSAN Network Design
VMworld 2017 vSAN Network Design VMworld 2017 vSAN Network Design
VMworld 2017 vSAN Network Design Cormac Hogan
 
VMworld 2014: vSphere Distributed Switch
VMworld 2014: vSphere Distributed SwitchVMworld 2014: vSphere Distributed Switch
VMworld 2014: vSphere Distributed SwitchVMworld
 
VMware VSAN Technical Deep Dive - March 2014
VMware VSAN Technical Deep Dive - March 2014VMware VSAN Technical Deep Dive - March 2014
VMware VSAN Technical Deep Dive - March 2014David Davis
 
VMware Vsan vtug 2014
VMware Vsan vtug 2014VMware Vsan vtug 2014
VMware Vsan vtug 2014csharney
 
A day in the life of a VSAN I/O - STO7875
A day in the life of a VSAN I/O - STO7875A day in the life of a VSAN I/O - STO7875
A day in the life of a VSAN I/O - STO7875Duncan Epping
 
VMware vSAN - Novosco, June 2017
VMware vSAN - Novosco, June 2017VMware vSAN - Novosco, June 2017
VMware vSAN - Novosco, June 2017Novosco
 
VMware Virtual SAN Presentation
VMware Virtual SAN PresentationVMware Virtual SAN Presentation
VMware Virtual SAN Presentationvirtualsouthwest
 
VSAN – Architettura e Design
VSAN – Architettura e DesignVSAN – Architettura e Design
VSAN – Architettura e DesignVMUG IT
 
VMworld 2013: VMware Virtual SAN Technical Best Practices
VMworld 2013: VMware Virtual SAN Technical Best Practices VMworld 2013: VMware Virtual SAN Technical Best Practices
VMworld 2013: VMware Virtual SAN Technical Best Practices VMworld
 
VMworld 2017 Core Storage
VMworld 2017 Core StorageVMworld 2017 Core Storage
VMworld 2017 Core StorageCormac Hogan
 
Presentation v mware virtual san 6.0
Presentation   v mware virtual san 6.0Presentation   v mware virtual san 6.0
Presentation v mware virtual san 6.0solarisyougood
 
vSAN architecture components
vSAN architecture componentsvSAN architecture components
vSAN architecture componentsDavid Pasek
 
VMware - Virtual SAN - IT Changes Everything
VMware - Virtual SAN - IT Changes EverythingVMware - Virtual SAN - IT Changes Everything
VMware - Virtual SAN - IT Changes EverythingVMUG IT
 
STO7535 Virtual SAN Proof of Concept - VMworld 2016
STO7535 Virtual SAN Proof of Concept - VMworld 2016STO7535 Virtual SAN Proof of Concept - VMworld 2016
STO7535 Virtual SAN Proof of Concept - VMworld 2016Cormac Hogan
 
VMware Virtual SAN slideshow
VMware Virtual SAN slideshowVMware Virtual SAN slideshow
VMware Virtual SAN slideshowAshley Williams
 
Salt Cloud vmware-orchestration
Salt Cloud vmware-orchestrationSalt Cloud vmware-orchestration
Salt Cloud vmware-orchestrationMo Rawi
 
VMware: Enabling Software-Defined Storage Using Virtual SAN (Technical Decisi...
VMware: Enabling Software-Defined Storage Using Virtual SAN (Technical Decisi...VMware: Enabling Software-Defined Storage Using Virtual SAN (Technical Decisi...
VMware: Enabling Software-Defined Storage Using Virtual SAN (Technical Decisi...VMware
 
Virtual san hardware guidance & best practices
Virtual san hardware guidance & best practicesVirtual san hardware guidance & best practices
Virtual san hardware guidance & best practicessolarisyougood
 

What's hot (20)

Five common customer use cases for Virtual SAN - VMworld US / 2015
Five common customer use cases for Virtual SAN - VMworld US / 2015Five common customer use cases for Virtual SAN - VMworld US / 2015
Five common customer use cases for Virtual SAN - VMworld US / 2015
 
Presentazione VMware @ VMUGIT UserCon 2015
Presentazione VMware @ VMUGIT UserCon 2015Presentazione VMware @ VMUGIT UserCon 2015
Presentazione VMware @ VMUGIT UserCon 2015
 
VMworld 2017 vSAN Network Design
VMworld 2017 vSAN Network Design VMworld 2017 vSAN Network Design
VMworld 2017 vSAN Network Design
 
VMworld 2014: vSphere Distributed Switch
VMworld 2014: vSphere Distributed SwitchVMworld 2014: vSphere Distributed Switch
VMworld 2014: vSphere Distributed Switch
 
VMware VSAN Technical Deep Dive - March 2014
VMware VSAN Technical Deep Dive - March 2014VMware VSAN Technical Deep Dive - March 2014
VMware VSAN Technical Deep Dive - March 2014
 
VMware Vsan vtug 2014
VMware Vsan vtug 2014VMware Vsan vtug 2014
VMware Vsan vtug 2014
 
A day in the life of a VSAN I/O - STO7875
A day in the life of a VSAN I/O - STO7875A day in the life of a VSAN I/O - STO7875
A day in the life of a VSAN I/O - STO7875
 
VMware vSAN - Novosco, June 2017
VMware vSAN - Novosco, June 2017VMware vSAN - Novosco, June 2017
VMware vSAN - Novosco, June 2017
 
VMware Virtual SAN Presentation
VMware Virtual SAN PresentationVMware Virtual SAN Presentation
VMware Virtual SAN Presentation
 
VSAN – Architettura e Design
VSAN – Architettura e DesignVSAN – Architettura e Design
VSAN – Architettura e Design
 
VMworld 2013: VMware Virtual SAN Technical Best Practices
VMworld 2013: VMware Virtual SAN Technical Best Practices VMworld 2013: VMware Virtual SAN Technical Best Practices
VMworld 2013: VMware Virtual SAN Technical Best Practices
 
VMworld 2017 Core Storage
VMworld 2017 Core StorageVMworld 2017 Core Storage
VMworld 2017 Core Storage
 
Presentation v mware virtual san 6.0
Presentation   v mware virtual san 6.0Presentation   v mware virtual san 6.0
Presentation v mware virtual san 6.0
 
vSAN architecture components
vSAN architecture componentsvSAN architecture components
vSAN architecture components
 
VMware - Virtual SAN - IT Changes Everything
VMware - Virtual SAN - IT Changes EverythingVMware - Virtual SAN - IT Changes Everything
VMware - Virtual SAN - IT Changes Everything
 
STO7535 Virtual SAN Proof of Concept - VMworld 2016
STO7535 Virtual SAN Proof of Concept - VMworld 2016STO7535 Virtual SAN Proof of Concept - VMworld 2016
STO7535 Virtual SAN Proof of Concept - VMworld 2016
 
VMware Virtual SAN slideshow
VMware Virtual SAN slideshowVMware Virtual SAN slideshow
VMware Virtual SAN slideshow
 
Salt Cloud vmware-orchestration
Salt Cloud vmware-orchestrationSalt Cloud vmware-orchestration
Salt Cloud vmware-orchestration
 
VMware: Enabling Software-Defined Storage Using Virtual SAN (Technical Decisi...
VMware: Enabling Software-Defined Storage Using Virtual SAN (Technical Decisi...VMware: Enabling Software-Defined Storage Using Virtual SAN (Technical Decisi...
VMware: Enabling Software-Defined Storage Using Virtual SAN (Technical Decisi...
 
Virtual san hardware guidance & best practices
Virtual san hardware guidance & best practicesVirtual san hardware guidance & best practices
Virtual san hardware guidance & best practices
 

Similar to Maximizing Availability in vSphere Stretched Clusters

VMworld 2013: Operating and Architecting a vSphere Metro Storage Cluster base...
VMworld 2013: Operating and Architecting a vSphere Metro Storage Cluster base...VMworld 2013: Operating and Architecting a vSphere Metro Storage Cluster base...
VMworld 2013: Operating and Architecting a vSphere Metro Storage Cluster base...VMworld
 
VMworld 2013: Successfully Virtualize Microsoft Exchange Server
VMworld 2013: Successfully Virtualize Microsoft Exchange Server VMworld 2013: Successfully Virtualize Microsoft Exchange Server
VMworld 2013: Successfully Virtualize Microsoft Exchange Server VMworld
 
VMworld - vSphere Distributed Switch 6.0 Technical Deep Dive
VMworld - vSphere Distributed Switch 6.0 Technical Deep DiveVMworld - vSphere Distributed Switch 6.0 Technical Deep Dive
VMworld - vSphere Distributed Switch 6.0 Technical Deep DiveChris Wahl
 
Elastic vSphere, Now With More Stretch
Elastic vSphere, Now With More StretchElastic vSphere, Now With More Stretch
Elastic vSphere, Now With More StretchScott Lowe
 
Presentation disaster recovery in virtualization and cloud
Presentation   disaster recovery in virtualization and cloudPresentation   disaster recovery in virtualization and cloud
Presentation disaster recovery in virtualization and cloudsolarisyourep
 
Presentation disaster recovery in virtualization and cloud
Presentation   disaster recovery in virtualization and cloudPresentation   disaster recovery in virtualization and cloud
Presentation disaster recovery in virtualization and cloudxKinAnx
 
Presentazione VMware @ VMUGIT UserCon 2015
Presentazione VMware @ VMUGIT UserCon 2015Presentazione VMware @ VMUGIT UserCon 2015
Presentazione VMware @ VMUGIT UserCon 2015VMUG IT
 
INF7827 DRS Best Practices
INF7827 DRS Best PracticesINF7827 DRS Best Practices
INF7827 DRS Best PracticesBrian Graf
 
M02+-+SDDC+Features+and+Operations.ppsx
M02+-+SDDC+Features+and+Operations.ppsxM02+-+SDDC+Features+and+Operations.ppsx
M02+-+SDDC+Features+and+Operations.ppsxRezaRestian2
 
VMworld Europe 2014: Virtual SAN Best Practices and Use Cases
VMworld Europe 2014: Virtual SAN Best Practices and Use CasesVMworld Europe 2014: Virtual SAN Best Practices and Use Cases
VMworld Europe 2014: Virtual SAN Best Practices and Use CasesVMworld
 
Accelerate with ibm storage ibm spectrum virtualize hyper swap deep dive
Accelerate with ibm storage  ibm spectrum virtualize hyper swap deep diveAccelerate with ibm storage  ibm spectrum virtualize hyper swap deep dive
Accelerate with ibm storage ibm spectrum virtualize hyper swap deep divexKinAnx
 
Hyper v® 2012 vs v sphere™ 5.1 understanding the differences
Hyper v® 2012 vs v sphere™ 5.1 understanding the differencesHyper v® 2012 vs v sphere™ 5.1 understanding the differences
Hyper v® 2012 vs v sphere™ 5.1 understanding the differencesSolarWinds
 
Elastic vSphere?
Elastic vSphere?Elastic vSphere?
Elastic vSphere?Scott Lowe
 
Best Practices of HA and Replication of PostgreSQL in Virtualized Environments
Best Practices of HA and Replication of PostgreSQL in Virtualized EnvironmentsBest Practices of HA and Replication of PostgreSQL in Virtualized Environments
Best Practices of HA and Replication of PostgreSQL in Virtualized EnvironmentsJignesh Shah
 
Top Ten Private Cloud Risks
Top Ten Private Cloud RisksTop Ten Private Cloud Risks
Top Ten Private Cloud RisksSymantec
 
Accelerate with ibm storage ibm spectrum virtualize hyper swap deep dive dee...
Accelerate with ibm storage  ibm spectrum virtualize hyper swap deep dive dee...Accelerate with ibm storage  ibm spectrum virtualize hyper swap deep dive dee...
Accelerate with ibm storage ibm spectrum virtualize hyper swap deep dive dee...xKinAnx
 
Exchange 2010 New England Vmug
Exchange 2010 New England VmugExchange 2010 New England Vmug
Exchange 2010 New England Vmugcsharney
 
Exploring Stretched Clusters
Exploring Stretched ClustersExploring Stretched Clusters
Exploring Stretched ClustersScott Lowe
 
VMworld 2013: DRS: New Features, Best Practices and Future Directions
VMworld 2013: DRS: New Features, Best Practices and Future Directions VMworld 2013: DRS: New Features, Best Practices and Future Directions
VMworld 2013: DRS: New Features, Best Practices and Future Directions VMworld
 
VMworld 2016: Enforcing a vSphere Cluster Design with PowerCLI Automation
VMworld 2016: Enforcing a vSphere Cluster Design with PowerCLI AutomationVMworld 2016: Enforcing a vSphere Cluster Design with PowerCLI Automation
VMworld 2016: Enforcing a vSphere Cluster Design with PowerCLI AutomationVMworld
 

Similar to Maximizing Availability in vSphere Stretched Clusters (20)

VMworld 2013: Operating and Architecting a vSphere Metro Storage Cluster base...
VMworld 2013: Operating and Architecting a vSphere Metro Storage Cluster base...VMworld 2013: Operating and Architecting a vSphere Metro Storage Cluster base...
VMworld 2013: Operating and Architecting a vSphere Metro Storage Cluster base...
 
VMworld 2013: Successfully Virtualize Microsoft Exchange Server
VMworld 2013: Successfully Virtualize Microsoft Exchange Server VMworld 2013: Successfully Virtualize Microsoft Exchange Server
VMworld 2013: Successfully Virtualize Microsoft Exchange Server
 
VMworld - vSphere Distributed Switch 6.0 Technical Deep Dive
VMworld - vSphere Distributed Switch 6.0 Technical Deep DiveVMworld - vSphere Distributed Switch 6.0 Technical Deep Dive
VMworld - vSphere Distributed Switch 6.0 Technical Deep Dive
 
Elastic vSphere, Now With More Stretch
Elastic vSphere, Now With More StretchElastic vSphere, Now With More Stretch
Elastic vSphere, Now With More Stretch
 
Presentation disaster recovery in virtualization and cloud
Presentation   disaster recovery in virtualization and cloudPresentation   disaster recovery in virtualization and cloud
Presentation disaster recovery in virtualization and cloud
 
Presentation disaster recovery in virtualization and cloud
Presentation   disaster recovery in virtualization and cloudPresentation   disaster recovery in virtualization and cloud
Presentation disaster recovery in virtualization and cloud
 
Presentazione VMware @ VMUGIT UserCon 2015
Presentazione VMware @ VMUGIT UserCon 2015Presentazione VMware @ VMUGIT UserCon 2015
Presentazione VMware @ VMUGIT UserCon 2015
 
INF7827 DRS Best Practices
INF7827 DRS Best PracticesINF7827 DRS Best Practices
INF7827 DRS Best Practices
 
M02+-+SDDC+Features+and+Operations.ppsx
M02+-+SDDC+Features+and+Operations.ppsxM02+-+SDDC+Features+and+Operations.ppsx
M02+-+SDDC+Features+and+Operations.ppsx
 
VMworld Europe 2014: Virtual SAN Best Practices and Use Cases
VMworld Europe 2014: Virtual SAN Best Practices and Use CasesVMworld Europe 2014: Virtual SAN Best Practices and Use Cases
VMworld Europe 2014: Virtual SAN Best Practices and Use Cases
 
Accelerate with ibm storage ibm spectrum virtualize hyper swap deep dive
Accelerate with ibm storage  ibm spectrum virtualize hyper swap deep diveAccelerate with ibm storage  ibm spectrum virtualize hyper swap deep dive
Accelerate with ibm storage ibm spectrum virtualize hyper swap deep dive
 
Hyper v® 2012 vs v sphere™ 5.1 understanding the differences
Hyper v® 2012 vs v sphere™ 5.1 understanding the differencesHyper v® 2012 vs v sphere™ 5.1 understanding the differences
Hyper v® 2012 vs v sphere™ 5.1 understanding the differences
 
Elastic vSphere?
Elastic vSphere?Elastic vSphere?
Elastic vSphere?
 
Best Practices of HA and Replication of PostgreSQL in Virtualized Environments
Best Practices of HA and Replication of PostgreSQL in Virtualized EnvironmentsBest Practices of HA and Replication of PostgreSQL in Virtualized Environments
Best Practices of HA and Replication of PostgreSQL in Virtualized Environments
 
Top Ten Private Cloud Risks
Top Ten Private Cloud RisksTop Ten Private Cloud Risks
Top Ten Private Cloud Risks
 
Accelerate with ibm storage ibm spectrum virtualize hyper swap deep dive dee...
Accelerate with ibm storage  ibm spectrum virtualize hyper swap deep dive dee...Accelerate with ibm storage  ibm spectrum virtualize hyper swap deep dive dee...
Accelerate with ibm storage ibm spectrum virtualize hyper swap deep dive dee...
 
Exchange 2010 New England Vmug
Exchange 2010 New England VmugExchange 2010 New England Vmug
Exchange 2010 New England Vmug
 
Exploring Stretched Clusters
Exploring Stretched ClustersExploring Stretched Clusters
Exploring Stretched Clusters
 
VMworld 2013: DRS: New Features, Best Practices and Future Directions
VMworld 2013: DRS: New Features, Best Practices and Future Directions VMworld 2013: DRS: New Features, Best Practices and Future Directions
VMworld 2013: DRS: New Features, Best Practices and Future Directions
 
VMworld 2016: Enforcing a vSphere Cluster Design with PowerCLI Automation
VMworld 2016: Enforcing a vSphere Cluster Design with PowerCLI AutomationVMworld 2016: Enforcing a vSphere Cluster Design with PowerCLI Automation
VMworld 2016: Enforcing a vSphere Cluster Design with PowerCLI Automation
 

More from Andrea Mauro

Storage user cases
Storage user casesStorage user cases
Storage user casesAndrea Mauro
 
Business Continuity & Storage
Business Continuity & StorageBusiness Continuity & Storage
Business Continuity & StorageAndrea Mauro
 
Dell vision and solutions
Dell vision and solutionsDell vision and solutions
Dell vision and solutionsAndrea Mauro
 
Evoluzione dello storage
Evoluzione dello storageEvoluzione dello storage
Evoluzione dello storageAndrea Mauro
 
Si fa presto a dire SDDC: come, quando e perché?
Si fa presto a dire SDDC: come, quando e perché?Si fa presto a dire SDDC: come, quando e perché?
Si fa presto a dire SDDC: come, quando e perché?Andrea Mauro
 
VMUG.IT Meeting Napoli - Opening
VMUG.IT Meeting Napoli - OpeningVMUG.IT Meeting Napoli - Opening
VMUG.IT Meeting Napoli - OpeningAndrea Mauro
 
Evento Veeam & Assyrus - 2 Novità di Veeam Backup & Replication v9
Evento Veeam & Assyrus - 2 Novità di Veeam Backup & Replication v9Evento Veeam & Assyrus - 2 Novità di Veeam Backup & Replication v9
Evento Veeam & Assyrus - 2 Novità di Veeam Backup & Replication v9Andrea Mauro
 
Evento Veeam & Assyrus - 6 Pianificare lato sorgente
Evento Veeam & Assyrus - 6 Pianificare lato sorgenteEvento Veeam & Assyrus - 6 Pianificare lato sorgente
Evento Veeam & Assyrus - 6 Pianificare lato sorgenteAndrea Mauro
 
Kemp LoadMaster & VMware vSphere
Kemp LoadMaster & VMware vSphereKemp LoadMaster & VMware vSphere
Kemp LoadMaster & VMware vSphereAndrea Mauro
 
Debunking VMware NSX
Debunking VMware NSXDebunking VMware NSX
Debunking VMware NSXAndrea Mauro
 
APAC #vBrownbag – VMware Mirage overview
APAC #vBrownbag – VMware Mirage overviewAPAC #vBrownbag – VMware Mirage overview
APAC #vBrownbag – VMware Mirage overviewAndrea Mauro
 
Nordic VMUG User Conference 2014 - Design VMware vCenter Server
Nordic VMUG User Conference 2014 - Design VMware vCenter ServerNordic VMUG User Conference 2014 - Design VMware vCenter Server
Nordic VMUG User Conference 2014 - Design VMware vCenter ServerAndrea Mauro
 
VMware vCSA pro and cons
VMware vCSA pro and consVMware vCSA pro and cons
VMware vCSA pro and consAndrea Mauro
 
Presentazione al VMUGIT UC 2014 - Virtualizzare con i piedi per terra
Presentazione al VMUGIT UC 2014 - Virtualizzare con i piedi per terraPresentazione al VMUGIT UC 2014 - Virtualizzare con i piedi per terra
Presentazione al VMUGIT UC 2014 - Virtualizzare con i piedi per terraAndrea Mauro
 
VMUG.IT @ ITway Campus 2014
VMUG.IT @ ITway Campus 2014VMUG.IT @ ITway Campus 2014
VMUG.IT @ ITway Campus 2014Andrea Mauro
 
Realizzare una rete aziendale con linux e samba
Realizzare una rete aziendale con linux e sambaRealizzare una rete aziendale con linux e samba
Realizzare una rete aziendale con linux e sambaAndrea Mauro
 
vBrownBagIT-Session3
vBrownBagIT-Session3vBrownBagIT-Session3
vBrownBagIT-Session3Andrea Mauro
 
#vBrownBag.IT - Session 2
#vBrownBag.IT - Session 2#vBrownBag.IT - Session 2
#vBrownBag.IT - Session 2Andrea Mauro
 
#vBrownBag.IT - Session 1
#vBrownBag.IT - Session 1#vBrownBag.IT - Session 1
#vBrownBag.IT - Session 1Andrea Mauro
 

More from Andrea Mauro (20)

Storage user cases
Storage user casesStorage user cases
Storage user cases
 
Business Continuity & Storage
Business Continuity & StorageBusiness Continuity & Storage
Business Continuity & Storage
 
Dell vision and solutions
Dell vision and solutionsDell vision and solutions
Dell vision and solutions
 
Evoluzione dello storage
Evoluzione dello storageEvoluzione dello storage
Evoluzione dello storage
 
Si fa presto a dire SDDC: come, quando e perché?
Si fa presto a dire SDDC: come, quando e perché?Si fa presto a dire SDDC: come, quando e perché?
Si fa presto a dire SDDC: come, quando e perché?
 
VMUG.IT Meeting Napoli - Opening
VMUG.IT Meeting Napoli - OpeningVMUG.IT Meeting Napoli - Opening
VMUG.IT Meeting Napoli - Opening
 
Evento Veeam & Assyrus - 2 Novità di Veeam Backup & Replication v9
Evento Veeam & Assyrus - 2 Novità di Veeam Backup & Replication v9Evento Veeam & Assyrus - 2 Novità di Veeam Backup & Replication v9
Evento Veeam & Assyrus - 2 Novità di Veeam Backup & Replication v9
 
Evento Veeam & Assyrus - 6 Pianificare lato sorgente
Evento Veeam & Assyrus - 6 Pianificare lato sorgenteEvento Veeam & Assyrus - 6 Pianificare lato sorgente
Evento Veeam & Assyrus - 6 Pianificare lato sorgente
 
Kemp LoadMaster & VMware vSphere
Kemp LoadMaster & VMware vSphereKemp LoadMaster & VMware vSphere
Kemp LoadMaster & VMware vSphere
 
Debunking VMware NSX
Debunking VMware NSXDebunking VMware NSX
Debunking VMware NSX
 
APAC #vBrownbag – VMware Mirage overview
APAC #vBrownbag – VMware Mirage overviewAPAC #vBrownbag – VMware Mirage overview
APAC #vBrownbag – VMware Mirage overview
 
Nordic VMUG User Conference 2014 - Design VMware vCenter Server
Nordic VMUG User Conference 2014 - Design VMware vCenter ServerNordic VMUG User Conference 2014 - Design VMware vCenter Server
Nordic VMUG User Conference 2014 - Design VMware vCenter Server
 
VMware vCSA pro and cons
VMware vCSA pro and consVMware vCSA pro and cons
VMware vCSA pro and cons
 
Presentazione al VMUGIT UC 2014 - Virtualizzare con i piedi per terra
Presentazione al VMUGIT UC 2014 - Virtualizzare con i piedi per terraPresentazione al VMUGIT UC 2014 - Virtualizzare con i piedi per terra
Presentazione al VMUGIT UC 2014 - Virtualizzare con i piedi per terra
 
VMUG.IT @ ITway Campus 2014
VMUG.IT @ ITway Campus 2014VMUG.IT @ ITway Campus 2014
VMUG.IT @ ITway Campus 2014
 
vBrownBagIT-VCDX
vBrownBagIT-VCDXvBrownBagIT-VCDX
vBrownBagIT-VCDX
 
Realizzare una rete aziendale con linux e samba
Realizzare una rete aziendale con linux e sambaRealizzare una rete aziendale con linux e samba
Realizzare una rete aziendale con linux e samba
 
vBrownBagIT-Session3
vBrownBagIT-Session3vBrownBagIT-Session3
vBrownBagIT-Session3
 
#vBrownBag.IT - Session 2
#vBrownBag.IT - Session 2#vBrownBag.IT - Session 2
#vBrownBag.IT - Session 2
 
#vBrownBag.IT - Session 1
#vBrownBag.IT - Session 1#vBrownBag.IT - Session 1
#vBrownBag.IT - Session 1
 

Recently uploaded

SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 

Recently uploaded (20)

SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 

Maximizing Availability in vSphere Stretched Clusters

  • 1. © 2010 VMware Inc. All rights reserved Welcome to the CZ VMUG Meeting 4 December 2017
  • 2. 2 The dark side of stretched clusters Andrea Mauro – VCDX & vExpert http://vinfrastructure.it/en/ Italian VMUG Founder and Board Member http://www.vmug.it
  • 3. 3 Stretched cluster ▪ Two active sites (+1?) ▪ Storage architecture cross-site ▪ Hypervisor architecture cross-site • vSphere Metro Storage Cluster (vMSC)
  • 4. 4 HA and vMotion ▪ vMotion for planned failover or failback ▪ HA for non planned failover
  • 5. 5 Requirements and limitations ▪ Networking • Higher latency in vMotion supported in vSphere Enterprise Plus • although Enterprise Plus requirement is no longer indicated in vSphere 6.x • stretched cluster not geographically? • ESXi vSphere vMotion network requirements: • minimum link bandwidth of 250Mbps • maximum supported network latency between sites should be around 10ms round-trip time (RTT) • Note that vSphere vMotion supports a maximum of 150ms latency as of vSphere 6.0, but this is not intended for stretched clustering usage • VMs networks should be the “same” on both sites • stretched L2 network • or some network virtualization techniques • Note that ESXi Management network and vMotion network could be also L3
  • 6. 6 Requirements and limitations ▪ Storage • Storage must be certified for vMSC architecture • Maximum supported latency for synchronous storage replication links • 5ms RTT? • Vendor specific requirements and architectures • Supported storage protocols are Fibre Channel, iSCSI, NFS, and FCoE • Hyper-converged solutions? • vSAN is supported • single vSAN stretched cluster • Other solutions can have different architecture • two storage clusters
  • 7. 7 Requirements and limitations ▪ Other • Cluster size? • 64? • in a vSAN Stretched Cluster: 30+1 (15+15+1) • 3rd site? • Number of shared datastore? • vCenter location?
  • 8. 8 Synchronous vs. Asynchronous ▪ Synchronous ▪ High consistency? ▪ High available? ▪ Asynchronous ▪ RPO depending by schedule ▪ Nearline sync?
  • 9. 9 Uniform vs. non uniform ▪ Uniform • «full access» ▪ Non-uniform • «LUN locality» • «VM locality»
  • 10. 10 Disaster recovery vs. disaster avoidance ▪ Disaster avoidance prevent or significantly reduce the probability that a disaster will occur (like for human errors) • if such an event does occur (like for a natural disaster) that the effects upon the organization’s technology systems are minimized as much as possible ▪ Disaster avoidance provides better "resilience" rather than good recovery • infrastructure availability solutions? • application availability and redundancy? ▪ Multi datacenter (or multi-region cloud) replication is one part • the second part is having active-active datacenters or have applications spanned between the multiple site that provide service availability ▪ Stretched cluster is an example of disaster avoidance at the infrastructure layer
  • 11. 11 Application vs. infrastructure resiliency ▪ Mostly of the new cloud native application are designed high availability and resiliency ▪ Fault domain or availability zone concepts ▪ There are also some example of traditional applications with high availability concepts at the application level that can work also geographically • DNS services • Active Directory Domain Controllers • Exchange DAG • SQL Always-On clusters
  • 12. 12 Disaster recovery vs. Stretched cluster ▪ Stretched cluster can provide both disaster recovery and disaster avoidances for some cases ▪ There are some possible limitation on using a stretched cluster also as disaster recovery: • Stretched cluster is coupled, disaster recovery site is de-coupled • Stretched cluster can’t protect you from site link failures and can be affected by the split-brain scenario • A witness can minimize this problem • Stretched cluster usually works with synchronous replication, that means • limited distance • bandwidth requirements are really high, to minimize storage latency • difficult to provide multiple consistent restore point at different time ▪ In most cases, where a stretched cluster is used, then there could be third site acting as a traditional DR, using in this way a multi- level protection approach
  • 13. 13 Only Stretched storage ▪ DR at virtualization layer
  • 15. 15 Design aspects ▪ Split-brain scenario • How avoid it • Networking consideration ▪ Dependencies ▪ Availability & Resiliency • Host failure • Storage failure • Site failure ▪ Data resiliency • Local resiliency, not only cross-site resiliency ▪ Data locality • Block storage and paths • NFS and IPs/networks • vSAN and other hyper-converged solution
  • 16. 16 External dependecies ▪ DNS ▪ Witness ▪ PSC ▪ vCenter Server • Distribuited virtual switches • vSAN • Storage policies • vVols • Storage policies • VM Encryption ▪ vCenter HA? • vCenter HA network latency between Active, Passive, and Witness nodes must be less than 10 ms
  • 17. 17 vSphere HA ▪ VMware recommends enabling vSphere HA admission control in all cluster, especially in a stretched cluster ▪ Workload availability is the primary driver for most stretched cluster environments, so can be crucial providing sufficient capacity for a full site failure ▪ To ensure that all workloads can be restarted by vSphere HA on just one site, configuring the admission control policy to 50 percent for both memory and CPU is recommended • VMware recommends using a percentage-based policy because it offers the most flexibility and reduces operational overhead
  • 18. 18 VM Component Protection (VMCP) ▪ Typical configuration for PDL events, is Power off and restart VMs ▪ For APD events, VMware recommends selecting Power off and restart VMs (conservative) • Refer to specific storage vendor requirements ▪ For vSphere 5.5?
  • 19. 19 Network heartbeat ▪ VMware vSphere HA network heartbeat • if a host is not receiving any heartbeats, it uses a fail-safe mechanism to detect if it is merely isolated from its master node or completely isolated from the network • By default, it does this by pinging the default gateway • In addition to this mechanism, one or more isolation addresses can be specified manually to enhance reliability of isolation validation ▪ VMware recommends specifying a minimum of two additional isolation addresses, with each address site local • This enables vSphere HA validation for complete network isolation, even in case of a connection failure between sites
  • 20. 20 Storage heartbeat ▪ VMware vSphere HA storage heartbeat • the minimum number of heartbeat datastores is two and the maximum is five ▪ Stretched cluster specific hints • For vSphere HA datastore heartbeating to function correctly in any type of failure scenario, VMware recommends increasing the number of heartbeat datastores from two to four • This provides full redundancy for both data center locations • Defining four specific datastores as preferred heartbeat datastores is also recommended, selecting two from one site and two from the other • This enables vSphere HA to heartbeat to a datastore even in the case of a connection failure between sites • Subsequently, it enables vSphere HA to determine the state of a host in any scenario ▪ VMware recommends selecting two datastores in each location to ensure that datastores are available at each site in the case of a site partition ▪ vSAN?
  • 21. 21 vSphere FT ▪ VMware vSphere FT 6.x replicate also the storage part ▪ Can function in clusters with nonuniform hosts, but it works best in clusters with compatible nodes ▪ vSMP FT is explicitly not supported in a stretched environment ▪ Legacy FT? ▪ vSAN? • https://cormachogan.com/2017/09/26/supporting-fault-tolerance-vms-vsan- stretched-cluster/
  • 22. 22 vSphere DRS ▪ To provide VM locality you should build specific VMs to hosts affinity rules ▪ VMware recommends implementing the “should rule” because these are violated by vSphere HA in the case of a full site failure • Note that vSphere DRS communicates these rules to vSphere HA, and these are stored in a “compatibility list” governing allowed start-up • If a single host fails, VM-to-host “should rules” are ignored by default ▪ For vSAN, VMware recommends that DRS is placed in partially automated mode if there is an outage • Customers will continue to be informed about DRS recommendations when the hosts on the recovered site are online, but can now wait until vSAN has fully resynced the virtual machine components • DRS can then be changed back to fully automated mode, which will allow virtual machine migrations to take place to conform to the VM/Host affinity rules
  • 23. 23 vSphere Storage DRS ▪ For Storage DRS (if applicable), this should be configured in manual mode or partially automated ▪ This enables human validation per recommendation and allows recommendations to be applied during off-peak hours ▪ Note that the use of I/O Metric or VMware vSphere Storage I/O Control is not supported in a vMSC configuration • VMware KB article 2042596 - https://kb.vmware.com/kb/2042596 ▪ Also SIOC is not supported!
  • 25. Conclusions ▪ Stretched cluster vs. disaster recovery ▪ Stretched cluster + disaster recovery ▪ Applications & services first ▪ Business driven ▪ Design considerations ▪ More sites for campus deployment?
  • 26. 26 Enjoy The Day! Join the Conversation! #VMUGCZ www.vmug.com